TOP ACTIVATIONS
MAX = 0.051

Can I keep the shiny rock ?" she asked .
. Can we own the rain , Mom ?" Mia asks
, " Isn 't the sunset beautiful ?" L
you pass me the heavy rock , please ?" So Jay
. Can you see the rain that way , Lily and
" Are you okay , Lily ?" he asked .
Do not worry ," the older brother said in a kind
say sorry to the pengu in ?" Lily asked .
the window to watch the rain ?" Her
go outside when it is raining ?" Lily thought for a
" Can we have an invitation too ?" Tom said .
play with your aer opl ane , Tim my ?" asked
What happened to my muff in ? It looks like it
I hug you too , Lily ?" Tom asked .
play with me and my doll ?" Tom says
Can I play with it ?", she said . Her mum
can I keep the talking ash tr ay ? â
boat fit in your big boat ?" The captain replied ,
Where should I start my search ?" Mom pointed outside and
do you do on your boat ?" "

MIN ACTIVATIONS
MIN = -0.063

s wrong , rh in oc eros ? â said
try to grab the av oc ados . But the rock
didn 't like wet av oc ados , so he decided
They forget about the av oc ados . They just want
playing ! The rh in oc eros was very worried ,
, " I love av oc ados ! They are so
his favourite treat - ch oc - chip ice cream !
saw a big rh in oc eros . It was dirty
on , Lily loved av oc ados and would ask for
" Hello , rh in oc eros !" Anna and Ben
at that big rh in oc eros !" Billy said ,
he saw a rh in oc eros ! It looked very
was a big rh in oc eros . He was very
happy and brought the av oc ados home to share with
was a big rh in oc eros . He was troubled
ongo loved to eat av oc ados . One day ,
paid him for his av oc ados . Tom
and picked the right av oc ados . Sadly
there were no other av oc ados . The
replied the rh in oc eros sadly . It had

INTERVAL 0.023 - 0.051
CONTAINS 0.398%

So the mom and the little girl went off to
Are we there yet ?" Sara asks . "
a swim ?" Her M ummy smiled and said " No
I will miss Snow y ," the little girl mentioned to
Tim my learned that it 's important to be kind

INTERVAL -0.006 - 0.023
CONTAINS 75.886%

This is my dress and I want to keep it ."
. They built a little house and played hide and seek
support you ." The frog thanked Lucy , and
" No , Tom . It looks scary .
well . But then she started to feel tired . She

INTERVAL -0.035 - -0.006
CONTAINS 23.701%

It was the best strawberry she had ever tasted . <|endoftext|>
He happily ch omp ed on the bone , content to
up and started jumping around the lawn too . He chased
seller said . " I will give you five coins for
playing on some equipment . She asked her mom if she

INTERVAL -0.063 - -0.035
CONTAINS 0.015%

The bad rh in oc eros was so happy .
thirty - two , thirty - three , thirty - four
said , " We ee e !" and off she went
suddenly saw a rh in oc eros in front of him
get her mom my 's attention , but she was busy
TokenCan
Token ID6090
Feature activation+4.52e-03

Pos logit contributions
 yourselves+0.002
ocolate+0.002
ados+0.002
 sidelines+0.002
ting+0.002
Neg logit contributions
-0.003
 said-0.002
 printed-0.002
 once-0.002
 connected-0.002
Token I
Token ID314
Feature activation+6.20e-03

Pos logit contributions
+0.007
eth+0.006
 favorite+0.006
 said+0.006
 wish+0.006
Neg logit contributions
 yourselves-0.007
ocolate-0.007
ados-0.006
ting-0.006
 sidelines-0.006
Token keep
Token ID1394
Feature activation+5.55e-03

Pos logit contributions
+0.008
 eng+0.007
eth+0.007
 connected+0.007
 favorite+0.007
Neg logit contributions
ocolate-0.010
 yourselves-0.010
 yourself-0.009
ting-0.009
 sidelines-0.009
Token the
Token ID262
Feature activation+5.08e-03

Pos logit contributions
+0.008
 favorite+0.006
 said+0.006
 sure+0.006
 once+0.006
Neg logit contributions
ocolate-0.010
 yourselves-0.009
ados-0.008
 sidelines-0.008
ting-0.008
Token shiny
Token ID22441
Feature activation+7.54e-03

Pos logit contributions
+0.007
 favorite+0.006
 wish+0.006
eth+0.006
 said+0.006
Neg logit contributions
ocolate-0.009
 yourselves-0.009
ados-0.008
ting-0.008
 sidelines-0.008
Token rock
Token ID3881
Feature activation+5.14e-02

Pos logit contributions
+0.010
 wish+0.009
 favorite+0.009
 wrapped+0.009
eth+0.008
Neg logit contributions
ocolate-0.013
 yourselves-0.013
 sidelines-0.012
ados-0.012
ting-0.012
Token?"
Token ID1701
Feature activation+2.55e-02

Pos logit contributions
+0.065
eth+0.056
 eng+0.053
 printed+0.053
 wish+0.051
Neg logit contributions
 yourselves-0.091
ocolate-0.091
ados-0.083
 yourself-0.081
ting-0.079
Token she
Token ID673
Feature activation+2.77e-02

Pos logit contributions
eth+0.041
+0.038
 printed+0.037
isc+0.035
ris+0.035
Neg logit contributions
 yourselves-0.035
 yourself-0.030
ocolate-0.029
 sidelines-0.028
 temptation-0.028
Token asked
Token ID1965
Feature activation+9.53e-03

Pos logit contributions
eth+0.029
+0.029
erg+0.026
isc+0.025
 favourite+0.025
Neg logit contributions
 yourselves-0.046
gging-0.045
Please-0.045
ting-0.044
 yourself-0.044
Token.
Token ID13
Feature activation+8.65e-03

Pos logit contributions
+0.009
 said+0.006
eth+0.006
 favorite+0.006
 once+0.006
Neg logit contributions
 yourselves-0.021
ocolate-0.021
 sidelines-0.020
ados-0.020
ting-0.019
Token
Token ID198
Feature activation+9.93e-03

Pos logit contributions
+0.005
eth+0.003
 said+0.003
 favorite+0.003
 wish+0.002
Neg logit contributions
 yourselves-0.022
ocolate-0.022
ados-0.021
ting-0.020
 sidelines-0.020
Token.
Token ID13
Feature activation+1.99e-03

Pos logit contributions
+0.003
eth+0.003
 eng+0.003
 favorite+0.003
 printed+0.003
Neg logit contributions
 yourselves-0.005
ocolate-0.005
ados-0.004
ting-0.004
 yourself-0.004
Token Can
Token ID1680
Feature activation+1.87e-02

Pos logit contributions
+0.003
eth+0.002
 favorite+0.002
 said+0.002
 wish+0.002
Neg logit contributions
 yourselves-0.003
ocolate-0.003
ados-0.003
ting-0.003
 sidelines-0.003
Token we
Token ID356
Feature activation+1.53e-02

Pos logit contributions
+0.021
eth+0.016
 said+0.016
 favorite+0.016
 wish+0.015
Neg logit contributions
ocolate-0.037
 yourselves-0.037
ados-0.035
ting-0.035
 sidelines-0.034
Token own
Token ID898
Feature activation+3.17e-03

Pos logit contributions
+0.020
eth+0.017
 eng+0.017
 favorite+0.017
 connected+0.016
Neg logit contributions
 yourselves-0.026
ocolate-0.026
ting-0.023
 sidelines-0.023
 yourself-0.023
Token the
Token ID262
Feature activation+4.71e-03

Pos logit contributions
+0.004
eth+0.003
 favorite+0.003
 said+0.003
 wish+0.003
Neg logit contributions
 yourselves-0.006
ocolate-0.006
ados-0.005
ting-0.005
 sidelines-0.005
Token rain
Token ID6290
Feature activation+4.91e-02

Pos logit contributions
+0.007
 wish+0.006
 said+0.006
 favorite+0.006
eth+0.006
Neg logit contributions
ocolate-0.008
 yourselves-0.008
ados-0.007
ting-0.007
 sidelines-0.007
Token,
Token ID11
Feature activation+1.98e-02

Pos logit contributions
+0.075
eth+0.065
 eng+0.064
 wish+0.062
 Hatt+0.061
Neg logit contributions
 yourselves-0.077
ocolate-0.072
 yourself-0.066
 cream-0.066
ting-0.065
Token Mom
Token ID11254
Feature activation+3.55e-02

Pos logit contributions
+0.026
eth+0.023
 eng+0.021
 printed+0.021
 connected+0.020
Neg logit contributions
 yourselves-0.035
ocolate-0.035
 sidelines-0.030
ados-0.030
 yourself-0.030
Token?"
Token ID1701
Feature activation+1.96e-02

Pos logit contributions
 eng+0.068
eth+0.067
 favourite+0.065
 Hatt+0.065
+0.065
Neg logit contributions
 yourselves-0.037
 yourself-0.032
 cream-0.030
ting-0.029
ocolate-0.025
Token Mia
Token ID32189
Feature activation+1.86e-02

Pos logit contributions
eth+0.034
 printed+0.033
+0.033
 connected+0.031
 prized+0.031
Neg logit contributions
 yourselves-0.023
ocolate-0.021
ados-0.019
 course-0.019
 sidelines-0.019
Token asks
Token ID7893
Feature activation+1.38e-02

Pos logit contributions
eth+0.023
 printed+0.022
 favourite+0.022
 favorite+0.021
isc+0.020
Neg logit contributions
 yourselves-0.030
ting-0.029
 yourself-0.028
ados-0.027
ocolate-0.026
Token,
Token ID11
Feature activation+3.90e-03

Pos logit contributions
 yourselves+0.017
ocolate+0.016
 sidelines+0.016
ting+0.015
ados+0.015
Neg logit contributions
-0.006
 said-0.004
eth-0.004
 favorite-0.004
 sure-0.004
Token "
Token ID366
Feature activation-1.20e-02

Pos logit contributions
+0.005
 said+0.004
 sure+0.004
+0.004
 excited+0.003
Neg logit contributions
 yourselves-0.008
ocolate-0.008
ados-0.008
 sidelines-0.007
 yourself-0.007
TokenIsn
Token ID41451
Feature activation+4.28e-03

Pos logit contributions
 yourselves+0.015
ocolate+0.015
ting+0.013
ados+0.013
 yourself+0.013
Neg logit contributions
-0.022
eth-0.019
 favorite-0.019
 wish-0.019
 said-0.019
Token't
Token ID470
Feature activation-1.38e-02

Pos logit contributions
 wish+0.008
eth+0.008
+0.008
 favorite+0.007
 connected+0.007
Neg logit contributions
ocolate-0.005
 yourselves-0.005
 course-0.004
ting-0.004
 yourself-0.004
Token the
Token ID262
Feature activation+1.07e-03

Pos logit contributions
 yourselves+0.021
ting+0.017
 yourself+0.017
ocolate+0.017
ados+0.017
Neg logit contributions
eth-0.024
-0.022
 favourite-0.019
 favorite-0.019
 printed-0.019
Token sunset
Token ID26428
Feature activation+4.80e-02

Pos logit contributions
+0.001
 wish+0.001
 favorite+0.001
 said+0.001
eth+0.001
Neg logit contributions
ocolate-0.002
 yourselves-0.002
ados-0.002
ting-0.002
 sidelines-0.002
Token beautiful
Token ID4950
Feature activation+1.81e-02

Pos logit contributions
+0.059
eth+0.058
erg+0.048
 eng+0.048
 wish+0.047
Neg logit contributions
 yourselves-0.084
ocolate-0.083
ados-0.075
 yourself-0.075
ting-0.074
Token?"
Token ID1701
Feature activation+1.40e-02

Pos logit contributions
+0.023
 said+0.019
 favorite+0.019
 once+0.018
 wish+0.018
Neg logit contributions
ocolate-0.033
 yourselves-0.033
 sidelines-0.031
ados-0.030
ting-0.030
Token
Token ID198
Feature activation-2.05e-03

Pos logit contributions
+0.011
 connected+0.008
 printed+0.008
 favorite+0.008
 stuck+0.007
Neg logit contributions
 yourselves-0.031
ocolate-0.030
ting-0.028
ados-0.028
 yourself-0.028
Token
Token ID198
Feature activation-3.82e-03

Pos logit contributions
 yourselves+0.004
ocolate+0.004
 yourself+0.004
 sidelines+0.003
 course+0.003
Neg logit contributions
-0.002
 stuck-0.002
 connected-0.002
 favorite-0.002
 printed-0.002
TokenL
Token ID43
Feature activation-5.82e-03

Pos logit contributions
 yourselves+0.005
ocolate+0.005
 yourself+0.005
 sidelines+0.004
ting+0.004
Neg logit contributions
-0.006
 favorite-0.005
 stuck-0.005
 connected-0.005
 printed-0.005
Token you
Token ID345
Feature activation+1.20e-02

Pos logit contributions
+0.005
 favorite+0.004
eth+0.004
 said+0.004
 wish+0.004
Neg logit contributions
ocolate-0.005
 yourselves-0.005
ados-0.004
ting-0.004
 sidelines-0.004
Token pass
Token ID1208
Feature activation+2.45e-03

Pos logit contributions
+0.018
 wish+0.015
eth+0.015
 said+0.015
 favorite+0.014
Neg logit contributions
 yourselves-0.020
ocolate-0.019
ados-0.018
ting-0.018
 sidelines-0.017
Token me
Token ID502
Feature activation+4.92e-03

Pos logit contributions
+0.003
 favorite+0.003
 said+0.003
 once+0.003
eth+0.002
Neg logit contributions
ocolate-0.004
 yourselves-0.004
 sidelines-0.004
ados-0.004
ting-0.004
Token the
Token ID262
Feature activation+1.98e-03

Pos logit contributions
+0.005
 favorite+0.004
eth+0.004
 said+0.003
 once+0.003
Neg logit contributions
ocolate-0.010
 yourselves-0.010
 sidelines-0.010
ados-0.010
ting-0.009
Token heavy
Token ID4334
Feature activation+1.04e-02

Pos logit contributions
+0.003
eth+0.002
 said+0.002
 sure+0.002
 printed+0.002
Neg logit contributions
 yourselves-0.003
ocolate-0.003
ting-0.003
ados-0.003
 sidelines-0.003
Token rock
Token ID3881
Feature activation+4.73e-02

Pos logit contributions
+0.014
eth+0.011
 favorite+0.011
 wish+0.011
 said+0.011
Neg logit contributions
 yourselves-0.018
ocolate-0.018
ados-0.017
ting-0.016
 sidelines-0.016
Token,
Token ID11
Feature activation+1.61e-02

Pos logit contributions
+0.063
eth+0.051
 favorite+0.050
 said+0.050
 wish+0.048
Neg logit contributions
 yourselves-0.084
ocolate-0.083
ados-0.077
 sidelines-0.076
ting-0.076
Token please
Token ID3387
Feature activation+1.96e-02

Pos logit contributions
+0.023
eth+0.019
 favorite+0.018
 said+0.018
 wish+0.018
Neg logit contributions
 yourselves-0.027
ocolate-0.027
ados-0.024
ting-0.024
 sidelines-0.024
Token?"
Token ID1701
Feature activation+1.64e-02

Pos logit contributions
+0.022
 said+0.017
 printed+0.016
 wish+0.016
 favorite+0.016
Neg logit contributions
ocolate-0.039
 yourselves-0.039
ados-0.038
 sidelines-0.037
ting-0.036
Token So
Token ID1406
Feature activation+2.62e-04

Pos logit contributions
+0.021
 printed+0.018
eth+0.018
 wish+0.018
 connected+0.018
Neg logit contributions
 yourselves-0.028
ocolate-0.028
ting-0.026
ados-0.025
 sidelines-0.024
Token Jay
Token ID9180
Feature activation+7.36e-03

Pos logit contributions
+0.000
eth+0.000
 favorite+0.000
 said+0.000
 wish+0.000
Neg logit contributions
 yourselves-0.001
ocolate-0.001
ados-0.000
ting-0.000
 sidelines-0.000
Token.
Token ID13
Feature activation+9.09e-03

Pos logit contributions
 yourselves+0.018
ocolate+0.018
ados+0.017
ting+0.016
 yourself+0.016
Neg logit contributions
-0.014
eth-0.012
 eng-0.011
erg-0.011
 wish-0.010
Token Can
Token ID1680
Feature activation+1.68e-02

Pos logit contributions
+0.009
eth+0.008
 favorite+0.008
 wish+0.008
 worked+0.008
Neg logit contributions
 yourselves-0.018
ocolate-0.017
 yourself-0.016
ting-0.016
ados-0.015
Token you
Token ID345
Feature activation+2.17e-02

Pos logit contributions
+0.024
eth+0.021
 connected+0.020
 favorite+0.020
 wish+0.020
Neg logit contributions
 yourselves-0.028
ocolate-0.027
ados-0.025
ting-0.024
 sidelines-0.024
Token see
Token ID766
Feature activation+2.62e-03

Pos logit contributions
+0.031
eth+0.028
 eng+0.027
 favorite+0.027
 connected+0.026
Neg logit contributions
ocolate-0.033
 yourselves-0.033
 sidelines-0.029
 yourself-0.029
 course-0.029
Token the
Token ID262
Feature activation+6.22e-04

Pos logit contributions
+0.004
eth+0.004
 eng+0.003
 wish+0.003
 connected+0.003
Neg logit contributions
 yourselves-0.004
ocolate-0.004
ting-0.004
ados-0.004
 yourself-0.004
Token rain
Token ID6290
Feature activation+4.71e-02

Pos logit contributions
+0.001
 wish+0.001
 favorite+0.001
 said+0.001
eth+0.001
Neg logit contributions
ocolate-0.001
 yourselves-0.001
ados-0.001
ting-0.001
 sidelines-0.001
Token that
Token ID326
Feature activation+4.49e-03

Pos logit contributions
+0.061
eth+0.052
 wish+0.048
 eng+0.048
 sure+0.046
Neg logit contributions
 yourselves-0.085
ocolate-0.083
ting-0.076
 yourself-0.075
ados-0.074
Token way
Token ID835
Feature activation-1.38e-02

Pos logit contributions
eth+0.006
+0.006
 eng+0.005
erg+0.005
 Hatt+0.005
Neg logit contributions
 yourselves-0.008
ocolate-0.007
ting-0.007
 course-0.006
 yourself-0.006
Token,
Token ID11
Feature activation+1.64e-02

Pos logit contributions
 yourselves+0.024
ocolate+0.023
ting+0.022
 yourself+0.021
 sidelines+0.021
Neg logit contributions
-0.018
eth-0.015
 eng-0.015
 printed-0.015
 wish-0.014
Token Lily
Token ID20037
Feature activation+3.16e-02

Pos logit contributions
+0.023
eth+0.021
 eng+0.019
 connected+0.018
 worked+0.018
Neg logit contributions
 yourselves-0.027
ocolate-0.026
 yourself-0.024
ados-0.023
 sidelines-0.023
Token and
Token ID290
Feature activation-8.91e-04

Pos logit contributions
+0.041
eth+0.036
 eng+0.035
 Georg+0.033
 favorite+0.032
Neg logit contributions
 yourselves-0.056
ocolate-0.053
 yourself-0.050
ting-0.050
ados-0.049
Token"
Token ID1
Feature activation-3.67e-03

Pos logit contributions
+0.001
eth+0.001
 favorite+0.001
 wish+0.001
 said+0.001
Neg logit contributions
 yourselves-0.002
ocolate-0.002
ados-0.001
ting-0.001
 sidelines-0.001
TokenAre
Token ID8491
Feature activation-3.76e-03

Pos logit contributions
 yourselves+0.005
ocolate+0.005
ados+0.005
ting+0.005
 sidelines+0.004
Neg logit contributions
-0.006
eth-0.005
 favorite-0.005
 said-0.005
 wish-0.005
Token you
Token ID345
Feature activation+1.06e-02

Pos logit contributions
ocolate+0.006
 yourselves+0.006
 sidelines+0.005
ados+0.005
ting+0.005
Neg logit contributions
-0.006
 said-0.005
 sure-0.005
 favorite-0.005
 wish-0.005
Token okay
Token ID8788
Feature activation+2.14e-02

Pos logit contributions
+0.015
eth+0.014
 printed+0.012
 favorite+0.012
erg+0.012
Neg logit contributions
 yourselves-0.017
ocolate-0.017
 yourself-0.016
ting-0.015
ados-0.015
Token,
Token ID11
Feature activation+1.62e-02

Pos logit contributions
+0.029
 connected+0.024
 eng+0.024
 printed+0.024
eth+0.024
Neg logit contributions
 yourselves-0.036
ocolate-0.036
ados-0.034
 yourself-0.032
ting-0.031
Token Lily
Token ID20037
Feature activation+4.62e-02

Pos logit contributions
+0.019
eth+0.016
 eng+0.014
 printed+0.014
 connected+0.014
Neg logit contributions
 yourselves-0.031
ocolate-0.030
ados-0.028
 sidelines-0.028
ting-0.027
Token?"
Token ID1701
Feature activation+3.08e-02

Pos logit contributions
+0.082
 favourite+0.081
 favorite+0.078
eth+0.077
 eng+0.077
Neg logit contributions
 yourselves-0.053
 yourself-0.045
ocolate-0.045
ting-0.043
ados-0.042
Token he
Token ID339
Feature activation+3.46e-02

Pos logit contributions
eth+0.049
iss+0.048
 favorite+0.048
vy+0.046
aved+0.046
Neg logit contributions
 yourselves-0.040
ting-0.034
 yourself-0.034
ados-0.033
 sidelines-0.033
Token asked
Token ID1965
Feature activation+1.58e-02

Pos logit contributions
+0.032
eth+0.029
 favorite+0.027
 favourite+0.027
erg+0.027
Neg logit contributions
 yourselves-0.066
 yourself-0.062
gging-0.061
ting-0.061
Please-0.059
Token.
Token ID13
Feature activation+7.20e-03

Pos logit contributions
+0.020
 wish+0.017
 printed+0.017
 favorite+0.017
eth+0.017
Neg logit contributions
 yourselves-0.027
ocolate-0.026
ting-0.025
ados-0.025
 yourself-0.023
Token
Token ID198
Feature activation+3.33e-03

Pos logit contributions
+0.009
 wish+0.009
 favorite+0.009
 favourite+0.008
 connected+0.008
Neg logit contributions
ocolate-0.011
 yourselves-0.011
 sidelines-0.010
ados-0.010
ting-0.010
TokenDo
Token ID5211
Feature activation+3.18e-03

Pos logit contributions
 yourselves+0.010
ocolate+0.010
ados+0.009
ting+0.009
 sidelines+0.009
Neg logit contributions
-0.013
eth-0.011
 favorite-0.011
 said-0.011
 wish-0.011
Token not
Token ID407
Feature activation+3.11e-03

Pos logit contributions
+0.005
 sure+0.004
 said+0.004
 favorite+0.004
 special+0.004
Neg logit contributions
ocolate-0.006
ting-0.005
 yourselves-0.005
ados-0.005
 sidelines-0.005
Token worry
Token ID5490
Feature activation-4.57e-03

Pos logit contributions
+0.004
eth+0.004
 favorite+0.003
 printed+0.003
 said+0.003
Neg logit contributions
 yourselves-0.005
ocolate-0.005
ados-0.005
ting-0.005
 yourself-0.005
Token,"
Token ID553
Feature activation+2.20e-02

Pos logit contributions
 yourselves+0.009
ocolate+0.009
ados+0.009
ting+0.009
 yourself+0.008
Neg logit contributions
-0.005
eth-0.004
 printed-0.003
 favorite-0.003
 wish-0.003
Token the
Token ID262
Feature activation+2.46e-02

Pos logit contributions
+0.024
eth+0.022
 favorite+0.020
 wish+0.018
 printed+0.018
Neg logit contributions
 yourselves-0.042
ocolate-0.041
ting-0.039
ados-0.039
 sidelines-0.038
Token older
Token ID4697
Feature activation+4.59e-02

Pos logit contributions
+0.026
eth+0.022
 wish+0.021
 favorite+0.020
 printed+0.020
Neg logit contributions
 yourselves-0.047
ocolate-0.047
 sidelines-0.044
ados-0.044
ting-0.044
Token brother
Token ID3956
Feature activation+4.64e-03

Pos logit contributions
+0.041
eth+0.030
 favorite+0.029
 wish+0.028
 printed+0.028
Neg logit contributions
 yourselves-0.100
ocolate-0.099
ados-0.094
ting-0.093
 sidelines-0.090
Token said
Token ID531
Feature activation+2.78e-03

Pos logit contributions
eth+0.003
+0.003
 favorite+0.003
 favourite+0.002
erg+0.002
Neg logit contributions
ocolate-0.010
 yourselves-0.010
ting-0.009
 course-0.009
ados-0.009
Token in
Token ID287
Feature activation+9.34e-03

Pos logit contributions
+0.003
eth+0.002
 said+0.002
 favorite+0.002
 wish+0.002
Neg logit contributions
 yourselves-0.006
ocolate-0.006
ados-0.005
ting-0.005
 sidelines-0.005
Token a
Token ID257
Feature activation-2.29e-03

Pos logit contributions
+0.009
 sure+0.005
 wish+0.005
 special+0.005
 know+0.005
Neg logit contributions
 yourselves-0.022
ocolate-0.022
ados-0.021
 sidelines-0.020
ting-0.020
Token kind
Token ID1611
Feature activation+2.36e-03

Pos logit contributions
 yourselves+0.004
ocolate+0.004
ados+0.004
ting+0.004
 yourself+0.004
Neg logit contributions
-0.003
 wish-0.003
 said-0.003
eth-0.002
 special-0.002
Token say
Token ID910
Feature activation+3.24e-04

Pos logit contributions
+0.015
eth+0.013
 favorite+0.012
 eng+0.012
 connected+0.012
Neg logit contributions
 yourselves-0.022
ocolate-0.022
 sidelines-0.020
ting-0.020
ados-0.020
Token sorry
Token ID7926
Feature activation+2.00e-02

Pos logit contributions
+0.001
 favorite+0.000
 said+0.000
eth+0.000
 wish+0.000
Neg logit contributions
 yourselves-0.000
ocolate-0.000
ados-0.000
ting-0.000
 sidelines-0.000
Token to
Token ID284
Feature activation+8.45e-03

Pos logit contributions
+0.025
 eng+0.022
eth+0.021
 connected+0.021
 printed+0.020
Neg logit contributions
 yourselves-0.036
ocolate-0.035
ados-0.032
 yourself-0.032
ting-0.031
Token the
Token ID262
Feature activation+5.54e-03

Pos logit contributions
+0.012
eth+0.010
 favorite+0.010
 said+0.010
 wish+0.010
Neg logit contributions
 yourselves-0.014
ocolate-0.014
ados-0.013
ting-0.013
 sidelines-0.012
Token pengu
Token ID38373
Feature activation+2.18e-03

Pos logit contributions
+0.008
eth+0.007
 connected+0.006
 sure+0.006
 printed+0.006
Neg logit contributions
 yourselves-0.010
ocolate-0.009
 sidelines-0.008
ting-0.008
 yourself-0.008
Tokenin
Token ID259
Feature activation+4.52e-02

Pos logit contributions
+0.003
 said+0.002
eth+0.002
 favorite+0.002
 wish+0.002
Neg logit contributions
 yourselves-0.004
ocolate-0.004
ados-0.004
 sidelines-0.004
ting-0.003
Token?"
Token ID1701
Feature activation+3.77e-02

Pos logit contributions
+0.070
eth+0.061
 eng+0.059
 connected+0.059
 favorite+0.058
Neg logit contributions
ocolate-0.068
 yourselves-0.067
ting-0.058
ados-0.058
 yourself-0.057
Token Lily
Token ID20037
Feature activation+3.49e-02

Pos logit contributions
+0.057
erg+0.055
rig+0.055
iss+0.054
ren+0.053
Neg logit contributions
 yourselves-0.050
 sidelines-0.042
Stop-0.041
Please-0.041
ocolate-0.041
Token asked
Token ID1965
Feature activation+2.16e-02

Pos logit contributions
+0.026
eth+0.025
 favorite+0.024
 favourite+0.024
 wish+0.024
Neg logit contributions
 yourselves-0.072
ting-0.070
ados-0.067
ocolate-0.067
 yourself-0.066
Token.
Token ID13
Feature activation+2.02e-02

Pos logit contributions
+0.023
eth+0.018
 said+0.017
 favorite+0.017
 printed+0.016
Neg logit contributions
 yourselves-0.044
ocolate-0.044
ados-0.041
 sidelines-0.040
ting-0.040
Token
Token ID198
Feature activation+1.85e-02

Pos logit contributions
+0.021
eth+0.016
 wish+0.016
 favorite+0.016
 said+0.015
Neg logit contributions
 yourselves-0.041
ocolate-0.040
ados-0.038
ting-0.038
 sidelines-0.037
Token the
Token ID262
Feature activation-2.94e-03

Pos logit contributions
+0.014
eth+0.012
 eng+0.012
 favorite+0.012
 printed+0.011
Neg logit contributions
 yourselves-0.017
ocolate-0.016
 yourself-0.015
 sidelines-0.015
ados-0.015
Token window
Token ID4324
Feature activation+2.05e-02

Pos logit contributions
ocolate+0.005
 yourselves+0.005
ados+0.005
ting+0.005
 sidelines+0.005
Neg logit contributions
-0.004
 favorite-0.003
eth-0.003
 wish-0.003
 printed-0.003
Token to
Token ID284
Feature activation+7.80e-03

Pos logit contributions
+0.030
eth+0.026
 wish+0.024
 favorite+0.024
 eng+0.024
Neg logit contributions
 yourselves-0.033
ocolate-0.033
ados-0.029
ting-0.029
 yourself-0.028
Token watch
Token ID2342
Feature activation-1.04e-03

Pos logit contributions
+0.010
eth+0.009
 favorite+0.008
 said+0.008
 eng+0.008
Neg logit contributions
 yourselves-0.014
ocolate-0.014
 sidelines-0.013
 yourself-0.012
ados-0.012
Token the
Token ID262
Feature activation+8.10e-03

Pos logit contributions
 yourselves+0.002
ocolate+0.002
ados+0.002
ting+0.002
 sidelines+0.002
Neg logit contributions
-0.001
eth-0.001
 favorite-0.001
 wish-0.001
 said-0.001
Token rain
Token ID6290
Feature activation+4.52e-02

Pos logit contributions
+0.013
eth+0.011
 favorite+0.010
 said+0.010
 wish+0.010
Neg logit contributions
 yourselves-0.013
ocolate-0.012
ados-0.011
ting-0.011
 sidelines-0.011
Token?"
Token ID1701
Feature activation+2.31e-02

Pos logit contributions
+0.066
eth+0.058
 wish+0.056
 eng+0.055
 favorite+0.055
Neg logit contributions
 yourselves-0.072
ocolate-0.070
ting-0.063
 yourself-0.061
 sidelines-0.061
Token
Token ID220
Feature activation-1.78e-03

Pos logit contributions
+0.023
 said+0.016
eth+0.016
 favorite+0.015
 wish+0.015
Neg logit contributions
 yourselves-0.051
ocolate-0.049
ados-0.046
ting-0.046
 sidelines-0.046
Token
Token ID198
Feature activation-7.11e-04

Pos logit contributions
 yourselves+0.003
ocolate+0.003
 yourself+0.003
ados+0.003
ting+0.003
Neg logit contributions
 wish-0.002
 printed-0.002
-0.002
 pretended-0.002
 wrapped-0.002
Token
Token ID198
Feature activation+1.88e-05

Pos logit contributions
 yourselves+0.002
ocolate+0.002
ados+0.002
ting+0.002
 sidelines+0.002
Neg logit contributions
-0.000
eth-0.000
 favorite-0.000
 wish-0.000
 said-0.000
TokenHer
Token ID9360
Feature activation+1.07e-02

Pos logit contributions
+0.000
eth+0.000
 said+0.000
 printed+0.000
 know+0.000
Neg logit contributions
 yourselves-0.000
ocolate-0.000
ados-0.000
ting-0.000
 sidelines-0.000
Token go
Token ID467
Feature activation+1.18e-02

Pos logit contributions
 yourselves+0.004
ocolate+0.004
ados+0.004
ting+0.004
 sidelines+0.004
Neg logit contributions
-0.002
eth-0.002
 favorite-0.002
 said-0.002
 wish-0.002
Token outside
Token ID2354
Feature activation+5.64e-03

Pos logit contributions
+0.016
eth+0.014
 eng+0.013
 wish+0.013
 favorite+0.012
Neg logit contributions
 yourselves-0.021
 yourself-0.019
ocolate-0.018
ting-0.018
ados-0.017
Token when
Token ID618
Feature activation+7.09e-03

Pos logit contributions
+0.007
eth+0.006
 wish+0.006
 printed+0.005
 eng+0.005
Neg logit contributions
 yourselves-0.010
ocolate-0.010
ting-0.009
ados-0.009
 yourself-0.009
Token it
Token ID340
Feature activation+3.47e-02

Pos logit contributions
+0.011
 said+0.009
 favorite+0.009
 wish+0.009
eth+0.009
Neg logit contributions
ocolate-0.011
 yourselves-0.011
ados-0.010
ting-0.010
 sidelines-0.010
Token is
Token ID318
Feature activation+7.68e-03

Pos logit contributions
eth+0.039
+0.038
 Hatt+0.037
 Georg+0.035
 eng+0.035
Neg logit contributions
 yourselves-0.059
ocolate-0.054
ting-0.054
 yourself-0.053
ados-0.050
Token raining
Token ID43079
Feature activation+4.50e-02

Pos logit contributions
+0.008
eth+0.006
 wish+0.006
 favorite+0.006
 said+0.006
Neg logit contributions
 yourselves-0.016
ocolate-0.016
ados-0.014
ting-0.014
 sidelines-0.014
Token?"
Token ID1701
Feature activation+9.11e-03

Pos logit contributions
+0.060
eth+0.048
 favorite+0.047
 said+0.047
 wish+0.046
Neg logit contributions
 yourselves-0.080
ocolate-0.079
ados-0.073
ting-0.072
 sidelines-0.071
Token Lily
Token ID20037
Feature activation+1.66e-02

Pos logit contributions
+0.009
 connected+0.008
eth+0.008
 wish+0.008
 eng+0.008
Neg logit contributions
 yourselves-0.017
ocolate-0.017
ados-0.016
ting-0.016
 course-0.016
Token thought
Token ID1807
Feature activation-5.44e-03

Pos logit contributions
+0.019
eth+0.015
 favorite+0.014
 wish+0.014
 printed+0.014
Neg logit contributions
 yourselves-0.032
ocolate-0.032
ados-0.029
ting-0.029
 yourself-0.028
Token for
Token ID329
Feature activation+1.25e-02

Pos logit contributions
ocolate+0.011
 yourselves+0.011
ting+0.010
ados+0.010
 sidelines+0.010
Neg logit contributions
-0.006
eth-0.005
 printed-0.004
 wish-0.004
 connected-0.004
Token a
Token ID257
Feature activation+3.46e-03

Pos logit contributions
+0.011
eth+0.008
 favorite+0.008
 said+0.008
 wish+0.008
Neg logit contributions
 yourselves-0.028
ocolate-0.027
ados-0.026
ting-0.025
 sidelines-0.025
Token"
Token ID1
Feature activation-7.62e-03

Pos logit contributions
+0.004
eth+0.003
 said+0.003
 connected+0.003
 printed+0.003
Neg logit contributions
ocolate-0.004
 yourselves-0.004
ados-0.004
ting-0.004
 sidelines-0.004
TokenCan
Token ID6090
Feature activation+7.94e-04

Pos logit contributions
ocolate+0.010
 yourselves+0.010
ados+0.009
ting+0.009
 sidelines+0.009
Neg logit contributions
-0.014
 said-0.012
 printed-0.012
eth-0.011
 connected-0.011
Token we
Token ID356
Feature activation+2.86e-03

Pos logit contributions
+0.001
eth+0.001
 favorite+0.001
 said+0.001
 wish+0.001
Neg logit contributions
 yourselves-0.001
ocolate-0.001
ados-0.001
ting-0.001
 sidelines-0.001
Token have
Token ID423
Feature activation-1.68e-03

Pos logit contributions
+0.004
eth+0.003
 eng+0.003
 favorite+0.003
 connected+0.003
Neg logit contributions
ocolate-0.005
 yourselves-0.005
ados-0.004
 yourself-0.004
 sidelines-0.004
Token an
Token ID281
Feature activation+2.61e-03

Pos logit contributions
 yourselves+0.003
ocolate+0.003
ados+0.002
 yourself+0.002
 course+0.002
Neg logit contributions
-0.003
eth-0.002
 eng-0.002
 connected-0.002
 wish-0.002
Token invitation
Token ID17023
Feature activation+4.48e-02

Pos logit contributions
+0.004
eth+0.004
s+0.004
 Georg+0.004
 involved+0.004
Neg logit contributions
ocolate-0.004
 yourselves-0.003
 course-0.003
ados-0.003
 temptation-0.003
Token too
Token ID1165
Feature activation+2.19e-02

Pos logit contributions
+0.065
 eng+0.058
eth+0.058
 Georg+0.054
 connected+0.054
Neg logit contributions
 yourselves-0.069
ocolate-0.068
 cream-0.061
 yourself-0.060
 course-0.059
Token?"
Token ID1701
Feature activation+1.95e-02

Pos logit contributions
+0.041
eth+0.036
 Hatt+0.035
 sure+0.035
 printed+0.035
Neg logit contributions
 yourselves-0.026
ocolate-0.024
 yourself-0.021
 cream-0.020
ados-0.018
Token Tom
Token ID4186
Feature activation+1.84e-02

Pos logit contributions
+0.030
eth+0.029
 favorite+0.028
 wish+0.028
 favourite+0.028
Neg logit contributions
 yourselves-0.026
 sidelines-0.024
ocolate-0.024
ados-0.023
ting-0.023
Token said
Token ID531
Feature activation-4.04e-03

Pos logit contributions
+0.015
 wish+0.014
eth+0.014
 favourite+0.014
 favorite+0.014
Neg logit contributions
ocolate-0.036
ting-0.036
 yourselves-0.036
gging-0.035
ados-0.035
Token.
Token ID13
Feature activation+2.12e-03

Pos logit contributions
ocolate+0.007
 yourselves+0.007
ados+0.007
ting+0.007
 sidelines+0.007
Neg logit contributions
-0.005
eth-0.004
 wish-0.004
 printed-0.004
 favorite-0.004
Token play
Token ID711
Feature activation+6.93e-03

Pos logit contributions
+0.003
eth+0.002
 eng+0.002
 sure+0.002
 connected+0.002
Neg logit contributions
ocolate-0.004
 yourselves-0.004
ados-0.004
 sidelines-0.004
 yourself-0.004
Token with
Token ID351
Feature activation+1.41e-02

Pos logit contributions
+0.009
eth+0.008
 wish+0.007
 eng+0.007
 connected+0.007
Neg logit contributions
 yourselves-0.013
ocolate-0.012
ados-0.011
ting-0.011
 yourself-0.011
Token your
Token ID534
Feature activation+3.48e-03

Pos logit contributions
+0.020
 said+0.016
 favorite+0.016
 special+0.015
 printed+0.015
Neg logit contributions
ocolate-0.025
 yourselves-0.023
 sidelines-0.022
ross-0.022
ting-0.021
Token aer
Token ID9551
Feature activation+2.34e-03

Pos logit contributions
+0.005
eth+0.004
 connected+0.004
 sure+0.004
 said+0.004
Neg logit contributions
 yourselves-0.006
ocolate-0.005
ados-0.005
 sidelines-0.005
 course-0.005
Tokenopl
Token ID20106
Feature activation+4.98e-03

Pos logit contributions
+0.005
eth+0.004
 favorite+0.004
 said+0.004
 wish+0.004
Neg logit contributions
 yourselves-0.003
ocolate-0.003
ting-0.002
ados-0.002
 sidelines-0.002
Tokenane
Token ID1531
Feature activation+4.46e-02

Pos logit contributions
+0.006
eth+0.005
 favorite+0.005
 said+0.005
 printed+0.005
Neg logit contributions
 yourselves-0.009
ocolate-0.009
 sidelines-0.009
ados-0.008
ting-0.008
Token,
Token ID11
Feature activation+2.70e-02

Pos logit contributions
+0.067
eth+0.060
 eng+0.057
 connected+0.055
 wish+0.054
Neg logit contributions
 yourselves-0.068
ocolate-0.066
ados-0.062
 yourself-0.060
 course-0.059
Token Tim
Token ID5045
Feature activation+3.45e-02

Pos logit contributions
+0.032
eth+0.026
 favorite+0.024
 wish+0.024
 printed+0.024
Neg logit contributions
 yourselves-0.052
ocolate-0.051
ados-0.048
ting-0.047
 sidelines-0.046
Tokenmy
Token ID1820
Feature activation+2.86e-02

Pos logit contributions
+0.040
eth+0.032
 favorite+0.030
 wish+0.030
 eng+0.029
Neg logit contributions
 yourselves-0.067
ocolate-0.066
ados-0.062
ting-0.061
 yourself-0.059
Token?"
Token ID1701
Feature activation+2.68e-02

Pos logit contributions
+0.045
eth+0.042
 eng+0.040
 favourite+0.039
 favorite+0.039
Neg logit contributions
 yourselves-0.040
ocolate-0.036
 yourself-0.035
ados-0.035
ting-0.035
Token asked
Token ID1965
Feature activation+2.20e-02

Pos logit contributions
+0.036
eth+0.036
 favorite+0.032
 wish+0.031
erg+0.031
Neg logit contributions
 yourselves-0.041
ados-0.038
ocolate-0.038
ting-0.037
 yourself-0.036
TokenWhat
Token ID2061
Feature activation+1.50e-03

Pos logit contributions
 yourselves+0.019
ocolate+0.019
ados+0.017
 sidelines+0.016
ting+0.015
Neg logit contributions
-0.023
 said-0.018
 printed-0.017
 connected-0.017
 sure-0.017
Token happened
Token ID3022
Feature activation+1.65e-02

Pos logit contributions
+0.002
 favorite+0.001
 said+0.001
 wish+0.001
 printed+0.001
Neg logit contributions
 yourselves-0.003
ocolate-0.003
 sidelines-0.003
ados-0.003
ting-0.003
Token to
Token ID284
Feature activation+4.81e-03

Pos logit contributions
+0.018
 favorite+0.013
 said+0.013
eth+0.012
 sure+0.012
Neg logit contributions
ocolate-0.034
 yourselves-0.034
ados-0.032
 sidelines-0.032
ting-0.031
Token my
Token ID616
Feature activation-5.52e-03

Pos logit contributions
+0.008
eth+0.007
 said+0.007
 favorite+0.007
 wish+0.007
Neg logit contributions
 yourselves-0.007
ocolate-0.007
ados-0.006
ting-0.006
 sidelines-0.006
Token muff
Token ID27563
Feature activation+2.82e-03

Pos logit contributions
 yourselves+0.008
ocolate+0.008
ados+0.007
 yourself+0.007
ting+0.007
Neg logit contributions
-0.009
eth-0.008
 connected-0.007
 printed-0.007
 sure-0.007
Tokenin
Token ID259
Feature activation+4.44e-02

Pos logit contributions
+0.003
 said+0.003
 favorite+0.003
 wish+0.003
 once+0.003
Neg logit contributions
 yourselves-0.005
ocolate-0.005
 sidelines-0.005
ting-0.005
ados-0.005
Token?
Token ID30
Feature activation+1.67e-04

Pos logit contributions
+0.065
eth+0.055
 favorite+0.053
 said+0.052
 wish+0.052
Neg logit contributions
 yourselves-0.072
ocolate-0.072
ados-0.065
ting-0.064
 sidelines-0.063
Token It
Token ID632
Feature activation+6.24e-03

Pos logit contributions
+0.000
eth+0.000
 favorite+0.000
 said+0.000
 wish+0.000
Neg logit contributions
 yourselves-0.000
ocolate-0.000
ados-0.000
ting-0.000
 sidelines-0.000
Token looks
Token ID3073
Feature activation+3.77e-03

Pos logit contributions
+0.006
eth+0.006
 eng+0.005
 favorite+0.005
 wish+0.005
Neg logit contributions
 yourselves-0.012
ocolate-0.012
ados-0.011
ting-0.011
 yourself-0.011
Token like
Token ID588
Feature activation-1.81e-03

Pos logit contributions
+0.005
eth+0.004
 favorite+0.003
 wish+0.003
 said+0.003
Neg logit contributions
 yourselves-0.007
ocolate-0.007
ados-0.006
ting-0.006
 sidelines-0.006
Token it
Token ID340
Feature activation-2.04e-03

Pos logit contributions
ocolate+0.004
 yourselves+0.004
ting+0.004
 sidelines+0.004
ados+0.004
Neg logit contributions
-0.002
 special-0.001
 said-0.001
 favorite-0.001
 sure-0.001
Token I
Token ID314
Feature activation+1.10e-02

Pos logit contributions
+0.005
eth+0.004
 favorite+0.004
 wish+0.004
 said+0.004
Neg logit contributions
 yourselves-0.008
ocolate-0.008
ados-0.007
ting-0.007
 sidelines-0.007
Token hug
Token ID16225
Feature activation+4.98e-03

Pos logit contributions
+0.014
 eng+0.014
eth+0.013
 connected+0.013
 sure+0.013
Neg logit contributions
ocolate-0.017
 yourselves-0.017
 yourself-0.015
 course-0.015
 sidelines-0.015
Token you
Token ID345
Feature activation+2.99e-02

Pos logit contributions
+0.008
 said+0.007
 favorite+0.006
 says+0.006
 once+0.006
Neg logit contributions
ocolate-0.008
 yourselves-0.007
 sidelines-0.007
ados-0.007
ting-0.007
Token too
Token ID1165
Feature activation+1.90e-02

Pos logit contributions
+0.042
eth+0.036
 eng+0.034
 wish+0.034
 connected+0.033
Neg logit contributions
 yourselves-0.050
ocolate-0.050
ting-0.045
ados-0.045
 yourself-0.044
Token,
Token ID11
Feature activation+2.01e-02

Pos logit contributions
+0.028
 printed+0.024
 eng+0.024
 connected+0.023
 sure+0.023
Neg logit contributions
ocolate-0.030
 yourselves-0.030
 yourself-0.026
ados-0.025
 course-0.025
Token Lily
Token ID20037
Feature activation+4.43e-02

Pos logit contributions
+0.027
eth+0.024
 connected+0.022
 eng+0.022
 printed+0.022
Neg logit contributions
 yourselves-0.035
ocolate-0.034
ados-0.031
 sidelines-0.031
 yourself-0.030
Token?"
Token ID1701
Feature activation+2.30e-02

Pos logit contributions
+0.084
 eng+0.082
eth+0.081
 favourite+0.080
 Georg+0.078
Neg logit contributions
 yourselves-0.045
 yourself-0.039
ocolate-0.037
ados-0.036
 cream-0.036
Token Tom
Token ID4186
Feature activation+2.74e-02

Pos logit contributions
eth+0.044
sy+0.040
a+0.040
+0.039
 printed+0.039
Neg logit contributions
 yourselves-0.025
 sidelines-0.020
 course-0.020
 yourself-0.019
 hi-0.019
Token asked
Token ID1965
Feature activation+5.95e-03

Pos logit contributions
eth+0.027
+0.027
 favourite+0.026
aved+0.025
 wish+0.024
Neg logit contributions
ting-0.046
 yourselves-0.045
 wig-0.044
gging-0.044
 yourself-0.044
Token.
Token ID13
Feature activation+6.62e-03

Pos logit contributions
+0.007
eth+0.005
 favorite+0.005
 said+0.005
 wish+0.005
Neg logit contributions
 yourselves-0.012
ocolate-0.012
ados-0.011
ting-0.011
 sidelines-0.011
Token
Token ID198
Feature activation+6.29e-03

Pos logit contributions
+0.007
eth+0.006
 said+0.006
 favorite+0.006
 wish+0.005
Neg logit contributions
 yourselves-0.013
ocolate-0.013
ados-0.012
ting-0.012
 sidelines-0.012
Token play
Token ID711
Feature activation+7.03e-03

Pos logit contributions
+0.001
eth+0.001
 connected+0.001
 eng+0.001
 sure+0.001
Neg logit contributions
 yourselves-0.002
ocolate-0.002
ados-0.002
 sidelines-0.002
 yourself-0.002
Token with
Token ID351
Feature activation+1.50e-02

Pos logit contributions
+0.009
eth+0.007
 wish+0.007
 eng+0.007
 connected+0.007
Neg logit contributions
 yourselves-0.013
ocolate-0.013
ados-0.012
ting-0.012
 yourself-0.012
Token me
Token ID502
Feature activation+2.32e-02

Pos logit contributions
+0.025
eth+0.021
 favorite+0.021
 said+0.021
 wish+0.020
Neg logit contributions
ocolate-0.022
 yourselves-0.022
ados-0.019
ting-0.019
 sidelines-0.019
Token and
Token ID290
Feature activation+1.27e-02

Pos logit contributions
+0.033
eth+0.028
 wish+0.027
 favorite+0.026
 printed+0.026
Neg logit contributions
 yourselves-0.039
ocolate-0.038
ados-0.035
ting-0.035
 sidelines-0.034
Token my
Token ID616
Feature activation+8.28e-03

Pos logit contributions
+0.017
eth+0.015
 favorite+0.014
 wish+0.013
 eng+0.013
Neg logit contributions
 yourselves-0.022
ocolate-0.022
ados-0.020
 sidelines-0.020
ting-0.020
Token doll
Token ID3654
Feature activation+4.41e-02

Pos logit contributions
+0.011
eth+0.009
 printed+0.009
 connected+0.009
 sure+0.009
Neg logit contributions
 yourselves-0.015
ocolate-0.014
ados-0.013
 sidelines-0.013
ting-0.013
Token?"
Token ID1701
Feature activation+2.57e-02

Pos logit contributions
+0.073
 eng+0.065
eth+0.064
 sure+0.062
 wish+0.061
Neg logit contributions
 yourselves-0.062
ocolate-0.058
ados-0.054
 yourself-0.053
ting-0.053
Token
Token ID198
Feature activation+5.41e-03

Pos logit contributions
+0.017
eth+0.012
 wish+0.011
 printed+0.011
 favorite+0.011
Neg logit contributions
 yourselves-0.062
ocolate-0.061
ados-0.058
ting-0.057
 sidelines-0.057
Token
Token ID198
Feature activation+3.85e-03

Pos logit contributions
+0.004
 favorite+0.003
eth+0.003
 printed+0.003
 wish+0.003
Neg logit contributions
 yourselves-0.013
ocolate-0.012
 yourself-0.011
ados-0.011
 sidelines-0.011
TokenTom
Token ID13787
Feature activation+8.28e-03

Pos logit contributions
+0.006
 said+0.005
eth+0.005
 says+0.005
 connected+0.005
Neg logit contributions
ocolate-0.006
 yourselves-0.006
ados-0.006
ting-0.006
 sidelines-0.005
Token says
Token ID1139
Feature activation-1.27e-02

Pos logit contributions
+0.009
eth+0.007
 eng+0.007
 printed+0.007
 connected+0.007
Neg logit contributions
 yourselves-0.016
ocolate-0.016
ados-0.015
ting-0.015
 sidelines-0.015
TokenCan
Token ID6090
Feature activation-2.74e-03

Pos logit contributions
 yourselves+0.022
ados+0.021
ocolate+0.021
 sidelines+0.020
ting+0.018
Neg logit contributions
-0.026
 printed-0.020
 said-0.019
 wish-0.019
 sure-0.019
Token I
Token ID314
Feature activation-1.80e-03

Pos logit contributions
ocolate+0.004
 yourselves+0.004
ados+0.004
ting+0.004
 sidelines+0.004
Neg logit contributions
-0.005
 said-0.004
 favorite-0.004
eth-0.004
 wish-0.004
Token play
Token ID711
Feature activation-2.38e-03

Pos logit contributions
ocolate+0.003
 yourselves+0.003
ados+0.003
ting+0.003
 sidelines+0.003
Neg logit contributions
-0.002
eth-0.002
 favorite-0.002
 eng-0.002
 said-0.002
Token with
Token ID351
Feature activation+1.26e-03

Pos logit contributions
 yourselves+0.004
ocolate+0.004
ados+0.004
ting+0.004
 yourself+0.004
Neg logit contributions
-0.003
eth-0.003
 eng-0.002
 wish-0.002
 said-0.002
Token it
Token ID340
Feature activation+2.21e-02

Pos logit contributions
+0.002
 said+0.001
 sure+0.001
 favorite+0.001
 stuck+0.001
Neg logit contributions
ocolate-0.002
 yourselves-0.002
 sidelines-0.002
ting-0.002
ados-0.002
Token?",
Token ID35379
Feature activation+4.41e-02

Pos logit contributions
+0.035
eth+0.033
 eng+0.032
 wish+0.031
 printed+0.030
Neg logit contributions
ocolate-0.031
 yourselves-0.031
 yourself-0.027
ados-0.027
ting-0.027
Token she
Token ID673
Feature activation+1.86e-02

Pos logit contributions
 favorite+0.065
+0.064
 favourite+0.061
eth+0.059
erg+0.059
Neg logit contributions
 yourselves-0.056
ting-0.053
ados-0.050
 yourself-0.048
ocolate-0.048
Token said
Token ID531
Feature activation+3.90e-03

Pos logit contributions
eth+0.017
+0.016
erg+0.015
 favorite+0.015
 favourite+0.014
Neg logit contributions
 yourselves-0.033
 yourself-0.031
ting-0.031
ados-0.031
 wig-0.030
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
+0.004
eth+0.003
 favorite+0.003
 said+0.003
 wish+0.003
Neg logit contributions
 yourselves-0.008
ocolate-0.008
ados-0.007
ting-0.007
 sidelines-0.007
Token Her
Token ID2332
Feature activation-8.69e-03

Pos logit contributions
+0.017
 wish+0.016
 favorite+0.015
 connected+0.014
 printed+0.014
Neg logit contributions
ocolate-0.025
 yourselves-0.024
ados-0.023
ting-0.022
 sidelines-0.021
Token mum
Token ID25682
Feature activation+1.38e-03

Pos logit contributions
 yourselves+0.016
ocolate+0.016
ting+0.015
ados+0.014
 sidelines+0.014
Neg logit contributions
-0.012
 favorite-0.010
 wish-0.010
 said-0.010
 sure-0.009
Token can
Token ID460
Feature activation+6.56e-03

Pos logit contributions
+0.013
 said+0.010
eth+0.010
 sure+0.010
 wish+0.010
Neg logit contributions
 yourselves-0.015
ocolate-0.015
ados-0.014
 sidelines-0.014
ting-0.014
Token I
Token ID314
Feature activation+1.77e-02

Pos logit contributions
+0.012
eth+0.010
 favorite+0.010
 said+0.010
 wish+0.010
Neg logit contributions
 yourselves-0.009
ocolate-0.009
ados-0.008
ting-0.008
 sidelines-0.008
Token keep
Token ID1394
Feature activation+2.72e-03

Pos logit contributions
+0.022
eth+0.019
 eng+0.019
 favorite+0.019
 connected+0.018
Neg logit contributions
ocolate-0.030
 yourselves-0.030
ting-0.027
 yourself-0.027
 sidelines-0.027
Token the
Token ID262
Feature activation+8.64e-03

Pos logit contributions
+0.004
 favorite+0.003
 said+0.003
 sure+0.003
 printed+0.003
Neg logit contributions
ocolate-0.004
 yourselves-0.004
ados-0.004
 sidelines-0.004
ting-0.004
Token talking
Token ID3375
Feature activation+2.07e-02

Pos logit contributions
+0.016
eth+0.014
 connected+0.013
 printed+0.012
 eng+0.012
Neg logit contributions
 yourselves-0.011
ocolate-0.011
 yourself-0.010
 temptation-0.010
ados-0.009
Token ash
Token ID12530
Feature activation+4.39e-02

Pos logit contributions
+0.028
eth+0.023
 favorite+0.022
 said+0.022
 wish+0.022
Neg logit contributions
 yourselves-0.037
ocolate-0.036
ados-0.033
ting-0.033
 sidelines-0.033
Tokentr
Token ID2213
Feature activation-2.11e-03

Pos logit contributions
+0.049
 eng+0.039
 wish+0.038
eth+0.038
 connected+0.038
Neg logit contributions
ocolate-0.085
 yourselves-0.084
ados-0.078
ting-0.077
 yourself-0.075
Tokenay
Token ID323
Feature activation+7.50e-03

Pos logit contributions
ocolate+0.003
 yourselves+0.003
ados+0.003
ting+0.003
 yourself+0.003
Neg logit contributions
-0.003
 wish-0.003
 connected-0.003
 printed-0.003
 sure-0.003
Token?
Token ID30
Feature activation+3.98e-03

Pos logit contributions
+0.009
eth+0.007
 said+0.007
 favorite+0.007
 once+0.007
Neg logit contributions
 yourselves-0.014
ocolate-0.014
 sidelines-0.013
ados-0.013
ting-0.012
Tokenâ
Token ID22940
Feature activation-1.37e-03

Pos logit contributions
+0.005
 favorite+0.004
 wish+0.004
 printed+0.004
eth+0.004
Neg logit contributions
 yourselves-0.007
ocolate-0.007
ting-0.007
ados-0.007
 sidelines-0.006
Token
Token ID26391
Feature activation+6.30e-03

Pos logit contributions
 yourselves+0.002
ocolate+0.002
ados+0.002
ting+0.002
 sidelines+0.002
Neg logit contributions
-0.002
 said-0.002
eth-0.002
 printed-0.002
 wish-0.002
Token boat
Token ID8848
Feature activation+3.81e-02

Pos logit contributions
+0.008
eth+0.007
 eng+0.007
erg+0.007
 sure+0.006
Neg logit contributions
 yourselves-0.008
ocolate-0.007
ados-0.007
ting-0.006
 yourself-0.006
Token fit
Token ID4197
Feature activation+2.39e-02

Pos logit contributions
+0.049
eth+0.040
 favorite+0.038
 wish+0.037
 said+0.037
Neg logit contributions
 yourselves-0.069
ocolate-0.068
ados-0.063
ting-0.062
 sidelines-0.061
Token in
Token ID287
Feature activation+1.41e-02

Pos logit contributions
+0.027
eth+0.020
 favorite+0.020
 said+0.020
 wish+0.019
Neg logit contributions
 yourselves-0.047
ocolate-0.047
ados-0.044
ting-0.043
 sidelines-0.043
Token your
Token ID534
Feature activation-1.52e-02

Pos logit contributions
+0.019
eth+0.016
 favorite+0.015
 said+0.015
 wish+0.015
Neg logit contributions
 yourselves-0.024
ocolate-0.024
ados-0.022
ting-0.022
 sidelines-0.021
Token big
Token ID1263
Feature activation+4.95e-03

Pos logit contributions
 yourselves+0.029
ocolate+0.028
ados+0.026
ting+0.026
 sidelines+0.026
Neg logit contributions
-0.019
eth-0.015
 favorite-0.014
 said-0.014
 wish-0.014
Token boat
Token ID8848
Feature activation+4.36e-02

Pos logit contributions
+0.006
eth+0.005
 favorite+0.005
 wish+0.005
 said+0.005
Neg logit contributions
 yourselves-0.009
ocolate-0.009
ados-0.009
ting-0.009
 sidelines-0.008
Token?"
Token ID1701
Feature activation+1.54e-02

Pos logit contributions
+0.061
eth+0.051
 favorite+0.049
 wish+0.048
 said+0.048
Neg logit contributions
 yourselves-0.074
ocolate-0.073
ados-0.067
ting-0.066
 sidelines-0.065
Token The
Token ID383
Feature activation-7.75e-03

Pos logit contributions
+0.015
eth+0.012
 said+0.011
 favorite+0.011
 wish+0.011
Neg logit contributions
 yourselves-0.033
ocolate-0.033
ados-0.030
ting-0.030
 sidelines-0.030
Token captain
Token ID10654
Feature activation-7.84e-03

Pos logit contributions
ocolate+0.017
 yourselves+0.017
ados+0.016
ting+0.016
 sidelines+0.015
Neg logit contributions
-0.007
 wish-0.005
 favorite-0.005
eth-0.005
 said-0.005
Token replied
Token ID8712
Feature activation-8.44e-03

Pos logit contributions
ocolate+0.014
 course+0.013
 cream+0.013
ting+0.013
endor+0.013
Neg logit contributions
-0.008
eth-0.007
 Hatt-0.006
 eng-0.006
 favourite-0.006
Token,
Token ID11
Feature activation+8.26e-03

Pos logit contributions
 yourselves+0.026
 sidelines+0.025
ocolate+0.023
 yourself+0.023
keeping+0.022
Neg logit contributions
-0.002
 together-0.002
 said-0.001
 once-0.001
 sure-0.001
TokenWhere
Token ID8496
Feature activation+4.49e-03

Pos logit contributions
 yourselves+0.013
ocolate+0.013
ados+0.012
 sidelines+0.012
 temptation+0.011
Neg logit contributions
-0.015
 said-0.010
 printed-0.010
 asked-0.010
 favorite-0.010
Token should
Token ID815
Feature activation-1.39e-03

Pos logit contributions
+0.005
eth+0.004
 printed+0.004
 connected+0.004
 favorite+0.004
Neg logit contributions
ocolate-0.008
 yourselves-0.008
ados-0.008
 yourself-0.007
ting-0.007
Token I
Token ID314
Feature activation+5.10e-03

Pos logit contributions
 yourselves+0.002
ocolate+0.002
ados+0.002
ting+0.002
 sidelines+0.002
Neg logit contributions
-0.002
eth-0.002
 favorite-0.002
 said-0.002
 printed-0.002
Token start
Token ID923
Feature activation+1.57e-02

Pos logit contributions
eth+0.006
 eng+0.006
+0.006
 favorite+0.006
om+0.005
Neg logit contributions
ocolate-0.008
 yourselves-0.008
 yourself-0.008
 course-0.007
ting-0.007
Token my
Token ID616
Feature activation+2.29e-03

Pos logit contributions
+0.024
eth+0.023
 wish+0.021
 eng+0.021
 favorite+0.021
Neg logit contributions
 yourselves-0.024
ocolate-0.023
 yourself-0.021
ados-0.021
ting-0.020
Token search
Token ID2989
Feature activation+4.36e-02

Pos logit contributions
+0.003
eth+0.003
 eng+0.002
 sure+0.002
 printed+0.002
Neg logit contributions
 yourselves-0.004
ocolate-0.004
ados-0.004
 sidelines-0.004
 yourself-0.003
Token?"
Token ID1701
Feature activation+2.38e-02

Pos logit contributions
+0.067
eth+0.059
 eng+0.056
 wish+0.055
 printed+0.055
Neg logit contributions
 yourselves-0.068
ocolate-0.065
ados-0.060
 yourself-0.059
ting-0.058
Token Mom
Token ID11254
Feature activation+1.19e-02

Pos logit contributions
+0.024
eth+0.018
 said+0.017
 favorite+0.017
 wish+0.017
Neg logit contributions
 yourselves-0.051
ocolate-0.050
ados-0.047
ting-0.047
 sidelines-0.046
Token pointed
Token ID6235
Feature activation-2.33e-03

Pos logit contributions
 favorite+0.012
 wish+0.012
 eng+0.012
+0.011
 favourite+0.011
Neg logit contributions
ocolate-0.020
ting-0.019
 course-0.018
 cream-0.018
ados-0.017
Token outside
Token ID2354
Feature activation+1.43e-03

Pos logit contributions
 yourselves+0.005
ocolate+0.005
ados+0.005
ting+0.004
 sidelines+0.004
Neg logit contributions
-0.002
eth-0.002
 favorite-0.002
 wish-0.002
 said-0.002
Token and
Token ID290
Feature activation+7.30e-03

Pos logit contributions
+0.001
 said+0.001
 once+0.001
eth+0.001
 favorite+0.001
Neg logit contributions
 yourselves-0.003
ocolate-0.003
ados-0.003
 sidelines-0.003
ting-0.003
Token do
Token ID466
Feature activation+3.86e-03

Pos logit contributions
 yourselves+0.011
ocolate+0.011
 sidelines+0.011
ados+0.011
ting+0.011
Neg logit contributions
-0.005
 favorite-0.003
 wish-0.003
 said-0.003
 printed-0.003
Token you
Token ID345
Feature activation+7.64e-03

Pos logit contributions
+0.007
eth+0.006
 favorite+0.006
 said+0.006
 wish+0.006
Neg logit contributions
 yourselves-0.005
ocolate-0.005
ados-0.005
ting-0.005
 sidelines-0.004
Token do
Token ID466
Feature activation+1.33e-02

Pos logit contributions
eth+0.010
+0.009
 Georg+0.008
 favorite+0.008
 Sad+0.008
Neg logit contributions
ocolate-0.013
 yourselves-0.013
 yourself-0.012
 course-0.012
ting-0.011
Token on
Token ID319
Feature activation+8.06e-03

Pos logit contributions
+0.019
eth+0.017
 eng+0.015
 printed+0.015
 wish+0.015
Neg logit contributions
 yourselves-0.022
ocolate-0.022
 yourself-0.020
ados-0.020
ting-0.019
Token your
Token ID534
Feature activation-8.90e-03

Pos logit contributions
+0.012
eth+0.010
 said+0.009
 favorite+0.009
 printed+0.009
Neg logit contributions
 yourselves-0.014
ocolate-0.013
ados-0.012
ting-0.012
 sidelines-0.012
Token boat
Token ID8848
Feature activation+4.34e-02

Pos logit contributions
 yourselves+0.018
ocolate+0.018
ados+0.017
ting+0.016
 sidelines+0.016
Neg logit contributions
-0.010
eth-0.008
 said-0.007
 printed-0.007
 wish-0.007
Token?"
Token ID1701
Feature activation+1.23e-02

Pos logit contributions
+0.066
eth+0.056
 eng+0.054
 wish+0.052
 sure+0.052
Neg logit contributions
 yourselves-0.068
ocolate-0.066
ados-0.060
ting-0.059
 yourself-0.059
Token
Token ID220
Feature activation-5.73e-03

Pos logit contributions
+0.015
 said+0.011
eth+0.011
 favorite+0.011
 printed+0.011
Neg logit contributions
 yourselves-0.024
ocolate-0.024
ados-0.022
ting-0.022
 sidelines-0.021
Token
Token ID198
Feature activation-1.26e-03

Pos logit contributions
 yourselves+0.010
ocolate+0.010
 yourself+0.009
ados+0.009
ting+0.008
Neg logit contributions
 wish-0.007
 printed-0.006
 wrapped-0.006
-0.006
 pretended-0.006
Token
Token ID198
Feature activation-5.53e-03

Pos logit contributions
 yourselves+0.003
ocolate+0.003
ting+0.003
ados+0.003
 sidelines+0.003
Neg logit contributions
-0.001
 wish-0.001
eth-0.001
 favorite-0.001
 eng-0.001
Token"
Token ID1
Feature activation-1.38e-02

Pos logit contributions
 yourselves+0.010
ocolate+0.010
ados+0.009
ting+0.009
 sidelines+0.009
Neg logit contributions
-0.008
 said-0.006
eth-0.006
 printed-0.006
's-0.006
Tokens
Token ID82
Feature activation+5.92e-03

Pos logit contributions
 yourselves+0.032
ocolate+0.031
ados+0.029
ting+0.029
 sidelines+0.028
Neg logit contributions
-0.019
eth-0.015
 favorite-0.014
 wish-0.014
 said-0.014
Token wrong
Token ID2642
Feature activation+1.24e-02

Pos logit contributions
+0.009
eth+0.008
 favorite+0.008
 said+0.008
 wish+0.008
Neg logit contributions
 yourselves-0.009
ocolate-0.009
ados-0.008
ting-0.008
 sidelines-0.008
Token,
Token ID11
Feature activation+5.77e-03

Pos logit contributions
+0.018
eth+0.015
 said+0.014
 favorite+0.014
 wish+0.014
Neg logit contributions
 yourselves-0.021
ocolate-0.021
ados-0.019
 sidelines-0.019
ting-0.019
Token rh
Token ID9529
Feature activation-1.79e-02

Pos logit contributions
+0.007
eth+0.006
 favorite+0.006
 said+0.006
 wish+0.006
Neg logit contributions
 yourselves-0.011
ocolate-0.010
ados-0.010
ting-0.009
 sidelines-0.009
Tokenin
Token ID259
Feature activation+2.76e-02

Pos logit contributions
ocolate+0.041
 sidelines+0.040
ados+0.039
 yourselves+0.038
 temptation+0.036
Neg logit contributions
-0.013
a-0.012
 said-0.011
 stuck-0.008
 lost-0.008
Tokenoc
Token ID420
Feature activation-6.32e-02

Pos logit contributions
+0.040
eth+0.033
 favorite+0.032
 said+0.032
 wish+0.031
Neg logit contributions
 yourselves-0.046
ocolate-0.046
ados-0.042
ting-0.041
 sidelines-0.041
Tokeneros
Token ID27498
Feature activation+8.57e-03

Pos logit contributions
 sidelines+0.085
 yourselves+0.078
ocolate+0.075
 temptation+0.072
Does+0.070
Neg logit contributions
-0.090
 once-0.085
 says-0.084
 said-0.082
eth-0.081
Token?
Token ID30
Feature activation+6.18e-03

Pos logit contributions
+0.012
eth+0.010
 said+0.010
 favorite+0.010
 wish+0.010
Neg logit contributions
 yourselves-0.014
ocolate-0.014
ados-0.013
ting-0.013
 sidelines-0.013
Tokenâ
Token ID22940
Feature activation-6.79e-03

Pos logit contributions
+0.008
 favorite+0.007
 wish+0.007
eth+0.007
 printed+0.007
Neg logit contributions
 yourselves-0.010
ocolate-0.010
ting-0.009
ados-0.009
 yourself-0.009
Token
Token ID26391
Feature activation+3.12e-03

Pos logit contributions
 yourselves+0.010
ocolate+0.010
ados+0.010
 sidelines+0.009
ting+0.009
Neg logit contributions
-0.011
 said-0.009
eth-0.009
 printed-0.009
 worked-0.009
Token said
Token ID531
Feature activation-7.66e-03

Pos logit contributions
+0.003
eth+0.003
 favorite+0.002
 said+0.002
 wish+0.002
Neg logit contributions
 yourselves-0.006
ocolate-0.006
ados-0.006
ting-0.006
 sidelines-0.006
Token try
Token ID1949
Feature activation-1.74e-02

Pos logit contributions
+0.007
eth+0.006
 favorite+0.006
 printed+0.006
 said+0.006
Neg logit contributions
 yourselves-0.008
ocolate-0.008
ados-0.007
ting-0.007
 sidelines-0.007
Token to
Token ID284
Feature activation-1.23e-02

Pos logit contributions
ocolate+0.041
 yourselves+0.040
 sidelines+0.038
ados+0.038
ting+0.036
Neg logit contributions
-0.015
 favorite-0.011
 sure-0.010
 once-0.010
 said-0.010
Token grab
Token ID5552
Feature activation-1.25e-02

Pos logit contributions
 yourselves+0.021
ocolate+0.021
ados+0.019
ting+0.019
 sidelines+0.019
Neg logit contributions
-0.017
eth-0.014
 favorite-0.013
 said-0.013
 wish-0.013
Token the
Token ID262
Feature activation-1.55e-02

Pos logit contributions
ocolate+0.025
 yourselves+0.025
 sidelines+0.023
ados+0.023
ting+0.022
Neg logit contributions
-0.015
 favorite-0.011
 said-0.011
 sure-0.011
eth-0.011
Token av
Token ID1196
Feature activation-1.56e-02

Pos logit contributions
ocolate+0.026
 yourselves+0.025
ting+0.022
 sidelines+0.022
ross+0.021
Neg logit contributions
-0.023
 favorite-0.021
 special-0.020
 lost-0.019
 wish-0.019
Tokenoc
Token ID420
Feature activation-6.30e-02

Pos logit contributions
 yourselves+0.031
ocolate+0.030
ados+0.028
ting+0.028
 sidelines+0.028
Neg logit contributions
-0.018
eth-0.014
 favorite-0.013
 said-0.013
 wish-0.013
Tokenados
Token ID22484
Feature activation-4.53e-03

Pos logit contributions
 yourselves+0.055
 sidelines+0.053
Does+0.047
ocolate+0.044
Leave+0.044
Neg logit contributions
-0.117
 once-0.110
 favorite-0.107
's-0.106
om-0.105
Token.
Token ID13
Feature activation+3.62e-03

Pos logit contributions
 yourselves+0.008
ocolate+0.008
ting+0.008
ados+0.008
 yourself+0.007
Neg logit contributions
-0.005
eth-0.005
 wish-0.005
 eng-0.005
 printed-0.005
Token But
Token ID887
Feature activation+3.70e-03

Pos logit contributions
+0.004
 wish+0.003
eth+0.003
 printed+0.003
 connected+0.003
Neg logit contributions
 yourselves-0.007
ocolate-0.007
ados-0.007
ting-0.006
 sidelines-0.006
Token the
Token ID262
Feature activation-4.97e-03

Pos logit contributions
+0.004
eth+0.004
 favorite+0.003
 wish+0.003
 said+0.003
Neg logit contributions
 yourselves-0.007
ocolate-0.007
ados-0.007
ting-0.006
 sidelines-0.006
Token rock
Token ID3881
Feature activation+2.19e-02

Pos logit contributions
ocolate+0.009
 yourselves+0.008
ting+0.008
ross+0.007
 sidelines+0.007
Neg logit contributions
-0.007
 favorite-0.007
 special-0.006
 said-0.006
 lost-0.006
Token didn
Token ID1422
Feature activation-8.43e-03

Pos logit contributions
 yourselves+0.041
ocolate+0.040
ados+0.038
ting+0.038
 sidelines+0.037
Neg logit contributions
-0.024
eth-0.019
 favorite-0.018
 wish-0.018
 printed-0.017
Token't
Token ID470
Feature activation-4.66e-03

Pos logit contributions
ocolate+0.010
 yourselves+0.010
ados+0.008
ting+0.008
 sidelines+0.008
Neg logit contributions
-0.015
 wish-0.014
 favorite-0.014
eth-0.014
 said-0.013
Token like
Token ID588
Feature activation-1.35e-02

Pos logit contributions
ocolate+0.011
 yourselves+0.010
 yourself+0.010
ting+0.010
ados+0.010
Neg logit contributions
-0.004
eth-0.003
 favorite-0.003
 favourite-0.002
 printed-0.002
Token wet
Token ID9583
Feature activation-1.23e-02

Pos logit contributions
ocolate+0.028
 yourselves+0.027
 sidelines+0.025
endor+0.025
ross+0.025
Neg logit contributions
-0.014
 special-0.012
 lost-0.012
 favorite-0.012
 once-0.011
Token av
Token ID1196
Feature activation-1.48e-02

Pos logit contributions
ocolate+0.023
bek+0.022
 sidelines+0.022
gency+0.021
Deal+0.019
Neg logit contributions
 inside-0.015
 once-0.014
 at-0.014
-0.014
 lost-0.013
Tokenoc
Token ID420
Feature activation-6.20e-02

Pos logit contributions
 yourselves+0.027
ocolate+0.027
ados+0.026
ting+0.025
 sidelines+0.024
Neg logit contributions
-0.018
eth-0.015
 favorite-0.014
 wish-0.014
 printed-0.014
Tokenados
Token ID22484
Feature activation-1.40e-03

Pos logit contributions
 yourselves+0.050
 sidelines+0.048
Does+0.041
ocolate+0.040
gency+0.034
Neg logit contributions
-0.123
 once-0.120
 favorite-0.112
 stuck-0.108
 says-0.108
Token,
Token ID11
Feature activation+7.67e-03

Pos logit contributions
 yourselves+0.002
ting+0.002
ocolate+0.002
ados+0.002
 course+0.002
Neg logit contributions
-0.002
eth-0.002
 wish-0.001
 eng-0.001
 printed-0.001
Token so
Token ID523
Feature activation+1.42e-02

Pos logit contributions
+0.008
 said+0.006
 sure+0.005
 favorite+0.005
 stuck+0.005
Neg logit contributions
ocolate-0.017
 yourselves-0.017
ados-0.016
 sidelines-0.016
anish-0.015
Token he
Token ID339
Feature activation-1.35e-03

Pos logit contributions
+0.020
eth+0.017
 favorite+0.016
 printed+0.016
 wish+0.016
Neg logit contributions
 yourselves-0.024
ocolate-0.023
ados-0.021
ting-0.021
 sidelines-0.021
Token decided
Token ID3066
Feature activation+2.05e-03

Pos logit contributions
 yourselves+0.003
ocolate+0.003
ados+0.002
ting+0.002
 sidelines+0.002
Neg logit contributions
-0.002
eth-0.001
 favorite-0.001
 wish-0.001
 printed-0.001
Token They
Token ID1119
Feature activation-1.07e-02

Pos logit contributions
+0.004
eth+0.003
 wish+0.003
 sure+0.003
 printed+0.003
Neg logit contributions
 yourselves-0.006
ocolate-0.006
ados-0.005
ting-0.005
 sidelines-0.005
Token forget
Token ID6044
Feature activation-4.16e-03

Pos logit contributions
 yourselves+0.017
ocolate+0.017
 sidelines+0.016
ting+0.015
 yourself+0.015
Neg logit contributions
-0.015
eth-0.013
 favorite-0.013
 printed-0.012
 eng-0.012
Token about
Token ID546
Feature activation-1.01e-02

Pos logit contributions
 yourselves+0.010
ocolate+0.010
ados+0.010
 sidelines+0.010
ting+0.010
Neg logit contributions
-0.003
 said-0.002
 favorite-0.002
 sure-0.002
 wish-0.002
Token the
Token ID262
Feature activation-7.91e-03

Pos logit contributions
ocolate+0.023
 yourselves+0.023
ados+0.022
 sidelines+0.021
ting+0.021
Neg logit contributions
-0.008
 favorite-0.007
 once-0.006
 said-0.006
 lost-0.006
Token av
Token ID1196
Feature activation-4.08e-03

Pos logit contributions
ocolate+0.014
 yourselves+0.014
ting+0.013
ados+0.012
 sidelines+0.012
Neg logit contributions
-0.010
 favorite-0.009
 wish-0.009
 lost-0.009
 special-0.009
Tokenoc
Token ID420
Feature activation-6.10e-02

Pos logit contributions
 yourselves+0.007
ocolate+0.007
ados+0.007
ting+0.007
 sidelines+0.007
Neg logit contributions
-0.005
eth-0.004
 favorite-0.004
 wish-0.004
 printed-0.004
Tokenados
Token ID22484
Feature activation+2.78e-03

Pos logit contributions
 yourselves+0.052
 sidelines+0.052
 hi+0.047
ocolate+0.046
Does+0.043
Neg logit contributions
 once-0.110
-0.103
 favorite-0.100
 inside-0.099
 stuck-0.098
Token.
Token ID13
Feature activation+4.31e-03

Pos logit contributions
+0.003
eth+0.003
 favorite+0.003
 wish+0.003
 eng+0.003
Neg logit contributions
 yourselves-0.005
ocolate-0.005
ting-0.005
ados-0.004
 course-0.004
Token They
Token ID1119
Feature activation-1.19e-02

Pos logit contributions
+0.005
eth+0.004
 wish+0.004
 connected+0.004
 printed+0.004
Neg logit contributions
 yourselves-0.008
ocolate-0.008
ados-0.007
ting-0.007
 yourself-0.007
Token just
Token ID655
Feature activation+1.25e-03

Pos logit contributions
 yourselves+0.020
ocolate+0.019
 sidelines+0.018
ting+0.017
 course+0.017
Neg logit contributions
-0.016
eth-0.014
 favorite-0.013
 printed-0.013
 eng-0.013
Token want
Token ID765
Feature activation-2.28e-03

Pos logit contributions
eth+0.001
+0.001
 favorite+0.001
 Hatt+0.001
 favourite+0.001
Neg logit contributions
 yourselves-0.002
ocolate-0.002
 sidelines-0.002
 yourself-0.002
ting-0.002
Token playing
Token ID2712
Feature activation-4.31e-03

Pos logit contributions
 yourselves+0.012
ocolate+0.011
ting+0.011
 sidelines+0.011
ados+0.010
Neg logit contributions
-0.008
eth-0.007
 favorite-0.007
 wish-0.006
 eng-0.006
Token!
Token ID0
Feature activation-7.38e-03

Pos logit contributions
ocolate+0.010
 yourselves+0.010
ados+0.009
 sidelines+0.009
ting+0.009
Neg logit contributions
-0.004
 said-0.003
 favorite-0.003
 connected-0.003
 printed-0.003
Token The
Token ID383
Feature activation-1.07e-02

Pos logit contributions
 yourselves+0.014
ocolate+0.014
ados+0.013
ting+0.013
 sidelines+0.013
Neg logit contributions
-0.007
 wish-0.007
 favorite-0.006
eth-0.006
 printed-0.006
Token rh
Token ID9529
Feature activation-2.61e-02

Pos logit contributions
ocolate+0.020
 yourselves+0.019
ados+0.018
ting+0.017
 yourself+0.017
Neg logit contributions
-0.014
 wish-0.012
 favorite-0.012
 said-0.012
eth-0.011
Tokenin
Token ID259
Feature activation+1.87e-03

Pos logit contributions
ocolate+0.049
 yourselves+0.048
 sidelines+0.045
ados+0.044
ting+0.043
Neg logit contributions
-0.031
 said-0.025
 printed-0.025
 connected-0.024
 once-0.024
Tokenoc
Token ID420
Feature activation-6.00e-02

Pos logit contributions
+0.002
eth+0.002
 favorite+0.002
 eng+0.002
 connected+0.002
Neg logit contributions
ocolate-0.003
 yourselves-0.003
ting-0.003
ados-0.003
 course-0.003
Tokeneros
Token ID27498
Feature activation-1.89e-02

Pos logit contributions
 sidelines+0.088
ocolate+0.087
 yourselves+0.085
 temptation+0.081
Does+0.079
Neg logit contributions
 once-0.072
-0.065
 favorite-0.061
 inside-0.060
 stuck-0.059
Token was
Token ID373
Feature activation-3.14e-03

Pos logit contributions
 yourselves+0.035
ocolate+0.035
ting+0.035
ados+0.033
 course+0.033
Neg logit contributions
-0.019
eth-0.017
 wish-0.015
 favorite-0.015
 eng-0.015
Token very
Token ID845
Feature activation+2.84e-03

Pos logit contributions
 yourselves+0.005
ocolate+0.005
ting+0.005
 sidelines+0.005
 yourself+0.005
Neg logit contributions
-0.004
eth-0.004
 wish-0.003
 favorite-0.003
erg-0.003
Token worried
Token ID7960
Feature activation-8.75e-03

Pos logit contributions
+0.003
eth+0.003
 Hatt+0.002
 Georg+0.002
 printed+0.002
Neg logit contributions
 yourselves-0.005
ocolate-0.005
 sidelines-0.005
 yourself-0.005
 temptation-0.005
Token,
Token ID11
Feature activation+2.53e-03

Pos logit contributions
 yourselves+0.018
ocolate+0.018
ados+0.017
ting+0.017
 sidelines+0.017
Neg logit contributions
-0.009
eth-0.007
 said-0.006
 favorite-0.006
 wish-0.006
Token,
Token ID11
Feature activation+1.18e-02

Pos logit contributions
 yourselves+0.005
ocolate+0.005
 sidelines+0.004
ting+0.004
ados+0.004
Neg logit contributions
-0.002
 said-0.002
 sure-0.002
 favorite-0.002
 printed-0.002
Token "
Token ID366
Feature activation-5.55e-03

Pos logit contributions
+0.018
 said+0.015
 sure+0.014
eth+0.014
 printed+0.014
Neg logit contributions
 yourselves-0.021
ocolate-0.020
ados-0.018
 sidelines-0.018
 yourself-0.017
TokenI
Token ID40
Feature activation+5.36e-03

Pos logit contributions
 yourselves+0.007
ocolate+0.007
ados+0.006
ting+0.006
 sidelines+0.006
Neg logit contributions
-0.010
eth-0.009
 favorite-0.009
 said-0.009
 wish-0.009
Token love
Token ID1842
Feature activation-9.34e-03

Pos logit contributions
eth+0.006
 Hatt+0.006
 Georg+0.006
 connected+0.006
 favorite+0.005
Neg logit contributions
ocolate-0.008
 course-0.007
 yourselves-0.007
 liar-0.007
 yourself-0.007
Token av
Token ID1196
Feature activation-1.14e-02

Pos logit contributions
ocolate+0.017
 yourselves+0.016
 sidelines+0.015
ting+0.015
endor+0.015
Neg logit contributions
-0.013
 said-0.011
 special-0.010
 sure-0.010
 once-0.010
Tokenoc
Token ID420
Feature activation-5.92e-02

Pos logit contributions
 yourselves+0.022
ocolate+0.022
ados+0.020
ting+0.020
 sidelines+0.020
Neg logit contributions
-0.013
eth-0.011
 favorite-0.010
 wish-0.010
 said-0.010
Tokenados
Token ID22484
Feature activation-5.69e-04

Pos logit contributions
 sidelines+0.048
 yourselves+0.048
ocolate+0.040
Does+0.037
 crashing+0.036
Neg logit contributions
-0.121
 once-0.109
 says-0.105
's-0.104
 said-0.103
Token!
Token ID0
Feature activation+3.12e-03

Pos logit contributions
 yourselves+0.001
ocolate+0.001
ados+0.001
 sidelines+0.001
ting+0.001
Neg logit contributions
-0.001
eth-0.000
 said-0.000
 favorite-0.000
 wish-0.000
Token They
Token ID1119
Feature activation-1.23e-02

Pos logit contributions
+0.005
eth+0.004
 favorite+0.004
 said+0.004
 wish+0.004
Neg logit contributions
 yourselves-0.005
ocolate-0.005
ados-0.004
ting-0.004
 sidelines-0.004
Token are
Token ID389
Feature activation-3.50e-03

Pos logit contributions
 yourselves+0.021
ocolate+0.020
ados+0.019
ting+0.018
 yourself+0.018
Neg logit contributions
-0.016
eth-0.015
 eng-0.013
 favorite-0.013
erg-0.013
Token so
Token ID523
Feature activation+7.32e-03

Pos logit contributions
 yourselves+0.007
ocolate+0.006
ados+0.006
ting+0.006
 yourself+0.006
Neg logit contributions
-0.004
eth-0.004
 wish-0.003
 favorite-0.003
 said-0.003
Token his
Token ID465
Feature activation-5.83e-03

Pos logit contributions
 yourselves+0.022
ocolate+0.022
ting+0.020
ados+0.020
 sidelines+0.020
Neg logit contributions
-0.016
eth-0.014
 favorite-0.012
 wish-0.012
 printed-0.012
Token favourite
Token ID12507
Feature activation-5.09e-03

Pos logit contributions
 yourselves+0.010
ocolate+0.010
ados+0.009
ting+0.009
 sidelines+0.009
Neg logit contributions
-0.008
 favorite-0.007
 said-0.007
eth-0.007
 wish-0.007
Token treat
Token ID2190
Feature activation-8.76e-03

Pos logit contributions
 yourselves+0.009
ocolate+0.008
ting+0.008
ados+0.008
 yourself+0.007
Neg logit contributions
-0.008
 favorite-0.007
 wish-0.007
 special-0.007
 said-0.007
Token -
Token ID532
Feature activation+2.43e-03

Pos logit contributions
 yourselves+0.018
ocolate+0.017
ados+0.016
ting+0.016
 sidelines+0.016
Neg logit contributions
-0.010
eth-0.008
 favorite-0.007
 said-0.007
 wish-0.007
Token ch
Token ID442
Feature activation-1.60e-03

Pos logit contributions
+0.003
 said+0.002
 sure+0.002
 favorite+0.002
 wish+0.002
Neg logit contributions
 yourselves-0.005
ocolate-0.005
ados-0.005
 sidelines-0.004
ting-0.004
Tokenoc
Token ID420
Feature activation-5.91e-02

Pos logit contributions
 yourselves+0.003
ocolate+0.003
 sidelines+0.003
ting+0.003
ados+0.003
Neg logit contributions
-0.002
eth-0.002
 sure-0.002
 wish-0.002
 stuck-0.001
Token-
Token ID12
Feature activation-2.70e-02

Pos logit contributions
 sidelines+0.072
 yourselves+0.072
chard+0.065
Does+0.064
nam+0.063
Neg logit contributions
-0.089
 once-0.087
eth-0.085
 sure-0.080
 Georg-0.079
Tokenchip
Token ID35902
Feature activation-2.01e-02

Pos logit contributions
 yourselves+0.045
ocolate+0.042
ados+0.040
 sidelines+0.039
ting+0.038
Neg logit contributions
-0.043
eth-0.034
 said-0.034
 sure-0.033
 favorite-0.033
Token ice
Token ID4771
Feature activation-1.60e-03

Pos logit contributions
ocolate+0.035
 yourselves+0.035
 sidelines+0.032
ados+0.031
ting+0.031
Neg logit contributions
-0.028
 said-0.024
 favorite-0.022
 sure-0.022
 says-0.021
Token cream
Token ID8566
Feature activation-1.41e-02

Pos logit contributions
 yourselves+0.002
ocolate+0.002
 sidelines+0.001
ting+0.001
ados+0.001
Neg logit contributions
-0.003
 favorite-0.003
 said-0.003
eth-0.003
 printed-0.003
Token!
Token ID0
Feature activation-6.96e-03

Pos logit contributions
 yourselves+0.028
ocolate+0.027
ados+0.025
 sidelines+0.024
ting+0.024
Neg logit contributions
-0.018
 said-0.014
 favorite-0.013
eth-0.013
 printed-0.013
Token saw
Token ID2497
Feature activation+1.64e-03

Pos logit contributions
 yourselves+0.013
ocolate+0.013
ting+0.013
 sidelines+0.012
ados+0.012
Neg logit contributions
-0.008
eth-0.007
 wish-0.006
 dot-0.005
 eng-0.005
Token a
Token ID257
Feature activation+2.95e-03

Pos logit contributions
+0.002
 favorite+0.001
eth+0.001
 said+0.001
 wish+0.001
Neg logit contributions
 yourselves-0.003
ocolate-0.003
ados-0.003
ting-0.003
 sidelines-0.003
Token big
Token ID1263
Feature activation+2.75e-03

Pos logit contributions
+0.003
eth+0.002
 favorite+0.002
 said+0.002
 wish+0.002
Neg logit contributions
 yourselves-0.006
ocolate-0.006
ados-0.006
ting-0.005
 sidelines-0.005
Token rh
Token ID9529
Feature activation-9.65e-03

Pos logit contributions
+0.004
eth+0.003
 said+0.003
 wish+0.003
 favorite+0.003
Neg logit contributions
 yourselves-0.005
ocolate-0.005
ados-0.004
ting-0.004
 sidelines-0.004
Tokenin
Token ID259
Feature activation+2.44e-02

Pos logit contributions
ocolate+0.018
 yourselves+0.018
ados+0.017
 sidelines+0.016
ting+0.016
Neg logit contributions
-0.012
eth-0.009
 printed-0.009
 said-0.009
 wish-0.009
Tokenoc
Token ID420
Feature activation-5.76e-02

Pos logit contributions
+0.027
eth+0.021
 said+0.020
 favorite+0.020
 wish+0.020
Neg logit contributions
 yourselves-0.049
ocolate-0.048
ados-0.045
ting-0.044
 sidelines-0.044
Tokeneros
Token ID27498
Feature activation+9.26e-05

Pos logit contributions
 yourselves+0.085
 sidelines+0.077
 yourself+0.076
ocolate+0.076
 temptation+0.070
Neg logit contributions
 once-0.078
eth-0.075
-0.073
 favorite-0.073
 wish-0.070
Token.
Token ID13
Feature activation-9.09e-03

Pos logit contributions
+0.000
eth+0.000
 said+0.000
 favorite+0.000
 wish+0.000
Neg logit contributions
 yourselves-0.000
ocolate-0.000
ados-0.000
 sidelines-0.000
ting-0.000
Token It
Token ID632
Feature activation+3.91e-03

Pos logit contributions
 yourselves+0.019
ocolate+0.019
ados+0.018
ting+0.018
 sidelines+0.018
Neg logit contributions
-0.009
eth-0.007
 said-0.007
 favorite-0.006
 wish-0.006
Token was
Token ID373
Feature activation+1.18e-03

Pos logit contributions
eth+0.005
+0.005
 wish+0.005
 eng+0.004
ends+0.004
Neg logit contributions
ting-0.005
 course-0.005
 yourselves-0.005
 cream-0.005
ocolate-0.005
Token dirty
Token ID11841
Feature activation+5.16e-04

Pos logit contributions
+0.002
eth+0.001
 wish+0.001
erg+0.001
 know+0.001
Neg logit contributions
 yourselves-0.002
ocolate-0.002
 yourself-0.002
ting-0.002
 cream-0.002
Token on
Token ID319
Feature activation+1.52e-04

Pos logit contributions
 yourselves+0.047
ocolate+0.045
 sidelines+0.043
ados+0.043
 yourself+0.041
Neg logit contributions
-0.020
eth-0.013
 once-0.013
 said-0.013
 sure-0.013
Token,
Token ID11
Feature activation-2.41e-03

Pos logit contributions
+0.000
eth+0.000
 favorite+0.000
 said+0.000
 wish+0.000
Neg logit contributions
 yourselves-0.000
ocolate-0.000
ados-0.000
ting-0.000
 sidelines-0.000
Token Lily
Token ID20037
Feature activation+1.40e-03

Pos logit contributions
 yourselves+0.004
ocolate+0.004
ados+0.004
ting+0.004
 course+0.004
Neg logit contributions
-0.003
eth-0.003
 wish-0.003
erg-0.002
 connected-0.002
Token loved
Token ID6151
Feature activation-1.20e-03

Pos logit contributions
+0.001
eth+0.001
 favorite+0.001
 said+0.001
 wish+0.001
Neg logit contributions
 yourselves-0.003
ocolate-0.003
ados-0.003
ting-0.003
 sidelines-0.003
Token av
Token ID1196
Feature activation-7.95e-03

Pos logit contributions
 yourselves+0.003
ocolate+0.003
 sidelines+0.002
ting+0.002
anish+0.002
Neg logit contributions
-0.001
 favorite-0.001
 sure-0.001
 said-0.001
 special-0.001
Tokenoc
Token ID420
Feature activation-5.75e-02

Pos logit contributions
ocolate+0.015
 yourselves+0.014
ados+0.014
ting+0.013
 sidelines+0.013
Neg logit contributions
-0.010
eth-0.008
 favorite-0.008
 wish-0.008
 printed-0.008
Tokenados
Token ID22484
Feature activation+6.95e-04

Pos logit contributions
 sidelines+0.054
 yourselves+0.052
Does+0.047
 crashing+0.046
 distance+0.045
Neg logit contributions
 once-0.101
-0.100
 favorite-0.094
's-0.089
 work-0.089
Token and
Token ID290
Feature activation-1.86e-03

Pos logit contributions
+0.001
 favorite+0.000
 said+0.000
 sure+0.000
 once+0.000
Neg logit contributions
 yourselves-0.002
ocolate-0.002
 sidelines-0.001
ados-0.001
ting-0.001
Token would
Token ID561
Feature activation-1.40e-02

Pos logit contributions
 yourselves+0.004
ocolate+0.004
 sidelines+0.004
ados+0.004
ting+0.004
Neg logit contributions
-0.002
 said-0.001
 favorite-0.001
 printed-0.001
 sure-0.001
Token ask
Token ID1265
Feature activation+2.95e-03

Pos logit contributions
 yourselves+0.031
ocolate+0.030
ting+0.029
ados+0.029
 sidelines+0.028
Neg logit contributions
-0.014
 wish-0.010
 favorite-0.010
 said-0.010
 know-0.009
Token for
Token ID329
Feature activation-2.35e-03

Pos logit contributions
+0.002
 favorite+0.002
 sure+0.002
 said+0.002
 special+0.002
Neg logit contributions
 yourselves-0.007
ocolate-0.007
 sidelines-0.006
ados-0.006
ting-0.006
Token"
Token ID1
Feature activation-8.91e-03

Pos logit contributions
 yourselves+0.004
ocolate+0.004
 yourself+0.004
ting+0.004
ados+0.003
Neg logit contributions
-0.005
 favorite-0.004
 wish-0.004
eth-0.004
 printed-0.004
TokenHello
Token ID15496
Feature activation-1.94e-03

Pos logit contributions
 yourselves+0.010
ocolate+0.010
ting+0.009
ados+0.009
 sidelines+0.009
Neg logit contributions
-0.017
eth-0.015
 favorite-0.015
 wish-0.015
 said-0.015
Token,
Token ID11
Feature activation-4.06e-03

Pos logit contributions
 yourselves+0.003
ocolate+0.003
ados+0.003
ting+0.003
 yourself+0.003
Neg logit contributions
-0.002
eth-0.002
 connected-0.002
 eng-0.002
 printed-0.002
Token rh
Token ID9529
Feature activation-1.84e-02

Pos logit contributions
 yourselves+0.007
ocolate+0.006
ting+0.006
ados+0.006
 course+0.006
Neg logit contributions
-0.006
eth-0.005
 eng-0.005
 favorite-0.005
 connected-0.005
Tokenin
Token ID259
Feature activation+1.10e-02

Pos logit contributions
ocolate+0.040
 sidelines+0.039
 yourselves+0.037
ados+0.037
 temptation+0.035
Neg logit contributions
-0.016
a-0.013
 said-0.013
 wish-0.012
 once-0.012
Tokenoc
Token ID420
Feature activation-5.72e-02

Pos logit contributions
+0.013
eth+0.010
 favorite+0.010
 wish+0.010
 printed+0.009
Neg logit contributions
 yourselves-0.021
ocolate-0.021
ados-0.020
ting-0.019
 sidelines-0.019
Tokeneros
Token ID27498
Feature activation-1.20e-02

Pos logit contributions
 sidelines+0.090
ocolate+0.082
 yourselves+0.079
 temptation+0.078
Does+0.072
Neg logit contributions
-0.066
 once-0.066
 Georg-0.062
 inside-0.062
eth-0.061
Token!"
Token ID2474
Feature activation+9.38e-03

Pos logit contributions
 yourselves+0.022
ocolate+0.022
ados+0.020
ting+0.020
 yourself+0.019
Neg logit contributions
-0.015
eth-0.013
 favorite-0.012
 printed-0.012
 wish-0.011
Token Anna
Token ID11735
Feature activation+1.97e-02

Pos logit contributions
eth+0.015
+0.014
erg+0.013
 favorite+0.013
 favourite+0.013
Neg logit contributions
 yourselves-0.015
ocolate-0.012
 yourself-0.012
 sidelines-0.011
ados-0.011
Token and
Token ID290
Feature activation-1.45e-02

Pos logit contributions
 favourite+0.024
 favorite+0.024
eth+0.023
+0.022
erg+0.022
Neg logit contributions
 yourselves-0.029
ting-0.029
ocolate-0.027
gging-0.026
ados-0.026
Token Ben
Token ID3932
Feature activation+2.11e-02

Pos logit contributions
 yourselves+0.028
ocolate+0.027
ados+0.025
ting+0.025
 sidelines+0.025
Neg logit contributions
-0.018
eth-0.014
 favorite-0.014
 wish-0.013
 said-0.013
Token at
Token ID379
Feature activation-3.57e-03

Pos logit contributions
 yourselves+0.001
ting+0.001
ados+0.001
ocolate+0.001
 yourself+0.001
Neg logit contributions
-0.001
eth-0.001
erg-0.001
 eng-0.001
 connected-0.001
Token that
Token ID326
Feature activation-5.59e-03

Pos logit contributions
ocolate+0.007
 yourselves+0.007
ados+0.006
 sidelines+0.006
ting+0.006
Neg logit contributions
-0.004
 said-0.003
eth-0.003
 favorite-0.003
 wish-0.003
Token big
Token ID1263
Feature activation-1.75e-03

Pos logit contributions
 yourselves+0.011
ocolate+0.011
ados+0.010
ting+0.010
 sidelines+0.009
Neg logit contributions
-0.007
eth-0.005
 favorite-0.005
 said-0.005
 wish-0.005
Token rh
Token ID9529
Feature activation-1.70e-02

Pos logit contributions
ocolate+0.004
 yourselves+0.003
 sidelines+0.003
ados+0.003
 yourself+0.003
Neg logit contributions
-0.002
 said-0.002
 wish-0.002
 says-0.002
 once-0.001
Tokenin
Token ID259
Feature activation+2.08e-02

Pos logit contributions
ocolate+0.033
 yourselves+0.032
 sidelines+0.032
ados+0.030
 temptation+0.029
Neg logit contributions
-0.019
a-0.015
 said-0.015
 printed-0.014
 stuck-0.014
Tokenoc
Token ID420
Feature activation-5.67e-02

Pos logit contributions
+0.024
 said+0.018
eth+0.017
 wish+0.017
 printed+0.017
Neg logit contributions
 yourselves-0.041
ocolate-0.041
ados-0.038
 sidelines-0.038
ting-0.037
Tokeneros
Token ID27498
Feature activation-1.70e-03

Pos logit contributions
 sidelines+0.080
 yourselves+0.076
ocolate+0.073
 temptation+0.070
Does+0.068
Neg logit contributions
-0.077
 once-0.076
om-0.068
 favorite-0.068
 Georg-0.068
Token!"
Token ID2474
Feature activation+1.97e-02

Pos logit contributions
 yourselves+0.003
ocolate+0.003
ados+0.003
ting+0.003
 sidelines+0.003
Neg logit contributions
-0.002
eth-0.002
 favorite-0.002
 wish-0.002
 printed-0.002
Token Billy
Token ID15890
Feature activation+1.85e-04

Pos logit contributions
+0.018
eth+0.016
 favorite+0.015
erg+0.015
 wish+0.014
Neg logit contributions
 yourselves-0.040
ados-0.039
ocolate-0.039
ting-0.037
 sidelines-0.035
Token said
Token ID531
Feature activation-8.55e-03

Pos logit contributions
 favorite+0.000
eth+0.000
+0.000
 favourite+0.000
 wish+0.000
Neg logit contributions
ting-0.000
 yourselves-0.000
gging-0.000
ocolate-0.000
 yourself-0.000
Token,
Token ID11
Feature activation+4.29e-03

Pos logit contributions
 yourselves+0.017
ocolate+0.017
ados+0.016
ting+0.016
 sidelines+0.016
Neg logit contributions
-0.009
eth-0.007
 favorite-0.007
 said-0.007
 wish-0.007
Token he
Token ID339
Feature activation-1.14e-02

Pos logit contributions
 yourselves+0.029
ocolate+0.028
ados+0.026
ting+0.026
 sidelines+0.025
Neg logit contributions
-0.022
eth-0.018
 favorite-0.018
 said-0.018
 wish-0.018
Token saw
Token ID2497
Feature activation-9.49e-03

Pos logit contributions
 yourselves+0.020
ocolate+0.020
ting+0.019
 yourself+0.018
ados+0.018
Neg logit contributions
-0.013
eth-0.011
 favorite-0.011
 wish-0.010
 connected-0.010
Token a
Token ID257
Feature activation-3.36e-03

Pos logit contributions
ocolate+0.021
 yourselves+0.021
ados+0.019
 sidelines+0.019
ting+0.019
Neg logit contributions
-0.009
 sure-0.007
 said-0.007
 favorite-0.007
 wish-0.006
Token rh
Token ID9529
Feature activation-1.17e-02

Pos logit contributions
 yourselves+0.008
ocolate+0.008
ados+0.007
ting+0.007
 sidelines+0.007
Neg logit contributions
-0.003
 wish-0.002
eth-0.002
 said-0.002
 favorite-0.002
Tokenin
Token ID259
Feature activation+2.07e-02

Pos logit contributions
 yourselves+0.023
ocolate+0.023
ados+0.022
 sidelines+0.021
 yourself+0.020
Neg logit contributions
-0.013
a-0.010
 said-0.010
 stuck-0.010
eth-0.010
Tokenoc
Token ID420
Feature activation-5.62e-02

Pos logit contributions
+0.020
 wish+0.015
 said+0.015
eth+0.014
 favorite+0.014
Neg logit contributions
 yourselves-0.046
ocolate-0.044
ados-0.042
 sidelines-0.041
ting-0.041
Tokeneros
Token ID27498
Feature activation-7.03e-03

Pos logit contributions
 yourselves+0.088
 yourself+0.077
ocolate+0.072
 sidelines+0.072
Does+0.070
Neg logit contributions
 once-0.076
-0.074
eth-0.071
 stuck-0.067
 wish-0.066
Token!
Token ID0
Feature activation-1.44e-02

Pos logit contributions
ocolate+0.014
 yourselves+0.014
ados+0.013
 sidelines+0.013
 yourself+0.012
Neg logit contributions
-0.008
 said-0.006
 stuck-0.006
 wish-0.006
 printed-0.006
Token It
Token ID632
Feature activation-3.58e-03

Pos logit contributions
 yourselves+0.031
ocolate+0.030
ados+0.029
ting+0.028
 sidelines+0.028
Neg logit contributions
-0.014
eth-0.010
 said-0.010
 favorite-0.010
 wish-0.010
Token looked
Token ID3114
Feature activation-4.60e-03

Pos logit contributions
ting+0.006
 yourselves+0.006
ocolate+0.006
 course+0.006
 sidelines+0.005
Neg logit contributions
eth-0.004
-0.004
 favorite-0.003
 wish-0.003
 Hatt-0.003
Token very
Token ID845
Feature activation+2.43e-03

Pos logit contributions
 yourselves+0.009
ocolate+0.009
ados+0.009
ting+0.009
 sidelines+0.009
Neg logit contributions
-0.005
eth-0.004
 favorite-0.003
 wish-0.003
 said-0.003
Token was
Token ID373
Feature activation-3.95e-03

Pos logit contributions
+0.026
eth+0.025
 prett+0.023
aly+0.023
 Hatt+0.023
Neg logit contributions
 yourselves-0.026
ocolate-0.026
 course-0.024
 yourself-0.023
anish-0.022
Token a
Token ID257
Feature activation+1.65e-03

Pos logit contributions
ocolate+0.006
 yourselves+0.006
 course+0.005
 yourself+0.005
ados+0.005
Neg logit contributions
eth-0.006
-0.006
 wish-0.005
 says-0.005
 know-0.005
Token big
Token ID1263
Feature activation+2.35e-03

Pos logit contributions
+0.002
eth+0.002
 said+0.002
 know+0.002
 connected+0.001
Neg logit contributions
 yourselves-0.003
ocolate-0.003
 sidelines-0.003
ting-0.003
ados-0.003
Token rh
Token ID9529
Feature activation-1.24e-02

Pos logit contributions
+0.003
eth+0.002
 favorite+0.002
 said+0.002
 wish+0.002
Neg logit contributions
 yourselves-0.004
ocolate-0.004
ados-0.004
ting-0.004
 sidelines-0.004
Tokenin
Token ID259
Feature activation+2.03e-02

Pos logit contributions
ocolate+0.023
 yourselves+0.023
 sidelines+0.021
ados+0.021
ting+0.021
Neg logit contributions
-0.014
 said-0.012
 favorite-0.012
eth-0.012
 wish-0.012
Tokenoc
Token ID420
Feature activation-5.62e-02

Pos logit contributions
+0.024
eth+0.019
 favorite+0.018
 said+0.018
 wish+0.018
Neg logit contributions
 yourselves-0.039
ocolate-0.039
ados-0.036
ting-0.035
 sidelines-0.035
Tokeneros
Token ID27498
Feature activation-7.94e-03

Pos logit contributions
 yourselves+0.080
 sidelines+0.075
ocolate+0.074
 temptation+0.072
 yourself+0.069
Neg logit contributions
 once-0.074
 favorite-0.073
eth-0.073
-0.072
om-0.070
Token.
Token ID13
Feature activation-2.10e-03

Pos logit contributions
 yourselves+0.016
ocolate+0.015
ados+0.014
 sidelines+0.014
ting+0.014
Neg logit contributions
-0.009
 said-0.007
eth-0.007
 favorite-0.007
 printed-0.007
Token He
Token ID679
Feature activation-1.42e-02

Pos logit contributions
 yourselves+0.004
ocolate+0.004
ados+0.004
ting+0.004
 sidelines+0.004
Neg logit contributions
-0.002
eth-0.002
 favorite-0.002
 said-0.002
 wish-0.002
Token was
Token ID373
Feature activation+1.05e-03

Pos logit contributions
 yourselves+0.026
ocolate+0.026
ting+0.024
 course+0.024
 yourself+0.023
Neg logit contributions
-0.016
eth-0.014
 eng-0.012
 sure-0.012
 wish-0.012
Token very
Token ID845
Feature activation+5.71e-03

Pos logit contributions
+0.002
eth+0.002
 wish+0.001
 printed+0.001
 dot+0.001
Neg logit contributions
 yourselves-0.001
ocolate-0.001
ting-0.001
 yourself-0.001
ados-0.001
Token happy
Token ID3772
Feature activation+1.57e-03

Pos logit contributions
eth+0.004
 dot+0.004
issy+0.004
 Hatt+0.004
 Georg+0.003
Neg logit contributions
ocolate-0.005
 yourselves-0.005
 cream-0.005
 yourself-0.005
gging-0.004
Token and
Token ID290
Feature activation-2.68e-03

Pos logit contributions
+0.002
eth+0.001
 said+0.001
 favorite+0.001
 wish+0.001
Neg logit contributions
 yourselves-0.003
ocolate-0.003
ados-0.003
 sidelines-0.003
ting-0.003
Token brought
Token ID3181
Feature activation+4.72e-03

Pos logit contributions
 yourselves+0.005
ocolate+0.005
ados+0.005
ting+0.005
 sidelines+0.005
Neg logit contributions
-0.003
eth-0.002
 favorite-0.002
 wish-0.002
 said-0.002
Token the
Token ID262
Feature activation-9.47e-03

Pos logit contributions
+0.006
 favorite+0.005
eth+0.005
 said+0.005
 printed+0.005
Neg logit contributions
 yourselves-0.009
ocolate-0.009
ados-0.008
ting-0.008
 sidelines-0.008
Token av
Token ID1196
Feature activation-1.33e-02

Pos logit contributions
 yourselves+0.014
ocolate+0.014
ting+0.013
 sidelines+0.012
 yourself+0.011
Neg logit contributions
-0.016
 favorite-0.014
 special-0.014
 said-0.013
 lost-0.013
Tokenoc
Token ID420
Feature activation-5.60e-02

Pos logit contributions
ocolate+0.024
 yourselves+0.024
ados+0.023
ting+0.022
 sidelines+0.021
Neg logit contributions
-0.016
eth-0.013
 wish-0.013
 favorite-0.013
 printed-0.013
Tokenados
Token ID22484
Feature activation-1.52e-03

Pos logit contributions
 yourselves+0.044
 sidelines+0.040
ocolate+0.039
gency+0.032
 wig+0.030
Neg logit contributions
-0.113
 once-0.107
 favorite-0.103
's-0.100
 says-0.099
Token home
Token ID1363
Feature activation-6.58e-03

Pos logit contributions
 yourselves+0.003
ocolate+0.003
ados+0.003
ting+0.003
 yourself+0.002
Neg logit contributions
-0.002
eth-0.002
 wish-0.001
 printed-0.001
 eng-0.001
Token to
Token ID284
Feature activation-2.24e-03

Pos logit contributions
 yourselves+0.015
ocolate+0.015
 sidelines+0.014
ting+0.014
ados+0.014
Neg logit contributions
-0.006
 favorite-0.004
 said-0.004
 once-0.004
eth-0.004
Token share
Token ID2648
Feature activation-6.68e-03

Pos logit contributions
 yourselves+0.005
ting+0.004
ocolate+0.004
ados+0.004
 sidelines+0.004
Neg logit contributions
-0.002
 said-0.002
 know-0.002
 wish-0.002
 fit-0.002
Token with
Token ID351
Feature activation+2.17e-02

Pos logit contributions
 sidelines+0.016
ocolate+0.016
 yourselves+0.016
ting+0.015
gency+0.015
Neg logit contributions
-0.005
 once-0.003
 said-0.003
 together-0.003
 special-0.003
Token was
Token ID373
Feature activation-4.74e-03

Pos logit contributions
eth+0.013
+0.013
 eng+0.012
 favorite+0.012
 wish+0.011
Neg logit contributions
ocolate-0.024
 yourselves-0.024
 course-0.022
 yourself-0.021
ting-0.021
Token a
Token ID257
Feature activation+2.84e-03

Pos logit contributions
 yourselves+0.010
ocolate+0.010
ados+0.010
ting+0.009
 sidelines+0.009
Neg logit contributions
-0.005
eth-0.003
 favorite-0.003
 said-0.003
 wish-0.003
Token big
Token ID1263
Feature activation+4.21e-04

Pos logit contributions
+0.003
eth+0.003
 favorite+0.002
 said+0.002
 wish+0.002
Neg logit contributions
 yourselves-0.006
ocolate-0.005
ados-0.005
ting-0.005
 sidelines-0.005
Token rh
Token ID9529
Feature activation-1.13e-02

Pos logit contributions
+0.001
 wish+0.000
eth+0.000
 said+0.000
 favorite+0.000
Neg logit contributions
 yourselves-0.001
ocolate-0.001
ados-0.001
ting-0.001
 sidelines-0.001
Tokenin
Token ID259
Feature activation+1.71e-02

Pos logit contributions
ocolate+0.022
 yourselves+0.022
 sidelines+0.020
ados+0.020
ting+0.019
Neg logit contributions
-0.012
a-0.010
 said-0.010
 stuck-0.010
 once-0.010
Tokenoc
Token ID420
Feature activation-5.59e-02

Pos logit contributions
+0.020
eth+0.017
 favorite+0.016
 printed+0.015
 said+0.015
Neg logit contributions
 yourselves-0.032
ocolate-0.032
ados-0.029
ting-0.029
 sidelines-0.029
Tokeneros
Token ID27498
Feature activation-9.89e-03

Pos logit contributions
 yourselves+0.079
 sidelines+0.074
ocolate+0.073
 temptation+0.069
 yourself+0.068
Neg logit contributions
-0.079
 once-0.078
 favorite-0.070
eth-0.070
 stuck-0.070
Token.
Token ID13
Feature activation-2.27e-03

Pos logit contributions
 yourselves+0.020
ocolate+0.019
ados+0.018
ting+0.018
 sidelines+0.018
Neg logit contributions
-0.011
eth-0.009
 favorite-0.008
 said-0.008
 wish-0.008
Token He
Token ID679
Feature activation-1.24e-02

Pos logit contributions
 yourselves+0.005
ocolate+0.005
ados+0.004
ting+0.004
 sidelines+0.004
Neg logit contributions
-0.002
 said-0.002
eth-0.002
 favorite-0.002
 wish-0.002
Token was
Token ID373
Feature activation-3.19e-03

Pos logit contributions
 yourselves+0.024
ocolate+0.024
ting+0.022
ados+0.022
 course+0.022
Neg logit contributions
-0.012
eth-0.011
 favorite-0.009
 eng-0.009
 wish-0.009
Token troubled
Token ID17840
Feature activation+3.06e-03

Pos logit contributions
 yourselves+0.005
ocolate+0.005
ting+0.004
 yourself+0.004
 sidelines+0.004
Neg logit contributions
eth-0.005
-0.004
 wish-0.004
s-0.004
erg-0.004
Tokenongo
Token ID25162
Feature activation-2.00e-02

Pos logit contributions
 yourselves+0.008
ocolate+0.008
ados+0.007
ting+0.007
 sidelines+0.007
Neg logit contributions
-0.010
 wish-0.008
 printed-0.008
 connected-0.008
eth-0.008
Token loved
Token ID6151
Feature activation+3.08e-03

Pos logit contributions
 yourselves+0.030
ocolate+0.028
 course+0.028
 yourself+0.027
ados+0.027
Neg logit contributions
-0.028
eth-0.026
 eng-0.024
 wish-0.023
 dot-0.022
Token to
Token ID284
Feature activation+1.49e-03

Pos logit contributions
+0.002
 favorite+0.002
 said+0.002
 special+0.002
 lost+0.001
Neg logit contributions
 yourselves-0.008
ocolate-0.007
 sidelines-0.007
anish-0.007
ting-0.007
Token eat
Token ID4483
Feature activation-5.11e-03

Pos logit contributions
+0.002
 wish+0.001
 know+0.001
 said+0.001
 favorite+0.001
Neg logit contributions
 yourselves-0.003
ocolate-0.003
ting-0.003
ados-0.003
 sidelines-0.003
Token av
Token ID1196
Feature activation-1.19e-02

Pos logit contributions
ocolate+0.007
 yourselves+0.007
 sidelines+0.007
ross+0.006
anish+0.006
Neg logit contributions
-0.009
 said-0.008
 favorite-0.008
 special-0.008
 once-0.007
Tokenoc
Token ID420
Feature activation-5.58e-02

Pos logit contributions
ocolate+0.020
 yourselves+0.020
ados+0.020
ting+0.018
 yourself+0.018
Neg logit contributions
-0.016
eth-0.014
 wish-0.013
 printed-0.013
 favorite-0.013
Tokenados
Token ID22484
Feature activation+1.64e-03

Pos logit contributions
 yourselves+0.045
 sidelines+0.043
ocolate+0.041
Does+0.038
Too+0.034
Neg logit contributions
-0.104
 favorite-0.102
 once-0.101
 says-0.096
 favourite-0.095
Token.
Token ID13
Feature activation-4.44e-03

Pos logit contributions
+0.002
eth+0.002
 wish+0.002
 eng+0.002
 printed+0.002
Neg logit contributions
 yourselves-0.003
ocolate-0.003
ados-0.003
ting-0.003
 yourself-0.003
Token One
Token ID1881
Feature activation+6.75e-03

Pos logit contributions
 yourselves+0.008
ocolate+0.008
ados+0.008
ting+0.008
 sidelines+0.008
Neg logit contributions
-0.005
 printed-0.004
 wish-0.004
 connected-0.004
 eng-0.004
Token day
Token ID1110
Feature activation-5.70e-03

Pos logit contributions
+0.007
eth+0.006
 printed+0.006
 connected+0.005
 eng+0.005
Neg logit contributions
ocolate-0.013
 yourselves-0.012
ados-0.012
ting-0.012
 course-0.012
Token,
Token ID11
Feature activation+3.95e-03

Pos logit contributions
 yourselves+0.013
ocolate+0.013
ados+0.012
 sidelines+0.012
ting+0.011
Neg logit contributions
-0.005
eth-0.004
 said-0.004
 favorite-0.004
 printed-0.003
Token paid
Token ID3432
Feature activation+2.23e-03

Pos logit contributions
 yourselves+0.004
ocolate+0.004
ting+0.004
 course+0.003
 sidelines+0.003
Neg logit contributions
-0.003
eth-0.002
 favorite-0.002
 eng-0.002
 favourite-0.002
Token him
Token ID683
Feature activation-3.37e-03

Pos logit contributions
+0.003
eth+0.002
 favorite+0.002
 wish+0.002
 printed+0.002
Neg logit contributions
 yourselves-0.004
ocolate-0.004
ados-0.004
ting-0.004
 sidelines-0.004
Token for
Token ID329
Feature activation+2.75e-03

Pos logit contributions
 yourselves+0.006
ocolate+0.006
ting+0.006
ados+0.006
 yourself+0.006
Neg logit contributions
-0.004
eth-0.003
 printed-0.003
 eng-0.003
 wish-0.003
Token his
Token ID465
Feature activation-4.11e-03

Pos logit contributions
+0.003
 sure+0.003
 said+0.003
 special+0.003
 know+0.003
Neg logit contributions
ocolate-0.005
ting-0.005
 yourselves-0.005
ados-0.005
anish-0.005
Token av
Token ID1196
Feature activation-9.28e-03

Pos logit contributions
 yourselves+0.007
ocolate+0.007
ados+0.007
 sidelines+0.007
ting+0.006
Neg logit contributions
-0.005
eth-0.004
 said-0.004
 printed-0.004
 sure-0.004
Tokenoc
Token ID420
Feature activation-5.57e-02

Pos logit contributions
 yourselves+0.016
ocolate+0.016
ados+0.015
ting+0.015
 sidelines+0.014
Neg logit contributions
-0.012
eth-0.010
 printed-0.010
 said-0.010
 eng-0.009
Tokenados
Token ID22484
Feature activation+7.17e-03

Pos logit contributions
 yourselves+0.046
 sidelines+0.043
ocolate+0.041
ting+0.038
Stop+0.036
Neg logit contributions
 once-0.105
-0.104
 favorite-0.102
eth-0.097
 says-0.096
Token.
Token ID13
Feature activation+6.17e-03

Pos logit contributions
+0.009
eth+0.007
 wish+0.007
 eng+0.007
 printed+0.007
Neg logit contributions
 yourselves-0.013
ocolate-0.013
ados-0.012
ting-0.012
 yourself-0.012
Token
Token ID198
Feature activation-2.53e-04

Pos logit contributions
+0.006
 wish+0.005
 printed+0.005
 connected+0.005
eth+0.005
Neg logit contributions
 yourselves-0.013
ocolate-0.012
ados-0.011
 yourself-0.011
 sidelines-0.011
Token
Token ID198
Feature activation+1.78e-04

Pos logit contributions
 yourselves+0.000
ocolate+0.000
 yourself+0.000
 sidelines+0.000
 course+0.000
Neg logit contributions
-0.000
 favorite-0.000
 wish-0.000
 printed-0.000
 connected-0.000
TokenTom
Token ID13787
Feature activation+2.44e-04

Pos logit contributions
+0.000
eth+0.000
 said+0.000
 favorite+0.000
 wish+0.000
Neg logit contributions
 yourselves-0.000
ocolate-0.000
ados-0.000
ting-0.000
 sidelines-0.000
Token and
Token ID290
Feature activation+4.85e-03

Pos logit contributions
+0.002
 said+0.002
eth+0.001
 sure+0.001
 favorite+0.001
Neg logit contributions
 yourselves-0.005
ocolate-0.005
 sidelines-0.005
ados-0.005
ting-0.004
Token picked
Token ID6497
Feature activation-5.24e-04

Pos logit contributions
+0.006
 said+0.005
 wish+0.004
 sure+0.004
 printed+0.004
Neg logit contributions
ocolate-0.010
 yourselves-0.010
ados-0.009
ting-0.009
 sidelines-0.009
Token the
Token ID262
Feature activation-3.62e-03

Pos logit contributions
 yourselves+0.001
ocolate+0.001
ados+0.001
 sidelines+0.001
ting+0.001
Neg logit contributions
-0.001
eth-0.000
 favorite-0.000
 said-0.000
 wish-0.000
Token right
Token ID826
Feature activation+6.28e-03

Pos logit contributions
ocolate+0.005
 yourselves+0.005
 sidelines+0.004
ting+0.004
 yourself+0.004
Neg logit contributions
-0.006
 favorite-0.006
 special-0.005
 lost-0.005
 once-0.005
Token av
Token ID1196
Feature activation-1.59e-02

Pos logit contributions
+0.008
 favorite+0.008
 once+0.008
 special+0.007
 wish+0.007
Neg logit contributions
ocolate-0.011
 yourselves-0.011
 sidelines-0.010
ting-0.010
ados-0.009
Tokenoc
Token ID420
Feature activation-5.56e-02

Pos logit contributions
 yourselves+0.032
ocolate+0.031
ados+0.029
ting+0.029
 sidelines+0.029
Neg logit contributions
-0.018
eth-0.014
 favorite-0.013
 said-0.013
 wish-0.013
Tokenados
Token ID22484
Feature activation-3.06e-03

Pos logit contributions
 yourselves+0.044
 sidelines+0.041
ocolate+0.038
 yourself+0.030
wered+0.029
Neg logit contributions
-0.115
 once-0.108
 favorite-0.102
's-0.101
eth-0.100
Token.
Token ID13
Feature activation-1.66e-03

Pos logit contributions
 yourselves+0.006
ocolate+0.006
ados+0.006
ting+0.006
 sidelines+0.005
Neg logit contributions
-0.003
eth-0.003
 wish-0.003
 favorite-0.003
 printed-0.003
Token
Token ID198
Feature activation-3.40e-03

Pos logit contributions
 yourselves+0.003
ocolate+0.003
ados+0.003
ting+0.003
 sidelines+0.003
Neg logit contributions
-0.002
eth-0.001
 wish-0.001
 favorite-0.001
 printed-0.001
Token
Token ID198
Feature activation-1.09e-04

Pos logit contributions
ocolate+0.008
 yourselves+0.007
 sidelines+0.007
 course+0.007
 yourself+0.007
Neg logit contributions
-0.003
 favorite-0.002
 wish-0.002
 printed-0.002
 connected-0.002
TokenSadly
Token ID36725
Feature activation-9.78e-03

Pos logit contributions
 yourselves+0.000
ocolate+0.000
ados+0.000
ting+0.000
 sidelines+0.000
Neg logit contributions
-0.000
eth-0.000
 said-0.000
 sure-0.000
 printed-0.000
Token there
Token ID612
Feature activation-1.30e-03

Pos logit contributions
+0.006
eth+0.006
 wish+0.005
 favorite+0.005
 eng+0.005
Neg logit contributions
 yourselves-0.007
ocolate-0.007
ados-0.006
ting-0.006
 sidelines-0.006
Token were
Token ID547
Feature activation-6.72e-03

Pos logit contributions
 yourselves+0.003
ocolate+0.003
ados+0.002
ting+0.002
 sidelines+0.002
Neg logit contributions
-0.001
eth-0.001
 favorite-0.001
 wish-0.001
 printed-0.001
Token no
Token ID645
Feature activation-2.50e-02

Pos logit contributions
ocolate+0.012
 yourselves+0.012
ados+0.011
ting+0.011
 sidelines+0.011
Neg logit contributions
-0.009
 favorite-0.007
 said-0.007
eth-0.007
 printed-0.007
Token other
Token ID584
Feature activation-1.92e-03

Pos logit contributions
ocolate+0.044
 yourselves+0.043
ting+0.039
 sidelines+0.039
ados+0.038
Neg logit contributions
-0.034
 wish-0.030
 favorite-0.029
 said-0.029
 know-0.028
Token av
Token ID1196
Feature activation-4.09e-04

Pos logit contributions
ocolate+0.003
 yourselves+0.003
 sidelines+0.003
ting+0.003
ados+0.003
Neg logit contributions
-0.003
 favorite-0.003
 said-0.003
 wish-0.002
 lost-0.002
Tokenoc
Token ID420
Feature activation-5.56e-02

Pos logit contributions
 yourselves+0.001
ocolate+0.001
ados+0.001
ting+0.001
 yourself+0.001
Neg logit contributions
-0.001
eth-0.000
 favorite-0.000
 wish-0.000
 said-0.000
Tokenados
Token ID22484
Feature activation+7.57e-03

Pos logit contributions
 sidelines+0.045
 yourselves+0.044
Does+0.043
ocolate+0.037
 distance+0.035
Neg logit contributions
 once-0.100
-0.100
 favorite-0.093
 stuck-0.092
 work-0.092
Token.
Token ID13
Feature activation-6.81e-03

Pos logit contributions
+0.009
eth+0.008
 wish+0.007
 favorite+0.007
 eng+0.007
Neg logit contributions
 yourselves-0.014
ocolate-0.014
ting-0.013
ados-0.013
 yourself-0.012
Token
Token ID198
Feature activation-7.20e-03

Pos logit contributions
ocolate+0.012
 yourselves+0.012
ados+0.012
ting+0.011
 sidelines+0.011
Neg logit contributions
-0.008
 wish-0.008
eth-0.007
 wrapped-0.007
 connected-0.007
Token
Token ID198
Feature activation-6.90e-03

Pos logit contributions
ocolate+0.013
 yourselves+0.013
 course+0.012
 yourself+0.011
endor+0.011
Neg logit contributions
 wish-0.007
-0.007
 prett-0.007
 favorite-0.007
 wrapped-0.007
TokenThe
Token ID464
Feature activation-6.31e-03

Pos logit contributions
 yourselves+0.011
ocolate+0.011
ados+0.010
ting+0.010
 sidelines+0.010
Neg logit contributions
-0.010
eth-0.009
 favorite-0.009
 wish-0.009
 said-0.009
Token
Token ID26391
Feature activation-5.62e-03

Pos logit contributions
 yourselves+0.020
ocolate+0.019
ados+0.018
 sidelines+0.017
ting+0.017
Neg logit contributions
-0.020
 said-0.016
eth-0.016
 worked-0.015
 sure-0.015
Token replied
Token ID8712
Feature activation-1.33e-02

Pos logit contributions
ocolate+0.009
 yourselves+0.009
ados+0.009
ting+0.009
 sidelines+0.009
Neg logit contributions
-0.008
eth-0.007
 said-0.006
 sure-0.006
 favorite-0.006
Token the
Token ID262
Feature activation+5.74e-03

Pos logit contributions
 yourselves+0.030
ocolate+0.029
 sidelines+0.028
ados+0.027
ting+0.027
Neg logit contributions
-0.013
eth-0.009
 said-0.009
 sure-0.009
 favorite-0.008
Token rh
Token ID9529
Feature activation-1.93e-02

Pos logit contributions
+0.008
 wish+0.007
 said+0.006
eth+0.006
 favorite+0.006
Neg logit contributions
 yourselves-0.010
ocolate-0.010
ados-0.009
ting-0.009
 sidelines-0.009
Tokenin
Token ID259
Feature activation+1.11e-02

Pos logit contributions
ocolate+0.040
 yourselves+0.040
 sidelines+0.039
ados+0.037
ting+0.036
Neg logit contributions
-0.019
a-0.016
 said-0.014
eth-0.014
 favorite-0.014
Tokenoc
Token ID420
Feature activation-5.56e-02

Pos logit contributions
+0.012
eth+0.009
 favorite+0.009
 said+0.009
 wish+0.009
Neg logit contributions
 yourselves-0.023
ocolate-0.022
ados-0.021
 sidelines-0.021
ting-0.021
Tokeneros
Token ID27498
Feature activation-1.92e-02

Pos logit contributions
 sidelines+0.086
 yourselves+0.081
ocolate+0.077
 temptation+0.073
wered+0.071
Neg logit contributions
 once-0.068
eth-0.063
-0.063
 favorite-0.060
om-0.059
Token sadly
Token ID21098
Feature activation+2.24e-03

Pos logit contributions
 yourselves+0.041
ocolate+0.041
 sidelines+0.038
ados+0.038
ting+0.037
Neg logit contributions
-0.020
 said-0.014
eth-0.014
 favorite-0.014
 sure-0.013
Token.
Token ID13
Feature activation-8.93e-03

Pos logit contributions
+0.002
eth+0.002
 favorite+0.002
 wish+0.002
 printed+0.002
Neg logit contributions
 yourselves-0.005
ocolate-0.004
ados-0.004
ting-0.004
 sidelines-0.004
Token It
Token ID632
Feature activation+8.16e-03

Pos logit contributions
 yourselves+0.019
ocolate+0.019
ados+0.018
ting+0.018
 sidelines+0.018
Neg logit contributions
-0.009
eth-0.006
 said-0.006
 favorite-0.006
 wish-0.006
Token had
Token ID550
Feature activation+1.77e-03

Pos logit contributions
eth+0.010
 ago+0.008
 eng+0.008
+0.008
 Hatt+0.008
Neg logit contributions
 course-0.012
ting-0.012
 yourselves-0.011
ocolate-0.011
 cream-0.011
Token
Token ID198
Feature activation+1.54e-03

Pos logit contributions
ocolate+0.000
 yourselves+0.000
 yourself+0.000
ados+0.000
 course+0.000
Neg logit contributions
-0.000
 connected-0.000
 favorite-0.000
 wish-0.000
 printed-0.000
TokenSo
Token ID2396
Feature activation-1.45e-03

Pos logit contributions
+0.003
eth+0.002
 said+0.002
 wish+0.002
 printed+0.002
Neg logit contributions
 yourselves-0.002
ocolate-0.002
ados-0.002
ting-0.002
 sidelines-0.002
Token the
Token ID262
Feature activation+7.78e-03

Pos logit contributions
 yourselves+0.003
ocolate+0.003
ados+0.003
ting+0.002
 yourself+0.002
Neg logit contributions
-0.002
eth-0.001
 printed-0.001
 favorite-0.001
 wish-0.001
Token mom
Token ID1995
Feature activation-1.84e-02

Pos logit contributions
+0.009
eth+0.008
 printed+0.007
 said+0.007
 favorite+0.007
Neg logit contributions
 yourselves-0.015
ocolate-0.015
ados-0.014
 sidelines-0.013
ting-0.013
Token and
Token ID290
Feature activation-9.40e-03

Pos logit contributions
ting+0.028
ocolate+0.027
 cream+0.027
 course+0.026
ados+0.025
Neg logit contributions
 favorite-0.022
 wish-0.021
 eng-0.021
 favourite-0.021
-0.019
Token the
Token ID262
Feature activation+2.38e-02

Pos logit contributions
 yourselves+0.016
ocolate+0.016
ados+0.015
ting+0.015
 sidelines+0.014
Neg logit contributions
-0.013
eth-0.011
 favorite-0.010
 wish-0.010
 said-0.010
Token little
Token ID1310
Feature activation+4.75e-03

Pos logit contributions
+0.028
eth+0.023
 said+0.021
 connected+0.021
 printed+0.021
Neg logit contributions
 yourselves-0.045
ocolate-0.044
ados-0.041
 sidelines-0.041
ting-0.041
Token girl
Token ID2576
Feature activation+3.07e-03

Pos logit contributions
+0.007
eth+0.006
 eng+0.005
 know+0.005
erg+0.005
Neg logit contributions
 yourselves-0.007
ocolate-0.007
ados-0.007
ting-0.007
 sidelines-0.007
Token went
Token ID1816
Feature activation+2.96e-03

Pos logit contributions
eth+0.004
 favorite+0.004
+0.004
 eng+0.003
erg+0.003
Neg logit contributions
ocolate-0.005
ting-0.004
 course-0.004
ados-0.004
 cream-0.004
Token off
Token ID572
Feature activation-2.17e-03

Pos logit contributions
+0.004
 wish+0.003
eth+0.003
 favorite+0.003
 eng+0.003
Neg logit contributions
 yourselves-0.005
ocolate-0.005
ting-0.005
 sidelines-0.005
ados-0.005
Token to
Token ID284
Feature activation-7.03e-04

Pos logit contributions
ocolate+0.003
ting+0.003
 sidelines+0.002
 yourselves+0.002
ados+0.002
Neg logit contributions
-0.003
 wish-0.003
erg-0.003
 favorite-0.003
eth-0.003
TokenAre
Token ID8491
Feature activation-6.92e-03

Pos logit contributions
 yourselves+0.007
ocolate+0.007
ados+0.007
 sidelines+0.007
ting+0.006
Neg logit contributions
-0.009
 printed-0.008
 said-0.008
 favorite-0.008
 connected-0.007
Token we
Token ID356
Feature activation+7.38e-03

Pos logit contributions
ocolate+0.010
 yourselves+0.010
ados+0.009
ting+0.009
 sidelines+0.009
Neg logit contributions
-0.011
 favorite-0.009
 said-0.009
eth-0.009
 wish-0.009
Token there
Token ID612
Feature activation+1.91e-02

Pos logit contributions
+0.008
eth+0.006
 wish+0.006
 favorite+0.006
 said+0.006
Neg logit contributions
 yourselves-0.015
ocolate-0.015
ados-0.014
ting-0.013
 sidelines-0.013
Token yet
Token ID1865
Feature activation+1.94e-02

Pos logit contributions
+0.025
eth+0.023
 eng+0.022
 wish+0.021
 connected+0.021
Neg logit contributions
ocolate-0.032
 yourselves-0.032
 yourself-0.028
ting-0.028
ados-0.028
Token?"
Token ID1701
Feature activation+2.06e-02

Pos logit contributions
+0.030
eth+0.029
 eng+0.027
 connected+0.026
 wish+0.026
Neg logit contributions
 yourselves-0.029
ocolate-0.027
 yourself-0.025
ados-0.023
 cream-0.023
Token Sara
Token ID24799
Feature activation+2.35e-02

Pos logit contributions
+0.037
eth+0.035
 printed+0.034
 fixed+0.034
 worked+0.033
Neg logit contributions
 yourselves-0.026
ocolate-0.021
 yourself-0.019
 temptation-0.017
Please-0.017
Token asks
Token ID7893
Feature activation+6.91e-03

Pos logit contributions
eth+0.031
 favourite+0.029
 printed+0.029
 favorite+0.028
 Hatt+0.028
Neg logit contributions
 yourselves-0.037
 yourself-0.034
ting-0.033
gging-0.033
ados-0.031
Token.
Token ID13
Feature activation+9.00e-03

Pos logit contributions
+0.008
eth+0.006
 said+0.006
 favorite+0.006
 wish+0.005
Neg logit contributions
 yourselves-0.014
ocolate-0.014
ados-0.013
 sidelines-0.013
ting-0.013
Token
Token ID198
Feature activation+9.68e-03

Pos logit contributions
+0.007
eth+0.005
 wish+0.005
 printed+0.004
 favorite+0.004
Neg logit contributions
 yourselves-0.021
ocolate-0.021
ados-0.019
ting-0.019
 sidelines-0.019
Token
Token ID198
Feature activation+9.69e-03

Pos logit contributions
+0.009
 printed+0.009
 wish+0.008
 favorite+0.008
 worked+0.008
Neg logit contributions
 yourselves-0.019
ocolate-0.018
 yourself-0.017
 sidelines-0.016
ados-0.016
Token"
Token ID1
Feature activation-5.61e-03

Pos logit contributions
+0.015
 favorite+0.013
eth+0.013
 wish+0.013
 printed+0.013
Neg logit contributions
 yourselves-0.015
ocolate-0.014
ados-0.013
ting-0.013
 sidelines-0.013
Token a
Token ID257
Feature activation+1.37e-02

Pos logit contributions
+0.010
eth+0.009
 wish+0.008
 connected+0.008
 printed+0.008
Neg logit contributions
 yourselves-0.016
ocolate-0.015
ados-0.014
 yourself-0.014
ting-0.014
Token swim
Token ID9422
Feature activation+2.54e-02

Pos logit contributions
+0.019
eth+0.015
 said+0.015
 connected+0.015
 printed+0.014
Neg logit contributions
ocolate-0.023
 yourselves-0.023
ados-0.021
 sidelines-0.021
ting-0.020
Token?"
Token ID1701
Feature activation+2.30e-02

Pos logit contributions
+0.036
eth+0.029
 wish+0.028
 said+0.028
 printed+0.028
Neg logit contributions
 yourselves-0.043
ocolate-0.043
ados-0.039
ting-0.038
 sidelines-0.038
Token Her
Token ID2332
Feature activation-1.63e-03

Pos logit contributions
+0.025
eth+0.019
 said+0.019
 favorite+0.018
 wish+0.018
Neg logit contributions
 yourselves-0.047
ocolate-0.047
ados-0.043
ting-0.043
 sidelines-0.043
Token M
Token ID337
Feature activation+1.54e-02

Pos logit contributions
 yourselves+0.003
ocolate+0.003
ados+0.003
ting+0.003
 sidelines+0.003
Neg logit contributions
-0.002
eth-0.002
 favorite-0.002
 said-0.002
 wish-0.002
Tokenummy
Token ID13513
Feature activation+3.08e-02

Pos logit contributions
+0.023
eth+0.019
 favorite+0.019
 said+0.018
 wish+0.018
Neg logit contributions
 yourselves-0.025
ocolate-0.024
 sidelines-0.023
ting-0.023
ados-0.022
Token smiled
Token ID13541
Feature activation+1.45e-02

Pos logit contributions
+0.033
eth+0.030
 eng+0.029
 wish+0.027
erg+0.027
Neg logit contributions
ocolate-0.053
 course-0.050
ting-0.049
 cream-0.048
ados-0.048
Token and
Token ID290
Feature activation+1.67e-03

Pos logit contributions
+0.014
eth+0.010
 wish+0.009
 printed+0.009
 favorite+0.009
Neg logit contributions
 yourselves-0.031
ocolate-0.031
ados-0.029
ting-0.029
 sidelines-0.028
Token said
Token ID531
Feature activation-8.16e-03

Pos logit contributions
+0.001
eth+0.001
 wish+0.001
 favorite+0.001
 eng+0.001
Neg logit contributions
 yourselves-0.004
ocolate-0.004
ting-0.004
ados-0.004
 sidelines-0.004
Token "
Token ID366
Feature activation-1.42e-02

Pos logit contributions
 yourselves+0.021
 sidelines+0.020
ocolate+0.018
ting+0.018
ados+0.018
Neg logit contributions
-0.007
 said-0.005
 favorite-0.005
 sure-0.005
 excited-0.005
TokenNo
Token ID2949
Feature activation-8.58e-03

Pos logit contributions
 yourselves+0.020
ocolate+0.019
ados+0.018
 sidelines+0.016
ting+0.016
Neg logit contributions
-0.026
 said-0.021
eth-0.021
 favorite-0.021
 printed-0.021
TokenI
Token ID40
Feature activation+8.06e-03

Pos logit contributions
 yourselves+0.014
ocolate+0.013
ting+0.011
ados+0.011
 yourself+0.011
Neg logit contributions
-0.022
eth-0.020
 favorite-0.020
 said-0.020
 wish-0.020
Token will
Token ID481
Feature activation-8.98e-03

Pos logit contributions
eth+0.010
 connected+0.010
 prett+0.009
 Hatt+0.009
 favourite+0.009
Neg logit contributions
ocolate-0.010
 yourselves-0.010
 temptation-0.010
 liar-0.010
 course-0.010
Token miss
Token ID2051
Feature activation+2.19e-03

Pos logit contributions
 yourselves+0.015
ocolate+0.015
ados+0.014
ting+0.014
 sidelines+0.013
Neg logit contributions
-0.013
 wish-0.011
eth-0.011
 said-0.011
 sure-0.011
Token Snow
Token ID7967
Feature activation-8.32e-04

Pos logit contributions
+0.003
 sure+0.003
 favorite+0.003
 once+0.003
 special+0.003
Neg logit contributions
ocolate-0.004
 yourselves-0.003
ados-0.003
 sidelines-0.003
ting-0.003
Tokeny
Token ID88
Feature activation+1.87e-03

Pos logit contributions
 yourselves+0.002
ocolate+0.002
ados+0.002
 sidelines+0.002
ting+0.002
Neg logit contributions
-0.001
eth-0.001
 favorite-0.001
 said-0.001
 printed-0.001
Token,"
Token ID553
Feature activation+2.37e-02

Pos logit contributions
+0.002
eth+0.002
 favorite+0.002
 printed+0.002
 said+0.002
Neg logit contributions
 yourselves-0.004
ocolate-0.004
ados-0.003
ting-0.003
 sidelines-0.003
Token the
Token ID262
Feature activation+9.30e-03

Pos logit contributions
+0.025
eth+0.023
 favorite+0.021
 printed+0.020
 connected+0.020
Neg logit contributions
 yourselves-0.046
ocolate-0.042
ados-0.041
ting-0.041
 yourself-0.041
Token little
Token ID1310
Feature activation+2.42e-03

Pos logit contributions
+0.012
eth+0.010
 favorite+0.009
 printed+0.009
 connected+0.009
Neg logit contributions
 yourselves-0.017
ocolate-0.016
 sidelines-0.015
ados-0.015
ting-0.015
Token girl
Token ID2576
Feature activation+1.19e-02

Pos logit contributions
+0.003
eth+0.003
 eng+0.003
 printed+0.003
 connected+0.003
Neg logit contributions
 yourselves-0.004
ocolate-0.004
ting-0.004
ados-0.004
 sidelines-0.004
Token mentioned
Token ID4750
Feature activation+6.72e-03

Pos logit contributions
eth+0.009
 dot+0.009
+0.009
erg+0.008
 ago+0.008
Neg logit contributions
 yourselves-0.021
ting-0.021
ocolate-0.020
ados-0.020
 course-0.020
Token to
Token ID284
Feature activation+1.24e-02

Pos logit contributions
+0.006
 said+0.005
 once+0.004
 sure+0.004
 favorite+0.004
Neg logit contributions
 yourselves-0.015
ocolate-0.015
 sidelines-0.014
ados-0.014
ting-0.013
Token
Token ID198
Feature activation+5.51e-03

Pos logit contributions
+0.004
 wish+0.003
 favorite+0.003
eth+0.003
 printed+0.003
Neg logit contributions
ocolate-0.005
 yourselves-0.005
 course-0.005
 sidelines-0.005
 yourself-0.004
TokenTim
Token ID14967
Feature activation+3.82e-03

Pos logit contributions
+0.009
eth+0.007
 favorite+0.007
 wish+0.007
 said+0.007
Neg logit contributions
 yourselves-0.008
ocolate-0.008
ados-0.008
ting-0.007
 sidelines-0.007
Tokenmy
Token ID1820
Feature activation-4.05e-03

Pos logit contributions
+0.005
eth+0.005
 eng+0.004
 favorite+0.004
 wish+0.004
Neg logit contributions
 yourselves-0.006
ting-0.006
ocolate-0.006
 course-0.006
 yourself-0.005
Token learned
Token ID4499
Feature activation+1.45e-02

Pos logit contributions
ting+0.007
 yourselves+0.006
ocolate+0.006
 course+0.006
ados+0.006
Neg logit contributions
eth-0.004
erg-0.004
 favorite-0.004
-0.004
 eng-0.004
Token that
Token ID326
Feature activation-2.48e-03

Pos logit contributions
+0.015
 favorite+0.011
 said+0.011
 sure+0.011
 special+0.010
Neg logit contributions
ocolate-0.032
 yourselves-0.031
 sidelines-0.030
ados-0.029
ting-0.029
Token it
Token ID340
Feature activation+2.31e-02

Pos logit contributions
 yourselves+0.004
ocolate+0.004
ados+0.004
ting+0.004
 yourself+0.004
Neg logit contributions
-0.003
eth-0.003
 favorite-0.003
 printed-0.003
 said-0.003
Token's
Token ID338
Feature activation+2.36e-02

Pos logit contributions
eth+0.024
 eng+0.020
 Georg+0.019
 wish+0.019
isc+0.018
Neg logit contributions
 yourselves-0.043
ting-0.042
 yourself-0.041
ados-0.041
 course-0.039
Token important
Token ID1593
Feature activation+1.84e-02

Pos logit contributions
+0.022
eth+0.019
 said+0.016
 wish+0.016
 favorite+0.016
Neg logit contributions
 yourselves-0.051
ocolate-0.049
ados-0.046
 yourself-0.046
ting-0.046
Token to
Token ID284
Feature activation+6.61e-03

Pos logit contributions
+0.017
 said+0.012
 favorite+0.012
 sure+0.011
 printed+0.011
Neg logit contributions
ocolate-0.042
 yourselves-0.042
 sidelines-0.040
ados-0.039
ting-0.039
Token be
Token ID307
Feature activation+1.27e-02

Pos logit contributions
+0.008
 wish+0.006
 said+0.006
 know+0.006
 sure+0.006
Neg logit contributions
 yourselves-0.012
ocolate-0.012
ados-0.012
ting-0.012
 sidelines-0.011
Token kind
Token ID1611
Feature activation+8.87e-03

Pos logit contributions
+0.017
 favorite+0.013
 said+0.013
 sure+0.013
 connected+0.013
Neg logit contributions
ocolate-0.023
 yourselves-0.023
ados-0.021
 sidelines-0.021
ting-0.021
TokenThis
Token ID1212
Feature activation+2.62e-03

Pos logit contributions
 yourselves+0.014
ocolate+0.014
ados+0.012
ting+0.012
 sidelines+0.012
Neg logit contributions
-0.022
eth-0.019
 favorite-0.019
 said-0.019
 wish-0.019
Token is
Token ID318
Feature activation+4.85e-03

Pos logit contributions
+0.003
eth+0.003
 connected+0.003
 eng+0.003
 favorite+0.003
Neg logit contributions
 yourselves-0.005
ocolate-0.004
 yourself-0.004
ados-0.004
ting-0.004
Token my
Token ID616
Feature activation-2.72e-03

Pos logit contributions
+0.007
eth+0.006
 wish+0.006
 favorite+0.006
 eng+0.006
Neg logit contributions
 yourselves-0.008
ocolate-0.007
ados-0.007
ting-0.007
 yourself-0.007
Token dress
Token ID6576
Feature activation+8.72e-06

Pos logit contributions
 yourselves+0.004
ocolate+0.004
 yourself+0.004
 sidelines+0.004
ados+0.004
Neg logit contributions
-0.004
eth-0.003
 connected-0.003
 said-0.003
 eng-0.003
Token and
Token ID290
Feature activation+9.42e-03

Feature not active on prev token;
no predictions were affected.
Token I
Token ID314
Feature activation+1.16e-02

Pos logit contributions
+0.014
eth+0.012
 favorite+0.011
 printed+0.011
 eng+0.011
Neg logit contributions
 yourselves-0.015
ocolate-0.014
ados-0.013
 yourself-0.013
ting-0.013
Token want
Token ID765
Feature activation+2.90e-03

Pos logit contributions
+0.012
 eng+0.012
 Georg+0.012
eth+0.011
 Sad+0.011
Neg logit contributions
 yourselves-0.017
ocolate-0.016
 yourself-0.016
 course-0.015
Too-0.015
Token to
Token ID284
Feature activation-8.96e-04

Pos logit contributions
+0.004
eth+0.003
 favorite+0.003
 said+0.003
 wish+0.003
Neg logit contributions
 yourselves-0.005
ocolate-0.005
ados-0.005
ting-0.005
 sidelines-0.005
Token keep
Token ID1394
Feature activation-4.28e-03

Pos logit contributions
 yourselves+0.002
ocolate+0.002
 sidelines+0.001
 yourself+0.001
ados+0.001
Neg logit contributions
-0.001
eth-0.001
 favorite-0.001
 said-0.001
 connected-0.001
Token it
Token ID340
Feature activation-1.65e-03

Pos logit contributions
 yourselves+0.007
ocolate+0.007
ados+0.007
ting+0.006
 sidelines+0.006
Neg logit contributions
-0.006
eth-0.005
 favorite-0.005
 said-0.005
 wish-0.005
Token."
Token ID526
Feature activation+1.34e-02

Pos logit contributions
 yourselves+0.003
ocolate+0.003
ados+0.003
ting+0.003
 yourself+0.003
Neg logit contributions
-0.002
eth-0.002
 said-0.002
 wish-0.002
 printed-0.002
Token.
Token ID13
Feature activation-8.36e-04

Pos logit contributions
 yourselves+0.034
ocolate+0.033
 sidelines+0.032
ados+0.032
ting+0.032
Neg logit contributions
-0.023
 wish-0.021
eth-0.020
 know-0.019
erg-0.019
Token They
Token ID1119
Feature activation-8.69e-03

Pos logit contributions
 yourselves+0.002
ocolate+0.002
ados+0.002
ting+0.002
 sidelines+0.002
Neg logit contributions
-0.001
eth-0.001
 favorite-0.001
 said-0.001
 printed-0.001
Token built
Token ID3170
Feature activation-3.13e-03

Pos logit contributions
ocolate+0.016
 yourselves+0.016
ting+0.015
 course+0.015
 sidelines+0.015
Neg logit contributions
-0.010
eth-0.009
 favorite-0.008
 wish-0.007
 eng-0.007
Token a
Token ID257
Feature activation-1.28e-03

Pos logit contributions
 yourselves+0.005
ocolate+0.005
ados+0.005
ting+0.005
 yourself+0.005
Neg logit contributions
-0.004
eth-0.004
 said-0.003
 wish-0.003
 printed-0.003
Token little
Token ID1310
Feature activation+4.98e-03

Pos logit contributions
 yourselves+0.002
ocolate+0.002
 sidelines+0.002
ados+0.002
 yourself+0.002
Neg logit contributions
-0.001
eth-0.001
 said-0.001
 connected-0.001
 printed-0.001
Token house
Token ID2156
Feature activation-4.69e-03

Pos logit contributions
+0.007
eth+0.007
 sure+0.006
erg+0.006
 said+0.006
Neg logit contributions
 yourselves-0.007
 sidelines-0.007
ocolate-0.007
ados-0.007
 course-0.007
Token and
Token ID290
Feature activation+2.95e-05

Pos logit contributions
 yourselves+0.009
ocolate+0.009
ados+0.009
ting+0.009
 sidelines+0.009
Neg logit contributions
-0.005
eth-0.004
 wish-0.004
 favorite-0.004
 said-0.004
Token played
Token ID2826
Feature activation+1.36e-03

Pos logit contributions
+0.000
eth+0.000
 wish+0.000
 favorite+0.000
 eng+0.000
Neg logit contributions
 yourselves-0.000
ocolate-0.000
ados-0.000
ting-0.000
 sidelines-0.000
Token hide
Token ID7808
Feature activation-1.13e-02

Pos logit contributions
+0.001
eth+0.001
 favorite+0.001
 said+0.001
 wish+0.001
Neg logit contributions
 yourselves-0.003
ocolate-0.003
ados-0.003
ting-0.003
 sidelines-0.003
Token and
Token ID290
Feature activation+6.60e-03

Pos logit contributions
ocolate+0.027
 yourselves+0.025
 sidelines+0.025
ados+0.024
ting+0.023
Neg logit contributions
-0.009
 favorite-0.007
eth-0.006
 sure-0.006
 connected-0.006
Token seek
Token ID5380
Feature activation+2.13e-03

Pos logit contributions
+0.010
eth+0.009
 favorite+0.009
 wish+0.008
 said+0.008
Neg logit contributions
 yourselves-0.010
ocolate-0.010
ados-0.009
ting-0.009
 sidelines-0.009
Token support
Token ID1104
Feature activation-6.12e-03

Pos logit contributions
 yourselves+0.024
ocolate+0.024
ados+0.022
ting+0.022
 sidelines+0.021
Neg logit contributions
-0.020
eth-0.017
 wish-0.016
 said-0.016
 favorite-0.016
Token you
Token ID345
Feature activation-2.67e-03

Pos logit contributions
ocolate+0.010
 yourselves+0.010
 sidelines+0.009
ados+0.009
ting+0.009
Neg logit contributions
-0.009
 sure-0.007
 said-0.007
 once-0.007
 know-0.007
Token."
Token ID526
Feature activation+1.41e-02

Pos logit contributions
 yourselves+0.004
ocolate+0.004
ados+0.004
ting+0.004
 sidelines+0.004
Neg logit contributions
-0.004
eth-0.003
 said-0.003
 favorite-0.003
 wish-0.003
Token
Token ID198
Feature activation+3.70e-03

Pos logit contributions
+0.011
eth+0.008
 wish+0.008
 favorite+0.008
 printed+0.008
Neg logit contributions
ocolate-0.032
 yourselves-0.032
ados-0.030
ting-0.030
 sidelines-0.029
Token
Token ID198
Feature activation+3.09e-03

Pos logit contributions
+0.003
 wish+0.002
 favorite+0.002
 printed+0.002
 connected+0.002
Neg logit contributions
ocolate-0.008
 yourselves-0.008
ting-0.007
 sidelines-0.007
 course-0.007
TokenThe
Token ID464
Feature activation+4.55e-04

Pos logit contributions
+0.005
eth+0.004
 said+0.004
 sure+0.003
 connected+0.003
Neg logit contributions
 yourselves-0.005
ocolate-0.005
ados-0.005
ting-0.005
 sidelines-0.005
Token frog
Token ID21264
Feature activation+1.07e-02

Pos logit contributions
+0.001
eth+0.001
 favorite+0.001
 said+0.001
 wish+0.001
Neg logit contributions
 yourselves-0.001
ocolate-0.001
ados-0.001
ting-0.001
 sidelines-0.001
Token thanked
Token ID26280
Feature activation+1.72e-03

Pos logit contributions
+0.013
eth+0.013
 eng+0.012
 wish+0.011
 Hatt+0.011
Neg logit contributions
 course-0.016
ocolate-0.016
ting-0.015
ados-0.015
 cream-0.014
Token Lucy
Token ID22162
Feature activation+1.89e-03

Pos logit contributions
+0.002
 said+0.001
 once+0.001
 sure+0.001
eth+0.001
Neg logit contributions
 yourselves-0.004
ocolate-0.004
 sidelines-0.003
ados-0.003
ting-0.003
Token,
Token ID11
Feature activation-4.21e-03

Pos logit contributions
+0.002
eth+0.001
 wish+0.001
 favorite+0.001
 printed+0.001
Neg logit contributions
 yourselves-0.004
ocolate-0.004
ados-0.004
ting-0.004
 sidelines-0.004
Token and
Token ID290
Feature activation-5.92e-03

Pos logit contributions
 yourselves+0.010
ocolate+0.010
ados+0.009
 sidelines+0.009
 yourself+0.009
Neg logit contributions
-0.004
 said-0.003
 sure-0.003
eth-0.002
 favorite-0.002
Token
Token ID198
Feature activation+4.36e-03

Pos logit contributions
+0.007
 wish+0.005
 printed+0.005
eth+0.004
 connected+0.004
Neg logit contributions
 yourselves-0.022
ocolate-0.021
ados-0.020
 sidelines-0.019
ting-0.019
Token
Token ID198
Feature activation+3.92e-03

Pos logit contributions
+0.004
 wish+0.004
 printed+0.004
 favorite+0.004
 stuck+0.004
Neg logit contributions
 yourselves-0.009
ocolate-0.008
 yourself-0.008
endor-0.007
 course-0.007
Token"
Token ID1
Feature activation-8.78e-03

Pos logit contributions
+0.006
eth+0.005
 said+0.005
 favorite+0.005
 wish+0.005
Neg logit contributions
 yourselves-0.006
ocolate-0.006
ados-0.006
ting-0.006
 sidelines-0.006
TokenNo
Token ID2949
Feature activation-9.04e-03

Pos logit contributions
ocolate+0.012
ados+0.012
 yourselves+0.012
ting+0.011
 sidelines+0.010
Neg logit contributions
-0.016
 favorite-0.013
 said-0.013
 fit-0.013
 sure-0.013
Token,
Token ID11
Feature activation+1.68e-02

Pos logit contributions
 yourselves+0.023
ocolate+0.023
ting+0.023
 sidelines+0.023
ados+0.022
Neg logit contributions
-0.005
 said-0.004
 says-0.003
eth-0.003
 wish-0.003
Token Tom
Token ID4186
Feature activation+3.52e-03

Pos logit contributions
+0.019
 said+0.015
 sure+0.014
 says+0.013
 favorite+0.013
Neg logit contributions
ocolate-0.035
 yourselves-0.034
ting-0.032
 sidelines-0.032
ados-0.032
Token.
Token ID13
Feature activation-6.48e-04

Pos logit contributions
+0.004
eth+0.004
 favorite+0.004
 printed+0.003
 eng+0.003
Neg logit contributions
 yourselves-0.006
ocolate-0.006
ados-0.006
 yourself-0.006
ting-0.006
Token It
Token ID632
Feature activation+4.62e-03

Pos logit contributions
 yourselves+0.001
ocolate+0.001
ados+0.001
ting+0.001
 sidelines+0.001
Neg logit contributions
-0.001
eth-0.001
 said-0.001
 wish-0.001
 favorite-0.001
Token looks
Token ID3073
Feature activation+6.38e-03

Pos logit contributions
eth+0.005
 eng+0.005
 printed+0.004
 favourite+0.004
+0.004
Neg logit contributions
 yourselves-0.008
 yourself-0.007
 course-0.007
ting-0.007
ados-0.007
Token scary
Token ID14343
Feature activation+1.67e-04

Pos logit contributions
eth+0.008
+0.008
 printed+0.007
erg+0.007
 Hatt+0.007
Neg logit contributions
 yourselves-0.011
 yourself-0.010
 temptation-0.009
ocolate-0.009
ting-0.009
Token.
Token ID13
Feature activation+3.21e-04

Pos logit contributions
+0.000
eth+0.000
 printed+0.000
 eng+0.000
 wish+0.000
Neg logit contributions
 yourselves-0.000
ting-0.000
 yourself-0.000
ocolate-0.000
ados-0.000
Token well
Token ID880
Feature activation+3.75e-03

Pos logit contributions
ocolate+0.002
 yourselves+0.002
ados+0.002
 sidelines+0.002
ting+0.002
Neg logit contributions
-0.001
 sure-0.001
 favorite-0.001
 once-0.001
 connected-0.001
Token.
Token ID13
Feature activation-5.12e-03

Pos logit contributions
+0.004
eth+0.003
 favorite+0.003
 wish+0.003
 said+0.003
Neg logit contributions
 yourselves-0.007
ocolate-0.007
ados-0.007
ting-0.007
 sidelines-0.007
Token But
Token ID887
Feature activation+9.96e-03

Pos logit contributions
 yourselves+0.009
ocolate+0.009
ting+0.008
ados+0.008
 yourself+0.008
Neg logit contributions
-0.007
 wish-0.006
 printed-0.006
 connected-0.006
 said-0.005
Token then
Token ID788
Feature activation+2.84e-03

Pos logit contributions
+0.013
eth+0.013
 printed+0.012
 wish+0.012
 know+0.012
Neg logit contributions
 yourselves-0.015
ting-0.015
ocolate-0.014
 yourself-0.014
ados-0.013
Token she
Token ID673
Feature activation+4.05e-03

Pos logit contributions
+0.004
eth+0.003
 favorite+0.003
 printed+0.003
 connected+0.003
Neg logit contributions
 yourselves-0.005
ting-0.004
ocolate-0.004
 yourself-0.004
ados-0.004
Token started
Token ID2067
Feature activation+5.82e-03

Pos logit contributions
+0.005
eth+0.005
 favorite+0.004
 Hatt+0.004
 wish+0.004
Neg logit contributions
 yourselves-0.006
ocolate-0.006
ting-0.006
 sidelines-0.006
 yourself-0.006
Token to
Token ID284
Feature activation+1.06e-03

Pos logit contributions
+0.006
 said+0.004
 favorite+0.004
eth+0.004
 sure+0.004
Neg logit contributions
ocolate-0.013
 yourselves-0.013
ados-0.012
 sidelines-0.011
ting-0.011
Token feel
Token ID1254
Feature activation+6.01e-03

Pos logit contributions
+0.001
eth+0.001
 wish+0.001
 favorite+0.001
 said+0.001
Neg logit contributions
 yourselves-0.002
ocolate-0.002
ados-0.002
ting-0.002
 sidelines-0.002
Token tired
Token ID10032
Feature activation+1.17e-02

Pos logit contributions
+0.006
eth+0.005
 favorite+0.004
 said+0.004
 wish+0.004
Neg logit contributions
 yourselves-0.013
ocolate-0.012
ados-0.012
ting-0.011
 sidelines-0.011
Token.
Token ID13
Feature activation-4.34e-03

Pos logit contributions
+0.013
eth+0.010
 favorite+0.009
 wish+0.009
 said+0.009
Neg logit contributions
 yourselves-0.024
ocolate-0.023
ados-0.022
ting-0.022
 sidelines-0.021
Token She
Token ID1375
Feature activation-1.17e-02

Pos logit contributions
 yourselves+0.008
ocolate+0.008
ados+0.007
ting+0.007
 yourself+0.007
Neg logit contributions
-0.005
 wish-0.004
 printed-0.004
 connected-0.004
 favorite-0.004
Token It
Token ID632
Feature activation+8.29e-03

Pos logit contributions
+0.003
eth+0.002
 favorite+0.002
 said+0.002
 wish+0.002
Neg logit contributions
 yourselves-0.007
ocolate-0.007
ados-0.006
ting-0.006
 sidelines-0.006
Token was
Token ID373
Feature activation+3.31e-03

Pos logit contributions
eth+0.011
+0.010
 eng+0.009
erg+0.009
 wish+0.009
Neg logit contributions
ocolate-0.013
 yourselves-0.013
ting-0.012
ados-0.012
 course-0.012
Token the
Token ID262
Feature activation-7.46e-03

Pos logit contributions
+0.004
eth+0.004
 wish+0.003
 printed+0.003
 favorite+0.003
Neg logit contributions
 yourselves-0.006
ocolate-0.006
ados-0.005
ting-0.005
 sidelines-0.005
Token best
Token ID1266
Feature activation-2.39e-02

Pos logit contributions
ocolate+0.017
 yourselves+0.017
ados+0.016
ting+0.016
 sidelines+0.015
Neg logit contributions
-0.007
 favorite-0.006
 wish-0.005
 special-0.005
 said-0.005
Token strawberry
Token ID41236
Feature activation-1.25e-02

Pos logit contributions
 yourselves+0.043
ocolate+0.040
ting+0.039
ados+0.038
 sidelines+0.038
Neg logit contributions
-0.034
 favorite-0.030
 wish-0.029
 once-0.029
 said-0.028
Token she
Token ID673
Feature activation-2.06e-02

Pos logit contributions
 yourselves+0.026
ocolate+0.025
 sidelines+0.024
ting+0.023
ados+0.023
Neg logit contributions
-0.014
 said-0.011
 favorite-0.011
 once-0.010
 sure-0.010
Token had
Token ID550
Feature activation+2.24e-03

Pos logit contributions
 yourselves+0.041
ocolate+0.041
ados+0.038
ting+0.038
 sidelines+0.037
Neg logit contributions
-0.022
eth-0.018
 wish-0.016
 favorite-0.016
 said-0.016
Token ever
Token ID1683
Feature activation-4.46e-03

Pos logit contributions
+0.003
 said+0.002
 favorite+0.002
 printed+0.002
 once+0.002
Neg logit contributions
 yourselves-0.004
ocolate-0.004
ados-0.004
 sidelines-0.004
ting-0.004
Token tasted
Token ID29187
Feature activation-5.04e-03

Pos logit contributions
 yourselves+0.008
ocolate+0.008
ados+0.007
ting+0.007
 sidelines+0.007
Neg logit contributions
-0.006
eth-0.005
 said-0.005
 favorite-0.005
 wish-0.005
Token.
Token ID13
Feature activation+3.49e-03

Pos logit contributions
ocolate+0.011
 yourselves+0.011
 sidelines+0.011
ados+0.010
ting+0.010
Neg logit contributions
-0.005
 once-0.004
 favorite-0.004
 sure-0.004
 special-0.003
Token<|endoftext|>
Token ID50256
Feature activation-9.85e-04

Pos logit contributions
+0.003
 wish+0.003
eth+0.002
 connected+0.002
 printed+0.002
Neg logit contributions
 yourselves-0.007
ocolate-0.007
ados-0.007
ting-0.007
 sidelines-0.007
Token He
Token ID679
Feature activation-1.24e-02

Pos logit contributions
ocolate+0.014
 yourselves+0.014
ados+0.013
ting+0.013
 sidelines+0.013
Neg logit contributions
-0.008
 wish-0.007
 printed-0.007
eth-0.007
 favorite-0.007
Token happily
Token ID18177
Feature activation+6.44e-03

Pos logit contributions
ocolate+0.023
 yourselves+0.023
ting+0.021
ados+0.021
 sidelines+0.021
Neg logit contributions
-0.013
eth-0.012
 favorite-0.010
 eng-0.010
 wish-0.010
Token ch
Token ID442
Feature activation+7.00e-03

Pos logit contributions
+0.008
eth+0.007
 favorite+0.007
 wish+0.007
 printed+0.006
Neg logit contributions
ocolate-0.011
 yourselves-0.011
ting-0.010
ados-0.010
 sidelines-0.010
Tokenomp
Token ID3361
Feature activation-5.96e-03

Pos logit contributions
+0.012
eth+0.010
 said+0.010
 wish+0.010
 sure+0.010
Neg logit contributions
 yourselves-0.010
ocolate-0.010
ados-0.008
ting-0.008
 sidelines-0.008
Tokened
Token ID276
Feature activation+1.13e-03

Pos logit contributions
 yourselves+0.011
ocolate+0.010
ados+0.009
 sidelines+0.009
ting+0.009
Neg logit contributions
-0.008
eth-0.007
 said-0.007
 sure-0.006
 favorite-0.006
Token on
Token ID319
Feature activation-7.93e-03

Pos logit contributions
+0.001
eth+0.001
 favorite+0.001
 wish+0.001
 said+0.001
Neg logit contributions
 yourselves-0.002
ocolate-0.002
ados-0.002
ting-0.002
 sidelines-0.002
Token the
Token ID262
Feature activation-5.25e-04

Pos logit contributions
ocolate+0.018
 yourselves+0.018
 sidelines+0.016
ados+0.016
anish+0.015
Neg logit contributions
-0.008
 once-0.006
 favorite-0.006
 special-0.006
 sure-0.006
Token bone
Token ID9970
Feature activation-2.07e-02

Pos logit contributions
ocolate+0.001
 yourselves+0.001
ados+0.001
 yourself+0.001
ting+0.001
Neg logit contributions
-0.001
 favorite-0.001
 said-0.001
 special-0.001
eth-0.001
Token,
Token ID11
Feature activation-5.50e-04

Pos logit contributions
 yourselves+0.043
ocolate+0.043
ados+0.040
ting+0.040
 sidelines+0.039
Neg logit contributions
-0.021
eth-0.016
 favorite-0.015
 said-0.015
 wish-0.015
Token content
Token ID2695
Feature activation+1.07e-02

Pos logit contributions
 yourselves+0.001
ocolate+0.001
ados+0.001
 yourself+0.001
ting+0.001
Neg logit contributions
-0.001
 said-0.001
 sure-0.001
 wrapped-0.001
 connected-0.001
Token to
Token ID284
Feature activation-6.69e-03

Pos logit contributions
+0.009
 once+0.008
 inside+0.008
 said+0.007
 favorite+0.007
Neg logit contributions
ocolate-0.024
 yourselves-0.023
 sidelines-0.022
ados-0.022
 course-0.021
Token up
Token ID510
Feature activation-1.11e-02

Pos logit contributions
 yourselves+0.013
ocolate+0.013
ting+0.012
ados+0.012
 sidelines+0.012
Neg logit contributions
-0.008
eth-0.007
 favorite-0.006
 wish-0.006
 printed-0.006
Token and
Token ID290
Feature activation-2.91e-03

Pos logit contributions
 yourselves+0.024
ocolate+0.024
ados+0.022
ting+0.022
 sidelines+0.022
Neg logit contributions
-0.011
eth-0.008
 favorite-0.007
 said-0.007
 wish-0.007
Token started
Token ID2067
Feature activation+3.87e-03

Pos logit contributions
 yourselves+0.005
ocolate+0.005
ados+0.005
ting+0.005
 sidelines+0.005
Neg logit contributions
-0.004
eth-0.003
 favorite-0.003
 wish-0.003
 said-0.003
Token jumping
Token ID14284
Feature activation-2.48e-03

Pos logit contributions
+0.004
 said+0.003
 sure+0.003
 favorite+0.003
eth+0.003
Neg logit contributions
ocolate-0.008
 yourselves-0.008
ados-0.007
 sidelines-0.007
ting-0.007
Token around
Token ID1088
Feature activation-7.87e-03

Pos logit contributions
 yourselves+0.005
ocolate+0.005
ados+0.005
 sidelines+0.005
ting+0.005
Neg logit contributions
-0.002
eth-0.002
 favorite-0.002
 said-0.002
 sure-0.002
Token the
Token ID262
Feature activation-7.47e-03

Pos logit contributions
 yourselves+0.018
ocolate+0.018
ados+0.016
 sidelines+0.016
 yourself+0.016
Neg logit contributions
-0.008
 once-0.005
 said-0.005
 stuck-0.005
 sure-0.005
Token lawn
Token ID21133
Feature activation-5.69e-03

Pos logit contributions
ocolate+0.009
 yourselves+0.009
ados+0.008
ting+0.007
 sidelines+0.007
Neg logit contributions
-0.015
 favorite-0.012
 said-0.012
eth-0.012
 connected-0.012
Token too
Token ID1165
Feature activation-6.53e-03

Pos logit contributions
 yourselves+0.012
ocolate+0.012
ados+0.011
 sidelines+0.011
ting+0.011
Neg logit contributions
-0.006
 said-0.004
 favorite-0.004
eth-0.004
 once-0.004
Token.
Token ID13
Feature activation-8.07e-03

Pos logit contributions
 yourselves+0.013
ocolate+0.013
ados+0.012
ting+0.012
 sidelines+0.012
Neg logit contributions
-0.007
eth-0.006
 said-0.005
 favorite-0.005
 wish-0.005
Token He
Token ID679
Feature activation-1.38e-02

Pos logit contributions
 yourselves+0.016
ocolate+0.016
ados+0.014
ting+0.014
 sidelines+0.014
Neg logit contributions
-0.009
eth-0.007
 wish-0.007
 favorite-0.007
 printed-0.007
Token chased
Token ID26172
Feature activation-1.72e-02

Pos logit contributions
 yourselves+0.021
ocolate+0.020
ting+0.020
 sidelines+0.019
 course+0.019
Neg logit contributions
-0.018
eth-0.017
 wish-0.015
 eng-0.015
 favorite-0.015
Token seller
Token ID18583
Feature activation+1.50e-02

Pos logit contributions
+0.019
 said+0.016
 favorite+0.015
 wish+0.015
 printed+0.015
Neg logit contributions
 yourselves-0.027
ocolate-0.027
ting-0.024
ados-0.024
 sidelines-0.024
Token said
Token ID531
Feature activation+3.51e-03

Pos logit contributions
eth+0.012
+0.012
 favorite+0.011
erg+0.011
ends+0.010
Neg logit contributions
ocolate-0.028
ados-0.027
 course-0.027
 cream-0.027
ting-0.026
Token.
Token ID13
Feature activation+9.22e-03

Pos logit contributions
+0.004
eth+0.003
 said+0.003
 favorite+0.003
 printed+0.003
Neg logit contributions
 yourselves-0.007
ocolate-0.007
ados-0.007
ting-0.007
 sidelines-0.007
Token "
Token ID366
Feature activation-1.33e-02

Pos logit contributions
+0.012
 wish+0.011
 favorite+0.010
eth+0.010
 printed+0.010
Neg logit contributions
 yourselves-0.016
ocolate-0.016
ados-0.014
ting-0.014
 sidelines-0.014
TokenI
Token ID40
Feature activation+2.16e-03

Pos logit contributions
 yourselves+0.019
ocolate+0.018
ados+0.018
 sidelines+0.017
ting+0.016
Neg logit contributions
-0.023
 printed-0.018
 sure-0.018
 favorite-0.018
eth-0.018
Token will
Token ID481
Feature activation-1.74e-02

Pos logit contributions
+0.002
eth+0.002
 favorite+0.002
 eng+0.002
 dot+0.002
Neg logit contributions
ocolate-0.004
 course-0.003
 yourselves-0.003
 Enjoy-0.003
 cream-0.003
Token give
Token ID1577
Feature activation-6.10e-03

Pos logit contributions
 yourselves+0.034
ocolate+0.033
ados+0.033
ting+0.032
 sidelines+0.031
Neg logit contributions
-0.022
 know-0.017
 wish-0.017
 said-0.016
 sure-0.016
Token you
Token ID345
Feature activation-8.85e-04

Pos logit contributions
ocolate+0.010
 yourselves+0.010
 sidelines+0.009
ados+0.009
ting+0.009
Neg logit contributions
-0.009
 said-0.007
 sure-0.007
 favorite-0.007
 once-0.007
Token five
Token ID1936
Feature activation-1.15e-02

Pos logit contributions
ocolate+0.002
 yourselves+0.002
 sidelines+0.002
ting+0.002
ados+0.002
Neg logit contributions
-0.001
 favorite-0.001
 special-0.001
 sure-0.001
 said-0.001
Token coins
Token ID10796
Feature activation-1.06e-02

Pos logit contributions
 yourselves+0.019
ocolate+0.019
 sidelines+0.017
ting+0.017
ados+0.017
Neg logit contributions
-0.017
 favorite-0.014
 said-0.014
 wish-0.014
eth-0.014
Token for
Token ID329
Feature activation+5.15e-03

Pos logit contributions
 yourselves+0.022
ocolate+0.021
ados+0.020
ting+0.020
 sidelines+0.019
Neg logit contributions
-0.011
eth-0.009
 favorite-0.008
 wish-0.008
 said-0.008
Token playing
Token ID2712
Feature activation-2.24e-03

Pos logit contributions
 yourselves+0.016
ocolate+0.016
ting+0.015
ados+0.014
 sidelines+0.014
Neg logit contributions
-0.016
eth-0.016
 favorite-0.015
 wish-0.015
 printed-0.014
Token on
Token ID319
Feature activation-2.21e-03

Pos logit contributions
 yourselves+0.005
ocolate+0.005
ados+0.004
 sidelines+0.004
ting+0.004
Neg logit contributions
-0.003
 said-0.002
 favorite-0.002
eth-0.002
 printed-0.002
Token some
Token ID617
Feature activation+9.19e-03

Pos logit contributions
ocolate+0.004
 yourselves+0.004
ados+0.004
ting+0.004
 sidelines+0.004
Neg logit contributions
-0.002
 said-0.002
 favorite-0.002
eth-0.002
 printed-0.002
Token equipment
Token ID5112
Feature activation-3.56e-03

Pos logit contributions
+0.012
 favorite+0.009
 said+0.009
 special+0.009
 wish+0.009
Neg logit contributions
ocolate-0.017
 yourselves-0.017
ados-0.016
ting-0.015
 sidelines-0.015
Token.
Token ID13
Feature activation-5.76e-03

Pos logit contributions
 yourselves+0.007
ocolate+0.007
ting+0.006
ados+0.006
 sidelines+0.006
Neg logit contributions
-0.004
eth-0.004
 wish-0.003
 sure-0.003
 favorite-0.003
Token She
Token ID1375
Feature activation-1.48e-02

Pos logit contributions
 yourselves+0.009
ting+0.009
ados+0.009
 sidelines+0.009
ocolate+0.009
Neg logit contributions
 wish-0.008
-0.007
 connected-0.007
 eng-0.007
erg-0.006
Token asked
Token ID1965
Feature activation+9.45e-03

Pos logit contributions
 yourselves+0.026
ocolate+0.025
ting+0.024
 sidelines+0.023
ados+0.023
Neg logit contributions
-0.017
eth-0.015
 wish-0.014
 eng-0.013
 favorite-0.013
Token her
Token ID607
Feature activation+8.96e-03

Pos logit contributions
+0.011
 said+0.009
 favorite+0.008
 sure+0.008
eth+0.008
Neg logit contributions
 yourselves-0.019
ocolate-0.018
 sidelines-0.017
ados-0.017
ting-0.016
Token mom
Token ID1995
Feature activation-1.30e-03

Pos logit contributions
+0.013
eth+0.012
 eng+0.011
 connected+0.011
 dot+0.011
Neg logit contributions
ocolate-0.013
 yourselves-0.012
ados-0.012
ting-0.011
 cream-0.011
Token if
Token ID611
Feature activation-6.81e-03

Pos logit contributions
 yourselves+0.003
ocolate+0.003
ados+0.002
ting+0.002
 sidelines+0.002
Neg logit contributions
-0.001
eth-0.001
 favorite-0.001
 wish-0.001
 said-0.001
Token she
Token ID673
Feature activation-4.22e-03

Pos logit contributions
 yourselves+0.015
ocolate+0.015
ados+0.014
ting+0.013
 sidelines+0.013
Neg logit contributions
-0.007
eth-0.005
 said-0.005
 favorite-0.005
 wish-0.004
Token
Token ID198
Feature activation-5.22e-03

Pos logit contributions
ocolate+0.010
 yourselves+0.010
 yourself+0.009
ting+0.009
 course+0.009
Neg logit contributions
-0.004
 wish-0.003
 favorite-0.003
 connected-0.003
 printed-0.003
TokenThe
Token ID464
Feature activation+9.84e-03

Pos logit contributions
ocolate+0.010
 yourselves+0.010
ados+0.010
ting+0.009
 sidelines+0.009
Neg logit contributions
-0.007
 said-0.005
eth-0.005
 sure-0.005
 connected-0.005
Token bad
Token ID2089
Feature activation+1.07e-02

Pos logit contributions
+0.012
eth+0.011
 printed+0.009
 said+0.009
 connected+0.009
Neg logit contributions
 yourselves-0.018
ocolate-0.018
ados-0.017
ting-0.016
 sidelines-0.016
Token rh
Token ID9529
Feature activation-1.52e-02

Pos logit contributions
+0.013
eth+0.011
 sure+0.010
 eng+0.010
 favorite+0.010
Neg logit contributions
 yourselves-0.020
ocolate-0.020
ados-0.019
ting-0.019
 sidelines-0.018
Tokenin
Token ID259
Feature activation+1.55e-02

Pos logit contributions
 yourselves+0.028
ocolate+0.028
 sidelines+0.027
ados+0.026
ting+0.025
Neg logit contributions
-0.018
 said-0.014
 once-0.014
 printed-0.014
 favorite-0.014
Tokenoc
Token ID420
Feature activation-4.38e-02

Pos logit contributions
+0.018
eth+0.014
 favorite+0.013
 said+0.013
 wish+0.013
Neg logit contributions
 yourselves-0.031
ocolate-0.030
ados-0.028
ting-0.028
 sidelines-0.028
Tokeneros
Token ID27498
Feature activation-1.20e-02

Pos logit contributions
 sidelines+0.065
 yourselves+0.062
ocolate+0.061
 temptation+0.056
 yourself+0.053
Neg logit contributions
 once-0.055
-0.052
 favorite-0.049
eth-0.048
 stuck-0.047
Token was
Token ID373
Feature activation-9.64e-04

Pos logit contributions
 yourselves+0.024
ocolate+0.024
ting+0.022
ados+0.022
 sidelines+0.021
Neg logit contributions
-0.013
eth-0.010
 favorite-0.009
 wish-0.009
 eng-0.009
Token so
Token ID523
Feature activation+5.80e-03

Pos logit contributions
 yourselves+0.002
ocolate+0.002
 yourself+0.002
ting+0.002
 sidelines+0.002
Neg logit contributions
-0.001
eth-0.001
 wish-0.001
 printed-0.001
 favorite-0.001
Token happy
Token ID3772
Feature activation+5.56e-03

Pos logit contributions
+0.007
eth+0.006
 Georg+0.005
 Hatt+0.005
 favorite+0.005
Neg logit contributions
 yourselves-0.010
ocolate-0.010
 yourself-0.009
ting-0.009
ados-0.009
Token.
Token ID13
Feature activation-9.04e-03

Pos logit contributions
+0.005
 said+0.004
 once+0.003
 sure+0.003
 favorite+0.003
Neg logit contributions
ocolate-0.013
 yourselves-0.013
 sidelines-0.012
ados-0.012
ting-0.011
Token thirty
Token ID12277
Feature activation-5.38e-03

Pos logit contributions
+0.011
eth+0.009
 favorite+0.009
 wish+0.009
 printed+0.008
Neg logit contributions
 yourselves-0.021
ocolate-0.020
ados-0.019
ting-0.019
 sidelines-0.019
Token-
Token ID12
Feature activation-3.70e-02

Pos logit contributions
 yourselves+0.009
ocolate+0.009
ados+0.008
ting+0.008
 sidelines+0.008
Neg logit contributions
-0.007
eth-0.006
 favorite-0.006
 printed-0.006
 wish-0.006
Tokentwo
Token ID11545
Feature activation-6.49e-03

Pos logit contributions
 yourselves+0.047
ocolate+0.047
ados+0.041
ting+0.041
 sidelines+0.040
Neg logit contributions
-0.067
eth-0.059
 favorite-0.057
 wish-0.057
 said-0.057
Token,
Token ID11
Feature activation+7.59e-03

Pos logit contributions
 yourselves+0.012
ocolate+0.012
ados+0.011
ting+0.011
 yourself+0.010
Neg logit contributions
-0.008
eth-0.007
 printed-0.006
 favorite-0.006
 wish-0.006
Token thirty
Token ID12277
Feature activation-9.03e-03

Pos logit contributions
+0.008
eth+0.007
 favorite+0.007
 wish+0.007
 printed+0.006
Neg logit contributions
 yourselves-0.015
ocolate-0.015
ados-0.014
ting-0.013
 sidelines-0.013
Token-
Token ID12
Feature activation-3.80e-02

Pos logit contributions
 yourselves+0.016
ocolate+0.016
ados+0.015
ting+0.014
 sidelines+0.014
Neg logit contributions
-0.012
eth-0.010
 favorite-0.009
 said-0.009
 printed-0.009
Tokenthree
Token ID15542
Feature activation-1.76e-02

Pos logit contributions
 yourselves+0.050
ocolate+0.050
ados+0.044
ting+0.043
 sidelines+0.043
Neg logit contributions
-0.068
eth-0.058
 favorite-0.057
 said-0.057
 wish-0.056
Token,
Token ID11
Feature activation+2.62e-03

Pos logit contributions
 yourselves+0.037
ocolate+0.036
ados+0.034
ting+0.034
 sidelines+0.033
Neg logit contributions
-0.019
eth-0.014
 said-0.013
 favorite-0.013
 wish-0.013
Token thirty
Token ID12277
Feature activation-9.83e-03

Pos logit contributions
+0.003
eth+0.003
 favorite+0.002
 printed+0.002
 wish+0.002
Neg logit contributions
 yourselves-0.005
ocolate-0.005
ados-0.005
ting-0.005
 sidelines-0.005
Token-
Token ID12
Feature activation-3.82e-02

Pos logit contributions
 yourselves+0.018
ocolate+0.017
ados+0.016
ting+0.016
 sidelines+0.016
Neg logit contributions
-0.013
eth-0.010
 favorite-0.010
 said-0.010
 wish-0.010
Tokenfour
Token ID14337
Feature activation-2.08e-02

Pos logit contributions
 yourselves+0.053
ocolate+0.053
ados+0.047
ting+0.046
 sidelines+0.046
Neg logit contributions
-0.066
eth-0.056
 said-0.054
 favorite-0.054
 wish-0.054
Token said
Token ID531
Feature activation-1.15e-02

Pos logit contributions
 yourselves+0.001
ocolate+0.001
ting+0.001
 course+0.001
ados+0.001
Neg logit contributions
-0.001
eth-0.001
 favorite-0.001
 eng-0.001
erg-0.001
Token,
Token ID11
Feature activation-8.57e-05

Pos logit contributions
 yourselves+0.021
ocolate+0.020
 sidelines+0.019
ting+0.018
ados+0.018
Neg logit contributions
-0.017
 sure-0.014
 said-0.014
 favorite-0.013
 once-0.013
Token "
Token ID366
Feature activation-7.97e-03

Pos logit contributions
 yourselves+0.000
ocolate+0.000
ados+0.000
 sidelines+0.000
ting+0.000
Neg logit contributions
-0.000
eth-0.000
 said-0.000
 sure-0.000
 favorite-0.000
TokenWe
Token ID1135
Feature activation+1.69e-04

Pos logit contributions
 yourselves+0.011
ocolate+0.011
ados+0.011
 sidelines+0.010
ting+0.009
Neg logit contributions
-0.014
 printed-0.011
 said-0.011
eth-0.011
 once-0.011
Tokenee
Token ID1453
Feature activation-2.39e-02

Pos logit contributions
eth+0.000
+0.000
 favorite+0.000
 eng+0.000
erg+0.000
Neg logit contributions
ocolate-0.000
 yourselves-0.000
 course-0.000
 yourself-0.000
ados-0.000
Tokene
Token ID68
Feature activation-3.58e-02

Pos logit contributions
 yourselves+0.044
ocolate+0.043
ados+0.040
ting+0.039
 sidelines+0.039
Neg logit contributions
-0.032
eth-0.025
 said-0.025
 sure-0.024
 favorite-0.024
Token!"
Token ID2474
Feature activation+1.12e-02

Pos logit contributions
 yourselves+0.062
ocolate+0.062
ados+0.056
 sidelines+0.056
ting+0.056
Neg logit contributions
-0.051
eth-0.041
 said-0.040
 favorite-0.040
 printed-0.039
Token and
Token ID290
Feature activation-1.19e-02

Pos logit contributions
+0.008
eth+0.005
 favorite+0.004
 said+0.004
 wish+0.004
Neg logit contributions
 yourselves-0.027
ocolate-0.027
ados-0.026
ting-0.025
 sidelines-0.025
Token off
Token ID572
Feature activation+2.47e-03

Pos logit contributions
 yourselves+0.022
ocolate+0.022
ados+0.020
ting+0.020
 sidelines+0.020
Neg logit contributions
-0.015
eth-0.012
 favorite-0.012
 wish-0.011
 said-0.011
Token she
Token ID673
Feature activation+1.75e-03

Pos logit contributions
+0.003
 wish+0.003
erg+0.003
 know+0.003
 prized+0.003
Neg logit contributions
ting-0.004
 yourselves-0.004
ados-0.004
ocolate-0.003
 sidelines-0.003
Token went
Token ID1816
Feature activation-1.74e-05

Pos logit contributions
+0.002
eth+0.002
 favorite+0.002
erg+0.002
 Today+0.002
Neg logit contributions
ocolate-0.003
 yourselves-0.002
ting-0.002
 course-0.002
 cream-0.002
Token suddenly
Token ID6451
Feature activation+1.78e-03

Pos logit contributions
+0.003
eth+0.002
 favorite+0.002
 wish+0.002
 connected+0.002
Neg logit contributions
 yourselves-0.006
ocolate-0.006
ting-0.005
 sidelines-0.005
 yourself-0.005
Token saw
Token ID2497
Feature activation-5.57e-04

Pos logit contributions
+0.002
eth+0.002
 favorite+0.002
 said+0.001
 wish+0.001
Neg logit contributions
 yourselves-0.004
ocolate-0.003
ados-0.003
ting-0.003
 sidelines-0.003
Token a
Token ID257
Feature activation-2.19e-03

Pos logit contributions
ocolate+0.001
 yourselves+0.001
ados+0.001
 sidelines+0.001
ting+0.001
Neg logit contributions
-0.001
 said-0.000
 favorite-0.000
 printed-0.000
 sure-0.000
Token rh
Token ID9529
Feature activation-9.49e-03

Pos logit contributions
 yourselves+0.005
ocolate+0.004
ados+0.004
ting+0.004
 yourself+0.004
Neg logit contributions
-0.002
 wish-0.002
eth-0.002
 said-0.002
 favorite-0.002
Tokenin
Token ID259
Feature activation+2.62e-02

Pos logit contributions
 yourselves+0.018
ocolate+0.017
ados+0.016
 sidelines+0.016
 yourself+0.015
Neg logit contributions
-0.012
eth-0.010
 said-0.009
 printed-0.009
 wish-0.009
Tokenoc
Token ID420
Feature activation-4.89e-02

Pos logit contributions
+0.031
eth+0.026
 favorite+0.024
 said+0.024
 sure+0.024
Neg logit contributions
ocolate-0.049
 yourselves-0.049
ados-0.045
ting-0.045
 sidelines-0.044
Tokeneros
Token ID27498
Feature activation-3.02e-03

Pos logit contributions
 yourselves+0.065
ocolate+0.057
 sidelines+0.055
 yourself+0.054
ting+0.049
Neg logit contributions
-0.082
 once-0.076
eth-0.075
 wish-0.072
 favorite-0.071
Token in
Token ID287
Feature activation+5.28e-03

Pos logit contributions
 yourselves+0.006
ocolate+0.006
ados+0.005
 sidelines+0.005
ting+0.005
Neg logit contributions
-0.004
eth-0.003
 said-0.003
 favorite-0.003
 wish-0.003
Token front
Token ID2166
Feature activation-5.18e-03

Pos logit contributions
+0.006
 said+0.004
 wrapped+0.004
 once+0.004
 wish+0.004
Neg logit contributions
ocolate-0.011
 yourselves-0.011
ados-0.011
ting-0.010
 sidelines-0.010
Token of
Token ID286
Feature activation+3.30e-03

Pos logit contributions
 yourselves+0.012
ocolate+0.012
ados+0.011
ting+0.011
 sidelines+0.011
Neg logit contributions
-0.004
 said-0.003
eth-0.003
 wish-0.003
 favorite-0.002
Token him
Token ID683
Feature activation+3.59e-03

Pos logit contributions
+0.005
 said+0.004
 once+0.004
 printed+0.004
 sure+0.003
Neg logit contributions
 yourselves-0.006
ocolate-0.006
ados-0.005
 sidelines-0.005
ting-0.005
Token get
Token ID651
Feature activation-1.35e-02

Pos logit contributions
 yourselves+0.014
ocolate+0.014
ados+0.013
ting+0.013
 sidelines+0.013
Neg logit contributions
-0.009
eth-0.007
 favorite-0.007
 said-0.007
 wish-0.007
Token her
Token ID607
Feature activation+3.66e-03

Pos logit contributions
 yourselves+0.023
ting+0.021
ocolate+0.021
 yourself+0.021
ados+0.020
Neg logit contributions
-0.018
eth-0.017
 printed-0.014
 wish-0.014
 eng-0.014
Token mom
Token ID1995
Feature activation-9.62e-03

Pos logit contributions
+0.005
eth+0.004
 eng+0.004
 connected+0.004
 worked+0.004
Neg logit contributions
 yourselves-0.006
ocolate-0.006
ados-0.005
 cream-0.005
ting-0.005
Tokenmy
Token ID1820
Feature activation+2.62e-03

Pos logit contributions
ting+0.017
ocolate+0.016
 yourselves+0.016
ados+0.015
 yourself+0.015
Neg logit contributions
 wish-0.010
 eng-0.010
eth-0.010
-0.009
 printed-0.009
Token's
Token ID338
Feature activation+8.36e-03

Pos logit contributions
 wish+0.003
 eng+0.003
 printed+0.003
eth+0.003
+0.003
Neg logit contributions
ting-0.005
ocolate-0.004
 yourselves-0.004
ados-0.004
 yourself-0.004
Token attention
Token ID3241
Feature activation-3.71e-02

Pos logit contributions
+0.014
eth+0.012
 eng+0.011
 printed+0.011
 connected+0.011
Neg logit contributions
 yourselves-0.012
ocolate-0.011
ados-0.010
 sidelines-0.010
ting-0.010
Token,
Token ID11
Feature activation+1.22e-02

Pos logit contributions
ocolate+0.075
 yourselves+0.074
ting+0.070
ados+0.069
 sidelines+0.068
Neg logit contributions
-0.038
eth-0.030
 wish-0.029
 favorite-0.029
 printed-0.028
Token but
Token ID475
Feature activation+9.33e-03

Pos logit contributions
+0.015
eth+0.012
 said+0.012
 favorite+0.012
 sure+0.011
Neg logit contributions
 yourselves-0.023
ocolate-0.023
ados-0.021
ting-0.021
 sidelines-0.021
Token she
Token ID673
Feature activation+5.00e-03

Pos logit contributions
+0.012
eth+0.010
 wish+0.010
 favorite+0.010
 printed+0.010
Neg logit contributions
 yourselves-0.016
ocolate-0.016
ados-0.015
ting-0.015
 sidelines-0.014
Token was
Token ID373
Feature activation-3.60e-03

Pos logit contributions
+0.006
eth+0.005
 eng+0.005
 favorite+0.005
 wish+0.005
Neg logit contributions
ocolate-0.009
 yourselves-0.008
ting-0.008
ados-0.008
 course-0.008
Token busy
Token ID8179
Feature activation+1.95e-03

Pos logit contributions
 yourselves+0.006
ocolate+0.006
ting+0.006
 sidelines+0.006
ados+0.006
Neg logit contributions
-0.005
eth-0.004
 wish-0.004
 favorite-0.004
 printed-0.004