TOP ACTIVATIONS
MAX = 6.133

got a big stain on her dress . She started crying
He has an itch on his nose . He rub s
had a big bump on her arm and it really hurt
stub bed her toe on a rock . It hurt a
him . Teddy fell on her shoulder and it hurt .
had a big bump on his head and a cut on
tri pped and fell on her way to school . She
had a big bump on his head . <|endoftext|> Once upon
tower and it fell on her foot . It hurt a
. The books fell on her , and she cried out
saw a big stain on her trumpet . It was brown
Max had a scratch on his foot . She felt bad
had a big itch on his back . He tried to
straight ened the blankets on her bed . She was so
Spot had an itch on his leg . He tried to
his toys and jump on his bed . One day ,
slippery . The soap made Lily 's eyes sting .
got a big patch on her knee . She was very
, brown seeds began to sp urt out through the soft
her toys and jump on her bed . One day ,

MIN ACTIVATIONS
MIN = -5.914

his friend , a wise old owl , and asked for
But then , a friendly tow truck came along and offered
his friend , a wise old owl . Max said ,
his friend , a wise old owl named O ll ie
Just then a wise old fox stepped out from behind
his friend , a wise old owl , " How can
Just then , a wise old owl swoop ed down from
his friend , a wise old owl , sitting on a
her friend , a wise old owl named H oot y
's friend , a wise old owl named O ll ie
her friend , a wise old owl named H oot ie
Just then , a wise old owl flew down and perched
Just then , a wise old owl flew by . "
friend Ralph , a friendly old turtle . Ralph said to
boy called Sam and an old lady called Grace . Sam
his friend , a wise old owl , for help .
Just then , a kind old lady appeared .
Just then , a wise old sk unk came along .
his tutor , a wise old owl , to teach him
his friend , a wise old owl , � � €

INTERVAL 3.121 - 6.133
CONTAINS 0.571%

day , it started raining very hard outside of her window
day , Jack disappeared ! Jill looked everywhere , but she
her tea . It made her jump in surprise and her
They hung their socks on the chim ney and then added
to play , but he doesn 't know how to be

INTERVAL 0.110 - 3.121
CONTAINS 44.605%

The train starts to move . But the train
boy a gentle smile . The rude boy was
. <|endoftext|> Once upon a time , there was a huge
to touch it . But Lily was stubborn and wanted to
to be more careful next time and not to drink anything

INTERVAL -2.902 - 0.110
CONTAINS 54.308%

stubborn little mouse who lived with her family in the woods
she was walking very slow in the woods . Suddenly ,
water . The brown dog was very happy in
to Ben . " Maybe the dog will go away ."
looked up at the tree and waved goodbye as she started

INTERVAL -5.914 - -2.902
CONTAINS 0.515%

voice . It was a friendly old man . " Hello
shoes to his friends . <|endoftext|> Once upon a time there
rare taps on dresses . <|endoftext|> Once there were two friends
. Then she saw a tiny mouse who was running around
. <|endoftext|> John was a young boy . He was only
Token got
Token ID1392
Feature activation+6.85e-01

Pos logit contributions
cases+0.074
 hardly+0.069
abs+0.067
tered+0.067
late+0.063
Neg logit contributions
smart-0.052
 course-0.046
itty-0.045
 living-0.045
in-0.043
Token a
Token ID257
Feature activation+2.13e+00

Pos logit contributions
 hardly+0.017
cases+0.017
tered+0.016
abs+0.015
 barely+0.014
Neg logit contributions
smart-0.016
itty-0.014
 course-0.014
in-0.014
etchup-0.014
Token big
Token ID1263
Feature activation-7.33e-01

Pos logit contributions
 hardly+0.052
cases+0.049
tered+0.048
 barely+0.043
 nearly+0.043
Neg logit contributions
smart-0.053
itty-0.048
 course-0.047
eros-0.045
in-0.045
Token stain
Token ID18746
Feature activation+1.34e+00

Pos logit contributions
smart+0.020
itty+0.018
ang+0.017
eros+0.017
 Friendship+0.017
Neg logit contributions
 hardly-0.015
tered-0.014
cases-0.014
 nearly-0.013
 floor-0.013
Token on
Token ID319
Feature activation+1.00e+00

Pos logit contributions
 hardly+0.044
cases+0.043
tered+0.040
 barely+0.038
abs+0.038
Neg logit contributions
smart-0.022
itty-0.018
in-0.018
 course-0.018
 living-0.017
Token her
Token ID607
Feature activation+5.86e+00

Pos logit contributions
cases+0.034
 hardly+0.032
abs+0.030
tered+0.030
rated+0.029
Neg logit contributions
smart-0.015
 course-0.015
itty-0.013
 living-0.012
in-0.012
Token dress
Token ID6576
Feature activation+1.12e+00

Pos logit contributions
cases+0.160
 hardly+0.159
tered+0.152
abs+0.142
late+0.140
Neg logit contributions
 living-0.109
smart-0.106
itty-0.101
eros-0.099
in-0.098
Token.
Token ID13
Feature activation+1.98e-02

Pos logit contributions
 hardly+0.037
cases+0.035
tered+0.034
abs+0.033
 barely+0.032
Neg logit contributions
smart-0.017
itty-0.014
 course-0.014
 living-0.013
in-0.013
Token She
Token ID1375
Feature activation+2.68e+00

Pos logit contributions
cases+0.001
 hardly+0.001
tered+0.001
rated+0.001
abs+0.001
Neg logit contributions
smart-0.000
itty-0.000
 course-0.000
 living-0.000
 welcome-0.000
Token started
Token ID2067
Feature activation+1.37e+00

Pos logit contributions
cases+0.085
abs+0.075
 hardly+0.072
late+0.070
ways+0.070
Neg logit contributions
 band-0.045
in-0.045
 living-0.044
smart-0.042
 course-0.041
Token crying
Token ID13774
Feature activation+1.61e+00

Pos logit contributions
cases+0.045
 hardly+0.043
tered+0.042
abs+0.040
rated+0.040
Neg logit contributions
smart-0.023
 living-0.020
itty-0.019
etchup-0.018
 course-0.018
Token He
Token ID679
Feature activation+1.43e+00

Pos logit contributions
 hardly+0.043
cases+0.042
tered+0.041
 barely+0.038
 nearly+0.037
Neg logit contributions
smart-0.024
itty-0.020
 course-0.019
 welcome-0.019
eros-0.019
Token has
Token ID468
Feature activation+7.77e-01

Pos logit contributions
 hardly+0.048
tered+0.047
cases+0.046
appointed+0.044
icing+0.043
Neg logit contributions
etchup-0.019
 living-0.019
smart-0.018
in-0.017
 course-0.017
Token an
Token ID281
Feature activation+3.35e+00

Pos logit contributions
 hardly+0.020
cases+0.019
tered+0.019
 barely+0.017
 nearly+0.017
Neg logit contributions
smart-0.018
itty-0.016
 course-0.016
in-0.015
eros-0.015
Token itch
Token ID49700
Feature activation+1.76e-02

Pos logit contributions
cases+0.085
 hardly+0.083
tered+0.077
abs+0.072
 barely+0.072
Neg logit contributions
smart-0.074
 course-0.069
 welcome-0.069
eros-0.067
itty-0.067
Token on
Token ID319
Feature activation+1.17e+00

Pos logit contributions
 hardly+0.000
cases+0.000
tered+0.000
 barely+0.000
 nearly+0.000
Neg logit contributions
smart-0.000
itty-0.000
 course-0.000
eros-0.000
ang-0.000
Token his
Token ID465
Feature activation+5.84e+00

Pos logit contributions
 hardly+0.043
cases+0.042
tered+0.041
 barely+0.038
abs+0.038
Neg logit contributions
smart-0.014
 course-0.012
itty-0.011
eros-0.010
in-0.010
Token nose
Token ID9686
Feature activation+2.07e+00

Pos logit contributions
 hardly+0.167
cases+0.162
tered+0.157
 nearly+0.144
 barely+0.143
Neg logit contributions
smart-0.111
itty-0.105
 living-0.101
eros-0.098
 course-0.097
Token.
Token ID13
Feature activation+1.42e+00

Pos logit contributions
 hardly+0.060
cases+0.057
tered+0.056
 barely+0.052
 nearly+0.051
Neg logit contributions
smart-0.041
itty-0.035
 course-0.035
in-0.034
 living-0.032
Token He
Token ID679
Feature activation+1.49e+00

Pos logit contributions
 hardly+0.045
cases+0.044
tered+0.043
 barely+0.040
abs+0.039
Neg logit contributions
smart-0.024
itty-0.020
 course-0.020
 welcome-0.019
eros-0.019
Token rub
Token ID6437
Feature activation+1.91e+00

Pos logit contributions
cases+0.046
tered+0.046
 hardly+0.046
icing+0.043
abs+0.043
Neg logit contributions
smart-0.022
etchup-0.022
 living-0.022
 course-0.021
in-0.020
Tokens
Token ID82
Feature activation+3.22e+00

Pos logit contributions
 hardly+0.047
cases+0.046
tered+0.044
 barely+0.040
 nearly+0.039
Neg logit contributions
smart-0.046
itty-0.042
 course-0.041
eros-0.039
in-0.039
Token had
Token ID550
Feature activation+1.08e+00

Pos logit contributions
cases+0.048
abs+0.045
 hardly+0.044
late+0.043
icing+0.042
Neg logit contributions
 living-0.033
smart-0.031
itty-0.029
in-0.028
 course-0.028
Token a
Token ID257
Feature activation+8.65e-01

Pos logit contributions
cases+0.030
 hardly+0.029
tered+0.029
abs+0.028
 closely+0.027
Neg logit contributions
smart-0.022
 living-0.020
itty-0.019
 course-0.018
etchup-0.017
Token big
Token ID1263
Feature activation-1.09e+00

Pos logit contributions
 hardly+0.021
cases+0.019
tered+0.019
 barely+0.018
 nearly+0.017
Neg logit contributions
smart-0.022
 course-0.019
itty-0.019
eros-0.019
ang-0.019
Token bump
Token ID13852
Feature activation+3.57e-01

Pos logit contributions
smart+0.025
itty+0.021
in+0.021
ang+0.021
rob+0.021
Neg logit contributions
 hardly-0.028
cases-0.026
tered-0.025
 barely-0.024
 floor-0.024
Token on
Token ID319
Feature activation+1.30e+00

Pos logit contributions
 hardly+0.011
cases+0.011
tered+0.010
 barely+0.010
 nearly+0.009
Neg logit contributions
smart-0.006
itty-0.006
 course-0.005
in-0.005
eros-0.005
Token her
Token ID607
Feature activation+5.74e+00

Pos logit contributions
cases+0.045
 hardly+0.043
abs+0.041
tered+0.041
rated+0.040
Neg logit contributions
smart-0.018
 course-0.017
 living-0.014
itty-0.013
 welcome-0.013
Token arm
Token ID3211
Feature activation+2.04e+00

Pos logit contributions
cases+0.138
 hardly+0.137
tered+0.133
rated+0.121
abs+0.121
Neg logit contributions
smart-0.124
 living-0.123
eros-0.118
itty-0.117
 course-0.116
Token and
Token ID290
Feature activation+8.04e-01

Pos logit contributions
 hardly+0.052
cases+0.048
tered+0.047
 barely+0.043
abs+0.043
Neg logit contributions
smart-0.048
itty-0.044
 course-0.043
 living-0.041
in-0.040
Token it
Token ID340
Feature activation-2.49e-01

Pos logit contributions
cases+0.019
tered+0.018
 hardly+0.018
abs+0.018
late+0.017
Neg logit contributions
smart-0.019
 living-0.018
 course-0.017
itty-0.016
 welcome-0.016
Token really
Token ID1107
Feature activation+1.44e+00

Pos logit contributions
smart+0.005
 living+0.005
itty+0.004
 course+0.004
in+0.004
Neg logit contributions
cases-0.007
abs-0.006
 hardly-0.006
tered-0.006
late-0.006
Token hurt
Token ID5938
Feature activation+1.28e+00

Pos logit contributions
 hardly+0.033
cases+0.033
late+0.030
abs+0.029
tered+0.029
Neg logit contributions
smart-0.035
in-0.030
itty-0.030
 course-0.030
 living-0.030
Token stub
Token ID17071
Feature activation+1.54e+00

Pos logit contributions
tered+0.054
 hardly+0.054
cases+0.053
late+0.052
appointed+0.049
Neg logit contributions
 living-0.015
P-0.013
in-0.012
o-0.012
-0.012
Tokenbed
Token ID3077
Feature activation+1.97e+00

Pos logit contributions
cases+0.039
 hardly+0.039
tered+0.035
 barely+0.033
 nearly+0.031
Neg logit contributions
smart-0.035
itty-0.034
in-0.033
 course-0.032
eros-0.032
Token her
Token ID607
Feature activation+2.23e+00

Pos logit contributions
 hardly+0.061
cases+0.061
tered+0.057
 barely+0.054
abs+0.052
Neg logit contributions
smart-0.035
itty-0.031
in-0.030
 course-0.030
ang-0.028
Token toe
Token ID21189
Feature activation+2.57e+00

Pos logit contributions
cases+0.065
 hardly+0.065
tered+0.061
rated+0.056
abs+0.056
Neg logit contributions
smart-0.040
itty-0.039
eros-0.038
in-0.037
 living-0.037
Token on
Token ID319
Feature activation+4.52e-01

Pos logit contributions
cases+0.096
 hardly+0.094
tered+0.088
icity+0.085
abs+0.084
Neg logit contributions
smart-0.030
in-0.025
itty-0.025
 course-0.023
ang-0.022
Token a
Token ID257
Feature activation+5.70e+00

Pos logit contributions
cases+0.015
 hardly+0.014
tered+0.013
abs+0.013
rated+0.012
Neg logit contributions
smart-0.008
 course-0.007
itty-0.007
in-0.006
 living-0.006
Token rock
Token ID3881
Feature activation+1.06e+00

Pos logit contributions
cases+0.190
 hardly+0.176
tered+0.172
rated+0.157
abs+0.154
Neg logit contributions
smart-0.092
 course-0.088
itty-0.083
in-0.082
 living-0.081
Token.
Token ID13
Feature activation+9.43e-01

Pos logit contributions
cases+0.031
 hardly+0.031
tered+0.029
 barely+0.026
abs+0.026
Neg logit contributions
smart-0.021
 course-0.019
itty-0.018
in-0.018
ang-0.017
Token It
Token ID632
Feature activation+1.96e+00

Pos logit contributions
cases+0.030
 hardly+0.029
tered+0.028
rated+0.027
abs+0.026
Neg logit contributions
smart-0.016
itty-0.014
 welcome-0.013
 course-0.013
 living-0.012
Token hurt
Token ID5938
Feature activation+2.25e+00

Pos logit contributions
cases+0.055
 hardly+0.049
tered+0.047
abs+0.047
icity+0.047
Neg logit contributions
smart-0.040
in-0.037
 course-0.036
itty-0.036
o-0.034
Token a
Token ID257
Feature activation+3.59e+00

Pos logit contributions
cases+0.078
 hardly+0.074
abs+0.066
tered+0.065
 barely+0.064
Neg logit contributions
smart-0.038
 course-0.035
itty-0.033
in-0.031
 welcome-0.030
Token him
Token ID683
Feature activation+1.96e+00

Pos logit contributions
 hardly+0.007
tered+0.007
cases+0.007
 nearly+0.006
 almost+0.006
Neg logit contributions
smart-0.006
itty-0.006
ang-0.006
eros-0.005
in-0.005
Token.
Token ID13
Feature activation+4.22e-01

Pos logit contributions
 hardly+0.062
cases+0.060
tered+0.058
 barely+0.054
 nearly+0.053
Neg logit contributions
smart-0.034
itty-0.029
 course-0.029
in-0.027
eros-0.027
Token Teddy
Token ID29345
Feature activation+2.52e+00

Pos logit contributions
cases+0.013
 hardly+0.013
tered+0.012
rated+0.012
abs+0.012
Neg logit contributions
smart-0.007
itty-0.006
 living-0.006
 course-0.006
 Friendship-0.005
Token fell
Token ID3214
Feature activation+9.52e-01

Pos logit contributions
cases+0.064
 hardly+0.064
late+0.061
tered+0.060
abs+0.059
Neg logit contributions
in-0.050
smart-0.049
itty-0.049
o-0.048
 living-0.047
Token on
Token ID319
Feature activation+1.53e-01

Pos logit contributions
 hardly+0.030
cases+0.029
tered+0.028
 barely+0.026
late+0.025
Neg logit contributions
smart-0.017
itty-0.015
 course-0.014
in-0.014
ang-0.013
Token her
Token ID607
Feature activation+5.68e+00

Pos logit contributions
cases+0.005
 hardly+0.005
tered+0.005
abs+0.005
rated+0.005
Neg logit contributions
smart-0.002
 course-0.002
itty-0.002
eros-0.002
 living-0.002
Token shoulder
Token ID8163
Feature activation+1.86e+00

Pos logit contributions
cases+0.140
 hardly+0.140
tered+0.135
abs+0.126
rated+0.124
Neg logit contributions
smart-0.123
 living-0.117
 course-0.115
itty-0.114
eros-0.112
Token and
Token ID290
Feature activation+1.21e+00

Pos logit contributions
 hardly+0.049
cases+0.047
tered+0.045
abs+0.042
icity+0.041
Neg logit contributions
smart-0.041
 course-0.037
itty-0.036
in-0.034
 living-0.034
Token it
Token ID340
Feature activation+8.99e-01

Pos logit contributions
cases+0.029
 hardly+0.028
tered+0.027
abs+0.025
late+0.024
Neg logit contributions
smart-0.029
 course-0.026
itty-0.026
in-0.025
eros-0.025
Token hurt
Token ID5938
Feature activation+1.37e+00

Pos logit contributions
cases+0.026
abs+0.025
 hardly+0.024
icity+0.024
late+0.023
Neg logit contributions
smart-0.015
 course-0.014
 band-0.013
 living-0.013
in-0.013
Token.
Token ID13
Feature activation+2.19e-01

Pos logit contributions
 hardly+0.040
cases+0.040
tered+0.036
 barely+0.035
abs+0.034
Neg logit contributions
smart-0.028
itty-0.024
 course-0.023
in-0.023
 living-0.022
Token had
Token ID550
Feature activation+4.51e-01

Pos logit contributions
cases+0.023
 hardly+0.022
tered+0.021
abs+0.020
late+0.019
Neg logit contributions
smart-0.020
 course-0.018
itty-0.018
 living-0.017
in-0.017
Token a
Token ID257
Feature activation+7.05e-01

Pos logit contributions
 hardly+0.010
cases+0.010
tered+0.009
 barely+0.009
 nearly+0.008
Neg logit contributions
smart-0.012
itty-0.011
 course-0.011
eros-0.010
in-0.010
Token big
Token ID1263
Feature activation-1.19e+00

Pos logit contributions
 hardly+0.014
 barely+0.012
 almost+0.011
chy+0.011
tered+0.011
Neg logit contributions
smart-0.021
ang-0.020
rob-0.020
itty-0.020
eros-0.020
Token bump
Token ID13852
Feature activation+2.43e-01

Pos logit contributions
smart+0.032
ang+0.029
itty+0.029
in+0.029
rob+0.028
Neg logit contributions
 hardly-0.023
 floor-0.022
tered-0.021
 almost-0.020
 barely-0.020
Token on
Token ID319
Feature activation+8.05e-01

Pos logit contributions
 hardly+0.007
cases+0.007
tered+0.007
 nearly+0.006
 closely+0.006
Neg logit contributions
smart-0.005
itty-0.004
 course-0.004
eros-0.004
 welcome-0.004
Token his
Token ID465
Feature activation+5.66e+00

Pos logit contributions
cases+0.030
 hardly+0.030
tered+0.028
abs+0.027
rated+0.027
Neg logit contributions
smart-0.009
 course-0.008
eros-0.007
itty-0.007
in-0.006
Token head
Token ID1182
Feature activation+5.78e-01

Pos logit contributions
 hardly+0.146
cases+0.145
tered+0.138
 nearly+0.124
 barely+0.124
Neg logit contributions
smart-0.124
itty-0.113
 course-0.111
eros-0.110
in-0.107
Token and
Token ID290
Feature activation+3.46e-01

Pos logit contributions
 hardly+0.015
cases+0.015
tered+0.014
 barely+0.013
abs+0.013
Neg logit contributions
smart-0.013
itty-0.011
 course-0.011
in-0.011
eros-0.011
Token a
Token ID257
Feature activation+1.68e+00

Pos logit contributions
 hardly+0.009
cases+0.009
tered+0.009
 barely+0.008
 nearly+0.008
Neg logit contributions
smart-0.008
itty-0.007
 course-0.007
in-0.006
eros-0.006
Token cut
Token ID2005
Feature activation+5.73e-01

Pos logit contributions
 hardly+0.035
 barely+0.031
tered+0.029
 floor+0.029
 nearly+0.027
Neg logit contributions
smart-0.048
rob-0.047
ang-0.046
eros-0.046
idd-0.045
Token on
Token ID319
Feature activation+5.56e-01

Pos logit contributions
 hardly+0.017
cases+0.017
tered+0.016
 barely+0.015
 nearly+0.015
Neg logit contributions
smart-0.011
itty-0.009
 course-0.009
in-0.009
eros-0.009
Token tri
Token ID1333
Feature activation+2.31e+00

Pos logit contributions
 hardly+0.024
cases+0.024
tered+0.023
late+0.022
 barely+0.021
Neg logit contributions
smart-0.022
itty-0.020
 living-0.020
 course-0.019
in-0.019
Tokenpped
Token ID1496
Feature activation+1.82e+00

Pos logit contributions
cases+0.059
 hardly+0.057
tered+0.053
abs+0.050
 closely+0.049
Neg logit contributions
smart-0.056
in-0.051
itty-0.050
ang-0.049
idd-0.049
Token and
Token ID290
Feature activation+2.30e+00

Pos logit contributions
 hardly+0.048
cases+0.046
tered+0.045
 barely+0.041
 nearly+0.041
Neg logit contributions
smart-0.041
itty-0.037
 course-0.035
eros-0.035
ang-0.034
Token fell
Token ID3214
Feature activation+9.26e-01

Pos logit contributions
cases+0.044
 hardly+0.042
tered+0.041
abs+0.036
late+0.036
Neg logit contributions
smart-0.067
itty-0.062
 course-0.062
in-0.060
etchup-0.058
Token on
Token ID319
Feature activation-1.27e-01

Pos logit contributions
 hardly+0.027
cases+0.026
tered+0.026
 barely+0.024
 nearly+0.023
Neg logit contributions
smart-0.018
itty-0.016
 course-0.016
eros-0.015
in-0.015
Token her
Token ID607
Feature activation+5.61e+00

Pos logit contributions
smart+0.002
 course+0.002
itty+0.002
eros+0.002
in+0.002
Neg logit contributions
cases-0.004
 hardly-0.004
tered-0.004
abs-0.003
 barely-0.003
Token way
Token ID835
Feature activation+1.61e+00

Pos logit contributions
cases+0.143
 hardly+0.143
tered+0.141
abs+0.127
late+0.126
Neg logit contributions
smart-0.119
itty-0.109
 course-0.109
eros-0.106
 living-0.105
Token to
Token ID284
Feature activation+7.58e-01

Pos logit contributions
 hardly+0.045
cases+0.044
tered+0.043
 barely+0.039
late+0.038
Neg logit contributions
smart-0.033
itty-0.030
 course-0.029
in-0.028
ang-0.027
Token school
Token ID1524
Feature activation+1.94e+00

Pos logit contributions
 hardly+0.019
cases+0.018
tered+0.018
 barely+0.016
 nearly+0.016
Neg logit contributions
smart-0.018
itty-0.016
 course-0.016
in-0.015
eros-0.015
Token.
Token ID13
Feature activation+7.11e-01

Pos logit contributions
 hardly+0.057
cases+0.054
tered+0.053
 barely+0.049
abs+0.049
Neg logit contributions
smart-0.038
 course-0.033
itty-0.032
 living-0.031
in-0.030
Token She
Token ID1375
Feature activation+2.18e+00

Pos logit contributions
 hardly+0.020
cases+0.020
tered+0.019
rated+0.018
 barely+0.017
Neg logit contributions
smart-0.014
itty-0.013
 course-0.012
 living-0.012
 welcome-0.012
Token had
Token ID550
Feature activation-4.43e-01

Pos logit contributions
cases+0.014
 hardly+0.013
tered+0.013
appointed+0.012
abs+0.012
Neg logit contributions
in-0.011
smart-0.010
itty-0.010
o-0.010
 course-0.009
Token a
Token ID257
Feature activation+1.63e-01

Pos logit contributions
smart+0.009
 course+0.008
itty+0.007
 living+0.007
in+0.007
Neg logit contributions
cases-0.013
 hardly-0.013
tered-0.012
abs-0.011
rated-0.011
Token big
Token ID1263
Feature activation-1.58e+00

Pos logit contributions
 hardly+0.004
cases+0.003
 barely+0.003
tered+0.003
 nearly+0.003
Neg logit contributions
smart-0.004
ang-0.004
eros-0.004
rob-0.004
itty-0.004
Token bump
Token ID13852
Feature activation-4.20e-01

Pos logit contributions
smart+0.042
ang+0.038
itty+0.037
in+0.036
rob+0.036
Neg logit contributions
 hardly-0.034
tered-0.031
cases-0.031
 floor-0.030
 barely-0.029
Token on
Token ID319
Feature activation+4.36e-01

Pos logit contributions
smart+0.009
itty+0.008
 course+0.008
eros+0.008
ang+0.008
Neg logit contributions
cases-0.011
 hardly-0.011
tered-0.011
 nearly-0.010
 couldn-0.010
Token his
Token ID465
Feature activation+5.58e+00

Pos logit contributions
 hardly+0.016
cases+0.016
tered+0.016
 barely+0.015
 nearly+0.014
Neg logit contributions
smart-0.005
itty-0.004
 course-0.004
in-0.004
eros-0.004
Token head
Token ID1182
Feature activation-6.28e-01

Pos logit contributions
 hardly+0.158
cases+0.154
tered+0.150
abs+0.138
 nearly+0.137
Neg logit contributions
smart-0.107
 living-0.094
itty-0.094
eros-0.093
 course-0.092
Token.
Token ID13
Feature activation-6.92e-01

Pos logit contributions
smart+0.014
itty+0.013
 course+0.012
eros+0.012
in+0.012
Neg logit contributions
 hardly-0.016
cases-0.016
tered-0.015
 barely-0.014
 nearly-0.014
Token<|endoftext|>
Token ID50256
Feature activation-2.17e+00

Pos logit contributions
smart+0.015
 course+0.014
itty+0.013
in+0.013
eros+0.013
Neg logit contributions
 hardly-0.019
cases-0.018
tered-0.017
 couldn-0.016
 barely-0.016
TokenOnce
Token ID7454
Feature activation+2.99e-02

Pos logit contributions
smart+0.043
itty+0.038
 course+0.037
eros+0.036
ang+0.036
Neg logit contributions
 hardly-0.062
cases-0.059
tered-0.059
 barely-0.054
 nearly-0.054
Token upon
Token ID2402
Feature activation+2.35e+00

Pos logit contributions
 hardly+0.001
cases+0.001
tered+0.001
 barely+0.001
 nearly+0.000
Neg logit contributions
smart-0.001
itty-0.001
 course-0.001
in-0.001
ang-0.001
Token tower
Token ID10580
Feature activation+3.21e-01

Pos logit contributions
 hardly+0.041
cases+0.040
tered+0.038
 barely+0.035
 nearly+0.034
Neg logit contributions
smart-0.034
itty-0.030
 course-0.030
in-0.028
ang-0.028
Token and
Token ID290
Feature activation+1.55e+00

Pos logit contributions
 hardly+0.008
cases+0.008
tered+0.008
 barely+0.007
abs+0.007
Neg logit contributions
smart-0.007
itty-0.007
 course-0.006
in-0.006
ang-0.006
Token it
Token ID340
Feature activation+6.67e-01

Pos logit contributions
cases+0.044
 hardly+0.044
abs+0.043
tered+0.041
rated+0.039
Neg logit contributions
smart-0.029
 course-0.026
 living-0.025
itty-0.024
in-0.024
Token fell
Token ID3214
Feature activation+6.71e-01

Pos logit contributions
abs+0.019
 hardly+0.019
cases+0.019
icity+0.018
tered+0.018
Neg logit contributions
smart-0.011
 course-0.010
in-0.010
 living-0.010
itty-0.009
Token on
Token ID319
Feature activation-2.21e-03

Pos logit contributions
 hardly+0.023
cases+0.022
tered+0.021
 barely+0.020
abs+0.020
Neg logit contributions
smart-0.010
itty-0.009
 course-0.008
in-0.008
 living-0.008
Token her
Token ID607
Feature activation+5.57e+00

Pos logit contributions
smart+0.000
 course+0.000
itty+0.000
 living+0.000
eros+0.000
Neg logit contributions
cases-0.000
 hardly-0.000
tered-0.000
abs-0.000
rated-0.000
Token foot
Token ID2366
Feature activation+1.61e+00

Pos logit contributions
 hardly+0.139
cases+0.137
tered+0.136
abs+0.124
late+0.120
Neg logit contributions
smart-0.122
 living-0.111
 course-0.111
eros-0.110
itty-0.110
Token.
Token ID13
Feature activation+8.05e-01

Pos logit contributions
 hardly+0.046
cases+0.045
tered+0.044
 barely+0.040
late+0.039
Neg logit contributions
smart-0.033
itty-0.029
 course-0.028
in-0.027
ang-0.026
Token It
Token ID632
Feature activation+1.24e+00

Pos logit contributions
cases+0.026
 hardly+0.025
tered+0.024
rated+0.023
abs+0.023
Neg logit contributions
smart-0.013
itty-0.012
 course-0.011
 welcome-0.010
 living-0.010
Token hurt
Token ID5938
Feature activation+1.51e+00

Pos logit contributions
cases+0.036
 hardly+0.034
abs+0.033
icity+0.032
tered+0.031
Neg logit contributions
smart-0.024
itty-0.021
 course-0.021
in-0.021
 living-0.020
Token a
Token ID257
Feature activation+3.12e+00

Pos logit contributions
cases+0.049
 hardly+0.048
tered+0.042
 barely+0.042
abs+0.042
Neg logit contributions
smart-0.028
 course-0.025
itty-0.024
 living-0.022
in-0.022
Token.
Token ID13
Feature activation+8.60e-01

Pos logit contributions
 hardly+0.010
tered+0.009
cases+0.009
 barely+0.009
abs+0.009
Neg logit contributions
smart-0.007
 course-0.007
itty-0.007
in-0.006
 living-0.006
Token The
Token ID383
Feature activation+1.29e+00

Pos logit contributions
 hardly+0.023
cases+0.023
tered+0.022
 barely+0.020
late+0.020
Neg logit contributions
smart-0.018
itty-0.017
 course-0.016
eros-0.015
in-0.015
Token books
Token ID3835
Feature activation+1.66e+00

Pos logit contributions
 hardly+0.035
cases+0.034
tered+0.033
 barely+0.030
 nearly+0.029
Neg logit contributions
smart-0.028
itty-0.025
 course-0.025
in-0.024
eros-0.024
Token fell
Token ID3214
Feature activation+1.26e+00

Pos logit contributions
 hardly+0.043
cases+0.041
abs+0.039
tered+0.039
late+0.038
Neg logit contributions
smart-0.036
 course-0.033
itty-0.032
in-0.031
 living-0.029
Token on
Token ID319
Feature activation+5.32e-01

Pos logit contributions
 hardly+0.040
cases+0.037
tered+0.036
 barely+0.034
abs+0.034
Neg logit contributions
smart-0.022
itty-0.019
 course-0.018
in-0.017
 living-0.017
Token her
Token ID607
Feature activation+5.57e+00

Pos logit contributions
cases+0.018
 hardly+0.017
tered+0.016
abs+0.016
rated+0.015
Neg logit contributions
smart-0.008
 course-0.007
itty-0.007
eros-0.006
in-0.006
Token,
Token ID11
Feature activation+3.70e+00

Pos logit contributions
 hardly+0.138
cases+0.135
tered+0.132
abs+0.124
rated+0.120
Neg logit contributions
smart-0.118
 living-0.113
itty-0.111
 course-0.109
eros-0.108
Token and
Token ID290
Feature activation+1.55e+00

Pos logit contributions
cases+0.086
 hardly+0.086
abs+0.081
tered+0.079
late+0.075
Neg logit contributions
smart-0.087
 living-0.080
 course-0.078
itty-0.074
etchup-0.072
Token she
Token ID673
Feature activation+2.86e+00

Pos logit contributions
 hardly+0.041
cases+0.040
tered+0.038
 barely+0.035
abs+0.035
Neg logit contributions
smart-0.034
itty-0.031
 course-0.030
in-0.029
eros-0.029
Token cried
Token ID16896
Feature activation+7.75e-01

Pos logit contributions
cases+0.067
abs+0.065
rated+0.063
tered+0.062
late+0.062
Neg logit contributions
smart-0.064
 living-0.061
in-0.060
itty-0.060
o-0.058
Token out
Token ID503
Feature activation-7.13e-01

Pos logit contributions
 hardly+0.023
cases+0.022
tered+0.020
abs+0.020
 barely+0.020
Neg logit contributions
smart-0.015
itty-0.013
 course-0.012
 living-0.012
 welcome-0.012
Token saw
Token ID2497
Feature activation-4.13e-01

Pos logit contributions
 hardly+0.036
cases+0.035
appointed+0.034
late+0.034
tered+0.033
Neg logit contributions
 living-0.017
in-0.016
o-0.016
idd-0.014
itty-0.014
Token a
Token ID257
Feature activation+2.51e-01

Pos logit contributions
smart+0.009
itty+0.008
 course+0.008
eros+0.008
in+0.008
Neg logit contributions
 hardly-0.011
cases-0.011
tered-0.010
 barely-0.009
 nearly-0.009
Token big
Token ID1263
Feature activation-1.71e+00

Pos logit contributions
 hardly+0.005
cases+0.005
tered+0.005
 barely+0.004
 floor+0.004
Neg logit contributions
smart-0.007
itty-0.006
 course-0.006
ang-0.006
rob-0.006
Token stain
Token ID18746
Feature activation+4.86e-01

Pos logit contributions
smart+0.043
itty+0.035
 Working+0.035
 Friendship+0.035
ang+0.034
Neg logit contributions
 floor-0.040
 hardly-0.037
 closely-0.036
 walls-0.036
tered-0.035
Token on
Token ID319
Feature activation+9.89e-01

Pos logit contributions
 hardly+0.018
cases+0.017
tered+0.017
 barely+0.016
 nearly+0.016
Neg logit contributions
smart-0.006
itty-0.005
 course-0.005
in-0.004
eros-0.004
Token her
Token ID607
Feature activation+5.56e+00

Pos logit contributions
cases+0.032
 hardly+0.032
tered+0.030
abs+0.029
 barely+0.028
Neg logit contributions
smart-0.016
 course-0.014
itty-0.013
eros-0.012
in-0.012
Token trumpet
Token ID40334
Feature activation+6.35e-01

Pos logit contributions
 hardly+0.143
cases+0.140
tered+0.137
rated+0.123
abs+0.123
Neg logit contributions
smart-0.117
itty-0.111
 living-0.111
 course-0.109
eros-0.108
Token.
Token ID13
Feature activation+1.66e+00

Pos logit contributions
 hardly+0.019
cases+0.018
tered+0.018
 barely+0.017
abs+0.017
Neg logit contributions
smart-0.011
 course-0.010
itty-0.010
 living-0.009
in-0.009
Token It
Token ID632
Feature activation+1.89e-01

Pos logit contributions
cases+0.052
 hardly+0.052
rated+0.047
abs+0.047
tered+0.046
Neg logit contributions
smart-0.028
itty-0.024
 welcome-0.023
 living-0.023
 course-0.022
Token was
Token ID373
Feature activation+2.08e+00

Pos logit contributions
cases+0.005
 hardly+0.005
tered+0.005
abs+0.005
late+0.004
Neg logit contributions
smart-0.004
itty-0.004
 course-0.003
in-0.003
 living-0.003
Token brown
Token ID7586
Feature activation+9.06e-01

Pos logit contributions
cases+0.058
 hardly+0.056
tered+0.052
abs+0.051
rated+0.049
Neg logit contributions
smart-0.044
 living-0.037
itty-0.037
 course-0.037
 welcome-0.036
Token Max
Token ID5436
Feature activation+8.86e-01

Pos logit contributions
smart+0.013
itty+0.012
 course+0.012
eros+0.011
in+0.011
Neg logit contributions
 hardly-0.016
cases-0.015
tered-0.015
 barely-0.013
 nearly-0.013
Token had
Token ID550
Feature activation-5.98e-02

Pos logit contributions
 hardly+0.023
cases+0.023
tered+0.021
abs+0.021
late+0.020
Neg logit contributions
smart-0.019
 living-0.017
itty-0.017
 course-0.017
in-0.017
Token a
Token ID257
Feature activation+9.10e-01

Pos logit contributions
smart+0.001
 living+0.001
 course+0.001
 welcome+0.001
 band+0.001
Neg logit contributions
cases-0.002
tered-0.002
 hardly-0.002
abs-0.002
 closely-0.002
Token scratch
Token ID12692
Feature activation-4.51e-01

Pos logit contributions
 hardly+0.023
cases+0.022
tered+0.021
 barely+0.020
 nearly+0.019
Neg logit contributions
smart-0.022
itty-0.019
 course-0.019
ang-0.018
in-0.018
Token on
Token ID319
Feature activation+3.76e-01

Pos logit contributions
smart+0.008
 course+0.006
itty+0.006
eros+0.006
etchup+0.006
Neg logit contributions
 hardly-0.014
cases-0.014
tered-0.014
 couldn-0.013
 nearly-0.013
Token his
Token ID465
Feature activation+5.55e+00

Pos logit contributions
cases+0.015
 hardly+0.014
tered+0.013
abs+0.013
rated+0.013
Neg logit contributions
smart-0.004
 course-0.003
itty-0.003
 welcome-0.002
eros-0.002
Token foot
Token ID2366
Feature activation+1.19e+00

Pos logit contributions
 hardly+0.141
cases+0.137
tered+0.131
 barely+0.119
 nearly+0.119
Neg logit contributions
smart-0.126
itty-0.114
 course-0.112
eros-0.111
in-0.109
Token.
Token ID13
Feature activation-3.09e-02

Pos logit contributions
 hardly+0.031
cases+0.030
tered+0.029
 barely+0.027
 nearly+0.026
Neg logit contributions
smart-0.027
itty-0.024
 course-0.024
in-0.023
eros-0.022
Token She
Token ID1375
Feature activation+3.71e-01

Pos logit contributions
smart+0.001
itty+0.001
 course+0.001
 living+0.000
eros+0.000
Neg logit contributions
 hardly-0.001
cases-0.001
tered-0.001
 barely-0.001
 nearly-0.001
Token felt
Token ID2936
Feature activation+7.79e-01

Pos logit contributions
cases+0.011
 hardly+0.010
abs+0.010
tered+0.010
late+0.010
Neg logit contributions
in-0.007
o-0.007
smart-0.006
itty-0.006
 living-0.006
Token bad
Token ID2089
Feature activation-2.51e-01

Pos logit contributions
 hardly+0.020
cases+0.020
tered+0.019
 barely+0.017
 nearly+0.017
Neg logit contributions
smart-0.018
itty-0.016
 course-0.016
in-0.015
eros-0.015
Token had
Token ID550
Feature activation+3.31e-01

Pos logit contributions
cases+0.027
 hardly+0.027
tered+0.025
late+0.023
 barely+0.022
Neg logit contributions
smart-0.026
itty-0.024
 course-0.024
 living-0.023
in-0.023
Token a
Token ID257
Feature activation+4.43e-01

Pos logit contributions
cases+0.009
 hardly+0.009
tered+0.008
late+0.007
 closely+0.007
Neg logit contributions
smart-0.007
itty-0.007
 course-0.006
 living-0.006
in-0.006
Token big
Token ID1263
Feature activation-1.40e+00

Pos logit contributions
 hardly+0.010
 barely+0.008
tered+0.008
cases+0.008
 almost+0.008
Neg logit contributions
smart-0.013
 course-0.011
itty-0.011
in-0.011
eros-0.011
Token itch
Token ID49700
Feature activation-7.93e-01

Pos logit contributions
smart+0.036
in+0.030
itty+0.030
etchup+0.029
ang+0.029
Neg logit contributions
 hardly-0.034
cases-0.030
tered-0.030
 floor-0.030
 barely-0.029
Token on
Token ID319
Feature activation+4.02e-01

Pos logit contributions
smart+0.017
itty+0.015
etchup+0.015
eros+0.015
ang+0.015
Neg logit contributions
 hardly-0.022
tered-0.020
cases-0.020
 barely-0.019
 couldn-0.019
Token his
Token ID465
Feature activation+5.52e+00

Pos logit contributions
 hardly+0.015
cases+0.014
tered+0.014
 barely+0.013
 nearly+0.013
Neg logit contributions
smart-0.005
itty-0.004
 course-0.004
eros-0.004
in-0.004
Token back
Token ID736
Feature activation+9.72e-01

Pos logit contributions
 hardly+0.150
cases+0.149
tered+0.144
 nearly+0.129
 barely+0.128
Neg logit contributions
smart-0.113
itty-0.105
 course-0.101
eros-0.098
in-0.096
Token.
Token ID13
Feature activation-8.22e-02

Pos logit contributions
 hardly+0.023
cases+0.022
tered+0.021
 barely+0.019
 nearly+0.019
Neg logit contributions
smart-0.024
 course-0.022
eros-0.021
itty-0.021
etchup-0.021
Token He
Token ID679
Feature activation+1.64e+00

Pos logit contributions
smart+0.002
itty+0.001
 course+0.001
eros+0.001
 welcome+0.001
Neg logit contributions
 hardly-0.002
cases-0.002
tered-0.002
 barely-0.002
 nearly-0.002
Token tried
Token ID3088
Feature activation+1.85e+00

Pos logit contributions
cases+0.047
late+0.040
 hardly+0.040
abs+0.039
tered+0.038
Neg logit contributions
 course-0.032
smart-0.031
itty-0.031
o-0.031
in-0.031
Token to
Token ID284
Feature activation+1.18e+00

Pos logit contributions
 hardly+0.050
cases+0.047
tered+0.046
 barely+0.043
 nearly+0.042
Neg logit contributions
smart-0.041
itty-0.037
 course-0.036
eros-0.035
ang-0.035
Token straight
Token ID3892
Feature activation-7.62e-01

Pos logit contributions
 hardly+0.016
cases+0.015
tered+0.015
abs+0.015
late+0.014
Neg logit contributions
smart-0.014
 living-0.013
 course-0.013
etchup-0.012
itty-0.012
Tokenened
Token ID2945
Feature activation+2.54e-01

Pos logit contributions
smart+0.017
itty+0.015
 course+0.015
in+0.014
eros+0.014
Neg logit contributions
 hardly-0.021
cases-0.020
tered-0.019
 barely-0.018
 nearly-0.017
Token the
Token ID262
Feature activation+2.56e+00

Pos logit contributions
 hardly+0.007
cases+0.007
tered+0.007
 barely+0.006
abs+0.006
Neg logit contributions
smart-0.005
 course-0.005
itty-0.005
in-0.005
eros-0.004
Token blankets
Token ID34794
Feature activation-2.92e-01

Pos logit contributions
 hardly+0.067
cases+0.062
tered+0.061
 barely+0.057
abs+0.055
Neg logit contributions
smart-0.057
 living-0.054
 course-0.053
itty-0.053
 welcome-0.051
Token on
Token ID319
Feature activation+5.46e-01

Pos logit contributions
smart+0.006
 course+0.005
itty+0.005
in+0.005
 living+0.005
Neg logit contributions
 hardly-0.008
cases-0.008
tered-0.008
 barely-0.007
abs-0.007
Token her
Token ID607
Feature activation+5.50e+00

Pos logit contributions
cases+0.020
 hardly+0.019
abs+0.018
rated+0.018
tered+0.018
Neg logit contributions
 course-0.007
smart-0.007
in-0.005
 living-0.005
 welcome-0.005
Token bed
Token ID3996
Feature activation+1.85e+00

Pos logit contributions
 hardly+0.119
tered+0.113
cases+0.112
abs+0.105
late+0.105
Neg logit contributions
 living-0.135
smart-0.132
 welcome-0.120
 course-0.120
eros-0.119
Token.
Token ID13
Feature activation+2.83e-01

Pos logit contributions
 hardly+0.058
cases+0.054
tered+0.053
rated+0.052
abs+0.052
Neg logit contributions
smart-0.030
 living-0.026
 course-0.024
itty-0.023
in-0.023
Token She
Token ID1375
Feature activation+1.06e+00

Pos logit contributions
 hardly+0.008
cases+0.008
tered+0.007
rated+0.007
 barely+0.007
Neg logit contributions
smart-0.006
itty-0.005
 welcome-0.005
 Working-0.005
 course-0.005
Token was
Token ID373
Feature activation+2.94e-01

Pos logit contributions
cases+0.028
 hardly+0.027
tered+0.027
abs+0.027
late+0.026
Neg logit contributions
in-0.020
 living-0.019
smart-0.019
 course-0.018
itty-0.018
Token so
Token ID523
Feature activation+6.21e-01

Pos logit contributions
cases+0.008
 hardly+0.007
tered+0.007
abs+0.007
late+0.007
Neg logit contributions
smart-0.006
 welcome-0.006
 living-0.006
itty-0.005
 course-0.005
Token Spot
Token ID15899
Feature activation+5.89e-01

Pos logit contributions
 hardly+0.006
cases+0.005
tered+0.005
 barely+0.005
 nearly+0.005
Neg logit contributions
smart-0.003
itty-0.003
 course-0.003
in-0.003
eros-0.003
Token had
Token ID550
Feature activation+5.11e-01

Pos logit contributions
 hardly+0.013
cases+0.013
tered+0.012
 barely+0.011
 nearly+0.011
Neg logit contributions
smart-0.015
itty-0.014
 course-0.014
in-0.014
eros-0.013
Token an
Token ID281
Feature activation+2.15e+00

Pos logit contributions
 hardly+0.013
cases+0.013
tered+0.013
late+0.012
 barely+0.011
Neg logit contributions
smart-0.011
itty-0.010
 course-0.010
 living-0.010
in-0.009
Token itch
Token ID49700
Feature activation-4.68e-01

Pos logit contributions
 hardly+0.053
cases+0.052
tered+0.050
 barely+0.045
 nearly+0.044
Neg logit contributions
smart-0.051
itty-0.046
 course-0.046
eros-0.044
in-0.044
Token on
Token ID319
Feature activation+1.69e-01

Pos logit contributions
smart+0.010
itty+0.009
etchup+0.009
eros+0.009
 course+0.009
Neg logit contributions
 hardly-0.012
tered-0.012
cases-0.012
 barely-0.011
 couldn-0.011
Token his
Token ID465
Feature activation+5.49e+00

Pos logit contributions
 hardly+0.006
cases+0.006
tered+0.006
 barely+0.005
 nearly+0.005
Neg logit contributions
smart-0.002
itty-0.002
 course-0.002
in-0.002
etchup-0.002
Token leg
Token ID1232
Feature activation+1.36e+00

Pos logit contributions
 hardly+0.152
cases+0.150
tered+0.146
 nearly+0.131
 barely+0.130
Neg logit contributions
smart-0.111
itty-0.102
 course-0.098
eros-0.095
in-0.094
Token.
Token ID13
Feature activation+3.03e-01

Pos logit contributions
 hardly+0.035
cases+0.034
tered+0.033
 barely+0.030
 nearly+0.029
Neg logit contributions
smart-0.031
itty-0.028
 course-0.028
in-0.027
eros-0.027
Token He
Token ID679
Feature activation+1.95e+00

Pos logit contributions
 hardly+0.009
cases+0.009
tered+0.009
 barely+0.008
 nearly+0.008
Neg logit contributions
smart-0.005
itty-0.005
 course-0.005
eros-0.004
etchup-0.004
Token tried
Token ID3088
Feature activation+1.87e+00

Pos logit contributions
cases+0.054
late+0.047
 hardly+0.046
abs+0.046
tered+0.045
Neg logit contributions
itty-0.038
smart-0.038
 course-0.037
o-0.037
in-0.037
Token to
Token ID284
Feature activation+1.17e+00

Pos logit contributions
 hardly+0.050
cases+0.046
tered+0.046
 barely+0.042
 nearly+0.041
Neg logit contributions
smart-0.042
itty-0.039
 course-0.038
eros-0.037
ang-0.036
Token his
Token ID465
Feature activation+1.26e+00

Pos logit contributions
 hardly+0.037
cases+0.036
tered+0.035
 barely+0.033
 closely+0.032
Neg logit contributions
smart-0.015
itty-0.013
 course-0.012
ang-0.011
eros-0.011
Token toys
Token ID14958
Feature activation+6.19e-01

Pos logit contributions
 hardly+0.029
cases+0.028
 floor+0.025
 barely+0.025
 couldn+0.024
Neg logit contributions
smart-0.037
 course-0.029
itty-0.029
 Working-0.028
 Friendship-0.028
Token and
Token ID290
Feature activation-2.40e-01

Pos logit contributions
 hardly+0.016
cases+0.015
tered+0.015
 barely+0.013
 nearly+0.013
Neg logit contributions
smart-0.015
itty-0.013
 course-0.013
in-0.013
eros-0.013
Token jump
Token ID4391
Feature activation+1.95e+00

Pos logit contributions
smart+0.006
itty+0.005
 course+0.005
in+0.005
eros+0.005
Neg logit contributions
 hardly-0.006
cases-0.006
tered-0.006
 barely-0.005
 nearly-0.005
Token on
Token ID319
Feature activation-2.16e-01

Pos logit contributions
 hardly+0.068
cases+0.064
abs+0.063
rated+0.062
appointed+0.061
Neg logit contributions
smart-0.023
in-0.022
 course-0.022
 living-0.020
 hipp-0.019
Token his
Token ID465
Feature activation+5.49e+00

Pos logit contributions
smart+0.004
 course+0.004
itty+0.003
in+0.003
eros+0.003
Neg logit contributions
 hardly-0.007
cases-0.007
tered-0.006
abs-0.006
 barely-0.006
Token bed
Token ID3996
Feature activation+1.87e+00

Pos logit contributions
 hardly+0.131
cases+0.126
tered+0.126
late+0.112
 nearly+0.112
Neg logit contributions
smart-0.127
itty-0.119
 living-0.116
 course-0.116
eros-0.115
Token.
Token ID13
Feature activation+7.60e-02

Pos logit contributions
 hardly+0.061
abs+0.057
cases+0.057
rated+0.056
appointed+0.055
Neg logit contributions
smart-0.025
 living-0.023
 course-0.021
in-0.020
 passenger-0.017
Token One
Token ID1881
Feature activation+1.34e+00

Pos logit contributions
 hardly+0.002
cases+0.002
tered+0.002
 barely+0.001
 nearly+0.001
Neg logit contributions
smart-0.002
itty-0.002
 course-0.002
in-0.002
eros-0.002
Token day
Token ID1110
Feature activation+3.42e-01

Pos logit contributions
 hardly+0.043
cases+0.041
tered+0.041
 barely+0.038
 nearly+0.037
Neg logit contributions
smart-0.023
itty-0.020
 course-0.019
eros-0.018
in-0.018
Token,
Token ID11
Feature activation+2.48e+00

Pos logit contributions
 hardly+0.009
cases+0.008
tered+0.008
 barely+0.007
 couldn+0.007
Neg logit contributions
smart-0.008
itty-0.007
 course-0.007
eros-0.007
in-0.007
Token slippery
Token ID32911
Feature activation+2.06e+00

Pos logit contributions
 hardly+0.093
cases+0.091
tered+0.087
icity+0.080
abs+0.079
Neg logit contributions
smart-0.089
in-0.075
 living-0.075
 course-0.074
etchup-0.073
Token.
Token ID13
Feature activation+2.36e+00

Pos logit contributions
 hardly+0.045
cases+0.044
tered+0.043
 barely+0.037
 nearly+0.037
Neg logit contributions
smart-0.055
itty-0.051
 course-0.049
eros-0.048
ang-0.048
Token The
Token ID383
Feature activation+1.55e-01

Pos logit contributions
 hardly+0.066
cases+0.065
tered+0.061
 barely+0.057
rated+0.056
Neg logit contributions
smart-0.050
itty-0.044
 course-0.043
etchup-0.041
 welcome-0.040
Token soap
Token ID19533
Feature activation+1.75e+00

Pos logit contributions
 hardly+0.004
cases+0.004
tered+0.004
 barely+0.003
 floor+0.003
Neg logit contributions
smart-0.004
itty-0.003
 course-0.003
eros-0.003
ang-0.003
Token made
Token ID925
Feature activation+4.54e+00

Pos logit contributions
cases+0.040
 hardly+0.039
tered+0.037
late+0.034
icity+0.033
Neg logit contributions
smart-0.044
 course-0.040
in-0.040
itty-0.039
etchup-0.038
Token Lily
Token ID20037
Feature activation+5.47e+00

Pos logit contributions
cases+0.147
 hardly+0.145
rated+0.131
tered+0.131
 barely+0.128
Neg logit contributions
smart-0.073
 course-0.066
in-0.061
 living-0.060
 welcome-0.059
Token's
Token ID338
Feature activation+2.98e+00

Pos logit contributions
cases+0.139
 hardly+0.135
tered+0.128
icity+0.122
late+0.120
Neg logit contributions
smart-0.127
itty-0.111
in-0.110
o-0.108
etchup-0.107
Token eyes
Token ID2951
Feature activation+3.76e+00

Pos logit contributions
cases+0.075
 hardly+0.071
tered+0.069
late+0.062
icing+0.061
Neg logit contributions
smart-0.067
itty-0.062
etchup-0.062
 living-0.061
eros-0.061
Token sting
Token ID24276
Feature activation+7.75e-01

Pos logit contributions
 hardly+0.096
cases+0.093
tered+0.086
icity+0.085
abs+0.083
Neg logit contributions
smart-0.084
in-0.071
 living-0.071
 course-0.070
 welcome-0.070
Token.
Token ID13
Feature activation+1.86e+00

Pos logit contributions
 hardly+0.019
tered+0.019
cases+0.018
 nearly+0.016
 couldn+0.016
Neg logit contributions
smart-0.018
itty-0.017
 course-0.016
eros-0.016
ang-0.016
Token
Token ID198
Feature activation+8.47e-02

Pos logit contributions
 hardly+0.051
cases+0.049
tered+0.048
 barely+0.044
 nearly+0.043
Neg logit contributions
smart-0.040
itty-0.036
 course-0.035
in-0.034
eros-0.033
Token got
Token ID1392
Feature activation+8.16e-01

Pos logit contributions
cases+0.073
 hardly+0.070
abs+0.067
tered+0.066
late+0.063
Neg logit contributions
smart-0.055
 course-0.049
 living-0.048
itty-0.048
in-0.046
Token a
Token ID257
Feature activation+2.14e+00

Pos logit contributions
 hardly+0.021
cases+0.020
tered+0.019
abs+0.017
 barely+0.017
Neg logit contributions
smart-0.019
itty-0.017
 course-0.017
in-0.017
etchup-0.017
Token big
Token ID1263
Feature activation-7.11e-01

Pos logit contributions
 hardly+0.053
cases+0.050
tered+0.049
 barely+0.044
 nearly+0.044
Neg logit contributions
smart-0.052
itty-0.047
 course-0.046
eros-0.045
ang-0.044
Token patch
Token ID8529
Feature activation+1.24e-01

Pos logit contributions
smart+0.018
itty+0.016
ang+0.016
 Friendship+0.015
rob+0.015
Neg logit contributions
 hardly-0.016
tered-0.015
cases-0.015
 floor-0.014
 nearly-0.014
Token on
Token ID319
Feature activation+7.26e-01

Pos logit contributions
 hardly+0.004
cases+0.004
tered+0.003
 barely+0.003
 nearly+0.003
Neg logit contributions
smart-0.002
itty-0.002
 course-0.002
in-0.002
ang-0.002
Token her
Token ID607
Feature activation+5.46e+00

Pos logit contributions
cases+0.026
 hardly+0.024
abs+0.023
tered+0.023
rated+0.023
Neg logit contributions
 course-0.010
smart-0.009
 living-0.008
itty-0.007
eros-0.007
Token knee
Token ID10329
Feature activation+6.95e-01

Pos logit contributions
cases+0.138
 hardly+0.137
tered+0.136
late+0.121
rated+0.120
Neg logit contributions
 living-0.110
smart-0.110
itty-0.105
eros-0.104
 welcome-0.101
Token.
Token ID13
Feature activation+9.31e-02

Pos logit contributions
 hardly+0.020
cases+0.020
tered+0.019
 barely+0.017
 nearly+0.017
Neg logit contributions
smart-0.013
itty-0.012
 course-0.011
in-0.011
eros-0.011
Token She
Token ID1375
Feature activation+2.21e+00

Pos logit contributions
cases+0.003
 hardly+0.003
tered+0.003
rated+0.002
 barely+0.002
Neg logit contributions
smart-0.002
itty-0.002
 course-0.001
 welcome-0.001
 living-0.001
Token was
Token ID373
Feature activation+1.05e+00

Pos logit contributions
cases+0.067
abs+0.061
 hardly+0.060
late+0.059
tered+0.058
Neg logit contributions
 band-0.039
in-0.038
 course-0.038
itty-0.036
smart-0.036
Token very
Token ID845
Feature activation+5.08e-01

Pos logit contributions
cases+0.028
 hardly+0.027
tered+0.026
abs+0.024
late+0.023
Neg logit contributions
smart-0.024
itty-0.021
 welcome-0.021
 living-0.020
 course-0.020
Token,
Token ID11
Feature activation+5.83e-01

Pos logit contributions
 hardly+0.056
cases+0.054
tered+0.050
 barely+0.046
late+0.046
Neg logit contributions
smart-0.060
itty-0.056
 course-0.054
in-0.054
eros-0.052
Token brown
Token ID7586
Feature activation-9.60e-01

Pos logit contributions
 hardly+0.015
tered+0.014
cases+0.013
 couldn+0.013
 barely+0.012
Neg logit contributions
smart-0.014
itty-0.013
etchup-0.012
rob-0.012
eros-0.012
Token seeds
Token ID11904
Feature activation-5.23e-01

Pos logit contributions
smart+0.021
itty+0.019
 course+0.018
in+0.017
eros+0.017
Neg logit contributions
 hardly-0.026
cases-0.025
tered-0.025
 barely-0.022
 nearly-0.022
Token began
Token ID2540
Feature activation+1.61e+00

Pos logit contributions
smart+0.013
itty+0.012
 course+0.012
eros+0.012
in+0.011
Neg logit contributions
 hardly-0.012
cases-0.012
tered-0.011
 barely-0.010
 nearly-0.010
Token to
Token ID284
Feature activation+6.21e-01

Pos logit contributions
 hardly+0.043
cases+0.041
tered+0.040
 barely+0.037
 nearly+0.036
Neg logit contributions
smart-0.036
itty-0.033
 course-0.032
eros-0.031
in-0.031
Token sp
Token ID599
Feature activation+5.46e+00

Pos logit contributions
cases+0.014
 hardly+0.014
tered+0.013
abs+0.012
 barely+0.012
Neg logit contributions
smart-0.016
 course-0.015
itty-0.014
in-0.014
 welcome-0.014
Tokenurt
Token ID3325
Feature activation+1.60e+00

Pos logit contributions
 hardly+0.116
 couldn+0.112
 barely+0.104
 forgot+0.102
cases+0.102
Neg logit contributions
in-0.153
smart-0.128
ang-0.128
 course-0.112
P-0.112
Token out
Token ID503
Feature activation+7.48e-01

Pos logit contributions
 hardly+0.040
cases+0.038
tered+0.037
 barely+0.034
 nearly+0.033
Neg logit contributions
smart-0.038
itty-0.035
 course-0.034
in-0.033
eros-0.033
Token through
Token ID832
Feature activation-3.07e-01

Pos logit contributions
 hardly+0.020
cases+0.019
tered+0.019
 barely+0.017
 nearly+0.017
Neg logit contributions
smart-0.016
itty-0.015
 course-0.014
eros-0.014
in-0.014
Token the
Token ID262
Feature activation+1.39e+00

Pos logit contributions
smart+0.006
itty+0.006
 course+0.005
eros+0.005
ang+0.005
Neg logit contributions
 hardly-0.009
cases-0.008
tered-0.008
 barely-0.008
 nearly-0.008
Token soft
Token ID2705
Feature activation-4.90e-01

Pos logit contributions
 hardly+0.032
cases+0.031
tered+0.031
 floor+0.028
 closely+0.028
Neg logit contributions
smart-0.035
itty-0.031
ang-0.031
eros-0.030
rob-0.030
Token her
Token ID607
Feature activation+9.25e-01

Pos logit contributions
 hardly+0.030
cases+0.029
tered+0.028
 barely+0.026
 nearly+0.026
Neg logit contributions
smart-0.020
itty-0.018
 course-0.017
in-0.017
eros-0.016
Token toys
Token ID14958
Feature activation+5.92e-01

Pos logit contributions
 hardly+0.022
cases+0.021
tered+0.020
 barely+0.019
 couldn+0.019
Neg logit contributions
smart-0.025
 course-0.021
itty-0.020
 Friendship-0.020
in-0.020
Token and
Token ID290
Feature activation-2.50e-01

Pos logit contributions
 hardly+0.016
cases+0.015
tered+0.015
 barely+0.014
 nearly+0.013
Neg logit contributions
smart-0.013
 course-0.012
itty-0.012
in-0.011
 living-0.011
Token jump
Token ID4391
Feature activation+1.84e+00

Pos logit contributions
smart+0.005
 course+0.005
itty+0.005
 living+0.005
 welcome+0.005
Neg logit contributions
cases-0.006
 hardly-0.006
tered-0.006
late-0.006
abs-0.006
Token on
Token ID319
Feature activation+2.98e-02

Pos logit contributions
 hardly+0.066
cases+0.062
rated+0.061
abs+0.061
appointed+0.059
Neg logit contributions
smart-0.019
 course-0.019
in-0.019
 living-0.018
 on-0.016
Token her
Token ID607
Feature activation+5.46e+00

Pos logit contributions
 hardly+0.001
cases+0.001
tered+0.001
abs+0.001
rated+0.001
Neg logit contributions
smart-0.000
 course-0.000
 living-0.000
itty-0.000
in-0.000
Token bed
Token ID3996
Feature activation+1.77e+00

Pos logit contributions
 hardly+0.133
tered+0.131
cases+0.127
late+0.119
rated+0.118
Neg logit contributions
smart-0.118
 living-0.116
itty-0.112
eros-0.108
 course-0.107
Token.
Token ID13
Feature activation-4.64e-01

Pos logit contributions
 hardly+0.059
rated+0.056
abs+0.056
cases+0.055
appointed+0.054
Neg logit contributions
 living-0.022
smart-0.021
 course-0.017
in-0.016
 kit-0.014
Token One
Token ID1881
Feature activation+1.21e+00

Pos logit contributions
smart+0.013
itty+0.012
 course+0.011
in+0.011
eros+0.011
Neg logit contributions
 hardly-0.010
cases-0.010
tered-0.009
 barely-0.008
 nearly-0.008
Token day
Token ID1110
Feature activation+2.51e-01

Pos logit contributions
 hardly+0.039
cases+0.038
tered+0.037
 barely+0.034
 nearly+0.034
Neg logit contributions
smart-0.020
itty-0.017
 course-0.017
in-0.016
eros-0.016
Token,
Token ID11
Feature activation+2.18e+00

Pos logit contributions
 hardly+0.007
cases+0.007
tered+0.006
 barely+0.006
 nearly+0.006
Neg logit contributions
smart-0.005
itty-0.005
 course-0.005
in-0.004
eros-0.004
Token his
Token ID465
Feature activation-1.92e-01

Pos logit contributions
smart+0.038
itty+0.036
ang+0.034
in+0.033
 course+0.033
Neg logit contributions
 hardly-0.057
tered-0.050
cases-0.050
 barely-0.049
 closely-0.048
Token friend
Token ID1545
Feature activation-1.81e+00

Pos logit contributions
smart+0.005
itty+0.004
 course+0.004
rob+0.004
ang+0.004
Neg logit contributions
 hardly-0.005
cases-0.005
tered-0.004
 closely-0.004
 barely-0.004
Token,
Token ID11
Feature activation-6.23e-01

Pos logit contributions
smart+0.045
 course+0.040
itty+0.040
eros+0.039
in+0.039
Neg logit contributions
 hardly-0.042
cases-0.041
tered-0.040
 closely-0.036
 couldn-0.035
Token a
Token ID257
Feature activation-2.26e+00

Pos logit contributions
smart+0.016
eros+0.015
rob+0.015
 course+0.015
ite+0.014
Neg logit contributions
 hardly-0.014
tered-0.013
 couldn-0.012
 nearly-0.012
 barely-0.012
Token wise
Token ID10787
Feature activation-3.43e+00

Pos logit contributions
rob+0.068
smart+0.067
eros+0.063
ases+0.063
orns+0.062
Neg logit contributions
 hardly-0.041
 barely-0.036
tered-0.034
 couldn-0.033
 floor-0.031
Token old
Token ID1468
Feature activation-5.91e+00

Pos logit contributions
smart+0.090
 course+0.081
etchup+0.079
rob+0.078
itty+0.078
Neg logit contributions
 hardly-0.074
tered-0.072
cases-0.069
 closely-0.067
 curiously-0.064
Token owl
Token ID39610
Feature activation-4.06e+00

Pos logit contributions
smart+0.140
rob+0.122
etchup+0.121
 course+0.118
ang+0.116
Neg logit contributions
 hardly-0.134
tered-0.128
 curiously-0.128
 closely-0.124
 floor-0.123
Token,
Token ID11
Feature activation-1.05e+00

Pos logit contributions
smart+0.090
 welcome+0.090
rob+0.087
 winners+0.084
 Friendship+0.084
Neg logit contributions
 couldn-0.088
 hardly-0.086
tered-0.083
 closely-0.081
 curiously-0.079
Token and
Token ID290
Feature activation-1.64e+00

Pos logit contributions
smart+0.028
itty+0.026
eros+0.025
in+0.025
 course+0.025
Neg logit contributions
 hardly-0.023
cases-0.021
tered-0.021
 couldn-0.020
 barely-0.019
Token asked
Token ID1965
Feature activation-2.42e+00

Pos logit contributions
smart+0.047
eros+0.045
itty+0.045
 course+0.044
ang+0.043
Neg logit contributions
 hardly-0.032
tered-0.031
 couldn-0.029
cases-0.029
 barely-0.027
Token for
Token ID329
Feature activation-2.55e+00

Pos logit contributions
itty+0.053
smart+0.052
ang+0.052
etchup+0.050
idd+0.050
Neg logit contributions
tered-0.060
 hardly-0.058
cases-0.056
 closely-0.054
 couldn-0.053
TokenBut
Token ID1537
Feature activation-7.10e-01

Pos logit contributions
 hardly+0.011
cases+0.011
tered+0.010
rated+0.010
late+0.010
Neg logit contributions
smart-0.009
itty-0.009
in-0.008
 living-0.008
 course-0.008
Token then
Token ID788
Feature activation-6.59e-01

Pos logit contributions
smart+0.019
itty+0.018
 course+0.017
in+0.017
etchup+0.017
Neg logit contributions
 hardly-0.015
cases-0.015
tered-0.014
 barely-0.013
 nearly-0.012
Token,
Token ID11
Feature activation+1.19e+00

Pos logit contributions
smart+0.015
itty+0.013
 course+0.013
ang+0.013
in+0.013
Neg logit contributions
 hardly-0.017
cases-0.017
tered-0.016
 barely-0.015
 nearly-0.015
Token a
Token ID257
Feature activation-2.06e+00

Pos logit contributions
 hardly+0.033
cases+0.033
tered+0.031
abs+0.029
 barely+0.029
Neg logit contributions
smart-0.024
 course-0.022
itty-0.021
eros-0.020
in-0.020
Token friendly
Token ID8030
Feature activation-3.51e+00

Pos logit contributions
smart+0.063
rob+0.059
ang+0.058
eros+0.057
etchup+0.057
Neg logit contributions
 hardly-0.035
tered-0.032
 barely-0.029
cases-0.029
 floor-0.027
Token tow
Token ID6094
Feature activation-5.88e+00

Pos logit contributions
smart+0.081
etchup+0.069
ang+0.069
rob+0.069
 course+0.067
Neg logit contributions
tered-0.085
 hardly-0.084
 floor-0.077
cases-0.075
 barely-0.074
Token truck
Token ID7779
Feature activation-3.52e+00

Pos logit contributions
smart+0.135
etchup+0.123
ang+0.117
iw+0.116
fing+0.112
Neg logit contributions
tered-0.145
 hardly-0.124
bell-0.123
 couldn-0.123
cases-0.121
Token came
Token ID1625
Feature activation-2.06e-01

Pos logit contributions
smart+0.080
itty+0.073
 course+0.072
ang+0.071
etchup+0.071
Neg logit contributions
 hardly-0.089
cases-0.089
tered-0.086
 couldn-0.079
 barely-0.077
Token along
Token ID1863
Feature activation-2.42e-01

Pos logit contributions
smart+0.005
itty+0.004
 course+0.004
eros+0.004
etchup+0.004
Neg logit contributions
 hardly-0.005
cases-0.005
tered-0.005
 barely-0.005
 nearly-0.004
Token and
Token ID290
Feature activation-9.07e-02

Pos logit contributions
smart+0.005
itty+0.005
 course+0.005
in+0.005
eros+0.004
Neg logit contributions
 hardly-0.007
cases-0.006
tered-0.006
 barely-0.006
 nearly-0.005
Token offered
Token ID4438
Feature activation+8.68e-01

Pos logit contributions
smart+0.002
itty+0.002
 course+0.002
in+0.002
eros+0.002
Neg logit contributions
 hardly-0.002
cases-0.002
tered-0.002
 barely-0.002
 nearly-0.002
Token his
Token ID465
Feature activation-4.49e-02

Pos logit contributions
smart+0.033
itty+0.029
 course+0.028
in+0.026
eros+0.026
Neg logit contributions
 hardly-0.060
cases-0.058
tered-0.056
 barely-0.053
 nearly-0.052
Token friend
Token ID1545
Feature activation-1.42e+00

Pos logit contributions
smart+0.001
itty+0.001
 course+0.001
in+0.001
eros+0.001
Neg logit contributions
 hardly-0.001
cases-0.001
tered-0.001
 barely-0.001
 nearly-0.001
Token,
Token ID11
Feature activation-3.46e-01

Pos logit contributions
smart+0.034
 course+0.030
itty+0.030
in+0.029
eros+0.029
Neg logit contributions
 hardly-0.035
cases-0.034
tered-0.033
 barely-0.030
 nearly-0.029
Token a
Token ID257
Feature activation-1.98e+00

Pos logit contributions
smart+0.009
 course+0.008
eros+0.008
itty+0.008
in+0.008
Neg logit contributions
 hardly-0.008
tered-0.008
cases-0.007
 barely-0.007
 nearly-0.007
Token wise
Token ID10787
Feature activation-3.22e+00

Pos logit contributions
smart+0.064
rob+0.061
 course+0.060
eros+0.058
opl+0.057
Neg logit contributions
 hardly-0.031
 barely-0.026
tered-0.026
cases-0.024
 couldn-0.023
Token old
Token ID1468
Feature activation-5.78e+00

Pos logit contributions
smart+0.081
 course+0.074
etchup+0.070
itty+0.068
rob+0.068
Neg logit contributions
 hardly-0.073
tered-0.071
cases-0.070
 closely-0.065
late-0.063
Token owl
Token ID39610
Feature activation-3.92e+00

Pos logit contributions
smart+0.154
 course+0.135
etchup+0.134
 Working+0.127
ez+0.126
Neg logit contributions
 hardly-0.115
tered-0.108
 floor-0.106
cases-0.105
 barely-0.101
Token.
Token ID13
Feature activation-3.83e-01

Pos logit contributions
smart+0.090
 welcome+0.089
 winners+0.085
rob+0.085
 Good+0.085
Neg logit contributions
 couldn-0.080
 hardly-0.078
tered-0.077
 closely-0.074
 curiously-0.071
Token Max
Token ID5436
Feature activation+4.66e-01

Pos logit contributions
smart+0.008
itty+0.007
 course+0.007
in+0.007
eros+0.007
Neg logit contributions
 hardly-0.011
cases-0.011
tered-0.010
 barely-0.009
 nearly-0.009
Token said
Token ID531
Feature activation-1.00e+00

Pos logit contributions
 hardly+0.012
cases+0.012
tered+0.011
late+0.011
abs+0.010
Neg logit contributions
smart-0.010
itty-0.009
 course-0.009
in-0.009
eros-0.009
Token,
Token ID11
Feature activation-1.13e+00

Pos logit contributions
smart+0.025
itty+0.024
ang+0.023
 course+0.023
etchup+0.023
Neg logit contributions
 hardly-0.023
tered-0.022
cases-0.022
 couldn-0.020
 barely-0.020
Token his
Token ID465
Feature activation+1.21e-01

Pos logit contributions
smart+0.022
itty+0.020
 course+0.019
ang+0.019
eros+0.019
Neg logit contributions
 hardly-0.025
cases-0.024
tered-0.023
 barely-0.021
 couldn-0.021
Token friend
Token ID1545
Feature activation-2.50e+00

Pos logit contributions
 hardly+0.004
cases+0.004
tered+0.004
 barely+0.004
 nearly+0.004
Neg logit contributions
smart-0.002
itty-0.002
 course-0.002
eros-0.001
in-0.001
Token,
Token ID11
Feature activation-4.92e-01

Pos logit contributions
smart+0.064
rob+0.061
ang+0.059
 course+0.058
etchup+0.058
Neg logit contributions
 hardly-0.053
cases-0.050
tered-0.049
 couldn-0.048
 closely-0.047
Token a
Token ID257
Feature activation-1.33e+00

Pos logit contributions
smart+0.012
rob+0.011
eros+0.011
ang+0.011
etchup+0.011
Neg logit contributions
 hardly-0.011
tered-0.010
 nearly-0.010
 couldn-0.010
cases-0.010
Token wise
Token ID10787
Feature activation-3.14e+00

Pos logit contributions
smart+0.041
rob+0.037
 course+0.036
eros+0.035
ang+0.034
Neg logit contributions
 hardly-0.024
 barely-0.021
tered-0.020
 floor-0.020
 couldn-0.020
Token old
Token ID1468
Feature activation-5.76e+00

Pos logit contributions
smart+0.082
 course+0.074
etchup+0.072
rob+0.071
itty+0.070
Neg logit contributions
 hardly-0.067
tered-0.067
cases-0.063
 closely-0.059
 floor-0.058
Token owl
Token ID39610
Feature activation-3.95e+00

Pos logit contributions
smart+0.137
rob+0.120
ang+0.119
etchup+0.119
fing+0.116
Neg logit contributions
 hardly-0.129
tered-0.123
 floor-0.122
 curiously-0.118
 not-0.117
Token named
Token ID3706
Feature activation-1.84e+00

Pos logit contributions
smart+0.092
rob+0.092
 welcome+0.089
 Working+0.085
 Friendship+0.085
Neg logit contributions
 hardly-0.077
 couldn-0.077
 closely-0.075
 curiously-0.072
tered-0.071
Token O
Token ID440
Feature activation-3.97e-01

Pos logit contributions
smart+0.041
itty+0.037
in+0.037
ang+0.037
 course+0.037
Neg logit contributions
 hardly-0.047
tered-0.045
cases-0.044
 barely-0.040
 couldn-0.040
Tokenll
Token ID297
Feature activation-6.53e-01

Pos logit contributions
smart+0.011
 welcome+0.009
 passenger+0.009
itty+0.009
 course+0.009
Neg logit contributions
 hardly-0.009
cases-0.008
tered-0.008
 nearly-0.007
 barely-0.007
Tokenie
Token ID494
Feature activation-3.19e+00

Pos logit contributions
smart+0.018
itty+0.016
 course+0.016
etchup+0.016
eros+0.015
Neg logit contributions
 hardly-0.014
cases-0.014
tered-0.013
 barely-0.012
 nearly-0.012
Token
Token ID198
Feature activation-1.61e-01

Pos logit contributions
smart+0.007
itty+0.006
in+0.005
 course+0.005
eros+0.005
Neg logit contributions
 hardly-0.015
cases-0.015
tered-0.015
 barely-0.014
 nearly-0.013
TokenJust
Token ID5703
Feature activation+6.34e-01

Pos logit contributions
smart+0.004
in+0.003
itty+0.003
 course+0.003
ang+0.003
Neg logit contributions
 hardly-0.004
cases-0.004
tered-0.004
 barely-0.003
rated-0.003
Token then
Token ID788
Feature activation-1.05e+00

Pos logit contributions
 hardly+0.017
cases+0.017
tered+0.015
abs+0.014
 barely+0.014
Neg logit contributions
smart-0.014
 course-0.013
itty-0.012
in-0.012
 welcome-0.012
Token a
Token ID257
Feature activation-2.58e+00

Pos logit contributions
smart+0.025
itty+0.023
 course+0.022
etchup+0.021
ang+0.021
Neg logit contributions
 hardly-0.027
cases-0.026
tered-0.025
 barely-0.023
 nearly-0.023
Token wise
Token ID10787
Feature activation-2.93e+00

Pos logit contributions
smart+0.067
rob+0.062
etchup+0.062
eros+0.061
itty+0.061
Neg logit contributions
 hardly-0.056
tered-0.051
 barely-0.048
 nearly-0.047
cases-0.047
Token old
Token ID1468
Feature activation-5.74e+00

Pos logit contributions
smart+0.078
 course+0.071
itty+0.071
etchup+0.071
idd+0.068
Neg logit contributions
 hardly-0.061
tered-0.060
cases-0.058
 closely-0.053
 barely-0.051
Token fox
Token ID21831
Feature activation-3.47e+00

Pos logit contributions
smart+0.136
etchup+0.126
ang+0.117
 course+0.116
idd+0.114
Neg logit contributions
 hardly-0.129
tered-0.124
 curiously-0.116
cases-0.116
 barely-0.115
Token stepped
Token ID10764
Feature activation-1.25e+00

Pos logit contributions
smart+0.082
etchup+0.077
 welcome+0.077
rob+0.075
 course+0.074
Neg logit contributions
 hardly-0.075
cases-0.074
tered-0.073
 couldn-0.071
 closely-0.069
Token out
Token ID503
Feature activation-5.75e-01

Pos logit contributions
smart+0.028
itty+0.025
 course+0.025
eros+0.024
etchup+0.024
Neg logit contributions
 hardly-0.031
cases-0.030
tered-0.030
 closely-0.029
 nearly-0.028
Token from
Token ID422
Feature activation-2.66e+00

Pos logit contributions
smart+0.014
itty+0.014
 course+0.013
rob+0.013
eros+0.013
Neg logit contributions
 hardly-0.013
cases-0.012
tered-0.012
 closely-0.011
 nearly-0.011
Token behind
Token ID2157
Feature activation-1.95e+00

Pos logit contributions
smart+0.084
itty+0.080
idd+0.077
in+0.077
etchup+0.076
Neg logit contributions
 hardly-0.041
tered-0.041
 closely-0.038
cases-0.037
 curiously-0.035
Token his
Token ID465
Feature activation-1.50e+00

Pos logit contributions
ang+0.042
etchup+0.042
idd+0.041
itty+0.040
ite+0.040
Neg logit contributions
tered-0.059
 hardly-0.058
 couldn-0.053
 curiously-0.052
 closely-0.052
Token friend
Token ID1545
Feature activation-2.18e+00

Pos logit contributions
smart+0.045
rob+0.043
 course+0.042
itty+0.041
ang+0.040
Neg logit contributions
 hardly-0.029
cases-0.025
 closely-0.024
 couldn-0.024
 barely-0.024
Token,
Token ID11
Feature activation-1.69e+00

Pos logit contributions
smart+0.054
rob+0.052
ang+0.051
 course+0.051
etchup+0.050
Neg logit contributions
 hardly-0.047
cases-0.045
tered-0.045
 couldn-0.042
 closely-0.042
Token a
Token ID257
Feature activation-2.08e+00

Pos logit contributions
opl+0.052
rob+0.051
ite+0.050
ites+0.049
smart+0.048
Neg logit contributions
 hardly-0.029
 couldn-0.027
tered-0.027
 not-0.026
 almost-0.025
Token wise
Token ID10787
Feature activation-3.09e+00

Pos logit contributions
rob+0.064
smart+0.061
opl+0.059
 course+0.058
ites+0.058
Neg logit contributions
 hardly-0.035
 barely-0.030
 couldn-0.029
tered-0.028
 nearly-0.028
Token old
Token ID1468
Feature activation-5.63e+00

Pos logit contributions
smart+0.069
 course+0.063
rob+0.062
etchup+0.062
itty+0.060
Neg logit contributions
 hardly-0.075
tered-0.073
 closely-0.072
cases-0.070
 curiously-0.068
Token owl
Token ID39610
Feature activation-3.73e+00

Pos logit contributions
smart+0.130
rob+0.125
etchup+0.121
 course+0.117
ips+0.114
Neg logit contributions
 curiously-0.119
 hardly-0.117
 closely-0.116
 not-0.115
tered-0.111
Token,
Token ID11
Feature activation-1.20e+00

Pos logit contributions
rob+0.110
 winners+0.100
 crowd+0.099
 passenger+0.097
seat+0.097
Neg logit contributions
 closely-0.061
 couldn-0.060
 curiously-0.055
tered-0.054
 hardly-0.051
Token "
Token ID366
Feature activation-2.69e-01

Pos logit contributions
smart+0.030
eros+0.028
rob+0.028
itty+0.028
ang+0.027
Neg logit contributions
 hardly-0.027
 couldn-0.026
tered-0.025
cases-0.024
 nearly-0.023
TokenHow
Token ID2437
Feature activation+8.96e-01

Pos logit contributions
smart+0.005
itty+0.004
eros+0.004
 course+0.004
 welcome+0.004
Neg logit contributions
 hardly-0.008
tered-0.008
cases-0.008
 barely-0.007
 nearly-0.007
Token can
Token ID460
Feature activation+2.27e+00

Pos logit contributions
cases+0.029
 hardly+0.029
tered+0.029
aned+0.027
late+0.027
Neg logit contributions
smart-0.015
 welcome-0.013
in-0.012
 living-0.011
o-0.011
TokenJust
Token ID5703
Feature activation+8.09e-01

Pos logit contributions
cases+0.012
 hardly+0.012
tered+0.011
 barely+0.010
 nearly+0.010
Neg logit contributions
smart-0.012
in-0.011
 course-0.011
P-0.010
 living-0.010
Token then
Token ID788
Feature activation-1.25e+00

Pos logit contributions
 hardly+0.021
cases+0.021
tered+0.018
abs+0.018
 barely+0.018
Neg logit contributions
smart-0.018
 course-0.017
itty-0.016
in-0.016
eros-0.015
Token,
Token ID11
Feature activation+5.81e-01

Pos logit contributions
smart+0.029
itty+0.028
 course+0.027
ang+0.026
etchup+0.026
Neg logit contributions
 hardly-0.031
cases-0.030
tered-0.029
 barely-0.026
 couldn-0.026
Token a
Token ID257
Feature activation-2.50e+00

Pos logit contributions
 hardly+0.017
cases+0.016
tered+0.015
abs+0.015
late+0.014
Neg logit contributions
smart-0.012
 course-0.010
in-0.010
itty-0.010
 welcome-0.010
Token wise
Token ID10787
Feature activation-3.07e+00

Pos logit contributions
smart+0.067
rob+0.067
etchup+0.062
orns+0.062
ang+0.061
Neg logit contributions
 hardly-0.050
tered-0.044
 barely-0.043
 nearly-0.040
 couldn-0.039
Token old
Token ID1468
Feature activation-5.59e+00

Pos logit contributions
smart+0.081
 course+0.074
itty+0.074
etchup+0.073
rob+0.070
Neg logit contributions
 hardly-0.066
tered-0.064
cases-0.062
 closely-0.056
 barely-0.055
Token owl
Token ID39610
Feature activation-3.54e+00

Pos logit contributions
smart+0.128
etchup+0.115
itty+0.108
ang+0.108
rob+0.106
Neg logit contributions
 hardly-0.134
tered-0.129
 barely-0.121
 curiously-0.118
 closely-0.118
Token swoop
Token ID38527
Feature activation-1.35e+00

Pos logit contributions
 welcome+0.089
smart+0.087
 Friendship+0.083
etchup+0.082
 Working+0.081
Neg logit contributions
 hardly-0.070
tered-0.069
 couldn-0.068
cases-0.068
 closely-0.063
Tokened
Token ID276
Feature activation-2.30e+00

Pos logit contributions
smart+0.031
 course+0.028
itty+0.027
etchup+0.026
eros+0.026
Neg logit contributions
 hardly-0.034
cases-0.034
tered-0.033
 nearly-0.030
 couldn-0.030
Token down
Token ID866
Feature activation-2.50e+00

Pos logit contributions
smart+0.046
itty+0.043
rob+0.043
ang+0.041
etchup+0.041
Neg logit contributions
tered-0.063
cases-0.060
 closely-0.059
 hardly-0.059
 curiously-0.057
Token from
Token ID422
Feature activation-2.58e+00

Pos logit contributions
smart+0.068
itty+0.067
 course+0.064
rob+0.064
eros+0.063
Neg logit contributions
 hardly-0.049
tered-0.047
cases-0.046
 closely-0.042
 nearly-0.041
Token his
Token ID465
Feature activation+6.23e-01

Pos logit contributions
smart+0.024
itty+0.022
ang+0.022
eros+0.021
etchup+0.021
Neg logit contributions
 hardly-0.019
tered-0.018
cases-0.018
 barely-0.016
 couldn-0.016
Token friend
Token ID1545
Feature activation-1.50e+00

Pos logit contributions
 hardly+0.020
cases+0.020
tered+0.019
 barely+0.018
 nearly+0.018
Neg logit contributions
smart-0.010
itty-0.009
 course-0.009
eros-0.008
in-0.008
Token,
Token ID11
Feature activation+1.44e-01

Pos logit contributions
smart+0.038
 course+0.035
rob+0.034
ang+0.034
etchup+0.034
Neg logit contributions
 hardly-0.033
cases-0.031
tered-0.031
 couldn-0.029
 nearly-0.028
Token a
Token ID257
Feature activation-1.01e+00

Pos logit contributions
 hardly+0.003
cases+0.003
tered+0.003
 barely+0.003
 nearly+0.003
Neg logit contributions
smart-0.003
 course-0.003
itty-0.003
eros-0.003
ang-0.003
Token wise
Token ID10787
Feature activation-2.99e+00

Pos logit contributions
smart+0.031
 course+0.027
rob+0.027
 Working+0.026
eros+0.026
Neg logit contributions
 hardly-0.018
tered-0.016
 barely-0.015
cases-0.015
 couldn-0.015
Token old
Token ID1468
Feature activation-5.57e+00

Pos logit contributions
smart+0.066
 course+0.058
etchup+0.057
rob+0.054
itty+0.054
Neg logit contributions
 hardly-0.076
tered-0.075
cases-0.072
 floor-0.067
 closely-0.067
Token owl
Token ID39610
Feature activation-3.47e+00

Pos logit contributions
smart+0.135
etchup+0.120
ang+0.116
 Working+0.115
fing+0.114
Neg logit contributions
 hardly-0.124
 floor-0.119
tered-0.117
 not-0.112
 barely-0.111
Token,
Token ID11
Feature activation-2.40e-01

Pos logit contributions
smart+0.079
 welcome+0.077
 winners+0.076
rob+0.075
 Working+0.073
Neg logit contributions
 couldn-0.069
 hardly-0.069
 closely-0.067
tered-0.066
 curiously-0.062
Token sitting
Token ID5586
Feature activation-1.64e+00

Pos logit contributions
smart+0.005
itty+0.005
 course+0.005
 living+0.004
in+0.004
Neg logit contributions
 hardly-0.006
cases-0.006
tered-0.006
 barely-0.005
late-0.005
Token on
Token ID319
Feature activation-1.32e+00

Pos logit contributions
smart+0.033
 course+0.030
 Friendship+0.028
 Working+0.028
itty+0.028
Neg logit contributions
tered-0.042
cases-0.040
 almost-0.039
 hardly-0.039
 couldn-0.038
Token a
Token ID257
Feature activation+4.14e+00

Pos logit contributions
smart+0.030
itty+0.027
 course+0.027
eros+0.025
in+0.025
Neg logit contributions
 hardly-0.034
cases-0.033
tered-0.032
 barely-0.029
 nearly-0.029
Token her
Token ID607
Feature activation-1.45e+00

Pos logit contributions
smart+0.038
itty+0.036
 course+0.034
ang+0.034
idd+0.033
Neg logit contributions
 hardly-0.063
tered-0.061
cases-0.059
 barely-0.055
 closely-0.055
Token friend
Token ID1545
Feature activation-2.35e+00

Pos logit contributions
smart+0.035
 course+0.033
itty+0.033
rob+0.033
ang+0.031
Neg logit contributions
 hardly-0.035
cases-0.034
tered-0.032
 closely-0.031
 barely-0.030
Token,
Token ID11
Feature activation-1.53e+00

Pos logit contributions
smart+0.052
itty+0.047
 course+0.047
ang+0.044
eros+0.044
Neg logit contributions
 hardly-0.062
cases-0.060
tered-0.058
 barely-0.052
 nearly-0.052
Token a
Token ID257
Feature activation-1.86e+00

Pos logit contributions
rob+0.044
smart+0.044
ite+0.043
eros+0.042
ang+0.042
Neg logit contributions
 hardly-0.027
 not-0.024
 almost-0.024
 couldn-0.023
 nearly-0.023
Token wise
Token ID10787
Feature activation-3.02e+00

Pos logit contributions
smart+0.053
rob+0.053
ang+0.050
 course+0.049
eros+0.049
Neg logit contributions
 hardly-0.035
 barely-0.030
 floor-0.030
tered-0.029
 couldn-0.029
Token old
Token ID1468
Feature activation-5.57e+00

Pos logit contributions
smart+0.079
 course+0.072
etchup+0.070
rob+0.070
itty+0.069
Neg logit contributions
 hardly-0.064
tered-0.062
cases-0.059
 closely-0.056
 floor-0.054
Token owl
Token ID39610
Feature activation-4.07e+00

Pos logit contributions
smart+0.132
rob+0.118
etchup+0.117
ang+0.115
 course+0.114
Neg logit contributions
 hardly-0.124
tered-0.117
 not-0.117
 floor-0.114
 curiously-0.110
Token named
Token ID3706
Feature activation-1.45e+00

Pos logit contributions
rob+0.109
smart+0.100
 Good+0.100
 crowd+0.099
P+0.099
Neg logit contributions
 couldn-0.067
 hardly-0.065
 closely-0.065
tered-0.063
 curiously-0.057
Token H
Token ID367
Feature activation+1.41e+00

Pos logit contributions
smart+0.028
itty+0.025
 course+0.024
in+0.024
ang+0.023
Neg logit contributions
 hardly-0.042
cases-0.041
tered-0.040
 barely-0.037
 couldn-0.036
Tokenoot
Token ID1025
Feature activation-2.44e+00

Pos logit contributions
 hardly+0.038
cases+0.037
tered+0.035
 barely+0.033
 nearly+0.032
Neg logit contributions
smart-0.032
itty-0.030
 course-0.028
in-0.028
ang-0.027
Tokeny
Token ID88
Feature activation-3.62e+00

Pos logit contributions
smart+0.065
 course+0.060
 welcome+0.060
etchup+0.059
 Friendship+0.058
Neg logit contributions
cases-0.048
tered-0.046
 hardly-0.044
 couldn-0.043
 closely-0.040
Token's
Token ID338
Feature activation+2.84e-01

Pos logit contributions
cases+0.079
 hardly+0.072
abs+0.070
tered+0.067
late+0.066
Neg logit contributions
smart-0.054
itty-0.051
 course-0.048
o-0.048
in-0.047
Token friend
Token ID1545
Feature activation-8.73e-01

Pos logit contributions
cases+0.010
 hardly+0.009
tered+0.009
rated+0.008
 nearly+0.008
Neg logit contributions
smart-0.004
itty-0.004
eros-0.003
in-0.003
 course-0.003
Token,
Token ID11
Feature activation-9.47e-01

Pos logit contributions
smart+0.020
 course+0.018
itty+0.018
eros+0.017
etchup+0.017
Neg logit contributions
 hardly-0.023
cases-0.022
tered-0.021
 barely-0.019
 couldn-0.019
Token a
Token ID257
Feature activation-1.79e+00

Pos logit contributions
smart+0.025
ang+0.023
eros+0.022
 course+0.022
itty+0.022
Neg logit contributions
 hardly-0.021
tered-0.020
 couldn-0.019
cases-0.019
 nearly-0.018
Token wise
Token ID10787
Feature activation-3.06e+00

Pos logit contributions
smart+0.053
rob+0.052
 course+0.051
eros+0.049
ang+0.048
Neg logit contributions
 hardly-0.032
 barely-0.027
tered-0.026
 couldn-0.026
cases-0.025
Token old
Token ID1468
Feature activation-5.48e+00

Pos logit contributions
smart+0.069
 course+0.063
etchup+0.060
itty+0.059
rob+0.059
Neg logit contributions
 hardly-0.077
tered-0.075
cases-0.072
 closely-0.069
 curiously-0.067
Token owl
Token ID39610
Feature activation-3.79e+00

Pos logit contributions
smart+0.121
 course+0.106
etchup+0.105
rob+0.104
ang+0.101
Neg logit contributions
 hardly-0.133
tered-0.126
 curiously-0.124
 couldn-0.122
 floor-0.121
Token named
Token ID3706
Feature activation-1.16e+00

Pos logit contributions
 welcome+0.088
 winners+0.087
 Friendship+0.085
rob+0.085
smart+0.084
Neg logit contributions
 couldn-0.079
 hardly-0.073
tered-0.070
 closely-0.069
 curiously-0.066
Token O
Token ID440
Feature activation-1.33e-01

Pos logit contributions
smart+0.021
itty+0.018
 course+0.018
eros+0.017
in+0.017
Neg logit contributions
 hardly-0.036
cases-0.035
tered-0.034
 barely-0.032
 nearly-0.031
Tokenll
Token ID297
Feature activation+4.09e-01

Pos logit contributions
smart+0.003
itty+0.003
 course+0.003
eros+0.003
 welcome+0.003
Neg logit contributions
 hardly-0.003
cases-0.003
tered-0.003
 nearly-0.002
 barely-0.002
Tokenie
Token ID494
Feature activation-2.48e+00

Pos logit contributions
 hardly+0.010
cases+0.010
tered+0.009
 barely+0.009
late+0.008
Neg logit contributions
smart-0.010
itty-0.009
 course-0.009
in-0.009
eros-0.008
Token her
Token ID607
Feature activation+5.13e-01

Pos logit contributions
smart+0.022
itty+0.020
 course+0.020
ang+0.019
eros+0.019
Neg logit contributions
 hardly-0.023
cases-0.022
tered-0.021
 barely-0.019
 nearly-0.019
Token friend
Token ID1545
Feature activation-2.01e+00

Pos logit contributions
cases+0.018
 hardly+0.018
tered+0.018
rated+0.016
late+0.016
Neg logit contributions
smart-0.007
itty-0.006
eros-0.006
 living-0.005
 course-0.005
Token,
Token ID11
Feature activation-5.18e-01

Pos logit contributions
smart+0.048
rob+0.044
 course+0.043
etchup+0.043
ang+0.043
Neg logit contributions
 hardly-0.046
tered-0.044
cases-0.044
 couldn-0.042
 closely-0.041
Token a
Token ID257
Feature activation-1.56e+00

Pos logit contributions
smart+0.013
 course+0.012
rob+0.012
etchup+0.012
ang+0.012
Neg logit contributions
 hardly-0.012
tered-0.011
 couldn-0.010
 nearly-0.010
cases-0.010
Token wise
Token ID10787
Feature activation-2.97e+00

Pos logit contributions
smart+0.047
rob+0.044
 course+0.043
etchup+0.041
eros+0.041
Neg logit contributions
 hardly-0.028
 barely-0.024
tered-0.024
 couldn-0.023
 floor-0.023
Token old
Token ID1468
Feature activation-5.46e+00

Pos logit contributions
smart+0.065
 course+0.058
etchup+0.057
rob+0.056
itty+0.055
Neg logit contributions
 hardly-0.076
tered-0.075
cases-0.071
 closely-0.067
 barely-0.066
Token owl
Token ID39610
Feature activation-3.67e+00

Pos logit contributions
smart+0.118
etchup+0.106
ang+0.104
fing+0.103
 course+0.102
Neg logit contributions
 hardly-0.132
tered-0.128
 floor-0.124
 curiously-0.122
 not-0.121
Token named
Token ID3706
Feature activation-1.50e+00

Pos logit contributions
smart+0.080
 welcome+0.080
rob+0.076
 Working+0.075
 Friendship+0.075
Neg logit contributions
 hardly-0.079
 couldn-0.078
 closely-0.076
tered-0.075
 curiously-0.073
Token H
Token ID367
Feature activation+1.57e+00

Pos logit contributions
smart+0.029
itty+0.026
 course+0.025
in+0.025
ang+0.024
Neg logit contributions
 hardly-0.044
cases-0.042
tered-0.041
 barely-0.038
 nearly-0.037
Tokenoot
Token ID1025
Feature activation-1.72e+00

Pos logit contributions
 hardly+0.043
cases+0.042
tered+0.040
 barely+0.037
 nearly+0.036
Neg logit contributions
smart-0.035
itty-0.033
 course-0.031
in-0.030
ang-0.030
Tokenie
Token ID494
Feature activation-2.93e+00

Pos logit contributions
smart+0.049
 course+0.046
 welcome+0.045
etchup+0.045
itty+0.044
Neg logit contributions
cases-0.031
 hardly-0.031
tered-0.030
 couldn-0.027
 closely-0.026
TokenJust
Token ID5703
Feature activation+6.45e-01

Pos logit contributions
cases+0.008
 hardly+0.008
tered+0.007
rated+0.007
 nearly+0.007
Neg logit contributions
smart-0.005
itty-0.005
in-0.004
 course-0.004
P-0.004
Token then
Token ID788
Feature activation-4.79e-01

Pos logit contributions
 hardly+0.016
cases+0.016
tered+0.014
 barely+0.014
abs+0.013
Neg logit contributions
smart-0.015
 course-0.014
itty-0.014
in-0.013
 welcome-0.013
Token,
Token ID11
Feature activation+1.10e+00

Pos logit contributions
smart+0.011
 course+0.010
itty+0.010
etchup+0.010
eros+0.010
Neg logit contributions
 hardly-0.012
cases-0.011
tered-0.011
 barely-0.010
 nearly-0.010
Token a
Token ID257
Feature activation-2.07e+00

Pos logit contributions
cases+0.032
 hardly+0.032
tered+0.030
abs+0.028
late+0.028
Neg logit contributions
smart-0.021
 course-0.019
itty-0.019
in-0.017
 welcome-0.017
Token wise
Token ID10787
Feature activation-2.81e+00

Pos logit contributions
smart+0.062
rob+0.058
eros+0.055
etchup+0.055
eper+0.055
Neg logit contributions
 hardly-0.036
tered-0.033
 barely-0.030
 couldn-0.029
 nearly-0.029
Token old
Token ID1468
Feature activation-5.46e+00

Pos logit contributions
smart+0.074
 course+0.067
etchup+0.066
itty+0.065
rob+0.064
Neg logit contributions
 hardly-0.060
tered-0.058
cases-0.056
 closely-0.052
late-0.050
Token owl
Token ID39610
Feature activation-3.47e+00

Pos logit contributions
smart+0.143
etchup+0.128
ang+0.123
 course+0.119
ases+0.119
Neg logit contributions
 hardly-0.108
tered-0.106
 floor-0.101
 couldn-0.097
 barely-0.097
Token flew
Token ID13112
Feature activation-1.19e+00

Pos logit contributions
smart+0.086
 welcome+0.086
 Working+0.081
 Friendship+0.081
etchup+0.080
Neg logit contributions
 hardly-0.069
 couldn-0.068
tered-0.066
cases-0.065
 closely-0.062
Token down
Token ID866
Feature activation-1.70e+00

Pos logit contributions
smart+0.023
itty+0.020
 course+0.020
etchup+0.019
rob+0.019
Neg logit contributions
 hardly-0.034
tered-0.034
cases-0.034
 nearly-0.031
 closely-0.031
Token and
Token ID290
Feature activation-3.95e-01

Pos logit contributions
smart+0.041
itty+0.039
eros+0.038
 course+0.038
in+0.037
Neg logit contributions
 hardly-0.039
tered-0.038
cases-0.037
 barely-0.034
 nearly-0.033
Token perched
Token ID49264
Feature activation-1.45e+00

Pos logit contributions
smart+0.011
itty+0.010
 course+0.010
eros+0.010
in+0.009
Neg logit contributions
 hardly-0.009
cases-0.008
tered-0.008
 couldn-0.007
 nearly-0.007
TokenJust
Token ID5703
Feature activation+9.37e-01

Pos logit contributions
 hardly+0.005
cases+0.005
tered+0.005
 barely+0.004
 nearly+0.004
Neg logit contributions
smart-0.006
itty-0.006
 course-0.006
eros-0.006
etchup-0.005
Token then
Token ID788
Feature activation-9.83e-01

Pos logit contributions
cases+0.025
 hardly+0.025
tered+0.022
abs+0.022
 barely+0.021
Neg logit contributions
smart-0.020
 course-0.019
itty-0.018
in-0.017
 welcome-0.017
Token,
Token ID11
Feature activation+3.75e-01

Pos logit contributions
smart+0.023
itty+0.021
 course+0.021
etchup+0.020
ang+0.020
Neg logit contributions
 hardly-0.025
cases-0.024
tered-0.023
 barely-0.021
 nearly-0.021
Token a
Token ID257
Feature activation-2.28e+00

Pos logit contributions
 hardly+0.010
cases+0.010
tered+0.010
 barely+0.009
 nearly+0.009
Neg logit contributions
smart-0.008
itty-0.007
 course-0.007
in-0.007
eros-0.007
Token wise
Token ID10787
Feature activation-2.87e+00

Pos logit contributions
smart+0.064
rob+0.062
orns+0.058
 Friendship+0.057
ases+0.057
Neg logit contributions
 hardly-0.044
 barely-0.038
tered-0.038
 nearly-0.036
 couldn-0.035
Token old
Token ID1468
Feature activation-5.44e+00

Pos logit contributions
smart+0.078
 course+0.070
itty+0.070
etchup+0.070
rob+0.067
Neg logit contributions
 hardly-0.060
tered-0.058
cases-0.055
 closely-0.051
 barely-0.050
Token owl
Token ID39610
Feature activation-3.47e+00

Pos logit contributions
smart+0.131
etchup+0.115
ang+0.111
 course+0.108
rob+0.107
Neg logit contributions
 hardly-0.125
tered-0.118
 barely-0.112
 floor-0.109
 couldn-0.108
Token flew
Token ID13112
Feature activation-1.34e+00

Pos logit contributions
smart+0.084
 welcome+0.082
 Friendship+0.079
etchup+0.076
 Working+0.076
Neg logit contributions
 hardly-0.074
tered-0.071
cases-0.070
 couldn-0.070
 nearly-0.065
Token by
Token ID416
Feature activation-5.69e-01

Pos logit contributions
smart+0.027
itty+0.025
rob+0.024
 course+0.024
etchup+0.024
Neg logit contributions
 hardly-0.037
tered-0.036
cases-0.036
 nearly-0.034
 closely-0.033
Token.
Token ID13
Feature activation-2.94e-01

Pos logit contributions
smart+0.015
itty+0.014
 course+0.013
etchup+0.013
in+0.013
Neg logit contributions
 hardly-0.013
tered-0.012
cases-0.012
 barely-0.011
 nearly-0.011
Token "
Token ID366
Feature activation-1.30e+00

Pos logit contributions
smart+0.007
 course+0.006
itty+0.006
in+0.006
etchup+0.006
Neg logit contributions
 hardly-0.008
cases-0.007
tered-0.007
 barely-0.007
 nearly-0.006
Token friend
Token ID1545
Feature activation-2.35e+00

Pos logit contributions
smart+0.004
itty+0.004
 course+0.004
rob+0.004
etchup+0.004
Neg logit contributions
 hardly-0.005
cases-0.005
tered-0.005
 barely-0.005
 closely-0.004
Token Ralph
Token ID20993
Feature activation-1.76e+00

Pos logit contributions
smart+0.055
 course+0.050
itty+0.050
rob+0.049
etchup+0.049
Neg logit contributions
 hardly-0.057
cases-0.056
tered-0.053
 couldn-0.050
 nearly-0.049
Token,
Token ID11
Feature activation-1.36e+00

Pos logit contributions
smart+0.040
rob+0.037
itty+0.037
 course+0.036
idd+0.035
Neg logit contributions
 hardly-0.043
cases-0.041
tered-0.041
 closely-0.040
 curiously-0.038
Token a
Token ID257
Feature activation-1.79e+00

Pos logit contributions
smart+0.030
 course+0.028
rob+0.028
itty+0.027
etchup+0.026
Neg logit contributions
 hardly-0.035
tered-0.032
 nearly-0.032
 couldn-0.032
cases-0.031
Token friendly
Token ID8030
Feature activation-4.29e+00

Pos logit contributions
smart+0.049
itty+0.045
rob+0.045
 course+0.044
etchup+0.044
Neg logit contributions
 hardly-0.039
 barely-0.033
cases-0.032
 nearly-0.032
tered-0.032
Token old
Token ID1468
Feature activation-5.37e+00

Pos logit contributions
smart+0.092
rob+0.081
etchup+0.080
itty+0.076
ites+0.075
Neg logit contributions
 hardly-0.115
tered-0.108
 curiously-0.105
 closely-0.103
 barely-0.101
Token turtle
Token ID28699
Feature activation-4.36e+00

Pos logit contributions
smart+0.133
etchup+0.113
itty+0.111
 course+0.109
 Working+0.107
Neg logit contributions
 hardly-0.128
tered-0.116
 curiously-0.116
 barely-0.113
 closely-0.112
Token.
Token ID13
Feature activation-9.84e-01

Pos logit contributions
smart+0.102
rob+0.100
aid+0.093
etchup+0.093
 welcome+0.091
Neg logit contributions
 hardly-0.097
 couldn-0.094
tered-0.089
cases-0.088
 curiously-0.086
Token Ralph
Token ID20993
Feature activation+3.60e-02

Pos logit contributions
smart+0.022
 course+0.020
itty+0.020
in+0.019
etchup+0.018
Neg logit contributions
 hardly-0.027
cases-0.025
tered-0.025
 barely-0.023
 nearly-0.022
Token said
Token ID531
Feature activation-1.75e+00

Pos logit contributions
 hardly+0.001
cases+0.001
tered+0.001
 barely+0.001
late+0.001
Neg logit contributions
smart-0.001
itty-0.001
 course-0.001
in-0.001
eros-0.001
Token to
Token ID284
Feature activation-1.62e+00

Pos logit contributions
itty+0.044
etchup+0.043
smart+0.043
rob+0.043
ang+0.042
Neg logit contributions
 hardly-0.041
tered-0.039
cases-0.037
 barely-0.035
 couldn-0.035
Token boy
Token ID2933
Feature activation-8.95e-01

Pos logit contributions
smart+0.102
 course+0.081
opl+0.073
rob+0.073
fing+0.073
Neg logit contributions
tered-0.098
 hardly-0.098
cases-0.095
 floor-0.092
 curiously-0.090
Token called
Token ID1444
Feature activation-5.31e-01

Pos logit contributions
smart+0.025
rob+0.024
 course+0.023
 passenger+0.023
 welcome+0.023
Neg logit contributions
 hardly-0.016
tered-0.015
 couldn-0.014
cases-0.014
 closely-0.013
Token Sam
Token ID3409
Feature activation+8.35e-01

Pos logit contributions
smart+0.013
itty+0.012
 course+0.012
in+0.012
ang+0.012
Neg logit contributions
 hardly-0.012
cases-0.012
tered-0.012
 barely-0.010
 couldn-0.010
Token and
Token ID290
Feature activation-2.40e+00

Pos logit contributions
 hardly+0.014
cases+0.013
tered+0.013
 couldn+0.011
 closely+0.011
Neg logit contributions
smart-0.026
itty-0.024
 course-0.024
eros-0.023
ang-0.023
Token an
Token ID281
Feature activation+1.62e+00

Pos logit contributions
smart+0.059
eros+0.056
ang+0.055
itty+0.054
in+0.053
Neg logit contributions
 hardly-0.058
tered-0.054
cases-0.051
 couldn-0.050
 barely-0.049
Token old
Token ID1468
Feature activation-5.37e+00

Pos logit contributions
 hardly+0.047
cases+0.046
tered+0.045
 barely+0.041
 nearly+0.040
Neg logit contributions
smart-0.032
itty-0.028
 course-0.027
in-0.026
etchup-0.026
Token lady
Token ID10846
Feature activation-3.47e+00

Pos logit contributions
smart+0.137
ang+0.108
 course+0.106
etchup+0.105
ez+0.104
Neg logit contributions
 hardly-0.122
 floor-0.120
cases-0.118
 couldn-0.116
tered-0.114
Token called
Token ID1444
Feature activation-1.20e+00

Pos logit contributions
rob+0.106
eros+0.100
P+0.100
smart+0.099
 course+0.099
Neg logit contributions
 couldn-0.047
tered-0.045
 closely-0.044
cases-0.044
 hardly-0.043
Token Grace
Token ID16156
Feature activation-1.79e+00

Pos logit contributions
smart+0.029
itty+0.027
 course+0.027
ang+0.026
in+0.026
Neg logit contributions
 hardly-0.028
cases-0.027
tered-0.027
 barely-0.023
 couldn-0.023
Token.
Token ID13
Feature activation-9.60e-02

Pos logit contributions
smart+0.049
 course+0.047
rob+0.046
itty+0.045
etchup+0.044
Neg logit contributions
tered-0.036
cases-0.036
 hardly-0.036
 couldn-0.032
 barely-0.030
Token Sam
Token ID3409
Feature activation+1.40e+00

Pos logit contributions
smart+0.002
 course+0.002
itty+0.002
 welcome+0.001
 Friendship+0.001
Neg logit contributions
 hardly-0.003
cases-0.003
tered-0.003
 barely-0.003
 nearly-0.002
Token his
Token ID465
Feature activation-1.53e+00

Pos logit contributions
smart+0.035
itty+0.033
etchup+0.031
ang+0.030
 course+0.030
Neg logit contributions
 hardly-0.068
tered-0.066
cases-0.064
 closely-0.060
 barely-0.060
Token friend
Token ID1545
Feature activation-2.38e+00

Pos logit contributions
smart+0.041
rob+0.039
itty+0.038
 course+0.038
ang+0.036
Neg logit contributions
 hardly-0.034
cases-0.031
 closely-0.029
 couldn-0.029
tered-0.029
Token,
Token ID11
Feature activation-1.81e+00

Pos logit contributions
smart+0.054
itty+0.049
 course+0.049
ang+0.047
eros+0.047
Neg logit contributions
 hardly-0.060
cases-0.058
tered-0.057
 closely-0.051
 barely-0.051
Token a
Token ID257
Feature activation-2.10e+00

Pos logit contributions
rob+0.052
ite+0.048
opl+0.048
smart+0.047
ites+0.047
Neg logit contributions
 hardly-0.038
tered-0.033
 couldn-0.033
 nearly-0.032
 curiously-0.032
Token wise
Token ID10787
Feature activation-3.03e+00

Pos logit contributions
rob+0.053
smart+0.052
eros+0.048
etchup+0.048
ang+0.047
Neg logit contributions
 hardly-0.047
 barely-0.041
tered-0.040
 couldn-0.040
 nearly-0.040
Token old
Token ID1468
Feature activation-5.36e+00

Pos logit contributions
smart+0.078
etchup+0.071
 course+0.070
rob+0.070
itty+0.070
Neg logit contributions
 hardly-0.065
tered-0.063
cases-0.061
 closely-0.059
 curiously-0.057
Token owl
Token ID39610
Feature activation-3.75e+00

Pos logit contributions
smart+0.116
etchup+0.107
rob+0.106
ang+0.098
idd+0.097
Neg logit contributions
 hardly-0.127
 curiously-0.123
tered-0.122
 closely-0.119
 couldn-0.115
Token,
Token ID11
Feature activation-1.21e+00

Pos logit contributions
rob+0.098
smart+0.096
idd+0.093
etchup+0.093
 Friendship+0.092
Neg logit contributions
 closely-0.066
 hardly-0.066
 couldn-0.065
tered-0.064
 curiously-0.062
Token for
Token ID329
Feature activation-1.58e+00

Pos logit contributions
smart+0.031
rob+0.030
itty+0.030
eros+0.030
ang+0.029
Neg logit contributions
 hardly-0.026
cases-0.024
 couldn-0.024
tered-0.024
 nearly-0.022
Token help
Token ID1037
Feature activation+1.24e+00

Pos logit contributions
itty+0.033
ite+0.032
ang+0.031
smart+0.031
rob+0.030
Neg logit contributions
 hardly-0.045
tered-0.041
 barely-0.039
cases-0.038
 nearly-0.037
Token.
Token ID13
Feature activation-6.29e-01

Pos logit contributions
 hardly+0.034
cases+0.033
tered+0.032
 barely+0.029
 nearly+0.029
Neg logit contributions
smart-0.027
itty-0.024
 course-0.024
in-0.022
eros-0.022
Token Just
Token ID2329
Feature activation+7.16e-01

Pos logit contributions
smart+0.004
 living+0.003
itty+0.003
 Friendship+0.003
 Working+0.003
Neg logit contributions
cases-0.006
 hardly-0.006
tered-0.005
abs-0.005
rated-0.005
Token then
Token ID788
Feature activation-1.17e+00

Pos logit contributions
cases+0.017
 hardly+0.017
tered+0.015
abs+0.014
 barely+0.014
Neg logit contributions
smart-0.018
 course-0.016
itty-0.016
in-0.015
eros-0.015
Token,
Token ID11
Feature activation+2.68e-01

Pos logit contributions
smart+0.031
rob+0.030
 course+0.030
itty+0.029
etchup+0.029
Neg logit contributions
 hardly-0.025
tered-0.023
cases-0.023
 couldn-0.022
 barely-0.021
Token a
Token ID257
Feature activation-2.62e+00

Pos logit contributions
 hardly+0.007
cases+0.007
tered+0.007
 barely+0.006
 nearly+0.006
Neg logit contributions
smart-0.006
itty-0.005
 course-0.005
in-0.005
eros-0.005
Token kind
Token ID1611
Feature activation-3.15e+00

Pos logit contributions
rob+0.072
smart+0.070
etchup+0.067
ang+0.066
orns+0.066
Neg logit contributions
 hardly-0.050
tered-0.046
 barely-0.043
 nearly-0.041
 couldn-0.041
Token old
Token ID1468
Feature activation-5.36e+00

Pos logit contributions
smart+0.074
rob+0.070
etchup+0.068
 course+0.066
idd+0.064
Neg logit contributions
tered-0.073
 hardly-0.072
cases-0.066
 closely-0.065
 curiously-0.063
Token lady
Token ID10846
Feature activation-4.16e+00

Pos logit contributions
smart+0.143
ang+0.129
etchup+0.127
rob+0.124
ites+0.122
Neg logit contributions
 hardly-0.102
tered-0.100
 floor-0.099
 curiously-0.093
 barely-0.090
Token appeared
Token ID4120
Feature activation-7.06e-01

Pos logit contributions
rob+0.107
opl+0.104
 winners+0.103
 welcome+0.100
smart+0.100
Neg logit contributions
 couldn-0.075
bell-0.073
 curiously-0.071
 closely-0.071
cases-0.070
Token.
Token ID13
Feature activation-8.35e-01

Pos logit contributions
smart+0.018
itty+0.016
 course+0.016
etchup+0.016
rob+0.016
Neg logit contributions
tered-0.015
cases-0.015
 hardly-0.015
 closely-0.014
 nearly-0.013
Token
Token ID220
Feature activation+7.49e-01

Pos logit contributions
smart+0.021
itty+0.020
 course+0.020
etchup+0.019
eros+0.019
Neg logit contributions
 hardly-0.019
cases-0.019
tered-0.018
 couldn-0.016
 barely-0.016
Token
Token ID198
Feature activation-1.09e+00

Pos logit contributions
 hardly+0.026
cases+0.025
tered+0.024
 barely+0.023
 nearly+0.022
Neg logit contributions
smart-0.011
itty-0.010
 course-0.009
in-0.008
 welcome-0.008
TokenJust
Token ID5703
Feature activation+5.82e-01

Pos logit contributions
 hardly+0.001
cases+0.001
tered+0.001
 barely+0.001
 nearly+0.001
Neg logit contributions
smart-0.002
itty-0.002
 course-0.002
eros-0.002
in-0.002
Token then
Token ID788
Feature activation-9.23e-01

Pos logit contributions
 hardly+0.013
cases+0.013
tered+0.012
 barely+0.011
abs+0.011
Neg logit contributions
smart-0.015
 course-0.014
itty-0.014
in-0.013
eros-0.013
Token,
Token ID11
Feature activation+6.04e-01

Pos logit contributions
smart+0.021
itty+0.019
 course+0.019
ang+0.018
etchup+0.018
Neg logit contributions
 hardly-0.024
cases-0.023
tered-0.022
 barely-0.020
 nearly-0.020
Token a
Token ID257
Feature activation-2.15e+00

Pos logit contributions
 hardly+0.017
cases+0.016
tered+0.016
 barely+0.014
 nearly+0.014
Neg logit contributions
smart-0.013
itty-0.012
 course-0.011
in-0.011
eros-0.011
Token wise
Token ID10787
Feature activation-2.81e+00

Pos logit contributions
smart+0.060
rob+0.058
etchup+0.055
itty+0.055
eros+0.054
Neg logit contributions
 hardly-0.043
tered-0.036
 barely-0.036
 nearly-0.034
cases-0.033
Token old
Token ID1468
Feature activation-5.36e+00

Pos logit contributions
smart+0.070
itty+0.064
 course+0.064
etchup+0.062
rob+0.060
Neg logit contributions
 hardly-0.065
tered-0.062
cases-0.060
 barely-0.055
 closely-0.054
Token sk
Token ID1341
Feature activation-2.11e+00

Pos logit contributions
smart+0.135
etchup+0.123
itty+0.118
ang+0.118
 course+0.116
Neg logit contributions
 hardly-0.117
tered-0.108
 barely-0.103
cases-0.098
 floor-0.098
Tokenunk
Token ID2954
Feature activation-2.39e+00

Pos logit contributions
smart+0.043
 course+0.039
itty+0.037
 welcome+0.036
 Friendship+0.036
Neg logit contributions
 hardly-0.059
cases-0.056
tered-0.055
 barely-0.051
 nearly-0.050
Token came
Token ID1625
Feature activation-4.29e-01

Pos logit contributions
smart+0.056
etchup+0.052
 welcome+0.051
 Friendship+0.051
rob+0.050
Neg logit contributions
 hardly-0.056
cases-0.054
tered-0.052
 couldn-0.050
 nearly-0.049
Token along
Token ID1863
Feature activation-9.27e-01

Pos logit contributions
smart+0.010
 course+0.010
itty+0.009
rob+0.009
etchup+0.009
Neg logit contributions
 hardly-0.010
tered-0.010
cases-0.010
 closely-0.009
 nearly-0.009
Token.
Token ID13
Feature activation-1.75e-01

Pos logit contributions
smart+0.024
itty+0.022
 course+0.022
eros+0.021
etchup+0.021
Neg logit contributions
 hardly-0.021
tered-0.020
cases-0.020
 barely-0.017
 closely-0.017
Token his
Token ID465
Feature activation-1.88e+00

Pos logit contributions
smart+0.036
itty+0.035
ang+0.034
idd+0.033
ite+0.033
Neg logit contributions
 hardly-0.059
tered-0.058
cases-0.055
 couldn-0.052
 barely-0.052
Token tutor
Token ID48680
Feature activation-3.34e+00

Pos logit contributions
smart+0.052
 course+0.050
rob+0.049
itty+0.049
ang+0.047
Neg logit contributions
 hardly-0.040
cases-0.035
 barely-0.033
 couldn-0.032
tered-0.032
Token,
Token ID11
Feature activation-1.44e+00

Pos logit contributions
smart+0.082
rob+0.082
ang+0.082
itty+0.081
idd+0.081
Neg logit contributions
cases-0.072
 hardly-0.068
tered-0.066
 closely-0.066
 curiously-0.061
Token a
Token ID257
Feature activation-1.94e+00

Pos logit contributions
rob+0.043
ite+0.042
ites+0.041
ebra+0.040
eros+0.039
Neg logit contributions
 hardly-0.027
 couldn-0.024
 not-0.024
tered-0.024
 barely-0.023
Token wise
Token ID10787
Feature activation-2.63e+00

Pos logit contributions
rob+0.048
smart+0.048
ang+0.047
ites+0.046
etchup+0.045
Neg logit contributions
 hardly-0.043
 barely-0.037
tered-0.036
 floor-0.035
 nearly-0.035
Token old
Token ID1468
Feature activation-5.34e+00

Pos logit contributions
smart+0.054
 course+0.048
etchup+0.048
itty+0.047
rob+0.046
Neg logit contributions
 hardly-0.072
tered-0.069
cases-0.068
 closely-0.063
 barely-0.062
Token owl
Token ID39610
Feature activation-3.68e+00

Pos logit contributions
smart+0.125
etchup+0.115
ang+0.113
 course+0.110
idd+0.109
Neg logit contributions
 hardly-0.123
cases-0.110
tered-0.110
 floor-0.108
 barely-0.108
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
rob+0.100
idd+0.094
P+0.094
etchup+0.094
 crowd+0.092
Neg logit contributions
 couldn-0.059
 hardly-0.058
 closely-0.057
tered-0.057
cases-0.056
Token to
Token ID284
Feature activation-2.08e+00

Pos logit contributions
smart+0.024
eros+0.023
itty+0.023
in+0.023
rob+0.023
Neg logit contributions
 hardly-0.023
cases-0.021
tered-0.021
 couldn-0.021
 nearly-0.019
Token teach
Token ID4545
Feature activation+1.07e+00

Pos logit contributions
itty+0.047
ang+0.046
smart+0.046
idd+0.045
etchup+0.045
Neg logit contributions
 hardly-0.055
tered-0.048
 barely-0.047
cases-0.045
 almost-0.044
Token him
Token ID683
Feature activation+7.96e-01

Pos logit contributions
 hardly+0.031
tered+0.030
cases+0.030
 couldn+0.027
 closely+0.027
Neg logit contributions
smart-0.020
itty-0.018
ang-0.017
in-0.017
eros-0.017
Token his
Token ID465
Feature activation-1.33e+00

Pos logit contributions
smart+0.024
itty+0.022
ang+0.021
etchup+0.021
 course+0.020
Neg logit contributions
 hardly-0.041
tered-0.040
cases-0.039
 barely-0.036
 closely-0.036
Token friend
Token ID1545
Feature activation-2.06e+00

Pos logit contributions
smart+0.031
rob+0.028
itty+0.028
 course+0.027
etchup+0.026
Neg logit contributions
 hardly-0.035
cases-0.031
 couldn-0.031
 closely-0.031
 barely-0.030
Token,
Token ID11
Feature activation-2.08e+00

Pos logit contributions
smart+0.047
itty+0.043
 course+0.042
ang+0.041
etchup+0.041
Neg logit contributions
 hardly-0.052
cases-0.050
tered-0.049
 couldn-0.044
 closely-0.044
Token a
Token ID257
Feature activation-1.82e+00

Pos logit contributions
rob+0.066
opl+0.065
ite+0.062
ites+0.061
loo+0.060
Neg logit contributions
 hardly-0.032
 not-0.030
 couldn-0.030
tered-0.029
 curiously-0.029
Token wise
Token ID10787
Feature activation-2.77e+00

Pos logit contributions
rob+0.055
smart+0.054
eros+0.050
orns+0.049
etchup+0.049
Neg logit contributions
 hardly-0.031
 couldn-0.027
 barely-0.026
 nearly-0.025
tered-0.025
Token old
Token ID1468
Feature activation-5.33e+00

Pos logit contributions
smart+0.061
etchup+0.055
rob+0.054
 course+0.054
itty+0.053
Neg logit contributions
 hardly-0.069
tered-0.067
cases-0.065
 closely-0.064
 curiously-0.061
Token owl
Token ID39610
Feature activation-3.67e+00

Pos logit contributions
smart+0.120
etchup+0.113
rob+0.110
ips+0.104
ang+0.103
Neg logit contributions
 hardly-0.115
 curiously-0.113
 closely-0.111
 couldn-0.111
 not-0.110
Token,
Token ID11
Feature activation-1.53e+00

Pos logit contributions
rob+0.100
etchup+0.092
smart+0.092
idd+0.091
P+0.091
Neg logit contributions
 couldn-0.065
 closely-0.064
 hardly-0.061
 curiously-0.059
tered-0.059
Token �
Token ID6184
Feature activation-7.76e-01

Pos logit contributions
smart+0.040
rob+0.039
eros+0.038
itty+0.038
ang+0.037
Neg logit contributions
 hardly-0.032
 couldn-0.031
tered-0.029
cases-0.029
 nearly-0.027
Token�
Token ID95
Feature activation-7.10e-01

Pos logit contributions
smart+0.017
itty+0.015
in+0.014
etchup+0.014
 passenger+0.014
Neg logit contributions
 hardly-0.020
tered-0.019
cases-0.019
late-0.018
 closely-0.017
Token€
Token ID26391
Feature activation-3.73e-01

Pos logit contributions
smart+0.016
in+0.015
itty+0.015
etchup+0.015
ang+0.014
Neg logit contributions
 hardly-0.018
tered-0.017
cases-0.016
late-0.015
 barely-0.015
Token day
Token ID1110
Feature activation-2.62e-02

Pos logit contributions
 hardly+0.019
cases+0.018
tered+0.018
 barely+0.016
 nearly+0.016
Neg logit contributions
smart-0.010
itty-0.009
 course-0.008
eros-0.008
in-0.008
Token,
Token ID11
Feature activation+1.65e+00

Pos logit contributions
smart+0.001
itty+0.000
 course+0.000
in+0.000
eros+0.000
Neg logit contributions
 hardly-0.001
cases-0.001
tered-0.001
 barely-0.001
 nearly-0.001
Token it
Token ID340
Feature activation-8.53e-01

Pos logit contributions
cases+0.050
 hardly+0.049
rated+0.048
late+0.048
tered+0.048
Neg logit contributions
smart-0.026
 living-0.024
 course-0.024
itty-0.023
 welcome-0.023
Token started
Token ID2067
Feature activation+1.45e+00

Pos logit contributions
 living+0.014
o+0.013
itty+0.013
in+0.012
smart+0.011
Neg logit contributions
cases-0.028
 hardly-0.025
tered-0.024
abs-0.024
late-0.024
Token raining
Token ID43079
Feature activation+3.17e+00

Pos logit contributions
cases+0.045
 hardly+0.044
tered+0.042
late+0.039
abs+0.039
Neg logit contributions
smart-0.027
 living-0.024
itty-0.023
o-0.022
etchup-0.022
Token very
Token ID845
Feature activation+3.15e+00

Pos logit contributions
 hardly+0.109
cases+0.107
abs+0.098
tered+0.095
 barely+0.095
Neg logit contributions
 living-0.040
smart-0.038
 course-0.037
itty-0.033
o-0.033
Token hard
Token ID1327
Feature activation+7.40e-01

Pos logit contributions
 hardly+0.091
cases+0.091
tered+0.084
arded+0.082
abs+0.080
Neg logit contributions
in-0.054
ang-0.050
 living-0.049
smart-0.045
itty-0.044
Token outside
Token ID2354
Feature activation+1.25e+00

Pos logit contributions
 hardly+0.023
tered+0.021
cases+0.021
abs+0.020
 barely+0.020
Neg logit contributions
smart-0.013
 living-0.012
 course-0.011
itty-0.011
in-0.011
Token of
Token ID286
Feature activation-8.52e-01

Pos logit contributions
 hardly+0.037
cases+0.036
tered+0.034
 barely+0.032
abs+0.032
Neg logit contributions
smart-0.024
itty-0.021
 course-0.021
 living-0.020
in-0.020
Token her
Token ID607
Feature activation+2.64e+00

Pos logit contributions
smart+0.018
itty+0.016
 course+0.016
in+0.015
eros+0.015
Neg logit contributions
 hardly-0.024
cases-0.023
tered-0.022
 barely-0.020
 nearly-0.020
Token window
Token ID4324
Feature activation+3.10e-01

Pos logit contributions
 hardly+0.077
cases+0.074
tered+0.073
 barely+0.067
 nearly+0.066
Neg logit contributions
smart-0.052
itty-0.047
 course-0.045
in-0.044
eros-0.043
Token day
Token ID1110
Feature activation+1.66e-01

Pos logit contributions
smart+0.021
rob+0.019
eros+0.018
 course+0.017
idd+0.017
Neg logit contributions
 hardly-0.022
tered-0.020
 couldn-0.020
cases-0.019
 nearly-0.018
Token,
Token ID11
Feature activation+1.38e+00

Pos logit contributions
 hardly+0.005
cases+0.005
tered+0.005
 barely+0.004
late+0.004
Neg logit contributions
smart-0.003
 course-0.003
itty-0.003
 welcome-0.002
in-0.002
Token Jack
Token ID3619
Feature activation+2.32e+00

Pos logit contributions
cases+0.039
 hardly+0.039
aned+0.035
tered+0.035
late+0.035
Neg logit contributions
smart-0.027
 course-0.026
 welcome-0.024
 Friendship-0.023
 living-0.023
Token disappeared
Token ID12120
Feature activation+8.49e-01

Pos logit contributions
 hardly+0.068
tered+0.066
late+0.064
cases+0.064
appointed+0.060
Neg logit contributions
o-0.040
itty-0.039
in-0.037
 living-0.037
smart-0.036
Token!
Token ID0
Feature activation+1.41e-01

Pos logit contributions
 hardly+0.026
tered+0.024
cases+0.024
abs+0.022
 barely+0.022
Neg logit contributions
smart-0.015
 living-0.013
in-0.013
 welcome-0.012
ang-0.012
Token Jill
Token ID22772
Feature activation+3.43e+00

Pos logit contributions
 hardly+0.004
cases+0.004
tered+0.004
 barely+0.004
late+0.004
Neg logit contributions
smart-0.003
itty-0.002
 course-0.002
 welcome-0.002
in-0.002
Token looked
Token ID3114
Feature activation+9.66e-01

Pos logit contributions
late+0.099
abs+0.099
 hardly+0.098
cases+0.098
tered+0.097
Neg logit contributions
 living-0.056
itty-0.052
in-0.050
smart-0.050
o-0.049
Token everywhere
Token ID8347
Feature activation+2.33e+00

Pos logit contributions
 hardly+0.029
cases+0.027
tered+0.026
abs+0.025
 barely+0.025
Neg logit contributions
smart-0.019
 living-0.017
 welcome-0.016
 course-0.016
in-0.015
Token,
Token ID11
Feature activation+3.92e+00

Pos logit contributions
cases+0.068
 hardly+0.067
tered+0.063
late+0.059
abs+0.058
Neg logit contributions
smart-0.046
 course-0.039
itty-0.038
 welcome-0.037
 living-0.037
Token but
Token ID475
Feature activation+9.21e-01

Pos logit contributions
abs+0.096
cases+0.096
 hardly+0.090
late+0.090
tered+0.086
Neg logit contributions
smart-0.083
 living-0.082
 course-0.079
 waiting-0.074
 welcome-0.070
Token she
Token ID673
Feature activation+1.99e+00

Pos logit contributions
cases+0.024
 hardly+0.024
tered+0.023
abs+0.022
late+0.022
Neg logit contributions
smart-0.019
 course-0.017
 welcome-0.017
in-0.016
 living-0.016
Token her
Token ID607
Feature activation+1.85e+00

Pos logit contributions
 hardly+0.020
cases+0.020
tered+0.019
 barely+0.017
 nearly+0.017
Neg logit contributions
smart-0.015
 course-0.013
itty-0.013
in-0.012
eros-0.012
Token tea
Token ID8887
Feature activation+1.19e+00

Pos logit contributions
 hardly+0.053
cases+0.052
tered+0.051
late+0.047
 barely+0.046
Neg logit contributions
smart-0.034
 living-0.032
 course-0.031
 welcome-0.031
itty-0.031
Token.
Token ID13
Feature activation+5.96e-02

Pos logit contributions
 hardly+0.033
cases+0.032
tered+0.031
 barely+0.028
abs+0.028
Neg logit contributions
smart-0.024
 course-0.022
itty-0.021
 living-0.020
in-0.020
Token It
Token ID632
Feature activation-3.84e-02

Pos logit contributions
cases+0.002
 hardly+0.002
tered+0.002
rated+0.002
abs+0.002
Neg logit contributions
smart-0.001
itty-0.001
 course-0.001
 welcome-0.001
 living-0.001
Token made
Token ID925
Feature activation+3.64e+00

Pos logit contributions
smart+0.001
 course+0.001
 living+0.001
itty+0.001
in+0.001
Neg logit contributions
cases-0.001
 hardly-0.001
tered-0.001
abs-0.001
late-0.001
Token her
Token ID607
Feature activation+3.18e+00

Pos logit contributions
cases+0.132
 hardly+0.123
rated+0.116
tered+0.116
abs+0.115
Neg logit contributions
 course-0.046
smart-0.046
 welcome-0.040
itty-0.037
 living-0.035
Token jump
Token ID4391
Feature activation+3.17e+00

Pos logit contributions
cases+0.085
tered+0.081
 hardly+0.079
late+0.074
abs+0.072
Neg logit contributions
smart-0.065
itty-0.062
 welcome-0.061
 living-0.059
 course-0.058
Token in
Token ID287
Feature activation-4.09e-01

Pos logit contributions
 hardly+0.096
cases+0.090
abs+0.081
icity+0.081
tered+0.080
Neg logit contributions
smart-0.053
in-0.048
 course-0.047
 living-0.046
 back-0.046
Token surprise
Token ID5975
Feature activation+1.13e+00

Pos logit contributions
smart+0.009
 course+0.008
itty+0.008
in+0.008
 welcome+0.008
Neg logit contributions
 hardly-0.011
cases-0.011
tered-0.010
 barely-0.009
abs-0.009
Token and
Token ID290
Feature activation+4.00e-01

Pos logit contributions
 hardly+0.030
cases+0.029
tered+0.028
 barely+0.026
abs+0.025
Neg logit contributions
smart-0.025
 course-0.023
itty-0.022
in-0.021
 living-0.021
Token her
Token ID607
Feature activation+6.92e-01

Pos logit contributions
 hardly+0.010
cases+0.010
tered+0.010
abs+0.009
late+0.009
Neg logit contributions
smart-0.009
 course-0.008
itty-0.008
 living-0.008
 welcome-0.007
TokenThey
Token ID2990
Feature activation+5.40e-01

Pos logit contributions
smart+0.012
itty+0.011
eros+0.011
 course+0.011
etchup+0.010
Neg logit contributions
 hardly-0.012
cases-0.011
tered-0.011
 barely-0.010
 nearly-0.009
Token hung
Token ID9174
Feature activation+7.74e-01

Pos logit contributions
 hardly+0.013
cases+0.013
tered+0.013
abs+0.012
icing+0.012
Neg logit contributions
 course-0.012
smart-0.012
 welcome-0.011
 passenger-0.010
in-0.010
Token their
Token ID511
Feature activation+2.73e+00

Pos logit contributions
 hardly+0.025
cases+0.024
tered+0.023
 barely+0.022
 nearly+0.021
Neg logit contributions
smart-0.013
itty-0.011
 course-0.011
in-0.011
eros-0.011
Token socks
Token ID24359
Feature activation+1.85e+00

Pos logit contributions
 hardly+0.065
cases+0.063
tered+0.061
 barely+0.055
 nearly+0.053
Neg logit contributions
smart-0.070
itty-0.063
 course-0.062
in-0.059
ang-0.059
Token on
Token ID319
Feature activation+4.66e-01

Pos logit contributions
 hardly+0.055
cases+0.051
tered+0.049
 barely+0.047
abs+0.047
Neg logit contributions
smart-0.035
 course-0.031
in-0.031
itty-0.030
 living-0.028
Token the
Token ID262
Feature activation+3.21e+00

Pos logit contributions
cases+0.016
 hardly+0.016
abs+0.015
tered+0.014
rated+0.014
Neg logit contributions
smart-0.006
 course-0.006
eros-0.005
 living-0.005
in-0.005
Token chim
Token ID18205
Feature activation+1.50e+00

Pos logit contributions
 hardly+0.072
cases+0.069
tered+0.067
 barely+0.061
 floor+0.060
Neg logit contributions
smart-0.087
itty-0.078
 course-0.076
ang-0.074
in-0.074
Tokenney
Token ID1681
Feature activation+1.23e+00

Pos logit contributions
 hardly+0.046
cases+0.045
tered+0.044
 barely+0.040
 nearly+0.040
Neg logit contributions
smart-0.027
itty-0.024
 course-0.023
in-0.022
eros-0.022
Token and
Token ID290
Feature activation+6.25e-01

Pos logit contributions
 hardly+0.032
cases+0.030
tered+0.029
 barely+0.027
 nearly+0.026
Neg logit contributions
smart-0.028
itty-0.026
 course-0.025
in-0.024
eros-0.024
Token then
Token ID788
Feature activation+5.13e-02

Pos logit contributions
 hardly+0.014
cases+0.014
tered+0.013
abs+0.012
late+0.012
Neg logit contributions
smart-0.015
 course-0.014
itty-0.014
etchup-0.014
in-0.014
Token added
Token ID2087
Feature activation+6.47e-02

Pos logit contributions
 hardly+0.001
cases+0.001
tered+0.001
 barely+0.001
 nearly+0.001
Neg logit contributions
smart-0.001
itty-0.001
 course-0.001
eros-0.001
ang-0.001
Token to
Token ID284
Feature activation-5.01e-01

Pos logit contributions
 hardly+0.001
cases+0.001
tered+0.001
 barely+0.001
 nearly+0.001
Neg logit contributions
smart-0.000
itty-0.000
ang-0.000
 course-0.000
eros-0.000
Token play
Token ID711
Feature activation+5.12e-01

Pos logit contributions
smart+0.011
itty+0.010
 course+0.010
in+0.010
eros+0.010
Neg logit contributions
 hardly-0.013
cases-0.013
tered-0.012
 barely-0.011
 nearly-0.011
Token,
Token ID11
Feature activation+2.77e+00

Pos logit contributions
 hardly+0.012
tered+0.012
cases+0.011
 closely+0.010
 barely+0.010
Neg logit contributions
smart-0.012
itty-0.012
etchup-0.011
ang-0.011
eros-0.011
Token but
Token ID475
Feature activation+2.71e-01

Pos logit contributions
 hardly+0.058
cases+0.057
tered+0.054
abs+0.047
late+0.047
Neg logit contributions
smart-0.077
 course-0.070
itty-0.069
eros-0.066
etchup-0.066
Token he
Token ID339
Feature activation+6.28e-01

Pos logit contributions
 hardly+0.007
cases+0.006
tered+0.006
 barely+0.006
 nearly+0.006
Neg logit contributions
smart-0.006
itty-0.006
 course-0.006
eros-0.006
in-0.006
Token doesn
Token ID1595
Feature activation+3.72e+00

Pos logit contributions
 hardly+0.017
cases+0.017
tered+0.016
 barely+0.014
abs+0.014
Neg logit contributions
smart-0.014
 course-0.012
itty-0.012
in-0.012
eros-0.012
Token't
Token ID470
Feature activation-2.37e-01

Pos logit contributions
cases+0.078
tered+0.069
rated+0.068
 hardly+0.064
abs+0.063
Neg logit contributions
smart-0.099
 course-0.093
in-0.091
 welcome-0.087
itty-0.085
Token know
Token ID760
Feature activation+2.72e-01

Pos logit contributions
smart+0.006
itty+0.006
 course+0.006
eros+0.006
etchup+0.006
Neg logit contributions
 hardly-0.005
tered-0.005
cases-0.005
 barely-0.004
 nearly-0.004
Token how
Token ID703
Feature activation+6.07e-01

Pos logit contributions
 hardly+0.007
cases+0.007
tered+0.007
 closely+0.007
 couldn+0.007
Neg logit contributions
smart-0.005
itty-0.005
ang-0.005
eros-0.005
 passenger-0.005
Token to
Token ID284
Feature activation-9.06e-01

Pos logit contributions
 hardly+0.016
cases+0.016
tered+0.015
 barely+0.014
 nearly+0.013
Neg logit contributions
smart-0.014
itty-0.012
 course-0.012
eros-0.012
in-0.012
Token be
Token ID307
Feature activation-1.13e+00

Pos logit contributions
smart+0.021
itty+0.019
 course+0.019
in+0.018
eros+0.018
Neg logit contributions
 hardly-0.023
cases-0.023
tered-0.022
 barely-0.020
 nearly-0.019
Token The
Token ID383
Feature activation-6.17e-01

Pos logit contributions
 hardly+0.021
cases+0.020
tered+0.020
 barely+0.018
 nearly+0.017
Neg logit contributions
smart-0.020
itty-0.018
 course-0.018
in-0.017
eros-0.017
Token train
Token ID4512
Feature activation-1.38e+00

Pos logit contributions
smart+0.016
itty+0.014
in+0.013
ang+0.013
eros+0.013
Neg logit contributions
 hardly-0.015
cases-0.015
tered-0.014
 floor-0.013
 barely-0.013
Token starts
Token ID4940
Feature activation+2.00e+00

Pos logit contributions
smart+0.031
itty+0.029
 course+0.027
in+0.027
ang+0.027
Neg logit contributions
 hardly-0.036
cases-0.035
tered-0.034
 barely-0.031
 nearly-0.030
Token to
Token ID284
Feature activation+7.17e-01

Pos logit contributions
 hardly+0.052
cases+0.050
tered+0.048
 barely+0.044
 nearly+0.043
Neg logit contributions
smart-0.046
itty-0.043
 course-0.042
eros-0.040
in-0.040
Token move
Token ID1445
Feature activation+8.23e-01

Pos logit contributions
 hardly+0.018
cases+0.017
tered+0.016
 barely+0.015
 nearly+0.014
Neg logit contributions
smart-0.017
itty-0.016
 course-0.016
in-0.015
eros-0.015
Token.
Token ID13
Feature activation+1.37e+00

Pos logit contributions
 hardly+0.024
cases+0.023
tered+0.022
 barely+0.021
abs+0.020
Neg logit contributions
smart-0.017
itty-0.014
 course-0.014
in-0.014
eros-0.013
Token
Token ID198
Feature activation+1.40e-02

Pos logit contributions
 hardly+0.037
cases+0.036
tered+0.034
 barely+0.031
 nearly+0.031
Neg logit contributions
smart-0.030
itty-0.027
 course-0.027
in-0.025
eros-0.025
Token
Token ID198
Feature activation-2.23e-01

Pos logit contributions
 hardly+0.000
cases+0.000
tered+0.000
 barely+0.000
 nearly+0.000
Neg logit contributions
smart-0.000
itty-0.000
 course-0.000
eros-0.000
in-0.000
TokenBut
Token ID1537
Feature activation+7.97e-01

Pos logit contributions
smart+0.005
itty+0.004
 course+0.004
eros+0.004
in+0.004
Neg logit contributions
 hardly-0.006
cases-0.006
tered-0.006
 barely-0.005
late-0.005
Token the
Token ID262
Feature activation-1.58e-01

Pos logit contributions
 hardly+0.019
cases+0.019
tered+0.018
 barely+0.016
 nearly+0.016
Neg logit contributions
smart-0.020
itty-0.018
 course-0.018
in-0.017
eros-0.017
Token train
Token ID4512
Feature activation-3.19e-01

Pos logit contributions
smart+0.003
itty+0.003
in+0.003
 course+0.003
eros+0.003
Neg logit contributions
 hardly-0.004
cases-0.004
tered-0.004
 barely-0.004
 floor-0.004
Token boy
Token ID2933
Feature activation-1.07e+00

Pos logit contributions
smart+0.020
itty+0.018
 course+0.017
eros+0.016
in+0.016
Neg logit contributions
 hardly-0.028
cases-0.027
tered-0.027
 barely-0.025
 nearly-0.024
Token a
Token ID257
Feature activation+4.48e-01

Pos logit contributions
rob+0.027
smart+0.027
idd+0.027
itty+0.026
eros+0.026
Neg logit contributions
 hardly-0.022
cases-0.021
tered-0.021
 closely-0.018
 nearly-0.018
Token gentle
Token ID10296
Feature activation-1.88e+00

Pos logit contributions
 hardly+0.011
cases+0.009
 barely+0.009
 nearly+0.009
tered+0.009
Neg logit contributions
smart-0.011
rob-0.010
eros-0.010
eper-0.010
ang-0.010
Token smile
Token ID8212
Feature activation-1.70e+00

Pos logit contributions
smart+0.043
rob+0.039
itty+0.038
ang+0.038
idd+0.037
Neg logit contributions
 hardly-0.048
tered-0.045
 barely-0.042
cases-0.041
 nearly-0.040
Token.
Token ID13
Feature activation-8.26e-01

Pos logit contributions
smart+0.040
 course+0.037
itty+0.037
etchup+0.036
eros+0.035
Neg logit contributions
 hardly-0.043
tered-0.041
cases-0.041
 barely-0.036
 couldn-0.036
Token
Token ID198
Feature activation+6.18e-01

Pos logit contributions
smart+0.020
itty+0.017
in+0.017
 course+0.017
ang+0.016
Neg logit contributions
 hardly-0.021
cases-0.020
tered-0.020
 barely-0.018
 couldn-0.018
Token
Token ID198
Feature activation+8.82e-01

Pos logit contributions
cases+0.021
 hardly+0.021
tered+0.020
 barely+0.019
rated+0.019
Neg logit contributions
smart-0.009
itty-0.008
 course-0.007
in-0.007
ang-0.007
TokenThe
Token ID464
Feature activation-6.78e-01

Pos logit contributions
cases+0.025
 hardly+0.022
rated+0.021
tered+0.021
 couldn+0.021
Neg logit contributions
smart-0.018
 course-0.018
itty-0.017
in-0.016
P-0.016
Token rude
Token ID22066
Feature activation-1.63e+00

Pos logit contributions
smart+0.012
itty+0.011
 course+0.011
eros+0.010
in+0.010
Neg logit contributions
 hardly-0.021
cases-0.020
tered-0.020
 barely-0.018
 nearly-0.018
Token boy
Token ID2933
Feature activation+1.02e+00

Pos logit contributions
smart+0.020
itty+0.018
 course+0.017
in+0.017
eros+0.016
Neg logit contributions
 hardly-0.060
cases-0.058
tered-0.056
 barely-0.053
 nearly-0.053
Token was
Token ID373
Feature activation-1.29e-02

Pos logit contributions
cases+0.026
 hardly+0.026
tered+0.024
abs+0.023
late+0.022
Neg logit contributions
smart-0.023
 course-0.022
itty-0.021
in-0.020
o-0.019
Token.
Token ID13
Feature activation-1.29e+00

Pos logit contributions
 hardly+0.020
cases+0.019
tered+0.018
 barely+0.017
 nearly+0.017
Neg logit contributions
smart-0.017
itty-0.016
 course-0.015
in-0.015
eros-0.015
Token<|endoftext|>
Token ID50256
Feature activation-2.43e+00

Pos logit contributions
smart+0.030
in+0.028
eper+0.027
eros+0.027
ang+0.026
Neg logit contributions
 hardly-0.033
tered-0.029
cases-0.028
 couldn-0.028
 barely-0.026
TokenOnce
Token ID7454
Feature activation-1.75e-02

Pos logit contributions
smart+0.050
itty+0.043
 course+0.042
eros+0.042
ang+0.042
Neg logit contributions
 hardly-0.067
tered-0.064
cases-0.064
 nearly-0.058
 barely-0.057
Token upon
Token ID2402
Feature activation+2.06e+00

Pos logit contributions
smart+0.001
itty+0.001
 course+0.000
eros+0.000
in+0.000
Neg logit contributions
 hardly-0.000
cases-0.000
tered-0.000
 barely-0.000
 nearly-0.000
Token a
Token ID257
Feature activation+2.03e+00

Pos logit contributions
cases+0.072
 hardly+0.068
late+0.064
aned+0.063
tered+0.063
Neg logit contributions
smart-0.028
 course-0.026
 living-0.026
in-0.025
 welcome-0.025
Token time
Token ID640
Feature activation+4.58e-01

Pos logit contributions
cases+0.049
 hardly+0.046
tered+0.042
 barely+0.039
late+0.038
Neg logit contributions
 course-0.052
smart-0.050
itty-0.050
 welcome-0.048
in-0.048
Token,
Token ID11
Feature activation+3.75e-01

Pos logit contributions
 hardly+0.012
cases+0.012
tered+0.012
 barely+0.011
 nearly+0.010
Neg logit contributions
smart-0.010
itty-0.009
 course-0.009
in-0.008
eros-0.008
Token there
Token ID612
Feature activation-9.14e-01

Pos logit contributions
 hardly+0.011
tered+0.010
cases+0.010
 closely+0.009
 couldn+0.009
Neg logit contributions
smart-0.008
 course-0.007
itty-0.007
eros-0.006
etchup-0.006
Token was
Token ID373
Feature activation+1.93e+00

Pos logit contributions
smart+0.023
itty+0.022
 course+0.022
in+0.021
eros+0.020
Neg logit contributions
 hardly-0.021
cases-0.021
tered-0.019
 barely-0.017
late-0.017
Token a
Token ID257
Feature activation+3.02e-01

Pos logit contributions
cases+0.061
 hardly+0.058
tered+0.053
rated+0.052
abs+0.051
Neg logit contributions
smart-0.032
 course-0.032
itty-0.032
 living-0.030
 welcome-0.030
Token huge
Token ID3236
Feature activation-1.90e+00

Pos logit contributions
 hardly+0.005
cases+0.005
tered+0.005
 barely+0.004
 nearly+0.004
Neg logit contributions
smart-0.010
 course-0.009
itty-0.009
eros-0.008
etchup-0.008
Token to
Token ID284
Feature activation-4.52e-01

Pos logit contributions
smart+0.007
itty+0.006
 course+0.006
rob+0.006
eros+0.006
Neg logit contributions
 hardly-0.006
tered-0.005
cases-0.005
 barely-0.005
 nearly-0.005
Token touch
Token ID3638
Feature activation+7.65e-01

Pos logit contributions
smart+0.010
 welcome+0.009
 course+0.009
itty+0.008
 passenger+0.008
Neg logit contributions
cases-0.013
 hardly-0.012
tered-0.011
abs-0.010
 barely-0.010
Token it
Token ID340
Feature activation+1.02e+00

Pos logit contributions
 hardly+0.021
cases+0.020
tered+0.019
 barely+0.018
 nearly+0.017
Neg logit contributions
smart-0.017
itty-0.015
 course-0.015
in-0.014
eros-0.014
Token.
Token ID13
Feature activation+8.69e-01

Pos logit contributions
 hardly+0.036
cases+0.033
abs+0.032
 barely+0.032
tered+0.032
Neg logit contributions
smart-0.014
 course-0.011
in-0.010
 welcome-0.010
 living-0.010
Token But
Token ID887
Feature activation+4.92e-01

Pos logit contributions
 hardly+0.030
cases+0.029
abs+0.027
 barely+0.027
 footing+0.026
Neg logit contributions
 living-0.011
smart-0.010
 Working-0.009
 Hi-0.009
 welcome-0.009
Token Lily
Token ID20037
Feature activation+2.28e+00

Pos logit contributions
cases+0.016
 hardly+0.015
abs+0.014
tered+0.014
rated+0.013
Neg logit contributions
smart-0.008
 course-0.008
itty-0.007
 welcome-0.007
 living-0.007
Token was
Token ID373
Feature activation-4.58e-01

Pos logit contributions
cases+0.057
 hardly+0.052
tered+0.050
late+0.047
abs+0.047
Neg logit contributions
itty-0.049
in-0.049
 course-0.048
smart-0.047
eper-0.046
Token stubborn
Token ID24444
Feature activation+4.11e-01

Pos logit contributions
smart+0.011
itty+0.010
 course+0.010
eros+0.009
in+0.009
Neg logit contributions
 hardly-0.012
cases-0.011
tered-0.011
 barely-0.010
 nearly-0.010
Token and
Token ID290
Feature activation+1.65e-01

Pos logit contributions
 hardly+0.012
cases+0.011
tered+0.011
 barely+0.010
abs+0.010
Neg logit contributions
smart-0.009
 course-0.008
 living-0.007
itty-0.007
in-0.007
Token wanted
Token ID2227
Feature activation-1.11e-01

Pos logit contributions
cases+0.004
 hardly+0.004
late+0.003
tered+0.003
abs+0.003
Neg logit contributions
smart-0.004
 course-0.004
itty-0.004
 living-0.003
 welcome-0.003
Token to
Token ID284
Feature activation-8.68e-01

Pos logit contributions
smart+0.002
itty+0.002
 course+0.002
in+0.002
eros+0.002
Neg logit contributions
 hardly-0.003
cases-0.003
tered-0.003
 barely-0.003
 nearly-0.003
Token to
Token ID284
Feature activation-9.69e-02

Pos logit contributions
cases+0.046
 hardly+0.045
tered+0.044
rated+0.041
abs+0.041
Neg logit contributions
smart-0.037
 living-0.030
 course-0.030
 welcome-0.030
in-0.030
Token be
Token ID307
Feature activation-1.97e+00

Pos logit contributions
smart+0.002
 course+0.002
 welcome+0.002
itty+0.002
 living+0.002
Neg logit contributions
cases-0.003
 hardly-0.003
tered-0.002
abs-0.002
late-0.002
Token more
Token ID517
Feature activation-2.90e-01

Pos logit contributions
rob+0.053
eros+0.052
itty+0.051
smart+0.051
 course+0.050
Neg logit contributions
 hardly-0.041
tered-0.038
 almost-0.037
 nearly-0.036
 not-0.034
Token careful
Token ID8161
Feature activation+2.35e+00

Pos logit contributions
smart+0.006
itty+0.005
in+0.005
 course+0.005
ang+0.005
Neg logit contributions
 hardly-0.008
cases-0.008
tered-0.008
 barely-0.007
late-0.007
Token next
Token ID1306
Feature activation-2.24e-03

Pos logit contributions
 hardly+0.060
cases+0.059
tered+0.055
 barely+0.051
late+0.051
Neg logit contributions
smart-0.054
 course-0.047
in-0.047
itty-0.046
 living-0.046
Token time
Token ID640
Feature activation+1.29e+00

Pos logit contributions
smart+0.000
 course+0.000
itty+0.000
in+0.000
eros+0.000
Neg logit contributions
 hardly-0.000
cases-0.000
tered-0.000
 barely-0.000
 nearly-0.000
Token and
Token ID290
Feature activation-4.95e-01

Pos logit contributions
 hardly+0.036
cases+0.035
tered+0.034
 barely+0.031
late+0.031
Neg logit contributions
smart-0.027
itty-0.023
 course-0.023
in-0.022
etchup-0.022
Token not
Token ID407
Feature activation+1.88e-01

Pos logit contributions
smart+0.013
itty+0.012
 course+0.012
eros+0.011
ang+0.011
Neg logit contributions
 hardly-0.011
cases-0.011
tered-0.010
 barely-0.009
 couldn-0.009
Token to
Token ID284
Feature activation-5.48e-01

Pos logit contributions
 hardly+0.004
cases+0.004
tered+0.004
 barely+0.004
 couldn+0.004
Neg logit contributions
smart-0.005
itty-0.005
 course-0.004
rob-0.004
eros-0.004
Token drink
Token ID4144
Feature activation+1.13e+00

Pos logit contributions
smart+0.013
 course+0.012
itty+0.012
in+0.011
 welcome+0.011
Neg logit contributions
cases-0.014
 hardly-0.014
tered-0.013
 barely-0.011
 nearly-0.011
Token anything
Token ID1997
Feature activation+1.79e+00

Pos logit contributions
 hardly+0.031
cases+0.030
tered+0.029
 barely+0.026
 nearly+0.026
Neg logit contributions
smart-0.024
itty-0.022
 course-0.021
eros-0.020
in-0.020
Token stubborn
Token ID24444
Feature activation-6.48e-01

Pos logit contributions
 hardly+0.002
tered+0.002
cases+0.002
 barely+0.001
 closely+0.001
Neg logit contributions
smart-0.004
 course-0.003
itty-0.003
rob-0.003
eros-0.003
Token little
Token ID1310
Feature activation-2.84e+00

Pos logit contributions
smart+0.012
itty+0.010
 course+0.010
in+0.010
etchup+0.010
Neg logit contributions
 hardly-0.019
cases-0.019
tered-0.018
 barely-0.017
 nearly-0.017
Token mouse
Token ID10211
Feature activation-9.91e-01

Pos logit contributions
smart+0.068
 Working+0.055
 Friendship+0.054
rob+0.049
fing+0.049
Neg logit contributions
tered-0.066
 floor-0.065
 hardly-0.064
chy-0.062
cases-0.061
Token who
Token ID508
Feature activation-1.54e+00

Pos logit contributions
smart+0.019
rob+0.017
etchup+0.017
 course+0.017
 welcome+0.017
Neg logit contributions
 hardly-0.026
tered-0.025
cases-0.024
 couldn-0.024
 closely-0.023
Token lived
Token ID5615
Feature activation-1.56e+00

Pos logit contributions
smart+0.036
itty+0.032
 course+0.032
in+0.030
eros+0.030
Neg logit contributions
 hardly-0.040
cases-0.039
tered-0.037
 barely-0.034
 nearly-0.033
Token with
Token ID351
Feature activation-5.37e-01

Pos logit contributions
smart+0.038
 course+0.036
rob+0.036
itty+0.035
eros+0.035
Neg logit contributions
 hardly-0.035
tered-0.034
 almost-0.031
 barely-0.030
 curiously-0.030
Token her
Token ID607
Feature activation-5.12e-01

Pos logit contributions
smart+0.014
itty+0.014
ang+0.014
rob+0.013
idd+0.013
Neg logit contributions
 hardly-0.013
tered-0.011
 barely-0.011
 couldn-0.010
cases-0.010
Token family
Token ID1641
Feature activation-1.73e+00

Pos logit contributions
smart+0.012
rob+0.010
 course+0.010
idd+0.010
itty+0.010
Neg logit contributions
 hardly-0.014
tered-0.012
cases-0.012
 couldn-0.012
 barely-0.012
Token in
Token ID287
Feature activation-1.43e+00

Pos logit contributions
smart+0.053
rob+0.051
itty+0.047
idd+0.047
aid+0.047
Neg logit contributions
 hardly-0.025
tered-0.025
 closely-0.025
 floor-0.023
 curiously-0.022
Token the
Token ID262
Feature activation-2.11e-01

Pos logit contributions
rob+0.041
smart+0.040
in+0.040
idd+0.040
ang+0.040
Neg logit contributions
 hardly-0.028
tered-0.024
 barely-0.023
 not-0.022
 almost-0.021
Token woods
Token ID16479
Feature activation-5.89e-02

Pos logit contributions
smart+0.005
ang+0.004
itty+0.004
etchup+0.004
rob+0.004
Neg logit contributions
 hardly-0.006
 floor-0.005
tered-0.005
cases-0.005
 barely-0.005
Token she
Token ID673
Feature activation+6.35e-01

Pos logit contributions
cases+0.059
 hardly+0.056
rated+0.055
tered+0.054
aned+0.053
Neg logit contributions
smart-0.031
itty-0.029
 course-0.028
 living-0.028
 welcome-0.027
Token was
Token ID373
Feature activation+5.46e-01

Pos logit contributions
cases+0.022
 hardly+0.021
late+0.020
tered+0.020
aned+0.019
Neg logit contributions
 living-0.010
o-0.009
in-0.009
P-0.009
smart-0.008
Token walking
Token ID6155
Feature activation-8.25e-01

Pos logit contributions
cases+0.020
 hardly+0.019
tered+0.019
abs+0.018
rated+0.018
Neg logit contributions
smart-0.006
itty-0.006
 living-0.006
 welcome-0.005
ang-0.005
Token very
Token ID845
Feature activation+1.76e+00

Pos logit contributions
smart+0.020
 course+0.018
itty+0.017
etchup+0.017
eros+0.016
Neg logit contributions
 hardly-0.021
tered-0.020
cases-0.019
 closely-0.018
 barely-0.018
Token slow
Token ID3105
Feature activation+3.83e-02

Pos logit contributions
cases+0.057
 hardly+0.054
tered+0.051
abs+0.047
late+0.047
Neg logit contributions
ang-0.029
in-0.028
 living-0.027
smart-0.026
 welcome-0.025
Token in
Token ID287
Feature activation-1.03e+00

Pos logit contributions
 hardly+0.001
cases+0.001
tered+0.001
 barely+0.001
abs+0.001
Neg logit contributions
smart-0.001
itty-0.001
 living-0.001
ang-0.001
in-0.001
Token the
Token ID262
Feature activation+5.28e-01

Pos logit contributions
smart+0.024
itty+0.023
in+0.022
idd+0.022
rob+0.022
Neg logit contributions
 hardly-0.024
tered-0.022
 closely-0.022
 floor-0.021
cases-0.021
Token woods
Token ID16479
Feature activation+1.29e+00

Pos logit contributions
 hardly+0.012
 floor+0.012
cases+0.011
 closely+0.011
 walls+0.011
Neg logit contributions
smart-0.014
etchup-0.011
itty-0.011
eros-0.011
ang-0.011
Token.
Token ID13
Feature activation+8.06e-01

Pos logit contributions
cases+0.041
 hardly+0.039
tered+0.037
abs+0.037
late+0.034
Neg logit contributions
smart-0.023
 living-0.022
 course-0.020
itty-0.020
 welcome-0.018
Token Suddenly
Token ID24975
Feature activation+9.47e-01

Pos logit contributions
cases+0.026
tered+0.025
 hardly+0.025
rated+0.025
icity+0.023
Neg logit contributions
itty-0.011
smart-0.011
 living-0.011
 What-0.010
 welcome-0.010
Token,
Token ID11
Feature activation+2.72e+00

Pos logit contributions
cases+0.034
 hardly+0.034
rated+0.033
tered+0.033
abs+0.032
Neg logit contributions
smart-0.010
itty-0.010
 course-0.009
 living-0.008
in-0.008
Token water
Token ID1660
Feature activation-1.84e+00

Pos logit contributions
smart+0.030
itty+0.026
ang+0.025
 course+0.025
eros+0.025
Neg logit contributions
 hardly-0.034
cases-0.032
tered-0.032
 barely-0.028
 nearly-0.028
Token.
Token ID13
Feature activation-6.04e-01

Pos logit contributions
smart+0.046
itty+0.041
 course+0.040
ang+0.040
in+0.039
Neg logit contributions
cases-0.043
 hardly-0.043
tered-0.041
 couldn-0.037
 nearly-0.037
Token
Token ID198
Feature activation+3.12e-02

Pos logit contributions
smart+0.013
itty+0.011
 course+0.011
in+0.011
eros+0.011
Neg logit contributions
 hardly-0.017
cases-0.016
tered-0.016
 barely-0.015
 nearly-0.014
Token
Token ID198
Feature activation+2.77e-01

Pos logit contributions
cases+0.001
 hardly+0.001
tered+0.001
rated+0.001
late+0.001
Neg logit contributions
smart-0.000
itty-0.000
 course-0.000
in-0.000
 welcome-0.000
TokenThe
Token ID464
Feature activation-9.39e-01

Pos logit contributions
cases+0.008
 hardly+0.007
tered+0.007
rated+0.007
 couldn+0.007
Neg logit contributions
smart-0.006
 course-0.005
in-0.005
itty-0.005
 welcome-0.005
Token brown
Token ID7586
Feature activation-1.49e+00

Pos logit contributions
smart+0.023
itty+0.020
in+0.019
 course+0.019
ang+0.019
Neg logit contributions
 hardly-0.024
cases-0.023
tered-0.022
 barely-0.021
 floor-0.020
Token dog
Token ID3290
Feature activation-3.70e-01

Pos logit contributions
smart+0.017
itty+0.014
 course+0.013
eros+0.012
in+0.012
Neg logit contributions
 hardly-0.056
cases-0.055
tered-0.054
 barely-0.050
 nearly-0.050
Token was
Token ID373
Feature activation-1.33e-01

Pos logit contributions
smart+0.007
 living+0.007
o+0.007
itty+0.007
 course+0.006
Neg logit contributions
 hardly-0.010
cases-0.010
abs-0.009
tered-0.009
late-0.009
Token very
Token ID845
Feature activation-3.23e-01

Pos logit contributions
smart+0.003
itty+0.003
 course+0.003
 welcome+0.003
in+0.003
Neg logit contributions
 hardly-0.003
cases-0.003
tered-0.003
abs-0.003
 barely-0.003
Token happy
Token ID3772
Feature activation-1.13e+00

Pos logit contributions
smart+0.007
itty+0.006
 course+0.006
in+0.006
eros+0.006
Neg logit contributions
 hardly-0.009
cases-0.009
tered-0.008
 barely-0.008
 nearly-0.008
Token in
Token ID287
Feature activation-1.88e+00

Pos logit contributions
smart+0.030
itty+0.028
 course+0.027
ang+0.026
rob+0.026
Neg logit contributions
 hardly-0.026
cases-0.024
tered-0.023
 couldn-0.022
 barely-0.022
Token to
Token ID284
Feature activation-8.84e-01

Pos logit contributions
smart+0.014
itty+0.013
 course+0.012
in+0.012
eros+0.012
Neg logit contributions
 hardly-0.019
cases-0.018
tered-0.018
 barely-0.016
 nearly-0.016
Token Ben
Token ID3932
Feature activation+5.82e-01

Pos logit contributions
smart+0.020
itty+0.018
 course+0.017
in+0.016
ang+0.016
Neg logit contributions
 hardly-0.024
cases-0.023
tered-0.022
 barely-0.021
 nearly-0.020
Token.
Token ID13
Feature activation-9.41e-01

Pos logit contributions
 hardly+0.013
cases+0.013
tered+0.013
 closely+0.012
 curiously+0.011
Neg logit contributions
smart-0.014
 course-0.014
rob-0.014
itty-0.014
eros-0.013
Token "
Token ID366
Feature activation+1.61e+00

Pos logit contributions
smart+0.019
itty+0.017
 course+0.017
in+0.016
eros+0.016
Neg logit contributions
 hardly-0.027
cases-0.026
tered-0.025
 barely-0.024
 nearly-0.023
TokenMaybe
Token ID13300
Feature activation+7.05e-01

Pos logit contributions
 hardly+0.048
cases+0.046
tered+0.045
 barely+0.042
 nearly+0.041
Neg logit contributions
smart-0.031
itty-0.027
 course-0.027
eros-0.025
in-0.025
Token the
Token ID262
Feature activation-8.32e-01

Pos logit contributions
 hardly+0.019
cases+0.019
tered+0.018
 barely+0.016
 nearly+0.016
Neg logit contributions
smart-0.015
itty-0.014
 course-0.014
in-0.013
eros-0.013
Token dog
Token ID3290
Feature activation-1.33e+00

Pos logit contributions
smart+0.014
itty+0.011
in+0.010
ang+0.010
etchup+0.010
Neg logit contributions
 hardly-0.028
cases-0.027
tered-0.026
 barely-0.025
 floor-0.024
Token will
Token ID481
Feature activation+2.03e-01

Pos logit contributions
smart+0.028
itty+0.025
 course+0.025
in+0.023
eros+0.023
Neg logit contributions
 hardly-0.037
cases-0.036
tered-0.035
 barely-0.032
 nearly-0.031
Token go
Token ID467
Feature activation-4.93e-01

Pos logit contributions
cases+0.006
 hardly+0.005
tered+0.005
abs+0.005
late+0.005
Neg logit contributions
smart-0.004
 welcome-0.004
 course-0.004
itty-0.003
 living-0.003
Token away
Token ID1497
Feature activation+8.30e-01

Pos logit contributions
smart+0.007
itty+0.006
 course+0.005
in+0.005
eros+0.005
Neg logit contributions
 hardly-0.017
cases-0.017
tered-0.017
 barely-0.015
abs-0.015
Token."
Token ID526
Feature activation-1.25e-01

Pos logit contributions
 hardly+0.021
cases+0.020
tered+0.019
 barely+0.018
 nearly+0.017
Neg logit contributions
smart-0.020
itty-0.018
 course-0.017
in-0.017
eros-0.017
Token looked
Token ID3114
Feature activation-1.53e-01

Pos logit contributions
smart+0.010
 course+0.009
itty+0.009
in+0.008
 living+0.008
Neg logit contributions
cases-0.011
 hardly-0.010
tered-0.010
late-0.010
abs-0.009
Token up
Token ID510
Feature activation-2.82e-01

Pos logit contributions
smart+0.003
itty+0.003
 course+0.003
in+0.003
 welcome+0.003
Neg logit contributions
 hardly-0.004
cases-0.004
tered-0.004
 barely-0.004
late-0.003
Token at
Token ID379
Feature activation-5.66e-01

Pos logit contributions
smart+0.005
in+0.004
 course+0.004
itty+0.004
 living+0.004
Neg logit contributions
 hardly-0.009
cases-0.009
tered-0.009
abs-0.008
late-0.008
Token the
Token ID262
Feature activation+2.18e-01

Pos logit contributions
smart+0.012
itty+0.010
ang+0.010
eros+0.010
etchup+0.010
Neg logit contributions
 hardly-0.016
cases-0.015
tered-0.015
 barely-0.014
 closely-0.014
Token tree
Token ID5509
Feature activation-7.33e-01

Pos logit contributions
cases+0.007
 hardly+0.007
tered+0.006
abs+0.006
late+0.006
Neg logit contributions
 course-0.004
smart-0.004
itty-0.004
 living-0.003
 welcome-0.003
Token and
Token ID290
Feature activation-1.05e+00

Pos logit contributions
smart+0.019
itty+0.017
 course+0.017
in+0.017
eros+0.017
Neg logit contributions
 hardly-0.017
cases-0.016
tered-0.015
 barely-0.014
 nearly-0.013
Token waved
Token ID26834
Feature activation-1.08e+00

Pos logit contributions
smart+0.026
itty+0.023
 course+0.023
in+0.022
eros+0.022
Neg logit contributions
 hardly-0.026
cases-0.025
tered-0.024
 barely-0.022
 nearly-0.021
Token goodbye
Token ID24829
Feature activation-3.59e-01

Pos logit contributions
smart+0.025
itty+0.023
 course+0.023
etchup+0.022
eros+0.022
Neg logit contributions
 hardly-0.028
cases-0.026
tered-0.026
 barely-0.024
 closely-0.023
Token as
Token ID355
Feature activation+9.76e-01

Pos logit contributions
smart+0.007
itty+0.006
eros+0.006
 living+0.006
in+0.005
Neg logit contributions
 hardly-0.010
cases-0.010
tered-0.010
abs-0.009
late-0.009
Token she
Token ID673
Feature activation-1.56e+00

Pos logit contributions
 hardly+0.028
cases+0.028
tered+0.026
abs+0.024
late+0.024
Neg logit contributions
smart-0.020
itty-0.017
 course-0.017
in-0.017
 welcome-0.016
Token started
Token ID2067
Feature activation-8.44e-01

Pos logit contributions
smart+0.038
itty+0.035
 course+0.034
in+0.033
eros+0.032
Neg logit contributions
 hardly-0.038
cases-0.037
tered-0.035
 barely-0.032
late-0.031
Token voice
Token ID3809
Feature activation-5.49e-01

Pos logit contributions
 hardly+0.018
cases+0.016
tered+0.016
 barely+0.015
 nearly+0.014
Neg logit contributions
smart-0.026
itty-0.024
rob-0.024
etchup-0.024
 course-0.023
Token.
Token ID13
Feature activation-8.73e-01

Pos logit contributions
smart+0.013
itty+0.013
etchup+0.012
eros+0.012
 course+0.012
Neg logit contributions
 hardly-0.013
cases-0.013
tered-0.012
 closely-0.011
 nearly-0.011
Token It
Token ID632
Feature activation-1.55e+00

Pos logit contributions
smart+0.019
 course+0.018
itty+0.018
in+0.017
rob+0.017
Neg logit contributions
 hardly-0.023
cases-0.022
tered-0.021
 couldn-0.020
 barely-0.019
Token was
Token ID373
Feature activation-4.76e-01

Pos logit contributions
smart+0.045
itty+0.042
 course+0.041
etchup+0.041
eros+0.040
Neg logit contributions
 hardly-0.031
cases-0.029
tered-0.028
 barely-0.025
 couldn-0.025
Token a
Token ID257
Feature activation-4.83e-01

Pos logit contributions
smart+0.011
itty+0.010
 course+0.010
in+0.009
eros+0.009
Neg logit contributions
 hardly-0.013
cases-0.012
tered-0.012
 barely-0.011
 nearly-0.011
Token friendly
Token ID8030
Feature activation-3.08e+00

Pos logit contributions
smart+0.013
rob+0.011
 course+0.011
itty+0.011
 Working+0.010
Neg logit contributions
 hardly-0.011
tered-0.010
 barely-0.010
cases-0.010
 nearly-0.010
Token old
Token ID1468
Feature activation-3.90e+00

Pos logit contributions
smart+0.064
rob+0.055
etchup+0.055
ang+0.053
 course+0.052
Neg logit contributions
 hardly-0.083
tered-0.079
cases-0.075
 closely-0.073
 floor-0.073
Token man
Token ID582
Feature activation-1.62e+00

Pos logit contributions
smart+0.088
etchup+0.073
ang+0.073
itty+0.072
 Working+0.071
Neg logit contributions
 hardly-0.099
cases-0.093
tered-0.092
 floor-0.091
 barely-0.087
Token.
Token ID13
Feature activation-2.67e-01

Pos logit contributions
smart+0.038
rob+0.036
 course+0.035
itty+0.034
eros+0.034
Neg logit contributions
 hardly-0.037
cases-0.035
tered-0.035
 couldn-0.033
 closely-0.033
Token "
Token ID366
Feature activation-7.74e-01

Pos logit contributions
smart+0.006
rob+0.006
 course+0.006
in+0.006
itty+0.006
Neg logit contributions
 hardly-0.006
cases-0.006
tered-0.006
 couldn-0.005
 nearly-0.005
TokenHello
Token ID15496
Feature activation-9.33e-01

Pos logit contributions
smart+0.015
eros+0.014
 course+0.013
etchup+0.013
rob+0.013
Neg logit contributions
 hardly-0.022
tered-0.021
cases-0.020
 nearly-0.019
 barely-0.019
Token shoes
Token ID10012
Feature activation-2.78e-01

Pos logit contributions
smart+0.039
itty+0.035
ang+0.033
 course+0.033
rob+0.032
Neg logit contributions
 hardly-0.042
cases-0.040
tered-0.038
 nearly-0.036
 floor-0.036
Token to
Token ID284
Feature activation-1.03e+00

Pos logit contributions
smart+0.006
itty+0.006
 course+0.005
eros+0.005
in+0.005
Neg logit contributions
 hardly-0.007
cases-0.007
tered-0.007
 barely-0.006
 nearly-0.006
Token his
Token ID465
Feature activation+3.01e-01

Pos logit contributions
itty+0.024
smart+0.024
etchup+0.022
eros+0.022
ang+0.022
Neg logit contributions
 hardly-0.027
tered-0.025
cases-0.023
 barely-0.023
 closely-0.022
Token friends
Token ID2460
Feature activation-7.96e-01

Pos logit contributions
 hardly+0.008
cases+0.007
tered+0.007
 barely+0.007
 couldn+0.007
Neg logit contributions
smart-0.007
itty-0.006
 course-0.006
rob-0.006
etchup-0.006
Token.
Token ID13
Feature activation-8.28e-01

Pos logit contributions
smart+0.020
itty+0.019
eros+0.018
 course+0.018
etchup+0.018
Neg logit contributions
 hardly-0.018
cases-0.018
tered-0.017
 closely-0.016
 nearly-0.015
Token<|endoftext|>
Token ID50256
Feature activation-2.98e+00

Pos logit contributions
smart+0.019
itty+0.018
eros+0.017
in+0.017
 course+0.017
Neg logit contributions
 hardly-0.022
cases-0.021
tered-0.020
 barely-0.018
 couldn-0.018
TokenOnce
Token ID7454
Feature activation-2.58e-01

Pos logit contributions
smart+0.059
eros+0.047
itty+0.047
ang+0.047
 passenger+0.047
Neg logit contributions
 hardly-0.081
cases-0.079
tered-0.078
 nearly-0.074
 couldn-0.074
Token upon
Token ID2402
Feature activation+2.02e+00

Pos logit contributions
smart+0.008
itty+0.007
 course+0.007
in+0.007
eros+0.007
Neg logit contributions
 hardly-0.005
cases-0.005
tered-0.004
 barely-0.004
 nearly-0.004
Token a
Token ID257
Feature activation+1.80e+00

Pos logit contributions
cases+0.069
 hardly+0.065
late+0.064
rated+0.063
aned+0.063
Neg logit contributions
smart-0.027
 living-0.025
 course-0.025
 welcome-0.023
in-0.023
Token time
Token ID640
Feature activation+2.38e-01

Pos logit contributions
cases+0.044
 hardly+0.041
tered+0.038
 barely+0.035
aned+0.034
Neg logit contributions
 course-0.045
smart-0.043
itty-0.043
 welcome-0.042
in-0.041
Token there
Token ID612
Feature activation-1.01e+00

Pos logit contributions
 hardly+0.006
cases+0.006
tered+0.006
 barely+0.005
 nearly+0.005
Neg logit contributions
smart-0.005
itty-0.005
 course-0.005
eros-0.004
in-0.004
Token rare
Token ID4071
Feature activation-1.07e+00

Pos logit contributions
 hardly+0.003
cases+0.003
tered+0.003
 barely+0.003
 nearly+0.003
Neg logit contributions
smart-0.002
itty-0.002
 course-0.002
in-0.002
eros-0.002
Token taps
Token ID34531
Feature activation+4.53e-01

Pos logit contributions
smart+0.025
itty+0.023
 course+0.023
in+0.022
eros+0.021
Neg logit contributions
 hardly-0.027
cases-0.026
tered-0.025
 barely-0.023
 nearly-0.022
Token on
Token ID319
Feature activation+3.47e-01

Pos logit contributions
 hardly+0.011
cases+0.011
tered+0.010
 barely+0.009
 nearly+0.009
Neg logit contributions
smart-0.011
itty-0.010
 course-0.010
eros-0.010
etchup-0.009
Token dresses
Token ID27309
Feature activation+1.78e+00

Pos logit contributions
 hardly+0.010
cases+0.010
tered+0.010
 barely+0.009
 nearly+0.009
Neg logit contributions
smart-0.007
 course-0.006
itty-0.006
in-0.005
eros-0.005
Token.
Token ID13
Feature activation-1.44e+00

Pos logit contributions
 hardly+0.047
cases+0.044
tered+0.043
 barely+0.039
rated+0.039
Neg logit contributions
smart-0.040
 course-0.036
itty-0.036
in-0.034
ang-0.033
Token<|endoftext|>
Token ID50256
Feature activation-3.09e+00

Pos logit contributions
smart+0.035
rob+0.033
eros+0.033
eper+0.033
ang+0.032
Neg logit contributions
 hardly-0.035
tered-0.030
cases-0.029
 couldn-0.029
 barely-0.029
TokenOnce
Token ID7454
Feature activation+7.70e-02

Pos logit contributions
smart+0.057
 passenger+0.045
eros+0.045
ang+0.044
 course+0.043
Neg logit contributions
 hardly-0.088
cases-0.086
tered-0.086
 couldn-0.082
 nearly-0.081
Token there
Token ID612
Feature activation-9.77e-01

Pos logit contributions
 hardly+0.001
cases+0.001
tered+0.001
 barely+0.001
late+0.001
Neg logit contributions
smart-0.002
itty-0.002
 course-0.002
in-0.002
ang-0.002
Token were
Token ID547
Feature activation+3.58e-01

Pos logit contributions
smart+0.029
itty+0.026
 course+0.025
eros+0.025
etchup+0.025
Neg logit contributions
 hardly-0.019
cases-0.018
tered-0.017
 couldn-0.016
 barely-0.016
Token two
Token ID734
Feature activation+9.03e-01

Pos logit contributions
 hardly+0.010
cases+0.010
tered+0.010
 barely+0.009
 nearly+0.009
Neg logit contributions
smart-0.007
itty-0.007
 course-0.006
in-0.006
eros-0.006
Token friends
Token ID2460
Feature activation+1.43e-03

Pos logit contributions
 hardly+0.033
cases+0.032
tered+0.032
 barely+0.030
 nearly+0.030
Neg logit contributions
smart-0.011
itty-0.009
 course-0.009
in-0.008
 living-0.008
Token.
Token ID13
Feature activation-1.78e-01

Pos logit contributions
smart+0.026
 course+0.023
itty+0.023
eros+0.022
in+0.021
Neg logit contributions
 hardly-0.037
cases-0.036
tered-0.034
 barely-0.032
abs-0.031
Token Then
Token ID3244
Feature activation-8.32e-01

Pos logit contributions
smart+0.003
itty+0.003
 welcome+0.003
 course+0.003
 living+0.003
Neg logit contributions
cases-0.005
 hardly-0.005
tered-0.005
rated-0.005
icing-0.005
Token she
Token ID673
Feature activation-7.95e-01

Pos logit contributions
smart+0.019
itty+0.017
 course+0.017
in+0.016
eros+0.016
Neg logit contributions
 hardly-0.022
cases-0.021
tered-0.020
 barely-0.019
 nearly-0.018
Token saw
Token ID2497
Feature activation-9.83e-01

Pos logit contributions
smart+0.018
itty+0.016
in+0.015
 course+0.015
o+0.014
Neg logit contributions
 hardly-0.021
tered-0.020
cases-0.020
late-0.019
abs-0.018
Token a
Token ID257
Feature activation-7.84e-01

Pos logit contributions
smart+0.021
 course+0.018
itty+0.018
in+0.017
eros+0.017
Neg logit contributions
 hardly-0.028
cases-0.027
tered-0.026
abs-0.024
 barely-0.024
Token tiny
Token ID7009
Feature activation-2.91e+00

Pos logit contributions
smart+0.021
 course+0.019
rob+0.019
itty+0.019
etchup+0.018
Neg logit contributions
 hardly-0.017
cases-0.015
tered-0.015
 barely-0.014
 couldn-0.014
Token mouse
Token ID10211
Feature activation-3.23e+00

Pos logit contributions
smart+0.055
rob+0.048
 course+0.048
itty+0.046
 Friendship+0.044
Neg logit contributions
 hardly-0.081
tered-0.079
cases-0.078
 closely-0.076
 curiously-0.074
Token who
Token ID508
Feature activation-1.76e+00

Pos logit contributions
smart+0.079
rob+0.079
etchup+0.072
 Friendship+0.071
 course+0.070
Neg logit contributions
 hardly-0.065
 closely-0.065
 curiously-0.062
tered-0.062
cases-0.060
Token was
Token ID373
Feature activation-1.75e+00

Pos logit contributions
smart+0.045
itty+0.041
 course+0.040
in+0.039
ang+0.038
Neg logit contributions
 hardly-0.041
cases-0.040
tered-0.038
abs-0.034
 barely-0.034
Token running
Token ID2491
Feature activation+2.85e-01

Pos logit contributions
smart+0.036
 course+0.033
itty+0.033
rob+0.031
eros+0.031
Neg logit contributions
 hardly-0.049
tered-0.045
cases-0.045
 barely-0.043
 nearly-0.043
Token around
Token ID1088
Feature activation-5.48e-01

Pos logit contributions
 hardly+0.009
cases+0.008
tered+0.008
 barely+0.007
abs+0.007
Neg logit contributions
smart-0.006
 course-0.005
itty-0.005
in-0.005
ang-0.004
Token.
Token ID13
Feature activation-6.42e-01

Pos logit contributions
smart+0.024
itty+0.023
rob+0.023
eros+0.023
 course+0.023
Neg logit contributions
 hardly-0.021
cases-0.020
tered-0.020
 nearly-0.018
 closely-0.018
Token<|endoftext|>
Token ID50256
Feature activation-3.11e+00

Pos logit contributions
smart+0.015
rob+0.014
in+0.014
 course+0.013
eros+0.013
Neg logit contributions
 hardly-0.016
tered-0.015
cases-0.014
 couldn-0.014
 barely-0.014
TokenJohn
Token ID7554
Feature activation+7.87e-01

Pos logit contributions
smart+0.061
 passenger+0.052
eros+0.051
ang+0.050
eper+0.050
Neg logit contributions
 hardly-0.079
tered-0.077
cases-0.076
 nearly-0.073
case-0.073
Token was
Token ID373
Feature activation+1.25e+00

Pos logit contributions
 hardly+0.020
cases+0.019
tered+0.018
 barely+0.016
late+0.016
Neg logit contributions
smart-0.019
itty-0.018
 course-0.017
in-0.017
etchup-0.016
Token a
Token ID257
Feature activation-9.83e-02

Pos logit contributions
 hardly+0.038
cases+0.037
tered+0.036
 barely+0.033
 nearly+0.032
Neg logit contributions
smart-0.023
itty-0.021
 course-0.020
in-0.019
ang-0.019
Token young
Token ID1862
Feature activation-2.99e+00

Pos logit contributions
smart+0.003
 course+0.002
itty+0.002
rob+0.002
eros+0.002
Neg logit contributions
 hardly-0.002
cases-0.002
tered-0.002
 barely-0.002
 nearly-0.002
Token boy
Token ID2933
Feature activation-8.09e-01

Pos logit contributions
smart+0.067
 course+0.056
itty+0.052
rob+0.051
 Working+0.050
Neg logit contributions
tered-0.075
 hardly-0.075
cases-0.074
 closely-0.070
 almost-0.069
Token.
Token ID13
Feature activation-1.30e-01

Pos logit contributions
smart+0.019
rob+0.017
eros+0.017
 course+0.017
itty+0.016
Neg logit contributions
 hardly-0.018
tered-0.017
 couldn-0.017
cases-0.017
 almost-0.016
Token He
Token ID679
Feature activation-6.86e-01

Pos logit contributions
smart+0.003
itty+0.003
etchup+0.003
ang+0.003
in+0.003
Neg logit contributions
 hardly-0.003
cases-0.003
tered-0.003
 barely-0.003
 closely-0.003
Token was
Token ID373
Feature activation+7.81e-02

Pos logit contributions
smart+0.017
itty+0.016
 course+0.016
in+0.015
 living+0.015
Neg logit contributions
cases-0.016
 hardly-0.016
tered-0.015
late-0.014
abs-0.013
Token only
Token ID691
Feature activation-1.60e+00

Pos logit contributions
 hardly+0.002
tered+0.002
cases+0.002
 barely+0.002
 nearly+0.002
Neg logit contributions
smart-0.002
 course-0.002
itty-0.002
eros-0.002
rob-0.002