TOP ACTIVATIONS
MAX = 4.653

The water from outside started se eping into the house ,
shore and saw lots of se ag ull s flying in
She cut and glued and se wed different fabrics together .
watched as her mom carefully se wed Ted 's arm back
Suddenly , a se ag ull came and landed
After lots of squ ashing , it was
and learned something new about se ag ull s . <|endoftext|>
every time . After lots of repeating , she finally created
. She grabbed the fire extingu isher and put out the
After lots of squ ashing , it was time
After lots of hard work , they proudly
she turned on the f auc et , no water came
watched as his mom carefully se wed the stitch in the
to turn off the f auc et but it was too
. With her new beautiful voice , the opera singer was
When the door opened , bright light shone out
you ." With a kind voice , the snake asked the
. They opened the f auc et and put their letters
When the windows were opened , a faint breeze came
away , they saw a se ag ull fly down and

MIN ACTIVATIONS
MIN = -4.272

lead him around the books he lf . After a while
the bear and the books he lf . She stayed in
It became the happiest books he lf in the world !
lead him around the books he lf . After a while
most perfect present ! Sy lves ter was so happy and
beneath the j ee pt he same je ep that they
showed her friend the books he lf . From then on
and saw the gloomy books he lf .
" Hello , Mister P ige on !" The
to see the rare books he lf . They looked at
storm came and the books he lf fell over . All
, or behind the books he lf , or in the
is hiding behind the books he lf . The cat jumps
tiny pink car ! J ax was about to swim away
and upset that her books he lf was a mess .
red ball behind the books he lf . Tom tried to
flowers and taking in the truly incredible view around him .
ferocious purple snake ! J ax was terrified and he ran
showed her friend the books he lf . But
the books on the books he lf . Billy

INTERVAL 2.422 - 4.653
CONTAINS 0.153%

Suddenly , he heard a voice from above . It was
. They have a lot of fun together . Anna thinks
. After a while the pigeon fl apped
the new world . Using their imag inations , they played
long as we â re together , we can

INTERVAL 0.190 - 2.422
CONTAINS 41.727%

" That was a close call . You were brave ."
ow !" Ben says . He is very hurt . He
opened the box , and there was his uniform ! Everyone
pizza . Lily wanted cheese pizza , but Ben wanted pepper
" Hello , Lily ," said the bear . "

INTERVAL -2.041 - 0.190
CONTAINS 57.456%

ila , our tower is so tall and pretty . We
cheered for him . The young boy was no
sliding down . Tim and Sue became good friends and played
inside and saw a little kitten hiding there . " Oh
, beyond the leaves and grass . All y

INTERVAL -4.272 - -2.041
CONTAINS 0.665%

Mia and Leo love Sam very much . One
, you must reach me ! But be careful , the
was in charge of making sure there were no distractions .
t eddy bear and ran back to his house . He
basket from Lily and ran away . He laughed and said
Token The
Token ID383
Feature activation+1.29e+00

Pos logit contributions
 lick+0.023
 Sue+0.023
 sill+0.019
 shake+0.018
 nod+0.018
Neg logit contributions
ind-0.040
opter-0.040
us-0.039
 targets-0.039
OO-0.039
Token water
Token ID1660
Feature activation-6.30e-01

Pos logit contributions
 Sue+0.037
 lick+0.035
 sill+0.032
 shake+0.029
 pur+0.029
Neg logit contributions
ind-0.050
eds-0.049
opter-0.049
 targets-0.049
us-0.048
Token from
Token ID422
Feature activation+5.52e-01

Pos logit contributions
ind+0.025
 targets+0.024
opter+0.024
us+0.024
eds+0.024
Neg logit contributions
 Sue-0.018
 lick-0.017
 sill-0.015
 shake-0.015
 pur-0.014
Token outside
Token ID2354
Feature activation+9.38e-02

Pos logit contributions
 Sue+0.014
 lick+0.012
 sill+0.011
 Amy+0.010
 Lucy+0.010
Neg logit contributions
opter-0.024
eds-0.024
opus-0.024
us-0.024
ind-0.024
Token started
Token ID2067
Feature activation-9.00e-02

Pos logit contributions
 Sue+0.003
 lick+0.002
 sill+0.002
 shake+0.002
 Amy+0.002
Neg logit contributions
ind-0.004
 targets-0.004
us-0.004
opus-0.004
eds-0.004
Token se
Token ID384
Feature activation+4.65e+00

Pos logit contributions
ind+0.003
eds+0.003
opter+0.003
 targets+0.003
us+0.003
Neg logit contributions
 Sue-0.003
 lick-0.003
 sill-0.003
 shake-0.003
 pur-0.003
Tokeneping
Token ID7213
Feature activation+1.53e+00

Pos logit contributions
 Sue+0.160
 lick+0.148
 sill+0.137
 Amy+0.130
 shake+0.128
Neg logit contributions
ind-0.160
eds-0.158
us-0.152
 targets-0.151
opter-0.151
Token into
Token ID656
Feature activation+1.25e+00

Pos logit contributions
 Sue+0.051
 lick+0.048
 sill+0.045
 shake+0.042
 pur+0.041
Neg logit contributions
ind-0.052
eds-0.051
opter-0.051
 targets-0.050
us-0.050
Token the
Token ID262
Feature activation-1.19e-01

Pos logit contributions
 Sue+0.031
 lick+0.027
 sill+0.025
 shake+0.023
 Amy+0.023
Neg logit contributions
opus-0.055
ind-0.055
eds-0.054
opter-0.054
us-0.053
Token house
Token ID2156
Feature activation+1.18e+00

Pos logit contributions
ind+0.004
opter+0.004
eds+0.004
us+0.004
 targets+0.004
Neg logit contributions
 Sue-0.003
 lick-0.003
 sill-0.003
 shake-0.003
 wipe-0.003
Token,
Token ID11
Feature activation-5.41e-01

Pos logit contributions
 Sue+0.039
 lick+0.037
 sill+0.034
 shake+0.032
 pur+0.031
Neg logit contributions
ind-0.040
eds-0.039
 targets-0.039
opter-0.039
us-0.038
Token shore
Token ID15191
Feature activation-1.08e+00

Pos logit contributions
OO+0.003
opter+0.003
ino+0.003
us+0.003
oly+0.002
Neg logit contributions
 lick-0.002
 Sue-0.002
 shake-0.002
 sill-0.002
 crunch-0.002
Token and
Token ID290
Feature activation-8.78e-01

Pos logit contributions
ind+0.036
opter+0.035
 targets+0.035
eds+0.035
opus+0.035
Neg logit contributions
 Sue-0.037
 lick-0.035
 sill-0.033
 shake-0.030
 nod-0.030
Token saw
Token ID2497
Feature activation-1.78e-01

Pos logit contributions
ind+0.028
eds+0.028
 targets+0.028
opter+0.027
us+0.027
Neg logit contributions
 Sue-0.030
 lick-0.029
 sill-0.027
 shake-0.025
 nod-0.025
Token lots
Token ID6041
Feature activation-1.22e+00

Pos logit contributions
ind+0.007
eds+0.007
 targets+0.007
opter+0.007
us+0.006
Neg logit contributions
 Sue-0.005
 lick-0.005
 sill-0.005
 shake-0.004
 pur-0.004
Token of
Token ID286
Feature activation+1.77e+00

Pos logit contributions
ind+0.044
eds+0.043
opter+0.043
 targets+0.043
us+0.042
Neg logit contributions
 Sue-0.039
 lick-0.037
 sill-0.033
 shake-0.031
 pur-0.031
Token se
Token ID384
Feature activation+4.40e+00

Pos logit contributions
 Sue+0.054
 lick+0.049
 sill+0.045
 crunch+0.043
 shake+0.043
Neg logit contributions
eds-0.066
OO-0.066
opter-0.065
ind-0.065
opus-0.064
Tokenag
Token ID363
Feature activation-5.27e-01

Pos logit contributions
 Sue+0.149
 lick+0.136
 sill+0.129
 Amy+0.122
 shake+0.120
Neg logit contributions
ind-0.155
eds-0.153
us-0.150
opter-0.146
 targets-0.144
Tokenull
Token ID724
Feature activation+7.88e-01

Pos logit contributions
 targets+0.013
opus+0.013
ind+0.013
opter+0.013
 passport+0.013
Neg logit contributions
 Sue-0.022
 lick-0.019
 sill-0.019
 Amy-0.018
 shake-0.018
Tokens
Token ID82
Feature activation+1.68e-01

Pos logit contributions
 Sue+0.038
 lick+0.037
 sill+0.034
 shake+0.033
 pur+0.033
Neg logit contributions
ind-0.015
 targets-0.014
opter-0.014
eds-0.014
us-0.013
Token flying
Token ID7348
Feature activation-1.24e-01

Pos logit contributions
 Sue+0.005
 lick+0.004
 sill+0.004
 shake+0.004
 pur+0.004
Neg logit contributions
ind-0.007
opter-0.007
eds-0.007
 targets-0.007
us-0.006
Token in
Token ID287
Feature activation+4.54e-01

Pos logit contributions
ind+0.005
eds+0.004
opter+0.004
 targets+0.004
us+0.004
Neg logit contributions
 Sue-0.004
 lick-0.004
 sill-0.003
 shake-0.003
 pur-0.003
Token She
Token ID1375
Feature activation+9.85e-01

Pos logit contributions
 lick+0.005
 shake+0.004
 Sue+0.004
 sill+0.004
 crunch+0.004
Neg logit contributions
ind-0.008
us-0.008
 targets-0.007
utes-0.007
ino-0.007
Token cut
Token ID2005
Feature activation-3.24e-01

Pos logit contributions
 Sue+0.026
 lick+0.025
 sill+0.022
 pur+0.021
 shake+0.021
Neg logit contributions
eds-0.041
ind-0.040
 targets-0.040
opus-0.039
opter-0.039
Token and
Token ID290
Feature activation+3.73e-01

Pos logit contributions
ind+0.012
eds+0.012
 targets+0.011
us+0.011
opter+0.011
Neg logit contributions
 Sue-0.010
 lick-0.010
 sill-0.008
 nod-0.008
 pur-0.008
Token glued
Token ID39783
Feature activation+1.44e+00

Pos logit contributions
 Sue+0.012
 lick+0.011
 sill+0.010
 nod+0.009
 Amy+0.009
Neg logit contributions
 targets-0.013
eds-0.013
ind-0.012
opter-0.012
us-0.012
Token and
Token ID290
Feature activation+1.60e+00

Pos logit contributions
 Sue+0.052
 lick+0.050
 sill+0.045
 nod+0.043
 pur+0.043
Neg logit contributions
ind-0.043
 targets-0.043
eds-0.043
us-0.042
opter-0.041
Token se
Token ID384
Feature activation+4.23e+00

Pos logit contributions
 Sue+0.052
 lick+0.048
 sill+0.044
 nod+0.042
 Amy+0.041
Neg logit contributions
 targets-0.054
us-0.054
eds-0.054
ind-0.054
opter-0.053
Tokenwed
Token ID19103
Feature activation+8.85e-01

Pos logit contributions
 Sue+0.164
 lick+0.155
 sill+0.145
 shake+0.138
 Amy+0.136
Neg logit contributions
ind-0.122
eds-0.122
us-0.119
 targets-0.118
opter-0.118
Token different
Token ID1180
Feature activation+6.40e-01

Pos logit contributions
 Sue+0.031
 lick+0.031
 sill+0.028
 nod+0.027
 pur+0.026
Neg logit contributions
ind-0.027
us-0.027
 targets-0.027
eds-0.027
opter-0.026
Token fabrics
Token ID39533
Feature activation+1.50e-01

Pos logit contributions
 Sue+0.023
 lick+0.022
 sill+0.020
 shake+0.019
 wipe+0.018
Neg logit contributions
ind-0.021
opus-0.020
eds-0.020
opter-0.020
ich-0.020
Token together
Token ID1978
Feature activation+1.16e+00

Pos logit contributions
 Sue+0.004
 lick+0.004
 sill+0.004
 shake+0.003
 pur+0.003
Neg logit contributions
ind-0.006
eds-0.006
 targets-0.006
opter-0.006
us-0.006
Token.
Token ID13
Feature activation+5.02e-01

Pos logit contributions
 Sue+0.034
 lick+0.032
 sill+0.028
 pur+0.026
 nod+0.026
Neg logit contributions
ind-0.043
 targets-0.043
eds-0.043
opter-0.042
us-0.042
Token watched
Token ID7342
Feature activation+8.35e-01

Pos logit contributions
 Sue+0.019
 lick+0.018
 sill+0.017
 shake+0.015
 Amy+0.015
Neg logit contributions
ind-0.016
eds-0.016
 targets-0.016
opter-0.016
us-0.016
Token as
Token ID355
Feature activation-3.60e-01

Pos logit contributions
 Sue+0.026
 lick+0.026
 sill+0.024
 nod+0.021
Deal+0.021
Neg logit contributions
 targets-0.028
ind-0.027
eds-0.027
us-0.027
opter-0.026
Token her
Token ID607
Feature activation+1.64e+00

Pos logit contributions
ind+0.016
eds+0.016
 targets+0.016
opter+0.016
us+0.015
Neg logit contributions
 Sue-0.008
 lick-0.008
 sill-0.007
 shake-0.006
 pur-0.006
Token mom
Token ID1995
Feature activation+7.87e-01

Pos logit contributions
 Sue+0.054
 lick+0.051
 sill+0.048
 nod+0.044
 Amy+0.043
Neg logit contributions
ind-0.054
eds-0.052
opus-0.052
 targets-0.052
us-0.051
Token carefully
Token ID7773
Feature activation-3.19e-01

Pos logit contributions
 Sue+0.026
 lick+0.024
 sill+0.023
 Amy+0.020
 crunch+0.020
Neg logit contributions
 targets-0.024
opter-0.024
ind-0.024
us-0.024
ino-0.023
Token se
Token ID384
Feature activation+4.21e+00

Pos logit contributions
 targets+0.011
us+0.010
opter+0.010
eds+0.010
ind+0.010
Neg logit contributions
 Sue-0.010
 lick-0.009
 sill-0.009
 Amy-0.008
 nod-0.008
Tokenwed
Token ID19103
Feature activation+6.52e-01

Pos logit contributions
 Sue+0.159
 lick+0.150
 sill+0.141
 shake+0.133
 pur+0.132
Neg logit contributions
ind-0.126
eds-0.125
 targets-0.121
us-0.120
opter-0.119
Token Ted
Token ID11396
Feature activation+8.18e-01

Pos logit contributions
 Sue+0.021
 lick+0.021
 sill+0.019
 nod+0.018
 pur+0.018
Neg logit contributions
 targets-0.023
us-0.022
ind-0.022
eds-0.022
opter-0.021
Token's
Token ID338
Feature activation+1.22e+00

Pos logit contributions
 Sue+0.032
 lick+0.030
 sill+0.028
 Amy+0.026
 nod+0.026
Neg logit contributions
eds-0.022
 targets-0.022
ind-0.022
us-0.021
opter-0.021
Token arm
Token ID3211
Feature activation+8.44e-01

Pos logit contributions
 Sue+0.041
 lick+0.038
 sill+0.034
 Amy+0.033
 nod+0.031
Neg logit contributions
ind-0.041
eds-0.041
 targets-0.041
opter-0.039
us-0.039
Token back
Token ID736
Feature activation-1.15e+00

Pos logit contributions
 Sue+0.033
 lick+0.031
 nod+0.028
 sill+0.027
 Amy+0.027
Neg logit contributions
 targets-0.023
eds-0.022
us-0.022
ind-0.022
opter-0.022
Token
Token ID198
Feature activation+1.08e-01

Pos logit contributions
 Sue+0.008
 lick+0.008
 sill+0.007
 Amy+0.006
 shake+0.006
Neg logit contributions
ind-0.019
eds-0.019
 targets-0.019
opter-0.019
us-0.019
Token
Token ID198
Feature activation+1.16e-01

Pos logit contributions
 Sue+0.002
 lick+0.002
 She+0.002
 Amy+0.002
 Lucy+0.002
Neg logit contributions
 targets-0.005
ind-0.005
opter-0.005
eds-0.005
OO-0.005
TokenSuddenly
Token ID38582
Feature activation+1.00e+00

Pos logit contributions
 Sue+0.003
 lick+0.003
 She+0.003
 Amy+0.003
 pur+0.003
Neg logit contributions
 targets-0.004
eds-0.004
opter-0.004
OO-0.004
 enemy-0.004
Token,
Token ID11
Feature activation+3.20e-01

Pos logit contributions
 Sue+0.030
 lick+0.028
 sill+0.025
 shake+0.023
 pur+0.023
Neg logit contributions
ind-0.038
eds-0.038
 targets-0.037
opter-0.037
us-0.037
Token a
Token ID257
Feature activation+5.28e-02

Pos logit contributions
 Sue+0.009
 lick+0.008
 sill+0.007
 pur+0.007
 shake+0.007
Neg logit contributions
ind-0.013
opter-0.012
eds-0.012
 targets-0.012
us-0.012
Token se
Token ID384
Feature activation+4.08e+00

Pos logit contributions
 Sue+0.002
 lick+0.002
 sill+0.001
 shake+0.001
 pur+0.001
Neg logit contributions
ind-0.002
eds-0.002
 targets-0.002
opter-0.002
us-0.002
Tokenag
Token ID363
Feature activation-5.17e-01

Pos logit contributions
 Sue+0.144
 lick+0.134
 sill+0.123
 Amy+0.118
 pur+0.117
Neg logit contributions
ind-0.134
opter-0.129
eds-0.128
us-0.128
OO-0.124
Tokenull
Token ID724
Feature activation+1.43e+00

Pos logit contributions
 targets+0.012
ind+0.012
eds+0.012
opter+0.012
opus+0.011
Neg logit contributions
 Sue-0.022
 lick-0.021
 sill-0.021
 shake-0.020
 nod-0.019
Token came
Token ID1625
Feature activation+9.37e-01

Pos logit contributions
 Sue+0.041
 lick+0.039
 shake+0.034
 pur+0.033
 sill+0.033
Neg logit contributions
ind-0.055
 targets-0.054
opter-0.053
eds-0.053
us-0.052
Token and
Token ID290
Feature activation-3.48e-01

Pos logit contributions
 Sue+0.029
 lick+0.027
 sill+0.025
 shake+0.024
 pur+0.023
Neg logit contributions
opter-0.034
opus-0.034
ind-0.034
 targets-0.033
eds-0.033
Token landed
Token ID11406
Feature activation-1.26e+00

Pos logit contributions
ind+0.013
eds+0.013
 targets+0.013
opter+0.013
us+0.013
Neg logit contributions
 Sue-0.010
 lick-0.010
 sill-0.008
 shake-0.008
 pur-0.008
Token
Token ID220
Feature activation-8.24e-01

Pos logit contributions
 Sue+0.016
 lick+0.014
 sill+0.013
 shake+0.011
 nod+0.011
Neg logit contributions
ind-0.037
eds-0.037
opter-0.036
us-0.036
 targets-0.036
Token
Token ID198
Feature activation+8.91e-02

Pos logit contributions
ind+0.032
eds+0.031
 targets+0.031
opter+0.031
us+0.031
Neg logit contributions
 Sue-0.023
 lick-0.022
 sill-0.020
 shake-0.018
 pur-0.018
Token
Token ID198
Feature activation+7.76e-02

Pos logit contributions
 Sue+0.001
 She+0.001
 +0.001
 Together+0.001
 lick+0.001
Neg logit contributions
 enemy-0.005
opus-0.005
 targets-0.005
ind-0.005
OO-0.005
TokenAfter
Token ID3260
Feature activation-5.06e-01

Pos logit contributions
 Sue+0.002
 She+0.002
 lick+0.002
 shake+0.002
 Together+0.002
Neg logit contributions
eds-0.003
opus-0.003
OO-0.003
us-0.003
 enemy-0.003
Token lots
Token ID6041
Feature activation+1.66e+00

Pos logit contributions
ind+0.019
 targets+0.019
eds+0.018
opter+0.018
us+0.018
Neg logit contributions
 Sue-0.015
 lick-0.014
 sill-0.013
 shake-0.012
 nod-0.012
Token of
Token ID286
Feature activation+4.07e+00

Pos logit contributions
 Sue+0.054
 lick+0.051
 sill+0.045
 shake+0.043
 pur+0.042
Neg logit contributions
ind-0.059
eds-0.058
 targets-0.058
opter-0.057
us-0.056
Token squ
Token ID2809
Feature activation+3.96e+00

Pos logit contributions
 Sue+0.124
 lick+0.117
 sill+0.106
 shake+0.097
 pur+0.097
Neg logit contributions
ind-0.151
eds-0.149
 targets-0.148
opter-0.147
us-0.145
Tokenashing
Token ID2140
Feature activation+1.97e+00

Pos logit contributions
 Sue+0.143
 lick+0.133
 sill+0.130
 Amy+0.119
 shake+0.117
Neg logit contributions
ind-0.125
eds-0.124
 targets-0.120
us-0.119
OO-0.117
Token,
Token ID11
Feature activation+5.44e-01

Pos logit contributions
 Sue+0.061
 lick+0.060
 sill+0.055
 pur+0.050
 shake+0.049
Neg logit contributions
 targets-0.067
eds-0.067
ind-0.067
opter-0.067
us-0.065
Token it
Token ID340
Feature activation+1.25e-01

Pos logit contributions
 Sue+0.017
 lick+0.015
 sill+0.014
 shake+0.013
 She+0.013
Neg logit contributions
eds-0.021
ind-0.020
us-0.020
opus-0.020
opter-0.019
Token was
Token ID373
Feature activation+7.95e-01

Pos logit contributions
 Sue+0.004
 lick+0.004
 sill+0.003
 shake+0.003
 pur+0.003
Neg logit contributions
ind-0.005
eds-0.004
 targets-0.004
opter-0.004
us-0.004
Token and
Token ID290
Feature activation+1.30e-01

Pos logit contributions
 Sue+0.020
 lick+0.019
 sill+0.017
 shake+0.016
 pur+0.016
Neg logit contributions
ind-0.025
eds-0.025
opter-0.025
 targets-0.025
us-0.024
Token learned
Token ID4499
Feature activation-3.52e-01

Pos logit contributions
 Sue+0.004
 lick+0.004
 sill+0.003
 shake+0.003
 nod+0.003
Neg logit contributions
ind-0.005
eds-0.005
 targets-0.005
opter-0.005
us-0.005
Token something
Token ID1223
Feature activation+3.85e-01

Pos logit contributions
eds+0.013
opter+0.013
ind+0.013
opus+0.013
us+0.012
Neg logit contributions
 Sue-0.012
 lick-0.010
 sill-0.010
 shake-0.009
 Amy-0.009
Token new
Token ID649
Feature activation+7.89e-01

Pos logit contributions
 Sue+0.013
 lick+0.011
 sill+0.010
 shake+0.010
 pur+0.010
Neg logit contributions
eds-0.014
opter-0.013
ind-0.013
 targets-0.013
OO-0.013
Token about
Token ID546
Feature activation-5.96e-01

Pos logit contributions
 Sue+0.018
 sill+0.014
 lick+0.013
 Amy+0.013
 She+0.012
Neg logit contributions
ind-0.039
opter-0.038
eds-0.037
opus-0.037
OO-0.036
Token se
Token ID384
Feature activation+4.03e+00

Pos logit contributions
ind+0.022
opter+0.022
eds+0.022
 targets+0.021
us+0.021
Neg logit contributions
 Sue-0.018
 lick-0.017
 sill-0.015
 shake-0.014
 Amy-0.014
Tokenag
Token ID363
Feature activation-6.67e-01

Pos logit contributions
 Sue+0.120
 lick+0.113
 sill+0.102
 shake+0.094
 pur+0.093
Neg logit contributions
ind-0.152
eds-0.150
 targets-0.148
opter-0.148
us-0.147
Tokenull
Token ID724
Feature activation+1.44e+00

Pos logit contributions
opus+0.019
 targets+0.019
opter+0.019
ind+0.018
 passport+0.018
Neg logit contributions
 Sue-0.026
 sill-0.022
 Amy-0.021
 lick-0.021
 Lucy-0.020
Tokens
Token ID82
Feature activation+7.76e-01

Pos logit contributions
 Sue+0.061
 lick+0.059
 sill+0.055
 pur+0.053
 shake+0.052
Neg logit contributions
ind-0.035
opter-0.035
 targets-0.035
OO-0.034
eds-0.034
Token.
Token ID13
Feature activation+2.61e-01

Pos logit contributions
 Sue+0.020
 lick+0.019
 sill+0.018
 shake+0.016
 nod+0.016
Neg logit contributions
opter-0.033
ind-0.032
opus-0.032
us-0.032
eds-0.032
Token<|endoftext|>
Token ID50256
Feature activation-1.50e+00

Pos logit contributions
 Sue+0.007
 lick+0.007
 sill+0.006
 shake+0.006
 wipe+0.006
Neg logit contributions
ind-0.010
eds-0.010
 targets-0.010
opter-0.010
us-0.010
Token every
Token ID790
Feature activation-2.04e+00

Pos logit contributions
ind+0.036
opter+0.036
opus+0.035
eds+0.035
 targets+0.035
Neg logit contributions
 Sue-0.029
 lick-0.026
 sill-0.025
 shake-0.023
 Amy-0.022
Token time
Token ID640
Feature activation+1.38e-01

Pos logit contributions
ind+0.060
eds+0.059
 targets+0.058
opter+0.058
us+0.057
Neg logit contributions
 Sue-0.077
 lick-0.073
 sill-0.068
 pur-0.064
 shake-0.064
Token.
Token ID13
Feature activation+4.17e-01

Pos logit contributions
 Sue+0.004
 lick+0.003
 sill+0.003
 Amy+0.003
 pur+0.003
Neg logit contributions
ind-0.006
eds-0.006
opter-0.006
opus-0.006
 targets-0.006
Token After
Token ID2293
Feature activation-3.38e-01

Pos logit contributions
 lick+0.008
 Sue+0.006
 shake+0.006
 wipe+0.006
 sill+0.006
Neg logit contributions
 targets-0.019
 perspective-0.019
opter-0.019
OO-0.019
us-0.019
Token lots
Token ID6041
Feature activation+1.54e+00

Pos logit contributions
ind+0.013
eds+0.013
 targets+0.013
opter+0.013
us+0.013
Neg logit contributions
 Sue-0.010
 lick-0.009
 sill-0.008
 shake-0.007
 pur-0.007
Token of
Token ID286
Feature activation+4.00e+00

Pos logit contributions
 Sue+0.051
 lick+0.049
 sill+0.045
 shake+0.042
 pur+0.041
Neg logit contributions
ind-0.052
eds-0.051
 targets-0.051
opter-0.050
us-0.050
Token repeating
Token ID20394
Feature activation+7.79e-02

Pos logit contributions
 Sue+0.135
 lick+0.129
 sill+0.120
 nod+0.107
 pur+0.107
Neg logit contributions
ind-0.132
 targets-0.131
opter-0.129
us-0.128
eds-0.128
Token,
Token ID11
Feature activation+2.19e-01

Pos logit contributions
 Sue+0.002
 lick+0.002
 sill+0.002
 shake+0.002
 Amy+0.002
Neg logit contributions
ind-0.003
eds-0.003
opus-0.003
us-0.003
opter-0.003
Token she
Token ID673
Feature activation-1.00e+00

Pos logit contributions
 lick+0.006
 Sue+0.005
 sill+0.005
 wipe+0.004
 blush+0.004
Neg logit contributions
 targets-0.009
ind-0.008
 perspective-0.008
eds-0.008
opter-0.008
Token finally
Token ID3443
Feature activation-2.26e+00

Pos logit contributions
ind+0.039
eds+0.038
 targets+0.038
OO+0.037
opter+0.037
Neg logit contributions
 Sue-0.028
 lick-0.028
 sill-0.026
 pur-0.025
 shake-0.024
Token created
Token ID2727
Feature activation-2.89e-01

Pos logit contributions
ind+0.091
eds+0.088
 targets+0.086
opus+0.085
ich+0.085
Neg logit contributions
 Sue-0.064
 lick-0.061
 sill-0.055
 shake-0.051
 pur-0.050
Token.
Token ID13
Feature activation+5.08e-01

Pos logit contributions
 Sue+0.016
 lick+0.014
 sill+0.013
 shake+0.012
 pur+0.012
Neg logit contributions
eds-0.024
ind-0.024
opter-0.024
 targets-0.023
opus-0.023
Token She
Token ID1375
Feature activation+6.14e-01

Pos logit contributions
 lick+0.009
 Sue+0.008
 sill+0.006
 wipe+0.006
 shake+0.006
Neg logit contributions
ind-0.024
opter-0.024
OO-0.024
 perspective-0.023
 targets-0.023
Token grabbed
Token ID13176
Feature activation+4.54e-01

Pos logit contributions
 Sue+0.018
 lick+0.017
 sill+0.015
 pur+0.015
 shake+0.014
Neg logit contributions
ind-0.024
eds-0.024
 targets-0.023
opter-0.023
us-0.023
Token the
Token ID262
Feature activation+5.43e-01

Pos logit contributions
 lick+0.014
 Sue+0.014
 sill+0.014
Deal+0.013
 pur+0.013
Neg logit contributions
 targets-0.014
 passport-0.013
ind-0.013
us-0.013
eds-0.012
Token fire
Token ID2046
Feature activation+1.61e+00

Pos logit contributions
 Sue+0.017
 lick+0.017
 sill+0.015
 shake+0.014
 wipe+0.014
Neg logit contributions
ind-0.019
eds-0.019
 targets-0.019
opter-0.019
us-0.019
Token extingu
Token ID24995
Feature activation+3.97e+00

Pos logit contributions
 Sue+0.058
 lick+0.055
 sill+0.050
 pur+0.047
 Amy+0.046
Neg logit contributions
ind-0.050
eds-0.049
 targets-0.049
opter-0.049
us-0.048
Tokenisher
Token ID4828
Feature activation+4.46e-02

Pos logit contributions
 Sue+0.213
 lick+0.198
 sill+0.190
 pur+0.187
 Amy+0.186
Neg logit contributions
ind-0.067
eds-0.058
us-0.055
opter-0.053
 targets-0.050
Token and
Token ID290
Feature activation-3.54e-01

Pos logit contributions
 Sue+0.002
 lick+0.001
 sill+0.001
 shake+0.001
 wipe+0.001
Neg logit contributions
ind-0.001
eds-0.001
opter-0.001
 targets-0.001
us-0.001
Token put
Token ID1234
Feature activation+7.19e-01

Pos logit contributions
ind+0.013
eds+0.013
 targets+0.013
opter+0.013
us+0.013
Neg logit contributions
 Sue-0.010
 lick-0.009
 sill-0.009
 nod-0.008
 shake-0.008
Token out
Token ID503
Feature activation-2.39e-01

Pos logit contributions
 Sue+0.034
 lick+0.033
 sill+0.031
 nod+0.030
 pur+0.030
Neg logit contributions
 targets-0.012
us-0.012
eds-0.012
ind-0.011
opter-0.011
Token the
Token ID262
Feature activation-2.21e-01

Pos logit contributions
ind+0.008
 targets+0.008
eds+0.008
 passport+0.008
us+0.008
Neg logit contributions
 lick-0.007
 Sue-0.006
 sill-0.005
Deal-0.005
 nod-0.005
Token
Token ID198
Feature activation+8.91e-02

Pos logit contributions
ind+0.032
eds+0.031
 targets+0.031
opter+0.031
us+0.031
Neg logit contributions
 Sue-0.023
 lick-0.022
 sill-0.020
 shake-0.018
 pur-0.018
Token
Token ID198
Feature activation+7.76e-02

Pos logit contributions
 Sue+0.001
 She+0.001
 +0.001
 Together+0.001
 lick+0.001
Neg logit contributions
 enemy-0.005
opus-0.005
 targets-0.005
ind-0.005
OO-0.005
TokenAfter
Token ID3260
Feature activation-5.06e-01

Pos logit contributions
 Sue+0.002
 She+0.002
 lick+0.002
 shake+0.002
 Together+0.002
Neg logit contributions
eds-0.003
opus-0.003
OO-0.003
us-0.003
 enemy-0.003
Token lots
Token ID6041
Feature activation+1.66e+00

Pos logit contributions
ind+0.019
 targets+0.019
eds+0.018
opter+0.018
us+0.018
Neg logit contributions
 Sue-0.015
 lick-0.014
 sill-0.013
 shake-0.012
 nod-0.012
Token of
Token ID286
Feature activation+4.07e+00

Pos logit contributions
 Sue+0.054
 lick+0.051
 sill+0.045
 shake+0.043
 pur+0.042
Neg logit contributions
ind-0.059
eds-0.058
 targets-0.058
opter-0.057
us-0.056
Token squ
Token ID2809
Feature activation+3.96e+00

Pos logit contributions
 Sue+0.124
 lick+0.117
 sill+0.106
 shake+0.097
 pur+0.097
Neg logit contributions
ind-0.151
eds-0.149
 targets-0.148
opter-0.147
us-0.145
Tokenashing
Token ID2140
Feature activation+1.97e+00

Pos logit contributions
 Sue+0.143
 lick+0.133
 sill+0.130
 Amy+0.119
 shake+0.117
Neg logit contributions
ind-0.125
eds-0.124
 targets-0.120
us-0.119
OO-0.117
Token,
Token ID11
Feature activation+5.44e-01

Pos logit contributions
 Sue+0.061
 lick+0.060
 sill+0.055
 pur+0.050
 shake+0.049
Neg logit contributions
 targets-0.067
eds-0.067
ind-0.067
opter-0.067
us-0.065
Token it
Token ID340
Feature activation+1.25e-01

Pos logit contributions
 Sue+0.017
 lick+0.015
 sill+0.014
 shake+0.013
 She+0.013
Neg logit contributions
eds-0.021
ind-0.020
us-0.020
opus-0.020
opter-0.019
Token was
Token ID373
Feature activation+7.95e-01

Pos logit contributions
 Sue+0.004
 lick+0.004
 sill+0.003
 shake+0.003
 pur+0.003
Neg logit contributions
ind-0.005
eds-0.004
 targets-0.004
opter-0.004
us-0.004
Token time
Token ID640
Feature activation+1.26e+00

Pos logit contributions
 lick+0.029
 Sue+0.028
 sill+0.026
 nod+0.025
 wipe+0.024
Neg logit contributions
ind-0.022
eds-0.022
 targets-0.022
OO-0.021
us-0.021
Token
Token ID220
Feature activation-5.60e-01

Pos logit contributions
 Sue+0.020
 lick+0.019
 sill+0.017
 shake+0.015
 pur+0.015
Neg logit contributions
ind-0.037
eds-0.036
opter-0.036
 targets-0.036
us-0.036
Token
Token ID198
Feature activation+2.31e-02

Pos logit contributions
ind+0.022
eds+0.022
 targets+0.022
opter+0.022
us+0.021
Neg logit contributions
 Sue-0.015
 lick-0.014
 sill-0.013
 shake-0.012
 pur-0.012
Token
Token ID198
Feature activation+3.18e-03

Pos logit contributions
 Sue+0.001
 lick+0.000
 sill+0.000
 Amy+0.000
 She+0.000
Neg logit contributions
ind-0.001
 targets-0.001
eds-0.001
opter-0.001
opus-0.001
TokenAfter
Token ID3260
Feature activation-5.29e-01

Pos logit contributions
 Sue+0.000
 lick+0.000
 Together+0.000
 She+0.000
 wipe+0.000
Neg logit contributions
 targets-0.000
eds-0.000
ind-0.000
OO-0.000
 enemy-0.000
Token lots
Token ID6041
Feature activation+1.64e+00

Pos logit contributions
ind+0.021
eds+0.020
 targets+0.020
opter+0.020
us+0.020
Neg logit contributions
 Sue-0.015
 lick-0.014
 sill-0.013
 shake-0.012
 pur-0.011
Token of
Token ID286
Feature activation+3.95e+00

Pos logit contributions
 Sue+0.053
 lick+0.050
 sill+0.045
 shake+0.042
 pur+0.042
Neg logit contributions
ind-0.058
eds-0.058
 targets-0.057
opter-0.056
us-0.055
Token hard
Token ID1327
Feature activation+1.51e+00

Pos logit contributions
 Sue+0.130
 lick+0.124
 sill+0.113
 shake+0.104
 pur+0.104
Neg logit contributions
ind-0.136
eds-0.134
 targets-0.133
opter-0.133
us-0.131
Token work
Token ID670
Feature activation+1.60e+00

Pos logit contributions
 Sue+0.052
 lick+0.050
 sill+0.046
 shake+0.043
 pur+0.043
Neg logit contributions
ind-0.049
eds-0.048
 targets-0.048
opter-0.047
us-0.047
Token,
Token ID11
Feature activation+8.33e-01

Pos logit contributions
 Sue+0.045
 lick+0.041
 sill+0.037
 shake+0.035
 Amy+0.035
Neg logit contributions
ind-0.064
eds-0.063
 targets-0.062
opus-0.061
opter-0.061
Token they
Token ID484
Feature activation-4.97e-01

Pos logit contributions
 Sue+0.022
 lick+0.021
 sill+0.018
 shake+0.017
 Amy+0.017
Neg logit contributions
ind-0.034
eds-0.034
 targets-0.033
opter-0.033
us-0.033
Token proudly
Token ID23017
Feature activation-1.59e+00

Pos logit contributions
ind+0.021
eds+0.021
 targets+0.020
opus+0.020
ich+0.020
Neg logit contributions
 Sue-0.013
 lick-0.012
 sill-0.011
 shake-0.010
 wipe-0.010
Token she
Token ID673
Feature activation+5.37e-01

Pos logit contributions
 lick+0.014
 sill+0.014
 Sue+0.014
 nod+0.012
 crunch+0.011
Neg logit contributions
opter-0.023
eds-0.023
ind-0.023
us-0.023
 targets-0.023
Token turned
Token ID2900
Feature activation+1.00e+00

Pos logit contributions
 Sue+0.019
 sill+0.018
 lick+0.018
 nod+0.016
 crunch+0.016
Neg logit contributions
us-0.016
eds-0.016
ind-0.015
opus-0.015
OO-0.015
Token on
Token ID319
Feature activation+1.11e+00

Pos logit contributions
 Sue+0.035
 lick+0.033
 sill+0.030
 shake+0.029
 pur+0.028
Neg logit contributions
ind-0.033
opter-0.033
eds-0.033
 targets-0.033
opus-0.032
Token the
Token ID262
Feature activation+5.32e-01

Pos logit contributions
 Sue+0.033
 lick+0.033
 sill+0.031
 nod+0.028
 shake+0.027
Neg logit contributions
ind-0.039
eds-0.038
 targets-0.038
us-0.036
opter-0.036
Token f
Token ID277
Feature activation+9.38e-01

Pos logit contributions
 Sue+0.016
 lick+0.015
 Amy+0.013
 sill+0.013
 pur+0.012
Neg logit contributions
ind-0.019
eds-0.019
us-0.019
opus-0.019
ich-0.019
Tokenauc
Token ID14272
Feature activation+3.94e+00

Pos logit contributions
 Sue+0.036
 lick+0.036
 pur+0.032
 Amy+0.031
 nod+0.031
Neg logit contributions
us-0.029
eds-0.028
ind-0.027
opus-0.027
opter-0.026
Tokenet
Token ID316
Feature activation+6.30e-01

Pos logit contributions
 Sue+0.154
+0.151
 lick+0.144
 Together+0.139
 nod+0.137
Neg logit contributions
eds-0.108
ind-0.097
OO-0.088
us-0.085
ors-0.081
Token,
Token ID11
Feature activation+6.27e-03

Pos logit contributions
 Sue+0.020
 lick+0.019
 sill+0.017
 pur+0.016
 shake+0.016
Neg logit contributions
ind-0.022
eds-0.022
opter-0.022
 targets-0.022
us-0.022
Token no
Token ID645
Feature activation+4.35e-01

Pos logit contributions
 lick+0.000
 sill+0.000
 Sue+0.000
 nod+0.000
 Together+0.000
Neg logit contributions
 targets-0.000
opter-0.000
 passport-0.000
opus-0.000
ind-0.000
Token water
Token ID1660
Feature activation-7.85e-01

Pos logit contributions
 Sue+0.013
 lick+0.012
 sill+0.011
 pur+0.010
 shake+0.010
Neg logit contributions
ind-0.016
eds-0.016
 targets-0.016
opter-0.016
us-0.016
Token came
Token ID1625
Feature activation+1.96e+00

Pos logit contributions
ind+0.031
opter+0.030
 targets+0.030
eds+0.030
us+0.030
Neg logit contributions
 Sue-0.023
 lick-0.021
 sill-0.019
 shake-0.018
 pur-0.017
Token watched
Token ID7342
Feature activation+4.69e-01

Pos logit contributions
ind+0.004
 targets+0.003
eds+0.003
us+0.003
opter+0.003
Neg logit contributions
 lick-0.002
 Sue-0.002
 pur-0.002
 shake-0.002
 sill-0.002
Token as
Token ID355
Feature activation-2.09e-01

Pos logit contributions
 Sue+0.015
 lick+0.014
 sill+0.013
 shake+0.012
 pur+0.012
Neg logit contributions
ind-0.017
eds-0.017
 targets-0.017
opter-0.017
us-0.016
Token his
Token ID465
Feature activation+1.14e+00

Pos logit contributions
ind+0.008
eds+0.008
 targets+0.008
opter+0.008
us+0.008
Neg logit contributions
 Sue-0.006
 lick-0.006
 sill-0.005
 shake-0.005
 pur-0.005
Token mom
Token ID1995
Feature activation+9.21e-01

Pos logit contributions
 Sue+0.032
 lick+0.031
 sill+0.027
 pur+0.025
 shake+0.025
Neg logit contributions
eds-0.045
ind-0.045
opter-0.045
 targets-0.045
us-0.044
Token carefully
Token ID7773
Feature activation-3.75e-01

Pos logit contributions
 Sue+0.027
 lick+0.025
 sill+0.023
 Amy+0.021
 nod+0.020
Neg logit contributions
ind-0.034
eds-0.034
 targets-0.033
opter-0.033
us-0.033
Token se
Token ID384
Feature activation+3.94e+00

Pos logit contributions
 targets+0.014
ind+0.014
eds+0.014
opter+0.014
us+0.014
Neg logit contributions
 Sue-0.011
 lick-0.009
 sill-0.009
 Amy-0.008
 nod-0.007
Tokenwed
Token ID19103
Feature activation+4.53e-01

Pos logit contributions
 Sue+0.146
 lick+0.137
 sill+0.128
 shake+0.122
 Amy+0.121
Neg logit contributions
ind-0.121
eds-0.120
us-0.115
 targets-0.115
opter-0.114
Token the
Token ID262
Feature activation+2.31e-01

Pos logit contributions
 Sue+0.015
 lick+0.014
 sill+0.013
 nod+0.012
 pur+0.012
Neg logit contributions
ind-0.015
 targets-0.015
us-0.015
eds-0.015
opter-0.015
Token stitch
Token ID24695
Feature activation+6.58e-01

Pos logit contributions
 lick+0.007
 Sue+0.006
 sill+0.006
 shake+0.005
 wipe+0.005
Neg logit contributions
eds-0.009
ind-0.009
us-0.009
opter-0.009
 targets-0.009
Token in
Token ID287
Feature activation+1.52e+00

Pos logit contributions
 Sue+0.023
 lick+0.022
 sill+0.019
 nod+0.018
 pur+0.018
Neg logit contributions
 targets-0.021
eds-0.021
ind-0.021
opter-0.020
us-0.020
Token the
Token ID262
Feature activation-2.92e-01

Pos logit contributions
 Sue+0.045
 lick+0.043
 sill+0.038
 pur+0.035
 nod+0.035
Neg logit contributions
ind-0.057
 targets-0.056
eds-0.056
opter-0.055
us-0.055
Token to
Token ID284
Feature activation+1.61e+00

Pos logit contributions
 Sue+0.009
 lick+0.008
 shake+0.007
 pur+0.007
 Amy+0.007
Neg logit contributions
opter-0.011
opus-0.010
ind-0.010
us-0.010
ino-0.010
Token turn
Token ID1210
Feature activation+8.53e-01

Pos logit contributions
 Sue+0.041
 lick+0.035
 sill+0.032
 Together+0.032
 Amy+0.030
Neg logit contributions
ind-0.065
eds-0.064
 targets-0.063
opter-0.061
 instrument-0.060
Token off
Token ID572
Feature activation-1.36e+00

Pos logit contributions
 Sue+0.024
 lick+0.022
 sill+0.020
 shake+0.020
 wipe+0.019
Neg logit contributions
ind-0.034
opter-0.034
opus-0.034
eds-0.034
ich-0.033
Token the
Token ID262
Feature activation+4.95e-01

Pos logit contributions
 targets+0.043
ind+0.043
eds+0.042
opter+0.040
us+0.040
Neg logit contributions
 lick-0.043
 Sue-0.042
 sill-0.040
 nod-0.038
Deal-0.037
Token f
Token ID277
Feature activation+8.31e-01

Pos logit contributions
 Sue+0.012
 lick+0.011
 sill+0.010
 shake+0.009
 wipe+0.009
Neg logit contributions
ind-0.021
eds-0.021
opter-0.021
 targets-0.021
us-0.021
Tokenauc
Token ID14272
Feature activation+3.93e+00

Pos logit contributions
 Sue+0.030
 lick+0.028
 pur+0.025
 Amy+0.025
 sill+0.024
Neg logit contributions
us-0.027
eds-0.027
ind-0.027
opter-0.026
opus-0.026
Tokenet
Token ID316
Feature activation+3.01e-01

Pos logit contributions
 Sue+0.151
 lick+0.136
+0.128
 Amy+0.127
 She+0.127
Neg logit contributions
eds-0.117
ind-0.110
 targets-0.104
OO-0.102
us-0.101
Token but
Token ID475
Feature activation-2.98e-01

Pos logit contributions
 Sue+0.009
 lick+0.009
 sill+0.008
 shake+0.008
 wipe+0.007
Neg logit contributions
ind-0.011
eds-0.011
opter-0.011
 targets-0.011
us-0.011
Token it
Token ID340
Feature activation+7.46e-01

Pos logit contributions
ind+0.011
eds+0.010
us+0.010
opter+0.010
 targets+0.010
Neg logit contributions
 Sue-0.011
 lick-0.009
 shake-0.009
 Amy-0.008
 wipe-0.008
Token was
Token ID373
Feature activation+1.13e+00

Pos logit contributions
 Sue+0.029
 lick+0.028
 sill+0.026
 nod+0.025
 Amy+0.024
Neg logit contributions
eds-0.020
opter-0.020
ind-0.020
 targets-0.019
OO-0.019
Token too
Token ID1165
Feature activation+1.89e+00

Pos logit contributions
 Sue+0.032
 lick+0.030
 sill+0.027
 nod+0.025
 wipe+0.024
Neg logit contributions
ind-0.043
eds-0.043
 targets-0.042
us-0.041
opter-0.041
Token.
Token ID13
Feature activation+2.93e-01

Pos logit contributions
 Sue+0.002
 lick+0.001
 sill+0.001
 pur+0.001
 shake+0.001
Neg logit contributions
ind-0.002
opter-0.002
eds-0.002
opus-0.002
us-0.002
Token With
Token ID2080
Feature activation+1.08e+00

Pos logit contributions
 Sue+0.007
 lick+0.007
 sill+0.006
 Amy+0.005
 pur+0.005
Neg logit contributions
ind-0.013
eds-0.013
 targets-0.012
opter-0.012
us-0.012
Token her
Token ID607
Feature activation+2.68e+00

Pos logit contributions
 Sue+0.032
 lick+0.030
 sill+0.027
 shake+0.025
 pur+0.025
Neg logit contributions
ind-0.041
eds-0.040
 targets-0.040
opter-0.040
us-0.039
Token new
Token ID649
Feature activation+1.38e+00

Pos logit contributions
 lick+0.078
 Sue+0.077
 pur+0.070
 shake+0.069
 nod+0.069
Neg logit contributions
us-0.102
ind-0.102
opter-0.101
eds-0.100
OO-0.099
Token beautiful
Token ID4950
Feature activation+4.75e-01

Pos logit contributions
 lick+0.043
 Sue+0.041
 nod+0.039
 pur+0.039
 sill+0.038
Neg logit contributions
us-0.050
ind-0.048
opter-0.048
OO-0.048
eds-0.048
Token voice
Token ID3809
Feature activation+3.92e+00

Pos logit contributions
 lick+0.016
 Sue+0.014
 pur+0.014
 nod+0.014
 shake+0.013
Neg logit contributions
us-0.017
ind-0.017
 targets-0.016
opter-0.016
opus-0.016
Token,
Token ID11
Feature activation+1.40e-01

Pos logit contributions
 Sue+0.107
 lick+0.098
 sill+0.087
 shake+0.083
 pur+0.082
Neg logit contributions
ind-0.163
eds-0.161
 targets-0.157
us-0.156
opter-0.155
Token the
Token ID262
Feature activation-1.57e-01

Pos logit contributions
 Sue+0.004
 lick+0.004
 sill+0.003
 shake+0.003
 pur+0.003
Neg logit contributions
ind-0.005
 targets-0.005
opter-0.005
eds-0.005
us-0.005
Token opera
Token ID27296
Feature activation-1.04e+00

Pos logit contributions
eds+0.005
us+0.005
OO+0.005
ind+0.005
opter+0.004
Neg logit contributions
 Sue-0.006
 lick-0.006
 sill-0.005
 shake-0.005
 pur-0.005
Token singer
Token ID14015
Feature activation-4.50e-01

Pos logit contributions
eds+0.038
ind+0.038
us+0.037
 targets+0.037
OO+0.037
Neg logit contributions
 Sue-0.033
 lick-0.031
 sill-0.030
 pur-0.029
 shake-0.028
Token was
Token ID373
Feature activation+4.66e-01

Pos logit contributions
ind+0.020
eds+0.019
 targets+0.019
us+0.019
OO+0.019
Neg logit contributions
 Sue-0.011
 lick-0.010
 pur-0.010
 sill-0.009
 shake-0.009
Token
Token ID198
Feature activation+2.23e-01

Pos logit contributions
 lick+0.016
 Sue+0.015
 sill+0.015
 crunch+0.014
 shake+0.013
Neg logit contributions
opter-0.018
us-0.018
OO-0.018
 perspective-0.017
 targets-0.017
Token
Token ID198
Feature activation+1.66e-01

Pos logit contributions
 Sue+0.006
 lick+0.005
 sill+0.005
 Amy+0.004
 shake+0.004
Neg logit contributions
ind-0.009
 targets-0.009
eds-0.009
opter-0.009
opus-0.009
TokenWhen
Token ID2215
Feature activation+5.23e-01

Pos logit contributions
 Sue+0.006
 lick+0.006
 sill+0.005
 shake+0.005
 pur+0.005
Neg logit contributions
ind-0.005
eds-0.005
 targets-0.005
opter-0.005
us-0.005
Token the
Token ID262
Feature activation+1.64e+00

Pos logit contributions
 lick+0.013
 sill+0.013
 Sue+0.012
 crunch+0.011
 nod+0.011
Neg logit contributions
opter-0.020
ind-0.019
us-0.019
 targets-0.019
 passport-0.019
Token door
Token ID3420
Feature activation+2.00e+00

Pos logit contributions
 Sue+0.068
 lick+0.068
 sill+0.062
 pur+0.058
 nod+0.058
Neg logit contributions
ind-0.040
opter-0.038
opus-0.037
 targets-0.037
ich-0.036
Token opened
Token ID4721
Feature activation+3.91e+00

Pos logit contributions
 Sue+0.050
 lick+0.049
 sill+0.042
 crunch+0.038
 pur+0.038
Neg logit contributions
opter-0.080
ind-0.079
eds-0.079
 targets-0.078
us-0.077
Token,
Token ID11
Feature activation+7.40e-01

Pos logit contributions
 Sue+0.118
 lick+0.109
 sill+0.099
 shake+0.092
 Amy+0.092
Neg logit contributions
ind-0.147
eds-0.146
 targets-0.145
opter-0.144
us-0.143
Token bright
Token ID6016
Feature activation+1.50e+00

Pos logit contributions
 lick+0.022
 sill+0.020
 Sue+0.020
 crunch+0.018
 nod+0.018
Neg logit contributions
 targets-0.025
opter-0.025
 passport-0.024
ind-0.024
ich-0.024
Token light
Token ID1657
Feature activation+7.81e-01

Pos logit contributions
 Sue+0.050
 lick+0.047
 sill+0.044
 shake+0.041
 pur+0.040
Neg logit contributions
ind-0.052
eds-0.051
us-0.050
 targets-0.050
opter-0.050
Token shone
Token ID44193
Feature activation+1.09e+00

Pos logit contributions
 Sue+0.025
 lick+0.023
 sill+0.022
 shake+0.020
 pur+0.020
Neg logit contributions
ind-0.028
eds-0.027
 targets-0.027
opter-0.027
us-0.027
Token out
Token ID503
Feature activation+3.42e-01

Pos logit contributions
 lick+0.040
 Sue+0.040
 sill+0.034
+0.033
 wipe+0.033
Neg logit contributions
ind-0.032
 targets-0.031
opter-0.030
us-0.030
eds-0.029
Token you
Token ID345
Feature activation-2.62e-01

Pos logit contributions
ind+0.009
eds+0.009
opter+0.009
 targets+0.008
opus+0.008
Neg logit contributions
 Sue-0.012
 lick-0.011
 sill-0.010
 Amy-0.010
 shake-0.010
Token."
Token ID526
Feature activation+3.07e-01

Pos logit contributions
opter+0.011
opus+0.010
ind+0.010
us+0.010
 targets+0.010
Neg logit contributions
 Sue-0.008
 lick-0.007
 sill-0.006
 Amy-0.006
 wipe-0.006
Token With
Token ID2080
Feature activation+8.32e-01

Pos logit contributions
 lick+0.007
 sill+0.007
 shake+0.007
 Sue+0.007
 crunch+0.006
Neg logit contributions
us-0.011
 targets-0.010
eds-0.010
opter-0.010
ables-0.010
Token a
Token ID257
Feature activation+2.20e+00

Pos logit contributions
 Sue+0.025
 lick+0.024
 sill+0.023
 wipe+0.021
 shake+0.020
Neg logit contributions
 targets-0.030
ind-0.028
us-0.027
eds-0.027
opter-0.027
Token kind
Token ID1611
Feature activation+2.46e+00

Pos logit contributions
 Sue+0.070
 sill+0.063
 lick+0.058
 Amy+0.054
 She+0.054
Neg logit contributions
opter-0.075
opus-0.074
us-0.072
eds-0.071
 passport-0.070
Token voice
Token ID3809
Feature activation+3.88e+00

Pos logit contributions
 Sue+0.076
 lick+0.070
 sill+0.065
Deal+0.064
 blush+0.064
Neg logit contributions
opter-0.081
us-0.077
OO-0.077
opus-0.076
 enemy-0.076
Token,
Token ID11
Feature activation-3.35e-01

Pos logit contributions
 lick+0.149
 Sue+0.143
 sill+0.137
+0.135
Deal+0.128
Neg logit contributions
us-0.100
opter-0.093
 targets-0.091
ino-0.089
opus-0.089
Token the
Token ID262
Feature activation-6.23e-01

Pos logit contributions
 targets+0.011
opter+0.010
us+0.010
opus+0.010
ich+0.010
Neg logit contributions
 lick-0.011
 sill-0.010
 Sue-0.010
 wipe-0.009
 crunch-0.009
Token snake
Token ID17522
Feature activation-1.20e-01

Pos logit contributions
opus+0.014
 targets+0.013
ich+0.013
opter+0.013
ind+0.013
Neg logit contributions
 lick-0.025
 Sue-0.025
 sill-0.023
 wipe-0.022
 blush-0.022
Token asked
Token ID1965
Feature activation-4.25e-01

Pos logit contributions
ind+0.004
 targets+0.004
eds+0.004
opus+0.004
opter+0.004
Neg logit contributions
 Sue-0.004
 lick-0.003
 sill-0.003
 wipe-0.003
 Amy-0.003
Token the
Token ID262
Feature activation-6.16e-01

Pos logit contributions
ind+0.015
eds+0.015
 targets+0.015
us+0.015
opter+0.015
Neg logit contributions
 Sue-0.013
 lick-0.013
 sill-0.011
 shake-0.010
 wipe-0.010
Token.
Token ID13
Feature activation+4.26e-01

Pos logit contributions
 Sue+0.024
 lick+0.024
 pur+0.020
 sill+0.020
 Amy+0.020
Neg logit contributions
ind-0.026
eds-0.026
us-0.026
 targets-0.025
opter-0.025
Token They
Token ID1119
Feature activation+1.14e+00

Pos logit contributions
 lick+0.009
 Sue+0.009
 sill+0.007
 pur+0.006
 shake+0.006
Neg logit contributions
 targets-0.019
ind-0.019
eds-0.019
us-0.019
opter-0.019
Token opened
Token ID4721
Feature activation+2.34e+00

Pos logit contributions
 Sue+0.034
 lick+0.031
 sill+0.029
 Amy+0.027
 crunch+0.026
Neg logit contributions
opter-0.040
us-0.039
ind-0.039
 targets-0.039
eds-0.039
Token the
Token ID262
Feature activation+4.45e-01

Pos logit contributions
 Sue+0.069
 lick+0.066
 sill+0.059
 pur+0.055
 shake+0.054
Neg logit contributions
ind-0.087
eds-0.086
 targets-0.086
opter-0.084
us-0.084
Token f
Token ID277
Feature activation+8.91e-01

Pos logit contributions
 Sue+0.015
 lick+0.015
 sill+0.013
 shake+0.012
 wipe+0.012
Neg logit contributions
ind-0.015
eds-0.015
opter-0.015
 targets-0.015
us-0.015
Tokenauc
Token ID14272
Feature activation+3.87e+00

Pos logit contributions
 Sue+0.032
 lick+0.030
 pur+0.027
 sill+0.026
 shake+0.026
Neg logit contributions
us-0.029
ind-0.029
eds-0.029
opter-0.028
opus-0.027
Tokenet
Token ID316
Feature activation+2.26e-01

Pos logit contributions
 Sue+0.153
 lick+0.137
+0.135
 Lucy+0.129
 Amy+0.129
Neg logit contributions
eds-0.108
ind-0.102
us-0.094
OO-0.091
 targets-0.091
Token and
Token ID290
Feature activation+2.55e-02

Pos logit contributions
 Sue+0.009
 lick+0.008
 sill+0.008
 pur+0.007
 shake+0.007
Neg logit contributions
ind-0.006
eds-0.006
 targets-0.006
opter-0.006
us-0.006
Token put
Token ID1234
Feature activation+7.06e-01

Pos logit contributions
 Sue+0.001
 lick+0.001
 sill+0.001
+0.001
 crunch+0.001
Neg logit contributions
 targets-0.001
us-0.001
eds-0.001
ind-0.001
opter-0.001
Token their
Token ID511
Feature activation+1.80e+00

Pos logit contributions
 lick+0.028
 Sue+0.028
 pur+0.026
 nod+0.026
 sill+0.026
Neg logit contributions
 targets-0.016
us-0.016
eds-0.015
ind-0.015
 passport-0.014
Token letters
Token ID7475
Feature activation+3.59e-01

Pos logit contributions
 Sue+0.063
 lick+0.060
 sill+0.054
 shake+0.052
 wipe+0.051
Neg logit contributions
opter-0.059
ind-0.059
eds-0.059
us-0.058
opus-0.058
Token
Token ID198
Feature activation+4.49e-02

Pos logit contributions
 targets+0.001
ind+0.001
eds+0.001
opter+0.001
 enemy+0.001
Neg logit contributions
 Sue-0.000
 lick-0.000
 She-0.000
 Amy-0.000
 Together-0.000
TokenWhen
Token ID2215
Feature activation+7.13e-01

Pos logit contributions
 Sue+0.001
 lick+0.001
 sill+0.001
 Amy+0.001
 wipe+0.001
Neg logit contributions
 targets-0.002
eds-0.002
ind-0.002
us-0.002
opter-0.002
Token the
Token ID262
Feature activation+1.49e+00

Pos logit contributions
 Sue+0.021
 lick+0.020
 sill+0.019
 shake+0.017
 nod+0.017
Neg logit contributions
ind-0.026
opter-0.026
eds-0.026
 targets-0.026
us-0.025
Token windows
Token ID9168
Feature activation+1.40e+00

Pos logit contributions
 Sue+0.049
 lick+0.045
 sill+0.041
 Amy+0.039
 Together+0.038
Neg logit contributions
opus-0.049
 passport-0.048
opter-0.046
ich-0.046
us-0.044
Token were
Token ID547
Feature activation+1.84e+00

Pos logit contributions
 Sue+0.041
 lick+0.040
 sill+0.035
 pur+0.033
 crunch+0.033
Neg logit contributions
opter-0.051
eds-0.050
ind-0.050
 targets-0.049
us-0.049
Token opened
Token ID4721
Feature activation+3.87e+00

Pos logit contributions
 Sue+0.054
 lick+0.053
 sill+0.047
 nod+0.044
 pur+0.043
Neg logit contributions
ind-0.064
eds-0.064
opter-0.063
us-0.062
 targets-0.062
Token,
Token ID11
Feature activation+4.47e-01

Pos logit contributions
 Sue+0.131
 lick+0.130
 sill+0.115
 nod+0.107
 wipe+0.107
Neg logit contributions
ind-0.125
opter-0.121
 targets-0.121
eds-0.121
us-0.118
Token a
Token ID257
Feature activation+7.23e-02

Pos logit contributions
 Sue+0.012
 lick+0.012
 sill+0.011
 shake+0.009
 pur+0.009
Neg logit contributions
ind-0.017
 targets-0.017
opter-0.017
eds-0.017
us-0.016
Token faint
Token ID18107
Feature activation-2.40e-02

Pos logit contributions
 Sue+0.002
 lick+0.002
 sill+0.002
 pur+0.002
 shake+0.002
Neg logit contributions
eds-0.003
ind-0.003
 targets-0.003
us-0.003
OO-0.003
Token breeze
Token ID28633
Feature activation-4.95e-01

Pos logit contributions
ind+0.001
eds+0.001
 targets+0.001
opter+0.001
us+0.001
Neg logit contributions
 Sue-0.001
 lick-0.001
 sill-0.001
 shake-0.001
 pur-0.001
Token came
Token ID1625
Feature activation+1.64e+00

Pos logit contributions
ind+0.017
eds+0.017
 targets+0.017
opter+0.017
us+0.017
Neg logit contributions
 Sue-0.016
 lick-0.015
 sill-0.014
 shake-0.013
 pur-0.013
Token away
Token ID1497
Feature activation-1.09e+00

Pos logit contributions
 Sue+0.089
 lick+0.086
 sill+0.078
 pur+0.073
 shake+0.073
Neg logit contributions
ind-0.091
eds-0.089
 targets-0.089
opter-0.088
us-0.088
Token,
Token ID11
Feature activation+3.58e-01

Pos logit contributions
ind+0.045
eds+0.045
 targets+0.044
us+0.043
opter+0.043
Neg logit contributions
 Sue-0.031
 lick-0.028
 sill-0.025
 shake-0.024
 Amy-0.024
Token they
Token ID484
Feature activation-6.84e-01

Pos logit contributions
 Sue+0.008
 lick+0.007
 Lucy+0.007
 Amy+0.007
 sill+0.006
Neg logit contributions
eds-0.017
ind-0.017
opus-0.016
us-0.016
OO-0.016
Token saw
Token ID2497
Feature activation-8.70e-02

Pos logit contributions
ind+0.027
eds+0.027
opter+0.026
opus+0.026
us+0.026
Neg logit contributions
 Sue-0.020
 lick-0.018
 sill-0.016
 shake-0.016
 pur-0.015
Token a
Token ID257
Feature activation-7.60e-01

Pos logit contributions
eds+0.004
ind+0.004
opter+0.004
us+0.004
opus+0.004
Neg logit contributions
 Sue-0.002
 lick-0.002
 sill-0.002
 Amy-0.002
 shake-0.002
Token se
Token ID384
Feature activation+3.85e+00

Pos logit contributions
eds+0.030
ind+0.029
 targets+0.029
ors+0.029
us+0.028
Neg logit contributions
 Sue-0.022
 lick-0.022
 shake-0.019
 crunch-0.019
 pur-0.019
Tokenag
Token ID363
Feature activation-5.77e-01

Pos logit contributions
 Sue+0.131
 lick+0.120
 sill+0.110
 Amy+0.106
 shake+0.106
Neg logit contributions
ind-0.132
eds-0.128
opter-0.127
us-0.127
OO-0.123
Tokenull
Token ID724
Feature activation+1.05e+00

Pos logit contributions
 targets+0.015
ind+0.015
opus+0.014
opter+0.014
us+0.014
Neg logit contributions
 Sue-0.024
 sill-0.022
 lick-0.022
 shake-0.020
 Amy-0.020
Token fly
Token ID6129
Feature activation+4.02e-01

Pos logit contributions
 Sue+0.037
 lick+0.035
 sill+0.032
 shake+0.031
 pur+0.031
Neg logit contributions
ind-0.033
opter-0.033
 targets-0.033
eds-0.032
us-0.032
Token down
Token ID866
Feature activation-2.47e+00

Pos logit contributions
 Sue+0.013
 lick+0.012
 sill+0.011
 shake+0.010
 pur+0.010
Neg logit contributions
ind-0.014
eds-0.014
opter-0.014
 targets-0.014
us-0.014
Token and
Token ID290
Feature activation-1.08e+00

Pos logit contributions
ind+0.084
eds+0.083
 targets+0.083
opter+0.081
us+0.081
Neg logit contributions
 Sue-0.081
 lick-0.078
 sill-0.070
 pur-0.066
 shake-0.065
Token lead
Token ID1085
Feature activation-6.91e-02

Pos logit contributions
us+0.032
eds+0.032
 targets+0.032
ind+0.032
opter+0.031
Neg logit contributions
 Sue-0.024
 lick-0.022
 sill-0.020
 crunch-0.018
 pur-0.018
Token him
Token ID683
Feature activation+9.21e-02

Pos logit contributions
ind+0.003
opter+0.003
eds+0.003
opus+0.003
ich+0.003
Neg logit contributions
 Sue-0.002
 lick-0.002
 sill-0.002
 Amy-0.002
 shake-0.002
Token around
Token ID1088
Feature activation-1.71e+00

Pos logit contributions
 Sue+0.003
 lick+0.003
 sill+0.003
 shake+0.003
 pur+0.003
Neg logit contributions
ind-0.003
eds-0.003
opter-0.003
 targets-0.003
us-0.003
Token the
Token ID262
Feature activation-1.61e+00

Pos logit contributions
opus+0.085
opter+0.084
ind+0.081
eds+0.081
ich+0.080
Neg logit contributions
 Sue-0.037
 lick-0.031
 sill-0.030
 Amy-0.028
 shake-0.028
Token books
Token ID3835
Feature activation-5.73e-01

Pos logit contributions
us+0.055
opter+0.054
eds+0.054
opus+0.052
ind+0.052
Neg logit contributions
 Sue-0.056
 lick-0.053
 sill-0.050
 shake-0.047
 pur-0.046
Tokenhe
Token ID258
Feature activation-4.27e+00

Pos logit contributions
ind+0.022
eds+0.022
 targets+0.021
opter+0.021
us+0.021
Neg logit contributions
 Sue-0.017
 lick-0.016
 sill-0.014
 shake-0.013
 pur-0.013
Tokenlf
Token ID1652
Feature activation+5.94e-01

Pos logit contributions
ind+0.140
us+0.137
OO+0.137
opter+0.135
eds+0.132
Neg logit contributions
 Sue-0.156
 Amy-0.137
 lick-0.136
 sill-0.136
 Lucy-0.128
Token.
Token ID13
Feature activation+3.95e-02

Pos logit contributions
 Sue+0.017
 lick+0.016
 sill+0.014
 shake+0.013
 pur+0.013
Neg logit contributions
ind-0.023
eds-0.022
 targets-0.022
opter-0.022
us-0.022
Token After
Token ID2293
Feature activation-4.88e-01

Pos logit contributions
 Sue+0.001
 lick+0.001
 sill+0.001
 Amy+0.001
 nod+0.001
Neg logit contributions
ind-0.002
eds-0.002
opter-0.002
us-0.002
ich-0.002
Token a
Token ID257
Feature activation+2.88e+00

Pos logit contributions
ind+0.023
eds+0.023
opter+0.022
opus+0.022
 targets+0.022
Neg logit contributions
 Sue-0.011
 lick-0.009
 sill-0.008
 Amy-0.008
 shake-0.008
Token while
Token ID981
Feature activation+1.14e+00

Pos logit contributions
 Sue+0.081
 lick+0.081
 shake+0.066
 crunch+0.064
 sill+0.064
Neg logit contributions
eds-0.117
ind-0.117
 targets-0.113
ich-0.109
us-0.109
Token the
Token ID262
Feature activation-2.54e-01

Pos logit contributions
 Sue+0.044
 lick+0.042
 sill+0.038
 shake+0.035
 pur+0.035
Neg logit contributions
ind-0.054
eds-0.053
 targets-0.052
opter-0.052
us-0.052
Token bear
Token ID6842
Feature activation+1.06e+00

Pos logit contributions
ind+0.009
 targets+0.008
eds+0.008
opter+0.008
us+0.008
Neg logit contributions
 Sue-0.009
 lick-0.008
 sill-0.007
 shake-0.007
 nod-0.007
Token and
Token ID290
Feature activation-7.54e-01

Pos logit contributions
 Sue+0.032
 lick+0.030
 sill+0.027
 shake+0.025
 pur+0.025
Neg logit contributions
ind-0.040
opter-0.040
eds-0.039
us-0.039
 targets-0.038
Token the
Token ID262
Feature activation-6.81e-01

Pos logit contributions
ind+0.031
opter+0.030
eds+0.030
us+0.030
opus+0.029
Neg logit contributions
 Sue-0.022
 lick-0.020
 sill-0.018
 Amy-0.017
 pur-0.017
Token books
Token ID3835
Feature activation-4.42e-01

Pos logit contributions
 targets+0.022
ind+0.022
eds+0.021
opter+0.021
opus+0.021
Neg logit contributions
 Sue-0.023
 lick-0.022
 sill-0.020
 shake-0.019
 nod-0.019
Tokenhe
Token ID258
Feature activation-4.17e+00

Pos logit contributions
 targets+0.016
eds+0.016
ind+0.016
opter+0.016
us+0.015
Neg logit contributions
 Sue-0.013
 lick-0.013
 sill-0.011
 nod-0.011
 pur-0.011
Tokenlf
Token ID1652
Feature activation+5.64e-01

Pos logit contributions
 targets+0.124
ind+0.122
OO+0.121
utes+0.121
us+0.118
Neg logit contributions
 Sue-0.165
 lick-0.146
 Amy-0.142
 Lucy-0.141
 sill-0.131
Token.
Token ID13
Feature activation+8.42e-01

Pos logit contributions
 Sue+0.017
 lick+0.016
 sill+0.014
 nod+0.013
 wipe+0.013
Neg logit contributions
ind-0.020
eds-0.020
 targets-0.020
opter-0.020
us-0.020
Token She
Token ID1375
Feature activation+7.29e-01

Pos logit contributions
 lick+0.016
 Sue+0.014
 pur+0.011
 crunch+0.011
 shake+0.011
Neg logit contributions
 targets-0.039
OO-0.038
 perspective-0.038
opter-0.038
eds-0.038
Token stayed
Token ID9658
Feature activation+1.14e-01

Pos logit contributions
 Sue+0.022
 lick+0.021
 sill+0.019
 pur+0.018
 shake+0.018
Neg logit contributions
ind-0.028
eds-0.027
opter-0.027
us-0.027
 targets-0.026
Token in
Token ID287
Feature activation+1.16e+00

Pos logit contributions
 Sue+0.004
 lick+0.003
 sill+0.003
 shake+0.003
 pur+0.003
Neg logit contributions
ind-0.004
eds-0.004
 targets-0.004
opter-0.004
us-0.004
Token It
Token ID632
Feature activation+1.03e+00

Pos logit contributions
 Sue+0.016
 lick+0.015
 sill+0.013
 shake+0.012
 pur+0.012
Neg logit contributions
ind-0.023
eds-0.023
 targets-0.023
opter-0.023
us-0.022
Token became
Token ID2627
Feature activation+7.89e-01

Pos logit contributions
 Sue+0.034
 lick+0.032
 sill+0.030
 pur+0.028
 shake+0.028
Neg logit contributions
ind-0.035
 targets-0.035
eds-0.035
opter-0.035
us-0.034
Token the
Token ID262
Feature activation+4.21e-01

Pos logit contributions
 Sue+0.023
 lick+0.022
 sill+0.019
 wipe+0.018
 shake+0.018
Neg logit contributions
ind-0.029
 targets-0.029
eds-0.028
us-0.028
opter-0.027
Token happiest
Token ID49414
Feature activation-2.53e+00

Pos logit contributions
 Sue+0.013
 lick+0.012
 sill+0.011
 Amy+0.010
 pur+0.010
Neg logit contributions
ind-0.015
 targets-0.015
opus-0.014
ich-0.014
eds-0.014
Token books
Token ID3835
Feature activation-5.14e-01

Pos logit contributions
eds+0.086
ind+0.085
us+0.084
 targets+0.081
OO+0.080
Neg logit contributions
 Sue-0.092
 lick-0.083
 sill-0.080
 Amy-0.077
 Lucy-0.075
Tokenhe
Token ID258
Feature activation-4.16e+00

Pos logit contributions
ind+0.019
eds+0.019
 targets+0.019
us+0.018
OO+0.018
Neg logit contributions
 Sue-0.016
 lick-0.015
 sill-0.014
 shake-0.013
 Amy-0.013
Tokenlf
Token ID1652
Feature activation+3.71e-01

Pos logit contributions
ind+0.128
 targets+0.125
OO+0.124
opter+0.123
eds+0.120
Neg logit contributions
 Sue-0.164
 Amy-0.146
 lick-0.141
 Lucy-0.137
 sill-0.133
Token in
Token ID287
Feature activation-7.92e-01

Pos logit contributions
 Sue+0.011
 lick+0.010
 sill+0.010
 Amy+0.009
 shake+0.009
Neg logit contributions
ind-0.014
 targets-0.014
eds-0.014
opter-0.013
OO-0.013
Token the
Token ID262
Feature activation-1.70e+00

Pos logit contributions
opus+0.043
us+0.041
opter+0.041
OO+0.041
eds+0.040
Neg logit contributions
 Sue-0.015
 Amy-0.010
 Lucy-0.010
 lick-0.010
 sill-0.009
Token world
Token ID995
Feature activation-8.86e-01

Pos logit contributions
us+0.038
eds+0.038
opter+0.037
 targets+0.037
ind+0.036
Neg logit contributions
 Sue-0.077
 lick-0.074
 sill-0.072
 shake-0.067
 pur-0.067
Token!
Token ID0
Feature activation-5.17e-01

Pos logit contributions
ind+0.035
eds+0.034
 targets+0.034
opter+0.033
us+0.033
Neg logit contributions
 Sue-0.026
 lick-0.024
 sill-0.022
 Amy-0.021
 shake-0.020
Token lead
Token ID1085
Feature activation-1.75e-01

Pos logit contributions
ind+0.037
eds+0.037
 targets+0.037
us+0.037
opter+0.036
Neg logit contributions
 Sue-0.025
 lick-0.023
 sill-0.021
 pur-0.019
 Amy-0.019
Token him
Token ID683
Feature activation-6.75e-02

Pos logit contributions
ind+0.006
opter+0.006
eds+0.006
opus+0.006
 targets+0.006
Neg logit contributions
 Sue-0.006
 lick-0.005
 sill-0.005
 Amy-0.004
 shake-0.004
Token around
Token ID1088
Feature activation-1.70e+00

Pos logit contributions
ind+0.002
opter+0.002
eds+0.002
 targets+0.002
us+0.002
Neg logit contributions
 Sue-0.002
 lick-0.002
 sill-0.002
 shake-0.002
 pur-0.002
Token the
Token ID262
Feature activation-1.67e+00

Pos logit contributions
opus+0.081
opter+0.080
ind+0.078
ich+0.077
eds+0.077
Neg logit contributions
 Sue-0.040
 lick-0.033
 sill-0.032
 Amy-0.031
 shake-0.030
Token books
Token ID3835
Feature activation-5.52e-01

Pos logit contributions
us+0.060
opter+0.060
eds+0.059
utes+0.057
opus+0.057
Neg logit contributions
 Sue-0.054
 lick-0.052
 sill-0.050
 pur-0.046
 shake-0.046
Tokenhe
Token ID258
Feature activation-4.14e+00

Pos logit contributions
ind+0.021
eds+0.021
opter+0.021
 targets+0.021
us+0.021
Neg logit contributions
 Sue-0.016
 lick-0.015
 sill-0.014
 shake-0.012
 Amy-0.012
Tokenlf
Token ID1652
Feature activation+6.03e-01

Pos logit contributions
ind+0.130
OO+0.126
opter+0.124
us+0.124
eds+0.123
Neg logit contributions
 Sue-0.158
 Amy-0.140
 lick-0.140
 sill-0.138
 Lucy-0.131
Token.
Token ID13
Feature activation-7.02e-02

Pos logit contributions
 Sue+0.017
 lick+0.016
 sill+0.014
 shake+0.013
 pur+0.013
Neg logit contributions
ind-0.023
eds-0.023
opter-0.023
 targets-0.023
us-0.023
Token After
Token ID2293
Feature activation-5.36e-01

Pos logit contributions
ind+0.003
eds+0.003
opter+0.003
us+0.003
 targets+0.003
Neg logit contributions
 Sue-0.002
 lick-0.001
 sill-0.001
 Amy-0.001
 nod-0.001
Token a
Token ID257
Feature activation+2.71e+00

Pos logit contributions
ind+0.025
eds+0.024
opter+0.024
opus+0.024
ich+0.023
Neg logit contributions
 Sue-0.013
 lick-0.011
 sill-0.010
 Amy-0.009
 shake-0.009
Token while
Token ID981
Feature activation+1.15e+00

Pos logit contributions
 lick+0.081
 Sue+0.080
 shake+0.066
 crunch+0.066
 sill+0.065
Neg logit contributions
eds-0.107
ind-0.106
 targets-0.103
ich-0.100
us-0.098
Token most
Token ID749
Feature activation-1.08e+00

Pos logit contributions
eds+0.002
ind+0.002
opter+0.002
us+0.002
 targets+0.002
Neg logit contributions
 Sue-0.002
 lick-0.002
 sill-0.001
 shake-0.001
 nod-0.001
Token perfect
Token ID2818
Feature activation-1.94e+00

Pos logit contributions
eds+0.041
ind+0.039
OO+0.039
opter+0.039
us+0.038
Neg logit contributions
 Sue-0.034
 lick-0.031
 sill-0.029
 crunch-0.028
 shake-0.027
Token present
Token ID1944
Feature activation-4.89e-01

Pos logit contributions
ind+0.065
eds+0.065
us+0.064
opter+0.063
 targets+0.063
Neg logit contributions
 Sue-0.068
 lick-0.065
 sill-0.060
 shake-0.056
 Amy-0.054
Token!
Token ID0
Feature activation-1.01e+00

Pos logit contributions
ind+0.019
eds+0.019
 targets+0.018
opter+0.018
us+0.018
Neg logit contributions
 Sue-0.015
 lick-0.013
 sill-0.013
 Amy-0.012
 shake-0.011
Token Sy
Token ID1632
Feature activation-1.47e+00

Pos logit contributions
ind+0.044
eds+0.043
 targets+0.043
opter+0.043
us+0.042
Neg logit contributions
 Sue-0.025
 lick-0.023
 sill-0.020
 Amy-0.018
 shake-0.018
Tokenlves
Token ID31018
Feature activation-4.06e+00

Pos logit contributions
eds+0.031
opter+0.030
utes+0.030
opus+0.030
OO+0.030
Neg logit contributions
 Sue-0.067
 lick-0.061
 sill-0.055
 Lucy-0.055
 She-0.054
Tokenter
Token ID353
Feature activation-1.23e+00

Pos logit contributions
opus+0.168
eds+0.159
OO+0.158
opter+0.156
 targets+0.155
Neg logit contributions
 Sue-0.111
 sill-0.082
 Lucy-0.080
 lick-0.076
 Sally-0.071
Token was
Token ID373
Feature activation-3.18e-01

Pos logit contributions
ind+0.047
 targets+0.047
eds+0.046
opter+0.045
OO+0.045
Neg logit contributions
 Sue-0.038
 lick-0.034
 sill-0.031
 pur-0.030
 shake-0.029
Token so
Token ID523
Feature activation-4.62e-01

Pos logit contributions
ind+0.013
opter+0.013
eds+0.013
 targets+0.013
opus+0.013
Neg logit contributions
 Sue-0.009
 lick-0.008
 sill-0.007
 Amy-0.007
 shake-0.007
Token happy
Token ID3772
Feature activation+4.43e-01

Pos logit contributions
ind+0.017
eds+0.017
opter+0.017
 targets+0.017
us+0.017
Neg logit contributions
 Sue-0.014
 lick-0.013
 sill-0.012
 shake-0.011
 pur-0.011
Token and
Token ID290
Feature activation+1.66e-01

Pos logit contributions
 Sue+0.014
 lick+0.011
 sill+0.011
 Amy+0.011
 She+0.011
Neg logit contributions
ind-0.018
eds-0.018
opter-0.017
 targets-0.017
OO-0.017
Token beneath
Token ID11061
Feature activation-2.06e+00

Pos logit contributions
opus+0.019
opter+0.019
ind+0.018
eds+0.018
 targets+0.018
Neg logit contributions
 Sue-0.012
 lick-0.010
 sill-0.010
 shake-0.009
 Amy-0.009
Token the
Token ID262
Feature activation-1.18e+00

Pos logit contributions
ind+0.086
opter+0.086
eds+0.086
opus+0.085
us+0.084
Neg logit contributions
 Sue-0.055
 lick-0.049
 sill-0.044
 Amy-0.041
 shake-0.041
Token j
Token ID474
Feature activation-1.43e+00

Pos logit contributions
opus+0.039
OO+0.039
eds+0.039
opter+0.039
us+0.038
Neg logit contributions
 Sue-0.040
 lick-0.038
 sill-0.037
 shake-0.035
 crunch-0.033
Tokenee
Token ID1453
Feature activation-2.09e+00

Pos logit contributions
 targets+0.036
eds+0.034
ich+0.034
ind+0.034
 perspective+0.033
Neg logit contributions
 Sue-0.058
 lick-0.054
 sill-0.052
 shake-0.048
 wipe-0.047
Tokenpt
Token ID457
Feature activation-1.34e+00

Pos logit contributions
ind+0.084
 targets+0.082
 culture+0.081
opter+0.081
opus+0.080
Neg logit contributions
 Sue-0.054
 sill-0.050
 lick-0.050
 shake-0.046
 wipe-0.042
Tokenhe
Token ID258
Feature activation-4.06e+00

Pos logit contributions
eds+0.047
ind+0.046
 targets+0.045
opus+0.045
opter+0.044
Neg logit contributions
 Sue-0.042
 sill-0.039
 lick-0.035
 shake-0.035
 She-0.034
Token same
Token ID976
Feature activation-1.35e+00

Pos logit contributions
ind+0.170
opter+0.163
opus+0.163
eds+0.163
us+0.162
Neg logit contributions
 Sue-0.107
 sill-0.090
 lick-0.088
 Amy-0.086
 shake-0.086
Token je
Token ID11223
Feature activation-4.08e-01

Pos logit contributions
opus+0.055
ind+0.055
us+0.054
opter+0.054
eds+0.054
Neg logit contributions
 lick-0.035
 Sue-0.035
 sill-0.033
 shake-0.031
 pur-0.029
Tokenep
Token ID538
Feature activation+6.02e-01

Pos logit contributions
 targets+0.008
 perspective+0.008
eds+0.008
opter+0.008
OO+0.008
Neg logit contributions
 Sue-0.018
 lick-0.018
 sill-0.017
 shake-0.016
 pur-0.016
Token that
Token ID326
Feature activation-2.54e-01

Pos logit contributions
 Sue+0.014
 sill+0.013
 lick+0.013
 shake+0.012
 pur+0.010
Neg logit contributions
ind-0.027
opus-0.026
eds-0.026
opter-0.025
 targets-0.025
Token they
Token ID484
Feature activation-1.24e+00

Pos logit contributions
ind+0.010
eds+0.010
opter+0.009
us+0.009
 targets+0.009
Neg logit contributions
 Sue-0.008
 lick-0.007
 sill-0.006
 shake-0.006
 Amy-0.006
Token showed
Token ID3751
Feature activation+1.01e+00

Pos logit contributions
ind+0.052
eds+0.051
 targets+0.050
opter+0.050
us+0.050
Neg logit contributions
 Sue-0.044
 lick-0.042
 sill-0.038
 shake-0.035
 pur-0.035
Token her
Token ID607
Feature activation+1.84e-01

Pos logit contributions
 lick+0.026
 Sue+0.025
 sill+0.023
 nod+0.021
 shake+0.020
Neg logit contributions
 targets-0.041
ind-0.041
eds-0.040
us-0.038
 perspective-0.037
Token friend
Token ID1545
Feature activation-3.08e-01

Pos logit contributions
 Sue+0.006
 lick+0.006
 sill+0.005
 shake+0.005
 pur+0.005
Neg logit contributions
ind-0.006
eds-0.006
 targets-0.006
opter-0.006
us-0.006
Token the
Token ID262
Feature activation-1.20e+00

Pos logit contributions
eds+0.011
ind+0.011
 targets+0.011
us+0.010
ich+0.010
Neg logit contributions
 lick-0.009
 Sue-0.009
 sill-0.008
 shake-0.007
 crunch-0.007
Token books
Token ID3835
Feature activation-2.24e-02

Pos logit contributions
ind+0.042
eds+0.042
us+0.041
opter+0.041
 targets+0.040
Neg logit contributions
 Sue-0.039
 lick-0.037
 sill-0.034
 shake-0.032
 pur-0.032
Tokenhe
Token ID258
Feature activation-4.02e+00

Pos logit contributions
ind+0.001
eds+0.001
 targets+0.001
opter+0.001
us+0.001
Neg logit contributions
 Sue-0.001
 lick-0.001
 sill-0.001
 shake-0.001
 pur-0.001
Tokenlf
Token ID1652
Feature activation+4.84e-01

Pos logit contributions
ind+0.110
 targets+0.106
us+0.104
eds+0.104
OO+0.102
Neg logit contributions
 Sue-0.173
 lick-0.157
 Amy-0.153
 sill-0.146
 Lucy-0.145
Token.
Token ID13
Feature activation+5.43e-01

Pos logit contributions
 Sue+0.014
 lick+0.014
 sill+0.012
 shake+0.011
 nod+0.011
Neg logit contributions
ind-0.018
eds-0.018
 targets-0.018
opter-0.018
us-0.017
Token From
Token ID3574
Feature activation+1.11e+00

Pos logit contributions
 lick+0.011
 Sue+0.010
 sill+0.009
 shake+0.008
 crunch+0.008
Neg logit contributions
ind-0.024
 targets-0.024
eds-0.024
OO-0.024
ich-0.023
Token then
Token ID788
Feature activation+1.62e-01

Pos logit contributions
 Sue+0.034
 lick+0.029
 Amy+0.026
 shake+0.026
 sill+0.026
Neg logit contributions
opter-0.045
OO-0.044
eds-0.044
us-0.042
 perspective-0.042
Token on
Token ID319
Feature activation-5.69e-01

Pos logit contributions
 Sue+0.005
 lick+0.005
 sill+0.004
 shake+0.004
 wipe+0.004
Neg logit contributions
ind-0.006
eds-0.006
 targets-0.006
opter-0.006
us-0.006
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 Sue+0.044
 lick+0.042
 sill+0.039
 shake+0.036
 pur+0.036
Neg logit contributions
ind-0.039
 targets-0.039
opter-0.039
eds-0.039
us-0.038
Token saw
Token ID2497
Feature activation-5.28e-01

Pos logit contributions
 targets+0.043
ind+0.043
eds+0.043
opus+0.042
opter+0.041
Neg logit contributions
 Sue-0.028
 lick-0.025
 Amy-0.021
 pur-0.021
 Lucy-0.020
Token the
Token ID262
Feature activation-7.97e-01

Pos logit contributions
ind+0.023
eds+0.023
 targets+0.022
opter+0.022
us+0.022
Neg logit contributions
 Sue-0.013
 lick-0.012
 sill-0.010
 shake-0.009
 pur-0.009
Token gloomy
Token ID46400
Feature activation-1.07e+00

Pos logit contributions
ind+0.026
 targets+0.025
eds+0.025
opter+0.025
opus+0.024
Neg logit contributions
 Sue-0.028
 lick-0.026
 sill-0.024
 nod-0.022
 pur-0.022
Token books
Token ID3835
Feature activation-4.09e-01

Pos logit contributions
ind+0.034
eds+0.033
 targets+0.033
opter+0.033
us+0.032
Neg logit contributions
 Sue-0.038
 lick-0.036
 sill-0.033
 shake-0.031
 pur-0.031
Tokenhe
Token ID258
Feature activation-4.02e+00

Pos logit contributions
opter+0.013
eds+0.012
ind+0.012
 targets+0.012
us+0.012
Neg logit contributions
 Sue-0.014
 lick-0.013
 sill-0.012
 nod-0.012
 pur-0.011
Tokenlf
Token ID1652
Feature activation+7.25e-01

Pos logit contributions
ind+0.103
 targets+0.100
us+0.097
eds+0.097
opter+0.096
Neg logit contributions
 Sue-0.176
 lick-0.162
 Amy-0.154
 sill-0.152
 shake-0.147
Token.
Token ID13
Feature activation-1.21e-02

Pos logit contributions
 Sue+0.022
 lick+0.022
 sill+0.019
 nod+0.018
 pur+0.018
Neg logit contributions
ind-0.026
eds-0.026
 targets-0.025
opter-0.025
us-0.025
Token
Token ID220
Feature activation-8.23e-01

Pos logit contributions
opter+0.000
opus+0.000
us+0.000
eds+0.000
OO+0.000
Neg logit contributions
 lick-0.000
 sill-0.000
 Sue-0.000
 shake-0.000
 crunch-0.000
Token
Token ID198
Feature activation+1.18e-02

Pos logit contributions
ind+0.032
eds+0.032
 targets+0.032
opter+0.032
us+0.031
Neg logit contributions
 Sue-0.023
 lick-0.022
 sill-0.019
 shake-0.018
 pur-0.017
Token
Token ID198
Feature activation-9.23e-02

Pos logit contributions
 Sue+0.000
 lick+0.000
 sill+0.000
 Amy+0.000
 shake+0.000
Neg logit contributions
ind-0.000
 targets-0.000
eds-0.000
opter-0.000
opus-0.000
Token "
Token ID366
Feature activation-1.30e-01

Pos logit contributions
ind+0.018
eds+0.018
 targets+0.018
opter+0.018
us+0.018
Neg logit contributions
 Sue-0.014
 lick-0.013
 sill-0.011
 shake-0.011
 pur-0.010
TokenHello
Token ID15496
Feature activation+7.37e-01

Pos logit contributions
ind+0.005
 targets+0.005
eds+0.005
opter+0.005
us+0.005
Neg logit contributions
 Sue-0.004
 lick-0.004
 sill-0.003
 shake-0.003
 Amy-0.003
Token,
Token ID11
Feature activation-7.69e-01

Pos logit contributions
 Sue+0.025
 lick+0.021
 sill+0.019
 Amy+0.019
 Lucy+0.018
Neg logit contributions
eds-0.027
 targets-0.027
ind-0.026
us-0.026
opter-0.025
Token Mister
Token ID43438
Feature activation-1.91e+00

Pos logit contributions
eds+0.028
ind+0.026
 targets+0.026
us+0.026
opter+0.025
Neg logit contributions
 Sue-0.027
 lick-0.024
 sill-0.021
 shake-0.021
 nod-0.021
Token P
Token ID350
Feature activation-9.15e-01

Pos logit contributions
eds+0.057
ind+0.056
 targets+0.056
opter+0.055
us+0.054
Neg logit contributions
 Sue-0.072
 lick-0.067
 sill-0.062
 Amy-0.058
 pur-0.058
Tokenige
Token ID10045
Feature activation-4.02e+00

Pos logit contributions
 targets+0.026
eds+0.025
ind+0.025
us+0.024
opter+0.024
Neg logit contributions
 Sue-0.034
 lick-0.032
 sill-0.029
 wipe-0.028
 nod-0.028
Tokenon
Token ID261
Feature activation-4.83e-01

Pos logit contributions
ind+0.168
 targets+0.163
 culture+0.156
 perspective+0.155
utes+0.155
Neg logit contributions
 Sue-0.112
 lick-0.098
 She-0.078
 pr-0.077
 shake-0.077
Token!"
Token ID2474
Feature activation+5.06e-01

Pos logit contributions
ind+0.013
 targets+0.013
eds+0.013
us+0.013
opter+0.013
Neg logit contributions
 Sue-0.019
 lick-0.018
 sill-0.017
 pur-0.016
 shake-0.016
Token
Token ID198
Feature activation-4.62e-01

Pos logit contributions
 Sue+0.011
 lick+0.010
 sill+0.009
 shake+0.008
 crunch+0.008
Neg logit contributions
 targets-0.022
us-0.022
opter-0.022
ind-0.021
opus-0.021
Token
Token ID198
Feature activation-4.39e-01

Pos logit contributions
ind+0.021
 targets+0.021
eds+0.021
opter+0.020
OO+0.020
Neg logit contributions
 Sue-0.011
 lick-0.009
 sill-0.008
 Amy-0.008
 She-0.008
TokenThe
Token ID464
Feature activation+1.10e+00

Pos logit contributions
eds+0.017
 targets+0.017
ind+0.017
opter+0.017
OO+0.017
Neg logit contributions
 Sue-0.012
 lick-0.012
 sill-0.010
 shake-0.010
 She-0.009
Token to
Token ID284
Feature activation-2.64e-01

Pos logit contributions
 Sue+0.013
 lick+0.012
 sill+0.011
 shake+0.010
 pur+0.010
Neg logit contributions
ind-0.017
eds-0.017
opter-0.017
 targets-0.017
opus-0.017
Token see
Token ID766
Feature activation-5.46e-01

Pos logit contributions
 targets+0.010
ind+0.009
opter+0.009
 passport+0.009
eds+0.009
Neg logit contributions
 Sue-0.007
 lick-0.007
 sill-0.006
 crunch-0.006
 Together-0.006
Token the
Token ID262
Feature activation-8.79e-01

Pos logit contributions
ind+0.020
eds+0.020
opter+0.020
opus+0.020
OO+0.020
Neg logit contributions
 Sue-0.017
 lick-0.015
 sill-0.014
 Amy-0.013
 Lucy-0.013
Token rare
Token ID4071
Feature activation-1.85e+00

Pos logit contributions
eds+0.029
opter+0.029
ind+0.029
us+0.028
OO+0.027
Neg logit contributions
 Sue-0.031
 lick-0.029
 sill-0.027
 pur-0.025
 shake-0.025
Token books
Token ID3835
Feature activation-2.15e-01

Pos logit contributions
ind+0.059
eds+0.058
us+0.057
opter+0.057
 targets+0.055
Neg logit contributions
 Sue-0.068
 lick-0.065
 sill-0.060
 pur-0.056
 shake-0.056
Tokenhe
Token ID258
Feature activation-4.01e+00

Pos logit contributions
ind+0.008
eds+0.008
 targets+0.008
opter+0.008
us+0.008
Neg logit contributions
 Sue-0.006
 lick-0.006
 sill-0.005
 shake-0.005
 pur-0.005
Tokenlf
Token ID1652
Feature activation+7.70e-01

Pos logit contributions
ind+0.118
OO+0.113
opter+0.109
eds+0.109
us+0.106
Neg logit contributions
 Sue-0.167
 Amy-0.147
 lick-0.145
 sill-0.137
 Lucy-0.136
Token.
Token ID13
Feature activation+7.47e-01

Pos logit contributions
 Sue+0.021
 lick+0.019
 sill+0.017
 shake+0.016
 Amy+0.016
Neg logit contributions
ind-0.031
eds-0.031
opter-0.031
 targets-0.030
us-0.030
Token They
Token ID1119
Feature activation+1.49e-01

Pos logit contributions
 Sue+0.016
 lick+0.015
 sill+0.013
 shake+0.012
 pur+0.012
Neg logit contributions
ind-0.034
eds-0.033
 targets-0.033
opter-0.033
us-0.033
Token looked
Token ID3114
Feature activation-2.43e-01

Pos logit contributions
 Sue+0.004
 lick+0.004
 sill+0.003
 shake+0.003
 pur+0.003
Neg logit contributions
ind-0.006
eds-0.006
us-0.006
 targets-0.006
opter-0.006
Token at
Token ID379
Feature activation+1.57e+00

Pos logit contributions
eds+0.009
opter+0.009
ind+0.009
opus+0.009
us+0.009
Neg logit contributions
 Sue-0.008
 lick-0.007
 sill-0.006
 shake-0.006
 Amy-0.006
Token storm
Token ID6388
Feature activation+3.98e-01

Pos logit contributions
ind+0.009
eds+0.008
 targets+0.008
opter+0.008
us+0.008
Neg logit contributions
 Sue-0.009
 lick-0.008
 sill-0.008
 shake-0.007
 pur-0.007
Token came
Token ID1625
Feature activation+1.23e+00

Pos logit contributions
 Sue+0.011
 lick+0.010
 shake+0.009
 sill+0.009
 pur+0.008
Neg logit contributions
ind-0.017
 targets-0.016
eds-0.016
us-0.016
opus-0.016
Token and
Token ID290
Feature activation-2.92e-01

Pos logit contributions
 Sue+0.042
 lick+0.038
 sill+0.035
 shake+0.035
 pur+0.034
Neg logit contributions
opus-0.042
ind-0.042
opter-0.041
 targets-0.041
eds-0.040
Token the
Token ID262
Feature activation-9.98e-01

Pos logit contributions
ind+0.011
eds+0.011
 targets+0.011
opter+0.011
us+0.011
Neg logit contributions
 Sue-0.009
 lick-0.008
 sill-0.008
 shake-0.007
 pur-0.007
Token books
Token ID3835
Feature activation-4.24e-01

Pos logit contributions
eds+0.033
us+0.032
ind+0.032
opter+0.032
opus+0.031
Neg logit contributions
 Sue-0.036
 lick-0.034
 sill-0.031
 shake-0.030
 pur-0.029
Tokenhe
Token ID258
Feature activation-4.01e+00

Pos logit contributions
ind+0.018
eds+0.017
us+0.017
 targets+0.017
opus+0.017
Neg logit contributions
 Sue-0.011
 lick-0.011
 sill-0.009
 shake-0.009
 wipe-0.009
Tokenlf
Token ID1652
Feature activation+6.24e-01

Pos logit contributions
ind+0.123
 targets+0.119
us+0.118
opus+0.113
eds+0.113
Neg logit contributions
 Sue-0.162
 Amy-0.143
 lick-0.139
 Lucy-0.136
 sill-0.131
Token fell
Token ID3214
Feature activation+2.82e-01

Pos logit contributions
 Sue+0.019
 lick+0.018
 sill+0.016
 shake+0.015
 pur+0.015
Neg logit contributions
ind-0.024
eds-0.023
 targets-0.023
us-0.023
opter-0.022
Token over
Token ID625
Feature activation-3.62e-01

Pos logit contributions
 Sue+0.009
 lick+0.009
 sill+0.008
 nod+0.007
 pur+0.007
Neg logit contributions
ind-0.010
eds-0.010
 targets-0.010
opter-0.010
us-0.010
Token.
Token ID13
Feature activation+5.59e-01

Pos logit contributions
ind+0.015
eds+0.015
opter+0.015
 targets+0.015
opus+0.014
Neg logit contributions
 Sue-0.010
 lick-0.009
 sill-0.008
 shake-0.007
 Amy-0.007
Token All
Token ID1439
Feature activation-6.52e-02

Pos logit contributions
 lick+0.011
 Sue+0.010
 sill+0.009
 nod+0.008
 shake+0.008
Neg logit contributions
 targets-0.025
ind-0.025
opter-0.025
eds-0.025
OO-0.025
Token,
Token ID11
Feature activation-7.59e-01

Pos logit contributions
 Sue+0.027
 sill+0.025
 lick+0.024
 Amy+0.021
 shake+0.021
Neg logit contributions
ind-0.042
opter-0.041
opus-0.040
us-0.040
eds-0.040
Token or
Token ID393
Feature activation-7.73e-01

Pos logit contributions
ind+0.031
opter+0.030
us+0.030
eds+0.029
opus+0.029
Neg logit contributions
 Sue-0.024
 lick-0.021
 sill-0.020
 shake-0.019
 Amy-0.018
Token behind
Token ID2157
Feature activation-1.29e+00

Pos logit contributions
ind+0.031
opter+0.031
us+0.030
eds+0.030
opus+0.030
Neg logit contributions
 Sue-0.023
 lick-0.021
 sill-0.019
 shake-0.019
 Amy-0.018
Token the
Token ID262
Feature activation-1.48e+00

Pos logit contributions
opus+0.056
opter+0.056
OO+0.055
ind+0.054
us+0.054
Neg logit contributions
 Sue-0.036
 lick-0.028
 Amy-0.028
 Lucy-0.027
 sill-0.027
Token books
Token ID3835
Feature activation-7.83e-01

Pos logit contributions
ind+0.054
opter+0.053
eds+0.053
us+0.053
 targets+0.052
Neg logit contributions
 Sue-0.046
 lick-0.044
 sill-0.040
 shake-0.038
 pur-0.037
Tokenhe
Token ID258
Feature activation-4.00e+00

Pos logit contributions
ind+0.035
 targets+0.033
eds+0.033
ich+0.033
us+0.032
Neg logit contributions
 Sue-0.020
 lick-0.018
 sill-0.017
 She-0.016
 shake-0.016
Tokenlf
Token ID1652
Feature activation+6.22e-01

Pos logit contributions
ind+0.130
OO+0.120
 targets+0.117
us+0.116
 perspective+0.114
Neg logit contributions
 Sue-0.151
 lick-0.133
 Amy-0.130
 sill-0.124
 Lucy-0.123
Token,
Token ID11
Feature activation-5.44e-01

Pos logit contributions
 Sue+0.016
 sill+0.013
 lick+0.013
 Amy+0.012
 Lucy+0.011
Neg logit contributions
ind-0.027
opter-0.026
opus-0.026
 targets-0.026
OO-0.026
Token or
Token ID393
Feature activation-6.74e-01

Pos logit contributions
ind+0.021
opter+0.021
us+0.021
eds+0.021
opus+0.021
Neg logit contributions
 Sue-0.017
 lick-0.015
 sill-0.014
 shake-0.013
 wipe-0.013
Token in
Token ID287
Feature activation-1.41e-01

Pos logit contributions
ind+0.027
opter+0.027
eds+0.027
us+0.026
opus+0.026
Neg logit contributions
 Sue-0.019
 lick-0.018
 sill-0.016
 shake-0.015
 Amy-0.015
Token the
Token ID262
Feature activation-9.80e-01

Pos logit contributions
opus+0.006
ind+0.006
us+0.006
opter+0.006
eds+0.006
Neg logit contributions
 Sue-0.004
 lick-0.004
 sill-0.003
 shake-0.003
 Amy-0.003
Token is
Token ID318
Feature activation+4.54e-01

Pos logit contributions
 Sue+0.006
 lick+0.005
 sill+0.005
 wipe+0.005
 Amy+0.004
Neg logit contributions
ind-0.005
eds-0.005
 targets-0.005
opus-0.005
OO-0.004
Token hiding
Token ID11816
Feature activation-4.96e-01

Pos logit contributions
 lick+0.014
 Sue+0.014
+0.013
 sill+0.013
 wipe+0.012
Neg logit contributions
ind-0.015
eds-0.014
ich-0.014
 targets-0.014
us-0.014
Token behind
Token ID2157
Feature activation-1.23e+00

Pos logit contributions
opter+0.023
opus+0.023
us+0.022
ind+0.022
eds+0.021
Neg logit contributions
 Sue-0.013
 sill-0.010
 Amy-0.009
 lick-0.009
 Mia-0.008
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
opter+0.057
ind+0.057
opus+0.057
eds+0.056
us+0.056
Neg logit contributions
 Sue-0.029
 lick-0.025
 sill-0.022
 Amy-0.021
 shake-0.020
Token books
Token ID3835
Feature activation-5.30e-01

Pos logit contributions
ind+0.047
us+0.047
opter+0.047
eds+0.046
OO+0.045
Neg logit contributions
 Sue-0.039
 lick-0.037
 sill-0.033
 shake-0.032
 pur-0.031
Tokenhe
Token ID258
Feature activation-3.99e+00

Pos logit contributions
ind+0.023
eds+0.022
opter+0.022
us+0.022
opus+0.022
Neg logit contributions
 Sue-0.014
 lick-0.013
 sill-0.011
 shake-0.011
 Amy-0.010
Tokenlf
Token ID1652
Feature activation+8.26e-01

Pos logit contributions
ind+0.115
us+0.107
opter+0.105
OO+0.105
eds+0.101
Neg logit contributions
 Sue-0.169
 lick-0.150
 Amy-0.146
 sill-0.138
 shake-0.137
Token.
Token ID13
Feature activation-4.96e-01

Pos logit contributions
 Sue+0.020
 lick+0.017
 sill+0.016
 shake+0.014
 Amy+0.014
Neg logit contributions
ind-0.037
opter-0.036
opus-0.036
eds-0.036
us-0.036
Token The
Token ID383
Feature activation+7.34e-01

Pos logit contributions
us+0.017
 targets+0.017
opter+0.017
ind+0.016
eds+0.016
Neg logit contributions
 lick-0.015
 Sue-0.013
 sill-0.012
 wipe-0.012
 shake-0.012
Token cat
Token ID3797
Feature activation+7.66e-02

Pos logit contributions
 Sue+0.025
 lick+0.023
 sill+0.021
 shake+0.020
 pur+0.020
Neg logit contributions
ind-0.025
eds-0.025
opter-0.025
us-0.025
 targets-0.024
Token jumps
Token ID18045
Feature activation-3.75e-01

Pos logit contributions
 Sue+0.003
 lick+0.002
 sill+0.002
 wipe+0.002
 nod+0.002
Neg logit contributions
ind-0.002
eds-0.002
 targets-0.002
opus-0.002
opter-0.002
Token tiny
Token ID7009
Feature activation+1.20e-01

Pos logit contributions
ind+0.015
eds+0.015
us+0.015
 targets+0.015
opter+0.015
Neg logit contributions
 lick-0.012
 Sue-0.011
 shake-0.010
 crunch-0.010
 pur-0.009
Token pink
Token ID11398
Feature activation-5.32e-01

Pos logit contributions
 Sue+0.004
 lick+0.004
 sill+0.003
 shake+0.003
 pur+0.003
Neg logit contributions
ind-0.004
eds-0.004
us-0.004
 targets-0.004
opter-0.004
Token car
Token ID1097
Feature activation+7.46e-01

Pos logit contributions
ind+0.020
us+0.020
opter+0.019
eds+0.019
 targets+0.019
Neg logit contributions
 lick-0.016
 Sue-0.016
 pur-0.013
 sill-0.013
 shake-0.013
Token!
Token ID0
Feature activation-1.25e-02

Pos logit contributions
 Sue+0.017
 lick+0.015
 sill+0.014
 shake+0.012
 pur+0.011
Neg logit contributions
ind-0.034
opter-0.034
eds-0.033
us-0.033
opus-0.033
Token J
Token ID449
Feature activation-1.89e+00

Pos logit contributions
ind+0.001
eds+0.001
opter+0.001
us+0.001
opus+0.001
Neg logit contributions
 Sue-0.000
 lick-0.000
 sill-0.000
 She-0.000
 Together-0.000
Tokenax
Token ID897
Feature activation-3.99e+00

Pos logit contributions
ind+0.065
eds+0.065
opter+0.063
 targets+0.062
 perspective+0.060
Neg logit contributions
 Sue-0.064
 lick-0.059
 sill-0.053
 nod-0.050
 crunch-0.049
Token was
Token ID373
Feature activation+5.55e-02

Pos logit contributions
 targets+0.164
ind+0.162
eds+0.158
 perspective+0.157
opter+0.156
Neg logit contributions
 Sue-0.105
 lick-0.097
 pur-0.089
 shake-0.087
 nod-0.079
Token about
Token ID546
Feature activation-9.93e-01

Pos logit contributions
 lick+0.002
 Sue+0.002
 sill+0.001
 wipe+0.001
 nod+0.001
Neg logit contributions
ind-0.002
 targets-0.002
eds-0.002
us-0.002
ich-0.002
Token to
Token ID284
Feature activation+2.85e-02

Pos logit contributions
opter+0.047
opus+0.043
ind+0.043
eds+0.043
us+0.041
Neg logit contributions
 Sue-0.029
 lick-0.023
 Amy-0.021
 Lucy-0.021
 pur-0.021
Token swim
Token ID9422
Feature activation+9.58e-01

Pos logit contributions
 Sue+0.001
 sill+0.001
 Together+0.001
Deal+0.001
cakes+0.001
Neg logit contributions
 targets-0.001
ind-0.001
eds-0.001
us-0.001
opter-0.001
Token away
Token ID1497
Feature activation-1.80e+00

Pos logit contributions
 Sue+0.028
 lick+0.025
 sill+0.023
 shake+0.022
 Amy+0.022
Neg logit contributions
opter-0.038
ind-0.037
eds-0.037
opus-0.037
ich-0.036
Token and
Token ID290
Feature activation+8.06e-01

Pos logit contributions
 Sue+0.025
 lick+0.022
 She+0.020
 sill+0.020
 shake+0.019
Neg logit contributions
ind-0.039
eds-0.038
opter-0.038
 targets-0.038
opus-0.037
Token upset
Token ID9247
Feature activation-2.12e-01

Pos logit contributions
 Sue+0.027
 lick+0.027
 sill+0.025
 nod+0.023
 wipe+0.022
Neg logit contributions
ind-0.025
eds-0.024
 targets-0.024
opter-0.023
us-0.023
Token that
Token ID326
Feature activation-4.98e-02

Pos logit contributions
ind+0.008
eds+0.008
 targets+0.008
opter+0.008
us+0.008
Neg logit contributions
 Sue-0.006
 lick-0.006
 sill-0.005
 shake-0.005
 pur-0.005
Token her
Token ID607
Feature activation+5.84e-01

Pos logit contributions
ind+0.002
eds+0.002
 targets+0.002
opter+0.002
us+0.002
Neg logit contributions
 Sue-0.001
 lick-0.001
 sill-0.001
 shake-0.001
 pur-0.001
Token books
Token ID3835
Feature activation-6.70e-01

Pos logit contributions
 Sue+0.025
 lick+0.024
 sill+0.023
 wipe+0.022
 nod+0.021
Neg logit contributions
ind-0.014
eds-0.013
 targets-0.013
opter-0.013
ich-0.012
Tokenhe
Token ID258
Feature activation-3.98e+00

Pos logit contributions
ind+0.028
eds+0.027
us+0.027
 targets+0.027
opter+0.026
Neg logit contributions
 Sue-0.018
 lick-0.017
 sill-0.015
 shake-0.014
 wipe-0.014
Tokenlf
Token ID1652
Feature activation+2.34e-01

Pos logit contributions
ind+0.112
 targets+0.107
us+0.105
eds+0.103
opter+0.102
Neg logit contributions
 Sue-0.170
 lick-0.154
 Amy-0.150
 Lucy-0.143
 sill-0.142
Token was
Token ID373
Feature activation+1.03e+00

Pos logit contributions
 Sue+0.008
 lick+0.007
 sill+0.007
 shake+0.006
 pur+0.006
Neg logit contributions
ind-0.008
eds-0.008
 targets-0.008
opter-0.008
us-0.008
Token a
Token ID257
Feature activation+5.28e-01

Pos logit contributions
 Sue+0.035
 lick+0.033
 sill+0.030
 shake+0.028
 pur+0.028
Neg logit contributions
ind-0.035
eds-0.034
opter-0.034
 targets-0.034
us-0.033
Token mess
Token ID2085
Feature activation+1.14e-01

Pos logit contributions
 lick+0.016
 Sue+0.015
 shake+0.013
 crunch+0.013
 pur+0.013
Neg logit contributions
eds-0.022
us-0.021
ind-0.021
 targets-0.020
utes-0.020
Token.
Token ID13
Feature activation+6.50e-01

Pos logit contributions
 Sue+0.003
 lick+0.003
 sill+0.003
 shake+0.003
 Amy+0.002
Neg logit contributions
ind-0.004
eds-0.004
 targets-0.004
opter-0.004
us-0.004
Token red
Token ID2266
Feature activation+1.63e-01

Pos logit contributions
ind+0.006
eds+0.006
us+0.006
 targets+0.006
opter+0.006
Neg logit contributions
 Sue-0.006
 lick-0.006
 sill-0.005
 shake-0.005
 pur-0.005
Token ball
Token ID2613
Feature activation-1.12e-01

Pos logit contributions
 Sue+0.006
 lick+0.006
 sill+0.005
 shake+0.005
 pur+0.005
Neg logit contributions
us-0.005
ind-0.005
eds-0.005
 targets-0.005
opter-0.005
Token behind
Token ID2157
Feature activation-1.14e+00

Pos logit contributions
 targets+0.004
eds+0.004
ind+0.004
opus+0.004
opter+0.004
Neg logit contributions
 Sue-0.004
 lick-0.003
 sill-0.003
 nod-0.003
 pur-0.003
Token the
Token ID262
Feature activation-1.83e+00

Pos logit contributions
ind+0.048
eds+0.047
opter+0.047
 targets+0.047
us+0.046
Neg logit contributions
 Sue-0.030
 lick-0.028
 sill-0.025
 shake-0.022
 Amy-0.022
Token books
Token ID3835
Feature activation-6.22e-01

Pos logit contributions
ind+0.067
eds+0.066
us+0.066
opter+0.066
 targets+0.065
Neg logit contributions
 Sue-0.058
 lick-0.055
 sill-0.050
 shake-0.047
 Amy-0.046
Tokenhe
Token ID258
Feature activation-3.98e+00

Pos logit contributions
ind+0.027
eds+0.026
opter+0.026
us+0.026
 targets+0.026
Neg logit contributions
 Sue-0.016
 lick-0.015
 sill-0.013
 shake-0.012
 Amy-0.012
Tokenlf
Token ID1652
Feature activation+7.31e-01

Pos logit contributions
ind+0.121
us+0.111
 targets+0.110
OO+0.109
eds+0.108
Neg logit contributions
 Sue-0.164
 lick-0.144
 Amy-0.142
 sill-0.137
 Lucy-0.131
Token.
Token ID13
Feature activation+3.91e-01

Pos logit contributions
 Sue+0.019
 lick+0.017
 sill+0.015
 shake+0.014
 Amy+0.013
Neg logit contributions
ind-0.031
opter-0.030
eds-0.030
 targets-0.030
us-0.030
Token Tom
Token ID4186
Feature activation-1.24e+00

Pos logit contributions
 lick+0.009
 Sue+0.009
 sill+0.008
 wipe+0.007
 pur+0.007
Neg logit contributions
 targets-0.016
OO-0.016
opter-0.016
opus-0.015
eds-0.015
Token tried
Token ID3088
Feature activation-4.81e-02

Pos logit contributions
ind+0.053
eds+0.052
 targets+0.052
opter+0.051
 perspective+0.050
Neg logit contributions
 Sue-0.034
 lick-0.030
 shake-0.027
 pur-0.026
 Amy-0.025
Token to
Token ID284
Feature activation+1.58e+00

Pos logit contributions
opter+0.002
ind+0.002
opus+0.002
eds+0.002
 targets+0.002
Neg logit contributions
 Sue-0.002
 lick-0.001
 shake-0.001
 Amy-0.001
 sill-0.001
Token flowers
Token ID12734
Feature activation-5.81e-01

Pos logit contributions
ind+0.031
eds+0.031
 targets+0.031
opter+0.030
opus+0.029
Neg logit contributions
 Sue-0.027
 lick-0.026
 sill-0.025
 nod-0.022
 wipe-0.022
Token and
Token ID290
Feature activation-4.95e-01

Pos logit contributions
 targets+0.018
eds+0.018
ind+0.017
us+0.017
opter+0.017
Neg logit contributions
 Sue-0.020
 lick-0.019
 sill-0.018
 nod-0.017
 pur-0.017
Token taking
Token ID2263
Feature activation-2.36e-01

Pos logit contributions
 targets+0.018
eds+0.017
ind+0.017
opter+0.016
us+0.016
Neg logit contributions
 Sue-0.014
 lick-0.014
 sill-0.013
 nod-0.012
 wipe-0.012
Token in
Token ID287
Feature activation+5.92e-01

Pos logit contributions
ind+0.007
eds+0.007
 targets+0.007
opter+0.007
us+0.007
Neg logit contributions
 Sue-0.009
 lick-0.008
 sill-0.008
 shake-0.007
 pur-0.007
Token the
Token ID262
Feature activation-3.97e-01

Pos logit contributions
 Sue+0.017
 lick+0.016
 sill+0.014
 shake+0.013
 Amy+0.013
Neg logit contributions
ind-0.023
opter-0.023
eds-0.023
opus-0.023
 targets-0.022
Token truly
Token ID4988
Feature activation-3.98e+00

Pos logit contributions
ind+0.012
 targets+0.012
eds+0.011
 perspective+0.011
 passport+0.011
Neg logit contributions
 Sue-0.014
 lick-0.013
 sill-0.012
 nod-0.011
 Together-0.011
Token incredible
Token ID8082
Feature activation-1.43e+00

Pos logit contributions
eds+0.143
ind+0.142
 targets+0.142
opter+0.137
OO+0.132
Neg logit contributions
 Sue-0.114
 lick-0.108
 sill-0.095
 Together-0.089
 pur-0.087
Token view
Token ID1570
Feature activation-8.55e-01

Pos logit contributions
ind+0.048
eds+0.047
 targets+0.046
 perspective+0.045
opter+0.044
Neg logit contributions
 Sue-0.047
 lick-0.045
 sill-0.042
 shake-0.038
 wipe-0.038
Token around
Token ID1088
Feature activation-1.49e+00

Pos logit contributions
ind+0.032
eds+0.031
 targets+0.031
opter+0.031
us+0.031
Neg logit contributions
 Sue-0.025
 lick-0.024
 sill-0.021
 pur-0.020
 nod-0.020
Token him
Token ID683
Feature activation-6.35e-03

Pos logit contributions
opter+0.058
OO+0.057
opus+0.057
ind+0.057
eds+0.056
Neg logit contributions
 Sue-0.045
 lick-0.039
 sill-0.036
 Amy-0.035
 Lucy-0.034
Token.
Token ID13
Feature activation+3.52e-01

Pos logit contributions
ind+0.000
opter+0.000
 targets+0.000
opus+0.000
eds+0.000
Neg logit contributions
 Sue-0.000
 lick-0.000
 sill-0.000
 pur-0.000
 shake-0.000
Token ferocious
Token ID40831
Feature activation-1.30e+00

Pos logit contributions
 targets+0.013
eds+0.013
ind+0.013
us+0.013
opter+0.013
Neg logit contributions
 lick-0.010
 Sue-0.009
 shake-0.008
 pur-0.008
 crunch-0.008
Token purple
Token ID14032
Feature activation-6.90e-01

Pos logit contributions
ind+0.037
eds+0.036
 targets+0.036
opter+0.036
us+0.035
Neg logit contributions
 Sue-0.050
 lick-0.049
 sill-0.045
 shake-0.043
 pur-0.042
Token snake
Token ID17522
Feature activation+9.03e-03

Pos logit contributions
ind+0.024
eds+0.024
us+0.024
opter+0.023
 targets+0.023
Neg logit contributions
 lick-0.023
 Sue-0.023
 sill-0.021
 shake-0.020
 pur-0.019
Token!
Token ID0
Feature activation-4.61e-01

Pos logit contributions
 Sue+0.000
 lick+0.000
 sill+0.000
 shake+0.000
 pur+0.000
Neg logit contributions
ind-0.000
eds-0.000
opter-0.000
 targets-0.000
us-0.000
Token J
Token ID449
Feature activation-1.87e+00

Pos logit contributions
ind+0.021
eds+0.021
 targets+0.020
opter+0.020
opus+0.020
Neg logit contributions
 Sue-0.011
 lick-0.010
 sill-0.009
 shake-0.007
 Amy-0.007
Tokenax
Token ID897
Feature activation-3.94e+00

Pos logit contributions
eds+0.066
ind+0.065
 targets+0.064
opter+0.063
 perspective+0.060
Neg logit contributions
 Sue-0.061
 lick-0.056
 sill-0.051
 nod-0.048
 shake-0.046
Token was
Token ID373
Feature activation+3.03e-03

Pos logit contributions
 targets+0.157
ind+0.154
eds+0.148
opter+0.147
 perspective+0.146
Neg logit contributions
 Sue-0.116
 lick-0.108
 pur-0.097
 shake-0.095
 sill-0.092
Token terrified
Token ID22144
Feature activation-5.49e-01

Pos logit contributions
 Sue+0.000
 lick+0.000
 sill+0.000
 shake+0.000
 pur+0.000
Neg logit contributions
ind-0.000
eds-0.000
 targets-0.000
opter-0.000
us-0.000
Token and
Token ID290
Feature activation-1.88e-02

Pos logit contributions
eds+0.020
ind+0.020
opter+0.020
 targets+0.019
opus+0.019
Neg logit contributions
 Sue-0.018
 lick-0.016
 sill-0.015
 shake-0.015
 Amy-0.014
Token he
Token ID339
Feature activation-1.42e+00

Pos logit contributions
ind+0.001
eds+0.001
 targets+0.001
opter+0.001
us+0.001
Neg logit contributions
 Sue-0.001
 lick-0.001
 sill-0.000
 shake-0.000
 pur-0.000
Token ran
Token ID4966
Feature activation+3.61e-01

Pos logit contributions
ind+0.055
 targets+0.054
eds+0.054
opter+0.054
us+0.053
Neg logit contributions
 Sue-0.042
 lick-0.040
 pur-0.035
 sill-0.035
 shake-0.035
Token showed
Token ID3751
Feature activation+1.11e+00

Pos logit contributions
 lick+0.002
 Sue+0.002
 sill+0.002
 nod+0.002
 crunch+0.002
Neg logit contributions
 targets-0.003
eds-0.003
opter-0.003
ind-0.003
us-0.003
Token her
Token ID607
Feature activation+3.63e-01

Pos logit contributions
 lick+0.026
 Sue+0.025
 sill+0.023
 shake+0.020
 nod+0.020
Neg logit contributions
 targets-0.048
ind-0.047
eds-0.046
opter-0.045
us-0.045
Token friend
Token ID1545
Feature activation-2.68e-01

Pos logit contributions
 lick+0.012
 Sue+0.012
 sill+0.011
 nod+0.011
 shake+0.011
Neg logit contributions
eds-0.011
 targets-0.011
ind-0.011
opter-0.011
us-0.010
Token the
Token ID262
Feature activation-1.06e+00

Pos logit contributions
 targets+0.010
eds+0.010
ind+0.010
us+0.009
opter+0.009
Neg logit contributions
 lick-0.007
 Sue-0.007
 sill-0.007
-0.006
 shake-0.006
Token books
Token ID3835
Feature activation-2.18e-01

Pos logit contributions
ind+0.036
eds+0.035
 targets+0.035
opter+0.035
us+0.034
Neg logit contributions
 Sue-0.036
 lick-0.034
 sill-0.031
 shake-0.029
 pur-0.029
Tokenhe
Token ID258
Feature activation-3.94e+00

Pos logit contributions
 targets+0.007
opter+0.007
ind+0.007
eds+0.007
us+0.007
Neg logit contributions
 Sue-0.007
 lick-0.007
 sill-0.006
 pur-0.006
 nod-0.006
Tokenlf
Token ID1652
Feature activation+3.82e-01

Pos logit contributions
ind+0.105
 targets+0.098
eds+0.098
us+0.097
OO+0.095
Neg logit contributions
 Sue-0.174
 lick-0.159
 Amy-0.152
 sill-0.148
 Lucy-0.145
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
 lick+0.012
 Sue+0.011
 sill+0.010
 nod+0.010
 wipe+0.010
Neg logit contributions
ind-0.013
eds-0.013
 targets-0.013
opter-0.013
us-0.012
Token
Token ID198
Feature activation-1.52e-01

Pos logit contributions
 lick+0.004
 sill+0.004
 Sue+0.004
 shake+0.004
 crunch+0.004
Neg logit contributions
 targets-0.006
us-0.006
eds-0.006
opus-0.006
opter-0.006
Token
Token ID198
Feature activation-1.49e-01

Pos logit contributions
ind+0.007
 targets+0.007
opter+0.007
eds+0.007
opus+0.007
Neg logit contributions
 Sue-0.004
 lick-0.003
 Amy-0.002
 She-0.002
 sill-0.002
TokenBut
Token ID1537
Feature activation-5.77e-02

Pos logit contributions
eds+0.006
 targets+0.006
ind+0.006
us+0.006
opter+0.006
Neg logit contributions
 Sue-0.004
 lick-0.004
 sill-0.004
 wipe-0.003
 She-0.003
Token the
Token ID262
Feature activation-3.25e-01

Pos logit contributions
ind+0.041
OO+0.041
opus+0.041
opter+0.040
ich+0.040
Neg logit contributions
 Sue-0.026
 lick-0.023
 sill-0.022
 Amy-0.020
 shake-0.019
Token books
Token ID3835
Feature activation+1.96e-01

Pos logit contributions
opus+0.011
ich+0.011
opter+0.010
OO+0.010
ind+0.010
Neg logit contributions
 Sue-0.012
 lick-0.011
 sill-0.010
 shake-0.010
 Amy-0.010
Token on
Token ID319
Feature activation-8.82e-01

Pos logit contributions
 Sue+0.006
 lick+0.005
 sill+0.005
 Amy+0.004
 shake+0.004
Neg logit contributions
ind-0.008
eds-0.008
 targets-0.007
opter-0.007
opus-0.007
Token the
Token ID262
Feature activation-4.52e-01

Pos logit contributions
opus+0.043
OO+0.042
ich+0.042
eds+0.042
opter+0.041
Neg logit contributions
 Sue-0.020
 lick-0.015
 sill-0.015
 Amy-0.014
 Lucy-0.014
Token books
Token ID3835
Feature activation-2.48e-02

Pos logit contributions
us+0.017
opter+0.017
opus+0.017
eds+0.017
ind+0.017
Neg logit contributions
 Sue-0.013
 lick-0.013
 sill-0.012
 shake-0.011
 Amy-0.011
Tokenhe
Token ID258
Feature activation-3.93e+00

Pos logit contributions
ind+0.001
eds+0.001
 targets+0.001
opter+0.001
opus+0.001
Neg logit contributions
 Sue-0.001
 lick-0.001
 sill-0.001
 Amy-0.000
 shake-0.000
Tokenlf
Token ID1652
Feature activation+1.22e+00

Pos logit contributions
ind+0.135
OO+0.124
us+0.123
opter+0.123
 targets+0.120
Neg logit contributions
 Sue-0.144
 Amy-0.127
 lick-0.123
 sill-0.122
 Lucy-0.115
Token.
Token ID13
Feature activation+1.28e+00

Pos logit contributions
 Sue+0.031
 lick+0.027
 sill+0.027
 Amy+0.024
 shake+0.023
Neg logit contributions
ind-0.052
eds-0.051
opter-0.051
opus-0.051
 targets-0.050
Token
Token ID198
Feature activation+2.44e-01

Pos logit contributions
 Sue+0.032
 lick+0.031
 sill+0.027
 shake+0.024
 pur+0.024
Neg logit contributions
ind-0.053
 targets-0.053
eds-0.052
opter-0.052
us-0.052
Token
Token ID198
Feature activation+2.03e-01

Pos logit contributions
 Sue+0.005
 lick+0.004
 Amy+0.003
 She+0.003
 Together+0.003
Neg logit contributions
ind-0.012
 targets-0.012
eds-0.012
 enemy-0.012
opter-0.012
TokenBilly
Token ID49640
Feature activation-4.51e-01

Pos logit contributions
 Sue+0.006
 lick+0.006
 She+0.005
 shake+0.005
 Together+0.005
Neg logit contributions
eds-0.008
ind-0.008
 targets-0.008
OO-0.007
opter-0.007
TokenSuddenly
Token ID38582
Feature activation+7.21e-01

Pos logit contributions
 targets+0.024
eds+0.024
ind+0.024
opter+0.024
OO+0.023
Neg logit contributions
 Sue-0.016
 lick-0.015
 sill-0.013
 She-0.012
 crunch-0.012
Token,
Token ID11
Feature activation-1.98e-01

Pos logit contributions
 Sue+0.018
 lick+0.014
 Amy+0.013
 Lucy+0.012
 shake+0.012
Neg logit contributions
eds-0.035
ind-0.035
 targets-0.033
ich-0.032
 perspective-0.031
Token he
Token ID339
Feature activation-1.35e+00

Pos logit contributions
ind+0.009
eds+0.009
 targets+0.008
opter+0.008
us+0.008
Neg logit contributions
 Sue-0.005
 lick-0.005
 sill-0.004
 shake-0.004
 Amy-0.004
Token heard
Token ID2982
Feature activation+1.00e-01

Pos logit contributions
opter+0.054
 perspective+0.053
ind+0.053
utes+0.053
 targets+0.053
Neg logit contributions
 Sue-0.037
 lick-0.036
 pur-0.035
 shake-0.034
 sill-0.032
Token a
Token ID257
Feature activation-3.89e-01

Pos logit contributions
 Sue+0.002
 lick+0.002
 sill+0.002
 pur+0.002
 Amy+0.002
Neg logit contributions
eds-0.005
ind-0.005
opter-0.005
opus-0.005
 targets-0.005
Token voice
Token ID3809
Feature activation+2.52e+00

Pos logit contributions
ind+0.014
eds+0.014
 targets+0.014
opter+0.014
us+0.014
Neg logit contributions
 Sue-0.012
 lick-0.011
 sill-0.010
 shake-0.009
 pur-0.009
Token from
Token ID422
Feature activation-2.15e-01

Pos logit contributions
 Sue+0.081
 lick+0.078
 sill+0.069
 crunch+0.065
 Amy+0.064
Neg logit contributions
ind-0.085
us-0.083
eds-0.083
 targets-0.081
opter-0.081
Token above
Token ID2029
Feature activation-1.24e+00

Pos logit contributions
ind+0.008
 targets+0.008
eds+0.007
us+0.007
opter+0.007
Neg logit contributions
 Sue-0.007
 lick-0.006
 sill-0.006
 pur-0.005
 nod-0.005
Token.
Token ID13
Feature activation+2.69e-01

Pos logit contributions
opter+0.051
ind+0.051
eds+0.050
opus+0.050
 targets+0.050
Neg logit contributions
 Sue-0.034
 lick-0.031
 sill-0.028
 shake-0.026
 nod-0.025
Token It
Token ID632
Feature activation+9.45e-01

Pos logit contributions
 Sue+0.007
 lick+0.007
 sill+0.006
 shake+0.005
 pur+0.005
Neg logit contributions
ind-0.011
eds-0.011
opter-0.011
 targets-0.011
us-0.011
Token was
Token ID373
Feature activation+1.24e-01

Pos logit contributions
 Sue+0.032
 lick+0.030
 sill+0.028
 shake+0.026
 wipe+0.025
Neg logit contributions
ind-0.032
eds-0.031
opter-0.031
 targets-0.031
us-0.030
Token.
Token ID13
Feature activation+1.71e-01

Pos logit contributions
ind+0.041
eds+0.040
 targets+0.040
opter+0.040
us+0.039
Neg logit contributions
 Sue-0.031
 lick-0.029
 sill-0.026
 shake-0.024
 pur-0.024
Token They
Token ID1119
Feature activation+8.37e-01

Pos logit contributions
 lick+0.004
 Sue+0.004
 sill+0.004
 shake+0.003
 wipe+0.003
Neg logit contributions
ind-0.006
 targets-0.006
us-0.006
eds-0.006
utes-0.006
Token have
Token ID423
Feature activation-3.71e-01

Pos logit contributions
 Sue+0.022
 lick+0.018
 sill+0.017
 Amy+0.016
 Lucy+0.015
Neg logit contributions
 targets-0.035
ind-0.034
opter-0.033
eds-0.033
 passport-0.032
Token a
Token ID257
Feature activation+1.46e+00

Pos logit contributions
ind+0.015
eds+0.015
opter+0.015
us+0.015
 targets+0.015
Neg logit contributions
 Sue-0.010
 lick-0.009
 sill-0.008
 pur-0.008
 shake-0.008
Token lot
Token ID1256
Feature activation-2.19e-01

Pos logit contributions
 Sue+0.048
 lick+0.047
 pur+0.041
 shake+0.040
 nod+0.040
Neg logit contributions
eds-0.052
ind-0.052
us-0.051
 targets-0.050
opter-0.049
Token of
Token ID286
Feature activation+2.68e+00

Pos logit contributions
 targets+0.007
ind+0.007
us+0.007
eds+0.007
opter+0.006
Neg logit contributions
 Sue-0.008
 lick-0.008
 sill-0.007
 nod-0.007
 pur-0.007
Token fun
Token ID1257
Feature activation+1.01e+00

Pos logit contributions
 Sue+0.072
 lick+0.068
 sill+0.060
 shake+0.056
 pur+0.055
Neg logit contributions
ind-0.109
eds-0.108
opter-0.107
opus-0.106
us-0.106
Token together
Token ID1978
Feature activation+1.07e+00

Pos logit contributions
 Sue+0.020
 lick+0.015
 sill+0.015
 shake+0.014
 pur+0.014
Neg logit contributions
opus-0.049
ind-0.048
us-0.048
OO-0.048
asper-0.048
Token.
Token ID13
Feature activation+4.94e-02

Pos logit contributions
 Sue+0.026
 lick+0.024
 sill+0.021
 Amy+0.019
 shake+0.019
Neg logit contributions
ind-0.048
opter-0.047
eds-0.047
opus-0.047
us-0.046
Token Anna
Token ID11735
Feature activation-2.80e-01

Pos logit contributions
 lick+0.001
 Sue+0.001
 sill+0.001
 shake+0.001
 wipe+0.001
Neg logit contributions
ind-0.002
 targets-0.002
eds-0.002
us-0.002
opter-0.002
Token thinks
Token ID6834
Feature activation-7.21e-01

Pos logit contributions
opter+0.011
ind+0.011
us+0.011
eds+0.010
OO+0.010
Neg logit contributions
 lick-0.008
 Sue-0.008
 pur-0.008
 nod-0.007
 sill-0.007
Token.
Token ID13
Feature activation+1.67e-01

Pos logit contributions
ind+0.032
eds+0.031
opter+0.031
 targets+0.031
us+0.031
Neg logit contributions
 Sue-0.026
 lick-0.024
 sill-0.022
 shake-0.020
 pur-0.020
Token
Token ID220
Feature activation-1.13e+00

Pos logit contributions
 lick+0.004
 Sue+0.004
 sill+0.004
 shake+0.004
 wipe+0.003
Neg logit contributions
us-0.007
ind-0.007
eds-0.007
 targets-0.007
opter-0.007
Token
Token ID198
Feature activation-2.02e-01

Pos logit contributions
ind+0.044
eds+0.043
 targets+0.043
opter+0.043
us+0.043
Neg logit contributions
 Sue-0.032
 lick-0.030
 sill-0.027
 shake-0.025
 pur-0.025
Token
Token ID198
Feature activation-2.59e-01

Pos logit contributions
ind+0.008
 targets+0.008
eds+0.008
opter+0.008
OO+0.008
Neg logit contributions
 Sue-0.006
 lick-0.005
 sill-0.005
 pur-0.004
 Amy-0.004
TokenAfter
Token ID3260
Feature activation-8.16e-01

Pos logit contributions
ind+0.010
eds+0.010
 targets+0.010
opter+0.010
OO+0.010
Neg logit contributions
 Sue-0.007
 lick-0.007
 sill-0.006
 nod-0.005
 pur-0.005
Token a
Token ID257
Feature activation+2.59e+00

Pos logit contributions
ind+0.031
eds+0.030
 targets+0.030
opter+0.030
us+0.030
Neg logit contributions
 Sue-0.024
 lick-0.023
 sill-0.021
 shake-0.019
 pur-0.019
Token while
Token ID981
Feature activation+1.36e+00

Pos logit contributions
 Sue+0.083
 lick+0.082
 crunch+0.067
 shake+0.066
 pur+0.066
Neg logit contributions
eds-0.098
ind-0.098
 targets-0.095
opter-0.092
us-0.091
Token the
Token ID262
Feature activation+1.43e+00

Pos logit contributions
 Sue+0.041
 lick+0.039
 sill+0.035
 shake+0.032
 pur+0.032
Neg logit contributions
ind-0.051
eds-0.050
 targets-0.049
opter-0.049
us-0.049
Token pigeon
Token ID46387
Feature activation+7.08e-01

Pos logit contributions
 Sue+0.058
 lick+0.055
 sill+0.051
 shake+0.049
 nod+0.048
Neg logit contributions
eds-0.040
ind-0.039
opter-0.038
us-0.038
 targets-0.038
Token fl
Token ID781
Feature activation-3.80e-01

Pos logit contributions
 Sue+0.022
 lick+0.020
 sill+0.018
 shake+0.017
 pur+0.017
Neg logit contributions
ind-0.027
eds-0.026
 targets-0.026
opter-0.026
us-0.026
Tokenapped
Token ID6320
Feature activation+2.67e+00

Pos logit contributions
ind+0.013
 targets+0.012
eds+0.012
opter+0.012
us+0.012
Neg logit contributions
 Sue-0.013
 lick-0.012
 sill-0.011
 shake-0.010
 nod-0.010
Token the
Token ID262
Feature activation-1.10e+00

Pos logit contributions
opus+0.042
eds+0.042
ind+0.041
opter+0.041
us+0.041
Neg logit contributions
 Sue-0.030
 lick-0.026
 sill-0.024
 shake-0.023
 Amy-0.023
Token new
Token ID649
Feature activation-1.01e+00

Pos logit contributions
us+0.040
OO+0.038
eds+0.038
fer+0.036
ich+0.036
Neg logit contributions
 Sue-0.035
 lick-0.033
 shake-0.032
 sill-0.032
 crunch-0.031
Token world
Token ID995
Feature activation-6.25e-01

Pos logit contributions
us+0.035
opus+0.032
eds+0.032
opter+0.030
ino+0.030
Neg logit contributions
 Sue-0.037
 sill-0.033
 lick-0.033
 shake-0.032
 nod-0.030
Token.
Token ID13
Feature activation+5.72e-01

Pos logit contributions
ind+0.023
eds+0.023
opter+0.023
 targets+0.022
us+0.022
Neg logit contributions
 Sue-0.019
 lick-0.018
 sill-0.016
 shake-0.015
 Amy-0.015
Token Using
Token ID8554
Feature activation-1.39e-01

Pos logit contributions
 Sue+0.011
 lick+0.010
 sill+0.009
 shake+0.008
 pur+0.007
Neg logit contributions
ind-0.027
 targets-0.027
eds-0.027
opter-0.027
us-0.027
Token their
Token ID511
Feature activation+2.50e+00

Pos logit contributions
ind+0.006
eds+0.006
opus+0.005
opter+0.005
OO+0.005
Neg logit contributions
 Sue-0.004
 lick-0.004
 sill-0.003
 Amy-0.003
 shake-0.003
Token imag
Token ID3590
Feature activation+1.74e+00

Pos logit contributions
 Sue+0.080
 lick+0.077
 sill+0.069
 shake+0.067
 nod+0.067
Neg logit contributions
us-0.090
ind-0.090
eds-0.089
opter-0.089
opus-0.089
Tokeninations
Token ID7352
Feature activation+5.15e-01

Pos logit contributions
 Sue+0.083
 lick+0.081
 sill+0.076
 shake+0.073
 nod+0.072
Neg logit contributions
ind-0.032
 targets-0.032
eds-0.031
ich-0.030
opter-0.030
Token,
Token ID11
Feature activation+3.48e-01

Pos logit contributions
 Sue+0.015
 lick+0.014
 sill+0.013
 shake+0.012
 nod+0.012
Neg logit contributions
ind-0.021
eds-0.020
opus-0.020
opter-0.020
OO-0.020
Token they
Token ID484
Feature activation-9.78e-01

Pos logit contributions
 Sue+0.010
 lick+0.010
 sill+0.009
 shake+0.008
 Amy+0.008
Neg logit contributions
ind-0.013
eds-0.013
opter-0.013
 targets-0.013
us-0.013
Token played
Token ID2826
Feature activation+1.26e-01

Pos logit contributions
ind+0.040
eds+0.039
opus+0.038
us+0.038
OO+0.038
Neg logit contributions
 Sue-0.027
 lick-0.026
 sill-0.023
 shake-0.023
 nod-0.022
Token long
Token ID890
Feature activation+1.69e+00

Pos logit contributions
 Sue+0.028
 lick+0.023
 Lucy+0.021
 Amy+0.020
 shake+0.019
Neg logit contributions
eds-0.040
ind-0.039
 targets-0.039
opus-0.038
opter-0.038
Token as
Token ID355
Feature activation+7.70e-01

Pos logit contributions
 Sue+0.048
 lick+0.044
 sill+0.038
 shake+0.036
 Amy+0.036
Neg logit contributions
ind-0.069
eds-0.069
 targets-0.066
opter-0.065
us-0.065
Token we
Token ID356
Feature activation+1.58e+00

Pos logit contributions
 Sue+0.023
 lick+0.020
 sill+0.018
 Amy+0.017
 Lucy+0.017
Neg logit contributions
eds-0.031
ind-0.031
 targets-0.031
OO-0.030
opter-0.030
Tokenâ
Token ID22940
Feature activation+1.42e+00

Pos logit contributions
 Sue+0.040
 lick+0.039
 sill+0.033
 shake+0.033
 nod+0.032
Neg logit contributions
ind-0.067
eds-0.065
 targets-0.064
opter-0.064
us-0.064
Token
Token ID26391
Feature activation+3.61e-01

Pos logit contributions
 Sue+0.046
 lick+0.041
 sill+0.039
 crunch+0.037
 pur+0.037
Neg logit contributions
opter-0.048
ind-0.047
us-0.047
OO-0.047
eds-0.047
Token
Token ID8151
Feature activation+3.22e+00

Pos logit contributions
 Sue+0.012
 lick+0.011
+0.010
 Amy+0.010
 She+0.010
Neg logit contributions
ind-0.013
eds-0.012
 targets-0.012
us-0.012
opter-0.012
Tokenre
Token ID260
Feature activation+3.06e-01

Pos logit contributions
 Sue+0.117
 lick+0.104
 sill+0.093
 She+0.092
 Amy+0.092
Neg logit contributions
ind-0.110
 targets-0.109
eds-0.103
 culture-0.100
opter-0.100
Token together
Token ID1978
Feature activation+1.94e+00

Pos logit contributions
 Sue+0.009
 lick+0.008
 sill+0.007
 shake+0.007
 nod+0.007
Neg logit contributions
ind-0.012
opus-0.012
opter-0.012
 targets-0.012
eds-0.012
Token,
Token ID11
Feature activation+4.83e-02

Pos logit contributions
 Sue+0.048
 lick+0.044
 Lucy+0.037
 wipe+0.036
 nod+0.036
Neg logit contributions
ind-0.087
 targets-0.086
eds-0.086
opter-0.084
opus-0.083
Token we
Token ID356
Feature activation-2.11e-01

Pos logit contributions
 Sue+0.002
 lick+0.002
 sill+0.001
 wipe+0.001
 shake+0.001
Neg logit contributions
ind-0.002
eds-0.002
 targets-0.002
opter-0.002
us-0.002
Token can
Token ID460
Feature activation+1.81e-01

Pos logit contributions
ind+0.008
eds+0.008
ich+0.008
opus+0.008
us+0.008
Neg logit contributions
 lick-0.006
 Sue-0.006
 shake-0.005
 wipe-0.005
 nod-0.005
Token "
Token ID366
Feature activation-4.22e-01

Pos logit contributions
us+0.012
ino+0.011
opter+0.011
 targets+0.011
opus+0.011
Neg logit contributions
 lick-0.014
 Sue-0.013
 wipe-0.012
 sill-0.012
 shake-0.012
TokenThat
Token ID2504
Feature activation-1.30e+00

Pos logit contributions
ind+0.016
eds+0.016
 targets+0.016
opter+0.016
us+0.016
Neg logit contributions
 Sue-0.012
 lick-0.012
 sill-0.010
 pur-0.010
 shake-0.010
Token was
Token ID373
Feature activation+1.02e+00

Pos logit contributions
ind+0.049
us+0.047
eds+0.047
opter+0.044
 targets+0.044
Neg logit contributions
 Sue-0.042
 lick-0.039
 sill-0.037
 shake-0.036
 pur-0.035
Token a
Token ID257
Feature activation-1.51e-03

Pos logit contributions
 Sue+0.030
 lick+0.029
 sill+0.026
 nod+0.024
 shake+0.024
Neg logit contributions
ind-0.038
eds-0.037
 targets-0.037
us-0.036
opter-0.036
Token close
Token ID1969
Feature activation-2.92e-01

Pos logit contributions
eds+0.000
opter+0.000
ind+0.000
us+0.000
 targets+0.000
Neg logit contributions
 Sue-0.000
 lick-0.000
 sill-0.000
 crunch-0.000
 shake-0.000
Token call
Token ID869
Feature activation+2.94e-01

Pos logit contributions
ind+0.010
eds+0.010
us+0.010
opter+0.010
opus+0.010
Neg logit contributions
 Sue-0.010
 lick-0.009
 sill-0.008
 shake-0.008
 nod-0.008
Token.
Token ID13
Feature activation-7.09e-02

Pos logit contributions
 Sue+0.009
 lick+0.008
 sill+0.008
 Amy+0.007
 shake+0.007
Neg logit contributions
ind-0.011
 targets-0.011
eds-0.011
opter-0.011
opus-0.011
Token You
Token ID921
Feature activation-4.60e-01

Pos logit contributions
us+0.002
ind+0.002
opter+0.002
opus+0.002
ich+0.002
Neg logit contributions
 lick-0.002
 Sue-0.002
 shake-0.002
 pur-0.002
 crunch-0.002
Token were
Token ID547
Feature activation+1.13e+00

Pos logit contributions
ind+0.019
eds+0.018
us+0.018
 targets+0.018
opus+0.017
Neg logit contributions
 Sue-0.013
 lick-0.013
 shake-0.011
 sill-0.011
 wipe-0.010
Token brave
Token ID14802
Feature activation-1.02e-01

Pos logit contributions
 Sue+0.033
 lick+0.031
 sill+0.028
 shake+0.026
 pur+0.026
Neg logit contributions
ind-0.043
opter-0.043
eds-0.043
 targets-0.042
opus-0.042
Token."
Token ID526
Feature activation+7.28e-01

Pos logit contributions
opter+0.004
eds+0.004
ind+0.004
opus+0.004
OO+0.004
Neg logit contributions
 Sue-0.003
 sill-0.003
 lick-0.003
 Amy-0.003
 Lucy-0.002
Token ow
Token ID12334
Feature activation+6.88e-01

Pos logit contributions
ind+0.006
eds+0.005
 targets+0.005
opter+0.005
us+0.005
Neg logit contributions
 Sue-0.004
 lick-0.004
 sill-0.003
 shake-0.003
 pur-0.003
Token!"
Token ID2474
Feature activation+1.21e+00

Pos logit contributions
 Sue+0.019
 lick+0.018
 shake+0.015
 sill+0.015
 Amy+0.014
Neg logit contributions
ind-0.028
 targets-0.028
eds-0.027
opter-0.026
us-0.026
Token Ben
Token ID3932
Feature activation-1.28e+00

Pos logit contributions
 Sue+0.029
 lick+0.028
 sill+0.024
 shake+0.022
 pur+0.021
Neg logit contributions
ind-0.051
 targets-0.050
eds-0.050
opter-0.050
us-0.050
Token says
Token ID1139
Feature activation-1.41e+00

Pos logit contributions
eds+0.036
 targets+0.035
ind+0.035
us+0.035
opus+0.034
Neg logit contributions
 Sue-0.048
 lick-0.046
 sill-0.043
 wipe-0.039
-0.039
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
us+0.036
eds+0.033
ind+0.032
 targets+0.031
OO+0.031
Neg logit contributions
 lick-0.053
 Sue-0.051
 ju-0.047
Deal-0.047
 sill-0.046
Token He
Token ID679
Feature activation+3.51e-01

Pos logit contributions
us+0.040
opus+0.038
OO+0.038
eds+0.038
opter+0.038
Neg logit contributions
 lick-0.029
 Sue-0.028
 sill-0.025
 wipe-0.024
 blush-0.023
Token is
Token ID318
Feature activation+4.18e-01

Pos logit contributions
 Sue+0.012
 lick+0.011
 sill+0.011
 Amy+0.010
 Lucy+0.010
Neg logit contributions
eds-0.011
ind-0.011
 targets-0.011
OO-0.010
us-0.010
Token very
Token ID845
Feature activation-1.27e+00

Pos logit contributions
 Sue+0.013
 lick+0.013
 sill+0.012
 nod+0.011
 Together+0.011
Neg logit contributions
eds-0.014
ind-0.014
us-0.013
 targets-0.013
ich-0.013
Token hurt
Token ID5938
Feature activation-8.11e-01

Pos logit contributions
ind+0.041
eds+0.040
us+0.039
ich+0.037
OO+0.036
Neg logit contributions
 Sue-0.044
 lick-0.042
 sill-0.039
-0.037
 Together-0.037
Token.
Token ID13
Feature activation-4.19e-01

Pos logit contributions
ind+0.030
eds+0.029
us+0.029
 targets+0.028
opter+0.028
Neg logit contributions
 lick-0.024
 Sue-0.024
 sill-0.021
 pur-0.020
 nod-0.020
Token He
Token ID679
Feature activation+7.18e-01

Pos logit contributions
us+0.017
eds+0.016
ind+0.016
opter+0.016
opus+0.016
Neg logit contributions
 lick-0.010
 Sue-0.010
 sill-0.009
 shake-0.008
 wipe-0.008
Token opened
Token ID4721
Feature activation+1.93e+00

Pos logit contributions
ind+0.005
opter+0.005
utes+0.004
 targets+0.004
 perspective+0.004
Neg logit contributions
 Sue-0.003
 lick-0.002
 pur-0.002
 shake-0.002
 sill-0.002
Token the
Token ID262
Feature activation+2.83e-01

Pos logit contributions
 Sue+0.044
 lick+0.036
 sill+0.032
 Amy+0.031
 shake+0.030
Neg logit contributions
opus-0.093
OO-0.092
opter-0.091
ind-0.091
eds-0.090
Token box
Token ID3091
Feature activation+1.40e+00

Pos logit contributions
 Sue+0.009
 lick+0.008
 sill+0.008
 shake+0.007
 wipe+0.007
Neg logit contributions
ind-0.011
eds-0.011
us-0.010
opter-0.010
opus-0.010
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
 Sue+0.051
 lick+0.048
 sill+0.045
 shake+0.042
 Amy+0.041
Neg logit contributions
ind-0.044
eds-0.044
opter-0.043
 targets-0.043
us-0.042
Token and
Token ID290
Feature activation-3.50e-02

Pos logit contributions
ind+0.036
eds+0.035
 targets+0.035
opter+0.034
us+0.034
Neg logit contributions
 Sue-0.037
 lick-0.035
 sill-0.032
 pur-0.030
 shake-0.030
Token there
Token ID612
Feature activation+1.83e+00

Pos logit contributions
ind+0.001
eds+0.001
opter+0.001
opus+0.001
 targets+0.001
Neg logit contributions
 Sue-0.001
 lick-0.001
 sill-0.001
 Amy-0.001
 shake-0.001
Token was
Token ID373
Feature activation-2.14e-01

Pos logit contributions
 Sue+0.056
 lick+0.052
 sill+0.047
 shake+0.044
 Amy+0.044
Neg logit contributions
ind-0.070
eds-0.068
 targets-0.067
opter-0.066
us-0.066
Token his
Token ID465
Feature activation+9.85e-01

Pos logit contributions
ind+0.009
eds+0.009
 targets+0.009
opter+0.009
us+0.009
Neg logit contributions
 Sue-0.005
 lick-0.005
 sill-0.004
 shake-0.004
 pur-0.004
Token uniform
Token ID8187
Feature activation-1.96e-01

Pos logit contributions
 Sue+0.032
 lick+0.028
 sill+0.027
 shake+0.026
 nod+0.025
Neg logit contributions
opter-0.038
eds-0.037
us-0.037
ind-0.036
OO-0.036
Token!
Token ID0
Feature activation-1.94e-01

Pos logit contributions
ind+0.009
eds+0.008
us+0.008
opter+0.008
opus+0.008
Neg logit contributions
 Sue-0.005
 lick-0.005
 sill-0.004
 shake-0.004
 pur-0.004
Token Everyone
Token ID11075
Feature activation-5.31e-01

Pos logit contributions
opter+0.008
 targets+0.008
us+0.008
OO+0.008
ind+0.008
Neg logit contributions
 lick-0.005
 Sue-0.004
 sill-0.004
 pur-0.004
 wipe-0.004
Token pizza
Token ID14256
Feature activation+4.31e-01

Pos logit contributions
 Sue+0.043
 lick+0.042
 crunch+0.037
 sill+0.036
 milk+0.035
Neg logit contributions
ich-0.048
OO-0.048
opter-0.047
eds-0.047
opus-0.046
Token.
Token ID13
Feature activation+4.80e-01

Pos logit contributions
 Sue+0.011
 lick+0.009
 sill+0.008
 Amy+0.007
 crunch+0.007
Neg logit contributions
ind-0.020
eds-0.019
opter-0.019
OO-0.019
 targets-0.018
Token Lily
Token ID20037
Feature activation+4.05e-01

Pos logit contributions
 lick+0.012
 Sue+0.011
 sill+0.010
 pur+0.010
 shake+0.010
Neg logit contributions
 targets-0.019
us-0.018
ind-0.018
eds-0.018
opus-0.018
Token wanted
Token ID2227
Feature activation-6.63e-01

Pos logit contributions
 Sue+0.011
 lick+0.010
 pur+0.009
 Amy+0.009
 She+0.009
Neg logit contributions
eds-0.017
ind-0.017
opter-0.017
 targets-0.017
 perspective-0.016
Token cheese
Token ID9891
Feature activation+4.56e-01

Pos logit contributions
opter+0.028
ind+0.027
OO+0.026
eds+0.026
opus+0.026
Neg logit contributions
 Sue-0.020
 lick-0.018
 Amy-0.016
 sill-0.016
 milk-0.015
Token pizza
Token ID14256
Feature activation+9.88e-01

Pos logit contributions
 Sue+0.015
 lick+0.014
 sill+0.013
 pur+0.012
 shake+0.012
Neg logit contributions
ind-0.015
eds-0.015
 targets-0.015
us-0.015
opter-0.015
Token,
Token ID11
Feature activation-5.55e-01

Pos logit contributions
 Sue+0.030
 lick+0.029
 sill+0.026
 shake+0.024
 pur+0.024
Neg logit contributions
ind-0.036
eds-0.036
 targets-0.035
opter-0.035
us-0.035
Token but
Token ID475
Feature activation-4.71e-01

Pos logit contributions
 targets+0.020
opus+0.019
eds+0.019
us+0.019
ind+0.019
Neg logit contributions
 Sue-0.015
 lick-0.015
 sill-0.014
 pur-0.012
 wipe-0.012
Token Ben
Token ID3932
Feature activation-4.81e-01

Pos logit contributions
ind+0.021
eds+0.021
opter+0.021
 targets+0.021
us+0.021
Neg logit contributions
 Sue-0.011
 lick-0.010
 sill-0.008
 shake-0.007
 Amy-0.007
Token wanted
Token ID2227
Feature activation-7.04e-01

Pos logit contributions
eds+0.020
ind+0.019
opter+0.019
 targets+0.019
us+0.019
Neg logit contributions
 Sue-0.014
 lick-0.013
 pur-0.011
 shake-0.011
 nod-0.011
Token pepper
Token ID13385
Feature activation+2.24e-01

Pos logit contributions
ind+0.025
opter+0.025
eds+0.025
 targets+0.024
opus+0.024
Neg logit contributions
 Sue-0.023
 lick-0.021
 sill-0.019
 shake-0.018
 Amy-0.018
Token
Token ID198
Feature activation+2.04e-01

Pos logit contributions
 Sue+0.008
 lick+0.007
 sill+0.007
 Amy+0.006
 shake+0.006
Neg logit contributions
ind-0.012
eds-0.012
 targets-0.012
opter-0.012
opus-0.012
Token"
Token ID1
Feature activation+1.93e-01

Pos logit contributions
 Sue+0.006
 lick+0.006
 sill+0.006
 shake+0.005
 pur+0.005
Neg logit contributions
ind-0.007
eds-0.007
 targets-0.007
opter-0.007
us-0.007
TokenHello
Token ID15496
Feature activation+1.02e+00

Pos logit contributions
 Sue+0.007
 lick+0.006
 sill+0.006
 shake+0.006
 Together+0.006
Neg logit contributions
opus-0.006
 musician-0.006
ind-0.006
 targets-0.006
ich-0.006
Token,
Token ID11
Feature activation-3.57e-01

Pos logit contributions
 Sue+0.032
 lick+0.031
 sill+0.028
 shake+0.026
 wipe+0.025
Neg logit contributions
ind-0.036
eds-0.035
opter-0.035
 targets-0.035
us-0.035
Token Lily
Token ID20037
Feature activation+7.11e-01

Pos logit contributions
ind+0.013
opter+0.013
opus+0.013
us+0.012
eds+0.012
Neg logit contributions
 lick-0.010
 Sue-0.010
 sill-0.010
 nod-0.008
 shake-0.008
Token,"
Token ID553
Feature activation+1.20e+00

Pos logit contributions
 Sue+0.027
 lick+0.027
 sill+0.025
 shake+0.023
 nod+0.023
Neg logit contributions
ind-0.020
eds-0.020
opter-0.019
us-0.019
 targets-0.019
Token said
Token ID531
Feature activation-7.21e-01

Pos logit contributions
 sill+0.040
 lick+0.039
+0.038
 Sue+0.037
coat+0.034
Neg logit contributions
 targets-0.032
opter-0.030
ind-0.030
 musician-0.030
opus-0.030
Token the
Token ID262
Feature activation-1.60e-02

Pos logit contributions
 musician+0.015
 targets+0.014
 passport+0.013
us+0.013
 dolphin+0.012
Neg logit contributions
 sill-0.027
 lick-0.027
-0.026
Deal-0.026
drops-0.024
Token bear
Token ID6842
Feature activation+6.04e-01

Pos logit contributions
ind+0.000
opus+0.000
ich+0.000
ino+0.000
us+0.000
Neg logit contributions
 Sue-0.001
 lick-0.001
 sill-0.001
 Amy-0.001
-0.001
Token.
Token ID13
Feature activation-1.72e-01

Pos logit contributions
 Sue+0.022
 lick+0.022
+0.021
Grab+0.020
 sill+0.020
Neg logit contributions
eds-0.014
us-0.014
ind-0.014
ich-0.013
opus-0.013
Token "
Token ID366
Feature activation-5.81e-01

Pos logit contributions
OO+0.002
opus+0.002
us+0.002
 pad+0.002
 ago+0.002
Neg logit contributions
 lick-0.007
 sill-0.007
 wipe-0.007
 shake-0.007
hetically-0.007
Tokenila
Token ID10102
Feature activation+6.09e-01

Pos logit contributions
 Sue+0.025
 lick+0.024
 sill+0.021
 shake+0.020
 wipe+0.020
Neg logit contributions
ind-0.022
 targets-0.021
eds-0.021
opter-0.020
us-0.020
Token,
Token ID11
Feature activation-5.69e-02

Pos logit contributions
 Sue+0.020
 lick+0.018
 sill+0.016
 pur+0.015
 shake+0.015
Neg logit contributions
ind-0.022
 targets-0.022
eds-0.022
us-0.021
opter-0.021
Token our
Token ID674
Feature activation-9.83e-01

Pos logit contributions
ind+0.002
eds+0.002
us+0.002
 targets+0.002
opter+0.002
Neg logit contributions
 Sue-0.002
 lick-0.002
 shake-0.002
 Amy-0.002
 wipe-0.002
Token tower
Token ID10580
Feature activation+2.42e-01

Pos logit contributions
us+0.040
ind+0.039
eds+0.039
opter+0.038
 targets+0.036
Neg logit contributions
 Sue-0.028
 lick-0.027
 sill-0.024
 shake-0.024
 crunch-0.024
Token is
Token ID318
Feature activation+1.01e+00

Pos logit contributions
 Sue+0.009
 lick+0.009
 sill+0.008
 shake+0.008
 pur+0.008
Neg logit contributions
ind-0.007
eds-0.007
 targets-0.007
opter-0.007
us-0.007
Token so
Token ID523
Feature activation-3.73e-01

Pos logit contributions
 Sue+0.027
 lick+0.024
 sill+0.022
 shake+0.021
 crunch+0.020
Neg logit contributions
ind-0.043
eds-0.042
us-0.041
opter-0.041
 targets-0.041
Token tall
Token ID7331
Feature activation+3.04e-01

Pos logit contributions
opter+0.014
ind+0.014
eds+0.014
 targets+0.013
us+0.013
Neg logit contributions
 Sue-0.011
 lick-0.010
 sill-0.009
 nod-0.009
 wipe-0.009
Token and
Token ID290
Feature activation+2.85e-01

Pos logit contributions
 Sue+0.013
 lick+0.012
 sill+0.011
 Amy+0.010
 shake+0.010
Neg logit contributions
ind-0.008
eds-0.008
 targets-0.008
opter-0.008
us-0.008
Token pretty
Token ID2495
Feature activation-6.67e-01

Pos logit contributions
 Sue+0.008
 lick+0.008
 sill+0.007
 nod+0.006
+0.006
Neg logit contributions
opter-0.010
ind-0.010
 targets-0.010
eds-0.010
opus-0.010
Token.
Token ID13
Feature activation-5.51e-01

Pos logit contributions
eds+0.020
ind+0.020
 targets+0.019
us+0.019
OO+0.019
Neg logit contributions
 Sue-0.025
 lick-0.024
 sill-0.022
 shake-0.021
 Amy-0.021
Token We
Token ID775
Feature activation+7.55e-01

Pos logit contributions
opter+0.016
ind+0.016
 targets+0.016
opus+0.016
us+0.016
Neg logit contributions
 lick-0.019
 Sue-0.018
 pur-0.016
 shake-0.016
 sill-0.016
Token cheered
Token ID34767
Feature activation+2.67e-01

Pos logit contributions
ind+0.086
OO+0.084
 targets+0.083
eds+0.082
ich+0.082
Neg logit contributions
 Sue-0.055
 lick-0.051
 pur-0.046
 shake-0.044
 Lucy-0.044
Token for
Token ID329
Feature activation-1.15e+00

Pos logit contributions
 Sue+0.007
 lick+0.006
 sill+0.006
 shake+0.006
 Amy+0.006
Neg logit contributions
eds-0.011
opus-0.011
opter-0.011
ind-0.011
 targets-0.011
Token him
Token ID683
Feature activation+7.30e-01

Pos logit contributions
ind+0.040
eds+0.040
opter+0.039
 targets+0.039
opus+0.039
Neg logit contributions
 Sue-0.038
 lick-0.036
 sill-0.032
 shake-0.030
 Amy-0.030
Token.
Token ID13
Feature activation+4.68e-01

Pos logit contributions
 Sue+0.020
 lick+0.019
 sill+0.017
 shake+0.016
 pur+0.015
Neg logit contributions
ind-0.029
eds-0.029
opter-0.029
 targets-0.029
opus-0.028
Token
Token ID198
Feature activation-4.08e-01

Pos logit contributions
 Sue+0.011
 lick+0.009
 sill+0.008
 Amy+0.008
 She+0.008
Neg logit contributions
ind-0.022
eds-0.021
opter-0.021
 targets-0.021
opus-0.021
Token
Token ID198
Feature activation-4.09e-01

Pos logit contributions
ind+0.019
 targets+0.019
opter+0.019
OO+0.019
 enemy+0.018
Neg logit contributions
 Sue-0.009
 lick-0.007
 Amy-0.007
 She-0.006
 Lucy-0.006
TokenThe
Token ID464
Feature activation+8.29e-01

Pos logit contributions
OO+0.017
ind+0.017
eds+0.017
 targets+0.016
ich+0.016
Neg logit contributions
 Sue-0.010
 lick-0.010
 Amy-0.008
 sill-0.008
 She-0.008
Token young
Token ID1862
Feature activation-1.01e+00

Pos logit contributions
 Sue+0.030
 lick+0.029
 sill+0.026
 shake+0.026
 nod+0.026
Neg logit contributions
eds-0.027
us-0.026
ind-0.026
opter-0.025
OO-0.025
Token boy
Token ID2933
Feature activation-2.03e-01

Pos logit contributions
eds+0.031
OO+0.029
us+0.029
ind+0.028
utes+0.027
Neg logit contributions
 Sue-0.038
 lick-0.036
 nod-0.034
 sill-0.033
 Amy-0.033
Token was
Token ID373
Feature activation+9.61e-03

Pos logit contributions
ind+0.009
eds+0.008
 targets+0.008
utes+0.008
OO+0.008
Neg logit contributions
 Sue-0.005
 lick-0.005
 pur-0.005
 sill-0.005
 shake-0.004
Token no
Token ID645
Feature activation+5.89e-01

Pos logit contributions
 Sue+0.000
 lick+0.000
 sill+0.000
 shake+0.000
 pur+0.000
Neg logit contributions
ind-0.000
eds-0.000
 targets-0.000
opter-0.000
us-0.000
Token sliding
Token ID22292
Feature activation-3.82e-01

Pos logit contributions
ind+0.040
opter+0.039
eds+0.038
opus+0.038
 targets+0.037
Neg logit contributions
 Sue-0.035
 lick-0.032
 sill-0.028
 shake-0.027
 Amy-0.027
Token down
Token ID866
Feature activation-2.20e+00

Pos logit contributions
ind+0.018
opter+0.018
opus+0.018
us+0.018
eds+0.018
Neg logit contributions
 Sue-0.008
 lick-0.007
 sill-0.006
 shake-0.006
 Amy-0.005
Token.
Token ID13
Feature activation+3.19e-01

Pos logit contributions
ind+0.085
eds+0.084
 targets+0.084
opter+0.082
us+0.082
Neg logit contributions
 Sue-0.062
 lick-0.059
 sill-0.053
 pur-0.048
 shake-0.048
Token Tim
Token ID5045
Feature activation-2.51e-01

Pos logit contributions
 lick+0.009
 sill+0.009
 wipe+0.008
 shake+0.008
 blush+0.008
Neg logit contributions
OO-0.011
eds-0.010
 targets-0.010
ich-0.010
opter-0.010
Token and
Token ID290
Feature activation+6.60e-02

Pos logit contributions
ind+0.011
opter+0.010
OO+0.010
 targets+0.010
eds+0.010
Neg logit contributions
 Sue-0.007
 lick-0.006
 pur-0.006
 shake-0.005
 sill-0.005
Token Sue
Token ID26089
Feature activation-8.49e-01

Pos logit contributions
+0.001
 wipe+0.001
 sill+0.001
 blush+0.001
 rinse+0.001
Neg logit contributions
eds-0.003
ich-0.003
 targets-0.003
 perspective-0.003
 musician-0.003
Token became
Token ID2627
Feature activation+2.42e-01

Pos logit contributions
eds+0.031
opter+0.031
ind+0.031
 targets+0.031
us+0.030
Neg logit contributions
 Sue-0.025
 lick-0.023
 sill-0.022
 Amy-0.019
 wipe-0.019
Token good
Token ID922
Feature activation+6.63e-01

Pos logit contributions
 lick+0.009
 Sue+0.008
 sill+0.008
 wipe+0.007
 pur+0.007
Neg logit contributions
 targets-0.007
ind-0.007
ich-0.007
us-0.007
eds-0.007
Token friends
Token ID2460
Feature activation+8.17e-01

Pos logit contributions
 Sue+0.024
 lick+0.023
 sill+0.021
 nod+0.020
 shake+0.020
Neg logit contributions
eds-0.021
opter-0.021
ind-0.021
OO-0.020
us-0.020
Token and
Token ID290
Feature activation-1.25e-01

Pos logit contributions
 Sue+0.027
 lick+0.025
 sill+0.023
 shake+0.021
 pur+0.021
Neg logit contributions
ind-0.028
eds-0.028
opter-0.028
 targets-0.028
us-0.027
Token played
Token ID2826
Feature activation+9.15e-02

Pos logit contributions
 targets+0.004
eds+0.004
ind+0.004
opter+0.004
us+0.004
Neg logit contributions
 lick-0.004
 Sue-0.004
 sill-0.004
 wipe-0.003
 shake-0.003
Token inside
Token ID2641
Feature activation-8.86e-01

Pos logit contributions
 Sue+0.023
 lick+0.022
 sill+0.020
 shake+0.019
 Amy+0.019
Neg logit contributions
ind-0.023
opter-0.023
eds-0.023
opus-0.023
 targets-0.023
Token and
Token ID290
Feature activation-4.34e-01

Pos logit contributions
ind+0.036
opter+0.035
opus+0.035
 targets+0.035
eds+0.035
Neg logit contributions
 Sue-0.027
 lick-0.024
 sill-0.022
 Amy-0.021
 shake-0.021
Token saw
Token ID2497
Feature activation-8.28e-02

Pos logit contributions
ind+0.014
eds+0.013
 targets+0.013
opter+0.013
opus+0.013
Neg logit contributions
 Sue-0.016
 lick-0.015
 sill-0.014
 shake-0.013
 Amy-0.013
Token a
Token ID257
Feature activation-9.01e-01

Pos logit contributions
ind+0.003
 targets+0.003
opter+0.003
eds+0.003
us+0.003
Neg logit contributions
 Sue-0.002
 lick-0.002
 sill-0.002
 nod-0.002
 shake-0.002
Token little
Token ID1310
Feature activation+6.21e-02

Pos logit contributions
eds+0.036
 targets+0.036
ind+0.036
us+0.035
OO+0.033
Neg logit contributions
 Sue-0.026
 lick-0.025
 shake-0.022
 pur-0.021
 crunch-0.021
Token kitten
Token ID42143
Feature activation-2.17e-01

Pos logit contributions
 Sue+0.002
 lick+0.002
 sill+0.002
 shake+0.002
 pur+0.002
Neg logit contributions
ind-0.002
eds-0.002
 targets-0.002
us-0.002
opter-0.002
Token hiding
Token ID11816
Feature activation-5.91e-01

Pos logit contributions
ind+0.007
eds+0.007
 targets+0.007
opter+0.007
us+0.007
Neg logit contributions
 Sue-0.007
 lick-0.007
 sill-0.006
 nod-0.006
 shake-0.006
Token there
Token ID612
Feature activation+1.29e+00

Pos logit contributions
ind+0.021
eds+0.020
 targets+0.020
opter+0.020
us+0.020
Neg logit contributions
 Sue-0.019
 lick-0.018
 sill-0.016
 shake-0.015
 pur-0.015
Token.
Token ID13
Feature activation+3.38e-01

Pos logit contributions
 Sue+0.036
 lick+0.034
 sill+0.030
 shake+0.028
 pur+0.028
Neg logit contributions
ind-0.051
eds-0.050
opter-0.050
 targets-0.050
us-0.049
Token "
Token ID366
Feature activation-3.44e-01

Pos logit contributions
 lick+0.007
 Sue+0.007
 sill+0.006
 shake+0.005
 nod+0.005
Neg logit contributions
ind-0.014
opter-0.014
eds-0.014
 targets-0.014
opus-0.014
TokenOh
Token ID5812
Feature activation-2.08e-01

Pos logit contributions
ind+0.012
eds+0.011
 targets+0.011
opter+0.011
us+0.011
Neg logit contributions
 Sue-0.011
 lick-0.011
 sill-0.010
 nod-0.009
 shake-0.009
Token,
Token ID11
Feature activation-2.62e-01

Pos logit contributions
ind+0.046
eds+0.045
 targets+0.045
opter+0.045
us+0.044
Neg logit contributions
 Sue-0.040
 lick-0.037
 sill-0.034
 shake-0.032
 pur-0.032
Token beyond
Token ID3675
Feature activation-1.51e+00

Pos logit contributions
ind+0.010
eds+0.010
 targets+0.010
opter+0.009
us+0.009
Neg logit contributions
 Sue-0.008
 lick-0.007
 sill-0.007
 shake-0.006
 nod-0.006
Token the
Token ID262
Feature activation-3.02e-01

Pos logit contributions
ind+0.060
eds+0.059
opter+0.059
 targets+0.059
us+0.058
Neg logit contributions
 Sue-0.042
 lick-0.039
 sill-0.035
 shake-0.032
 pur-0.032
Token leaves
Token ID5667
Feature activation-4.88e-01

Pos logit contributions
us+0.012
eds+0.012
opter+0.012
OO+0.011
opus+0.011
Neg logit contributions
 Sue-0.009
 lick-0.009
 sill-0.008
 shake-0.008
 pur-0.007
Token and
Token ID290
Feature activation-2.59e-01

Pos logit contributions
ind+0.017
eds+0.016
 targets+0.016
opter+0.016
us+0.016
Neg logit contributions
 Sue-0.016
 lick-0.015
 sill-0.014
 shake-0.013
 pur-0.013
Token grass
Token ID8701
Feature activation-1.06e+00

Pos logit contributions
 targets+0.009
ind+0.009
eds+0.009
opter+0.009
ich+0.009
Neg logit contributions
 Sue-0.008
 lick-0.007
 sill-0.007
 nod-0.006
 wipe-0.006
Token.
Token ID13
Feature activation+5.72e-01

Pos logit contributions
ind+0.037
eds+0.037
 targets+0.036
opter+0.036
us+0.035
Neg logit contributions
 Sue-0.033
 lick-0.032
 sill-0.028
 wipe-0.026
 Amy-0.026
Token
Token ID198
Feature activation-6.88e-02

Pos logit contributions
 lick+0.017
 Sue+0.016
 sill+0.015
 crunch+0.014
 wipe+0.014
Neg logit contributions
 targets-0.021
opter-0.020
ind-0.020
eds-0.020
us-0.020
Token
Token ID198
Feature activation-1.45e-01

Pos logit contributions
ind+0.003
 targets+0.003
eds+0.003
opter+0.003
opus+0.003
Neg logit contributions
 Sue-0.002
 lick-0.002
 sill-0.001
 Amy-0.001
 She-0.001
TokenAll
Token ID3237
Feature activation+4.39e-01

Pos logit contributions
eds+0.005
ind+0.005
 targets+0.005
opus+0.005
opter+0.005
Neg logit contributions
 Sue-0.005
 lick-0.004
 sill-0.004
 She-0.004
 shake-0.004
Tokeny
Token ID88
Feature activation-5.31e-01

Pos logit contributions
 Sue+0.015
 lick+0.015
 sill+0.013
 pur+0.012
 wipe+0.012
Neg logit contributions
ind-0.014
eds-0.014
 targets-0.014
opter-0.014
us-0.014
Token Mia
Token ID32189
Feature activation-5.00e-01

Pos logit contributions
us+0.006
eds+0.006
opus+0.006
ind+0.006
ino+0.006
Neg logit contributions
 lick-0.006
 wipe-0.006
 blush-0.006
 shake-0.005
 crunch-0.005
Token and
Token ID290
Feature activation+2.92e-01

Pos logit contributions
opter+0.018
eds+0.017
ind+0.017
us+0.017
 targets+0.017
Neg logit contributions
 Sue-0.016
 lick-0.016
 pur-0.014
 sill-0.014
 nod-0.014
Token Leo
Token ID19632
Feature activation-1.22e+00

Pos logit contributions
 lick+0.009
 wipe+0.008
 Sue+0.008
 sill+0.008
+0.007
Neg logit contributions
 targets-0.010
ico-0.010
opus-0.009
 enemy-0.009
us-0.009
Token love
Token ID1842
Feature activation-4.88e-01

Pos logit contributions
ind+0.048
 targets+0.048
eds+0.047
opter+0.047
us+0.046
Neg logit contributions
 Sue-0.033
 lick-0.029
 sill-0.028
 Amy-0.024
 wipe-0.023
Token Sam
Token ID3409
Feature activation-2.53e-01

Pos logit contributions
opter+0.019
ind+0.018
opus+0.018
OO+0.018
us+0.018
Neg logit contributions
 Sue-0.016
 lick-0.014
 sill-0.013
 Amy-0.012
 pur-0.012
Token very
Token ID845
Feature activation-2.12e+00

Pos logit contributions
ind+0.009
eds+0.009
 targets+0.009
us+0.009
opter+0.009
Neg logit contributions
 Sue-0.008
 lick-0.008
 sill-0.007
 shake-0.006
 wipe-0.006
Token much
Token ID881
Feature activation-5.70e-01

Pos logit contributions
ind+0.066
us+0.062
opter+0.061
eds+0.060
 targets+0.058
Neg logit contributions
 lick-0.076
 Sue-0.074
 sill-0.069
 wipe-0.067
 nod-0.065
Token.
Token ID13
Feature activation-3.11e-01

Pos logit contributions
ind+0.022
eds+0.022
 targets+0.022
opter+0.022
us+0.022
Neg logit contributions
 Sue-0.016
 lick-0.015
 sill-0.013
 shake-0.012
 pur-0.012
Token
Token ID198
Feature activation-1.76e-01

Pos logit contributions
eds+0.010
 targets+0.010
ind+0.010
us+0.010
opus+0.010
Neg logit contributions
 lick-0.009
 wipe-0.009
 sill-0.008
 Sue-0.008
 blush-0.008
Token
Token ID198
Feature activation-7.86e-02

Pos logit contributions
ind+0.007
eds+0.007
 targets+0.007
opter+0.007
us+0.007
Neg logit contributions
 Sue-0.005
 lick-0.004
 sill-0.004
 shake-0.004
 Amy-0.004
TokenOne
Token ID3198
Feature activation+7.25e-01

Pos logit contributions
ind+0.003
eds+0.003
 targets+0.003
us+0.003
opter+0.003
Neg logit contributions
 Sue-0.002
 lick-0.002
 sill-0.002
 wipe-0.002
 pur-0.002
Token,
Token ID11
Feature activation-1.12e+00

Pos logit contributions
 targets+0.009
ind+0.009
opter+0.008
eds+0.008
 perspective+0.008
Neg logit contributions
 Sue-0.008
 lick-0.007
 sill-0.006
 shake-0.006
 Amy-0.006
Token you
Token ID345
Feature activation-1.06e+00

Pos logit contributions
ind+0.047
 targets+0.046
eds+0.046
us+0.045
opter+0.045
Neg logit contributions
 Sue-0.033
 lick-0.029
 sill-0.026
 shake-0.025
 Amy-0.024
Token must
Token ID1276
Feature activation-1.77e+00

Pos logit contributions
ind+0.049
eds+0.047
 targets+0.047
us+0.046
ich+0.046
Neg logit contributions
 Sue-0.025
 lick-0.024
 shake-0.021
 nod-0.019
 sill-0.018
Token reach
Token ID3151
Feature activation-7.76e-01

Pos logit contributions
ind+0.070
opter+0.069
eds+0.069
 targets+0.068
us+0.066
Neg logit contributions
 Sue-0.045
 lick-0.042
 sill-0.039
 blush-0.035
 pur-0.034
Token me
Token ID502
Feature activation+5.40e-01

Pos logit contributions
opter+0.038
opus+0.036
OO+0.034
ind+0.034
onda+0.034
Neg logit contributions
 Sue-0.020
 lick-0.015
 shake-0.013
 Lucy-0.013
 Amy-0.013
Token!
Token ID0
Feature activation-2.22e+00

Pos logit contributions
 Sue+0.021
 lick+0.020
 sill+0.019
 shake+0.018
 wipe+0.017
Neg logit contributions
opter-0.015
 targets-0.015
ind-0.015
opus-0.014
eds-0.014
Token But
Token ID887
Feature activation-6.68e-01

Pos logit contributions
ind+0.072
us+0.071
opter+0.070
opus+0.070
eds+0.069
Neg logit contributions
 lick-0.071
 Sue-0.071
 sill-0.063
 pur-0.062
 crunch-0.061
Token be
Token ID307
Feature activation+4.62e-01

Pos logit contributions
ind+0.024
eds+0.023
 targets+0.023
opter+0.023
us+0.023
Neg logit contributions
 Sue-0.021
 lick-0.020
 sill-0.018
 shake-0.017
 pur-0.017
Token careful
Token ID8161
Feature activation-2.21e-01

Pos logit contributions
 Sue+0.016
 lick+0.016
 sill+0.014
 shake+0.013
 pur+0.013
Neg logit contributions
ind-0.015
eds-0.014
 targets-0.014
opter-0.014
us-0.014
Token,
Token ID11
Feature activation+5.41e-01

Pos logit contributions
 targets+0.007
ind+0.007
opter+0.007
eds+0.007
opus+0.007
Neg logit contributions
 Sue-0.008
 lick-0.007
 sill-0.007
 shake-0.006
 Amy-0.006
Token the
Token ID262
Feature activation-4.22e-01

Pos logit contributions
 Sue+0.018
 lick+0.016
 sill+0.015
 shake+0.015
 Amy+0.014
Neg logit contributions
ind-0.020
opter-0.020
eds-0.019
us-0.019
 targets-0.019
Token was
Token ID373
Feature activation+3.02e-01

Pos logit contributions
ind+0.101
eds+0.097
ich+0.096
 targets+0.096
us+0.092
Neg logit contributions
 lick-0.053
 Sue-0.051
 pur-0.045
 wipe-0.043
 shake-0.043
Token in
Token ID287
Feature activation+7.65e-01

Pos logit contributions
 Sue+0.009
 lick+0.009
 sill+0.008
 shake+0.007
 nod+0.007
Neg logit contributions
ind-0.011
eds-0.011
 targets-0.011
opter-0.010
us-0.010
Token charge
Token ID3877
Feature activation-1.03e+00

Pos logit contributions
 Sue+0.022
 lick+0.021
 sill+0.018
 shake+0.017
 pur+0.017
Neg logit contributions
ind-0.029
 targets-0.029
eds-0.029
opter-0.029
us-0.028
Token of
Token ID286
Feature activation+1.03e+00

Pos logit contributions
ind+0.040
eds+0.040
opter+0.039
opus+0.039
ich+0.039
Neg logit contributions
 Sue-0.031
 lick-0.028
 sill-0.026
 Amy-0.024
 shake-0.024
Token making
Token ID1642
Feature activation+2.90e-01

Pos logit contributions
 Sue+0.033
 lick+0.032
 sill+0.028
 shake+0.027
 Amy+0.027
Neg logit contributions
opter-0.038
opus-0.037
ind-0.037
eds-0.036
us-0.036
Token sure
Token ID1654
Feature activation-2.30e+00

Pos logit contributions
 Sue+0.009
 lick+0.008
 sill+0.007
 pur+0.007
 Amy+0.007
Neg logit contributions
ind-0.011
opter-0.011
eds-0.011
opus-0.011
ich-0.011
Token there
Token ID612
Feature activation+3.80e-01

Pos logit contributions
ind+0.090
opter+0.089
opus+0.089
OO+0.088
eds+0.087
Neg logit contributions
 Sue-0.074
 lick-0.063
 sill-0.059
 Amy-0.059
 Lucy-0.058
Token were
Token ID547
Feature activation+3.88e-01

Pos logit contributions
 Sue+0.012
 lick+0.012
 sill+0.011
 shake+0.010
 pur+0.010
Neg logit contributions
ind-0.013
eds-0.013
 targets-0.013
opter-0.013
us-0.013
Token no
Token ID645
Feature activation+4.80e-01

Pos logit contributions
 Sue+0.012
 lick+0.011
 sill+0.010
 shake+0.009
 pur+0.009
Neg logit contributions
ind-0.015
opter-0.015
eds-0.014
 targets-0.014
opus-0.014
Token distractions
Token ID38085
Feature activation-1.85e+00

Pos logit contributions
 Sue+0.016
 lick+0.015
 sill+0.014
 shake+0.013
 pur+0.013
Neg logit contributions
ind-0.016
opter-0.016
eds-0.016
us-0.016
opus-0.016
Token.
Token ID13
Feature activation+6.76e-01

Pos logit contributions
ind+0.087
opter+0.083
opus+0.083
eds+0.083
us+0.082
Neg logit contributions
 Sue-0.042
 lick-0.036
 sill-0.035
 shake-0.030
 She-0.030
Token t
Token ID256
Feature activation-6.22e-01

Pos logit contributions
 Sue+0.001
 lick+0.001
 sill+0.001
 shake+0.001
 pur+0.001
Neg logit contributions
ind-0.002
eds-0.002
opter-0.002
 targets-0.002
us-0.002
Tokeneddy
Token ID21874
Feature activation-5.89e-01

Pos logit contributions
 targets+0.015
 culture+0.014
OO+0.013
 perspective+0.013
opter+0.013
Neg logit contributions
 lick-0.025
 Sue-0.025
 sill-0.024
 She-0.023
 wipe-0.022
Token bear
Token ID6842
Feature activation+6.85e-01

Pos logit contributions
ind+0.023
eds+0.023
 targets+0.023
opter+0.023
us+0.022
Neg logit contributions
 Sue-0.017
 lick-0.016
 sill-0.014
 shake-0.013
 pur-0.013
Token and
Token ID290
Feature activation-6.15e-01

Pos logit contributions
 Sue+0.025
 lick+0.024
 sill+0.022
 shake+0.021
 pur+0.021
Neg logit contributions
ind-0.021
eds-0.020
 targets-0.020
opter-0.020
us-0.020
Token ran
Token ID4966
Feature activation+1.50e-02

Pos logit contributions
eds+0.023
 targets+0.022
ind+0.022
opter+0.022
us+0.022
Neg logit contributions
 Sue-0.018
 lick-0.017
 sill-0.016
 nod-0.014
 shake-0.014
Token back
Token ID736
Feature activation-2.66e+00

Pos logit contributions
 Sue+0.000
 lick+0.000
 sill+0.000
 shake+0.000
 Amy+0.000
Neg logit contributions
ind-0.001
opter-0.001
eds-0.001
opus-0.001
 targets-0.001
Token to
Token ID284
Feature activation-1.03e+00

Pos logit contributions
opter+0.081
ind+0.080
opus+0.080
 targets+0.079
eds+0.078
Neg logit contributions
 Sue-0.102
 lick-0.095
 sill-0.087
 Amy-0.083
 shake-0.082
Token his
Token ID465
Feature activation-5.59e-01

Pos logit contributions
eds+0.037
ind+0.036
 targets+0.036
us+0.035
opter+0.034
Neg logit contributions
 lick-0.028
 Sue-0.028
 sill-0.027
 crunch-0.024
 nod-0.023
Token house
Token ID2156
Feature activation-1.04e-01

Pos logit contributions
opter+0.021
us+0.020
OO+0.020
opus+0.019
eds+0.019
Neg logit contributions
 Sue-0.018
 lick-0.018
 sill-0.015
 pur-0.015
 shake-0.015
Token.
Token ID13
Feature activation-1.18e-01

Pos logit contributions
eds+0.004
ind+0.004
 targets+0.004
us+0.004
opter+0.004
Neg logit contributions
 Sue-0.003
 lick-0.003
 sill-0.003
 nod-0.002
 pur-0.002
Token He
Token ID679
Feature activation+2.11e-01

Pos logit contributions
ind+0.005
us+0.005
eds+0.005
 targets+0.005
opter+0.005
Neg logit contributions
 lick-0.003
 Sue-0.002
 sill-0.002
 shake-0.002
 nod-0.002
Token basket
Token ID7988
Feature activation+4.59e-02

Pos logit contributions
 Sue+0.023
 lick+0.022
 sill+0.019
 shake+0.017
 pur+0.017
Neg logit contributions
ind-0.040
eds-0.040
 targets-0.039
opter-0.039
us-0.039
Token from
Token ID422
Feature activation+5.42e-01

Pos logit contributions
 Sue+0.002
 lick+0.002
 pur+0.002
 blush+0.002
 nod+0.002
Neg logit contributions
eds-0.001
ind-0.001
 targets-0.001
us-0.001
opter-0.001
Token Lily
Token ID20037
Feature activation+1.47e+00

Pos logit contributions
 lick+0.013
 Sue+0.013
 sill+0.012
 nod+0.011
 pur+0.011
Neg logit contributions
 targets-0.022
ind-0.022
eds-0.021
us-0.021
ich-0.021
Token and
Token ID290
Feature activation-1.42e-01

Pos logit contributions
 Sue+0.046
 lick+0.043
 sill+0.039
 shake+0.036
 Amy+0.036
Neg logit contributions
opter-0.057
ind-0.055
opus-0.054
us-0.053
eds-0.053
Token ran
Token ID4966
Feature activation+1.63e-03

Pos logit contributions
 targets+0.005
eds+0.005
us+0.005
ich+0.004
 passport+0.004
Neg logit contributions
 sill-0.004
 lick-0.004
 Sue-0.004
 nod-0.004
-0.003
Token away
Token ID1497
Feature activation-3.16e+00

Pos logit contributions
 Sue+0.000
 lick+0.000
 sill+0.000
 shake+0.000
 Amy+0.000
Neg logit contributions
ind-0.000
opter-0.000
eds-0.000
opus-0.000
 targets-0.000
Token.
Token ID13
Feature activation-2.78e-01

Pos logit contributions
ind+0.126
eds+0.124
 targets+0.123
opter+0.123
us+0.121
Neg logit contributions
 Sue-0.087
 lick-0.082
 sill-0.073
 shake-0.067
 pur-0.067
Token He
Token ID679
Feature activation+2.78e-01

Pos logit contributions
 targets+0.011
us+0.011
ind+0.011
eds+0.011
opter+0.011
Neg logit contributions
 lick-0.007
 Sue-0.006
 sill-0.006
 shake-0.005
 pur-0.005
Token laughed
Token ID13818
Feature activation-6.29e-01

Pos logit contributions
 Sue+0.008
 lick+0.008
 sill+0.007
 shake+0.006
 pur+0.006
Neg logit contributions
ind-0.011
eds-0.010
 targets-0.010
opter-0.010
us-0.010
Token and
Token ID290
Feature activation-3.20e-02

Pos logit contributions
ind+0.020
eds+0.020
opter+0.020
 targets+0.020
us+0.019
Neg logit contributions
 Sue-0.023
 lick-0.021
 sill-0.020
 shake-0.019
 pur-0.018
Token said
Token ID531
Feature activation-1.36e+00

Pos logit contributions
 targets+0.001
eds+0.001
us+0.001
opter+0.001
ich+0.001
Neg logit contributions
 Sue-0.001
 sill-0.001
 lick-0.001
 nod-0.001
 wipe-0.001