TOP ACTIVATIONS
MAX = 5.582

and up the grand stair ways . Everywhere she looked ,
with glasses and a pony tail . She is reading a
and the tires and the mirrors . They saw their faces
adventure . <|endoftext|> John and Tara were playing in the park
seek . Tom was the seeker and Lily was the h
. The bike rider stopped . He was angry
garden . Lily is the seeker and Ben is the h
that in your mouth ? Is that a bone ?"
Tim became the best bike rider in the whole town .
the people cl apped in amaz ement . The singing rock
jumped up and grinned in amaz ement .
." The bike rider calmed down . He saw
and buck led her seat belt . The plane took off
park . Lily was the seeker and Tom was the h
park . Anna is the seeker and Ben is the h
nap . <|endoftext|> Eric and Alison liked to play together .
said sorry to the bike rider . They cleaned up the
izz ed off back into outer space . Earth was an
park . Tim was the seeker and Lily was the h
park . Tom was the seeker and Mia was the h

MIN ACTIVATIONS
MIN = -5.589

continued to scatter the bread cr umbs . <|endoftext|> Once upon
it was difficult finding r ags and water . But Bob
mum gave him some bread cr umbs . Jack smiled as
and make them go " cr unch ".
is stronger . He dr ags Lily into the pond .
love you " and " cr acker ". He also likes
" You can have a bite ". The mouse tasted the
face and the coat dr ags on the floor .
They offered it some bread cr umbs and it happily ate
opens it . He dr ags Sue to the street .
, " we use our tails to help us balance when
if she could have a bite . Her mom
" Can I have a bite ?" Ann
L ily tried a bite and said , " E
. You can have a bite if you want ."
if she could have a bite . Her mom said �
if she could have a bite , and it was very
the arms . She dr ags them to the office .
him to turn the r ags she kept in the closet
mom bought some new r ags . When they got back

INTERVAL 2.789 - 5.582
CONTAINS 0.257%

day , the mighty k ang aroo and his friends went
He felt so happy , faster and faster he went until
The girl was very curious about the wreck , but she
� � That â € ™ s the gas from the
was too expensive and not meant for him .

INTERVAL -0.003 - 2.789
CONTAINS 50.353%

great idea and they both marched around in the rain ,
, 5 rocks . He was very proud of what he
do not stop . They do not listen to him .
to camp with his friends . He could hardly guess what
" But I want to know . Maybe it 's a

INTERVAL -2.796 - -0.003
CONTAINS 49.074%

first before you use my things , okay ? And now
upcoming festival . Everyone was excited but Billy was a bit
her mom my 's hand . She was curious to know
for helping their little one . From that day on ,
while exploring , Jack found a little bird who was built

INTERVAL -5.589 - -2.796
CONTAINS 0.316%

" I saw a big rad ish in the garden today
old , y ucky cr umbs . But the mouse did
was so happy and w agged his tail . He and
mom to get her some clay . Mom said yes and
, the whale saw a little fish swimming nearby . The
Token and
Token ID290
Feature activation+6.71e-02

Pos logit contributions
Paul+0.168
 York+0.156
abella+0.152
Grand+0.150
Mad+0.145
Neg logit contributions
owers-0.156
 beans-0.147
 lettuce-0.145
 Freddy-0.143
sed-0.143
Token up
Token ID510
Feature activation+2.04e+00

Pos logit contributions
Paul+0.003
 tow+0.003
 York+0.003
ind+0.003
Mad+0.003
Neg logit contributions
 Freddy-0.002
 shovel-0.002
sed-0.002
 beans-0.002
 lettuce-0.002
Token the
Token ID262
Feature activation+1.36e+00

Pos logit contributions
Paul+0.087
 York+0.079
Grand+0.078
Mad+0.076
Offic+0.076
Neg logit contributions
 beans-0.074
owers-0.072
 Freddy-0.072
 lettuce-0.067
 grapes-0.066
Token grand
Token ID4490
Feature activation+4.72e+00

Pos logit contributions
Paul+0.047
abella+0.044
Grand+0.042
 York+0.041
Daniel+0.040
Neg logit contributions
owers-0.059
 lettuce-0.059
 beans-0.058
 mushroom-0.055
ados-0.055
Token stair
Token ID18761
Feature activation+3.32e+00

Pos logit contributions
Paul+0.167
abella+0.153
Grand+0.150
 much+0.148
 security+0.143
Neg logit contributions
owers-0.203
ados-0.196
 beans-0.196
 lettuce-0.196
 Freddy-0.185
Tokenways
Token ID1322
Feature activation+5.50e+00

Pos logit contributions
Paul+0.127
Grand+0.117
 York+0.115
Daniel+0.113
Mad+0.112
Neg logit contributions
owers-0.134
 beans-0.125
 lettuce-0.122
 Freddy-0.120
sed-0.119
Token.
Token ID13
Feature activation-3.70e-01

Pos logit contributions
Paul+0.223
 York+0.202
Grand+0.197
ind+0.194
Daniel+0.192
Neg logit contributions
owers-0.211
 beans-0.194
 cheeks-0.186
 lettuce-0.186
cr-0.185
Token Everywhere
Token ID48201
Feature activation-2.10e-01

Pos logit contributions
owers+0.014
 lettuce+0.014
 beans+0.014
ados+0.014
 Freddy+0.013
Neg logit contributions
Paul-0.014
 York-0.013
 security-0.013
abella-0.013
Grand-0.013
Token she
Token ID673
Feature activation+2.20e+00

Pos logit contributions
owers+0.007
 Freddy+0.007
 beans+0.007
ados+0.007
 grapes+0.007
Neg logit contributions
Paul-0.009
 York-0.009
Grand-0.008
abella-0.008
ris-0.008
Token looked
Token ID3114
Feature activation+2.62e-01

Pos logit contributions
Paul+0.100
 much+0.090
 tow+0.090
Grand+0.089
 York+0.088
Neg logit contributions
 beans-0.069
owers-0.068
 lettuce-0.067
 mushroom-0.063
 Freddy-0.061
Token,
Token ID11
Feature activation-1.18e-01

Pos logit contributions
Paul+0.011
ris+0.010
Grand+0.010
 tow+0.010
Offic+0.010
Neg logit contributions
owers-0.010
 beans-0.010
 Freddy-0.009
 squash-0.009
 lettuce-0.009
Token with
Token ID351
Feature activation-1.16e+00

Pos logit contributions
Paul+0.006
abella+0.006
Daniel+0.005
Grand+0.005
 York+0.005
Neg logit contributions
owers-0.005
 Freddy-0.005
 beans-0.005
 lettuce-0.005
 mushroom-0.005
Token glasses
Token ID15232
Feature activation+4.97e-01

Pos logit contributions
 lettuce+0.052
 beans+0.050
 grapes+0.049
 Freddy+0.048
owers+0.048
Neg logit contributions
Paul-0.040
abella-0.038
 York-0.038
Jo-0.035
ris-0.035
Token and
Token ID290
Feature activation-5.40e-01

Pos logit contributions
Paul+0.017
abella+0.015
 York+0.015
Grand+0.015
 security+0.014
Neg logit contributions
 lettuce-0.022
owers-0.022
 beans-0.022
 Freddy-0.021
ados-0.021
Token a
Token ID257
Feature activation+1.11e+00

Pos logit contributions
 lettuce+0.026
owers+0.025
 beans+0.025
 Freddy+0.024
ados+0.024
Neg logit contributions
Paul-0.016
 York-0.014
abella-0.014
 security-0.014
Grand-0.014
Token pony
Token ID26902
Feature activation+1.57e+00

Pos logit contributions
Paul+0.041
abella+0.038
 York+0.038
 security+0.037
Grand+0.036
Neg logit contributions
 lettuce-0.045
owers-0.044
 beans-0.043
 Freddy-0.042
ados-0.042
Tokentail
Token ID13199
Feature activation+5.39e+00

Pos logit contributions
Paul+0.048
abella+0.043
Grand+0.042
 York+0.042
 security+0.042
Neg logit contributions
owers-0.074
 lettuce-0.074
 beans-0.072
 Freddy-0.071
ados-0.071
Token.
Token ID13
Feature activation+1.79e-03

Pos logit contributions
Paul+0.235
abella+0.210
 York+0.205
Grand+0.204
Jo+0.203
Neg logit contributions
sed-0.191
owers-0.185
 tongue-0.169
ses-0.168
cr-0.166
Token She
Token ID1375
Feature activation+5.52e-01

Pos logit contributions
Paul+0.000
 York+0.000
Grand+0.000
 security+0.000
abella+0.000
Neg logit contributions
owers-0.000
 lettuce-0.000
 beans-0.000
ados-0.000
 Freddy-0.000
Token is
Token ID318
Feature activation+1.20e+00

Pos logit contributions
Paul+0.023
Grand+0.020
abella+0.019
Mad+0.019
Jo+0.019
Neg logit contributions
 lettuce-0.021
 beans-0.021
owers-0.020
sed-0.020
 mushroom-0.020
Token reading
Token ID3555
Feature activation+1.59e+00

Pos logit contributions
Paul+0.047
 York+0.047
ris+0.044
Jo+0.043
 airport+0.042
Neg logit contributions
 Freddy-0.044
 lettuce-0.044
 beans-0.042
sed-0.041
 grapes-0.040
Token a
Token ID257
Feature activation+3.97e-01

Pos logit contributions
Paul+0.063
 York+0.061
ris+0.060
abella+0.059
Mad+0.058
Neg logit contributions
 beans-0.062
 grapes-0.060
 lettuce-0.060
 Freddy-0.058
 mushroom-0.056
Token and
Token ID290
Feature activation+2.09e-01

Pos logit contributions
Paul+0.075
abella+0.069
 York+0.068
 security+0.066
Grand+0.065
Neg logit contributions
owers-0.089
 lettuce-0.086
 beans-0.084
cr-0.084
 Freddy-0.083
Token the
Token ID262
Feature activation+1.06e+00

Pos logit contributions
Paul+0.007
abella+0.007
 York+0.007
Jo+0.006
 security+0.006
Neg logit contributions
 lettuce-0.009
 beans-0.009
 shovel-0.008
 Freddy-0.008
 grapes-0.008
Token tires
Token ID17920
Feature activation+1.95e+00

Pos logit contributions
Paul+0.035
abella+0.033
 York+0.032
Grand+0.032
 security+0.031
Neg logit contributions
 lettuce-0.046
owers-0.046
 beans-0.045
ados-0.044
 Freddy-0.043
Token and
Token ID290
Feature activation+8.61e-01

Pos logit contributions
Paul+0.068
 York+0.063
abella+0.062
Grand+0.062
 security+0.059
Neg logit contributions
owers-0.083
 lettuce-0.081
 beans-0.079
 Freddy-0.078
cr-0.078
Token the
Token ID262
Feature activation+1.43e+00

Pos logit contributions
Paul+0.028
 York+0.025
abella+0.025
Grand+0.024
 security+0.024
Neg logit contributions
 lettuce-0.040
 beans-0.038
owers-0.037
 Freddy-0.037
ados-0.037
Token mirrors
Token ID22353
Feature activation+5.26e+00

Pos logit contributions
Paul+0.048
 York+0.044
 security+0.043
abella+0.043
Grand+0.043
Neg logit contributions
 lettuce-0.062
owers-0.062
 beans-0.060
 Freddy-0.059
ados-0.059
Token.
Token ID13
Feature activation+5.98e-02

Pos logit contributions
Paul+0.203
Grand+0.186
abella+0.186
 York+0.181
Mad+0.175
Neg logit contributions
owers-0.213
 beans-0.196
 lettuce-0.195
 shovel-0.195
cr-0.194
Token They
Token ID1119
Feature activation-2.34e-01

Pos logit contributions
Paul+0.002
 security+0.002
 York+0.002
Grand+0.002
abella+0.002
Neg logit contributions
 lettuce-0.002
owers-0.002
 beans-0.002
 Freddy-0.002
ados-0.002
Token saw
Token ID2497
Feature activation+3.87e-01

Pos logit contributions
 lettuce+0.008
owers+0.008
 beans+0.008
 Freddy+0.008
cr+0.007
Neg logit contributions
Paul-0.010
 York-0.009
 security-0.009
abella-0.009
Grand-0.009
Token their
Token ID511
Feature activation-3.00e-01

Pos logit contributions
Paul+0.014
ris+0.014
Grand+0.013
abella+0.013
 York+0.013
Neg logit contributions
 lettuce-0.017
 beans-0.016
 grapes-0.016
 squash-0.015
 Freddy-0.015
Token faces
Token ID6698
Feature activation-2.13e+00

Pos logit contributions
 lettuce+0.012
owers+0.012
 beans+0.012
 Freddy+0.012
ados+0.012
Neg logit contributions
Paul-0.011
 security-0.010
 York-0.010
Grand-0.010
abella-0.010
Token adventure
Token ID8855
Feature activation+2.40e-01

Pos logit contributions
Paul+0.067
 York+0.062
abella+0.061
 security+0.061
Grand+0.061
Neg logit contributions
 lettuce-0.072
owers-0.070
 beans-0.069
 Freddy-0.069
ados-0.067
Token.
Token ID13
Feature activation-4.36e-01

Pos logit contributions
Paul+0.009
 York+0.009
Grand+0.009
abella+0.008
bell+0.008
Neg logit contributions
 Freddy-0.010
 lettuce-0.009
owers-0.009
 beans-0.009
cr-0.009
Token<|endoftext|>
Token ID50256
Feature activation-1.76e+00

Pos logit contributions
 lettuce+0.022
owers+0.022
 Freddy+0.022
 beans+0.021
ados+0.021
Neg logit contributions
Paul-0.011
 York-0.010
 security-0.010
abella-0.010
Grand-0.010
TokenJohn
Token ID7554
Feature activation+1.47e+00

Pos logit contributions
owers+0.075
 lettuce+0.073
 Freddy+0.072
 beans+0.071
ados+0.069
Neg logit contributions
Paul-0.061
 security-0.057
 York-0.057
Grand-0.056
abella-0.056
Token and
Token ID290
Feature activation+2.66e-01

Pos logit contributions
Paul+0.054
 security+0.051
 York+0.049
abella+0.049
Grand+0.048
Neg logit contributions
 lettuce-0.060
 Freddy-0.058
owers-0.058
 beans-0.057
 mushroom-0.054
Token Tara
Token ID37723
Feature activation+5.22e+00

Pos logit contributions
Paul+0.009
 York+0.008
abella+0.008
 security+0.008
Grand+0.008
Neg logit contributions
 lettuce-0.012
 Freddy-0.012
owers-0.012
 beans-0.011
ados-0.011
Token were
Token ID547
Feature activation+3.02e-01

Pos logit contributions
Paul+0.240
 security+0.227
Grand+0.227
abella+0.225
 York+0.223
Neg logit contributions
owers-0.164
 Freddy-0.154
 lettuce-0.152
 beans-0.151
 squash-0.140
Token playing
Token ID2712
Feature activation+5.05e-02

Pos logit contributions
Paul+0.012
 York+0.012
 security+0.012
Grand+0.011
bell+0.011
Neg logit contributions
 Freddy-0.011
 lettuce-0.010
owers-0.010
 beans-0.010
ados-0.010
Token in
Token ID287
Feature activation-7.29e-01

Pos logit contributions
Paul+0.001
 much+0.001
 York+0.001
 security+0.001
 airport+0.001
Neg logit contributions
owers-0.003
 lettuce-0.003
ados-0.002
sed-0.002
mons-0.002
Token the
Token ID262
Feature activation+6.42e-01

Pos logit contributions
 Freddy+0.028
 beans+0.028
 lettuce+0.028
 squash+0.027
 shovel+0.027
Neg logit contributions
abella-0.028
bell-0.026
Paul-0.025
ris-0.025
Grand-0.024
Token park
Token ID3952
Feature activation+6.25e-01

Pos logit contributions
Paul+0.026
abella+0.026
bell+0.024
 security+0.024
 York+0.024
Neg logit contributions
 lettuce-0.024
 beans-0.024
 squash-0.023
 Freddy-0.022
 grapes-0.022
Token seek
Token ID5380
Feature activation+1.99e+00

Pos logit contributions
Paul+0.009
Grand+0.009
 much+0.009
 security+0.009
 tow+0.008
Neg logit contributions
owers-0.014
ados-0.013
sed-0.012
mons-0.012
 lettuce-0.012
Token.
Token ID13
Feature activation+6.44e-01

Pos logit contributions
Paul+0.073
abella+0.066
 York+0.065
Grand+0.065
 tow+0.064
Neg logit contributions
 beans-0.085
 lettuce-0.083
owers-0.082
 Freddy-0.079
ados-0.078
Token Tom
Token ID4186
Feature activation+8.35e-01

Pos logit contributions
 security+0.026
Paul+0.026
ind+0.025
 York+0.024
Jo+0.024
Neg logit contributions
 Freddy-0.024
 lettuce-0.023
 grapes-0.022
 beans-0.022
owers-0.022
Token was
Token ID373
Feature activation+1.06e+00

Pos logit contributions
Paul+0.033
 York+0.031
Mad+0.031
Grand+0.029
 security+0.029
Neg logit contributions
 lettuce-0.033
 Freddy-0.031
 beans-0.031
 mushroom-0.030
sed-0.029
Token the
Token ID262
Feature activation+1.28e+00

Pos logit contributions
 York+0.043
bell+0.042
Paul+0.041
ris+0.039
 airport+0.039
Neg logit contributions
 Freddy-0.039
 lettuce-0.037
 grapes-0.035
 beans-0.035
sed-0.034
Token seeker
Token ID45993
Feature activation+5.18e+00

Pos logit contributions
Paul+0.042
 security+0.041
Grand+0.039
 airport+0.039
 much+0.038
Neg logit contributions
owers-0.056
ados-0.056
 lettuce-0.055
 Freddy-0.053
 beans-0.053
Token and
Token ID290
Feature activation+2.98e-02

Pos logit contributions
Paul+0.159
 York+0.144
 security+0.140
Grand+0.137
abella+0.134
Neg logit contributions
 lettuce-0.240
owers-0.240
 beans-0.239
 Freddy-0.236
ados-0.232
Token Lily
Token ID20037
Feature activation+7.72e-01

Pos logit contributions
 York+0.001
Paul+0.001
 airport+0.001
bell+0.001
 security+0.001
Neg logit contributions
 Freddy-0.001
 lettuce-0.001
 beans-0.001
 grapes-0.001
mons-0.001
Token was
Token ID373
Feature activation+1.59e+00

Pos logit contributions
Paul+0.027
 York+0.025
Grand+0.024
 security+0.023
bell+0.023
Neg logit contributions
 lettuce-0.033
 beans-0.033
 Freddy-0.032
owers-0.032
 mushroom-0.031
Token the
Token ID262
Feature activation+9.17e-01

Pos logit contributions
bell+0.067
 York+0.065
ris+0.061
Paul+0.061
 airport+0.061
Neg logit contributions
 Freddy-0.056
 grapes-0.050
mons-0.048
 beans-0.048
cr-0.048
Token h
Token ID289
Feature activation+2.81e+00

Pos logit contributions
Paul+0.032
 York+0.030
abella+0.028
Grand+0.028
bell+0.028
Neg logit contributions
 lettuce-0.038
 beans-0.038
owers-0.037
 Freddy-0.037
 mushroom-0.036
Token.
Token ID13
Feature activation+1.83e-01

Pos logit contributions
 lettuce+0.068
owers+0.068
 beans+0.066
 Freddy+0.065
ados+0.065
Neg logit contributions
Paul-0.037
 York-0.033
 security-0.033
Grand-0.033
abella-0.032
Token
Token ID198
Feature activation+5.08e-01

Pos logit contributions
Paul+0.005
 York+0.005
 security+0.005
abella+0.004
Grand+0.004
Neg logit contributions
 lettuce-0.009
owers-0.009
 beans-0.009
 Freddy-0.009
ados-0.009
Token
Token ID198
Feature activation-9.02e-02

Pos logit contributions
 airport+0.018
 York+0.018
 security+0.017
Paul+0.017
bell+0.017
Neg logit contributions
 Freddy-0.019
 lettuce-0.019
 beans-0.018
owers-0.017
sed-0.017
TokenThe
Token ID464
Feature activation-2.64e-01

Pos logit contributions
 lettuce+0.003
 Freddy+0.003
mons+0.003
 beans+0.003
 squash+0.003
Neg logit contributions
 airport-0.003
bell-0.003
 York-0.003
 security-0.003
Paul-0.003
Token bike
Token ID7161
Feature activation+1.62e+00

Pos logit contributions
 lettuce+0.012
owers+0.011
 beans+0.011
 Freddy+0.011
ados+0.011
Neg logit contributions
Paul-0.009
 York-0.008
abella-0.008
Grand-0.008
 security-0.008
Token rider
Token ID17876
Feature activation+5.15e+00

Pos logit contributions
Paul+0.060
 York+0.054
abella+0.053
Daniel+0.052
Grand+0.052
Neg logit contributions
 lettuce-0.065
sed-0.063
owers-0.063
 beans-0.062
 Freddy-0.061
Token stopped
Token ID5025
Feature activation-3.51e-01

Pos logit contributions
Paul+0.201
 York+0.181
Grand+0.180
abella+0.176
Daniel+0.172
Neg logit contributions
 lettuce-0.193
sed-0.191
owers-0.191
 beans-0.187
 Freddy-0.186
Token.
Token ID13
Feature activation-1.75e-02

Pos logit contributions
 lettuce+0.017
owers+0.017
 beans+0.016
 Freddy+0.016
ados+0.016
Neg logit contributions
Paul-0.011
 York-0.010
 security-0.009
Grand-0.009
abella-0.009
Token He
Token ID679
Feature activation+1.33e-01

Pos logit contributions
 lettuce+0.001
 Freddy+0.001
 beans+0.001
ados+0.001
sed+0.001
Neg logit contributions
Paul-0.001
ind-0.001
Jo-0.001
 York-0.001
abella-0.001
Token was
Token ID373
Feature activation+9.01e-01

Pos logit contributions
Paul+0.006
 York+0.005
 elev+0.005
Mad+0.005
abella+0.005
Neg logit contributions
sed-0.005
 lettuce-0.005
 beans-0.004
 spat-0.004
 Freddy-0.004
Token angry
Token ID7954
Feature activation+1.35e-02

Pos logit contributions
Paul+0.038
 York+0.038
Jo+0.036
ris+0.035
abella+0.035
Neg logit contributions
 lettuce-0.030
sed-0.030
 Freddy-0.030
 tongue-0.027
 beans-0.027
Token garden
Token ID11376
Feature activation-1.31e+00

Pos logit contributions
owers+0.008
 lettuce+0.007
mons+0.007
ados+0.007
cr+0.007
Neg logit contributions
Paul-0.005
 much-0.005
abella-0.005
 security-0.004
Grand-0.004
Token.
Token ID13
Feature activation-2.13e-01

Pos logit contributions
 lettuce+0.067
owers+0.066
ados+0.065
 Freddy+0.064
 beans+0.064
Neg logit contributions
Paul-0.030
 much-0.028
 York-0.027
 airport-0.027
 security-0.026
Token Lily
Token ID20037
Feature activation+2.89e-01

Pos logit contributions
 lettuce+0.009
 Freddy+0.009
owers+0.009
 beans+0.009
ados+0.008
Neg logit contributions
Paul-0.007
 security-0.007
 York-0.007
abella-0.007
Grand-0.007
Token is
Token ID318
Feature activation+1.21e+00

Pos logit contributions
Paul+0.010
 security+0.009
 York+0.009
Grand+0.009
 tow+0.008
Neg logit contributions
 lettuce-0.013
owers-0.013
 beans-0.013
 Freddy-0.012
 mushroom-0.012
Token the
Token ID262
Feature activation+3.62e-01

Pos logit contributions
Paul+0.040
 York+0.038
 security+0.035
Grand+0.035
abella+0.034
Neg logit contributions
 lettuce-0.054
 Freddy-0.053
 beans-0.053
owers-0.052
sed-0.051
Token seeker
Token ID45993
Feature activation+5.14e+00

Pos logit contributions
Paul+0.012
 security+0.012
 airport+0.012
Grand+0.012
 much+0.011
Neg logit contributions
owers-0.015
 lettuce-0.015
ados-0.015
 Freddy-0.014
 beans-0.014
Token and
Token ID290
Feature activation-1.76e-01

Pos logit contributions
Paul+0.143
 York+0.127
 security+0.126
Grand+0.125
abella+0.124
Neg logit contributions
 lettuce-0.252
owers-0.252
 beans-0.246
 Freddy-0.243
ados-0.242
Token Ben
Token ID3932
Feature activation+7.05e-01

Pos logit contributions
 lettuce+0.008
owers+0.008
 beans+0.007
 Freddy+0.007
ados+0.007
Neg logit contributions
Paul-0.006
 York-0.005
 security-0.005
Grand-0.005
abella-0.005
Token is
Token ID318
Feature activation+2.03e+00

Pos logit contributions
Paul+0.023
 security+0.021
 York+0.020
abella+0.020
Grand+0.020
Neg logit contributions
 lettuce-0.032
owers-0.031
 beans-0.031
 Freddy-0.030
ados-0.030
Token the
Token ID262
Feature activation+5.59e-01

Pos logit contributions
Paul+0.080
 York+0.077
Daniel+0.074
Grand+0.073
Mad+0.071
Neg logit contributions
 Freddy-0.077
 mushroom-0.072
 beans-0.071
sed-0.070
 lettuce-0.070
Token h
Token ID289
Feature activation+2.88e+00

Pos logit contributions
Paul+0.019
 security+0.019
 airport+0.018
 much+0.018
Grand+0.017
Neg logit contributions
owers-0.024
ados-0.023
 lettuce-0.023
 Freddy-0.023
 beans-0.022
Token that
Token ID326
Feature activation+1.14e+00

Pos logit contributions
Paul+0.074
 York+0.069
Jo+0.069
Grand+0.069
bell+0.068
Neg logit contributions
 lettuce-0.078
 Freddy-0.076
 beans-0.076
ados-0.072
owers-0.072
Token in
Token ID287
Feature activation-1.55e+00

Pos logit contributions
Paul+0.035
 security+0.032
 much+0.032
 York+0.032
Grand+0.032
Neg logit contributions
owers-0.052
 lettuce-0.051
 beans-0.050
ados-0.049
 Freddy-0.049
Token your
Token ID534
Feature activation+8.21e-01

Pos logit contributions
 lettuce+0.067
 beans+0.066
ados+0.062
 Freddy+0.062
 squash+0.062
Neg logit contributions
Paul-0.054
abella-0.052
bell-0.050
 York-0.049
Grand-0.049
Token mouth
Token ID5422
Feature activation-1.94e+00

Pos logit contributions
Paul+0.032
abella+0.029
Grand+0.029
 York+0.029
 security+0.028
Neg logit contributions
 lettuce-0.032
owers-0.031
 beans-0.031
 Freddy-0.030
ados-0.030
Token?
Token ID30
Feature activation+5.56e-01

Pos logit contributions
 lettuce+0.088
owers+0.088
 beans+0.086
 Freddy+0.084
ados+0.083
Neg logit contributions
Paul-0.062
 York-0.056
 security-0.055
abella-0.055
Grand-0.054
Token Is
Token ID1148
Feature activation+5.12e+00

Pos logit contributions
Paul+0.023
 security+0.022
 airport+0.021
abella+0.021
 York+0.021
Neg logit contributions
 lettuce-0.020
owers-0.020
 Freddy-0.019
 beans-0.019
ados-0.019
Token that
Token ID326
Feature activation+3.48e-01

Pos logit contributions
Paul+0.162
 York+0.149
Grand+0.144
 tow+0.141
Jo+0.137
Neg logit contributions
 lettuce-0.244
 beans-0.240
 Freddy-0.235
 grapes-0.230
owers-0.230
Token a
Token ID257
Feature activation-2.01e-01

Pos logit contributions
Paul+0.011
 York+0.010
 security+0.010
abella+0.010
Grand+0.010
Neg logit contributions
 lettuce-0.016
owers-0.016
 beans-0.015
 Freddy-0.015
ados-0.015
Token bone
Token ID9970
Feature activation-1.27e+00

Pos logit contributions
 lettuce+0.008
 beans+0.008
owers+0.008
 Freddy+0.007
 mushroom+0.007
Neg logit contributions
Paul-0.008
 York-0.007
abella-0.007
Grand-0.007
 security-0.007
Token?"
Token ID1701
Feature activation+5.08e-01

Pos logit contributions
 lettuce+0.058
owers+0.058
 beans+0.056
 Freddy+0.056
ados+0.055
Neg logit contributions
Paul-0.039
 York-0.035
 security-0.035
Grand-0.035
abella-0.035
Token
Token ID198
Feature activation+5.39e-01

Pos logit contributions
Paul+0.015
 York+0.014
 security+0.014
abella+0.013
 airport+0.013
Neg logit contributions
 Freddy-0.024
 lettuce-0.023
owers-0.023
 beans-0.023
ados-0.022
Token Tim
Token ID5045
Feature activation-8.24e-01

Pos logit contributions
 lettuce+0.001
 Freddy+0.001
 beans+0.001
owers+0.001
ados+0.001
Neg logit contributions
Paul-0.001
 York-0.001
Grand-0.001
bell-0.001
 security-0.001
Token became
Token ID2627
Feature activation-1.22e-01

Pos logit contributions
 lettuce+0.030
 Freddy+0.029
owers+0.029
 beans+0.029
ados+0.027
Neg logit contributions
Paul-0.033
 York-0.032
 security-0.031
Grand-0.031
bell-0.030
Token the
Token ID262
Feature activation+7.68e-01

Pos logit contributions
 lettuce+0.005
 Freddy+0.005
 beans+0.005
 squash+0.005
 grapes+0.005
Neg logit contributions
Paul-0.004
bell-0.004
 York-0.004
Grand-0.004
abella-0.004
Token best
Token ID1266
Feature activation+1.48e+00

Pos logit contributions
Paul+0.025
 security+0.024
 much+0.023
Grand+0.023
 airport+0.023
Neg logit contributions
owers-0.034
 lettuce-0.033
ados-0.033
 Freddy-0.032
 beans-0.032
Token bike
Token ID7161
Feature activation+1.72e+00

Pos logit contributions
Paul+0.046
 York+0.042
Grand+0.041
abella+0.041
 security+0.039
Neg logit contributions
 lettuce-0.070
 beans-0.069
owers-0.069
 Freddy-0.068
ados-0.066
Token rider
Token ID17876
Feature activation+5.08e+00

Pos logit contributions
Paul+0.068
 York+0.067
Grand+0.065
abella+0.065
Daniel+0.063
Neg logit contributions
owers-0.062
 Freddy-0.061
 lettuce-0.060
 beans-0.059
ados-0.058
Token in
Token ID287
Feature activation-9.00e-01

Pos logit contributions
Paul+0.181
 York+0.172
Grand+0.169
abella+0.164
 security+0.163
Neg logit contributions
owers-0.207
 Freddy-0.204
 lettuce-0.204
 beans-0.200
ados-0.198
Token the
Token ID262
Feature activation+1.75e-01

Pos logit contributions
 Freddy+0.038
 beans+0.036
 lettuce+0.036
 squash+0.034
 mushroom+0.034
Neg logit contributions
abella-0.030
bell-0.030
 York-0.030
Paul-0.030
Grand-0.029
Token whole
Token ID2187
Feature activation-4.48e-01

Pos logit contributions
abella+0.007
Paul+0.007
Jo+0.006
bell+0.006
Grand+0.006
Neg logit contributions
 lettuce-0.007
 mushroom-0.006
 beans-0.006
 squash-0.006
 grapes-0.006
Token town
Token ID3240
Feature activation+5.57e-01

Pos logit contributions
 squash+0.013
 beans+0.012
 mushroom+0.012
 lettuce+0.012
 Freddy+0.012
Neg logit contributions
abella-0.022
bell-0.018
Paul-0.018
year-0.018
ves-0.018
Token.
Token ID13
Feature activation-2.68e-01

Pos logit contributions
Paul+0.022
abella+0.021
 York+0.021
Grand+0.020
V+0.020
Neg logit contributions
owers-0.020
 Freddy-0.019
 beans-0.019
cr-0.019
 lettuce-0.019
Token the
Token ID262
Feature activation+8.75e-01

Pos logit contributions
Paul+0.008
 York+0.008
Grand+0.007
abella+0.007
Daniel+0.007
Neg logit contributions
 lettuce-0.013
 Freddy-0.013
 beans-0.013
owers-0.012
 squash-0.012
Token people
Token ID661
Feature activation+2.45e+00

Pos logit contributions
Paul+0.033
 York+0.031
 security+0.031
Grand+0.030
abella+0.030
Neg logit contributions
owers-0.034
 lettuce-0.034
 beans-0.033
 Freddy-0.033
ados-0.032
Token cl
Token ID537
Feature activation+4.53e-02

Pos logit contributions
Paul+0.090
 York+0.085
abella+0.083
Grand+0.083
 security+0.082
Neg logit contributions
owers-0.096
 lettuce-0.096
 beans-0.092
 Freddy-0.092
ados-0.089
Tokenapped
Token ID6320
Feature activation-8.63e-01

Pos logit contributions
Paul+0.002
 security+0.002
Grand+0.002
abella+0.002
 York+0.002
Neg logit contributions
owers-0.001
 lettuce-0.001
 Freddy-0.001
ados-0.001
 beans-0.001
Token in
Token ID287
Feature activation-4.51e-01

Pos logit contributions
 lettuce+0.035
 Freddy+0.035
 beans+0.035
owers+0.033
cr+0.033
Neg logit contributions
Paul-0.031
 York-0.029
abella-0.029
Grand-0.028
 security-0.028
Token amaz
Token ID40642
Feature activation+5.06e+00

Pos logit contributions
 lettuce+0.017
 beans+0.017
 Freddy+0.017
owers+0.017
ados+0.016
Neg logit contributions
Paul-0.017
abella-0.016
 York-0.016
Grand-0.016
 security-0.016
Tokenement
Token ID972
Feature activation+2.47e+00

Pos logit contributions
Paul+0.181
Grand+0.176
abella+0.171
 York+0.165
 Africa+0.164
Neg logit contributions
owers-0.204
ados-0.202
 beans-0.190
 lettuce-0.187
 Freddy-0.183
Token.
Token ID13
Feature activation-2.55e-01

Pos logit contributions
Paul+0.075
 York+0.068
 security+0.067
abella+0.067
Grand+0.067
Neg logit contributions
owers-0.115
 lettuce-0.115
 beans-0.112
 Freddy-0.111
ados-0.110
Token The
Token ID383
Feature activation-4.93e-01

Pos logit contributions
 lettuce+0.012
owers+0.012
 beans+0.012
 Freddy+0.012
ados+0.011
Neg logit contributions
Paul-0.008
 York-0.007
 security-0.007
abella-0.007
Grand-0.007
Token singing
Token ID13777
Feature activation-3.67e-01

Pos logit contributions
 lettuce+0.019
owers+0.019
 beans+0.018
 Freddy+0.018
ados+0.018
Neg logit contributions
Paul-0.019
abella-0.018
 York-0.018
Grand-0.018
 security-0.018
Token rock
Token ID3881
Feature activation+7.78e-01

Pos logit contributions
 lettuce+0.017
owers+0.017
 beans+0.016
 Freddy+0.016
ados+0.016
Neg logit contributions
Paul-0.011
 York-0.010
 security-0.010
abella-0.010
Grand-0.010
Token jumped
Token ID11687
Feature activation-7.27e-01

Pos logit contributions
Paul+0.001
 York+0.001
 security+0.001
Grand+0.001
 much+0.001
Neg logit contributions
 lettuce-0.001
owers-0.001
 Freddy-0.001
 beans-0.001
cr-0.001
Token up
Token ID510
Feature activation+1.96e+00

Pos logit contributions
 lettuce+0.028
 Freddy+0.028
 beans+0.028
owers+0.027
cr+0.026
Neg logit contributions
Paul-0.028
 York-0.027
Grand-0.026
 security-0.025
abella-0.025
Token and
Token ID290
Feature activation+7.03e-01

Pos logit contributions
Paul+0.063
 York+0.058
abella+0.056
Grand+0.056
 security+0.055
Neg logit contributions
 lettuce-0.089
 Freddy-0.088
 beans-0.088
owers-0.087
ados-0.084
Token grinned
Token ID36268
Feature activation-7.42e-01

Pos logit contributions
Paul+0.027
 York+0.025
 security+0.025
abella+0.025
Grand+0.025
Neg logit contributions
owers-0.027
 lettuce-0.027
 beans-0.026
ados-0.025
 Freddy-0.025
Token in
Token ID287
Feature activation-3.05e-01

Pos logit contributions
 Freddy+0.031
 lettuce+0.030
 beans+0.030
owers+0.030
cr+0.029
Neg logit contributions
Paul-0.027
abella-0.025
 York-0.025
Grand-0.024
 airport-0.024
Token amaz
Token ID40642
Feature activation+5.06e+00

Pos logit contributions
 lettuce+0.012
owers+0.012
 beans+0.012
 Freddy+0.012
ados+0.011
Neg logit contributions
Paul-0.011
abella-0.011
 York-0.010
 security-0.010
Grand-0.010
Tokenement
Token ID972
Feature activation+2.64e+00

Pos logit contributions
Paul+0.181
Grand+0.178
 York+0.168
abella+0.165
Offic+0.165
Neg logit contributions
ados-0.201
owers-0.198
 beans-0.192
 Freddy-0.191
 lettuce-0.189
Token.
Token ID13
Feature activation-2.20e-01

Pos logit contributions
Paul+0.080
 York+0.073
Grand+0.071
abella+0.071
 security+0.070
Neg logit contributions
owers-0.123
 lettuce-0.123
 beans-0.121
 Freddy-0.120
ados-0.119
Token
Token ID220
Feature activation-1.82e-01

Pos logit contributions
 lettuce+0.010
owers+0.009
 Freddy+0.009
 beans+0.009
ados+0.009
Neg logit contributions
Paul-0.008
 York-0.007
 security-0.007
Grand-0.007
abella-0.007
Token
Token ID198
Feature activation+6.10e-01

Pos logit contributions
owers+0.009
 lettuce+0.009
 Freddy+0.009
 beans+0.008
ados+0.008
Neg logit contributions
Paul-0.005
 security-0.005
 York-0.005
Grand-0.004
bell-0.004
Token
Token ID198
Feature activation-1.54e-02

Pos logit contributions
 airport+0.023
 York+0.022
 security+0.022
bell+0.021
 elev+0.021
Neg logit contributions
 Freddy-0.024
 lettuce-0.023
cr-0.022
owers-0.021
 mushroom-0.021
Token."
Token ID526
Feature activation-2.91e-02

Pos logit contributions
 lettuce+0.026
owers+0.026
 beans+0.026
 Freddy+0.025
ados+0.025
Neg logit contributions
Paul-0.018
 York-0.017
 security-0.016
abella-0.016
Grand-0.016
Token
Token ID198
Feature activation+7.33e-01

Pos logit contributions
 lettuce+0.002
owers+0.002
 beans+0.002
ados+0.002
 Freddy+0.002
Neg logit contributions
Paul-0.001
Grand-0.001
 York-0.001
 security-0.001
abella-0.001
Token
Token ID198
Feature activation+1.32e-01

Pos logit contributions
Paul+0.025
 York+0.025
 airport+0.025
 security+0.024
bell+0.024
Neg logit contributions
 Freddy-0.030
 lettuce-0.029
owers-0.028
 beans-0.028
 squash-0.027
TokenThe
Token ID464
Feature activation-3.29e-01

Pos logit contributions
bell+0.004
 airport+0.004
 York+0.004
 elev+0.004
abella+0.004
Neg logit contributions
 lettuce-0.005
 beans-0.005
 Freddy-0.005
 squash-0.005
mons-0.005
Token bike
Token ID7161
Feature activation+1.60e+00

Pos logit contributions
 beans+0.010
 grapes+0.010
 lettuce+0.010
sed+0.009
 shovel+0.009
Neg logit contributions
Paul-0.014
Jo-0.014
abella-0.013
bell-0.013
 York-0.013
Token rider
Token ID17876
Feature activation+5.06e+00

Pos logit contributions
Paul+0.063
 York+0.058
abella+0.056
Grand+0.055
Daniel+0.055
Neg logit contributions
 lettuce-0.060
sed-0.058
owers-0.058
 Freddy-0.058
 beans-0.057
Token calmed
Token ID49566
Feature activation+1.08e+00

Pos logit contributions
Paul+0.195
 York+0.178
Grand+0.175
 security+0.172
abella+0.171
Neg logit contributions
 lettuce-0.195
owers-0.191
 Freddy-0.191
 beans-0.190
sed-0.184
Token down
Token ID866
Feature activation+7.93e-01

Pos logit contributions
Paul+0.050
 security+0.047
 York+0.047
Grand+0.047
abella+0.047
Neg logit contributions
 lettuce-0.033
owers-0.033
 beans-0.032
 Freddy-0.031
ados-0.031
Token.
Token ID13
Feature activation+1.12e-01

Pos logit contributions
Paul+0.036
Jo+0.034
 York+0.033
abella+0.033
Mad+0.032
Neg logit contributions
 beans-0.027
 lettuce-0.026
 Freddy-0.025
 grapes-0.024
ados-0.024
Token He
Token ID679
Feature activation+2.56e-01

Pos logit contributions
Paul+0.004
 York+0.004
Jo+0.004
ind+0.004
 security+0.004
Neg logit contributions
 lettuce-0.004
 Freddy-0.004
 beans-0.004
ados-0.004
sed-0.004
Token saw
Token ID2497
Feature activation+4.01e-01

Pos logit contributions
Paul+0.011
 York+0.010
Grand+0.010
Jo+0.010
 much+0.010
Neg logit contributions
 lettuce-0.009
sed-0.008
 beans-0.008
 Freddy-0.008
 grapes-0.008
Token and
Token ID290
Feature activation+4.30e-01

Pos logit contributions
Paul+0.076
 York+0.069
abella+0.069
Grand+0.069
Mad+0.067
Neg logit contributions
owers-0.078
 beans-0.072
ados-0.072
 lettuce-0.071
 spine-0.071
Token buck
Token ID10230
Feature activation+3.76e+00

Pos logit contributions
Paul+0.017
 York+0.016
 tow+0.016
Grand+0.016
ind+0.016
Neg logit contributions
 lettuce-0.015
 beans-0.015
 Freddy-0.014
 shovel-0.014
sed-0.014
Tokenled
Token ID992
Feature activation-1.62e+00

Pos logit contributions
 tow+0.241
Grand+0.235
 Africa+0.232
 much+0.230
Paul+0.229
Neg logit contributions
owers-0.050
ped-0.046
ados-0.045
mons-0.033
 beans-0.032
Token her
Token ID607
Feature activation+9.25e-01

Pos logit contributions
 beans+0.054
 lettuce+0.054
 Freddy+0.053
ados+0.051
owers+0.050
Neg logit contributions
Paul-0.070
Grand-0.068
 York-0.065
Offic-0.065
ris-0.064
Token seat
Token ID5852
Feature activation+1.85e+00

Pos logit contributions
Paul+0.039
abella+0.037
Grand+0.036
 York+0.035
ris+0.034
Neg logit contributions
 lettuce-0.032
owers-0.031
 beans-0.031
sed-0.030
ados-0.030
Tokenbelt
Token ID37976
Feature activation+5.06e+00

Pos logit contributions
Paul+0.060
 York+0.056
Grand+0.056
abella+0.055
Offic+0.054
Neg logit contributions
owers-0.077
sed-0.074
 cheeks-0.074
mons-0.073
cr-0.073
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
Paul+0.216
 York+0.214
Grand+0.207
abella+0.205
Offic+0.203
Neg logit contributions
 cheek-0.146
 spine-0.145
owers-0.141
 forehead-0.139
 cheeks-0.139
Token The
Token ID383
Feature activation-4.61e-01

Pos logit contributions
 lettuce+0.009
owers+0.009
 Freddy+0.009
 beans+0.009
ados+0.008
Neg logit contributions
Paul-0.009
 security-0.008
 York-0.008
Grand-0.008
 airport-0.008
Token plane
Token ID6614
Feature activation+2.08e+00

Pos logit contributions
 beans+0.018
owers+0.017
 lettuce+0.017
 cheek+0.017
 grapes+0.017
Neg logit contributions
Paul-0.016
abella-0.015
ris-0.015
ook-0.014
Jo-0.014
Token took
Token ID1718
Feature activation-1.10e+00

Pos logit contributions
Paul+0.079
Grand+0.073
abella+0.072
 York+0.072
Daniel+0.071
Neg logit contributions
sed-0.074
owers-0.074
 lettuce-0.074
cr-0.072
 Freddy-0.071
Token off
Token ID572
Feature activation+9.16e-01

Pos logit contributions
 Freddy+0.032
ados+0.031
 beans+0.031
 lettuce+0.030
 grapes+0.029
Neg logit contributions
Paul-0.049
Grand-0.049
 York-0.048
Offic-0.048
Daniel-0.047
Token park
Token ID3952
Feature activation+1.16e+00

Pos logit contributions
Paul+0.039
abella+0.037
 York+0.035
Grand+0.035
Jo+0.034
Neg logit contributions
 lettuce-0.039
 beans-0.038
 mushroom-0.037
 Freddy-0.036
 grapes-0.036
Token.
Token ID13
Feature activation+7.90e-01

Pos logit contributions
Paul+0.037
 York+0.034
abella+0.033
Grand+0.033
 security+0.033
Neg logit contributions
owers-0.053
 lettuce-0.052
 beans-0.051
 Freddy-0.051
ados-0.050
Token Lily
Token ID20037
Feature activation+2.71e-01

Pos logit contributions
Paul+0.034
 security+0.032
 York+0.032
Jo+0.032
Grand+0.032
Neg logit contributions
 Freddy-0.028
 lettuce-0.028
 beans-0.026
 grapes-0.026
owers-0.025
Token was
Token ID373
Feature activation+8.13e-01

Pos logit contributions
Paul+0.011
 York+0.010
Grand+0.010
Mad+0.009
 tow+0.009
Neg logit contributions
 lettuce-0.010
 beans-0.010
 Freddy-0.010
sed-0.010
 mushroom-0.010
Token the
Token ID262
Feature activation+1.12e+00

Pos logit contributions
 York+0.033
bell+0.031
Paul+0.031
oc+0.030
ris+0.030
Neg logit contributions
 Freddy-0.029
 grapes-0.027
 beans-0.027
 lettuce-0.027
sed-0.026
Token seeker
Token ID45993
Feature activation+5.05e+00

Pos logit contributions
Paul+0.041
 security+0.038
 York+0.037
Grand+0.037
abella+0.037
Neg logit contributions
 lettuce-0.045
owers-0.045
 beans-0.044
ados-0.043
 Freddy-0.043
Token and
Token ID290
Feature activation-1.29e-01

Pos logit contributions
Paul+0.152
 York+0.137
 security+0.134
Grand+0.131
abella+0.129
Neg logit contributions
owers-0.239
 lettuce-0.238
 beans-0.236
 Freddy-0.233
ados-0.230
Token Tom
Token ID4186
Feature activation+1.14e+00

Pos logit contributions
 Freddy+0.005
 lettuce+0.005
owers+0.005
 beans+0.005
 grapes+0.004
Neg logit contributions
 York-0.005
Paul-0.005
bell-0.005
 airport-0.005
 security-0.005
Token was
Token ID373
Feature activation+1.54e+00

Pos logit contributions
Paul+0.038
 York+0.036
Grand+0.035
 airport+0.034
bell+0.034
Neg logit contributions
 Freddy-0.049
 lettuce-0.049
 beans-0.049
owers-0.047
 mushroom-0.047
Token the
Token ID262
Feature activation+8.81e-01

Pos logit contributions
bell+0.065
 York+0.063
ris+0.059
 airport+0.059
Paul+0.058
Neg logit contributions
 Freddy-0.054
 grapes-0.047
 beans-0.045
cr-0.045
ados-0.045
Token h
Token ID289
Feature activation+2.78e+00

Pos logit contributions
Paul+0.032
 York+0.030
bell+0.028
abella+0.028
Grand+0.028
Neg logit contributions
 lettuce-0.036
 beans-0.035
 Freddy-0.035
owers-0.034
 mushroom-0.033
Token park
Token ID3952
Feature activation+1.23e+00

Pos logit contributions
Paul+0.035
abella+0.033
 York+0.032
bell+0.032
Grand+0.032
Neg logit contributions
 lettuce-0.035
 beans-0.034
 mushroom-0.033
 Freddy-0.033
 grapes-0.032
Token.
Token ID13
Feature activation+1.10e+00

Pos logit contributions
Paul+0.040
 York+0.036
abella+0.035
Grand+0.035
 security+0.034
Neg logit contributions
owers-0.056
 lettuce-0.056
 beans-0.055
 Freddy-0.054
ados-0.053
Token Anna
Token ID11735
Feature activation+1.63e+00

Pos logit contributions
Paul+0.039
 York+0.036
 security+0.036
Grand+0.036
abella+0.036
Neg logit contributions
 lettuce-0.045
owers-0.045
 beans-0.044
 Freddy-0.044
ados-0.043
Token is
Token ID318
Feature activation+1.40e+00

Pos logit contributions
Paul+0.067
abella+0.058
 security+0.058
Grand+0.058
 tow+0.057
Neg logit contributions
 beans-0.060
owers-0.060
 lettuce-0.059
sed-0.059
 mushroom-0.058
Token the
Token ID262
Feature activation+7.82e-01

Pos logit contributions
Paul+0.052
 York+0.050
Jo+0.047
ris+0.046
bell+0.046
Neg logit contributions
 Freddy-0.054
sed-0.054
 beans-0.054
 lettuce-0.054
 grapes-0.052
Token seeker
Token ID45993
Feature activation+5.04e+00

Pos logit contributions
Paul+0.028
 security+0.027
Grand+0.026
abella+0.026
 much+0.026
Neg logit contributions
owers-0.032
 lettuce-0.032
ados-0.031
 beans-0.031
 Freddy-0.031
Token and
Token ID290
Feature activation-9.40e-02

Pos logit contributions
Paul+0.148
 York+0.132
 security+0.130
Grand+0.128
abella+0.126
Neg logit contributions
owers-0.242
 lettuce-0.241
 beans-0.238
 Freddy-0.234
ados-0.232
Token Ben
Token ID3932
Feature activation+7.38e-01

Pos logit contributions
 lettuce+0.004
owers+0.004
 beans+0.004
 Freddy+0.004
ados+0.003
Neg logit contributions
Paul-0.004
 York-0.003
 security-0.003
Grand-0.003
bell-0.003
Token is
Token ID318
Feature activation+2.07e+00

Pos logit contributions
Paul+0.022
 York+0.019
Grand+0.019
 security+0.019
abella+0.019
Neg logit contributions
 lettuce-0.035
owers-0.035
 beans-0.035
 Freddy-0.034
ados-0.033
Token the
Token ID262
Feature activation+7.06e-01

Pos logit contributions
Paul+0.081
bell+0.078
Daniel+0.077
 York+0.077
ris+0.075
Neg logit contributions
 Freddy-0.070
sed-0.069
mons-0.067
owers-0.067
 mushroom-0.066
Token h
Token ID289
Feature activation+2.52e+00

Pos logit contributions
Paul+0.023
 security+0.021
Grand+0.020
abella+0.020
 York+0.020
Neg logit contributions
owers-0.032
 lettuce-0.032
 beans-0.031
ados-0.031
 Freddy-0.031
Token nap
Token ID25422
Feature activation-2.00e+00

Pos logit contributions
 lettuce+0.152
owers+0.151
 beans+0.148
 Freddy+0.146
ados+0.144
Neg logit contributions
Paul-0.124
 York-0.113
 security-0.113
abella-0.112
Grand-0.112
Token.
Token ID13
Feature activation-5.49e-01

Pos logit contributions
 beans+0.085
 lettuce+0.085
owers+0.084
 Freddy+0.082
ados+0.079
Neg logit contributions
Paul-0.070
 York-0.065
Grand-0.063
abella-0.063
bell-0.062
Token<|endoftext|>
Token ID50256
Feature activation-1.85e+00

Pos logit contributions
 lettuce+0.026
owers+0.025
 Freddy+0.025
 beans+0.025
ados+0.024
Neg logit contributions
Paul-0.017
 York-0.015
 security-0.015
Grand-0.015
abella-0.015
TokenEric
Token ID25004
Feature activation+3.71e+00

Pos logit contributions
owers+0.078
 lettuce+0.076
 Freddy+0.075
 beans+0.074
ados+0.071
Neg logit contributions
Paul-0.064
 security-0.060
 York-0.060
Grand-0.059
abella-0.059
Token and
Token ID290
Feature activation+1.57e-01

Pos logit contributions
Paul+0.141
 security+0.132
 elev+0.130
 York+0.127
Mad+0.125
Neg logit contributions
 lettuce-0.148
 Freddy-0.145
 beans-0.144
owers-0.137
 mushroom-0.133
Token Alison
Token ID33629
Feature activation+5.04e+00

Pos logit contributions
Paul+0.005
 York+0.004
abella+0.004
 security+0.004
Grand+0.004
Neg logit contributions
 lettuce-0.007
 Freddy-0.007
owers-0.007
 beans-0.007
ados-0.007
Token liked
Token ID8288
Feature activation+8.46e-01

Pos logit contributions
Paul+0.208
 York+0.194
Grand+0.192
 security+0.192
abella+0.188
Neg logit contributions
owers-0.180
 lettuce-0.177
 beans-0.176
 Freddy-0.174
 squash-0.164
Token to
Token ID284
Feature activation+5.32e-01

Pos logit contributions
Paul+0.027
 York+0.025
 security+0.025
abella+0.024
Grand+0.024
Neg logit contributions
 lettuce-0.038
owers-0.038
 beans-0.037
 Freddy-0.036
ados-0.036
Token play
Token ID711
Feature activation+7.33e-01

Pos logit contributions
Paul+0.021
 York+0.019
Grand+0.019
 security+0.019
abella+0.019
Neg logit contributions
owers-0.020
 lettuce-0.020
 beans-0.019
ados-0.019
 Freddy-0.019
Token together
Token ID1978
Feature activation-1.10e+00

Pos logit contributions
Paul+0.023
 York+0.021
 security+0.021
abella+0.021
Grand+0.021
Neg logit contributions
 lettuce-0.033
owers-0.033
 beans-0.032
 Freddy-0.031
ados-0.031
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
 lettuce+0.052
owers+0.052
 beans+0.051
 Freddy+0.050
ados+0.050
Neg logit contributions
Paul-0.032
 York-0.029
 security-0.029
Grand-0.028
abella-0.028
Token said
Token ID531
Feature activation+3.08e-01

Pos logit contributions
Paul+0.028
 security+0.027
bell+0.027
 much+0.027
 airport+0.027
Neg logit contributions
 lettuce-0.035
ados-0.035
 beans-0.033
owers-0.033
 Freddy-0.032
Token sorry
Token ID7926
Feature activation-1.06e+00

Pos logit contributions
Paul+0.012
 York+0.011
Grand+0.011
abella+0.010
 security+0.010
Neg logit contributions
 lettuce-0.013
 beans-0.012
 Freddy-0.012
owers-0.012
ados-0.012
Token to
Token ID284
Feature activation+4.18e-01

Pos logit contributions
owers+0.053
 lettuce+0.053
 beans+0.051
 Freddy+0.051
ados+0.050
Neg logit contributions
Paul-0.029
 security-0.026
 York-0.026
Grand-0.026
 much-0.025
Token the
Token ID262
Feature activation-3.21e-01

Pos logit contributions
Paul+0.015
 security+0.013
 York+0.013
Grand+0.013
abella+0.013
Neg logit contributions
 lettuce-0.018
owers-0.017
 Freddy-0.017
 beans-0.017
ados-0.017
Token bike
Token ID7161
Feature activation+1.49e+00

Pos logit contributions
 beans+0.011
 lettuce+0.010
 mushroom+0.010
 squash+0.009
 berries+0.009
Neg logit contributions
Paul-0.015
abella-0.014
bell-0.014
 York-0.014
ris-0.013
Token rider
Token ID17876
Feature activation+5.04e+00

Pos logit contributions
Paul+0.055
abella+0.050
 York+0.050
Grand+0.049
Daniel+0.048
Neg logit contributions
 Freddy-0.059
owers-0.059
 beans-0.059
 lettuce-0.059
ados-0.057
Token.
Token ID13
Feature activation+1.66e-02

Pos logit contributions
Paul+0.163
 York+0.146
Grand+0.145
abella+0.143
 security+0.141
Neg logit contributions
owers-0.226
 lettuce-0.225
 beans-0.221
 Freddy-0.220
ados-0.216
Token They
Token ID1119
Feature activation-4.91e-02

Pos logit contributions
Paul+0.001
abella+0.001
 security+0.001
 York+0.001
 airport+0.001
Neg logit contributions
 lettuce-0.001
owers-0.001
 Freddy-0.001
 beans-0.001
ados-0.000
Token cleaned
Token ID20750
Feature activation-6.00e-01

Pos logit contributions
 lettuce+0.002
owers+0.002
 beans+0.002
 Freddy+0.002
ados+0.002
Neg logit contributions
Paul-0.002
 York-0.002
 security-0.002
abella-0.002
Grand-0.002
Token up
Token ID510
Feature activation+8.82e-01

Pos logit contributions
ados+0.023
 Freddy+0.022
 lettuce+0.022
 beans+0.022
 squash+0.022
Neg logit contributions
Paul-0.024
Grand-0.023
bell-0.023
 airport-0.022
abella-0.022
Token the
Token ID262
Feature activation+1.49e-01

Pos logit contributions
Paul+0.034
Jo+0.031
abella+0.031
Grand+0.030
Mad+0.030
Neg logit contributions
 beans-0.036
 lettuce-0.035
 grapes-0.034
ados-0.034
 shovel-0.034
Tokenizz
Token ID6457
Feature activation+2.61e+00

Pos logit contributions
Paul+0.003
 York+0.003
 security+0.003
Grand+0.002
 airport+0.002
Neg logit contributions
owers-0.003
 lettuce-0.003
 Freddy-0.003
 beans-0.003
ados-0.003
Tokened
Token ID276
Feature activation-5.33e-01

Pos logit contributions
Paul+0.109
Grand+0.101
 York+0.101
 security+0.100
abella+0.098
Neg logit contributions
 lettuce-0.092
owers-0.092
 beans-0.089
 Freddy-0.089
ados-0.088
Token off
Token ID572
Feature activation+1.19e+00

Pos logit contributions
 Freddy+0.018
 lettuce+0.018
 beans+0.017
ados+0.017
 mushroom+0.016
Neg logit contributions
Paul-0.023
Grand-0.022
abella-0.022
 York-0.021
Mad-0.021
Token back
Token ID736
Feature activation-2.61e-01

Pos logit contributions
Paul+0.046
Grand+0.042
abella+0.042
 York+0.042
Daniel+0.039
Neg logit contributions
 Freddy-0.047
 lettuce-0.047
 beans-0.046
owers-0.045
ados-0.044
Token into
Token ID656
Feature activation-3.51e-01

Pos logit contributions
owers+0.009
ados+0.009
 lettuce+0.009
 beans+0.009
 Freddy+0.009
Neg logit contributions
abella-0.011
Paul-0.011
Grand-0.011
Daniel-0.010
Mad-0.010
Token outer
Token ID12076
Feature activation+5.04e+00

Pos logit contributions
 lettuce+0.015
 beans+0.015
 Freddy+0.015
 shovel+0.014
 squash+0.014
Neg logit contributions
abella-0.012
Paul-0.012
Grand-0.011
bell-0.011
 York-0.011
Token space
Token ID2272
Feature activation+1.78e+00

Pos logit contributions
abella+0.210
Paul+0.205
bell+0.192
Jo+0.190
Grand+0.183
Neg logit contributions
 lettuce-0.190
 beans-0.185
 mushroom-0.172
 shovel-0.162
 mushrooms-0.159
Token.
Token ID13
Feature activation-5.12e-01

Pos logit contributions
Paul+0.069
abella+0.067
Grand+0.065
V+0.063
Mad+0.062
Neg logit contributions
 beans-0.067
owers-0.067
 lettuce-0.065
 Freddy-0.063
ados-0.062
Token Earth
Token ID3668
Feature activation+1.66e-01

Pos logit contributions
 Freddy+0.020
owers+0.020
 lettuce+0.020
ados+0.019
 beans+0.019
Neg logit contributions
Paul-0.019
 security-0.018
Grand-0.018
 York-0.018
 airport-0.018
Token was
Token ID373
Feature activation+7.37e-01

Pos logit contributions
Paul+0.005
Grand+0.005
abella+0.005
 York+0.005
 security+0.004
Neg logit contributions
owers-0.008
 lettuce-0.008
 beans-0.007
 Freddy-0.007
 mushroom-0.007
Token an
Token ID281
Feature activation+7.71e-01

Pos logit contributions
 York+0.031
Paul+0.028
Offic+0.028
Mad+0.027
abella+0.027
Neg logit contributions
 Freddy-0.026
 lettuce-0.024
cr-0.023
 beans-0.021
 Freddie-0.021
Token park
Token ID3952
Feature activation+1.13e+00

Pos logit contributions
Paul+0.035
abella+0.033
 York+0.031
Grand+0.031
bell+0.031
Neg logit contributions
 lettuce-0.035
 beans-0.034
 Freddy-0.033
 mushroom-0.032
owers-0.031
Token.
Token ID13
Feature activation+1.08e+00

Pos logit contributions
Paul+0.034
 York+0.031
abella+0.031
Grand+0.030
 security+0.030
Neg logit contributions
owers-0.053
 lettuce-0.053
 beans-0.052
 Freddy-0.051
ados-0.051
Token Tim
Token ID5045
Feature activation-6.00e-01

Pos logit contributions
Paul+0.040
 York+0.037
abella+0.036
 security+0.036
Grand+0.036
Neg logit contributions
 lettuce-0.044
owers-0.044
 beans-0.042
ados-0.042
 Freddy-0.041
Token was
Token ID373
Feature activation+8.76e-01

Pos logit contributions
 lettuce+0.025
 Freddy+0.025
 beans+0.024
owers+0.023
 mushroom+0.023
Neg logit contributions
Paul-0.021
 York-0.020
 security-0.019
Grand-0.019
abella-0.019
Token the
Token ID262
Feature activation+8.44e-01

Pos logit contributions
 York+0.035
bell+0.034
Paul+0.033
ris+0.033
Jo+0.031
Neg logit contributions
 Freddy-0.033
 lettuce-0.030
 beans-0.030
sed-0.029
 grapes-0.029
Token seeker
Token ID45993
Feature activation+5.01e+00

Pos logit contributions
Paul+0.030
 security+0.028
Grand+0.027
 much+0.027
abella+0.027
Neg logit contributions
owers-0.036
 lettuce-0.035
ados-0.035
 beans-0.034
 Freddy-0.034
Token and
Token ID290
Feature activation-2.19e-01

Pos logit contributions
Paul+0.147
 York+0.133
 security+0.131
Grand+0.128
abella+0.126
Neg logit contributions
 lettuce-0.239
owers-0.239
 beans-0.236
 Freddy-0.233
ados-0.230
Token Lily
Token ID20037
Feature activation+6.05e-01

Pos logit contributions
 Freddy+0.009
 lettuce+0.009
 beans+0.008
owers+0.008
ados+0.008
Neg logit contributions
Paul-0.008
 York-0.008
bell-0.008
 airport-0.008
 security-0.008
Token was
Token ID373
Feature activation+1.45e+00

Pos logit contributions
Paul+0.019
 York+0.018
Grand+0.017
 security+0.017
abella+0.017
Neg logit contributions
 lettuce-0.027
owers-0.027
 beans-0.027
 Freddy-0.027
 mushroom-0.026
Token the
Token ID262
Feature activation+7.04e-01

Pos logit contributions
bell+0.060
 York+0.058
ris+0.057
Paul+0.055
Daniel+0.054
Neg logit contributions
 Freddy-0.052
 grapes-0.045
ados-0.044
 beans-0.044
 mushroom-0.044
Token h
Token ID289
Feature activation+2.50e+00

Pos logit contributions
Paul+0.024
 York+0.022
 security+0.022
abella+0.022
Grand+0.022
Neg logit contributions
 lettuce-0.030
owers-0.030
 beans-0.029
 Freddy-0.029
ados-0.028
Token park
Token ID3952
Feature activation+1.19e+00

Pos logit contributions
Paul+0.039
abella+0.037
 York+0.035
Jo+0.034
bell+0.034
Neg logit contributions
 lettuce-0.036
 beans-0.035
 mushroom-0.034
 grapes-0.033
 Freddy-0.033
Token.
Token ID13
Feature activation+9.79e-01

Pos logit contributions
Paul+0.043
 York+0.040
abella+0.039
Grand+0.038
Mad+0.037
Neg logit contributions
owers-0.050
 lettuce-0.048
 Freddy-0.048
 beans-0.048
ados-0.046
Token Tom
Token ID4186
Feature activation+8.45e-01

Pos logit contributions
Paul+0.043
 York+0.040
 security+0.040
Grand+0.039
Mad+0.038
Neg logit contributions
 lettuce-0.033
owers-0.033
 Freddy-0.033
 beans-0.031
 grapes-0.031
Token was
Token ID373
Feature activation+9.46e-01

Pos logit contributions
Paul+0.032
 York+0.030
Mad+0.030
Grand+0.029
abella+0.029
Neg logit contributions
 lettuce-0.032
 Freddy-0.032
 beans-0.031
 mushroom-0.031
sed-0.031
Token the
Token ID262
Feature activation+1.03e+00

Pos logit contributions
 York+0.038
Paul+0.036
bell+0.035
ris+0.034
Jo+0.034
Neg logit contributions
 Freddy-0.035
 lettuce-0.033
 beans-0.032
sed-0.031
 grapes-0.031
Token seeker
Token ID45993
Feature activation+5.00e+00

Pos logit contributions
Paul+0.037
 security+0.035
Grand+0.034
abella+0.034
 York+0.034
Neg logit contributions
owers-0.042
 lettuce-0.042
 beans-0.041
ados-0.040
 Freddy-0.040
Token and
Token ID290
Feature activation-1.48e-01

Pos logit contributions
Paul+0.153
 York+0.139
 security+0.136
Grand+0.133
 much+0.130
Neg logit contributions
owers-0.232
 lettuce-0.231
 beans-0.229
 Freddy-0.227
ados-0.223
Token Mia
Token ID32189
Feature activation+1.08e+00

Pos logit contributions
 Freddy+0.005
 lettuce+0.005
 beans+0.005
owers+0.005
 grapes+0.005
Neg logit contributions
 York-0.006
Paul-0.006
bell-0.006
 airport-0.006
Grand-0.006
Token was
Token ID373
Feature activation+1.53e+00

Pos logit contributions
Paul+0.036
 York+0.033
Grand+0.032
 security+0.031
abella+0.031
Neg logit contributions
 lettuce-0.047
owers-0.047
 beans-0.047
 Freddy-0.047
 mushroom-0.045
Token the
Token ID262
Feature activation+8.54e-01

Pos logit contributions
bell+0.063
 York+0.063
ris+0.060
Paul+0.059
Grand+0.058
Neg logit contributions
 Freddy-0.052
sed-0.045
 grapes-0.045
 beans-0.044
 mushroom-0.044
Token h
Token ID289
Feature activation+2.64e+00

Pos logit contributions
Paul+0.030
 York+0.029
abella+0.027
Grand+0.027
bell+0.027
Neg logit contributions
 lettuce-0.035
 beans-0.034
 Freddy-0.034
owers-0.034
 mushroom-0.033
Token continued
Token ID3767
Feature activation+1.17e+00

Pos logit contributions
Paul+0.008
 much+0.008
Grand+0.007
abella+0.007
 security+0.007
Neg logit contributions
 lettuce-0.020
owers-0.020
ados-0.020
 beans-0.019
 Freddy-0.019
Token to
Token ID284
Feature activation+4.98e-01

Pos logit contributions
Paul+0.041
 York+0.037
Grand+0.037
 security+0.037
abella+0.037
Neg logit contributions
 lettuce-0.050
owers-0.049
 beans-0.049
 Freddy-0.048
ados-0.047
Token scatter
Token ID41058
Feature activation+1.59e-01

Pos logit contributions
Paul+0.021
abella+0.020
Grand+0.020
 York+0.020
 security+0.020
Neg logit contributions
 lettuce-0.017
owers-0.017
 beans-0.017
 Freddy-0.017
ados-0.016
Token the
Token ID262
Feature activation+1.78e-01

Pos logit contributions
Paul+0.005
 security+0.004
 York+0.004
 much+0.004
abella+0.004
Neg logit contributions
owers-0.007
 lettuce-0.007
 beans-0.007
 Freddy-0.007
ados-0.007
Token bread
Token ID8509
Feature activation-2.47e+00

Pos logit contributions
 security+0.007
Paul+0.007
 much+0.007
 airport+0.007
 York+0.006
Neg logit contributions
owers-0.006
 Freddy-0.006
ados-0.006
 lettuce-0.006
sed-0.006
Tokencr
Token ID6098
Feature activation-5.18e+00

Pos logit contributions
owers+0.122
sed+0.122
ses+0.119
ados+0.117
 lettuce+0.115
Neg logit contributions
 much-0.059
--0.051
 soon-0.049
 security-0.045
bell-0.043
Tokenumbs
Token ID18146
Feature activation-3.63e+00

Pos logit contributions
sed+0.174
ses+0.166
ados+0.143
 mushroom+0.140
 shovel+0.139
Neg logit contributions
 much-0.240
ris-0.213
bell-0.213
�-0.212
--0.207
Token.
Token ID13
Feature activation-1.56e-01

Pos logit contributions
owers+0.190
sed+0.189
ados+0.186
 lettuce+0.184
mons+0.184
Neg logit contributions
 much-0.087
bell-0.067
 security-0.066
 soon-0.066
Paul-0.064
Token<|endoftext|>
Token ID50256
Feature activation-1.75e+00

Pos logit contributions
 lettuce+0.007
owers+0.007
 beans+0.007
 Freddy+0.007
ados+0.006
Neg logit contributions
Paul-0.005
 security-0.005
 York-0.005
Grand-0.005
abella-0.005
TokenOnce
Token ID7454
Feature activation+8.14e-01

Pos logit contributions
owers+0.080
 lettuce+0.078
 Freddy+0.078
 beans+0.076
 mushroom+0.073
Neg logit contributions
Paul-0.054
 security-0.051
 York-0.050
abella-0.050
Grand-0.049
Token upon
Token ID2402
Feature activation-9.69e-01

Pos logit contributions
Paul+0.038
 York+0.037
 security+0.035
Grand+0.035
abella+0.034
Neg logit contributions
 lettuce-0.025
 Freddy-0.025
owers-0.024
 beans-0.024
 mushroom-0.022
Token it
Token ID340
Feature activation-1.94e-01

Pos logit contributions
owers+0.030
 lettuce+0.030
 beans+0.029
ados+0.028
sed+0.027
Neg logit contributions
Paul-0.024
 security-0.023
 much-0.022
abella-0.021
Grand-0.021
Token was
Token ID373
Feature activation+5.74e-01

Pos logit contributions
 lettuce+0.008
 beans+0.008
 Freddy+0.008
cr+0.008
 shovel+0.007
Neg logit contributions
Paul-0.007
 York-0.007
 France-0.006
Mad-0.006
 security-0.006
Token difficult
Token ID2408
Feature activation+6.45e-01

Pos logit contributions
Paul+0.019
 York+0.018
 security+0.017
abella+0.016
Grand+0.016
Neg logit contributions
 lettuce-0.026
 beans-0.026
 Freddy-0.025
owers-0.024
ados-0.024
Token finding
Token ID4917
Feature activation-7.39e-02

Pos logit contributions
Paul+0.024
 York+0.022
Grand+0.022
abella+0.022
 security+0.021
Neg logit contributions
 beans-0.026
 lettuce-0.026
 Freddy-0.025
owers-0.025
ados-0.025
Token r
Token ID374
Feature activation-2.04e+00

Pos logit contributions
 beans+0.003
 lettuce+0.003
 shovel+0.003
 Freddy+0.003
 squash+0.003
Neg logit contributions
Grand-0.003
Paul-0.003
abella-0.003
Offic-0.003
bell-0.003
Tokenags
Token ID3775
Feature activation-5.08e+00

Pos logit contributions
 lettuce+0.075
owers+0.074
 beans+0.072
 Freddy+0.071
ados+0.070
Neg logit contributions
Paul-0.083
 York-0.077
 security-0.077
abella-0.076
Grand-0.076
Token and
Token ID290
Feature activation+5.90e-01

Pos logit contributions
 beans+0.199
 Freddy+0.199
 lettuce+0.197
 shovel+0.193
cr+0.193
Neg logit contributions
Paul-0.182
Grand-0.181
 York-0.178
Offic-0.172
Mad-0.171
Token water
Token ID1660
Feature activation-8.30e-02

Pos logit contributions
Paul+0.019
 York+0.018
Grand+0.017
abella+0.017
 security+0.017
Neg logit contributions
 lettuce-0.027
 Freddy-0.026
 beans-0.026
owers-0.026
ados-0.025
Token.
Token ID13
Feature activation-1.31e-01

Pos logit contributions
 lettuce+0.004
owers+0.004
 beans+0.004
 Freddy+0.004
ados+0.004
Neg logit contributions
Paul-0.002
 York-0.002
 security-0.002
abella-0.002
Grand-0.002
Token But
Token ID887
Feature activation+9.79e-01

Pos logit contributions
owers+0.005
 beans+0.005
 lettuce+0.005
ados+0.005
 mushroom+0.005
Neg logit contributions
Paul-0.005
 much-0.004
 security-0.004
 York-0.004
abella-0.004
Token Bob
Token ID5811
Feature activation-9.46e-01

Pos logit contributions
Paul+0.040
 York+0.038
 security+0.037
abella+0.037
Grand+0.037
Neg logit contributions
 lettuce-0.035
owers-0.035
 beans-0.034
 Freddy-0.033
ados-0.033
Token mum
Token ID25682
Feature activation-1.32e-01

Pos logit contributions
owers+0.047
 lettuce+0.046
 beans+0.044
ados+0.044
 Freddy+0.044
Neg logit contributions
Paul-0.055
 security-0.051
abella-0.051
 York-0.051
Grand-0.051
Token gave
Token ID2921
Feature activation-6.89e-01

Pos logit contributions
 lettuce+0.005
owers+0.005
 Freddy+0.005
 beans+0.005
ados+0.004
Neg logit contributions
Paul-0.005
 York-0.005
 security-0.005
Grand-0.005
abella-0.005
Token him
Token ID683
Feature activation-1.96e+00

Pos logit contributions
owers+0.033
 lettuce+0.031
sed+0.031
mons+0.031
 beans+0.030
Neg logit contributions
Paul-0.021
abella-0.019
 much-0.018
 security-0.018
Grand-0.018
Token some
Token ID617
Feature activation-7.85e-01

Pos logit contributions
 lettuce+0.092
 beans+0.090
 Freddy+0.089
owers+0.087
ados+0.086
Neg logit contributions
Paul-0.062
 York-0.057
Grand-0.055
 security-0.055
bell-0.054
Token bread
Token ID8509
Feature activation-3.04e+00

Pos logit contributions
owers+0.025
ados+0.024
sed+0.023
 Freddy+0.023
 lettuce+0.022
Neg logit contributions
 much-0.033
 security-0.032
Paul-0.032
 tow-0.032
 York-0.031
Tokencr
Token ID6098
Feature activation-5.06e+00

Pos logit contributions
owers+0.156
ados+0.151
sed+0.149
 lettuce+0.147
 Freddy+0.147
Neg logit contributions
 much-0.064
Paul-0.058
--0.058
Grand-0.055
 tow-0.054
Tokenumbs
Token ID18146
Feature activation-4.06e+00

Pos logit contributions
sed+0.155
ses+0.145
ados+0.138
owers+0.133
mons+0.129
Neg logit contributions
 much-0.232
ris-0.220
Paul-0.215
bell-0.211
abella-0.208
Token.
Token ID13
Feature activation-6.46e-01

Pos logit contributions
owers+0.223
ados+0.217
sed+0.216
 lettuce+0.214
mons+0.211
Neg logit contributions
 much-0.080
Paul-0.067
 security-0.065
bell-0.063
abella-0.063
Token Jack
Token ID3619
Feature activation-1.28e+00

Pos logit contributions
 lettuce+0.027
owers+0.027
 beans+0.026
 Freddy+0.026
ados+0.025
Neg logit contributions
Paul-0.023
 York-0.021
 security-0.021
abella-0.021
Grand-0.021
Token smiled
Token ID13541
Feature activation-8.35e-01

Pos logit contributions
 lettuce+0.058
owers+0.058
 beans+0.057
 Freddy+0.056
ados+0.055
Neg logit contributions
Paul-0.040
 York-0.036
 security-0.036
abella-0.036
Grand-0.036
Token as
Token ID355
Feature activation-6.50e-01

Pos logit contributions
 lettuce+0.035
 Freddy+0.034
 beans+0.034
owers+0.033
ados+0.032
Neg logit contributions
Paul-0.030
 York-0.028
abella-0.028
 security-0.027
 elev-0.026
Token and
Token ID290
Feature activation+5.09e-01

Pos logit contributions
owers+0.062
 lettuce+0.062
sed+0.060
ados+0.060
mons+0.060
Neg logit contributions
 much-0.023
Paul-0.022
 security-0.020
 York-0.020
 airport-0.019
Token make
Token ID787
Feature activation+6.01e-01

Pos logit contributions
Paul+0.018
 much+0.017
 York+0.017
 security+0.017
bell+0.017
Neg logit contributions
owers-0.021
ados-0.019
 lettuce-0.019
 beans-0.019
mons-0.019
Token them
Token ID606
Feature activation-1.94e+00

Pos logit contributions
Paul+0.021
Grand+0.020
bell+0.020
 York+0.019
 security+0.019
Neg logit contributions
 lettuce-0.026
 beans-0.025
 Freddy-0.025
ados-0.024
owers-0.024
Token go
Token ID467
Feature activation+8.58e-01

Pos logit contributions
owers+0.080
 lettuce+0.076
sed+0.075
ados+0.075
mons+0.074
Neg logit contributions
 much-0.070
Paul-0.066
 security-0.064
 York-0.062
bell-0.061
Token "
Token ID366
Feature activation+6.16e-01

Pos logit contributions
Paul+0.030
abella+0.029
Grand+0.029
 York+0.028
 security+0.028
Neg logit contributions
 Freddy-0.036
 beans-0.035
 lettuce-0.035
 mushroom-0.033
cr-0.033
Tokencr
Token ID6098
Feature activation-5.06e+00

Pos logit contributions
Paul+0.020
 security+0.019
abella+0.018
 York+0.018
 airport+0.018
Neg logit contributions
owers-0.027
 lettuce-0.026
ados-0.026
 beans-0.026
 Freddy-0.026
Tokenunch
Token ID3316
Feature activation-1.06e+00

Pos logit contributions
sed+0.204
ses+0.195
ados+0.190
 lettuce+0.188
 beans+0.185
Neg logit contributions
Paul-0.189
 much-0.185
ris-0.184
bell-0.176
Grand-0.174
Token".
Token ID1911
Feature activation-2.01e-01

Pos logit contributions
 lettuce+0.051
owers+0.050
 beans+0.049
 Freddy+0.049
sed+0.048
Neg logit contributions
Paul-0.030
 York-0.027
 much-0.027
abella-0.026
Grand-0.026
Token
Token ID220
Feature activation-3.60e-01

Pos logit contributions
 lettuce+0.010
owers+0.010
 beans+0.009
ados+0.009
 Freddy+0.009
Neg logit contributions
Paul-0.006
 York-0.005
Grand-0.005
abella-0.005
 security-0.005
Token
Token ID198
Feature activation+4.70e-01

Pos logit contributions
owers+0.017
 lettuce+0.017
 Freddy+0.017
 beans+0.016
ados+0.016
Neg logit contributions
Paul-0.010
 security-0.010
 York-0.010
 airport-0.009
bell-0.009
Token
Token ID198
Feature activation-8.92e-02

Pos logit contributions
 security+0.017
 York+0.016
 airport+0.016
bell+0.015
 elev+0.015
Neg logit contributions
 Freddy-0.019
 lettuce-0.018
owers-0.017
cr-0.017
 beans-0.016
Token is
Token ID318
Feature activation+1.31e+00

Pos logit contributions
owers+0.066
 lettuce+0.065
ados+0.064
 beans+0.063
 Freddy+0.062
Neg logit contributions
 security-0.020
 much-0.020
 York-0.019
--0.018
bell-0.018
Token stronger
Token ID7387
Feature activation+2.17e+00

Pos logit contributions
Paul+0.039
 security+0.036
 much+0.035
Grand+0.035
abella+0.035
Neg logit contributions
owers-0.063
 lettuce-0.061
ados-0.060
 beans-0.060
 Freddy-0.059
Token.
Token ID13
Feature activation-1.10e-01

Pos logit contributions
Paul+0.075
 York+0.068
abella+0.066
Grand+0.065
 airport+0.064
Neg logit contributions
 lettuce-0.093
owers-0.092
 beans-0.091
 Freddy-0.090
ados-0.089
Token He
Token ID679
Feature activation-1.58e-01

Pos logit contributions
 lettuce+0.005
owers+0.005
 beans+0.005
 Freddy+0.005
ados+0.005
Neg logit contributions
Paul-0.003
 York-0.003
 security-0.003
Grand-0.003
abella-0.003
Token dr
Token ID1553
Feature activation-1.34e-02

Pos logit contributions
owers+0.006
 lettuce+0.006
 beans+0.006
ados+0.006
 Freddy+0.006
Neg logit contributions
Paul-0.006
 York-0.005
 security-0.005
abella-0.005
Grand-0.005
Tokenags
Token ID3775
Feature activation-4.98e+00

Pos logit contributions
owers+0.000
 lettuce+0.000
sed+0.000
 beans+0.000
ados+0.000
Neg logit contributions
Paul-0.001
 much-0.001
abella-0.001
 security-0.001
 tow-0.001
Token Lily
Token ID20037
Feature activation-8.16e-01

Pos logit contributions
 Freddy+0.201
 lettuce+0.200
 beans+0.197
 grapes+0.190
ados+0.190
Neg logit contributions
Paul-0.177
Grand-0.174
 York-0.173
Mad-0.168
bell-0.166
Token into
Token ID656
Feature activation-1.55e+00

Pos logit contributions
owers+0.034
 lettuce+0.033
 beans+0.033
 Freddy+0.032
ados+0.032
Neg logit contributions
Paul-0.029
 security-0.027
 York-0.026
 much-0.026
Grand-0.026
Token the
Token ID262
Feature activation-1.05e-01

Pos logit contributions
 lettuce+0.070
 beans+0.069
owers+0.068
 Freddy+0.068
ados+0.067
Neg logit contributions
Paul-0.050
 York-0.045
abella-0.044
 security-0.044
Grand-0.044
Token pond
Token ID16723
Feature activation-1.14e+00

Pos logit contributions
owers+0.004
 lettuce+0.004
ados+0.003
 Freddy+0.003
 beans+0.003
Neg logit contributions
Paul-0.004
 security-0.004
 York-0.004
 airport-0.004
Grand-0.004
Token.
Token ID13
Feature activation-7.21e-02

Pos logit contributions
 lettuce+0.055
owers+0.055
 beans+0.054
 Freddy+0.053
ados+0.053
Neg logit contributions
Paul-0.033
 York-0.029
abella-0.029
 security-0.029
Grand-0.029
Token love
Token ID1842
Feature activation+8.38e-01

Pos logit contributions
Paul+0.022
 York+0.020
 security+0.020
Grand+0.020
abella+0.019
Neg logit contributions
owers-0.017
 lettuce-0.016
 beans-0.016
 Freddy-0.016
cr-0.015
Token you
Token ID345
Feature activation+3.81e-02

Pos logit contributions
Paul+0.031
 security+0.029
 much+0.029
 York+0.029
abella+0.028
Neg logit contributions
owers-0.033
 lettuce-0.032
mons-0.031
 beans-0.030
sed-0.030
Token"
Token ID1
Feature activation-3.64e-01

Pos logit contributions
Paul+0.001
 York+0.001
 security+0.001
Grand+0.001
abella+0.001
Neg logit contributions
 lettuce-0.002
owers-0.002
 beans-0.002
 Freddy-0.002
ados-0.002
Token and
Token ID290
Feature activation-1.60e-01

Pos logit contributions
owers+0.016
 lettuce+0.016
 beans+0.016
 Freddy+0.015
ados+0.015
Neg logit contributions
Paul-0.012
 York-0.011
 security-0.011
abella-0.010
 tow-0.010
Token "
Token ID366
Feature activation+9.16e-01

Pos logit contributions
 lettuce+0.007
 Freddy+0.006
 beans+0.006
owers+0.006
ados+0.006
Neg logit contributions
Paul-0.006
 York-0.005
Grand-0.005
 security-0.005
bell-0.005
Tokencr
Token ID6098
Feature activation-4.97e+00

Pos logit contributions
Paul+0.035
 security+0.034
 York+0.034
abella+0.033
 airport+0.032
Neg logit contributions
owers-0.035
 lettuce-0.034
 beans-0.033
 Freddy-0.032
ados-0.032
Tokenacker
Token ID10735
Feature activation-1.73e+00

Pos logit contributions
 lettuce+0.198
owers+0.193
ados+0.193
 beans+0.192
sed+0.191
Neg logit contributions
Paul-0.183
 much-0.168
abella-0.168
Grand-0.166
 York-0.164
Token".
Token ID1911
Feature activation-1.26e-01

Pos logit contributions
owers+0.088
 lettuce+0.087
 beans+0.085
ados+0.083
 Freddy+0.083
Neg logit contributions
Paul-0.047
 security-0.041
Grand-0.041
 York-0.041
abella-0.040
Token He
Token ID679
Feature activation-4.35e-01

Pos logit contributions
 lettuce+0.005
 Freddy+0.005
 beans+0.005
owers+0.005
ados+0.005
Neg logit contributions
Paul-0.005
 security-0.004
 York-0.004
Grand-0.004
abella-0.004
Token also
Token ID635
Feature activation-4.69e-02

Pos logit contributions
 lettuce+0.017
owers+0.017
 beans+0.017
 Freddy+0.016
sed+0.016
Neg logit contributions
Paul-0.017
Grand-0.015
 security-0.015
 York-0.015
abella-0.014
Token likes
Token ID7832
Feature activation-2.21e-02

Pos logit contributions
 lettuce+0.002
owers+0.002
 beans+0.002
 Freddy+0.002
sed+0.002
Neg logit contributions
Paul-0.002
 York-0.002
Grand-0.002
 security-0.002
abella-0.002
Token "
Token ID366
Feature activation+2.94e-01

Pos logit contributions
 lettuce+0.020
owers+0.019
 Freddy+0.019
 beans+0.019
ados+0.019
Neg logit contributions
Paul-0.014
 York-0.013
 security-0.013
abella-0.013
 airport-0.013
TokenYou
Token ID1639
Feature activation-5.34e-01

Pos logit contributions
Paul+0.011
 York+0.010
 security+0.010
abella+0.010
Grand+0.010
Neg logit contributions
 lettuce-0.012
owers-0.012
 beans-0.011
 Freddy-0.011
ados-0.011
Token can
Token ID460
Feature activation-3.91e-01

Pos logit contributions
owers+0.018
ados+0.018
 lettuce+0.018
mons+0.017
 beans+0.017
Neg logit contributions
Paul-0.022
 much-0.022
abella-0.021
 security-0.021
 York-0.020
Token have
Token ID423
Feature activation-1.13e+00

Pos logit contributions
 lettuce+0.016
owers+0.015
 beans+0.015
ados+0.015
sed+0.015
Neg logit contributions
Paul-0.014
 much-0.013
Grand-0.013
abella-0.013
 York-0.013
Token a
Token ID257
Feature activation-1.14e+00

Pos logit contributions
owers+0.055
 lettuce+0.053
ados+0.052
 beans+0.051
sed+0.051
Neg logit contributions
Paul-0.033
 much-0.030
abella-0.030
Grand-0.029
 security-0.029
Token bite
Token ID13197
Feature activation-4.95e+00

Pos logit contributions
owers+0.043
 lettuce+0.042
ados+0.041
 beans+0.040
 Freddy+0.040
Neg logit contributions
Paul-0.044
 security-0.042
 much-0.041
 York-0.041
Grand-0.041
Token".
Token ID1911
Feature activation-6.66e-01

Pos logit contributions
owers+0.246
 lettuce+0.242
 beans+0.235
ados+0.234
sed+0.234
Neg logit contributions
Paul-0.127
 much-0.121
abella-0.115
Grand-0.115
 security-0.115
Token The
Token ID383
Feature activation-7.57e-01

Pos logit contributions
 lettuce+0.037
 beans+0.037
sed+0.036
owers+0.036
 pulp+0.035
Neg logit contributions
Paul-0.014
 much-0.012
Jo-0.011
Grand-0.011
abella-0.011
Token mouse
Token ID10211
Feature activation-2.32e-01

Pos logit contributions
owers+0.028
cr+0.026
 Freddy+0.026
mons+0.026
ados+0.026
Neg logit contributions
 security-0.029
 much-0.028
Paul-0.028
Grand-0.028
 tow-0.028
Token tasted
Token ID29187
Feature activation-1.87e+00

Pos logit contributions
owers+0.011
ados+0.011
 lettuce+0.011
 beans+0.011
mons+0.011
Neg logit contributions
Paul-0.006
 much-0.006
abella-0.006
bell-0.006
Jo-0.006
Token the
Token ID262
Feature activation+4.03e-01

Pos logit contributions
owers+0.099
 lettuce+0.099
 beans+0.096
 Freddy+0.095
ados+0.095
Neg logit contributions
Paul-0.045
 security-0.039
 York-0.039
abella-0.039
Grand-0.038
Token face
Token ID1986
Feature activation-1.35e+00

Pos logit contributions
Paul+0.020
abella+0.019
 York+0.019
 security+0.019
Grand+0.019
Neg logit contributions
 lettuce-0.011
owers-0.011
 beans-0.010
ados-0.010
 Freddy-0.010
Token and
Token ID290
Feature activation+1.30e-01

Pos logit contributions
 lettuce+0.052
 beans+0.051
owers+0.051
cr+0.049
 grapes+0.047
Neg logit contributions
Paul-0.057
 York-0.049
Mad-0.049
abella-0.049
Daniel-0.047
Token the
Token ID262
Feature activation+8.69e-01

Pos logit contributions
Paul+0.004
 security+0.004
 York+0.004
Grand+0.004
 much+0.004
Neg logit contributions
owers-0.005
 lettuce-0.005
 Freddy-0.005
 beans-0.005
ados-0.005
Token coat
Token ID13209
Feature activation+3.72e+00

Pos logit contributions
Paul+0.031
 security+0.031
 York+0.030
Grand+0.030
 airport+0.029
Neg logit contributions
owers-0.035
 Freddy-0.034
 lettuce-0.033
ados-0.033
 beans-0.032
Token dr
Token ID1553
Feature activation+1.23e-01

Pos logit contributions
Paul+0.151
abella+0.130
Mad+0.129
 York+0.128
Grand+0.128
Neg logit contributions
owers-0.147
 lettuce-0.143
 beans-0.140
mons-0.137
sed-0.136
Tokenags
Token ID3775
Feature activation-4.89e+00

Pos logit contributions
Paul+0.007
 York+0.007
 security+0.007
Grand+0.007
abella+0.007
Neg logit contributions
 lettuce-0.002
owers-0.002
 beans-0.002
 Freddy-0.002
ados-0.002
Token on
Token ID319
Feature activation+1.24e+00

Pos logit contributions
 beans+0.170
cr+0.166
 lettuce+0.165
 Freddy+0.164
 grapes+0.161
Neg logit contributions
Paul-0.201
Mad-0.197
Grand-0.190
 York-0.190
Offic-0.187
Token the
Token ID262
Feature activation-6.27e-02

Pos logit contributions
Paul+0.043
 York+0.038
Grand+0.038
Mad+0.037
abella+0.037
Neg logit contributions
 lettuce-0.055
 beans-0.055
 Freddy-0.054
ados-0.053
 grapes-0.052
Token floor
Token ID4314
Feature activation+3.88e-01

Pos logit contributions
owers+0.002
 lettuce+0.002
 Freddy+0.002
 beans+0.002
ados+0.002
Neg logit contributions
Paul-0.002
 security-0.002
 York-0.002
Grand-0.002
 airport-0.002
Token.
Token ID13
Feature activation+3.27e-01

Pos logit contributions
Paul+0.016
 York+0.014
abella+0.014
Mad+0.014
Grand+0.014
Neg logit contributions
 beans-0.015
 lettuce-0.015
owers-0.014
cr-0.014
 shovel-0.014
Token
Token ID198
Feature activation+7.17e-01

Pos logit contributions
Paul+0.010
abella+0.009
 much+0.009
Grand+0.009
 York+0.009
Neg logit contributions
owers-0.015
 lettuce-0.015
 beans-0.015
ados-0.015
 Freddy-0.014
Token They
Token ID1119
Feature activation-5.51e-01

Pos logit contributions
owers+0.018
 lettuce+0.018
 beans+0.017
ados+0.017
 Freddy+0.017
Neg logit contributions
Paul-0.010
 York-0.009
abella-0.009
Grand-0.008
 much-0.008
Token offered
Token ID4438
Feature activation+1.62e-01

Pos logit contributions
owers+0.023
 lettuce+0.023
ados+0.023
 beans+0.023
mons+0.022
Neg logit contributions
Paul-0.019
 security-0.017
Grand-0.017
 much-0.017
 York-0.017
Token it
Token ID340
Feature activation-9.99e-01

Pos logit contributions
Paul+0.004
 security+0.004
 much+0.004
abella+0.004
 York+0.004
Neg logit contributions
owers-0.008
 lettuce-0.008
 beans-0.008
ados-0.008
 Freddy-0.008
Token some
Token ID617
Feature activation-8.47e-01

Pos logit contributions
owers+0.049
 lettuce+0.048
 beans+0.047
ados+0.047
 Freddy+0.047
Neg logit contributions
Paul-0.027
 much-0.025
 security-0.024
 York-0.024
abella-0.024
Token bread
Token ID8509
Feature activation-2.88e+00

Pos logit contributions
owers+0.027
ados+0.027
sed+0.026
 Freddy+0.025
 lettuce+0.025
Neg logit contributions
 much-0.036
 security-0.033
 tow-0.033
--0.032
Paul-0.032
Tokencr
Token ID6098
Feature activation-4.86e+00

Pos logit contributions
owers+0.141
ados+0.137
sed+0.133
 lettuce+0.132
 Freddy+0.132
Neg logit contributions
 much-0.067
--0.062
 soon-0.057
 tow-0.052
Paul-0.051
Tokenumbs
Token ID18146
Feature activation-3.86e+00

Pos logit contributions
sed+0.128
ses+0.118
ados+0.114
 beans+0.109
 Freddy+0.109
Neg logit contributions
 much-0.241
ris-0.230
bell-0.228
Paul-0.228
ind-0.221
Token and
Token ID290
Feature activation+1.41e-01

Pos logit contributions
owers+0.219
ados+0.214
sed+0.209
 lettuce+0.207
mons+0.204
Neg logit contributions
 much-0.076
--0.056
 soon-0.056
 security-0.053
 tow-0.049
Token it
Token ID340
Feature activation+7.10e-01

Pos logit contributions
 much+0.005
Paul+0.005
 security+0.005
 York+0.005
 tow+0.005
Neg logit contributions
owers-0.006
ados-0.005
 lettuce-0.005
 Freddy-0.005
mons-0.005
Token happily
Token ID18177
Feature activation-8.84e-01

Pos logit contributions
 much+0.021
Paul+0.020
 security+0.020
abella+0.020
Grand+0.019
Neg logit contributions
owers-0.032
ados-0.031
 lettuce-0.031
 beans-0.030
 Freddy-0.029
Token ate
Token ID15063
Feature activation-1.55e+00

Pos logit contributions
owers+0.037
ados+0.037
 lettuce+0.036
 beans+0.036
 Freddy+0.035
Neg logit contributions
 much-0.028
--0.027
 security-0.026
Paul-0.026
ris-0.025
Token opens
Token ID9808
Feature activation+4.88e-01

Pos logit contributions
Paul+0.006
Grand+0.006
abella+0.006
 York+0.006
 much+0.006
Neg logit contributions
 lettuce-0.006
 beans-0.006
 Freddy-0.006
sed-0.006
ados-0.006
Token it
Token ID340
Feature activation-2.75e-01

Pos logit contributions
Paul+0.019
Grand+0.018
abella+0.018
 York+0.017
Mad+0.017
Neg logit contributions
 Freddy-0.017
ados-0.017
 lettuce-0.016
 shovel-0.016
 beans-0.016
Token.
Token ID13
Feature activation-3.13e-01

Pos logit contributions
 lettuce+0.009
 beans+0.009
 mushroom+0.009
owers+0.009
ados+0.008
Neg logit contributions
Paul-0.013
abella-0.012
Mad-0.011
Daniel-0.011
 York-0.011
Token He
Token ID679
Feature activation-2.79e-01

Pos logit contributions
 Freddy+0.011
 lettuce+0.011
 shovel+0.010
sed+0.010
ados+0.010
Neg logit contributions
Paul-0.013
ind-0.013
 York-0.013
 airport-0.013
abella-0.012
Token dr
Token ID1553
Feature activation-5.62e-03

Pos logit contributions
 lettuce+0.010
sed+0.009
 beans+0.009
ses+0.009
 berries+0.009
Neg logit contributions
Paul-0.013
Grand-0.011
Mad-0.011
Jo-0.011
 much-0.011
Tokenags
Token ID3775
Feature activation-4.84e+00

Pos logit contributions
owers+0.000
 lettuce+0.000
 beans+0.000
 Freddy+0.000
ados+0.000
Neg logit contributions
Paul-0.000
 security-0.000
 York-0.000
abella-0.000
 much-0.000
Token Sue
Token ID26089
Feature activation-4.97e-01

Pos logit contributions
 lettuce+0.170
 Freddy+0.169
 shovel+0.167
 beans+0.165
ados+0.165
Neg logit contributions
Grand-0.184
Mad-0.183
Paul-0.178
bell-0.176
 York-0.173
Token to
Token ID284
Feature activation-1.65e-01

Pos logit contributions
 lettuce+0.021
owers+0.021
 beans+0.021
 Freddy+0.020
ados+0.020
Neg logit contributions
Paul-0.017
 York-0.015
abella-0.015
 security-0.015
Grand-0.015
Token the
Token ID262
Feature activation-2.51e-01

Pos logit contributions
 lettuce+0.008
 Freddy+0.007
 beans+0.007
owers+0.007
ados+0.007
Neg logit contributions
Paul-0.005
 York-0.005
abella-0.005
Grand-0.005
 security-0.004
Token street
Token ID4675
Feature activation+1.50e-01

Pos logit contributions
 lettuce+0.009
 shovel+0.009
 beans+0.008
sed+0.008
 mushroom+0.008
Neg logit contributions
Paul-0.011
abella-0.010
ris-0.010
Jo-0.010
 York-0.009
Token.
Token ID13
Feature activation-8.39e-02

Pos logit contributions
Paul+0.008
abella+0.007
Mad+0.007
 York+0.007
Jo+0.007
Neg logit contributions
 shovel-0.004
 spine-0.004
 heads-0.004
 beans-0.003
 cheek-0.003
Token,
Token ID11
Feature activation-9.05e-01

Pos logit contributions
 lettuce+0.014
owers+0.014
 Freddy+0.014
 beans+0.013
ados+0.013
Neg logit contributions
Paul-0.010
 York-0.009
 security-0.009
Grand-0.009
abella-0.008
Token "
Token ID366
Feature activation+2.53e-01

Pos logit contributions
 lettuce+0.040
owers+0.040
 beans+0.039
 Freddy+0.039
ados+0.038
Neg logit contributions
Paul-0.029
 York-0.027
 security-0.027
abella-0.026
Grand-0.026
Tokenwe
Token ID732
Feature activation+1.49e+00

Pos logit contributions
Paul+0.008
 York+0.008
 security+0.008
abella+0.007
Grand+0.007
Neg logit contributions
owers-0.011
 lettuce-0.011
 beans-0.011
 Freddy-0.011
ados-0.011
Token use
Token ID779
Feature activation-4.30e-01

Pos logit contributions
Paul+0.058
 York+0.054
 security+0.053
abella+0.053
Grand+0.053
Neg logit contributions
 lettuce-0.056
owers-0.056
 beans-0.054
 Freddy-0.054
ados-0.053
Token our
Token ID674
Feature activation-1.67e+00

Pos logit contributions
 lettuce+0.020
owers+0.020
 beans+0.019
 Freddy+0.019
ados+0.019
Neg logit contributions
Paul-0.013
 York-0.012
 security-0.012
abella-0.012
Grand-0.012
Token tails
Token ID30514
Feature activation-4.83e+00

Pos logit contributions
owers+0.056
ados+0.054
 Freddy+0.054
mons+0.053
 beans+0.052
Neg logit contributions
 much-0.067
 security-0.063
 York-0.063
 tow-0.063
Paul-0.062
Token to
Token ID284
Feature activation-7.85e-01

Pos logit contributions
 lettuce+0.254
owers+0.253
 Freddy+0.249
ados+0.248
 beans+0.243
Neg logit contributions
 much-0.090
Paul-0.089
 York-0.081
bell-0.081
 security-0.080
Token help
Token ID1037
Feature activation-1.37e+00

Pos logit contributions
owers+0.028
 lettuce+0.027
ados+0.027
 beans+0.026
mons+0.026
Neg logit contributions
Paul-0.031
 much-0.030
 security-0.030
 York-0.030
 tow-0.030
Token us
Token ID514
Feature activation-1.80e+00

Pos logit contributions
owers+0.062
 lettuce+0.062
 beans+0.060
 Freddy+0.059
ados+0.059
Neg logit contributions
Paul-0.043
 York-0.040
Grand-0.040
 security-0.039
 much-0.039
Token balance
Token ID5236
Feature activation-1.40e+00

Pos logit contributions
owers+0.087
sed+0.084
 lettuce+0.084
ados+0.083
mons+0.083
Neg logit contributions
 much-0.047
Paul-0.046
 York-0.045
bell-0.043
Grand-0.043
Token when
Token ID618
Feature activation-6.25e-01

Pos logit contributions
 lettuce+0.064
owers+0.063
 beans+0.063
 Freddy+0.061
ados+0.061
Neg logit contributions
Paul-0.045
 security-0.041
abella-0.040
 York-0.040
Grand-0.040
Token if
Token ID611
Feature activation-5.62e-01

Pos logit contributions
Paul+0.038
 York+0.036
ris+0.034
Daniel+0.033
Jo+0.033
Neg logit contributions
 lettuce-0.033
 beans-0.033
 Freddy-0.033
ados-0.033
 grapes-0.032
Token she
Token ID673
Feature activation-9.61e-01

Pos logit contributions
 beans+0.020
 lettuce+0.020
 Freddy+0.020
owers+0.019
ados+0.019
Neg logit contributions
Paul-0.024
ris-0.023
 York-0.023
bell-0.022
Grand-0.022
Token could
Token ID714
Feature activation-9.36e-01

Pos logit contributions
 lettuce+0.032
 Freddy+0.031
owers+0.030
 beans+0.030
 mushroom+0.029
Neg logit contributions
Paul-0.043
 York-0.042
 airport-0.040
 security-0.039
 elev-0.038
Token have
Token ID423
Feature activation-1.17e+00

Pos logit contributions
 lettuce+0.036
owers+0.036
 beans+0.035
 Freddy+0.035
ados+0.035
Neg logit contributions
Paul-0.035
 York-0.033
 security-0.033
Grand-0.032
abella-0.032
Token a
Token ID257
Feature activation-5.98e-01

Pos logit contributions
 lettuce+0.058
 beans+0.056
 Freddy+0.055
ados+0.054
 grapes+0.054
Neg logit contributions
Paul-0.035
 York-0.031
abella-0.030
bell-0.030
Grand-0.030
Token bite
Token ID13197
Feature activation-4.83e+00

Pos logit contributions
 lettuce+0.022
owers+0.022
 beans+0.021
 Freddy+0.021
ados+0.021
Neg logit contributions
Paul-0.024
 York-0.022
 security-0.022
abella-0.022
Grand-0.022
Token.
Token ID13
Feature activation-3.42e-01

Pos logit contributions
owers+0.252
 lettuce+0.248
sed+0.243
 beans+0.241
ados+0.240
Neg logit contributions
Paul-0.109
 much-0.107
Grand-0.100
 security-0.099
abella-0.097
Token
Token ID198
Feature activation+7.33e-01

Pos logit contributions
 lettuce+0.016
 beans+0.016
owers+0.016
ados+0.015
 Freddy+0.015
Neg logit contributions
Paul-0.011
Grand-0.010
 York-0.009
abella-0.009
 much-0.009
Token
Token ID198
Feature activation+1.65e-01

Pos logit contributions
 York+0.022
 security+0.022
Paul+0.022
 airport+0.022
abella+0.021
Neg logit contributions
owers-0.032
 Freddy-0.032
 lettuce-0.031
 beans-0.030
 squash-0.030
TokenHer
Token ID9360
Feature activation-6.86e-01

Pos logit contributions
Paul+0.006
abella+0.006
 York+0.005
 security+0.005
 airport+0.005
Neg logit contributions
owers-0.007
 lettuce-0.006
 squash-0.006
 beans-0.006
ados-0.006
Token mom
Token ID1995
Feature activation+6.76e-01

Pos logit contributions
owers+0.033
 lettuce+0.033
 beans+0.032
 Freddy+0.031
ados+0.031
Neg logit contributions
Paul-0.020
 security-0.019
Grand-0.019
 York-0.018
 much-0.018
Token "
Token ID366
Feature activation+7.90e-01

Pos logit contributions
 Freddy+0.002
 lettuce+0.002
 beans+0.002
owers+0.002
ados+0.002
Neg logit contributions
Paul-0.001
 York-0.001
ris-0.001
 security-0.001
 airport-0.001
TokenCan
Token ID6090
Feature activation+5.42e-01

Pos logit contributions
Paul+0.028
 York+0.025
 security+0.025
abella+0.025
Grand+0.025
Neg logit contributions
 lettuce-0.033
owers-0.033
 beans-0.032
 Freddy-0.032
ados-0.032
Token I
Token ID314
Feature activation-1.80e-01

Pos logit contributions
Paul+0.024
 York+0.022
 security+0.022
abella+0.022
Grand+0.022
Neg logit contributions
 lettuce-0.018
owers-0.018
 beans-0.017
 Freddy-0.017
ados-0.017
Token have
Token ID423
Feature activation-9.49e-01

Pos logit contributions
owers+0.008
 lettuce+0.008
ados+0.007
 beans+0.007
sed+0.007
Neg logit contributions
Paul-0.006
 much-0.006
 tow-0.006
bell-0.005
Grand-0.005
Token a
Token ID257
Feature activation-5.55e-01

Pos logit contributions
 lettuce+0.047
 beans+0.045
owers+0.045
 Freddy+0.045
ados+0.044
Neg logit contributions
Paul-0.027
 York-0.025
 security-0.024
Grand-0.024
abella-0.024
Token bite
Token ID13197
Feature activation-4.82e+00

Pos logit contributions
owers+0.019
ados+0.017
 beans+0.017
mons+0.016
 lettuce+0.016
Neg logit contributions
 much-0.024
Paul-0.023
 tow-0.023
bell-0.023
 security-0.023
Token?"
Token ID1701
Feature activation+9.60e-01

Pos logit contributions
owers+0.232
 lettuce+0.230
 beans+0.224
sed+0.220
ados+0.220
Neg logit contributions
Paul-0.131
 much-0.125
Grand-0.121
abella-0.120
 security-0.116
Token
Token ID220
Feature activation+2.02e-01

Pos logit contributions
Paul+0.040
abella+0.036
 York+0.036
 security+0.036
Grand+0.036
Neg logit contributions
 lettuce-0.035
owers-0.034
 beans-0.034
ados-0.033
 Freddy-0.032
Token
Token ID198
Feature activation+1.34e+00

Pos logit contributions
 York+0.007
 security+0.007
Paul+0.007
 airport+0.007
ind+0.006
Neg logit contributions
 Freddy-0.008
owers-0.008
 lettuce-0.008
cr-0.008
ados-0.008
Token
Token ID198
Feature activation+7.17e-01

Pos logit contributions
 airport+0.046
 York+0.046
 security+0.043
Paul+0.042
bell+0.041
Neg logit contributions
 Freddy-0.055
owers-0.053
cr-0.053
 lettuce-0.050
mons-0.049
TokenAnn
Token ID18858
Feature activation+2.90e+00

Pos logit contributions
 airport+0.031
 York+0.030
 security+0.029
bell+0.029
Paul+0.029
Neg logit contributions
 Freddy-0.023
cr-0.023
owers-0.022
 lettuce-0.021
mons-0.021
Token
Token ID198
Feature activation-1.51e-01

Pos logit contributions
 security+0.010
 York+0.010
Paul+0.010
 airport+0.010
bell+0.010
Neg logit contributions
 lettuce-0.017
 Freddy-0.017
owers-0.016
 beans-0.016
 grapes-0.016
TokenL
Token ID43
Feature activation-4.20e-01

Pos logit contributions
 beans+0.006
 lettuce+0.006
 Freddy+0.006
 grapes+0.006
owers+0.005
Neg logit contributions
 York-0.005
 security-0.005
 airport-0.005
bell-0.005
abella-0.005
Tokenily
Token ID813
Feature activation-7.37e-01

Pos logit contributions
 lettuce+0.017
owers+0.017
 beans+0.016
 Freddy+0.016
ados+0.016
Neg logit contributions
Paul-0.016
abella-0.014
Grand-0.014
 security-0.014
 York-0.014
Token tried
Token ID3088
Feature activation-2.19e+00

Pos logit contributions
owers+0.032
 lettuce+0.032
 beans+0.031
ados+0.030
 Freddy+0.030
Neg logit contributions
Paul-0.025
 York-0.022
Grand-0.022
abella-0.022
 security-0.022
Token a
Token ID257
Feature activation-1.25e+00

Pos logit contributions
 lettuce+0.101
 beans+0.099
 Freddy+0.098
owers+0.097
ados+0.096
Neg logit contributions
Paul-0.069
 York-0.064
 security-0.062
Grand-0.061
abella-0.061
Token bite
Token ID13197
Feature activation-4.82e+00

Pos logit contributions
owers+0.059
ados+0.055
 beans+0.053
 lettuce+0.053
mons+0.053
Neg logit contributions
 much-0.038
Paul-0.037
 security-0.037
 tow-0.035
Grand-0.035
Token and
Token ID290
Feature activation-5.15e-01

Pos logit contributions
owers+0.252
 lettuce+0.250
 beans+0.243
ados+0.241
 Freddy+0.240
Neg logit contributions
Paul-0.115
 much-0.103
 security-0.103
Grand-0.102
abella-0.102
Token said
Token ID531
Feature activation+5.38e-01

Pos logit contributions
owers+0.028
ados+0.026
 lettuce+0.025
 beans+0.025
mons+0.025
Neg logit contributions
 much-0.012
abella-0.012
Paul-0.012
Grand-0.011
 security-0.010
Token,
Token ID11
Feature activation-5.03e-01

Pos logit contributions
Paul+0.013
abella+0.011
 security+0.011
 much+0.011
 York+0.011
Neg logit contributions
owers-0.028
 lettuce-0.028
 beans-0.027
 Freddy-0.027
ados-0.027
Token "
Token ID366
Feature activation+4.54e-01

Pos logit contributions
 lettuce+0.025
owers+0.025
 beans+0.024
 Freddy+0.024
ados+0.024
Neg logit contributions
Paul-0.014
 York-0.012
 security-0.012
Grand-0.012
abella-0.012
TokenE
Token ID36
Feature activation+3.03e-01

Pos logit contributions
Grand+0.013
Paul+0.012
 much+0.011
abella+0.011
One+0.010
Neg logit contributions
 lettuce-0.022
 beans-0.022
sed-0.022
 Freddy-0.022
ados-0.022
Token.
Token ID13
Feature activation-2.06e-01

Pos logit contributions
Paul+0.018
 York+0.017
 security+0.016
abella+0.016
Grand+0.016
Neg logit contributions
owers-0.025
 lettuce-0.025
 beans-0.024
 Freddy-0.024
ados-0.024
Token You
Token ID921
Feature activation-3.07e-01

Pos logit contributions
 lettuce+0.008
owers+0.008
 Freddy+0.008
 beans+0.008
ados+0.007
Neg logit contributions
Paul-0.008
 security-0.008
 York-0.007
abella-0.007
 airport-0.007
Token can
Token ID460
Feature activation-2.53e-01

Pos logit contributions
 lettuce+0.013
owers+0.013
ados+0.013
 beans+0.013
 Freddy+0.012
Neg logit contributions
Paul-0.010
 much-0.010
 York-0.010
 security-0.009
abella-0.009
Token have
Token ID423
Feature activation-8.60e-01

Pos logit contributions
 lettuce+0.010
owers+0.010
 beans+0.010
ados+0.010
 Freddy+0.009
Neg logit contributions
Paul-0.009
 York-0.009
Grand-0.009
 security-0.009
abella-0.009
Token a
Token ID257
Feature activation-9.86e-01

Pos logit contributions
 lettuce+0.043
owers+0.042
 beans+0.041
 Freddy+0.041
ados+0.040
Neg logit contributions
Paul-0.024
 York-0.022
 security-0.022
Grand-0.021
abella-0.021
Token bite
Token ID13197
Feature activation-4.79e+00

Pos logit contributions
 lettuce+0.039
owers+0.038
 beans+0.038
 Freddy+0.037
ados+0.036
Neg logit contributions
Paul-0.037
 York-0.034
 security-0.034
abella-0.034
Grand-0.034
Token if
Token ID611
Feature activation-8.08e-01

Pos logit contributions
owers+0.222
 lettuce+0.222
 beans+0.216
 Freddy+0.214
ados+0.213
Neg logit contributions
Paul-0.142
 York-0.128
Grand-0.127
abella-0.127
 security-0.127
Token you
Token ID345
Feature activation-1.71e+00

Pos logit contributions
owers+0.032
 lettuce+0.032
 beans+0.031
 Freddy+0.030
ados+0.030
Neg logit contributions
Paul-0.030
 security-0.028
 York-0.028
abella-0.027
 much-0.027
Token want
Token ID765
Feature activation-6.47e-01

Pos logit contributions
ados+0.070
 lettuce+0.068
owers+0.065
 beans+0.064
sed+0.064
Neg logit contributions
 much-0.057
bell-0.055
ris-0.054
abella-0.054
Paul-0.052
Token."
Token ID526
Feature activation-5.52e-01

Pos logit contributions
 lettuce+0.032
owers+0.032
 beans+0.031
 Freddy+0.031
ados+0.031
Neg logit contributions
Paul-0.018
 York-0.016
 security-0.016
Grand-0.016
abella-0.016
Token
Token ID198
Feature activation+4.42e-01

Pos logit contributions
 lettuce+0.031
 beans+0.031
owers+0.030
sed+0.030
mons+0.029
Neg logit contributions
Paul-0.012
 much-0.010
Grand-0.010
Jo-0.009
abella-0.009
Token if
Token ID611
Feature activation-3.15e-01

Pos logit contributions
Paul+0.054
 York+0.052
ris+0.050
Jo+0.049
abella+0.048
Neg logit contributions
 beans-0.042
 lettuce-0.042
 Freddy-0.042
ados-0.041
 grapes-0.039
Token she
Token ID673
Feature activation-7.38e-01

Pos logit contributions
 beans+0.011
 Freddy+0.010
 lettuce+0.010
ados+0.009
 squash+0.009
Neg logit contributions
Paul-0.014
 York-0.014
ris-0.014
Jo-0.013
 tow-0.013
Token could
Token ID714
Feature activation-7.26e-01

Pos logit contributions
 lettuce+0.028
 Freddy+0.027
 beans+0.027
owers+0.027
 mushroom+0.026
Neg logit contributions
Paul-0.030
 York-0.028
 security-0.027
 airport-0.026
Grand-0.026
Token have
Token ID423
Feature activation-9.79e-01

Pos logit contributions
owers+0.027
 lettuce+0.027
 beans+0.026
ados+0.025
 Freddy+0.025
Neg logit contributions
Paul-0.029
 York-0.027
 security-0.027
Grand-0.027
abella-0.027
Token a
Token ID257
Feature activation-4.66e-01

Pos logit contributions
 lettuce+0.047
 beans+0.046
 Freddy+0.045
owers+0.044
ados+0.044
Neg logit contributions
Paul-0.031
 York-0.028
 security-0.027
bell-0.026
Grand-0.026
Token bite
Token ID13197
Feature activation-4.79e+00

Pos logit contributions
owers+0.016
 lettuce+0.015
ados+0.015
 beans+0.014
 Freddy+0.014
Neg logit contributions
Paul-0.020
 much-0.019
 security-0.019
 York-0.019
Grand-0.019
Token.
Token ID13
Feature activation-1.07e-01

Pos logit contributions
owers+0.256
 lettuce+0.250
sed+0.248
ados+0.244
 beans+0.239
Neg logit contributions
 much-0.105
Paul-0.099
Grand-0.095
abella-0.095
 security-0.091
Token Her
Token ID2332
Feature activation-4.79e-01

Pos logit contributions
 lettuce+0.004
owers+0.004
 beans+0.004
 Freddy+0.004
ados+0.004
Neg logit contributions
Paul-0.004
 York-0.004
 security-0.004
Grand-0.004
abella-0.004
Token mom
Token ID1995
Feature activation+7.21e-01

Pos logit contributions
owers+0.028
ados+0.027
 lettuce+0.026
mons+0.026
cr+0.026
Neg logit contributions
Grand-0.009
 much-0.009
 security-0.009
Paul-0.008
 tow-0.008
Token said
Token ID531
Feature activation+2.24e-01

Pos logit contributions
Paul+0.030
 York+0.029
 security+0.028
Grand+0.027
abella+0.027
Neg logit contributions
 lettuce-0.025
owers-0.025
 beans-0.024
 Freddy-0.024
ados-0.023
Token �
Token ID6184
Feature activation+9.13e-01

Pos logit contributions
Paul+0.009
 York+0.008
 security+0.008
Grand+0.008
 airport+0.007
Neg logit contributions
 lettuce-0.009
 beans-0.009
 Freddy-0.009
owers-0.009
ados-0.009
Token if
Token ID611
Feature activation-6.19e-01

Pos logit contributions
 Freddy+0.109
 lettuce+0.108
 beans+0.105
 mushroom+0.102
ados+0.101
Neg logit contributions
Paul-0.082
 York-0.075
Grand-0.073
 security-0.072
 airport-0.071
Token she
Token ID673
Feature activation-1.03e+00

Pos logit contributions
 Freddy+0.022
 beans+0.020
 lettuce+0.018
 shovel+0.018
ados+0.017
Neg logit contributions
 York-0.027
ris-0.027
Paul-0.026
ind-0.026
Grand-0.026
Token could
Token ID714
Feature activation-1.00e+00

Pos logit contributions
owers+0.044
 lettuce+0.043
 beans+0.042
ados+0.042
 Freddy+0.041
Neg logit contributions
Paul-0.035
 security-0.032
 York-0.032
abella-0.032
Grand-0.032
Token have
Token ID423
Feature activation-1.15e+00

Pos logit contributions
owers+0.040
ados+0.039
 lettuce+0.039
 beans+0.038
sed+0.037
Neg logit contributions
 much-0.034
Paul-0.034
 tow-0.034
bell-0.033
Grand-0.032
Token a
Token ID257
Feature activation-6.14e-01

Pos logit contributions
 lettuce+0.054
 beans+0.051
 Freddy+0.051
 grapes+0.050
ados+0.050
Neg logit contributions
Paul-0.039
 York-0.035
 security-0.035
Grand-0.034
bell-0.034
Token bite
Token ID13197
Feature activation-4.77e+00

Pos logit contributions
owers+0.016
 lettuce+0.015
ados+0.015
 beans+0.015
 Freddy+0.015
Neg logit contributions
Paul-0.030
 much-0.029
 security-0.029
 York-0.029
Grand-0.029
Token,
Token ID11
Feature activation-6.63e-01

Pos logit contributions
owers+0.268
sed+0.263
 lettuce+0.257
ses+0.252
ados+0.250
Neg logit contributions
 much-0.099
--0.080
Paul-0.075
Grand-0.075
abella-0.073
Token and
Token ID290
Feature activation+9.41e-03

Pos logit contributions
owers+0.025
 lettuce+0.024
 beans+0.023
ados+0.023
 Freddy+0.023
Neg logit contributions
Paul-0.026
Grand-0.024
 York-0.024
 much-0.024
 security-0.024
Token it
Token ID340
Feature activation+7.35e-01

Pos logit contributions
 York+0.000
 security+0.000
Paul+0.000
ind+0.000
ris+0.000
Neg logit contributions
 Freddy-0.000
 beans-0.000
 lettuce-0.000
ados-0.000
 squash-0.000
Token was
Token ID373
Feature activation+1.04e+00

Pos logit contributions
 much+0.015
abella+0.014
 security+0.014
bell+0.014
Grand+0.014
Neg logit contributions
owers-0.041
ados-0.040
 lettuce-0.037
 spine-0.037
mons-0.037
Token very
Token ID845
Feature activation-1.28e-01

Pos logit contributions
 much+0.028
Paul+0.025
Grand+0.024
abella+0.023
 security+0.023
Neg logit contributions
owers-0.055
ados-0.051
mons-0.050
 lettuce-0.050
 beans-0.049
Token the
Token ID262
Feature activation-2.88e-01

Pos logit contributions
owers+0.031
 lettuce+0.030
 beans+0.029
ados+0.028
 Freddy+0.028
Neg logit contributions
Paul-0.013
 security-0.012
Grand-0.011
abella-0.011
 York-0.011
Token arms
Token ID5101
Feature activation-3.59e-01

Pos logit contributions
 lettuce+0.012
owers+0.012
 beans+0.012
 Freddy+0.011
ados+0.011
Neg logit contributions
Paul-0.010
 York-0.009
 security-0.009
abella-0.009
Grand-0.009
Token.
Token ID13
Feature activation+1.30e-02

Pos logit contributions
 lettuce+0.015
owers+0.015
 Freddy+0.015
 beans+0.015
ados+0.015
Neg logit contributions
Paul-0.012
 York-0.012
abella-0.011
 security-0.011
Grand-0.011
Token She
Token ID1375
Feature activation+6.48e-01

Pos logit contributions
Paul+0.000
Grand+0.000
 security+0.000
 York+0.000
 much+0.000
Neg logit contributions
 lettuce-0.001
owers-0.001
 beans-0.001
ados-0.001
 Freddy-0.001
Token dr
Token ID1553
Feature activation+1.29e-01

Pos logit contributions
Paul+0.022
 York+0.020
 security+0.020
abella+0.020
Grand+0.020
Neg logit contributions
owers-0.028
 lettuce-0.028
 beans-0.027
 Freddy-0.026
ados-0.026
Tokenags
Token ID3775
Feature activation-4.75e+00

Pos logit contributions
Paul+0.008
 security+0.008
 York+0.008
abella+0.008
Grand+0.008
Neg logit contributions
owers-0.002
 lettuce-0.002
 beans-0.002
 Freddy-0.002
ados-0.001
Token them
Token ID606
Feature activation-1.50e+00

Pos logit contributions
 beans+0.180
 Freddy+0.177
 lettuce+0.173
 grapes+0.172
ados+0.171
Neg logit contributions
Paul-0.176
 York-0.173
Grand-0.171
Mad-0.167
bell-0.165
Token to
Token ID284
Feature activation-2.43e-01

Pos logit contributions
owers+0.068
 lettuce+0.067
 beans+0.065
 Freddy+0.064
ados+0.064
Neg logit contributions
Paul-0.048
 security-0.045
 much-0.043
 York-0.043
Grand-0.043
Token the
Token ID262
Feature activation-3.08e-01

Pos logit contributions
 lettuce+0.011
 Freddy+0.010
 beans+0.010
owers+0.010
ados+0.010
Neg logit contributions
Paul-0.008
 York-0.007
abella-0.007
Grand-0.007
 security-0.007
Token office
Token ID2607
Feature activation+3.41e+00

Pos logit contributions
 lettuce+0.012
 beans+0.012
owers+0.011
 grapes+0.011
sed+0.011
Neg logit contributions
Paul-0.012
 York-0.011
abella-0.010
Jo-0.010
Grand-0.010
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
Paul+0.150
 York+0.141
abella+0.137
Daniel+0.130
ind+0.128
Neg logit contributions
ados-0.104
 cheek-0.101
 grapes-0.100
 spine-0.100
 shovel-0.099
Token him
Token ID683
Feature activation-1.93e+00

Pos logit contributions
Paul+0.004
 much+0.004
abella+0.004
Grand+0.003
 security+0.003
Neg logit contributions
owers-0.009
sed-0.009
mons-0.009
ses-0.008
 lettuce-0.008
Token to
Token ID284
Feature activation-3.09e-01

Pos logit contributions
 lettuce+0.089
owers+0.088
 beans+0.087
 Freddy+0.086
ados+0.085
Neg logit contributions
Paul-0.060
 York-0.054
Grand-0.053
 security-0.053
abella-0.053
Token turn
Token ID1210
Feature activation+4.77e-01

Pos logit contributions
 lettuce+0.010
 beans+0.009
owers+0.009
 Freddy+0.009
 squash+0.009
Neg logit contributions
Paul-0.014
 York-0.013
abella-0.013
 security-0.013
Grand-0.012
Token the
Token ID262
Feature activation-3.57e-01

Pos logit contributions
Paul+0.019
 York+0.017
abella+0.017
Grand+0.017
Offic+0.017
Neg logit contributions
 beans-0.019
 lettuce-0.018
ados-0.018
 Freddy-0.017
 squash-0.017
Token r
Token ID374
Feature activation-1.50e+00

Pos logit contributions
owers+0.015
 lettuce+0.015
 Freddy+0.014
ados+0.014
 beans+0.014
Neg logit contributions
Paul-0.012
 security-0.012
 York-0.011
 much-0.011
Grand-0.011
Tokenags
Token ID3775
Feature activation-4.75e+00

Pos logit contributions
 lettuce+0.067
 Freddy+0.066
 mushroom+0.063
 beans+0.063
owers+0.061
Neg logit contributions
Paul-0.047
 York-0.045
 office-0.045
 airport-0.044
abella-0.044
Token she
Token ID673
Feature activation+1.01e+00

Pos logit contributions
 beans+0.169
owers+0.168
 Freddy+0.167
 lettuce+0.166
ados+0.164
Neg logit contributions
Paul-0.186
 York-0.180
Grand-0.178
abella-0.176
Mad-0.173
Token kept
Token ID4030
Feature activation+7.28e-01

Pos logit contributions
Paul+0.044
 York+0.040
Daniel+0.040
 security+0.040
abella+0.039
Neg logit contributions
owers-0.033
sed-0.032
 beans-0.031
mons-0.031
 lettuce-0.030
Token in
Token ID287
Feature activation-1.48e+00

Pos logit contributions
Paul+0.029
abella+0.028
 York+0.028
Grand+0.027
ris+0.027
Neg logit contributions
 beans-0.025
ados-0.025
 Freddy-0.025
 squash-0.025
 lettuce-0.024
Token the
Token ID262
Feature activation+1.90e-01

Pos logit contributions
 beans+0.060
 lettuce+0.058
 Freddy+0.056
ados+0.055
 shovel+0.055
Neg logit contributions
abella-0.053
Paul-0.053
 York-0.052
bell-0.050
Grand-0.047
Token closet
Token ID21615
Feature activation+2.54e+00

Pos logit contributions
Paul+0.007
 York+0.006
 security+0.006
Grand+0.006
abella+0.006
Neg logit contributions
 lettuce-0.008
owers-0.008
 beans-0.008
 Freddy-0.008
ados-0.007
Token mom
Token ID1995
Feature activation+4.57e-01

Pos logit contributions
Paul+0.015
 York+0.014
abella+0.014
 security+0.014
Grand+0.014
Neg logit contributions
 lettuce-0.019
owers-0.019
 beans-0.019
 Freddy-0.018
ados-0.018
Token bought
Token ID5839
Feature activation-4.54e-01

Pos logit contributions
 outer+0.021
 York+0.021
Paul+0.021
 France+0.020
 Africa+0.020
Neg logit contributions
 Freddy-0.012
 beans-0.012
 lettuce-0.012
 shovel-0.011
 squash-0.011
Token some
Token ID617
Feature activation-1.02e+00

Pos logit contributions
 beans+0.018
 lettuce+0.017
 grapes+0.017
 Freddy+0.017
ados+0.016
Neg logit contributions
Paul-0.019
 York-0.018
ris-0.017
bell-0.017
 tow-0.017
Token new
Token ID649
Feature activation+2.56e-01

Pos logit contributions
owers+0.035
 lettuce+0.034
 Freddy+0.033
ados+0.033
 beans+0.032
Neg logit contributions
Paul-0.042
 much-0.040
 security-0.040
 York-0.040
Grand-0.040
Token r
Token ID374
Feature activation-2.02e+00

Pos logit contributions
Paul+0.011
 York+0.010
 security+0.010
abella+0.010
Grand+0.010
Neg logit contributions
 lettuce-0.008
owers-0.008
 beans-0.008
 Freddy-0.008
ados-0.008
Tokenags
Token ID3775
Feature activation-4.71e+00

Pos logit contributions
 lettuce+0.077
 Freddy+0.074
 mushroom+0.073
 beans+0.070
sed+0.066
Neg logit contributions
Paul-0.081
abella-0.075
Grand-0.075
 York-0.074
 office-0.073
Token.
Token ID13
Feature activation-2.26e-01

Pos logit contributions
owers+0.195
 beans+0.188
 Freddy+0.187
 squash+0.183
mons+0.183
Neg logit contributions
Paul-0.161
 York-0.155
 security-0.153
Grand-0.152
abella-0.148
Token When
Token ID1649
Feature activation-7.79e-01

Pos logit contributions
 beans+0.007
owers+0.007
 lettuce+0.007
ados+0.007
 Freddy+0.007
Neg logit contributions
ris-0.010
Paul-0.010
 security-0.010
ep-0.009
abella-0.009
Token they
Token ID484
Feature activation-1.38e+00

Pos logit contributions
 beans+0.026
 Freddy+0.021
owers+0.021
 shovel+0.020
 lettuce+0.020
Neg logit contributions
ris-0.035
old-0.034
ind-0.033
oc-0.033
Paul-0.033
Token got
Token ID1392
Feature activation-1.18e+00

Pos logit contributions
 Freddy+0.050
 lettuce+0.049
 beans+0.049
owers+0.047
 shovel+0.045
Neg logit contributions
Paul-0.056
 York-0.054
Grand-0.050
 security-0.050
ind-0.049
Token back
Token ID736
Feature activation-5.71e-01

Pos logit contributions
 beans+0.042
 Freddy+0.041
 lettuce+0.040
 shovel+0.039
 squash+0.038
Neg logit contributions
Paul-0.050
 York-0.049
Offic-0.048
Grand-0.048
Jo-0.047
Token day
Token ID1110
Feature activation-1.30e-01

Pos logit contributions
 lettuce+0.017
owers+0.017
ados+0.016
 beans+0.016
 Freddy+0.016
Neg logit contributions
Paul-0.011
 much-0.010
 security-0.010
Grand-0.010
 airport-0.009
Token,
Token ID11
Feature activation-6.09e-01

Pos logit contributions
owers+0.005
 Freddy+0.005
 beans+0.004
mons+0.004
 lettuce+0.004
Neg logit contributions
Paul-0.005
Grand-0.005
ind-0.005
Offic-0.005
abella-0.005
Token the
Token ID262
Feature activation+7.51e-02

Pos logit contributions
 Freddy+0.022
 shovel+0.021
cr+0.020
owers+0.020
 berries+0.020
Neg logit contributions
bell-0.023
Grand-0.023
ind-0.022
 York-0.022
abella-0.021
Token mighty
Token ID18680
Feature activation+1.36e+00

Pos logit contributions
Paul+0.003
 York+0.003
 security+0.003
Grand+0.003
abella+0.003
Neg logit contributions
 lettuce-0.003
owers-0.003
 beans-0.003
 Freddy-0.003
ados-0.003
Token k
Token ID479
Feature activation-3.91e-01

Pos logit contributions
Paul+0.060
abella+0.054
 security+0.054
bell+0.053
 York+0.053
Neg logit contributions
 lettuce-0.049
 beans-0.048
owers-0.047
 grapes-0.045
 Freddy-0.045
Tokenang
Token ID648
Feature activation+2.94e+00

Pos logit contributions
owers+0.018
 lettuce+0.018
 beans+0.017
ados+0.017
 Freddy+0.017
Neg logit contributions
Paul-0.012
 York-0.011
 security-0.011
abella-0.011
bell-0.011
Tokenaroo
Token ID38049
Feature activation-4.07e-01

Pos logit contributions
Paul+0.138
 York+0.130
 security+0.129
Grand+0.128
abella+0.128
Neg logit contributions
owers-0.088
 lettuce-0.088
 beans-0.085
 Freddy-0.083
ados-0.082
Token and
Token ID290
Feature activation-2.52e-01

Pos logit contributions
 lettuce+0.018
owers+0.017
 beans+0.017
 Freddy+0.017
ados+0.017
Neg logit contributions
Paul-0.014
 York-0.013
 security-0.012
Grand-0.012
abella-0.012
Token his
Token ID465
Feature activation+6.76e-01

Pos logit contributions
 lettuce+0.010
 Freddy+0.010
ados+0.010
 beans+0.010
 mushroom+0.010
Neg logit contributions
Paul-0.009
 York-0.008
bell-0.008
Grand-0.008
 airport-0.008
Token friends
Token ID2460
Feature activation-1.13e+00

Pos logit contributions
Paul+0.026
 security+0.025
 much+0.024
 York+0.024
abella+0.024
Neg logit contributions
owers-0.025
 lettuce-0.025
ados-0.024
 Freddy-0.024
 beans-0.024
Token went
Token ID1816
Feature activation+8.83e-02

Pos logit contributions
 lettuce+0.043
owers+0.043
 beans+0.042
 Freddy+0.042
ados+0.041
Neg logit contributions
Paul-0.043
 York-0.040
 security-0.039
Grand-0.039
abella-0.039
Token He
Token ID679
Feature activation-1.91e-01

Pos logit contributions
owers+0.026
 lettuce+0.025
 beans+0.025
ados+0.025
mons+0.024
Neg logit contributions
Paul-0.015
 much-0.013
abella-0.013
Grand-0.013
 York-0.013
Token felt
Token ID2936
Feature activation+3.12e-01

Pos logit contributions
cr+0.006
 Freddy+0.006
 lettuce+0.006
sed+0.006
 shovel+0.005
Neg logit contributions
Grand-0.008
Paul-0.008
 York-0.008
 elev-0.008
 office-0.007
Token so
Token ID523
Feature activation-1.48e+00

Pos logit contributions
 York+0.015
Grand+0.014
bell+0.014
Paul+0.014
Offic+0.014
Neg logit contributions
 Freddy-0.010
 lettuce-0.009
 beans-0.008
cr-0.008
're-0.008
Token happy
Token ID3772
Feature activation+6.39e-02

Pos logit contributions
 Freddy+0.053
're+0.049
 lettuce+0.048
owers+0.048
cr+0.047
Neg logit contributions
 York-0.060
Paul-0.058
abella-0.057
Grand-0.057
 airport-0.056
Token,
Token ID11
Feature activation-4.14e-01

Pos logit contributions
Paul+0.002
 York+0.002
Grand+0.002
abella+0.002
bell+0.002
Neg logit contributions
 lettuce-0.003
 Freddy-0.003
owers-0.003
 beans-0.003
ados-0.003
Token faster
Token ID5443
Feature activation+3.35e+00

Pos logit contributions
 Freddy+0.017
 lettuce+0.017
 beans+0.016
 squash+0.015
ados+0.015
Neg logit contributions
Paul-0.015
 York-0.015
abella-0.014
Grand-0.014
bell-0.014
Token and
Token ID290
Feature activation-1.43e-02

Pos logit contributions
Paul+0.119
Grand+0.119
bell+0.115
 York+0.114
abella+0.110
Neg logit contributions
 Freddy-0.130
owers-0.128
 beans-0.127
ados-0.127
 lettuce-0.126
Token faster
Token ID5443
Feature activation+3.58e+00

Pos logit contributions
 Freddy+0.001
 lettuce+0.001
cr+0.000
 beans+0.000
 Fred+0.000
Neg logit contributions
 York-0.001
ind-0.001
Paul-0.001
Grand-0.001
bell-0.001
Token he
Token ID339
Feature activation+3.93e-01

Pos logit contributions
Grand+0.129
Paul+0.128
 York+0.123
bell+0.123
abella+0.119
Neg logit contributions
 Freddy-0.140
owers-0.137
 lettuce-0.136
 beans-0.136
ados-0.136
Token went
Token ID1816
Feature activation-2.99e-02

Pos logit contributions
Paul+0.017
Grand+0.017
 York+0.016
 operating+0.016
 tow+0.016
Neg logit contributions
cr-0.012
 Freddy-0.012
sed-0.012
 lettuce-0.011
owers-0.011
Token until
Token ID1566
Feature activation-1.40e-01

Pos logit contributions
 Freddy+0.001
 beans+0.001
 lettuce+0.001
 shovel+0.001
cr+0.001
Neg logit contributions
Grand-0.001
Paul-0.001
abella-0.001
 York-0.001
Offic-0.001
TokenThe
Token ID464
Feature activation-2.90e-01

Pos logit contributions
 North+0.014
 York+0.014
abella+0.014
 airport+0.014
 security+0.013
Neg logit contributions
 squash-0.019
 Freddy-0.018
owers-0.018
 grapes-0.018
 beans-0.018
Token girl
Token ID2576
Feature activation+5.31e-01

Pos logit contributions
 beans+0.010
 lettuce+0.010
owers+0.010
 squash+0.009
 Freddy+0.009
Neg logit contributions
Paul-0.013
abella-0.012
 York-0.011
Jo-0.011
Grand-0.011
Token was
Token ID373
Feature activation+5.40e-01

Pos logit contributions
Paul+0.019
 York+0.017
abella+0.017
 security+0.017
Grand+0.017
Neg logit contributions
 lettuce-0.022
owers-0.022
 beans-0.021
 Freddy-0.021
sed-0.021
Token very
Token ID845
Feature activation-5.85e-01

Pos logit contributions
Paul+0.020
 York+0.019
Grand+0.018
abella+0.018
ris+0.018
Neg logit contributions
 Freddy-0.021
 lettuce-0.021
sed-0.021
 beans-0.020
 squash-0.019
Token curious
Token ID11040
Feature activation+1.19e+00

Pos logit contributions
 lettuce+0.025
owers+0.025
 Freddy+0.024
 beans+0.024
ados+0.024
Neg logit contributions
Paul-0.020
 York-0.019
abella-0.018
Grand-0.018
 security-0.018
Token about
Token ID546
Feature activation+2.80e+00

Pos logit contributions
Paul+0.040
 York+0.036
abella+0.035
Grand+0.035
 tow+0.034
Neg logit contributions
 beans-0.054
 lettuce-0.053
 Freddy-0.052
owers-0.051
ados-0.051
Token the
Token ID262
Feature activation+2.23e-01

Pos logit contributions
Paul+0.082
abella+0.075
Grand+0.074
 York+0.072
bell+0.071
Neg logit contributions
 lettuce-0.137
 beans-0.135
 Freddy-0.133
owers-0.130
ados-0.130
Token wreck
Token ID15228
Feature activation-1.01e+00

Pos logit contributions
Paul+0.009
abella+0.009
Grand+0.008
Jo+0.008
 York+0.008
Neg logit contributions
 lettuce-0.009
 beans-0.009
 squash-0.008
owers-0.008
 shovel-0.008
Token,
Token ID11
Feature activation-2.12e-02

Pos logit contributions
owers+0.045
 lettuce+0.045
 beans+0.045
 Freddy+0.043
ados+0.043
Neg logit contributions
Paul-0.034
abella-0.031
Grand-0.031
 York-0.030
Daniel-0.029
Token but
Token ID475
Feature activation+3.93e-01

Pos logit contributions
 lettuce+0.001
 beans+0.001
 Freddy+0.001
owers+0.001
ados+0.001
Neg logit contributions
Paul-0.001
 York-0.001
abella-0.001
Grand-0.001
 security-0.001
Token she
Token ID673
Feature activation+4.57e-01

Pos logit contributions
Paul+0.017
 York+0.016
ris+0.016
 tow+0.015
Daniel+0.015
Neg logit contributions
 Freddy-0.014
 beans-0.013
 lettuce-0.013
owers-0.013
 mushroom-0.013
Token�
Token ID129
Feature activation+1.51e+00

Pos logit contributions
Paul+0.058
 York+0.053
 security+0.053
Grand+0.052
abella+0.052
Neg logit contributions
 lettuce-0.076
owers-0.076
 beans-0.074
 Freddy-0.073
ados-0.072
Token�
Token ID241
Feature activation+1.86e+00

Pos logit contributions
 security+0.059
Paul+0.058
 tow+0.057
 York+0.056
bell+0.055
Neg logit contributions
 Freddy-0.053
owers-0.053
ados-0.051
 lettuce-0.049
sed-0.049
TokenThat
Token ID2504
Feature activation+4.78e-01

Pos logit contributions
 York+0.065
 security+0.064
Paul+0.064
 airport+0.060
bell+0.060
Neg logit contributions
owers-0.076
 Freddy-0.073
 lettuce-0.072
mons-0.070
 beans-0.070
Tokenâ
Token ID22940
Feature activation+7.39e-02

Pos logit contributions
Paul+0.016
 security+0.014
Grand+0.014
 York+0.014
abella+0.014
Neg logit contributions
 beans-0.021
 lettuce-0.021
owers-0.021
 Freddy-0.020
 squash-0.020
Token€
Token ID26391
Feature activation+2.07e+00

Pos logit contributions
Paul+0.003
 York+0.003
 security+0.003
abella+0.003
Grand+0.003
Neg logit contributions
 lettuce-0.003
owers-0.003
 beans-0.003
 Freddy-0.003
ados-0.003
Tokenâ„¢
Token ID8151
Feature activation+3.26e+00

Pos logit contributions
 security+0.077
Paul+0.073
 York+0.072
 airport+0.071
bell+0.069
Neg logit contributions
owers-0.084
 Freddy-0.079
mons-0.076
ados-0.074
cr-0.074
Tokens
Token ID82
Feature activation-4.87e-02

Pos logit contributions
Paul+0.138
 York+0.135
 security+0.134
Grand+0.130
 airport+0.127
Neg logit contributions
owers-0.115
 Freddy-0.108
 lettuce-0.105
mons-0.103
 beans-0.102
Token the
Token ID262
Feature activation+2.27e-01

Pos logit contributions
owers+0.002
 lettuce+0.002
 beans+0.002
 Freddy+0.002
ados+0.002
Neg logit contributions
Paul-0.002
 security-0.002
Grand-0.002
 York-0.002
abella-0.002
Token gas
Token ID3623
Feature activation+1.21e+00

Pos logit contributions
Paul+0.008
 York+0.008
 security+0.007
abella+0.007
Grand+0.007
Neg logit contributions
 lettuce-0.009
owers-0.009
 beans-0.009
 Freddy-0.009
ados-0.009
Token from
Token ID422
Feature activation-4.02e-01

Pos logit contributions
Paul+0.042
Grand+0.038
abella+0.037
 York+0.036
 security+0.036
Neg logit contributions
owers-0.052
 lettuce-0.052
 beans-0.051
 Freddy-0.049
ados-0.049
Token the
Token ID262
Feature activation+3.69e-01

Pos logit contributions
 lettuce+0.017
 Freddy+0.017
 beans+0.017
ados+0.017
 squash+0.017
Neg logit contributions
Paul-0.013
 York-0.012
bell-0.012
Grand-0.012
Jo-0.012
Token was
Token ID373
Feature activation+2.87e-01

Pos logit contributions
 beans+0.030
owers+0.030
 lettuce+0.030
 Freddy+0.030
ados+0.029
Neg logit contributions
Paul-0.021
 York-0.019
Grand-0.019
Daniel-0.018
 security-0.018
Token too
Token ID1165
Feature activation-6.67e-01

Pos logit contributions
Paul+0.009
 York+0.009
 security+0.008
abella+0.008
Grand+0.008
Neg logit contributions
 lettuce-0.013
owers-0.013
 beans-0.012
 Freddy-0.012
ados-0.012
Token expensive
Token ID5789
Feature activation+7.54e-01

Pos logit contributions
 beans+0.030
 lettuce+0.028
 squash+0.028
 Freddy+0.028
 mushroom+0.027
Neg logit contributions
Paul-0.025
 York-0.023
abella-0.022
Jo-0.021
 tow-0.020
Token and
Token ID290
Feature activation-1.41e-01

Pos logit contributions
Paul+0.025
Grand+0.023
 York+0.022
abella+0.022
 security+0.021
Neg logit contributions
 lettuce-0.034
 beans-0.034
owers-0.033
 Freddy-0.033
ados-0.033
Token not
Token ID407
Feature activation+8.31e-01

Pos logit contributions
 Freddy+0.006
 lettuce+0.006
 beans+0.006
ados+0.005
 berries+0.005
Neg logit contributions
Paul-0.005
 York-0.005
Grand-0.005
Daniel-0.005
 security-0.005
Token meant
Token ID4001
Feature activation+3.10e+00

Pos logit contributions
Paul+0.027
 York+0.025
 security+0.025
abella+0.024
Grand+0.024
Neg logit contributions
 lettuce-0.037
owers-0.037
 beans-0.036
 Freddy-0.035
ados-0.035
Token for
Token ID329
Feature activation+7.93e-02

Pos logit contributions
Paul+0.109
Grand+0.100
 York+0.100
abella+0.096
 security+0.094
Neg logit contributions
 lettuce-0.134
 beans-0.131
 Freddy-0.130
ados-0.127
owers-0.127
Token him
Token ID683
Feature activation-1.95e+00

Pos logit contributions
Paul+0.003
Grand+0.003
 York+0.003
bell+0.003
abella+0.003
Neg logit contributions
 lettuce-0.003
 beans-0.003
 Freddy-0.003
owers-0.003
ados-0.003
Token.
Token ID13
Feature activation-3.45e-01

Pos logit contributions
 lettuce+0.084
 beans+0.082
owers+0.082
 Freddy+0.082
ados+0.079
Neg logit contributions
Paul-0.068
 York-0.063
Grand-0.062
abella-0.060
 security-0.060
Token
Token ID198
Feature activation+5.79e-01

Pos logit contributions
 lettuce+0.014
owers+0.014
 beans+0.013
 Freddy+0.013
ados+0.013
Neg logit contributions
Paul-0.013
 York-0.012
 security-0.012
abella-0.012
Grand-0.012
Token
Token ID198
Feature activation+4.59e-02

Pos logit contributions
Paul+0.017
 York+0.016
 security+0.016
 airport+0.016
bell+0.016
Neg logit contributions
 Freddy-0.027
owers-0.026
 lettuce-0.026
 beans-0.025
cr-0.025
Token great
Token ID1049
Feature activation+5.21e-02

Pos logit contributions
Paul+0.004
abella+0.004
 York+0.004
Jo+0.004
Grand+0.004
Neg logit contributions
 beans-0.005
 lettuce-0.005
 squash-0.005
 mushroom-0.005
 Freddy-0.005
Token idea
Token ID2126
Feature activation-9.46e-01

Pos logit contributions
Paul+0.003
Grand+0.003
 York+0.003
abella+0.003
 security+0.003
Neg logit contributions
 Freddy-0.001
 beans-0.001
 squash-0.001
 mushroom-0.001
 lettuce-0.001
Token and
Token ID290
Feature activation-1.21e-01

Pos logit contributions
owers+0.042
 lettuce+0.042
 Freddy+0.042
 beans+0.042
ados+0.041
Neg logit contributions
Paul-0.030
 York-0.028
abella-0.027
Grand-0.027
 security-0.026
Token they
Token ID484
Feature activation-9.73e-02

Pos logit contributions
 Freddy+0.004
 lettuce+0.004
 beans+0.004
 shovel+0.004
 squash+0.004
Neg logit contributions
Paul-0.005
 York-0.005
Grand-0.005
ind-0.005
 elev-0.005
Token both
Token ID1111
Feature activation-2.52e+00

Pos logit contributions
 beans+0.003
 lettuce+0.003
 Freddy+0.003
owers+0.003
cr+0.003
Neg logit contributions
Paul-0.004
 York-0.004
Grand-0.004
 security-0.004
 elev-0.004
Token marched
Token ID23558
Feature activation+6.82e-01

Pos logit contributions
 shovel+0.075
 Freddy+0.075
cr+0.074
're+0.071
 beans+0.071
Neg logit contributions
Paul-0.112
 York-0.111
Grand-0.106
 security-0.104
 elev-0.104
Token around
Token ID1088
Feature activation+1.31e+00

Pos logit contributions
Grand+0.030
Offic+0.029
Paul+0.029
 York+0.029
abella+0.028
Neg logit contributions
 Freddy-0.021
 beans-0.020
 shovel-0.020
cr-0.019
 lettuce-0.018
Token in
Token ID287
Feature activation-8.04e-01

Pos logit contributions
Grand+0.052
Paul+0.052
bell+0.050
abella+0.049
 York+0.047
Neg logit contributions
 beans-0.050
 Freddy-0.050
 squash-0.046
 lettuce-0.044
 shovel-0.044
Token the
Token ID262
Feature activation+3.69e-01

Pos logit contributions
 shovel+0.024
 beans+0.023
 Freddy+0.022
 squash+0.021
 hands+0.020
Neg logit contributions
abella-0.038
Grand-0.036
Paul-0.035
ind-0.034
Offic-0.034
Token rain
Token ID6290
Feature activation-5.56e-01

Pos logit contributions
Paul+0.018
abella+0.018
Grand+0.017
Jo+0.016
Daniel+0.016
Neg logit contributions
 beans-0.012
 lettuce-0.011
 squash-0.011
 Freddy-0.011
 shovel-0.010
Token,
Token ID11
Feature activation+5.35e-02

Pos logit contributions
owers+0.022
 beans+0.021
cr+0.021
 Freddy+0.020
mons+0.020
Neg logit contributions
Paul-0.021
Grand-0.020
abella-0.019
 security-0.019
 York-0.019
Token,
Token ID11
Feature activation-1.07e+00

Pos logit contributions
Paul+0.075
Grand+0.067
 York+0.066
abella+0.066
Mad+0.065
Neg logit contributions
 lettuce-0.088
 beans-0.087
owers-0.087
ados-0.084
 grapes-0.084
Token 5
Token ID642
Feature activation+2.24e+00

Pos logit contributions
 lettuce+0.045
 grapes+0.042
 beans+0.041
 shovel+0.041
ados+0.041
Neg logit contributions
Paul-0.039
abella-0.037
 York-0.036
 security-0.035
Mad-0.035
Token rocks
Token ID12586
Feature activation+1.14e+00

Pos logit contributions
Paul+0.081
 York+0.074
Grand+0.072
abella+0.072
Mad+0.071
Neg logit contributions
 lettuce-0.097
 beans-0.094
owers-0.093
 grapes-0.091
 Freddy-0.091
Token.
Token ID13
Feature activation-6.04e-01

Pos logit contributions
Paul+0.034
 York+0.031
Grand+0.031
abella+0.031
 security+0.030
Neg logit contributions
owers-0.054
 lettuce-0.054
 beans-0.052
 Freddy-0.052
ados-0.052
Token He
Token ID679
Feature activation-4.53e-01

Pos logit contributions
 lettuce+0.027
owers+0.027
 Freddy+0.026
 beans+0.026
ados+0.025
Neg logit contributions
Paul-0.020
 security-0.018
 York-0.018
Grand-0.018
abella-0.018
Token was
Token ID373
Feature activation+2.60e-01

Pos logit contributions
 lettuce+0.017
 Freddy+0.015
cr+0.015
owers+0.015
 beans+0.015
Neg logit contributions
Paul-0.018
 York-0.018
Mad-0.017
 security-0.017
Grand-0.016
Token very
Token ID845
Feature activation-7.23e-01

Pos logit contributions
 York+0.010
Paul+0.010
bell+0.009
ind+0.009
Grand+0.009
Neg logit contributions
 lettuce-0.009
 Freddy-0.009
 grapes-0.009
 beans-0.009
 mushroom-0.009
Token proud
Token ID6613
Feature activation+2.39e-01

Pos logit contributions
 lettuce+0.029
 Freddy+0.029
 shovel+0.027
 squash+0.027
 beans+0.027
Neg logit contributions
 York-0.028
Paul-0.026
 airport-0.026
abella-0.023
Mad-0.023
Token of
Token ID286
Feature activation-7.02e-01

Pos logit contributions
Paul+0.006
 York+0.006
 security+0.005
abella+0.005
Grand+0.005
Neg logit contributions
 lettuce-0.012
owers-0.012
 beans-0.012
 Freddy-0.012
ados-0.012
Token what
Token ID644
Feature activation+1.81e+00

Pos logit contributions
 Freddy+0.028
ados+0.026
 lettuce+0.026
 beans+0.025
 mushroom+0.025
Neg logit contributions
Grand-0.027
 York-0.027
bell-0.026
Paul-0.026
 airport-0.025
Token he
Token ID339
Feature activation-7.05e-01

Pos logit contributions
Paul+0.059
 York+0.056
Grand+0.054
bell+0.054
abella+0.053
Neg logit contributions
 lettuce-0.081
 beans-0.079
 Freddy-0.079
owers-0.077
ados-0.076
Token do
Token ID466
Feature activation-3.74e-02

Pos logit contributions
 lettuce+0.005
 beans+0.004
ados+0.004
owers+0.004
 grapes+0.004
Neg logit contributions
 security-0.003
 much-0.003
Paul-0.003
 tow-0.003
Daniel-0.003
Token not
Token ID407
Feature activation-4.71e-02

Pos logit contributions
 Freddy+0.001
 beans+0.001
 squash+0.001
 lettuce+0.001
sed+0.001
Neg logit contributions
Paul-0.002
 York-0.002
Grand-0.002
Jo-0.002
bell-0.002
Token stop
Token ID2245
Feature activation-1.54e-01

Pos logit contributions
owers+0.002
 lettuce+0.002
 beans+0.002
ados+0.002
 Freddy+0.002
Neg logit contributions
Paul-0.002
 security-0.002
Grand-0.002
 much-0.002
 York-0.002
Token.
Token ID13
Feature activation+1.14e-01

Pos logit contributions
 lettuce+0.007
owers+0.007
 Freddy+0.007
 beans+0.007
ados+0.007
Neg logit contributions
Paul-0.005
abella-0.004
 York-0.004
Grand-0.004
 security-0.004
Token They
Token ID1119
Feature activation-1.23e-01

Pos logit contributions
Paul+0.004
 York+0.004
 security+0.004
Grand+0.004
abella+0.004
Neg logit contributions
 lettuce-0.004
owers-0.004
 beans-0.004
 Freddy-0.004
ados-0.004
Token do
Token ID466
Feature activation+4.09e-02

Pos logit contributions
 lettuce+0.004
owers+0.004
 beans+0.004
ados+0.004
 Freddy+0.004
Neg logit contributions
Paul-0.005
 security-0.005
 York-0.005
abella-0.005
Grand-0.005
Token not
Token ID407
Feature activation+3.04e-01

Pos logit contributions
Paul+0.002
 York+0.002
Jo+0.002
Grand+0.002
 tow+0.002
Neg logit contributions
 Freddy-0.001
 beans-0.001
 lettuce-0.001
 squash-0.001
sed-0.001
Token listen
Token ID6004
Feature activation+9.06e-01

Pos logit contributions
Paul+0.013
 York+0.012
abella+0.012
 security+0.012
Grand+0.011
Neg logit contributions
 lettuce-0.011
 Freddy-0.010
owers-0.010
 beans-0.010
ados-0.010
Token to
Token ID284
Feature activation+2.01e-01

Pos logit contributions
 York+0.045
Paul+0.044
 airport+0.042
Offic+0.041
 tow+0.041
Neg logit contributions
 Freddy-0.023
 beans-0.022
cr-0.022
 cheek-0.020
 cheeks-0.020
Token him
Token ID683
Feature activation-1.42e+00

Pos logit contributions
Paul+0.007
Grand+0.006
 York+0.006
abella+0.006
bell+0.006
Neg logit contributions
 Freddy-0.008
 lettuce-0.008
owers-0.008
 beans-0.008
ados-0.008
Token.
Token ID13
Feature activation-8.34e-02

Pos logit contributions
 Freddy+0.055
 lettuce+0.054
owers+0.054
 beans+0.054
ados+0.054
Neg logit contributions
Paul-0.054
 York-0.049
Grand-0.049
abella-0.048
Daniel-0.046
Token to
Token ID284
Feature activation+7.11e-01

Pos logit contributions
Paul+0.037
abella+0.035
Grand+0.034
 security+0.034
 York+0.034
Neg logit contributions
 lettuce-0.041
 beans-0.040
owers-0.039
 Freddy-0.039
ados-0.037
Token camp
Token ID1413
Feature activation+9.65e-01

Pos logit contributions
Paul+0.026
abella+0.024
bell+0.024
Grand+0.023
ris+0.022
Neg logit contributions
 lettuce-0.030
 beans-0.029
 Freddy-0.029
 squash-0.027
 mushroom-0.027
Token with
Token ID351
Feature activation-1.58e-01

Pos logit contributions
Paul+0.039
 York+0.035
abella+0.035
 tow+0.035
Grand+0.034
Neg logit contributions
owers-0.034
 beans-0.034
 Freddy-0.033
cr-0.033
 lettuce-0.033
Token his
Token ID465
Feature activation+1.19e-01

Pos logit contributions
 lettuce+0.007
 beans+0.006
 Freddy+0.006
 salad+0.006
ados+0.006
Neg logit contributions
Paul-0.006
ris-0.006
abella-0.006
bell-0.005
 York-0.005
Token friends
Token ID2460
Feature activation-1.75e+00

Pos logit contributions
Paul+0.005
 York+0.004
abella+0.004
Grand+0.004
 security+0.004
Neg logit contributions
 lettuce-0.005
owers-0.005
 beans-0.005
 Freddy-0.004
ados-0.004
Token.
Token ID13
Feature activation+2.67e-01

Pos logit contributions
 lettuce+0.079
owers+0.079
 beans+0.078
 Freddy+0.077
ados+0.075
Neg logit contributions
Paul-0.055
 York-0.050
 security-0.050
abella-0.049
Grand-0.049
Token He
Token ID679
Feature activation+2.50e-03

Pos logit contributions
Paul+0.010
 security+0.009
 York+0.009
abella+0.009
Grand+0.009
Neg logit contributions
 lettuce-0.011
owers-0.011
 beans-0.010
 Freddy-0.010
ados-0.010
Token could
Token ID714
Feature activation-4.87e-01

Pos logit contributions
Paul+0.000
 York+0.000
 elev+0.000
 security+0.000
 tow+0.000
Neg logit contributions
 lettuce-0.000
 beans-0.000
 Freddy-0.000
sed-0.000
 shovel-0.000
Token hardly
Token ID8941
Feature activation+3.41e-01

Pos logit contributions
 lettuce+0.020
owers+0.020
 beans+0.019
 Freddy+0.019
ados+0.018
Neg logit contributions
Paul-0.018
 York-0.016
 security-0.016
abella-0.016
Grand-0.016
Token guess
Token ID4724
Feature activation+1.05e-01

Pos logit contributions
Paul+0.015
bell+0.014
 security+0.014
 much+0.014
Grand+0.014
Neg logit contributions
owers-0.011
 lettuce-0.011
ados-0.011
 beans-0.011
 grapes-0.010
Token what
Token ID644
Feature activation+1.55e+00

Pos logit contributions
Paul+0.004
Grand+0.004
Offic+0.004
Mad+0.004
 York+0.004
Neg logit contributions
 lettuce-0.004
 beans-0.004
 Freddy-0.004
 shovel-0.004
 berries-0.004
Token"
Token ID1
Feature activation-3.00e-02

Pos logit contributions
 York+0.016
bell+0.015
 airport+0.015
Paul+0.015
abella+0.015
Neg logit contributions
mons-0.023
owers-0.023
 grapes-0.023
 lettuce-0.022
 squash-0.022
TokenBut
Token ID1537
Feature activation+1.28e+00

Pos logit contributions
owers+0.001
mons+0.001
 beans+0.001
 lettuce+0.001
 grapes+0.001
Neg logit contributions
Paul-0.001
 York-0.001
bell-0.001
 airport-0.001
 security-0.001
Token I
Token ID314
Feature activation-1.52e-02

Pos logit contributions
Paul+0.050
Grand+0.046
 security+0.046
 York+0.046
abella+0.045
Neg logit contributions
 lettuce-0.049
owers-0.048
 beans-0.048
 Freddy-0.047
ados-0.047
Token want
Token ID765
Feature activation-2.24e-01

Pos logit contributions
 Freddy+0.000
're+0.000
sed+0.000
cr+0.000
ses+0.000
Neg logit contributions
Paul-0.001
 York-0.001
 security-0.001
 airport-0.001
 elev-0.001
Token to
Token ID284
Feature activation-5.11e-01

Pos logit contributions
owers+0.011
 lettuce+0.010
 beans+0.010
ados+0.010
 Freddy+0.010
Neg logit contributions
Paul-0.007
 security-0.006
 York-0.006
Grand-0.006
 much-0.006
Token know
Token ID760
Feature activation+5.57e-01

Pos logit contributions
 lettuce+0.019
owers+0.018
 Freddy+0.018
 beans+0.018
ados+0.017
Neg logit contributions
Paul-0.021
abella-0.019
 York-0.019
Grand-0.019
 security-0.019
Token.
Token ID13
Feature activation+2.22e-02

Pos logit contributions
Paul+0.020
Daniel+0.019
Jo+0.018
abella+0.018
 York+0.018
Neg logit contributions
 lettuce-0.022
 berries-0.022
 beans-0.021
 grapes-0.021
 Freddy-0.021
Token Maybe
Token ID6674
Feature activation+1.53e+00

Pos logit contributions
Paul+0.001
 security+0.001
abella+0.001
ris+0.001
Daniel+0.001
Neg logit contributions
 lettuce-0.001
owers-0.001
 Freddy-0.001
ados-0.001
sed-0.001
Token it
Token ID340
Feature activation-1.62e-01

Pos logit contributions
Paul+0.064
ris+0.057
Jo+0.057
abella+0.056
Daniel+0.056
Neg logit contributions
 lettuce-0.059
 beans-0.058
 Freddy-0.056
 squash-0.056
 shovel-0.056
Token's
Token ID338
Feature activation+7.34e-01

Pos logit contributions
owers+0.007
 lettuce+0.007
 beans+0.007
 Freddy+0.007
ados+0.007
Neg logit contributions
Paul-0.005
Grand-0.005
abella-0.005
 York-0.005
 security-0.004
Token a
Token ID257
Feature activation+4.87e-01

Pos logit contributions
Paul+0.027
Jo+0.024
 York+0.024
Grand+0.024
abella+0.024
Neg logit contributions
 lettuce-0.031
 beans-0.030
 Freddy-0.030
 grapes-0.029
 mushroom-0.029
Token first
Token ID717
Feature activation+5.88e-01

Pos logit contributions
 lettuce+0.051
 beans+0.051
 Freddy+0.049
owers+0.048
 grapes+0.046
Neg logit contributions
Paul-0.055
 York-0.049
 security-0.049
Jo-0.048
Grand-0.048
Token before
Token ID878
Feature activation+7.68e-02

Pos logit contributions
Paul+0.023
 security+0.020
 York+0.020
abella+0.020
Grand+0.019
Neg logit contributions
 beans-0.024
 lettuce-0.023
owers-0.023
 Freddy-0.022
 grapes-0.022
Token you
Token ID345
Feature activation-1.52e+00

Pos logit contributions
Paul+0.003
 York+0.003
 security+0.003
abella+0.003
Grand+0.003
Neg logit contributions
 lettuce-0.003
owers-0.003
 beans-0.002
 Freddy-0.002
ados-0.002
Token use
Token ID779
Feature activation-8.72e-01

Pos logit contributions
 lettuce+0.060
ados+0.058
owers+0.058
 beans+0.056
 Freddy+0.056
Neg logit contributions
Paul-0.055
 York-0.053
bell-0.053
 much-0.053
 security-0.052
Token my
Token ID616
Feature activation-1.21e+00

Pos logit contributions
 beans+0.038
 lettuce+0.038
ados+0.035
 grapes+0.035
 Freddy+0.035
Neg logit contributions
Paul-0.032
Grand-0.028
Jo-0.028
 security-0.028
abella-0.028
Token things
Token ID1243
Feature activation-5.17e-01

Pos logit contributions
 lettuce+0.042
owers+0.042
 beans+0.041
 Freddy+0.040
ados+0.040
Neg logit contributions
Paul-0.051
 York-0.047
 security-0.047
abella-0.047
Grand-0.046
Token,
Token ID11
Feature activation-6.06e-01

Pos logit contributions
 lettuce+0.023
owers+0.023
 beans+0.023
 Freddy+0.022
ados+0.022
Neg logit contributions
Paul-0.017
 York-0.015
abella-0.015
 security-0.015
Grand-0.015
Token okay
Token ID8788
Feature activation-5.93e-01

Pos logit contributions
 lettuce+0.023
 beans+0.022
owers+0.022
 Freddy+0.022
ados+0.021
Neg logit contributions
Paul-0.025
 York-0.023
 security-0.022
abella-0.022
Grand-0.022
Token?
Token ID30
Feature activation+6.83e-01

Pos logit contributions
 lettuce+0.026
owers+0.026
 beans+0.025
 Freddy+0.025
ados+0.025
Neg logit contributions
Paul-0.020
 York-0.018
 security-0.018
abella-0.018
Grand-0.018
Token And
Token ID843
Feature activation-5.22e-01

Pos logit contributions
Paul+0.028
 security+0.027
abella+0.026
 airport+0.026
 York+0.026
Neg logit contributions
 lettuce-0.024
owers-0.024
 Freddy-0.023
 beans-0.023
ados-0.023
Token now
Token ID783
Feature activation+2.23e+00

Pos logit contributions
 lettuce+0.018
 beans+0.017
owers+0.017
 Freddy+0.017
ados+0.017
Neg logit contributions
Paul-0.023
 York-0.021
 security-0.021
abella-0.021
Grand-0.020
Token upcoming
Token ID7865
Feature activation+1.28e+00

Pos logit contributions
owers+0.018
 lettuce+0.018
 beans+0.018
ados+0.017
 Freddy+0.017
Neg logit contributions
Paul-0.016
 security-0.015
 York-0.015
Grand-0.015
abella-0.015
Token festival
Token ID10876
Feature activation+4.78e-01

Pos logit contributions
Paul+0.048
Grand+0.045
 York+0.045
abella+0.044
 security+0.044
Neg logit contributions
owers-0.049
 lettuce-0.049
 beans-0.049
 Freddy-0.048
 grapes-0.046
Token.
Token ID13
Feature activation-3.57e-01

Pos logit contributions
Paul+0.016
 York+0.015
Grand+0.015
abella+0.015
 security+0.014
Neg logit contributions
owers-0.020
 Freddy-0.020
 lettuce-0.020
 beans-0.020
cr-0.019
Token Everyone
Token ID11075
Feature activation-1.03e-01

Pos logit contributions
 Freddy+0.016
 lettuce+0.015
owers+0.015
 beans+0.015
ados+0.014
Neg logit contributions
 security-0.012
Paul-0.011
 York-0.011
Grand-0.011
 airport-0.010
Token was
Token ID373
Feature activation+9.65e-01

Pos logit contributions
 Freddy+0.004
owers+0.004
 lettuce+0.004
 beans+0.004
 squash+0.004
Neg logit contributions
Paul-0.004
Grand-0.004
 York-0.003
 tow-0.003
 security-0.003
Token excited
Token ID6568
Feature activation-9.46e-02

Pos logit contributions
 York+0.043
Paul+0.040
Offic+0.039
Grand+0.039
bell+0.039
Neg logit contributions
 Freddy-0.029
cr-0.025
melon-0.025
owers-0.024
 lettuce-0.024
Token but
Token ID475
Feature activation+4.77e-01

Pos logit contributions
 lettuce+0.004
owers+0.004
 beans+0.004
 Freddy+0.004
ados+0.004
Neg logit contributions
Paul-0.003
 York-0.003
 security-0.003
abella-0.003
Grand-0.003
Token Billy
Token ID15890
Feature activation-1.19e+00

Pos logit contributions
 York+0.017
Paul+0.017
Grand+0.016
 airport+0.015
 security+0.015
Neg logit contributions
 Freddy-0.020
 lettuce-0.018
 beans-0.018
 squash-0.017
cr-0.017
Token was
Token ID373
Feature activation+2.79e-01

Pos logit contributions
 Freddy+0.046
owers+0.045
 lettuce+0.045
 beans+0.044
sed+0.042
Neg logit contributions
Paul-0.045
 York-0.043
Grand-0.042
 security-0.042
 tow-0.041
Token a
Token ID257
Feature activation-1.40e-01

Pos logit contributions
 York+0.013
oc+0.012
Paul+0.012
Offic+0.011
Grand+0.011
Neg logit contributions
 Freddy-0.009
 Freddie-0.008
 beans-0.008
 squash-0.008
 lettuce-0.008
Token bit
Token ID1643
Feature activation-3.30e+00

Pos logit contributions
 beans+0.004
 squash+0.004
 lettuce+0.004
 Freddy+0.004
 mushroom+0.004
Neg logit contributions
Paul-0.007
 York-0.007
abella-0.006
Grand-0.006
 elev-0.006
Token her
Token ID607
Feature activation+7.24e-01

Pos logit contributions
owers+0.073
 lettuce+0.071
 beans+0.068
ados+0.068
 Freddy+0.068
Neg logit contributions
Paul-0.037
 security-0.034
 York-0.033
Grand-0.033
 much-0.033
Token mom
Token ID1995
Feature activation+1.41e-01

Pos logit contributions
 airport+0.020
 office+0.019
 security+0.019
Paul+0.019
 York+0.019
Neg logit contributions
 Freddy-0.034
 lettuce-0.034
owers-0.033
ados-0.033
cr-0.031
Tokenmy
Token ID1820
Feature activation-9.40e-01

Pos logit contributions
Paul+0.004
 York+0.003
 security+0.003
Grand+0.003
abella+0.003
Neg logit contributions
 lettuce-0.007
owers-0.007
 beans-0.007
 Freddy-0.007
ados-0.007
Token's
Token ID338
Feature activation+2.16e-01

Pos logit contributions
owers+0.048
 lettuce+0.048
 beans+0.046
 Freddy+0.046
ados+0.046
Neg logit contributions
Paul-0.024
 York-0.022
Grand-0.021
 security-0.021
abella-0.021
Token hand
Token ID1021
Feature activation-1.83e+00

Pos logit contributions
Paul+0.007
 security+0.006
 York+0.006
Grand+0.006
 airport+0.006
Neg logit contributions
owers-0.010
 lettuce-0.009
 Freddy-0.009
 beans-0.009
ados-0.009
Token.
Token ID13
Feature activation-2.95e-01

Pos logit contributions
 lettuce+0.088
owers+0.087
 beans+0.086
 Freddy+0.084
ados+0.084
Neg logit contributions
Paul-0.054
 York-0.048
 security-0.047
abella-0.047
Grand-0.046
Token She
Token ID1375
Feature activation+1.37e-01

Pos logit contributions
owers+0.012
 lettuce+0.012
 beans+0.011
 Freddy+0.011
ados+0.011
Neg logit contributions
Paul-0.011
abella-0.010
 security-0.010
 York-0.010
Grand-0.010
Token was
Token ID373
Feature activation+3.31e-01

Pos logit contributions
Paul+0.005
 York+0.005
 airport+0.005
 security+0.005
abella+0.005
Neg logit contributions
 lettuce-0.005
 Freddy-0.005
 beans-0.005
 mushroom-0.005
owers-0.005
Token curious
Token ID11040
Feature activation+1.09e+00

Pos logit contributions
Paul+0.012
 York+0.011
abella+0.011
Grand+0.010
bell+0.010
Neg logit contributions
 lettuce-0.013
 Freddy-0.013
 beans-0.013
owers-0.013
 squash-0.013
Token to
Token ID284
Feature activation+9.50e-03

Pos logit contributions
Paul+0.039
abella+0.035
 York+0.035
Grand+0.034
 tow+0.033
Neg logit contributions
 beans-0.047
 lettuce-0.046
 Freddy-0.045
ados-0.045
 mushroom-0.044
Token know
Token ID760
Feature activation+7.75e-01

Pos logit contributions
Paul+0.000
 York+0.000
 security+0.000
abella+0.000
Grand+0.000
Neg logit contributions
 lettuce-0.000
owers-0.000
 Freddy-0.000
 beans-0.000
ados-0.000
Token for
Token ID329
Feature activation-2.31e-01

Pos logit contributions
 lettuce+0.055
 beans+0.053
 Freddy+0.053
owers+0.052
ados+0.052
Neg logit contributions
Paul-0.043
 York-0.039
Grand-0.039
abella-0.038
 security-0.037
Token helping
Token ID5742
Feature activation-9.18e-01

Pos logit contributions
 lettuce+0.009
owers+0.009
 beans+0.008
 Freddy+0.008
ados+0.008
Neg logit contributions
Paul-0.009
 York-0.008
 security-0.008
Grand-0.008
abella-0.008
Token their
Token ID511
Feature activation-1.59e+00

Pos logit contributions
 lettuce+0.039
owers+0.039
 beans+0.038
 Freddy+0.037
ados+0.037
Neg logit contributions
Paul-0.032
 York-0.029
 security-0.029
abella-0.029
Grand-0.029
Token little
Token ID1310
Feature activation-3.51e+00

Pos logit contributions
 lettuce+0.064
owers+0.064
 beans+0.062
 Freddy+0.061
ados+0.060
Neg logit contributions
Paul-0.059
 York-0.055
 security-0.054
abella-0.054
Grand-0.054
Token one
Token ID530
Feature activation+2.15e-01

Pos logit contributions
 lettuce+0.131
owers+0.131
 beans+0.127
 Freddy+0.125
ados+0.124
Neg logit contributions
Paul-0.138
 York-0.128
 security-0.128
Grand-0.126
abella-0.126
Token.
Token ID13
Feature activation-5.84e-01

Pos logit contributions
Paul+0.005
 security+0.005
 much+0.005
 York+0.005
 tow+0.005
Neg logit contributions
owers-0.011
 lettuce-0.011
 beans-0.011
ados-0.011
mons-0.011
Token From
Token ID3574
Feature activation-8.81e-01

Pos logit contributions
 lettuce+0.024
 Freddy+0.024
owers+0.024
 beans+0.023
ados+0.023
Neg logit contributions
Paul-0.021
 security-0.019
 York-0.019
abella-0.019
Grand-0.019
Token that
Token ID326
Feature activation+7.58e-01

Pos logit contributions
 Freddy+0.042
 lettuce+0.041
 beans+0.040
owers+0.039
 squash+0.039
Neg logit contributions
Paul-0.026
 York-0.024
bell-0.024
Jo-0.024
abella-0.024
Token day
Token ID1110
Feature activation+2.75e-01

Pos logit contributions
Paul+0.030
 York+0.029
 security+0.028
abella+0.028
bell+0.027
Neg logit contributions
 lettuce-0.028
owers-0.028
 Freddy-0.027
 beans-0.027
ados-0.026
Token on
Token ID319
Feature activation+6.74e-01

Pos logit contributions
Paul+0.009
 York+0.008
 security+0.008
abella+0.008
Grand+0.008
Neg logit contributions
 lettuce-0.012
owers-0.012
 beans-0.012
 Freddy-0.012
ados-0.012
Token,
Token ID11
Feature activation-2.17e-01

Pos logit contributions
Paul+0.018
 York+0.016
 security+0.016
abella+0.016
Grand+0.016
Neg logit contributions
 lettuce-0.034
owers-0.034
 beans-0.033
 Freddy-0.033
ados-0.033
Token while
Token ID981
Feature activation-1.58e+00

Pos logit contributions
owers+0.027
ses+0.025
sed+0.025
ados+0.025
mons+0.024
Neg logit contributions
 much-0.014
 it-0.012
Paul-0.012
 soon-0.012
Grand-0.011
Token exploring
Token ID13504
Feature activation+8.23e-01

Pos logit contributions
owers+0.072
 lettuce+0.072
 beans+0.069
ados+0.069
 Freddy+0.067
Neg logit contributions
Paul-0.050
 much-0.044
 security-0.044
Grand-0.044
 York-0.044
Token,
Token ID11
Feature activation+2.66e-01

Pos logit contributions
Paul+0.026
Grand+0.023
abella+0.023
 York+0.022
bell+0.022
Neg logit contributions
 lettuce-0.039
 beans-0.038
 Freddy-0.038
owers-0.038
ados-0.037
Token Jack
Token ID3619
Feature activation-7.72e-01

Pos logit contributions
 much+0.009
Paul+0.009
abella+0.008
 security+0.008
Grand+0.008
Neg logit contributions
owers-0.012
sed-0.011
ados-0.011
 lettuce-0.011
ses-0.010
Token found
Token ID1043
Feature activation-8.52e-01

Pos logit contributions
 lettuce+0.033
 beans+0.032
owers+0.031
 Freddy+0.031
 mushroom+0.030
Neg logit contributions
Paul-0.028
 York-0.025
 security-0.025
 tow-0.025
abella-0.024
Token a
Token ID257
Feature activation-9.00e-02

Pos logit contributions
 lettuce+0.043
owers+0.042
 beans+0.042
 Freddy+0.041
ados+0.041
Neg logit contributions
Paul-0.023
 York-0.021
 security-0.021
Grand-0.020
abella-0.020
Token little
Token ID1310
Feature activation-3.17e+00

Pos logit contributions
 lettuce+0.004
 beans+0.004
owers+0.004
 Freddy+0.004
 mushroom+0.004
Neg logit contributions
Paul-0.003
 York-0.003
abella-0.003
Grand-0.003
 security-0.002
Token bird
Token ID6512
Feature activation-1.81e+00

Pos logit contributions
 lettuce+0.104
owers+0.102
 beans+0.100
 Freddy+0.098
ados+0.097
Neg logit contributions
Paul-0.140
 York-0.131
abella-0.130
 security-0.130
Grand-0.129
Token who
Token ID508
Feature activation+2.80e-01

Pos logit contributions
 lettuce+0.083
owers+0.082
 beans+0.081
 Freddy+0.080
ados+0.079
Neg logit contributions
Paul-0.057
 York-0.051
abella-0.051
 security-0.051
Grand-0.050
Token was
Token ID373
Feature activation+3.79e-01

Pos logit contributions
Paul+0.009
 York+0.009
abella+0.009
 airport+0.009
 security+0.008
Neg logit contributions
 Freddy-0.012
 lettuce-0.012
owers-0.012
 mushroom-0.011
 beans-0.011
Token built
Token ID3170
Feature activation-1.12e+00

Pos logit contributions
Paul+0.013
 York+0.012
abella+0.012
Grand+0.012
 security+0.012
Neg logit contributions
 lettuce-0.016
owers-0.016
 beans-0.016
 Freddy-0.016
ados-0.015
Token "
Token ID366
Feature activation+4.03e-01

Pos logit contributions
 lettuce+0.036
owers+0.036
 beans+0.035
 Freddy+0.035
ados+0.034
Neg logit contributions
Paul-0.021
 York-0.019
 security-0.018
abella-0.018
Grand-0.018
TokenI
Token ID40
Feature activation-8.29e-02

Pos logit contributions
Paul+0.013
 York+0.012
 security+0.012
Grand+0.012
abella+0.012
Neg logit contributions
 lettuce-0.018
owers-0.018
 beans-0.018
 Freddy-0.017
ados-0.017
Token saw
Token ID2497
Feature activation-5.14e-01

Pos logit contributions
 Freddy+0.003
 lettuce+0.003
owers+0.003
 beans+0.003
 mushroom+0.003
Neg logit contributions
Paul-0.004
 York-0.003
 security-0.003
Grand-0.003
abella-0.003
Token a
Token ID257
Feature activation-2.23e-01

Pos logit contributions
owers+0.027
 lettuce+0.027
 beans+0.026
 Freddy+0.026
ados+0.025
Neg logit contributions
Paul-0.012
 much-0.011
 security-0.011
 York-0.011
abella-0.010
Token big
Token ID1263
Feature activation-2.83e-01

Pos logit contributions
owers+0.008
 lettuce+0.007
ados+0.007
 beans+0.007
 Freddy+0.007
Neg logit contributions
Paul-0.009
 security-0.009
 much-0.009
Grand-0.009
 York-0.009
Token rad
Token ID2511
Feature activation-4.16e+00

Pos logit contributions
owers+0.009
mons+0.008
 lettuce+0.008
 beans+0.008
ados+0.008
Neg logit contributions
Paul-0.012
 security-0.012
 much-0.012
 tow-0.012
 York-0.012
Tokenish
Token ID680
Feature activation-2.73e+00

Pos logit contributions
owers+0.095
sed+0.094
 Freddy+0.093
ses+0.091
 lettuce+0.089
Neg logit contributions
--0.197
ind-0.195
Paul-0.192
 York-0.189
 security-0.186
Token in
Token ID287
Feature activation-1.86e+00

Pos logit contributions
owers+0.134
 lettuce+0.128
sed+0.127
 Freddy+0.125
ados+0.124
Neg logit contributions
 much-0.065
 security-0.062
Paul-0.061
 airport-0.061
--0.060
Token the
Token ID262
Feature activation-4.20e-02

Pos logit contributions
owers+0.099
 lettuce+0.098
 beans+0.095
 Freddy+0.094
ados+0.094
Neg logit contributions
Paul-0.044
 York-0.039
 security-0.039
Grand-0.038
abella-0.038
Token garden
Token ID11376
Feature activation+1.34e-01

Pos logit contributions
owers+0.002
mons+0.002
sed+0.002
ados+0.002
 Freddy+0.002
Neg logit contributions
 airport-0.001
 much-0.001
Paul-0.001
 security-0.001
 York-0.001
Token today
Token ID1909
Feature activation+1.03e+00

Pos logit contributions
Paul+0.004
 York+0.004
abella+0.004
 security+0.004
Grand+0.004
Neg logit contributions
 lettuce-0.006
owers-0.006
 beans-0.006
 Freddy-0.006
ados-0.006
Token old
Token ID1468
Feature activation+2.84e-01

Pos logit contributions
 lettuce+0.049
owers+0.048
 beans+0.048
 Freddy+0.047
ados+0.046
Neg logit contributions
Paul-0.039
 York-0.036
Grand-0.035
 security-0.035
abella-0.035
Token,
Token ID11
Feature activation-4.64e-01

Pos logit contributions
Paul+0.008
 security+0.008
 York+0.007
Grand+0.007
abella+0.007
Neg logit contributions
owers-0.013
 lettuce-0.013
 beans-0.013
 Freddy-0.013
ados-0.013
Token y
Token ID331
Feature activation-2.29e+00

Pos logit contributions
 lettuce+0.020
 beans+0.020
owers+0.020
 Freddy+0.019
ados+0.019
Neg logit contributions
Paul-0.016
 York-0.014
abella-0.014
Grand-0.014
 security-0.014
Tokenucky
Token ID5309
Feature activation-2.12e+00

Pos logit contributions
sed+0.073
owers+0.072
 lettuce+0.071
ses+0.069
 Freddy+0.067
Neg logit contributions
 much-0.095
Paul-0.094
 security-0.091
 York-0.091
Grand-0.089
Token cr
Token ID1067
Feature activation-2.06e+00

Pos logit contributions
owers+0.109
 lettuce+0.107
 beans+0.104
 Freddy+0.104
ados+0.103
Neg logit contributions
Paul-0.052
 security-0.048
 much-0.047
 York-0.046
abella-0.045
Tokenumbs
Token ID18146
Feature activation-3.62e+00

Pos logit contributions
 lettuce+0.063
 beans+0.063
 Freddy+0.061
owers+0.061
sed+0.059
Neg logit contributions
Paul-0.095
Grand-0.090
 much-0.087
bell-0.087
 tow-0.086
Token.
Token ID13
Feature activation-2.78e-01

Pos logit contributions
owers+0.197
ados+0.187
 lettuce+0.187
sed+0.184
mons+0.183
Neg logit contributions
 much-0.079
 security-0.073
Paul-0.070
abella-0.066
 tow-0.064
Token But
Token ID887
Feature activation+9.39e-01

Pos logit contributions
owers+0.013
 lettuce+0.013
 beans+0.013
ados+0.013
 Freddy+0.012
Neg logit contributions
Paul-0.008
 York-0.007
 much-0.007
 security-0.007
abella-0.007
Token the
Token ID262
Feature activation-2.86e-01

Pos logit contributions
Paul+0.033
 York+0.030
Grand+0.029
abella+0.029
Daniel+0.029
Neg logit contributions
 Freddy-0.042
 lettuce-0.041
 beans-0.039
ados-0.038
owers-0.038
Token mouse
Token ID10211
Feature activation-6.01e-01

Pos logit contributions
owers+0.012
ados+0.011
mons+0.011
cr+0.011
 Freddy+0.011
Neg logit contributions
 much-0.010
 security-0.010
Grand-0.009
 tow-0.009
 airport-0.009
Token did
Token ID750
Feature activation-6.74e-01

Pos logit contributions
owers+0.031
 lettuce+0.031
 beans+0.030
ados+0.030
 Freddy+0.029
Neg logit contributions
Paul-0.015
 security-0.014
abella-0.014
 York-0.013
 much-0.013
Token was
Token ID373
Feature activation+5.57e-01

Pos logit contributions
Paul+0.002
 York+0.002
 security+0.002
Grand+0.002
abella+0.002
Neg logit contributions
 lettuce-0.003
owers-0.003
 beans-0.003
 Freddy-0.003
ados-0.003
Token so
Token ID523
Feature activation-1.63e+00

Pos logit contributions
 York+0.024
Paul+0.023
 airport+0.021
abella+0.021
bell+0.021
Neg logit contributions
 Freddy-0.020
 lettuce-0.019
cr-0.018
sed-0.018
 beans-0.017
Token happy
Token ID3772
Feature activation+1.95e-01

Pos logit contributions
 lettuce+0.076
 Freddy+0.076
owers+0.074
 beans+0.073
cr+0.071
Neg logit contributions
Paul-0.050
 York-0.049
abella-0.046
 airport-0.045
 security-0.044
Token and
Token ID290
Feature activation+1.91e-01

Pos logit contributions
Paul+0.005
 security+0.004
 York+0.004
abella+0.004
 much+0.004
Neg logit contributions
owers-0.010
 lettuce-0.010
 beans-0.010
ados-0.010
 Freddy-0.010
Token w
Token ID266
Feature activation-4.64e-01

Pos logit contributions
Paul+0.006
 security+0.006
 much+0.006
abella+0.005
Grand+0.005
Neg logit contributions
owers-0.009
 lettuce-0.009
 beans-0.008
ados-0.008
mons-0.008
Tokenagged
Token ID14655
Feature activation-3.25e+00

Pos logit contributions
 lettuce+0.013
 beans+0.012
 mushroom+0.012
owers+0.012
ados+0.011
Neg logit contributions
Paul-0.023
 much-0.022
abella-0.022
 security-0.021
ris-0.021
Token his
Token ID465
Feature activation+1.89e+00

Pos logit contributions
 lettuce+0.158
 Freddy+0.155
 beans+0.153
owers+0.152
ados+0.152
Neg logit contributions
Paul-0.094
 York-0.086
Grand-0.084
abella-0.082
 security-0.082
Token tail
Token ID7894
Feature activation-2.81e-01

Pos logit contributions
 much+0.080
 tow+0.077
 security+0.074
Paul+0.073
-+0.073
Neg logit contributions
owers-0.072
 beans-0.066
mons-0.065
ados-0.064
 lettuce-0.063
Token.
Token ID13
Feature activation-9.88e-02

Pos logit contributions
owers+0.014
 lettuce+0.014
 beans+0.014
ados+0.013
 Freddy+0.013
Neg logit contributions
Paul-0.007
 security-0.007
 much-0.007
 York-0.007
Grand-0.006
Token He
Token ID679
Feature activation-1.15e-01

Pos logit contributions
owers+0.005
 beans+0.005
 lettuce+0.005
cr+0.005
ados+0.005
Neg logit contributions
Paul-0.002
 much-0.002
abella-0.002
 security-0.002
Grand-0.002
Token and
Token ID290
Feature activation-2.67e-01

Pos logit contributions
 lettuce+0.005
owers+0.005
 beans+0.005
 Freddy+0.005
ados+0.005
Neg logit contributions
Paul-0.004
 York-0.004
 security-0.004
abella-0.004
Grand-0.004
Token mom
Token ID1995
Feature activation+5.83e-02

Pos logit contributions
 lettuce+0.005
 beans+0.005
 Freddy+0.005
 mushroom+0.005
 shovel+0.005
Neg logit contributions
Paul-0.006
 York-0.006
Grand-0.005
abella-0.005
Jo-0.005
Token to
Token ID284
Feature activation-7.28e-01

Pos logit contributions
Paul+0.002
 York+0.002
 security+0.002
Jo+0.002
Mad+0.002
Neg logit contributions
 beans-0.002
 lettuce-0.002
 Freddy-0.002
 grapes-0.002
 mushroom-0.002
Token get
Token ID651
Feature activation-5.14e-03

Pos logit contributions
owers+0.027
 lettuce+0.027
 beans+0.026
ados+0.025
 Freddy+0.025
Neg logit contributions
Paul-0.029
 security-0.027
 York-0.027
Grand-0.027
abella-0.027
Token her
Token ID607
Feature activation+3.30e-01

Pos logit contributions
 beans+0.000
 lettuce+0.000
 Freddy+0.000
 squash+0.000
 grapes+0.000
Neg logit contributions
Paul-0.000
 York-0.000
Grand-0.000
Jo-0.000
 security-0.000
Token some
Token ID617
Feature activation-5.73e-01

Pos logit contributions
Paul+0.010
 security+0.009
 much+0.009
 York+0.009
bell+0.009
Neg logit contributions
owers-0.015
 lettuce-0.015
 Freddy-0.015
ados-0.014
 beans-0.014
Token clay
Token ID21558
Feature activation-3.60e+00

Pos logit contributions
owers+0.021
sed+0.020
 Freddy+0.020
ados+0.020
cr+0.020
Neg logit contributions
 much-0.022
 security-0.021
 tow-0.020
 airport-0.020
Paul-0.020
Token.
Token ID13
Feature activation-1.87e-02

Pos logit contributions
owers+0.208
 lettuce+0.208
sed+0.207
 Freddy+0.203
ses+0.199
Neg logit contributions
 security-0.042
 much-0.042
--0.041
 office-0.041
bell-0.040
Token Mom
Token ID11254
Feature activation+3.97e-01

Pos logit contributions
 lettuce+0.001
owers+0.001
 beans+0.001
 Freddy+0.001
ados+0.001
Neg logit contributions
Paul-0.001
 York-0.001
 security-0.001
Grand-0.001
abella-0.001
Token said
Token ID531
Feature activation+3.82e-01

Pos logit contributions
Paul+0.017
 York+0.017
 security+0.016
 much+0.016
 airport+0.016
Neg logit contributions
 lettuce-0.013
 Freddy-0.013
 beans-0.013
owers-0.012
 squash-0.012
Token yes
Token ID3763
Feature activation-4.42e-01

Pos logit contributions
Paul+0.014
 York+0.014
 security+0.013
ris+0.013
Grand+0.013
Neg logit contributions
 Freddy-0.016
 lettuce-0.016
 beans-0.016
owers-0.015
ados-0.015
Token and
Token ID290
Feature activation-4.79e-01

Pos logit contributions
 Freddy+0.017
owers+0.016
 beans+0.016
 lettuce+0.016
ados+0.015
Neg logit contributions
Paul-0.017
 York-0.016
abella-0.015
 tow-0.015
 security-0.015
Token,
Token ID11
Feature activation-7.15e-01

Pos logit contributions
 lettuce+0.022
owers+0.022
 beans+0.022
 Freddy+0.022
ados+0.021
Neg logit contributions
Paul-0.014
Grand-0.013
abella-0.013
 York-0.012
 security-0.012
Token the
Token ID262
Feature activation+2.25e-01

Pos logit contributions
owers+0.039
 lettuce+0.039
 beans+0.038
ados+0.038
 Freddy+0.038
Neg logit contributions
Paul-0.015
 security-0.014
 York-0.013
 much-0.013
Grand-0.013
Token whale
Token ID22206
Feature activation-1.05e+00

Pos logit contributions
 airport+0.008
 security+0.008
Paul+0.007
Grand+0.007
 much+0.007
Neg logit contributions
ados-0.009
owers-0.009
 Freddy-0.009
 lettuce-0.009
mons-0.009
Token saw
Token ID2497
Feature activation+3.78e-02

Pos logit contributions
owers+0.051
 lettuce+0.050
ados+0.049
 beans+0.049
 Freddy+0.048
Neg logit contributions
Paul-0.029
 security-0.027
 York-0.027
Grand-0.027
bell-0.026
Token a
Token ID257
Feature activation+1.31e-01

Pos logit contributions
Paul+0.001
 security+0.001
 much+0.001
 York+0.001
abella+0.001
Neg logit contributions
owers-0.002
 lettuce-0.002
 beans-0.002
ados-0.002
 Freddy-0.002
Token little
Token ID1310
Feature activation-2.90e+00

Pos logit contributions
Paul+0.004
 York+0.004
abella+0.003
 security+0.003
Grand+0.003
Neg logit contributions
 lettuce-0.006
owers-0.006
 beans-0.006
 Freddy-0.006
ados-0.006
Token fish
Token ID5916
Feature activation-2.13e+00

Pos logit contributions
owers+0.103
 lettuce+0.098
ados+0.095
 Freddy+0.094
 beans+0.093
Neg logit contributions
Paul-0.117
 security-0.115
 York-0.114
 much-0.111
 airport-0.110
Token swimming
Token ID14899
Feature activation-1.14e+00

Pos logit contributions
owers+0.103
 lettuce+0.101
ados+0.101
 beans+0.098
mons+0.096
Neg logit contributions
Paul-0.056
 much-0.055
 security-0.055
 airport-0.053
 York-0.052
Token nearby
Token ID6716
Feature activation-7.31e-01

Pos logit contributions
owers+0.058
ados+0.056
 lettuce+0.056
mons+0.053
 beans+0.053
Neg logit contributions
 much-0.029
Paul-0.027
--0.027
 security-0.027
bell-0.025
Token.
Token ID13
Feature activation-1.30e-03

Pos logit contributions
 lettuce+0.034
owers+0.034
 beans+0.033
 Freddy+0.033
ados+0.032
Neg logit contributions
Paul-0.023
abella-0.020
 York-0.020
Grand-0.020
 security-0.020
Token The
Token ID383
Feature activation-5.84e-01

Pos logit contributions
owers+0.000
 lettuce+0.000
 beans+0.000
ados+0.000
mons+0.000
Neg logit contributions
Paul-0.000
 York-0.000
Grand-0.000
 much-0.000
abella-0.000