TOP ACTIVATIONS
MAX = 16.785

not available yet ," Grand ad said as he wiped the
frog ! T ad smiled and carefully picked up
never pin ched the to ad 's toe again , and
words like " ab rac ad ab ra " and "
and say " ab rac ad ab ra " all the
, " A br ac ad ab ra !" Suddenly ,
, " A br ac ad ab ra ! I want
, " A br ac ad ab ra , I turn
It was his grand ad . He said to Jamie
, " A br ac ad ab ra !" Suddenly ,
assignment . Ch ad always works hard at school
big tree . Bl ossom tried to climb up
say " A br ac ad ab ra !" and everyone
. " A br ac ad ab ra !"
them on and gave Grand ad a playful punch . He
the tree . Bl ossom touched the pet als
. " A br ac ad ab ra !"
slept in her bow - ad orned crib . Every night
, " A br ac ad ab ra ! I 'm
seen before . But Grand ad warned him that he must

MIN ACTIVATIONS
MIN = -17.556

We have different likes and disl ikes . We can be
sad . He said , "" I hope something will change
should be compassionate . Comp assion ate means you care about
they arrived they were ast ounded , the land was very
the bird and said , "' Hello , my name is
Billy was ast ounded ! He used the magnet
and said excited ly , "' Wow , the buildings here
my smiled and said , "' Let 's go and do
Sam . He said , "' We should share our special
ally , that 's not ours . Leave it where it
Its brain works differently than ours . â €
car . It 's not ours . It 's our parents
found it . It 's ours ." The big
" No , it is ours . We found it .
. " They are not ours . They belong to the
it â € ™ s ours â € .
says . " It is ours !" The woman
money . It 's not ours ." In the
not yours . They are ours . We made them .
But you have to try ours too . We can help

INTERVAL 8.200 - 16.785
CONTAINS 3.733%

Tim . Tim loved to go fishing with his dad .
. â € Tim my just stared in amaz
that appear risky . <|endoftext|> Sam was feeling upset . He
and pushed the button to go down . It took him
Max at all . <|endoftext|> Ben and Lily were playing in

INTERVAL -0.385 - 8.200
CONTAINS 54.144%

One day , Tim met a sad girl named
. Your ball is not slow or dumb . It was
too tight . He could not wear it anymore . Tim
old man to do . The old man then
the jazz music back on and danced around the living room

INTERVAL -8.971 - -0.385
CONTAINS 39.906%

make new paper animals and shapes . But this time ,
turned it around , inspecting it closely . Then , before
forest , there was a small bunny named Ben . Ben
try to find their way through again soon . From that
candy . She smiled and enjoyed her treats . She had

INTERVAL -17.556 - -8.971
CONTAINS 2.217%

the bull . The bull sees Mia and sn orts .
the bird or make it sad . She wants to be
Lily 's mom said , " No more cartoons today ,
mouse . Sam was very sad and angry . He did
The girl smiled . " I love it !" she
Token not
Token ID407
Feature activation+4.70e+00

Pos logit contributions
ometer+0.187
ounded+0.186
oke+0.179
rier+0.175
 understands+0.174
Neg logit contributions
aster-0.121
e-0.112
One-0.107
 rest-0.105
 spring-0.097
Token available
Token ID1695
Feature activation-2.38e+00

Pos logit contributions
aster+0.188
e+0.174
 rest+0.170
 spring+0.154
One+0.153
Neg logit contributions
ounded-0.344
ometer-0.340
oke-0.330
keeper-0.322
 understands-0.317
Token yet
Token ID1865
Feature activation-5.58e+00

Pos logit contributions
ometer+0.160
ounded+0.153
rier+0.151
oke+0.151
keeper+0.145
Neg logit contributions
aster-0.108
e-0.101
One-0.097
 rest-0.092
wered-0.084
Token,"
Token ID553
Feature activation-7.61e+00

Pos logit contributions
ometer+0.413
ounded+0.407
oke+0.396
rier+0.391
 understands+0.386
Neg logit contributions
aster-0.212
e-0.193
One-0.184
 rest-0.179
 play-0.160
Token Grand
Token ID5675
Feature activation-5.19e-01

Pos logit contributions
ometer+0.502
ounded+0.493
oke+0.477
rier+0.471
 understands+0.462
Neg logit contributions
aster-0.354
e-0.328
One-0.316
 rest-0.308
 play-0.280
Tokenad
Token ID324
Feature activation+1.62e+01

Pos logit contributions
ometer+0.035
ounded+0.034
oke+0.033
rier+0.033
 understands+0.032
Neg logit contributions
aster-0.024
e-0.022
One-0.021
 rest-0.020
 play-0.019
Token said
Token ID531
Feature activation-5.96e+00

Pos logit contributions
e+1.085
aster+1.054
One+1.015
he+0.937
 read+0.937
Neg logit contributions
ometer-0.772
ounded-0.694
rier-0.653
keeper-0.604
sed-0.574
Token as
Token ID355
Feature activation-2.54e+00

Pos logit contributions
ounded+0.460
ometer+0.460
oke+0.446
 understands+0.435
rier+0.427
Neg logit contributions
aster-0.204
e-0.193
 rest-0.170
One-0.170
 play-0.152
Token he
Token ID339
Feature activation+1.01e+00

Pos logit contributions
ometer+0.177
ounded+0.173
oke+0.168
rier+0.167
 understands+0.164
Neg logit contributions
aster-0.108
e-0.099
One-0.096
 rest-0.093
 play-0.083
Token wiped
Token ID21122
Feature activation-1.80e+00

Pos logit contributions
aster+0.059
e+0.056
One+0.055
 rest+0.053
 read+0.051
Neg logit contributions
ometer-0.055
ounded-0.054
rier-0.053
oke-0.051
 understands-0.048
Token the
Token ID262
Feature activation-7.66e-01

Pos logit contributions
ometer+0.139
ounded+0.138
oke+0.134
rier+0.132
 understands+0.130
Neg logit contributions
aster-0.064
e-0.058
One-0.054
 rest-0.052
 play-0.046
Token frog
Token ID21264
Feature activation+1.10e+00

Pos logit contributions
ometer+0.103
ounded+0.101
oke+0.098
rier+0.096
 understands+0.095
Neg logit contributions
aster-0.083
e-0.078
One-0.075
 rest-0.073
 play-0.067
Token!
Token ID0
Feature activation-4.48e-01

Pos logit contributions
aster+0.045
e+0.041
One+0.040
 rest+0.038
 play+0.034
Neg logit contributions
ometer-0.078
ounded-0.076
oke-0.074
rier-0.073
 understands-0.072
Token
Token ID198
Feature activation+9.78e-01

Pos logit contributions
ometer+0.026
ounded+0.026
rier+0.025
oke+0.025
 understands+0.024
Neg logit contributions
aster-0.024
e-0.023
One-0.022
 rest-0.021
 play-0.020
Token
Token ID198
Feature activation+8.67e-01

Pos logit contributions
aster+0.044
e+0.041
 rest+0.040
 arrive+0.037
One+0.037
Neg logit contributions
ometer-0.063
oke-0.061
ounded-0.061
 understands-0.059
keeper-0.058
TokenT
Token ID51
Feature activation+5.85e+00

Pos logit contributions
aster+0.032
 rest+0.032
e+0.029
 recess+0.029
 arrive+0.028
Neg logit contributions
keeper-0.056
oke-0.056
ounded-0.056
ometer-0.056
 puzzled-0.054
Tokenad
Token ID324
Feature activation+1.62e+01

Pos logit contributions
aster+0.416
e+0.396
One+0.387
 rest+0.380
 play+0.359
Neg logit contributions
ometer-0.243
ounded-0.235
oke-0.224
rier-0.219
 understands-0.213
Token smiled
Token ID13541
Feature activation-4.02e+00

Pos logit contributions
e+1.017
aster+0.974
One+0.928
 rest+0.888
 play+0.858
Neg logit contributions
ometer-0.848
ounded-0.793
rier-0.785
keeper-0.705
oke-0.701
Token and
Token ID290
Feature activation+1.89e+00

Pos logit contributions
oke+0.329
ometer+0.323
ounded+0.320
 understands+0.297
ours+0.294
Neg logit contributions
e-0.109
 play-0.108
One-0.108
 go-0.106
 spring-0.104
Token carefully
Token ID7773
Feature activation+2.92e+00

Pos logit contributions
e+0.090
aster+0.089
 rest+0.087
 play+0.086
 spring+0.086
Neg logit contributions
ometer-0.120
oke-0.116
ounded-0.115
rier-0.114
keeper-0.103
Token picked
Token ID6497
Feature activation+5.39e-01

Pos logit contributions
aster+0.158
e+0.148
One+0.144
 rest+0.140
 play+0.129
Neg logit contributions
ometer-0.169
ounded-0.166
oke-0.161
rier-0.158
 understands-0.156
Token up
Token ID510
Feature activation-3.95e+00

Pos logit contributions
aster+0.025
e+0.024
 rest+0.023
 spring+0.021
One+0.021
Neg logit contributions
ounded-0.036
ometer-0.035
oke-0.035
rier-0.033
 puzzled-0.032
Token never
Token ID1239
Feature activation+1.21e+01

Pos logit contributions
e+0.047
aster+0.046
One+0.042
 rest+0.042
 play+0.040
Neg logit contributions
ometer-0.055
ounded-0.054
rier-0.054
oke-0.051
keeper-0.048
Token pin
Token ID6757
Feature activation+2.12e+00

Pos logit contributions
aster+0.616
 rest+0.598
e+0.588
 play+0.582
One+0.581
Neg logit contributions
ometer-0.765
rier-0.725
ounded-0.683
oke-0.677
keeper-0.641
Tokenched
Token ID1740
Feature activation-2.19e-01

Pos logit contributions
aster+0.115
e+0.108
One+0.104
 rest+0.102
 play+0.094
Neg logit contributions
ometer-0.124
ounded-0.121
oke-0.117
rier-0.115
 understands-0.113
Token the
Token ID262
Feature activation-4.12e-01

Pos logit contributions
ometer+0.015
ounded+0.014
oke+0.014
rier+0.014
 understands+0.014
Neg logit contributions
aster-0.010
e-0.009
One-0.009
 rest-0.009
 play-0.008
Token to
Token ID284
Feature activation-3.74e+00

Pos logit contributions
ounded+0.023
ometer+0.023
oke+0.022
 understands+0.022
rier+0.021
Neg logit contributions
aster-0.023
e-0.021
 rest-0.021
One-0.020
 play-0.019
Tokenad
Token ID324
Feature activation+1.62e+01

Pos logit contributions
 ashamed+0.220
oke+0.217
rier+0.215
 teeth+0.213
ometer+0.213
Neg logit contributions
One-0.134
P-0.130
e-0.126
aster-0.120
 arrive-0.116
Token's
Token ID338
Feature activation-4.98e+00

Pos logit contributions
aster+0.884
e+0.829
One+0.800
 rest+0.783
 play+0.724
Neg logit contributions
ometer-0.932
ounded-0.912
oke-0.877
rier-0.865
 understands-0.852
Token toe
Token ID21189
Feature activation+1.21e+00

Pos logit contributions
ounded+0.255
ometer+0.248
oke+0.245
 understands+0.225
rier+0.225
Neg logit contributions
aster-0.307
e-0.296
 rest-0.288
One-0.280
 play-0.273
Token again
Token ID757
Feature activation-2.18e+00

Pos logit contributions
aster+0.058
e+0.055
One+0.051
 rest+0.051
 play+0.047
Neg logit contributions
ometer-0.077
ounded-0.077
oke-0.074
 understands-0.072
rier-0.072
Token,
Token ID11
Feature activation+1.32e+00

Pos logit contributions
ometer+0.153
ounded+0.150
oke+0.147
rier+0.145
 understands+0.144
Neg logit contributions
aster-0.090
e-0.081
One-0.078
 rest-0.075
 arrive-0.068
Token and
Token ID290
Feature activation+1.78e+00

Pos logit contributions
aster+0.057
 rest+0.055
One+0.054
e+0.054
 play+0.051
Neg logit contributions
ometer-0.091
oke-0.088
ounded-0.087
rier-0.085
 understands-0.081
Token words
Token ID2456
Feature activation-4.76e+00

Pos logit contributions
ounded+0.139
ometer+0.131
rier+0.126
 puzzled+0.125
oke+0.121
Neg logit contributions
aster-0.096
e-0.093
 rest-0.087
 play-0.086
 go-0.081
Token like
Token ID588
Feature activation+1.53e+00

Pos logit contributions
ometer+0.323
ounded+0.317
oke+0.307
rier+0.303
 understands+0.299
Neg logit contributions
aster-0.210
e-0.194
One-0.187
 rest-0.182
 play-0.164
Token "
Token ID366
Feature activation-1.01e+01

Pos logit contributions
aster+0.076
e+0.071
One+0.068
 rest+0.067
 play+0.062
Neg logit contributions
ometer-0.095
ounded-0.094
oke-0.091
rier-0.089
 understands-0.088
Tokenab
Token ID397
Feature activation-1.26e-01

Pos logit contributions
ometer+0.638
ounded+0.594
oke+0.590
keeper+0.573
rier+0.562
Neg logit contributions
aster-0.490
 rest-0.451
e-0.434
One-0.420
 arrive-0.414
Tokenrac
Token ID11510
Feature activation-5.29e+00

Pos logit contributions
ometer+0.007
ounded+0.007
oke+0.007
rier+0.007
 understands+0.007
Neg logit contributions
aster-0.007
One-0.006
e-0.006
 rest-0.006
 play-0.006
Tokenad
Token ID324
Feature activation+1.60e+01

Pos logit contributions
oke+0.388
ometer+0.365
 understands+0.360
ounded+0.357
rier+0.357
Neg logit contributions
aster-0.170
 rest-0.163
 walk-0.154
 recess-0.151
One-0.149
Tokenab
Token ID397
Feature activation-2.55e-01

Pos logit contributions
aster+0.914
e+0.881
One+0.814
 rest+0.797
 spring+0.744
Neg logit contributions
ounded-0.886
ometer-0.884
rier-0.828
keeper-0.811
 puzzled-0.807
Tokenra
Token ID430
Feature activation+3.59e+00

Pos logit contributions
ometer+0.015
ounded+0.014
oke+0.014
rier+0.014
 understands+0.013
Neg logit contributions
aster-0.014
e-0.013
One-0.013
 rest-0.012
 play-0.011
Token"
Token ID1
Feature activation-2.76e+00

Pos logit contributions
aster+0.199
e+0.185
One+0.181
 rest+0.177
 play+0.164
Neg logit contributions
ometer-0.203
ounded-0.197
oke-0.192
rier-0.189
 understands-0.185
Token and
Token ID290
Feature activation+1.71e+00

Pos logit contributions
ometer+0.165
ounded+0.164
oke+0.158
rier+0.155
 understands+0.154
Neg logit contributions
aster-0.141
e-0.132
One-0.129
 rest-0.124
 play-0.115
Token "
Token ID366
Feature activation-1.05e+01

Pos logit contributions
aster+0.096
e+0.089
 rest+0.084
One+0.082
 play+0.081
Neg logit contributions
ounded-0.100
ometer-0.098
rier-0.092
oke-0.092
keeper-0.088
Token and
Token ID290
Feature activation+1.80e+00

Pos logit contributions
ometer+0.171
ounded+0.168
oke+0.163
rier+0.162
 understands+0.159
Neg logit contributions
aster-0.108
e-0.099
One-0.095
 rest-0.093
 play-0.083
Token say
Token ID910
Feature activation+9.98e-01

Pos logit contributions
aster+0.055
e+0.049
One+0.047
 rest+0.043
 arrive+0.037
Neg logit contributions
ometer-0.146
ounded-0.144
oke-0.140
rier-0.139
 understands-0.137
Token "
Token ID366
Feature activation-1.04e+01

Pos logit contributions
aster+0.048
e+0.046
 rest+0.044
One+0.044
 spring+0.043
Neg logit contributions
ounded-0.063
ometer-0.061
oke-0.059
keeper-0.058
outed-0.057
Tokenab
Token ID397
Feature activation-2.62e-01

Pos logit contributions
ometer+0.608
ounded+0.589
keeper+0.554
rier+0.549
azed+0.532
Neg logit contributions
aster-0.503
 rest-0.435
 recess-0.404
wered-0.402
 end-0.393
Tokenrac
Token ID11510
Feature activation-5.37e+00

Pos logit contributions
ometer+0.012
ounded+0.012
oke+0.012
rier+0.011
 understands+0.011
Neg logit contributions
aster-0.017
e-0.016
One-0.015
 rest-0.015
 play-0.015
Tokenad
Token ID324
Feature activation+1.60e+01

Pos logit contributions
oke+0.398
 understands+0.388
atted+0.387
ounded+0.386
keeper+0.383
Neg logit contributions
aster-0.140
 rest-0.123
 recess-0.123
 walk-0.120
 wait-0.120
Tokenab
Token ID397
Feature activation-2.44e-01

Pos logit contributions
aster+0.954
e+0.918
One+0.867
 rest+0.855
 arrive+0.794
Neg logit contributions
ometer-0.837
ounded-0.823
rier-0.772
oke-0.771
 understands-0.767
Tokenra
Token ID430
Feature activation+3.63e+00

Pos logit contributions
ometer+0.014
ounded+0.014
oke+0.013
rier+0.013
 understands+0.013
Neg logit contributions
aster-0.013
e-0.013
One-0.012
 rest-0.012
 play-0.011
Token"
Token ID1
Feature activation-2.66e+00

Pos logit contributions
aster+0.193
e+0.188
One+0.175
 rest+0.173
 arrive+0.160
Neg logit contributions
ometer-0.212
ounded-0.209
oke-0.200
rier-0.197
 understands-0.194
Token all
Token ID477
Feature activation+3.94e+00

Pos logit contributions
ometer+0.170
ounded+0.166
oke+0.161
rier+0.159
 understands+0.156
Neg logit contributions
aster-0.130
e-0.119
One-0.116
 rest-0.113
 arrive-0.102
Token the
Token ID262
Feature activation-6.89e-01

Pos logit contributions
aster+0.152
e+0.138
One+0.136
 rest+0.128
 arrive+0.117
Neg logit contributions
ometer-0.281
ounded-0.279
oke-0.275
 understands-0.267
rier-0.267
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
ometer+0.499
ounded+0.499
oke+0.482
 puzzled+0.463
 understands+0.462
Neg logit contributions
aster-0.166
e-0.161
 rest-0.152
One-0.140
 play-0.131
Token "
Token ID366
Feature activation-1.03e+01

Pos logit contributions
aster+0.094
One+0.091
e+0.089
 rest+0.089
 arrive+0.086
Neg logit contributions
ometer-0.049
ounded-0.048
oke-0.045
 understands-0.045
rier-0.043
TokenA
Token ID32
Feature activation+5.15e+00

Pos logit contributions
keeper+0.590
ounded+0.581
ometer+0.575
 understands+0.567
rier+0.563
Neg logit contributions
aster-0.493
 rest-0.440
 recess-0.432
e-0.428
 arrive-0.409
Tokenbr
Token ID1671
Feature activation+7.64e+00

Pos logit contributions
aster+0.243
e+0.225
One+0.219
 rest+0.213
 play+0.194
Neg logit contributions
ometer-0.336
ounded-0.331
oke-0.320
rier-0.317
 understands-0.311
Tokenac
Token ID330
Feature activation-7.37e+00

Pos logit contributions
e+0.454
aster+0.451
One+0.435
 end+0.391
 rest+0.389
Neg logit contributions
ometer-0.377
ounded-0.377
 puzzled-0.357
rier-0.353
keeper-0.353
Tokenad
Token ID324
Feature activation+1.60e+01

Pos logit contributions
ometer+0.625
oke+0.624
ounded+0.616
 understands+0.612
rier+0.612
Neg logit contributions
aster-0.175
One-0.142
 rest-0.142
e-0.125
 arrive-0.125
Tokenab
Token ID397
Feature activation-2.44e-01

Pos logit contributions
aster+0.949
e+0.907
One+0.858
 rest+0.849
 play+0.788
Neg logit contributions
ometer-0.852
ounded-0.836
rier-0.788
oke-0.781
 understands-0.771
Tokenra
Token ID430
Feature activation+3.63e+00

Pos logit contributions
ometer+0.014
ounded+0.014
oke+0.013
rier+0.013
 understands+0.013
Neg logit contributions
aster-0.013
e-0.012
One-0.012
 rest-0.012
 play-0.011
Token!"
Token ID2474
Feature activation-6.56e+00

Pos logit contributions
aster+0.185
One+0.170
e+0.168
 rest+0.163
 play+0.151
Neg logit contributions
ometer-0.220
ounded-0.216
oke-0.212
rier-0.206
 understands-0.204
Token Suddenly
Token ID24975
Feature activation-1.18e+00

Pos logit contributions
ometer+0.396
ounded+0.388
oke+0.375
rier+0.369
 understands+0.363
Neg logit contributions
aster-0.342
e-0.319
One-0.309
 rest-0.302
 play-0.278
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
ometer+0.088
ounded+0.087
oke+0.084
rier+0.083
 understands+0.082
Neg logit contributions
aster-0.045
e-0.041
 rest-0.039
One-0.039
 spring-0.034
Token,
Token ID11
Feature activation+1.26e+00

Pos logit contributions
ounded+0.541
ometer+0.531
oke+0.514
outed+0.499
keeper+0.499
Neg logit contributions
aster-0.121
e-0.117
 rest-0.111
 go-0.106
One-0.103
Token "
Token ID366
Feature activation-1.04e+01

Pos logit contributions
aster+0.088
One+0.085
 rest+0.083
e+0.082
 spring+0.081
Neg logit contributions
ometer-0.049
ounded-0.048
oke-0.046
 understands-0.045
rier-0.044
TokenA
Token ID32
Feature activation+5.13e+00

Pos logit contributions
ounded+0.496
ometer+0.494
keeper+0.480
 understands+0.456
ener+0.452
Neg logit contributions
aster-0.557
 recess-0.512
 rest-0.507
 parties-0.484
 jog-0.474
Tokenbr
Token ID1671
Feature activation+7.63e+00

Pos logit contributions
aster+0.252
e+0.236
One+0.229
 rest+0.225
 play+0.206
Neg logit contributions
ometer-0.323
ounded-0.317
oke-0.307
rier-0.303
 understands-0.298
Tokenac
Token ID330
Feature activation-7.39e+00

Pos logit contributions
aster+0.459
e+0.457
One+0.445
 end+0.400
 rest+0.398
Neg logit contributions
ounded-0.364
ometer-0.364
rier-0.353
 puzzled-0.348
keeper-0.347
Tokenad
Token ID324
Feature activation+1.60e+01

Pos logit contributions
ometer+0.617
ounded+0.609
oke+0.606
 understands+0.597
rier+0.594
Neg logit contributions
aster-0.193
One-0.157
 rest-0.156
e-0.147
 arrive-0.135
Tokenab
Token ID397
Feature activation-2.72e-01

Pos logit contributions
aster+0.978
e+0.927
One+0.897
 rest+0.881
 play+0.822
Neg logit contributions
ometer-0.815
ounded-0.796
oke-0.760
rier-0.752
 understands-0.735
Tokenra
Token ID430
Feature activation+3.61e+00

Pos logit contributions
ometer+0.016
ounded+0.015
oke+0.015
rier+0.015
 understands+0.014
Neg logit contributions
aster-0.015
e-0.014
One-0.013
 rest-0.013
 play-0.012
Token!
Token ID0
Feature activation-9.96e-01

Pos logit contributions
aster+0.190
e+0.176
One+0.173
 rest+0.168
 play+0.155
Neg logit contributions
ometer-0.215
ounded-0.210
oke-0.204
rier-0.200
 understands-0.197
Token I
Token ID314
Feature activation-1.49e+00

Pos logit contributions
ometer+0.044
ounded+0.044
oke+0.042
rier+0.042
 understands+0.041
Neg logit contributions
aster-0.066
e-0.064
One-0.063
 rest-0.061
 play-0.058
Token want
Token ID765
Feature activation-2.71e+00

Pos logit contributions
ometer+0.088
ounded+0.087
 understands+0.082
oke+0.082
 puzzled+0.079
Neg logit contributions
aster-0.080
One-0.071
e-0.069
 rest-0.066
 recess-0.066
Token,
Token ID11
Feature activation+1.26e+00

Pos logit contributions
ounded+0.507
ometer+0.500
oke+0.472
 understands+0.468
 puzzled+0.465
Neg logit contributions
aster-0.171
e-0.163
One-0.148
 rest-0.145
 play-0.132
Token "
Token ID366
Feature activation-1.04e+01

Pos logit contributions
aster+0.085
One+0.084
 rest+0.081
e+0.080
 spring+0.079
Neg logit contributions
ounded-0.051
ometer-0.051
oke-0.047
 understands-0.046
rier-0.045
TokenA
Token ID32
Feature activation+5.13e+00

Pos logit contributions
ounded+0.625
ometer+0.605
keeper+0.604
rier+0.597
 understands+0.594
Neg logit contributions
aster-0.518
 rest-0.452
e-0.435
 arrive-0.418
 recess-0.412
Tokenbr
Token ID1671
Feature activation+7.63e+00

Pos logit contributions
aster+0.269
e+0.251
One+0.245
 rest+0.238
 play+0.220
Neg logit contributions
ometer-0.308
ounded-0.303
oke-0.291
rier-0.288
 understands-0.283
Tokenac
Token ID330
Feature activation-7.39e+00

Pos logit contributions
aster+0.454
e+0.449
One+0.436
 end+0.391
he+0.387
Neg logit contributions
ometer-0.373
ounded-0.371
 puzzled-0.356
rier-0.352
keeper-0.352
Tokenad
Token ID324
Feature activation+1.60e+01

Pos logit contributions
oke+0.624
ometer+0.617
ounded+0.609
rier+0.606
 understands+0.604
Neg logit contributions
aster-0.183
 rest-0.157
One-0.154
 arrive-0.131
e-0.130
Tokenab
Token ID397
Feature activation-2.72e-01

Pos logit contributions
aster+0.930
e+0.896
One+0.836
 rest+0.825
 play+0.771
Neg logit contributions
ometer-0.871
ounded-0.856
rier-0.810
 puzzled-0.787
keeper-0.784
Tokenra
Token ID430
Feature activation+3.61e+00

Pos logit contributions
ometer+0.015
ounded+0.015
oke+0.015
rier+0.014
 understands+0.014
Neg logit contributions
aster-0.015
e-0.014
One-0.014
 rest-0.013
 play-0.012
Token,
Token ID11
Feature activation+1.32e+00

Pos logit contributions
aster+0.178
e+0.163
One+0.161
 rest+0.156
 play+0.143
Neg logit contributions
ometer-0.226
ounded-0.222
oke-0.217
rier-0.212
 understands-0.210
Token I
Token ID314
Feature activation-1.49e+00

Pos logit contributions
aster+0.064
e+0.060
One+0.059
 rest+0.056
 play+0.051
Neg logit contributions
ometer-0.083
ounded-0.082
oke-0.080
rier-0.078
 understands-0.077
Token turn
Token ID1210
Feature activation+6.49e+00

Pos logit contributions
ometer+0.093
ounded+0.093
oke+0.090
 understands+0.090
 puzzled+0.086
Neg logit contributions
aster-0.074
One-0.064
e-0.063
 rest-0.060
wered-0.059
Token
Token ID198
Feature activation+7.39e-01

Pos logit contributions
ounded+0.301
ometer+0.297
rier+0.289
oke+0.280
 puzzled+0.278
Neg logit contributions
aster-0.135
e-0.120
One-0.119
 rest-0.112
 play-0.097
TokenIt
Token ID1026
Feature activation+1.06e+00

Pos logit contributions
aster+0.036
 rest+0.034
e+0.033
 wait+0.032
 arrive+0.032
Neg logit contributions
oke-0.042
keeper-0.042
ometer-0.042
 understands-0.040
ounded-0.039
Token was
Token ID373
Feature activation-1.31e+00

Pos logit contributions
e+0.042
aster+0.042
One+0.041
 rest+0.039
 play+0.039
Neg logit contributions
ometer-0.075
rier-0.074
ounded-0.074
oke-0.072
keeper-0.066
Token his
Token ID465
Feature activation+1.09e+00

Pos logit contributions
ometer+0.092
ounded+0.090
oke+0.088
rier+0.087
 understands+0.086
Neg logit contributions
aster-0.055
e-0.050
One-0.048
 rest-0.047
 play-0.042
Token grand
Token ID4490
Feature activation-5.93e+00

Pos logit contributions
aster+0.063
e+0.060
 rest+0.059
One+0.058
 play+0.055
Neg logit contributions
ounded-0.059
ometer-0.058
oke-0.056
rier-0.055
 understands-0.053
Tokenad
Token ID324
Feature activation+1.59e+01

Pos logit contributions
ometer+0.367
ounded+0.360
oke+0.348
rier+0.343
 understands+0.337
Neg logit contributions
aster-0.300
e-0.280
One-0.270
 rest-0.264
 play-0.242
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
aster+0.715
e+0.648
One+0.646
 rest+0.619
 play+0.569
Neg logit contributions
ometer-1.057
ounded-1.039
oke-1.025
rier-0.996
 understands-0.987
Token He
Token ID679
Feature activation-2.03e+00

Pos logit contributions
aster+0.088
e+0.082
One+0.082
 rest+0.077
 play+0.074
Neg logit contributions
ounded-0.085
ometer-0.085
rier-0.081
oke-0.080
 understands-0.079
Token said
Token ID531
Feature activation-6.20e+00

Pos logit contributions
ometer+0.122
ounded+0.120
rier+0.118
oke+0.114
 understands+0.108
Neg logit contributions
aster-0.108
e-0.103
One-0.099
 rest-0.096
 play-0.092
Token to
Token ID284
Feature activation-4.12e+00

Pos logit contributions
ounded+0.514
ometer+0.500
oke+0.481
ishes+0.472
 understands+0.470
Neg logit contributions
aster-0.178
e-0.166
 rest-0.142
One-0.136
 go-0.134
Token Jamie
Token ID17826
Feature activation+3.75e-01

Pos logit contributions
ometer+0.203
keeper+0.196
oke+0.192
rier+0.192
 understands+0.189
Neg logit contributions
One-0.214
aster-0.209
e-0.206
 rest-0.200
P-0.199
Token,
Token ID11
Feature activation+1.25e+00

Pos logit contributions
ounded+0.532
ometer+0.521
oke+0.502
keeper+0.486
 puzzled+0.484
Neg logit contributions
aster-0.143
e-0.135
 rest-0.124
One-0.119
 go-0.118
Token "
Token ID366
Feature activation-1.04e+01

Pos logit contributions
aster+0.088
One+0.085
 rest+0.084
e+0.083
 spring+0.080
Neg logit contributions
ometer-0.048
ounded-0.047
oke-0.045
 understands-0.044
rier-0.043
TokenA
Token ID32
Feature activation+5.10e+00

Pos logit contributions
ometer+0.565
ounded+0.549
keeper+0.548
rier+0.521
 understands+0.512
Neg logit contributions
aster-0.519
 rest-0.479
 recess-0.463
 arrive-0.449
e-0.444
Tokenbr
Token ID1671
Feature activation+7.59e+00

Pos logit contributions
aster+0.238
e+0.224
One+0.218
 rest+0.213
 spring+0.195
Neg logit contributions
ometer-0.333
ounded-0.328
oke-0.317
rier-0.314
 understands-0.310
Tokenac
Token ID330
Feature activation-7.44e+00

Pos logit contributions
aster+0.439
e+0.438
One+0.424
 end+0.385
he+0.374
Neg logit contributions
ounded-0.377
ometer-0.372
 puzzled-0.363
rier-0.360
keeper-0.360
Tokenad
Token ID324
Feature activation+1.59e+01

Pos logit contributions
ometer+0.626
oke+0.617
ounded+0.614
 understands+0.605
rier+0.605
Neg logit contributions
aster-0.185
One-0.156
 rest-0.153
e-0.139
 arrive-0.134
Tokenab
Token ID397
Feature activation-2.89e-01

Pos logit contributions
aster+0.958
e+0.909
One+0.874
 rest+0.860
 play+0.801
Neg logit contributions
ometer-0.831
ounded-0.813
oke-0.773
rier-0.768
 understands-0.752
Tokenra
Token ID430
Feature activation+3.60e+00

Pos logit contributions
ometer+0.017
ounded+0.016
oke+0.016
rier+0.016
 understands+0.015
Neg logit contributions
aster-0.016
e-0.015
One-0.014
 rest-0.014
 play-0.013
Token!"
Token ID2474
Feature activation-6.61e+00

Pos logit contributions
aster+0.184
e+0.169
One+0.168
 rest+0.162
 play+0.149
Neg logit contributions
ometer-0.219
ounded-0.214
oke-0.209
rier-0.204
 understands-0.202
Token Suddenly
Token ID24975
Feature activation-1.22e+00

Pos logit contributions
ometer+0.422
ounded+0.413
oke+0.400
rier+0.394
 understands+0.389
Neg logit contributions
aster-0.324
e-0.302
One-0.291
 rest-0.282
 arrive-0.258
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
ometer+0.093
ounded+0.092
oke+0.089
rier+0.088
 understands+0.086
Neg logit contributions
aster-0.045
e-0.041
 rest-0.039
One-0.039
 spring-0.034
Token assignment
Token ID16237
Feature activation-3.57e+00

Pos logit contributions
aster+0.057
e+0.053
One+0.051
 rest+0.050
 play+0.046
Neg logit contributions
ometer-0.074
ounded-0.072
oke-0.070
rier-0.069
 understands-0.068
Token.
Token ID13
Feature activation+1.54e+00

Pos logit contributions
ometer+0.241
ounded+0.237
oke+0.230
rier+0.227
 understands+0.223
Neg logit contributions
aster-0.158
e-0.146
One-0.141
 rest-0.137
 play-0.124
Token
Token ID198
Feature activation+4.97e-01

Pos logit contributions
aster+0.079
One+0.078
e+0.078
 rest+0.075
 play+0.072
Neg logit contributions
rier-0.090
ounded-0.089
ometer-0.085
oke-0.082
keeper-0.081
Token
Token ID198
Feature activation+4.86e-01

Pos logit contributions
aster+0.021
e+0.019
One+0.018
 rest+0.017
 arrive+0.016
Neg logit contributions
ometer-0.034
ounded-0.033
oke-0.033
 understands-0.032
 puzzled-0.031
TokenCh
Token ID1925
Feature activation+1.03e+01

Pos logit contributions
aster+0.021
 rest+0.018
e+0.018
 recess+0.017
 walk+0.016
Neg logit contributions
ometer-0.032
ounded-0.032
oke-0.031
 understands-0.030
keeper-0.029
Tokenad
Token ID324
Feature activation+1.59e+01

Pos logit contributions
aster+0.415
 rest+0.378
One+0.377
e+0.351
 walk+0.343
Neg logit contributions
oke-0.750
ometer-0.727
ounded-0.712
atted-0.692
 understands-0.683
Token always
Token ID1464
Feature activation+9.73e+00

Pos logit contributions
aster+0.815
e+0.804
One+0.738
 rest+0.718
 play+0.690
Neg logit contributions
ometer-1.002
ounded-0.940
rier-0.932
oke-0.877
 puzzled-0.852
Token works
Token ID2499
Feature activation-2.17e+00

Pos logit contributions
aster+0.475
e+0.461
 rest+0.428
One+0.417
 play+0.416
Neg logit contributions
ometer-0.637
rier-0.620
ounded-0.610
oke-0.583
keeper-0.574
Token hard
Token ID1327
Feature activation+4.61e+00

Pos logit contributions
ometer+0.147
ounded+0.143
oke+0.139
 understands+0.134
rier+0.133
Neg logit contributions
aster-0.095
e-0.093
One-0.088
 rest-0.083
 play-0.077
Token at
Token ID379
Feature activation-3.48e+00

Pos logit contributions
aster+0.209
e+0.194
One+0.188
 rest+0.181
 arrive+0.165
Neg logit contributions
ometer-0.307
ounded-0.301
oke-0.294
rier-0.289
 understands-0.285
Token school
Token ID1524
Feature activation+6.51e+00

Pos logit contributions
ometer+0.218
oke+0.211
 understands+0.210
ounded+0.210
rier+0.207
Neg logit contributions
aster-0.143
One-0.138
e-0.137
P-0.122
 rest-0.121
Token big
Token ID1263
Feature activation+7.48e-01

Pos logit contributions
aster+0.145
One+0.141
 rest+0.140
 spring+0.135
 end+0.131
Neg logit contributions
ounded-0.188
ometer-0.188
oke-0.184
atted-0.184
rier-0.177
Token tree
Token ID5509
Feature activation+1.29e+00

Pos logit contributions
aster+0.034
e+0.031
 rest+0.030
One+0.030
 spring+0.029
Neg logit contributions
ometer-0.051
ounded-0.050
oke-0.048
rier-0.047
 understands-0.047
Token.
Token ID13
Feature activation+1.50e+00

Pos logit contributions
aster+0.060
e+0.053
One+0.051
 recess+0.050
 rest+0.050
Neg logit contributions
ometer-0.083
ounded-0.081
keeper-0.081
oke-0.079
 puzzled-0.079
Token
Token ID198
Feature activation+4.31e-01

Pos logit contributions
aster+0.060
One+0.059
e+0.057
 rest+0.056
 spring+0.053
Neg logit contributions
ounded-0.105
oke-0.101
rier-0.099
ometer-0.099
 puzzled-0.093
Token
Token ID198
Feature activation+4.35e-01

Pos logit contributions
aster+0.022
e+0.020
 rest+0.018
 recess+0.018
One+0.017
Neg logit contributions
ometer-0.027
ounded-0.026
oke-0.026
 understands-0.025
keeper-0.024
TokenBl
Token ID3629
Feature activation+1.58e+01

Pos logit contributions
aster+0.018
e+0.016
 recess+0.015
 rest+0.015
 play+0.014
Neg logit contributions
ometer-0.029
ounded-0.029
 understands-0.028
rier-0.028
keeper-0.028
Tokenossom
Token ID25548
Feature activation-9.49e+00

Pos logit contributions
aster+0.979
e+0.938
One+0.905
 rest+0.875
 arrive+0.823
Neg logit contributions
ometer-0.785
ounded-0.771
oke-0.742
rier-0.733
 understands-0.722
Token tried
Token ID3088
Feature activation-2.84e-01

Pos logit contributions
ometer+0.619
ounded+0.600
rier+0.589
oke+0.581
 understands+0.554
Neg logit contributions
aster-0.440
e-0.429
One-0.411
 rest-0.405
 spring-0.382
Token to
Token ID284
Feature activation-4.09e+00

Pos logit contributions
ometer+0.017
ounded+0.016
oke+0.016
rier+0.016
keeper+0.015
Neg logit contributions
aster-0.015
e-0.014
One-0.014
 rest-0.014
 spring-0.013
Token climb
Token ID12080
Feature activation+1.15e+01

Pos logit contributions
ounded+0.254
 puzzled+0.244
 fist+0.237
keeper+0.234
 confused+0.232
Neg logit contributions
aster-0.158
P-0.155
 recess-0.153
One-0.145
e-0.141
Token up
Token ID510
Feature activation-3.99e+00

Pos logit contributions
aster+0.616
e+0.573
One+0.549
 rest+0.537
 arrive+0.499
Neg logit contributions
ometer-0.677
ounded-0.663
rier-0.631
oke-0.629
 understands-0.615
Token say
Token ID910
Feature activation+9.66e-01

Pos logit contributions
aster+0.058
e+0.052
One+0.049
 rest+0.047
 play+0.041
Neg logit contributions
ometer-0.146
ounded-0.143
oke-0.140
rier-0.138
 understands-0.136
Token "
Token ID366
Feature activation-1.04e+01

Pos logit contributions
aster+0.050
e+0.047
One+0.045
 rest+0.045
 play+0.043
Neg logit contributions
ounded-0.058
ometer-0.058
oke-0.054
keeper-0.052
 understands-0.052
TokenA
Token ID32
Feature activation+5.06e+00

Pos logit contributions
ometer+0.630
ounded+0.624
keeper+0.603
rier+0.591
oke+0.582
Neg logit contributions
aster-0.496
 rest-0.426
e-0.403
wered-0.400
 arrive-0.387
Tokenbr
Token ID1671
Feature activation+7.53e+00

Pos logit contributions
aster+0.226
e+0.209
One+0.204
 rest+0.198
 spring+0.182
Neg logit contributions
ometer-0.340
ounded-0.338
oke-0.325
rier-0.325
 understands-0.318
Tokenac
Token ID330
Feature activation-7.51e+00

Pos logit contributions
aster+0.488
e+0.479
One+0.469
 rest+0.428
 end+0.413
Neg logit contributions
ometer-0.341
ounded-0.340
 puzzled-0.313
 understands-0.312
rier-0.312
Tokenad
Token ID324
Feature activation+1.58e+01

Pos logit contributions
ometer+0.620
oke+0.617
ounded+0.611
rier+0.607
 understands+0.606
Neg logit contributions
aster-0.186
 rest-0.162
One-0.153
 arrive-0.137
 play-0.136
Tokenab
Token ID397
Feature activation-4.44e-01

Pos logit contributions
aster+0.949
e+0.903
One+0.863
 rest+0.847
 arrive+0.790
Neg logit contributions
ometer-0.828
ounded-0.812
oke-0.765
rier-0.764
 understands-0.751
Tokenra
Token ID430
Feature activation+3.42e+00

Pos logit contributions
ometer+0.026
ounded+0.025
oke+0.024
rier+0.024
 understands+0.023
Neg logit contributions
aster-0.024
e-0.023
One-0.022
 rest-0.022
 play-0.020
Token!"
Token ID2474
Feature activation-6.80e+00

Pos logit contributions
aster+0.190
e+0.176
One+0.174
 rest+0.170
 play+0.157
Neg logit contributions
ometer-0.193
ounded-0.188
oke-0.183
rier-0.179
 understands-0.176
Token and
Token ID290
Feature activation+1.58e+00

Pos logit contributions
ometer+0.380
ounded+0.366
oke+0.357
rier+0.352
keeper+0.346
Neg logit contributions
aster-0.380
e-0.359
One-0.349
 rest-0.341
 arrive-0.315
Token everyone
Token ID2506
Feature activation+2.29e+00

Pos logit contributions
aster+0.067
e+0.062
 rest+0.057
One+0.056
 play+0.055
Neg logit contributions
ometer-0.113
ounded-0.111
oke-0.108
rier-0.106
 understands-0.103
Token.
Token ID13
Feature activation+1.42e+00

Pos logit contributions
ometer+0.193
ounded+0.190
oke+0.185
rier+0.182
 understands+0.181
Neg logit contributions
aster-0.106
e-0.098
One-0.093
 rest-0.090
 play-0.081
Token "
Token ID366
Feature activation-1.05e+01

Pos logit contributions
One+0.043
aster+0.042
Once+0.038
 rest+0.037
 play+0.035
Neg logit contributions
ounded-0.115
ometer-0.110
ener-0.108
rier-0.108
ience-0.107
TokenA
Token ID32
Feature activation+4.99e+00

Pos logit contributions
 understands+0.488
ounded+0.483
ometer+0.477
ksh+0.454
 Isn+0.453
Neg logit contributions
 rest-0.505
aster-0.495
 parties-0.472
 walk-0.461
 recess-0.460
Tokenbr
Token ID1671
Feature activation+7.47e+00

Pos logit contributions
aster+0.206
e+0.191
One+0.181
 rest+0.175
 arrive+0.157
Neg logit contributions
ometer-0.353
ounded-0.344
oke-0.339
rier-0.331
 understands-0.328
Tokenac
Token ID330
Feature activation-7.53e+00

Pos logit contributions
aster+0.477
e+0.469
One+0.454
 rest+0.419
 recess+0.403
Neg logit contributions
ometer-0.344
ounded-0.342
rier-0.326
keeper-0.322
 puzzled-0.317
Tokenad
Token ID324
Feature activation+1.58e+01

Pos logit contributions
oke+0.597
atted+0.590
ometer+0.588
rier+0.581
 understands+0.581
Neg logit contributions
aster-0.203
One-0.191
 rest-0.183
 arrive-0.161
 walk-0.160
Tokenab
Token ID397
Feature activation-4.14e-01

Pos logit contributions
aster+0.932
e+0.903
One+0.833
 rest+0.829
 arrive+0.774
Neg logit contributions
ometer-0.851
ounded-0.837
rier-0.797
keeper-0.772
 puzzled-0.767
Tokenra
Token ID430
Feature activation+3.47e+00

Pos logit contributions
ometer+0.024
ounded+0.023
oke+0.022
rier+0.022
 understands+0.022
Neg logit contributions
aster-0.023
e-0.021
One-0.021
 rest-0.020
 play-0.019
Token!"
Token ID2474
Feature activation-6.73e+00

Pos logit contributions
aster+0.188
One+0.174
e+0.173
 rest+0.167
 play+0.155
Neg logit contributions
ometer-0.198
ounded-0.194
oke-0.190
rier-0.185
 understands-0.182
Token
Token ID198
Feature activation+4.11e-01

Pos logit contributions
ometer+0.506
ounded+0.498
rier+0.481
oke+0.480
 understands+0.459
Neg logit contributions
aster-0.267
One-0.225
 rest-0.223
e-0.216
 spring-0.205
Token
Token ID198
Feature activation+4.08e-01

Pos logit contributions
aster+0.016
e+0.015
One+0.014
 rest+0.014
 play+0.012
Neg logit contributions
ometer-0.030
ounded-0.029
oke-0.028
rier-0.028
 understands-0.028
Token them
Token ID606
Feature activation+2.30e+00

Pos logit contributions
aster+0.189
e+0.169
One+0.166
 rest+0.156
 arrive+0.140
Neg logit contributions
ometer-0.370
ounded-0.361
rier-0.353
oke-0.350
 understands-0.345
Token on
Token ID319
Feature activation+4.32e+00

Pos logit contributions
aster+0.120
One+0.102
e+0.102
 rest+0.094
wered+0.089
Neg logit contributions
ometer-0.143
ounded-0.143
rier-0.135
 puzzled-0.134
keeper-0.132
Token and
Token ID290
Feature activation+1.62e+00

Pos logit contributions
aster+0.215
e+0.198
One+0.190
 rest+0.184
 recess+0.173
Neg logit contributions
ometer-0.254
rier-0.246
keeper-0.241
 puzzled-0.235
ounded-0.235
Token gave
Token ID2921
Feature activation+8.43e-02

Pos logit contributions
aster+0.077
e+0.071
 rest+0.069
One+0.067
 play+0.064
Neg logit contributions
ometer-0.106
ounded-0.103
oke-0.103
rier-0.100
 understands-0.096
Token Grand
Token ID5675
Feature activation-9.39e-01

Pos logit contributions
aster+0.003
One+0.003
e+0.003
 rest+0.002
 arrive+0.002
Neg logit contributions
ometer-0.006
ounded-0.006
rier-0.006
keeper-0.006
oke-0.006
Tokenad
Token ID324
Feature activation+1.58e+01

Pos logit contributions
ometer+0.061
oke+0.060
ounded+0.059
 understands+0.058
rier+0.057
Neg logit contributions
aster-0.044
e-0.039
One-0.039
 rest-0.038
 arrive-0.035
Token a
Token ID257
Feature activation+3.02e+00

Pos logit contributions
aster+0.843
e+0.801
One+0.763
 rest+0.753
 play+0.690
Neg logit contributions
ometer-0.926
ounded-0.912
oke-0.881
rier-0.861
 understands-0.848
Token playful
Token ID34264
Feature activation-5.00e+00

Pos logit contributions
aster+0.135
 rest+0.125
e+0.122
One+0.118
 play+0.114
Neg logit contributions
ometer-0.201
ounded-0.199
oke-0.198
rier-0.197
 understands-0.188
Token punch
Token ID10862
Feature activation+2.20e+00

Pos logit contributions
ometer+0.321
ounded+0.312
oke+0.299
rier+0.299
 understands+0.295
Neg logit contributions
aster-0.238
e-0.220
One-0.217
 rest-0.199
 arrive-0.187
Token.
Token ID13
Feature activation+1.46e+00

Pos logit contributions
aster+0.092
e+0.084
One+0.082
 rest+0.078
 arrive+0.070
Neg logit contributions
ometer-0.155
ounded-0.150
oke-0.147
rier-0.146
keeper-0.143
Token He
Token ID679
Feature activation-2.06e+00

Pos logit contributions
aster+0.084
e+0.078
One+0.078
 rest+0.076
 play+0.072
Neg logit contributions
ometer-0.079
ounded-0.078
oke-0.076
rier-0.075
 understands-0.073
Token the
Token ID262
Feature activation-7.66e-01

Pos logit contributions
aster+0.193
e+0.174
One+0.172
 rest+0.169
 arrive+0.160
Neg logit contributions
ometer-0.284
rier-0.281
 puzzled-0.271
keeper-0.270
 understands-0.266
Token tree
Token ID5509
Feature activation+1.36e+00

Pos logit contributions
ometer+0.044
ounded+0.042
rier+0.042
keeper+0.041
oke+0.039
Neg logit contributions
aster-0.041
e-0.039
One-0.038
 rest-0.034
 recess-0.034
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
aster+0.056
e+0.051
One+0.049
 rest+0.047
 arrive+0.042
Neg logit contributions
ometer-0.096
ounded-0.094
oke-0.092
rier-0.090
 understands-0.089
Token
Token ID198
Feature activation+4.24e-01

Pos logit contributions
aster+0.073
One+0.068
e+0.067
 rest+0.063
 play+0.059
Neg logit contributions
ounded-0.097
ometer-0.096
rier-0.093
oke-0.091
 understands-0.089
Token
Token ID198
Feature activation+4.09e-01

Pos logit contributions
aster+0.020
e+0.019
 rest+0.018
 arrive+0.017
 recess+0.017
Neg logit contributions
ometer-0.026
oke-0.026
ounded-0.025
 understands-0.025
keeper-0.024
TokenBl
Token ID3629
Feature activation+1.58e+01

Pos logit contributions
aster+0.015
 rest+0.015
e+0.014
 recess+0.014
 arrive+0.014
Neg logit contributions
oke-0.027
keeper-0.026
ounded-0.026
ometer-0.025
 understands-0.025
Tokenossom
Token ID25548
Feature activation-9.58e+00

Pos logit contributions
aster+0.936
e+0.896
One+0.857
 rest+0.821
 arrive+0.773
Neg logit contributions
ometer-0.818
ounded-0.811
rier-0.778
oke-0.767
 understands-0.757
Token touched
Token ID12615
Feature activation-1.48e+00

Pos logit contributions
ometer+0.599
ounded+0.587
oke+0.565
rier+0.560
 understands+0.542
Neg logit contributions
aster-0.474
e-0.454
One-0.430
 rest-0.424
 play-0.392
Token the
Token ID262
Feature activation-9.44e-01

Pos logit contributions
ometer+0.116
ounded+0.115
oke+0.111
rier+0.108
 understands+0.107
Neg logit contributions
aster-0.052
e-0.049
One-0.045
 rest-0.044
 spring-0.040
Token pet
Token ID4273
Feature activation+5.40e+00

Pos logit contributions
ometer+0.060
ounded+0.060
oke+0.059
rier+0.056
 understands+0.055
Neg logit contributions
aster-0.044
e-0.043
 rest-0.042
One-0.040
 spring-0.038
Tokenals
Token ID874
Feature activation-5.08e+00

Pos logit contributions
aster+0.404
e+0.388
One+0.376
 rest+0.373
 play+0.353
Neg logit contributions
ometer-0.200
ounded-0.198
oke-0.185
rier-0.179
 understands-0.175
Token.
Token ID13
Feature activation+1.40e+00

Pos logit contributions
ometer+0.037
ounded+0.037
oke+0.036
rier+0.036
 understands+0.035
Neg logit contributions
aster-0.018
e-0.016
One-0.016
 rest-0.015
 play-0.013
Token "
Token ID366
Feature activation-1.05e+01

Pos logit contributions
aster+0.072
One+0.068
 spring+0.065
 rest+0.063
 play+0.063
Neg logit contributions
ounded-0.086
ometer-0.081
rier-0.080
ience-0.078
oke-0.078
TokenA
Token ID32
Feature activation+4.96e+00

Pos logit contributions
ounded+0.564
ometer+0.553
 understands+0.514
rier+0.498
keeper+0.490
Neg logit contributions
aster-0.512
 rest-0.498
 arrive-0.490
e-0.472
 recess-0.464
Tokenbr
Token ID1671
Feature activation+7.43e+00

Pos logit contributions
aster+0.230
e+0.213
One+0.204
 rest+0.198
 play+0.179
Neg logit contributions
ometer-0.331
ounded-0.323
oke-0.314
rier-0.310
 understands-0.305
Tokenac
Token ID330
Feature activation-7.57e+00

Pos logit contributions
aster+0.491
e+0.477
One+0.466
 rest+0.429
 end+0.417
Neg logit contributions
ometer-0.333
ounded-0.329
rier-0.315
keeper-0.311
 puzzled-0.301
Tokenad
Token ID324
Feature activation+1.58e+01

Pos logit contributions
oke+0.603
 understands+0.592
atted+0.586
ometer+0.585
ounded+0.581
Neg logit contributions
aster-0.201
 rest-0.186
One-0.186
 arrive-0.169
 jog-0.169
Tokenab
Token ID397
Feature activation-4.26e-01

Pos logit contributions
aster+0.946
e+0.908
One+0.852
 rest+0.843
 play+0.786
Neg logit contributions
ometer-0.836
ounded-0.819
rier-0.778
keeper-0.755
oke-0.755
Tokenra
Token ID430
Feature activation+3.44e+00

Pos logit contributions
ometer+0.025
ounded+0.024
oke+0.023
rier+0.023
 understands+0.022
Neg logit contributions
aster-0.023
e-0.022
One-0.021
 rest-0.021
 play-0.019
Token!"
Token ID2474
Feature activation-6.77e+00

Pos logit contributions
aster+0.192
One+0.178
e+0.177
 rest+0.172
 play+0.160
Neg logit contributions
ometer-0.191
ounded-0.187
oke-0.184
rier-0.178
 understands-0.177
Token
Token ID198
Feature activation+3.82e-01

Pos logit contributions
ometer+0.489
ounded+0.482
rier+0.463
oke+0.462
 understands+0.444
Neg logit contributions
aster-0.288
One-0.253
e-0.252
 rest-0.241
 spring-0.223
Token
Token ID198
Feature activation+3.85e-01

Pos logit contributions
aster+0.016
e+0.015
One+0.014
 rest+0.014
 arrive+0.013
Neg logit contributions
ometer-0.027
ounded-0.026
oke-0.026
rier-0.025
 understands-0.025
Token slept
Token ID21256
Feature activation+6.93e+00

Pos logit contributions
aster+0.236
e+0.221
One+0.216
 rest+0.211
 arrive+0.197
Neg logit contributions
ometer-0.198
ounded-0.193
oke-0.186
rier-0.181
 understands-0.180
Token in
Token ID287
Feature activation+4.33e+00

Pos logit contributions
aster+0.300
e+0.271
One+0.260
 rest+0.249
 play+0.227
Neg logit contributions
ometer-0.481
ounded-0.474
rier-0.466
oke-0.455
 understands-0.447
Token her
Token ID607
Feature activation+4.82e+00

Pos logit contributions
aster+0.202
P+0.186
e+0.185
One+0.175
 race+0.154
Neg logit contributions
 understands-0.243
 puzzled-0.243
 teeth-0.238
 eyes-0.238
ometer-0.235
Token bow
Token ID9563
Feature activation+2.26e+00

Pos logit contributions
aster+0.245
 rest+0.237
e+0.235
One+0.230
 play+0.219
Neg logit contributions
oke-0.289
ounded-0.288
ometer-0.287
rier-0.269
 understands-0.260
Token-
Token ID12
Feature activation-6.74e-01

Pos logit contributions
aster+0.091
e+0.082
One+0.078
 rest+0.075
 arrive+0.067
Neg logit contributions
ometer-0.164
ounded-0.161
rier-0.155
oke-0.155
 understands-0.152
Tokenad
Token ID324
Feature activation+1.58e+01

Pos logit contributions
ometer+0.041
ounded+0.040
keeper+0.038
 understands+0.038
oke+0.036
Neg logit contributions
aster-0.033
One-0.029
 arrive-0.029
e-0.029
 rest-0.027
Tokenorned
Token ID26994
Feature activation-1.65e+00

Pos logit contributions
aster+0.985
e+0.927
One+0.907
 rest+0.887
 play+0.830
Neg logit contributions
ometer-0.783
ounded-0.763
oke-0.735
rier-0.717
 understands-0.705
Token crib
Token ID48083
Feature activation-1.54e+00

Pos logit contributions
ometer+0.115
ounded+0.110
rier+0.110
oke+0.107
keeper+0.107
Neg logit contributions
aster-0.071
e-0.061
One-0.061
 rest-0.057
 arrive-0.053
Token.
Token ID13
Feature activation+1.44e+00

Pos logit contributions
ounded+0.114
ometer+0.114
oke+0.111
rier+0.108
 understands+0.108
Neg logit contributions
aster-0.056
e-0.052
One-0.049
 rest-0.048
 play-0.044
Token Every
Token ID3887
Feature activation-1.91e+00

Pos logit contributions
aster+0.067
e+0.062
One+0.061
 rest+0.059
 play+0.054
Neg logit contributions
ometer-0.094
ounded-0.093
oke-0.090
rier-0.089
 understands-0.087
Tokennight
Token ID3847
Feature activation-4.46e-03

Pos logit contributions
ometer+0.139
 understands+0.132
ounded+0.131
 puzzled+0.128
keeper+0.126
Neg logit contributions
aster-0.066
e-0.061
One-0.053
 arrive-0.051
 recess-0.051
Token,
Token ID11
Feature activation+1.09e+00

Pos logit contributions
ometer+0.515
ounded+0.513
oke+0.492
 puzzled+0.476
outed+0.473
Neg logit contributions
e-0.165
aster-0.162
One-0.153
 rest-0.149
 play-0.147
Token "
Token ID366
Feature activation-1.05e+01

Pos logit contributions
aster+0.074
One+0.074
 rest+0.070
e+0.070
 spring+0.070
Neg logit contributions
ometer-0.044
ounded-0.043
oke-0.040
 understands-0.039
rier-0.039
TokenA
Token ID32
Feature activation+4.95e+00

Pos logit contributions
ounded+0.555
keeper+0.522
 understands+0.519
ometer+0.517
oke+0.505
Neg logit contributions
aster-0.596
 rest-0.516
e-0.475
 recess-0.464
 walk-0.463
Tokenbr
Token ID1671
Feature activation+7.45e+00

Pos logit contributions
aster+0.265
e+0.248
One+0.240
 rest+0.234
 play+0.216
Neg logit contributions
ometer-0.292
ounded-0.285
oke-0.276
rier-0.271
 understands-0.267
Tokenac
Token ID330
Feature activation-7.59e+00

Pos logit contributions
aster+0.463
e+0.461
One+0.451
 rest+0.405
 end+0.399
Neg logit contributions
ometer-0.353
ounded-0.350
rier-0.331
keeper-0.326
 puzzled-0.325
Tokenad
Token ID324
Feature activation+1.58e+01

Pos logit contributions
oke+0.634
ometer+0.630
ounded+0.628
 understands+0.622
rier+0.618
Neg logit contributions
aster-0.189
 rest-0.154
One-0.151
e-0.128
 arrive-0.126
Tokenab
Token ID397
Feature activation-4.48e-01

Pos logit contributions
aster+0.928
e+0.897
One+0.836
 rest+0.828
 arrive+0.773
Neg logit contributions
ometer-0.849
ounded-0.831
rier-0.789
oke-0.766
 puzzled-0.765
Tokenra
Token ID430
Feature activation+3.45e+00

Pos logit contributions
ometer+0.025
ounded+0.025
oke+0.024
rier+0.023
 understands+0.023
Neg logit contributions
aster-0.025
e-0.023
One-0.023
 rest-0.022
 play-0.021
Token!
Token ID0
Feature activation-1.17e+00

Pos logit contributions
aster+0.188
e+0.175
One+0.171
 rest+0.167
 play+0.154
Neg logit contributions
ometer-0.199
ounded-0.194
oke-0.189
rier-0.185
 understands-0.182
Token I
Token ID314
Feature activation-1.70e+00

Pos logit contributions
ometer+0.052
ounded+0.050
oke+0.048
rier+0.047
 understands+0.046
Neg logit contributions
aster-0.080
e-0.076
One-0.074
 rest-0.073
 play-0.069
Token'm
Token ID1101
Feature activation-1.21e+00

Pos logit contributions
ounded+0.106
ometer+0.105
 understands+0.102
oke+0.101
 puzzled+0.098
Neg logit contributions
aster-0.084
One-0.072
e-0.069
 rest-0.066
wered-0.066
Token seen
Token ID1775
Feature activation-9.24e+00

Pos logit contributions
aster+0.574
e+0.547
 rest+0.525
One+0.512
 play+0.481
Neg logit contributions
ometer-0.713
ounded-0.660
rier-0.660
oke-0.643
keeper-0.619
Token before
Token ID878
Feature activation-6.32e-01

Pos logit contributions
ometer+0.561
ounded+0.550
oke+0.540
rier+0.536
 understands+0.520
Neg logit contributions
aster-0.459
e-0.422
One-0.415
 rest-0.398
 play-0.369
Token.
Token ID13
Feature activation+1.38e+00

Pos logit contributions
ometer+0.043
ounded+0.042
oke+0.041
rier+0.040
 puzzled+0.040
Neg logit contributions
aster-0.028
e-0.025
One-0.024
 rest-0.023
wered-0.021
Token But
Token ID887
Feature activation-1.56e+00

Pos logit contributions
aster+0.065
One+0.065
 play+0.057
 rest+0.054
 spring+0.054
Neg logit contributions
rier-0.099
ounded-0.089
rily-0.087
ometer-0.086
ener-0.085
Token Grand
Token ID5675
Feature activation-9.95e-01

Pos logit contributions
ometer+0.115
ounded+0.113
rier+0.113
oke+0.111
 understands+0.107
Neg logit contributions
aster-0.062
e-0.057
One-0.054
 rest-0.053
 play-0.049
Tokenad
Token ID324
Feature activation+1.57e+01

Pos logit contributions
oke+0.069
ometer+0.068
ounded+0.067
 understands+0.066
keeper+0.064
Neg logit contributions
aster-0.041
e-0.036
 rest-0.036
One-0.036
 skip-0.034
Token warned
Token ID7728
Feature activation-4.92e+00

Pos logit contributions
e+0.793
aster+0.779
One+0.743
 rest+0.690
 read+0.666
Neg logit contributions
ometer-1.006
rier-0.948
ounded-0.929
oke-0.865
keeper-0.852
Token him
Token ID683
Feature activation+6.42e+00

Pos logit contributions
ometer+0.319
ounded+0.303
oke+0.300
 puzzled+0.294
keeper+0.294
Neg logit contributions
aster-0.210
One-0.197
e-0.190
 rest-0.182
P-0.178
Token that
Token ID326
Feature activation-2.29e+00

Pos logit contributions
aster+0.256
e+0.237
 rest+0.232
One+0.221
 play+0.205
Neg logit contributions
ounded-0.462
ometer-0.450
oke-0.446
rier-0.435
 puzzled-0.425
Token he
Token ID339
Feature activation+7.67e-01

Pos logit contributions
ometer+0.150
ounded+0.149
oke+0.143
rier+0.142
 understands+0.139
Neg logit contributions
aster-0.108
e-0.100
One-0.096
 rest-0.095
 play-0.087
Token must
Token ID1276
Feature activation-8.57e+00

Pos logit contributions
e+0.042
aster+0.040
 rest+0.039
One+0.038
 play+0.037
Neg logit contributions
rier-0.045
ounded-0.043
ometer-0.042
oke-0.041
 puzzled-0.038
Token We
Token ID775
Feature activation-5.86e+00

Pos logit contributions
aster+0.085
e+0.081
One+0.079
 rest+0.077
 play+0.073
Neg logit contributions
ometer-0.054
ounded-0.053
oke-0.050
rier-0.049
 understands-0.048
Token have
Token ID423
Feature activation+4.10e+00

Pos logit contributions
ometer+0.386
ounded+0.376
 understands+0.370
oke+0.369
 ashamed+0.354
Neg logit contributions
aster-0.262
One-0.224
 recess-0.201
wered-0.199
e-0.197
Token different
Token ID1180
Feature activation-3.51e+00

Pos logit contributions
aster+0.198
e+0.183
 rest+0.181
One+0.181
 play+0.170
Neg logit contributions
ounded-0.270
ometer-0.269
oke-0.259
rier-0.257
keeper-0.242
Token likes
Token ID7832
Feature activation-1.05e+01

Pos logit contributions
ometer+0.213
ounded+0.210
oke+0.199
rier+0.195
keeper+0.188
Neg logit contributions
aster-0.188
e-0.184
 rest-0.176
One-0.174
 play-0.168
Token and
Token ID290
Feature activation+1.47e+00

Pos logit contributions
ometer+0.653
ounded+0.637
oke+0.620
rier+0.611
 understands+0.601
Neg logit contributions
aster-0.518
e-0.481
One-0.466
 rest-0.453
 arrive-0.414
Token disl
Token ID19621
Feature activation-1.76e+01

Pos logit contributions
aster+0.065
e+0.060
 rest+0.058
One+0.057
 play+0.054
Neg logit contributions
ometer-0.101
ounded-0.099
oke-0.096
rier-0.095
keeper-0.092
Tokenikes
Token ID7938
Feature activation-4.58e+00

Pos logit contributions
ometer+0.756
ounded+0.732
oke+0.700
rier+0.687
 understands+0.670
Neg logit contributions
aster-1.222
e-1.155
One-1.123
 rest-1.108
 arrive-1.044
Token.
Token ID13
Feature activation+1.29e+00

Pos logit contributions
ometer+0.335
ounded+0.327
oke+0.321
rier+0.316
 understands+0.312
Neg logit contributions
aster-0.179
e-0.162
One-0.155
 rest-0.149
 arrive-0.133
Token We
Token ID775
Feature activation-5.85e+00

Pos logit contributions
aster+0.087
e+0.083
One+0.081
 rest+0.080
 play+0.075
Neg logit contributions
ometer-0.058
ounded-0.056
oke-0.053
rier-0.052
 understands-0.051
Token can
Token ID460
Feature activation-5.69e+00

Pos logit contributions
ometer+0.371
ounded+0.364
 understands+0.354
oke+0.353
 puzzled+0.338
Neg logit contributions
aster-0.282
One-0.236
e-0.220
 recess-0.219
wered-0.215
Token be
Token ID307
Feature activation+1.77e+00

Pos logit contributions
ometer+0.346
keeper+0.335
ounded+0.335
rier+0.332
 puzzled+0.329
Neg logit contributions
aster-0.260
P-0.218
 recess-0.203
e-0.203
 Saturday-0.180
Token sad
Token ID6507
Feature activation-9.95e+00

Pos logit contributions
ometer+0.146
ounded+0.141
oke+0.133
rier+0.133
keeper+0.126
Neg logit contributions
aster-0.247
e-0.234
One-0.230
 rest-0.227
 play-0.216
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
ometer+0.768
ounded+0.768
oke+0.739
rier+0.737
 understands+0.722
Neg logit contributions
aster-0.340
e-0.299
One-0.299
 rest-0.295
 play-0.267
Token He
Token ID679
Feature activation-2.30e+00

Pos logit contributions
One+0.050
 play+0.047
 rest+0.043
aster+0.043
Once+0.042
Neg logit contributions
oke-0.091
ounded-0.086
rier-0.085
ounced-0.084
ishes-0.083
Token said
Token ID531
Feature activation-6.46e+00

Pos logit contributions
rier+0.148
ometer+0.141
oke+0.140
ounded+0.137
keeper+0.124
Neg logit contributions
One-0.112
e-0.111
aster-0.111
 play-0.110
 read-0.109
Token,
Token ID11
Feature activation+9.77e-01

Pos logit contributions
ounded+0.571
ometer+0.541
oke+0.538
rier+0.519
 understands+0.517
Neg logit contributions
aster-0.152
 rest-0.133
e-0.130
 play-0.121
One-0.119
Token ""
Token ID13538
Feature activation-1.70e+01

Pos logit contributions
One+0.061
aster+0.060
 rest+0.058
Once+0.056
 play+0.056
Neg logit contributions
ounded-0.046
oke-0.043
ometer-0.043
rier-0.042
azed-0.041
TokenI
Token ID40
Feature activation+7.67e+00

Pos logit contributions
ometer+1.102
ounded+1.102
keeper+1.067
 understands+1.059
 puzzled+1.047
Neg logit contributions
aster-0.708
e-0.671
 rest-0.617
 arrive-0.592
 recess-0.592
Token hope
Token ID2911
Feature activation+1.13e+00

Pos logit contributions
aster+0.359
e+0.345
 rest+0.333
One+0.328
 play+0.313
Neg logit contributions
ometer-0.495
rier-0.494
ounded-0.487
oke-0.469
keeper-0.457
Token something
Token ID1223
Feature activation-8.04e+00

Pos logit contributions
aster+0.049
e+0.045
One+0.043
 rest+0.042
 play+0.038
Neg logit contributions
ometer-0.079
ounded-0.077
oke-0.075
rier-0.074
 understands-0.073
Token will
Token ID481
Feature activation-2.79e+00

Pos logit contributions
ometer+0.565
ounded+0.536
rier+0.535
oke+0.523
keeper+0.509
Neg logit contributions
 rest-0.348
aster-0.334
e-0.323
 play-0.315
One-0.315
Token change
Token ID1487
Feature activation+2.93e+00

Pos logit contributions
ounded+0.182
ometer+0.179
rier+0.177
keeper+0.173
 isn+0.171
Neg logit contributions
aster-0.122
e-0.115
P-0.105
One-0.101
 recess-0.099
Token should
Token ID815
Feature activation-7.90e+00

Pos logit contributions
ometer+0.132
ounded+0.130
oke+0.128
 understands+0.126
 ashamed+0.120
Neg logit contributions
aster-0.087
One-0.076
e-0.071
 recess-0.067
 rest-0.066
Token be
Token ID307
Feature activation+1.71e+00

Pos logit contributions
 understands+0.378
 ashamed+0.360
 dear+0.352
 doesn+0.347
 teeth+0.334
Neg logit contributions
he-0.380
aster-0.379
e-0.364
 recess-0.359
date-0.354
Token compassionate
Token ID32533
Feature activation-3.86e+00

Pos logit contributions
aster+0.080
e+0.080
One+0.077
 rest+0.074
 play+0.073
Neg logit contributions
ometer-0.116
ounded-0.109
oke-0.107
rier-0.106
keeper-0.102
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
ounded+0.313
ometer+0.308
oke+0.298
 puzzled+0.293
rier+0.289
Neg logit contributions
e-0.119
aster-0.114
One-0.106
 rest-0.100
 play-0.096
Token Comp
Token ID3082
Feature activation+2.66e+00

Pos logit contributions
aster+0.083
e+0.079
One+0.077
 rest+0.076
 arrive+0.071
Neg logit contributions
ometer-0.055
ounded-0.053
oke-0.051
rier-0.050
 understands-0.049
Tokenassion
Token ID11857
Feature activation-1.69e+01

Pos logit contributions
One+0.196
e+0.195
aster+0.192
 play+0.179
 end+0.175
Neg logit contributions
rier-0.084
 puzzled-0.083
ounded-0.079
ometer-0.078
 understands-0.077
Tokenate
Token ID378
Feature activation+1.45e+00

Pos logit contributions
ounded+1.126
ometer+1.116
 puzzled+1.043
oke+1.036
rier+1.032
Neg logit contributions
e-0.768
One-0.739
aster-0.739
 rest-0.663
 end-0.661
Token means
Token ID1724
Feature activation-1.41e+01

Pos logit contributions
e+0.072
aster+0.069
 rest+0.065
One+0.063
 end+0.061
Neg logit contributions
ounded-0.092
ometer-0.088
 puzzled-0.084
rier-0.083
oke-0.081
Token you
Token ID345
Feature activation+4.91e+00

Pos logit contributions
ometer+1.045
ounded+1.026
oke+1.003
rier+0.988
 understands+0.978
Neg logit contributions
aster-0.540
e-0.492
One-0.468
 rest-0.450
 play-0.398
Token care
Token ID1337
Feature activation+2.83e+00

Pos logit contributions
e+0.212
aster+0.212
 rest+0.205
One+0.197
 play+0.188
Neg logit contributions
ometer-0.328
ounded-0.327
rier-0.322
oke-0.310
 puzzled-0.305
Token about
Token ID546
Feature activation-7.19e+00

Pos logit contributions
e+0.114
 rest+0.114
aster+0.112
One+0.111
 end+0.111
Neg logit contributions
ounded-0.191
ometer-0.189
keeper-0.183
 puzzled-0.177
rier-0.177
Token they
Token ID484
Feature activation+1.43e+00

Pos logit contributions
ometer+0.083
ounded+0.081
oke+0.078
 understands+0.077
rier+0.077
Neg logit contributions
aster-0.065
e-0.062
One-0.058
 rest-0.056
 skip-0.050
Token arrived
Token ID5284
Feature activation+2.43e+00

Pos logit contributions
aster+0.076
e+0.070
One+0.069
 rest+0.065
 recess+0.060
Neg logit contributions
ounded-0.083
oke-0.078
ometer-0.078
 understands-0.077
 puzzled-0.075
Token they
Token ID484
Feature activation+1.42e+00

Pos logit contributions
 rest+0.061
aster+0.061
 play+0.059
e+0.058
 recess+0.057
Neg logit contributions
ometer-0.198
oke-0.190
rier-0.187
ounded-0.185
 understands-0.180
Token were
Token ID547
Feature activation+1.83e+00

Pos logit contributions
aster+0.078
e+0.074
 rest+0.072
One+0.072
 play+0.069
Neg logit contributions
ometer-0.085
rier-0.081
ounded-0.079
oke-0.078
 understands-0.073
Token ast
Token ID6468
Feature activation+4.74e+00

Pos logit contributions
aster+0.090
e+0.087
 rest+0.081
One+0.081
 arrive+0.076
Neg logit contributions
ometer-0.116
ounded-0.112
oke-0.109
rier-0.108
 understands-0.105
Tokenounded
Token ID6302
Feature activation-1.66e+01

Pos logit contributions
One+0.366
 walk+0.361
 play+0.360
 go+0.359
 rest+0.358
Neg logit contributions
 teeth-0.121
rier-0.097
 understands-0.095
 claws-0.091
rily-0.090
Token,
Token ID11
Feature activation+8.70e-01

Pos logit contributions
ometer+1.403
oke+1.321
ounded+1.316
rier+1.296
 understands+1.256
Neg logit contributions
e-0.455
One-0.436
aster-0.408
 play-0.393
 rest-0.393
Token the
Token ID262
Feature activation-1.18e+00

Pos logit contributions
aster+0.034
 rest+0.031
e+0.031
One+0.030
 play+0.028
Neg logit contributions
ometer-0.065
oke-0.062
ounded-0.061
rier-0.060
keeper-0.058
Token land
Token ID1956
Feature activation+7.93e+00

Pos logit contributions
ometer+0.074
ounded+0.073
oke+0.070
rier+0.069
 understands+0.068
Neg logit contributions
aster-0.059
e-0.055
One-0.053
 rest-0.052
 play-0.048
Token was
Token ID373
Feature activation-1.84e+00

Pos logit contributions
e+0.305
aster+0.305
 rest+0.282
 end+0.273
 arrive+0.270
Neg logit contributions
ometer-0.546
ounded-0.527
rier-0.520
 understands-0.493
oke-0.492
Token very
Token ID845
Feature activation+2.46e+00

Pos logit contributions
ometer+0.125
ounded+0.122
oke+0.120
rier+0.116
 understands+0.115
Neg logit contributions
aster-0.082
e-0.077
One-0.075
 rest-0.073
 spring-0.068
Token the
Token ID262
Feature activation-1.18e+00

Pos logit contributions
ometer+0.301
ounded+0.298
rier+0.289
 understands+0.286
oke+0.285
Neg logit contributions
aster-0.180
e-0.169
One-0.164
 rest-0.147
 recess-0.142
Token bird
Token ID6512
Feature activation-1.87e+00

Pos logit contributions
ometer+0.073
ounded+0.073
oke+0.072
 understands+0.066
atted+0.066
Neg logit contributions
aster-0.059
 rest-0.058
e-0.056
One-0.056
 spring-0.052
Token and
Token ID290
Feature activation+1.36e+00

Pos logit contributions
ometer+0.125
ounded+0.123
oke+0.120
rier+0.119
 understands+0.116
Neg logit contributions
aster-0.085
e-0.079
One-0.075
 rest-0.074
 play-0.068
Token said
Token ID531
Feature activation-6.53e+00

Pos logit contributions
aster+0.056
 rest+0.054
e+0.054
 read+0.053
 play+0.052
Neg logit contributions
ometer-0.098
oke-0.096
rier-0.095
ounded-0.091
 understands-0.087
Token,
Token ID11
Feature activation+8.79e-01

Pos logit contributions
ounded+0.537
ometer+0.533
oke+0.513
outed+0.511
ishes+0.503
Neg logit contributions
 rest-0.159
aster-0.158
One-0.152
e-0.150
 go-0.144
Token "'
Token ID24018
Feature activation-1.64e+01

Pos logit contributions
One+0.058
aster+0.057
 rest+0.055
 spring+0.054
e+0.053
Neg logit contributions
ounded-0.036
ometer-0.036
 understands-0.034
oke-0.034
ishes-0.033
TokenHello
Token ID15496
Feature activation+8.46e-01

Pos logit contributions
ounded+0.952
keeper+0.925
ometer+0.925
oke+0.886
 understands+0.879
Neg logit contributions
aster-0.760
 rest-0.695
 recess-0.683
e-0.672
 arrive-0.651
Token,
Token ID11
Feature activation+8.72e-01

Pos logit contributions
One+0.031
aster+0.031
e+0.031
 rest+0.030
 spring+0.029
Neg logit contributions
ounded-0.062
ometer-0.061
oke-0.058
 understands-0.057
outed-0.057
Token my
Token ID616
Feature activation-7.89e+00

Pos logit contributions
 rest+0.044
aster+0.043
e+0.041
One+0.041
 spring+0.041
Neg logit contributions
ometer-0.053
ounded-0.052
outed-0.048
atted-0.048
rier-0.047
Token name
Token ID1438
Feature activation-4.26e+00

Pos logit contributions
ounded+0.438
oke+0.419
ometer+0.405
rier+0.389
 puzzled+0.386
Neg logit contributions
 rest-0.438
e-0.434
aster-0.416
One-0.404
 play-0.403
Token is
Token ID318
Feature activation-4.01e+00

Pos logit contributions
ometer+0.226
ounded+0.221
oke+0.213
rier+0.209
 understands+0.205
Neg logit contributions
aster-0.252
e-0.237
One-0.230
 rest-0.226
 play-0.210
Token
Token ID198
Feature activation+2.96e-01

Pos logit contributions
ometer+0.361
ounded+0.353
oke+0.344
rier+0.338
 understands+0.335
Neg logit contributions
aster-0.224
e-0.207
One-0.198
 rest-0.192
 arrive-0.173
Token
Token ID198
Feature activation+3.08e-01

Pos logit contributions
aster+0.014
e+0.013
 rest+0.012
One+0.012
 recess+0.011
Neg logit contributions
ometer-0.019
ounded-0.019
oke-0.019
 understands-0.018
keeper-0.017
TokenBilly
Token ID49640
Feature activation+4.03e+00

Pos logit contributions
aster+0.013
e+0.013
 rest+0.012
 recess+0.012
 arrive+0.011
Neg logit contributions
ounded-0.021
ometer-0.020
oke-0.019
 understands-0.019
keeper-0.019
Token was
Token ID373
Feature activation-1.62e+00

Pos logit contributions
e+0.196
aster+0.195
One+0.181
 rest+0.179
 play+0.171
Neg logit contributions
rier-0.251
ometer-0.250
ounded-0.244
oke-0.242
 understands-0.222
Token ast
Token ID6468
Feature activation+4.95e+00

Pos logit contributions
ometer+0.101
oke+0.096
ounded+0.095
rier+0.091
keeper+0.088
Neg logit contributions
e-0.085
aster-0.083
One-0.076
 rest-0.076
 play-0.072
Tokenounded
Token ID6302
Feature activation-1.64e+01

Pos logit contributions
One+0.383
aster+0.379
he+0.375
 rest+0.361
 walk+0.361
Neg logit contributions
 teeth-0.136
 eyes-0.113
sed-0.111
 understands-0.104
rier-0.100
Token!
Token ID0
Feature activation-1.21e+00

Pos logit contributions
ometer+1.330
ounded+1.304
oke+1.277
rier+1.226
 understands+1.221
Neg logit contributions
aster-0.486
e-0.470
One-0.451
 rest-0.422
 play-0.390
Token He
Token ID679
Feature activation-2.14e+00

Pos logit contributions
ounded+0.073
rier+0.072
oke+0.068
ener+0.068
ometer+0.068
Neg logit contributions
aster-0.057
One-0.055
Once-0.051
 rest-0.051
 play-0.049
Token used
Token ID973
Feature activation+3.41e+00

Pos logit contributions
rier+0.147
ometer+0.140
ounded+0.133
oke+0.132
ours+0.121
Neg logit contributions
e-0.101
aster-0.096
 read-0.095
One-0.094
 play-0.091
Token the
Token ID262
Feature activation-9.50e-01

Pos logit contributions
aster+0.107
e+0.093
One+0.090
 rest+0.089
 play+0.079
Neg logit contributions
ometer-0.280
ounded-0.277
oke-0.274
rier-0.268
 understands-0.260
Token magnet
Token ID19972
Feature activation-3.55e+00

Pos logit contributions
oke+0.057
ometer+0.057
ounded+0.056
atted+0.054
rier+0.054
Neg logit contributions
 rest-0.049
One-0.047
aster-0.047
 end-0.045
e-0.043
Token and
Token ID290
Feature activation+1.43e+00

Pos logit contributions
ometer+0.487
ounded+0.479
oke+0.466
rier+0.461
 understands+0.455
Neg logit contributions
aster-0.261
e-0.238
One-0.228
 rest-0.220
 play-0.195
Token said
Token ID531
Feature activation-6.43e+00

Pos logit contributions
aster+0.064
 rest+0.062
 play+0.061
 spring+0.058
e+0.057
Neg logit contributions
ometer-0.100
oke-0.095
rier-0.093
ounded-0.090
 understands-0.085
Token excited
Token ID6568
Feature activation-2.15e+00

Pos logit contributions
ounded+0.540
ometer+0.524
oke+0.518
ishes+0.503
outed+0.503
Neg logit contributions
aster-0.148
 rest-0.139
e-0.138
 go-0.135
 play-0.129
Tokenly
Token ID306
Feature activation-8.55e+00

Pos logit contributions
ometer+0.090
ounded+0.088
oke+0.082
 understands+0.082
 puzzled+0.079
Neg logit contributions
aster-0.148
 rest-0.143
e-0.140
 play-0.139
One-0.139
Token,
Token ID11
Feature activation+9.64e-01

Pos logit contributions
ounded+0.693
ometer+0.663
oke+0.648
 understands+0.644
rier+0.640
Neg logit contributions
aster-0.255
e-0.243
 rest-0.212
One-0.204
 play-0.197
Token "'
Token ID24018
Feature activation-1.63e+01

Pos logit contributions
aster+0.066
One+0.064
 rest+0.063
 spring+0.061
 arrive+0.061
Neg logit contributions
ometer-0.040
ounded-0.038
oke-0.037
 understands-0.035
rier-0.035
TokenWow
Token ID22017
Feature activation-9.19e-01

Pos logit contributions
ounded+0.890
ometer+0.837
oke+0.814
keeper+0.812
 understands+0.788
Neg logit contributions
aster-0.833
 rest-0.778
e-0.764
 recess-0.725
 end-0.719
Token,
Token ID11
Feature activation+9.68e-01

Pos logit contributions
ometer+0.064
ounded+0.063
oke+0.061
rier+0.060
 understands+0.060
Neg logit contributions
aster-0.038
e-0.036
 rest-0.034
One-0.034
 play-0.031
Token the
Token ID262
Feature activation-1.06e+00

Pos logit contributions
aster+0.045
 rest+0.041
e+0.041
One+0.038
 play+0.038
Neg logit contributions
ometer-0.066
ounded-0.064
oke-0.062
rier-0.061
 understands-0.059
Token buildings
Token ID6832
Feature activation-3.20e+00

Pos logit contributions
ometer+0.071
ounded+0.068
oke+0.068
rier+0.066
 understands+0.064
Neg logit contributions
aster-0.048
 rest-0.045
One-0.045
e-0.044
 arrive-0.041
Token here
Token ID994
Feature activation-4.38e+00

Pos logit contributions
ometer+0.234
ounded+0.230
oke+0.224
rier+0.221
 understands+0.218
Neg logit contributions
aster-0.125
e-0.114
One-0.109
 rest-0.106
 play-0.094
Tokenmy
Token ID1820
Feature activation-6.77e-01

Pos logit contributions
ometer+0.048
ounded+0.047
rier+0.046
oke+0.045
 understands+0.044
Neg logit contributions
aster-0.030
e-0.028
One-0.027
 rest-0.026
 play-0.024
Token smiled
Token ID13541
Feature activation-4.49e+00

Pos logit contributions
ometer+0.042
rier+0.041
ounded+0.041
oke+0.040
 understands+0.039
Neg logit contributions
aster-0.033
e-0.032
One-0.030
 rest-0.030
 play-0.028
Token and
Token ID290
Feature activation+1.46e+00

Pos logit contributions
oke+0.384
ometer+0.375
ounded+0.363
 understands+0.343
rier+0.340
Neg logit contributions
One-0.118
 play-0.114
 spring-0.108
 go-0.106
 rest-0.105
Token said
Token ID531
Feature activation-6.41e+00

Pos logit contributions
aster+0.075
 rest+0.071
 play+0.070
 read+0.069
 spring+0.069
Neg logit contributions
ometer-0.089
rier-0.088
oke-0.083
ounded-0.083
 understands-0.076
Token,
Token ID11
Feature activation+9.94e-01

Pos logit contributions
ounded+0.540
ometer+0.534
oke+0.515
 understands+0.502
rier+0.501
Neg logit contributions
aster-0.158
e-0.151
 rest-0.146
One-0.131
 play-0.127
Token "'
Token ID24018
Feature activation-1.63e+01

Pos logit contributions
One+0.066
aster+0.066
 rest+0.063
 spring+0.062
 arrive+0.061
Neg logit contributions
ometer-0.041
ounded-0.040
oke-0.038
 understands-0.038
rier-0.037
TokenLet
Token ID5756
Feature activation-1.33e+00

Pos logit contributions
ounded+0.858
ometer+0.835
oke+0.794
 understands+0.792
keeper+0.772
Neg logit contributions
aster-0.895
 rest-0.834
e-0.813
 arrive-0.785
 end-0.783
Token's
Token ID338
Feature activation-5.42e+00

Pos logit contributions
ometer+0.065
ounded+0.064
oke+0.061
rier+0.060
 understands+0.059
Neg logit contributions
aster-0.085
e-0.080
One-0.078
 rest-0.077
 play-0.072
Token go
Token ID467
Feature activation+1.13e+01

Pos logit contributions
ometer+0.353
ounded+0.351
 puzzled+0.348
 ashamed+0.336
 isn+0.335
Neg logit contributions
aster-0.220
One-0.191
 recess-0.176
he-0.173
e-0.173
Token and
Token ID290
Feature activation+1.50e+00

Pos logit contributions
aster+0.518
e+0.458
One+0.452
wered+0.413
 rest+0.392
Neg logit contributions
ounded-0.757
ometer-0.731
oke-0.687
 puzzled-0.675
 understands-0.665
Token do
Token ID466
Feature activation+2.06e+00

Pos logit contributions
aster+0.042
e+0.038
One+0.036
 rest+0.032
 arrive+0.027
Neg logit contributions
ometer-0.124
ounded-0.123
oke-0.119
 understands-0.117
rier-0.117
Token Sam
Token ID3409
Feature activation+3.65e-01

Pos logit contributions
ometer+0.247
oke+0.242
ounded+0.239
rier+0.237
 understands+0.236
Neg logit contributions
One-0.190
aster-0.188
e-0.178
P-0.172
 rest-0.161
Token.
Token ID13
Feature activation+1.28e+00

Pos logit contributions
aster+0.015
e+0.014
One+0.013
 rest+0.013
 play+0.012
Neg logit contributions
ometer-0.026
ounded-0.026
rier-0.025
oke-0.025
 understands-0.024
Token He
Token ID679
Feature activation-2.25e+00

Pos logit contributions
aster+0.051
One+0.048
e+0.045
 play+0.043
 spring+0.042
Neg logit contributions
rier-0.095
ounded-0.092
ometer-0.089
oke-0.086
ener-0.085
Token said
Token ID531
Feature activation-6.40e+00

Pos logit contributions
ometer+0.139
rier+0.137
ounded+0.135
oke+0.130
 understands+0.122
Neg logit contributions
aster-0.113
e-0.111
One-0.107
 rest-0.103
 read-0.102
Token,
Token ID11
Feature activation+1.01e+00

Pos logit contributions
ounded+0.484
ometer+0.474
oke+0.449
 understands+0.439
rier+0.436
Neg logit contributions
aster-0.248
e-0.223
 rest-0.206
One-0.204
 play-0.192
Token "'
Token ID24018
Feature activation-1.62e+01

Pos logit contributions
aster+0.074
One+0.071
e+0.069
 rest+0.068
 spring+0.066
Neg logit contributions
ounded-0.039
ometer-0.038
rier-0.035
oke-0.035
 understands-0.035
TokenWe
Token ID1135
Feature activation+3.93e+00

Pos logit contributions
ounded+0.939
ometer+0.938
oke+0.913
 understands+0.907
keeper+0.906
Neg logit contributions
aster-0.717
 rest-0.696
 arrive-0.679
e-0.646
 end-0.638
Token should
Token ID815
Feature activation-7.82e+00

Pos logit contributions
aster+0.234
e+0.213
One+0.211
 rest+0.203
 arrive+0.191
Neg logit contributions
ometer-0.210
ounded-0.205
oke-0.197
 understands-0.192
rier-0.191
Token share
Token ID2648
Feature activation+5.38e+00

Pos logit contributions
ometer+0.496
 isn+0.490
 understands+0.487
 doesn+0.482
 ashamed+0.481
Neg logit contributions
aster-0.322
 recess-0.272
e-0.262
One-0.261
P-0.252
Token our
Token ID674
Feature activation-6.73e+00

Pos logit contributions
aster+0.226
e+0.218
One+0.202
 rest+0.200
 play+0.188
Neg logit contributions
ometer-0.388
ounded-0.383
oke-0.371
rier-0.365
keeper-0.353
Token special
Token ID2041
Feature activation-4.67e+00

Pos logit contributions
ometer+0.408
ounded+0.399
oke+0.387
rier+0.382
 understands+0.373
Neg logit contributions
aster-0.349
e-0.328
One-0.316
 rest-0.311
 play-0.287
Tokenally
Token ID453
Feature activation+7.74e-01

Pos logit contributions
aster+0.513
e+0.491
One+0.464
 rest+0.456
 arrive+0.431
Neg logit contributions
ometer-0.474
ounded-0.473
oke-0.461
 understands-0.440
rier-0.436
Token,
Token ID11
Feature activation+8.74e-01

Pos logit contributions
e+0.022
aster+0.021
 rest+0.020
 play+0.018
 spring+0.017
Neg logit contributions
ounded-0.066
rier-0.063
ometer-0.062
oke-0.061
 puzzled-0.060
Token that
Token ID326
Feature activation-2.52e+00

Pos logit contributions
aster+0.041
e+0.038
 rest+0.037
One+0.036
 play+0.033
Neg logit contributions
ometer-0.058
ounded-0.056
oke-0.055
rier-0.054
 understands-0.053
Token's
Token ID338
Feature activation-5.54e+00

Pos logit contributions
ounded+0.143
ometer+0.143
rier+0.141
oke+0.138
 puzzled+0.132
Neg logit contributions
 rest-0.134
aster-0.132
e-0.130
One-0.127
 play-0.124
Token not
Token ID407
Feature activation+3.55e+00

Pos logit contributions
ometer+0.389
ounded+0.380
oke+0.367
rier+0.365
 puzzled+0.358
Neg logit contributions
aster-0.234
e-0.216
One-0.205
 rest-0.198
 play-0.179
Token ours
Token ID16903
Feature activation-1.62e+01

Pos logit contributions
aster+0.172
e+0.161
One+0.155
 rest+0.154
 play+0.141
Neg logit contributions
ometer-0.228
ounded-0.224
oke-0.218
rier-0.213
 understands-0.211
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
ometer+1.144
ounded+1.124
oke+1.093
rier+1.078
 understands+1.062
Neg logit contributions
aster-0.673
e-0.622
One-0.595
 rest-0.583
 play-0.524
Token Leave
Token ID17446
Feature activation+7.91e-01

Pos logit contributions
e+0.066
aster+0.064
One+0.061
 play+0.059
 rest+0.059
Neg logit contributions
ounded-0.060
 understands-0.056
rier-0.056
oke-0.054
 puzzled-0.054
Token it
Token ID340
Feature activation-4.07e+00

Pos logit contributions
aster+0.035
e+0.032
 rest+0.031
One+0.031
 play+0.028
Neg logit contributions
ometer-0.055
ounded-0.054
oke-0.053
rier-0.052
 puzzled-0.050
Token where
Token ID810
Feature activation-5.75e+00

Pos logit contributions
ounded+0.217
ometer+0.213
rier+0.200
oke+0.196
keeper+0.196
Neg logit contributions
aster-0.240
e-0.219
One-0.214
wered-0.199
 rest-0.197
Token it
Token ID340
Feature activation-4.08e+00

Pos logit contributions
ometer+0.461
ounded+0.450
oke+0.441
rier+0.441
 puzzled+0.428
Neg logit contributions
aster-0.188
 rest-0.178
e-0.168
 play-0.155
One-0.153
Token Its
Token ID6363
Feature activation-8.38e+00

Pos logit contributions
ometer+0.100
ounded+0.100
rier+0.098
oke+0.095
 understands+0.093
Neg logit contributions
aster-0.061
One-0.057
e-0.056
 rest-0.052
Once-0.048
Token brain
Token ID3632
Feature activation-4.51e+00

Pos logit contributions
ounded+0.497
ometer+0.479
oke+0.475
rier+0.454
 understands+0.426
Neg logit contributions
e-0.447
aster-0.429
 rest-0.428
One-0.419
 play-0.412
Token works
Token ID2499
Feature activation-2.58e+00

Pos logit contributions
ometer+0.271
ounded+0.267
rier+0.259
oke+0.255
keeper+0.241
Neg logit contributions
aster-0.228
e-0.225
One-0.211
 rest-0.210
 play-0.199
Token differently
Token ID10338
Feature activation-6.24e+00

Pos logit contributions
ometer+0.175
ounded+0.171
oke+0.165
 understands+0.161
rier+0.160
Neg logit contributions
aster-0.110
e-0.109
One-0.102
 rest-0.097
 time-0.091
Token than
Token ID621
Feature activation-5.54e+00

Pos logit contributions
ometer+0.476
ounded+0.476
oke+0.454
 understands+0.448
 puzzled+0.439
Neg logit contributions
aster-0.210
e-0.207
One-0.196
 rest-0.183
 time-0.174
Token ours
Token ID16903
Feature activation-1.62e+01

Pos logit contributions
ometer+0.368
ounded+0.362
oke+0.351
rier+0.346
 understands+0.341
Neg logit contributions
aster-0.253
e-0.234
One-0.225
 rest-0.219
 play-0.199
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
ometer+1.248
ounded+1.230
oke+1.195
rier+1.177
 understands+1.164
Neg logit contributions
aster-0.565
e-0.520
One-0.492
 rest-0.479
 play-0.421
Tokenâ
Token ID22940
Feature activation+3.23e+00

Pos logit contributions
aster+0.050
One+0.047
e+0.047
 rest+0.043
Once+0.040
Neg logit contributions
ounded-0.081
ometer-0.080
rier-0.078
oke-0.076
 understands-0.075
Token€
Token ID26391
Feature activation+8.94e-01

Pos logit contributions
aster+0.152
e+0.137
One+0.136
 rest+0.132
 arrive+0.121
Neg logit contributions
ometer-0.210
ounded-0.209
oke-0.203
 understands-0.195
rier-0.194
Token
Token ID220
Feature activation-4.58e+00

Pos logit contributions
aster+0.044
e+0.043
One+0.040
 rest+0.039
 play+0.036
Neg logit contributions
ounded-0.056
ometer-0.056
rier-0.055
oke-0.053
keeper-0.052
Token
Token ID198
Feature activation+8.97e-02

Pos logit contributions
ometer+0.333
ounded+0.331
rier+0.319
oke+0.317
 understands+0.312
Neg logit contributions
aster-0.175
e-0.161
One-0.158
 rest-0.150
 play-0.132
Token car
Token ID1097
Feature activation+4.19e-01

Pos logit contributions
ometer+0.291
ounded+0.288
oke+0.279
rier+0.274
 understands+0.271
Neg logit contributions
aster-0.199
e-0.186
One-0.177
 rest-0.174
 play-0.159
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
aster+0.019
e+0.018
One+0.017
 rest+0.017
 play+0.015
Neg logit contributions
ometer-0.028
ounded-0.027
oke-0.027
rier-0.026
 understands-0.026
Token It
Token ID632
Feature activation-7.69e+00

Pos logit contributions
aster+0.082
e+0.078
One+0.076
 rest+0.074
 play+0.071
Neg logit contributions
ometer-0.049
ounded-0.049
oke-0.046
rier-0.045
 understands-0.044
Token's
Token ID338
Feature activation-5.55e+00

Pos logit contributions
ounded+0.413
ometer+0.409
rier+0.403
oke+0.389
 puzzled+0.385
Neg logit contributions
e-0.424
aster-0.419
 rest-0.409
One-0.404
 play-0.387
Token not
Token ID407
Feature activation+3.54e+00

Pos logit contributions
ometer+0.322
ounded+0.315
oke+0.304
rier+0.300
 understands+0.294
Neg logit contributions
aster-0.302
e-0.283
One-0.274
 rest-0.267
 play-0.247
Token ours
Token ID16903
Feature activation-1.62e+01

Pos logit contributions
aster+0.157
e+0.152
 rest+0.152
One+0.140
 play+0.138
Neg logit contributions
ometer-0.238
ounded-0.232
oke-0.229
 understands-0.221
rier-0.221
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
ometer+1.084
ounded+1.065
oke+1.033
rier+1.018
 understands+1.003
Neg logit contributions
aster-0.731
e-0.679
One-0.653
 rest-0.637
 play-0.578
Token It
Token ID632
Feature activation-7.71e+00

Pos logit contributions
aster+0.081
e+0.078
One+0.075
 rest+0.073
 play+0.070
Neg logit contributions
ounded-0.050
ometer-0.049
oke-0.046
rier-0.046
 understands-0.045
Token's
Token ID338
Feature activation-5.55e+00

Pos logit contributions
ounded+0.406
ometer+0.402
rier+0.401
oke+0.376
 puzzled+0.372
Neg logit contributions
e-0.445
aster-0.431
 rest-0.420
One-0.414
 play-0.396
Token our
Token ID674
Feature activation-6.88e+00

Pos logit contributions
ometer+0.325
ounded+0.323
oke+0.312
rier+0.304
 understands+0.300
Neg logit contributions
aster-0.299
e-0.284
One-0.272
 rest-0.272
 spring-0.252
Token parents
Token ID3397
Feature activation-1.81e+00

Pos logit contributions
ounded+0.367
ometer+0.364
oke+0.355
rier+0.342
 understands+0.333
Neg logit contributions
aster-0.399
e-0.380
 rest-0.376
One-0.370
 play-0.354
Token found
Token ID1043
Feature activation-4.26e+00

Pos logit contributions
aster+0.197
One+0.176
e+0.174
 rest+0.166
 recess+0.158
Neg logit contributions
ometer-0.229
ounded-0.224
oke-0.219
 understands-0.212
 puzzled-0.206
Token it
Token ID340
Feature activation-4.07e+00

Pos logit contributions
ounded+0.336
ometer+0.332
oke+0.328
 puzzled+0.312
rier+0.304
Neg logit contributions
aster-0.153
e-0.145
 rest-0.137
One-0.126
 spring-0.115
Token.
Token ID13
Feature activation+1.18e+00

Pos logit contributions
ometer+0.288
ounded+0.283
oke+0.275
rier+0.272
keeper+0.266
Neg logit contributions
aster-0.171
e-0.154
One-0.146
 rest-0.142
 arrive-0.127
Token It
Token ID632
Feature activation-7.70e+00

Pos logit contributions
aster+0.080
e+0.079
One+0.075
 rest+0.072
 play+0.070
Neg logit contributions
ounded-0.052
ometer-0.050
rier-0.048
oke-0.047
 understands-0.046
Token's
Token ID338
Feature activation-5.56e+00

Pos logit contributions
ometer+0.422
ounded+0.421
rier+0.413
oke+0.395
 puzzled+0.388
Neg logit contributions
e-0.426
aster-0.405
One-0.404
 rest-0.403
 play-0.384
Token ours
Token ID16903
Feature activation-1.62e+01

Pos logit contributions
ometer+0.315
ounded+0.310
oke+0.300
rier+0.291
 understands+0.288
Neg logit contributions
aster-0.312
e-0.294
One-0.286
 rest-0.281
 play-0.261
Token."
Token ID526
Feature activation-5.41e+00

Pos logit contributions
ometer+1.131
ounded+1.112
oke+1.082
rier+1.052
 understands+1.043
Neg logit contributions
aster-0.688
e-0.648
One-0.618
 rest-0.610
 play-0.555
Token
Token ID198
Feature activation+9.52e-02

Pos logit contributions
ometer+0.383
ounded+0.376
oke+0.366
rier+0.361
 understands+0.356
Neg logit contributions
aster-0.226
e-0.208
One-0.198
 rest-0.193
 play-0.173
Token
Token ID198
Feature activation+9.89e-02

Pos logit contributions
aster+0.004
e+0.004
 rest+0.004
One+0.004
 arrive+0.004
Neg logit contributions
ometer-0.006
ounded-0.006
oke-0.006
 understands-0.006
keeper-0.006
TokenThe
Token ID464
Feature activation+2.54e+00

Pos logit contributions
aster+0.004
 rest+0.003
e+0.003
 arrive+0.003
wered+0.003
Neg logit contributions
keeper-0.007
ounded-0.007
ometer-0.007
 understands-0.007
oke-0.007
Token big
Token ID1263
Feature activation+4.51e-01

Pos logit contributions
aster+0.179
 rest+0.175
One+0.172
 end+0.168
 play+0.165
Neg logit contributions
ometer-0.106
ounded-0.099
oke-0.099
rier-0.091
atted-0.090
Token "
Token ID366
Feature activation-1.08e+01

Pos logit contributions
aster+0.060
One+0.058
 rest+0.055
 spring+0.055
e+0.054
Neg logit contributions
ounded-0.038
rier-0.035
ometer-0.035
keeper-0.033
azed-0.032
TokenNo
Token ID2949
Feature activation+3.45e+00

Pos logit contributions
ometer+0.700
ounded+0.690
 understands+0.652
keeper+0.651
rier+0.644
Neg logit contributions
aster-0.462
 rest-0.417
e-0.414
 arrive-0.392
 recess-0.391
Token,
Token ID11
Feature activation+8.70e-01

Pos logit contributions
aster+0.132
e+0.118
One+0.116
 rest+0.115
 play+0.101
Neg logit contributions
ometer-0.259
ounded-0.257
rier-0.246
oke-0.245
keeper-0.241
Token it
Token ID340
Feature activation-4.06e+00

Pos logit contributions
aster+0.039
e+0.036
One+0.034
 rest+0.034
 arrive+0.030
Neg logit contributions
ometer-0.060
ounded-0.059
oke-0.057
rier-0.056
 understands-0.055
Token is
Token ID318
Feature activation-4.01e+00

Pos logit contributions
ometer+0.212
ounded+0.212
rier+0.204
oke+0.198
keeper+0.193
Neg logit contributions
aster-0.235
e-0.235
One-0.221
 rest-0.220
 play-0.207
Token ours
Token ID16903
Feature activation-1.62e+01

Pos logit contributions
ometer+0.222
ounded+0.218
oke+0.212
rier+0.206
 understands+0.202
Neg logit contributions
aster-0.227
e-0.220
One-0.216
 rest-0.214
 spring-0.201
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
ometer+1.192
ounded+1.170
oke+1.142
rier+1.131
 understands+1.114
Neg logit contributions
aster-0.629
e-0.566
One-0.543
 rest-0.522
 arrive-0.461
Token We
Token ID775
Feature activation-5.95e+00

Pos logit contributions
aster+0.081
e+0.080
One+0.075
 rest+0.074
 play+0.071
Neg logit contributions
ounded-0.049
ometer-0.047
oke-0.045
rier-0.045
 understands-0.044
Token found
Token ID1043
Feature activation-4.24e+00

Pos logit contributions
ometer+0.363
ounded+0.354
oke+0.344
 understands+0.343
 ashamed+0.325
Neg logit contributions
aster-0.297
One-0.264
e-0.250
 recess-0.242
wered-0.239
Token it
Token ID340
Feature activation-4.05e+00

Pos logit contributions
ounded+0.324
ometer+0.322
oke+0.318
 puzzled+0.301
rier+0.293
Neg logit contributions
aster-0.161
e-0.156
 rest-0.147
One-0.134
 end-0.127
Token.
Token ID13
Feature activation+1.18e+00

Pos logit contributions
ometer+0.280
ounded+0.274
oke+0.266
rier+0.263
keeper+0.257
Neg logit contributions
aster-0.179
e-0.159
One-0.152
 rest-0.147
wered-0.136
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
ometer+0.477
ounded+0.465
rier+0.458
oke+0.453
keeper+0.442
Neg logit contributions
aster-0.263
e-0.234
One-0.234
 rest-0.215
 arrive-0.196
Token "
Token ID366
Feature activation-1.08e+01

Pos logit contributions
One+0.075
aster+0.073
 spring+0.069
e+0.068
Once+0.068
Neg logit contributions
ounded-0.052
ometer-0.049
rier-0.047
oke-0.046
ience-0.046
TokenThey
Token ID2990
Feature activation+6.26e+00

Pos logit contributions
ometer+0.668
ounded+0.649
rier+0.643
keeper+0.620
 understands+0.608
Neg logit contributions
aster-0.484
 rest-0.460
 recess-0.459
e-0.439
 arrive-0.423
Token are
Token ID389
Feature activation+1.08e+00

Pos logit contributions
aster+0.283
One+0.245
e+0.238
 recess+0.223
 rest+0.217
Neg logit contributions
ometer-0.425
ounded-0.419
 understands-0.402
oke-0.400
 puzzled-0.393
Token not
Token ID407
Feature activation+3.56e+00

Pos logit contributions
aster+0.054
e+0.051
One+0.049
 rest+0.048
 spring+0.046
Neg logit contributions
ometer-0.069
ounded-0.068
oke-0.065
 understands-0.063
rier-0.063
Token ours
Token ID16903
Feature activation-1.62e+01

Pos logit contributions
aster+0.175
e+0.170
 rest+0.164
One+0.154
 spring+0.152
Neg logit contributions
ometer-0.227
ounded-0.225
oke-0.214
keeper-0.208
 understands-0.207
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
ometer+1.188
ounded+1.169
oke+1.135
rier+1.117
 understands+1.105
Neg logit contributions
aster-0.628
e-0.580
One-0.555
 rest-0.542
 play-0.485
Token They
Token ID1119
Feature activation+1.96e-01

Pos logit contributions
aster+0.068
e+0.067
One+0.064
 rest+0.061
 play+0.059
Neg logit contributions
ounded-0.061
ometer-0.060
oke-0.057
rier-0.057
 understands-0.056
Token belong
Token ID5594
Feature activation+4.69e-01

Pos logit contributions
aster+0.007
One+0.006
e+0.006
 rest+0.006
 recess+0.005
Neg logit contributions
ometer-0.015
ounded-0.014
oke-0.014
 understands-0.014
rier-0.014
Token to
Token ID284
Feature activation-4.42e+00

Pos logit contributions
aster+0.020
e+0.020
 rest+0.020
One+0.019
 play+0.018
Neg logit contributions
ometer-0.031
ounded-0.031
oke-0.031
rier-0.029
 puzzled-0.029
Token the
Token ID262
Feature activation-1.17e+00

Pos logit contributions
ometer+0.301
ounded+0.292
oke+0.289
rier+0.285
 understands+0.282
Neg logit contributions
aster-0.187
e-0.173
One-0.166
 rest-0.158
 arrive-0.144
Token it
Token ID340
Feature activation-4.08e+00

Pos logit contributions
ometer+0.002
ounded+0.002
oke+0.002
rier+0.002
 understands+0.002
Neg logit contributions
aster-0.001
e-0.001
One-0.001
 rest-0.001
wered-0.001
Tokenâ
Token ID22940
Feature activation+3.23e+00

Pos logit contributions
ometer+0.235
ounded+0.230
oke+0.224
rier+0.223
 puzzled+0.214
Neg logit contributions
aster-0.212
e-0.206
 rest-0.201
One-0.200
 play-0.186
Token€
Token ID26391
Feature activation+8.85e-01

Pos logit contributions
aster+0.156
One+0.145
e+0.139
 rest+0.135
wered+0.131
Neg logit contributions
ometer-0.195
ounded-0.192
oke-0.189
 understands-0.177
rier-0.175
Tokenâ„¢
Token ID8151
Feature activation+1.72e+00

Pos logit contributions
aster+0.052
e+0.049
One+0.048
 rest+0.047
 play+0.043
Neg logit contributions
ometer-0.047
ounded-0.046
oke-0.044
rier-0.044
 understands-0.043
Tokens
Token ID82
Feature activation-1.12e+01

Pos logit contributions
aster+0.096
e+0.090
One+0.088
 rest+0.086
 play+0.079
Neg logit contributions
ometer-0.096
ounded-0.094
oke-0.090
rier-0.089
 understands-0.088
Token ours
Token ID16903
Feature activation-1.62e+01

Pos logit contributions
ometer+0.721
ounded+0.710
oke+0.693
rier+0.677
 understands+0.666
Neg logit contributions
aster-0.529
e-0.495
One-0.478
 rest-0.474
 play-0.432
Tokenâ
Token ID22940
Feature activation+3.23e+00

Pos logit contributions
ometer+1.161
ounded+1.145
oke+1.118
rier+1.089
 understands+1.073
Neg logit contributions
aster-0.647
e-0.603
One-0.573
 rest-0.572
 play-0.514
Token€
Token ID26391
Feature activation+8.76e-01

Pos logit contributions
aster+0.152
One+0.144
e+0.138
wered+0.134
 rest+0.134
Neg logit contributions
ometer-0.191
ounded-0.186
oke-0.184
 understands-0.171
rier-0.169
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
aster+0.050
e+0.047
One+0.045
 rest+0.044
 play+0.041
Neg logit contributions
ometer-0.049
ounded-0.048
oke-0.046
rier-0.045
 understands-0.044
Token
Token ID198
Feature activation+1.02e-01

Pos logit contributions
aster+0.072
e+0.069
One+0.068
 rest+0.065
 play+0.061
Neg logit contributions
ometer-0.058
ounded-0.057
rier-0.054
oke-0.054
 understands-0.052
Token
Token ID198
Feature activation+1.11e-01

Pos logit contributions
aster+0.005
e+0.005
 rest+0.005
 wait+0.005
 walk+0.005
Neg logit contributions
oke-0.006
ometer-0.006
 understands-0.005
ounded-0.005
keeper-0.005
Token says
Token ID1139
Feature activation-9.41e+00

Pos logit contributions
aster+0.147
e+0.143
One+0.141
 rest+0.138
 play+0.134
Neg logit contributions
ometer-0.063
ounded-0.061
rier-0.057
oke-0.054
keeper-0.050
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
ometer+0.704
ounded+0.691
oke+0.675
rier+0.670
 understands+0.656
Neg logit contributions
aster-0.355
e-0.319
One-0.312
 rest-0.294
 arrive-0.259
Token "
Token ID366
Feature activation-1.08e+01

Pos logit contributions
aster+0.076
One+0.075
e+0.072
 rest+0.070
 spring+0.069
Neg logit contributions
ounded-0.053
ometer-0.052
rier-0.049
oke-0.048
 understands-0.045
TokenIt
Token ID1026
Feature activation+4.83e-01

Pos logit contributions
ometer+0.673
ounded+0.639
keeper+0.629
rier+0.622
 understands+0.602
Neg logit contributions
aster-0.506
 rest-0.479
e-0.462
 recess-0.434
 walk-0.427
Token is
Token ID318
Feature activation-4.01e+00

Pos logit contributions
e+0.025
 rest+0.023
One+0.022
aster+0.022
 play+0.022
Neg logit contributions
rier-0.029
ounded-0.029
ometer-0.028
oke-0.027
 puzzled-0.027
Token ours
Token ID16903
Feature activation-1.62e+01

Pos logit contributions
ounded+0.265
ometer+0.263
oke+0.257
rier+0.247
 understands+0.245
Neg logit contributions
aster-0.188
e-0.177
One-0.171
 rest-0.171
 spring-0.161
Token!"
Token ID2474
Feature activation-7.04e+00

Pos logit contributions
ometer+1.160
ounded+1.143
oke+1.111
rier+1.087
 understands+1.077
Neg logit contributions
aster-0.658
e-0.611
One-0.586
 rest-0.573
 play-0.517
Token
Token ID198
Feature activation+9.29e-02

Pos logit contributions
ometer+0.457
ounded+0.447
oke+0.441
 understands+0.432
rier+0.424
Neg logit contributions
aster-0.329
e-0.312
One-0.283
 rest-0.280
 arrive-0.264
Token
Token ID198
Feature activation+1.11e-01

Pos logit contributions
aster+0.004
e+0.004
 rest+0.004
One+0.004
 arrive+0.003
Neg logit contributions
ometer-0.006
ounded-0.006
oke-0.006
 understands-0.006
keeper-0.006
TokenThe
Token ID464
Feature activation+2.55e+00

Pos logit contributions
aster+0.005
 rest+0.005
e+0.005
 arrive+0.004
 wait+0.004
Neg logit contributions
oke-0.007
keeper-0.007
ounded-0.007
ometer-0.007
 understands-0.007
Token woman
Token ID2415
Feature activation-6.12e+00

Pos logit contributions
aster+0.183
e+0.174
One+0.171
 rest+0.168
 play+0.159
Neg logit contributions
ometer-0.105
ounded-0.102
oke-0.097
rier-0.095
 understands-0.092
Token money
Token ID1637
Feature activation-1.21e+00

Pos logit contributions
 rest+0.003
aster+0.003
e+0.002
One+0.002
 play+0.002
Neg logit contributions
ometer-0.003
oke-0.003
ounded-0.003
 understands-0.003
rier-0.003
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
ometer+0.077
ounded+0.075
oke+0.074
 understands+0.073
rier+0.071
Neg logit contributions
aster-0.058
e-0.054
One-0.052
 rest-0.052
 play-0.047
Token It
Token ID632
Feature activation-7.71e+00

Pos logit contributions
aster+0.074
e+0.073
One+0.072
 rest+0.068
 play+0.066
Neg logit contributions
ounded-0.052
rier-0.051
ometer-0.050
 understands-0.049
oke-0.047
Token's
Token ID338
Feature activation-5.56e+00

Pos logit contributions
rier+0.472
ometer+0.470
joy+0.463
ounded+0.462
oke+0.449
Neg logit contributions
 rest-0.354
e-0.341
 go-0.340
One-0.321
 play-0.314
Token not
Token ID407
Feature activation+3.55e+00

Pos logit contributions
ometer+0.349
ounded+0.345
oke+0.336
rier+0.327
 understands+0.325
Neg logit contributions
aster-0.277
e-0.257
One-0.252
 rest-0.246
 spring-0.225
Token ours
Token ID16903
Feature activation-1.62e+01

Pos logit contributions
aster+0.171
 rest+0.167
e+0.157
One+0.156
 go+0.151
Neg logit contributions
ometer-0.224
oke-0.218
ounded-0.216
 understands-0.210
 puzzled-0.205
Token."
Token ID526
Feature activation-5.40e+00

Pos logit contributions
ometer+1.064
ounded+1.048
oke+1.013
 puzzled+0.988
rier+0.977
Neg logit contributions
aster-0.735
e-0.712
 rest-0.695
One-0.680
 play-0.643
Token
Token ID198
Feature activation+9.29e-02

Pos logit contributions
ometer+0.336
ounded+0.330
oke+0.318
rier+0.316
 understands+0.309
Neg logit contributions
aster-0.271
e-0.253
One-0.245
 rest-0.239
 play-0.220
Token
Token ID198
Feature activation+1.11e-01

Pos logit contributions
aster+0.004
e+0.004
 rest+0.004
One+0.004
 arrive+0.004
Neg logit contributions
ometer-0.006
oke-0.006
ounded-0.006
 understands-0.005
keeper-0.005
TokenIn
Token ID818
Feature activation+8.86e+00

Pos logit contributions
aster+0.005
 rest+0.004
 recess+0.004
 arrive+0.004
e+0.004
Neg logit contributions
keeper-0.007
oke-0.007
ounded-0.007
 understands-0.007
ometer-0.007
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
aster+0.367
One+0.341
e+0.341
 rest+0.330
 play+0.293
Neg logit contributions
ometer-0.580
 puzzled-0.570
ounded-0.569
 understands-0.557
oke-0.556
Token not
Token ID407
Feature activation+3.55e+00

Pos logit contributions
aster+0.056
e+0.053
One+0.051
 rest+0.050
 play+0.046
Neg logit contributions
ometer-0.065
ounded-0.063
oke-0.061
rier-0.060
 understands-0.059
Token yours
Token ID12431
Feature activation-7.34e+00

Pos logit contributions
aster+0.175
e+0.169
One+0.160
 rest+0.159
 play+0.149
Neg logit contributions
ometer-0.230
ounded-0.222
oke-0.212
rier-0.211
keeper-0.208
Token.
Token ID13
Feature activation+1.18e+00

Pos logit contributions
ometer+0.548
ounded+0.538
oke+0.525
rier+0.519
 understands+0.512
Neg logit contributions
aster-0.277
e-0.250
One-0.239
 rest-0.231
 play-0.203
Token They
Token ID1119
Feature activation+1.83e-01

Pos logit contributions
aster+0.073
e+0.071
One+0.068
 rest+0.065
 play+0.063
Neg logit contributions
ounded-0.059
ometer-0.058
rier-0.055
oke-0.055
 understands-0.054
Token are
Token ID389
Feature activation+1.07e+00

Pos logit contributions
aster+0.007
One+0.006
e+0.006
 rest+0.005
 recess+0.005
Neg logit contributions
ometer-0.013
ounded-0.013
oke-0.013
 understands-0.013
rier-0.013
Token ours
Token ID16903
Feature activation-1.62e+01

Pos logit contributions
aster+0.059
e+0.056
One+0.054
 rest+0.053
 play+0.050
Neg logit contributions
ometer-0.061
ounded-0.060
oke-0.058
rier-0.056
 understands-0.055
Token.
Token ID13
Feature activation+1.19e+00

Pos logit contributions
ometer+1.221
ounded+1.207
oke+1.170
rier+1.149
 understands+1.139
Neg logit contributions
aster-0.588
e-0.547
One-0.520
 rest-0.508
 play-0.454
Token We
Token ID775
Feature activation-5.95e+00

Pos logit contributions
aster+0.073
e+0.071
One+0.069
 rest+0.066
 play+0.063
Neg logit contributions
ounded-0.059
ometer-0.058
rier-0.055
oke-0.055
 understands-0.053
Token made
Token ID925
Feature activation+1.12e+00

Pos logit contributions
ometer+0.361
ounded+0.357
oke+0.347
 understands+0.344
 ashamed+0.327
Neg logit contributions
aster-0.299
One-0.261
e-0.249
 recess-0.239
wered-0.237
Token them
Token ID606
Feature activation+2.07e+00

Pos logit contributions
aster+0.045
e+0.041
One+0.039
 rest+0.039
 play+0.035
Neg logit contributions
ometer-0.082
ounded-0.081
oke-0.079
rier-0.077
 understands-0.076
Token.
Token ID13
Feature activation+1.18e+00

Pos logit contributions
aster+0.103
e+0.095
One+0.093
 rest+0.085
wered+0.081
Neg logit contributions
ometer-0.128
ounded-0.127
rier-0.123
keeper-0.120
oke-0.120
Token But
Token ID887
Feature activation-1.78e+00

Pos logit contributions
aster+0.083
e+0.078
One+0.075
 rest+0.075
 arrive+0.071
Neg logit contributions
ometer-0.049
ounded-0.046
oke-0.044
rier-0.043
 understands-0.042
Token you
Token ID345
Feature activation+4.87e+00

Pos logit contributions
ometer+0.119
ounded+0.116
oke+0.113
rier+0.112
 understands+0.110
Neg logit contributions
aster-0.081
e-0.074
One-0.071
 rest-0.067
 recess-0.062
Token have
Token ID423
Feature activation+4.01e+00

Pos logit contributions
aster+0.235
e+0.210
One+0.208
 rest+0.195
 recess+0.188
Neg logit contributions
ometer-0.312
ounded-0.304
oke-0.298
 understands-0.287
 puzzled-0.283
Token to
Token ID284
Feature activation-4.43e+00

Pos logit contributions
aster+0.178
One+0.173
 rest+0.169
e+0.163
 spring+0.158
Neg logit contributions
rier-0.281
ounded-0.280
ometer-0.274
oke-0.273
rily-0.253
Token try
Token ID1949
Feature activation+1.37e+00

Pos logit contributions
 ashamed+0.205
 darling+0.203
 puzzled+0.202
rier+0.201
 dear+0.200
Neg logit contributions
aster-0.188
he-0.185
P-0.184
 recess-0.179
B-0.172
Token ours
Token ID16903
Feature activation-1.62e+01

Pos logit contributions
aster+0.066
One+0.061
e+0.060
 rest+0.059
 play+0.055
Neg logit contributions
ounded-0.090
ometer-0.089
oke-0.086
rier-0.084
keeper-0.082
Token too
Token ID1165
Feature activation-2.88e+00

Pos logit contributions
ometer+1.089
ounded+1.071
oke+1.043
rier+1.015
 understands+1.000
Neg logit contributions
aster-0.733
e-0.687
One-0.662
 rest-0.651
 play-0.593
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
ometer+0.192
ounded+0.188
oke+0.183
rier+0.180
 understands+0.178
Neg logit contributions
aster-0.132
e-0.122
One-0.118
 rest-0.115
 play-0.104
Token We
Token ID775
Feature activation-5.97e+00

Pos logit contributions
aster+0.082
e+0.077
 rest+0.076
One+0.075
 walk+0.071
Neg logit contributions
ometer-0.049
ounded-0.046
oke-0.045
rier-0.043
 understands-0.042
Token can
Token ID460
Feature activation-5.80e+00

Pos logit contributions
ometer+0.350
ounded+0.337
oke+0.327
 understands+0.319
 puzzled+0.311
Neg logit contributions
aster-0.319
One-0.284
e-0.278
 rest-0.265
 recess-0.261
Token help
Token ID1037
Feature activation+8.31e+00

Pos logit contributions
keeper+0.316
ometer+0.311
ounded+0.307
 puzzled+0.307
rier+0.304
Neg logit contributions
aster-0.269
 recess-0.226
P-0.215
Time-0.200
e-0.199
Token Tim
Token ID5045
Feature activation+2.80e+00

Pos logit contributions
aster+0.428
e+0.402
One+0.381
 rest+0.361
 skip+0.339
Neg logit contributions
ometer-0.525
ounded-0.508
oke-0.498
keeper-0.470
 understands-0.470
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
aster+0.165
e+0.147
 recess+0.146
wered+0.144
 arrive+0.142
Neg logit contributions
 understands-0.133
ometer-0.133
oke-0.132
ounded-0.126
 puzzled-0.120
Token Tim
Token ID5045
Feature activation+2.80e+00

Pos logit contributions
aster+0.069
e+0.064
 rest+0.060
One+0.058
 recess+0.057
Neg logit contributions
ometer-0.063
oke-0.059
ounded-0.058
 understands-0.055
 puzzled-0.055
Token loved
Token ID6151
Feature activation-2.60e+00

Pos logit contributions
aster+0.169
One+0.150
e+0.147
 rest+0.145
 recess+0.144
Neg logit contributions
ometer-0.141
 understands-0.136
ounded-0.134
oke-0.132
 puzzled-0.127
Token to
Token ID284
Feature activation-4.43e+00

Pos logit contributions
ometer+0.132
ounded+0.125
oke+0.121
 understands+0.118
rier+0.117
Neg logit contributions
aster-0.158
e-0.152
One-0.147
 rest-0.142
 arrive-0.133
Token go
Token ID467
Feature activation+1.12e+01

Pos logit contributions
 teeth+0.280
ometer+0.272
 eyes+0.265
 dear+0.255
 son+0.253
Neg logit contributions
One-0.150
e-0.137
he-0.133
P-0.130
pped-0.128
Token fishing
Token ID12478
Feature activation+5.49e-01

Pos logit contributions
aster+0.649
e+0.615
One+0.571
wered+0.529
 end+0.510
Neg logit contributions
ometer-0.601
ounded-0.599
oke-0.531
 understands-0.514
 puzzled-0.513
Token with
Token ID351
Feature activation+3.73e+00

Pos logit contributions
aster+0.026
e+0.022
One+0.021
 arrive+0.021
 rest+0.021
Neg logit contributions
ometer-0.035
ounded-0.034
rier-0.033
keeper-0.032
oke-0.032
Token his
Token ID465
Feature activation+5.98e-01

Pos logit contributions
aster+0.174
e+0.171
One+0.159
 rest+0.153
 arrive+0.152
Neg logit contributions
ometer-0.225
 teeth-0.212
 understands-0.212
rier-0.211
ounded-0.208
Token dad
Token ID9955
Feature activation-3.83e+00

Pos logit contributions
 rest+0.030
aster+0.029
 play+0.028
e+0.028
One+0.028
Neg logit contributions
ounded-0.037
oke-0.036
rier-0.036
ometer-0.035
keeper-0.033
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
ometer+0.263
oke+0.247
ounded+0.247
keeper+0.239
rier+0.237
Neg logit contributions
aster-0.169
e-0.146
One-0.143
wered-0.139
 rest-0.138
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
aster+0.019
e+0.018
One+0.017
 rest+0.017
 play+0.015
Neg logit contributions
ometer-0.028
ounded-0.027
oke-0.027
rier-0.026
 understands-0.026
Tokenâ
Token ID22940
Feature activation+3.22e+00

Pos logit contributions
aster+0.078
e+0.074
One+0.072
 rest+0.071
 play+0.067
Neg logit contributions
ometer-0.053
ounded-0.051
oke-0.049
rier-0.048
 understands-0.047
Token€
Token ID26391
Feature activation+8.82e-01

Pos logit contributions
aster+0.151
One+0.144
e+0.135
 rest+0.132
 recess+0.127
Neg logit contributions
ounded-0.204
oke-0.197
ometer-0.197
 understands-0.183
keeper-0.177
Token
Token ID198
Feature activation+9.29e-02

Pos logit contributions
aster+0.047
e+0.045
One+0.043
 rest+0.042
 play+0.039
Neg logit contributions
ometer-0.052
ounded-0.051
rier-0.051
oke-0.049
 puzzled-0.047
Token
Token ID198
Feature activation+1.11e-01

Pos logit contributions
aster+0.005
e+0.005
 recess+0.004
 wait+0.004
 rest+0.004
Neg logit contributions
oke-0.005
 understands-0.005
ometer-0.005
ounded-0.005
 puzzled-0.005
TokenTim
Token ID14967
Feature activation+9.32e+00

Pos logit contributions
aster+0.005
e+0.004
 recess+0.004
 rest+0.004
 walk+0.004
Neg logit contributions
oke-0.007
ounded-0.007
 understands-0.007
keeper-0.006
 pleased-0.006
Tokenmy
Token ID1820
Feature activation-7.86e-01

Pos logit contributions
e+0.364
aster+0.360
 rest+0.320
One+0.317
 play+0.305
Neg logit contributions
rier-0.676
ounded-0.674
ometer-0.669
oke-0.635
 puzzled-0.621
Token just
Token ID655
Feature activation+4.28e+00

Pos logit contributions
rier+0.051
ometer+0.050
ounded+0.050
oke+0.048
 puzzled+0.045
Neg logit contributions
e-0.037
aster-0.037
 rest-0.034
One-0.032
 play-0.032
Token stared
Token ID18484
Feature activation-1.61e+00

Pos logit contributions
aster+0.221
e+0.220
 rest+0.212
 play+0.204
 walk+0.192
Neg logit contributions
ometer-0.260
rier-0.258
ounded-0.245
oke-0.241
keeper-0.233
Token in
Token ID287
Feature activation+4.11e+00

Pos logit contributions
ometer+0.107
ounded+0.105
oke+0.104
 understands+0.100
keeper+0.093
Neg logit contributions
aster-0.072
e-0.069
 rest-0.066
One-0.064
 play-0.062
Token amaz
Token ID40642
Feature activation-1.18e+01

Pos logit contributions
aster+0.243
e+0.229
One+0.223
 rest+0.217
 arrive+0.201
Neg logit contributions
ometer-0.212
ounded-0.208
rier-0.201
oke-0.200
 understands-0.198
Token that
Token ID326
Feature activation-2.19e+00

Pos logit contributions
ometer+0.396
ounded+0.389
oke+0.377
rier+0.372
 understands+0.367
Neg logit contributions
aster-0.260
e-0.239
One-0.231
 rest-0.224
 play-0.202
Token appear
Token ID1656
Feature activation+5.92e+00

Pos logit contributions
ometer+0.163
ounded+0.159
rier+0.157
oke+0.155
keeper+0.151
Neg logit contributions
aster-0.082
e-0.078
 rest-0.077
One-0.070
 play-0.067
Token risky
Token ID17564
Feature activation-9.50e+00

Pos logit contributions
e+0.259
 rest+0.258
aster+0.254
One+0.246
 time+0.238
Neg logit contributions
ometer-0.381
ounded-0.363
oke-0.356
rier-0.350
 puzzled-0.338
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
ometer+0.675
ounded+0.668
oke+0.650
rier+0.633
 understands+0.627
Neg logit contributions
aster-0.375
e-0.357
 rest-0.341
One-0.337
 play-0.306
Token<|endoftext|>
Token ID50256
Feature activation+4.39e-01

Pos logit contributions
aster+0.060
e+0.057
One+0.055
 rest+0.052
Once+0.046
Neg logit contributions
ometer-0.109
ounded-0.108
rier-0.105
oke-0.103
 understands-0.100
TokenSam
Token ID16305
Feature activation+8.24e+00

Pos logit contributions
aster+0.025
e+0.023
One+0.021
 rest+0.021
 wait+0.021
Neg logit contributions
ometer-0.024
oke-0.023
ounded-0.023
 understands-0.022
keeper-0.022
Token was
Token ID373
Feature activation-1.46e+00

Pos logit contributions
aster+0.382
e+0.361
One+0.343
 rest+0.338
 play+0.315
Neg logit contributions
ometer-0.546
ounded-0.538
rier-0.526
oke-0.516
keeper-0.494
Token feeling
Token ID4203
Feature activation+6.49e-01

Pos logit contributions
ometer+0.094
ounded+0.092
 puzzled+0.091
rier+0.089
oke+0.089
Neg logit contributions
aster-0.067
One-0.060
e-0.060
 rest-0.058
 recess-0.053
Token upset
Token ID9247
Feature activation-5.17e+00

Pos logit contributions
aster+0.036
e+0.034
One+0.033
 rest+0.032
 play+0.030
Neg logit contributions
ometer-0.038
ounded-0.037
oke-0.036
rier-0.034
 understands-0.034
Token.
Token ID13
Feature activation+1.56e+00

Pos logit contributions
ometer+0.366
ounded+0.361
oke+0.350
rier+0.344
 understands+0.341
Neg logit contributions
aster-0.213
e-0.197
One-0.190
 rest-0.185
 play-0.167
Token He
Token ID679
Feature activation-1.99e+00

Pos logit contributions
aster+0.097
One+0.092
e+0.091
 rest+0.087
 play+0.084
Neg logit contributions
ounded-0.078
ometer-0.077
oke-0.073
rier-0.073
 understands-0.071
Token and
Token ID290
Feature activation+1.74e+00

Pos logit contributions
aster+0.233
e+0.216
One+0.209
 rest+0.202
 play+0.183
Neg logit contributions
ometer-0.322
ounded-0.316
oke-0.305
rier-0.302
 understands-0.296
Token pushed
Token ID7121
Feature activation+4.42e+00

Pos logit contributions
aster+0.077
e+0.073
 rest+0.072
 play+0.067
One+0.065
Neg logit contributions
ometer-0.120
oke-0.117
rier-0.115
ounded-0.114
 understands-0.109
Token the
Token ID262
Feature activation-7.82e-01

Pos logit contributions
aster+0.230
One+0.210
e+0.202
 arrive+0.191
 rest+0.189
Neg logit contributions
ounded-0.256
ometer-0.255
rier-0.252
 puzzled-0.238
keeper-0.237
Token button
Token ID4936
Feature activation-3.72e-01

Pos logit contributions
ometer+0.049
oke+0.046
ounded+0.046
rier+0.045
 understands+0.044
Neg logit contributions
aster-0.039
 rest-0.037
e-0.035
One-0.035
 end-0.033
Token to
Token ID284
Feature activation-4.04e+00

Pos logit contributions
ometer+0.022
ounded+0.022
rier+0.021
oke+0.020
keeper+0.020
Neg logit contributions
aster-0.020
e-0.018
One-0.018
 rest-0.017
wered-0.016
Token go
Token ID467
Feature activation+1.16e+01

Pos logit contributions
rier+0.230
ounded+0.228
ometer+0.221
keeper+0.218
 understands+0.213
Neg logit contributions
P-0.183
One-0.176
e-0.170
aster-0.167
Once-0.156
Token down
Token ID866
Feature activation-9.42e+00

Pos logit contributions
aster+0.655
e+0.609
One+0.578
 rest+0.551
wered+0.513
Neg logit contributions
ounded-0.660
ometer-0.659
rier-0.629
keeper-0.605
oke-0.605
Token.
Token ID13
Feature activation+1.58e+00

Pos logit contributions
ometer+0.560
rier+0.554
ounded+0.549
keeper+0.541
 puzzled+0.517
Neg logit contributions
aster-0.464
e-0.430
One-0.407
 rest-0.393
P-0.390
Token It
Token ID632
Feature activation-7.28e+00

Pos logit contributions
aster+0.082
 rest+0.080
One+0.075
e+0.075
 play+0.075
Neg logit contributions
rier-0.094
ounded-0.091
oke-0.091
ometer-0.088
ener-0.087
Token took
Token ID1718
Feature activation+4.32e+00

Pos logit contributions
ometer+0.468
ounded+0.449
oke+0.445
rier+0.443
 understands+0.423
Neg logit contributions
aster-0.345
e-0.332
One-0.320
 rest-0.317
 play-0.293
Token him
Token ID683
Feature activation+6.62e+00

Pos logit contributions
aster+0.205
One+0.184
e+0.182
 rest+0.170
 arrive+0.159
Neg logit contributions
ometer-0.279
ounded-0.276
rier-0.266
oke-0.264
 understands-0.262
Token Max
Token ID5436
Feature activation+1.82e+00

Pos logit contributions
aster+0.100
e+0.092
 rest+0.085
One+0.084
 play+0.076
Neg logit contributions
ounded-0.205
ometer-0.205
oke-0.198
rier-0.193
 understands-0.192
Token at
Token ID379
Feature activation-3.92e+00

Pos logit contributions
aster+0.088
One+0.079
e+0.077
 rest+0.074
wered+0.071
Neg logit contributions
ometer-0.116
oke-0.112
ounded-0.112
rier-0.109
keeper-0.108
Token all
Token ID477
Feature activation+3.46e+00

Pos logit contributions
ometer+0.254
ounded+0.243
oke+0.241
rier+0.240
 understands+0.238
Neg logit contributions
aster-0.172
e-0.159
One-0.157
 rest-0.144
P-0.136
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
aster+0.139
e+0.125
One+0.121
 rest+0.117
 arrive+0.105
Neg logit contributions
ometer-0.243
oke-0.240
ounded-0.237
rier-0.234
 understands-0.230
Token<|endoftext|>
Token ID50256
Feature activation+8.51e-02

Pos logit contributions
aster+0.051
One+0.048
e+0.048
 rest+0.045
 play+0.043
Neg logit contributions
ounded-0.080
ometer-0.079
rier-0.078
oke-0.077
keeper-0.073
TokenBen
Token ID11696
Feature activation+9.21e+00

Pos logit contributions
aster+0.005
 wait+0.005
e+0.005
 rest+0.005
wered+0.005
Neg logit contributions
ometer-0.004
oke-0.004
 understands-0.004
keeper-0.003
ounded-0.003
Token and
Token ID290
Feature activation+1.37e+00

Pos logit contributions
aster+0.436
e+0.404
One+0.390
 rest+0.380
 play+0.345
Neg logit contributions
ometer-0.597
ounded-0.585
oke-0.568
rier-0.559
 understands-0.553
Token Lily
Token ID20037
Feature activation+1.27e+00

Pos logit contributions
aster+0.051
e+0.046
One+0.045
 rest+0.043
 arrive+0.037
Neg logit contributions
ometer-0.103
ounded-0.101
oke-0.098
rier-0.097
 understands-0.096
Token were
Token ID547
Feature activation+1.81e+00

Pos logit contributions
aster+0.073
One+0.065
e+0.060
 recess+0.060
wered+0.060
Neg logit contributions
 understands-0.070
ounded-0.065
ometer-0.064
oke-0.062
 puzzled-0.062
Token playing
Token ID2712
Feature activation+4.54e+00

Pos logit contributions
aster+0.101
e+0.092
One+0.091
 rest+0.086
 arrive+0.082
Neg logit contributions
ometer-0.102
ounded-0.099
oke-0.098
 puzzled-0.096
 understands-0.096
Token in
Token ID287
Feature activation+4.10e+00

Pos logit contributions
aster+0.239
One+0.224
 arrive+0.208
K+0.190
 rest+0.189
Neg logit contributions
rier-0.240
ometer-0.231
keeper-0.220
 puzzled-0.220
ounded-0.213
Token
Token ID198
Feature activation+2.41e-01

Pos logit contributions
aster+0.059
e+0.055
One+0.054
 rest+0.051
 play+0.047
Neg logit contributions
ometer-0.088
ounded-0.088
rier-0.085
oke-0.083
 understands-0.082
Token
Token ID198
Feature activation+2.40e-01

Pos logit contributions
aster+0.011
e+0.010
 rest+0.009
 arrive+0.009
 wait+0.009
Neg logit contributions
ometer-0.015
oke-0.015
ounded-0.015
 understands-0.015
keeper-0.014
TokenOne
Token ID3198
Feature activation+9.63e+00

Pos logit contributions
e+0.006
 recess+0.006
 arrive+0.006
aster+0.006
 rest+0.005
Neg logit contributions
oke-0.018
ounded-0.017
keeper-0.017
ometer-0.017
 understands-0.016
Token day
Token ID1110
Feature activation+6.18e+00

Pos logit contributions
aster+0.259
e+0.232
One+0.224
 rest+0.200
 arrive+0.185
Neg logit contributions
ometer-0.809
ounded-0.782
oke-0.777
 understands-0.756
 puzzled-0.750
Token,
Token ID11
Feature activation+9.71e-01

Pos logit contributions
aster+0.281
e+0.259
One+0.252
 rest+0.239
 arrive+0.219
Neg logit contributions
ometer-0.412
ounded-0.402
oke-0.393
rier-0.383
 understands-0.380
Token Tim
Token ID5045
Feature activation+2.90e+00

Pos logit contributions
aster+0.048
e+0.045
One+0.043
 rest+0.041
 arrive+0.038
Neg logit contributions
ometer-0.060
ounded-0.059
rier-0.056
keeper-0.056
oke-0.056
Token met
Token ID1138
Feature activation-3.30e+00

Pos logit contributions
aster+0.143
e+0.140
 rest+0.131
One+0.131
 play+0.125
Neg logit contributions
ometer-0.184
rier-0.182
ounded-0.180
oke-0.172
keeper-0.164
Token a
Token ID257
Feature activation+2.82e+00

Pos logit contributions
ometer+0.276
ounded+0.271
oke+0.266
rier+0.263
 understands+0.256
Neg logit contributions
aster-0.098
e-0.090
One-0.084
 rest-0.081
 play-0.070
Token sad
Token ID6507
Feature activation-9.96e+00

Pos logit contributions
aster+0.150
 rest+0.150
 play+0.146
One+0.144
 spring+0.141
Neg logit contributions
ometer-0.158
ounded-0.158
atted-0.157
rier-0.154
oke-0.151
Token girl
Token ID2576
Feature activation-7.27e+00

Pos logit contributions
ounded+0.517
ometer+0.506
oke+0.477
 understands+0.470
rier+0.467
Neg logit contributions
aster-0.610
e-0.555
 rest-0.554
One-0.544
 play-0.527
Token named
Token ID3706
Feature activation+8.54e+00

Pos logit contributions
ometer+0.474
oke+0.472
ounded+0.465
 puzzled+0.460
keeper+0.454
Neg logit contributions
aster-0.316
One-0.283
e-0.267
 rest-0.264
wered-0.257
Token.
Token ID13
Feature activation+1.42e+00

Pos logit contributions
aster+0.198
e+0.192
 rest+0.187
One+0.185
 play+0.169
Neg logit contributions
ometer-0.369
ounded-0.361
rier-0.350
oke-0.347
 understands-0.337
Token Your
Token ID3406
Feature activation-6.59e+00

Pos logit contributions
aster+0.079
e+0.077
One+0.072
 rest+0.070
 play+0.068
Neg logit contributions
ounded-0.084
ometer-0.078
rier-0.077
oke-0.076
keeper-0.073
Token ball
Token ID2613
Feature activation-1.06e+00

Pos logit contributions
ounded+0.430
oke+0.427
ometer+0.418
rier+0.417
keeper+0.385
Neg logit contributions
 rest-0.304
aster-0.300
 play-0.294
e-0.287
 spring-0.278
Token is
Token ID318
Feature activation-3.76e+00

Pos logit contributions
ometer+0.055
ounded+0.055
oke+0.053
rier+0.052
 puzzled+0.050
Neg logit contributions
aster-0.061
e-0.060
 rest-0.058
One-0.057
 play-0.055
Token not
Token ID407
Feature activation+3.83e+00

Pos logit contributions
ometer+0.260
ounded+0.257
oke+0.250
rier+0.239
 understands+0.236
Neg logit contributions
aster-0.165
e-0.158
One-0.151
 rest-0.149
 spring-0.138
Token slow
Token ID3105
Feature activation+5.75e+00

Pos logit contributions
 rest+0.158
e+0.154
aster+0.153
 play+0.146
One+0.142
Neg logit contributions
ometer-0.265
ounded-0.264
oke-0.257
keeper-0.245
 understands-0.242
Token or
Token ID393
Feature activation+3.22e+00

Pos logit contributions
e+0.157
aster+0.153
 rest+0.152
One+0.144
 play+0.140
Neg logit contributions
ometer-0.459
ounded-0.457
oke-0.436
 understands-0.433
keeper-0.431
Token dumb
Token ID13526
Feature activation-5.43e+00

Pos logit contributions
aster+0.153
 rest+0.146
e+0.144
 play+0.141
One+0.140
Neg logit contributions
ometer-0.208
ounded-0.207
oke-0.204
rier-0.195
keeper-0.190
Token.
Token ID13
Feature activation+1.46e+00

Pos logit contributions
ounded+0.438
ometer+0.435
 puzzled+0.409
oke+0.406
rier+0.403
Neg logit contributions
aster-0.162
e-0.153
One-0.148
 rest-0.146
 play-0.133
Token It
Token ID632
Feature activation-7.42e+00

Pos logit contributions
aster+0.090
e+0.089
One+0.084
 rest+0.080
 play+0.078
Neg logit contributions
ounded-0.075
ometer-0.068
oke-0.067
rier-0.067
keeper-0.066
Token was
Token ID373
Feature activation-1.54e+00

Pos logit contributions
ounded+0.427
ometer+0.425
rier+0.424
oke+0.395
 puzzled+0.383
Neg logit contributions
e-0.394
 rest-0.373
aster-0.366
One-0.361
 play-0.355
Token too
Token ID1165
Feature activation-2.29e+00

Pos logit contributions
ometer+0.061
 puzzled+0.060
ounded+0.059
rier+0.059
 ashamed+0.056
Neg logit contributions
aster-0.069
e-0.065
One-0.060
 rest-0.058
 recess-0.057
Token tight
Token ID5381
Feature activation+5.84e-01

Pos logit contributions
ometer+0.137
ounded+0.134
oke+0.129
rier+0.126
 understands+0.125
Neg logit contributions
aster-0.123
e-0.115
One-0.108
 rest-0.106
 arrive-0.099
Token.
Token ID13
Feature activation+1.66e+00

Pos logit contributions
aster+0.025
e+0.023
One+0.022
 rest+0.022
 play+0.020
Neg logit contributions
ometer-0.040
ounded-0.040
oke-0.039
rier-0.038
 understands-0.037
Token He
Token ID679
Feature activation-1.93e+00

Pos logit contributions
aster+0.075
One+0.074
 play+0.068
e+0.067
Once+0.067
Neg logit contributions
ounded-0.108
rier-0.107
oke-0.101
ometer-0.101
 understands-0.100
Token could
Token ID714
Feature activation-6.21e+00

Pos logit contributions
rier+0.135
ometer+0.132
ounded+0.129
oke+0.125
keeper+0.115
Neg logit contributions
aster-0.085
e-0.083
One-0.083
 play-0.078
 read-0.077
Token not
Token ID407
Feature activation+3.94e+00

Pos logit contributions
ometer+0.409
ounded+0.403
 puzzled+0.399
 ashamed+0.383
 confused+0.373
Neg logit contributions
aster-0.265
One-0.242
e-0.236
 recess-0.231
P-0.216
Token wear
Token ID5806
Feature activation+8.48e+00

Pos logit contributions
P+0.172
 Sundays+0.167
 recess+0.165
One+0.164
 course+0.163
Neg logit contributions
ounded-0.208
 ashamed-0.205
 puzzled-0.198
 confused-0.198
ener-0.188
Token it
Token ID340
Feature activation-3.74e+00

Pos logit contributions
aster+0.401
e+0.376
One+0.360
 rest+0.358
 play+0.328
Neg logit contributions
ometer-0.554
ounded-0.545
oke-0.535
rier-0.518
 understands-0.511
Token anymore
Token ID7471
Feature activation-5.53e+00

Pos logit contributions
ometer+0.232
ounded+0.229
rier+0.216
oke+0.212
keeper+0.210
Neg logit contributions
aster-0.193
e-0.172
One-0.164
 rest-0.154
wered-0.153
Token.
Token ID13
Feature activation+1.47e+00

Pos logit contributions
ometer+0.401
ounded+0.390
oke+0.383
rier+0.374
 understands+0.374
Neg logit contributions
aster-0.221
e-0.199
One-0.192
 rest-0.183
 arrive-0.165
Token Tim
Token ID5045
Feature activation+3.06e+00

Pos logit contributions
One+0.055
aster+0.054
e+0.048
 rest+0.048
Once+0.047
Neg logit contributions
rier-0.109
ounded-0.108
ometer-0.105
 puzzled-0.101
ener-0.100
Token old
Token ID1468
Feature activation-9.59e+00

Pos logit contributions
aster+0.019
e+0.018
One+0.017
 rest+0.016
 arrive+0.016
Neg logit contributions
ounded-0.021
ometer-0.020
oke-0.020
rier-0.020
 puzzled-0.019
Token man
Token ID582
Feature activation-3.73e+00

Pos logit contributions
ounded+0.504
ometer+0.501
oke+0.489
rier+0.469
 understands+0.457
Neg logit contributions
aster-0.576
 rest-0.537
e-0.527
One-0.520
 play-0.504
Token to
Token ID284
Feature activation-4.05e+00

Pos logit contributions
ometer+0.250
ounded+0.244
oke+0.241
rier+0.233
 understands+0.231
Neg logit contributions
aster-0.165
e-0.149
One-0.148
 rest-0.142
wered-0.129
Token do
Token ID466
Feature activation+2.32e+00

Pos logit contributions
ounded+0.279
 ashamed+0.277
 understands+0.277
oke+0.274
ometer+0.273
Neg logit contributions
aster-0.108
P-0.101
e-0.101
One-0.101
Once-0.083
Token.
Token ID13
Feature activation+1.58e+00

Pos logit contributions
aster+0.098
e+0.088
One+0.085
 rest+0.082
 arrive+0.075
Neg logit contributions
ometer-0.159
ounded-0.158
oke-0.153
rier-0.151
 understands-0.150
Token
Token ID198
Feature activation+5.26e-01

Pos logit contributions
aster+0.068
One+0.066
 rest+0.060
e+0.059
 play+0.059
Neg logit contributions
ometer-0.106
ounded-0.105
rier-0.103
oke-0.100
 understands-0.099
Token
Token ID198
Feature activation+5.29e-01

Pos logit contributions
aster+0.027
e+0.026
 rest+0.023
 arrive+0.023
 wait+0.022
Neg logit contributions
ounded-0.031
oke-0.031
ometer-0.031
 understands-0.029
keeper-0.029
TokenThe
Token ID464
Feature activation+2.99e+00

Pos logit contributions
aster+0.020
e+0.019
 rest+0.018
 arrive+0.017
 recess+0.017
Neg logit contributions
ounded-0.036
oke-0.035
keeper-0.035
ometer-0.034
 puzzled-0.034
Token old
Token ID1468
Feature activation-9.55e+00

Pos logit contributions
aster+0.218
 rest+0.206
One+0.199
e+0.196
 end+0.194
Neg logit contributions
ometer-0.130
ounded-0.126
oke-0.116
rier-0.113
atted-0.108
Token man
Token ID582
Feature activation-3.68e+00

Pos logit contributions
ounded+0.542
ometer+0.539
oke+0.508
rier+0.503
azed+0.483
Neg logit contributions
aster-0.561
 rest-0.506
One-0.492
e-0.478
 read-0.477
Token then
Token ID788
Feature activation+3.89e-01

Pos logit contributions
ometer+0.217
rier+0.208
ounded+0.205
oke+0.189
 understands+0.188
Neg logit contributions
aster-0.198
e-0.194
One-0.182
 rest-0.180
 play-0.174
Token the
Token ID262
Feature activation-1.17e+00

Pos logit contributions
aster+0.181
e+0.167
 rest+0.167
One+0.163
 spring+0.158
Neg logit contributions
ometer-0.255
oke-0.253
ounded-0.249
rier-0.237
 understands-0.236
Token jazz
Token ID21274
Feature activation+4.18e+00

Pos logit contributions
ometer+0.070
ounded+0.069
oke+0.067
rier+0.065
 understands+0.064
Neg logit contributions
aster-0.061
e-0.057
One-0.055
 rest-0.055
 play-0.050
Token music
Token ID2647
Feature activation+2.34e-01

Pos logit contributions
aster+0.210
e+0.195
One+0.194
 rest+0.181
wered+0.169
Neg logit contributions
ometer-0.259
ounded-0.252
oke-0.242
rier-0.240
 understands-0.237
Token back
Token ID736
Feature activation-4.59e+00

Pos logit contributions
aster+0.012
One+0.011
e+0.011
 rest+0.010
wered+0.009
Neg logit contributions
ometer-0.015
ounded-0.014
oke-0.014
rier-0.014
keeper-0.014
Token on
Token ID319
Feature activation+4.08e+00

Pos logit contributions
ometer+0.300
ounded+0.298
oke+0.296
rier+0.287
 understands+0.282
Neg logit contributions
aster-0.202
e-0.192
One-0.185
 rest-0.182
 play-0.171
Token and
Token ID290
Feature activation+1.38e+00

Pos logit contributions
aster+0.205
One+0.187
e+0.183
 rest+0.182
P+0.173
Neg logit contributions
rier-0.223
ometer-0.219
keeper-0.213
 eyes-0.208
 teeth-0.205
Token danced
Token ID39480
Feature activation+7.05e+00

Pos logit contributions
aster+0.059
e+0.055
 rest+0.053
 play+0.050
One+0.050
Neg logit contributions
ometer-0.096
ounded-0.094
oke-0.093
rier-0.091
 understands-0.088
Token around
Token ID1088
Feature activation-6.72e+00

Pos logit contributions
aster+0.334
e+0.311
One+0.300
 rest+0.293
 play+0.268
Neg logit contributions
ometer-0.458
ounded-0.448
oke-0.436
rier-0.429
 understands-0.423
Token the
Token ID262
Feature activation-1.17e+00

Pos logit contributions
ometer+0.431
rier+0.422
ounded+0.421
oke+0.411
 understands+0.409
Neg logit contributions
aster-0.303
One-0.273
e-0.256
 rest-0.247
wered-0.233
Token living
Token ID2877
Feature activation-1.74e+00

Pos logit contributions
ometer+0.068
ounded+0.066
oke+0.064
rier+0.063
 understands+0.063
Neg logit contributions
aster-0.064
e-0.059
One-0.058
 rest-0.055
 arrive-0.051
Token room
Token ID2119
Feature activation-3.03e+00

Pos logit contributions
ometer+0.114
ounded+0.112
oke+0.110
keeper+0.101
rier+0.101
Neg logit contributions
e-0.086
aster-0.084
 rest-0.074
 play-0.073
 recess-0.073
Token make
Token ID787
Feature activation+2.01e+00

Pos logit contributions
ometer+0.318
keeper+0.293
endant+0.282
 understands+0.275
rier+0.269
Neg logit contributions
aster-0.240
 recess-0.225
P-0.218
Time-0.206
 Saturday-0.204
Token new
Token ID649
Feature activation-6.57e+00

Pos logit contributions
aster+0.082
e+0.076
One+0.071
 rest+0.070
 play+0.064
Neg logit contributions
ounded-0.146
ometer-0.146
oke-0.141
rier-0.136
 understands-0.135
Token paper
Token ID3348
Feature activation-6.90e-01

Pos logit contributions
ounded+0.421
ometer+0.414
oke+0.397
rier+0.375
keeper+0.363
Neg logit contributions
e-0.330
aster-0.322
 rest-0.312
 play-0.308
One-0.296
Token animals
Token ID4695
Feature activation+6.50e-01

Pos logit contributions
ounded+0.047
ometer+0.047
oke+0.045
 understands+0.043
rier+0.043
Neg logit contributions
aster-0.030
e-0.029
One-0.027
 rest-0.027
 play-0.025
Token and
Token ID290
Feature activation+1.43e+00

Pos logit contributions
aster+0.029
e+0.026
One+0.026
 rest+0.025
wered+0.022
Neg logit contributions
ometer-0.044
ounded-0.043
oke-0.042
rier-0.042
 understands-0.040
Token shapes
Token ID15268
Feature activation-7.72e+00

Pos logit contributions
aster+0.070
e+0.065
 rest+0.064
One+0.062
 play+0.061
Neg logit contributions
ometer-0.092
ounded-0.091
oke-0.088
rier-0.086
keeper-0.083
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
ometer+0.515
rier+0.499
oke+0.492
ounded+0.489
 understands+0.478
Neg logit contributions
aster-0.351
e-0.310
One-0.302
 rest-0.294
wered-0.273
Token But
Token ID887
Feature activation-1.74e+00

Pos logit contributions
aster+0.084
e+0.079
 rest+0.076
One+0.076
 walk+0.072
Neg logit contributions
ometer-0.054
ounded-0.050
oke-0.050
rier-0.048
 understands-0.047
Token this
Token ID428
Feature activation-3.88e+00

Pos logit contributions
ometer+0.121
ounded+0.118
oke+0.116
rier+0.115
 understands+0.113
Neg logit contributions
aster-0.073
e-0.067
One-0.063
 rest-0.060
 arrive-0.054
Token time
Token ID640
Feature activation+9.28e+00

Pos logit contributions
ometer+0.230
ounded+0.226
oke+0.218
rier+0.215
 understands+0.210
Neg logit contributions
aster-0.208
e-0.194
One-0.188
 rest-0.185
 play-0.170
Token,
Token ID11
Feature activation+9.06e-01

Pos logit contributions
aster+0.369
e+0.344
 rest+0.333
One+0.329
 play+0.300
Neg logit contributions
ometer-0.673
ounded-0.663
rier-0.639
oke-0.633
 puzzled-0.618
Token turned
Token ID2900
Feature activation+4.30e+00

Pos logit contributions
aster+0.082
e+0.075
 rest+0.074
 play+0.069
 read+0.069
Neg logit contributions
ometer-0.120
ounded-0.117
oke-0.116
rier-0.115
 puzzled-0.106
Token it
Token ID340
Feature activation-3.68e+00

Pos logit contributions
aster+0.198
e+0.182
 rest+0.174
One+0.170
 spring+0.161
Neg logit contributions
ometer-0.293
ounded-0.291
oke-0.283
rier-0.273
keeper-0.267
Token around
Token ID1088
Feature activation-6.32e+00

Pos logit contributions
ometer+0.257
ounded+0.252
oke+0.245
rier+0.243
 understands+0.239
Neg logit contributions
aster-0.158
e-0.144
One-0.138
 rest-0.134
 play-0.120
Token,
Token ID11
Feature activation+1.30e+00

Pos logit contributions
ometer+0.452
ounded+0.442
oke+0.431
rier+0.430
 understands+0.423
Neg logit contributions
aster-0.255
e-0.229
One-0.223
 rest-0.213
 arrive-0.191
Token inspecting
Token ID49889
Feature activation-4.11e-01

Pos logit contributions
aster+0.056
 rest+0.052
e+0.051
 play+0.048
 spring+0.047
Neg logit contributions
ometer-0.094
oke-0.090
ounded-0.089
rier-0.085
keeper-0.083
Token it
Token ID340
Feature activation-3.62e+00

Pos logit contributions
ometer+0.030
ounded+0.030
oke+0.030
rier+0.028
keeper+0.028
Neg logit contributions
aster-0.016
e-0.016
One-0.015
 rest-0.014
 spring-0.013
Token closely
Token ID7173
Feature activation-2.82e+00

Pos logit contributions
ometer+0.247
ounded+0.242
oke+0.235
rier+0.233
 understands+0.229
Neg logit contributions
aster-0.161
e-0.147
One-0.141
 rest-0.138
 arrive-0.124
Token.
Token ID13
Feature activation+1.61e+00

Pos logit contributions
ometer+0.234
ounded+0.233
oke+0.226
rier+0.218
 understands+0.217
Neg logit contributions
aster-0.080
e-0.075
One-0.072
 rest-0.068
 spring-0.058
Token Then
Token ID3244
Feature activation-9.85e-01

Pos logit contributions
aster+0.074
 rest+0.073
One+0.072
e+0.070
 spring+0.068
Neg logit contributions
oke-0.104
ounded-0.103
rier-0.103
ometer-0.101
keeper-0.093
Token,
Token ID11
Feature activation+1.35e+00

Pos logit contributions
ometer+0.072
ounded+0.071
oke+0.069
rier+0.068
 understands+0.067
Neg logit contributions
aster-0.039
e-0.036
One-0.034
 rest-0.034
 play-0.030
Token before
Token ID878
Feature activation-3.75e-01

Pos logit contributions
aster+0.051
e+0.047
One+0.044
 rest+0.043
 play+0.038
Neg logit contributions
ometer-0.100
ounded-0.099
oke-0.096
rier-0.095
 understands-0.094
Token forest
Token ID8222
Feature activation+5.63e+00

Pos logit contributions
aster+0.081
e+0.075
 rest+0.072
One+0.072
 play+0.068
Neg logit contributions
ometer-0.100
ounded-0.098
oke-0.095
rier-0.093
 understands-0.090
Token,
Token ID11
Feature activation+1.76e+00

Pos logit contributions
aster+0.207
e+0.190
One+0.181
 rest+0.176
 play+0.156
Neg logit contributions
ometer-0.426
ounded-0.417
oke-0.408
rier-0.402
 understands-0.396
Token there
Token ID612
Feature activation-3.58e+00

Pos logit contributions
aster+0.078
e+0.073
One+0.070
 rest+0.069
 play+0.062
Neg logit contributions
ometer-0.119
ounded-0.116
oke-0.114
rier-0.112
 understands-0.110
Token was
Token ID373
Feature activation-1.14e+00

Pos logit contributions
ometer+0.216
ounded+0.212
oke+0.206
 understands+0.202
rier+0.201
Neg logit contributions
aster-0.185
e-0.170
One-0.166
 rest-0.161
 arrive-0.148
Token a
Token ID257
Feature activation+3.38e+00

Pos logit contributions
ometer+0.076
ounded+0.075
 puzzled+0.074
 understands+0.073
keeper+0.073
Neg logit contributions
aster-0.047
e-0.043
One-0.038
 rest-0.037
 recess-0.036
Token small
Token ID1402
Feature activation-4.02e+00

Pos logit contributions
aster+0.195
e+0.183
One+0.180
 rest+0.178
 play+0.167
Neg logit contributions
ometer-0.182
ounded-0.180
oke-0.173
rier-0.169
 understands-0.167
Token bunny
Token ID44915
Feature activation-9.22e-01

Pos logit contributions
ounded+0.228
ometer+0.224
oke+0.216
 understands+0.208
rier+0.208
Neg logit contributions
aster-0.222
e-0.210
 rest-0.207
One-0.204
 play-0.196
Token named
Token ID3706
Feature activation+8.96e+00

Pos logit contributions
ometer+0.067
ounded+0.064
oke+0.064
keeper+0.063
rier+0.062
Neg logit contributions
aster-0.036
One-0.030
e-0.030
 rest-0.029
 recess-0.028
Token Ben
Token ID3932
Feature activation+5.05e+00

Pos logit contributions
aster+0.483
e+0.444
One+0.428
 rest+0.416
 arrive+0.391
Neg logit contributions
ometer-0.524
ounded-0.505
oke-0.495
rier-0.483
 understands-0.477
Token.
Token ID13
Feature activation+1.58e+00

Pos logit contributions
aster+0.169
e+0.155
One+0.147
 rest+0.141
 play+0.125
Neg logit contributions
ometer-0.398
ounded-0.394
rier-0.381
oke-0.381
 understands-0.371
Token Ben
Token ID3932
Feature activation+4.99e+00

Pos logit contributions
aster+0.068
One+0.064
e+0.062
 rest+0.059
 play+0.055
Neg logit contributions
ounded-0.112
ometer-0.111
rier-0.107
oke-0.105
keeper-0.103
Token try
Token ID1949
Feature activation+1.59e+00

Pos logit contributions
aster+0.045
e+0.040
One+0.037
 rest+0.028
P+0.027
Neg logit contributions
ounded-0.133
ometer-0.133
keeper-0.128
 understands-0.126
 puzzled-0.124
Token to
Token ID284
Feature activation-4.19e+00

Pos logit contributions
aster+0.076
One+0.070
e+0.070
 rest+0.070
 play+0.065
Neg logit contributions
ometer-0.104
ounded-0.102
oke-0.101
rier-0.099
keeper-0.095
Token find
Token ID1064
Feature activation+8.75e-01

Pos logit contributions
ounded+0.221
keeper+0.206
 puzzled+0.203
 son+0.203
 understands+0.198
Neg logit contributions
aster-0.202
P-0.186
One-0.169
e-0.160
 recess-0.155
Token their
Token ID511
Feature activation+2.91e-01

Pos logit contributions
aster+0.037
e+0.034
 rest+0.032
One+0.032
 play+0.029
Neg logit contributions
ometer-0.063
ounded-0.061
oke-0.060
rier-0.059
 understands-0.057
Token way
Token ID835
Feature activation+4.57e-01

Pos logit contributions
 rest+0.013
aster+0.012
One+0.012
e+0.012
 play+0.012
Neg logit contributions
ometer-0.019
oke-0.019
rier-0.019
ounded-0.019
keeper-0.018
Token through
Token ID832
Feature activation-6.68e-01

Pos logit contributions
aster+0.025
e+0.024
One+0.023
 rest+0.023
 play+0.021
Neg logit contributions
ometer-0.026
ounded-0.025
oke-0.024
rier-0.024
 understands-0.024
Token again
Token ID757
Feature activation-2.40e+00

Pos logit contributions
ometer+0.044
ounded+0.044
oke+0.043
 understands+0.043
rier+0.043
Neg logit contributions
aster-0.028
e-0.025
One-0.025
 rest-0.024
P-0.021
Token soon
Token ID2582
Feature activation+4.96e+00

Pos logit contributions
ometer+0.168
ounded+0.165
oke+0.159
rier+0.159
 understands+0.156
Neg logit contributions
aster-0.100
e-0.090
One-0.085
 rest-0.083
wered-0.075
Token.
Token ID13
Feature activation+1.45e+00

Pos logit contributions
aster+0.194
e+0.175
One+0.170
 rest+0.161
 play+0.142
Neg logit contributions
ometer-0.363
ounded-0.357
oke-0.349
rier-0.345
 understands-0.340
Token From
Token ID3574
Feature activation-1.09e+00

Pos logit contributions
One+0.061
aster+0.060
e+0.057
 rest+0.057
 play+0.056
Neg logit contributions
rier-0.105
ener-0.096
ounded-0.095
oke-0.095
ometer-0.095
Token that
Token ID326
Feature activation-2.24e+00

Pos logit contributions
ometer+0.078
ounded+0.076
oke+0.075
rier+0.074
 understands+0.072
Neg logit contributions
aster-0.045
e-0.041
One-0.040
 rest-0.039
 play-0.035
Token candy
Token ID18550
Feature activation-1.63e-01

Pos logit contributions
ometer+0.277
ounded+0.264
rier+0.261
oke+0.258
keeper+0.256
Neg logit contributions
aster-0.287
One-0.270
e-0.267
 rest-0.241
wered-0.240
Token.
Token ID13
Feature activation+1.21e+00

Pos logit contributions
ometer+0.010
ounded+0.010
oke+0.009
keeper+0.009
rier+0.009
Neg logit contributions
aster-0.008
e-0.008
One-0.008
wered-0.007
 rest-0.007
Token She
Token ID1375
Feature activation+2.06e+00

Pos logit contributions
aster+0.053
One+0.049
 rest+0.048
e+0.047
Once+0.044
Neg logit contributions
ometer-0.084
rier-0.081
ounded-0.081
oke-0.078
ener-0.074
Token smiled
Token ID13541
Feature activation-4.54e+00

Pos logit contributions
aster+0.100
e+0.098
One+0.091
 rest+0.091
 play+0.087
Neg logit contributions
ometer-0.131
rier-0.128
ounded-0.127
oke-0.124
 understands-0.115
Token and
Token ID290
Feature activation+1.42e+00

Pos logit contributions
oke+0.388
ometer+0.368
ounded+0.361
 understands+0.343
ours+0.338
Neg logit contributions
 play-0.115
e-0.111
One-0.111
 her-0.109
 spring-0.107
Token enjoyed
Token ID8359
Feature activation-4.00e+00

Pos logit contributions
aster+0.074
e+0.073
 rest+0.070
 play+0.067
 read+0.065
Neg logit contributions
ometer-0.086
rier-0.082
oke-0.082
ounded-0.081
keeper-0.074
Token her
Token ID607
Feature activation+4.64e+00

Pos logit contributions
 understands+0.229
rier+0.227
 puzzled+0.227
ounded+0.221
ometer+0.220
Neg logit contributions
aster-0.189
e-0.173
One-0.173
 rest-0.160
P-0.150
Token treats
Token ID18432
Feature activation-3.26e+00

Pos logit contributions
aster+0.246
e+0.230
One+0.222
 rest+0.210
 arrive+0.198
Neg logit contributions
ometer-0.274
ounded-0.270
oke-0.257
 understands-0.255
rier-0.253
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
ometer+0.198
keeper+0.195
rier+0.195
ounded+0.192
 puzzled+0.180
Neg logit contributions
aster-0.167
One-0.151
e-0.144
wered-0.139
 recess-0.130
Token She
Token ID1375
Feature activation+2.07e+00

Pos logit contributions
aster+0.058
One+0.054
e+0.054
 rest+0.053
 play+0.048
Neg logit contributions
ometer-0.081
ounded-0.079
rier-0.078
oke-0.076
 puzzled-0.072
Token had
Token ID550
Feature activation-7.66e-01

Pos logit contributions
aster+0.112
e+0.105
One+0.102
 rest+0.100
 play+0.092
Neg logit contributions
ometer-0.121
ounded-0.118
oke-0.115
rier-0.113
 understands-0.110
Token the
Token ID262
Feature activation-6.90e-01

Pos logit contributions
ometer+0.227
ounded+0.216
 understands+0.214
keeper+0.213
rier+0.213
Neg logit contributions
One-0.134
aster-0.130
e-0.123
 arrive-0.112
 rest-0.111
Token bull
Token ID6473
Feature activation-7.29e+00

Pos logit contributions
ometer+0.040
ounded+0.040
oke+0.039
rier+0.037
 understands+0.037
Neg logit contributions
aster-0.037
e-0.035
 rest-0.035
One-0.034
 spring-0.032
Token.
Token ID13
Feature activation+1.58e+00

Pos logit contributions
ometer+0.519
ounded+0.505
oke+0.495
rier+0.486
keeper+0.483
Neg logit contributions
aster-0.296
e-0.261
One-0.261
 rest-0.243
 recess-0.227
Token The
Token ID383
Feature activation-3.97e+00

Pos logit contributions
aster+0.078
e+0.071
One+0.070
 rest+0.067
 play+0.063
Neg logit contributions
ounded-0.101
ometer-0.100
rier-0.096
oke-0.096
 understands-0.092
Token bull
Token ID6473
Feature activation-7.40e+00

Pos logit contributions
ounded+0.183
ometer+0.179
oke+0.174
rier+0.163
atted+0.161
Neg logit contributions
aster-0.266
 rest-0.250
One-0.247
 end-0.242
e-0.237
Token sees
Token ID7224
Feature activation-9.99e+00

Pos logit contributions
ometer+0.368
ounded+0.359
oke+0.340
rier+0.338
keeper+0.323
Neg logit contributions
aster-0.463
e-0.444
One-0.428
 rest-0.423
 play-0.398
Token Mia
Token ID32189
Feature activation+2.18e+00

Pos logit contributions
ometer+0.826
ounded+0.814
oke+0.776
rier+0.774
 puzzled+0.752
Neg logit contributions
aster-0.324
e-0.287
One-0.269
 rest-0.261
 spring-0.229
Token and
Token ID290
Feature activation+1.66e+00

Pos logit contributions
aster+0.105
One+0.095
e+0.094
 rest+0.090
 recess+0.084
Neg logit contributions
ometer-0.139
ounded-0.135
oke-0.133
rier-0.131
 understands-0.130
Token sn
Token ID3013
Feature activation+1.12e+01

Pos logit contributions
aster+0.067
e+0.060
 rest+0.058
 run+0.056
 play+0.056
Neg logit contributions
ounded-0.126
ometer-0.124
oke-0.119
rier-0.117
keeper-0.114
Tokenorts
Token ID2096
Feature activation-4.20e-01

Pos logit contributions
aster+0.721
One+0.674
e+0.672
 rest+0.655
 arrive+0.634
Neg logit contributions
ometer-0.567
ounded-0.523
oke-0.508
rier-0.505
atted-0.494
Token.
Token ID13
Feature activation+1.41e+00

Pos logit contributions
ometer+0.028
ounded+0.028
oke+0.027
rier+0.027
 understands+0.027
Neg logit contributions
aster-0.019
e-0.017
One-0.017
 rest-0.016
 recess-0.015
Token the
Token ID262
Feature activation-1.06e+00

Pos logit contributions
aster+0.024
One+0.021
e+0.021
 rest+0.021
 arrive+0.019
Neg logit contributions
ometer-0.038
rier-0.038
keeper-0.037
oke-0.037
 understands-0.036
Token bird
Token ID6512
Feature activation-1.76e+00

Pos logit contributions
ometer+0.059
ounded+0.058
oke+0.056
rier+0.054
 understands+0.052
Neg logit contributions
aster-0.060
e-0.057
One-0.056
 rest-0.056
 play-0.052
Token or
Token ID393
Feature activation+3.06e+00

Pos logit contributions
ometer+0.120
ounded+0.116
oke+0.115
rier+0.114
keeper+0.111
Neg logit contributions
aster-0.076
One-0.068
e-0.068
 rest-0.063
 recess-0.060
Token make
Token ID787
Feature activation+2.05e+00

Pos logit contributions
aster+0.115
e+0.102
 rest+0.099
 play+0.093
One+0.093
Neg logit contributions
ometer-0.238
ounded-0.235
oke-0.230
rier-0.225
 understands-0.217
Token it
Token ID340
Feature activation-3.94e+00

Pos logit contributions
aster+0.082
e+0.075
 rest+0.070
One+0.070
 play+0.063
Neg logit contributions
ometer-0.149
ounded-0.148
oke-0.143
rier-0.139
 understands-0.138
Token sad
Token ID6507
Feature activation-9.89e+00

Pos logit contributions
ometer+0.259
ounded+0.255
oke+0.248
rier+0.242
 understands+0.239
Neg logit contributions
aster-0.183
e-0.170
One-0.163
 rest-0.162
 play-0.149
Token.
Token ID13
Feature activation+1.30e+00

Pos logit contributions
ometer+0.771
ounded+0.764
oke+0.737
rier+0.726
 understands+0.720
Neg logit contributions
aster-0.340
e-0.307
One-0.293
 rest-0.286
 play-0.256
Token She
Token ID1375
Feature activation+2.15e+00

Pos logit contributions
aster+0.053
One+0.050
e+0.048
 rest+0.046
 spring+0.042
Neg logit contributions
ounded-0.093
ometer-0.091
rier-0.089
oke-0.088
keeper-0.085
Token wants
Token ID3382
Feature activation-1.17e+01

Pos logit contributions
aster+0.153
e+0.148
One+0.147
 rest+0.145
 play+0.138
Neg logit contributions
ometer-0.088
ounded-0.085
rier-0.080
oke-0.079
keeper-0.074
Token to
Token ID284
Feature activation-4.30e+00

Pos logit contributions
ounded+0.704
ometer+0.704
oke+0.676
rier+0.662
 understands+0.635
Neg logit contributions
aster-0.627
e-0.578
One-0.560
 rest-0.559
 play-0.521
Token be
Token ID307
Feature activation+1.80e+00

Pos logit contributions
 understands+0.237
 ashamed+0.236
 teeth+0.228
keeper+0.224
ometer+0.219
Neg logit contributions
aster-0.189
One-0.187
P-0.184
 recess-0.163
he-0.160
Token Lily
Token ID20037
Feature activation+1.28e+00

Pos logit contributions
aster+0.045
e+0.043
One+0.040
 rest+0.038
 arrive+0.037
Neg logit contributions
ometer-0.052
ounded-0.051
rier-0.049
 puzzled-0.049
 understands-0.048
Token's
Token ID338
Feature activation-5.54e+00

Pos logit contributions
aster+0.076
e+0.071
One+0.070
 rest+0.068
 arrive+0.064
Neg logit contributions
ometer-0.067
ounded-0.065
oke-0.063
 understands-0.061
rier-0.061
Token mom
Token ID1995
Feature activation-7.88e-01

Pos logit contributions
ometer+0.302
ounded+0.296
oke+0.285
rier+0.280
 understands+0.275
Neg logit contributions
aster-0.321
e-0.302
One-0.294
 rest-0.288
 play-0.268
Token said
Token ID531
Feature activation-6.53e+00

Pos logit contributions
ometer+0.048
ounded+0.047
oke+0.045
rier+0.045
 understands+0.044
Neg logit contributions
aster-0.041
e-0.038
One-0.037
 rest-0.036
 play-0.033
Token,
Token ID11
Feature activation+8.80e-01

Pos logit contributions
ounded+0.528
ometer+0.516
rier+0.501
oke+0.494
 understands+0.494
Neg logit contributions
aster-0.208
 rest-0.184
e-0.178
One-0.165
 arrive-0.155
Token "
Token ID366
Feature activation-1.08e+01

Pos logit contributions
aster+0.053
One+0.052
 spring+0.050
 play+0.050
 rest+0.050
Neg logit contributions
ounded-0.044
rier-0.043
azed-0.040
 understands-0.039
ometer-0.039
TokenNo
Token ID2949
Feature activation+3.45e+00

Pos logit contributions
ounded+0.728
ometer+0.722
keeper+0.698
oke+0.696
 understands+0.696
Neg logit contributions
aster-0.428
e-0.398
 arrive-0.366
 rest-0.353
wered-0.342
Token more
Token ID517
Feature activation-3.18e-01

Pos logit contributions
aster+0.155
e+0.144
One+0.138
 rest+0.129
 arrive+0.122
Neg logit contributions
ometer-0.232
ounded-0.227
oke-0.225
 understands-0.217
rier-0.216
Token cartoons
Token ID30070
Feature activation+2.29e+00

Pos logit contributions
ometer+0.020
ounded+0.020
oke+0.019
rier+0.019
 understands+0.018
Neg logit contributions
aster-0.016
e-0.015
One-0.014
 rest-0.014
 play-0.013
Token today
Token ID1909
Feature activation+5.43e-01

Pos logit contributions
aster+0.102
One+0.095
e+0.090
 rest+0.086
wered+0.083
Neg logit contributions
ometer-0.153
oke-0.148
ounded-0.147
rier-0.142
 understands-0.142
Token,
Token ID11
Feature activation+8.72e-01

Pos logit contributions
aster+0.024
e+0.022
One+0.021
 rest+0.020
wered+0.019
Neg logit contributions
ometer-0.037
ounded-0.036
oke-0.035
rier-0.034
 understands-0.034
Token mouse
Token ID10211
Feature activation-4.78e+00

Pos logit contributions
ometer+0.396
ounded+0.391
oke+0.373
rier+0.369
 understands+0.366
Neg logit contributions
aster-0.255
e-0.236
One-0.225
 rest-0.219
 play-0.201
Token.
Token ID13
Feature activation+1.38e+00

Pos logit contributions
ometer+0.337
ounded+0.330
oke+0.323
rier+0.317
 understands+0.313
Neg logit contributions
aster-0.199
e-0.180
One-0.175
 rest-0.168
 arrive-0.150
Token Sam
Token ID3409
Feature activation+4.47e-01

Pos logit contributions
aster+0.064
e+0.058
One+0.058
 play+0.053
 spring+0.052
Neg logit contributions
ounded-0.093
rier-0.093
ometer-0.089
oke-0.084
rily-0.083
Token was
Token ID373
Feature activation-1.62e+00

Pos logit contributions
e+0.022
aster+0.022
One+0.020
 rest+0.020
 play+0.019
Neg logit contributions
ometer-0.028
ounded-0.027
rier-0.027
oke-0.026
keeper-0.025
Token very
Token ID845
Feature activation+2.68e+00

Pos logit contributions
ometer+0.092
ounded+0.090
oke+0.087
rier+0.085
 understands+0.083
Neg logit contributions
aster-0.091
e-0.086
One-0.082
 rest-0.081
 play-0.076
Token sad
Token ID6507
Feature activation-9.81e+00

Pos logit contributions
aster+0.170
e+0.166
 rest+0.159
One+0.156
 play+0.153
Neg logit contributions
ometer-0.132
ounded-0.126
oke-0.118
rier-0.115
keeper-0.112
Token and
Token ID290
Feature activation+1.60e+00

Pos logit contributions
ometer+0.745
ounded+0.741
oke+0.712
rier+0.706
 understands+0.696
Neg logit contributions
aster-0.352
e-0.320
One-0.311
 rest-0.303
 play-0.274
Token angry
Token ID7954
Feature activation-7.05e+00

Pos logit contributions
aster+0.090
e+0.086
 rest+0.084
 play+0.084
 run+0.081
Neg logit contributions
ometer-0.092
ounded-0.088
oke-0.086
rier-0.084
keeper-0.078
Token.
Token ID13
Feature activation+1.42e+00

Pos logit contributions
ounded+0.602
ometer+0.595
oke+0.567
 understands+0.558
 puzzled+0.555
Neg logit contributions
aster-0.186
e-0.176
One-0.161
 rest-0.153
 play-0.143
Token He
Token ID679
Feature activation-2.14e+00

Pos logit contributions
aster+0.068
One+0.067
e+0.064
 play+0.060
Once+0.058
Neg logit contributions
rier-0.091
ounded-0.091
ometer-0.086
rily-0.081
oke-0.081
Token did
Token ID750
Feature activation-1.84e+00

Pos logit contributions
ometer+0.131
ounded+0.127
rier+0.127
oke+0.124
keeper+0.116
Neg logit contributions
aster-0.108
e-0.108
One-0.103
 rest-0.100
 play-0.098
Token
Token ID198
Feature activation+1.03e-01

Pos logit contributions
aster+0.006
e+0.005
 rest+0.005
wered+0.005
 end+0.005
Neg logit contributions
ometer-0.006
oke-0.006
ounded-0.006
 understands-0.006
keeper-0.006
TokenThe
Token ID464
Feature activation+2.54e+00

Pos logit contributions
aster+0.005
 rest+0.004
e+0.004
 recess+0.004
wered+0.004
Neg logit contributions
ounded-0.007
keeper-0.006
ometer-0.006
oke-0.006
 understands-0.006
Token girl
Token ID2576
Feature activation-7.33e+00

Pos logit contributions
aster+0.172
e+0.163
One+0.160
 rest+0.159
 play+0.150
Neg logit contributions
ometer-0.114
ounded-0.110
oke-0.107
rier-0.104
 understands-0.101
Token smiled
Token ID13541
Feature activation-4.58e+00

Pos logit contributions
ometer+0.386
rier+0.374
ounded+0.370
oke+0.356
 understands+0.330
Neg logit contributions
aster-0.438
e-0.433
One-0.409
 rest-0.403
 read-0.394
Token.
Token ID13
Feature activation+1.18e+00

Pos logit contributions
oke+0.375
ometer+0.372
ounded+0.362
 understands+0.342
rier+0.332
Neg logit contributions
One-0.126
e-0.121
 play-0.118
aster-0.116
 end-0.114
Token "
Token ID366
Feature activation-1.08e+01

Pos logit contributions
One+0.053
aster+0.051
Once+0.049
 spring+0.047
 play+0.047
Neg logit contributions
ounded-0.076
rier-0.075
oke-0.071
ometer-0.070
ishes-0.069
TokenI
Token ID40
Feature activation+7.58e+00

Pos logit contributions
ounded+0.646
ometer+0.636
keeper+0.612
 understands+0.608
oke+0.599
Neg logit contributions
aster-0.534
e-0.474
 rest-0.468
 arrive-0.441
 walk-0.435
Token love
Token ID1842
Feature activation+9.61e-01

Pos logit contributions
aster+0.338
e+0.309
One+0.299
 rest+0.287
 arrive+0.260
Neg logit contributions
ometer-0.514
ounded-0.507
oke-0.491
 understands-0.480
rier-0.479
Token it
Token ID340
Feature activation-4.06e+00

Pos logit contributions
aster+0.046
e+0.043
One+0.041
 rest+0.040
 play+0.037
Neg logit contributions
ometer-0.063
ounded-0.062
oke-0.060
rier-0.059
 understands-0.058
Token!"
Token ID2474
Feature activation-7.03e+00

Pos logit contributions
ometer+0.257
ounded+0.254
oke+0.247
 understands+0.240
rier+0.238
Neg logit contributions
aster-0.195
e-0.187
One-0.179
 rest-0.176
 play-0.162
Token she
Token ID673
Feature activation+3.66e+00

Pos logit contributions
ometer+0.499
ounded+0.489
oke+0.476
rier+0.467
 understands+0.465
Neg logit contributions
aster-0.291
e-0.268
One-0.257
 rest-0.248
 arrive-0.224