TOP ACTIVATIONS
MAX = 18.639

We have different likes and disl ikes . We can be
should be compassionate . Comp assion ate means you care about
sad . He said , "" I hope something will change
your tomatoes . They are ours . We planted them .
Billy was ast ounded ! He used the magnet
. The juice is not ours . We have to give
ally , that 's not ours . Leave it where it
" No , this is ours . Go away !"
they arrived they were ast ounded , the land was very
not yours . They are ours . We made them .
. That balloon is not ours . We must not take
not . It is not ours . It is dangerous ."
. The things are not ours . We should not take
. The coins are not ours . They are for the
" No , it is ours now !" one of the
" No , it is ours . We found it .
. Those flowers are not ours . They belong to someone
yelled . " Bob is ours !" The dog
the bird and said , "' Hello , my name is
only on things that are ours ." <|endoftext|> Anna and Ben

MIN ACTIVATIONS
MIN = -17.078

not available yet ," Grand ad said as he wiped the
words like " ab rac ad ab ra " and "
and say " ab rac ad ab ra " all the
big tree . Bl ossom tried to climb up
never pin ched the to ad 's toe again , and
frog ! T ad smiled and carefully picked up
, " A br ac ad ab ra , I turn
, " A br ac ad ab ra ! I want
, " A br ac ad ab ra !" Suddenly ,
It was his grand ad . He said to Jamie
, " A br ac ad ab ra !" Suddenly ,
assignment . Ch ad always works hard at school
say " A br ac ad ab ra !" and everyone
. " A br ac ad ab ra !"
. " A br ac ad ab ra !"
the tree . Bl ossom touched the pet als
, " A br ac ad ab ra ! I 'm
snake . " Bl ub ! You are very
them on and gave Grand ad a playful punch . He
seen before . But Grand ad warned him that he must

INTERVAL 9.709 - 18.639
CONTAINS 1.659%

his way ! <|endoftext|> Once upon a time , there was
person smiled and said , " That 's great ! I
. The old man his sed , �
belt he found . The old man was very happy and
blocks ' Adventure ' and ' Amazing ' instead . That

INTERVAL 0.780 - 9.709
CONTAINS 37.807%

Mia saw a ghost . The ghost was very friendly .
Tom is a cat . He likes to play with his
. She quickly recognised the bottle of poison .
it to the poor and needy people ". Anna
to find a bone somewhere else . Bob rushed

INTERVAL -8.149 - 0.780
CONTAINS 56.797%

not much space . Lu cy said , "
day , a big storm came and broke the hall .
hook and throw the line in the water .
play with my ball !" Lily was happy and said ,
got up early and put on his mail man hat and

INTERVAL -17.078 - -8.149
CONTAINS 3.737%

men and snow balls and slide on the ice . But
saw a big black beetle crawling on a leaf . It
some food . Ch ad finished lunch and he
. The end . <|endoftext|> Once upon a time , there
the magical statue . <|endoftext|> Once upon a time there was
Token We
Token ID775
Feature activation+5.98e+00

Pos logit contributions
aster+0.096
e+0.092
One+0.088
 rest+0.086
 spring+0.083
Neg logit contributions
ometer-0.061
ounded-0.060
oke-0.058
keeper-0.055
rier-0.055
Token have
Token ID423
Feature activation-4.05e+00

Pos logit contributions
ometer+0.385
ounded+0.380
oke+0.373
 understands+0.369
 ashamed+0.356
Neg logit contributions
aster-0.263
One-0.223
e-0.207
wered-0.204
 recess-0.202
Token different
Token ID1180
Feature activation+3.32e+00

Pos logit contributions
aster+0.193
e+0.184
One+0.175
 rest+0.175
 spring+0.165
Neg logit contributions
ounded-0.263
ometer-0.259
oke-0.253
rier-0.245
keeper-0.239
Token likes
Token ID7832
Feature activation+1.04e+01

Pos logit contributions
ounded+0.199
ometer+0.198
oke+0.188
keeper+0.181
rier+0.181
Neg logit contributions
e-0.175
aster-0.174
 rest-0.162
One-0.159
 play-0.152
Token and
Token ID290
Feature activation-1.86e+00

Pos logit contributions
ometer+0.635
ounded+0.629
oke+0.614
keeper+0.594
rier+0.590
Neg logit contributions
aster-0.509
e-0.488
One-0.454
 rest-0.439
 spring-0.413
Token disl
Token ID19621
Feature activation+1.86e+01

Pos logit contributions
aster+0.081
e+0.078
 rest+0.071
One+0.070
 spring+0.066
Neg logit contributions
ometer-0.125
ounded-0.125
oke-0.121
rier-0.117
keeper-0.117
Tokenikes
Token ID7938
Feature activation+4.54e+00

Pos logit contributions
ometer+0.790
ounded+0.780
oke+0.750
keeper+0.713
rier+0.711
Neg logit contributions
aster-1.274
e-1.227
One-1.158
 rest-1.141
 spring-1.100
Token.
Token ID13
Feature activation-1.50e+00

Pos logit contributions
ometer+0.322
ounded+0.317
oke+0.314
rier+0.302
keeper+0.302
Neg logit contributions
aster-0.177
e-0.166
One-0.151
 rest-0.144
wered-0.133
Token We
Token ID775
Feature activation+5.95e+00

Pos logit contributions
aster+0.099
e+0.096
One+0.091
 rest+0.090
 spring+0.086
Neg logit contributions
ometer-0.065
ounded-0.065
oke-0.062
keeper-0.059
rier-0.059
Token can
Token ID460
Feature activation+5.32e+00

Pos logit contributions
ometer+0.368
ounded+0.366
oke+0.356
 understands+0.351
 puzzled+0.338
Neg logit contributions
aster-0.284
One-0.236
e-0.231
 recess-0.220
wered-0.220
Token be
Token ID307
Feature activation-1.52e+00

Pos logit contributions
ometer+0.316
keeper+0.314
ounded+0.311
 puzzled+0.303
rier+0.301
Neg logit contributions
aster-0.241
P-0.204
e-0.197
 recess-0.188
 Saturday-0.172
Token should
Token ID815
Feature activation+7.43e+00

Pos logit contributions
ometer+0.114
ounded+0.114
oke+0.112
 understands+0.108
keeper+0.106
Neg logit contributions
aster-0.074
One-0.064
e-0.063
 recess-0.057
wered-0.056
Token be
Token ID307
Feature activation-1.46e+00

Pos logit contributions
 understands+0.345
 dear+0.332
 ashamed+0.332
 doesn+0.310
 wants+0.304
Neg logit contributions
he-0.365
aster-0.356
e-0.352
 recess-0.336
date-0.334
Token compassionate
Token ID32533
Feature activation+4.32e+00

Pos logit contributions
e+0.070
aster+0.068
One+0.065
 rest+0.062
 play+0.060
Neg logit contributions
ometer-0.097
ounded-0.092
oke-0.091
keeper-0.088
rier-0.087
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
ounded+0.347
ometer+0.338
oke+0.331
 puzzled+0.323
 understands+0.316
Neg logit contributions
e-0.137
aster-0.127
One-0.115
 rest-0.108
 end-0.104
Token Comp
Token ID3082
Feature activation-2.64e+00

Pos logit contributions
aster+0.093
e+0.089
One+0.085
 rest+0.083
 spring+0.080
Neg logit contributions
ometer-0.063
ounded-0.062
oke-0.060
keeper-0.057
rier-0.057
Tokenassion
Token ID11857
Feature activation+1.74e+01

Pos logit contributions
e+0.195
One+0.190
aster+0.188
he+0.175
 play+0.172
Neg logit contributions
 puzzled-0.081
rier-0.080
ounded-0.079
 ashamed-0.076
ometer-0.075
Tokenate
Token ID378
Feature activation-9.52e-01

Pos logit contributions
ounded+1.145
ometer+1.122
oke+1.061
 puzzled+1.056
keeper+1.055
Neg logit contributions
e-0.796
aster-0.747
One-0.736
 end-0.664
 rest-0.659
Token means
Token ID1724
Feature activation+1.41e+01

Pos logit contributions
e+0.047
aster+0.044
 rest+0.041
One+0.040
 end+0.039
Neg logit contributions
ounded-0.060
ometer-0.057
 puzzled-0.055
oke-0.054
rier-0.053
Token you
Token ID345
Feature activation-4.91e+00

Pos logit contributions
ometer+1.023
ounded+1.018
oke+0.995
keeper+0.967
rier+0.962
Neg logit contributions
aster-0.534
e-0.507
One-0.460
 rest-0.440
 spring-0.406
Token care
Token ID1337
Feature activation-2.78e+00

Pos logit contributions
e+0.214
aster+0.206
 rest+0.198
One+0.191
 play+0.178
Neg logit contributions
ounded-0.325
ometer-0.322
rier-0.315
oke-0.310
keeper-0.307
Token about
Token ID546
Feature activation+7.21e+00

Pos logit contributions
e+0.113
aster+0.108
 rest+0.108
 end+0.108
One+0.106
Neg logit contributions
ounded-0.186
ometer-0.181
keeper-0.180
 puzzled-0.172
rier-0.169
Token sad
Token ID6507
Feature activation+9.73e+00

Pos logit contributions
ometer+0.126
ounded+0.124
oke+0.117
keeper+0.113
rier+0.113
Neg logit contributions
aster-0.212
e-0.206
One-0.197
 rest-0.194
 spring-0.186
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
ounded+0.749
ometer+0.740
oke+0.722
rier+0.707
keeper+0.704
Neg logit contributions
aster-0.322
e-0.296
One-0.279
 rest-0.274
 spring-0.252
Token He
Token ID679
Feature activation+2.44e+00

Pos logit contributions
One+0.042
 play+0.038
aster+0.036
 time+0.036
 rest+0.036
Neg logit contributions
oke-0.077
ounded-0.074
rier-0.071
ounced-0.071
ishes-0.070
Token said
Token ID531
Feature activation+6.59e+00

Pos logit contributions
rier+0.150
oke+0.145
ometer+0.144
ounded+0.142
keeper+0.131
Neg logit contributions
e-0.121
One-0.118
aster-0.117
 play-0.114
 rest-0.114
Token,
Token ID11
Feature activation-8.79e-01

Pos logit contributions
ounded+0.579
oke+0.546
ometer+0.542
 understands+0.518
rier+0.517
Neg logit contributions
aster-0.150
e-0.138
 rest-0.128
One-0.114
 play-0.113
Token ""
Token ID13538
Feature activation+1.68e+01

Pos logit contributions
One+0.054
aster+0.053
 rest+0.051
 spring+0.051
Once+0.050
Neg logit contributions
ounded-0.041
oke-0.039
ometer-0.038
endant-0.037
rier-0.037
TokenI
Token ID40
Feature activation-7.66e+00

Pos logit contributions
ounded+1.090
ometer+1.076
keeper+1.068
 understands+1.034
oke+1.033
Neg logit contributions
aster-0.687
e-0.674
 rest-0.588
 recess-0.571
 arrive-0.553
Token hope
Token ID2911
Feature activation-1.18e+00

Pos logit contributions
aster+0.353
e+0.352
 rest+0.326
One+0.321
 play+0.304
Neg logit contributions
rier-0.482
ometer-0.481
ounded-0.480
oke-0.463
keeper-0.457
Token something
Token ID1223
Feature activation+8.14e+00

Pos logit contributions
aster+0.050
e+0.048
One+0.044
 rest+0.043
 spring+0.040
Neg logit contributions
ometer-0.080
ounded-0.080
oke-0.078
keeper-0.075
rier-0.075
Token will
Token ID481
Feature activation+2.55e+00

Pos logit contributions
ometer+0.559
ounded+0.538
rier+0.526
oke+0.526
keeper+0.518
Neg logit contributions
 rest-0.342
e-0.334
aster-0.334
One-0.312
 play-0.305
Token change
Token ID1487
Feature activation-2.75e+00

Pos logit contributions
ounded+0.166
ometer+0.162
keeper+0.160
rier+0.158
 ashamed+0.153
Neg logit contributions
aster-0.110
e-0.107
P-0.094
One-0.091
 recess-0.089
Token your
Token ID534
Feature activation+4.80e+00

Pos logit contributions
aster+0.174
e+0.167
One+0.155
 rest+0.153
 spring+0.145
Neg logit contributions
ounded-0.196
ometer-0.196
oke-0.188
keeper-0.182
rier-0.180
Token tomatoes
Token ID23972
Feature activation+6.00e+00

Pos logit contributions
ounded+0.280
ometer+0.265
oke+0.264
keeper+0.246
rier+0.242
Neg logit contributions
aster-0.262
 rest-0.255
 time-0.244
e-0.242
 spring-0.242
Token.
Token ID13
Feature activation-1.53e+00

Pos logit contributions
ometer+0.400
oke+0.394
ounded+0.388
rier+0.380
keeper+0.375
Neg logit contributions
aster-0.258
e-0.240
One-0.229
wered-0.210
 rest-0.204
Token They
Token ID1119
Feature activation-4.59e-01

Pos logit contributions
aster+0.091
e+0.088
One+0.083
 rest+0.081
 spring+0.077
Neg logit contributions
ometer-0.077
ounded-0.077
oke-0.074
keeper-0.071
rier-0.071
Token are
Token ID389
Feature activation-1.24e+00

Pos logit contributions
aster+0.018
One+0.016
e+0.015
 recess+0.015
wered+0.014
Neg logit contributions
ometer-0.031
oke-0.031
ounded-0.031
 understands-0.030
keeper-0.029
Token ours
Token ID16903
Feature activation+1.64e+01

Pos logit contributions
aster+0.059
e+0.057
One+0.052
 rest+0.052
 spring+0.050
Neg logit contributions
ounded-0.080
ometer-0.078
oke-0.075
keeper-0.074
 understands-0.072
Token.
Token ID13
Feature activation-1.53e+00

Pos logit contributions
ounded+1.221
ometer+1.220
oke+1.186
keeper+1.140
 puzzled+1.138
Neg logit contributions
aster-0.581
e-0.564
One-0.506
 rest-0.496
 spring-0.450
Token We
Token ID775
Feature activation+6.14e+00

Pos logit contributions
aster+0.100
e+0.098
One+0.092
 rest+0.089
 spring+0.086
Neg logit contributions
ounded-0.069
ometer-0.068
oke-0.065
keeper-0.063
rier-0.063
Token planted
Token ID18969
Feature activation+9.15e-01

Pos logit contributions
ounded+0.375
oke+0.374
ometer+0.372
 understands+0.357
 ashamed+0.351
Neg logit contributions
aster-0.281
One-0.259
e-0.237
wered-0.231
 recess-0.229
Token them
Token ID606
Feature activation-2.27e+00

Pos logit contributions
ounded+0.067
ometer+0.067
oke+0.065
keeper+0.063
 puzzled+0.062
Neg logit contributions
aster-0.035
e-0.033
One-0.030
 rest-0.029
 spring-0.027
Token.
Token ID13
Feature activation-1.51e+00

Pos logit contributions
aster+0.098
e+0.093
One+0.086
 rest+0.082
 spring+0.076
Neg logit contributions
ometer-0.152
ounded-0.151
oke-0.147
keeper-0.144
rier-0.143
Token
Token ID198
Feature activation-2.23e-01

Pos logit contributions
ometer+0.335
ounded+0.332
oke+0.324
keeper+0.315
 understands+0.311
Neg logit contributions
aster-0.211
e-0.202
One-0.184
 rest-0.178
 spring-0.166
Token
Token ID198
Feature activation-2.31e-01

Pos logit contributions
aster+0.010
e+0.010
 rest+0.009
One+0.009
 recess+0.008
Neg logit contributions
ometer-0.014
ounded-0.014
oke-0.014
 understands-0.013
keeper-0.013
TokenBilly
Token ID49640
Feature activation-3.87e+00

Pos logit contributions
aster+0.010
e+0.010
 rest+0.009
 recess+0.009
 spring+0.008
Neg logit contributions
ounded-0.015
ometer-0.015
oke-0.014
 understands-0.014
keeper-0.014
Token was
Token ID373
Feature activation+1.55e+00

Pos logit contributions
e+0.193
aster+0.186
One+0.172
 rest+0.169
 spring+0.160
Neg logit contributions
ometer-0.233
rier-0.232
ounded-0.231
oke-0.230
keeper-0.211
Token ast
Token ID6468
Feature activation-5.38e+00

Pos logit contributions
ometer+0.094
oke+0.091
ounded+0.090
keeper+0.085
rier+0.084
Neg logit contributions
e-0.082
aster-0.079
One-0.072
 rest-0.071
 spring-0.069
Tokenounded
Token ID6302
Feature activation+1.64e+01

Pos logit contributions
he+0.410
One+0.408
aster+0.405
 go+0.383
 rest+0.383
Neg logit contributions
 teeth-0.142
 eyes-0.116
sed-0.116
 understands-0.108
rier-0.102
Token!
Token ID0
Feature activation+1.19e+00

Pos logit contributions
ometer+1.311
ounded+1.299
oke+1.276
 understands+1.204
rier+1.201
Neg logit contributions
e-0.483
aster-0.474
One-0.435
 rest-0.402
 time-0.384
Token He
Token ID679
Feature activation+2.19e+00

Pos logit contributions
ounded+0.071
rier+0.068
oke+0.066
ener+0.066
ometer+0.065
Neg logit contributions
aster-0.055
One-0.053
Once-0.049
 rest-0.049
e-0.047
Token used
Token ID973
Feature activation-3.33e+00

Pos logit contributions
rier+0.147
ometer+0.140
ounded+0.135
oke+0.134
keeper+0.122
Neg logit contributions
e-0.106
aster-0.097
One-0.095
 read-0.093
 play-0.090
Token the
Token ID262
Feature activation+8.17e-01

Pos logit contributions
aster+0.103
e+0.095
One+0.086
 rest+0.084
 spring+0.078
Neg logit contributions
ounded-0.268
ometer-0.268
oke-0.266
rier-0.255
keeper-0.251
Token magnet
Token ID19972
Feature activation+3.63e+00

Pos logit contributions
oke+0.049
ounded+0.048
ometer+0.048
atted+0.045
rier+0.045
Neg logit contributions
 rest-0.041
One-0.039
aster-0.039
 end-0.038
 time-0.037
Token.
Token ID13
Feature activation-1.52e+00

Pos logit contributions
aster+0.151
e+0.145
One+0.134
 rest+0.128
 spring+0.120
Neg logit contributions
ometer-0.200
ounded-0.197
oke-0.191
keeper-0.188
rier-0.187
Token The
Token ID383
Feature activation+3.82e+00

Pos logit contributions
aster+0.103
e+0.101
One+0.095
 rest+0.093
 spring+0.090
Neg logit contributions
ounded-0.065
ometer-0.064
oke-0.062
keeper-0.059
rier-0.058
Token juice
Token ID13135
Feature activation-3.11e+00

Pos logit contributions
ometer+0.232
ounded+0.227
oke+0.222
keeper+0.218
rier+0.216
Neg logit contributions
aster-0.189
e-0.183
One-0.168
 rest-0.158
 spring-0.149
Token is
Token ID318
Feature activation+3.52e+00

Pos logit contributions
aster+0.177
e+0.172
One+0.160
 rest+0.159
 spring+0.152
Neg logit contributions
ounded-0.163
ometer-0.163
oke-0.157
keeper-0.150
rier-0.150
Token not
Token ID407
Feature activation-3.36e+00

Pos logit contributions
ounded+0.223
ometer+0.222
oke+0.215
keeper+0.207
rier+0.204
Neg logit contributions
aster-0.168
e-0.161
One-0.150
 rest-0.148
 spring-0.143
Token ours
Token ID16903
Feature activation+1.64e+01

Pos logit contributions
aster+0.157
e+0.152
 rest+0.150
 spring+0.139
 time+0.138
Neg logit contributions
ounded-0.216
ometer-0.211
oke-0.206
keeper-0.200
 understands-0.195
Token.
Token ID13
Feature activation-1.48e+00

Pos logit contributions
ounded+1.134
ometer+1.133
oke+1.095
keeper+1.059
 puzzled+1.053
Neg logit contributions
aster-0.668
e-0.649
One-0.593
 rest-0.583
 spring-0.533
Token We
Token ID775
Feature activation+6.12e+00

Pos logit contributions
aster+0.100
e+0.098
One+0.092
 rest+0.090
 spring+0.087
Neg logit contributions
ounded-0.063
ometer-0.062
oke-0.060
keeper-0.057
rier-0.057
Token have
Token ID423
Feature activation-3.91e+00

Pos logit contributions
ometer+0.374
ounded+0.363
oke+0.359
 understands+0.354
keeper+0.341
Neg logit contributions
aster-0.291
One-0.261
e-0.253
 recess-0.241
wered-0.240
Token to
Token ID284
Feature activation+4.31e+00

Pos logit contributions
aster+0.166
e+0.155
 rest+0.150
One+0.144
 spring+0.134
Neg logit contributions
ounded-0.286
ometer-0.281
oke-0.281
rier-0.269
keeper-0.257
Token give
Token ID1577
Feature activation-3.93e+00

Pos logit contributions
 ashamed+0.213
 dear+0.212
 son+0.208
 darling+0.204
 mouth+0.198
Neg logit contributions
he-0.187
One-0.184
P-0.175
 recess-0.173
K-0.168
Tokenally
Token ID453
Feature activation-6.60e-01

Pos logit contributions
aster+0.487
e+0.478
One+0.436
 rest+0.427
 spring+0.408
Neg logit contributions
ounded-0.443
ometer-0.435
oke-0.433
 understands-0.405
keeper-0.404
Token,
Token ID11
Feature activation-7.95e-01

Pos logit contributions
e+0.020
aster+0.018
 rest+0.017
 spring+0.015
 play+0.015
Neg logit contributions
ounded-0.055
rier-0.052
ometer-0.052
oke-0.051
 puzzled-0.050
Token that
Token ID326
Feature activation+2.84e+00

Pos logit contributions
aster+0.037
e+0.035
 rest+0.033
One+0.032
 spring+0.031
Neg logit contributions
ometer-0.051
ounded-0.051
oke-0.049
rier-0.048
keeper-0.048
Token's
Token ID338
Feature activation+5.51e+00

Pos logit contributions
ounded+0.163
ometer+0.160
oke+0.158
rier+0.157
 puzzled+0.149
Neg logit contributions
e-0.147
 rest-0.146
aster-0.144
One-0.138
 spring-0.136
Token not
Token ID407
Feature activation-3.20e+00

Pos logit contributions
ometer+0.375
ounded+0.371
oke+0.359
keeper+0.352
rier+0.350
Neg logit contributions
aster-0.231
e-0.222
One-0.201
 rest-0.193
 spring-0.175
Token ours
Token ID16903
Feature activation+1.64e+01

Pos logit contributions
aster+0.153
e+0.147
One+0.137
 rest+0.135
 spring+0.127
Neg logit contributions
ometer-0.201
ounded-0.200
oke-0.196
keeper-0.188
 understands-0.187
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
ometer+1.140
ounded+1.135
oke+1.107
keeper+1.071
rier+1.067
Neg logit contributions
aster-0.661
e-0.635
One-0.578
 rest-0.563
 spring-0.519
Token Leave
Token ID17446
Feature activation-6.65e-01

Pos logit contributions
e+0.058
aster+0.055
One+0.052
 rest+0.050
 play+0.050
Neg logit contributions
ounded-0.053
 understands-0.049
rier-0.049
oke-0.049
 puzzled-0.048
Token it
Token ID340
Feature activation+3.68e+00

Pos logit contributions
aster+0.029
e+0.028
One+0.025
 rest+0.025
 spring+0.023
Neg logit contributions
ometer-0.045
ounded-0.045
oke-0.044
rier-0.042
keeper-0.042
Token where
Token ID810
Feature activation+6.22e+00

Pos logit contributions
ounded+0.196
ometer+0.189
keeper+0.180
oke+0.179
rier+0.177
Neg logit contributions
aster-0.212
e-0.198
One-0.188
wered-0.178
 rest-0.172
Token it
Token ID340
Feature activation+3.68e+00

Pos logit contributions
ometer+0.487
ounded+0.480
oke+0.473
rier+0.463
 puzzled+0.455
Neg logit contributions
aster-0.202
e-0.189
 rest-0.188
One-0.163
 spring-0.162
Token "
Token ID366
Feature activation+1.06e+01

Pos logit contributions
aster+0.075
One+0.071
e+0.069
 spring+0.069
 rest+0.068
Neg logit contributions
ounded-0.052
ometer-0.047
rier-0.047
ishes-0.046
keeper-0.045
TokenNo
Token ID2949
Feature activation-3.49e+00

Pos logit contributions
ounded+0.690
ometer+0.679
keeper+0.662
oke+0.647
 understands+0.642
Neg logit contributions
aster-0.455
e-0.423
 rest-0.388
 recess-0.381
 spring-0.361
Token,
Token ID11
Feature activation-1.13e+00

Pos logit contributions
aster+0.123
e+0.114
 rest+0.109
One+0.108
 spring+0.098
Neg logit contributions
ounded-0.266
ometer-0.265
keeper-0.254
oke-0.253
rier-0.252
Token this
Token ID428
Feature activation+3.97e+00

Pos logit contributions
aster+0.052
e+0.049
One+0.045
 rest+0.045
 spring+0.042
Neg logit contributions
ometer-0.074
ounded-0.074
oke-0.071
keeper-0.069
rier-0.069
Token is
Token ID318
Feature activation+3.60e+00

Pos logit contributions
ounded+0.198
ometer+0.197
oke+0.190
keeper+0.183
rier+0.182
Neg logit contributions
aster-0.240
e-0.232
One-0.220
 rest-0.215
 spring-0.206
Token ours
Token ID16903
Feature activation+1.64e+01

Pos logit contributions
ometer+0.212
ounded+0.211
oke+0.205
keeper+0.198
rier+0.196
Neg logit contributions
aster-0.187
e-0.181
One-0.170
 rest-0.165
 spring-0.158
Token.
Token ID13
Feature activation-1.51e+00

Pos logit contributions
ometer+1.178
ounded+1.171
oke+1.144
keeper+1.114
rier+1.109
Neg logit contributions
aster-0.627
e-0.592
One-0.539
 rest-0.515
 spring-0.476
Token Go
Token ID1514
Feature activation-4.62e+00

Pos logit contributions
aster+0.098
e+0.098
One+0.090
 rest+0.088
 spring+0.084
Neg logit contributions
ounded-0.069
ometer-0.066
oke-0.064
rier-0.062
keeper-0.062
Token away
Token ID1497
Feature activation+4.24e+00

Pos logit contributions
aster+0.265
wered+0.239
e+0.231
One+0.227
 rest+0.219
Neg logit contributions
ounded-0.245
ometer-0.240
keeper-0.228
 understands-0.205
oke-0.205
Token!"
Token ID2474
Feature activation+6.72e+00

Pos logit contributions
ounded+0.272
ometer+0.271
oke+0.263
 puzzled+0.252
 understands+0.252
Neg logit contributions
aster-0.195
e-0.190
One-0.176
 rest-0.171
 spring-0.163
Token
Token ID198
Feature activation-3.43e-01

Pos logit contributions
ometer+0.452
ounded+0.449
oke+0.438
keeper+0.425
rier+0.422
Neg logit contributions
aster-0.292
e-0.280
One-0.257
 rest-0.248
 spring-0.231
Token they
Token ID484
Feature activation-9.41e-01

Pos logit contributions
ometer+0.078
ounded+0.078
oke+0.075
keeper+0.074
 understands+0.073
Neg logit contributions
aster-0.064
e-0.063
One-0.057
 rest-0.054
 spring-0.051
Token arrived
Token ID5284
Feature activation-2.49e+00

Pos logit contributions
aster+0.049
e+0.047
One+0.044
 rest+0.041
 spring+0.039
Neg logit contributions
ounded-0.054
oke-0.051
ometer-0.050
 understands-0.050
keeper-0.050
Token they
Token ID484
Feature activation-9.58e-01

Pos logit contributions
e+0.060
aster+0.060
 rest+0.059
 recess+0.056
 play+0.055
Neg logit contributions
ometer-0.200
oke-0.194
ounded-0.188
rier-0.187
 understands-0.181
Token were
Token ID547
Feature activation-1.85e+00

Pos logit contributions
aster+0.052
e+0.051
 rest+0.048
One+0.048
 play+0.046
Neg logit contributions
ometer-0.056
rier-0.053
ounded-0.053
oke-0.052
keeper-0.049
Token ast
Token ID6468
Feature activation-5.23e+00

Pos logit contributions
aster+0.090
e+0.089
One+0.081
 rest+0.080
 spring+0.077
Neg logit contributions
ometer-0.115
ounded-0.112
oke-0.110
keeper-0.106
rier-0.106
Tokenounded
Token ID6302
Feature activation+1.64e+01

Pos logit contributions
One+0.396
he+0.390
 go+0.390
 walk+0.386
 play+0.386
Neg logit contributions
 teeth-0.128
 understands-0.101
rier-0.101
 claws-0.098
sed-0.094
Token,
Token ID11
Feature activation-6.80e-01

Pos logit contributions
ometer+1.359
oke+1.297
ounded+1.290
rier+1.250
 understands+1.218
Neg logit contributions
e-0.464
One-0.416
 time-0.402
aster-0.395
 rest-0.370
Token the
Token ID262
Feature activation+1.31e+00

Pos logit contributions
aster+0.026
e+0.025
 rest+0.024
One+0.024
 spring+0.023
Neg logit contributions
ometer-0.050
oke-0.048
ounded-0.047
rier-0.046
keeper-0.045
Token land
Token ID1956
Feature activation-7.88e+00

Pos logit contributions
ometer+0.080
ounded+0.080
oke+0.078
keeper+0.075
rier+0.075
Neg logit contributions
aster-0.064
e-0.061
One-0.058
 rest-0.056
 spring-0.053
Token was
Token ID373
Feature activation+1.92e+00

Pos logit contributions
e+0.311
aster+0.299
 rest+0.273
 spring+0.271
 end+0.270
Neg logit contributions
ometer-0.530
ounded-0.518
rier-0.502
oke-0.484
 understands-0.479
Token very
Token ID845
Feature activation-2.92e+00

Pos logit contributions
ometer+0.128
ounded+0.126
oke+0.125
rier+0.118
 understands+0.118
Neg logit contributions
aster-0.084
e-0.082
One-0.077
 rest-0.074
 spring-0.072
Token not
Token ID407
Feature activation-3.41e+00

Pos logit contributions
aster+0.061
e+0.059
One+0.055
 rest+0.054
 spring+0.051
Neg logit contributions
ometer-0.071
ounded-0.070
oke-0.068
keeper-0.066
rier-0.066
Token yours
Token ID12431
Feature activation+7.34e+00

Pos logit contributions
aster+0.165
e+0.164
One+0.149
 rest+0.148
 spring+0.139
Neg logit contributions
ometer-0.216
ounded-0.212
oke-0.203
keeper-0.201
rier-0.197
Token.
Token ID13
Feature activation-1.33e+00

Pos logit contributions
ometer+0.540
ounded+0.536
oke+0.525
keeper+0.510
rier+0.508
Neg logit contributions
aster-0.269
e-0.254
One-0.230
 rest-0.220
 spring-0.202
Token They
Token ID1119
Feature activation-3.25e-01

Pos logit contributions
aster+0.080
e+0.079
One+0.074
 rest+0.071
 spring+0.068
Neg logit contributions
ounded-0.067
ometer-0.065
oke-0.063
rier-0.061
keeper-0.061
Token are
Token ID389
Feature activation-1.20e+00

Pos logit contributions
aster+0.012
e+0.011
One+0.010
 rest+0.009
wered+0.009
Neg logit contributions
ounded-0.024
ometer-0.024
oke-0.023
 understands-0.023
keeper-0.022
Token ours
Token ID16903
Feature activation+1.64e+01

Pos logit contributions
aster+0.065
e+0.063
One+0.059
 rest+0.058
 spring+0.055
Neg logit contributions
ometer-0.068
ounded-0.067
oke-0.065
keeper-0.062
rier-0.062
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
ounded+1.223
ometer+1.222
oke+1.188
keeper+1.143
rier+1.142
Neg logit contributions
aster-0.571
e-0.556
One-0.499
 rest-0.484
 spring-0.440
Token We
Token ID775
Feature activation+6.05e+00

Pos logit contributions
aster+0.080
e+0.080
One+0.075
 rest+0.072
 spring+0.069
Neg logit contributions
ounded-0.067
ometer-0.065
oke-0.063
rier-0.061
keeper-0.060
Token made
Token ID925
Feature activation-1.19e+00

Pos logit contributions
ounded+0.359
ometer+0.358
oke+0.350
 understands+0.341
keeper+0.331
Neg logit contributions
aster-0.300
One-0.260
e-0.259
wered-0.241
 recess-0.239
Token them
Token ID606
Feature activation-2.28e+00

Pos logit contributions
aster+0.047
e+0.045
One+0.041
 rest+0.040
 spring+0.036
Neg logit contributions
ometer-0.085
ounded-0.084
oke-0.082
keeper-0.079
rier-0.079
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
aster+0.111
e+0.105
One+0.099
 rest+0.091
wered+0.089
Neg logit contributions
ounded-0.139
ometer-0.138
keeper-0.133
rier-0.132
oke-0.132
Token.
Token ID13
Feature activation-1.24e+00

Pos logit contributions
e+0.049
aster+0.047
 rest+0.040
One+0.039
 spring+0.036
Neg logit contributions
ounded-0.118
ometer-0.113
oke-0.111
rier-0.109
keeper-0.109
Token That
Token ID1320
Feature activation+6.85e+00

Pos logit contributions
e+0.077
aster+0.076
One+0.071
 rest+0.069
 spring+0.067
Neg logit contributions
ounded-0.060
ometer-0.056
oke-0.056
rier-0.055
keeper-0.054
Token balloon
Token ID21190
Feature activation+2.97e-01

Pos logit contributions
ounded+0.390
oke+0.379
ometer+0.369
rier+0.365
keeper+0.353
Neg logit contributions
aster-0.366
e-0.365
 rest-0.346
One-0.343
 spring-0.334
Token is
Token ID318
Feature activation+3.74e+00

Pos logit contributions
ounded+0.017
ometer+0.016
rier+0.016
 puzzled+0.016
oke+0.016
Neg logit contributions
e-0.016
aster-0.015
 rest-0.014
 spring-0.014
 play-0.014
Token not
Token ID407
Feature activation-3.30e+00

Pos logit contributions
ounded+0.249
ometer+0.246
oke+0.244
rier+0.236
keeper+0.235
Neg logit contributions
aster-0.164
e-0.158
One-0.151
 rest-0.146
 spring-0.144
Token ours
Token ID16903
Feature activation+1.64e+01

Pos logit contributions
aster+0.149
e+0.146
 rest+0.138
One+0.135
 spring+0.132
Neg logit contributions
ounded-0.212
ometer-0.211
oke-0.209
keeper-0.204
 understands-0.198
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
ounded+1.148
ometer+1.145
oke+1.112
keeper+1.074
rier+1.074
Neg logit contributions
aster-0.640
e-0.629
One-0.574
 rest-0.565
 spring-0.515
Token We
Token ID775
Feature activation+6.18e+00

Pos logit contributions
e+0.078
aster+0.077
One+0.073
 rest+0.069
 spring+0.067
Neg logit contributions
ounded-0.056
ometer-0.052
oke-0.051
rier-0.050
keeper-0.050
Token must
Token ID1276
Feature activation+8.39e+00

Pos logit contributions
ometer+0.351
ounded+0.347
oke+0.338
keeper+0.323
 understands+0.320
Neg logit contributions
aster-0.332
e-0.314
One-0.298
 rest-0.285
 spring-0.272
Token not
Token ID407
Feature activation-3.29e+00

Pos logit contributions
ounded+0.409
 ashamed+0.409
 dear+0.408
ometer+0.405
 understands+0.394
Neg logit contributions
 recess-0.370
aster-0.363
P-0.351
e-0.347
One-0.343
Token take
Token ID1011
Feature activation-7.23e+00

Pos logit contributions
 recess+0.134
wered+0.132
aster+0.131
 weekends+0.122
 course+0.121
Neg logit contributions
ometer-0.191
ounded-0.185
 ashamed-0.176
oke-0.175
keeper-0.175
Token not
Token ID407
Feature activation-3.37e+00

Pos logit contributions
ometer+0.423
ounded+0.400
 ashamed+0.386
 dear+0.383
 understands+0.381
Neg logit contributions
aster-0.345
e-0.341
P-0.327
 recess-0.324
he-0.315
Token.
Token ID13
Feature activation-1.55e+00

Pos logit contributions
aster+0.123
e+0.111
 recess+0.105
 course+0.100
P+0.093
Neg logit contributions
ometer-0.239
ounded-0.236
oke-0.218
 teeth-0.212
keeper-0.212
Token It
Token ID632
Feature activation+6.89e+00

Pos logit contributions
e+0.090
aster+0.090
One+0.080
 rest+0.079
 spring+0.077
Neg logit contributions
ounded-0.081
oke-0.077
rier-0.074
ometer-0.074
 understands-0.074
Token is
Token ID318
Feature activation+3.50e+00

Pos logit contributions
ounded+0.391
rier+0.388
ometer+0.379
oke+0.370
 puzzled+0.363
Neg logit contributions
e-0.365
aster-0.334
 rest-0.329
One-0.326
 spring-0.316
Token not
Token ID407
Feature activation-3.39e+00

Pos logit contributions
ounded+0.226
ometer+0.223
oke+0.221
keeper+0.212
rier+0.210
Neg logit contributions
aster-0.164
e-0.157
One-0.146
 rest-0.143
 spring-0.139
Token ours
Token ID16903
Feature activation+1.64e+01

Pos logit contributions
aster+0.160
e+0.155
 rest+0.147
One+0.141
 spring+0.141
Neg logit contributions
ounded-0.211
ometer-0.210
oke-0.210
keeper-0.203
 understands-0.200
Token.
Token ID13
Feature activation-1.53e+00

Pos logit contributions
ometer+1.153
ounded+1.152
oke+1.120
keeper+1.081
 puzzled+1.078
Neg logit contributions
aster-0.636
e-0.623
One-0.566
 rest-0.558
 spring-0.506
Token It
Token ID632
Feature activation+6.86e+00

Pos logit contributions
e+0.097
aster+0.096
One+0.089
 rest+0.085
 spring+0.083
Neg logit contributions
ounded-0.072
ometer-0.068
oke-0.067
rier-0.067
 understands-0.065
Token is
Token ID318
Feature activation+3.48e+00

Pos logit contributions
ounded+0.364
rier+0.362
ometer+0.356
oke+0.341
 puzzled+0.335
Neg logit contributions
e-0.392
aster-0.360
 rest-0.355
One-0.348
 spring-0.338
Token dangerous
Token ID4923
Feature activation+8.29e+00

Pos logit contributions
ounded+0.223
ometer+0.220
oke+0.217
keeper+0.207
rier+0.206
Neg logit contributions
aster-0.165
e-0.159
One-0.147
 rest-0.145
 spring-0.141
Token."
Token ID526
Feature activation+4.93e+00

Pos logit contributions
ounded+0.600
ometer+0.598
oke+0.581
 understands+0.565
keeper+0.565
Neg logit contributions
aster-0.307
e-0.301
One-0.275
 rest-0.266
 spring-0.246
Token.
Token ID13
Feature activation-1.66e+00

Pos logit contributions
aster+0.140
e+0.134
One+0.124
 rest+0.122
 spring+0.112
Neg logit contributions
ounded-0.245
ometer-0.243
oke-0.234
keeper-0.229
 puzzled-0.228
Token The
Token ID383
Feature activation+3.74e+00

Pos logit contributions
aster+0.102
e+0.100
One+0.093
 rest+0.090
 spring+0.086
Neg logit contributions
ounded-0.083
ometer-0.078
oke-0.076
 puzzled-0.074
keeper-0.074
Token things
Token ID1243
Feature activation+6.21e+00

Pos logit contributions
ometer+0.185
ounded+0.182
oke+0.175
keeper+0.171
rier+0.169
Neg logit contributions
aster-0.230
e-0.222
One-0.207
 rest-0.200
 spring-0.192
Token are
Token ID389
Feature activation-1.40e+00

Pos logit contributions
ounded+0.436
ometer+0.435
oke+0.431
 understands+0.416
keeper+0.411
Neg logit contributions
aster-0.239
e-0.223
One-0.219
 rest-0.183
wered-0.181
Token not
Token ID407
Feature activation-3.46e+00

Pos logit contributions
aster+0.064
e+0.061
One+0.057
 rest+0.056
 spring+0.053
Neg logit contributions
ometer-0.090
ounded-0.090
oke-0.087
keeper-0.084
rier-0.083
Token ours
Token ID16903
Feature activation+1.64e+01

Pos logit contributions
aster+0.189
e+0.182
 rest+0.175
One+0.170
 spring+0.163
Neg logit contributions
ounded-0.193
ometer-0.192
oke-0.186
keeper-0.179
 understands-0.178
Token.
Token ID13
Feature activation-1.67e+00

Pos logit contributions
ometer+1.180
ounded+1.174
oke+1.148
keeper+1.117
rier+1.111
Neg logit contributions
aster-0.620
e-0.585
One-0.532
 rest-0.508
 spring-0.470
Token We
Token ID775
Feature activation+6.00e+00

Pos logit contributions
aster+0.101
e+0.099
One+0.093
 rest+0.091
 spring+0.086
Neg logit contributions
ounded-0.082
ometer-0.082
oke-0.078
keeper-0.075
rier-0.075
Token should
Token ID815
Feature activation+7.46e+00

Pos logit contributions
ometer+0.400
ounded+0.387
oke+0.384
 understands+0.371
keeper+0.368
Neg logit contributions
aster-0.255
One-0.224
e-0.222
 recess-0.202
wered-0.201
Token not
Token ID407
Feature activation-3.47e+00

Pos logit contributions
 dear+0.395
 understands+0.390
 ashamed+0.387
keeper+0.383
ometer+0.375
Neg logit contributions
aster-0.336
e-0.320
 recess-0.309
he-0.308
P-0.294
Token take
Token ID1011
Feature activation-7.44e+00

Pos logit contributions
 recess+0.134
aster+0.131
Now+0.121
 course+0.120
wered+0.118
Neg logit contributions
ometer-0.201
keeper-0.191
 teeth-0.186
 ashamed-0.183
ounded-0.179
Token.
Token ID13
Feature activation-1.47e+00

Pos logit contributions
aster+0.125
e+0.118
One+0.110
 rest+0.110
 spring+0.100
Neg logit contributions
ounded-0.257
ometer-0.252
oke-0.243
 puzzled-0.240
keeper-0.239
Token The
Token ID383
Feature activation+3.89e+00

Pos logit contributions
aster+0.095
e+0.092
One+0.087
 rest+0.086
 spring+0.082
Neg logit contributions
ometer-0.067
ounded-0.066
oke-0.064
keeper-0.061
rier-0.060
Token coins
Token ID10796
Feature activation+2.96e+00

Pos logit contributions
ometer+0.212
ounded+0.212
oke+0.205
keeper+0.200
rier+0.200
Neg logit contributions
aster-0.216
e-0.209
One-0.193
 rest-0.185
 spring-0.177
Token are
Token ID389
Feature activation-1.26e+00

Pos logit contributions
ometer+0.222
ounded+0.221
oke+0.214
keeper+0.210
rier+0.209
Neg logit contributions
aster-0.103
e-0.099
 rest-0.087
One-0.086
 spring-0.078
Token not
Token ID407
Feature activation-3.37e+00

Pos logit contributions
aster+0.065
e+0.063
One+0.059
 rest+0.058
 spring+0.056
Neg logit contributions
ounded-0.075
ometer-0.074
oke-0.072
 understands-0.069
keeper-0.068
Token ours
Token ID16903
Feature activation+1.64e+01

Pos logit contributions
aster+0.178
e+0.176
 rest+0.170
One+0.161
 spring+0.159
Neg logit contributions
ounded-0.189
ometer-0.187
oke-0.181
 understands-0.175
keeper-0.173
Token.
Token ID13
Feature activation-1.46e+00

Pos logit contributions
ounded+1.217
ometer+1.217
oke+1.181
keeper+1.144
rier+1.141
Neg logit contributions
aster-0.573
e-0.558
One-0.503
 rest-0.492
 spring-0.442
Token They
Token ID1119
Feature activation-4.33e-01

Pos logit contributions
aster+0.092
e+0.090
One+0.085
 rest+0.082
 spring+0.079
Neg logit contributions
ounded-0.068
ometer-0.068
oke-0.065
keeper-0.062
rier-0.062
Token are
Token ID389
Feature activation-1.28e+00

Pos logit contributions
aster+0.016
e+0.014
One+0.013
 rest+0.012
 recess+0.011
Neg logit contributions
ometer-0.032
ounded-0.032
oke-0.031
keeper-0.030
 understands-0.030
Token for
Token ID329
Feature activation+5.42e+00

Pos logit contributions
aster+0.071
e+0.070
 rest+0.065
One+0.064
 spring+0.063
Neg logit contributions
ounded-0.071
ometer-0.070
oke-0.067
keeper-0.063
 understands-0.063
Token the
Token ID262
Feature activation+8.25e-01

Pos logit contributions
ometer+0.328
ounded+0.325
oke+0.321
keeper+0.310
rier+0.308
Neg logit contributions
aster-0.261
e-0.251
One-0.232
 rest-0.221
he-0.212
Token"
Token ID1
Feature activation+3.00e+00

Pos logit contributions
aster+0.018
e+0.017
 rest+0.016
wered+0.014
 spring+0.014
Neg logit contributions
ounded-0.031
keeper-0.030
ometer-0.030
oke-0.030
 understands-0.029
TokenNo
Token ID2949
Feature activation-3.34e+00

Pos logit contributions
ounded+0.191
keeper+0.181
ometer+0.180
 understands+0.180
oke+0.180
Neg logit contributions
aster-0.136
e-0.134
 rest-0.120
One-0.109
 arrive-0.106
Token,
Token ID11
Feature activation-1.22e+00

Pos logit contributions
aster+0.129
e+0.121
One+0.112
 rest+0.110
 spring+0.101
Neg logit contributions
ometer-0.244
ounded-0.243
oke-0.234
keeper-0.231
rier-0.230
Token it
Token ID340
Feature activation+3.46e+00

Pos logit contributions
aster+0.055
e+0.051
 rest+0.047
One+0.045
 spring+0.044
Neg logit contributions
ometer-0.084
ounded-0.081
oke-0.078
keeper-0.077
 puzzled-0.076
Token is
Token ID318
Feature activation+3.46e+00

Pos logit contributions
ounded+0.189
ometer+0.187
rier+0.178
oke+0.177
keeper+0.176
Neg logit contributions
e-0.193
aster-0.185
 rest-0.174
One-0.173
 spring-0.166
Token ours
Token ID16903
Feature activation+1.64e+01

Pos logit contributions
ounded+0.197
ometer+0.196
oke+0.192
keeper+0.182
rier+0.181
Neg logit contributions
aster-0.186
e-0.184
One-0.175
 rest-0.172
 spring-0.167
Token now
Token ID783
Feature activation+6.07e-02

Pos logit contributions
ometer+1.092
ounded+1.087
oke+1.057
keeper+1.015
rier+1.011
Neg logit contributions
aster-0.705
e-0.686
One-0.630
 rest-0.611
 spring-0.567
Token!"
Token ID2474
Feature activation+6.58e+00

Pos logit contributions
ometer+0.004
ounded+0.004
oke+0.003
keeper+0.003
rier+0.003
Neg logit contributions
aster-0.003
e-0.003
One-0.003
 rest-0.003
 spring-0.003
Token one
Token ID530
Feature activation+1.54e+00

Pos logit contributions
ometer+0.452
ounded+0.447
oke+0.432
rier+0.421
keeper+0.415
Neg logit contributions
aster-0.289
e-0.263
One-0.252
 rest-0.242
 spring-0.227
Token of
Token ID286
Feature activation-1.91e+00

Pos logit contributions
ometer+0.103
ounded+0.103
oke+0.099
keeper+0.097
rier+0.097
Neg logit contributions
aster-0.067
e-0.064
One-0.059
 rest-0.057
 spring-0.053
Token the
Token ID262
Feature activation+7.42e-01

Pos logit contributions
aster+0.084
e+0.083
One+0.078
 rest+0.074
 walk+0.070
Neg logit contributions
rier-0.109
oke-0.108
ounded-0.106
 understands-0.106
keeper-0.106
Token "
Token ID366
Feature activation+1.06e+01

Pos logit contributions
aster+0.071
One+0.069
 spring+0.067
e+0.067
 rest+0.065
Neg logit contributions
ounded-0.044
ometer-0.040
rier-0.040
keeper-0.040
oke-0.037
TokenNo
Token ID2949
Feature activation-3.39e+00

Pos logit contributions
ometer+0.671
ounded+0.670
keeper+0.641
 understands+0.625
oke+0.624
Neg logit contributions
aster-0.446
e-0.415
 rest-0.397
 recess-0.379
wered-0.377
Token,
Token ID11
Feature activation-1.04e+00

Pos logit contributions
aster+0.130
e+0.122
One+0.113
 rest+0.111
 spring+0.102
Neg logit contributions
ounded-0.249
ometer-0.248
oke-0.238
keeper-0.236
rier-0.234
Token it
Token ID340
Feature activation+3.50e+00

Pos logit contributions
aster+0.046
e+0.043
One+0.040
 rest+0.039
 spring+0.036
Neg logit contributions
ometer-0.070
ounded-0.070
oke-0.067
keeper-0.065
rier-0.065
Token is
Token ID318
Feature activation+3.58e+00

Pos logit contributions
ounded+0.183
ometer+0.181
oke+0.173
rier+0.172
keeper+0.170
Neg logit contributions
e-0.202
aster-0.199
One-0.185
 rest-0.183
 spring-0.175
Token ours
Token ID16903
Feature activation+1.63e+01

Pos logit contributions
ounded+0.195
ometer+0.195
oke+0.190
keeper+0.181
rier+0.179
Neg logit contributions
aster-0.198
e-0.197
One-0.187
 rest-0.184
 spring-0.179
Token.
Token ID13
Feature activation-1.46e+00

Pos logit contributions
ometer+1.188
ounded+1.180
oke+1.155
keeper+1.126
rier+1.120
Neg logit contributions
aster-0.614
e-0.578
One-0.525
 rest-0.501
 spring-0.462
Token We
Token ID775
Feature activation+6.04e+00

Pos logit contributions
e+0.100
aster+0.099
One+0.092
 rest+0.090
 spring+0.086
Neg logit contributions
ounded-0.061
ometer-0.058
oke-0.057
rier-0.055
 puzzled-0.054
Token found
Token ID1043
Feature activation+4.35e+00

Pos logit contributions
ometer+0.359
ounded+0.356
oke+0.346
 understands+0.339
 ashamed+0.324
Neg logit contributions
aster-0.298
One-0.263
e-0.260
wered-0.244
 recess-0.243
Token it
Token ID340
Feature activation+3.48e+00

Pos logit contributions
ounded+0.326
oke+0.321
ometer+0.321
 puzzled+0.302
keeper+0.293
Neg logit contributions
e-0.165
aster-0.164
 rest-0.147
One-0.135
 spring-0.132
Token.
Token ID13
Feature activation-1.48e+00

Pos logit contributions
ometer+0.235
ounded+0.233
oke+0.227
keeper+0.222
rier+0.220
Neg logit contributions
aster-0.152
e-0.139
One-0.128
 rest-0.122
wered-0.118
Token.
Token ID13
Feature activation-1.09e+00

Pos logit contributions
aster+0.036
e+0.036
One+0.031
 rest+0.030
 spring+0.027
Neg logit contributions
ounded-0.093
ometer-0.092
oke-0.090
rier-0.088
keeper-0.088
Token Those
Token ID5845
Feature activation+6.28e+00

Pos logit contributions
e+0.060
aster+0.058
One+0.054
 rest+0.054
 spring+0.052
Neg logit contributions
ounded-0.058
oke-0.055
 understands-0.054
rier-0.054
ometer-0.053
Token flowers
Token ID12734
Feature activation+2.98e-01

Pos logit contributions
ometer+0.447
ounded+0.444
oke+0.434
keeper+0.421
rier+0.420
Neg logit contributions
aster-0.244
e-0.234
One-0.213
 rest-0.205
 spring-0.190
Token are
Token ID389
Feature activation-1.10e+00

Pos logit contributions
ometer+0.021
ounded+0.021
oke+0.021
 understands+0.020
keeper+0.020
Neg logit contributions
aster-0.011
e-0.010
One-0.010
 rest-0.008
 recess-0.008
Token not
Token ID407
Feature activation-3.33e+00

Pos logit contributions
aster+0.049
e+0.048
One+0.045
 rest+0.044
 spring+0.043
Neg logit contributions
ounded-0.072
ometer-0.072
oke-0.070
keeper-0.067
rier-0.067
Token ours
Token ID16903
Feature activation+1.63e+01

Pos logit contributions
aster+0.173
e+0.169
 rest+0.158
One+0.156
 spring+0.151
Neg logit contributions
ometer-0.195
ounded-0.194
oke-0.188
keeper-0.180
 understands-0.178
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
ometer+1.154
ounded+1.146
oke+1.120
keeper+1.095
rier+1.088
Neg logit contributions
aster-0.650
e-0.607
One-0.554
 rest-0.522
 spring-0.490
Token They
Token ID1119
Feature activation-9.56e-02

Pos logit contributions
e+0.055
aster+0.053
One+0.051
 rest+0.048
 play+0.046
Neg logit contributions
ounded-0.063
ometer-0.059
 understands-0.059
oke-0.058
rier-0.057
Token belong
Token ID5594
Feature activation-3.32e-01

Pos logit contributions
aster+0.004
e+0.003
One+0.003
 rest+0.003
 spring+0.003
Neg logit contributions
ometer-0.007
ounded-0.007
oke-0.007
keeper-0.006
rier-0.006
Token to
Token ID284
Feature activation+4.35e+00

Pos logit contributions
e+0.015
aster+0.015
 rest+0.014
One+0.014
 spring+0.013
Neg logit contributions
ometer-0.021
oke-0.021
ounded-0.021
rier-0.020
 puzzled-0.020
Token someone
Token ID2130
Feature activation+2.51e+00

Pos logit contributions
ometer+0.304
ounded+0.302
oke+0.295
keeper+0.287
rier+0.285
Neg logit contributions
aster-0.174
e-0.165
One-0.151
 rest-0.145
 spring-0.134
Token yelled
Token ID22760
Feature activation+5.96e-01

Pos logit contributions
e+0.260
aster+0.243
 rest+0.222
One+0.221
 run+0.218
Neg logit contributions
ometer-0.291
ounded-0.282
rier-0.276
oke-0.270
keeper-0.258
Token.
Token ID13
Feature activation-1.43e+00

Pos logit contributions
ometer+0.047
ounded+0.047
oke+0.046
keeper+0.044
 understands+0.044
Neg logit contributions
aster-0.018
e-0.018
 rest-0.015
One-0.015
 spring-0.014
Token "
Token ID366
Feature activation+1.08e+01

Pos logit contributions
aster+0.095
e+0.091
One+0.090
 spring+0.085
 rest+0.084
Neg logit contributions
ounded-0.065
ometer-0.061
oke-0.058
keeper-0.054
rier-0.054
TokenBob
Token ID18861
Feature activation-8.15e+00

Pos logit contributions
ounded+0.600
oke+0.573
ometer+0.570
rier+0.569
 puzzled+0.560
Neg logit contributions
aster-0.486
 recess-0.461
 parties-0.456
 rest-0.442
e-0.418
Token is
Token ID318
Feature activation+3.57e+00

Pos logit contributions
e+0.419
aster+0.413
 rest+0.369
One+0.368
 spring+0.352
Neg logit contributions
ounded-0.481
ometer-0.479
oke-0.452
rier-0.449
keeper-0.449
Token ours
Token ID16903
Feature activation+1.63e+01

Pos logit contributions
ometer+0.201
ounded+0.200
oke+0.195
keeper+0.187
rier+0.187
Neg logit contributions
aster-0.191
e-0.183
One-0.173
 rest-0.167
 spring-0.157
Token!"
Token ID2474
Feature activation+6.75e+00

Pos logit contributions
ometer+1.007
ounded+1.002
oke+0.973
keeper+0.940
rier+0.935
Neg logit contributions
aster-0.791
e-0.762
One-0.707
 rest-0.687
 spring-0.646
Token
Token ID198
Feature activation-1.33e-01

Pos logit contributions
ometer+0.472
ounded+0.470
oke+0.458
keeper+0.444
rier+0.443
Neg logit contributions
aster-0.273
e-0.260
One-0.239
 rest-0.229
 spring-0.213
Token
Token ID198
Feature activation-1.28e-01

Pos logit contributions
aster+0.006
e+0.006
 rest+0.006
One+0.005
 arrive+0.005
Neg logit contributions
ometer-0.008
oke-0.008
ounded-0.008
keeper-0.007
 understands-0.007
TokenThe
Token ID464
Feature activation-2.36e+00

Pos logit contributions
aster+0.005
 rest+0.004
e+0.004
 arrive+0.004
 recess+0.004
Neg logit contributions
keeper-0.009
ometer-0.009
oke-0.008
ounded-0.008
rier-0.008
Token dog
Token ID3290
Feature activation+2.07e+00

Pos logit contributions
aster+0.158
e+0.155
One+0.141
 rest+0.136
wered+0.133
Neg logit contributions
ometer-0.103
ounded-0.099
oke-0.097
 understands-0.096
keeper-0.095
Token the
Token ID262
Feature activation+1.08e+00

Pos logit contributions
ounded+0.299
ometer+0.298
keeper+0.288
oke+0.287
rier+0.284
Neg logit contributions
aster-0.177
e-0.172
One-0.160
 rest-0.143
 recess-0.139
Token bird
Token ID6512
Feature activation+1.76e+00

Pos logit contributions
ounded+0.066
ometer+0.065
oke+0.065
 understands+0.059
atted+0.059
Neg logit contributions
aster-0.053
e-0.053
 rest-0.052
One-0.050
 spring-0.049
Token and
Token ID290
Feature activation-1.49e+00

Pos logit contributions
ometer+0.117
ounded+0.116
oke+0.113
rier+0.110
keeper+0.109
Neg logit contributions
aster-0.078
e-0.075
One-0.069
 rest-0.067
 spring-0.063
Token said
Token ID531
Feature activation+6.59e+00

Pos logit contributions
aster+0.061
e+0.060
 rest+0.058
 spring+0.056
 read+0.056
Neg logit contributions
ometer-0.105
oke-0.104
rier-0.101
ounded-0.098
 understands-0.093
Token,
Token ID11
Feature activation-8.20e-01

Pos logit contributions
ounded+0.536
ometer+0.527
oke+0.513
outed+0.502
ishes+0.498
Neg logit contributions
e-0.157
aster-0.156
 rest-0.154
One-0.148
 go-0.145
Token "'
Token ID24018
Feature activation+1.63e+01

Pos logit contributions
One+0.053
aster+0.053
 spring+0.051
e+0.051
 rest+0.050
Neg logit contributions
ounded-0.033
ometer-0.033
oke-0.031
 understands-0.031
ishes-0.030
TokenHello
Token ID15496
Feature activation-1.00e+00

Pos logit contributions
ounded+0.946
keeper+0.933
ometer+0.905
oke+0.883
 understands+0.861
Neg logit contributions
aster-0.743
e-0.679
 rest-0.669
 recess-0.666
wered-0.642
Token,
Token ID11
Feature activation-7.90e-01

Pos logit contributions
e+0.038
aster+0.037
One+0.036
 spring+0.034
 rest+0.034
Neg logit contributions
ounded-0.073
ometer-0.071
oke-0.068
 understands-0.066
 puzzled-0.066
Token my
Token ID616
Feature activation+8.11e+00

Pos logit contributions
 rest+0.039
aster+0.039
e+0.038
 spring+0.038
One+0.037
Neg logit contributions
ometer-0.047
ounded-0.047
atted-0.042
outed-0.042
rier-0.041
Token name
Token ID1438
Feature activation+4.40e+00

Pos logit contributions
ounded+0.438
oke+0.420
ometer+0.397
 puzzled+0.382
rier+0.378
Neg logit contributions
e-0.459
 rest-0.447
aster-0.430
 spring-0.420
One-0.415
Token is
Token ID318
Feature activation+3.84e+00

Pos logit contributions
ometer+0.228
ounded+0.226
oke+0.220
keeper+0.211
rier+0.209
Neg logit contributions
aster-0.256
e-0.245
One-0.232
 rest-0.225
 spring-0.214
Token only
Token ID691
Feature activation-9.41e+00

Pos logit contributions
ometer+0.118
ounded+0.118
oke+0.115
keeper+0.112
rier+0.111
Neg logit contributions
aster-0.074
e-0.071
One-0.063
 rest-0.058
he-0.056
Token on
Token ID319
Feature activation-4.51e+00

Pos logit contributions
aster+0.463
e+0.446
One+0.416
 rest+0.408
 spring+0.384
Neg logit contributions
ometer-0.575
ounded-0.571
oke-0.556
keeper-0.537
rier-0.533
Token things
Token ID1243
Feature activation+6.26e+00

Pos logit contributions
aster+0.201
e+0.192
One+0.179
 rest+0.169
wered+0.159
Neg logit contributions
ometer-0.291
ounded-0.287
oke-0.282
rier-0.273
keeper-0.272
Token that
Token ID326
Feature activation+2.85e+00

Pos logit contributions
ometer+0.361
ounded+0.361
oke+0.355
 understands+0.337
keeper+0.332
Neg logit contributions
aster-0.331
e-0.311
One-0.293
wered-0.271
 rest-0.269
Token are
Token ID389
Feature activation-1.21e+00

Pos logit contributions
ometer+0.197
ounded+0.194
oke+0.191
 understands+0.185
keeper+0.184
Neg logit contributions
aster-0.116
e-0.107
One-0.103
 rest-0.090
wered-0.088
Token ours
Token ID16903
Feature activation+1.63e+01

Pos logit contributions
aster+0.053
e+0.050
One+0.048
 rest+0.046
 spring+0.044
Neg logit contributions
ometer-0.081
ounded-0.081
oke-0.079
keeper-0.076
rier-0.075
Token."
Token ID526
Feature activation+5.15e+00

Pos logit contributions
ounded+1.191
ometer+1.191
oke+1.160
rier+1.113
keeper+1.111
Neg logit contributions
aster-0.589
e-0.587
One-0.532
 rest-0.530
 spring-0.479
Token<|endoftext|>
Token ID50256
Feature activation-2.03e-01

Pos logit contributions
ounded+0.398
ometer+0.390
oke+0.379
rier+0.378
keeper+0.365
Neg logit contributions
aster-0.178
e-0.163
One-0.158
 rest-0.150
 spring-0.135
TokenAnna
Token ID31160
Feature activation-7.80e+00

Pos logit contributions
aster+0.013
e+0.012
 wait+0.011
wered+0.011
 rest+0.011
Neg logit contributions
oke-0.009
ometer-0.009
 understands-0.009
keeper-0.008
ounded-0.008
Token and
Token ID290
Feature activation-1.38e+00

Pos logit contributions
aster+0.350
e+0.341
One+0.310
 rest+0.303
 spring+0.285
Neg logit contributions
ometer-0.506
ounded-0.505
oke-0.490
rier-0.479
keeper-0.474
Token Ben
Token ID3932
Feature activation-4.66e+00

Pos logit contributions
aster+0.058
e+0.056
One+0.054
 rest+0.048
wered+0.046
Neg logit contributions
ometer-0.090
ounded-0.089
 understands-0.087
oke-0.087
keeper-0.086
Token not
Token ID407
Feature activation-4.52e+00

Pos logit contributions
ounded+0.156
ometer+0.155
oke+0.151
keeper+0.146
rier+0.144
Neg logit contributions
aster-0.101
e-0.097
One-0.088
 rest-0.086
 spring-0.083
Token available
Token ID1695
Feature activation+2.45e+00

Pos logit contributions
aster+0.178
e+0.171
 rest+0.158
 spring+0.150
 time+0.143
Neg logit contributions
ounded-0.328
ometer-0.320
oke-0.315
keeper-0.310
 understands-0.299
Token yet
Token ID1865
Feature activation+4.85e+00

Pos logit contributions
ometer+0.161
ounded+0.156
oke+0.154
rier+0.152
keeper+0.150
Neg logit contributions
aster-0.110
e-0.106
One-0.097
 rest-0.091
wered-0.086
Token,"
Token ID553
Feature activation+7.52e+00

Pos logit contributions
ounded+0.353
ometer+0.353
oke+0.344
keeper+0.334
rier+0.333
Neg logit contributions
aster-0.180
e-0.170
One-0.154
 rest-0.149
 spring-0.138
Token Grand
Token ID5675
Feature activation+3.18e-01

Pos logit contributions
ometer+0.486
ounded+0.485
oke+0.470
rier+0.454
keeper+0.453
Neg logit contributions
aster-0.345
e-0.330
One-0.304
 rest-0.295
 spring-0.276
Tokenad
Token ID324
Feature activation-1.66e+01

Pos logit contributions
ometer+0.021
ounded+0.021
oke+0.020
keeper+0.020
rier+0.019
Neg logit contributions
aster-0.014
e-0.014
One-0.013
 rest-0.012
 spring-0.011
Token said
Token ID531
Feature activation+5.92e+00

Pos logit contributions
e+1.125
aster+1.074
One+1.027
he+0.976
 end+0.944
Neg logit contributions
ometer-0.765
ounded-0.703
rier-0.641
keeper-0.626
keepers-0.582
Token as
Token ID355
Feature activation+2.63e+00

Pos logit contributions
ounded+0.453
ometer+0.447
oke+0.441
 understands+0.424
keeper+0.415
Neg logit contributions
aster-0.199
e-0.197
One-0.164
 rest-0.163
 spring-0.146
Token he
Token ID339
Feature activation-6.30e-01

Pos logit contributions
ometer+0.177
ounded+0.175
oke+0.171
keeper+0.167
rier+0.166
Neg logit contributions
aster-0.112
e-0.108
One-0.099
 rest-0.095
 spring-0.089
Token wiped
Token ID21122
Feature activation+1.75e+00

Pos logit contributions
aster+0.036
e+0.036
One+0.034
 rest+0.032
 spring+0.032
Neg logit contributions
ounded-0.033
ometer-0.033
rier-0.032
oke-0.031
keeper-0.029
Token the
Token ID262
Feature activation+7.71e-01

Pos logit contributions
ometer+0.131
ounded+0.131
oke+0.128
keeper+0.124
rier+0.123
Neg logit contributions
aster-0.063
e-0.060
One-0.053
 rest-0.051
 spring-0.047
Token words
Token ID2456
Feature activation+4.71e+00

Pos logit contributions
ounded+0.135
ometer+0.125
 puzzled+0.120
rier+0.119
oke+0.118
Neg logit contributions
aster-0.093
e-0.092
 rest-0.082
 play-0.080
 spring-0.079
Token like
Token ID588
Feature activation-1.63e+00

Pos logit contributions
ometer+0.313
ounded+0.312
oke+0.304
keeper+0.294
rier+0.293
Neg logit contributions
aster-0.204
e-0.195
One-0.180
 rest-0.174
 spring-0.162
Token "
Token ID366
Feature activation+1.01e+01

Pos logit contributions
aster+0.080
e+0.077
One+0.071
 rest+0.069
 spring+0.066
Neg logit contributions
ounded-0.100
ometer-0.099
oke-0.096
keeper-0.093
rier-0.092
Tokenab
Token ID397
Feature activation-5.36e-02

Pos logit contributions
ometer+0.625
ounded+0.588
oke+0.587
keeper+0.577
rier+0.545
Neg logit contributions
aster-0.479
e-0.437
 rest-0.437
wered-0.413
One-0.407
Tokenrac
Token ID11510
Feature activation+4.94e+00

Pos logit contributions
aster+0.003
e+0.003
One+0.003
 rest+0.003
 spring+0.002
Neg logit contributions
ometer-0.003
ounded-0.003
oke-0.003
rier-0.003
keeper-0.003
Tokenad
Token ID324
Feature activation-1.64e+01

Pos logit contributions
oke+0.362
ometer+0.335
keeper+0.334
ounded+0.332
 understands+0.331
Neg logit contributions
aster-0.154
 rest-0.145
 recess-0.137
e-0.137
 walk-0.134
Tokenab
Token ID397
Feature activation+5.11e-04

Pos logit contributions
aster+0.907
e+0.894
One+0.804
 rest+0.782
 spring+0.753
Neg logit contributions
ounded-0.913
ometer-0.900
keeper-0.851
oke-0.842
rier-0.837
Tokenra
Token ID430
Feature activation-3.93e+00

Pos logit contributions
ounded+0.000
ometer+0.000
oke+0.000
keeper+0.000
rier+0.000
Neg logit contributions
aster-0.000
e-0.000
One-0.000
 rest-0.000
 spring-0.000
Token"
Token ID1
Feature activation+2.68e+00

Pos logit contributions
aster+0.213
e+0.202
One+0.193
 rest+0.188
 spring+0.177
Neg logit contributions
ometer-0.218
ounded-0.215
oke-0.211
rier-0.201
keeper-0.201
Token and
Token ID290
Feature activation-2.26e+00

Pos logit contributions
ounded+0.158
ometer+0.157
oke+0.153
 understands+0.146
rier+0.146
Neg logit contributions
aster-0.135
e-0.130
One-0.123
 rest-0.117
 spring-0.112
Token "
Token ID366
Feature activation+1.03e+01

Pos logit contributions
aster+0.125
e+0.119
 rest+0.108
 spring+0.107
One+0.107
Neg logit contributions
ounded-0.131
ometer-0.127
oke-0.120
rier-0.117
keeper-0.117
Token and
Token ID290
Feature activation-1.95e+00

Pos logit contributions
ometer+0.165
ounded+0.164
oke+0.159
keeper+0.155
rier+0.155
Neg logit contributions
aster-0.104
e-0.099
One-0.091
 rest-0.087
 spring-0.081
Token say
Token ID910
Feature activation-7.44e-01

Pos logit contributions
aster+0.059
e+0.056
One+0.049
 rest+0.047
 spring+0.042
Neg logit contributions
ometer-0.155
ounded-0.155
oke-0.151
keeper-0.148
rier-0.147
Token "
Token ID366
Feature activation+1.03e+01

Pos logit contributions
aster+0.035
e+0.035
 spring+0.032
One+0.032
 rest+0.032
Neg logit contributions
ounded-0.046
ometer-0.044
oke-0.044
keeper-0.043
outed-0.041
Tokenab
Token ID397
Feature activation-5.50e-02

Pos logit contributions
ometer+0.593
ounded+0.582
keeper+0.556
rier+0.530
ener+0.525
Neg logit contributions
aster-0.495
 rest-0.422
wered-0.402
 recess-0.397
 end-0.389
Tokenrac
Token ID11510
Feature activation+4.79e+00

Pos logit contributions
aster+0.003
e+0.003
One+0.003
 rest+0.003
 spring+0.003
Neg logit contributions
ounded-0.003
ometer-0.003
oke-0.003
rier-0.002
keeper-0.002
Tokenad
Token ID324
Feature activation-1.64e+01

Pos logit contributions
oke+0.356
keeper+0.345
ounded+0.345
 understands+0.342
atted+0.340
Neg logit contributions
aster-0.120
 recess-0.105
 rest-0.102
e-0.102
 wait-0.100
Tokenab
Token ID397
Feature activation-7.66e-02

Pos logit contributions
aster+0.941
e+0.925
One+0.849
 rest+0.832
 spring+0.792
Neg logit contributions
ometer-0.860
ounded-0.859
oke-0.813
keeper-0.801
 puzzled-0.790
Tokenra
Token ID430
Feature activation-3.98e+00

Pos logit contributions
aster+0.004
e+0.004
One+0.004
 rest+0.004
 spring+0.003
Neg logit contributions
ometer-0.004
ounded-0.004
oke-0.004
keeper-0.004
rier-0.004
Token"
Token ID1
Feature activation+2.89e+00

Pos logit contributions
aster+0.207
e+0.206
One+0.186
 rest+0.183
 spring+0.173
Neg logit contributions
ounded-0.229
ometer-0.229
oke-0.220
keeper-0.215
rier-0.211
Token all
Token ID477
Feature activation-4.07e+00

Pos logit contributions
ometer+0.180
ounded+0.178
oke+0.172
keeper+0.169
rier+0.167
Neg logit contributions
aster-0.140
e-0.132
One-0.123
 rest-0.120
 spring-0.111
Token the
Token ID262
Feature activation+5.98e-01

Pos logit contributions
aster+0.155
e+0.146
One+0.137
 rest+0.128
 spring+0.116
Neg logit contributions
ounded-0.285
ometer-0.283
oke-0.282
 understands-0.270
keeper-0.268
Token big
Token ID1263
Feature activation-4.94e-01

Pos logit contributions
aster+0.143
One+0.139
 spring+0.137
 rest+0.137
e+0.133
Neg logit contributions
ounded-0.187
ometer-0.184
oke-0.183
atted-0.180
 understands-0.173
Token tree
Token ID5509
Feature activation-1.57e+00

Pos logit contributions
aster+0.022
e+0.021
One+0.019
 rest+0.019
 spring+0.019
Neg logit contributions
ounded-0.033
ometer-0.033
oke-0.032
rier-0.030
keeper-0.030
Token.
Token ID13
Feature activation-1.46e+00

Pos logit contributions
aster+0.072
e+0.066
One+0.060
 recess+0.060
 rest+0.058
Neg logit contributions
ometer-0.099
keeper-0.099
ounded-0.098
oke-0.096
 puzzled-0.094
Token
Token ID198
Feature activation-3.98e-01

Pos logit contributions
aster+0.057
e+0.056
One+0.055
 rest+0.053
 spring+0.051
Neg logit contributions
ounded-0.102
oke-0.099
ometer-0.095
rier-0.095
keeper-0.092
Token
Token ID198
Feature activation-3.88e-01

Pos logit contributions
aster+0.020
e+0.018
 rest+0.016
 recess+0.016
One+0.016
Neg logit contributions
ometer-0.024
ounded-0.024
oke-0.024
 understands-0.023
keeper-0.023
TokenBl
Token ID3629
Feature activation-1.64e+01

Pos logit contributions
aster+0.016
e+0.014
 recess+0.013
 rest+0.013
 play+0.011
Neg logit contributions
ounded-0.026
ometer-0.026
keeper-0.025
 understands-0.025
oke-0.024
Tokenossom
Token ID25548
Feature activation+9.39e+00

Pos logit contributions
aster+0.990
e+0.974
One+0.911
 rest+0.875
 spring+0.841
Neg logit contributions
ounded-0.799
ometer-0.799
oke-0.772
keeper-0.742
rier-0.741
Token tried
Token ID3088
Feature activation+1.75e-01

Pos logit contributions
ometer+0.600
ounded+0.590
oke+0.573
rier+0.568
 understands+0.537
Neg logit contributions
e-0.431
aster-0.427
One-0.397
 rest-0.389
 spring-0.382
Token to
Token ID284
Feature activation+4.12e+00

Pos logit contributions
ounded+0.010
ometer+0.010
oke+0.010
rier+0.009
keeper+0.009
Neg logit contributions
aster-0.009
e-0.009
One-0.009
 rest-0.009
 spring-0.008
Token climb
Token ID12080
Feature activation-1.14e+01

Pos logit contributions
ounded+0.254
 puzzled+0.242
keeper+0.238
 fist+0.233
 confused+0.230
Neg logit contributions
aster-0.159
P-0.156
 recess-0.152
e-0.147
One-0.143
Token up
Token ID510
Feature activation+3.82e+00

Pos logit contributions
aster+0.607
e+0.580
One+0.533
 rest+0.518
wered+0.479
Neg logit contributions
ometer-0.654
ounded-0.651
oke-0.614
keeper-0.611
rier-0.605
Token never
Token ID1239
Feature activation-1.18e+01

Pos logit contributions
e+0.046
aster+0.044
One+0.040
 rest+0.039
 spring+0.038
Neg logit contributions
ometer-0.052
ounded-0.051
rier-0.049
oke-0.049
keeper-0.046
Token pin
Token ID6757
Feature activation-1.99e+00

Pos logit contributions
aster+0.594
e+0.585
 rest+0.574
One+0.558
 play+0.556
Neg logit contributions
ometer-0.733
rier-0.691
ounded-0.658
oke-0.656
keeper-0.628
Tokenched
Token ID1740
Feature activation+1.01e-01

Pos logit contributions
aster+0.103
e+0.100
One+0.093
 rest+0.090
 spring+0.086
Neg logit contributions
ometer-0.115
ounded-0.114
oke-0.111
keeper-0.107
rier-0.107
Token the
Token ID262
Feature activation+5.80e-01

Pos logit contributions
ometer+0.007
ounded+0.007
oke+0.006
keeper+0.006
rier+0.006
Neg logit contributions
aster-0.004
e-0.004
One-0.004
 rest-0.004
 spring-0.004
Token to
Token ID284
Feature activation+3.95e+00

Pos logit contributions
ounded+0.033
ometer+0.032
oke+0.031
 understands+0.030
keeper+0.029
Neg logit contributions
aster-0.031
e-0.030
 rest-0.029
One-0.028
 spring-0.027
Tokenad
Token ID324
Feature activation-1.63e+01

Pos logit contributions
 ashamed+0.228
oke+0.226
rier+0.219
 puzzled+0.219
ometer+0.218
Neg logit contributions
One-0.140
P-0.139
e-0.138
aster-0.126
 recess-0.121
Token's
Token ID338
Feature activation+5.13e+00

Pos logit contributions
aster+0.870
e+0.839
One+0.783
 rest+0.762
 spring+0.722
Neg logit contributions
ometer-0.927
ounded-0.921
oke-0.892
keeper-0.861
rier-0.856
Token toe
Token ID21189
Feature activation-1.16e+00

Pos logit contributions
ounded+0.262
oke+0.253
ometer+0.251
 understands+0.228
rier+0.225
Neg logit contributions
aster-0.310
e-0.306
 rest-0.289
One-0.281
 spring-0.276
Token again
Token ID757
Feature activation+2.11e+00

Pos logit contributions
aster+0.054
e+0.053
One+0.048
 rest+0.047
 spring+0.044
Neg logit contributions
ounded-0.074
ometer-0.073
oke-0.071
 understands-0.068
keeper-0.068
Token,
Token ID11
Feature activation-1.02e+00

Pos logit contributions
ometer+0.146
ounded+0.146
oke+0.143
keeper+0.139
rier+0.138
Neg logit contributions
aster-0.084
e-0.079
One-0.072
 rest-0.069
 spring-0.064
Token and
Token ID290
Feature activation-1.69e+00

Pos logit contributions
aster+0.043
e+0.043
 rest+0.041
One+0.041
 spring+0.039
Neg logit contributions
ometer-0.069
oke-0.068
ounded-0.067
rier-0.064
keeper-0.061
Token frog
Token ID21264
Feature activation-1.02e+00

Pos logit contributions
ometer+0.100
ounded+0.100
oke+0.097
keeper+0.094
rier+0.093
Neg logit contributions
aster-0.082
e-0.079
One-0.074
 rest-0.071
 spring-0.067
Token!
Token ID0
Feature activation+3.20e-01

Pos logit contributions
aster+0.041
e+0.039
One+0.036
 rest+0.034
 spring+0.032
Neg logit contributions
ometer-0.071
ounded-0.070
oke-0.069
keeper-0.067
rier-0.066
Token
Token ID198
Feature activation-7.84e-01

Pos logit contributions
ounded+0.019
ometer+0.018
oke+0.018
rier+0.017
 understands+0.017
Neg logit contributions
aster-0.017
e-0.016
One-0.015
 rest-0.015
 spring-0.014
Token
Token ID198
Feature activation-6.72e-01

Pos logit contributions
aster+0.035
e+0.034
 rest+0.031
One+0.029
 recess+0.028
Neg logit contributions
ometer-0.049
oke-0.049
ounded-0.048
keeper-0.047
 understands-0.047
TokenT
Token ID51
Feature activation-5.75e+00

Pos logit contributions
aster+0.025
 rest+0.024
e+0.023
 recess+0.022
 end+0.021
Neg logit contributions
keeper-0.044
oke-0.043
ounded-0.043
ometer-0.042
 puzzled-0.042
Tokenad
Token ID324
Feature activation-1.63e+01

Pos logit contributions
aster+0.404
e+0.392
One+0.373
 rest+0.365
 spring+0.351
Neg logit contributions
ometer-0.232
ounded-0.229
oke-0.220
keeper-0.209
rier-0.207
Token smiled
Token ID13541
Feature activation+3.96e+00

Pos logit contributions
e+1.028
aster+0.970
One+0.916
 rest+0.873
 spring+0.862
Neg logit contributions
ometer-0.833
ounded-0.795
rier-0.764
keeper-0.721
oke-0.711
Token and
Token ID290
Feature activation-1.79e+00

Pos logit contributions
oke+0.322
ounded+0.313
ometer+0.312
 understands+0.288
ours+0.281
Neg logit contributions
e-0.110
 spring-0.103
One-0.102
 go-0.102
 end-0.099
Token carefully
Token ID7773
Feature activation-2.55e+00

Pos logit contributions
e+0.087
aster+0.084
 spring+0.083
 rest+0.082
 play+0.080
Neg logit contributions
ometer-0.110
oke-0.109
ounded-0.107
rier-0.104
keeper-0.097
Token picked
Token ID6497
Feature activation-2.87e-01

Pos logit contributions
aster+0.137
e+0.132
One+0.124
 rest+0.120
 spring+0.113
Neg logit contributions
ometer-0.144
ounded-0.143
oke-0.139
keeper-0.134
rier-0.133
Token up
Token ID510
Feature activation+3.92e+00

Pos logit contributions
e+0.013
aster+0.013
 rest+0.012
 spring+0.011
One+0.011
Neg logit contributions
ounded-0.019
ometer-0.018
oke-0.018
keeper-0.017
 puzzled-0.017
Token,
Token ID11
Feature activation-1.35e+00

Pos logit contributions
ounded+0.502
ometer+0.490
oke+0.468
keeper+0.463
 puzzled+0.458
Neg logit contributions
e-0.169
aster-0.167
One-0.143
 rest-0.139
 spring-0.127
Token "
Token ID366
Feature activation+1.04e+01

Pos logit contributions
aster+0.091
One+0.089
e+0.087
 rest+0.085
 spring+0.085
Neg logit contributions
ounded-0.054
ometer-0.052
oke-0.049
 understands-0.048
keeper-0.048
TokenA
Token ID32
Feature activation-5.26e+00

Pos logit contributions
ounded+0.620
keeper+0.609
ometer+0.591
oke+0.586
 understands+0.581
Neg logit contributions
aster-0.505
e-0.438
 rest-0.435
 recess-0.401
 arrive-0.396
Tokenbr
Token ID1671
Feature activation-7.61e+00

Pos logit contributions
aster+0.272
e+0.261
One+0.245
 rest+0.238
 spring+0.226
Neg logit contributions
ounded-0.309
ometer-0.308
oke-0.296
keeper-0.288
rier-0.287
Tokenac
Token ID330
Feature activation+7.50e+00

Pos logit contributions
e+0.450
aster+0.444
One+0.423
he+0.389
 end+0.383
Neg logit contributions
ounded-0.371
ometer-0.366
keeper-0.358
 puzzled-0.354
rier-0.343
Tokenad
Token ID324
Feature activation-1.63e+01

Pos logit contributions
oke+0.630
ometer+0.614
ounded+0.613
 understands+0.603
rier+0.601
Neg logit contributions
aster-0.181
 rest-0.152
One-0.150
e-0.137
 arrive-0.122
Tokenab
Token ID397
Feature activation+4.32e-02

Pos logit contributions
aster+0.913
e+0.901
One+0.814
 rest+0.798
 spring+0.763
Neg logit contributions
ometer-0.889
ounded-0.888
keeper-0.829
rier-0.822
oke-0.820
Tokenra
Token ID430
Feature activation-3.84e+00

Pos logit contributions
ometer+0.002
ounded+0.002
oke+0.002
keeper+0.002
rier+0.002
Neg logit contributions
aster-0.002
e-0.002
One-0.002
 rest-0.002
 spring-0.002
Token,
Token ID11
Feature activation-1.42e+00

Pos logit contributions
aster+0.186
e+0.175
One+0.167
 rest+0.161
 spring+0.151
Neg logit contributions
ometer-0.235
ounded-0.234
oke-0.230
keeper-0.219
rier-0.219
Token I
Token ID314
Feature activation+1.63e+00

Pos logit contributions
aster+0.069
e+0.066
One+0.062
 rest+0.059
 spring+0.056
Neg logit contributions
ometer-0.088
ounded-0.087
oke-0.085
keeper-0.082
rier-0.082
Token turn
Token ID1210
Feature activation-6.28e+00

Pos logit contributions
ounded+0.100
ometer+0.099
oke+0.097
 understands+0.095
 puzzled+0.092
Neg logit contributions
aster-0.080
e-0.070
One-0.069
wered-0.064
 rest-0.064
Token,
Token ID11
Feature activation-1.23e+00

Pos logit contributions
ounded+0.540
ometer+0.524
oke+0.514
keeper+0.502
ener+0.494
Neg logit contributions
e-0.126
aster-0.121
 go-0.108
 rest-0.107
 spring-0.104
Token "
Token ID366
Feature activation+1.04e+01

Pos logit contributions
aster+0.085
One+0.082
e+0.081
 rest+0.080
 spring+0.079
Neg logit contributions
ounded-0.046
ometer-0.046
oke-0.044
 understands-0.043
keeper-0.042
TokenA
Token ID32
Feature activation-5.24e+00

Pos logit contributions
ounded+0.499
keeper+0.493
ometer+0.488
ener+0.455
 understands+0.451
Neg logit contributions
aster-0.544
 recess-0.501
 rest-0.489
e-0.476
 parties-0.473
Tokenbr
Token ID1671
Feature activation-7.67e+00

Pos logit contributions
aster+0.254
e+0.245
One+0.229
 rest+0.223
 spring+0.211
Neg logit contributions
ometer-0.323
ounded-0.321
oke-0.312
keeper-0.302
rier-0.301
Tokenac
Token ID330
Feature activation+7.50e+00

Pos logit contributions
e+0.462
aster+0.452
One+0.436
 end+0.395
 time+0.392
Neg logit contributions
ounded-0.366
ometer-0.358
keeper-0.355
 puzzled-0.347
rier-0.345
Tokenad
Token ID324
Feature activation-1.63e+01

Pos logit contributions
ometer+0.615
ounded+0.614
oke+0.613
 understands+0.596
keeper+0.595
Neg logit contributions
aster-0.190
e-0.154
One-0.152
 rest-0.149
 arrive-0.124
Tokenab
Token ID397
Feature activation+2.42e-02

Pos logit contributions
aster+0.957
e+0.928
One+0.871
 rest+0.850
 spring+0.810
Neg logit contributions
ometer-0.837
ounded-0.832
oke-0.800
keeper-0.772
rier-0.767
Tokenra
Token ID430
Feature activation-3.92e+00

Pos logit contributions
ometer+0.001
ounded+0.001
oke+0.001
keeper+0.001
rier+0.001
Neg logit contributions
aster-0.001
e-0.001
One-0.001
 rest-0.001
 spring-0.001
Token!
Token ID0
Feature activation+9.30e-01

Pos logit contributions
aster+0.202
e+0.192
One+0.182
 rest+0.176
 spring+0.166
Neg logit contributions
ometer-0.229
ounded-0.227
oke-0.222
keeper-0.212
rier-0.211
Token I
Token ID314
Feature activation+1.67e+00

Pos logit contributions
ounded+0.041
ometer+0.040
oke+0.039
rier+0.038
 understands+0.037
Neg logit contributions
aster-0.061
e-0.060
One-0.057
 rest-0.055
 spring-0.053
Token want
Token ID765
Feature activation+2.95e+00

Pos logit contributions
ounded+0.096
ometer+0.096
oke+0.091
 understands+0.090
keeper+0.087
Neg logit contributions
aster-0.088
e-0.079
One-0.078
 recess-0.072
wered-0.072
Token,
Token ID11
Feature activation-1.18e+00

Pos logit contributions
ounded+0.501
ometer+0.496
oke+0.485
keeper+0.466
 puzzled+0.463
Neg logit contributions
e-0.172
aster-0.168
 rest-0.151
One-0.140
 spring-0.133
Token "
Token ID366
Feature activation+1.04e+01

Pos logit contributions
aster+0.084
e+0.081
One+0.081
 rest+0.079
 spring+0.078
Neg logit contributions
ounded-0.043
ometer-0.043
oke-0.040
 understands-0.039
keeper-0.038
TokenA
Token ID32
Feature activation-5.20e+00

Pos logit contributions
keeper+0.602
ounded+0.584
ometer+0.567
 understands+0.562
rier+0.553
Neg logit contributions
aster-0.483
e-0.433
 rest-0.425
 recess-0.424
 fall-0.400
Tokenbr
Token ID1671
Feature activation-7.66e+00

Pos logit contributions
aster+0.242
e+0.232
One+0.216
 rest+0.209
 spring+0.198
Neg logit contributions
ometer-0.333
ounded-0.332
oke-0.321
keeper-0.312
rier-0.311
Tokenac
Token ID330
Feature activation+7.46e+00

Pos logit contributions
e+0.457
aster+0.443
One+0.425
 end+0.385
he+0.384
Neg logit contributions
ounded-0.378
ometer-0.371
keeper-0.360
 puzzled-0.355
rier-0.345
Tokenad
Token ID324
Feature activation-1.63e+01

Pos logit contributions
oke+0.628
ometer+0.620
ounded+0.617
 understands+0.609
keeper+0.606
Neg logit contributions
aster-0.171
One-0.137
 rest-0.135
e-0.132
 arrive-0.115
Tokenab
Token ID397
Feature activation-3.40e-02

Pos logit contributions
aster+0.926
e+0.907
One+0.829
 rest+0.816
 spring+0.776
Neg logit contributions
ometer-0.872
ounded-0.871
oke-0.818
keeper-0.810
rier-0.802
Tokenra
Token ID430
Feature activation-3.95e+00

Pos logit contributions
aster+0.002
e+0.002
One+0.002
 rest+0.002
 spring+0.002
Neg logit contributions
ometer-0.002
ounded-0.002
oke-0.002
keeper-0.002
rier-0.002
Token!"
Token ID2474
Feature activation+6.45e+00

Pos logit contributions
aster+0.196
e+0.184
One+0.179
 rest+0.170
 spring+0.161
Neg logit contributions
ometer-0.235
ounded-0.234
oke-0.231
keeper-0.219
rier-0.218
Token Suddenly
Token ID24975
Feature activation+1.03e+00

Pos logit contributions
ometer+0.380
ounded+0.377
oke+0.366
keeper+0.355
rier+0.351
Neg logit contributions
aster-0.331
e-0.319
One-0.298
 rest-0.288
 spring-0.273
Token,
Token ID11
Feature activation-1.24e+00

Pos logit contributions
ometer+0.075
ounded+0.075
oke+0.073
rier+0.070
keeper+0.070
Neg logit contributions
aster-0.039
e-0.037
One-0.034
 rest-0.033
 spring-0.031
Token
Token ID198
Feature activation-3.21e-01

Pos logit contributions
ounded+0.279
ometer+0.272
rier+0.264
oke+0.261
keeper+0.259
Neg logit contributions
aster-0.127
e-0.118
One-0.110
 rest-0.104
 spring-0.094
TokenIt
Token ID1026
Feature activation-1.55e+00

Pos logit contributions
aster+0.016
e+0.015
 rest+0.014
wered+0.014
 end+0.014
Neg logit contributions
keeper-0.018
oke-0.018
ometer-0.018
 understands-0.017
ounded-0.017
Token was
Token ID373
Feature activation+1.32e+00

Pos logit contributions
e+0.064
aster+0.061
One+0.058
 spring+0.057
 rest+0.056
Neg logit contributions
ometer-0.106
ounded-0.106
rier-0.105
oke-0.104
keeper-0.097
Token his
Token ID465
Feature activation-1.01e+00

Pos logit contributions
ometer+0.091
ounded+0.090
oke+0.088
keeper+0.086
rier+0.085
Neg logit contributions
aster-0.054
e-0.052
One-0.047
 rest-0.046
 spring-0.042
Token grand
Token ID4490
Feature activation+6.41e+00

Pos logit contributions
aster+0.059
e+0.057
 rest+0.054
One+0.054
 spring+0.051
Neg logit contributions
ounded-0.053
ometer-0.051
oke-0.051
rier-0.048
 understands-0.047
Tokenad
Token ID324
Feature activation-1.62e+01

Pos logit contributions
ometer+0.385
ounded+0.383
oke+0.371
keeper+0.358
rier+0.357
Neg logit contributions
aster-0.323
e-0.311
One-0.289
 rest-0.281
 spring-0.264
Token.
Token ID13
Feature activation-1.19e+00

Pos logit contributions
aster+0.710
e+0.665
One+0.635
 rest+0.604
 spring+0.575
Neg logit contributions
ometer-1.058
ounded-1.055
oke-1.044
keeper-1.004
rier-0.992
Token He
Token ID679
Feature activation+2.28e+00

Pos logit contributions
aster+0.066
e+0.064
One+0.061
 rest+0.058
 spring+0.056
Neg logit contributions
ounded-0.065
ometer-0.064
oke-0.061
rier-0.060
 understands-0.059
Token said
Token ID531
Feature activation+6.46e+00

Pos logit contributions
ounded+0.134
ometer+0.133
rier+0.128
oke+0.127
keeper+0.119
Neg logit contributions
aster-0.120
e-0.118
One-0.109
 rest-0.105
 spring-0.102
Token to
Token ID284
Feature activation+4.09e+00

Pos logit contributions
ounded+0.530
ometer+0.508
oke+0.496
ishes+0.482
 understands+0.478
Neg logit contributions
aster-0.182
e-0.179
 rest-0.141
 go-0.139
One-0.137
Token Jamie
Token ID17826
Feature activation-4.42e-01

Pos logit contributions
ometer+0.196
keeper+0.196
oke+0.190
rier+0.183
 son+0.183
Neg logit contributions
e-0.208
One-0.208
aster-0.205
P-0.197
 rest-0.194
Token,
Token ID11
Feature activation-1.25e+00

Pos logit contributions
ounded+0.531
ometer+0.514
oke+0.502
keeper+0.489
ener+0.482
Neg logit contributions
e-0.144
aster-0.143
 rest-0.121
 go-0.120
One-0.117
Token "
Token ID366
Feature activation+1.04e+01

Pos logit contributions
aster+0.088
One+0.084
e+0.084
 rest+0.082
 spring+0.081
Neg logit contributions
ounded-0.047
ometer-0.046
oke-0.045
 understands-0.043
keeper-0.042
TokenA
Token ID32
Feature activation-5.18e+00

Pos logit contributions
keeper+0.558
ometer+0.556
ounded+0.549
rier+0.508
ener+0.506
Neg logit contributions
aster-0.507
 rest-0.462
 recess-0.453
e-0.447
 run-0.427
Tokenbr
Token ID1671
Feature activation-7.60e+00

Pos logit contributions
aster+0.237
e+0.230
One+0.215
 rest+0.209
 spring+0.200
Neg logit contributions
ounded-0.331
ometer-0.331
oke-0.321
keeper-0.313
rier-0.310
Tokenac
Token ID330
Feature activation+7.55e+00

Pos logit contributions
e+0.439
aster+0.428
One+0.411
 end+0.377
he+0.376
Neg logit contributions
ounded-0.378
keeper-0.367
ometer-0.366
 puzzled-0.362
rier-0.352
Tokenad
Token ID324
Feature activation-1.62e+01

Pos logit contributions
oke+0.624
ometer+0.624
ounded+0.619
 understands+0.605
keeper+0.604
Neg logit contributions
aster-0.182
One-0.150
 rest-0.145
e-0.145
 arrive-0.123
Tokenab
Token ID397
Feature activation+5.62e-02

Pos logit contributions
aster+0.936
e+0.910
One+0.847
 rest+0.828
 spring+0.789
Neg logit contributions
ometer-0.851
ounded-0.847
oke-0.810
keeper-0.788
rier-0.782
Tokenra
Token ID430
Feature activation-3.88e+00

Pos logit contributions
ometer+0.003
ounded+0.003
oke+0.003
keeper+0.003
rier+0.003
Neg logit contributions
aster-0.003
e-0.003
One-0.003
 rest-0.003
 spring-0.002
Token!"
Token ID2474
Feature activation+6.50e+00

Pos logit contributions
aster+0.193
e+0.183
One+0.176
 rest+0.168
 spring+0.159
Neg logit contributions
ometer-0.232
ounded-0.230
oke-0.225
keeper-0.215
rier-0.214
Token Suddenly
Token ID24975
Feature activation+1.04e+00

Pos logit contributions
ometer+0.402
ounded+0.398
oke+0.385
keeper+0.377
 understands+0.370
Neg logit contributions
aster-0.318
e-0.306
One-0.283
 rest-0.271
 spring-0.256
Token,
Token ID11
Feature activation-1.29e+00

Pos logit contributions
ounded+0.077
ometer+0.077
oke+0.075
rier+0.073
keeper+0.072
Neg logit contributions
aster-0.038
e-0.036
 rest-0.032
One-0.032
 spring-0.030
Token assignment
Token ID16237
Feature activation+3.24e+00

Pos logit contributions
aster+0.050
e+0.048
One+0.044
 rest+0.043
 spring+0.041
Neg logit contributions
ometer-0.064
ounded-0.063
oke-0.062
keeper-0.059
rier-0.059
Token.
Token ID13
Feature activation-1.39e+00

Pos logit contributions
ometer+0.214
ounded+0.213
oke+0.207
keeper+0.201
rier+0.200
Neg logit contributions
aster-0.141
e-0.135
One-0.125
 rest-0.121
 spring-0.113
Token
Token ID198
Feature activation-3.86e-01

Pos logit contributions
e+0.071
aster+0.070
One+0.069
 rest+0.065
 spring+0.063
Neg logit contributions
ounded-0.080
rier-0.079
ometer-0.075
keeper-0.074
oke-0.074
Token
Token ID198
Feature activation-3.64e-01

Pos logit contributions
aster+0.016
e+0.015
One+0.013
 rest+0.013
 recess+0.012
Neg logit contributions
ometer-0.026
ounded-0.026
oke-0.026
 understands-0.025
keeper-0.024
TokenCh
Token ID1925
Feature activation-1.07e+01

Pos logit contributions
aster+0.016
e+0.014
 rest+0.013
 recess+0.012
 walk+0.011
Neg logit contributions
ometer-0.024
ounded-0.024
oke-0.023
keeper-0.022
 understands-0.022
Tokenad
Token ID324
Feature activation-1.62e+01

Pos logit contributions
aster+0.419
One+0.377
 rest+0.375
e+0.368
 spring+0.338
Neg logit contributions
oke-0.781
ometer-0.745
ounded-0.739
atted-0.708
 understands-0.700
Token always
Token ID1464
Feature activation-9.46e+00

Pos logit contributions
e+0.833
aster+0.821
One+0.738
 rest+0.714
 end+0.683
Neg logit contributions
ometer-0.992
ounded-0.944
rier-0.916
oke-0.884
 puzzled-0.851
Token works
Token ID2499
Feature activation+2.03e+00

Pos logit contributions
e+0.460
aster+0.460
 rest+0.409
One+0.401
 play+0.394
Neg logit contributions
ometer-0.601
ounded-0.584
rier-0.582
oke-0.559
keeper-0.557
Token hard
Token ID1327
Feature activation-5.00e+00

Pos logit contributions
ometer+0.135
ounded+0.133
oke+0.129
keeper+0.125
 understands+0.123
Neg logit contributions
e-0.088
aster-0.087
One-0.080
 rest-0.075
 time-0.072
Token at
Token ID379
Feature activation+3.56e+00

Pos logit contributions
aster+0.225
e+0.217
One+0.201
 rest+0.192
 spring+0.179
Neg logit contributions
ometer-0.324
ounded-0.321
oke-0.316
rier-0.302
keeper-0.301
Token school
Token ID1524
Feature activation-6.40e+00

Pos logit contributions
ometer+0.216
oke+0.213
ounded+0.212
 understands+0.209
rier+0.204
Neg logit contributions
aster-0.146
e-0.145
One-0.139
P-0.126
 rest-0.121
Token say
Token ID910
Feature activation-6.16e-01

Pos logit contributions
aster+0.063
e+0.059
One+0.052
 rest+0.051
 spring+0.046
Neg logit contributions
ometer-0.154
ounded-0.154
oke-0.151
rier-0.146
keeper-0.146
Token "
Token ID366
Feature activation+1.06e+01

Pos logit contributions
aster+0.031
e+0.031
One+0.029
 rest+0.028
 spring+0.028
Neg logit contributions
ounded-0.037
ometer-0.036
oke-0.034
keeper-0.033
 puzzled-0.032
TokenA
Token ID32
Feature activation-4.93e+00

Pos logit contributions
ounded+0.631
ometer+0.627
keeper+0.618
oke+0.589
rier+0.584
Neg logit contributions
aster-0.497
 rest-0.421
e-0.417
wered-0.408
 spring-0.388
Tokenbr
Token ID1671
Feature activation-7.51e+00

Pos logit contributions
aster+0.217
e+0.208
One+0.194
 rest+0.187
 spring+0.179
Neg logit contributions
ounded-0.327
ometer-0.324
oke-0.315
rier-0.308
keeper-0.307
Tokenac
Token ID330
Feature activation+7.66e+00

Pos logit contributions
e+0.479
aster+0.476
One+0.456
 rest+0.412
he+0.410
Neg logit contributions
ounded-0.340
ometer-0.334
keeper-0.312
 puzzled-0.311
 understands-0.307
Tokenad
Token ID324
Feature activation-1.62e+01

Pos logit contributions
oke+0.626
ometer+0.619
ounded+0.617
 understands+0.607
keeper+0.606
Neg logit contributions
aster-0.184
 rest-0.157
One-0.149
e-0.137
 spring-0.130
Tokenab
Token ID397
Feature activation+1.88e-01

Pos logit contributions
aster+0.933
e+0.908
One+0.842
 rest+0.822
 spring+0.784
Neg logit contributions
ometer-0.848
ounded-0.846
oke-0.804
keeper-0.786
rier-0.778
Tokenra
Token ID430
Feature activation-3.66e+00

Pos logit contributions
ometer+0.011
ounded+0.011
oke+0.010
keeper+0.010
rier+0.010
Neg logit contributions
aster-0.010
e-0.010
One-0.009
 rest-0.009
 spring-0.008
Token!"
Token ID2474
Feature activation+6.53e+00

Pos logit contributions
aster+0.199
e+0.189
One+0.180
 rest+0.175
 spring+0.166
Neg logit contributions
ometer-0.202
ounded-0.200
oke-0.196
rier-0.187
keeper-0.187
Token and
Token ID290
Feature activation-1.77e+00

Pos logit contributions
ometer+0.356
ounded+0.348
oke+0.341
keeper+0.335
rier+0.326
Neg logit contributions
aster-0.359
e-0.349
One-0.328
 rest-0.319
 spring-0.302
Token everyone
Token ID2506
Feature activation-2.18e+00

Pos logit contributions
aster+0.074
e+0.071
 rest+0.062
One+0.061
 spring+0.060
Neg logit contributions
ometer-0.123
ounded-0.122
oke-0.119
rier-0.115
keeper-0.112
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
aster+0.001
e+0.001
One+0.001
 rest+0.001
 spring+0.001
Neg logit contributions
ometer-0.002
ounded-0.002
oke-0.002
keeper-0.002
rier-0.002
Token "
Token ID366
Feature activation+1.05e+01

Pos logit contributions
aster+0.072
One+0.067
 spring+0.066
e+0.064
 rest+0.062
Neg logit contributions
ounded-0.085
ometer-0.079
oke-0.078
rier-0.077
ener-0.077
TokenA
Token ID32
Feature activation-5.06e+00

Pos logit contributions
ounded+0.556
ometer+0.535
 understands+0.499
keeper+0.491
oke+0.482
Neg logit contributions
aster-0.502
 rest-0.484
e-0.478
 arrive-0.471
 recess-0.457
Tokenbr
Token ID1671
Feature activation-7.43e+00

Pos logit contributions
aster+0.229
e+0.219
One+0.201
 rest+0.194
 spring+0.180
Neg logit contributions
ometer-0.331
ounded-0.328
oke-0.320
keeper-0.310
rier-0.308
Tokenac
Token ID330
Feature activation+7.74e+00

Pos logit contributions
aster+0.483
e+0.479
One+0.455
 rest+0.417
 time+0.413
Neg logit contributions
ounded-0.328
ometer-0.326
keeper-0.316
rier-0.304
 puzzled-0.297
Tokenad
Token ID324
Feature activation-1.61e+01

Pos logit contributions
oke+0.612
 understands+0.593
atted+0.587
ounded+0.585
ometer+0.583
Neg logit contributions
aster-0.200
One-0.184
 rest-0.183
 jog-0.163
 arrive-0.162
Tokenab
Token ID397
Feature activation+1.93e-01

Pos logit contributions
aster+0.932
e+0.914
One+0.834
 rest+0.819
 spring+0.784
Neg logit contributions
ometer-0.854
ounded-0.851
keeper-0.798
oke-0.793
rier-0.790
Tokenra
Token ID430
Feature activation-3.64e+00

Pos logit contributions
ometer+0.011
ounded+0.011
oke+0.011
keeper+0.010
rier+0.010
Neg logit contributions
aster-0.010
e-0.010
One-0.009
 rest-0.009
 spring-0.009
Token!"
Token ID2474
Feature activation+6.44e+00

Pos logit contributions
aster+0.199
e+0.188
One+0.183
 rest+0.175
 spring+0.167
Neg logit contributions
ometer-0.198
ounded-0.197
oke-0.195
 understands-0.184
rier-0.183
Token
Token ID198
Feature activation-2.60e-01

Pos logit contributions
ometer+0.454
ounded+0.453
oke+0.436
rier+0.427
keeper+0.418
Neg logit contributions
aster-0.270
e-0.246
One-0.236
 rest-0.223
 spring-0.216
Token
Token ID198
Feature activation-2.55e-01

Pos logit contributions
aster+0.011
e+0.010
One+0.009
 rest+0.009
 spring+0.008
Neg logit contributions
ometer-0.018
ounded-0.018
oke-0.017
keeper-0.017
 understands-0.017
Token.
Token ID13
Feature activation-1.46e+00

Pos logit contributions
ometer+0.200
ounded+0.199
oke+0.194
keeper+0.188
rier+0.187
Neg logit contributions
aster-0.110
e-0.105
One-0.095
 rest-0.092
 spring-0.085
Token "
Token ID366
Feature activation+1.06e+01

Pos logit contributions
aster+0.043
One+0.043
Once+0.038
 spring+0.037
 rest+0.036
Neg logit contributions
ounded-0.117
ometer-0.111
ener-0.110
oke-0.108
rier-0.108
TokenA
Token ID32
Feature activation-5.08e+00

Pos logit contributions
ounded+0.484
 understands+0.480
ometer+0.467
keeper+0.459
 Isn+0.447
Neg logit contributions
 rest-0.490
aster-0.485
 parties-0.462
 recess-0.452
 end-0.441
Tokenbr
Token ID1671
Feature activation-7.45e+00

Pos logit contributions
aster+0.202
e+0.193
One+0.175
 rest+0.167
wered+0.154
Neg logit contributions
ometer-0.357
ounded-0.353
oke-0.348
keeper-0.335
rier-0.333
Tokenac
Token ID330
Feature activation+7.72e+00

Pos logit contributions
e+0.469
aster+0.466
One+0.442
 rest+0.404
 time+0.398
Neg logit contributions
ounded-0.341
ometer-0.337
keeper-0.328
rier-0.316
 puzzled-0.314
Tokenad
Token ID324
Feature activation-1.61e+01

Pos logit contributions
oke+0.608
atted+0.593
ometer+0.589
 understands+0.584
ounded+0.584
Neg logit contributions
aster-0.202
One-0.189
 rest-0.178
 arrive-0.153
 jog-0.152
Tokenab
Token ID397
Feature activation+2.04e-01

Pos logit contributions
aster+0.910
e+0.904
One+0.806
 rest+0.797
 spring+0.760
Neg logit contributions
ometer-0.875
ounded-0.875
keeper-0.821
rier-0.814
oke-0.806
Tokenra
Token ID430
Feature activation-3.61e+00

Pos logit contributions
ometer+0.012
ounded+0.011
oke+0.011
keeper+0.011
rier+0.011
Neg logit contributions
aster-0.011
e-0.010
One-0.010
 rest-0.010
 spring-0.009
Token!"
Token ID2474
Feature activation+6.43e+00

Pos logit contributions
aster+0.192
e+0.182
One+0.176
 rest+0.169
 spring+0.160
Neg logit contributions
ometer-0.203
ounded-0.202
oke-0.199
keeper-0.188
rier-0.188
Token
Token ID198
Feature activation-3.03e-01

Pos logit contributions
ometer+0.462
ounded+0.461
oke+0.445
rier+0.436
 understands+0.418
Neg logit contributions
aster-0.262
e-0.223
One-0.220
 rest-0.217
 spring-0.210
Token
Token ID198
Feature activation-2.92e-01

Pos logit contributions
aster+0.012
e+0.011
One+0.010
 rest+0.010
 spring+0.009
Neg logit contributions
ometer-0.022
ounded-0.022
oke-0.021
rier-0.021
keeper-0.020
Token the
Token ID262
Feature activation+9.44e-01

Pos logit contributions
aster+0.208
e+0.195
One+0.184
 rest+0.180
 recess+0.171
Neg logit contributions
ometer-0.300
rier-0.295
keeper-0.293
 puzzled-0.288
 understands-0.281
Token tree
Token ID5509
Feature activation-1.44e+00

Pos logit contributions
ometer+0.053
ounded+0.052
keeper+0.051
rier+0.051
oke+0.049
Neg logit contributions
aster-0.050
e-0.047
One-0.045
he-0.041
 recess-0.040
Token.
Token ID13
Feature activation-1.39e+00

Pos logit contributions
aster+0.059
e+0.055
One+0.051
 rest+0.049
 spring+0.045
Neg logit contributions
ometer-0.100
ounded-0.099
oke-0.097
keeper-0.094
rier-0.093
Token
Token ID198
Feature activation-1.31e-01

Pos logit contributions
aster+0.065
e+0.063
One+0.060
 rest+0.056
 spring+0.054
Neg logit contributions
ounded-0.089
ometer-0.087
oke-0.084
rier-0.084
 understands-0.081
Token
Token ID198
Feature activation-7.05e-02

Pos logit contributions
aster+0.006
e+0.006
 rest+0.005
 recess+0.005
 arrive+0.005
Neg logit contributions
oke-0.008
ometer-0.008
ounded-0.008
keeper-0.008
 understands-0.007
TokenBl
Token ID3629
Feature activation-1.61e+01

Pos logit contributions
aster+0.003
e+0.003
 rest+0.003
 recess+0.002
 arrive+0.002
Neg logit contributions
oke-0.005
keeper-0.004
ounded-0.004
ometer-0.004
 understands-0.004
Tokenossom
Token ID25548
Feature activation+9.79e+00

Pos logit contributions
aster+0.938
e+0.921
One+0.854
 rest+0.815
 spring+0.786
Neg logit contributions
ounded-0.827
ometer-0.820
oke-0.785
rier-0.772
keeper-0.769
Token touched
Token ID12615
Feature activation+1.54e+00

Pos logit contributions
ometer+0.595
ounded+0.592
oke+0.571
rier+0.553
keeper+0.546
Neg logit contributions
aster-0.480
e-0.473
One-0.432
 rest-0.423
 spring-0.406
Token the
Token ID262
Feature activation+1.16e+00

Pos logit contributions
ounded+0.117
ometer+0.117
oke+0.114
keeper+0.109
 understands+0.108
Neg logit contributions
aster-0.053
e-0.052
One-0.046
 rest-0.045
 spring-0.042
Token pet
Token ID4273
Feature activation-4.95e+00

Pos logit contributions
ounded+0.073
ometer+0.073
oke+0.072
rier+0.067
 understands+0.067
Neg logit contributions
e-0.054
aster-0.054
 rest-0.051
One-0.049
 spring-0.048
Tokenals
Token ID874
Feature activation+4.69e+00

Pos logit contributions
aster+0.360
e+0.351
One+0.333
 rest+0.328
 spring+0.317
Neg logit contributions
ounded-0.184
ometer-0.184
oke-0.175
keeper-0.163
rier-0.163
Token,
Token ID11
Feature activation-9.75e-01

Pos logit contributions
ounded+0.512
ometer+0.508
oke+0.491
keeper+0.472
 puzzled+0.472
Neg logit contributions
e-0.176
aster-0.164
One-0.153
 spring-0.148
 rest-0.147
Token "
Token ID366
Feature activation+1.06e+01

Pos logit contributions
aster+0.066
One+0.065
e+0.064
 spring+0.063
 rest+0.062
Neg logit contributions
ounded-0.038
ometer-0.038
oke-0.036
 understands-0.034
keeper-0.034
TokenA
Token ID32
Feature activation-5.00e+00

Pos logit contributions
ounded+0.557
keeper+0.533
 understands+0.513
ometer+0.508
oke+0.508
Neg logit contributions
aster-0.588
 rest-0.502
e-0.481
 recess-0.457
 fall-0.454
Tokenbr
Token ID1671
Feature activation-7.48e+00

Pos logit contributions
aster+0.264
e+0.255
One+0.237
 rest+0.231
 spring+0.218
Neg logit contributions
ometer-0.287
ounded-0.285
oke-0.277
keeper-0.267
rier-0.265
Tokenac
Token ID330
Feature activation+7.72e+00

Pos logit contributions
e+0.466
aster+0.456
One+0.442
he+0.397
 rest+0.395
Neg logit contributions
ounded-0.351
ometer-0.346
keeper-0.333
 puzzled-0.323
rier-0.323
Tokenad
Token ID324
Feature activation-1.61e+01

Pos logit contributions
oke+0.641
ounded+0.632
ometer+0.627
 understands+0.621
keeper+0.617
Neg logit contributions
aster-0.188
 rest-0.149
One-0.148
e-0.135
 arrive-0.117
Tokenab
Token ID397
Feature activation+1.64e-01

Pos logit contributions
aster+0.909
e+0.900
One+0.812
 rest+0.800
 spring+0.764
Neg logit contributions
ometer-0.871
ounded-0.867
keeper-0.811
oke-0.807
rier-0.805
Tokenra
Token ID430
Feature activation-3.76e+00

Pos logit contributions
ometer+0.009
ounded+0.009
oke+0.009
keeper+0.008
rier+0.008
Neg logit contributions
aster-0.009
e-0.008
One-0.008
 rest-0.008
 spring-0.007
Token!
Token ID0
Feature activation+1.06e+00

Pos logit contributions
aster+0.200
e+0.191
One+0.181
 rest+0.175
 spring+0.166
Neg logit contributions
ometer-0.213
ounded-0.212
oke-0.206
keeper-0.198
rier-0.197
Token I
Token ID314
Feature activation+1.91e+00

Pos logit contributions
ometer+0.046
ounded+0.045
oke+0.044
keeper+0.041
rier+0.041
Neg logit contributions
aster-0.071
e-0.069
One-0.066
 rest-0.064
 spring-0.062
Token'm
Token ID1101
Feature activation+1.23e+00

Pos logit contributions
ounded+0.118
ometer+0.115
oke+0.113
 understands+0.112
 puzzled+0.108
Neg logit contributions
aster-0.093
One-0.080
e-0.080
wered-0.075
 rest-0.073
Token snake
Token ID17522
Feature activation+1.99e+00

Pos logit contributions
ounded+0.041
ometer+0.041
oke+0.040
 understands+0.037
rier+0.036
Neg logit contributions
aster-0.043
e-0.041
One-0.040
 rest-0.040
 spring-0.037
Token.
Token ID13
Feature activation-1.66e+00

Pos logit contributions
ounded+0.159
ometer+0.155
oke+0.151
rier+0.147
 understands+0.145
Neg logit contributions
e-0.063
aster-0.061
One-0.054
 rest-0.052
 spring-0.048
Token
Token ID198
Feature activation-2.67e-01

Pos logit contributions
aster+0.065
e+0.061
One+0.060
 spring+0.052
 rest+0.052
Neg logit contributions
ounded-0.120
ometer-0.116
rier-0.112
oke-0.111
 understands-0.107
Token
Token ID198
Feature activation-1.97e-01

Pos logit contributions
aster+0.013
e+0.012
 rest+0.011
 arrive+0.010
One+0.010
Neg logit contributions
ometer-0.016
oke-0.016
ounded-0.016
keeper-0.016
 understands-0.015
Token"
Token ID1
Feature activation+3.07e+00

Pos logit contributions
 rest+0.007
aster+0.007
e+0.007
 arrive+0.006
 recess+0.006
Neg logit contributions
keeper-0.013
oke-0.013
ometer-0.013
ounded-0.013
 understands-0.012
TokenBl
Token ID3629
Feature activation-1.61e+01

Pos logit contributions
keeper+0.184
ounded+0.184
ometer+0.179
 puzzled+0.177
 understands+0.177
Neg logit contributions
aster-0.150
e-0.147
 rest-0.133
 recess-0.121
 arrive-0.120
Tokenub
Token ID549
Feature activation-2.67e+00

Pos logit contributions
aster+0.925
e+0.881
One+0.843
 rest+0.833
 spring+0.785
Neg logit contributions
ometer-0.861
ounded-0.844
oke-0.828
keeper-0.783
rier-0.777
Token!
Token ID0
Feature activation+1.12e+00

Pos logit contributions
aster+0.143
e+0.142
 rest+0.128
One+0.126
 spring+0.121
Neg logit contributions
ometer-0.148
ounded-0.147
keeper-0.137
oke-0.136
 puzzled-0.135
Token You
Token ID921
Feature activation+1.53e+00

Pos logit contributions
ounded+0.056
ometer+0.054
oke+0.052
rier+0.050
 puzzled+0.050
Neg logit contributions
aster-0.067
e-0.066
One-0.063
 rest-0.061
 spring-0.058
Token are
Token ID389
Feature activation-1.41e+00

Pos logit contributions
ounded+0.104
ometer+0.104
oke+0.102
 understands+0.098
keeper+0.098
Neg logit contributions
aster-0.064
e-0.059
One-0.055
 rest-0.052
 spring-0.049
Token very
Token ID845
Feature activation-3.31e+00

Pos logit contributions
aster+0.062
e+0.062
One+0.055
 rest+0.054
 spring+0.051
Neg logit contributions
ometer-0.095
ounded-0.094
oke-0.091
keeper-0.088
rier-0.087
Token them
Token ID606
Feature activation-2.27e+00

Pos logit contributions
aster+0.172
e+0.160
One+0.150
 rest+0.139
 spring+0.126
Neg logit contributions
ometer-0.333
ounded-0.329
oke-0.319
rier-0.317
keeper-0.316
Token on
Token ID319
Feature activation-4.67e+00

Pos logit contributions
aster+0.116
e+0.101
One+0.098
 rest+0.088
wered+0.086
Neg logit contributions
ounded-0.141
ometer-0.139
keeper-0.132
 puzzled-0.132
rier-0.130
Token and
Token ID290
Feature activation-1.59e+00

Pos logit contributions
aster+0.230
e+0.219
One+0.202
 rest+0.194
P+0.189
Neg logit contributions
ometer-0.265
keeper-0.260
rier-0.255
 puzzled-0.248
ounded-0.248
Token gave
Token ID2921
Feature activation+1.67e-01

Pos logit contributions
aster+0.075
e+0.072
 rest+0.066
One+0.064
 spring+0.063
Neg logit contributions
ometer-0.101
ounded-0.100
oke-0.100
rier-0.096
keeper-0.093
Token Grand
Token ID5675
Feature activation+9.31e-01

Pos logit contributions
ometer+0.012
ounded+0.012
keeper+0.012
oke+0.012
rier+0.012
Neg logit contributions
aster-0.006
e-0.005
One-0.005
 rest-0.004
wered-0.004
Tokenad
Token ID324
Feature activation-1.61e+01

Pos logit contributions
oke+0.059
ometer+0.058
ounded+0.058
keeper+0.056
 understands+0.055
Neg logit contributions
aster-0.043
e-0.040
One-0.038
 rest-0.037
 spring-0.035
Token a
Token ID257
Feature activation-3.04e+00

Pos logit contributions
aster+0.846
e+0.825
One+0.760
 rest+0.745
 spring+0.703
Neg logit contributions
ometer-0.923
ounded-0.922
oke-0.894
keeper-0.855
rier-0.852
Token playful
Token ID34264
Feature activation+5.33e+00

Pos logit contributions
aster+0.135
e+0.126
 rest+0.123
One+0.117
 spring+0.115
Neg logit contributions
oke-0.198
ounded-0.198
ometer-0.197
rier-0.193
keeper-0.186
Token punch
Token ID10862
Feature activation-2.27e+00

Pos logit contributions
ometer+0.331
ounded+0.327
oke+0.314
keeper+0.311
rier+0.306
Neg logit contributions
aster-0.252
e-0.241
One-0.228
 rest-0.208
wered-0.200
Token.
Token ID13
Feature activation-1.23e+00

Pos logit contributions
aster+0.094
e+0.089
One+0.084
 rest+0.078
wered+0.073
Neg logit contributions
ometer-0.157
ounded-0.153
oke-0.150
keeper-0.148
rier-0.146
Token He
Token ID679
Feature activation+2.22e+00

Pos logit contributions
aster+0.070
e+0.067
One+0.064
 rest+0.062
 spring+0.060
Neg logit contributions
ounded-0.065
ometer-0.065
oke-0.063
rier-0.061
keeper-0.060
Token seen
Token ID1775
Feature activation+9.24e+00

Pos logit contributions
aster+0.565
e+0.557
 rest+0.513
One+0.500
 spring+0.471
Neg logit contributions
ometer-0.702
ounded-0.656
rier-0.645
oke-0.641
keeper-0.624
Token before
Token ID878
Feature activation+5.27e-01

Pos logit contributions
ometer+0.540
ounded+0.538
oke+0.531
keeper+0.514
rier+0.514
Neg logit contributions
aster-0.458
e-0.434
One-0.412
 rest-0.392
 spring-0.373
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
ometer+0.035
ounded+0.034
oke+0.034
keeper+0.033
 puzzled+0.033
Neg logit contributions
aster-0.023
e-0.021
One-0.019
 rest-0.018
wered-0.018
Token But
Token ID887
Feature activation+1.43e+00

Pos logit contributions
aster+0.064
One+0.063
e+0.055
 play+0.055
 spring+0.054
Neg logit contributions
rier-0.095
ounded-0.088
keeper-0.084
oke-0.084
ometer-0.084
Token Grand
Token ID5675
Feature activation+9.72e-01

Pos logit contributions
ometer+0.104
ounded+0.103
oke+0.102
rier+0.101
keeper+0.098
Neg logit contributions
aster-0.057
e-0.054
One-0.049
 rest-0.047
 spring-0.046
Tokenad
Token ID324
Feature activation-1.61e+01

Pos logit contributions
oke+0.067
ounded+0.065
ometer+0.064
keeper+0.063
 understands+0.063
Neg logit contributions
aster-0.040
e-0.036
 rest-0.034
One-0.034
 spring-0.032
Token warned
Token ID7728
Feature activation+4.98e+00

Pos logit contributions
e+0.827
aster+0.789
One+0.748
 rest+0.691
 spring+0.672
Neg logit contributions
ometer-0.998
ounded-0.934
rier-0.934
oke-0.872
keeper-0.869
Token him
Token ID683
Feature activation-6.59e+00

Pos logit contributions
ometer+0.314
ounded+0.303
oke+0.301
keeper+0.298
 puzzled+0.293
Neg logit contributions
aster-0.209
e-0.197
One-0.195
P-0.182
he-0.178
Token that
Token ID326
Feature activation+2.67e+00

Pos logit contributions
aster+0.261
e+0.252
 rest+0.234
One+0.223
 spring+0.211
Neg logit contributions
ounded-0.468
oke-0.453
ometer-0.449
rier-0.432
 puzzled-0.427
Token he
Token ID339
Feature activation-2.17e-01

Pos logit contributions
ounded+0.170
ometer+0.169
oke+0.164
rier+0.159
keeper+0.158
Neg logit contributions
aster-0.126
e-0.120
One-0.112
 rest-0.110
 spring-0.105
Token must
Token ID1276
Feature activation+8.03e+00

Pos logit contributions
e+0.012
aster+0.011
 rest+0.011
One+0.011
 play+0.010
Neg logit contributions
rier-0.012
ounded-0.012
ometer-0.012
oke-0.011
keeper-0.011
Token his
Token ID465
Feature activation-4.97e-01

Pos logit contributions
aster+0.573
e+0.545
One+0.506
 rest+0.492
 spring+0.463
Neg logit contributions
ometer-0.645
ounded-0.641
oke-0.620
keeper-0.604
rier-0.599
Token way
Token ID835
Feature activation-5.04e-01

Pos logit contributions
aster+0.022
e+0.021
One+0.020
 rest+0.019
 spring+0.017
Neg logit contributions
ometer-0.033
ounded-0.032
oke-0.031
keeper-0.031
rier-0.030
Token!
Token ID0
Feature activation+1.14e+00

Pos logit contributions
aster+0.024
e+0.023
One+0.021
 rest+0.021
 spring+0.019
Neg logit contributions
ometer-0.031
ounded-0.031
oke-0.030
keeper-0.029
rier-0.029
Token<|endoftext|>
Token ID50256
Feature activation-4.77e-01

Pos logit contributions
ometer+0.083
ounded+0.083
rier+0.080
oke+0.080
keeper+0.077
Neg logit contributions
aster-0.043
e-0.041
One-0.038
 rest-0.037
 spring-0.033
TokenOnce
Token ID7454
Feature activation-9.97e+00

Pos logit contributions
aster+0.028
e+0.025
One+0.023
 rest+0.023
wered+0.023
Neg logit contributions
oke-0.025
ometer-0.024
ounded-0.024
keeper-0.023
 understands-0.023
Token upon
Token ID2402
Feature activation+1.17e+01

Pos logit contributions
aster+0.580
One+0.563
 rest+0.562
 time+0.552
e+0.542
Neg logit contributions
rier-0.495
ometer-0.490
ounded-0.478
keeper-0.449
ener-0.432
Token a
Token ID257
Feature activation-3.10e+00

Pos logit contributions
 puzzled+0.543
 understands+0.543
 ashamed+0.540
 teeth+0.535
ounded+0.526
Neg logit contributions
aster-0.588
 recess-0.561
e-0.526
P-0.492
 race-0.488
Token time
Token ID640
Feature activation-9.96e+00

Pos logit contributions
aster+0.099
 recess+0.090
e+0.086
wered+0.086
 arrive+0.078
Neg logit contributions
 understands-0.215
oke-0.212
 puzzled-0.212
ounded-0.210
ometer-0.206
Token,
Token ID11
Feature activation-1.10e+00

Pos logit contributions
aster+0.435
e+0.410
One+0.375
 rest+0.362
 spring+0.336
Neg logit contributions
ometer-0.664
ounded-0.663
oke-0.651
keeper-0.624
 understands-0.617
Token there
Token ID612
Feature activation+4.06e+00

Pos logit contributions
aster+0.059
e+0.056
One+0.052
 rest+0.051
 spring+0.048
Neg logit contributions
ometer-0.062
ounded-0.062
oke-0.060
keeper-0.058
 puzzled-0.057
Token was
Token ID373
Feature activation+1.51e+00

Pos logit contributions
ometer+0.196
oke+0.191
ounded+0.191
 understands+0.186
keeper+0.181
Neg logit contributions
aster-0.248
e-0.228
One-0.221
 rest-0.210
 recess-0.208
Token person
Token ID1048
Feature activation+5.65e-01

Pos logit contributions
aster+0.157
e+0.149
 rest+0.148
One+0.146
 end+0.142
Neg logit contributions
ounded-0.124
ometer-0.120
oke-0.120
keeper-0.107
 understands-0.107
Token smiled
Token ID13541
Feature activation+4.46e+00

Pos logit contributions
ometer+0.030
ounded+0.030
rier+0.029
oke+0.028
 understands+0.026
Neg logit contributions
e-0.032
aster-0.032
One-0.029
 rest-0.029
 spring-0.028
Token and
Token ID290
Feature activation-1.34e+00

Pos logit contributions
oke+0.376
ometer+0.370
ounded+0.364
 understands+0.337
ours+0.333
Neg logit contributions
One-0.099
 spring-0.097
e-0.096
 end-0.094
 go-0.093
Token said
Token ID531
Feature activation+6.60e+00

Pos logit contributions
aster+0.073
e+0.072
 rest+0.067
 spring+0.067
 play+0.065
Neg logit contributions
ometer-0.073
ounded-0.071
oke-0.069
rier-0.069
 understands-0.063
Token,
Token ID11
Feature activation-7.30e-01

Pos logit contributions
ounded+0.545
ometer+0.529
oke+0.515
 puzzled+0.506
 understands+0.502
Neg logit contributions
e-0.171
aster-0.163
 rest-0.136
One-0.129
 spring-0.124
Token "
Token ID366
Feature activation+1.08e+01

Pos logit contributions
aster+0.050
One+0.048
e+0.048
 spring+0.046
 rest+0.046
Neg logit contributions
ounded-0.028
ometer-0.027
oke-0.026
 understands-0.025
keeper-0.024
TokenThat
Token ID2504
Feature activation-7.04e-01

Pos logit contributions
ounded+0.630
ometer+0.616
keeper+0.615
oke+0.606
 understands+0.604
Neg logit contributions
aster-0.547
e-0.515
 rest-0.477
 recess-0.440
 arrive-0.438
Token's
Token ID338
Feature activation+5.44e+00

Pos logit contributions
e+0.035
aster+0.035
 rest+0.035
 spring+0.033
One+0.032
Neg logit contributions
ounded-0.044
oke-0.043
rier-0.040
ometer-0.040
keeper-0.039
Token great
Token ID1049
Feature activation-3.70e-01

Pos logit contributions
ometer+0.350
ounded+0.348
oke+0.335
rier+0.327
keeper+0.326
Neg logit contributions
aster-0.251
e-0.233
One-0.214
 rest-0.208
 recess-0.195
Token!
Token ID0
Feature activation+1.44e+00

Pos logit contributions
aster+0.014
e+0.013
 rest+0.012
One+0.012
 spring+0.011
Neg logit contributions
ounded-0.027
ometer-0.027
oke-0.026
keeper-0.025
 puzzled-0.025
Token I
Token ID314
Feature activation+2.17e+00

Pos logit contributions
ounded+0.073
rier+0.069
ometer+0.069
 puzzled+0.066
oke+0.065
Neg logit contributions
e-0.086
aster-0.085
One-0.080
 rest-0.076
 spring-0.074
Token.
Token ID13
Feature activation-1.33e+00

Pos logit contributions
ometer+0.077
ounded+0.077
oke+0.075
keeper+0.073
rier+0.073
Neg logit contributions
aster-0.039
e-0.036
One-0.033
 rest-0.032
 spring-0.029
Token
Token ID220
Feature activation+4.18e+00

Pos logit contributions
aster+0.052
One+0.049
Once+0.049
 play+0.048
't+0.047
Neg logit contributions
rier-0.096
ishes-0.090
ometer-0.090
ounded-0.088
ener-0.088
Token
Token ID198
Feature activation-1.80e-01

Pos logit contributions
ounded+0.301
ometer+0.300
oke+0.291
rier+0.287
keeper+0.285
Neg logit contributions
aster-0.157
e-0.149
One-0.138
 rest-0.131
 spring-0.120
Token
Token ID198
Feature activation-1.67e-01

Pos logit contributions
aster+0.008
e+0.008
 rest+0.007
One+0.007
 spring+0.006
Neg logit contributions
ounded-0.012
ometer-0.012
oke-0.012
keeper-0.011
 understands-0.011
TokenThe
Token ID464
Feature activation-2.55e+00

Pos logit contributions
aster+0.007
e+0.006
One+0.006
 rest+0.006
 spring+0.005
Neg logit contributions
ounded-0.012
ometer-0.012
oke-0.011
keeper-0.011
 understands-0.011
Token old
Token ID1468
Feature activation+1.00e+01

Pos logit contributions
aster+0.182
e+0.171
 rest+0.169
One+0.165
 end+0.161
Neg logit contributions
ometer-0.112
ounded-0.108
oke-0.101
rier-0.095
keeper-0.092
Token man
Token ID582
Feature activation+3.99e+00

Pos logit contributions
ounded+0.593
ometer+0.589
oke+0.541
rier+0.538
azed+0.510
Neg logit contributions
aster-0.578
e-0.506
 rest-0.503
One-0.486
 read-0.485
Token his
Token ID465
Feature activation-3.30e-01

Pos logit contributions
ometer+0.236
ounded+0.228
rier+0.221
oke+0.216
 understands+0.206
Neg logit contributions
e-0.211
aster-0.207
 rest-0.185
One-0.184
 spring-0.175
Tokensed
Token ID36622
Feature activation+8.17e+00

Pos logit contributions
aster+0.020
e+0.020
 rest+0.019
 spring+0.018
One+0.017
Neg logit contributions
ounded-0.017
ometer-0.016
oke-0.016
rier-0.016
keeper-0.015
Token,
Token ID11
Feature activation-9.10e-01

Pos logit contributions
ometer+0.550
ounded+0.542
oke+0.528
keeper+0.512
rier+0.509
Neg logit contributions
aster-0.355
e-0.353
One-0.314
 rest-0.308
 spring-0.291
Token �
Token ID6184
Feature activation+1.49e-01

Pos logit contributions
aster+0.048
e+0.046
One+0.043
 rest+0.043
 spring+0.043
Neg logit contributions
ometer-0.055
ounded-0.053
rier-0.051
oke-0.050
keeper-0.048
Token belt
Token ID10999
Feature activation-8.46e-01

Pos logit contributions
ounded+0.048
ometer+0.048
rier+0.045
oke+0.042
atted+0.041
Neg logit contributions
aster-0.074
 rest-0.071
 spring-0.068
One-0.068
 time-0.067
Token he
Token ID339
Feature activation+7.45e-03

Pos logit contributions
aster+0.032
e+0.031
 rest+0.030
One+0.028
 play+0.027
Neg logit contributions
oke-0.059
ometer-0.059
ounded-0.058
 understands-0.056
rier-0.055
Token found
Token ID1043
Feature activation+4.32e+00

Pos logit contributions
rier+0.000
ometer+0.000
oke+0.000
ounded+0.000
ours+0.000
Neg logit contributions
e-0.000
 play-0.000
 rest-0.000
 read-0.000
aster-0.000
Token.
Token ID13
Feature activation-1.29e+00

Pos logit contributions
ounded+0.352
ometer+0.352
oke+0.342
 puzzled+0.323
rier+0.322
Neg logit contributions
e-0.132
aster-0.125
 rest-0.110
One-0.099
 spring-0.095
Token The
Token ID383
Feature activation+4.04e+00

Pos logit contributions
aster+0.047
One+0.044
e+0.042
 rest+0.040
 play+0.040
Neg logit contributions
ounded-0.092
rier-0.092
ener-0.089
ometer-0.088
ience-0.085
Token old
Token ID1468
Feature activation+1.02e+01

Pos logit contributions
ometer+0.146
ounded+0.144
oke+0.131
rier+0.127
keeper+0.117
Neg logit contributions
aster-0.314
 rest-0.297
e-0.291
One-0.288
 end-0.287
Token man
Token ID582
Feature activation+4.23e+00

Pos logit contributions
ounded+0.543
ometer+0.528
rier+0.512
oke+0.510
azed+0.474
Neg logit contributions
aster-0.624
e-0.558
 rest-0.557
One-0.549
 spring-0.534
Token was
Token ID373
Feature activation+1.84e+00

Pos logit contributions
ometer+0.257
rier+0.250
ounded+0.247
oke+0.233
 understands+0.218
Neg logit contributions
e-0.220
aster-0.203
 rest-0.191
 read-0.187
 play-0.185
Token very
Token ID845
Feature activation-3.00e+00

Pos logit contributions
ometer+0.111
ounded+0.107
oke+0.104
rier+0.097
 understands+0.097
Neg logit contributions
e-0.098
aster-0.095
One-0.088
 rest-0.085
 spring-0.083
Token happy
Token ID3772
Feature activation+5.54e+00

Pos logit contributions
e+0.122
 play+0.117
aster+0.112
 rest+0.111
One+0.111
Neg logit contributions
ometer-0.206
ounded-0.189
ishes-0.186
oke-0.183
ener-0.181
Token and
Token ID290
Feature activation-1.65e+00

Pos logit contributions
ounded+0.412
ometer+0.409
oke+0.394
rier+0.382
 understands+0.377
Neg logit contributions
aster-0.202
e-0.199
 rest-0.176
One-0.175
 spring-0.166
Token blocks
Token ID7021
Feature activation-6.85e-01

Pos logit contributions
 play+0.017
e+0.016
aster+0.016
 rest+0.016
 spring+0.016
Neg logit contributions
ounded-0.028
oke-0.026
ometer-0.026
rier-0.025
keeper-0.025
Token '
Token ID705
Feature activation+9.89e+00

Pos logit contributions
aster+0.032
e+0.030
One+0.028
 rest+0.027
 spring+0.025
Neg logit contributions
ometer-0.043
ounded-0.043
oke-0.042
keeper-0.041
rier-0.040
TokenAdventure
Token ID48289
Feature activation-1.78e+00

Pos logit contributions
ometer+0.572
ounded+0.567
oke+0.566
keeper+0.550
run+0.498
Neg logit contributions
aster-0.458
 rest-0.430
wered-0.410
e-0.408
 end-0.406
Token'
Token ID6
Feature activation+2.85e+00

Pos logit contributions
aster+0.101
e+0.097
One+0.091
 rest+0.089
 spring+0.084
Neg logit contributions
ometer-0.095
ounded-0.093
oke-0.091
keeper-0.088
rier-0.087
Token and
Token ID290
Feature activation-1.57e+00

Pos logit contributions
ounded+0.185
ometer+0.184
oke+0.179
rier+0.173
keeper+0.173
Neg logit contributions
aster-0.127
e-0.123
One-0.114
 rest-0.110
 spring-0.103
Token '
Token ID705
Feature activation+9.88e+00

Pos logit contributions
aster+0.057
e+0.053
 rest+0.048
One+0.047
 play+0.045
Neg logit contributions
ometer-0.117
ounded-0.117
oke-0.115
rier-0.112
keeper-0.109
TokenAmazing
Token ID42770
Feature activation-5.24e+00

Pos logit contributions
ometer+0.587
ounded+0.583
oke+0.572
keeper+0.561
 puzzled+0.518
Neg logit contributions
aster-0.454
 rest-0.426
e-0.416
wered-0.397
 end-0.395
Token'
Token ID6
Feature activation+2.83e+00

Pos logit contributions
aster+0.294
e+0.282
One+0.265
 rest+0.258
 spring+0.246
Neg logit contributions
ounded-0.283
ometer-0.282
oke-0.270
keeper-0.261
 understands-0.261
Token instead
Token ID2427
Feature activation+5.53e+00

Pos logit contributions
ometer+0.188
ounded+0.187
oke+0.182
rier+0.176
 understands+0.175
Neg logit contributions
aster-0.122
e-0.119
One-0.109
 rest-0.106
 spring-0.099
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
ounded+0.364
ometer+0.364
oke+0.351
keeper+0.341
rier+0.340
Neg logit contributions
aster-0.244
e-0.234
One-0.215
 rest-0.208
 spring-0.194
Token That
Token ID1320
Feature activation+6.74e+00

Pos logit contributions
aster+0.078
e+0.077
One+0.074
 rest+0.070
 spring+0.067
Neg logit contributions
ounded-0.060
ometer-0.059
oke-0.056
rier-0.054
 understands-0.054
Token Mia
Token ID32189
Feature activation-2.44e+00

Pos logit contributions
aster+0.071
e+0.068
One+0.063
 rest+0.060
 arrive+0.056
Neg logit contributions
ometer-0.082
ounded-0.081
oke-0.077
 understands-0.076
keeper-0.076
Token saw
Token ID2497
Feature activation+8.24e+00

Pos logit contributions
aster+0.136
e+0.129
One+0.123
 rest+0.117
wered+0.113
Neg logit contributions
ometer-0.130
ounded-0.130
oke-0.126
 understands-0.122
keeper-0.122
Token a
Token ID257
Feature activation-3.36e+00

Pos logit contributions
ometer+0.617
ounded+0.606
 understands+0.596
oke+0.592
 puzzled+0.587
Neg logit contributions
aster-0.268
e-0.261
One-0.231
 rest-0.207
 arrive-0.193
Token ghost
Token ID10905
Feature activation-1.32e+00

Pos logit contributions
aster+0.159
 rest+0.149
 spring+0.145
One+0.143
e+0.143
Neg logit contributions
ounded-0.211
oke-0.204
ometer-0.203
rier-0.194
atted-0.194
Token.
Token ID13
Feature activation-1.76e+00

Pos logit contributions
aster+0.063
One+0.056
e+0.053
wered+0.051
 recess+0.049
Neg logit contributions
ometer-0.081
ounded-0.079
oke-0.078
 puzzled-0.076
keeper-0.076
Token The
Token ID383
Feature activation+3.48e+00

Pos logit contributions
aster+0.079
e+0.078
 spring+0.071
One+0.070
 play+0.068
Neg logit contributions
ounded-0.121
rier-0.115
oke-0.114
ometer-0.114
keeper-0.109
Token ghost
Token ID10905
Feature activation-1.33e+00

Pos logit contributions
ometer+0.158
ounded+0.154
 understands+0.145
keeper+0.143
rier+0.142
Neg logit contributions
aster-0.233
e-0.221
One-0.202
 rest-0.193
wered-0.186
Token was
Token ID373
Feature activation+1.32e+00

Pos logit contributions
e+0.058
aster+0.051
 rest+0.049
 spring+0.049
One+0.049
Neg logit contributions
rier-0.091
ounded-0.091
ometer-0.090
oke-0.086
keeper-0.084
Token very
Token ID845
Feature activation-3.57e+00

Pos logit contributions
ounded+0.078
ometer+0.078
oke+0.076
keeper+0.073
rier+0.072
Neg logit contributions
aster-0.068
e-0.066
One-0.061
 rest-0.060
 spring-0.058
Token friendly
Token ID8030
Feature activation+6.25e+00

Pos logit contributions
aster+0.168
e+0.167
 spring+0.162
One+0.162
 rest+0.159
Neg logit contributions
ounded-0.213
oke-0.210
ometer-0.210
rier-0.199
keeper-0.198
Token.
Token ID13
Feature activation-1.81e+00

Pos logit contributions
ounded+0.485
ometer+0.480
oke+0.469
rier+0.456
keeper+0.454
Neg logit contributions
aster-0.206
e-0.202
One-0.178
 rest-0.172
 spring-0.160
TokenTom
Token ID13787
Feature activation-1.00e+01

Pos logit contributions
aster+0.024
 wait+0.021
e+0.021
wered+0.020
 rest+0.020
Neg logit contributions
oke-0.014
 understands-0.013
ometer-0.013
keeper-0.011
ounded-0.011
Token is
Token ID318
Feature activation+3.75e+00

Pos logit contributions
aster+0.569
e+0.543
One+0.517
 rest+0.500
 spring+0.476
Neg logit contributions
ometer-0.525
ounded-0.518
oke-0.513
 understands-0.494
keeper-0.485
Token a
Token ID257
Feature activation-2.90e+00

Pos logit contributions
 puzzled+0.212
ounded+0.206
ometer+0.206
oke+0.202
 ashamed+0.198
Neg logit contributions
aster-0.187
e-0.181
One-0.172
 rest-0.162
 arrive-0.152
Token cat
Token ID3797
Feature activation+1.30e+00

Pos logit contributions
aster+0.179
e+0.174
One+0.164
 rest+0.160
 spring+0.152
Neg logit contributions
ometer-0.140
ounded-0.138
oke-0.133
keeper-0.128
rier-0.127
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
oke+0.076
ometer+0.076
keeper+0.075
 understands+0.070
 puzzled+0.069
Neg logit contributions
aster-0.061
wered-0.054
e-0.053
 recess-0.053
 rest-0.053
Token He
Token ID679
Feature activation+2.39e+00

Pos logit contributions
aster+0.076
e+0.075
 rest+0.073
One+0.071
 end+0.070
Neg logit contributions
ometer-0.043
oke-0.042
 understands-0.038
keeper-0.037
ounded-0.037
Token likes
Token ID7832
Feature activation+1.06e+01

Pos logit contributions
 understands+0.068
ometer+0.064
oke+0.063
ounded+0.056
 puzzled+0.056
Neg logit contributions
aster-0.190
e-0.176
One-0.175
 recess-0.169
wered-0.169
Token to
Token ID284
Feature activation+4.34e+00

Pos logit contributions
ometer+0.509
ounded+0.496
oke+0.482
keeper+0.466
rier+0.457
Neg logit contributions
aster-0.657
e-0.641
One-0.603
 rest-0.585
 spring-0.554
Token play
Token ID711
Feature activation-7.21e+00

Pos logit contributions
 teeth+0.265
 claws+0.246
ometer+0.246
 mouth+0.230
 eyes+0.228
Neg logit contributions
he-0.144
Once-0.140
One-0.132
pped-0.130
Now-0.128
Token with
Token ID351
Feature activation-3.43e+00

Pos logit contributions
aster+0.366
e+0.311
 arrive+0.299
One+0.294
 rest+0.290
Neg logit contributions
rier-0.394
ometer-0.383
ounded-0.366
keeper-0.361
 puzzled-0.360
Token his
Token ID465
Feature activation-3.68e-01

Pos logit contributions
aster+0.158
e+0.149
One+0.143
 rest+0.136
 arrive+0.129
Neg logit contributions
ometer-0.213
oke-0.203
ounded-0.202
keeper-0.198
 understands-0.198
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
ometer+0.448
ounded+0.437
oke+0.432
keeper+0.425
rier+0.421
Neg logit contributions
aster-0.243
e-0.230
One-0.209
 rest-0.199
wered-0.181
Token She
Token ID1375
Feature activation-2.51e+00

Pos logit contributions
aster+0.035
One+0.035
 spring+0.035
 play+0.032
e+0.030
Neg logit contributions
rier-0.104
oke-0.099
ometer-0.097
ounded-0.096
ener-0.093
Token quickly
Token ID2952
Feature activation-4.33e+00

Pos logit contributions
aster+0.105
e+0.103
 read+0.100
One+0.099
 rest+0.097
Neg logit contributions
rier-0.180
ometer-0.171
ounded-0.160
oke-0.159
osity-0.145
Token recognised
Token ID20915
Feature activation+6.78e+00

Pos logit contributions
aster+0.223
e+0.216
One+0.202
 rest+0.194
 spring+0.182
Neg logit contributions
ounded-0.253
ometer-0.251
oke-0.245
keeper-0.238
 understands-0.234
Token the
Token ID262
Feature activation+7.77e-01

Pos logit contributions
ometer+0.480
ounded+0.478
oke+0.466
keeper+0.453
rier+0.451
Neg logit contributions
aster-0.266
e-0.253
One-0.232
 rest-0.222
 spring-0.205
Token bottle
Token ID9294
Feature activation+2.76e+00

Pos logit contributions
ounded+0.044
oke+0.043
ometer+0.043
rier+0.040
keeper+0.040
Neg logit contributions
 rest-0.040
aster-0.039
e-0.038
 spring-0.038
 end-0.037
Token of
Token ID286
Feature activation-1.98e+00

Pos logit contributions
ometer+0.186
ounded+0.185
oke+0.181
rier+0.174
keeper+0.174
Neg logit contributions
aster-0.118
e-0.113
One-0.103
 rest-0.101
 spring-0.095
Token poison
Token ID8764
Feature activation+3.51e+00

Pos logit contributions
aster+0.104
e+0.102
 rest+0.096
 spring+0.093
One+0.089
Neg logit contributions
ounded-0.120
ometer-0.117
oke-0.113
keeper-0.110
rier-0.110
Token.
Token ID13
Feature activation-1.31e+00

Pos logit contributions
ounded+0.245
ometer+0.240
oke+0.235
keeper+0.225
rier+0.224
Neg logit contributions
aster-0.142
e-0.140
 rest-0.124
One-0.122
 spring-0.119
Token
Token ID198
Feature activation-3.32e-01

Pos logit contributions
aster+0.062
e+0.058
One+0.058
 spring+0.055
 rest+0.054
Neg logit contributions
rier-0.081
ounded-0.080
ometer-0.079
oke-0.078
keeper-0.075
Token
Token ID198
Feature activation-3.19e-01

Pos logit contributions
aster+0.017
e+0.016
 rest+0.014
 recess+0.014
One+0.014
Neg logit contributions
ometer-0.019
oke-0.019
ounded-0.019
 understands-0.018
keeper-0.018
Token it
Token ID340
Feature activation+3.65e+00

Pos logit contributions
ometer+0.029
ounded+0.029
oke+0.029
rier+0.028
keeper+0.028
Neg logit contributions
aster-0.012
e-0.012
One-0.010
 rest-0.010
 spring-0.009
Token to
Token ID284
Feature activation+4.33e+00

Pos logit contributions
oke+0.217
ounded+0.217
ometer+0.216
rier+0.203
 understands+0.201
Neg logit contributions
aster-0.185
e-0.180
 rest-0.166
One-0.165
 spring-0.155
Token the
Token ID262
Feature activation+1.04e+00

Pos logit contributions
ometer+0.228
ounded+0.227
keeper+0.224
rier+0.222
 understands+0.219
Neg logit contributions
aster-0.206
e-0.203
One-0.200
he-0.184
P-0.181
Token poor
Token ID3595
Feature activation+1.12e+01

Pos logit contributions
ometer+0.057
ounded+0.056
oke+0.055
keeper+0.053
rier+0.052
Neg logit contributions
aster-0.058
e-0.056
One-0.052
 rest-0.051
 spring-0.048
Token and
Token ID290
Feature activation-1.64e+00

Pos logit contributions
ounded+0.612
ometer+0.589
oke+0.574
rier+0.545
 understands+0.532
Neg logit contributions
aster-0.626
e-0.608
 rest-0.598
One-0.576
 spring-0.572
Token needy
Token ID37278
Feature activation+7.42e+00

Pos logit contributions
 play+0.062
aster+0.061
 rest+0.060
 go+0.058
e+0.056
Neg logit contributions
oke-0.119
ometer-0.118
rier-0.115
ounded-0.114
azed-0.107
Token people
Token ID661
Feature activation-5.86e-01

Pos logit contributions
ometer+0.497
ounded+0.494
oke+0.486
keeper+0.475
rier+0.471
Neg logit contributions
aster-0.315
e-0.293
One-0.275
 rest-0.261
 spring-0.246
Token".
Token ID1911
Feature activation+6.93e+00

Pos logit contributions
e+0.024
aster+0.023
 rest+0.021
One+0.020
 play+0.019
Neg logit contributions
ometer-0.041
ounded-0.041
oke-0.039
rier-0.039
keeper-0.038
Token
Token ID198
Feature activation-2.05e-01

Pos logit contributions
ometer+0.464
ounded+0.463
oke+0.448
rier+0.437
keeper+0.432
Neg logit contributions
aster-0.300
e-0.290
One-0.268
 rest-0.257
 spring-0.239
Token
Token ID198
Feature activation-1.90e-01

Pos logit contributions
aster+0.010
e+0.009
 rest+0.008
One+0.008
 spring+0.008
Neg logit contributions
oke-0.012
ometer-0.012
ounded-0.012
 understands-0.012
keeper-0.012
TokenAnna
Token ID31160
Feature activation-8.08e+00

Pos logit contributions
aster+0.008
e+0.007
 rest+0.007
 recess+0.007
 spring+0.007
Neg logit contributions
 understands-0.012
oke-0.012
ounded-0.012
keeper-0.012
 puzzled-0.012
Token to
Token ID284
Feature activation+4.43e+00

Pos logit contributions
ounded+0.060
oke+0.059
ometer+0.058
rier+0.057
keeper+0.053
Neg logit contributions
aster-0.034
e-0.032
 rest-0.031
One-0.030
 time-0.028
Token find
Token ID1064
Feature activation-1.52e-01

Pos logit contributions
 son+0.237
 dear+0.219
 teeth+0.217
 ashamed+0.215
 darling+0.210
Neg logit contributions
 recess-0.182
One-0.175
P-0.172
La-0.170
K-0.165
Token a
Token ID257
Feature activation-2.78e+00

Pos logit contributions
aster+0.005
e+0.004
 rest+0.004
One+0.003
 spring+0.003
Neg logit contributions
oke-0.013
ounded-0.013
ometer-0.013
rier-0.012
keeper-0.012
Token bone
Token ID9970
Feature activation+2.76e+00

Pos logit contributions
 rest+0.139
aster+0.138
One+0.133
 spring+0.128
 play+0.125
Neg logit contributions
oke-0.168
ounded-0.168
atted-0.159
rier-0.159
ometer-0.157
Token somewhere
Token ID7382
Feature activation+2.40e+00

Pos logit contributions
ometer+0.166
ounded+0.164
oke+0.159
keeper+0.158
rier+0.154
Neg logit contributions
aster-0.138
e-0.133
One-0.123
 rest-0.117
wered-0.112
Token else
Token ID2073
Feature activation+3.76e+00

Pos logit contributions
ometer+0.161
ounded+0.158
oke+0.154
rier+0.153
 understands+0.148
Neg logit contributions
aster-0.105
 rest-0.100
e-0.099
One-0.093
 play-0.090
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
ounded+0.272
oke+0.266
rier+0.262
ometer+0.261
 understands+0.253
Neg logit contributions
e-0.142
aster-0.135
 rest-0.133
One-0.124
 play-0.122
Token
Token ID198
Feature activation-4.46e-02

Pos logit contributions
One+0.036
aster+0.035
e+0.033
 rest+0.031
 spring+0.029
Neg logit contributions
ounded-0.087
rier-0.087
ometer-0.087
oke-0.084
ener-0.083
Token
Token ID198
Feature activation-7.08e-03

Pos logit contributions
aster+0.002
e+0.002
 rest+0.002
One+0.002
 arrive+0.002
Neg logit contributions
ometer-0.003
oke-0.003
ounded-0.003
keeper-0.003
 understands-0.003
TokenBob
Token ID18861
Feature activation-8.16e+00

Pos logit contributions
aster+0.000
e+0.000
 rest+0.000
 recess+0.000
 arrive+0.000
Neg logit contributions
ounded-0.001
ometer-0.001
oke-0.001
keeper-0.000
 understands-0.000
Token rushed
Token ID13999
Feature activation-5.73e+00

Pos logit contributions
e+0.411
aster+0.389
One+0.374
 rest+0.348
 play+0.342
Neg logit contributions
ometer-0.492
rier-0.492
ounded-0.487
oke-0.470
 understands-0.438
Token not
Token ID407
Feature activation-3.22e+00

Pos logit contributions
ometer+0.133
ounded+0.128
keeper+0.125
oke+0.125
 understands+0.123
Neg logit contributions
aster-0.077
e-0.077
One-0.065
 rest-0.064
he-0.061
Token much
Token ID881
Feature activation-4.17e+00

Pos logit contributions
e+0.131
aster+0.128
One+0.118
 rest+0.109
he+0.107
Neg logit contributions
ometer-0.213
ounded-0.205
rier-0.203
keeper-0.201
oke-0.198
Token space
Token ID2272
Feature activation-2.69e+00

Pos logit contributions
aster+0.204
e+0.193
One+0.182
 rest+0.178
 spring+0.168
Neg logit contributions
ometer-0.258
ounded-0.258
oke-0.249
keeper-0.241
rier-0.240
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
aster+0.115
e+0.109
One+0.102
 rest+0.096
wered+0.089
Neg logit contributions
ometer-0.183
ounded-0.179
oke-0.176
keeper-0.173
rier-0.171
Token
Token ID198
Feature activation-4.09e-02

Pos logit contributions
aster+0.045
e+0.043
One+0.043
 rest+0.039
 spring+0.036
Neg logit contributions
ounded-0.094
ometer-0.093
rier-0.090
oke-0.090
keeper-0.087
Token
Token ID198
Feature activation-1.52e-02

Pos logit contributions
aster+0.002
e+0.002
 rest+0.002
One+0.002
 recess+0.002
Neg logit contributions
ometer-0.003
oke-0.002
ounded-0.002
keeper-0.002
 understands-0.002
TokenLu
Token ID25596
Feature activation-8.60e+00

Pos logit contributions
aster+0.001
e+0.001
 recess+0.000
 rest+0.000
wered+0.000
Neg logit contributions
ometer-0.001
ounded-0.001
oke-0.001
keeper-0.001
 understands-0.001
Tokency
Token ID948
Feature activation+1.58e+00

Pos logit contributions
e+0.435
aster+0.427
One+0.383
 rest+0.377
he+0.361
Neg logit contributions
ounded-0.511
ometer-0.510
rier-0.503
oke-0.496
 puzzled-0.476
Token said
Token ID531
Feature activation+6.66e+00

Pos logit contributions
ometer+0.089
ounded+0.089
rier+0.087
oke+0.085
keeper+0.081
Neg logit contributions
e-0.086
aster-0.086
One-0.078
 rest-0.076
 spring-0.073
Token,
Token ID11
Feature activation-8.25e-01

Pos logit contributions
ounded+0.538
oke+0.517
ometer+0.513
rier+0.503
 puzzled+0.493
Neg logit contributions
aster-0.196
e-0.190
 rest-0.170
One-0.163
 spring-0.155
Token "
Token ID366
Feature activation+1.09e+01

Pos logit contributions
aster+0.052
One+0.051
e+0.051
 spring+0.051
 rest+0.050
Neg logit contributions
ounded-0.036
rier-0.034
oke-0.033
ometer-0.032
 understands-0.032
Token day
Token ID1110
Feature activation-6.38e+00

Pos logit contributions
aster+0.071
e+0.068
One+0.059
 rest+0.051
 recess+0.051
Neg logit contributions
ometer-0.183
ounded-0.179
oke-0.175
keeper-0.172
 puzzled-0.168
Token,
Token ID11
Feature activation-9.47e-01

Pos logit contributions
aster+0.307
e+0.289
One+0.272
 rest+0.255
wered+0.246
Neg logit contributions
ometer-0.396
ounded-0.385
oke-0.379
 understands-0.362
keeper-0.359
Token a
Token ID257
Feature activation-2.86e+00

Pos logit contributions
aster+0.035
e+0.034
One+0.030
 rest+0.028
 spring+0.026
Neg logit contributions
ometer-0.069
ounded-0.068
keeper-0.065
oke-0.065
rier-0.065
Token big
Token ID1263
Feature activation-3.37e-01

Pos logit contributions
aster+0.173
e+0.164
 rest+0.160
One+0.159
 spring+0.153
Neg logit contributions
ounded-0.140
ometer-0.139
oke-0.133
rier-0.130
 understands-0.129
Token storm
Token ID6388
Feature activation-2.89e+00

Pos logit contributions
aster+0.016
e+0.016
One+0.015
 rest+0.014
 spring+0.013
Neg logit contributions
ometer-0.021
ounded-0.021
oke-0.020
keeper-0.019
rier-0.019
Token came
Token ID1625
Feature activation-4.06e+00

Pos logit contributions
aster+0.151
e+0.143
One+0.139
 rest+0.130
 recess+0.123
Neg logit contributions
ounded-0.166
oke-0.164
ometer-0.163
keeper-0.154
 understands-0.151
Token and
Token ID290
Feature activation-1.57e+00

Pos logit contributions
aster+0.193
e+0.183
One+0.170
 rest+0.164
 spring+0.153
Neg logit contributions
ometer-0.256
ounded-0.255
oke-0.248
keeper-0.240
rier-0.238
Token broke
Token ID6265
Feature activation+2.74e+00

Pos logit contributions
aster+0.059
 rest+0.051
e+0.050
 spring+0.050
One+0.049
Neg logit contributions
oke-0.116
ometer-0.116
ounded-0.113
rier-0.111
 understands-0.109
Token the
Token ID262
Feature activation+9.47e-01

Pos logit contributions
ounded+0.185
ometer+0.184
oke+0.181
keeper+0.174
 understands+0.173
Neg logit contributions
aster-0.116
e-0.112
One-0.103
 rest-0.099
 spring-0.095
Token hall
Token ID6899
Feature activation-6.27e+00

Pos logit contributions
ometer+0.054
ounded+0.054
oke+0.052
keeper+0.050
rier+0.050
Neg logit contributions
aster-0.051
e-0.049
One-0.046
 rest-0.043
 spring-0.041
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
aster+0.231
e+0.219
One+0.198
 rest+0.190
 spring+0.175
Neg logit contributions
ometer-0.459
ounded-0.456
oke-0.446
keeper-0.433
rier-0.432
Token hook
Token ID8011
Feature activation+5.96e-01

Pos logit contributions
ometer+0.049
ounded+0.049
oke+0.048
keeper+0.046
rier+0.046
Neg logit contributions
aster-0.038
e-0.037
One-0.034
 rest-0.033
 spring-0.031
Token and
Token ID290
Feature activation-1.79e+00

Pos logit contributions
ometer+0.042
ounded+0.040
keeper+0.039
oke+0.039
rier+0.039
Neg logit contributions
aster-0.024
e-0.022
One-0.021
 rest-0.020
wered-0.019
Token throw
Token ID3714
Feature activation-5.51e+00

Pos logit contributions
aster+0.068
e+0.064
 rest+0.057
One+0.056
 spring+0.053
Neg logit contributions
ometer-0.131
ounded-0.131
oke-0.128
rier-0.124
keeper-0.123
Token the
Token ID262
Feature activation+7.83e-01

Pos logit contributions
aster+0.246
e+0.236
One+0.217
 rest+0.210
 spring+0.197
Neg logit contributions
ometer-0.362
ounded-0.361
oke-0.351
keeper-0.340
rier-0.338
Token line
Token ID1627
Feature activation-5.24e+00

Pos logit contributions
ounded+0.047
ometer+0.046
oke+0.045
keeper+0.044
rier+0.043
Neg logit contributions
aster-0.040
e-0.039
 rest-0.037
One-0.035
 spring-0.035
Token in
Token ID287
Feature activation-4.50e+00

Pos logit contributions
aster+0.263
e+0.247
One+0.235
 rest+0.223
wered+0.222
Neg logit contributions
ometer-0.309
ounded-0.308
keeper-0.297
rier-0.293
oke-0.292
Token the
Token ID262
Feature activation+7.26e-01

Pos logit contributions
aster+0.208
e+0.189
One+0.176
 rest+0.173
wered+0.167
Neg logit contributions
ometer-0.266
 puzzled-0.261
rier-0.260
keeper-0.259
ounded-0.255
Token water
Token ID1660
Feature activation-4.41e+00

Pos logit contributions
ounded+0.052
ometer+0.052
oke+0.051
keeper+0.049
rier+0.049
Neg logit contributions
aster-0.029
e-0.027
 rest-0.025
One-0.024
 spring-0.023
Token.
Token ID13
Feature activation-1.49e+00

Pos logit contributions
aster+0.169
e+0.159
One+0.148
 rest+0.139
wered+0.127
Neg logit contributions
ometer-0.315
ounded-0.308
oke-0.304
rier-0.301
keeper-0.300
Token
Token ID198
Feature activation-3.67e-01

Pos logit contributions
aster+0.054
One+0.049
 rest+0.046
 spring+0.045
 play+0.043
Neg logit contributions
ounded-0.114
ometer-0.111
oke-0.108
ener-0.107
rier-0.107
Token
Token ID198
Feature activation-3.68e-01

Pos logit contributions
aster+0.019
e+0.018
 rest+0.016
wered+0.016
 recess+0.015
Neg logit contributions
ometer-0.021
oke-0.021
ounded-0.021
keeper-0.020
 understands-0.020
Token play
Token ID711
Feature activation-7.15e+00

Pos logit contributions
aster+0.028
e+0.025
One+0.020
 rest+0.018
 spring+0.014
Neg logit contributions
ometer-0.136
ounded-0.136
oke-0.133
keeper-0.130
rier-0.130
Token with
Token ID351
Feature activation-3.43e+00

Pos logit contributions
aster+0.347
e+0.293
One+0.291
wered+0.282
 arrive+0.270
Neg logit contributions
ounded-0.419
ometer-0.402
 understands-0.400
oke-0.395
rier-0.391
Token my
Token ID616
Feature activation+8.14e+00

Pos logit contributions
aster+0.199
e+0.190
One+0.181
 rest+0.172
wered+0.167
Neg logit contributions
ometer-0.168
ounded-0.166
oke-0.160
keeper-0.160
 understands-0.159
Token ball
Token ID2613
Feature activation+1.21e+00

Pos logit contributions
oke+0.552
ounded+0.546
ometer+0.543
rier+0.520
keeper+0.514
Neg logit contributions
e-0.329
aster-0.328
 rest-0.326
One-0.313
 play-0.307
Token!"
Token ID2474
Feature activation+6.85e+00

Pos logit contributions
ometer+0.077
ounded+0.076
oke+0.073
keeper+0.073
rier+0.071
Neg logit contributions
aster-0.057
e-0.053
One-0.049
 rest-0.047
wered-0.046
Token Lily
Token ID20037
Feature activation-1.22e+00

Pos logit contributions
ounded+0.448
ometer+0.448
oke+0.435
rier+0.424
keeper+0.419
Neg logit contributions
aster-0.312
e-0.297
One-0.275
 rest-0.268
 spring-0.251
Token was
Token ID373
Feature activation+1.79e+00

Pos logit contributions
e+0.060
aster+0.055
 rest+0.054
One+0.053
 play+0.053
Neg logit contributions
rier-0.078
ometer-0.075
ounded-0.071
oke-0.071
keeper-0.066
Token happy
Token ID3772
Feature activation+5.54e+00

Pos logit contributions
ometer+0.103
ounded+0.101
oke+0.099
keeper+0.095
rier+0.094
Neg logit contributions
aster-0.095
e-0.094
One-0.087
 rest-0.085
 spring-0.082
Token and
Token ID290
Feature activation-1.53e+00

Pos logit contributions
ometer+0.339
ounded+0.337
oke+0.327
rier+0.317
keeper+0.315
Neg logit contributions
aster-0.277
e-0.266
One-0.248
 rest-0.243
 spring-0.230
Token said
Token ID531
Feature activation+6.57e+00

Pos logit contributions
 play+0.069
 rest+0.069
e+0.067
aster+0.067
 spring+0.066
Neg logit contributions
rier-0.096
ometer-0.096
oke-0.095
ounded-0.090
ener-0.088
Token,
Token ID11
Feature activation-9.27e-01

Pos logit contributions
ounded+0.487
ometer+0.484
oke+0.474
keeper+0.460
 puzzled+0.455
Neg logit contributions
aster-0.230
e-0.226
One-0.203
 rest-0.203
 spring-0.183
Token got
Token ID1392
Feature activation+1.23e-02

Pos logit contributions
aster+0.006
e+0.006
One+0.005
 rest+0.005
 spring+0.005
Neg logit contributions
ometer-0.006
ounded-0.006
oke-0.006
keeper-0.006
 understands-0.006
Token up
Token ID510
Feature activation+4.26e+00

Pos logit contributions
oke+0.001
ometer+0.001
ounded+0.001
 understands+0.001
rier+0.001
Neg logit contributions
aster-0.001
e-0.000
 rest-0.000
One-0.000
 play-0.000
Token early
Token ID1903
Feature activation-1.39e+00

Pos logit contributions
ometer+0.268
ounded+0.265
oke+0.256
keeper+0.253
rier+0.250
Neg logit contributions
aster-0.201
e-0.193
One-0.179
 rest-0.172
 spring-0.161
Token and
Token ID290
Feature activation-1.54e+00

Pos logit contributions
aster+0.069
e+0.067
One+0.062
 rest+0.060
 spring+0.056
Neg logit contributions
ometer-0.084
ounded-0.083
oke-0.080
keeper-0.078
rier-0.077
Token put
Token ID1234
Feature activation-4.31e+00

Pos logit contributions
aster+0.056
 play+0.050
 go+0.048
 rest+0.048
 read+0.048
Neg logit contributions
oke-0.121
rier-0.116
ometer-0.116
ounded-0.113
azed-0.109
Token on
Token ID319
Feature activation-4.47e+00

Pos logit contributions
aster+0.157
e+0.149
One+0.132
 rest+0.129
 spring+0.118
Neg logit contributions
ounded-0.321
ometer-0.321
oke-0.315
keeper-0.303
rier-0.301
Token his
Token ID465
Feature activation-2.12e-01

Pos logit contributions
aster+0.186
e+0.179
One+0.165
 rest+0.156
 spring+0.146
Neg logit contributions
ometer-0.302
ounded-0.298
oke-0.288
keeper-0.285
rier-0.282
Token mail
Token ID6920
Feature activation-3.55e+00

Pos logit contributions
aster+0.011
e+0.010
One+0.009
 rest+0.009
 spring+0.009
Neg logit contributions
ounded-0.013
ometer-0.013
oke-0.013
rier-0.012
keeper-0.012
Tokenman
Token ID805
Feature activation+1.71e+00

Pos logit contributions
aster+0.199
e+0.192
One+0.182
 rest+0.179
 spring+0.169
Neg logit contributions
ounded-0.192
oke-0.188
ometer-0.185
 understands-0.177
rier-0.175
Token hat
Token ID6877
Feature activation+1.85e+00

Pos logit contributions
ometer+0.093
ounded+0.092
oke+0.089
keeper+0.086
rier+0.085
Neg logit contributions
aster-0.096
e-0.092
One-0.087
 rest-0.084
 spring-0.081
Token and
Token ID290
Feature activation-1.52e+00

Pos logit contributions
ometer+0.123
ounded+0.121
oke+0.118
keeper+0.116
rier+0.114
Neg logit contributions
aster-0.081
e-0.077
One-0.071
 rest-0.068
 spring-0.064
Tokenmen
Token ID3653
Feature activation-8.92e-01

Pos logit contributions
aster+0.310
e+0.286
One+0.286
wered+0.264
 recess+0.264
Neg logit contributions
ometer-0.221
ounded-0.200
rier-0.199
oke-0.196
keeper-0.190
Token and
Token ID290
Feature activation-1.79e+00

Pos logit contributions
aster+0.047
One+0.043
wered+0.041
 recess+0.040
e+0.040
Neg logit contributions
ometer-0.052
ounded-0.048
keeper-0.048
oke-0.048
rier-0.048
Token snow
Token ID6729
Feature activation-4.70e+00

Pos logit contributions
aster+0.079
e+0.079
One+0.073
he+0.066
 rest+0.065
Neg logit contributions
ometer-0.113
ounded-0.111
oke-0.108
 understands-0.106
keeper-0.106
Tokenballs
Token ID21591
Feature activation-1.70e+00

Pos logit contributions
aster+0.318
e+0.292
One+0.291
wered+0.271
 rest+0.270
Neg logit contributions
ometer-0.223
rier-0.203
oke-0.194
ounded-0.193
keeper-0.190
Token and
Token ID290
Feature activation-1.83e+00

Pos logit contributions
aster+0.092
e+0.084
wered+0.080
One+0.080
|+0.077
Neg logit contributions
ometer-0.095
rier-0.092
oke-0.087
keeper-0.085
ounded-0.083
Token slide
Token ID10649
Feature activation-9.78e+00

Pos logit contributions
e+0.072
aster+0.072
One+0.068
he+0.060
wered+0.058
Neg logit contributions
ometer-0.122
ounded-0.120
oke-0.116
 understands-0.116
 puzzled-0.115
Token on
Token ID319
Feature activation-4.88e+00

Pos logit contributions
aster+0.452
e+0.426
One+0.386
 rest+0.372
wered+0.369
Neg logit contributions
ometer-0.647
ounded-0.613
keeper-0.594
rier-0.589
oke-0.588
Token the
Token ID262
Feature activation+8.20e-01

Pos logit contributions
aster+0.229
e+0.226
One+0.205
he+0.201
 recess+0.201
Neg logit contributions
ometer-0.266
 teeth-0.259
keeper-0.252
 understands-0.251
 eyes-0.250
Token ice
Token ID4771
Feature activation-6.62e+00

Pos logit contributions
ometer+0.049
ounded+0.047
oke+0.046
keeper+0.045
rier+0.045
Neg logit contributions
aster-0.042
e-0.041
One-0.037
 rest-0.034
 recess-0.033
Token.
Token ID13
Feature activation-1.53e+00

Pos logit contributions
aster+0.298
e+0.295
One+0.275
wered+0.250
 rest+0.247
Neg logit contributions
ometer-0.438
ounded-0.414
rier-0.405
keeper-0.401
oke-0.401
Token But
Token ID887
Feature activation+1.32e+00

Pos logit contributions
aster+0.073
e+0.071
One+0.064
 rest+0.063
 spring+0.059
Neg logit contributions
ometer-0.095
ounded-0.094
oke-0.092
keeper-0.089
 understands-0.088
Token saw
Token ID2497
Feature activation+8.35e+00

Pos logit contributions
aster+0.019
e+0.019
One+0.017
 rest+0.017
 spring+0.016
Neg logit contributions
ometer-0.020
ounded-0.020
oke-0.020
rier-0.019
keeper-0.019
Token a
Token ID257
Feature activation-3.21e+00

Pos logit contributions
ounded+0.706
ometer+0.705
oke+0.694
rier+0.669
keeper+0.651
Neg logit contributions
aster-0.250
e-0.217
 rest-0.198
One-0.195
 spring-0.178
Token big
Token ID1263
Feature activation-6.55e-01

Pos logit contributions
aster+0.149
 rest+0.145
 spring+0.145
One+0.137
 end+0.135
Neg logit contributions
ounded-0.201
ometer-0.195
oke-0.192
atted-0.191
rier-0.184
Token black
Token ID2042
Feature activation+2.81e+00

Pos logit contributions
aster+0.033
e+0.031
 rest+0.029
 spring+0.029
One+0.028
Neg logit contributions
ounded-0.041
ometer-0.040
oke-0.039
rier-0.037
keeper-0.037
Token beetle
Token ID45489
Feature activation+2.83e+00

Pos logit contributions
ometer+0.119
oke+0.110
keeper+0.107
ounded+0.106
rier+0.104
Neg logit contributions
aster-0.185
e-0.184
One-0.178
 rest-0.166
 recess-0.161
Token crawling
Token ID34499
Feature activation-9.46e+00

Pos logit contributions
ometer+0.166
oke+0.165
 puzzled+0.160
ounded+0.159
 understands+0.159
Neg logit contributions
aster-0.133
e-0.116
One-0.115
 recess-0.113
wered-0.111
Token on
Token ID319
Feature activation-4.88e+00

Pos logit contributions
aster+0.380
e+0.352
One+0.326
 rest+0.311
 spring+0.280
Neg logit contributions
ometer-0.667
ounded-0.662
oke-0.644
keeper-0.629
rier-0.626
Token a
Token ID257
Feature activation-3.25e+00

Pos logit contributions
e+0.215
aster+0.211
One+0.198
irl+0.193
 rest+0.189
Neg logit contributions
ometer-0.277
keeper-0.275
 teeth-0.267
rier-0.266
 puzzled-0.265
Token leaf
Token ID12835
Feature activation-1.60e+00

Pos logit contributions
aster+0.159
e+0.155
One+0.142
 rest+0.135
 recess+0.128
Neg logit contributions
ometer-0.196
ounded-0.193
oke-0.186
keeper-0.184
rier-0.182
Token.
Token ID13
Feature activation-1.65e+00

Pos logit contributions
aster+0.078
e+0.072
 recess+0.068
One+0.066
wered+0.063
Neg logit contributions
ometer-0.097
keeper-0.091
ounded-0.088
rier-0.087
oke-0.087
Token It
Token ID632
Feature activation+6.64e+00

Pos logit contributions
aster+0.079
 go+0.065
 play+0.065
One+0.063
e+0.062
Neg logit contributions
ounded-0.106
ometer-0.104
oke-0.102
rier-0.102
ience-0.101
Token some
Token ID617
Feature activation+2.45e+00

Pos logit contributions
aster+0.206
e+0.193
 rest+0.178
One+0.177
 spring+0.162
Neg logit contributions
ometer-0.323
ounded-0.321
oke-0.315
keeper-0.298
rier-0.298
Token food
Token ID2057
Feature activation+2.35e+00

Pos logit contributions
ometer+0.136
ounded+0.135
oke+0.133
keeper+0.129
 understands+0.128
Neg logit contributions
e-0.125
aster-0.124
One-0.116
wered-0.106
 rest-0.105
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
ometer+0.155
ounded+0.151
oke+0.148
keeper+0.148
rier+0.144
Neg logit contributions
aster-0.104
e-0.100
One-0.094
 rest-0.086
wered-0.085
Token
Token ID198
Feature activation-1.25e-01

Pos logit contributions
aster+0.061
e+0.059
One+0.058
 rest+0.056
 spring+0.054
Neg logit contributions
ounded-0.066
rier-0.065
oke-0.063
keeper-0.062
ometer-0.062
Token
Token ID198
Feature activation-1.15e-01

Pos logit contributions
aster+0.006
e+0.005
One+0.005
 rest+0.005
 spring+0.004
Neg logit contributions
ometer-0.008
ounded-0.008
oke-0.008
 understands-0.007
keeper-0.007
TokenCh
Token ID1925
Feature activation-1.05e+01

Pos logit contributions
aster+0.005
e+0.004
 rest+0.004
 recess+0.004
 course+0.003
Neg logit contributions
ounded-0.008
ometer-0.008
oke-0.007
 puzzled-0.007
 understands-0.007
Tokenad
Token ID324
Feature activation-1.59e+01

Pos logit contributions
aster+0.455
One+0.420
 rest+0.411
e+0.408
 spring+0.382
Neg logit contributions
oke-0.711
ometer-0.682
ounded-0.673
atted-0.639
 understands-0.633
Token finished
Token ID5201
Feature activation-6.73e+00

Pos logit contributions
e+0.798
aster+0.788
One+0.709
 rest+0.681
 read+0.645
Neg logit contributions
ometer-1.012
ounded-0.960
rier-0.948
oke-0.895
 puzzled-0.861
Token lunch
Token ID9965
Feature activation-5.88e+00

Pos logit contributions
aster+0.286
e+0.278
One+0.258
 rest+0.246
 spring+0.227
Neg logit contributions
ometer-0.445
ounded-0.439
oke-0.433
keeper-0.420
 understands-0.417
Token and
Token ID290
Feature activation-1.56e+00

Pos logit contributions
aster+0.275
e+0.263
One+0.246
 rest+0.233
wered+0.219
Neg logit contributions
ometer-0.370
ounded-0.367
oke-0.359
keeper-0.353
rier-0.346
Token he
Token ID339
Feature activation-8.12e-02

Pos logit contributions
aster+0.071
e+0.068
One+0.063
 rest+0.061
 spring+0.057
Neg logit contributions
ometer-0.101
ounded-0.100
oke-0.097
keeper-0.094
rier-0.094
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
ometer+0.044
keeper+0.043
ounded+0.042
rier+0.042
oke+0.041
Neg logit contributions
aster-0.037
e-0.033
One-0.033
wered-0.032
 recess-0.030
Token The
Token ID383
Feature activation+3.97e+00

Pos logit contributions
aster+0.047
e+0.044
One+0.043
 rest+0.043
 play+0.039
Neg logit contributions
ounded-0.077
rier-0.077
ometer-0.074
oke-0.074
 puzzled-0.071
Token end
Token ID886
Feature activation-5.16e+00

Pos logit contributions
oke+0.236
ounded+0.234
ometer+0.233
 understands+0.227
keeper+0.226
Neg logit contributions
aster-0.190
e-0.186
One-0.162
 rest-0.156
he-0.155
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
aster+0.181
e+0.171
One+0.154
 rest+0.147
 spring+0.134
Neg logit contributions
ometer-0.389
ounded-0.387
oke-0.379
keeper-0.368
rier-0.367
Token<|endoftext|>
Token ID50256
Feature activation-1.44e-01

Pos logit contributions
aster+0.046
e+0.044
One+0.040
 rest+0.039
 spring+0.036
Neg logit contributions
ometer-0.068
ounded-0.066
oke-0.066
keeper-0.064
 understands-0.064
TokenOnce
Token ID7454
Feature activation-9.75e+00

Pos logit contributions
aster+0.008
e+0.007
 wait+0.007
wered+0.007
One+0.007
Neg logit contributions
oke-0.007
ometer-0.007
 understands-0.007
ounded-0.007
keeper-0.007
Token upon
Token ID2402
Feature activation+1.19e+01

Pos logit contributions
aster+0.571
One+0.557
 rest+0.551
 time+0.544
e+0.534
Neg logit contributions
rier-0.487
ometer-0.480
ounded-0.476
keeper-0.449
ener-0.430
Token a
Token ID257
Feature activation-2.84e+00

Pos logit contributions
 ashamed+0.557
 understands+0.548
 teeth+0.537
 puzzled+0.530
ounded+0.521
Neg logit contributions
aster-0.573
 recess-0.556
e-0.530
irl-0.492
asted-0.487
Token time
Token ID640
Feature activation-9.69e+00

Pos logit contributions
aster+0.087
e+0.079
 recess+0.078
wered+0.076
 arrive+0.072
Neg logit contributions
oke-0.196
 understands-0.196
ounded-0.193
ometer-0.189
 puzzled-0.187
Token,
Token ID11
Feature activation-8.70e-01

Pos logit contributions
aster+0.457
e+0.424
One+0.388
 rest+0.375
wered+0.355
Neg logit contributions
ometer-0.616
ounded-0.614
oke-0.609
 understands-0.572
keeper-0.569
Token there
Token ID612
Feature activation+4.33e+00

Pos logit contributions
aster+0.044
e+0.042
One+0.039
 rest+0.038
 spring+0.036
Neg logit contributions
ometer-0.053
ounded-0.052
oke-0.051
keeper-0.049
rier-0.049
Token the
Token ID262
Feature activation+1.25e+00

Pos logit contributions
ometer+0.233
ounded+0.233
oke+0.228
keeper+0.224
rier+0.221
Neg logit contributions
aster-0.138
e-0.129
One-0.120
 rest-0.114
 spring-0.105
Token magical
Token ID10883
Feature activation+5.03e+00

Pos logit contributions
ometer+0.065
ounded+0.065
oke+0.063
keeper+0.061
rier+0.060
Neg logit contributions
aster-0.073
e-0.070
One-0.066
 rest-0.064
 spring-0.061
Token statue
Token ID15207
Feature activation+2.30e+00

Pos logit contributions
ometer+0.272
ounded+0.270
oke+0.256
rier+0.249
keeper+0.243
Neg logit contributions
aster-0.287
e-0.280
One-0.263
 rest-0.258
 spring-0.247
Token.
Token ID13
Feature activation-9.50e-01

Pos logit contributions
ometer+0.143
ounded+0.142
oke+0.141
keeper+0.140
rier+0.134
Neg logit contributions
aster-0.106
e-0.099
One-0.094
 rest-0.090
wered-0.086
Token<|endoftext|>
Token ID50256
Feature activation-1.74e-01

Pos logit contributions
aster+0.042
e+0.040
One+0.037
 rest+0.036
 spring+0.033
Neg logit contributions
ometer-0.063
ounded-0.063
oke-0.061
rier-0.060
keeper-0.059
TokenOnce
Token ID7454
Feature activation-9.66e+00

Pos logit contributions
aster+0.010
e+0.009
One+0.009
 rest+0.008
wered+0.008
Neg logit contributions
ometer-0.009
oke-0.009
 understands-0.008
ounded-0.008
keeper-0.008
Token upon
Token ID2402
Feature activation+1.20e+01

Pos logit contributions
aster+0.528
 time+0.526
One+0.523
 rest+0.523
he+0.519
Neg logit contributions
rier-0.510
ometer-0.499
ounded-0.477
keeper-0.454
ener-0.446
Token a
Token ID257
Feature activation-2.75e+00

Pos logit contributions
 understands+0.559
 ashamed+0.554
 puzzled+0.553
 teeth+0.541
ounded+0.537
Neg logit contributions
aster-0.604
 recess-0.574
e-0.537
P-0.503
 race-0.499
Token time
Token ID640
Feature activation-9.63e+00

Pos logit contributions
aster+0.085
 recess+0.075
wered+0.074
e+0.071
 arrive+0.068
Neg logit contributions
oke-0.195
 understands-0.194
ounded-0.193
 puzzled-0.192
ometer-0.188
Token there
Token ID612
Feature activation+4.39e+00

Pos logit contributions
aster+0.405
e+0.385
One+0.352
 rest+0.339
 spring+0.315
Neg logit contributions
ometer-0.656
ounded-0.653
oke-0.639
keeper-0.617
rier-0.612
Token was
Token ID373
Feature activation+1.91e+00

Pos logit contributions
ometer+0.228
ounded+0.224
oke+0.222
 understands+0.213
keeper+0.212
Neg logit contributions
aster-0.253
e-0.237
One-0.227
 rest-0.218
 recess-0.208