TOP ACTIVATIONS
MAX = 48.226

LL WHEN YOU FE EL LIKE IT ". Tom
bit of magic . <|endoftext|> Isa ac was preparing breakfast in
cl ove and cinnamon and cass ia and sa ff ron
love it !" Isa ac gul ped and said
thought it was the perfect souven ir from the walk .
said mom my , but don
happy to tell her I 'm
Tim my took a deep breath and opened the rocket .
sign ' happy ', ' birth day ', and ' you
that said " DO NOT PRESS ." She couldn 't resist
speed and her peace she just
my bird said , Wait until
B eth said , Let â
with a big number 1 â
no time ." Enc our aged by the eagle
m ama said , Let â
His mother said , There ,
, Jenn ie followed the proverb : " Del ay is
a rare silence that added
L ily took a deep breath and started to climb .

MIN ACTIVATIONS
MIN = -47.601

did not care about bath ro bes . She cared about
" I 'm a tight ro pe walk er . Do
walk around on a tight ro pe , he could even
brother would wear their bath ro bes together and play games
march ." Ped ro said , " Okay ,
happy ! F ro ggy 's friends asked him
singing and the flag unf ro ze . The
ucky . F ro ggy looked at the ball
it ?" F ro ggy said , " I
man balancing on the tight ro pe . <|endoftext|> Once upon
Lily had so many bath ro bes , but he loved
today ." F ro ggy smiled and said ,
ggy . F ro ggy provided D ucky with
She called it " F ro ggy " and put it
said . F ro ggie hopped out of the
." F ro ggy said , " Don
deaf . F ro ggy was so happy that
ball ?" F ro ggy replied , " Sure
play !" F ro ggy and D ucky played
ucky said , " F ro ggy , you are such

INTERVAL 24.269 - 48.226
CONTAINS 1.451%

you might get hurt . <|endoftext|> One day , a little
no luck ." The moms hear them laugh .
and daddy took a deep breath . They were so happy
s important to increase the tid iness of the toilet .
Tim my the pictures in the book and told him about

INTERVAL 0.313 - 24.269
CONTAINS 45.532%

and Dad looked at the pe b ble carefully . It
. They put on their hats , shoes and coats .
the duck and was happy that Max was by her side
off the tree . George smiled and took a bite .
Ben . He pulled hard on the root , but it

INTERVAL -23.644 - 0.313
CONTAINS 52.504%

" I saw how much you like to write , so
kissed them . Some told them stories or jokes . Lily
So they went to the store to replace the ball with
unloaded the motorcycle with ease . Tim my was so excited
took him away . The lion 's friends were sad and

INTERVAL -47.601 - -23.644
CONTAINS 0.513%

food away from Tim and Sam . " No
time there was a nos y cat named Felix . He
hat and his bag . He walks fast .
flame outside his window . He got scared and called out
middle of the huge spark ly things was a sparkling wheel
TokenLL
Token ID3069
Feature activation+2.89e+01

Pos logit contributions
 outer+0.971
 such+0.927
 unnecessary+0.924
 possible+0.867
rous+0.847
Neg logit contributions
board-1.045
 He-0.931
ops-0.878
affe-0.840
 boy-0.838
Token WHEN
Token ID42099
Feature activation+3.21e+01

Pos logit contributions
 outer+0.780
 such+0.753
 unnecessary+0.753
rous+0.698
 possible+0.696
Neg logit contributions
board-0.792
 He-0.707
ops-0.660
 was-0.630
 boy-0.629
Token YOU
Token ID7013
Feature activation+1.43e+01

Pos logit contributions
 outer+0.710
 unnecessary+0.682
 such+0.651
nam+0.628
rous+0.614
Neg logit contributions
board-1.061
 He-0.923
ops-0.914
affe-0.865
 boy-0.854
Token FE
Token ID18630
Feature activation+1.87e+01

Pos logit contributions
 outer+0.385
 unnecessary+0.378
 such+0.370
rous+0.350
 possible+0.347
Neg logit contributions
board-0.394
 He-0.345
ops-0.330
 was-0.323
s-0.318
TokenEL
Token ID3698
Feature activation+1.19e+01

Pos logit contributions
 outer+0.473
 unnecessary+0.462
 such+0.453
rous+0.427
 possible+0.419
Neg logit contributions
board-0.520
 He-0.477
ops-0.440
 boy-0.435
 was-0.434
Token LIKE
Token ID34178
Feature activation+4.82e+01

Pos logit contributions
 outer+0.323
 unnecessary+0.304
 possible+0.297
 such+0.295
nam+0.285
Neg logit contributions
board-0.343
 He-0.289
ops-0.287
s-0.279
ard-0.273
Token IT
Token ID7283
Feature activation+1.25e+01

Pos logit contributions
 unnecessary+1.199
 outer+1.178
rous+1.099
 possible+1.093
 such+1.092
Neg logit contributions
board-1.471
 He-1.322
ops-1.250
affe-1.207
 was-1.167
Token".
Token ID1911
Feature activation+1.34e+01

Pos logit contributions
 such+0.343
 outer+0.338
 unnecessary+0.330
 possible+0.312
 any+0.310
Neg logit contributions
board-0.316
 He-0.284
ops-0.255
affe-0.251
ushed-0.248
Token
Token ID198
Feature activation+6.05e-02

Pos logit contributions
 such+0.377
 outer+0.348
 unnecessary+0.333
 an+0.329
 any+0.323
Neg logit contributions
board-0.342
ops-0.289
y-0.287
les-0.282
affe-0.281
Token
Token ID198
Feature activation+1.07e-01

Pos logit contributions
 outer+0.002
 such+0.002
 unnecessary+0.002
rous+0.002
 possible+0.002
Neg logit contributions
board-0.001
 He-0.001
ops-0.001
affe-0.001
 boy-0.001
TokenTom
Token ID13787
Feature activation-2.59e+00

Pos logit contributions
 outer+0.004
 unnecessary+0.004
 such+0.004
rous+0.004
 possible+0.004
Neg logit contributions
board-0.002
 He-0.002
ops-0.001
 was-0.001
s-0.001
Token bit
Token ID1643
Feature activation+6.53e+00

Pos logit contributions
board+0.282
 He+0.252
ops+0.234
 boy+0.227
 was+0.227
Neg logit contributions
 outer-0.280
 unnecessary-0.271
 such-0.271
rous-0.251
 possible-0.249
Token of
Token ID286
Feature activation-3.56e+00

Pos logit contributions
 outer+0.158
 such+0.151
 unnecessary+0.151
rous+0.139
 possible+0.137
Neg logit contributions
board-0.195
 He-0.177
ops-0.163
 was-0.161
s-0.161
Token magic
Token ID5536
Feature activation-3.23e-01

Pos logit contributions
board+0.113
 He+0.109
 was+0.106
 boy+0.100
 wanted+0.097
Neg logit contributions
 outer-0.070
 unnecessary-0.067
rous-0.061
 such-0.061
 possible-0.057
Token.
Token ID13
Feature activation-5.21e+00

Pos logit contributions
board+0.008
 He+0.007
ops+0.006
affe+0.006
s+0.006
Neg logit contributions
 outer-0.010
 such-0.010
 unnecessary-0.009
 possible-0.009
 any-0.009
Token<|endoftext|>
Token ID50256
Feature activation+2.91e+01

Pos logit contributions
board+0.120
ops+0.097
 He+0.092
affe+0.088
les+0.087
Neg logit contributions
 such-0.167
 outer-0.162
 unnecessary-0.154
rous-0.148
 possible-0.147
TokenIsa
Token ID39443
Feature activation+4.44e+01

Pos logit contributions
 such+0.757
 outer+0.689
 an+0.627
 these+0.594
 any+0.594
Neg logit contributions
board-0.797
affe-0.726
ushed-0.694
ops-0.681
der-0.680
Tokenac
Token ID330
Feature activation+3.68e+00

Pos logit contributions
 outer+1.116
 such+1.075
 unnecessary+1.071
rous+0.991
 possible+0.984
Neg logit contributions
board-1.284
 He-1.155
ops-1.086
 boy-1.034
s-1.032
Token was
Token ID373
Feature activation-2.12e+01

Pos logit contributions
 such+0.111
 outer+0.108
 unnecessary+0.104
rous+0.102
 any+0.098
Neg logit contributions
board-0.086
 He-0.076
ops-0.069
 boy-0.065
 Maybe-0.063
Token preparing
Token ID10629
Feature activation+1.28e+01

Pos logit contributions
 was+0.515
 He+0.507
 and+0.505
 wanted+0.457
 liked+0.439
Neg logit contributions
 unnecessary-0.628
 outer-0.540
 backup-0.513
 possible-0.493
 assistance-0.485
Token breakfast
Token ID12607
Feature activation+9.03e+00

Pos logit contributions
 unnecessary+0.314
 outer+0.260
rous+0.257
 possible+0.255
shadow+0.247
Neg logit contributions
board-0.382
 He-0.374
 was-0.363
 and-0.347
s-0.324
Token in
Token ID287
Feature activation+9.14e+00

Pos logit contributions
 such+0.275
 outer+0.274
 unnecessary+0.257
 possible+0.251
 any+0.247
Neg logit contributions
board-0.206
 He-0.182
affe-0.165
ops-0.165
 boy-0.161
Token cl
Token ID537
Feature activation+8.08e+00

Pos logit contributions
board+0.518
 He+0.470
ops+0.443
 was+0.429
 boy+0.428
Neg logit contributions
 outer-0.379
 unnecessary-0.365
 such-0.364
rous-0.334
 possible-0.330
Tokenove
Token ID659
Feature activation+2.87e+00

Pos logit contributions
 outer+0.225
 such+0.224
 unnecessary+0.218
rous+0.206
 possible+0.201
Neg logit contributions
board-0.214
 He-0.186
 boy-0.172
ops-0.172
 was-0.167
Token and
Token ID290
Feature activation-1.66e+01

Pos logit contributions
 outer+0.088
 such+0.086
 unnecessary+0.085
rous+0.080
 possible+0.080
Neg logit contributions
board-0.067
 He-0.059
ops-0.054
 boy-0.051
 was-0.051
Token cinnamon
Token ID29181
Feature activation-4.55e+00

Pos logit contributions
board+0.507
 He+0.474
 was+0.448
ops+0.438
 boy+0.432
Neg logit contributions
 outer-0.367
 unnecessary-0.365
 such-0.352
rous-0.333
 possible-0.316
Token and
Token ID290
Feature activation-1.66e+01

Pos logit contributions
board+0.102
 He+0.089
ops+0.079
 boy+0.077
affe+0.077
Neg logit contributions
 outer-0.142
 such-0.141
 unnecessary-0.138
 possible-0.131
 any-0.129
Token cass
Token ID30606
Feature activation+4.44e+01

Pos logit contributions
board+0.494
 He+0.453
ops+0.424
 was+0.421
 boy+0.413
Neg logit contributions
 outer-0.390
 unnecessary-0.381
 such-0.374
rous-0.351
 possible-0.340
Tokenia
Token ID544
Feature activation-2.34e+01

Pos logit contributions
 outer+1.196
 such+1.190
 unnecessary+1.130
 possible+1.103
 any+1.059
Neg logit contributions
board-1.170
 He-1.022
ops-0.978
 boy-0.942
s-0.932
Token and
Token ID290
Feature activation-1.65e+01

Pos logit contributions
board+0.373
 He+0.287
ops+0.262
boards+0.233
 boy+0.226
Neg logit contributions
 such-0.893
 outer-0.869
 unnecessary-0.853
 possible-0.828
 your-0.822
Token sa
Token ID473
Feature activation+8.90e+00

Pos logit contributions
board+0.527
 He+0.500
 was+0.480
 boy+0.463
ops+0.458
Neg logit contributions
 outer-0.335
 unnecessary-0.330
 such-0.311
rous-0.297
 possible-0.275
Tokenff
Token ID487
Feature activation-7.90e+00

Pos logit contributions
 such+0.235
 outer+0.231
 unnecessary+0.222
 possible+0.214
rous+0.213
Neg logit contributions
board-0.240
 He-0.219
 boy-0.197
 Maybe-0.187
affe-0.185
Tokenron
Token ID1313
Feature activation-1.08e-01

Pos logit contributions
board+0.175
 He+0.145
ops+0.132
 boy+0.129
boards+0.124
Neg logit contributions
 such-0.246
 outer-0.244
 unnecessary-0.243
 possible-0.231
 any-0.228
Token love
Token ID1842
Feature activation+1.44e+00

Pos logit contributions
board+0.093
 He+0.087
 was+0.081
ops+0.079
s+0.076
Neg logit contributions
 outer-0.083
 unnecessary-0.080
 such-0.077
rous-0.076
nam-0.073
Token it
Token ID340
Feature activation+1.13e+00

Pos logit contributions
 outer+0.031
 unnecessary+0.029
rous+0.027
 such+0.027
 backup+0.026
Neg logit contributions
board-0.045
 He-0.042
 was-0.041
s-0.039
ops-0.039
Token!"
Token ID2474
Feature activation+8.37e+00

Pos logit contributions
 outer+0.030
 such+0.030
 unnecessary+0.029
 possible+0.027
rous+0.027
Neg logit contributions
board-0.031
 He-0.027
ops-0.025
 boy-0.024
affe-0.024
Token
Token ID198
Feature activation-3.71e+00

Pos logit contributions
 such+0.261
 outer+0.246
 unnecessary+0.240
 any+0.231
 our+0.227
Neg logit contributions
board-0.193
ops-0.153
affe-0.146
ushed-0.141
 He-0.140
Token
Token ID198
Feature activation-3.62e+00

Pos logit contributions
board+0.061
 He+0.057
s+0.047
 was+0.047
ops+0.046
Neg logit contributions
 outer-0.140
 unnecessary-0.138
 such-0.134
rous-0.130
 possible-0.130
TokenIsa
Token ID39443
Feature activation+4.36e+01

Pos logit contributions
board+0.101
 He+0.098
s+0.091
 was+0.088
ops+0.087
Neg logit contributions
 outer-0.095
 unnecessary-0.092
 such-0.087
rous-0.085
 possible-0.084
Tokenac
Token ID330
Feature activation+2.70e+00

Pos logit contributions
 outer+0.889
 unnecessary+0.846
 such+0.845
rous+0.781
 possible+0.771
Neg logit contributions
board-1.443
 He-1.324
ops-1.247
s-1.205
 boy-1.195
Token gul
Token ID47161
Feature activation+2.20e+01

Pos logit contributions
 such+0.072
 unnecessary+0.066
 outer+0.064
rous+0.063
 any+0.062
Neg logit contributions
board-0.071
 He-0.064
 Pole-0.058
 boy-0.058
ops-0.058
Tokenped
Token ID9124
Feature activation-1.33e+01

Pos logit contributions
 such+0.573
 outer+0.550
 unnecessary+0.548
rous+0.507
 any+0.500
Neg logit contributions
board-0.601
 He-0.528
 boy-0.503
ops-0.484
 Maybe-0.476
Token and
Token ID290
Feature activation-1.70e+01

Pos logit contributions
board+0.332
 He+0.315
 was+0.282
 boy+0.269
s+0.269
Neg logit contributions
 outer-0.385
 unnecessary-0.381
 such-0.359
rous-0.355
 possible-0.346
Token said
Token ID531
Feature activation-1.81e+01

Pos logit contributions
board+0.384
 He+0.324
ops+0.301
affe+0.288
 boy+0.280
Neg logit contributions
 outer-0.539
 such-0.532
 unnecessary-0.519
rous-0.493
 possible-0.490
Token thought
Token ID1807
Feature activation-1.06e+00

Pos logit contributions
 such+0.436
 unnecessary+0.403
 our+0.400
 outer+0.396
rous+0.385
Neg logit contributions
board-0.380
 He-0.316
les-0.294
der-0.291
y-0.289
Token it
Token ID340
Feature activation+1.11e+01

Pos logit contributions
board+0.032
 He+0.028
ops+0.027
ushed+0.027
oc+0.026
Neg logit contributions
 such-0.026
 outer-0.025
 unnecessary-0.024
 any-0.023
 possible-0.023
Token was
Token ID373
Feature activation-1.27e+01

Pos logit contributions
 such+0.411
 any+0.393
 anything+0.389
 an+0.387
 outer+0.384
Neg logit contributions
board-0.190
affe-0.157
omo-0.147
ops-0.144
oc-0.143
Token the
Token ID262
Feature activation+2.71e+01

Pos logit contributions
board+0.437
 He+0.425
 was+0.406
 boy+0.388
ops+0.376
Neg logit contributions
 outer-0.229
 unnecessary-0.214
 such-0.198
rous-0.198
 backup-0.184
Token perfect
Token ID2818
Feature activation+9.37e+00

Pos logit contributions
 outer+0.582
 such+0.562
 unnecessary+0.561
 possible+0.514
rous+0.506
Neg logit contributions
board-0.882
 He-0.804
ops-0.765
s-0.735
affe-0.730
Token souven
Token ID45043
Feature activation+4.30e+01

Pos logit contributions
 outer+0.229
 such+0.222
 unnecessary+0.211
 possible+0.211
 any+0.201
Neg logit contributions
board-0.266
 He-0.248
ops-0.237
s-0.229
affe-0.227
Tokenir
Token ID343
Feature activation-2.85e+00

Pos logit contributions
 outer+0.971
 unnecessary+0.958
 such+0.929
 possible+0.841
rous+0.837
Neg logit contributions
board-1.359
 He-1.211
ops-1.198
s-1.130
 was-1.109
Token from
Token ID422
Feature activation+1.31e+01

Pos logit contributions
board+0.066
 He+0.059
 boy+0.053
ops+0.051
 Maybe+0.050
Neg logit contributions
 such-0.085
 outer-0.085
 unnecessary-0.082
 possible-0.081
 any-0.078
Token the
Token ID262
Feature activation+2.47e+01

Pos logit contributions
 outer+0.226
 unnecessary+0.219
 such+0.212
rous+0.191
 possible+0.187
Neg logit contributions
board-0.470
 He-0.438
ops-0.415
 was-0.411
 boy-0.404
Token walk
Token ID2513
Feature activation+1.18e+01

Pos logit contributions
 outer+0.630
 such+0.611
 unnecessary+0.609
rous+0.563
 possible+0.558
Neg logit contributions
board-0.703
 He-0.626
ops-0.589
 boy-0.572
 was-0.572
Token.
Token ID13
Feature activation+3.78e-01

Pos logit contributions
 such+0.368
 outer+0.361
 unnecessary+0.341
 possible+0.334
 any+0.333
Neg logit contributions
board-0.263
 He-0.223
ops-0.217
affe-0.204
ushed-0.202
Token said
Token ID531
Feature activation-7.48e+00

Pos logit contributions
nam+0.225
 outer+0.218
 accompl+0.215
 possible+0.214
 unnecessary+0.210
Neg logit contributions
board-0.336
 was-0.326
 boy-0.320
 He-0.308
 and-0.303
Token mom
Token ID1995
Feature activation+5.35e+00

Pos logit contributions
board+0.175
 He+0.146
ops+0.143
affe+0.131
 boy+0.129
Neg logit contributions
 such-0.229
 outer-0.226
 unnecessary-0.218
 any-0.210
 possible-0.209
Tokenmy
Token ID1820
Feature activation-6.65e+00

Pos logit contributions
 such+0.180
 any+0.165
 permission+0.160
 an+0.160
 these+0.157
Neg logit contributions
board-0.102
ops-0.086
 He-0.085
oc-0.081
actus-0.079
Token,
Token ID11
Feature activation+6.20e+00

Pos logit contributions
board+0.131
 He+0.106
ops+0.101
affe+0.100
oc+0.098
Neg logit contributions
 such-0.224
 any-0.211
 outer-0.205
 able-0.204
 permission-0.204
Token
Token ID6184
Feature activation+3.19e+01

Pos logit contributions
 such+0.201
 outer+0.200
 unnecessary+0.192
 possible+0.183
 any+0.183
Neg logit contributions
board-0.132
ops-0.109
 He-0.107
affe-0.098
s-0.095
Token
Token ID95
Feature activation+4.30e+01

Pos logit contributions
 outer+0.813
 unnecessary+0.806
 such+0.798
 possible+0.734
rous+0.721
Neg logit contributions
board-0.929
 He-0.824
ops-0.778
s-0.742
 boy-0.737
Token
Token ID26391
Feature activation+1.65e+01

Pos logit contributions
 outer+1.106
 such+1.081
 unnecessary+1.031
rous+0.987
 possible+0.970
Neg logit contributions
board-1.172
 He-1.062
ops-1.010
 boy-0.980
s-0.979
Token
Token ID129
Feature activation+2.15e+01

Pos logit contributions
 such+0.453
 outer+0.445
 unnecessary+0.413
 our+0.412
rous+0.406
Neg logit contributions
board-0.394
 He-0.337
affe-0.320
der-0.318
 boy-0.313
Token
Token ID241
Feature activation+2.23e+01

Pos logit contributions
 outer+0.583
 unnecessary+0.571
 such+0.563
 possible+0.526
rous+0.518
Neg logit contributions
board-0.587
 He-0.528
ops-0.490
 was-0.467
affe-0.462
Tokenbut
Token ID4360
Feature activation+9.18e+00

Pos logit contributions
 outer+0.605
 such+0.594
 unnecessary+0.571
 possible+0.556
rous+0.534
Neg logit contributions
board-0.595
ops-0.493
 He-0.490
affe-0.476
s-0.453
Token don
Token ID836
Feature activation-5.29e+00

Pos logit contributions
 outer+0.235
 such+0.229
 unnecessary+0.227
 possible+0.210
rous+0.209
Neg logit contributions
board-0.261
 He-0.233
ops-0.220
affe-0.210
s-0.209
Token happy
Token ID3772
Feature activation+1.64e+01

Pos logit contributions
 He+0.338
 was+0.335
board+0.330
 wanted+0.305
 boy+0.304
Neg logit contributions
 outer-0.258
 unnecessary-0.254
rous-0.218
nam-0.215
 backup-0.215
Token to
Token ID284
Feature activation+2.58e+01

Pos logit contributions
 such+0.453
 outer+0.413
 possible+0.407
 any+0.399
 our+0.393
Neg logit contributions
board-0.448
ops-0.388
 He-0.375
affe-0.368
oc-0.362
Token tell
Token ID1560
Feature activation+1.82e+01

Pos logit contributions
 outer+0.605
 unnecessary+0.593
 such+0.577
rous+0.539
 possible+0.529
Neg logit contributions
board-0.790
 He-0.726
ops-0.673
 was-0.662
 boy-0.648
Token her
Token ID607
Feature activation+1.09e+01

Pos logit contributions
 outer+0.344
 unnecessary+0.333
 such+0.310
rous+0.301
 possible+0.292
Neg logit contributions
board-0.634
 He-0.590
 was-0.550
ops-0.545
 boy-0.538
Token
Token ID6184
Feature activation+3.09e+01

Pos logit contributions
 outer+0.282
 such+0.273
 unnecessary+0.272
rous+0.251
 possible+0.251
Neg logit contributions
board-0.310
 He-0.278
ops-0.260
 was-0.249
 boy-0.249
Token
Token ID95
Feature activation+4.27e+01

Pos logit contributions
 outer+0.866
 unnecessary+0.844
 such+0.841
rous+0.781
 possible+0.779
Neg logit contributions
board-0.810
 He-0.718
ops-0.668
 was-0.636
 boy-0.633
Token
Token ID26391
Feature activation+1.60e+01

Pos logit contributions
 outer+1.089
 such+1.070
 unnecessary+1.022
rous+0.965
 possible+0.961
Neg logit contributions
board-1.187
 He-1.070
ops-1.024
s-0.990
affe-0.986
Token
Token ID129
Feature activation+2.15e+01

Pos logit contributions
 such+0.446
 outer+0.424
 unnecessary+0.401
 our+0.394
 possible+0.393
Neg logit contributions
board-0.390
 He-0.321
affe-0.309
der-0.307
les-0.306
Token
Token ID241
Feature activation+2.26e+01

Pos logit contributions
 outer+0.583
 unnecessary+0.572
 such+0.559
 possible+0.525
rous+0.518
Neg logit contributions
board-0.587
 He-0.531
ops-0.493
 was-0.469
affe-0.463
TokenI
Token ID40
Feature activation+5.51e+00

Pos logit contributions
 outer+0.641
 such+0.639
 unnecessary+0.605
 possible+0.598
rous+0.568
Neg logit contributions
board-0.582
ops-0.475
 He-0.467
affe-0.452
s-0.438
Token'm
Token ID1101
Feature activation+1.85e-01

Pos logit contributions
 such+0.129
 outer+0.127
 possible+0.118
 an+0.115
 rid+0.114
Neg logit contributions
board-0.163
 He-0.137
ops-0.136
affe-0.136
boards-0.136
Token Tim
Token ID5045
Feature activation-2.59e+00

Pos logit contributions
 such+0.835
 outer+0.812
 unnecessary+0.780
 possible+0.777
 any+0.764
Neg logit contributions
board-0.680
ops-0.550
 He-0.536
ushed-0.513
affe-0.507
Tokenmy
Token ID1820
Feature activation-4.42e+00

Pos logit contributions
board+0.051
 Pole+0.048
ops+0.046
der+0.044
boards+0.043
Neg logit contributions
 such-0.076
 your-0.073
 able-0.072
 unnecessary-0.070
 an-0.070
Token took
Token ID1718
Feature activation+7.48e+00

Pos logit contributions
board+0.088
ops+0.073
 Pole+0.073
 He+0.073
boards+0.070
Neg logit contributions
 such-0.130
 able-0.128
 your-0.127
 our-0.124
 an-0.123
Token a
Token ID257
Feature activation+1.37e+01

Pos logit contributions
 outer+0.170
 possible+0.152
nam+0.152
 unnecessary+0.150
rous+0.147
Neg logit contributions
 He-0.229
board-0.220
 was-0.204
 boy-0.200
les-0.188
Token deep
Token ID2769
Feature activation+1.57e+01

Pos logit contributions
 such+0.403
 outer+0.381
 unnecessary+0.375
 possible+0.360
rous+0.356
Neg logit contributions
board-0.359
 He-0.303
 boy-0.303
s-0.290
 was-0.284
Token breath
Token ID8033
Feature activation+4.26e+01

Pos logit contributions
 outer+0.215
 such+0.207
 unnecessary+0.206
 possible+0.176
rous+0.171
Neg logit contributions
board-0.635
 He-0.597
 boy-0.556
ops-0.556
 was-0.554
Token and
Token ID290
Feature activation-8.60e+00

Pos logit contributions
 such+1.662
 outer+1.653
 unnecessary+1.584
 possible+1.568
 any+1.557
Neg logit contributions
board-0.602
 He-0.450
ops-0.419
 boy-0.387
affe-0.375
Token opened
Token ID4721
Feature activation+1.15e+01

Pos logit contributions
board+0.179
 He+0.151
ops+0.140
affe+0.131
 boy+0.129
Neg logit contributions
 outer-0.288
 such-0.280
 unnecessary-0.279
rous-0.263
 possible-0.262
Token the
Token ID262
Feature activation+2.48e+01

Pos logit contributions
 unnecessary+0.249
 outer+0.206
rous+0.197
 possible+0.196
nam+0.195
Neg logit contributions
 He-0.391
board-0.353
 was-0.347
s-0.339
 boy-0.330
Token rocket
Token ID10701
Feature activation+8.77e+00

Pos logit contributions
 outer+0.767
 such+0.744
 unnecessary+0.742
 possible+0.693
rous+0.692
Neg logit contributions
board-0.573
 He-0.502
ops-0.471
s-0.438
 was-0.438
Token.
Token ID13
Feature activation-3.63e-01

Pos logit contributions
 outer+0.285
 such+0.275
 possible+0.256
 unnecessary+0.252
 any+0.252
Neg logit contributions
board-0.186
 He-0.160
ops-0.158
affe-0.146
ushed-0.142
Token sign
Token ID1051
Feature activation-2.38e+00

Pos logit contributions
 outer+0.412
 such+0.395
 unnecessary+0.384
 possible+0.358
rous+0.356
Neg logit contributions
board-0.579
 He-0.524
ops-0.495
affe-0.476
 boy-0.475
Token '
Token ID705
Feature activation+2.39e+00

Pos logit contributions
board+0.072
 He+0.064
ops+0.061
affe+0.059
 boy+0.057
Neg logit contributions
 outer-0.057
 such-0.057
 unnecessary-0.054
 possible-0.051
 any-0.050
Tokenhappy
Token ID34191
Feature activation+1.53e+01

Pos logit contributions
 outer+0.059
 unnecessary+0.057
 such+0.056
 possible+0.052
rous+0.052
Neg logit contributions
board-0.070
 He-0.063
ops-0.060
s-0.058
 was-0.058
Token',
Token ID3256
Feature activation+9.44e+00

Pos logit contributions
 outer+0.404
 such+0.396
 unnecessary+0.386
 possible+0.362
rous+0.357
Neg logit contributions
board-0.416
 He-0.375
ops-0.351
affe-0.334
 boy-0.333
Token '
Token ID705
Feature activation+2.67e+00

Pos logit contributions
 such+0.314
 your+0.290
 our+0.288
 an+0.288
 any+0.285
Neg logit contributions
board-0.204
ops-0.183
oc-0.175
affe-0.173
jo-0.160
Tokenbirth
Token ID24280
Feature activation+4.25e+01

Pos logit contributions
 outer+0.069
 unnecessary+0.067
 such+0.066
 possible+0.061
rous+0.060
Neg logit contributions
board-0.076
 He-0.068
ops-0.064
 was-0.062
s-0.062
Tokenday
Token ID820
Feature activation+1.39e+01

Pos logit contributions
 unnecessary+0.972
 outer+0.950
 possible+0.872
nam+0.867
 such+0.852
Neg logit contributions
board-1.428
 He-1.219
 was-1.213
 boy-1.184
ops-1.180
Token',
Token ID3256
Feature activation+9.83e+00

Pos logit contributions
 outer+0.417
 unnecessary+0.408
 such+0.399
rous+0.380
 possible+0.373
Neg logit contributions
board-0.339
 He-0.298
ops-0.275
 was-0.268
 boy-0.264
Token and
Token ID290
Feature activation-1.26e+01

Pos logit contributions
 such+0.313
 outer+0.294
 your+0.291
 our+0.290
 these+0.285
Neg logit contributions
board-0.211
ops-0.185
affe-0.169
 He-0.164
oc-0.162
Token '
Token ID705
Feature activation+3.09e+00

Pos logit contributions
board+0.327
 He+0.290
ops+0.270
 was+0.257
 boy+0.256
Neg logit contributions
 outer-0.356
 such-0.345
 unnecessary-0.345
rous-0.321
 possible-0.320
Tokenyou
Token ID5832
Feature activation+6.23e+00

Pos logit contributions
 outer+0.081
 unnecessary+0.080
 such+0.077
 possible+0.072
rous+0.071
Neg logit contributions
board-0.086
 He-0.077
ops-0.072
s-0.070
 was-0.070
Token that
Token ID326
Feature activation+2.86e+00

Pos logit contributions
board+0.090
 He+0.085
oc+0.084
ops+0.084
affe+0.083
Neg logit contributions
 such-0.055
 outer-0.053
 any-0.049
 possible-0.048
 an-0.044
Token said
Token ID531
Feature activation-1.59e+01

Pos logit contributions
 such+0.098
 outer+0.098
 possible+0.092
 unnecessary+0.090
nam+0.090
Neg logit contributions
board-0.055
 He-0.046
affe-0.045
ops-0.043
oc-0.041
Token "
Token ID366
Feature activation-6.26e-01

Pos logit contributions
board+0.303
 He+0.257
ops+0.231
 was+0.215
 boy+0.215
Neg logit contributions
 outer-0.557
 such-0.543
 unnecessary-0.542
rous-0.512
 possible-0.511
TokenDO
Token ID18227
Feature activation+2.09e+01

Pos logit contributions
board+0.018
 He+0.016
ops+0.015
 was+0.015
 boy+0.015
Neg logit contributions
 outer-0.016
 such-0.015
 unnecessary-0.015
rous-0.014
 possible-0.014
Token NOT
Token ID5626
Feature activation+2.11e+01

Pos logit contributions
 outer+0.472
 unnecessary+0.449
 such+0.438
rous+0.420
 possible+0.407
Neg logit contributions
board-0.667
 He-0.607
ops-0.575
 was-0.552
affe-0.545
Token PRESS
Token ID44310
Feature activation+4.25e+01

Pos logit contributions
 possible+0.578
 unnecessary+0.556
 outer+0.552
 such+0.536
rous+0.515
Neg logit contributions
board-0.546
 He-0.519
s-0.450
ard-0.444
oc-0.444
Token."
Token ID526
Feature activation+6.66e+00

Pos logit contributions
 such+0.964
 outer+0.952
 possible+0.845
 unnecessary+0.841
 any+0.823
Neg logit contributions
board-1.209
 He-1.146
 boy-1.070
ops-1.055
s-1.034
Token She
Token ID1375
Feature activation-1.08e+01

Pos logit contributions
 such+0.197
 an+0.185
 any+0.181
 your+0.179
nam+0.178
Neg logit contributions
oc-0.133
affe-0.127
ops-0.123
board-0.121
oke-0.120
Token couldn
Token ID3521
Feature activation-3.40e+00

Pos logit contributions
board+0.245
 He+0.213
ops+0.204
affe+0.195
oke+0.190
Neg logit contributions
 such-0.307
 an-0.302
 outer-0.292
 able-0.287
 unnecessary-0.278
Token't
Token ID470
Feature activation-9.25e-01

Pos logit contributions
board+0.115
 He+0.104
ops+0.098
 boy+0.097
affe+0.096
Neg logit contributions
 outer-0.067
 such-0.066
 unnecessary-0.063
 possible-0.058
rous-0.057
Token resist
Token ID4180
Feature activation+6.62e+00

Pos logit contributions
board+0.034
 He+0.031
ops+0.030
s+0.029
 boy+0.029
Neg logit contributions
 outer-0.016
 such-0.015
 unnecessary-0.015
 possible-0.013
 any-0.013
Token speed
Token ID2866
Feature activation+6.85e+00

Pos logit contributions
 outer+0.322
 unnecessary+0.309
 such+0.307
rous+0.284
 possible+0.279
Neg logit contributions
board-0.410
 He-0.370
ops-0.347
 was-0.342
 boy-0.341
Token and
Token ID290
Feature activation-8.13e+00

Pos logit contributions
 such+0.213
 outer+0.210
 unnecessary+0.201
 possible+0.198
 any+0.196
Neg logit contributions
board-0.151
 He-0.131
ops-0.123
affe-0.121
oc-0.116
Token her
Token ID607
Feature activation+9.67e+00

Pos logit contributions
board+0.210
 He+0.182
ops+0.174
affe+0.166
s+0.161
Neg logit contributions
 outer-0.230
 such-0.226
 unnecessary-0.222
 possible-0.209
rous-0.205
Token peace
Token ID4167
Feature activation+6.94e+00

Pos logit contributions
 outer+0.269
 such+0.260
 unnecessary+0.259
 possible+0.244
rous+0.238
Neg logit contributions
board-0.252
 He-0.221
ops-0.208
affe-0.196
s-0.194
Token
Token ID6184
Feature activation+3.01e+01

Pos logit contributions
 such+0.223
 outer+0.216
 possible+0.212
 any+0.206
 unnecessary+0.202
Neg logit contributions
board-0.144
 He-0.127
ops-0.116
oc-0.115
affe-0.113
Token
Token ID95
Feature activation+4.23e+01

Pos logit contributions
 outer+0.802
 unnecessary+0.795
 such+0.786
rous+0.728
 possible+0.726
Neg logit contributions
board-0.827
 He-0.734
ops-0.693
 was-0.652
affe-0.646
Token
Token ID26391
Feature activation+1.53e+01

Pos logit contributions
 outer+1.047
 such+1.017
 unnecessary+0.994
rous+0.925
 possible+0.921
Neg logit contributions
board-1.231
 He-1.110
ops-1.049
s-1.012
 boy-1.011
Token
Token ID447
Feature activation+6.25e+00

Pos logit contributions
 such+0.433
 outer+0.428
 unnecessary+0.405
rous+0.394
 possible+0.393
Neg logit contributions
board-0.369
 He-0.311
 boy-0.285
affe-0.283
ops-0.275
Token
Token ID250
Feature activation+1.52e+01

Pos logit contributions
 outer+0.193
 such+0.190
 unnecessary+0.186
rous+0.180
 possible+0.173
Neg logit contributions
board-0.140
 He-0.120
ops-0.109
affe-0.107
 boy-0.104
Token she
Token ID673
Feature activation-1.04e+01

Pos logit contributions
 outer+0.417
 unnecessary+0.407
 such+0.396
rous+0.373
 possible+0.366
Neg logit contributions
board-0.397
 He-0.362
ops-0.332
 was-0.324
 boy-0.321
Token just
Token ID655
Feature activation-7.14e+00

Pos logit contributions
board+0.232
 He+0.193
ops+0.183
boards+0.171
affe+0.168
Neg logit contributions
 such-0.327
 outer-0.321
 unnecessary-0.319
 possible-0.304
rous-0.293
Tokenmy
Token ID1820
Feature activation-5.65e+00

Pos logit contributions
 such+0.307
 able+0.277
 your+0.273
 any+0.271
 an+0.268
Neg logit contributions
board-0.193
amboo-0.159
ops-0.154
 He-0.150
boards-0.148
Token bird
Token ID6512
Feature activation+4.08e+00

Pos logit contributions
board+0.118
amboo+0.095
ops+0.090
affe+0.089
boards+0.089
Neg logit contributions
 such-0.181
 able-0.169
 any-0.164
 an-0.164
 your-0.163
Token said
Token ID531
Feature activation-7.40e+00

Pos logit contributions
 such+0.135
 outer+0.128
 your+0.127
 our+0.125
 able+0.122
Neg logit contributions
board-0.086
boards-0.067
ops-0.065
 He-0.062
amboo-0.054
Token,
Token ID11
Feature activation+5.26e+00

Pos logit contributions
board+0.195
 He+0.168
ops+0.162
 boy+0.151
affe+0.151
Neg logit contributions
 outer-0.204
 such-0.204
 unnecessary-0.196
 possible-0.186
 any-0.184
Token
Token ID6184
Feature activation+3.08e+01

Pos logit contributions
 such+0.166
 outer+0.166
 unnecessary+0.158
 possible+0.153
 any+0.151
Neg logit contributions
board-0.116
ops-0.096
 He-0.090
affe-0.085
s-0.082
Token
Token ID95
Feature activation+4.23e+01

Pos logit contributions
 outer+0.857
 unnecessary+0.835
 such+0.834
 possible+0.772
rous+0.770
Neg logit contributions
board-0.814
 He-0.724
ops-0.674
 boy-0.640
 was-0.640
Token
Token ID26391
Feature activation+1.57e+01

Pos logit contributions
 outer+1.074
 such+1.049
 unnecessary+1.003
rous+0.956
 possible+0.946
Neg logit contributions
board-1.178
 He-1.057
ops-1.009
s-0.997
 boy-0.980
Token
Token ID129
Feature activation+2.09e+01

Pos logit contributions
 such+0.438
 outer+0.423
 unnecessary+0.405
 possible+0.390
rous+0.389
Neg logit contributions
board-0.379
 He-0.315
les-0.296
 boy-0.293
der-0.291
Token
Token ID241
Feature activation+2.18e+01

Pos logit contributions
 outer+0.556
 unnecessary+0.541
 such+0.537
 possible+0.498
rous+0.494
Neg logit contributions
board-0.580
 He-0.523
ops-0.487
 was-0.463
affe-0.462
TokenWait
Token ID21321
Feature activation+1.08e+01

Pos logit contributions
 outer+0.580
 such+0.569
 unnecessary+0.548
 possible+0.535
rous+0.510
Neg logit contributions
board-0.602
ops-0.498
 He-0.492
affe-0.478
s-0.461
Token until
Token ID1566
Feature activation-1.90e+00

Pos logit contributions
 outer+0.365
 unnecessary+0.360
rous+0.344
 such+0.344
nam+0.336
Neg logit contributions
board-0.218
 He-0.195
 was-0.172
ops-0.166
 boy-0.163
TokenB
Token ID33
Feature activation+1.49e+01

Pos logit contributions
 such+0.398
 our+0.384
 your+0.382
 possible+0.379
 outer+0.373
Neg logit contributions
board-0.196
ard-0.169
affe-0.166
ops-0.165
 He-0.156
Tokeneth
Token ID2788
Feature activation+9.64e+00

Pos logit contributions
 unnecessary+0.408
 such+0.398
 permission+0.380
 any+0.375
 possible+0.372
Neg logit contributions
board-0.353
boards-0.285
 says-0.267
les-0.261
 man-0.257
Token said
Token ID531
Feature activation-5.04e+00

Pos logit contributions
 such+0.290
 outer+0.271
 permission+0.266
 unnecessary+0.265
 any+0.265
Neg logit contributions
board-0.213
 He-0.181
boards-0.165
s-0.163
ops-0.161
Token,
Token ID11
Feature activation+6.43e+00

Pos logit contributions
board+0.091
 He+0.081
 was+0.069
ops+0.069
s+0.067
Neg logit contributions
 outer-0.181
 unnecessary-0.176
 such-0.174
rous-0.168
 possible-0.166
Token
Token ID6184
Feature activation+3.11e+01

Pos logit contributions
 outer+0.199
 such+0.196
 unnecessary+0.189
 possible+0.183
 our+0.181
Neg logit contributions
board-0.145
ops-0.122
 He-0.112
affe-0.106
ushed-0.104
Token
Token ID95
Feature activation+4.22e+01

Pos logit contributions
 outer+0.836
 such+0.802
 unnecessary+0.797
rous+0.744
 possible+0.737
Neg logit contributions
board-0.850
 He-0.757
ops-0.711
 was-0.686
 boy-0.682
Token
Token ID26391
Feature activation+1.59e+01

Pos logit contributions
 outer+1.148
 such+1.123
 unnecessary+1.047
rous+1.012
 possible+0.983
Neg logit contributions
board-1.063
 He-0.934
ops-0.930
s-0.920
 boy-0.899
Token
Token ID129
Feature activation+2.10e+01

Pos logit contributions
 such+0.424
 outer+0.416
 our+0.389
 unnecessary+0.389
 your+0.380
Neg logit contributions
board-0.394
der-0.320
les-0.317
 He-0.313
ly-0.308
Token
Token ID241
Feature activation+2.16e+01

Pos logit contributions
 outer+0.551
 unnecessary+0.537
 such+0.533
 possible+0.494
rous+0.491
Neg logit contributions
board-0.587
 He-0.530
ops-0.493
 was-0.470
affe-0.469
TokenLet
Token ID5756
Feature activation+9.82e+00

Pos logit contributions
 outer+0.619
 such+0.604
 unnecessary+0.580
 possible+0.574
rous+0.542
Neg logit contributions
board-0.561
ops-0.457
 He-0.437
affe-0.436
les-0.417
Tokenâ
Token ID22940
Feature activation+2.14e+01

Pos logit contributions
 such+0.329
 unnecessary+0.314
 outer+0.312
 our+0.300
 your+0.299
Neg logit contributions
board-0.202
 He-0.171
ops-0.163
ushed-0.152
affe-0.151
Token with
Token ID351
Feature activation+9.32e+00

Pos logit contributions
 such+0.459
 possible+0.428
 outer+0.407
 our+0.403
 your+0.395
Neg logit contributions
board-0.346
 He-0.324
affe-0.290
ops-0.277
 Maybe-0.272
Token a
Token ID257
Feature activation+1.21e+01

Pos logit contributions
 outer+0.166
 unnecessary+0.159
 such+0.156
rous+0.140
 possible+0.139
Neg logit contributions
board-0.336
 He-0.310
ops-0.294
 was-0.288
 boy-0.286
Token big
Token ID1263
Feature activation+6.97e+00

Pos logit contributions
 outer+0.260
 such+0.249
 unnecessary+0.248
 possible+0.226
rous+0.225
Neg logit contributions
board-0.392
 He-0.356
ops-0.339
s-0.323
 was-0.323
Token number
Token ID1271
Feature activation-6.21e-01

Pos logit contributions
 outer+0.184
 such+0.179
 unnecessary+0.168
rous+0.163
 possible+0.162
Neg logit contributions
board-0.184
 He-0.166
ops-0.163
s-0.155
affe-0.147
Token
Token ID6184
Feature activation+3.01e+01

Pos logit contributions
board+0.014
 He+0.012
ops+0.011
affe+0.010
 boy+0.010
Neg logit contributions
 such-0.019
 outer-0.019
 possible-0.018
 unnecessary-0.018
 any-0.018
Token
Token ID95
Feature activation+4.22e+01

Pos logit contributions
 unnecessary+0.802
 outer+0.790
 such+0.778
rous+0.720
 possible+0.717
Neg logit contributions
board-0.856
 He-0.744
ops-0.712
affe-0.670
 was-0.654
Token
Token ID26391
Feature activation+1.54e+01

Pos logit contributions
 outer+1.122
 such+1.093
 unnecessary+1.060
 possible+0.996
rous+0.994
Neg logit contributions
board-1.135
 He-1.019
ops-0.960
s-0.936
 boy-0.932
Token
Token ID129
Feature activation+2.14e+01

Pos logit contributions
 outer+0.431
 such+0.430
 unnecessary+0.400
 possible+0.389
rous+0.388
Neg logit contributions
board-0.367
 He-0.316
 boy-0.297
affe-0.286
der-0.284
Token
Token ID241
Feature activation+2.17e+01

Pos logit contributions
 outer+0.581
 unnecessary+0.578
 such+0.556
 possible+0.530
nam+0.525
Neg logit contributions
board-0.581
 He-0.530
ops-0.491
affe-0.459
 was-0.456
Token1
Token ID16
Feature activation+2.46e+01

Pos logit contributions
 outer+0.606
 such+0.595
 unnecessary+0.574
 possible+0.554
rous+0.536
Neg logit contributions
board-0.568
ops-0.468
 He-0.467
affe-0.443
s-0.435
Tokenâ
Token ID22940
Feature activation+2.15e+01

Pos logit contributions
 such+0.768
 outer+0.728
 unnecessary+0.707
 any+0.701
 your+0.692
Neg logit contributions
board-0.524
 He-0.447
 boy-0.416
ushed-0.385
affe-0.383
Token no
Token ID645
Feature activation+1.00e+01

Pos logit contributions
 unnecessary+0.278
 outer+0.264
 such+0.241
rous+0.230
nam+0.220
Neg logit contributions
board-0.634
 He-0.630
 was-0.592
 boy-0.571
ops-0.553
Token time
Token ID640
Feature activation+1.16e+01

Pos logit contributions
 outer+0.229
 unnecessary+0.220
 such+0.218
rous+0.205
nam+0.202
Neg logit contributions
board-0.304
 He-0.283
 was-0.261
 boy-0.260
ops-0.255
Token."
Token ID526
Feature activation+1.15e+01

Pos logit contributions
 outer+0.383
 such+0.375
 unnecessary+0.372
 possible+0.354
rous+0.349
Neg logit contributions
board-0.242
 He-0.204
ops-0.189
affe-0.177
 boy-0.177
Token
Token ID198
Feature activation+2.78e+00

Pos logit contributions
 such+0.315
 your+0.287
 outer+0.283
 an+0.283
 our+0.278
Neg logit contributions
board-0.301
ops-0.257
les-0.243
affe-0.241
actus-0.232
Token
Token ID198
Feature activation+2.42e+00

Pos logit contributions
 outer+0.087
 such+0.085
 unnecessary+0.084
rous+0.080
 possible+0.079
Neg logit contributions
board-0.063
ops-0.051
 He-0.050
affe-0.046
 boy-0.046
TokenEnc
Token ID27195
Feature activation+4.20e+01

Pos logit contributions
 outer+0.069
 such+0.066
 unnecessary+0.066
rous+0.062
 possible+0.062
Neg logit contributions
board-0.062
 He-0.054
ops-0.051
 boy-0.048
s-0.048
Tokenour
Token ID454
Feature activation-7.45e+00

Pos logit contributions
 outer+1.255
 such+1.225
 unnecessary+1.216
 possible+1.184
rous+1.166
Neg logit contributions
board-0.991
 He-0.847
 boy-0.766
ops-0.764
 was-0.762
Tokenaged
Token ID1886
Feature activation+5.79e-01

Pos logit contributions
board+0.175
 He+0.151
ops+0.138
 boy+0.134
affe+0.133
Neg logit contributions
 outer-0.225
 such-0.222
 unnecessary-0.218
 possible-0.207
rous-0.203
Token by
Token ID416
Feature activation+1.24e+01

Pos logit contributions
 outer+0.011
 such+0.011
 unnecessary+0.011
 possible+0.010
rous+0.010
Neg logit contributions
board-0.020
 He-0.018
ops-0.017
affe-0.017
 boy-0.016
Token the
Token ID262
Feature activation+2.32e+01

Pos logit contributions
 outer+0.282
 unnecessary+0.272
 such+0.270
rous+0.247
 possible+0.246
Neg logit contributions
board-0.391
 He-0.357
ops-0.334
 was-0.325
 boy-0.324
Token eagle
Token ID31176
Feature activation-1.55e+00

Pos logit contributions
 outer+0.651
 such+0.634
 unnecessary+0.630
rous+0.588
 possible+0.584
Neg logit contributions
board-0.604
 He-0.536
ops-0.495
 boy-0.480
 was-0.476
Token m
Token ID285
Feature activation+2.69e+00

Pos logit contributions
 outer+1.061
 unnecessary+1.026
 such+1.023
 possible+0.964
rous+0.955
Neg logit contributions
board-0.825
 He-0.724
ops-0.683
s-0.641
affe-0.621
Tokenama
Token ID1689
Feature activation+9.45e-01

Pos logit contributions
 such+0.072
 unnecessary+0.068
 permission+0.068
 outer+0.068
rous+0.066
Neg logit contributions
board-0.060
 He-0.054
 They-0.050
 Maybe-0.049
boards-0.049
Token said
Token ID531
Feature activation-6.87e+00

Pos logit contributions
 such+0.032
 outer+0.029
 your+0.029
 these+0.028
 able+0.028
Neg logit contributions
board-0.017
ops-0.014
boards-0.013
amboo-0.012
 He-0.012
Token,
Token ID11
Feature activation+4.87e+00

Pos logit contributions
board+0.149
 He+0.127
ops+0.118
 boy+0.110
affe+0.110
Neg logit contributions
 outer-0.221
 such-0.218
 unnecessary-0.215
 possible-0.202
rous-0.201
Token
Token ID6184
Feature activation+3.04e+01

Pos logit contributions
 outer+0.143
 such+0.142
 unnecessary+0.135
 possible+0.130
 any+0.128
Neg logit contributions
board-0.118
ops-0.099
 He-0.095
affe-0.090
s-0.087
Token
Token ID95
Feature activation+4.19e+01

Pos logit contributions
 outer+0.799
 such+0.767
 unnecessary+0.762
rous+0.712
 possible+0.705
Neg logit contributions
board-0.840
 He-0.747
ops-0.704
 was-0.688
 boy-0.684
Token
Token ID26391
Feature activation+1.53e+01

Pos logit contributions
 outer+1.120
 such+1.107
 unnecessary+1.046
rous+0.996
 possible+0.990
Neg logit contributions
board-1.068
 He-0.941
s-0.932
ops-0.923
 boy-0.906
Token
Token ID129
Feature activation+2.06e+01

Pos logit contributions
 such+0.426
 outer+0.410
 unnecessary+0.397
 possible+0.380
 our+0.376
Neg logit contributions
board-0.373
 He-0.307
 boy-0.292
les-0.292
affe-0.288
Token
Token ID241
Feature activation+2.16e+01

Pos logit contributions
 outer+0.566
 unnecessary+0.550
 such+0.546
 possible+0.508
rous+0.506
Neg logit contributions
board-0.551
 He-0.495
ops-0.460
 was-0.437
affe-0.434
TokenLet
Token ID5756
Feature activation+9.50e+00

Pos logit contributions
 outer+0.595
 such+0.589
 unnecessary+0.566
 possible+0.553
rous+0.527
Neg logit contributions
board-0.571
ops-0.469
 He-0.460
affe-0.452
s-0.431
Tokenâ
Token ID22940
Feature activation+2.12e+01

Pos logit contributions
 such+0.310
 unnecessary+0.294
 outer+0.290
 your+0.279
 our+0.279
Neg logit contributions
board-0.203
 He-0.174
ops-0.166
ushed-0.156
affe-0.155
TokenHis
Token ID6653
Feature activation+1.02e+01

Pos logit contributions
 outer+0.189
 such+0.187
 unnecessary+0.185
 permission+0.183
 any+0.178
Neg logit contributions
board-0.207
ops-0.196
 He-0.192
les-0.180
affe-0.179
Token mother
Token ID2802
Feature activation+8.50e+00

Pos logit contributions
 outer+0.266
 such+0.263
 unnecessary+0.257
 any+0.244
 possible+0.242
Neg logit contributions
board-0.285
ops-0.243
 He-0.227
boards-0.225
affe-0.215
Token said
Token ID531
Feature activation-6.09e+00

Pos logit contributions
 such+0.285
 any+0.260
 your+0.256
 outer+0.255
 able+0.255
Neg logit contributions
board-0.167
ops-0.133
amboo-0.132
 He-0.127
boards-0.122
Token,
Token ID11
Feature activation+5.63e+00

Pos logit contributions
board+0.143
 He+0.121
ops+0.117
affe+0.109
 boy+0.107
Neg logit contributions
 such-0.184
 outer-0.183
 unnecessary-0.177
 any-0.169
 possible-0.168
Token
Token ID6184
Feature activation+3.05e+01

Pos logit contributions
 such+0.168
 outer+0.166
 unnecessary+0.158
 possible+0.153
 any+0.152
Neg logit contributions
board-0.133
ops-0.113
 He-0.104
affe-0.101
les-0.097
Token
Token ID95
Feature activation+4.19e+01

Pos logit contributions
 outer+0.814
 such+0.777
 unnecessary+0.771
rous+0.725
 possible+0.719
Neg logit contributions
board-0.836
 He-0.744
ops-0.701
 was-0.680
 boy-0.675
Token
Token ID26391
Feature activation+1.55e+01

Pos logit contributions
 outer+1.049
 such+1.027
 unnecessary+0.952
rous+0.920
 possible+0.915
Neg logit contributions
board-1.131
 He-1.004
s-0.997
ops-0.996
 boy-0.974
Token
Token ID129
Feature activation+2.08e+01

Pos logit contributions
 such+0.412
 outer+0.394
 unnecessary+0.376
 possible+0.367
 our+0.367
Neg logit contributions
board-0.387
les-0.311
 He-0.309
der-0.306
affe-0.305
Token
Token ID241
Feature activation+2.16e+01

Pos logit contributions
 outer+0.554
 unnecessary+0.536
 such+0.536
rous+0.495
 possible+0.495
Neg logit contributions
board-0.570
 He-0.511
ops-0.477
 was-0.456
 boy-0.454
TokenThere
Token ID1858
Feature activation+1.83e+01

Pos logit contributions
 outer+0.572
 such+0.563
 unnecessary+0.542
 possible+0.528
rous+0.503
Neg logit contributions
board-0.595
ops-0.493
 He-0.487
affe-0.477
s-0.458
Token,
Token ID11
Feature activation+2.03e+00

Pos logit contributions
 such+0.632
 your+0.589
 our+0.583
 any+0.575
 anything+0.574
Neg logit contributions
board-0.355
affe-0.300
ushed-0.293
 He-0.289
oc-0.274
Token,
Token ID11
Feature activation-1.42e-01

Pos logit contributions
 such+0.259
 outer+0.250
 any+0.239
 your+0.238
 these+0.237
Neg logit contributions
board-0.189
 He-0.152
affe-0.149
ushed-0.143
ops-0.142
Token Jenn
Token ID9777
Feature activation+3.87e+00

Pos logit contributions
board+0.004
 He+0.003
ops+0.003
 was+0.003
s+0.003
Neg logit contributions
 outer-0.004
 unnecessary-0.004
 such-0.004
rous-0.004
 possible-0.003
Tokenie
Token ID494
Feature activation-5.51e+00

Pos logit contributions
 such+0.118
 outer+0.106
rous+0.105
 any+0.105
 your+0.103
Neg logit contributions
 He-0.089
board-0.088
 Maybe-0.072
 They-0.069
ops-0.068
Token followed
Token ID3940
Feature activation+4.09e+00

Pos logit contributions
board+0.124
 He+0.102
ops+0.095
affe+0.095
boards+0.092
Neg logit contributions
 such-0.174
 outer-0.161
 unnecessary-0.154
 possible-0.154
rous-0.152
Token the
Token ID262
Feature activation+2.18e+01

Pos logit contributions
 unnecessary+0.086
 outer+0.085
rous+0.082
nam+0.081
 possible+0.076
Neg logit contributions
 He-0.121
board-0.119
 and-0.113
 was-0.113
s-0.107
Token proverb
Token ID36950
Feature activation+4.19e+01

Pos logit contributions
 such+0.558
 outer+0.552
 unnecessary+0.541
rous+0.514
 possible+0.500
Neg logit contributions
board-0.636
 He-0.548
 boy-0.540
 was-0.518
ops-0.509
Token:
Token ID25
Feature activation+3.86e+00

Pos logit contributions
 such+1.196
 outer+1.112
 possible+1.086
 any+1.070
 anything+1.057
Neg logit contributions
board-0.979
oc-0.899
 He-0.842
ops-0.818
affe-0.803
Token "
Token ID366
Feature activation+1.04e+00

Pos logit contributions
 unnecessary+0.117
 outer+0.116
 such+0.113
rous+0.112
 possible+0.108
Neg logit contributions
board-0.088
 He-0.082
 was-0.072
ops-0.070
 boy-0.067
TokenDel
Token ID13856
Feature activation+2.62e+01

Pos logit contributions
 outer+0.026
 such+0.026
 unnecessary+0.025
 possible+0.024
rous+0.023
Neg logit contributions
board-0.030
 He-0.026
ops-0.025
 boy-0.024
affe-0.024
Tokenay
Token ID323
Feature activation-9.99e+00

Pos logit contributions
 such+0.654
 outer+0.645
 unnecessary+0.635
rous+0.596
 possible+0.583
Neg logit contributions
board-0.727
 He-0.684
 boy-0.635
s-0.615
 was-0.607
Token is
Token ID318
Feature activation-7.86e+00

Pos logit contributions
board+0.165
 He+0.149
 boy+0.122
affe+0.115
 Maybe+0.114
Neg logit contributions
 such-0.361
 outer-0.350
 our-0.331
 unnecessary-0.330
 these-0.328
Token
Token ID250
Feature activation+1.81e+01

Pos logit contributions
 outer+0.272
 such+0.267
 unnecessary+0.262
rous+0.256
 possible+0.244
Neg logit contributions
board-0.230
 He-0.193
ops-0.180
affe-0.180
 boy-0.177
Token a
Token ID257
Feature activation+1.29e+01

Pos logit contributions
 outer+0.480
 unnecessary+0.465
 such+0.464
rous+0.430
 possible+0.427
Neg logit contributions
board-0.494
 He-0.445
ops-0.413
 was-0.398
 boy-0.396
Token rare
Token ID4071
Feature activation+1.63e+01

Pos logit contributions
 outer+0.281
 unnecessary+0.270
 such+0.268
 possible+0.244
rous+0.243
Neg logit contributions
board-0.415
 He-0.378
ops-0.360
s-0.345
 was-0.343
Token silence
Token ID9550
Feature activation+6.03e-01

Pos logit contributions
 outer+0.443
 such+0.419
 unnecessary+0.408
 possible+0.400
 any+0.381
Neg logit contributions
board-0.424
 He-0.384
ops-0.381
s-0.351
boards-0.345
Token
Token ID6184
Feature activation+3.01e+01

Pos logit contributions
 outer+0.019
 such+0.018
 possible+0.018
 unnecessary+0.017
 our+0.017
Neg logit contributions
board-0.014
 He-0.012
ops-0.011
affe-0.010
ushed-0.010
Token
Token ID95
Feature activation+4.18e+01

Pos logit contributions
 unnecessary+0.807
 outer+0.805
 such+0.803
nam+0.748
rous+0.743
Neg logit contributions
board-0.817
 He-0.720
ops-0.676
affe-0.634
 was-0.631
Token
Token ID26391
Feature activation+1.48e+01

Pos logit contributions
 outer+1.054
 such+1.022
 unnecessary+1.002
rous+0.933
 possible+0.930
Neg logit contributions
board-1.192
 He-1.071
ops-1.016
s-0.985
 boy-0.983
Token
Token ID447
Feature activation+6.15e+00

Pos logit contributions
 outer+0.400
 such+0.392
 unnecessary+0.381
 possible+0.363
rous+0.362
Neg logit contributions
board-0.381
 He-0.331
 boy-0.303
ops-0.298
affe-0.297
Token
Token ID250
Feature activation+1.50e+01

Pos logit contributions
 outer+0.194
 such+0.189
 unnecessary+0.188
rous+0.179
 possible+0.175
Neg logit contributions
board-0.137
 He-0.117
ops-0.107
affe-0.103
 boy-0.103
Token that
Token ID326
Feature activation+7.55e+00

Pos logit contributions
 outer+0.401
 unnecessary+0.394
 such+0.387
rous+0.361
 possible+0.354
Neg logit contributions
board-0.398
 He-0.370
ops-0.332
 was-0.326
 boy-0.323
Token added
Token ID2087
Feature activation+6.02e-01

Pos logit contributions
 outer+0.227
 such+0.220
 unnecessary+0.220
 possible+0.205
rous+0.205
Neg logit contributions
board-0.182
 He-0.159
ops-0.149
affe-0.139
 boy-0.139
TokenL
Token ID43
Feature activation+1.42e+01

Pos logit contributions
 outer+0.244
 unnecessary+0.238
 such+0.237
rous+0.221
 possible+0.221
Neg logit contributions
board-0.201
 He-0.180
ops-0.163
 was-0.158
 boy-0.157
Tokenily
Token ID813
Feature activation+5.64e-01

Pos logit contributions
 such+0.371
 outer+0.362
 unnecessary+0.343
 possible+0.338
 permission+0.328
Neg logit contributions
board-0.362
 He-0.346
s-0.337
ops-0.315
 boy-0.302
Token took
Token ID1718
Feature activation+5.75e+00

Pos logit contributions
 such+0.016
 an+0.016
 your+0.015
 practice+0.015
 the+0.015
Neg logit contributions
board-0.011
 Pole-0.010
 He-0.009
amboo-0.009
ops-0.009
Token a
Token ID257
Feature activation+1.28e+01

Pos logit contributions
 outer+0.136
rous+0.120
 possible+0.119
shadow+0.117
nam+0.117
Neg logit contributions
 He-0.164
board-0.163
 was-0.152
 boy-0.151
 says-0.143
Token deep
Token ID2769
Feature activation+1.52e+01

Pos logit contributions
 such+0.335
 outer+0.318
 unnecessary+0.312
 possible+0.296
rous+0.295
Neg logit contributions
board-0.373
 He-0.330
 boy-0.329
s-0.317
 was-0.312
Token breath
Token ID8033
Feature activation+4.18e+01

Pos logit contributions
 outer+0.191
 unnecessary+0.188
 such+0.184
 possible+0.156
rous+0.148
Neg logit contributions
board-0.633
 He-0.599
 boy-0.560
 was-0.557
ops-0.553
Token and
Token ID290
Feature activation-9.72e+00

Pos logit contributions
 such+1.542
 outer+1.504
 any+1.453
 possible+1.437
 unnecessary+1.427
Neg logit contributions
board-0.666
 He-0.500
ops-0.490
 boy-0.473
affe-0.447
Token started
Token ID2067
Feature activation-1.72e+00

Pos logit contributions
board+0.197
 He+0.168
ops+0.153
affe+0.143
 boy+0.142
Neg logit contributions
 outer-0.330
 such-0.321
 unnecessary-0.320
rous-0.302
 possible-0.301
Token to
Token ID284
Feature activation+2.01e+01

Pos logit contributions
 He+0.047
 was+0.046
board+0.045
s+0.043
 and+0.042
Neg logit contributions
 unnecessary-0.043
 outer-0.042
rous-0.040
 ours-0.040
 such-0.038
Token climb
Token ID12080
Feature activation+7.71e+00

Pos logit contributions
 outer+0.496
 unnecessary+0.481
 such+0.480
rous+0.443
 possible+0.441
Neg logit contributions
board-0.590
 He-0.536
ops-0.499
 was-0.487
 boy-0.483
Token.
Token ID13
Feature activation-1.46e+00

Pos logit contributions
 outer+0.168
 unnecessary+0.163
 such+0.160
rous+0.149
 possible+0.145
Neg logit contributions
board-0.246
 He-0.228
ops-0.212
 was-0.209
s-0.207
Token did
Token ID750
Feature activation-8.58e+00

Pos logit contributions
board+0.326
ops+0.294
 Pole+0.260
 He+0.259
affe+0.258
Neg logit contributions
 such-0.365
 an-0.360
 rid-0.329
 outer-0.327
 any-0.324
Token not
Token ID407
Feature activation-3.77e+00

Pos logit contributions
 He+0.218
 was+0.215
board+0.204
s+0.188
 and+0.182
Neg logit contributions
 outer-0.234
rous-0.221
 unnecessary-0.217
 such-0.200
nam-0.198
Token care
Token ID1337
Feature activation+5.99e+00

Pos logit contributions
board+0.119
 He+0.104
ops+0.103
s+0.098
ushed+0.095
Neg logit contributions
 outer-0.085
 unnecessary-0.085
 such-0.084
 possible-0.079
 any-0.078
Token about
Token ID546
Feature activation+5.80e+00

Pos logit contributions
 outer+0.155
 such+0.154
 unnecessary+0.151
 possible+0.139
 any+0.138
Neg logit contributions
board-0.169
 He-0.146
ops-0.142
affe-0.135
 boy-0.135
Token bath
Token ID7837
Feature activation+3.96e+00

Pos logit contributions
 outer+0.126
 such+0.123
 unnecessary+0.120
 possible+0.110
rous+0.108
Neg logit contributions
board-0.190
 He-0.170
ops-0.163
affe-0.158
s-0.156
Tokenro
Token ID305
Feature activation-4.76e+01

Pos logit contributions
 such+0.122
 any+0.113
 unnecessary+0.109
 anything+0.109
 outer+0.106
Neg logit contributions
affe-0.084
board-0.084
 He-0.077
oc-0.076
jo-0.069
Tokenbes
Token ID12636
Feature activation-1.30e+01

Pos logit contributions
board+1.191
 He+1.062
affe+0.937
ops+0.921
 boy+0.921
Neg logit contributions
 such-1.351
 outer-1.349
 unnecessary-1.324
 possible-1.246
 any-1.233
Token.
Token ID13
Feature activation-6.42e+00

Pos logit contributions
board+0.273
 He+0.240
affe+0.216
oc+0.216
ushed+0.216
Neg logit contributions
 such-0.426
 any-0.401
 outer-0.395
 possible-0.392
 unnecessary-0.390
Token She
Token ID1375
Feature activation-1.36e+01

Pos logit contributions
board+0.128
ops+0.109
affe+0.098
les+0.096
oc+0.096
Neg logit contributions
 such-0.219
 outer-0.211
 unnecessary-0.208
 an-0.204
 your-0.204
Token cared
Token ID19951
Feature activation+7.39e-01

Pos logit contributions
board+0.318
ops+0.278
 Pole+0.274
affe+0.266
 He+0.262
Neg logit contributions
 such-0.368
 an-0.356
 any-0.327
 rid-0.326
 known-0.317
Token about
Token ID546
Feature activation+5.81e+00

Pos logit contributions
 outer+0.017
 such+0.017
 unnecessary+0.017
 possible+0.015
rous+0.015
Neg logit contributions
board-0.022
 He-0.020
ops-0.019
 boy-0.018
affe-0.018
Token "
Token ID366
Feature activation-2.99e+00

Pos logit contributions
board+0.143
ops+0.129
les+0.113
affe+0.106
y+0.105
Neg logit contributions
 such-0.195
 outer-0.193
 an-0.180
 unnecessary-0.177
 your-0.176
TokenI
Token ID40
Feature activation-1.29e+00

Pos logit contributions
board+0.080
 He+0.072
ops+0.067
 was+0.064
 boy+0.064
Neg logit contributions
 outer-0.081
 unnecessary-0.079
 such-0.079
rous-0.073
 possible-0.073
Token'm
Token ID1101
Feature activation-6.19e+00

Pos logit contributions
board+0.036
 He+0.031
ops+0.029
affe+0.028
boards+0.028
Neg logit contributions
 outer-0.034
 such-0.033
 unnecessary-0.033
 possible-0.031
rous-0.031
Token a
Token ID257
Feature activation+2.19e+00

Pos logit contributions
 He+0.168
 was+0.153
board+0.146
 said+0.146
 and+0.144
Neg logit contributions
 outer-0.146
 unnecessary-0.143
rous-0.137
nam-0.127
 backup-0.119
Token tight
Token ID5381
Feature activation+1.71e+00

Pos logit contributions
 outer+0.058
 such+0.057
 unnecessary+0.056
rous+0.052
 possible+0.052
Neg logit contributions
board-0.060
 He-0.054
ops-0.050
 boy-0.049
 was-0.048
Tokenro
Token ID305
Feature activation-4.72e+01

Pos logit contributions
 outer+0.050
 such+0.048
 unnecessary+0.048
 any+0.046
 possible+0.046
Neg logit contributions
board-0.040
 He-0.036
ops-0.034
ushed-0.032
boards-0.032
Tokenpe
Token ID431
Feature activation-3.37e+01

Pos logit contributions
board+0.899
 He+0.776
ops+0.670
 boy+0.640
s+0.637
Neg logit contributions
 outer-1.632
 such-1.604
 unnecessary-1.587
 possible-1.520
rous-1.498
Token walk
Token ID2513
Feature activation+5.32e+00

Pos logit contributions
board+0.687
 He+0.588
ops+0.531
 boy+0.498
 was+0.497
Neg logit contributions
 outer-1.135
 such-1.107
 unnecessary-1.104
 possible-1.042
rous-1.041
Tokener
Token ID263
Feature activation-2.31e+01

Pos logit contributions
 outer+0.139
 such+0.136
 unnecessary+0.133
 possible+0.125
rous+0.123
Neg logit contributions
board-0.148
 He-0.131
ops-0.125
affe-0.117
 was-0.116
Token.
Token ID13
Feature activation-6.35e+00

Pos logit contributions
board+0.544
 He+0.477
ops+0.439
s+0.411
 boy+0.410
Neg logit contributions
 outer-0.693
 such-0.687
 unnecessary-0.655
 possible-0.643
 any-0.627
Token Do
Token ID2141
Feature activation+1.02e+01

Pos logit contributions
board+0.161
ops+0.139
s+0.130
 He+0.126
y+0.125
Neg logit contributions
 outer-0.184
 such-0.181
 unnecessary-0.174
 possible-0.160
 your-0.159
Token walk
Token ID2513
Feature activation+5.63e+00

Pos logit contributions
board+0.432
 He+0.407
 was+0.374
ops+0.367
s+0.357
Neg logit contributions
 unnecessary-0.349
 outer-0.335
 such-0.321
rous-0.304
 possible-0.295
Token around
Token ID1088
Feature activation+2.75e+00

Pos logit contributions
 outer+0.131
 unnecessary+0.128
 such+0.126
rous+0.116
 possible+0.115
Neg logit contributions
board-0.173
 He-0.158
ops-0.147
 was-0.144
 boy-0.143
Token on
Token ID319
Feature activation+4.87e+00

Pos logit contributions
 unnecessary+0.056
 outer+0.056
 such+0.053
rous+0.050
 possible+0.049
Neg logit contributions
board-0.092
 He-0.086
 was-0.079
ops-0.079
 boy-0.078
Token a
Token ID257
Feature activation+2.59e+00

Pos logit contributions
 unnecessary+0.078
 outer+0.074
rous+0.069
 such+0.068
 possible+0.068
Neg logit contributions
board-0.183
 He-0.176
 was-0.164
 boy-0.160
ops-0.157
Token tight
Token ID5381
Feature activation+1.56e+00

Pos logit contributions
 such+0.064
 unnecessary+0.064
 outer+0.064
 possible+0.058
rous+0.058
Neg logit contributions
board-0.077
 He-0.068
 was-0.063
 boy-0.063
s-0.063
Tokenro
Token ID305
Feature activation-4.68e+01

Pos logit contributions
 outer+0.048
 such+0.045
 unnecessary+0.044
 any+0.042
 possible+0.042
Neg logit contributions
board-0.035
 He-0.031
ops-0.030
s-0.028
ushed-0.028
Tokenpe
Token ID431
Feature activation-3.34e+01

Pos logit contributions
board+0.912
 He+0.812
 boy+0.681
ops+0.680
s+0.670
Neg logit contributions
 outer-1.556
 such-1.529
 unnecessary-1.482
 possible-1.433
 any-1.425
Token,
Token ID11
Feature activation-3.45e+00

Pos logit contributions
board+0.735
 He+0.640
ops+0.581
 boy+0.551
 was+0.546
Neg logit contributions
 outer-1.068
 such-1.036
 unnecessary-1.032
 possible-0.973
rous-0.970
Token he
Token ID339
Feature activation-2.18e+01

Pos logit contributions
board+0.085
 He+0.078
 was+0.071
 boy+0.069
s+0.065
Neg logit contributions
 unnecessary-0.099
 outer-0.099
 such-0.095
rous-0.092
 possible-0.091
Token could
Token ID714
Feature activation-1.46e+01

Pos logit contributions
board+0.385
 He+0.320
ops+0.281
affe+0.267
 boy+0.249
Neg logit contributions
 outer-0.787
 such-0.773
 unnecessary-0.757
 any-0.727
rous-0.725
Token even
Token ID772
Feature activation-1.10e+00

Pos logit contributions
board+0.454
 He+0.426
 was+0.394
ops+0.387
 boy+0.377
Neg logit contributions
 unnecessary-0.335
 outer-0.330
 such-0.313
rous-0.296
 possible-0.285
Token brother
Token ID3956
Feature activation-7.44e+00

Pos logit contributions
 outer+0.060
 such+0.059
 unnecessary+0.059
 possible+0.055
rous+0.055
Neg logit contributions
board-0.033
 He-0.027
ops-0.025
affe-0.023
s-0.022
Token would
Token ID561
Feature activation-1.18e+01

Pos logit contributions
board+0.163
 He+0.137
ops+0.123
affe+0.121
ushed+0.119
Neg logit contributions
 such-0.241
 outer-0.226
 unnecessary-0.226
 any-0.221
rous-0.210
Token wear
Token ID5806
Feature activation+5.87e+00

Pos logit contributions
board+0.320
 He+0.301
 was+0.282
ops+0.272
s+0.266
Neg logit contributions
 unnecessary-0.310
 outer-0.299
 such-0.284
 possible-0.277
rous-0.273
Token their
Token ID511
Feature activation+5.74e+00

Pos logit contributions
 possible+0.133
 unnecessary+0.129
 outer+0.124
shadow+0.121
rous+0.117
Neg logit contributions
board-0.165
 was-0.164
s-0.160
 He-0.155
 boy-0.151
Token bath
Token ID7837
Feature activation+4.39e+00

Pos logit contributions
 outer+0.147
 unnecessary+0.145
 possible+0.140
 such+0.139
rous+0.131
Neg logit contributions
board-0.159
 He-0.141
 was-0.140
s-0.135
 boy-0.135
Tokenro
Token ID305
Feature activation-4.67e+01

Pos logit contributions
 such+0.143
 outer+0.137
 any+0.129
 rid+0.128
 unnecessary+0.127
Neg logit contributions
board-0.081
affe-0.079
 He-0.078
oc-0.068
oke-0.065
Tokenbes
Token ID12636
Feature activation-1.24e+01

Pos logit contributions
board+0.929
 He+0.761
ops+0.751
 was+0.715
s+0.691
Neg logit contributions
 outer-1.599
 unnecessary-1.568
 such-1.554
rous-1.473
 possible-1.470
Token together
Token ID1978
Feature activation-9.71e+00

Pos logit contributions
board+0.369
 He+0.331
ops+0.312
 was+0.301
s+0.300
Neg logit contributions
 outer-0.304
 unnecessary-0.294
 such-0.292
rous-0.269
 possible-0.268
Token and
Token ID290
Feature activation-1.69e+01

Pos logit contributions
board+0.216
 He+0.187
ops+0.172
 was+0.168
s+0.165
Neg logit contributions
 outer-0.309
 unnecessary-0.303
 such-0.298
rous-0.284
 possible-0.282
Token play
Token ID711
Feature activation-2.38e+00

Pos logit contributions
board+0.398
 He+0.351
ops+0.323
 was+0.316
 boy+0.310
Neg logit contributions
 outer-0.515
 unnecessary-0.502
 such-0.497
rous-0.470
 possible-0.467
Token games
Token ID1830
Feature activation-1.85e-02

Pos logit contributions
board+0.069
 was+0.062
 He+0.060
s+0.058
 boy+0.058
Neg logit contributions
 outer-0.060
 unnecessary-0.059
 possible-0.059
rous-0.054
 such-0.054
Token march
Token ID9960
Feature activation+4.78e+00

Pos logit contributions
 such+0.081
 unnecessary+0.075
 outer+0.072
 rid+0.071
rous+0.070
Neg logit contributions
board-0.074
 He-0.064
s-0.063
 was-0.062
 boy-0.062
Token."
Token ID526
Feature activation+4.12e+00

Pos logit contributions
 outer+0.140
 unnecessary+0.139
 such+0.135
rous+0.128
 possible+0.126
Neg logit contributions
board-0.120
 He-0.104
ops-0.095
 was-0.094
 boy-0.094
Token
Token ID198
Feature activation-3.21e+00

Pos logit contributions
 such+0.103
 an+0.095
 our+0.092
 unnecessary+0.089
 your+0.088
Neg logit contributions
board-0.121
ops-0.099
ushed-0.097
oke-0.096
les-0.095
Token
Token ID198
Feature activation-3.22e+00

Pos logit contributions
board+0.076
ops+0.060
 He+0.060
 boy+0.054
boards+0.054
Neg logit contributions
 outer-0.098
 such-0.097
 unnecessary-0.093
rous-0.090
 possible-0.089
TokenPed
Token ID43468
Feature activation+2.81e+01

Pos logit contributions
board+0.108
 He+0.095
ops+0.092
 boy+0.087
 was+0.086
Neg logit contributions
 outer-0.067
 such-0.065
 unnecessary-0.065
rous-0.060
 possible-0.059
Tokenro
Token ID305
Feature activation-4.66e+01

Pos logit contributions
 outer+0.822
 such+0.753
 unnecessary+0.729
rous+0.723
 backup+0.709
Neg logit contributions
board-0.595
 was-0.531
 boy-0.530
ops-0.530
s-0.530
Token said
Token ID531
Feature activation-1.74e+01

Pos logit contributions
board+1.187
 He+1.081
affe+0.923
ops+0.922
 boy+0.921
Neg logit contributions
 such-1.263
 outer-1.253
 unnecessary-1.190
 possible-1.162
rous-1.148
Token,
Token ID11
Feature activation-3.41e+00

Pos logit contributions
board+0.234
 He+0.212
 was+0.180
s+0.176
 boy+0.167
Neg logit contributions
 outer-0.720
 unnecessary-0.704
 such-0.676
rous-0.670
 possible-0.651
Token "
Token ID366
Feature activation-2.76e+00

Pos logit contributions
board+0.081
ops+0.067
 He+0.060
affe+0.057
oc+0.057
Neg logit contributions
 such-0.104
 outer-0.102
 unnecessary-0.098
 our-0.095
 possible-0.094
TokenOkay
Token ID16454
Feature activation+4.27e+00

Pos logit contributions
board+0.076
 He+0.065
ops+0.063
affe+0.057
boards+0.057
Neg logit contributions
 outer-0.074
 such-0.073
 unnecessary-0.072
rous-0.069
 possible-0.068
Token,
Token ID11
Feature activation-3.42e+00

Pos logit contributions
 such+0.142
 outer+0.140
 unnecessary+0.139
 possible+0.133
 our+0.132
Neg logit contributions
board-0.089
 He-0.073
ops-0.069
boards-0.062
s-0.062
Token happy
Token ID3772
Feature activation+3.64e+00

Pos logit contributions
 outer+0.187
 such+0.180
 unnecessary+0.171
 possible+0.164
rous+0.164
Neg logit contributions
board-0.325
ops-0.284
 He-0.278
affe-0.269
boards-0.260
Token!
Token ID0
Feature activation+1.31e+00

Pos logit contributions
 such+0.103
 outer+0.091
 any+0.089
 your+0.087
 our+0.087
Neg logit contributions
board-0.098
ops-0.084
affe-0.083
ushed-0.080
 He-0.079
Token
Token ID198
Feature activation-2.98e+00

Pos logit contributions
 such+0.049
 your+0.046
 these+0.045
 an+0.045
 outer+0.045
Neg logit contributions
board-0.018
ops-0.015
ushed-0.013
les-0.013
y-0.012
Token
Token ID198
Feature activation-2.99e+00

Pos logit contributions
board+0.067
 He+0.053
ops+0.053
affe+0.050
 boy+0.048
Neg logit contributions
 outer-0.093
 such-0.092
 unnecessary-0.089
rous-0.085
 possible-0.085
TokenF
Token ID37
Feature activation+1.07e+01

Pos logit contributions
board+0.087
 He+0.079
ops+0.073
 was+0.071
s+0.071
Neg logit contributions
 outer-0.076
 unnecessary-0.073
 such-0.073
 possible-0.067
rous-0.067
Tokenro
Token ID305
Feature activation-4.65e+01

Pos logit contributions
 outer+0.355
 unnecessary+0.353
 such+0.345
rous+0.332
 possible+0.324
Neg logit contributions
board-0.206
 He-0.169
ops-0.153
 boy-0.152
boards-0.152
Tokenggy
Token ID19970
Feature activation-4.85e+00

Pos logit contributions
board+1.377
 He+1.270
 boy+1.116
s+1.113
affe+1.108
Neg logit contributions
 such-1.071
 outer-1.052
 unnecessary-0.990
 possible-0.963
rous-0.946
Token's
Token ID338
Feature activation+1.46e+00

Pos logit contributions
board+0.093
 He+0.076
ops+0.075
iness+0.075
affe+0.073
Neg logit contributions
 such-0.151
 an-0.144
 our-0.137
 the-0.137
 your-0.137
Token friends
Token ID2460
Feature activation-4.85e+00

Pos logit contributions
 outer+0.039
 such+0.038
 unnecessary+0.038
rous+0.035
 possible+0.035
Neg logit contributions
board-0.039
 He-0.035
ops-0.033
s-0.031
affe-0.031
Token asked
Token ID1965
Feature activation-2.00e+00

Pos logit contributions
board+0.105
 He+0.089
ops+0.086
affe+0.085
oc+0.083
Neg logit contributions
 such-0.153
 outer-0.142
 any-0.139
 an-0.136
 these-0.136
Token him
Token ID683
Feature activation-1.13e+01

Pos logit contributions
board+0.062
 He+0.055
ops+0.053
affe+0.051
 boy+0.051
Neg logit contributions
 outer-0.046
 such-0.045
 unnecessary-0.045
 possible-0.040
rous-0.040
Token singing
Token ID13777
Feature activation+4.65e+00

Pos logit contributions
board+0.206
 He+0.200
 was+0.198
s+0.193
y+0.184
Neg logit contributions
 outer-0.146
 unnecessary-0.145
 such-0.132
rous-0.130
nam-0.128
Token and
Token ID290
Feature activation-1.67e+01

Pos logit contributions
 outer+0.150
 such+0.147
 unnecessary+0.146
 possible+0.137
rous+0.137
Neg logit contributions
board-0.101
 He-0.087
ops-0.080
 boy-0.075
affe-0.075
Token the
Token ID262
Feature activation+1.86e+01

Pos logit contributions
board+0.526
 He+0.497
 was+0.477
 boy+0.462
s+0.454
Neg logit contributions
 unnecessary-0.352
 outer-0.348
 such-0.332
rous-0.313
 possible-0.300
Token flag
Token ID6056
Feature activation-3.91e+00

Pos logit contributions
 outer+0.514
 unnecessary+0.496
 such+0.496
 possible+0.463
rous+0.461
Neg logit contributions
board-0.494
 He-0.442
ops-0.414
s-0.392
affe-0.386
Token unf
Token ID3684
Feature activation+9.65e+00

Pos logit contributions
board+0.091
 He+0.078
ops+0.071
affe+0.070
oc+0.068
Neg logit contributions
 such-0.118
 outer-0.116
 any-0.110
 possible-0.110
 able-0.105
Tokenro
Token ID305
Feature activation-4.65e+01

Pos logit contributions
 outer+0.275
 unnecessary+0.266
 such+0.261
 possible+0.242
 backup+0.241
Neg logit contributions
board-0.239
 He-0.208
ops-0.203
 boy-0.201
 was-0.200
Tokenze
Token ID2736
Feature activation-8.54e+00

Pos logit contributions
board+1.041
 He+0.904
ops+0.833
 was+0.789
 boy+0.784
Neg logit contributions
 outer-1.467
 unnecessary-1.426
 such-1.426
rous-1.338
 possible-1.332
Token.
Token ID13
Feature activation-6.12e+00

Pos logit contributions
board+0.196
 He+0.171
ops+0.163
affe+0.157
oc+0.154
Neg logit contributions
 such-0.258
 outer-0.250
 unnecessary-0.240
 any-0.237
 possible-0.227
Token
Token ID198
Feature activation-3.43e+00

Pos logit contributions
board+0.150
ops+0.124
affe+0.116
 He+0.116
s+0.112
Neg logit contributions
 outer-0.182
 such-0.182
 unnecessary-0.173
rous-0.163
 possible-0.160
Token
Token ID198
Feature activation-3.49e+00

Pos logit contributions
board+0.064
 He+0.056
ops+0.049
 was+0.047
s+0.046
Neg logit contributions
 outer-0.122
 unnecessary-0.119
 such-0.119
rous-0.112
 possible-0.112
TokenThe
Token ID464
Feature activation+1.98e+01

Pos logit contributions
board+0.100
 He+0.095
 was+0.088
ops+0.085
s+0.084
Neg logit contributions
 unnecessary-0.087
 outer-0.086
 such-0.084
 any-0.078
 possible-0.077
Tokenucky
Token ID5309
Feature activation-1.23e+01

Pos logit contributions
board+0.001
jo+0.001
 Pole+0.001
cases+0.001
boards+0.001
Neg logit contributions
 unnecessary-0.001
 such-0.001
 rid-0.001
 any-0.001
 your-0.001
Token.
Token ID13
Feature activation-4.54e+00

Pos logit contributions
board+0.254
 He+0.198
ops+0.198
affe+0.182
boards+0.180
Neg logit contributions
 such-0.405
 outer-0.395
 unnecessary-0.376
 any-0.375
 able-0.366
Token
Token ID198
Feature activation-2.51e+00

Pos logit contributions
board+0.127
ops+0.104
jo+0.101
affe+0.100
les+0.099
Neg logit contributions
 such-0.115
 unnecessary-0.103
 outer-0.101
 your-0.101
 an-0.100
Token
Token ID198
Feature activation-2.47e+00

Pos logit contributions
board+0.055
 He+0.045
ops+0.042
 boy+0.041
affe+0.040
Neg logit contributions
 outer-0.079
 such-0.078
 unnecessary-0.077
rous-0.073
 possible-0.072
TokenF
Token ID37
Feature activation+1.08e+01

Pos logit contributions
board+0.070
 He+0.063
ops+0.059
s+0.057
 was+0.057
Neg logit contributions
 outer-0.066
 unnecessary-0.063
 such-0.063
 possible-0.058
rous-0.058
Tokenro
Token ID305
Feature activation-4.62e+01

Pos logit contributions
 unnecessary+0.317
 outer+0.306
 such+0.298
rous+0.293
 possible+0.282
Neg logit contributions
board-0.252
 He-0.214
boards-0.203
ops-0.197
 boy-0.194
Tokenggy
Token ID19970
Feature activation-4.19e+00

Pos logit contributions
board+1.376
 He+1.261
ops+1.123
 boy+1.115
s+1.114
Neg logit contributions
 such-1.067
 outer-1.050
 unnecessary-1.010
 possible-0.956
rous-0.948
Token looked
Token ID3114
Feature activation-1.19e+01

Pos logit contributions
board+0.092
 He+0.076
ops+0.073
boards+0.069
affe+0.069
Neg logit contributions
 such-0.123
 an-0.113
 our-0.112
 outer-0.111
 these-0.111
Token at
Token ID379
Feature activation+5.60e+00

Pos logit contributions
board+0.314
 He+0.313
 was+0.288
s+0.287
 boy+0.277
Neg logit contributions
 outer-0.317
 unnecessary-0.306
 possible-0.291
 such-0.291
 backup-0.275
Token the
Token ID262
Feature activation+1.95e+01

Pos logit contributions
 outer+0.092
 such+0.091
 unnecessary+0.088
rous+0.078
 any+0.076
Neg logit contributions
board-0.214
 He-0.192
ops-0.187
affe-0.181
 boy-0.178
Token ball
Token ID2613
Feature activation-1.07e+01

Pos logit contributions
 such+0.603
 outer+0.594
 unnecessary+0.583
 possible+0.558
rous+0.547
Neg logit contributions
board-0.434
 He-0.368
ops-0.347
oc-0.326
jo-0.325
Token it
Token ID340
Feature activation+1.02e+00

Pos logit contributions
 unnecessary+0.016
 outer+0.015
 possible+0.015
rous+0.015
nam+0.014
Neg logit contributions
board-0.021
 was-0.020
 He-0.020
s-0.020
ops-0.019
Token?"
Token ID1701
Feature activation+8.06e+00

Pos logit contributions
 outer+0.030
 unnecessary+0.030
 such+0.028
rous+0.027
nam+0.027
Neg logit contributions
board-0.025
 He-0.023
 was-0.021
ops-0.020
s-0.020
Token
Token ID198
Feature activation-2.84e+00

Pos logit contributions
 such+0.196
 an+0.175
 these+0.174
 outer+0.169
 your+0.169
Neg logit contributions
board-0.232
ushed-0.195
ops-0.192
y-0.188
affe-0.188
Token
Token ID198
Feature activation-2.77e+00

Pos logit contributions
board+0.054
 He+0.046
ops+0.041
 was+0.038
s+0.038
Neg logit contributions
 outer-0.100
 unnecessary-0.098
 such-0.098
rous-0.092
 possible-0.092
TokenF
Token ID37
Feature activation+1.09e+01

Pos logit contributions
board+0.077
 He+0.071
ops+0.066
s+0.065
 was+0.064
Neg logit contributions
 outer-0.076
 unnecessary-0.073
 such-0.071
 possible-0.069
rous-0.066
Tokenro
Token ID305
Feature activation-4.62e+01

Pos logit contributions
 outer+0.362
 unnecessary+0.361
 such+0.351
rous+0.342
 possible+0.329
Neg logit contributions
board-0.205
 He-0.169
 boy-0.155
boards-0.152
ops-0.149
Tokenggy
Token ID19970
Feature activation-4.58e+00

Pos logit contributions
board+1.394
 He+1.284
s+1.143
 boy+1.141
affe+1.129
Neg logit contributions
 such-1.024
 outer-0.985
 unnecessary-0.924
 possible-0.918
rous-0.902
Token said
Token ID531
Feature activation-1.70e+01

Pos logit contributions
board+0.085
iness+0.072
ops+0.071
amboo+0.068
boards+0.066
Neg logit contributions
 an-0.140
 such-0.137
 your-0.127
 the-0.127
 our-0.126
Token,
Token ID11
Feature activation-2.51e+00

Pos logit contributions
board+0.270
 He+0.226
ops+0.190
 was+0.184
s+0.181
Neg logit contributions
 outer-0.659
 unnecessary-0.645
 such-0.636
rous-0.611
 possible-0.605
Token "
Token ID366
Feature activation-1.88e+00

Pos logit contributions
board+0.059
ops+0.050
 He+0.045
affe+0.045
oc+0.043
Neg logit contributions
 such-0.074
 outer-0.074
 unnecessary-0.068
 any-0.068
 an-0.067
TokenI
Token ID40
Feature activation-2.07e-01

Pos logit contributions
board+0.046
 He+0.039
ops+0.038
affe+0.036
 boy+0.036
Neg logit contributions
 outer-0.055
 such-0.055
 unnecessary-0.052
 possible-0.050
rous-0.050
Token man
Token ID582
Feature activation-1.98e+01

Pos logit contributions
 outer+0.560
 such+0.543
 unnecessary+0.542
rous+0.504
 possible+0.504
Neg logit contributions
board-0.503
 He-0.446
ops-0.415
s-0.394
 was-0.393
Token balancing
Token ID22486
Feature activation+1.48e+01

Pos logit contributions
board+0.492
 He+0.418
ops+0.390
affe+0.383
boards+0.367
Neg logit contributions
 such-0.583
 outer-0.581
 unnecessary-0.538
 any-0.528
 possible-0.521
Token on
Token ID319
Feature activation+6.12e+00

Pos logit contributions
 outer+0.317
 unnecessary+0.310
 such+0.300
rous+0.282
 possible+0.273
Neg logit contributions
board-0.481
 He-0.440
ops-0.415
 was-0.408
s-0.401
Token the
Token ID262
Feature activation+1.92e+01

Pos logit contributions
 outer+0.090
 unnecessary+0.087
 such+0.084
rous+0.075
 possible+0.074
Neg logit contributions
board-0.239
 He-0.223
ops-0.211
 was-0.207
 boy-0.206
Token tight
Token ID5381
Feature activation+2.39e+00

Pos logit contributions
 outer+0.589
 such+0.577
 unnecessary+0.577
rous+0.544
 possible+0.537
Neg logit contributions
board-0.457
 He-0.389
ops-0.362
 boy-0.356
 was-0.351
Tokenro
Token ID305
Feature activation-4.60e+01

Pos logit contributions
 outer+0.076
 such+0.073
 unnecessary+0.070
 any+0.068
 possible+0.068
Neg logit contributions
board-0.051
 He-0.046
ops-0.042
ushed-0.040
s-0.040
Tokenpe
Token ID431
Feature activation-3.25e+01

Pos logit contributions
board+0.932
 He+0.816
 boy+0.697
ops+0.691
affe+0.670
Neg logit contributions
 outer-1.522
 such-1.502
 unnecessary-1.466
 possible-1.410
 any-1.385
Token.
Token ID13
Feature activation-4.95e+00

Pos logit contributions
board+0.787
 He+0.691
ops+0.626
 boy+0.606
affe+0.595
Neg logit contributions
 outer-0.969
 such-0.941
 unnecessary-0.935
 possible-0.882
rous-0.870
Token<|endoftext|>
Token ID50256
Feature activation+2.86e+01

Pos logit contributions
board+0.125
ops+0.110
y+0.099
affe+0.097
ard+0.095
Neg logit contributions
 such-0.144
 outer-0.139
 your-0.133
 an-0.129
 unnecessary-0.129
TokenOnce
Token ID7454
Feature activation+2.75e+01

Pos logit contributions
 such+0.804
 outer+0.767
 an+0.684
 possible+0.665
 any+0.663
Neg logit contributions
board-0.732
affe-0.634
ops-0.611
ushed-0.592
der-0.578
Token upon
Token ID2402
Feature activation+9.16e+00

Pos logit contributions
 outer+0.626
 such+0.598
 unnecessary+0.581
 possible+0.561
rous+0.552
Neg logit contributions
board-0.860
 He-0.748
ops-0.731
affe-0.705
oc-0.678
Token Lily
Token ID20037
Feature activation-1.29e+01

Pos logit contributions
 such+0.055
 outer+0.053
 unnecessary+0.050
 possible+0.050
 any+0.049
Neg logit contributions
board-0.058
 He-0.049
ops-0.049
affe-0.048
oc-0.046
Token had
Token ID550
Feature activation-1.20e+01

Pos logit contributions
board+0.276
 Pole+0.239
boards+0.222
affe+0.211
 He+0.207
Neg logit contributions
 such-0.414
 able-0.374
 any-0.369
 your-0.360
 an-0.354
Token so
Token ID523
Feature activation+1.03e+01

Pos logit contributions
board+0.391
 He+0.366
 was+0.355
s+0.338
ops+0.337
Neg logit contributions
 outer-0.248
 unnecessary-0.240
rous-0.220
 such-0.219
 possible-0.207
Token many
Token ID867
Feature activation+8.94e+00

Pos logit contributions
 outer+0.103
 such+0.099
 unnecessary+0.096
 possible+0.086
rous+0.084
Neg logit contributions
board-0.459
ops-0.404
 He-0.398
affe-0.394
ard-0.387
Token bath
Token ID7837
Feature activation+5.13e+00

Pos logit contributions
 outer+0.198
 such+0.196
 unnecessary+0.191
rous+0.172
 any+0.170
Neg logit contributions
board-0.290
 He-0.260
ops-0.247
affe-0.242
 boy-0.234
Tokenro
Token ID305
Feature activation-4.60e+01

Pos logit contributions
 such+0.162
 any+0.144
 outer+0.144
 rid+0.142
 possible+0.140
Neg logit contributions
board-0.107
affe-0.106
 He-0.099
oc-0.097
 boy-0.086
Tokenbes
Token ID12636
Feature activation-1.18e+01

Pos logit contributions
board+1.120
 He+1.002
ops+0.877
affe+0.875
 boy+0.874
Neg logit contributions
 outer-1.349
 such-1.328
 unnecessary-1.300
 possible-1.235
rous-1.221
Token,
Token ID11
Feature activation-2.35e+00

Pos logit contributions
board+0.328
 He+0.297
affe+0.278
ops+0.277
oc+0.275
Neg logit contributions
 such-0.306
 outer-0.293
 possible-0.279
 unnecessary-0.276
 any-0.274
Token but
Token ID475
Feature activation-7.45e+00

Pos logit contributions
board+0.057
ops+0.048
 He+0.047
affe+0.045
ushed+0.043
Neg logit contributions
 such-0.072
 outer-0.069
 unnecessary-0.067
 any-0.065
 possible-0.064
Token he
Token ID339
Feature activation-2.08e+01

Pos logit contributions
board+0.188
 He+0.160
ops+0.154
affe+0.147
s+0.142
Neg logit contributions
 outer-0.217
 such-0.216
 unnecessary-0.208
 possible-0.200
rous-0.196
Token loved
Token ID6151
Feature activation+7.03e-01

Pos logit contributions
board+0.489
 He+0.394
 Pole+0.393
boards+0.391
ops+0.375
Neg logit contributions
 such-0.626
 an-0.569
 able-0.562
rous-0.562
 any-0.561
Token today
Token ID1909
Feature activation+6.61e+00

Pos logit contributions
board+0.075
 He+0.069
 was+0.065
ops+0.064
 boy+0.063
Neg logit contributions
 outer-0.052
 unnecessary-0.050
 such-0.048
rous-0.046
 possible-0.045
Token."
Token ID526
Feature activation+5.54e+00

Pos logit contributions
 outer+0.185
 such+0.184
 unnecessary+0.175
 possible+0.169
 any+0.165
Neg logit contributions
board-0.171
 He-0.150
ops-0.142
affe-0.136
 boy-0.131
Token
Token ID198
Feature activation-2.27e+00

Pos logit contributions
 such+0.135
 your+0.120
 our+0.116
 an+0.114
 outer+0.109
Neg logit contributions
board-0.166
boards-0.138
les-0.136
ops-0.135
y-0.132
Token
Token ID198
Feature activation-2.31e+00

Pos logit contributions
board+0.057
 He+0.048
ops+0.046
 boy+0.043
affe+0.043
Neg logit contributions
 outer-0.066
 such-0.065
 unnecessary-0.064
rous-0.060
 possible-0.060
TokenF
Token ID37
Feature activation+1.14e+01

Pos logit contributions
board+0.067
 He+0.061
ops+0.057
 was+0.055
s+0.055
Neg logit contributions
 outer-0.059
 unnecessary-0.057
 such-0.056
rous-0.052
 possible-0.052
Tokenro
Token ID305
Feature activation-4.59e+01

Pos logit contributions
 outer+0.297
 unnecessary+0.291
rous+0.282
 such+0.280
 possible+0.265
Neg logit contributions
board-0.302
 He-0.266
ops-0.251
boards-0.251
s-0.247
Tokenggy
Token ID19970
Feature activation-4.51e+00

Pos logit contributions
board+1.296
 He+1.177
s+1.037
ops+1.035
 boy+1.035
Neg logit contributions
 such-1.134
 outer-1.106
 unnecessary-1.060
 possible-1.023
rous-1.010
Token smiled
Token ID13541
Feature activation-1.95e+00

Pos logit contributions
board+0.084
 He+0.066
ops+0.066
boards+0.066
affe+0.061
Neg logit contributions
 such-0.145
 an-0.136
 our-0.132
 your-0.130
 these-0.129
Token and
Token ID290
Feature activation-1.59e+01

Pos logit contributions
board+0.035
ops+0.029
affe+0.023
boards+0.022
 He+0.022
Neg logit contributions
 such-0.073
 outer-0.071
 any-0.067
 unnecessary-0.067
 an-0.066
Token said
Token ID531
Feature activation-1.71e+01

Pos logit contributions
board+0.257
ops+0.180
 He+0.174
affe+0.160
boards+0.157
Neg logit contributions
 such-0.610
 outer-0.599
 unnecessary-0.577
 our-0.563
 possible-0.562
Token,
Token ID11
Feature activation-2.39e+00

Pos logit contributions
board+0.256
 He+0.222
 was+0.179
ops+0.178
s+0.175
Neg logit contributions
 outer-0.673
 unnecessary-0.659
 such-0.646
rous-0.632
 possible-0.615
Tokenggy
Token ID19970
Feature activation-4.08e+00

Pos logit contributions
 such+0.030
 unnecessary+0.027
 free+0.026
 rid+0.025
 able+0.025
Neg logit contributions
board-0.075
 He-0.066
boards-0.065
les-0.062
ops-0.062
Token.
Token ID13
Feature activation-4.38e+00

Pos logit contributions
board+0.086
 He+0.068
ops+0.065
boards+0.061
 boy+0.061
Neg logit contributions
 such-0.132
 outer-0.128
 any-0.121
 unnecessary-0.121
 able-0.120
Token
Token ID198
Feature activation-1.92e+00

Pos logit contributions
board+0.120
ops+0.101
jo+0.093
les+0.093
affe+0.091
Neg logit contributions
 such-0.116
 our-0.107
 outer-0.106
 your-0.106
 these-0.103
Token
Token ID198
Feature activation-1.83e+00

Pos logit contributions
board+0.047
ops+0.037
 He+0.036
affe+0.035
 boy+0.034
Neg logit contributions
 such-0.055
 outer-0.055
 unnecessary-0.052
rous-0.051
 our-0.050
TokenF
Token ID37
Feature activation+1.15e+01

Pos logit contributions
board+0.051
 He+0.046
ops+0.043
 was+0.041
s+0.041
Neg logit contributions
 outer-0.048
 unnecessary-0.047
 such-0.047
 possible-0.043
rous-0.043
Tokenro
Token ID305
Feature activation-4.55e+01

Pos logit contributions
 outer+0.299
 unnecessary+0.294
 such+0.284
rous+0.281
 possible+0.266
Neg logit contributions
board-0.306
 He-0.261
ops-0.253
boards-0.253
 boy-0.241
Tokenggy
Token ID19970
Feature activation-3.82e+00

Pos logit contributions
board+1.372
 He+1.256
ops+1.121
s+1.109
 boy+1.106
Neg logit contributions
 such-1.030
 outer-1.008
 unnecessary-0.963
rous-0.923
 possible-0.918
Token provided
Token ID2810
Feature activation+1.54e+00

Pos logit contributions
board+0.080
 He+0.065
ops+0.064
boards+0.061
iness+0.057
Neg logit contributions
 such-0.114
 an-0.110
 our-0.107
 able-0.105
 any-0.104
Token D
Token ID360
Feature activation+1.36e+00

Pos logit contributions
 outer+0.041
 unnecessary+0.040
 such+0.038
 possible+0.037
rous+0.036
Neg logit contributions
board-0.043
 He-0.040
 was-0.036
ops-0.036
s-0.036
Tokenucky
Token ID5309
Feature activation-1.11e+01

Pos logit contributions
 unnecessary+0.030
 such+0.028
 your+0.026
 rid+0.025
 any+0.025
Neg logit contributions
board-0.035
jo-0.034
cases-0.033
oc-0.032
 He-0.032
Token with
Token ID351
Feature activation+2.19e+00

Pos logit contributions
board+0.259
 He+0.233
 was+0.215
s+0.211
 boy+0.207
Neg logit contributions
 outer-0.345
 unnecessary-0.339
 such-0.326
 possible-0.317
rous-0.314
Token She
Token ID1375
Feature activation-1.17e+01

Pos logit contributions
board+0.075
ops+0.064
les+0.057
affe+0.055
 boy+0.055
Neg logit contributions
 such-0.140
 outer-0.136
 unnecessary-0.125
rous-0.124
 an-0.124
Token called
Token ID1444
Feature activation-1.61e+00

Pos logit contributions
board+0.285
 He+0.252
ops+0.238
affe+0.224
amboo+0.219
Neg logit contributions
 such-0.328
 outer-0.325
 able-0.302
rous-0.299
 an-0.295
Token it
Token ID340
Feature activation+2.54e+00

Pos logit contributions
board+0.050
 He+0.049
 was+0.045
 and+0.042
 boy+0.042
Neg logit contributions
 unnecessary-0.038
 outer-0.036
rous-0.034
 possible-0.032
 such-0.031
Token "
Token ID366
Feature activation-1.08e+00

Pos logit contributions
 outer+0.066
 such+0.065
 unnecessary+0.061
 possible+0.059
 any+0.058
Neg logit contributions
board-0.071
 He-0.061
ops-0.060
affe-0.057
 boy-0.056
TokenF
Token ID37
Feature activation+1.19e+01

Pos logit contributions
board+0.031
 He+0.027
ops+0.026
 was+0.025
 boy+0.025
Neg logit contributions
 outer-0.028
 such-0.027
 unnecessary-0.026
rous-0.025
 possible-0.024
Tokenro
Token ID305
Feature activation-4.54e+01

Pos logit contributions
 outer+0.322
 such+0.309
 unnecessary+0.307
rous+0.288
 possible+0.285
Neg logit contributions
board-0.322
 He-0.289
ops-0.269
 boy-0.260
s-0.260
Tokenggy
Token ID19970
Feature activation-4.04e+00

Pos logit contributions
board+1.106
 He+0.969
ops+0.911
 was+0.868
s+0.854
Neg logit contributions
 outer-1.350
 unnecessary-1.312
 such-1.302
rous-1.222
 possible-1.214
Token"
Token ID1
Feature activation+8.36e+00

Pos logit contributions
board+0.096
 He+0.085
ops+0.077
 boy+0.074
affe+0.072
Neg logit contributions
 such-0.117
 outer-0.115
 unnecessary-0.112
 our-0.106
 any-0.106
Token and
Token ID290
Feature activation-1.58e+01

Pos logit contributions
 such+0.231
 outer+0.217
 unnecessary+0.214
 possible+0.209
 any+0.203
Neg logit contributions
board-0.215
 He-0.187
ops-0.178
oc-0.173
affe-0.171
Token put
Token ID1234
Feature activation+5.06e+00

Pos logit contributions
board+0.433
 He+0.373
ops+0.366
affe+0.344
s+0.340
Neg logit contributions
 such-0.419
 outer-0.418
 unnecessary-0.396
 possible-0.376
 any-0.370
Token it
Token ID340
Feature activation+1.87e+00

Pos logit contributions
 outer+0.104
 unnecessary+0.101
 such+0.093
rous+0.089
 possible+0.088
Neg logit contributions
board-0.169
 He-0.157
 was-0.147
ops-0.147
s-0.144
Token said
Token ID531
Feature activation-1.61e+01

Pos logit contributions
board+0.274
 He+0.203
ops+0.199
affe+0.156
boards+0.155
Neg logit contributions
 such-0.763
 outer-0.754
 possible-0.712
 unnecessary-0.705
 an-0.697
Token.
Token ID13
Feature activation-4.26e+00

Pos logit contributions
board+0.274
 He+0.233
ops+0.199
 was+0.190
s+0.188
Neg logit contributions
 outer-0.599
 unnecessary-0.589
 such-0.582
rous-0.558
 possible-0.550
Token
Token ID198
Feature activation-1.59e+00

Pos logit contributions
board+0.107
ops+0.091
y+0.082
les+0.081
affe+0.080
Neg logit contributions
 such-0.119
 outer-0.114
 unnecessary-0.107
 possible-0.106
 an-0.106
Token
Token ID198
Feature activation-1.52e+00

Pos logit contributions
board+0.036
 He+0.029
ops+0.028
 boy+0.026
affe+0.026
Neg logit contributions
 outer-0.050
 such-0.049
 unnecessary-0.047
rous-0.045
 possible-0.045
TokenF
Token ID37
Feature activation+1.18e+01

Pos logit contributions
board+0.044
 He+0.039
ops+0.037
 was+0.035
s+0.035
Neg logit contributions
 outer-0.039
 unnecessary-0.038
 such-0.038
rous-0.035
 possible-0.035
Tokenro
Token ID305
Feature activation-4.51e+01

Pos logit contributions
 outer+0.382
 unnecessary+0.368
 such+0.360
rous+0.351
 possible+0.344
Neg logit contributions
board-0.239
 He-0.207
 boy-0.188
s-0.182
boards-0.182
Tokenggie
Token ID23571
Feature activation+2.90e+00

Pos logit contributions
board+1.295
 He+1.176
s+1.055
 boy+1.052
ops+1.052
Neg logit contributions
 outer-1.083
 such-1.080
 unnecessary-1.016
 possible-0.984
rous-0.964
Token hopped
Token ID45230
Feature activation-8.95e-01

Pos logit contributions
 such+0.088
 an+0.083
 outer+0.082
 able+0.079
 these+0.079
Neg logit contributions
board-0.054
 He-0.046
affe-0.044
amboo-0.042
ops-0.042
Token out
Token ID503
Feature activation+1.56e+00

Pos logit contributions
board+0.029
 He+0.028
 was+0.025
s+0.025
 boy+0.024
Neg logit contributions
 unnecessary-0.021
 outer-0.020
 such-0.018
rous-0.018
nam-0.018
Token of
Token ID286
Feature activation-2.19e+00

Pos logit contributions
 unnecessary+0.052
 outer+0.050
rous+0.047
 such+0.047
nam+0.046
Neg logit contributions
board-0.034
 He-0.033
 was-0.028
s-0.027
 boy-0.026
Token the
Token ID262
Feature activation+2.03e+01

Pos logit contributions
 He+0.063
board+0.061
 was+0.060
 boy+0.058
 bear+0.055
Neg logit contributions
 unnecessary-0.058
 possible-0.046
 outer-0.044
rous-0.043
ault-0.043
Token."
Token ID526
Feature activation+6.35e+00

Pos logit contributions
 such+0.127
 outer+0.123
 unnecessary+0.122
 possible+0.116
 any+0.116
Neg logit contributions
board-0.124
 He-0.107
ops-0.101
affe-0.098
 boy-0.095
Token
Token ID220
Feature activation+4.32e+00

Pos logit contributions
 such+0.158
 your+0.139
 an+0.139
 our+0.137
 unnecessary+0.136
Neg logit contributions
board-0.185
affe-0.150
boards-0.146
jo-0.144
ops-0.143
Token
Token ID198
Feature activation-1.12e+00

Pos logit contributions
 such+0.120
 outer+0.113
 unnecessary+0.109
 possible+0.107
 our+0.106
Neg logit contributions
board-0.116
ops-0.092
affe-0.088
 boy-0.085
ard-0.085
Token
Token ID198
Feature activation-1.03e+00

Pos logit contributions
board+0.026
 He+0.020
ops+0.020
affe+0.019
 boy+0.019
Neg logit contributions
 outer-0.034
 such-0.034
 unnecessary-0.033
rous-0.032
 possible-0.031
TokenF
Token ID37
Feature activation+1.27e+01

Pos logit contributions
board+0.031
 He+0.027
ops+0.026
affe+0.025
 boy+0.025
Neg logit contributions
 outer-0.024
 such-0.024
 unnecessary-0.024
rous-0.022
 possible-0.021
Tokenro
Token ID305
Feature activation-4.44e+01

Pos logit contributions
 outer+0.339
 unnecessary+0.339
rous+0.323
 such+0.321
 possible+0.306
Neg logit contributions
board-0.327
 He-0.277
boards-0.269
ops-0.265
 boy-0.253
Tokenggy
Token ID19970
Feature activation-2.59e+00

Pos logit contributions
board+1.331
 He+1.217
ops+1.082
s+1.076
 boy+1.065
Neg logit contributions
 such-0.999
 outer-0.974
 unnecessary-0.939
 possible-0.899
rous-0.889
Token said
Token ID531
Feature activation-1.52e+01

Pos logit contributions
board+0.056
boards+0.044
 He+0.043
ops+0.043
pit+0.043
Neg logit contributions
 an-0.073
 such-0.073
 our-0.071
 the-0.069
 any-0.068
Token,
Token ID11
Feature activation-2.58e-01

Pos logit contributions
board+0.222
 He+0.193
 was+0.158
s+0.155
ops+0.152
Neg logit contributions
 outer-0.610
 unnecessary-0.588
 such-0.581
rous-0.567
 possible-0.558
Token "
Token ID366
Feature activation+2.00e-01

Pos logit contributions
board+0.006
ops+0.005
 He+0.005
affe+0.004
oc+0.004
Neg logit contributions
 such-0.008
 outer-0.008
 unnecessary-0.007
 any-0.007
 our-0.007
TokenDon
Token ID3987
Feature activation+1.22e+01

Pos logit contributions
 unnecessary+0.005
 such+0.005
 outer+0.005
rous+0.005
 possible+0.005
Neg logit contributions
board-0.006
 He-0.005
ops-0.005
affe-0.005
ushed-0.004
Token deaf
Token ID21815
Feature activation+3.81e+00

Pos logit contributions
 was+0.560
 He+0.537
 wanted+0.532
 and+0.502
 had+0.498
Neg logit contributions
 unnecessary-0.445
 outer-0.430
rous-0.408
 backup-0.377
gary-0.358
Token.
Token ID13
Feature activation-3.04e+00

Pos logit contributions
 such+0.123
 outer+0.116
 unnecessary+0.114
 any+0.113
 possible+0.111
Neg logit contributions
board-0.087
ops-0.074
 He-0.070
affe-0.069
oc-0.066
Token
Token ID198
Feature activation-6.15e-01

Pos logit contributions
board+0.064
ops+0.052
les+0.049
y+0.048
ushed+0.048
Neg logit contributions
 such-0.098
 unnecessary-0.090
 outer-0.090
 these-0.087
 an-0.086
Token
Token ID198
Feature activation-5.05e-01

Pos logit contributions
board+0.013
 He+0.011
ops+0.010
 boy+0.009
affe+0.009
Neg logit contributions
 outer-0.021
 such-0.020
 unnecessary-0.020
rous-0.019
 possible-0.019
TokenF
Token ID37
Feature activation+1.33e+01

Pos logit contributions
board+0.015
 He+0.013
ops+0.013
s+0.012
 was+0.012
Neg logit contributions
 outer-0.013
 unnecessary-0.013
 such-0.012
 possible-0.012
nam-0.012
Tokenro
Token ID305
Feature activation-4.38e+01

Pos logit contributions
 unnecessary+0.460
 outer+0.457
 such+0.447
rous+0.429
 possible+0.420
Neg logit contributions
board-0.240
 He-0.198
ops-0.174
 boy-0.172
boards-0.169
Tokenggy
Token ID19970
Feature activation-2.17e+00

Pos logit contributions
board+1.331
 He+1.236
s+1.094
 boy+1.090
ops+1.083
Neg logit contributions
 such-0.975
 outer-0.951
 unnecessary-0.914
 possible-0.871
rous-0.855
Token was
Token ID373
Feature activation-1.89e+01

Pos logit contributions
board+0.039
 He+0.032
ops+0.030
amboo+0.029
boards+0.029
Neg logit contributions
 such-0.072
 an-0.066
 outer-0.065
 any-0.064
 able-0.064
Token so
Token ID523
Feature activation+1.23e+01

Pos logit contributions
 was+0.498
 He+0.496
 wanted+0.444
board+0.426
 and+0.423
Neg logit contributions
 unnecessary-0.508
 outer-0.494
rous-0.444
nam-0.437
 backup-0.436
Token happy
Token ID3772
Feature activation+6.75e+00

Pos logit contributions
 outer+0.298
 unnecessary+0.293
 such+0.286
rous+0.257
 any+0.254
Neg logit contributions
board-0.358
 He-0.353
 was-0.322
s-0.313
 boy-0.311
Token that
Token ID326
Feature activation+4.31e+00

Pos logit contributions
 such+0.166
 outer+0.150
 any+0.141
 possible+0.141
 your+0.136
Neg logit contributions
board-0.205
ops-0.178
affe-0.173
 He-0.171
ushed-0.168
Token ball
Token ID2613
Feature activation-7.54e+00

Pos logit contributions
 outer+0.624
 such+0.607
 unnecessary+0.598
rous+0.572
nam+0.533
Neg logit contributions
board-0.641
 He-0.556
 boy-0.542
 was-0.521
ops-0.516
Token?"
Token ID1701
Feature activation+1.27e+01

Pos logit contributions
board+0.203
 He+0.181
ops+0.167
 was+0.161
 boy+0.161
Neg logit contributions
 outer-0.205
 unnecessary-0.199
 such-0.198
rous-0.184
 possible-0.182
Token
Token ID198
Feature activation+7.93e-01

Pos logit contributions
 such+0.331
 your+0.298
 these+0.294
 outer+0.292
 unnecessary+0.292
Neg logit contributions
board-0.350
ops-0.292
boards-0.283
ushed-0.275
y-0.263
Token
Token ID198
Feature activation+6.78e-01

Pos logit contributions
 outer+0.026
 such+0.025
 unnecessary+0.025
rous+0.024
 possible+0.024
Neg logit contributions
board-0.017
 He-0.014
ops-0.013
 boy-0.012
affe-0.012
TokenF
Token ID37
Feature activation+1.38e+01

Pos logit contributions
 outer+0.019
 such+0.018
 unnecessary+0.018
rous+0.017
 possible+0.017
Neg logit contributions
board-0.018
 He-0.016
ops-0.015
 was-0.014
 boy-0.014
Tokenro
Token ID305
Feature activation-4.37e+01

Pos logit contributions
 outer+0.403
 unnecessary+0.392
rous+0.382
 such+0.382
 possible+0.364
Neg logit contributions
board-0.301
 He-0.262
 boy-0.248
boards-0.247
s-0.244
Tokenggy
Token ID19970
Feature activation-2.05e+00

Pos logit contributions
board+1.113
 He+0.959
 boy+0.869
ops+0.859
s+0.850
Neg logit contributions
 such-1.213
 outer-1.192
 unnecessary-1.143
 possible-1.118
rous-1.109
Token replied
Token ID8712
Feature activation-6.90e+00

Pos logit contributions
board+0.041
ops+0.035
iness+0.031
affe+0.031
amboo+0.031
Neg logit contributions
 such-0.061
 an-0.061
 your-0.059
 our-0.058
 these-0.058
Token,
Token ID11
Feature activation+5.09e-01

Pos logit contributions
board+0.104
 He+0.089
 was+0.072
ops+0.072
s+0.070
Neg logit contributions
 outer-0.272
 unnecessary-0.266
 such-0.260
rous-0.252
 possible-0.249
Token "
Token ID366
Feature activation+2.28e-01

Pos logit contributions
 such+0.016
 outer+0.016
 unnecessary+0.015
 possible+0.014
 any+0.014
Neg logit contributions
board-0.012
ops-0.009
 He-0.009
affe-0.009
 boy-0.008
TokenSure
Token ID19457
Feature activation+1.09e+01

Pos logit contributions
 outer+0.006
 unnecessary+0.006
 such+0.006
rous+0.005
 possible+0.005
Neg logit contributions
board-0.006
 He-0.006
ops-0.005
 was-0.005
 boy-0.005
Token play
Token ID711
Feature activation+1.86e+00

Pos logit contributions
 outer+0.385
 such+0.366
 unnecessary+0.337
 any+0.324
 our+0.316
Neg logit contributions
board-0.644
 He-0.565
ops-0.556
 boy-0.539
boards-0.538
Token!"
Token ID2474
Feature activation+1.24e+01

Pos logit contributions
 outer+0.053
 unnecessary+0.053
 possible+0.049
 such+0.049
rous+0.048
Neg logit contributions
board-0.049
 He-0.043
 was-0.040
ops-0.039
 boy-0.039
Token
Token ID198
Feature activation-1.29e-01

Pos logit contributions
 such+0.316
 your+0.289
 an+0.281
 our+0.279
 these+0.270
Neg logit contributions
board-0.348
ops-0.294
boards-0.290
pit-0.268
les-0.265
Token
Token ID198
Feature activation-1.91e-01

Pos logit contributions
board+0.003
 He+0.003
ops+0.003
 boy+0.003
affe+0.003
Neg logit contributions
 outer-0.004
 such-0.003
 unnecessary-0.003
rous-0.003
 possible-0.003
TokenF
Token ID37
Feature activation+1.35e+01

Pos logit contributions
board+0.005
 He+0.004
ops+0.004
 boy+0.004
affe+0.004
Neg logit contributions
 outer-0.005
 such-0.005
 unnecessary-0.005
rous-0.005
 possible-0.005
Tokenro
Token ID305
Feature activation-4.37e+01

Pos logit contributions
 outer+0.329
 unnecessary+0.307
rous+0.304
 such+0.304
 possible+0.289
Neg logit contributions
board-0.388
 He-0.343
ops-0.329
boards-0.325
s-0.322
Tokenggy
Token ID19970
Feature activation-2.17e+00

Pos logit contributions
board+1.157
 He+1.042
ops+0.916
 boy+0.913
s+0.907
Neg logit contributions
 such-1.150
 outer-1.143
 unnecessary-1.087
 possible-1.051
rous-1.046
Token and
Token ID290
Feature activation-1.37e+01

Pos logit contributions
board+0.045
 He+0.039
ops+0.035
boards+0.035
affe+0.034
Neg logit contributions
 such-0.063
 an-0.059
 these-0.058
 our-0.057
 outer-0.057
Token D
Token ID360
Feature activation+2.42e+00

Pos logit contributions
board+0.377
 He+0.339
ops+0.316
 was+0.304
 boy+0.304
Neg logit contributions
 outer-0.364
 unnecessary-0.352
 such-0.351
rous-0.326
 possible-0.325
Tokenucky
Token ID5309
Feature activation-1.04e+01

Pos logit contributions
 unnecessary+0.063
 such+0.060
rous+0.053
 your+0.052
 rid+0.051
Neg logit contributions
board-0.059
oc-0.053
man-0.053
 Pole-0.052
cases-0.051
Token played
Token ID2826
Feature activation+8.02e+00

Pos logit contributions
board+0.250
 He+0.224
 Pole+0.211
affe+0.206
oke+0.202
Neg logit contributions
 such-0.275
 an-0.260
 any-0.255
 these-0.250
 the-0.247
Tokenucky
Token ID5309
Feature activation-8.06e+00

Pos logit contributions
 unnecessary+0.128
 such+0.119
rous+0.116
 outer+0.114
 possible+0.112
Neg logit contributions
board-0.124
 He-0.113
boards-0.107
oc-0.104
jo-0.098
Token said
Token ID531
Feature activation-1.32e+01

Pos logit contributions
board+0.169
affe+0.146
 He+0.145
ops+0.145
oke+0.129
Neg logit contributions
 such-0.235
 an-0.227
 your-0.224
 able-0.222
 our-0.221
Token,
Token ID11
Feature activation+1.07e+00

Pos logit contributions
board+0.344
 He+0.328
 was+0.299
s+0.288
 boy+0.282
Neg logit contributions
 outer-0.375
 unnecessary-0.363
 such-0.348
rous-0.348
 possible-0.323
Token "
Token ID366
Feature activation+1.49e+00

Pos logit contributions
 outer+0.032
 such+0.032
 unnecessary+0.030
 possible+0.030
 our+0.029
Neg logit contributions
board-0.025
ops-0.021
 He-0.019
affe-0.018
oc-0.018
TokenF
Token ID37
Feature activation+1.38e+01

Pos logit contributions
 such+0.037
 outer+0.036
 unnecessary+0.034
rous+0.033
 possible+0.033
Neg logit contributions
board-0.044
 He-0.037
ops-0.037
s-0.035
oke-0.035
Tokenro
Token ID305
Feature activation-4.33e+01

Pos logit contributions
 outer+0.396
 such+0.383
 unnecessary+0.383
rous+0.359
 possible+0.355
Neg logit contributions
board-0.353
 He-0.314
ops-0.291
 boy-0.278
 was-0.278
Tokenggy
Token ID19970
Feature activation-2.07e+00

Pos logit contributions
board+1.085
 He+0.972
ops+0.864
s+0.848
 boy+0.846
Neg logit contributions
 outer-1.222
 such-1.218
 unnecessary-1.190
 possible-1.119
rous-1.118
Token,
Token ID11
Feature activation+3.08e-01

Pos logit contributions
board+0.048
 He+0.041
der+0.037
ops+0.037
affe+0.037
Neg logit contributions
 such-0.061
 our-0.058
 your-0.058
 unnecessary-0.057
 outer-0.057
Token you
Token ID345
Feature activation-3.83e+00

Pos logit contributions
 outer+0.008
 such+0.008
 unnecessary+0.008
rous+0.007
 possible+0.007
Neg logit contributions
board-0.009
 He-0.008
ops-0.007
 was-0.007
 boy-0.007
Token are
Token ID389
Feature activation-3.82e+00

Pos logit contributions
board+0.098
 He+0.086
ops+0.080
s+0.076
 boy+0.076
Neg logit contributions
 outer-0.109
 such-0.107
 unnecessary-0.106
 possible-0.099
rous-0.099
Token such
Token ID884
Feature activation+2.94e+01

Pos logit contributions
 He+0.088
 was+0.085
 had+0.077
 boy+0.077
 and+0.076
Neg logit contributions
 outer-0.113
rous-0.105
 unnecessary-0.103
 err-0.092
 assistance-0.090
Token you
Token ID345
Feature activation-3.09e+00

Pos logit contributions
board+0.294
 He+0.274
 was+0.256
 boy+0.253
ops+0.243
Neg logit contributions
 outer-0.234
 unnecessary-0.223
 such-0.218
rous-0.209
 possible-0.197
Token might
Token ID1244
Feature activation-6.60e+00

Pos logit contributions
board+0.065
 He+0.054
ops+0.050
affe+0.048
 boy+0.047
Neg logit contributions
 such-0.100
 outer-0.098
 unnecessary-0.097
 possible-0.093
 any-0.093
Token get
Token ID651
Feature activation+5.65e+00

Pos logit contributions
board+0.188
 He+0.177
 was+0.166
ops+0.162
 boy+0.157
Neg logit contributions
 outer-0.162
 unnecessary-0.158
 such-0.152
rous-0.148
nam-0.142
Token hurt
Token ID5938
Feature activation-1.17e-01

Pos logit contributions
 outer+0.142
rous+0.121
 unnecessary+0.117
wered+0.115
 such+0.115
Neg logit contributions
 He-0.149
 was-0.148
s-0.144
board-0.144
 boy-0.139
Token.
Token ID13
Feature activation-3.57e+00

Pos logit contributions
board+0.003
ops+0.002
 He+0.002
affe+0.002
ushed+0.002
Neg logit contributions
 such-0.004
 outer-0.003
 unnecessary-0.003
 any-0.003
 possible-0.003
Token<|endoftext|>
Token ID50256
Feature activation+3.08e+01

Pos logit contributions
board+0.090
ops+0.078
s+0.071
les+0.070
y+0.068
Neg logit contributions
 such-0.105
 outer-0.103
 unnecessary-0.098
 your-0.095
 our-0.094
TokenOne
Token ID3198
Feature activation+1.78e+01

Pos logit contributions
 such+0.898
 outer+0.871
 unnecessary+0.794
rous+0.776
 possible+0.768
Neg logit contributions
board-0.755
affe-0.627
ops-0.623
 He-0.604
ushed-0.596
Token day
Token ID1110
Feature activation-2.96e-02

Pos logit contributions
 outer+0.493
 unnecessary+0.485
 such+0.476
rous+0.440
 possible+0.436
Neg logit contributions
board-0.479
 He-0.432
ops-0.395
 was-0.393
 boy-0.389
Token,
Token ID11
Feature activation-9.99e-01

Pos logit contributions
board+0.001
 He+0.000
ops+0.000
 boy+0.000
 was+0.000
Neg logit contributions
 outer-0.001
 such-0.001
 unnecessary-0.001
rous-0.001
 possible-0.001
Token a
Token ID257
Feature activation+5.22e+00

Pos logit contributions
 He+0.036
board+0.036
 was+0.035
 boy+0.034
y+0.033
Neg logit contributions
 unnecessary-0.017
 outer-0.016
 such-0.015
rous-0.014
 possible-0.014
Token little
Token ID1310
Feature activation-8.82e+00

Pos logit contributions
 outer+0.131
 unnecessary+0.124
 such+0.117
rous+0.115
 our+0.113
Neg logit contributions
board-0.147
ops-0.130
 He-0.129
boards-0.118
s-0.117
Token no
Token ID645
Feature activation+6.54e+00

Pos logit contributions
 outer+0.055
rous+0.045
 unnecessary+0.044
ivated+0.042
wered+0.041
Neg logit contributions
 was-0.062
board-0.061
 He-0.060
 wanted-0.059
 boy-0.057
Token luck
Token ID8458
Feature activation+9.19e+00

Pos logit contributions
 outer+0.179
 such+0.171
 unnecessary+0.170
rous+0.160
 possible+0.158
Neg logit contributions
board-0.174
 He-0.155
 boy-0.145
ops-0.143
 was-0.142
Token."
Token ID526
Feature activation+8.88e+00

Pos logit contributions
 such+0.267
 outer+0.266
 unnecessary+0.259
 possible+0.254
 any+0.247
Neg logit contributions
board-0.218
 He-0.194
ops-0.178
affe-0.174
 boy-0.166
Token
Token ID198
Feature activation+2.09e+00

Pos logit contributions
 such+0.215
 an+0.202
 our+0.193
 your+0.192
 these+0.184
Neg logit contributions
board-0.264
ops-0.233
affe-0.231
les-0.226
y-0.219
Token
Token ID198
Feature activation+2.20e+00

Pos logit contributions
 outer+0.066
 such+0.065
 unnecessary+0.063
rous+0.060
 possible+0.059
Neg logit contributions
board-0.048
ops-0.038
 He-0.037
affe-0.035
y-0.034
TokenThe
Token ID464
Feature activation+2.53e+01

Pos logit contributions
 outer+0.059
 such+0.057
 unnecessary+0.057
rous+0.053
 possible+0.053
Neg logit contributions
board-0.060
 He-0.053
ops-0.050
affe-0.048
 boy-0.047
Token moms
Token ID34015
Feature activation+8.73e+00

Pos logit contributions
 outer+1.009
 such+0.988
 unnecessary+0.982
 backup+0.957
rous+0.949
Neg logit contributions
board-0.288
 He-0.237
ops-0.218
s-0.195
y-0.188
Token hear
Token ID3285
Feature activation+1.22e+01

Pos logit contributions
 such+0.228
 these+0.204
 backup+0.204
 any+0.203
 outer+0.200
Neg logit contributions
board-0.220
ops-0.203
affe-0.201
 He-0.199
oc-0.194
Token them
Token ID606
Feature activation-4.79e+00

Pos logit contributions
 outer+0.255
 unnecessary+0.247
 such+0.232
 possible+0.227
rous+0.222
Neg logit contributions
board-0.406
 He-0.379
ops-0.349
 was-0.346
 boy-0.346
Token laugh
Token ID6487
Feature activation+1.22e+01

Pos logit contributions
board+0.090
affe+0.074
ops+0.072
 He+0.071
oc+0.065
Neg logit contributions
 such-0.164
 unnecessary-0.155
 outer-0.154
 any-0.154
 our-0.148
Token.
Token ID13
Feature activation+3.55e-01

Pos logit contributions
 such+0.383
 any+0.364
 unnecessary+0.348
 outer+0.341
 our+0.340
Neg logit contributions
board-0.277
ops-0.249
affe-0.224
 Pole-0.217
ushed-0.216
Token and
Token ID290
Feature activation-1.51e+01

Pos logit contributions
board+0.354
 He+0.323
ops+0.312
affe+0.294
amboo+0.284
Neg logit contributions
 such-0.498
 able-0.482
 our-0.453
 outer-0.453
 these-0.452
Token daddy
Token ID35822
Feature activation+6.60e+00

Pos logit contributions
board+0.401
 He+0.366
 boy+0.341
 was+0.333
ops+0.323
Neg logit contributions
 outer-0.417
 unnecessary-0.409
 such-0.395
rous-0.379
 possible-0.377
Token took
Token ID1718
Feature activation-2.84e+00

Pos logit contributions
 such+0.187
 able+0.177
 outer+0.171
 permission+0.167
 any+0.166
Neg logit contributions
board-0.147
 He-0.137
affe-0.125
 Maybe-0.124
ops-0.118
Token a
Token ID257
Feature activation+5.17e+00

Pos logit contributions
board+0.083
 He+0.082
 was+0.080
 boy+0.078
s+0.075
Neg logit contributions
 outer-0.064
 possible-0.060
 unnecessary-0.058
nam-0.058
rous-0.057
Token deep
Token ID2769
Feature activation+8.28e+00

Pos logit contributions
 such+0.138
 unnecessary+0.132
 possible+0.123
rous+0.121
 outer+0.121
Neg logit contributions
board-0.153
s-0.133
 was-0.129
 boy-0.127
 He-0.123
Token breath
Token ID8033
Feature activation+3.49e+01

Pos logit contributions
 unnecessary+0.145
 such+0.130
 possible+0.126
nam+0.125
 outer+0.124
Neg logit contributions
board-0.316
 He-0.289
 was-0.276
 boy-0.272
les-0.269
Token.
Token ID13
Feature activation-4.82e+00

Pos logit contributions
 such+1.310
 outer+1.304
 possible+1.233
 unnecessary+1.229
 any+1.225
Neg logit contributions
board-0.529
 He-0.431
ops-0.382
 boy-0.366
affe-0.363
Token They
Token ID1119
Feature activation-1.32e+01

Pos logit contributions
board+0.087
ops+0.077
affe+0.071
les+0.063
y+0.063
Neg logit contributions
 outer-0.170
 such-0.169
 our-0.154
 these-0.153
 unnecessary-0.153
Token were
Token ID547
Feature activation-1.42e+01

Pos logit contributions
board+0.297
 He+0.256
ops+0.235
affe+0.229
 boy+0.213
Neg logit contributions
 outer-0.416
 such-0.408
 unnecessary-0.376
rous-0.374
nam-0.372
Token so
Token ID523
Feature activation+1.02e+01

Pos logit contributions
 was+0.381
 He+0.374
board+0.344
 said+0.343
 and+0.343
Neg logit contributions
 outer-0.363
 unnecessary-0.363
rous-0.326
 backup-0.302
 possible-0.294
Token happy
Token ID3772
Feature activation+4.89e+00

Pos logit contributions
 outer+0.249
 unnecessary+0.245
 such+0.235
 possible+0.212
 any+0.211
Neg logit contributions
board-0.302
 He-0.285
 was-0.274
s-0.273
 boy-0.262
Tokens
Token ID82
Feature activation-6.76e+00

Pos logit contributions
 such+0.100
 possible+0.092
 unnecessary+0.092
 outer+0.092
 any+0.091
Neg logit contributions
board-0.046
 He-0.037
 boy-0.035
affe-0.035
ard-0.032
Token important
Token ID1593
Feature activation+1.04e+01

Pos logit contributions
board+0.209
 He+0.189
ops+0.179
 boy+0.171
s+0.171
Neg logit contributions
 outer-0.156
 such-0.151
 unnecessary-0.151
 possible-0.138
rous-0.137
Token to
Token ID284
Feature activation+2.06e+01

Pos logit contributions
 outer+0.210
 such+0.206
 unnecessary+0.206
 possible+0.187
 any+0.180
Neg logit contributions
board-0.350
 He-0.316
ops-0.306
affe-0.296
 boy-0.289
Token increase
Token ID2620
Feature activation+4.91e+00

Pos logit contributions
 outer+0.531
 unnecessary+0.519
 such+0.500
rous+0.482
 possible+0.464
Neg logit contributions
board-0.567
 He-0.525
 was-0.483
ops-0.475
 boy-0.468
Token the
Token ID262
Feature activation+2.31e+01

Pos logit contributions
 outer+0.135
 unnecessary+0.131
 such+0.130
rous+0.122
 possible+0.121
Neg logit contributions
board-0.126
 He-0.115
 was-0.105
ops-0.104
s-0.103
Token tid
Token ID29770
Feature activation+3.25e+01

Pos logit contributions
 outer+0.546
 such+0.529
 unnecessary+0.519
rous+0.484
 possible+0.473
Neg logit contributions
board-0.705
 He-0.626
ops-0.587
 boy-0.585
 was-0.583
Tokeniness
Token ID1272
Feature activation-2.85e+01

Pos logit contributions
 outer+0.897
 such+0.893
 possible+0.856
 unnecessary+0.852
rous+0.823
Neg logit contributions
board-0.794
 He-0.733
affe-0.685
 boy-0.656
ops-0.656
Token of
Token ID286
Feature activation+1.91e-01

Pos logit contributions
board+0.750
 He+0.654
ops+0.626
affe+0.601
 boy+0.586
Neg logit contributions
 outer-0.787
 such-0.773
 unnecessary-0.761
 possible-0.731
 any-0.704
Token the
Token ID262
Feature activation+2.19e+01

Pos logit contributions
 outer+0.003
 unnecessary+0.003
 such+0.003
rous+0.003
 possible+0.003
Neg logit contributions
board-0.007
 He-0.006
 was-0.006
 boy-0.006
ops-0.006
Token toilet
Token ID16146
Feature activation-1.93e+00

Pos logit contributions
 outer+0.567
 such+0.549
 unnecessary+0.548
rous+0.507
 possible+0.505
Neg logit contributions
board-0.619
 He-0.553
ops-0.518
 was-0.499
 boy-0.498
Token.
Token ID13
Feature activation-2.26e+00

Pos logit contributions
board+0.039
 He+0.037
affe+0.035
ops+0.033
ushed+0.032
Neg logit contributions
 such-0.059
 outer-0.059
 possible-0.056
 unnecessary-0.055
 any-0.055
Token Tim
Token ID5045
Feature activation-1.03e+01

Pos logit contributions
 outer+0.171
 unnecessary+0.166
 such+0.162
 possible+0.155
rous+0.154
Neg logit contributions
board-0.151
 He-0.137
ops-0.123
 was-0.121
 boy-0.121
Tokenmy
Token ID1820
Feature activation-1.24e+01

Pos logit contributions
board+0.153
 He+0.122
ops+0.112
affe+0.098
s+0.093
Neg logit contributions
 such-0.400
 outer-0.397
 unnecessary-0.387
 any-0.371
rous-0.371
Token the
Token ID262
Feature activation+2.42e+01

Pos logit contributions
board+0.377
 was+0.363
 He+0.362
s+0.352
 and+0.338
Neg logit contributions
 unnecessary-0.320
 outer-0.290
 possible-0.276
rous-0.248
 miles-0.244
Token pictures
Token ID5986
Feature activation+5.25e+00

Pos logit contributions
 outer+0.773
 unnecessary+0.763
 such+0.746
rous+0.724
 possible+0.709
Neg logit contributions
board-0.547
 He-0.476
ops-0.421
 was-0.419
 boy-0.418
Token in
Token ID287
Feature activation+1.39e+01

Pos logit contributions
 such+0.160
 outer+0.150
 possible+0.146
 any+0.145
 these+0.139
Neg logit contributions
board-0.119
oc-0.105
ops-0.104
affe-0.103
 He-0.100
Token the
Token ID262
Feature activation+2.44e+01

Pos logit contributions
 unnecessary+0.206
 outer+0.200
 such+0.184
rous+0.175
 possible+0.171
Neg logit contributions
board-0.541
 He-0.513
 was-0.482
 boy-0.476
ops-0.476
Token book
Token ID1492
Feature activation+4.70e+00

Pos logit contributions
 outer+0.717
 such+0.696
 unnecessary+0.693
rous+0.646
 possible+0.641
Neg logit contributions
board-0.605
 He-0.537
ops-0.500
s-0.474
affe-0.472
Token and
Token ID290
Feature activation-1.07e+01

Pos logit contributions
 such+0.148
 outer+0.136
 any+0.131
 an+0.130
 possible+0.129
Neg logit contributions
affe-0.083
ushed-0.082
amboo-0.082
board-0.080
oc-0.079
Token told
Token ID1297
Feature activation-2.79e-01

Pos logit contributions
board+0.242
ops+0.215
affe+0.209
 He+0.196
oc+0.185
Neg logit contributions
 such-0.343
 outer-0.319
nam-0.318
 these-0.310
 any-0.306
Token him
Token ID683
Feature activation-5.34e+00

Pos logit contributions
board+0.009
 He+0.008
ops+0.007
s+0.007
affe+0.007
Neg logit contributions
 outer-0.006
 such-0.006
 unnecessary-0.006
rous-0.006
 possible-0.006
Token about
Token ID546
Feature activation+1.25e+01

Pos logit contributions
board+0.173
 He+0.159
ops+0.148
 was+0.145
s+0.144
Neg logit contributions
 outer-0.117
 unnecessary-0.112
 such-0.110
rous-0.103
 possible-0.100
Token and
Token ID290
Feature activation-1.69e+01

Pos logit contributions
board+0.171
ops+0.156
 Pole+0.151
amboo+0.149
boards+0.147
Neg logit contributions
 such-0.241
 the-0.233
 an-0.232
 these-0.225
 able-0.224
Token Dad
Token ID17415
Feature activation-6.17e+00

Pos logit contributions
board+0.492
 He+0.437
ops+0.418
affe+0.399
s+0.394
Neg logit contributions
 outer-0.417
 such-0.410
 unnecessary-0.400
rous-0.374
 possible-0.371
Token looked
Token ID3114
Feature activation-1.27e+01

Pos logit contributions
board+0.128
 He+0.115
amboo+0.107
affe+0.106
ops+0.105
Neg logit contributions
 such-0.183
 able-0.175
 an-0.172
 outer-0.172
 permission-0.169
Token at
Token ID379
Feature activation+4.66e+00

Pos logit contributions
board+0.347
 He+0.317
 was+0.289
s+0.288
 boy+0.286
Neg logit contributions
 outer-0.341
 unnecessary-0.329
 such-0.325
 possible-0.309
rous-0.298
Token the
Token ID262
Feature activation+1.80e+01

Pos logit contributions
 outer+0.062
 unnecessary+0.058
 such+0.058
rous+0.049
 possible+0.048
Neg logit contributions
board-0.192
 He-0.178
ops-0.171
 was-0.166
 boy-0.166
Token pe
Token ID613
Feature activation+5.09e+00

Pos logit contributions
 outer+0.403
 such+0.395
 unnecessary+0.379
 possible+0.363
rous+0.355
Neg logit contributions
board-0.559
 He-0.505
ops-0.486
affe-0.468
s-0.458
Tokenb
Token ID65
Feature activation-7.19e+00

Pos logit contributions
 outer+0.130
 such+0.126
 unnecessary+0.126
rous+0.116
 possible+0.115
Neg logit contributions
board-0.145
 He-0.130
ops-0.122
 was-0.118
s-0.117
Tokenble
Token ID903
Feature activation-1.78e+01

Pos logit contributions
board+0.228
 He+0.202
 boy+0.191
ops+0.190
affe+0.189
Neg logit contributions
 such-0.152
 outer-0.147
 possible-0.142
 unnecessary-0.135
 any-0.135
Token carefully
Token ID7773
Feature activation-1.67e+01

Pos logit contributions
board+0.325
 He+0.270
ops+0.257
ushed+0.249
affe+0.245
Neg logit contributions
 such-0.610
 outer-0.598
 any-0.577
 possible-0.576
 these-0.553
Token.
Token ID13
Feature activation-5.80e+00

Pos logit contributions
board+0.294
 He+0.243
ops+0.224
affe+0.209
 boy+0.203
Neg logit contributions
 outer-0.603
 such-0.597
 unnecessary-0.581
 possible-0.558
 any-0.554
Token It
Token ID632
Feature activation-9.81e+00

Pos logit contributions
board+0.129
ops+0.116
affe+0.114
oc+0.107
jo+0.100
Neg logit contributions
 such-0.179
 outer-0.166
 an-0.161
 our-0.160
rous-0.159
Token.
Token ID13
Feature activation-1.64e+00

Pos logit contributions
 unnecessary+0.152
 outer+0.151
 such+0.145
rous+0.143
nam+0.141
Neg logit contributions
board-0.047
 He-0.039
s-0.031
 was-0.031
ops-0.029
Token They
Token ID1119
Feature activation-1.13e+01

Pos logit contributions
board+0.035
ops+0.028
affe+0.026
 He+0.026
oc+0.024
Neg logit contributions
 outer-0.053
 such-0.053
 unnecessary-0.051
rous-0.049
 our-0.049
Token put
Token ID1234
Feature activation+7.84e+00

Pos logit contributions
board+0.346
 He+0.306
ops+0.297
affe+0.286
s+0.281
Neg logit contributions
 outer-0.262
 such-0.254
 unnecessary-0.241
rous-0.230
 any-0.225
Token on
Token ID319
Feature activation+9.24e+00

Pos logit contributions
 outer+0.180
 unnecessary+0.172
 possible+0.162
 such+0.160
rous+0.153
Neg logit contributions
board-0.244
 He-0.228
 was-0.216
 boy-0.214
ops-0.213
Token their
Token ID511
Feature activation+1.03e+01

Pos logit contributions
 possible+0.241
 unnecessary+0.233
shadow+0.216
 outer+0.211
 able+0.208
Neg logit contributions
board-0.244
 was-0.237
s-0.228
 He-0.226
 boy-0.219
Token hats
Token ID23910
Feature activation+3.74e+00

Pos logit contributions
 outer+0.291
 such+0.285
 unnecessary+0.283
 possible+0.271
rous+0.264
Neg logit contributions
board-0.267
 He-0.233
 was-0.216
ops-0.214
 boy-0.213
Token,
Token ID11
Feature activation+9.27e-01

Pos logit contributions
 such+0.133
 outer+0.125
 any+0.122
 our+0.120
 an+0.119
Neg logit contributions
board-0.061
 He-0.052
affe-0.048
oc-0.047
 boy-0.046
Token shoes
Token ID10012
Feature activation-1.41e-01

Pos logit contributions
 outer+0.025
 unnecessary+0.024
 such+0.024
 possible+0.023
rous+0.022
Neg logit contributions
board-0.025
 He-0.023
ops-0.021
 was-0.020
 boy-0.020
Token and
Token ID290
Feature activation-1.29e+01

Pos logit contributions
board+0.003
 He+0.002
affe+0.002
ops+0.002
 boy+0.002
Neg logit contributions
 such-0.005
 outer-0.005
 any-0.004
 unnecessary-0.004
 our-0.004
Token coats
Token ID30720
Feature activation+1.18e+01

Pos logit contributions
board+0.359
 He+0.321
ops+0.301
 was+0.287
 boy+0.287
Neg logit contributions
 outer-0.338
 such-0.327
 unnecessary-0.327
rous-0.302
 possible-0.301
Token.
Token ID13
Feature activation-8.56e-01

Pos logit contributions
 such+0.377
 outer+0.362
 any+0.348
 our+0.333
 an+0.331
Neg logit contributions
board-0.249
affe-0.210
 He-0.207
oc-0.202
ops-0.191
Token the
Token ID262
Feature activation+1.93e+01

Pos logit contributions
 outer+0.337
 such+0.335
 unnecessary+0.328
 possible+0.289
 any+0.286
Neg logit contributions
board-0.625
 He-0.565
ops-0.545
affe-0.523
s-0.522
Token duck
Token ID22045
Feature activation-4.58e+00

Pos logit contributions
 unnecessary+0.463
 outer+0.461
 such+0.457
rous+0.421
 possible+0.419
Neg logit contributions
board-0.574
 He-0.515
ops-0.495
s-0.469
affe-0.463
Token and
Token ID290
Feature activation-1.58e+01

Pos logit contributions
board+0.096
 He+0.078
ops+0.075
affe+0.069
 boy+0.067
Neg logit contributions
 such-0.153
 outer-0.149
 unnecessary-0.149
 any-0.143
 possible-0.141
Token was
Token ID373
Feature activation-2.08e+01

Pos logit contributions
board+0.520
 He+0.472
ops+0.448
s+0.430
 boy+0.430
Neg logit contributions
 outer-0.339
 such-0.327
 unnecessary-0.324
rous-0.294
 possible-0.294
Token happy
Token ID3772
Feature activation+4.37e+00

Pos logit contributions
 was+0.544
 He+0.514
 wanted+0.478
 had+0.467
 said+0.458
Neg logit contributions
 unnecessary-0.593
 outer-0.566
rous-0.485
nam-0.485
 err-0.478
Token that
Token ID326
Feature activation+2.16e+00

Pos logit contributions
 such+0.078
 outer+0.072
 unnecessary+0.068
 possible+0.065
 any+0.064
Neg logit contributions
board-0.164
ops-0.145
 He-0.144
affe-0.141
oc-0.136
Token Max
Token ID5436
Feature activation-1.80e+01

Pos logit contributions
 outer+0.065
 such+0.064
 unnecessary+0.063
 possible+0.059
rous+0.059
Neg logit contributions
board-0.052
 He-0.045
ops-0.042
affe-0.040
s-0.040
Token was
Token ID373
Feature activation-2.06e+01

Pos logit contributions
board+0.329
ops+0.244
omo+0.227
 Pole+0.225
boards+0.223
Neg logit contributions
 such-0.691
 able-0.600
 any-0.596
 an-0.590
 unnecessary-0.583
Token by
Token ID416
Feature activation+8.51e+00

Pos logit contributions
 was+0.591
 He+0.561
 and+0.545
 wanted+0.533
 had+0.527
Neg logit contributions
 unnecessary-0.478
 outer-0.475
nam-0.419
rous-0.416
 err-0.394
Token her
Token ID607
Feature activation+2.82e+00

Pos logit contributions
rous+0.168
 unnecessary+0.166
 outer+0.162
gary+0.140
nam+0.133
Neg logit contributions
 He-0.264
 was-0.243
board-0.231
 boy-0.224
 They-0.223
Token side
Token ID1735
Feature activation-6.67e+00

Pos logit contributions
 outer+0.075
 such+0.073
 unnecessary+0.073
rous+0.067
 possible+0.067
Neg logit contributions
board-0.078
 He-0.070
ops-0.065
 was-0.062
 boy-0.062
Token off
Token ID572
Feature activation+1.26e+01

Pos logit contributions
board+0.265
ops+0.236
 He+0.236
affe+0.228
 boy+0.223
Neg logit contributions
 outer-0.122
 such-0.121
 unnecessary-0.114
 possible-0.103
 any-0.103
Token the
Token ID262
Feature activation+2.26e+01

Pos logit contributions
 outer+0.296
 such+0.289
 unnecessary+0.274
 possible+0.257
 any+0.253
Neg logit contributions
board-0.385
 He-0.341
ops-0.333
affe-0.318
s-0.314
Token tree
Token ID5509
Feature activation-8.61e+00

Pos logit contributions
 outer+0.744
 such+0.727
 unnecessary+0.725
 possible+0.685
rous+0.676
Neg logit contributions
board-0.469
 He-0.406
ops-0.373
s-0.351
 was-0.344
Token.
Token ID13
Feature activation-1.43e+00

Pos logit contributions
board+0.176
ops+0.149
 He+0.149
affe+0.141
 boy+0.140
Neg logit contributions
 such-0.275
 outer-0.271
 any-0.251
 possible-0.247
 able-0.247
Token George
Token ID4502
Feature activation-2.99e+00

Pos logit contributions
board+0.041
ops+0.035
affe+0.032
y+0.032
s+0.032
Neg logit contributions
 such-0.037
 outer-0.037
 unnecessary-0.034
 any-0.032
 our-0.032
Token smiled
Token ID13541
Feature activation+2.74e+00

Pos logit contributions
board+0.053
ops+0.041
 He+0.041
amboo+0.041
affe+0.038
Neg logit contributions
 such-0.103
 able-0.097
 an-0.095
 outer-0.095
 any-0.094
Token and
Token ID290
Feature activation-1.20e+01

Pos logit contributions
 such+0.099
 outer+0.097
 any+0.092
 an+0.090
 unnecessary+0.089
Neg logit contributions
board-0.053
ops-0.046
affe-0.039
oke-0.036
boards-0.035
Token took
Token ID1718
Feature activation+8.54e-01

Pos logit contributions
board+0.247
ops+0.198
 He+0.191
affe+0.184
s+0.168
Neg logit contributions
 such-0.410
 outer-0.399
 unnecessary-0.382
rous-0.369
 possible-0.369
Token a
Token ID257
Feature activation+8.06e+00

Pos logit contributions
nam+0.020
 outer+0.019
 unnecessary+0.019
rous+0.018
 possible+0.016
Neg logit contributions
 He-0.027
 was-0.026
board-0.024
 boy-0.023
 and-0.022
Token bite
Token ID13197
Feature activation+1.12e+01

Pos logit contributions
 such+0.206
 outer+0.199
 unnecessary+0.194
rous+0.179
 possible+0.178
Neg logit contributions
board-0.242
s-0.218
 boy-0.211
 was-0.210
 He-0.209
Token.
Token ID13
Feature activation-7.67e-01

Pos logit contributions
 such+0.365
 any+0.327
 outer+0.324
 possible+0.315
 unnecessary+0.312
Neg logit contributions
board-0.243
ops-0.210
 He-0.202
affe-0.193
ushed-0.193
Token Ben
Token ID3932
Feature activation-7.34e+00

Pos logit contributions
board+0.257
 He+0.230
ops+0.216
 boy+0.207
 was+0.207
Neg logit contributions
 outer-0.230
 such-0.223
 unnecessary-0.221
rous-0.204
 possible-0.204
Token.
Token ID13
Feature activation-2.57e-01

Pos logit contributions
board+0.168
ops+0.139
boards+0.133
 Pole+0.129
affe+0.129
Neg logit contributions
 such-0.225
 any-0.205
 an-0.200
 able-0.198
 your-0.196
Token He
Token ID679
Feature activation-1.79e+01

Pos logit contributions
board+0.007
ops+0.006
les+0.006
actus+0.005
ard+0.005
Neg logit contributions
 such-0.007
 outer-0.007
 your-0.006
 our-0.006
 an-0.006
Token pulled
Token ID5954
Feature activation+7.93e+00

Pos logit contributions
board+0.375
ops+0.333
amboo+0.308
 He+0.294
ard+0.291
Neg logit contributions
 such-0.530
 an-0.528
 outer-0.481
 able-0.480
 the-0.471
Token hard
Token ID1327
Feature activation+8.07e+00

Pos logit contributions
 outer+0.153
 such+0.146
 unnecessary+0.145
rous+0.130
 possible+0.130
Neg logit contributions
board-0.278
 He-0.255
ops-0.242
 was-0.234
 boy-0.234
Token on
Token ID319
Feature activation+1.07e+01

Pos logit contributions
 outer+0.286
 such+0.281
 unnecessary+0.280
 possible+0.264
rous+0.263
Neg logit contributions
board-0.149
 He-0.125
ops-0.113
affe-0.105
 boy-0.105
Token the
Token ID262
Feature activation+2.34e+01

Pos logit contributions
 outer+0.163
 such+0.155
 unnecessary+0.153
rous+0.133
 possible+0.132
Neg logit contributions
board-0.417
 He-0.385
ops-0.368
 was-0.357
 boy-0.357
Token root
Token ID6808
Feature activation-9.51e+00

Pos logit contributions
 outer+0.695
 unnecessary+0.672
 such+0.672
 possible+0.622
 any+0.615
Neg logit contributions
board-0.549
 He-0.494
ops-0.465
s-0.437
 was-0.428
Token,
Token ID11
Feature activation+1.34e+00

Pos logit contributions
board+0.202
ara+0.186
ushed+0.184
ops+0.178
actus+0.176
Neg logit contributions
 such-0.276
 any-0.272
 possible-0.254
 outer-0.254
 able-0.244
Token but
Token ID475
Feature activation-3.75e+00

Pos logit contributions
 such+0.048
 outer+0.048
 unnecessary+0.047
 any+0.045
 possible+0.044
Neg logit contributions
board-0.025
 He-0.019
ops-0.019
affe-0.017
 boy-0.016
Token it
Token ID340
Feature activation+4.26e+00

Pos logit contributions
board+0.102
 He+0.089
ops+0.085
affe+0.081
s+0.079
Neg logit contributions
 outer-0.100
 such-0.100
 unnecessary-0.097
 possible-0.092
rous-0.090
Token "
Token ID366
Feature activation+3.44e+00

Pos logit contributions
 outer+0.108
 such+0.107
 unnecessary+0.102
 possible+0.098
 any+0.098
Neg logit contributions
board-0.079
ops-0.067
 He-0.062
affe-0.060
s-0.057
TokenI
Token ID40
Feature activation+4.38e+00

Pos logit contributions
 outer+0.083
 such+0.082
 unnecessary+0.077
rous+0.075
 possible+0.075
Neg logit contributions
board-0.101
 He-0.086
ops-0.085
s-0.081
affe-0.081
Token saw
Token ID2497
Feature activation+2.55e+00

Pos logit contributions
 such+0.105
 outer+0.101
 unnecessary+0.099
 possible+0.096
 rid+0.095
Neg logit contributions
board-0.131
 He-0.111
ops-0.110
affe-0.107
boards-0.107
Token how
Token ID703
Feature activation+1.95e+01

Pos logit contributions
 outer+0.046
 unnecessary+0.043
 such+0.040
rous+0.039
 possible+0.039
Neg logit contributions
board-0.090
 He-0.086
 was-0.081
 boy-0.080
ops-0.078
Token much
Token ID881
Feature activation+1.10e+01

Pos logit contributions
 outer+0.421
 unnecessary+0.405
 such+0.399
rous+0.368
 possible+0.361
Neg logit contributions
board-0.611
 He-0.608
s-0.558
 was-0.554
 boy-0.548
Token you
Token ID345
Feature activation-1.87e+00

Pos logit contributions
 outer+0.256
 such+0.250
 unnecessary+0.243
 possible+0.229
rous+0.223
Neg logit contributions
board-0.343
 He-0.303
ops-0.294
affe-0.280
s-0.277
Token like
Token ID588
Feature activation+8.24e+00

Pos logit contributions
board+0.046
 He+0.041
affe+0.037
 boy+0.036
ops+0.036
Neg logit contributions
 such-0.054
 outer-0.053
 unnecessary-0.049
 any-0.049
 rid-0.048
Token to
Token ID284
Feature activation+1.92e+01

Pos logit contributions
 unnecessary+0.198
 outer+0.197
 such+0.183
rous+0.181
 possible+0.176
Neg logit contributions
board-0.243
 He-0.225
 was-0.210
ops-0.205
s-0.202
Token write
Token ID3551
Feature activation+3.79e+00

Pos logit contributions
 outer+0.432
 such+0.415
 unnecessary+0.407
 possible+0.375
rous+0.374
Neg logit contributions
board-0.609
 He-0.549
ops-0.521
s-0.503
 boy-0.503
Token,
Token ID11
Feature activation+7.75e-01

Pos logit contributions
 outer+0.107
 such+0.105
 unnecessary+0.103
 possible+0.096
rous+0.095
Neg logit contributions
board-0.098
 He-0.086
ops-0.081
 boy-0.077
affe-0.077
Token so
Token ID523
Feature activation+1.30e+01

Pos logit contributions
 such+0.023
 outer+0.022
 unnecessary+0.021
 possible+0.020
 our+0.020
Neg logit contributions
board-0.019
ops-0.017
 He-0.016
s-0.015
ushed-0.015
Token kissed
Token ID27946
Feature activation+5.44e-01

Pos logit contributions
board+0.277
 He+0.229
ops+0.208
affe+0.191
s+0.185
Neg logit contributions
 outer-0.564
 such-0.557
 unnecessary-0.547
rous-0.521
 possible-0.520
Token them
Token ID606
Feature activation-9.67e+00

Pos logit contributions
 such+0.007
 any+0.006
 outer+0.006
 unnecessary+0.006
 anything+0.005
Neg logit contributions
board-0.023
ops-0.021
 He-0.020
affe-0.020
ushed-0.020
Token.
Token ID13
Feature activation-4.39e+00

Pos logit contributions
board+0.207
 He+0.178
ops+0.163
 boy+0.153
 was+0.152
Neg logit contributions
 outer-0.315
 such-0.308
 unnecessary-0.306
rous-0.288
 possible-0.287
Token Some
Token ID2773
Feature activation+6.48e+00

Pos logit contributions
board+0.091
ops+0.074
affe+0.067
y+0.063
ard+0.061
Neg logit contributions
 outer-0.153
 such-0.149
 unnecessary-0.140
 our-0.139
 your-0.137
Token told
Token ID1297
Feature activation-4.80e+00

Pos logit contributions
 such+0.167
 unnecessary+0.157
 outer+0.152
 any+0.151
 these+0.150
Neg logit contributions
board-0.176
 He-0.165
ops-0.144
affe-0.142
oc-0.142
Token them
Token ID606
Feature activation-9.83e+00

Pos logit contributions
board+0.172
 He+0.158
ops+0.150
 was+0.147
 boy+0.147
Neg logit contributions
 outer-0.088
 unnecessary-0.083
 such-0.082
rous-0.075
 possible-0.074
Token stories
Token ID3923
Feature activation+6.53e+00

Pos logit contributions
board+0.318
 He+0.285
ops+0.274
affe+0.264
 boy+0.261
Neg logit contributions
 such-0.211
 outer-0.209
 unnecessary-0.201
 any-0.185
 possible-0.183
Token or
Token ID393
Feature activation-1.21e+01

Pos logit contributions
 such+0.225
 outer+0.207
 possible+0.205
 any+0.203
 anything+0.197
Neg logit contributions
board-0.118
 He-0.104
oc-0.101
affe-0.097
ops-0.095
Token jokes
Token ID14532
Feature activation-9.55e+00

Pos logit contributions
board+0.339
 He+0.314
 was+0.300
 boy+0.297
s+0.281
Neg logit contributions
 outer-0.297
 unnecessary-0.279
rous-0.274
 such-0.270
 possible-0.263
Token.
Token ID13
Feature activation-4.72e+00

Pos logit contributions
board+0.204
 He+0.175
affe+0.162
oc+0.161
ops+0.158
Neg logit contributions
 such-0.307
 outer-0.283
 any-0.283
 possible-0.275
 anything-0.272
Token Lily
Token ID20037
Feature activation-1.30e+01

Pos logit contributions
board+0.094
ops+0.074
affe+0.068
y+0.065
ushed+0.062
Neg logit contributions
 outer-0.165
 such-0.164
 unnecessary-0.155
 our-0.150
 any-0.149
Token So
Token ID1406
Feature activation+5.91e+00

Pos logit contributions
board+0.052
ops+0.051
affe+0.046
oc+0.045
les+0.045
Neg logit contributions
 such-0.054
 an-0.050
 any-0.049
 outer-0.048
 unnecessary-0.048
Token they
Token ID484
Feature activation-8.18e+00

Pos logit contributions
 outer+0.210
 such+0.208
 unnecessary+0.204
 possible+0.194
rous+0.193
Neg logit contributions
board-0.113
 He-0.091
ops-0.085
affe-0.078
s-0.076
Token went
Token ID1816
Feature activation-2.65e+00

Pos logit contributions
board+0.177
 He+0.154
ops+0.146
affe+0.137
s+0.131
Neg logit contributions
 such-0.257
 outer-0.254
 unnecessary-0.246
 any-0.239
nam-0.236
Token to
Token ID284
Feature activation+1.86e+01

Pos logit contributions
board+0.078
 He+0.075
s+0.069
 was+0.069
 and+0.068
Neg logit contributions
 unnecessary-0.067
 outer-0.065
 possible-0.060
 such-0.060
nam-0.059
Token the
Token ID262
Feature activation+2.23e+01

Pos logit contributions
 unnecessary+0.538
 possible+0.492
 outer+0.473
rous+0.464
gary+0.460
Neg logit contributions
 He-0.492
 was-0.472
 boy-0.450
 They-0.445
board-0.431
Token store
Token ID3650
Feature activation-8.88e+00

Pos logit contributions
 outer+0.719
 unnecessary+0.699
 such+0.698
rous+0.656
 possible+0.654
Neg logit contributions
board-0.496
 He-0.430
ops-0.391
 was-0.373
 boy-0.373
Token to
Token ID284
Feature activation+1.88e+01

Pos logit contributions
board+0.249
 He+0.218
ops+0.217
affe+0.213
les+0.196
Neg logit contributions
 such-0.225
 outer-0.217
 unnecessary-0.203
 any-0.200
rous-0.199
Token replace
Token ID6330
Feature activation+1.10e+01

Pos logit contributions
 unnecessary+0.563
 possible+0.505
 outer+0.502
rous+0.470
 such+0.451
Neg logit contributions
board-0.488
 He-0.477
 was-0.460
 boy-0.437
 They-0.401
Token the
Token ID262
Feature activation+2.28e+01

Pos logit contributions
 outer+0.157
 unnecessary+0.151
 such+0.130
 possible+0.128
rous+0.123
Neg logit contributions
board-0.427
 He-0.410
 was-0.397
 boy-0.391
s-0.390
Token ball
Token ID2613
Feature activation-7.66e+00

Pos logit contributions
 such+0.676
 outer+0.669
 unnecessary+0.662
rous+0.625
 possible+0.625
Neg logit contributions
board-0.518
 He-0.444
ops-0.433
jo-0.414
affe-0.404
Token with
Token ID351
Feature activation+4.62e+00

Pos logit contributions
board+0.185
 He+0.159
ops+0.157
affe+0.153
 boy+0.142
Neg logit contributions
 such-0.224
 outer-0.219
 any-0.208
 possible-0.207
 unnecessary-0.203
Token unloaded
Token ID49975
Feature activation+1.81e+01

Pos logit contributions
board+0.029
ops+0.024
 He+0.024
affe+0.023
der+0.023
Neg logit contributions
 such-0.041
 able-0.039
 outer-0.038
 these-0.037
 our-0.037
Token the
Token ID262
Feature activation+2.26e+01

Pos logit contributions
 outer+0.266
 unnecessary+0.255
 such+0.243
rous+0.217
 possible+0.216
Neg logit contributions
board-0.707
 He-0.662
ops-0.624
 was-0.618
 boy-0.615
Token motorcycle
Token ID18757
Feature activation-2.05e+00

Pos logit contributions
 outer+0.686
 such+0.666
 unnecessary+0.665
rous+0.622
 possible+0.618
Neg logit contributions
board-0.540
 He-0.475
ops-0.444
 was-0.416
affe-0.414
Token with
Token ID351
Feature activation+4.99e+00

Pos logit contributions
board+0.039
omo+0.033
affe+0.033
 He+0.033
ops+0.033
Neg logit contributions
 outer-0.065
 such-0.064
 possible-0.060
 any-0.060
 an-0.057
Token ease
Token ID10152
Feature activation-3.41e+00

Pos logit contributions
 unnecessary+0.085
 outer+0.084
 such+0.074
 possible+0.073
rous+0.073
Neg logit contributions
board-0.178
 He-0.173
 was-0.161
 boy-0.158
ops-0.154
Token.
Token ID13
Feature activation-6.72e-01

Pos logit contributions
board+0.067
 He+0.054
ops+0.052
affe+0.051
 Maybe+0.048
Neg logit contributions
 such-0.117
 outer-0.114
 any-0.108
 unnecessary-0.107
 possible-0.106
Token Tim
Token ID5045
Feature activation-1.07e+01

Pos logit contributions
board+0.016
ops+0.016
y+0.015
affe+0.015
les+0.014
Neg logit contributions
 such-0.019
 outer-0.018
 an-0.017
 these-0.017
 our-0.016
Tokenmy
Token ID1820
Feature activation-1.36e+01

Pos logit contributions
board+0.279
der+0.268
ger+0.257
boards+0.249
ops+0.248
Neg logit contributions
 such-0.252
 your-0.223
 our-0.212
 my-0.210
rous-0.208
Token was
Token ID373
Feature activation-1.70e+01

Pos logit contributions
board+0.223
 He+0.175
affe+0.160
ops+0.160
amboo+0.159
Neg logit contributions
 such-0.480
 outer-0.452
 an-0.442
 able-0.441
 our-0.441
Token so
Token ID523
Feature activation+1.45e+01

Pos logit contributions
 was+0.423
 He+0.421
board+0.366
 and+0.366
 wanted+0.365
Neg logit contributions
 unnecessary-0.514
 outer-0.486
rous-0.441
 backup-0.426
 possible-0.406
Token excited
Token ID6568
Feature activation+6.57e+00

Pos logit contributions
 unnecessary+0.393
 outer+0.392
 such+0.356
 possible+0.337
 any+0.334
Neg logit contributions
board-0.391
 He-0.388
 was-0.362
 boy-0.352
s-0.348
Token took
Token ID1718
Feature activation-4.77e+00

Pos logit contributions
board+0.388
 He+0.326
ops+0.316
affe+0.291
s+0.284
Neg logit contributions
 outer-0.533
 such-0.530
 unnecessary-0.517
 possible-0.485
 any-0.482
Token him
Token ID683
Feature activation-1.22e+01

Pos logit contributions
 He+0.147
board+0.139
 was+0.135
 boy+0.128
oc+0.121
Neg logit contributions
 outer-0.109
nam-0.101
rous-0.099
 possible-0.097
 unnecessary-0.096
Token away
Token ID1497
Feature activation+1.11e+00

Pos logit contributions
board+0.386
 He+0.371
s+0.362
 was+0.359
 and+0.341
Neg logit contributions
 outer-0.264
rous-0.235
 unnecessary-0.229
nam-0.226
 such-0.220
Token.
Token ID13
Feature activation-5.80e+00

Pos logit contributions
 outer+0.040
 unnecessary+0.039
 such+0.039
rous+0.037
 possible+0.037
Neg logit contributions
board-0.020
 He-0.017
ops-0.015
 was-0.014
s-0.014
Token The
Token ID383
Feature activation+1.27e+01

Pos logit contributions
board+0.129
ops+0.112
y+0.105
les+0.098
affe+0.098
Neg logit contributions
 such-0.184
 outer-0.174
 unnecessary-0.169
 an-0.162
 backup-0.161
Token lion
Token ID18744
Feature activation-1.46e+01

Pos logit contributions
 outer+0.432
 unnecessary+0.422
 such+0.419
 possible+0.395
rous+0.390
Neg logit contributions
board-0.262
 He-0.221
ops-0.208
s-0.193
 was-0.185
Token's
Token ID338
Feature activation+7.30e-01

Pos logit contributions
board+0.259
 He+0.217
ops+0.194
amboo+0.179
boards+0.177
Neg logit contributions
 such-0.506
 able-0.471
 any-0.468
 unnecessary-0.463
 miles-0.459
Token friends
Token ID2460
Feature activation-5.68e+00

Pos logit contributions
 outer+0.020
 unnecessary+0.019
 such+0.019
 possible+0.018
rous+0.018
Neg logit contributions
board-0.020
 He-0.018
ops-0.017
s-0.016
affe-0.015
Token were
Token ID547
Feature activation-1.50e+01

Pos logit contributions
board+0.110
 He+0.089
ops+0.086
affe+0.081
amboo+0.079
Neg logit contributions
 such-0.198
 any-0.180
 able-0.179
 outer-0.178
 unnecessary-0.176
Token sad
Token ID6507
Feature activation+1.69e+00

Pos logit contributions
 He+0.365
 was+0.355
 boy+0.319
 driver+0.314
 and+0.313
Neg logit contributions
 outer-0.465
 unnecessary-0.407
rous-0.391
nam-0.377
 possible-0.362
Token and
Token ID290
Feature activation-1.66e+01

Pos logit contributions
 any+0.051
 such+0.051
 unnecessary+0.047
 anything+0.045
 possible+0.045
Neg logit contributions
board-0.044
ops-0.037
affe-0.035
ushed-0.034
oc-0.034
Token food
Token ID2057
Feature activation-6.46e+00

Pos logit contributions
 outer+0.425
 unnecessary+0.419
 such+0.414
 possible+0.379
rous+0.373
Neg logit contributions
board-0.579
 He-0.523
ops-0.499
s-0.474
 was-0.473
Token away
Token ID1497
Feature activation+1.39e+00

Pos logit contributions
board+0.190
 He+0.168
ops+0.163
affe+0.154
 boy+0.152
Neg logit contributions
 such-0.155
 outer-0.153
 unnecessary-0.149
 any-0.141
 possible-0.141
Token from
Token ID422
Feature activation+6.26e+00

Pos logit contributions
 such+0.038
 any+0.035
 outer+0.035
 possible+0.034
 our+0.034
Neg logit contributions
board-0.039
ops-0.034
affe-0.031
 He-0.031
ushed-0.030
Token Tim
Token ID5045
Feature activation-1.60e+01

Pos logit contributions
 outer+0.170
 unnecessary+0.165
 such+0.163
rous+0.153
 possible+0.152
Neg logit contributions
board-0.165
 He-0.153
ops-0.138
 was-0.137
 boy-0.136
Token and
Token ID290
Feature activation-1.72e+01

Pos logit contributions
board+0.333
ushed+0.268
 He+0.265
boards+0.252
ops+0.243
Neg logit contributions
 such-0.534
 any-0.495
 our-0.480
 your-0.479
 unnecessary-0.473
Token Sam
Token ID3409
Feature activation-2.57e+01

Pos logit contributions
board+0.410
 He+0.360
ops+0.331
 was+0.316
 boy+0.315
Neg logit contributions
 outer-0.522
 unnecessary-0.507
 such-0.507
rous-0.474
 possible-0.473
Token.
Token ID13
Feature activation-5.75e+00

Pos logit contributions
board+0.487
 He+0.395
ops+0.361
affe+0.336
 boy+0.332
Neg logit contributions
 such-0.897
 outer-0.888
 unnecessary-0.883
 any-0.837
 possible-0.834
Token
Token ID198
Feature activation-3.63e+00

Pos logit contributions
board+0.141
ops+0.120
les+0.116
y+0.112
affe+0.109
Neg logit contributions
 such-0.168
 our-0.154
 an-0.151
 these-0.150
 your-0.148
Token
Token ID198
Feature activation-3.48e+00

Pos logit contributions
board+0.086
ops+0.069
 He+0.068
affe+0.065
 boy+0.064
Neg logit contributions
 such-0.106
 outer-0.105
 unnecessary-0.101
rous-0.098
 possible-0.097
Token"
Token ID1
Feature activation+6.64e+00

Pos logit contributions
board+0.090
 He+0.078
ops+0.074
affe+0.070
 boy+0.070
Neg logit contributions
 outer-0.097
 such-0.095
 unnecessary-0.094
rous-0.089
 possible-0.088
TokenNo
Token ID2949
Feature activation+1.51e+01

Pos logit contributions
 outer+0.174
 such+0.169
 unnecessary+0.169
rous+0.156
 possible+0.155
Neg logit contributions
board-0.188
 He-0.168
ops-0.157
 was-0.150
s-0.150
Token time
Token ID640
Feature activation+5.05e+00

Pos logit contributions
 outer+0.083
 unnecessary+0.080
 such+0.079
rous+0.072
 possible+0.071
Neg logit contributions
board-0.143
 He-0.130
ops-0.124
affe-0.118
s-0.118
Token there
Token ID612
Feature activation+6.02e+00

Pos logit contributions
 outer+0.165
 such+0.162
 unnecessary+0.153
 possible+0.152
 any+0.152
Neg logit contributions
board-0.109
 He-0.087
affe-0.084
ops-0.084
les-0.077
Token was
Token ID373
Feature activation-2.06e+01

Pos logit contributions
 such+0.201
 the+0.187
 an+0.186
 outer+0.175
 a+0.166
Neg logit contributions
affe-0.131
board-0.129
der-0.116
ener-0.108
ly-0.104
Token a
Token ID257
Feature activation+4.42e+00

Pos logit contributions
 He+0.686
board+0.677
 was+0.656
 boy+0.624
 They+0.606
Neg logit contributions
 unnecessary-0.402
 outer-0.352
 backup-0.334
 possible-0.330
 rid-0.319
Token nos
Token ID43630
Feature activation+3.60e+00

Pos logit contributions
 outer+0.115
 unnecessary+0.108
 such+0.105
rous+0.100
 possible+0.099
Neg logit contributions
board-0.124
ops-0.109
 He-0.109
s-0.099
boards-0.098
Tokeny
Token ID88
Feature activation-2.49e+01

Pos logit contributions
 outer+0.119
 such+0.119
 any+0.113
rous+0.108
 possible+0.107
Neg logit contributions
board-0.056
ops-0.053
boards-0.044
ushed-0.044
 He-0.040
Token cat
Token ID3797
Feature activation-1.49e+01

Pos logit contributions
ops+0.643
board+0.616
boards+0.588
s+0.579
ushed+0.579
Neg logit contributions
 outer-0.593
 your-0.534
 our-0.521
 an-0.509
 these-0.505
Token named
Token ID3706
Feature activation+5.26e+00

Pos logit contributions
board+0.415
ops+0.380
boards+0.357
 He+0.353
ushed+0.352
Neg logit contributions
 such-0.364
 outer-0.349
 your-0.321
 an-0.320
 any-0.314
Token Felix
Token ID29721
Feature activation-8.42e+00

Pos logit contributions
 outer+0.154
 unnecessary+0.150
 such+0.146
rous+0.142
 possible+0.141
Neg logit contributions
board-0.130
 He-0.117
ops-0.107
 was-0.104
 boy-0.102
Token.
Token ID13
Feature activation-3.83e+00

Pos logit contributions
board+0.210
ops+0.180
ushed+0.162
boards+0.162
pit+0.156
Neg logit contributions
 such-0.248
 outer-0.238
 any-0.225
 an-0.224
 unnecessary-0.218
Token He
Token ID679
Feature activation-2.19e+01

Pos logit contributions
board+0.077
ops+0.061
ushed+0.060
affe+0.057
oke+0.053
Neg logit contributions
 outer-0.128
 such-0.127
 unnecessary-0.124
 an-0.116
rous-0.114
Token hat
Token ID6877
Feature activation-1.28e+01

Pos logit contributions
board+0.092
 He+0.080
 boy+0.073
ops+0.072
 was+0.070
Neg logit contributions
 outer-0.126
 such-0.122
 unnecessary-0.121
 possible-0.116
rous-0.115
Token and
Token ID290
Feature activation-1.68e+01

Pos logit contributions
board+0.302
 He+0.260
ops+0.246
 boy+0.232
affe+0.231
Neg logit contributions
 outer-0.382
 such-0.375
 unnecessary-0.367
 possible-0.347
 any-0.346
Token his
Token ID465
Feature activation-3.86e+00

Pos logit contributions
board+0.432
 He+0.360
ops+0.360
affe+0.341
y+0.323
Neg logit contributions
 outer-0.485
 such-0.472
 unnecessary-0.466
 any-0.435
rous-0.433
Token bag
Token ID6131
Feature activation-1.82e+00

Pos logit contributions
board+0.087
 He+0.075
ops+0.069
 was+0.065
s+0.065
Neg logit contributions
 outer-0.122
 unnecessary-0.119
 such-0.119
rous-0.111
 possible-0.111
Token.
Token ID13
Feature activation-5.44e+00

Pos logit contributions
board+0.047
ops+0.040
 He+0.039
affe+0.038
oc+0.036
Neg logit contributions
 such-0.050
 outer-0.049
 unnecessary-0.047
 any-0.046
 possible-0.046
Token He
Token ID679
Feature activation-2.38e+01

Pos logit contributions
board+0.079
ops+0.062
y+0.051
affe+0.051
ushed+0.048
Neg logit contributions
 such-0.212
 outer-0.203
 unnecessary-0.200
 any-0.196
 an-0.195
Token walks
Token ID11114
Feature activation+1.11e+01

Pos logit contributions
board+0.555
ops+0.451
 He+0.445
affe+0.425
y+0.412
Neg logit contributions
 outer-0.723
 such-0.720
 unnecessary-0.704
 any-0.670
 our-0.666
Token fast
Token ID3049
Feature activation-3.73e+00

Pos logit contributions
 outer+0.217
 unnecessary+0.210
 such+0.204
rous+0.187
 possible+0.183
Neg logit contributions
board-0.387
 He-0.360
 was-0.333
ops-0.332
s-0.331
Token.
Token ID13
Feature activation-5.27e+00

Pos logit contributions
board+0.062
 He+0.051
ops+0.045
 was+0.041
affe+0.041
Neg logit contributions
 outer-0.140
 such-0.137
 unnecessary-0.137
rous-0.129
 possible-0.129
Token
Token ID198
Feature activation-2.70e+00

Pos logit contributions
board+0.104
ops+0.084
affe+0.077
y+0.074
ushed+0.072
Neg logit contributions
 such-0.180
 outer-0.178
 unnecessary-0.172
 any-0.164
 backup-0.163
Token
Token ID198
Feature activation-2.58e+00

Pos logit contributions
board+0.055
 He+0.045
ops+0.043
affe+0.040
 boy+0.039
Neg logit contributions
 outer-0.091
 such-0.089
 unnecessary-0.088
rous-0.083
 possible-0.083
Token flame
Token ID16506
Feature activation+3.85e+00

Pos logit contributions
board+0.162
 He+0.152
ops+0.152
s+0.142
 Maybe+0.136
Neg logit contributions
 outer-0.179
 such-0.173
 unnecessary-0.158
 our-0.156
 any-0.155
Token outside
Token ID2354
Feature activation+7.84e-01

Pos logit contributions
 outer+0.082
 such+0.080
 possible+0.071
 unnecessary+0.070
 any+0.070
Neg logit contributions
board-0.123
 He-0.114
ops-0.110
affe-0.106
 boy-0.102
Token his
Token ID465
Feature activation-4.71e+00

Pos logit contributions
 such+0.022
 outer+0.021
 any+0.019
 possible+0.018
 unnecessary+0.018
Neg logit contributions
board-0.021
ops-0.019
affe-0.018
 He-0.017
ushed-0.017
Token window
Token ID4324
Feature activation-3.72e+00

Pos logit contributions
board+0.122
 He+0.109
ops+0.102
affe+0.097
 was+0.095
Neg logit contributions
 outer-0.133
 such-0.129
 unnecessary-0.127
rous-0.118
 possible-0.117
Token.
Token ID13
Feature activation-6.55e+00

Pos logit contributions
board+0.077
affe+0.076
ops+0.074
amboo+0.071
oke+0.069
Neg logit contributions
 such-0.114
 outer-0.107
 any-0.099
 an-0.097
 possible-0.096
Token He
Token ID679
Feature activation-2.49e+01

Pos logit contributions
board+0.143
ops+0.133
les+0.133
y+0.131
ies+0.120
Neg logit contributions
 such-0.200
 an-0.187
 outer-0.181
 your-0.175
 these-0.173
Token got
Token ID1392
Feature activation-6.56e+00

Pos logit contributions
board+0.577
ops+0.455
 He+0.445
boards+0.414
affe+0.401
Neg logit contributions
 outer-0.767
 such-0.750
 an-0.712
 unnecessary-0.700
 our-0.698
Token scared
Token ID12008
Feature activation-1.42e+00

Pos logit contributions
 He+0.175
 was+0.166
s+0.152
 wanted+0.149
 boy+0.148
Neg logit contributions
 outer-0.191
 unnecessary-0.182
 possible-0.167
rous-0.161
gary-0.156
Token and
Token ID290
Feature activation-1.75e+01

Pos logit contributions
board+0.027
ops+0.024
ushed+0.022
affe+0.021
oc+0.020
Neg logit contributions
 such-0.052
 any-0.050
 unnecessary-0.047
 outer-0.046
 your-0.046
Token called
Token ID1444
Feature activation-3.77e+00

Pos logit contributions
board+0.350
ops+0.278
 He+0.276
affe+0.262
s+0.241
Neg logit contributions
 outer-0.596
 such-0.595
 unnecessary-0.585
 any-0.554
nam-0.548
Token out
Token ID503
Feature activation-7.14e-01

Pos logit contributions
 He+0.117
board+0.116
 was+0.103
 boy+0.099
 They+0.094
Neg logit contributions
 unnecessary-0.089
rous-0.087
 outer-0.085
 possible-0.077
 such-0.077
Token middle
Token ID3504
Feature activation-1.28e+00

Pos logit contributions
 outer+0.503
 unnecessary+0.492
 such+0.488
 possible+0.455
rous+0.455
Neg logit contributions
board-0.498
 He-0.443
ops-0.404
 was-0.399
 boy-0.398
Token of
Token ID286
Feature activation-4.47e+00

Pos logit contributions
board+0.037
 He+0.034
ops+0.033
affe+0.033
ushed+0.031
Neg logit contributions
 such-0.031
 outer-0.029
 these-0.027
rous-0.027
 any-0.026
Token the
Token ID262
Feature activation+1.84e+01

Pos logit contributions
 was+0.145
 He+0.141
board+0.138
 and+0.132
 boy+0.132
Neg logit contributions
 unnecessary-0.098
 outer-0.075
 possible-0.074
rous-0.072
shadow-0.070
Token huge
Token ID3236
Feature activation+3.48e+00

Pos logit contributions
 outer+0.545
 unnecessary+0.537
 such+0.535
rous+0.499
 possible+0.497
Neg logit contributions
board-0.457
 He-0.398
ops-0.365
 was-0.356
 boy-0.356
Token spark
Token ID9009
Feature activation-6.28e+00

Pos logit contributions
 outer+0.106
 such+0.103
 unnecessary+0.101
rous+0.095
 any+0.094
Neg logit contributions
board-0.081
 He-0.072
ops-0.068
s-0.065
affe-0.064
Tokenly
Token ID306
Feature activation-3.17e+01

Pos logit contributions
 He+0.085
board+0.083
 Maybe+0.071
ushed+0.063
omo+0.061
Neg logit contributions
 outer-0.241
 such-0.240
 unnecessary-0.229
 any-0.228
 these-0.226
Token things
Token ID1243
Feature activation+1.42e+00

Pos logit contributions
board+0.842
 He+0.788
ops+0.742
affe+0.707
s+0.686
Neg logit contributions
 outer-0.870
 such-0.815
 unnecessary-0.786
 possible-0.728
rous-0.728
Token was
Token ID373
Feature activation-2.22e+01

Pos logit contributions
 outer+0.042
 such+0.042
 possible+0.039
 unnecessary+0.039
 any+0.038
Neg logit contributions
board-0.035
 He-0.030
affe-0.029
ops-0.029
oc-0.028
Token a
Token ID257
Feature activation+2.91e+00

Pos logit contributions
board+0.639
 was+0.615
 He+0.598
 wasn+0.536
 wanted+0.535
Neg logit contributions
 unnecessary-0.533
 outer-0.508
nam-0.469
rous-0.467
 such-0.457
Token sparkling
Token ID39072
Feature activation+5.02e+00

Pos logit contributions
 outer+0.054
 such+0.053
 unnecessary+0.053
rous+0.047
 possible+0.046
Neg logit contributions
board-0.102
 He-0.093
 was-0.088
 boy-0.088
ops-0.087
Token wheel
Token ID7825
Feature activation-4.56e+00

Pos logit contributions
 outer+0.134
 such+0.127
 unnecessary+0.123
 possible+0.117
rous+0.116
Neg logit contributions
board-0.136
 He-0.125
ops-0.120
s-0.113
affe-0.112