TOP ACTIVATIONS
MAX = 3.244

clean . The brother was so happy when
searched until she finally found the video . She was so
was so excited to watch the new video . But when
so excited to watch the new video . But when she
took good care of his new tablet so it wouldn 't
Spot happily took the bone and ate it up
smile . After the tablet was fixed , Tim
he had chosen to watch the fun and easy video .
end ! After the video was over , Lu
going off to watch her beloved k itty video . <|endoftext|>
my took good care of his new tablet so it wouldn
they arrived , Tom put the video in the player and
started to g iggle . The water was cold , but
watched . When the video was over , they
She sat down and watched the cartoon . After a while
. He said he loved his mom too . He said
. After the cartoon was over , Lily
car and likes it . The cat jumps on the car
scare her a bit ". The other cat agreed . The
better . When the cartoon finished , her mom

MIN ACTIVATIONS
MIN = -3.325

a little mouse named S que aky . One day ,
his pet mouse , S que aky . One
One sunny day , S que aky came up to Fl
She blinked her eyes and imm ag ined she could see
G iz mo and S que eker , were both small
play ?" S que aky replied , " Yes
, Fl uffy asked S que aky , " Do you
y puppet called P ips que ak . One day P
not touch things that were not his . Tim decided to
. P ips que ak started to regret it
. One day P ips que ak was playing with its
Lily . " It 's an underground tunnel ," said her
Look , mom , isn 't it incredible ?" Mom agreed
Fl uffy and S que aky played for hours until
wins !" S que eker was up for the
in the bathroom and didn 't know what it was .
one more time , and Sil v ie repeated it .
!" " S que ak !" Lily says .
that day forward , Daisy stri ved to be the best
by himself when he heard an enormous noise . He looked

INTERVAL 1.602 - 3.244
CONTAINS 0.402%

alone !" But the dog does not listen . He thinks
saw something moving . She went closer and saw it was
her cabin . One day , while she was playing ,
the bench . Lily 's mom has a bag . She
Oh no !" The bear was sorry . He

INTERVAL -0.040 - 1.602
CONTAINS 52.226%

pumpkin . When he got home , he called for his
They were friends . The moral of the story
tower now ?" Tom looks at the stick .
without asking and to always listen to his mom my .
But then , something strange happened . The stack started to

INTERVAL -1.683 - -0.040
CONTAINS 47.092%

would bow every day for fun . <|endoftext|> Once upon a
picture on his shelf , next to his cars and trucks
, especially in the mud . One day , Tim my
in nature . P ray ing made the little
ily didn 't care about that . She said , "

INTERVAL -3.325 - -1.683
CONTAINS 0.279%

" Tim my , you broke the computer !"
The ice was not strong . It cracked under
to build a nest in a tree . The rob in
. The bird lived in an ordinary tree with its family
overnight . Everyone was so disappointed ! They
Token clean
Token ID3424
Feature activation+8.77e-02

Pos logit contributions
 last+0.782
 Ms+0.694
 teach+0.641
 front+0.634
�+0.626
Neg logit contributions
ust-1.093
nes-1.051
vel-1.048
onel-0.962
oll-0.951
Token.
Token ID13
Feature activation-2.73e-01

Pos logit contributions
 last+0.098
 Ms+0.084
 teach+0.075
 front+0.074
 hugs+0.072
Neg logit contributions
ust-0.183
nes-0.179
vel-0.178
onel-0.164
aces-0.163
Token
Token ID220
Feature activation-2.08e-01

Pos logit contributions
ust+0.590
nes+0.572
vel+0.570
onel+0.530
aces+0.524
Neg logit contributions
 last-0.298
 Ms-0.246
 teach-0.220
 front-0.217
 hugs-0.211
Token
Token ID198
Feature activation-1.23e-01

Pos logit contributions
ust+0.466
nes+0.454
vel+0.451
onel+0.420
aces+0.416
Neg logit contributions
 last-0.207
 Ms-0.169
 teach-0.148
 front-0.147
 hugs-0.143
Token
Token ID198
Feature activation-4.81e-01

Pos logit contributions
ust+0.274
nes+0.269
vel+0.267
onel+0.247
oll+0.243
Neg logit contributions
 last-0.126
 Ms-0.102
 teach-0.091
 front-0.090
 hugs-0.088
TokenThe
Token ID464
Feature activation+3.24e+00

Pos logit contributions
ust+1.012
nes+0.985
vel+0.978
onel+0.907
aces+0.896
Neg logit contributions
 last-0.555
 Ms-0.470
 front-0.415
 teach-0.415
 hugs-0.408
Token brother
Token ID3956
Feature activation+1.76e-01

Pos logit contributions
ted+5.321
�+4.983
't+4.852
 Ms+4.723
 last+4.714
Neg logit contributions
ust-4.659
vel-4.609
nes-4.233
aky-4.212
oll-4.152
Token was
Token ID373
Feature activation-5.57e-01

Pos logit contributions
 last+0.195
 Ms+0.164
 front+0.147
 teach+0.145
't+0.140
Neg logit contributions
ust-0.369
nes-0.363
vel-0.359
onel-0.331
aces-0.329
Token so
Token ID523
Feature activation+2.03e-01

Pos logit contributions
ust+0.916
nes+0.883
vel+0.875
onel+0.794
aces+0.783
Neg logit contributions
 last-0.896
 Ms-0.799
 front-0.736
 teach-0.735
 hugs-0.726
Token happy
Token ID3772
Feature activation+5.57e-02

Pos logit contributions
 last+0.308
 Ms+0.274
 front+0.252
 teach+0.251
 hugs+0.244
Neg logit contributions
ust-0.356
nes-0.341
vel-0.339
onel-0.312
oll-0.303
Token when
Token ID618
Feature activation+8.99e-01

Pos logit contributions
 last+0.061
 Ms+0.052
 front+0.046
 teach+0.045
 hugs+0.044
Neg logit contributions
ust-0.118
vel-0.115
nes-0.115
onel-0.106
aces-0.105
Token searched
Token ID16499
Feature activation+9.00e-01

Pos logit contributions
ust+1.784
nes+1.708
vel+1.703
aces+1.551
onel+1.549
Neg logit contributions
 last-1.659
 Ms-1.453
 front-1.343
 teach-1.326
 hugs-1.315
Token until
Token ID1566
Feature activation+1.49e-01

Pos logit contributions
 last+1.095
 Ms+1.049
 teach+0.952
�+0.927
't+0.898
Neg logit contributions
ust-1.685
vel-1.637
nes-1.602
onel-1.492
aces-1.492
Token she
Token ID673
Feature activation-4.08e-01

Pos logit contributions
 last+0.152
 Ms+0.131
 teach+0.118
 hugs+0.115
 praise+0.112
Neg logit contributions
ust-0.321
nes-0.318
vel-0.312
onel-0.293
aces-0.285
Token finally
Token ID3443
Feature activation-1.18e-01

Pos logit contributions
ust+0.717
nes+0.694
vel+0.687
onel+0.629
aces+0.618
Neg logit contributions
 last-0.607
 Ms-0.537
 teach-0.491
 front-0.490
 hugs-0.484
Token found
Token ID1043
Feature activation+8.19e-02

Pos logit contributions
ust+0.184
nes+0.177
vel+0.175
onel+0.158
aces+0.154
Neg logit contributions
 last-0.199
 Ms-0.182
 teach-0.167
 front-0.166
 hugs-0.165
Token the
Token ID262
Feature activation+3.24e+00

Pos logit contributions
 last+0.083
 Ms+0.071
 teach+0.062
 hugs+0.060
 front+0.060
Neg logit contributions
ust-0.177
vel-0.174
nes-0.174
onel-0.162
aces-0.161
Token video
Token ID2008
Feature activation-7.71e-01

Pos logit contributions
�+4.660
 Ms+4.530
 last+4.498
ted+4.493
ited+4.247
Neg logit contributions
vel-4.868
ust-4.844
nes-4.599
onel-4.281
aces-4.053
Token.
Token ID13
Feature activation+4.54e-01

Pos logit contributions
ust+1.705
nes+1.653
vel+1.642
onel+1.530
ode+1.513
Neg logit contributions
 last-0.815
 Ms-0.674
 front-0.590
 teach-0.587
 hugs-0.574
Token She
Token ID1375
Feature activation-1.91e-01

Pos logit contributions
 last+0.652
 Ms+0.536
 teach+0.512
 front+0.498
 praise+0.495
Neg logit contributions
ust-0.816
nes-0.792
vel-0.784
aces-0.722
onel-0.717
Token was
Token ID373
Feature activation-7.19e-01

Pos logit contributions
ust+0.399
nes+0.391
vel+0.387
onel+0.358
aces+0.354
Neg logit contributions
 last-0.217
 Ms-0.184
 front-0.162
 teach-0.162
 hugs-0.156
Token so
Token ID523
Feature activation-3.59e-01

Pos logit contributions
ust+1.179
nes+1.135
vel+1.123
onel+1.016
aces+1.007
Neg logit contributions
 last-1.172
 Ms-1.045
 front-0.969
 teach-0.961
 hugs-0.951
Token was
Token ID373
Feature activation-3.34e-01

Pos logit contributions
ust+0.428
nes+0.417
vel+0.413
onel+0.380
aces+0.379
Neg logit contributions
 last-0.248
 Ms-0.212
 front-0.189
 teach-0.187
 hugs-0.182
Token so
Token ID523
Feature activation-2.41e-01

Pos logit contributions
ust+0.584
nes+0.565
vel+0.561
onel+0.511
aces+0.505
Neg logit contributions
 last-0.503
 Ms-0.443
 teach-0.405
 front-0.404
 hugs-0.400
Token excited
Token ID6568
Feature activation+6.11e-01

Pos logit contributions
ust+0.461
nes+0.444
vel+0.442
onel+0.407
oll+0.399
Neg logit contributions
 last-0.325
 Ms-0.284
 front-0.258
 teach-0.256
 hugs-0.251
Token to
Token ID284
Feature activation+3.56e-01

Pos logit contributions
 last+0.711
 Ms+0.624
 front+0.568
 teach+0.559
�+0.557
Neg logit contributions
ust-1.247
nes-1.197
vel-1.196
aces-1.093
oll-1.089
Token watch
Token ID2342
Feature activation+1.38e-01

Pos logit contributions
 last+0.374
 Ms+0.335
�+0.269
 hugs+0.268
 Mrs+0.267
Neg logit contributions
ust-0.746
nes-0.745
vel-0.734
onel-0.672
ode-0.670
Token the
Token ID262
Feature activation+3.18e+00

Pos logit contributions
 last+0.130
 Ms+0.103
�+0.088
 front+0.088
 hugs+0.088
Neg logit contributions
ust-0.309
vel-0.308
nes-0.305
aces-0.282
onel-0.282
Token new
Token ID649
Feature activation+3.12e+00

Pos logit contributions
 last+4.828
�+4.554
 Ms+4.229
 praise+4.032
ited+3.998
Neg logit contributions
vel-5.040
ust-4.608
nes-4.575
aund-4.226
onel-4.036
Token video
Token ID2008
Feature activation-3.45e-01

Pos logit contributions
 last+4.904
�+4.835
 Ms+4.198
ited+4.095
 praise+3.937
Neg logit contributions
vel-4.977
nes-4.624
ust-4.354
aund-4.097
od-3.997
Token.
Token ID13
Feature activation+5.16e-01

Pos logit contributions
ust+0.741
nes+0.723
vel+0.719
onel+0.667
aces+0.661
Neg logit contributions
 last-0.374
 Ms-0.315
 front-0.275
 teach-0.274
 hugs-0.270
Token But
Token ID887
Feature activation-7.52e-02

Pos logit contributions
 last+0.735
 Ms+0.611
 teach+0.573
 praise+0.560
 front+0.559
Neg logit contributions
ust-0.930
nes-0.899
vel-0.882
aces-0.833
onel-0.805
Token when
Token ID618
Feature activation+7.36e-01

Pos logit contributions
ust+0.176
nes+0.172
vel+0.171
onel+0.159
aces+0.158
Neg logit contributions
 last-0.066
 Ms-0.054
 teach-0.046
 hugs-0.045
 front-0.045
Token so
Token ID523
Feature activation-2.41e-01

Pos logit contributions
ust+0.584
nes+0.565
vel+0.561
onel+0.511
aces+0.505
Neg logit contributions
 last-0.503
 Ms-0.443
 teach-0.405
 front-0.404
 hugs-0.400
Token excited
Token ID6568
Feature activation+6.11e-01

Pos logit contributions
ust+0.461
nes+0.444
vel+0.442
onel+0.407
oll+0.399
Neg logit contributions
 last-0.325
 Ms-0.284
 front-0.258
 teach-0.256
 hugs-0.251
Token to
Token ID284
Feature activation+3.56e-01

Pos logit contributions
 last+0.711
 Ms+0.624
 front+0.568
 teach+0.559
�+0.557
Neg logit contributions
ust-1.247
nes-1.197
vel-1.196
aces-1.093
oll-1.089
Token watch
Token ID2342
Feature activation+1.38e-01

Pos logit contributions
 last+0.374
 Ms+0.335
�+0.269
 hugs+0.268
 Mrs+0.267
Neg logit contributions
ust-0.746
nes-0.745
vel-0.734
onel-0.672
ode-0.670
Token the
Token ID262
Feature activation+3.18e+00

Pos logit contributions
 last+0.130
 Ms+0.103
�+0.088
 front+0.088
 hugs+0.088
Neg logit contributions
ust-0.309
vel-0.308
nes-0.305
aces-0.282
onel-0.282
Token new
Token ID649
Feature activation+3.12e+00

Pos logit contributions
 last+4.828
�+4.554
 Ms+4.229
 praise+4.032
ited+3.998
Neg logit contributions
vel-5.040
ust-4.608
nes-4.575
aund-4.226
onel-4.036
Token video
Token ID2008
Feature activation-3.45e-01

Pos logit contributions
 last+4.904
�+4.835
 Ms+4.198
ited+4.095
 praise+3.937
Neg logit contributions
vel-4.977
nes-4.624
ust-4.354
aund-4.097
od-3.997
Token.
Token ID13
Feature activation+5.16e-01

Pos logit contributions
ust+0.741
nes+0.723
vel+0.719
onel+0.667
aces+0.661
Neg logit contributions
 last-0.374
 Ms-0.315
 front-0.275
 teach-0.274
 hugs-0.270
Token But
Token ID887
Feature activation-7.52e-02

Pos logit contributions
 last+0.735
 Ms+0.611
 teach+0.573
 praise+0.560
 front+0.559
Neg logit contributions
ust-0.930
nes-0.899
vel-0.882
aces-0.833
onel-0.805
Token when
Token ID618
Feature activation+7.36e-01

Pos logit contributions
ust+0.176
nes+0.172
vel+0.171
onel+0.159
aces+0.158
Neg logit contributions
 last-0.066
 Ms-0.054
 teach-0.046
 hugs-0.045
 front-0.045
Token she
Token ID673
Feature activation-1.70e-01

Pos logit contributions
 last+0.858
 Ms+0.725
 teach+0.677
 praise+0.675
 hugs+0.664
Neg logit contributions
ust-1.432
nes-1.424
vel-1.392
onel-1.275
aund-1.262
Token took
Token ID1718
Feature activation-7.00e-01

Pos logit contributions
ust+0.749
nes+0.727
vel+0.722
onel+0.665
aces+0.658
Neg logit contributions
 last-0.511
 Ms-0.446
 front-0.401
 teach-0.399
 hugs-0.392
Token good
Token ID922
Feature activation+2.37e-01

Pos logit contributions
ust+1.510
nes+1.464
vel+1.447
onel+1.351
ode+1.335
Neg logit contributions
 last-0.789
 Ms-0.661
 front-0.586
 teach-0.585
 hugs-0.580
Token care
Token ID1337
Feature activation+2.34e-01

Pos logit contributions
 last+0.242
 Ms+0.213
�+0.179
 front+0.176
't+0.174
Neg logit contributions
ust-0.511
vel-0.507
nes-0.497
onel-0.463
aces-0.463
Token of
Token ID286
Feature activation+1.74e-02

Pos logit contributions
 last+0.253
 Ms+0.229
 teach+0.194
 front+0.193
�+0.190
Neg logit contributions
ust-0.495
vel-0.485
nes-0.482
onel-0.451
aces-0.446
Token his
Token ID465
Feature activation+2.98e+00

Pos logit contributions
 last+0.025
 Ms+0.022
 front+0.020
�+0.019
 hugs+0.019
Neg logit contributions
ust-0.031
vel-0.030
nes-0.030
onel-0.028
aces-0.027
Token new
Token ID649
Feature activation+3.09e+00

Pos logit contributions
 last+4.580
�+4.294
 Ms+4.276
't+4.145
glers+3.937
Neg logit contributions
vel-4.480
ust-4.402
nes-4.012
aces-3.932
onel-3.780
Token tablet
Token ID14147
Feature activation-1.94e-01

Pos logit contributions
 last+5.580
�+5.355
 Ms+5.120
't+4.888
ited+4.819
Neg logit contributions
vel-4.153
ust-3.853
nes-3.632
aces-3.418
ying-3.133
Token so
Token ID523
Feature activation-4.36e-02

Pos logit contributions
ust+0.408
nes+0.397
vel+0.395
onel+0.368
aces+0.365
Neg logit contributions
 last-0.214
 Ms-0.184
 front-0.161
 teach-0.160
 hugs-0.157
Token it
Token ID340
Feature activation-5.21e-01

Pos logit contributions
ust+0.084
nes+0.081
vel+0.081
onel+0.075
oll+0.073
Neg logit contributions
 last-0.057
 Ms-0.051
 front-0.046
 teach-0.045
 hugs-0.044
Token wouldn
Token ID3636
Feature activation+1.33e+00

Pos logit contributions
ust+0.921
nes+0.891
vel+0.885
onel+0.807
aces+0.796
Neg logit contributions
 last-0.771
 Ms-0.680
 teach-0.621
 front-0.620
 hugs-0.612
Token't
Token ID470
Feature activation-2.53e-01

Pos logit contributions
 last+0.746
 Ms+0.634
 hugs+0.502
 praise+0.449
 front+0.388
Neg logit contributions
nes-3.509
ust-3.460
vel-3.391
onel-3.245
aces-3.116
Token
Token ID198
Feature activation-4.12e-01

Pos logit contributions
ust+0.843
nes+0.822
vel+0.816
onel+0.760
aces+0.754
Neg logit contributions
 last-0.387
 Ms-0.317
 front-0.277
 teach-0.276
 hugs-0.268
Token
Token ID198
Feature activation-6.08e-01

Pos logit contributions
ust+0.880
nes+0.858
vel+0.851
onel+0.790
aces+0.781
Neg logit contributions
 last-0.457
 Ms-0.384
 front-0.338
 teach-0.338
 hugs-0.330
TokenSpot
Token ID32565
Feature activation+3.72e-01

Pos logit contributions
ust+1.078
nes+1.042
vel+1.035
onel+0.949
aces+0.937
Neg logit contributions
 last-0.900
 Ms-0.798
 teach-0.727
 front-0.726
 hugs-0.715
Token happily
Token ID18177
Feature activation-9.51e-02

Pos logit contributions
 last+0.440
 Ms+0.370
 front+0.338
 teach+0.331
't+0.328
Neg logit contributions
ust-0.745
nes-0.734
vel-0.724
aces-0.668
onel-0.665
Token took
Token ID1718
Feature activation+2.19e-01

Pos logit contributions
ust+0.186
nes+0.182
vel+0.180
onel+0.166
oll+0.164
Neg logit contributions
 last-0.120
 Ms-0.105
 front-0.094
 teach-0.093
't-0.092
Token the
Token ID262
Feature activation+3.08e+00

Pos logit contributions
 last+0.220
 Ms+0.184
 teach+0.162
 front+0.160
't+0.156
Neg logit contributions
ust-0.474
nes-0.466
vel-0.465
onel-0.432
oll-0.431
Token bone
Token ID9970
Feature activation-2.05e-01

Pos logit contributions
 Ms+4.406
 last+4.221
glers+3.947
 praise+3.854
 front+3.725
Neg logit contributions
vel-4.820
oll-4.455
ode-4.201
ust-4.174
ery-4.101
Token and
Token ID290
Feature activation+3.87e-01

Pos logit contributions
ust+0.414
nes+0.402
vel+0.400
onel+0.370
aces+0.365
Neg logit contributions
 last-0.244
 Ms-0.213
 teach-0.189
 front-0.187
 hugs-0.185
Token ate
Token ID15063
Feature activation+1.37e-01

Pos logit contributions
 last+0.491
 Ms+0.422
 front+0.390
't+0.385
 teach+0.384
Neg logit contributions
ust-0.722
nes-0.721
vel-0.706
onel-0.654
oll-0.653
Token it
Token ID340
Feature activation-1.17e-01

Pos logit contributions
 last+0.203
 Ms+0.180
 teach+0.168
 front+0.166
't+0.161
Neg logit contributions
ust-0.232
vel-0.228
nes-0.224
onel-0.204
oll-0.202
Token up
Token ID510
Feature activation+1.88e-01

Pos logit contributions
ust+0.216
nes+0.210
vel+0.209
onel+0.191
aces+0.190
Neg logit contributions
 last-0.159
 Ms-0.142
 teach-0.129
 front-0.128
 hugs-0.125
Token smile
Token ID8212
Feature activation+5.31e-01

Pos logit contributions
 last+0.338
 Ms+0.314
 front+0.278
�+0.277
't+0.274
Neg logit contributions
ust-0.389
vel-0.376
nes-0.373
onel-0.341
aces-0.336
Token.
Token ID13
Feature activation+2.65e-01

Pos logit contributions
 last+0.675
 Ms+0.610
 front+0.536
�+0.533
 praise+0.530
Neg logit contributions
ust-1.024
vel-1.002
nes-0.972
onel-0.910
aces-0.900
Token
Token ID198
Feature activation+5.50e-02

Pos logit contributions
 last+0.316
 Ms+0.260
 teach+0.242
 front+0.239
 praise+0.225
Neg logit contributions
ust-0.543
nes-0.523
vel-0.518
onel-0.479
oll-0.477
Token
Token ID198
Feature activation+5.88e-01

Pos logit contributions
 last+0.079
 Ms+0.069
 teach+0.064
 front+0.063
 hugs+0.062
Neg logit contributions
ust-0.098
nes-0.096
vel-0.095
onel-0.086
oll-0.085
TokenAfter
Token ID3260
Feature activation-5.34e-02

Pos logit contributions
 last+0.734
 Ms+0.638
 hugs+0.579
 teach+0.568
 front+0.567
Neg logit contributions
ust-1.202
vel-1.148
nes-1.140
onel-1.026
ode-1.014
Token the
Token ID262
Feature activation+3.05e+00

Pos logit contributions
ust+0.109
vel+0.108
nes+0.107
onel+0.099
oll+0.098
Neg logit contributions
 last-0.061
 Ms-0.052
 front-0.046
 teach-0.046
 hugs-0.045
Token tablet
Token ID14147
Feature activation-5.38e-01

Pos logit contributions
�+4.627
 Ms+4.613
 last+4.525
etchup+4.128
ted+4.092
Neg logit contributions
ust-4.464
vel-4.408
nes-4.222
aund-3.987
oll-3.965
Token was
Token ID373
Feature activation-4.12e-01

Pos logit contributions
ust+1.167
nes+1.135
vel+1.128
onel+1.049
aces+1.038
Neg logit contributions
 last-0.578
 Ms-0.484
 teach-0.423
 front-0.423
 hugs-0.414
Token fixed
Token ID5969
Feature activation+4.29e-02

Pos logit contributions
ust+0.737
nes+0.713
vel+0.708
onel+0.647
aces+0.638
Neg logit contributions
 last-0.601
 Ms-0.530
 teach-0.483
 front-0.483
 hugs-0.476
Token,
Token ID11
Feature activation+2.53e-01

Pos logit contributions
 last+0.040
 Ms+0.034
 front+0.029
 teach+0.029
 hugs+0.028
Neg logit contributions
ust-0.098
vel-0.095
nes-0.095
onel-0.089
aces-0.088
Token Tim
Token ID5045
Feature activation+8.81e-02

Pos logit contributions
 last+0.312
 Ms+0.272
 hugs+0.249
 teach+0.247
 front+0.245
Neg logit contributions
ust-0.495
nes-0.470
vel-0.468
onel-0.432
ode-0.426
Token he
Token ID339
Feature activation+4.51e-01

Pos logit contributions
ust+0.376
nes+0.365
vel+0.363
onel+0.338
aces+0.334
Neg logit contributions
 last-0.184
 Ms-0.154
 front-0.136
 teach-0.134
 hugs-0.134
Token had
Token ID550
Feature activation-4.23e-01

Pos logit contributions
 last+0.658
 Ms+0.589
 front+0.545
 teach+0.526
 praise+0.522
Neg logit contributions
nes-0.792
ust-0.792
vel-0.760
onel-0.699
aces-0.688
Token chosen
Token ID7147
Feature activation+2.60e-01

Pos logit contributions
ust+0.898
nes+0.875
vel+0.868
onel+0.805
aces+0.796
Neg logit contributions
 last-0.479
 Ms-0.406
 front-0.357
 teach-0.356
 hugs-0.350
Token to
Token ID284
Feature activation+2.99e-01

Pos logit contributions
 last+0.262
 Ms+0.220
�+0.192
 front+0.189
 teach+0.186
Neg logit contributions
ust-0.567
nes-0.553
vel-0.549
onel-0.509
aces-0.507
Token watch
Token ID2342
Feature activation+3.80e-01

Pos logit contributions
 last+0.360
 Ms+0.326
 hugs+0.279
 front+0.276
�+0.273
Neg logit contributions
ust-0.588
nes-0.582
vel-0.571
onel-0.520
ode-0.515
Token the
Token ID262
Feature activation+3.00e+00

Pos logit contributions
 last+0.401
 Ms+0.335
�+0.296
 hugs+0.293
 front+0.293
Neg logit contributions
ust-0.802
vel-0.793
nes-0.782
onel-0.724
aces-0.722
Token fun
Token ID1257
Feature activation+5.48e-01

Pos logit contributions
 last+4.400
 Ms+4.038
�+3.770
 hugs+3.738
 front+3.674
Neg logit contributions
vel-4.708
ust-4.666
nes-4.399
onel-4.100
aund-4.041
Token and
Token ID290
Feature activation+4.27e-01

Pos logit contributions
 last+0.685
 Ms+0.590
 front+0.543
 hugs+0.524
�+0.523
Neg logit contributions
ust-1.057
nes-1.039
vel-1.033
onel-0.935
aces-0.923
Token easy
Token ID2562
Feature activation+1.31e+00

Pos logit contributions
 last+0.494
 Ms+0.439
 front+0.389
�+0.386
't+0.367
Neg logit contributions
ust-0.863
nes-0.848
vel-0.827
onel-0.770
aces-0.759
Token video
Token ID2008
Feature activation-2.51e-01

Pos logit contributions
 last+1.775
 Ms+1.569
 front+1.458
�+1.421
 hugs+1.385
Neg logit contributions
vel-2.342
ust-2.302
nes-2.285
onel-2.048
aces-2.036
Token.
Token ID13
Feature activation+2.21e-01

Pos logit contributions
ust+0.531
nes+0.518
vel+0.514
onel+0.477
aces+0.473
Neg logit contributions
 last-0.279
 Ms-0.238
 front-0.209
 teach-0.207
 hugs-0.204
Token end
Token ID886
Feature activation-2.56e-02

Pos logit contributions
 last+0.435
 Ms+0.396
 hugs+0.358
 front+0.342
�+0.342
Neg logit contributions
ust-0.647
nes-0.638
vel-0.622
onel-0.566
ode-0.557
Token!
Token ID0
Feature activation-6.68e-01

Pos logit contributions
ust+0.055
nes+0.053
vel+0.053
onel+0.049
aces+0.049
Neg logit contributions
 last-0.028
 Ms-0.023
 teach-0.020
 front-0.020
 hugs-0.020
Token
Token ID198
Feature activation-1.18e-01

Pos logit contributions
ust+1.552
nes+1.511
vel+1.506
onel+1.407
ode+1.393
Neg logit contributions
 last-0.627
 Ms-0.514
 front-0.436
 teach-0.431
 hugs-0.428
Token
Token ID198
Feature activation-6.71e-01

Pos logit contributions
ust+0.232
nes+0.227
vel+0.225
onel+0.206
oll+0.203
Neg logit contributions
 last-0.148
 Ms-0.128
 teach-0.115
 front-0.114
 hugs-0.111
TokenAfter
Token ID3260
Feature activation+1.61e-01

Pos logit contributions
ust+1.377
nes+1.337
vel+1.329
onel+1.235
aces+1.226
Neg logit contributions
 last-0.807
 Ms-0.689
 front-0.615
 teach-0.611
 hugs-0.600
Token the
Token ID262
Feature activation+2.99e+00

Pos logit contributions
 last+0.174
 Ms+0.146
 teach+0.127
 front+0.127
 hugs+0.124
Neg logit contributions
ust-0.334
vel-0.333
nes-0.331
onel-0.307
oll-0.304
Token video
Token ID2008
Feature activation-2.81e-01

Pos logit contributions
 last+4.327
 Ms+4.216
�+3.984
ted+3.896
 Mrs+3.764
Neg logit contributions
vel-4.580
nes-4.396
ust-4.294
aund-4.069
onel-3.944
Token was
Token ID373
Feature activation-2.57e-01

Pos logit contributions
ust+0.619
nes+0.606
vel+0.600
onel+0.561
aces+0.554
Neg logit contributions
 last-0.283
 Ms-0.236
 front-0.206
 teach-0.204
 hugs-0.200
Token over
Token ID625
Feature activation+1.56e+00

Pos logit contributions
ust+0.443
nes+0.428
vel+0.424
onel+0.386
aces+0.380
Neg logit contributions
 last-0.389
 Ms-0.347
 teach-0.319
 front-0.317
 hugs-0.314
Token,
Token ID11
Feature activation+1.06e-01

Pos logit contributions
 last+1.683
 Ms+1.538
 teach+1.435
 praise+1.316
 hugs+1.301
Neg logit contributions
ust-3.055
vel-3.034
nes-2.949
onel-2.809
adow-2.803
Token Lu
Token ID6026
Feature activation+2.78e-01

Pos logit contributions
 last+0.151
 Ms+0.132
 teach+0.124
 front+0.121
 hugs+0.120
Neg logit contributions
ust-0.190
nes-0.185
vel-0.181
onel-0.166
aces-0.163
Token going
Token ID1016
Feature activation-3.83e-01

Pos logit contributions
 last+1.065
 Ms+0.897
 teach+0.825
 hugs+0.803
 praise+0.790
Neg logit contributions
vel-1.709
ust-1.704
nes-1.663
onel-1.520
aces-1.507
Token off
Token ID572
Feature activation+5.35e-01

Pos logit contributions
ust+0.712
nes+0.692
vel+0.686
onel+0.630
aces+0.622
Neg logit contributions
 last-0.529
 Ms-0.467
 teach-0.419
 front-0.418
 hugs-0.412
Token to
Token ID284
Feature activation+1.52e-01

Pos logit contributions
 last+0.632
 Ms+0.549
 teach+0.487
 front+0.474
 praise+0.468
Neg logit contributions
vel-1.053
nes-1.053
ust-1.052
onel-0.965
aces-0.958
Token watch
Token ID2342
Feature activation+2.98e-01

Pos logit contributions
 last+0.158
 Ms+0.135
 front+0.113
�+0.112
 hugs+0.112
Neg logit contributions
ust-0.323
nes-0.323
vel-0.318
onel-0.296
aces-0.288
Token her
Token ID607
Feature activation+2.39e+00

Pos logit contributions
 last+0.366
 Ms+0.304
 front+0.275
 hugs+0.273
 teach+0.272
Neg logit contributions
ust-0.579
vel-0.576
nes-0.572
onel-0.520
aces-0.520
Token beloved
Token ID14142
Feature activation+2.98e+00

Pos logit contributions
 last+3.271
 Ms+2.899
�+2.796
't+2.723
 front+2.684
Neg logit contributions
ust-4.201
nes-4.037
vel-3.985
onel-3.685
oll-3.549
Token k
Token ID479
Feature activation+1.84e+00

Pos logit contributions
 last+4.632
 Ms+3.949
 praise+3.746
Grab+3.739
ited+3.737
Neg logit contributions
vel-4.487
nes-4.290
ust-4.215
onel-3.975
aund-3.775
Tokenitty
Token ID9760
Feature activation+1.08e+00

Pos logit contributions
 last+2.958
 Ms+2.561
 praise+2.355
 teach+2.299
 hugs+2.299
Neg logit contributions
nes-3.227
ust-3.224
vel-3.222
onel-2.850
oll-2.771
Token video
Token ID2008
Feature activation-3.32e-01

Pos logit contributions
 last+1.359
 Ms+1.206
 front+1.079
't+1.054
�+1.054
Neg logit contributions
vel-2.018
ust-2.008
nes-1.962
onel-1.843
aces-1.803
Token.
Token ID13
Feature activation+7.83e-01

Pos logit contributions
ust+0.715
nes+0.698
vel+0.693
onel+0.644
aces+0.638
Neg logit contributions
 last-0.356
 Ms-0.299
 front-0.262
 teach-0.260
 hugs-0.256
Token<|endoftext|>
Token ID50256
Feature activation+5.79e-02

Pos logit contributions
 last+0.994
 Ms+0.773
 teach+0.760
 front+0.745
 first+0.716
Neg logit contributions
ust-1.530
nes-1.492
vel-1.462
oll-1.339
onel-1.338
Tokenmy
Token ID1820
Feature activation-3.90e-01

Pos logit contributions
ust+1.665
nes+1.632
vel+1.619
onel+1.522
aces+1.504
Neg logit contributions
 last-0.532
 Ms-0.408
 teach-0.332
 front-0.331
 hugs-0.324
Token took
Token ID1718
Feature activation-7.00e-01

Pos logit contributions
ust+0.749
nes+0.727
vel+0.722
onel+0.665
aces+0.658
Neg logit contributions
 last-0.511
 Ms-0.446
 front-0.401
 teach-0.399
 hugs-0.392
Token good
Token ID922
Feature activation+2.37e-01

Pos logit contributions
ust+1.510
nes+1.464
vel+1.447
onel+1.351
ode+1.335
Neg logit contributions
 last-0.789
 Ms-0.661
 front-0.586
 teach-0.585
 hugs-0.580
Token care
Token ID1337
Feature activation+2.34e-01

Pos logit contributions
 last+0.242
 Ms+0.213
�+0.179
 front+0.176
't+0.174
Neg logit contributions
ust-0.511
vel-0.507
nes-0.497
onel-0.463
aces-0.463
Token of
Token ID286
Feature activation+1.74e-02

Pos logit contributions
 last+0.253
 Ms+0.229
 teach+0.194
 front+0.193
�+0.190
Neg logit contributions
ust-0.495
vel-0.485
nes-0.482
onel-0.451
aces-0.446
Token his
Token ID465
Feature activation+2.98e+00

Pos logit contributions
 last+0.025
 Ms+0.022
 front+0.020
�+0.019
 hugs+0.019
Neg logit contributions
ust-0.031
vel-0.030
nes-0.030
onel-0.028
aces-0.027
Token new
Token ID649
Feature activation+3.09e+00

Pos logit contributions
 last+4.580
�+4.294
 Ms+4.276
't+4.145
glers+3.937
Neg logit contributions
vel-4.480
ust-4.402
nes-4.012
aces-3.932
onel-3.780
Token tablet
Token ID14147
Feature activation-1.94e-01

Pos logit contributions
 last+5.580
�+5.355
 Ms+5.120
't+4.888
ited+4.819
Neg logit contributions
vel-4.153
ust-3.853
nes-3.632
aces-3.418
ying-3.133
Token so
Token ID523
Feature activation-4.36e-02

Pos logit contributions
ust+0.408
nes+0.397
vel+0.395
onel+0.368
aces+0.365
Neg logit contributions
 last-0.214
 Ms-0.184
 front-0.161
 teach-0.160
 hugs-0.157
Token it
Token ID340
Feature activation-5.21e-01

Pos logit contributions
ust+0.084
nes+0.081
vel+0.081
onel+0.075
oll+0.073
Neg logit contributions
 last-0.057
 Ms-0.051
 front-0.046
 teach-0.045
 hugs-0.044
Token wouldn
Token ID3636
Feature activation+1.33e+00

Pos logit contributions
ust+0.921
nes+0.891
vel+0.885
onel+0.807
aces+0.796
Neg logit contributions
 last-0.771
 Ms-0.680
 teach-0.621
 front-0.620
 hugs-0.612
Token they
Token ID484
Feature activation-8.65e-01

Pos logit contributions
 last+0.206
 Ms+0.177
 teach+0.159
 hugs+0.156
 front+0.154
Neg logit contributions
ust-0.365
nes-0.362
vel-0.353
onel-0.323
oll-0.319
Token arrived
Token ID5284
Feature activation-1.21e+00

Pos logit contributions
ust+1.766
vel+1.715
nes+1.705
aces+1.581
onel+1.578
Neg logit contributions
 last-1.082
 Ms-0.922
 front-0.819
 teach-0.818
 hugs-0.807
Token,
Token ID11
Feature activation+4.58e-01

Pos logit contributions
ust+2.868
nes+2.785
vel+2.774
ode+2.608
aces+2.597
Neg logit contributions
 last-1.148
 Ms-0.908
 front-0.799
 hugs-0.780
 teach-0.757
Token Tom
Token ID4186
Feature activation+3.50e-01

Pos logit contributions
 last+0.552
 Ms+0.466
 teach+0.426
 hugs+0.424
 front+0.423
Neg logit contributions
ust-0.914
nes-0.892
vel-0.868
onel-0.793
aund-0.789
Token put
Token ID1234
Feature activation-2.72e-01

Pos logit contributions
 last+0.437
 Ms+0.372
 front+0.339
 teach+0.335
�+0.329
Neg logit contributions
ust-0.679
nes-0.673
vel-0.659
onel-0.606
aces-0.602
Token the
Token ID262
Feature activation+2.97e+00

Pos logit contributions
ust+0.588
nes+0.574
vel+0.572
onel+0.532
aces+0.529
Neg logit contributions
 last-0.290
 Ms-0.245
 teach-0.215
 front-0.211
 hugs-0.207
Token video
Token ID2008
Feature activation-1.05e+00

Pos logit contributions
 Ms+4.311
 last+4.310
 Mrs+3.823
�+3.800
 Together+3.746
Neg logit contributions
vel-4.560
nes-4.362
ust-4.177
onel-4.005
aund-3.945
Token in
Token ID287
Feature activation+1.00e+00

Pos logit contributions
ust+1.889
nes+1.811
vel+1.806
onel+1.642
ode+1.627
Neg logit contributions
 last-1.563
 Ms-1.330
 front-1.247
 hugs-1.228
 teach-1.226
Token the
Token ID262
Feature activation+7.48e-01

Pos logit contributions
 last+1.114
 Ms+0.985
 teach+0.882
 hugs+0.863
't+0.856
Neg logit contributions
ust-1.967
vel-1.962
nes-1.944
aces-1.767
onel-1.733
Token player
Token ID2137
Feature activation-6.34e-01

Pos logit contributions
 last+1.145
 Ms+1.052
't+0.967
 teach+0.936
 praise+0.916
Neg logit contributions
ust-1.162
nes-1.154
vel-1.139
onel-1.048
ode-1.031
Token and
Token ID290
Feature activation-5.29e-03

Pos logit contributions
ust+1.209
nes+1.168
vel+1.159
onel+1.068
aces+1.051
Neg logit contributions
 last-0.851
 Ms-0.738
 front-0.670
 teach-0.669
 hugs-0.658
Token started
Token ID2067
Feature activation-2.01e-02

Pos logit contributions
ust+0.369
nes+0.361
vel+0.358
onel+0.330
aces+0.326
Neg logit contributions
 last-0.224
 Ms-0.192
 front-0.172
 teach-0.172
 hugs-0.166
Token to
Token ID284
Feature activation+5.51e-01

Pos logit contributions
ust+0.042
nes+0.042
vel+0.042
onel+0.038
aces+0.037
Neg logit contributions
 last-0.022
 Ms-0.019
 teach-0.016
 front-0.016
 hugs-0.016
Token g
Token ID308
Feature activation-1.27e+00

Pos logit contributions
 last+0.725
 Ms+0.645
 front+0.559
�+0.557
't+0.556
Neg logit contributions
ust-1.020
nes-1.009
vel-1.005
ode-0.907
onel-0.896
Tokeniggle
Token ID24082
Feature activation-5.50e-01

Pos logit contributions
ust+1.771
nes+1.738
vel+1.690
ode+1.549
aces+1.527
Neg logit contributions
 last-2.290
 Ms-2.051
 teach-1.939
 hugs-1.933
 front-1.931
Token.
Token ID13
Feature activation+8.29e-02

Pos logit contributions
ust+1.207
nes+1.175
vel+1.166
onel+1.087
aces+1.075
Neg logit contributions
 last-0.578
 Ms-0.482
 front-0.420
 teach-0.420
 hugs-0.411
Token The
Token ID383
Feature activation+2.95e+00

Pos logit contributions
 last+0.104
 Ms+0.085
 teach+0.080
 front+0.078
 hugs+0.075
Neg logit contributions
ust-0.165
nes-0.160
vel-0.159
onel-0.147
oll-0.145
Token water
Token ID1660
Feature activation+6.09e-01

Pos logit contributions
�+4.417
ted+4.392
 last+4.267
't+4.249
 Ms+4.234
Neg logit contributions
ust-4.349
vel-4.148
nes-4.145
onel-3.830
aky-3.820
Token was
Token ID373
Feature activation+3.74e-01

Pos logit contributions
 last+0.747
 Ms+0.669
 teach+0.598
 front+0.586
�+0.572
Neg logit contributions
ust-1.175
nes-1.167
vel-1.132
onel-1.067
aces-1.042
Token cold
Token ID4692
Feature activation-6.87e-05

Pos logit contributions
 last+0.532
 Ms+0.481
 teach+0.443
 hugs+0.435
�+0.430
Neg logit contributions
ust-0.667
vel-0.639
nes-0.638
onel-0.580
oll-0.565
Token,
Token ID11
Feature activation-1.36e-02

Pos logit contributions
ust+0.000
vel+0.000
nes+0.000
onel+0.000
aces+0.000
Neg logit contributions
 last-0.000
 Ms-0.000
 teach-0.000
 front-0.000
 hugs-0.000
Token but
Token ID475
Feature activation+2.44e-01

Pos logit contributions
ust+0.025
nes+0.025
vel+0.025
onel+0.023
aces+0.022
Neg logit contributions
 last-0.018
 Ms-0.016
 teach-0.014
 front-0.014
 hugs-0.014
Token watched
Token ID7342
Feature activation+3.04e-02

Pos logit contributions
ust+0.135
nes+0.132
vel+0.130
onel+0.120
aces+0.119
Neg logit contributions
 last-0.084
 Ms-0.073
 front-0.065
 teach-0.065
't-0.064
Token.
Token ID13
Feature activation+5.35e-01

Pos logit contributions
 last+0.031
 Ms+0.025
 front+0.022
 teach+0.021
 hugs+0.021
Neg logit contributions
ust-0.067
vel-0.066
nes-0.065
onel-0.061
aces-0.060
Token
Token ID198
Feature activation+2.12e-01

Pos logit contributions
 last+0.664
 Ms+0.544
 teach+0.518
 front+0.500
 praise+0.480
Neg logit contributions
ust-1.060
vel-1.023
nes-1.022
onel-0.932
aces-0.930
Token
Token ID198
Feature activation+3.92e-01

Pos logit contributions
 last+0.245
 Ms+0.205
 teach+0.188
 front+0.183
 hugs+0.180
Neg logit contributions
ust-0.439
nes-0.430
vel-0.426
onel-0.388
oll-0.384
TokenWhen
Token ID2215
Feature activation+3.99e-01

Pos logit contributions
 last+0.485
 Ms+0.408
 hugs+0.376
 teach+0.369
 front+0.362
Neg logit contributions
ust-0.810
vel-0.782
nes-0.781
onel-0.695
ode-0.683
Token the
Token ID262
Feature activation+2.95e+00

Pos logit contributions
 last+0.474
 Ms+0.417
 teach+0.376
 front+0.371
 hugs+0.369
Neg logit contributions
ust-0.781
nes-0.763
vel-0.758
onel-0.689
aund-0.680
Token video
Token ID2008
Feature activation-1.70e-01

Pos logit contributions
 Ms+4.493
 last+4.436
 Mrs+3.963
�+3.774
 praise+3.761
Neg logit contributions
vel-4.032
nes-4.029
ust-3.963
aund-3.779
oll-3.755
Token was
Token ID373
Feature activation-3.34e-01

Pos logit contributions
ust+0.354
nes+0.347
vel+0.342
onel+0.319
aces+0.314
Neg logit contributions
 last-0.190
 Ms-0.161
 front-0.143
 teach-0.142
 hugs-0.140
Token over
Token ID625
Feature activation+1.19e+00

Pos logit contributions
ust+0.612
nes+0.592
vel+0.588
onel+0.537
aces+0.530
Neg logit contributions
 last-0.469
 Ms-0.414
 teach-0.377
 front-0.374
 hugs-0.371
Token,
Token ID11
Feature activation+8.91e-01

Pos logit contributions
 last+1.243
 Ms+1.167
 teach+1.039
 hugs+0.974
 praise+0.944
Neg logit contributions
ust-2.432
vel-2.369
nes-2.331
aund-2.170
onel-2.161
Token they
Token ID484
Feature activation-8.15e-01

Pos logit contributions
 last+1.059
 Ms+0.901
 hugs+0.835
 teach+0.830
 front+0.828
Neg logit contributions
ust-1.760
nes-1.699
vel-1.685
onel-1.544
aund-1.530
Token She
Token ID1375
Feature activation-8.06e-01

Pos logit contributions
ust+2.087
nes+2.037
vel+2.021
onel+1.882
ode+1.861
Neg logit contributions
 last-1.133
 Ms-0.993
 front-0.872
 teach-0.855
 hugs-0.837
Token sat
Token ID3332
Feature activation-2.55e-01

Pos logit contributions
ust+1.659
nes+1.613
vel+1.609
onel+1.487
aces+1.472
Neg logit contributions
 last-0.974
 Ms-0.831
 front-0.736
 teach-0.734
 hugs-0.732
Token down
Token ID866
Feature activation+1.21e+00

Pos logit contributions
ust+0.427
nes+0.413
vel+0.411
onel+0.374
aces+0.368
Neg logit contributions
 last-0.400
 Ms-0.356
 teach-0.329
 front-0.325
 hugs-0.322
Token and
Token ID290
Feature activation-2.16e-01

Pos logit contributions
 last+1.645
 Ms+1.553
 teach+1.427
 hugs+1.386
 praise+1.385
Neg logit contributions
vel-2.017
nes-2.013
ust-2.007
onel-1.815
aces-1.805
Token watched
Token ID7342
Feature activation+7.29e-02

Pos logit contributions
ust+0.421
nes+0.411
vel+0.407
onel+0.375
aces+0.368
Neg logit contributions
 last-0.276
 Ms-0.236
 front-0.214
 teach-0.213
 hugs-0.207
Token the
Token ID262
Feature activation+2.94e+00

Pos logit contributions
 last+0.092
 Ms+0.079
 front+0.071
 teach+0.070
 hugs+0.070
Neg logit contributions
ust-0.141
vel-0.139
nes-0.138
onel-0.126
aces-0.125
Token cartoon
Token ID16251
Feature activation-3.28e-01

Pos logit contributions
 last+4.114
�+3.856
't+3.469
 Ms+3.426
 praise+3.249
Neg logit contributions
vel-4.793
ust-4.741
nes-4.376
oll-4.162
onel-4.097
Token.
Token ID13
Feature activation-2.22e-01

Pos logit contributions
ust+0.670
nes+0.653
vel+0.650
onel+0.600
aces+0.594
Neg logit contributions
 last-0.388
 Ms-0.332
 front-0.294
 teach-0.293
 hugs-0.289
Token After
Token ID2293
Feature activation+5.96e-01

Pos logit contributions
ust+0.451
nes+0.438
vel+0.437
onel+0.403
aces+0.401
Neg logit contributions
 last-0.267
 Ms-0.225
 teach-0.202
 front-0.200
 hugs-0.198
Token a
Token ID257
Feature activation+8.25e-01

Pos logit contributions
 last+0.618
 Ms+0.522
 teach+0.458
 front+0.444
 hugs+0.441
Neg logit contributions
vel-1.248
ust-1.237
nes-1.227
onel-1.140
oll-1.137
Token while
Token ID981
Feature activation+8.69e-01

Pos logit contributions
 last+0.739
 Ms+0.621
 hugs+0.577
 front+0.558
 praise+0.529
Neg logit contributions
ust-1.831
nes-1.808
vel-1.791
ode-1.600
onel-1.594
Token.
Token ID13
Feature activation-2.39e-01

Pos logit contributions
ust+0.038
vel+0.037
nes+0.037
onel+0.034
aces+0.034
Neg logit contributions
 last-0.021
 Ms-0.018
 front-0.016
 teach-0.016
 hugs-0.016
Token He
Token ID679
Feature activation-4.68e-03

Pos logit contributions
ust+0.499
nes+0.484
vel+0.480
onel+0.445
aces+0.442
Neg logit contributions
 last-0.279
 Ms-0.232
 teach-0.209
 front-0.207
 hugs-0.203
Token said
Token ID531
Feature activation-7.59e-01

Pos logit contributions
ust+0.009
nes+0.009
vel+0.009
onel+0.008
oll+0.008
Neg logit contributions
 last-0.006
 Ms-0.005
 front-0.004
 teach-0.004
't-0.004
Token he
Token ID339
Feature activation+3.67e-01

Pos logit contributions
ust+1.756
nes+1.707
vel+1.693
onel+1.581
ode+1.567
Neg logit contributions
 last-0.728
 Ms-0.599
 teach-0.513
 hugs-0.505
 front-0.504
Token loved
Token ID6151
Feature activation+3.90e-01

Pos logit contributions
 last+0.510
 Ms+0.462
 front+0.427
�+0.404
 teach+0.402
Neg logit contributions
ust-0.678
nes-0.649
vel-0.641
onel-0.591
aces-0.588
Token his
Token ID465
Feature activation+2.92e+00

Pos logit contributions
 last+0.518
 Ms+0.444
�+0.414
 front+0.407
 teach+0.403
Neg logit contributions
ust-0.722
nes-0.705
vel-0.703
aces-0.647
onel-0.637
Token mom
Token ID1995
Feature activation-2.79e-01

Pos logit contributions
 last+4.499
�+4.381
 Ms+4.294
ted+4.266
ited+4.252
Neg logit contributions
ust-4.111
nes-4.043
vel-3.901
aces-3.608
oll-3.429
Token too
Token ID1165
Feature activation+4.58e-01

Pos logit contributions
ust+0.586
nes+0.568
vel+0.565
onel+0.523
aces+0.521
Neg logit contributions
 last-0.315
 Ms-0.270
 front-0.238
 teach-0.237
 hugs-0.231
Token.
Token ID13
Feature activation-1.03e-01

Pos logit contributions
 last+0.532
 Ms+0.477
 teach+0.420
 hugs+0.410
 praise+0.408
Neg logit contributions
ust-0.932
vel-0.900
nes-0.885
aces-0.822
oll-0.817
Token He
Token ID679
Feature activation+7.80e-02

Pos logit contributions
ust+0.213
nes+0.205
vel+0.203
onel+0.188
aces+0.188
Neg logit contributions
 last-0.121
 Ms-0.100
 teach-0.091
 front-0.090
 hugs-0.088
Token said
Token ID531
Feature activation-6.30e-01

Pos logit contributions
 last+0.090
 Ms+0.078
 front+0.071
't+0.068
 teach+0.068
Neg logit contributions
ust-0.161
nes-0.156
vel-0.153
onel-0.142
oll-0.142
Token.
Token ID13
Feature activation+3.04e-01

Pos logit contributions
 last+0.664
 Ms+0.611
 teach+0.527
 praise+0.524
�+0.518
Neg logit contributions
ust-1.110
vel-1.083
nes-1.075
aces-0.990
oll-0.980
Token
Token ID220
Feature activation-2.00e-01

Pos logit contributions
 last+0.379
 Ms+0.302
 teach+0.291
 front+0.285
 praise+0.272
Neg logit contributions
ust-0.608
nes-0.592
vel-0.587
onel-0.538
oll-0.536
Token
Token ID198
Feature activation+5.26e-02

Pos logit contributions
ust+0.449
nes+0.438
vel+0.434
onel+0.405
aces+0.402
Neg logit contributions
 last-0.200
 Ms-0.163
 teach-0.143
 front-0.142
 hugs-0.138
Token
Token ID198
Feature activation+8.29e-02

Pos logit contributions
 last+0.056
 Ms+0.045
 teach+0.041
 front+0.040
 hugs+0.039
Neg logit contributions
ust-0.114
nes-0.113
vel-0.111
onel-0.102
aces-0.101
TokenAfter
Token ID3260
Feature activation+2.32e-01

Pos logit contributions
 last+0.094
 Ms+0.078
 teach+0.070
 hugs+0.070
 front+0.069
Neg logit contributions
ust-0.178
nes-0.174
vel-0.172
onel-0.156
aces-0.153
Token the
Token ID262
Feature activation+2.91e+00

Pos logit contributions
 last+0.245
 Ms+0.206
 teach+0.181
 front+0.179
 hugs+0.174
Neg logit contributions
vel-0.484
ust-0.483
nes-0.482
onel-0.444
oll-0.443
Token cartoon
Token ID16251
Feature activation+1.32e-01

Pos logit contributions
 last+4.074
 Ms+4.066
ted+3.788
�+3.779
glers+3.733
Neg logit contributions
vel-4.321
nes-4.142
ust-4.113
oll-4.086
aund-3.796
Token was
Token ID373
Feature activation-2.97e-01

Pos logit contributions
 last+0.127
 Ms+0.105
 front+0.093
 teach+0.090
 hugs+0.088
Neg logit contributions
ust-0.290
nes-0.289
vel-0.285
onel-0.266
oll-0.263
Token over
Token ID625
Feature activation+1.39e+00

Pos logit contributions
ust+0.547
nes+0.530
vel+0.526
onel+0.481
aces+0.475
Neg logit contributions
 last-0.413
 Ms-0.363
 teach-0.331
 front-0.329
 hugs-0.324
Token,
Token ID11
Feature activation+3.77e-01

Pos logit contributions
 last+1.467
 Ms+1.332
 teach+1.236
 front+1.122
 hugs+1.117
Neg logit contributions
ust-2.744
vel-2.717
nes-2.710
onel-2.528
 penn-2.515
Token Lily
Token ID20037
Feature activation-8.75e-02

Pos logit contributions
 last+0.479
 Ms+0.400
 front+0.378
 teach+0.374
 hugs+0.369
Neg logit contributions
ust-0.718
nes-0.709
vel-0.696
onel-0.637
oll-0.626
Token car
Token ID1097
Feature activation+3.96e-01

Pos logit contributions
 last+1.724
 Ms+1.648
 praise+1.422
 hugs+1.416
 Mrs+1.398
Neg logit contributions
vel-1.797
ust-1.792
aces-1.748
nes-1.737
ode-1.689
Token and
Token ID290
Feature activation+2.15e+00

Pos logit contributions
 last+0.463
 Ms+0.429
 teach+0.359
 hugs+0.351
 praise+0.351
Neg logit contributions
ust-0.793
vel-0.774
nes-0.760
aces-0.713
onel-0.712
Token likes
Token ID7832
Feature activation+5.90e-02

Pos logit contributions
 last+3.546
�+3.043
 Ms+2.928
 praise+2.766
 teach+2.736
Neg logit contributions
vel-3.178
ust-3.157
aces-2.997
nes-2.851
onel-2.746
Token it
Token ID340
Feature activation+3.32e-01

Pos logit contributions
 last+0.076
 Ms+0.066
 front+0.059
 teach+0.059
�+0.059
Neg logit contributions
ust-0.112
nes-0.108
vel-0.108
aces-0.100
onel-0.100
Token.
Token ID13
Feature activation+6.52e-01

Pos logit contributions
 last+0.377
 Ms+0.349
�+0.297
 teach+0.295
 front+0.293
Neg logit contributions
ust-0.689
vel-0.661
nes-0.652
aces-0.613
onel-0.611
Token The
Token ID383
Feature activation+2.91e+00

Pos logit contributions
 last+1.020
 Ms+0.850
 teach+0.801
 front+0.797
 praise+0.785
Neg logit contributions
ust-1.073
vel-1.030
nes-1.002
aces-0.959
onel-0.943
Token cat
Token ID3797
Feature activation+4.57e-01

Pos logit contributions
 last+5.111
 Ms+4.931
�+4.831
ted+4.786
ited+4.683
Neg logit contributions
aces-3.306
vel-3.167
 wing-3.119
ust-3.103
aky-2.886
Token jumps
Token ID18045
Feature activation+2.33e-01

Pos logit contributions
 last+0.614
 Ms+0.529
 front+0.482
�+0.480
 teach+0.479
Neg logit contributions
ust-0.853
vel-0.820
nes-0.806
aces-0.759
onel-0.756
Token on
Token ID319
Feature activation+7.17e-01

Pos logit contributions
 last+0.354
 Ms+0.317
 teach+0.295
 praise+0.287
�+0.285
Neg logit contributions
ust-0.396
vel-0.386
nes-0.382
onel-0.348
aces-0.344
Token the
Token ID262
Feature activation+1.91e-01

Pos logit contributions
 last+0.835
 Ms+0.721
 teach+0.645
 praise+0.643
 hugs+0.634
Neg logit contributions
vel-1.389
ust-1.386
nes-1.343
aces-1.251
onel-1.243
Token car
Token ID1097
Feature activation-5.99e-02

Pos logit contributions
 last+0.290
 Ms+0.264
 teach+0.238
�+0.234
 praise+0.233
Neg logit contributions
ust-0.314
nes-0.309
vel-0.305
aces-0.285
ode-0.277
Token scare
Token ID19437
Feature activation+4.57e-01

Pos logit contributions
 last+0.436
 Ms+0.385
 front+0.328
 hugs+0.322
�+0.314
Neg logit contributions
ust-0.813
nes-0.800
vel-0.778
onel-0.718
ode-0.717
Token her
Token ID607
Feature activation-5.87e-01

Pos logit contributions
 last+0.655
 Ms+0.579
�+0.538
 front+0.522
 teach+0.521
Neg logit contributions
ust-0.796
nes-0.789
vel-0.777
onel-0.708
aces-0.685
Token a
Token ID257
Feature activation-4.20e-01

Pos logit contributions
ust+1.099
nes+1.065
vel+1.057
onel+0.971
aces+0.959
Neg logit contributions
 last-0.808
 Ms-0.704
 front-0.639
 teach-0.638
 hugs-0.628
Token bit
Token ID1643
Feature activation+5.52e-01

Pos logit contributions
ust+0.832
nes+0.807
vel+0.802
onel+0.740
aces+0.729
Neg logit contributions
 last-0.529
 Ms-0.457
 front-0.408
 teach-0.408
 hugs-0.402
Token".
Token ID1911
Feature activation+3.14e-01

Pos logit contributions
 last+0.810
 Ms+0.752
�+0.692
 front+0.678
 teach+0.677
Neg logit contributions
ust-0.970
nes-0.921
vel-0.905
onel-0.829
ode-0.793
Token The
Token ID383
Feature activation+2.90e+00

Pos logit contributions
 last+0.344
 Ms+0.273
 front+0.251
 teach+0.250
 hugs+0.239
Neg logit contributions
ust-0.659
nes-0.648
vel-0.637
onel-0.592
aces-0.588
Token other
Token ID584
Feature activation+2.20e+00

Pos logit contributions
 last+4.780
�+4.452
ted+4.228
ited+4.137
 Ms+4.069
Neg logit contributions
ust-4.109
nes-3.913
vel-3.794
aces-3.506
onel-3.468
Token cat
Token ID3797
Feature activation-6.33e-02

Pos logit contributions
 last+3.608
�+3.091
 Ms+2.995
 praise+2.976
 front+2.958
Neg logit contributions
ust-3.562
vel-3.252
nes-3.239
aces-3.082
onel-2.907
Token agreed
Token ID4987
Feature activation-9.35e-02

Pos logit contributions
ust+0.129
nes+0.125
vel+0.124
onel+0.115
aces+0.114
Neg logit contributions
 last-0.076
 Ms-0.065
 front-0.059
 teach-0.057
 hugs-0.056
Token.
Token ID13
Feature activation+5.75e-01

Pos logit contributions
ust+0.200
nes+0.194
vel+0.194
onel+0.177
aces+0.177
Neg logit contributions
 last-0.102
 Ms-0.087
 front-0.076
 teach-0.075
 hugs-0.074
Token The
Token ID383
Feature activation+2.57e+00

Pos logit contributions
 last+0.754
 Ms+0.608
 teach+0.584
 front+0.578
 praise+0.550
Neg logit contributions
ust-1.107
nes-1.062
vel-1.053
aces-0.974
onel-0.970
Token better
Token ID1365
Feature activation-5.25e-01

Pos logit contributions
 last+0.778
 Ms+0.692
�+0.638
 teach+0.637
 front+0.633
Neg logit contributions
ust-0.945
vel-0.919
nes-0.911
onel-0.859
oll-0.831
Token.
Token ID13
Feature activation-4.64e-01

Pos logit contributions
ust+1.158
nes+1.127
vel+1.119
onel+1.043
aces+1.031
Neg logit contributions
 last-0.552
 Ms-0.460
 front-0.401
 teach-0.400
 hugs-0.391
Token
Token ID198
Feature activation-6.02e-01

Pos logit contributions
ust+0.955
nes+0.928
vel+0.922
onel+0.854
aces+0.844
Neg logit contributions
 last-0.554
 Ms-0.473
 teach-0.420
 front-0.420
 hugs-0.412
Token
Token ID198
Feature activation-7.66e-01

Pos logit contributions
ust+1.392
nes+1.355
vel+1.346
onel+1.261
aces+1.248
Neg logit contributions
 last-0.565
 Ms-0.461
 front-0.393
 teach-0.391
 hugs-0.382
TokenWhen
Token ID2215
Feature activation+6.75e-02

Pos logit contributions
ust+1.567
nes+1.520
vel+1.503
onel+1.408
aces+1.393
Neg logit contributions
 last-0.928
 Ms-0.799
 front-0.719
 teach-0.713
 hugs-0.691
Token the
Token ID262
Feature activation+2.90e+00

Pos logit contributions
 last+0.070
 Ms+0.058
 hugs+0.051
 teach+0.051
 front+0.050
Neg logit contributions
ust-0.144
nes-0.142
vel-0.141
onel-0.130
oll-0.128
Token cartoon
Token ID16251
Feature activation-1.66e-01

Pos logit contributions
 last+3.712
�+3.472
 Ms+3.399
't+3.131
ted+3.083
Neg logit contributions
ust-4.686
oll-4.627
vel-4.595
nes-4.548
onel-4.154
Token finished
Token ID5201
Feature activation+8.11e-02

Pos logit contributions
ust+0.354
nes+0.348
vel+0.344
onel+0.320
aces+0.316
Neg logit contributions
 last-0.178
 Ms-0.149
 front-0.131
 teach-0.130
 hugs-0.128
Token,
Token ID11
Feature activation+3.39e-01

Pos logit contributions
 last+0.077
 Ms+0.062
 front+0.053
 teach+0.053
 hugs+0.052
Neg logit contributions
ust-0.183
vel-0.180
nes-0.180
onel-0.166
aces-0.165
Token her
Token ID607
Feature activation+6.07e-01

Pos logit contributions
 last+0.406
 Ms+0.336
 hugs+0.314
 teach+0.305
 front+0.305
Neg logit contributions
ust-0.674
vel-0.659
nes-0.657
onel-0.601
aces-0.591
Token mom
Token ID1995
Feature activation+4.87e-01

Pos logit contributions
 last+0.846
 Ms+0.772
't+0.687
 teach+0.686
�+0.674
Neg logit contributions
ust-1.059
nes-1.036
vel-1.005
oll-0.955
onel-0.931
Token a
Token ID257
Feature activation+1.31e-01

Pos logit contributions
ust+0.086
nes+0.083
vel+0.082
onel+0.076
ode+0.076
Neg logit contributions
 last-0.043
 Ms-0.036
 front-0.032
 hugs-0.031
 teach-0.031
Token little
Token ID1310
Feature activation-1.93e-01

Pos logit contributions
 last+0.164
 Ms+0.140
 front+0.127
 hugs+0.126
 teach+0.125
Neg logit contributions
ust-0.256
nes-0.248
vel-0.247
onel-0.226
ode-0.224
Token mouse
Token ID10211
Feature activation-3.06e-02

Pos logit contributions
ust+0.347
nes+0.335
vel+0.333
onel+0.306
ode+0.301
Neg logit contributions
 last-0.275
 Ms-0.243
 front-0.221
 teach-0.219
't-0.217
Token named
Token ID3706
Feature activation+4.41e-01

Pos logit contributions
ust+0.058
nes+0.057
vel+0.056
onel+0.052
aces+0.051
Neg logit contributions
 last-0.040
 Ms-0.035
 front-0.032
 teach-0.031
 hugs-0.031
Token S
Token ID311
Feature activation-6.39e-01

Pos logit contributions
 last+0.499
 Ms+0.396
 front+0.375
 praise+0.365
 teach+0.360
Neg logit contributions
ust-0.925
nes-0.906
vel-0.889
onel-0.825
aces-0.820
Tokenque
Token ID4188
Feature activation-3.33e+00

Pos logit contributions
ust+1.040
nes+1.007
vel+0.997
onel+0.902
aces+0.886
Neg logit contributions
 last-1.039
 Ms-0.923
 teach-0.851
 front-0.848
 hugs-0.839
Tokenaky
Token ID15492
Feature activation+1.00e+00

Pos logit contributions
nes+3.088
ust+2.991
vel+2.557
aund+2.418
onel+2.347
Neg logit contributions
 last-8.075
 first-7.127
 Ms-7.106
 front-6.964
't-6.811
Token.
Token ID13
Feature activation+2.62e-01

Pos logit contributions
 last+1.204
 Ms+1.098
�+0.977
't+0.971
 praise+0.948
Neg logit contributions
ust-1.954
vel-1.871
nes-1.857
aces-1.778
ode-1.726
Token One
Token ID1881
Feature activation+5.58e-02

Pos logit contributions
 last+0.299
 Ms+0.238
 teach+0.220
 hugs+0.219
 front+0.219
Neg logit contributions
ust-0.546
nes-0.530
vel-0.525
aces-0.490
oll-0.486
Token day
Token ID1110
Feature activation+1.19e+00

Pos logit contributions
 last+0.074
 Ms+0.067
 teach+0.060
 front+0.060
 hugs+0.060
Neg logit contributions
ust-0.105
nes-0.101
vel-0.099
onel-0.092
oll-0.091
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
 last+1.338
 Ms+1.109
 praise+1.047
 teach+1.042
 front+1.028
Neg logit contributions
vel-2.413
ust-2.402
nes-2.358
oll-2.232
aund-2.226
Token his
Token ID465
Feature activation+8.26e-02

Pos logit contributions
ust+0.648
nes+0.628
vel+0.625
onel+0.574
aces+0.569
Neg logit contributions
 last-0.471
 Ms-0.410
 teach-0.372
 front-0.370
 hugs-0.363
Token pet
Token ID4273
Feature activation+3.53e-02

Pos logit contributions
 last+0.135
 Ms+0.125
 teach+0.113
 front+0.112
't+0.111
Neg logit contributions
ust-0.130
nes-0.124
vel-0.123
onel-0.112
aces-0.112
Token mouse
Token ID10211
Feature activation-2.05e-01

Pos logit contributions
 last+0.047
 Ms+0.041
 teach+0.037
 front+0.037
't+0.036
Neg logit contributions
ust-0.066
nes-0.064
vel-0.063
onel-0.059
aces-0.058
Token,
Token ID11
Feature activation-4.48e-01

Pos logit contributions
ust+0.430
nes+0.417
vel+0.416
onel+0.386
aces+0.384
Neg logit contributions
 last-0.232
 Ms-0.198
 front-0.174
 teach-0.173
 hugs-0.168
Token S
Token ID311
Feature activation-4.87e-01

Pos logit contributions
ust+0.945
nes+0.919
vel+0.912
onel+0.847
aces+0.837
Neg logit contributions
 last-0.511
 Ms-0.432
 front-0.382
 teach-0.381
 hugs-0.374
Tokenque
Token ID4188
Feature activation-3.31e+00

Pos logit contributions
ust+0.781
nes+0.753
vel+0.746
onel+0.674
aces+0.664
Neg logit contributions
 last-0.803
 Ms-0.719
 front-0.663
 teach-0.662
 hugs-0.655
Tokenaky
Token ID15492
Feature activation+5.93e-01

Pos logit contributions
ust+3.017
nes+2.972
aund+2.519
vel+2.375
onel+2.314
Neg logit contributions
 last-7.976
 Ms-7.128
 first-7.089
 front-7.039
 while-6.904
Token.
Token ID13
Feature activation-1.04e-01

Pos logit contributions
 last+0.689
 Ms+0.602
 front+0.527
�+0.520
 teach+0.518
Neg logit contributions
ust-1.201
nes-1.191
vel-1.170
aces-1.106
onel-1.083
Token
Token ID220
Feature activation-7.45e-01

Pos logit contributions
ust+0.233
nes+0.226
vel+0.225
aces+0.209
onel+0.208
Neg logit contributions
 last-0.104
 Ms-0.083
 front-0.074
 teach-0.074
 hugs-0.072
Token
Token ID198
Feature activation-3.57e-02

Pos logit contributions
ust+1.738
nes+1.699
vel+1.691
onel+1.583
ode+1.570
Neg logit contributions
 last-0.671
 Ms-0.544
 front-0.456
 teach-0.450
 hugs-0.441
TokenOne
Token ID3198
Feature activation+3.81e-01

Pos logit contributions
ust+0.078
nes+0.077
vel+0.076
onel+0.070
aces+0.069
Neg logit contributions
 last-0.036
 Ms-0.030
 teach-0.027
 hugs-0.026
 front-0.026
TokenOne
Token ID3198
Feature activation+3.81e-01

Pos logit contributions
ust+0.078
nes+0.077
vel+0.076
onel+0.070
aces+0.069
Neg logit contributions
 last-0.036
 Ms-0.030
 teach-0.027
 hugs-0.026
 front-0.026
Token sunny
Token ID27737
Feature activation-2.11e-02

Pos logit contributions
 last+0.520
 Ms+0.483
 teach+0.442
�+0.442
 hugs+0.438
Neg logit contributions
ust-0.691
nes-0.649
vel-0.644
oll-0.596
adow-0.595
Token day
Token ID1110
Feature activation+9.72e-01

Pos logit contributions
ust+0.039
nes+0.038
vel+0.038
onel+0.035
aces+0.034
Neg logit contributions
 last-0.028
 Ms-0.025
 teach-0.023
 front-0.022
 hugs-0.022
Token,
Token ID11
Feature activation+1.30e+00

Pos logit contributions
 last+1.049
 Ms+0.885
 teach+0.822
 praise+0.816
 front+0.806
Neg logit contributions
ust-2.003
vel-1.990
nes-1.947
oll-1.840
aund-1.823
Token S
Token ID311
Feature activation-3.06e-01

Pos logit contributions
 last+1.753
 praise+1.509
 teach+1.474
 Ms+1.465
 front+1.437
Neg logit contributions
ust-2.284
nes-2.227
vel-2.164
oll-2.059
aces-2.012
Tokenque
Token ID4188
Feature activation-3.11e+00

Pos logit contributions
ust+0.478
nes+0.457
vel+0.454
onel+0.410
aces+0.406
Neg logit contributions
 last-0.517
 Ms-0.467
 front-0.433
 teach-0.431
 hugs-0.426
Tokenaky
Token ID15492
Feature activation+2.59e-01

Pos logit contributions
ust+2.622
nes+2.613
vel+2.191
aund+2.068
ode+1.894
Neg logit contributions
 last-7.628
 Ms-6.854
 first-6.789
't-6.663
 while-6.571
Token came
Token ID1625
Feature activation+1.15e+00

Pos logit contributions
 last+0.357
 Ms+0.310
 front+0.285
 teach+0.284
�+0.280
Neg logit contributions
ust-0.476
nes-0.469
vel-0.457
onel-0.419
aces-0.415
Token up
Token ID510
Feature activation+2.08e+00

Pos logit contributions
 last+1.558
 Ms+1.484
�+1.442
 teach+1.352
 praise+1.331
Neg logit contributions
ust-2.125
vel-1.976
nes-1.962
onel-1.819
adow-1.789
Token to
Token ID284
Feature activation+1.09e+00

Pos logit contributions
 last+2.933
 Ms+2.700
 hugs+2.643
 praise+2.607
 teach+2.560
Neg logit contributions
ust-3.579
vel-3.530
nes-3.330
adow-3.005
onel-2.975
Token Fl
Token ID1610
Feature activation-1.77e-01

Pos logit contributions
 last+1.219
 Ms+0.993
�+0.940
 praise+0.905
 hugs+0.898
Neg logit contributions
nes-2.151
vel-2.086
ust-2.084
ode-1.884
aund-1.851
Token She
Token ID1375
Feature activation-3.36e-01

Pos logit contributions
ust+0.387
nes+0.377
vel+0.373
onel+0.344
aces+0.343
Neg logit contributions
 last-0.227
 Ms-0.189
 teach-0.171
 front-0.170
 hugs-0.166
Token blinked
Token ID40360
Feature activation-2.38e-02

Pos logit contributions
ust+0.701
nes+0.681
vel+0.675
onel+0.625
aces+0.618
Neg logit contributions
 last-0.389
 Ms-0.331
 teach-0.294
 front-0.293
 hugs-0.286
Token her
Token ID607
Feature activation-5.34e-01

Pos logit contributions
ust+0.043
vel+0.042
nes+0.042
onel+0.038
aces+0.038
Neg logit contributions
 last-0.034
 Ms-0.030
 teach-0.027
 front-0.027
�-0.027
Token eyes
Token ID2951
Feature activation+1.25e+00

Pos logit contributions
ust+0.809
nes+0.778
vel+0.770
onel+0.691
aces+0.679
Neg logit contributions
 last-0.938
 Ms-0.843
 front-0.784
 teach-0.782
 hugs-0.774
Token and
Token ID290
Feature activation+1.53e-01

Pos logit contributions
 last+1.509
 Ms+1.464
 praise+1.266
 teach+1.258
 hugs+1.230
Neg logit contributions
ust-2.421
vel-2.406
nes-2.259
aces-2.162
onel-2.124
Token imm
Token ID2296
Feature activation-3.11e+00

Pos logit contributions
 last+0.201
 Ms+0.175
�+0.159
 front+0.158
 teach+0.157
Neg logit contributions
ust-0.289
nes-0.280
vel-0.279
onel-0.256
aces-0.251
Tokenag
Token ID363
Feature activation-1.34e+00

Pos logit contributions
vel+5.013
nes+4.832
ode+4.630
ust+4.558
onel+4.477
Neg logit contributions
 last-5.273
 Ms-4.405
 front-4.291
 first-4.240
 teach-4.118
Tokenined
Token ID1389
Feature activation-5.18e-01

Pos logit contributions
ust+2.056
nes+2.033
vel+1.967
ode+1.806
onel+1.763
Neg logit contributions
 last-2.302
 Ms-2.016
 front-1.924
 hugs-1.880
't-1.878
Token she
Token ID673
Feature activation+5.87e-01

Pos logit contributions
ust+1.049
nes+1.020
vel+1.011
onel+0.935
aces+0.924
Neg logit contributions
 last-0.639
 Ms-0.549
 front-0.490
 teach-0.489
 hugs-0.481
Token could
Token ID714
Feature activation+4.12e-01

Pos logit contributions
 last+0.784
 Ms+0.720
�+0.642
 front+0.630
 teach+0.630
Neg logit contributions
ust-1.103
nes-1.062
vel-1.056
onel-0.981
aces-0.959
Token see
Token ID766
Feature activation-3.76e-01

Pos logit contributions
 last+0.498
 Ms+0.468
 hugs+0.409
�+0.395
 Mrs+0.387
Neg logit contributions
ust-0.807
nes-0.797
vel-0.773
onel-0.721
aces-0.702
Token G
Token ID402
Feature activation-7.50e-01

Pos logit contributions
ust+2.236
vel+2.182
nes+2.163
onel+2.037
aces+1.991
Neg logit contributions
 last-1.122
 Ms-0.958
 teach-0.830
 front-0.800
 hugs-0.796
Tokeniz
Token ID528
Feature activation-2.16e-01

Pos logit contributions
ust+1.126
nes+1.087
vel+1.077
onel+0.967
aces+0.943
Neg logit contributions
 last-1.308
 Ms-1.165
 front-1.087
 teach-1.083
 hugs-1.070
Tokenmo
Token ID5908
Feature activation+3.28e-02

Pos logit contributions
ust+0.427
nes+0.416
vel+0.412
onel+0.381
aces+0.378
Neg logit contributions
 last-0.269
 Ms-0.236
 front-0.210
 teach-0.209
 hugs-0.205
Token and
Token ID290
Feature activation-6.01e-01

Pos logit contributions
 last+0.039
 Ms+0.035
 front+0.031
 teach+0.030
 hugs+0.030
Neg logit contributions
ust-0.066
nes-0.064
vel-0.063
aces-0.059
onel-0.059
Token S
Token ID311
Feature activation-8.58e-01

Pos logit contributions
ust+1.303
nes+1.266
vel+1.261
onel+1.170
aces+1.156
Neg logit contributions
 last-0.663
 Ms-0.560
 teach-0.490
 front-0.488
 hugs-0.479
Tokenque
Token ID4188
Feature activation-3.10e+00

Pos logit contributions
ust+1.341
nes+1.306
vel+1.292
onel+1.156
aces+1.127
Neg logit contributions
 last-1.460
 Ms-1.285
 teach-1.204
 front-1.186
 hugs-1.180
Tokeneker
Token ID28233
Feature activation+4.93e-01

Pos logit contributions
ust+2.767
nes+2.703
vel+2.334
aund+2.232
oll+2.092
Neg logit contributions
 last-7.556
 first-6.709
 Ms-6.615
 front-6.399
't-6.357
Token,
Token ID11
Feature activation-2.54e-01

Pos logit contributions
 last+0.513
 Ms+0.465
 front+0.389
 hugs+0.379
 teach+0.376
Neg logit contributions
ust-1.049
nes-1.018
vel-1.012
aces-0.964
onel-0.950
Token were
Token ID547
Feature activation+5.92e-01

Pos logit contributions
ust+0.481
nes+0.467
vel+0.462
onel+0.426
aces+0.421
Neg logit contributions
 last-0.342
 Ms-0.297
 front-0.271
 teach-0.269
 hugs-0.265
Token both
Token ID1111
Feature activation+3.68e-01

Pos logit contributions
 last+0.837
 Ms+0.722
 front+0.671
 hugs+0.670
�+0.670
Neg logit contributions
ust-1.064
nes-1.052
vel-1.029
onel-0.943
aces-0.939
Token small
Token ID1402
Feature activation+8.70e-02

Pos logit contributions
 last+0.541
 Ms+0.484
 hugs+0.439
 front+0.435
 teach+0.435
Neg logit contributions
ust-0.642
nes-0.630
vel-0.618
onel-0.570
aces-0.566
Token play
Token ID711
Feature activation-3.86e-01

Pos logit contributions
 last+0.300
 Ms+0.250
 front+0.219
�+0.215
 hugs+0.211
Neg logit contributions
ust-0.630
nes-0.628
vel-0.605
ode-0.568
onel-0.567
Token?"
Token ID1701
Feature activation-3.51e-01

Pos logit contributions
ust+0.842
nes+0.822
vel+0.818
onel+0.758
aces+0.750
Neg logit contributions
 last-0.408
 Ms-0.341
 front-0.297
 teach-0.294
 hugs-0.290
Token
Token ID198
Feature activation+4.56e-01

Pos logit contributions
ust+0.780
nes+0.762
vel+0.756
onel+0.704
aces+0.698
Neg logit contributions
 last-0.358
 Ms-0.293
 front-0.255
 teach-0.254
 hugs-0.249
Token
Token ID198
Feature activation+3.14e-01

Pos logit contributions
 last+0.514
 Ms+0.418
 teach+0.378
 front+0.377
 hugs+0.377
Neg logit contributions
ust-0.950
nes-0.942
vel-0.923
oll-0.844
onel-0.838
TokenS
Token ID50
Feature activation+3.02e-01

Pos logit contributions
 last+0.473
 Ms+0.405
 hugs+0.376
 praise+0.371
 front+0.369
Neg logit contributions
ust-0.550
nes-0.543
vel-0.533
ode-0.477
aces-0.471
Tokenque
Token ID4188
Feature activation-3.08e+00

Pos logit contributions
 last+0.469
 Ms+0.421
 front+0.392
 teach+0.390
 hugs+0.381
Neg logit contributions
ust-0.514
vel-0.492
nes-0.487
ode-0.442
oll-0.441
Tokenaky
Token ID15492
Feature activation-3.82e-01

Pos logit contributions
nes+2.679
ust+2.548
vel+2.219
ode+2.022
onel+1.974
Neg logit contributions
 last-7.462
 Ms-6.757
 first-6.680
't-6.572
 front-6.407
Token replied
Token ID8712
Feature activation+3.94e-01

Pos logit contributions
ust+0.828
nes+0.807
vel+0.800
onel+0.743
aces+0.736
Neg logit contributions
 last-0.408
 Ms-0.342
 front-0.300
 teach-0.298
 hugs-0.292
Token,
Token ID11
Feature activation+5.59e-01

Pos logit contributions
 last+0.391
 Ms+0.316
 front+0.283
 teach+0.271
 praise+0.271
Neg logit contributions
ust-0.871
vel-0.867
nes-0.854
onel-0.802
aces-0.793
Token "
Token ID366
Feature activation+1.06e+00

Pos logit contributions
 last+0.764
 Ms+0.627
 praise+0.596
 front+0.587
 hugs+0.582
Neg logit contributions
ust-1.019
vel-1.015
nes-0.991
aces-0.919
onel-0.909
TokenYes
Token ID5297
Feature activation+7.20e-01

Pos logit contributions
 last+1.078
 Ms+0.873
 front+0.790
 hugs+0.785
 praise+0.764
Neg logit contributions
ust-2.436
nes-2.381
vel-2.259
ode-2.126
onel-2.093
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
 last+1.338
 Ms+1.109
 praise+1.047
 teach+1.042
 front+1.028
Neg logit contributions
vel-2.413
ust-2.402
nes-2.358
oll-2.232
aund-2.226
Token Fl
Token ID1610
Feature activation-4.25e-01

Pos logit contributions
 last+1.833
 praise+1.572
 hugs+1.508
 teach+1.505
 front+1.477
Neg logit contributions
ust-2.424
nes-2.380
vel-2.306
oll-2.167
aund-2.109
Tokenuffy
Token ID15352
Feature activation+3.38e-01

Pos logit contributions
ust+0.714
nes+0.688
vel+0.683
onel+0.621
aces+0.611
Neg logit contributions
 last-0.671
 Ms-0.598
 front-0.549
 teach-0.549
 hugs-0.542
Token asked
Token ID1965
Feature activation-3.69e-01

Pos logit contributions
 last+0.471
 Ms+0.410
 front+0.375
 teach+0.373
 praise+0.368
Neg logit contributions
ust-0.618
nes-0.604
vel-0.593
onel-0.544
ode-0.538
Token S
Token ID311
Feature activation-2.20e-01

Pos logit contributions
ust+0.758
nes+0.736
vel+0.733
onel+0.678
aces+0.671
Neg logit contributions
 last-0.441
 Ms-0.375
 front-0.336
 teach-0.335
 hugs-0.328
Tokenque
Token ID4188
Feature activation-3.08e+00

Pos logit contributions
ust+0.350
nes+0.334
vel+0.332
onel+0.300
aces+0.296
Neg logit contributions
 last-0.364
 Ms-0.328
 front-0.306
 teach-0.302
 hugs-0.300
Tokenaky
Token ID15492
Feature activation+7.26e-01

Pos logit contributions
nes+2.914
ust+2.710
vel+2.277
aund+2.260
onel+2.195
Neg logit contributions
 last-7.370
 first-6.569
 Ms-6.505
 front-6.432
't-6.358
Token,
Token ID11
Feature activation+1.96e-01

Pos logit contributions
 last+0.820
 Ms+0.705
�+0.679
 front+0.650
 praise+0.610
Neg logit contributions
ust-1.490
vel-1.437
nes-1.408
onel-1.329
aces-1.321
Token "
Token ID366
Feature activation+7.32e-01

Pos logit contributions
 last+0.246
 Ms+0.207
 front+0.190
 hugs+0.189
 praise+0.188
Neg logit contributions
ust-0.380
vel-0.374
nes-0.364
onel-0.341
aces-0.336
TokenDo
Token ID5211
Feature activation+7.45e-02

Pos logit contributions
 last+0.929
 Ms+0.790
 hugs+0.732
 praise+0.731
 teach+0.724
Neg logit contributions
ust-1.438
vel-1.402
nes-1.390
ode-1.317
onel-1.265
Token you
Token ID345
Feature activation+6.84e-01

Pos logit contributions
 last+0.096
 Ms+0.084
 front+0.076
 teach+0.074
 hugs+0.073
Neg logit contributions
ust-0.146
nes-0.141
vel-0.139
aces-0.127
onel-0.127
Tokeny
Token ID88
Feature activation-3.20e-01

Pos logit contributions
nes+2.398
ust+2.390
vel+2.366
onel+2.144
ode+2.112
Neg logit contributions
 last-2.146
 Ms-1.883
 front-1.697
 hugs-1.686
 teach-1.684
Token puppet
Token ID30095
Feature activation+3.94e-01

Pos logit contributions
ust+0.628
nes+0.610
vel+0.606
onel+0.558
aces+0.550
Neg logit contributions
 last-0.404
 Ms-0.349
 front-0.313
 teach-0.312
 hugs-0.309
Token called
Token ID1444
Feature activation+1.78e-01

Pos logit contributions
 last+0.562
 Ms+0.496
 front+0.452
 hugs+0.446
 praise+0.441
Neg logit contributions
ust-0.693
nes-0.686
vel-0.676
onel-0.616
oll-0.604
Token P
Token ID350
Feature activation-2.91e-01

Pos logit contributions
 last+0.213
 Ms+0.175
 front+0.160
 teach+0.160
 hugs+0.159
Neg logit contributions
ust-0.361
nes-0.357
vel-0.352
aces-0.323
onel-0.321
Tokenips
Token ID2419
Feature activation-1.36e-01

Pos logit contributions
ust+0.511
nes+0.491
vel+0.489
onel+0.444
aces+0.438
Neg logit contributions
 last-0.437
 Ms-0.389
 front-0.355
 teach-0.355
 hugs-0.350
Tokenque
Token ID4188
Feature activation-3.06e+00

Pos logit contributions
ust+0.270
nes+0.263
vel+0.259
onel+0.240
aces+0.239
Neg logit contributions
 last-0.170
 Ms-0.149
 teach-0.132
 front-0.132
 hugs-0.130
Tokenak
Token ID461
Feature activation+7.89e-01

Pos logit contributions
ust+2.714
nes+2.586
vel+2.265
aund+2.148
onel+2.019
Neg logit contributions
 last-7.424
 first-6.687
 Ms-6.599
 front-6.470
't-6.355
Token.
Token ID13
Feature activation+3.88e-01

Pos logit contributions
 last+0.953
 Ms+0.854
 praise+0.748
 teach+0.734
 hugs+0.720
Neg logit contributions
ust-1.562
vel-1.513
nes-1.507
aces-1.424
onel-1.387
Token One
Token ID1881
Feature activation-1.48e-01

Pos logit contributions
 last+0.511
 Ms+0.439
 hugs+0.398
 front+0.398
 teach+0.396
Neg logit contributions
ust-0.735
vel-0.705
nes-0.703
aces-0.657
onel-0.652
Token day
Token ID1110
Feature activation+1.13e+00

Pos logit contributions
ust+0.286
nes+0.276
vel+0.273
onel+0.252
oll+0.249
Neg logit contributions
 last-0.193
 Ms-0.172
 front-0.154
 teach-0.154
 hugs-0.152
Token P
Token ID350
Feature activation-4.43e-01

Pos logit contributions
 last+1.281
 Ms+1.098
 praise+0.965
 front+0.962
 teach+0.933
Neg logit contributions
vel-2.331
ust-2.290
nes-2.214
aund-2.135
aces-2.119
Token not
Token ID407
Feature activation-9.37e-01

Pos logit contributions
ust+0.971
nes+0.943
vel+0.936
onel+0.864
aces+0.854
Neg logit contributions
 last-0.619
 Ms-0.535
 front-0.479
 teach-0.478
 hugs-0.470
Token touch
Token ID3638
Feature activation-1.19e+00

Pos logit contributions
ust+2.087
nes+2.059
vel+2.035
onel+1.914
aces+1.881
Neg logit contributions
 last-0.980
 Ms-0.770
 teach-0.708
 front-0.699
't-0.674
Token things
Token ID1243
Feature activation-1.17e+00

Pos logit contributions
ust+2.486
nes+2.450
vel+2.382
onel+2.223
oll+2.182
Neg logit contributions
 last-1.473
 Ms-1.197
 front-1.151
 hugs-1.094
 teach-1.083
Token that
Token ID326
Feature activation-1.12e+00

Pos logit contributions
ust+2.628
nes+2.558
vel+2.541
onel+2.383
ode+2.367
Neg logit contributions
 last-1.251
 Ms-0.979
 teach-0.877
 front-0.865
 first-0.863
Token were
Token ID547
Feature activation-1.50e+00

Pos logit contributions
ust+2.210
vel+2.174
nes+2.148
onel+1.961
ode+1.939
Neg logit contributions
 last-1.489
 Ms-1.257
 teach-1.151
 hugs-1.119
 front-1.113
Token not
Token ID407
Feature activation-3.05e+00

Pos logit contributions
nes+3.087
ust+3.076
vel+3.064
onel+2.829
ode+2.768
Neg logit contributions
 last-1.913
 Ms-1.500
 first-1.416
 front-1.399
 teach-1.358
Token his
Token ID465
Feature activation-1.26e+00

Pos logit contributions
nes+7.057
ust+6.863
vel+6.831
onel+6.568
ode+6.447
Neg logit contributions
 last-3.836
 first-2.679
 while-2.623
 teach-2.577
 front-2.498
Token.
Token ID13
Feature activation+2.21e-02

Pos logit contributions
ust+2.580
nes+2.551
vel+2.526
onel+2.316
oll+2.278
Neg logit contributions
 last-1.620
 Ms-1.267
 front-1.239
 teach-1.214
 first-1.189
Token Tim
Token ID5045
Feature activation-8.23e-01

Pos logit contributions
 last+0.035
 Ms+0.030
 front+0.028
 teach+0.028
 hugs+0.028
Neg logit contributions
ust-0.037
nes-0.036
vel-0.035
aces-0.032
onel-0.032
Token decided
Token ID3066
Feature activation-2.57e-01

Pos logit contributions
ust+1.639
nes+1.596
vel+1.591
onel+1.468
aces+1.441
Neg logit contributions
 last-1.055
 Ms-0.895
 teach-0.810
 front-0.808
 hugs-0.805
Token to
Token ID284
Feature activation+2.44e-01

Pos logit contributions
ust+0.569
nes+0.556
vel+0.549
onel+0.510
aces+0.506
Neg logit contributions
 last-0.264
 Ms-0.222
 front-0.191
 teach-0.190
 hugs-0.185
Token.
Token ID13
Feature activation-6.90e-02

Pos logit contributions
ust+0.427
nes+0.412
vel+0.408
onel+0.374
aces+0.369
Neg logit contributions
 last-0.357
 Ms-0.323
 teach-0.293
 front-0.291
 hugs-0.288
Token
Token ID198
Feature activation+9.78e-02

Pos logit contributions
ust+0.147
nes+0.143
vel+0.142
onel+0.132
aces+0.131
Neg logit contributions
 last-0.076
 Ms-0.063
 front-0.056
 teach-0.056
 hugs-0.054
Token
Token ID198
Feature activation+4.62e-01

Pos logit contributions
 last+0.104
 Ms+0.086
 teach+0.077
 front+0.076
 hugs+0.075
Neg logit contributions
ust-0.212
nes-0.208
vel-0.205
onel-0.189
oll-0.186
TokenP
Token ID47
Feature activation-8.03e-01

Pos logit contributions
 last+0.712
 Ms+0.620
 hugs+0.579
 praise+0.562
 teach+0.560
Neg logit contributions
ust-0.818
vel-0.788
nes-0.782
onel-0.695
ode-0.684
Tokenips
Token ID2419
Feature activation-1.41e+00

Pos logit contributions
ust+1.185
nes+1.153
vel+1.132
onel+1.014
aces+0.999
Neg logit contributions
 last-1.414
 Ms-1.265
 teach-1.178
 front-1.173
 hugs-1.171
Tokenque
Token ID4188
Feature activation-3.05e+00

Pos logit contributions
ust+2.841
vel+2.754
nes+2.717
ode+2.536
onel+2.487
Neg logit contributions
 last-1.822
 Ms-1.506
 hugs-1.370
 front-1.351
 first-1.321
Tokenak
Token ID461
Feature activation+5.85e-01

Pos logit contributions
ust+2.659
nes+2.565
aund+2.129
vel+2.100
oll+2.052
Neg logit contributions
 last-7.323
 first-6.580
 Ms-6.497
't-6.493
 front-6.307
Token started
Token ID2067
Feature activation+4.32e-01

Pos logit contributions
 last+0.751
 Ms+0.678
 teach+0.597
 front+0.593
�+0.591
Neg logit contributions
ust-1.110
nes-1.095
vel-1.082
onel-1.007
aces-1.003
Token to
Token ID284
Feature activation+3.40e-02

Pos logit contributions
 last+0.490
 Ms+0.418
 teach+0.362
 front+0.359
�+0.351
Neg logit contributions
ust-0.879
vel-0.864
nes-0.855
onel-0.793
aces-0.781
Token regret
Token ID13721
Feature activation-7.95e-01

Pos logit contributions
 last+0.045
 Ms+0.040
 front+0.036
 hugs+0.035
 teach+0.035
Neg logit contributions
ust-0.064
nes-0.062
vel-0.062
onel-0.057
ode-0.056
Token it
Token ID340
Feature activation+3.19e-01

Pos logit contributions
ust+1.620
nes+1.577
vel+1.550
onel+1.440
ode+1.428
Neg logit contributions
 last-0.971
 Ms-0.832
 front-0.741
 teach-0.740
 hugs-0.724
Token.
Token ID13
Feature activation+3.88e-01

Pos logit contributions
 last+0.953
 Ms+0.854
 praise+0.748
 teach+0.734
 hugs+0.720
Neg logit contributions
ust-1.562
vel-1.513
nes-1.507
aces-1.424
onel-1.387
Token One
Token ID1881
Feature activation-1.48e-01

Pos logit contributions
 last+0.511
 Ms+0.439
 hugs+0.398
 front+0.398
 teach+0.396
Neg logit contributions
ust-0.735
vel-0.705
nes-0.703
aces-0.657
onel-0.652
Token day
Token ID1110
Feature activation+1.13e+00

Pos logit contributions
ust+0.286
nes+0.276
vel+0.273
onel+0.252
oll+0.249
Neg logit contributions
 last-0.193
 Ms-0.172
 front-0.154
 teach-0.154
 hugs-0.152
Token P
Token ID350
Feature activation-4.43e-01

Pos logit contributions
 last+1.281
 Ms+1.098
 praise+0.965
 front+0.962
 teach+0.933
Neg logit contributions
vel-2.331
ust-2.290
nes-2.214
aund-2.135
aces-2.119
Tokenips
Token ID2419
Feature activation-8.44e-01

Pos logit contributions
ust+0.742
nes+0.715
vel+0.709
onel+0.644
aces+0.635
Neg logit contributions
 last-0.701
 Ms-0.625
 front-0.574
 teach-0.573
 hugs-0.566
Tokenque
Token ID4188
Feature activation-3.04e+00

Pos logit contributions
ust+1.717
vel+1.662
nes+1.662
onel+1.527
ode+1.516
Neg logit contributions
 last-1.035
 Ms-0.872
 front-0.784
 teach-0.776
 hugs-0.773
Tokenak
Token ID461
Feature activation+5.82e-01

Pos logit contributions
ust+2.996
nes+2.777
vel+2.501
aund+2.402
oll+2.281
Neg logit contributions
 last-7.053
 first-6.300
 Ms-6.268
't-6.190
 hugs-6.058
Token was
Token ID373
Feature activation+1.50e-01

Pos logit contributions
 last+0.790
 Ms+0.708
 teach+0.632
 front+0.627
 praise+0.625
Neg logit contributions
ust-1.075
nes-1.048
vel-1.031
aces-0.956
onel-0.954
Token playing
Token ID2712
Feature activation-2.67e-01

Pos logit contributions
 last+0.201
 Ms+0.174
 teach+0.158
 front+0.157
 praise+0.156
Neg logit contributions
ust-0.282
vel-0.276
nes-0.275
onel-0.253
aces-0.250
Token with
Token ID351
Feature activation+1.32e-01

Pos logit contributions
ust+0.533
nes+0.520
vel+0.520
onel+0.479
aces+0.474
Neg logit contributions
 last-0.331
 Ms-0.285
 front-0.253
 teach-0.251
 hugs-0.251
Token its
Token ID663
Feature activation+9.28e-01

Pos logit contributions
 last+0.189
 Ms+0.167
 teach+0.151
 front+0.148
�+0.147
Neg logit contributions
ust-0.233
nes-0.227
vel-0.227
aces-0.208
onel-0.208
Token Lily
Token ID20037
Feature activation-2.47e-01

Pos logit contributions
 last+0.499
 Ms+0.420
 front+0.393
 teach+0.386
�+0.385
Neg logit contributions
ust-0.713
nes-0.708
vel-0.690
aces-0.643
onel-0.633
Token.
Token ID13
Feature activation+6.41e-02

Pos logit contributions
ust+0.534
vel+0.518
nes+0.518
onel+0.479
aces+0.474
Neg logit contributions
 last-0.267
 Ms-0.224
 front-0.195
 teach-0.193
 hugs-0.191
Token "
Token ID366
Feature activation-1.46e+00

Pos logit contributions
 last+0.077
 Ms+0.063
 teach+0.057
 hugs+0.057
 front+0.056
Neg logit contributions
ust-0.130
nes-0.128
vel-0.125
aces-0.116
onel-0.115
TokenIt
Token ID1026
Feature activation-3.09e-01

Pos logit contributions
ust+2.906
nes+2.823
vel+2.820
onel+2.651
aces+2.643
Neg logit contributions
 last-1.811
 Ms-1.579
't-1.478
 front-1.466
 teach-1.399
Token's
Token ID338
Feature activation-1.19e+00

Pos logit contributions
ust+0.527
nes+0.508
vel+0.506
onel+0.460
aces+0.453
Neg logit contributions
 last-0.472
 Ms-0.421
 front-0.385
 teach-0.384
 hugs-0.378
Token an
Token ID281
Feature activation-3.00e+00

Pos logit contributions
ust+2.753
nes+2.724
vel+2.677
onel+2.512
aces+2.472
Neg logit contributions
 last-1.227
 Ms-0.965
 front-0.886
 teach-0.814
 first-0.809
Token underground
Token ID11447
Feature activation-1.11e+00

Pos logit contributions
nes+6.496
ust+6.291
vel+6.174
ode+5.615
onel+5.538
Neg logit contributions
 last-4.196
 front-3.192
 first-3.147
 teach-2.893
 Ms-2.847
Token tunnel
Token ID13275
Feature activation+3.34e-01

Pos logit contributions
ust+1.781
nes+1.725
vel+1.716
onel+1.541
oll+1.525
Neg logit contributions
 last-1.900
 Ms-1.635
 front-1.563
 teach-1.562
 hugs-1.526
Token,"
Token ID553
Feature activation-5.00e-01

Pos logit contributions
 last+0.400
 Ms+0.368
 front+0.317
 teach+0.316
 hugs+0.311
Neg logit contributions
ust-0.664
nes-0.654
vel-0.642
aces-0.595
onel-0.592
Token said
Token ID531
Feature activation-3.22e-01

Pos logit contributions
ust+1.134
nes+1.106
vel+1.098
onel+1.025
aces+1.014
Neg logit contributions
 last-0.489
 Ms-0.402
 front-0.345
 teach-0.345
 hugs-0.337
Token her
Token ID607
Feature activation-9.34e-01

Pos logit contributions
ust+0.658
nes+0.644
vel+0.637
onel+0.591
aces+0.584
Neg logit contributions
 last-0.381
 Ms-0.325
 front-0.289
 teach-0.287
 hugs-0.283
TokenLook
Token ID8567
Feature activation-1.38e+00

Pos logit contributions
ust+0.046
nes+0.045
vel+0.045
onel+0.041
ode+0.040
Neg logit contributions
 last-0.032
 Ms-0.028
 teach-0.025
 front-0.025
 hugs-0.025
Token,
Token ID11
Feature activation-1.32e+00

Pos logit contributions
ust+3.408
nes+3.350
vel+3.252
onel+3.120
aces+3.115
Neg logit contributions
 last-1.077
 Ms-0.862
 hugs-0.704
 teach-0.683
 front-0.670
Token mom
Token ID1995
Feature activation+7.46e-01

Pos logit contributions
ust+2.967
nes+2.888
vel+2.872
onel+2.686
aces+2.670
Neg logit contributions
 last-1.396
 Ms-1.184
 teach-1.034
 front-1.001
 hugs-1.000
Token,
Token ID11
Feature activation-1.76e+00

Pos logit contributions
 last+0.730
 Ms+0.622
�+0.549
 teach+0.540
 front+0.538
Neg logit contributions
ust-1.669
vel-1.625
nes-1.593
ode-1.502
adow-1.485
Token isn
Token ID2125
Feature activation+2.48e-01

Pos logit contributions
ust+4.248
nes+4.151
vel+4.117
onel+3.884
aces+3.870
Neg logit contributions
 last-1.698
 Ms-1.350
 teach-1.231
 hugs-1.154
 front-1.135
Token't
Token ID470
Feature activation-2.99e+00

Pos logit contributions
 last+0.113
 Ms+0.067
 hugs+0.043
 teach+0.042
 praise+0.039
Neg logit contributions
ust-0.690
nes-0.679
vel-0.673
onel-0.633
oll-0.624
Token it
Token ID340
Feature activation+2.17e-01

Pos logit contributions
ust+6.855
nes+6.787
aces+6.619
ode+6.553
vel+6.462
Neg logit contributions
 last-3.339
 Ms-2.701
 Mrs-2.562
 first-2.546
't-2.473
Token incredible
Token ID8082
Feature activation-1.36e-01

Pos logit contributions
 last+0.265
 Ms+0.234
 hugs+0.210
 front+0.210
 praise+0.208
Neg logit contributions
ust-0.438
nes-0.423
vel-0.421
onel-0.385
oll-0.379
Token?"
Token ID1701
Feature activation-7.99e-01

Pos logit contributions
ust+0.286
nes+0.280
vel+0.278
onel+0.257
aces+0.256
Neg logit contributions
 last-0.154
 Ms-0.132
 front-0.116
 teach-0.114
 hugs-0.112
Token Mom
Token ID11254
Feature activation+7.03e-01

Pos logit contributions
ust+1.778
nes+1.720
vel+1.719
onel+1.601
ode+1.580
Neg logit contributions
 last-0.835
 Ms-0.719
 teach-0.613
 front-0.611
 hugs-0.605
Token agreed
Token ID4987
Feature activation+3.40e-01

Pos logit contributions
 last+0.734
 Ms+0.597
 front+0.560
�+0.549
 teach+0.526
Neg logit contributions
ust-1.523
vel-1.474
nes-1.467
ode-1.356
oll-1.351
Token
Token ID198
Feature activation-2.09e-01

Pos logit contributions
 last+0.055
 Ms+0.045
 teach+0.040
 front+0.040
 hugs+0.039
Neg logit contributions
ust-0.113
nes-0.111
vel-0.109
onel-0.101
oll-0.100
TokenFl
Token ID7414
Feature activation-3.10e-01

Pos logit contributions
ust+0.388
nes+0.377
vel+0.372
onel+0.338
ode+0.335
Neg logit contributions
 last-0.294
 Ms-0.255
 teach-0.232
 front-0.232
 hugs-0.232
Tokenuffy
Token ID15352
Feature activation-1.31e-01

Pos logit contributions
ust+0.514
nes+0.495
vel+0.492
onel+0.446
aces+0.440
Neg logit contributions
 last-0.496
 Ms-0.443
 front-0.408
 teach-0.407
 hugs-0.402
Token and
Token ID290
Feature activation-3.93e-01

Pos logit contributions
ust+0.262
nes+0.257
vel+0.252
onel+0.232
aces+0.231
Neg logit contributions
 last-0.159
 Ms-0.136
 front-0.122
 teach-0.121
't-0.117
Token S
Token ID311
Feature activation-3.85e-01

Pos logit contributions
ust+0.881
nes+0.859
vel+0.854
onel+0.796
aces+0.789
Neg logit contributions
 last-0.395
 Ms-0.325
 front-0.281
 teach-0.280
 hugs-0.274
Tokenque
Token ID4188
Feature activation-2.96e+00

Pos logit contributions
ust+0.622
nes+0.597
vel+0.592
onel+0.537
aces+0.530
Neg logit contributions
 last-0.630
 Ms-0.565
 front-0.522
 teach-0.520
 hugs-0.514
Tokenaky
Token ID15492
Feature activation-3.77e-01

Pos logit contributions
nes+2.551
ust+2.434
vel+2.128
aund+1.944
onel+1.891
Neg logit contributions
 last-7.316
 Ms-6.624
 first-6.476
't-6.320
 while-6.282
Token played
Token ID2826
Feature activation-7.47e-01

Pos logit contributions
ust+0.733
nes+0.713
vel+0.706
onel+0.651
aces+0.644
Neg logit contributions
 last-0.488
 Ms-0.425
 front-0.380
 teach-0.380
 hugs-0.373
Token for
Token ID329
Feature activation+8.70e-02

Pos logit contributions
ust+1.466
nes+1.412
vel+1.398
onel+1.295
ode+1.281
Neg logit contributions
 last-0.984
 Ms-0.851
 teach-0.772
 front-0.766
 hugs-0.756
Token hours
Token ID2250
Feature activation-1.34e-01

Pos logit contributions
 last+0.084
 Ms+0.071
 teach+0.060
 front+0.059
 hugs+0.058
Neg logit contributions
ust-0.191
vel-0.189
nes-0.189
aces-0.174
onel-0.174
Token until
Token ID1566
Feature activation-6.62e-02

Pos logit contributions
ust+0.290
nes+0.285
vel+0.283
aces+0.262
onel+0.262
Neg logit contributions
 last-0.141
 Ms-0.121
 teach-0.104
 hugs-0.103
 front-0.103
Token wins
Token ID7864
Feature activation+4.29e-01

Pos logit contributions
ust+0.121
nes+0.118
vel+0.117
aces+0.108
onel+0.107
Neg logit contributions
 last-0.062
 Ms-0.053
 front-0.047
 teach-0.046
 hugs-0.045
Token!"
Token ID2474
Feature activation-9.45e-01

Pos logit contributions
 last+0.482
 Ms+0.426
 teach+0.373
 hugs+0.372
 praise+0.365
Neg logit contributions
ust-0.886
nes-0.876
vel-0.859
aces-0.801
onel-0.788
Token
Token ID198
Feature activation-4.17e-01

Pos logit contributions
ust+2.182
nes+2.115
vel+2.105
onel+1.967
aces+1.927
Neg logit contributions
 last-0.923
 Ms-0.772
 teach-0.667
 front-0.658
't-0.642
Token
Token ID198
Feature activation-5.23e-01

Pos logit contributions
ust+0.953
nes+0.930
vel+0.924
onel+0.862
aces+0.853
Neg logit contributions
 last-0.402
 Ms-0.330
 front-0.282
 teach-0.281
 hugs-0.275
TokenS
Token ID50
Feature activation-9.53e-01

Pos logit contributions
ust+0.902
nes+0.872
vel+0.865
onel+0.788
aces+0.777
Neg logit contributions
 last-0.802
 Ms-0.711
 front-0.651
 teach-0.651
 hugs-0.641
Tokenque
Token ID4188
Feature activation-2.96e+00

Pos logit contributions
ust+1.321
nes+1.299
vel+1.267
onel+1.127
aces+1.115
Neg logit contributions
 last-1.779
 Ms-1.580
 front-1.489
 teach-1.486
 hugs-1.468
Tokeneker
Token ID28233
Feature activation+2.59e-01

Pos logit contributions
nes+2.559
ust+2.540
vel+2.146
oll+1.914
aund+1.896
Neg logit contributions
 last-7.095
 first-6.451
 Ms-6.365
't-6.187
 front-6.110
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
 last+0.318
 Ms+0.281
 front+0.249
 teach+0.242
 hugs+0.241
Neg logit contributions
ust-0.512
nes-0.503
vel-0.495
aces-0.456
onel-0.455
Token up
Token ID510
Feature activation+9.79e-01

Pos logit contributions
 last+0.072
 Ms+0.063
 teach+0.058
 hugs+0.058
 front+0.058
Neg logit contributions
ust-0.080
nes-0.077
vel-0.077
onel-0.069
aces-0.069
Token for
Token ID329
Feature activation+5.65e-01

Pos logit contributions
 last+1.389
 Ms+1.272
 hugs+1.212
 teach+1.159
 praise+1.136
Neg logit contributions
ust-1.728
vel-1.643
nes-1.621
onel-1.459
aces-1.431
Token the
Token ID262
Feature activation+1.36e+00

Pos logit contributions
 last+0.598
 Ms+0.512
 hugs+0.435
 teach+0.431
 front+0.429
Neg logit contributions
ust-1.179
vel-1.171
nes-1.154
aces-1.072
onel-1.067
Token in
Token ID287
Feature activation+6.86e-02

Pos logit contributions
ust+1.267
nes+1.226
vel+1.201
onel+1.096
aces+1.074
Neg logit contributions
 last-1.205
 Ms-1.056
 front-0.987
 teach-0.981
 hugs-0.962
Token the
Token ID262
Feature activation-1.80e+00

Pos logit contributions
 last+0.073
 Ms+0.061
 teach+0.055
 hugs+0.053
�+0.053
Neg logit contributions
ust-0.145
nes-0.142
vel-0.141
aces-0.129
onel-0.129
Token bathroom
Token ID12436
Feature activation+5.61e-02

Pos logit contributions
ust+3.672
nes+3.525
vel+3.516
onel+3.250
aces+3.194
Neg logit contributions
 last-2.560
 front-2.161
 Ms-2.152
 teach-1.929
 hugs-1.929
Token and
Token ID290
Feature activation-3.80e-01

Pos logit contributions
 last+0.062
 Ms+0.054
 teach+0.047
 hugs+0.047
 praise+0.045
Neg logit contributions
ust-0.117
nes-0.114
vel-0.114
aces-0.105
onel-0.105
Token didn
Token ID1422
Feature activation+4.94e-01

Pos logit contributions
ust+0.743
nes+0.722
vel+0.716
onel+0.660
aces+0.652
Neg logit contributions
 last-0.492
 Ms-0.425
 front-0.382
 teach-0.381
 hugs-0.373
Token't
Token ID470
Feature activation-2.93e+00

Pos logit contributions
 last+0.230
 Ms+0.166
 hugs+0.116
 praise+0.102
 teach+0.095
Neg logit contributions
ust-1.345
nes-1.331
vel-1.320
onel-1.237
oll-1.233
Token know
Token ID760
Feature activation-1.11e+00

Pos logit contributions
nes+6.408
ust+6.377
vel+6.171
onel+6.051
aces+5.940
Neg logit contributions
 last-3.731
 teach-2.976
 while-2.771
 front-2.726
 hug-2.697
Token what
Token ID644
Feature activation-1.50e+00

Pos logit contributions
ust+2.018
nes+2.007
vel+1.919
onel+1.785
ode+1.752
Neg logit contributions
 last-1.610
 Ms-1.408
 teach-1.282
 hugs-1.278
 front-1.278
Token it
Token ID340
Feature activation+2.67e-01

Pos logit contributions
ust+3.281
nes+3.160
vel+3.103
onel+2.967
ode+2.874
Neg logit contributions
 last-1.778
 Ms-1.513
 teach-1.352
 hugs-1.343
 front-1.326
Token was
Token ID373
Feature activation-1.10e+00

Pos logit contributions
 last+0.311
 Ms+0.268
 front+0.247
 hugs+0.238
 teach+0.238
Neg logit contributions
ust-0.547
nes-0.524
vel-0.520
onel-0.485
aces-0.483
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
ust+2.517
nes+2.466
vel+2.428
onel+2.287
ode+2.249
Neg logit contributions
 last-1.113
 Ms-0.860
 teach-0.782
 front-0.764
 first-0.747
Token one
Token ID530
Feature activation+5.94e-01

Pos logit contributions
 last+0.078
 Ms+0.068
 teach+0.062
�+0.062
't+0.061
Neg logit contributions
ust-0.109
nes-0.106
vel-0.106
onel-0.098
aces-0.096
Token more
Token ID517
Feature activation-5.27e-01

Pos logit contributions
 last+0.637
 Ms+0.613
't+0.550
�+0.535
 teach+0.513
Neg logit contributions
ust-1.203
vel-1.167
nes-1.150
onel-1.081
ode-1.079
Token time
Token ID640
Feature activation+1.52e-01

Pos logit contributions
ust+0.868
nes+0.838
vel+0.831
onel+0.753
aces+0.741
Neg logit contributions
 last-0.854
 Ms-0.761
 front-0.701
 teach-0.701
 hugs-0.692
Token,
Token ID11
Feature activation+6.56e-02

Pos logit contributions
 last+0.171
 Ms+0.149
 front+0.130
 teach+0.129
 hugs+0.127
Neg logit contributions
ust-0.313
vel-0.309
nes-0.306
onel-0.284
aces-0.283
Token and
Token ID290
Feature activation-5.76e-01

Pos logit contributions
 last+0.079
 Ms+0.069
 front+0.062
 teach+0.061
 hugs+0.060
Neg logit contributions
ust-0.130
vel-0.127
nes-0.126
onel-0.117
aces-0.116
Token Sil
Token ID4243
Feature activation-2.93e+00

Pos logit contributions
ust+1.323
nes+1.288
vel+1.281
onel+1.196
aces+1.184
Neg logit contributions
 last-0.553
 Ms-0.452
 teach-0.385
 front-0.385
 hugs-0.376
Tokenv
Token ID85
Feature activation-1.02e+00

Pos logit contributions
vel+3.264
ust+3.177
oll+3.173
nes+3.163
ode+3.056
Neg logit contributions
 last-5.838
 Ms-5.097
 first-4.885
't-4.839
 teach-4.821
Tokenie
Token ID494
Feature activation+9.03e-02

Pos logit contributions
ust+1.904
nes+1.847
vel+1.843
onel+1.673
ode+1.663
Neg logit contributions
 last-1.436
 Ms-1.222
 front-1.124
 hugs-1.117
 teach-1.113
Token repeated
Token ID5100
Feature activation-7.41e-01

Pos logit contributions
 last+0.114
 Ms+0.101
 front+0.089
 teach+0.088
�+0.086
Neg logit contributions
ust-0.175
nes-0.173
vel-0.170
onel-0.157
aces-0.156
Token it
Token ID340
Feature activation+1.99e-01

Pos logit contributions
ust+1.629
nes+1.585
vel+1.565
onel+1.464
ode+1.446
Neg logit contributions
 last-0.788
 Ms-0.661
 teach-0.578
 front-0.576
 hugs-0.569
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
 last+0.238
 Ms+0.212
 front+0.185
�+0.183
 teach+0.181
Neg logit contributions
ust-0.400
vel-0.390
nes-0.386
onel-0.357
aces-0.356
Token!"
Token ID2474
Feature activation+9.77e-02

Pos logit contributions
 last+0.464
 Ms+0.399
 front+0.357
�+0.349
 teach+0.347
Neg logit contributions
ust-0.807
nes-0.795
vel-0.790
onel-0.728
aces-0.727
Token
Token ID198
Feature activation+1.42e-01

Pos logit contributions
 last+0.109
 Ms+0.087
 teach+0.080
 front+0.079
 hugs+0.076
Neg logit contributions
ust-0.205
nes-0.203
vel-0.199
aces-0.185
onel-0.184
Token
Token ID198
Feature activation-7.46e-02

Pos logit contributions
 last+0.153
 Ms+0.124
 teach+0.113
 front+0.109
 hugs+0.108
Neg logit contributions
ust-0.308
nes-0.303
vel-0.298
onel-0.274
aces-0.272
Token"
Token ID1
Feature activation+4.75e-01

Pos logit contributions
ust+0.146
nes+0.142
vel+0.140
onel+0.127
ode+0.126
Neg logit contributions
 last-0.099
 Ms-0.084
 teach-0.076
 hugs-0.076
 front-0.075
TokenS
Token ID50
Feature activation+2.87e-01

Pos logit contributions
 last+0.632
 Ms+0.536
 hugs+0.492
 teach+0.483
 front+0.482
Neg logit contributions
ust-0.912
nes-0.898
vel-0.872
onel-0.801
aces-0.796
Tokenque
Token ID4188
Feature activation-2.93e+00

Pos logit contributions
 last+0.404
 Ms+0.362
 front+0.330
 teach+0.326
't+0.322
Neg logit contributions
ust-0.527
vel-0.509
nes-0.495
ode-0.465
oll-0.457
Tokenak
Token ID461
Feature activation+8.67e-01

Pos logit contributions
ust+3.154
nes+3.010
vel+2.488
onel+2.412
aund+2.336
Neg logit contributions
 last-6.751
 Ms-5.932
 first-5.883
 front-5.806
't-5.781
Token!"
Token ID2474
Feature activation-5.42e-02

Pos logit contributions
 last+0.972
 Ms+0.890
�+0.767
 front+0.751
 teach+0.743
Neg logit contributions
ust-1.761
vel-1.725
nes-1.721
aces-1.619
onel-1.608
Token Lily
Token ID20037
Feature activation+8.01e-01

Pos logit contributions
ust+0.107
nes+0.106
vel+0.104
onel+0.096
aces+0.096
Neg logit contributions
 last-0.067
 Ms-0.056
 teach-0.051
 front-0.051
 hugs-0.050
Token says
Token ID1139
Feature activation+2.93e-01

Pos logit contributions
 last+1.044
 Ms+0.868
�+0.820
 front+0.805
't+0.780
Neg logit contributions
ust-1.512
nes-1.509
vel-1.482
aces-1.411
onel-1.350
Token.
Token ID13
Feature activation-6.99e-02

Pos logit contributions
 last+0.338
 Ms+0.291
 front+0.255
 teach+0.253
 praise+0.244
Neg logit contributions
ust-0.602
vel-0.595
nes-0.592
onel-0.546
aces-0.546
Token that
Token ID326
Feature activation-3.10e-01

Pos logit contributions
 last+0.860
 Ms+0.795
 teach+0.701
�+0.699
 front+0.692
Neg logit contributions
ust-1.445
nes-1.399
vel-1.357
onel-1.257
aces-1.256
Token day
Token ID1110
Feature activation-8.01e-01

Pos logit contributions
ust+0.606
nes+0.587
vel+0.585
onel+0.539
aces+0.530
Neg logit contributions
 last-0.398
 Ms-0.348
 front-0.312
 teach-0.311
 hugs-0.309
Token forward
Token ID2651
Feature activation-3.20e-01

Pos logit contributions
ust+1.404
nes+1.352
vel+1.327
onel+1.214
ode+1.207
Neg logit contributions
 last-1.223
 Ms-1.075
 front-0.995
 teach-0.987
 hugs-0.976
Token,
Token ID11
Feature activation+9.42e-02

Pos logit contributions
ust+0.759
nes+0.742
vel+0.736
onel+0.691
aces+0.684
Neg logit contributions
 last-0.277
 Ms-0.224
 front-0.187
 teach-0.185
 hugs-0.178
Token Daisy
Token ID40355
Feature activation+5.26e-01

Pos logit contributions
 last+0.104
 Ms+0.086
 front+0.079
 teach+0.077
 hugs+0.076
Neg logit contributions
ust-0.199
nes-0.194
vel-0.192
onel-0.178
aces-0.177
Token stri
Token ID2338
Feature activation-2.92e+00

Pos logit contributions
 last+0.700
 Ms+0.620
 front+0.563
�+0.550
 teach+0.549
Neg logit contributions
ust-0.980
nes-0.965
vel-0.948
aces-0.880
onel-0.879
Tokenved
Token ID1079
Feature activation+6.10e-01

Pos logit contributions
ust+3.719
vel+3.632
nes+3.531
oll+3.050
ode+3.037
Neg logit contributions
 last-6.034
 first-4.992
 teach-4.889
 Ms-4.872
 front-4.853
Token to
Token ID284
Feature activation-1.23e-01

Pos logit contributions
 last+0.769
 Ms+0.683
�+0.641
 front+0.604
 teach+0.601
Neg logit contributions
ust-1.185
vel-1.138
nes-1.130
onel-1.047
aces-1.041
Token be
Token ID307
Feature activation+9.72e-02

Pos logit contributions
ust+0.247
nes+0.241
vel+0.239
onel+0.221
aces+0.219
Neg logit contributions
 last-0.147
 Ms-0.128
 front-0.112
 hugs-0.111
 teach-0.111
Token the
Token ID262
Feature activation+4.22e-01

Pos logit contributions
 last+0.130
 Ms+0.115
 teach+0.102
�+0.102
 front+0.101
Neg logit contributions
ust-0.183
vel-0.178
nes-0.178
onel-0.163
aces-0.161
Token best
Token ID1266
Feature activation+3.83e-02

Pos logit contributions
 last+0.635
 Ms+0.593
 hugs+0.540
't+0.537
 front+0.528
Neg logit contributions
ust-0.705
vel-0.664
nes-0.663
onel-0.612
ode-0.604
Token by
Token ID416
Feature activation-6.45e-01

Pos logit contributions
 last+0.833
 Ms+0.746
 hugs+0.673
�+0.660
 teach+0.660
Neg logit contributions
ust-1.395
vel-1.395
nes-1.375
onel-1.264
adow-1.245
Token himself
Token ID2241
Feature activation+4.94e-01

Pos logit contributions
ust+1.524
nes+1.488
vel+1.475
onel+1.384
aces+1.366
Neg logit contributions
 last-0.585
 Ms-0.468
 front-0.402
 teach-0.394
 hugs-0.384
Token when
Token ID618
Feature activation+7.55e-01

Pos logit contributions
 last+0.549
 Ms+0.510
 hugs+0.450
 praise+0.444
 teach+0.437
Neg logit contributions
ust-1.028
vel-0.996
nes-0.976
adow-0.909
onel-0.905
Token he
Token ID339
Feature activation+1.10e+00

Pos logit contributions
 last+0.849
 Ms+0.752
 teach+0.691
 hugs+0.674
 praise+0.674
Neg logit contributions
ust-1.500
nes-1.478
vel-1.458
onel-1.359
aces-1.334
Token heard
Token ID2982
Feature activation-5.64e-01

Pos logit contributions
 last+1.392
 Ms+1.247
 praise+1.169
 teach+1.157
 hugs+1.128
Neg logit contributions
ust-2.109
nes-2.051
vel-2.003
onel-1.880
adow-1.867
Token an
Token ID281
Feature activation-2.91e+00

Pos logit contributions
ust+1.308
nes+1.276
vel+1.267
onel+1.185
aces+1.172
Neg logit contributions
 last-0.526
 Ms-0.428
 front-0.364
 teach-0.363
 hugs-0.355
Token enormous
Token ID9812
Feature activation-9.48e-01

Pos logit contributions
nes+6.426
ust+6.325
vel+6.318
aces+5.747
ode+5.733
Neg logit contributions
 last-3.607
 Ms-2.900
 teach-2.835
 front-2.828
 first-2.689
Token noise
Token ID7838
Feature activation+2.35e-02

Pos logit contributions
ust+1.808
nes+1.760
vel+1.713
onel+1.589
aces+1.579
Neg logit contributions
 last-1.318
 Ms-1.146
 teach-1.059
 front-1.055
 hugs-1.028
Token.
Token ID13
Feature activation-7.21e-01

Pos logit contributions
 last+0.027
 Ms+0.023
 hugs+0.020
 front+0.020
 teach+0.020
Neg logit contributions
ust-0.049
vel-0.047
nes-0.047
onel-0.043
aces-0.043
Token He
Token ID679
Feature activation-1.23e+00

Pos logit contributions
ust+1.371
nes+1.329
vel+1.319
onel+1.219
ode+1.190
Neg logit contributions
 last-0.983
 Ms-0.870
 front-0.784
 teach-0.777
 hugs-0.767
Token looked
Token ID3114
Feature activation+5.94e-02

Pos logit contributions
ust+2.566
nes+2.534
vel+2.527
onel+2.352
aces+2.340
Neg logit contributions
 last-1.435
 Ms-1.207
 front-1.080
 hugs-1.073
 teach-1.066
Token alone
Token ID3436
Feature activation+4.91e-01

Pos logit contributions
 last+1.014
 Ms+0.983
 teach+0.866
 hugs+0.849
 praise+0.848
Neg logit contributions
ust-1.177
nes-1.141
vel-1.127
onel-1.039
aces-1.024
Token!"
Token ID2474
Feature activation+3.44e-01

Pos logit contributions
 last+0.559
 Ms+0.512
 teach+0.441
 hugs+0.429
 praise+0.427
Neg logit contributions
ust-1.009
vel-0.989
nes-0.986
aces-0.907
onel-0.907
Token But
Token ID887
Feature activation+1.20e-01

Pos logit contributions
 last+0.383
 Ms+0.313
 teach+0.285
 front+0.277
 hugs+0.274
Neg logit contributions
ust-0.713
nes-0.702
vel-0.689
aces-0.645
onel-0.635
Token the
Token ID262
Feature activation+1.46e+00

Pos logit contributions
 last+0.125
 Ms+0.103
 hugs+0.092
 teach+0.092
 front+0.090
Neg logit contributions
ust-0.260
nes-0.255
vel-0.252
onel-0.235
aces-0.233
Token dog
Token ID3290
Feature activation+5.92e-01

Pos logit contributions
 last+2.276
 Ms+2.210
�+2.032
't+1.956
 praise+1.948
Neg logit contributions
ust-2.161
vel-2.044
nes-2.029
aces-1.912
oll-1.888
Token does
Token ID857
Feature activation+1.70e+00

Pos logit contributions
 last+0.893
 Ms+0.810
 front+0.735
�+0.734
 teach+0.718
Neg logit contributions
ust-0.999
vel-0.947
nes-0.920
onel-0.868
aces-0.864
Token not
Token ID407
Feature activation-8.35e-01

Pos logit contributions
 last+2.646
 Ms+2.539
 hugs+2.299
ted+2.273
�+2.241
Neg logit contributions
ust-2.768
vel-2.692
nes-2.631
onel-2.276
aund-2.203
Token listen
Token ID6004
Feature activation+6.20e-01

Pos logit contributions
ust+1.768
nes+1.725
vel+1.714
onel+1.596
aces+1.572
Neg logit contributions
 last-0.974
 Ms-0.793
 teach-0.733
 front-0.712
 hugs-0.702
Token.
Token ID13
Feature activation+4.70e-01

Pos logit contributions
 last+0.743
 Ms+0.673
�+0.599
 teach+0.587
 praise+0.574
Neg logit contributions
ust-1.248
vel-1.204
nes-1.189
aces-1.082
onel-1.069
Token He
Token ID679
Feature activation+1.15e+00

Pos logit contributions
 last+0.637
 Ms+0.537
 teach+0.504
 hugs+0.486
 front+0.485
Neg logit contributions
ust-0.876
vel-0.835
nes-0.825
aces-0.780
onel-0.776
Token thinks
Token ID6834
Feature activation+1.05e-01

Pos logit contributions
 last+1.628
 Ms+1.489
�+1.350
 front+1.323
't+1.291
Neg logit contributions
ust-2.024
vel-1.904
nes-1.881
aces-1.821
oll-1.788
Token saw
Token ID2497
Feature activation-6.71e-02

Pos logit contributions
 last+0.785
 Ms+0.704
 teach+0.643
 praise+0.623
 hugs+0.610
Neg logit contributions
ust-1.221
nes-1.204
vel-1.192
onel-1.105
ode-1.087
Token something
Token ID1223
Feature activation-8.66e-01

Pos logit contributions
ust+0.149
nes+0.145
vel+0.145
onel+0.136
aces+0.134
Neg logit contributions
 last-0.066
 Ms-0.055
 teach-0.048
 front-0.046
 hugs-0.045
Token moving
Token ID3867
Feature activation-8.23e-01

Pos logit contributions
ust+1.725
nes+1.694
vel+1.662
onel+1.541
aces+1.527
Neg logit contributions
 last-1.109
 Ms-0.946
 teach-0.852
 front-0.851
 hugs-0.836
Token.
Token ID13
Feature activation-5.05e-01

Pos logit contributions
ust+1.440
nes+1.399
vel+1.382
onel+1.262
ode+1.244
Neg logit contributions
 last-1.251
 Ms-1.086
 front-1.016
 teach-0.999
 hugs-0.987
Token She
Token ID1375
Feature activation-6.00e-01

Pos logit contributions
ust+0.861
nes+0.832
vel+0.825
onel+0.751
aces+0.740
Neg logit contributions
 last-0.779
 Ms-0.692
 front-0.634
 teach-0.634
 hugs-0.626
Token went
Token ID1816
Feature activation+2.06e+00

Pos logit contributions
ust+1.247
nes+1.214
vel+1.205
onel+1.119
aces+1.105
Neg logit contributions
 last-0.704
 Ms-0.600
 front-0.532
 teach-0.529
 hugs-0.523
Token closer
Token ID5699
Feature activation-2.92e-01

Pos logit contributions
 last+2.847
 Ms+2.768
�+2.689
 praise+2.499
 teach+2.417
Neg logit contributions
ust-3.280
nes-3.233
vel-3.107
ode-2.919
ying-2.883
Token and
Token ID290
Feature activation+4.69e-01

Pos logit contributions
ust+0.628
nes+0.612
vel+0.609
onel+0.563
aces+0.560
Neg logit contributions
 last-0.311
 Ms-0.265
 teach-0.231
 front-0.229
 hugs-0.224
Token saw
Token ID2497
Feature activation-3.98e-01

Pos logit contributions
 last+0.544
 Ms+0.455
 teach+0.421
�+0.415
 front+0.414
Neg logit contributions
ust-0.949
nes-0.939
vel-0.921
onel-0.849
ode-0.834
Token it
Token ID340
Feature activation+8.37e-01

Pos logit contributions
ust+0.888
nes+0.864
vel+0.860
onel+0.803
aces+0.794
Neg logit contributions
 last-0.404
 Ms-0.336
 teach-0.290
 front-0.289
 hugs-0.282
Token was
Token ID373
Feature activation-1.00e-01

Pos logit contributions
 last+0.948
 Ms+0.872
 praise+0.806
 teach+0.772
 hugs+0.771
Neg logit contributions
ust-1.734
vel-1.681
nes-1.606
onel-1.524
aces-1.497
Token her
Token ID607
Feature activation-1.15e+00

Pos logit contributions
ust+2.033
nes+1.989
vel+1.964
onel+1.840
oll+1.803
Neg logit contributions
 last-1.122
 Ms-0.960
 front-0.871
 teach-0.843
 hugs-0.828
Token cabin
Token ID9351
Feature activation+7.56e-01

Pos logit contributions
ust+1.764
vel+1.719
nes+1.709
aces+1.528
onel+1.523
Neg logit contributions
 last-2.062
 Ms-1.819
 front-1.744
 teach-1.690
 hugs-1.673
Token.
Token ID13
Feature activation+7.30e-02

Pos logit contributions
 last+0.891
 Ms+0.846
 hugs+0.765
 praise+0.733
 teach+0.729
Neg logit contributions
ust-1.461
nes-1.423
vel-1.381
adow-1.323
aces-1.304
Token One
Token ID1881
Feature activation+1.16e-01

Pos logit contributions
 last+0.071
 Ms+0.057
 front+0.050
 teach+0.050
 hugs+0.049
Neg logit contributions
ust-0.164
nes-0.159
vel-0.158
aces-0.147
oll-0.146
Token day
Token ID1110
Feature activation+9.49e-01

Pos logit contributions
 last+0.154
 Ms+0.140
 teach+0.128
 hugs+0.127
 front+0.125
Neg logit contributions
ust-0.217
nes-0.206
vel-0.204
oll-0.187
onel-0.187
Token,
Token ID11
Feature activation+1.70e+00

Pos logit contributions
 last+1.055
 Ms+0.883
 teach+0.807
 front+0.804
 praise+0.795
Neg logit contributions
vel-1.957
ust-1.933
nes-1.884
oll-1.787
aund-1.781
Token while
Token ID981
Feature activation+4.55e-01

Pos logit contributions
 last+2.369
 praise+2.003
 Ms+1.975
 hugs+1.939
 front+1.934
Neg logit contributions
nes-2.800
ust-2.794
vel-2.783
aund-2.553
oll-2.495
Token she
Token ID673
Feature activation-3.92e-01

Pos logit contributions
 last+0.529
 Ms+0.444
 teach+0.405
 praise+0.396
 hugs+0.392
Neg logit contributions
vel-0.905
ust-0.901
nes-0.895
onel-0.825
oll-0.816
Token was
Token ID373
Feature activation-7.70e-01

Pos logit contributions
ust+0.884
nes+0.862
vel+0.855
onel+0.798
aces+0.790
Neg logit contributions
 last-0.390
 Ms-0.325
 teach-0.279
 front-0.278
 hugs-0.272
Token playing
Token ID2712
Feature activation-7.91e-01

Pos logit contributions
ust+1.599
nes+1.548
vel+1.532
onel+1.419
aces+1.406
Neg logit contributions
 last-0.929
 Ms-0.786
 front-0.707
 teach-0.702
 hugs-0.689
Token,
Token ID11
Feature activation+2.52e-01

Pos logit contributions
ust+1.795
nes+1.741
vel+1.715
onel+1.603
aces+1.587
Neg logit contributions
 last-0.805
 Ms-0.661
 teach-0.581
 front-0.576
 hugs-0.561
Token the
Token ID262
Feature activation+1.09e-01

Pos logit contributions
 last+0.226
 Ms+0.185
 hugs+0.153
 teach+0.152
't+0.149
Neg logit contributions
ust-0.620
nes-0.614
vel-0.601
onel-0.558
ode-0.553
Token bench
Token ID7624
Feature activation+6.28e-01

Pos logit contributions
 last+0.166
 Ms+0.152
 teach+0.139
 hugs+0.136
 praise+0.134
Neg logit contributions
ust-0.181
nes-0.175
vel-0.171
onel-0.157
ode-0.156
Token.
Token ID13
Feature activation+9.05e-02

Pos logit contributions
 last+0.721
 Ms+0.688
 teach+0.596
 hugs+0.586
't+0.576
Neg logit contributions
ust-1.255
nes-1.211
vel-1.180
adow-1.112
aces-1.095
Token Lily
Token ID20037
Feature activation+9.02e-01

Pos logit contributions
 last+0.130
 Ms+0.110
 front+0.102
 teach+0.102
 hugs+0.100
Neg logit contributions
ust-0.162
nes-0.160
vel-0.155
aces-0.143
onel-0.141
Token's
Token ID338
Feature activation+6.75e-01

Pos logit contributions
 last+1.173
 Ms+1.000
�+0.940
't+0.925
 teach+0.924
Neg logit contributions
ust-1.637
nes-1.615
vel-1.590
aces-1.516
oll-1.511
Token mom
Token ID1995
Feature activation+1.95e+00

Pos logit contributions
 last+0.991
 Ms+0.953
't+0.840
 teach+0.819
�+0.811
Neg logit contributions
ust-1.101
nes-1.056
vel-1.053
ode-0.986
aces-0.984
Token has
Token ID468
Feature activation-1.01e-01

Pos logit contributions
 last+2.579
�+2.345
 Ms+2.307
't+2.178
 while+2.149
Neg logit contributions
vel-3.254
ust-3.184
nes-3.141
oll-3.057
ode-3.038
Token a
Token ID257
Feature activation+2.90e-01

Pos logit contributions
ust+0.209
nes+0.205
vel+0.203
onel+0.187
aces+0.187
Neg logit contributions
 last-0.116
 Ms-0.099
 teach-0.087
 front-0.085
 hugs-0.084
Token bag
Token ID6131
Feature activation+5.43e-01

Pos logit contributions
 last+0.406
 Ms+0.368
't+0.332
 front+0.332
 teach+0.327
Neg logit contributions
ust-0.507
vel-0.490
nes-0.484
oll-0.444
ode-0.440
Token.
Token ID13
Feature activation+1.91e-01

Pos logit contributions
 last+0.603
 Ms+0.565
 teach+0.479
 hugs+0.472
 praise+0.466
Neg logit contributions
ust-1.115
nes-1.097
vel-1.081
oll-1.011
aces-1.008
Token She
Token ID1375
Feature activation+1.44e+00

Pos logit contributions
 last+0.263
 Ms+0.216
 front+0.202
 teach+0.200
 hugs+0.196
Neg logit contributions
ust-0.352
nes-0.345
vel-0.340
aces-0.313
onel-0.312
TokenOh
Token ID5812
Feature activation+8.72e-02

Pos logit contributions
ust+0.097
nes+0.094
vel+0.094
onel+0.085
ode+0.084
Neg logit contributions
 last-0.071
 Ms-0.063
 teach-0.057
 hugs-0.056
 front-0.056
Token no
Token ID645
Feature activation+1.44e+00

Pos logit contributions
 last+0.125
 Ms+0.112
 teach+0.102
 front+0.100
 hugs+0.098
Neg logit contributions
ust-0.157
nes-0.150
vel-0.150
onel-0.138
oll-0.134
Token!"
Token ID2474
Feature activation+2.33e-01

Pos logit contributions
 last+1.573
 Ms+1.490
 front+1.226
 teach+1.223
�+1.195
Neg logit contributions
ust-2.986
vel-2.979
nes-2.801
oll-2.610
adow-2.603
Token
Token ID198
Feature activation+7.30e-01

Pos logit contributions
 last+0.257
 Ms+0.214
 teach+0.193
 front+0.186
 hugs+0.185
Neg logit contributions
ust-0.489
nes-0.475
vel-0.468
aces-0.434
onel-0.433
Token
Token ID198
Feature activation+7.13e-01

Pos logit contributions
 last+0.860
 Ms+0.723
 teach+0.674
 hugs+0.638
 praise+0.625
Neg logit contributions
ust-1.476
nes-1.449
vel-1.417
onel-1.288
oll-1.285
TokenThe
Token ID464
Feature activation+2.48e+00

Pos logit contributions
 last+1.223
 Ms+1.101
 teach+1.038
 hugs+1.031
 praise+1.002
Neg logit contributions
ust-1.086
nes-1.041
vel-1.025
ode-0.887
onel-0.881
Token bear
Token ID6842
Feature activation-1.68e-01

Pos logit contributions
�+3.671
 last+3.523
 Ms+3.514
ted+3.280
 Together+3.110
Neg logit contributions
ust-3.682
vel-3.590
nes-3.508
onel-3.452
aces-3.437
Token was
Token ID373
Feature activation-3.89e-01

Pos logit contributions
ust+0.313
nes+0.302
vel+0.300
onel+0.277
aces+0.273
Neg logit contributions
 last-0.229
 Ms-0.202
 front-0.182
 teach-0.181
 hugs-0.176
Token sorry
Token ID7926
Feature activation-6.55e-01

Pos logit contributions
ust+0.731
nes+0.708
vel+0.703
onel+0.647
aces+0.638
Neg logit contributions
 last-0.534
 Ms-0.468
 teach-0.423
 front-0.421
 hugs-0.415
Token.
Token ID13
Feature activation-6.96e-01

Pos logit contributions
ust+1.452
nes+1.425
vel+1.407
onel+1.313
aces+1.299
Neg logit contributions
 last-0.696
 Ms-0.573
 teach-0.501
 front-0.500
 hugs-0.494
Token He
Token ID679
Feature activation+2.02e-02

Pos logit contributions
ust+1.380
nes+1.343
vel+1.333
onel+1.227
ode+1.210
Neg logit contributions
 last-0.899
 Ms-0.785
 front-0.706
 teach-0.700
 hugs-0.689
Token pumpkin
Token ID30089
Feature activation-6.44e-01

Pos logit contributions
ust+1.096
nes+1.058
vel+1.047
onel+0.959
aces+0.942
Neg logit contributions
 last-0.915
 Ms-0.802
 front-0.739
 teach-0.736
 hugs-0.725
Token.
Token ID13
Feature activation+2.44e-02

Pos logit contributions
ust+1.422
nes+1.383
vel+1.372
onel+1.278
aces+1.257
Neg logit contributions
 last-0.684
 Ms-0.564
 front-0.498
 teach-0.495
 hugs-0.485
Token When
Token ID1649
Feature activation+6.13e-01

Pos logit contributions
 last+0.029
 Ms+0.023
 teach+0.021
 front+0.021
 hugs+0.021
Neg logit contributions
ust-0.050
nes-0.049
vel-0.048
aces-0.044
onel-0.044
Token he
Token ID339
Feature activation+2.70e-01

Pos logit contributions
 last+0.688
 Ms+0.587
 teach+0.542
 hugs+0.542
 praise+0.530
Neg logit contributions
ust-1.223
nes-1.219
vel-1.197
oll-1.085
onel-1.085
Token got
Token ID1392
Feature activation-1.40e-01

Pos logit contributions
 last+0.353
 Ms+0.313
 teach+0.281
 front+0.278
 hugs+0.277
Neg logit contributions
ust-0.504
nes-0.496
vel-0.489
onel-0.445
oll-0.442
Token home
Token ID1363
Feature activation+1.02e+00

Pos logit contributions
ust+0.229
nes+0.223
vel+0.220
onel+0.199
ode+0.196
Neg logit contributions
 last-0.222
 Ms-0.202
 teach-0.186
 front-0.182
 hugs-0.181
Token,
Token ID11
Feature activation+3.25e-01

Pos logit contributions
 last+1.019
 Ms+0.902
 teach+0.803
 hugs+0.793
 praise+0.786
Neg logit contributions
ust-2.131
nes-2.114
vel-2.079
onel-1.951
ying-1.939
Token he
Token ID339
Feature activation+4.73e-01

Pos logit contributions
 last+0.390
 Ms+0.327
 hugs+0.302
 teach+0.298
 front+0.294
Neg logit contributions
ust-0.644
nes-0.623
vel-0.620
onel-0.570
oll-0.563
Token called
Token ID1444
Feature activation-4.13e-01

Pos logit contributions
 last+0.593
 Ms+0.505
 front+0.468
 teach+0.457
 praise+0.450
Neg logit contributions
ust-0.916
nes-0.906
vel-0.884
onel-0.809
oll-0.805
Token for
Token ID329
Feature activation+4.08e-01

Pos logit contributions
ust+0.781
nes+0.759
vel+0.754
onel+0.692
aces+0.683
Neg logit contributions
 last-0.560
 Ms-0.488
 front-0.441
 teach-0.440
 hugs-0.433
Token his
Token ID465
Feature activation+3.32e-01

Pos logit contributions
 last+0.490
 Ms+0.428
�+0.391
't+0.377
 teach+0.371
Neg logit contributions
ust-0.793
nes-0.783
vel-0.782
onel-0.709
oll-0.707
Token They
Token ID1119
Feature activation-2.74e-01

Pos logit contributions
ust+0.192
nes+0.186
vel+0.183
onel+0.169
aces+0.169
Neg logit contributions
 last-0.125
 Ms-0.105
 teach-0.097
 front-0.096
 hugs-0.094
Token were
Token ID547
Feature activation-4.71e-01

Pos logit contributions
ust+0.480
nes+0.465
vel+0.457
onel+0.420
aces+0.415
Neg logit contributions
 last-0.405
 Ms-0.361
 front-0.329
 teach-0.327
 hugs-0.323
Token friends
Token ID2460
Feature activation-3.22e-01

Pos logit contributions
ust+0.891
nes+0.863
vel+0.857
onel+0.788
aces+0.778
Neg logit contributions
 last-0.641
 Ms-0.559
 front-0.505
 teach-0.505
 hugs-0.497
Token.
Token ID13
Feature activation+9.39e-03

Pos logit contributions
ust+0.666
nes+0.648
vel+0.643
onel+0.597
aces+0.589
Neg logit contributions
 last-0.378
 Ms-0.324
 front-0.287
 teach-0.285
 hugs-0.280
Token
Token ID198
Feature activation+2.11e-01

Pos logit contributions
 last+0.011
 Ms+0.009
 teach+0.008
 front+0.008
 hugs+0.008
Neg logit contributions
ust-0.020
nes-0.019
vel-0.019
onel-0.017
aces-0.017
Token
Token ID198
Feature activation+4.62e-01

Pos logit contributions
 last+0.310
 Ms+0.272
 teach+0.254
 front+0.248
 hugs+0.245
Neg logit contributions
ust-0.366
nes-0.361
vel-0.353
onel-0.319
oll-0.316
TokenThe
Token ID464
Feature activation+6.58e-01

Pos logit contributions
 last+0.663
 Ms+0.572
 teach+0.535
 hugs+0.534
�+0.524
Neg logit contributions
ust-0.831
nes-0.829
vel-0.808
onel-0.708
ode-0.705
Token moral
Token ID6573
Feature activation+4.05e-01

Pos logit contributions
 last+1.048
 Ms+0.999
�+0.931
't+0.898
 hugs+0.886
Neg logit contributions
ust-0.988
nes-0.955
vel-0.954
onel-0.878
ode-0.859
Token of
Token ID286
Feature activation+1.02e-01

Pos logit contributions
 last+0.445
 Ms+0.409
�+0.359
 teach+0.346
 front+0.345
Neg logit contributions
ust-0.836
nes-0.813
vel-0.787
aces-0.745
oll-0.739
Token the
Token ID262
Feature activation+3.84e-01

Pos logit contributions
 last+0.080
 Ms+0.064
 teach+0.054
 front+0.053
�+0.053
Neg logit contributions
ust-0.239
nes-0.237
vel-0.234
onel-0.218
oll-0.216
Token story
Token ID1621
Feature activation-3.67e-01

Pos logit contributions
 last+0.770
 Ms+0.717
 front+0.673
�+0.671
 teach+0.657
Neg logit contributions
ust-0.444
vel-0.424
nes-0.424
ode-0.360
oll-0.360
Token tower
Token ID10580
Feature activation-2.10e-01

Pos logit contributions
ust+0.334
nes+0.319
vel+0.316
onel+0.285
ode+0.281
Neg logit contributions
 last-0.382
 Ms-0.350
 teach-0.321
 front-0.320
 hugs-0.318
Token now
Token ID783
Feature activation-1.70e-01

Pos logit contributions
ust+0.428
nes+0.415
vel+0.412
onel+0.381
aces+0.378
Neg logit contributions
 last-0.248
 Ms-0.218
 front-0.191
 teach-0.191
 hugs-0.190
Token?"
Token ID1701
Feature activation-1.10e-01

Pos logit contributions
ust+0.344
nes+0.333
vel+0.332
onel+0.305
aces+0.303
Neg logit contributions
 last-0.206
 Ms-0.181
 front-0.159
 teach-0.159
 hugs-0.158
Token
Token ID198
Feature activation-9.39e-02

Pos logit contributions
ust+0.233
nes+0.229
vel+0.226
onel+0.210
aces+0.209
Neg logit contributions
 last-0.120
 Ms-0.097
 front-0.087
 teach-0.087
 hugs-0.085
Token
Token ID198
Feature activation-4.42e-01

Pos logit contributions
ust+0.207
nes+0.203
vel+0.201
onel+0.186
aces+0.184
Neg logit contributions
 last-0.097
 Ms-0.079
 teach-0.070
 front-0.069
 hugs-0.068
TokenTom
Token ID13787
Feature activation+3.75e-01

Pos logit contributions
ust+0.875
nes+0.851
vel+0.844
onel+0.777
aces+0.768
Neg logit contributions
 last-0.562
 Ms-0.484
 teach-0.434
 front-0.433
 hugs-0.427
Token looks
Token ID3073
Feature activation-3.11e-01

Pos logit contributions
 last+0.475
 Ms+0.414
�+0.367
 teach+0.367
 front+0.365
Neg logit contributions
ust-0.734
vel-0.711
nes-0.702
aces-0.660
ode-0.650
Token at
Token ID379
Feature activation+2.60e-01

Pos logit contributions
ust+0.489
nes+0.468
vel+0.467
onel+0.422
aces+0.415
Neg logit contributions
 last-0.520
 Ms-0.466
 front-0.431
 teach-0.430
't-0.422
Token the
Token ID262
Feature activation+3.48e-01

Pos logit contributions
 last+0.254
 Ms+0.221
 teach+0.194
 hugs+0.186
�+0.183
Neg logit contributions
ust-0.568
nes-0.554
vel-0.542
aces-0.510
onel-0.509
Token stick
Token ID4859
Feature activation-7.11e-01

Pos logit contributions
 last+0.532
 Ms+0.501
 teach+0.451
 hugs+0.447
�+0.440
Neg logit contributions
ust-0.565
vel-0.542
nes-0.532
onel-0.495
aces-0.494
Token.
Token ID13
Feature activation-8.72e-01

Pos logit contributions
ust+1.564
nes+1.530
vel+1.512
onel+1.411
aces+1.388
Neg logit contributions
 last-0.753
 Ms-0.614
 front-0.546
 teach-0.542
 hugs-0.525
Token without
Token ID1231
Feature activation-1.49e+00

Pos logit contributions
ust+0.612
nes+0.594
vel+0.590
onel+0.546
aces+0.538
Neg logit contributions
 last-0.370
 Ms-0.323
 front-0.287
 teach-0.285
 hugs-0.280
Token asking
Token ID4737
Feature activation+2.02e-01

Pos logit contributions
ust+3.233
nes+3.210
vel+3.072
onel+2.929
oll+2.898
Neg logit contributions
 last-1.714
 Ms-1.415
 teach-1.285
 hugs-1.274
 front-1.263
Token and
Token ID290
Feature activation-4.18e-01

Pos logit contributions
 last+0.244
 Ms+0.213
�+0.192
 front+0.191
 teach+0.186
Neg logit contributions
ust-0.406
vel-0.393
nes-0.389
aces-0.360
onel-0.360
Token to
Token ID284
Feature activation-9.78e-01

Pos logit contributions
ust+0.859
nes+0.834
vel+0.828
onel+0.768
aces+0.757
Neg logit contributions
 last-0.497
 Ms-0.427
 front-0.379
 teach-0.376
 hugs-0.370
Token always
Token ID1464
Feature activation-6.56e-01

Pos logit contributions
ust+1.950
nes+1.915
vel+1.905
onel+1.766
aces+1.749
Neg logit contributions
 last-1.262
 Ms-1.049
 teach-0.984
 front-0.951
 hugs-0.945
Token listen
Token ID6004
Feature activation+1.02e+00

Pos logit contributions
ust+1.365
nes+1.331
vel+1.319
onel+1.227
aces+1.214
Neg logit contributions
 last-0.774
 Ms-0.650
 teach-0.588
 front-0.581
 hugs-0.572
Token to
Token ID284
Feature activation-7.42e-01

Pos logit contributions
 last+1.219
 Ms+1.135
�+1.068
 front+0.990
 teach+0.953
Neg logit contributions
ust-2.071
vel-1.987
nes-1.970
onel-1.785
aces-1.765
Token his
Token ID465
Feature activation+1.65e-01

Pos logit contributions
ust+1.394
nes+1.358
vel+1.339
onel+1.235
aces+1.220
Neg logit contributions
 last-1.040
 Ms-0.907
 teach-0.828
 front-0.821
 hugs-0.812
Token mom
Token ID1995
Feature activation+1.89e-01

Pos logit contributions
 last+0.221
 Ms+0.203
 front+0.175
't+0.174
�+0.174
Neg logit contributions
ust-0.313
nes-0.299
vel-0.296
onel-0.273
ode-0.271
Tokenmy
Token ID1820
Feature activation+1.22e-01

Pos logit contributions
 last+0.171
 Ms+0.147
 front+0.125
�+0.120
 teach+0.119
Neg logit contributions
ust-0.437
vel-0.422
nes-0.419
ode-0.391
aces-0.390
Token.
Token ID13
Feature activation-8.22e-02

Pos logit contributions
 last+0.140
 Ms+0.122
 front+0.108
 teach+0.105
 hugs+0.104
Neg logit contributions
ust-0.253
vel-0.246
nes-0.244
onel-0.225
aces-0.225
TokenBut
Token ID1537
Feature activation-4.11e-01

Pos logit contributions
ust+0.551
nes+0.538
vel+0.534
onel+0.493
aces+0.484
Neg logit contributions
 last-0.295
 Ms-0.247
 front-0.218
 teach-0.218
 hugs-0.216
Token then
Token ID788
Feature activation-2.38e-01

Pos logit contributions
ust+0.959
nes+0.936
vel+0.930
onel+0.869
aces+0.859
Neg logit contributions
 last-0.374
 Ms-0.303
 front-0.257
 teach-0.256
 hugs-0.251
Token,
Token ID11
Feature activation-4.20e-02

Pos logit contributions
ust+0.547
nes+0.535
vel+0.533
onel+0.497
aces+0.491
Neg logit contributions
 last-0.222
 Ms-0.179
 front-0.154
 teach-0.153
 hugs-0.148
Token something
Token ID1223
Feature activation-7.82e-01

Pos logit contributions
ust+0.093
nes+0.091
vel+0.091
onel+0.084
aces+0.083
Neg logit contributions
 last-0.042
 Ms-0.034
 front-0.030
 teach-0.030
 hugs-0.030
Token strange
Token ID6283
Feature activation-7.88e-02

Pos logit contributions
ust+1.316
nes+1.275
vel+1.270
onel+1.147
ode+1.142
Neg logit contributions
 last-1.235
 Ms-1.091
 teach-1.002
 front-0.993
 hugs-0.990
Token happened
Token ID3022
Feature activation+5.54e-01

Pos logit contributions
ust+0.117
nes+0.112
vel+0.111
onel+0.100
aces+0.099
Neg logit contributions
 last-0.136
 Ms-0.125
 front-0.116
 teach-0.114
 hugs-0.113
Token.
Token ID13
Feature activation-7.15e-01

Pos logit contributions
 last+0.645
 Ms+0.568
 teach+0.491
 praise+0.489
 hugs+0.484
Neg logit contributions
ust-1.138
vel-1.122
nes-1.096
onel-1.018
aces-1.009
Token The
Token ID383
Feature activation+1.49e+00

Pos logit contributions
ust+1.661
nes+1.618
vel+1.610
onel+1.506
ode+1.492
Neg logit contributions
 last-0.661
 Ms-0.550
 front-0.462
 teach-0.456
 hugs-0.447
Token stack
Token ID8931
Feature activation-3.19e-01

Pos logit contributions
 last+1.959
 Ms+1.896
't+1.725
ted+1.683
�+1.636
Neg logit contributions
ust-2.596
vel-2.559
nes-2.431
onel-2.365
oll-2.353
Token started
Token ID2067
Feature activation+8.61e-01

Pos logit contributions
ust+0.648
nes+0.631
vel+0.625
onel+0.580
aces+0.574
Neg logit contributions
 last-0.381
 Ms-0.331
 teach-0.292
 front-0.292
 hugs-0.287
Token to
Token ID284
Feature activation+4.44e-01

Pos logit contributions
 last+0.989
 Ms+0.873
 teach+0.754
 hugs+0.745
�+0.740
Neg logit contributions
ust-1.737
vel-1.687
nes-1.667
onel-1.553
aces-1.527
Token would
Token ID561
Feature activation-2.14e-01

Pos logit contributions
ust+0.736
nes+0.717
vel+0.709
onel+0.655
aces+0.647
Neg logit contributions
 last-0.494
 Ms-0.430
 front-0.386
 teach-0.385
 hugs-0.378
Token bow
Token ID9563
Feature activation-1.42e-01

Pos logit contributions
ust+0.384
nes+0.374
vel+0.370
onel+0.337
aces+0.333
Neg logit contributions
 last-0.306
 Ms-0.272
 front-0.245
 teach-0.243
 hugs-0.241
Token every
Token ID790
Feature activation-2.44e-01

Pos logit contributions
ust+0.261
nes+0.254
vel+0.252
onel+0.230
aces+0.229
Neg logit contributions
 last-0.197
 Ms-0.174
 teach-0.157
 front-0.157
 hugs-0.154
Token day
Token ID1110
Feature activation-2.92e-01

Pos logit contributions
ust+0.470
nes+0.458
vel+0.454
onel+0.417
aces+0.411
Neg logit contributions
 last-0.314
 Ms-0.279
 teach-0.248
 front-0.245
 hugs-0.244
Token for
Token ID329
Feature activation-8.04e-01

Pos logit contributions
ust+0.626
nes+0.612
vel+0.608
onel+0.563
aces+0.559
Neg logit contributions
 last-0.317
 Ms-0.268
 teach-0.235
 front-0.234
 hugs-0.230
Token fun
Token ID1257
Feature activation-9.68e-01

Pos logit contributions
ust+1.743
nes+1.684
vel+1.664
onel+1.560
ode+1.545
Neg logit contributions
 last-0.907
 Ms-0.760
 front-0.675
 teach-0.670
 hugs-0.653
Token.
Token ID13
Feature activation-5.27e-01

Pos logit contributions
ust+2.185
nes+2.117
vel+2.095
onel+1.972
ode+1.947
Neg logit contributions
 last-0.977
 Ms-0.796
 front-0.701
 teach-0.700
 hugs-0.690
Token<|endoftext|>
Token ID50256
Feature activation-2.63e-01

Pos logit contributions
ust+1.140
nes+1.109
vel+1.102
onel+1.025
aces+1.013
Neg logit contributions
 last-0.574
 Ms-0.483
 front-0.422
 teach-0.421
 hugs-0.414
TokenOnce
Token ID7454
Feature activation-4.47e-01

Pos logit contributions
ust+0.592
nes+0.580
vel+0.575
onel+0.535
aces+0.533
Neg logit contributions
 last-0.255
 Ms-0.209
 teach-0.182
 front-0.178
 hugs-0.175
Token upon
Token ID2402
Feature activation-9.10e-03

Pos logit contributions
ust+1.052
nes+1.025
vel+1.018
onel+0.954
aces+0.943
Neg logit contributions
 last-0.397
 Ms-0.320
 teach-0.270
 front-0.269
 hugs-0.262
Token a
Token ID257
Feature activation-2.47e-02

Pos logit contributions
ust+0.021
nes+0.020
vel+0.020
aces+0.019
onel+0.019
Neg logit contributions
 last-0.008
 Ms-0.007
 teach-0.006
 front-0.006
 hugs-0.006
Token picture
Token ID4286
Feature activation-3.55e-01

Pos logit contributions
ust+0.458
nes+0.448
vel+0.443
onel+0.408
aces+0.406
Neg logit contributions
 last-0.306
 Ms-0.267
 teach-0.238
 front-0.238
't-0.233
Token on
Token ID319
Feature activation-3.25e-02

Pos logit contributions
ust+0.624
nes+0.605
vel+0.600
onel+0.549
aces+0.545
Neg logit contributions
 last-0.523
 Ms-0.465
 teach-0.424
 front-0.424
 hugs-0.415
Token his
Token ID465
Feature activation-9.40e-01

Pos logit contributions
ust+0.066
nes+0.065
vel+0.065
aces+0.060
onel+0.059
Neg logit contributions
 last-0.038
 Ms-0.032
 teach-0.029
 hugs-0.028
 front-0.028
Token shelf
Token ID18316
Feature activation+1.45e-01

Pos logit contributions
ust+1.585
vel+1.506
nes+1.502
onel+1.368
oll+1.330
Neg logit contributions
 last-1.536
 Ms-1.337
 front-1.274
 teach-1.262
 hugs-1.248
Token,
Token ID11
Feature activation+5.77e-02

Pos logit contributions
 last+0.158
 Ms+0.141
 teach+0.122
 hugs+0.119
 front+0.119
Neg logit contributions
ust-0.302
nes-0.296
vel-0.293
aces-0.275
onel-0.272
Token next
Token ID1306
Feature activation-6.99e-02

Pos logit contributions
 last+0.071
 Ms+0.061
 front+0.055
 teach+0.054
't+0.053
Neg logit contributions
ust-0.113
nes-0.110
vel-0.110
onel-0.101
aces-0.101
Token to
Token ID284
Feature activation-4.99e-01

Pos logit contributions
ust+0.130
nes+0.126
vel+0.125
onel+0.113
aces+0.113
Neg logit contributions
 last-0.094
 Ms-0.086
 teach-0.077
 front-0.075
 hugs-0.074
Token his
Token ID465
Feature activation+2.97e-01

Pos logit contributions
ust+0.992
nes+0.963
vel+0.956
onel+0.883
aces+0.872
Neg logit contributions
 last-0.632
 Ms-0.545
 front-0.488
 teach-0.487
 hugs-0.479
Token cars
Token ID5006
Feature activation-7.03e-01

Pos logit contributions
 last+0.441
 Ms+0.402
't+0.363
�+0.354
 teach+0.352
Neg logit contributions
ust-0.500
nes-0.490
vel-0.484
onel-0.444
ode-0.434
Token and
Token ID290
Feature activation-5.64e-01

Pos logit contributions
ust+1.564
nes+1.516
vel+1.510
onel+1.406
aces+1.384
Neg logit contributions
 last-0.732
 Ms-0.598
 front-0.525
 teach-0.525
 hugs-0.511
Token trucks
Token ID13960
Feature activation-1.35e-01

Pos logit contributions
ust+1.049
nes+1.015
vel+1.008
onel+0.925
aces+0.911
Neg logit contributions
 last-0.785
 Ms-0.686
 teach-0.623
 front-0.623
 hugs-0.615
Token,
Token ID11
Feature activation+1.10e-01

Pos logit contributions
 last+1.324
 Ms+1.250
 praise+1.149
 hugs+1.139
 teach+1.136
Neg logit contributions
ust-1.092
vel-1.063
nes-1.047
aces-0.955
onel-0.930
Token especially
Token ID2592
Feature activation-4.08e-01

Pos logit contributions
 last+0.152
 Ms+0.135
 front+0.122
 hugs+0.121
 teach+0.121
Neg logit contributions
ust-0.203
nes-0.196
vel-0.194
ode-0.177
aces-0.176
Token in
Token ID287
Feature activation+4.63e-01

Pos logit contributions
ust+0.847
nes+0.822
vel+0.817
onel+0.756
aces+0.750
Neg logit contributions
 last-0.482
 Ms-0.412
 teach-0.365
 front-0.365
 hugs-0.358
Token the
Token ID262
Feature activation-9.37e-01

Pos logit contributions
 last+0.437
 Ms+0.368
 teach+0.326
�+0.314
 hugs+0.313
Neg logit contributions
ust-1.008
vel-0.987
nes-0.981
onel-0.910
ode-0.908
Token mud
Token ID17492
Feature activation-4.28e-01

Pos logit contributions
ust+1.699
nes+1.662
vel+1.638
onel+1.498
aces+1.485
Neg logit contributions
 last-1.388
 Ms-1.207
 front-1.133
 hugs-1.099
 teach-1.091
Token.
Token ID13
Feature activation-5.37e-02

Pos logit contributions
ust+0.929
nes+0.904
vel+0.898
onel+0.836
aces+0.829
Neg logit contributions
 last-0.458
 Ms-0.387
 teach-0.336
 front-0.335
 hugs-0.329
Token One
Token ID1881
Feature activation+3.61e-02

Pos logit contributions
ust+0.116
nes+0.112
vel+0.111
aces+0.103
onel+0.103
Neg logit contributions
 last-0.058
 Ms-0.048
 teach-0.042
 front-0.042
 hugs-0.042
Token day
Token ID1110
Feature activation+6.99e-01

Pos logit contributions
 last+0.048
 Ms+0.044
 teach+0.039
 hugs+0.039
 front+0.039
Neg logit contributions
ust-0.068
nes-0.065
vel-0.064
onel-0.059
oll-0.058
Token,
Token ID11
Feature activation+1.58e+00

Pos logit contributions
 last+0.731
 Ms+0.637
 teach+0.559
 front+0.549
 praise+0.541
Neg logit contributions
ust-1.485
vel-1.467
nes-1.418
oll-1.347
aund-1.344
Token Tim
Token ID5045
Feature activation-7.00e-01

Pos logit contributions
 last+2.237
 Ms+2.033
 hugs+2.004
 praise+1.965
 teach+1.892
Neg logit contributions
ust-2.626
vel-2.436
nes-2.433
aund-2.263
oll-2.187
Tokenmy
Token ID1820
Feature activation-1.41e-01

Pos logit contributions
ust+1.737
nes+1.703
vel+1.693
onel+1.591
aces+1.573
Neg logit contributions
 last-0.548
 Ms-0.415
 front-0.341
 teach-0.335
 hugs-0.329
Token in
Token ID287
Feature activation+2.39e-01

Pos logit contributions
ust+0.244
nes+0.238
vel+0.235
onel+0.218
aces+0.216
Neg logit contributions
 last-0.148
 Ms-0.130
 front-0.117
 teach-0.114
 hugs-0.112
Token nature
Token ID3450
Feature activation-1.89e-01

Pos logit contributions
 last+0.253
 Ms+0.214
 teach+0.189
 hugs+0.189
�+0.184
Neg logit contributions
ust-0.507
nes-0.493
vel-0.491
onel-0.452
aces-0.448
Token.
Token ID13
Feature activation-4.49e-01

Pos logit contributions
ust+0.409
nes+0.399
vel+0.396
onel+0.367
aces+0.365
Neg logit contributions
 last-0.199
 Ms-0.168
 front-0.147
 teach-0.146
 hugs-0.142
Token
Token ID198
Feature activation-4.26e-01

Pos logit contributions
ust+0.947
nes+0.921
vel+0.914
onel+0.848
aces+0.839
Neg logit contributions
 last-0.512
 Ms-0.432
 teach-0.382
 front-0.382
 hugs-0.374
Token
Token ID198
Feature activation-6.45e-01

Pos logit contributions
ust+0.900
nes+0.877
vel+0.870
onel+0.807
aces+0.797
Neg logit contributions
 last-0.483
 Ms-0.408
 teach-0.361
 front-0.360
 hugs-0.353
TokenP
Token ID47
Feature activation-1.47e+00

Pos logit contributions
ust+1.408
nes+1.371
vel+1.362
onel+1.273
aces+1.264
Neg logit contributions
 last-0.687
 Ms-0.576
 front-0.503
 teach-0.500
 hugs-0.485
Tokenray
Token ID2433
Feature activation+3.95e-01

Pos logit contributions
ust+2.492
nes+2.417
vel+2.414
ode+2.196
onel+2.191
Neg logit contributions
 last-2.264
 Ms-1.965
 hugs-1.816
 teach-1.802
 praise-1.780
Tokening
Token ID278
Feature activation+1.15e-01

Pos logit contributions
 last+0.458
 Ms+0.399
 front+0.358
 teach+0.348
�+0.346
Neg logit contributions
ust-0.796
nes-0.785
vel-0.782
onel-0.724
aces-0.710
Token made
Token ID925
Feature activation+2.51e-01

Pos logit contributions
 last+0.144
 Ms+0.125
 front+0.113
 teach+0.113
�+0.109
Neg logit contributions
ust-0.223
nes-0.219
vel-0.216
onel-0.201
aces-0.199
Token the
Token ID262
Feature activation+7.70e-01

Pos logit contributions
 last+0.325
 Ms+0.281
�+0.259
 front+0.259
 teach+0.257
Neg logit contributions
ust-0.478
nes-0.465
vel-0.463
onel-0.427
aces-0.417
Token little
Token ID1310
Feature activation+6.69e-01

Pos logit contributions
 last+1.221
 Ms+1.128
't+1.065
�+1.040
 hugs+1.030
Neg logit contributions
ust-1.189
vel-1.144
nes-1.140
onel-1.043
ode-1.021
Tokenily
Token ID813
Feature activation+4.22e-01

Pos logit contributions
ust+1.129
nes+1.085
vel+1.048
ode+0.886
onel+0.876
Neg logit contributions
 last-2.980
 Ms-2.726
 hugs-2.587
 teach-2.586
 front-2.558
Token didn
Token ID1422
Feature activation+8.07e-01

Pos logit contributions
 last+0.553
 Ms+0.484
 front+0.439
 teach+0.429
't+0.426
Neg logit contributions
ust-0.800
nes-0.784
vel-0.770
aces-0.702
onel-0.701
Token't
Token ID470
Feature activation-1.14e+00

Pos logit contributions
 last+0.409
 Ms+0.299
 hugs+0.231
 praise+0.199
 teach+0.180
Neg logit contributions
ust-2.165
nes-2.150
vel-2.127
oll-1.983
onel-1.970
Token care
Token ID1337
Feature activation-1.19e+00

Pos logit contributions
ust+2.147
nes+2.087
vel+2.078
onel+1.938
ode+1.907
Neg logit contributions
 last-1.617
 Ms-1.364
 teach-1.305
 hugs-1.226
 front-1.223
Token about
Token ID546
Feature activation+2.57e-02

Pos logit contributions
ust+2.367
nes+2.338
vel+2.252
ode+2.144
onel+2.110
Neg logit contributions
 last-1.501
 Ms-1.272
 teach-1.153
't-1.143
 hugs-1.123
Token that
Token ID326
Feature activation-6.40e-01

Pos logit contributions
 last+0.026
 Ms+0.021
 front+0.018
 teach+0.018
 hugs+0.018
Neg logit contributions
ust-0.057
vel-0.055
nes-0.055
onel-0.051
aces-0.051
Token.
Token ID13
Feature activation-2.52e-01

Pos logit contributions
ust+1.386
nes+1.351
vel+1.337
onel+1.245
aces+1.229
Neg logit contributions
 last-0.709
 Ms-0.597
 teach-0.524
 front-0.522
 hugs-0.510
Token She
Token ID1375
Feature activation-3.68e-01

Pos logit contributions
ust+0.498
nes+0.481
vel+0.478
onel+0.442
aces+0.440
Neg logit contributions
 last-0.319
 Ms-0.272
 front-0.246
 teach-0.246
 hugs-0.240
Token said
Token ID531
Feature activation-9.07e-01

Pos logit contributions
ust+0.714
nes+0.692
vel+0.686
onel+0.631
aces+0.624
Neg logit contributions
 last-0.481
 Ms-0.418
 front-0.377
 teach-0.374
 hugs-0.368
Token,
Token ID11
Feature activation-6.73e-01

Pos logit contributions
ust+2.251
nes+2.199
vel+2.165
onel+2.037
ode+2.033
Neg logit contributions
 last-0.718
 Ms-0.567
 teach-0.467
 hugs-0.463
 front-0.448
Token "
Token ID366
Feature activation+8.21e-02

Pos logit contributions
ust+1.446
nes+1.406
vel+1.390
onel+1.297
aces+1.279
Neg logit contributions
 last-0.747
 Ms-0.637
 teach-0.558
 front-0.555
 hugs-0.544
Token
Token ID198
Feature activation-1.74e-02

Pos logit contributions
ust+0.282
nes+0.274
vel+0.271
onel+0.252
aces+0.250
Neg logit contributions
 last-0.158
 Ms-0.130
 teach-0.119
 front-0.118
 hugs-0.115
Token
Token ID198
Feature activation-4.21e-01

Pos logit contributions
ust+0.038
nes+0.038
vel+0.037
onel+0.034
aces+0.034
Neg logit contributions
 last-0.018
 Ms-0.015
 teach-0.013
 front-0.013
 hugs-0.013
Token"
Token ID1
Feature activation-4.85e-01

Pos logit contributions
ust+0.841
nes+0.817
vel+0.811
onel+0.747
aces+0.738
Neg logit contributions
 last-0.529
 Ms-0.454
 teach-0.407
 front-0.406
 hugs-0.400
TokenTim
Token ID14967
Feature activation+8.38e-01

Pos logit contributions
ust+0.938
nes+0.910
vel+0.903
onel+0.831
aces+0.821
Neg logit contributions
 last-0.648
 Ms-0.563
 front-0.507
 teach-0.507
 hugs-0.499
Tokenmy
Token ID1820
Feature activation+1.18e+00

Pos logit contributions
 last+0.811
 Ms+0.735
 teach+0.627
�+0.620
 front+0.616
Neg logit contributions
ust-1.844
vel-1.823
nes-1.763
ode-1.684
aces-1.672
Token,
Token ID11
Feature activation-1.86e+00

Pos logit contributions
 last+1.243
 Ms+1.124
 front+1.003
�+0.936
 praise+0.918
Neg logit contributions
vel-2.489
ust-2.460
nes-2.380
onel-2.267
ode-2.257
Token you
Token ID345
Feature activation-2.17e-01

Pos logit contributions
ust+4.112
nes+3.984
vel+3.942
aces+3.749
onel+3.636
Neg logit contributions
 last-2.231
 Ms-1.790
 teach-1.658
't-1.625
 hugs-1.608
Token broke
Token ID6265
Feature activation-3.48e-01

Pos logit contributions
ust+0.392
nes+0.378
vel+0.376
onel+0.345
aces+0.338
Neg logit contributions
 last-0.313
 Ms-0.280
 front-0.253
 teach-0.250
 hugs-0.247
Token the
Token ID262
Feature activation+1.17e+00

Pos logit contributions
ust+0.800
nes+0.781
vel+0.777
onel+0.723
aces+0.717
Neg logit contributions
 last-0.327
 Ms-0.267
 teach-0.230
 front-0.224
 hugs-0.221
Token computer
Token ID3644
Feature activation-1.38e-01

Pos logit contributions
 last+2.213
 Ms+2.134
�+1.936
't+1.927
 hugs+1.920
Neg logit contributions
vel-1.394
ust-1.382
nes-1.316
aces-1.195
onel-1.181
Token!"
Token ID2474
Feature activation-2.88e-01

Pos logit contributions
ust+0.298
nes+0.291
vel+0.290
onel+0.268
aces+0.266
Neg logit contributions
 last-0.146
 Ms-0.126
 teach-0.109
 hugs-0.108
 front-0.107
Token
Token ID198
Feature activation-1.47e-02

Pos logit contributions
 last+0.089
 Ms+0.074
 teach+0.068
 front+0.067
 hugs+0.066
Neg logit contributions
ust-0.142
nes-0.139
vel-0.138
aces-0.128
onel-0.127
Token
Token ID198
Feature activation-8.41e-02

Pos logit contributions
ust+0.032
nes+0.031
vel+0.031
onel+0.029
aces+0.029
Neg logit contributions
 last-0.015
 Ms-0.013
 teach-0.011
 front-0.011
 hugs-0.011
TokenThe
Token ID464
Feature activation+8.59e-01

Pos logit contributions
ust+0.147
nes+0.144
vel+0.142
onel+0.128
aces+0.126
Neg logit contributions
 last-0.128
 Ms-0.112
 teach-0.103
 hugs-0.102
 front-0.102
Token ice
Token ID4771
Feature activation-3.19e-01

Pos logit contributions
 last+1.241
 Ms+1.215
 Mrs+1.044
't+1.034
�+1.034
Neg logit contributions
ust-1.390
nes-1.359
vel-1.338
ode-1.242
onel-1.226
Token was
Token ID373
Feature activation+2.40e-02

Pos logit contributions
ust+0.580
nes+0.564
vel+0.555
onel+0.511
aces+0.506
Neg logit contributions
 last-0.452
 Ms-0.403
 teach-0.365
 front-0.364
 hugs-0.359
Token not
Token ID407
Feature activation-1.69e+00

Pos logit contributions
 last+0.032
 Ms+0.028
 teach+0.026
 hugs+0.025
 front+0.025
Neg logit contributions
ust-0.046
vel-0.044
nes-0.044
onel-0.040
aces-0.040
Token strong
Token ID1913
Feature activation+1.40e-01

Pos logit contributions
ust+3.708
nes+3.676
vel+3.596
onel+3.407
ode+3.357
Neg logit contributions
 last-1.985
 Ms-1.565
 first-1.415
 front-1.405
 while-1.397
Token.
Token ID13
Feature activation-8.33e-01

Pos logit contributions
 last+0.191
 Ms+0.173
 front+0.155
 teach+0.153
 hugs+0.152
Neg logit contributions
ust-0.258
vel-0.250
nes-0.247
aces-0.228
onel-0.227
Token It
Token ID632
Feature activation+5.81e-01

Pos logit contributions
ust+1.509
nes+1.461
vel+1.443
onel+1.317
ode+1.310
Neg logit contributions
 last-1.215
 Ms-1.085
 teach-0.984
 front-0.981
 hugs-0.965
Token cracked
Token ID21368
Feature activation+5.70e-02

Pos logit contributions
 last+0.672
 Ms+0.612
 teach+0.537
 front+0.532
 praise+0.532
Neg logit contributions
ust-1.181
nes-1.133
vel-1.099
onel-1.044
aces-1.026
Token under
Token ID739
Feature activation+2.21e-01

Pos logit contributions
 last+0.065
 Ms+0.057
 teach+0.052
 hugs+0.050
 praise+0.049
Neg logit contributions
ust-0.118
vel-0.114
nes-0.113
aces-0.105
onel-0.105
Token to
Token ID284
Feature activation-9.40e-02

Pos logit contributions
ust+0.963
nes+0.938
vel+0.932
onel+0.869
aces+0.860
Neg logit contributions
 last-0.425
 Ms-0.352
 front-0.303
 teach-0.303
 hugs-0.295
Token build
Token ID1382
Feature activation-1.18e+00

Pos logit contributions
ust+0.184
nes+0.181
vel+0.177
onel+0.164
ode+0.162
Neg logit contributions
 last-0.117
 Ms-0.104
 hugs-0.090
 front-0.090
 teach-0.088
Token a
Token ID257
Feature activation-1.46e+00

Pos logit contributions
ust+2.840
nes+2.801
vel+2.700
onel+2.557
oll+2.542
Neg logit contributions
 last-1.076
 Ms-0.825
 front-0.748
 teach-0.730
 hugs-0.711
Token nest
Token ID16343
Feature activation-4.09e-01

Pos logit contributions
ust+2.639
nes+2.587
vel+2.563
aces+2.432
onel+2.341
Neg logit contributions
 last-2.185
 Ms-1.861
 teach-1.747
 front-1.741
 first-1.678
Token in
Token ID287
Feature activation-1.16e-01

Pos logit contributions
ust+0.825
nes+0.802
vel+0.796
onel+0.736
aces+0.728
Neg logit contributions
 last-0.498
 Ms-0.431
 front-0.383
 teach-0.382
 hugs-0.378
Token a
Token ID257
Feature activation-2.08e+00

Pos logit contributions
ust+0.278
vel+0.271
nes+0.271
onel+0.253
aces+0.250
Neg logit contributions
 last-0.094
 Ms-0.077
 teach-0.063
 hugs-0.062
 front-0.059
Token tree
Token ID5509
Feature activation-1.75e-02

Pos logit contributions
ust+4.118
nes+3.952
vel+3.924
aces+3.869
oll+3.717
Neg logit contributions
 last-3.013
 Ms-2.489
 front-2.480
 teach-2.262
 first-2.254
Token.
Token ID13
Feature activation-8.51e-02

Pos logit contributions
ust+0.037
nes+0.036
vel+0.036
onel+0.033
aces+0.033
Neg logit contributions
 last-0.019
 Ms-0.017
 hugs-0.015
 teach-0.015
 front-0.015
Token The
Token ID383
Feature activation-1.13e+00

Pos logit contributions
ust+0.182
nes+0.177
vel+0.175
aces+0.163
onel+0.162
Neg logit contributions
 last-0.094
 Ms-0.078
 teach-0.069
 hugs-0.069
 front-0.068
Token rob
Token ID3857
Feature activation-1.06e+00

Pos logit contributions
ust+2.099
nes+2.058
vel+2.045
aces+1.841
onel+1.840
Neg logit contributions
 last-1.686
 Ms-1.397
 front-1.334
 teach-1.302
 first-1.284
Tokenin
Token ID259
Feature activation-7.33e-01

Pos logit contributions
ust+1.333
nes+1.308
vel+1.260
onel+1.091
aces+1.089
Neg logit contributions
 last-2.122
 Ms-1.909
 teach-1.792
 hugs-1.791
 front-1.789
Token.
Token ID13
Feature activation+2.89e-01

Pos logit contributions
 last+0.371
 Ms+0.338
 front+0.299
 teach+0.298
 hugs+0.297
Neg logit contributions
ust-0.472
nes-0.452
vel-0.452
onel-0.412
ode-0.407
Token The
Token ID383
Feature activation-1.20e+00

Pos logit contributions
 last+0.247
 Ms+0.191
 hugs+0.164
 teach+0.161
 front+0.156
Neg logit contributions
ust-0.678
vel-0.658
nes-0.656
aces-0.625
onel-0.616
Token bird
Token ID6512
Feature activation-4.13e-01

Pos logit contributions
ust+2.295
nes+2.267
vel+2.262
aces+2.027
onel+2.018
Neg logit contributions
 last-1.733
 Ms-1.398
 front-1.357
 teach-1.310
 first-1.298
Token lived
Token ID5615
Feature activation-3.16e-01

Pos logit contributions
ust+0.777
nes+0.752
vel+0.747
onel+0.687
aces+0.678
Neg logit contributions
 last-0.561
 Ms-0.492
 front-0.444
 teach-0.444
 hugs-0.435
Token in
Token ID287
Feature activation-2.12e-01

Pos logit contributions
ust+0.462
nes+0.442
vel+0.440
onel+0.395
aces+0.387
Neg logit contributions
 last-0.560
 Ms-0.510
 teach-0.472
 front-0.470
 hugs-0.466
Token an
Token ID281
Feature activation-2.64e+00

Pos logit contributions
ust+0.494
nes+0.481
vel+0.481
onel+0.450
aces+0.445
Neg logit contributions
 last-0.187
 Ms-0.154
 teach-0.129
 hugs-0.125
 front-0.124
Token ordinary
Token ID8850
Feature activation-1.83e+00

Pos logit contributions
nes+5.828
ust+5.812
vel+5.738
aces+5.120
onel+5.120
Neg logit contributions
 last-3.364
 front-2.718
 Ms-2.512
 first-2.395
 hugs-2.284
Token tree
Token ID5509
Feature activation+3.89e-01

Pos logit contributions
ust+3.039
nes+2.984
vel+2.978
aces+2.789
oll+2.764
Neg logit contributions
 last-3.110
 front-2.606
 Ms-2.589
 first-2.464
 teach-2.442
Token with
Token ID351
Feature activation-3.31e-01

Pos logit contributions
 last+0.485
 Ms+0.455
 teach+0.393
 hugs+0.389
 praise+0.387
Neg logit contributions
ust-0.755
vel-0.728
nes-0.726
adow-0.671
aces-0.670
Token its
Token ID663
Feature activation-1.22e-01

Pos logit contributions
ust+0.616
nes+0.594
vel+0.592
onel+0.546
aces+0.540
Neg logit contributions
 last-0.456
 Ms-0.401
 teach-0.362
 front-0.359
 hugs-0.353
Token family
Token ID1641
Feature activation-4.00e-01

Pos logit contributions
ust+0.169
nes+0.160
vel+0.158
onel+0.142
aces+0.140
Neg logit contributions
 last-0.223
 Ms-0.207
 teach-0.190
 hugs-0.188
 front-0.188
Token overnight
Token ID13417
Feature activation+3.34e-01

Pos logit contributions
 last+0.212
 Ms+0.186
 teach+0.161
 front+0.160
 hugs+0.160
Neg logit contributions
ust-0.405
nes-0.397
vel-0.392
onel-0.362
aces-0.361
Token.
Token ID13
Feature activation-5.37e-01

Pos logit contributions
 last+0.347
 Ms+0.301
 teach+0.264
 hugs+0.260
 praise+0.256
Neg logit contributions
ust-0.716
vel-0.701
nes-0.698
aces-0.643
onel-0.639
Token
Token ID220
Feature activation-7.01e-01

Pos logit contributions
ust+1.137
nes+1.105
vel+1.099
onel+1.020
aces+1.008
Neg logit contributions
 last-0.611
 Ms-0.519
 front-0.457
 teach-0.456
 hugs-0.447
Token
Token ID198
Feature activation+1.20e-01

Pos logit contributions
ust+1.635
nes+1.592
vel+1.587
onel+1.486
ode+1.472
Neg logit contributions
 last-0.636
 Ms-0.518
 front-0.435
 teach-0.431
 hugs-0.422
Token
Token ID198
Feature activation+6.58e-02

Pos logit contributions
 last+0.129
 Ms+0.105
 teach+0.095
 hugs+0.093
 front+0.092
Neg logit contributions
ust-0.259
nes-0.257
vel-0.251
onel-0.229
aces-0.228
TokenEveryone
Token ID16190
Feature activation-1.70e+00

Pos logit contributions
 last+0.079
 Ms+0.066
 hugs+0.060
 teach+0.059
 front+0.058
Neg logit contributions
ust-0.137
nes-0.135
vel-0.132
onel-0.120
aces-0.118
Token was
Token ID373
Feature activation-7.36e-01

Pos logit contributions
ust+3.888
nes+3.809
vel+3.774
aces+3.593
ode+3.562
Neg logit contributions
 last-1.731
 Ms-1.416
 hugs-1.255
 teach-1.233
 front-1.195
Token so
Token ID523
Feature activation+7.21e-02

Pos logit contributions
ust+1.369
nes+1.322
vel+1.313
onel+1.204
aces+1.194
Neg logit contributions
 last-1.034
 Ms-0.908
 front-0.826
 teach-0.824
 hugs-0.807
Token disappointed
Token ID11679
Feature activation-9.11e-02

Pos logit contributions
 last+0.101
 Ms+0.089
 teach+0.081
 front+0.081
 hugs+0.079
Neg logit contributions
ust-0.134
nes-0.129
vel-0.128
onel-0.118
aces-0.114
Token!
Token ID0
Feature activation-5.59e-01

Pos logit contributions
ust+0.196
nes+0.190
vel+0.188
onel+0.174
aces+0.172
Neg logit contributions
 last-0.099
 Ms-0.085
 front-0.075
 teach-0.074
 hugs-0.073
Token They
Token ID1119
Feature activation-1.04e+00

Pos logit contributions
ust+1.116
nes+1.084
vel+1.078
onel+0.995
aces+0.982
Neg logit contributions
 last-0.698
 Ms-0.603
 front-0.538
 teach-0.537
 hugs-0.529