TOP ACTIVATIONS
MAX = 3.244

clean . The brother was so happy when
searched until she finally found the video . She was so
was so excited to watch the new video . But when
so excited to watch the new video . But when she
took good care of his new tablet so it wouldn 't
Spot happily took the bone and ate it up
smile . After the tablet was fixed , Tim
he had chosen to watch the fun and easy video .
end ! After the video was over , Lu
going off to watch her beloved k itty video . <|endoftext|>
my took good care of his new tablet so it wouldn
they arrived , Tom put the video in the player and
started to g iggle . The water was cold , but
watched . When the video was over , they
She sat down and watched the cartoon . After a while
. He said he loved his mom too . He said
. After the cartoon was over , Lily
car and likes it . The cat jumps on the car
scare her a bit ". The other cat agreed . The
better . When the cartoon finished , her mom

MIN ACTIVATIONS
MIN = -3.325

a little mouse named S que aky . One day ,
his pet mouse , S que aky . One
One sunny day , S que aky came up to Fl
She blinked her eyes and imm ag ined she could see
G iz mo and S que eker , were both small
play ?" S que aky replied , " Yes
, Fl uffy asked S que aky , " Do you
y puppet called P ips que ak . One day P
not touch things that were not his . Tim decided to
. P ips que ak started to regret it
. One day P ips que ak was playing with its
Lily . " It 's an underground tunnel ," said her
Look , mom , isn 't it incredible ?" Mom agreed
Fl uffy and S que aky played for hours until
wins !" S que eker was up for the
in the bathroom and didn 't know what it was .
one more time , and Sil v ie repeated it .
!" " S que ak !" Lily says .
that day forward , Daisy stri ved to be the best
by himself when he heard an enormous noise . He looked

INTERVAL 1.602 - 3.244
CONTAINS 0.402%

, Lily 's mom took the cookies out of the oven
. One day , Jack wanted to go
help people . One day , Jack got a call that
my said no and ran to buy candy . When Tim
to explore . One day , she saw a big jar

INTERVAL -0.040 - 1.602
CONTAINS 52.226%

and said , " Yes please !" L ily
got up and looked under the sofa . She saw the
will watch over it ." Emily was so happy .
was very curious and she wanted to know what was inside
shouted . Mom smiled and carried a small bundle

INTERVAL -1.683 - -0.040
CONTAINS 47.092%

thanked him for coming to help them . The
to share its space with anyone else . One
I understand . You were just excited to meet me ."
have money . Money can buy candy and ice cream and
she always made sure to keep her key close . <|endoftext|>

INTERVAL -3.325 - -1.683
CONTAINS 0.279%

" Look , L ila , I see a wide land
Let 's go take a closer look ," her mom my
, the man went on an unknown adventure . He didn
called Sally . She loved playing outside and jumping in p
mom put the cream on his knee and soon it felt
Token clean
Token ID3424
Feature activation+8.77e-02

Pos logit contributions
 last+0.782
 Ms+0.694
 teach+0.641
 front+0.634
+0.626
Neg logit contributions
ust-1.093
nes-1.051
vel-1.048
onel-0.962
oll-0.951
Token.
Token ID13
Feature activation-2.73e-01

Pos logit contributions
 last+0.098
 Ms+0.084
 teach+0.075
 front+0.074
 hugs+0.072
Neg logit contributions
ust-0.183
nes-0.179
vel-0.178
onel-0.164
aces-0.163
Token
Token ID220
Feature activation-2.08e-01

Pos logit contributions
ust+0.590
nes+0.572
vel+0.570
onel+0.530
aces+0.524
Neg logit contributions
 last-0.298
 Ms-0.246
 teach-0.220
 front-0.217
 hugs-0.211
Token
Token ID198
Feature activation-1.23e-01

Pos logit contributions
ust+0.466
nes+0.454
vel+0.451
onel+0.420
aces+0.416
Neg logit contributions
 last-0.207
 Ms-0.169
 teach-0.148
 front-0.147
 hugs-0.143
Token
Token ID198
Feature activation-4.81e-01

Pos logit contributions
ust+0.274
nes+0.269
vel+0.267
onel+0.247
oll+0.243
Neg logit contributions
 last-0.126
 Ms-0.102
 teach-0.091
 front-0.090
 hugs-0.088
TokenThe
Token ID464
Feature activation+3.24e+00

Pos logit contributions
ust+1.012
nes+0.985
vel+0.978
onel+0.907
aces+0.896
Neg logit contributions
 last-0.555
 Ms-0.470
 front-0.415
 teach-0.415
 hugs-0.408
Token brother
Token ID3956
Feature activation+1.76e-01

Pos logit contributions
ted+5.321
+4.983
't+4.852
 Ms+4.723
 last+4.714
Neg logit contributions
ust-4.659
vel-4.609
nes-4.233
aky-4.212
oll-4.152
Token was
Token ID373
Feature activation-5.57e-01

Pos logit contributions
 last+0.195
 Ms+0.164
 front+0.147
 teach+0.145
't+0.140
Neg logit contributions
ust-0.369
nes-0.363
vel-0.359
onel-0.331
aces-0.329
Token so
Token ID523
Feature activation+2.03e-01

Pos logit contributions
ust+0.916
nes+0.883
vel+0.875
onel+0.794
aces+0.783
Neg logit contributions
 last-0.896
 Ms-0.799
 front-0.736
 teach-0.735
 hugs-0.726
Token happy
Token ID3772
Feature activation+5.57e-02

Pos logit contributions
 last+0.308
 Ms+0.274
 front+0.252
 teach+0.251
 hugs+0.244
Neg logit contributions
ust-0.356
nes-0.341
vel-0.339
onel-0.312
oll-0.303
Token when
Token ID618
Feature activation+8.99e-01

Pos logit contributions
 last+0.061
 Ms+0.052
 front+0.046
 teach+0.045
 hugs+0.044
Neg logit contributions
ust-0.118
vel-0.115
nes-0.115
onel-0.106
aces-0.105
Token searched
Token ID16499
Feature activation+9.00e-01

Pos logit contributions
ust+1.784
nes+1.708
vel+1.703
aces+1.551
onel+1.549
Neg logit contributions
 last-1.659
 Ms-1.453
 front-1.343
 teach-1.326
 hugs-1.315
Token until
Token ID1566
Feature activation+1.49e-01

Pos logit contributions
 last+1.095
 Ms+1.049
 teach+0.952
+0.927
't+0.898
Neg logit contributions
ust-1.685
vel-1.637
nes-1.602
onel-1.492
aces-1.492
Token she
Token ID673
Feature activation-4.08e-01

Pos logit contributions
 last+0.152
 Ms+0.131
 teach+0.118
 hugs+0.115
 praise+0.112
Neg logit contributions
ust-0.321
nes-0.318
vel-0.312
onel-0.293
aces-0.285
Token finally
Token ID3443
Feature activation-1.18e-01

Pos logit contributions
ust+0.717
nes+0.694
vel+0.687
onel+0.629
aces+0.618
Neg logit contributions
 last-0.607
 Ms-0.537
 teach-0.491
 front-0.490
 hugs-0.484
Token found
Token ID1043
Feature activation+8.19e-02

Pos logit contributions
ust+0.184
nes+0.177
vel+0.175
onel+0.158
aces+0.154
Neg logit contributions
 last-0.199
 Ms-0.182
 teach-0.167
 front-0.166
 hugs-0.165
Token the
Token ID262
Feature activation+3.24e+00

Pos logit contributions
 last+0.083
 Ms+0.071
 teach+0.062
 hugs+0.060
 front+0.060
Neg logit contributions
ust-0.177
vel-0.174
nes-0.174
onel-0.162
aces-0.161
Token video
Token ID2008
Feature activation-7.71e-01

Pos logit contributions
+4.660
 Ms+4.530
 last+4.498
ted+4.493
ited+4.247
Neg logit contributions
vel-4.868
ust-4.844
nes-4.599
onel-4.281
aces-4.053
Token.
Token ID13
Feature activation+4.54e-01

Pos logit contributions
ust+1.705
nes+1.653
vel+1.642
onel+1.530
ode+1.513
Neg logit contributions
 last-0.815
 Ms-0.674
 front-0.590
 teach-0.587
 hugs-0.574
Token She
Token ID1375
Feature activation-1.91e-01

Pos logit contributions
 last+0.652
 Ms+0.536
 teach+0.512
 front+0.498
 praise+0.495
Neg logit contributions
ust-0.816
nes-0.792
vel-0.784
aces-0.722
onel-0.717
Token was
Token ID373
Feature activation-7.19e-01

Pos logit contributions
ust+0.399
nes+0.391
vel+0.387
onel+0.358
aces+0.354
Neg logit contributions
 last-0.217
 Ms-0.184
 front-0.162
 teach-0.162
 hugs-0.156
Token so
Token ID523
Feature activation-3.59e-01

Pos logit contributions
ust+1.179
nes+1.135
vel+1.123
onel+1.016
aces+1.007
Neg logit contributions
 last-1.172
 Ms-1.045
 front-0.969
 teach-0.961
 hugs-0.951
Token was
Token ID373
Feature activation-3.34e-01

Pos logit contributions
ust+0.428
nes+0.417
vel+0.413
onel+0.380
aces+0.379
Neg logit contributions
 last-0.248
 Ms-0.212
 front-0.189
 teach-0.187
 hugs-0.182
Token so
Token ID523
Feature activation-2.41e-01

Pos logit contributions
ust+0.584
nes+0.565
vel+0.561
onel+0.511
aces+0.505
Neg logit contributions
 last-0.503
 Ms-0.443
 teach-0.405
 front-0.404
 hugs-0.400
Token excited
Token ID6568
Feature activation+6.11e-01

Pos logit contributions
ust+0.461
nes+0.444
vel+0.442
onel+0.407
oll+0.399
Neg logit contributions
 last-0.325
 Ms-0.284
 front-0.258
 teach-0.256
 hugs-0.251
Token to
Token ID284
Feature activation+3.56e-01

Pos logit contributions
 last+0.711
 Ms+0.624
 front+0.568
 teach+0.559
+0.557
Neg logit contributions
ust-1.247
nes-1.197
vel-1.196
aces-1.093
oll-1.089
Token watch
Token ID2342
Feature activation+1.38e-01

Pos logit contributions
 last+0.374
 Ms+0.335
+0.269
 hugs+0.268
 Mrs+0.267
Neg logit contributions
ust-0.746
nes-0.745
vel-0.734
onel-0.672
ode-0.670
Token the
Token ID262
Feature activation+3.18e+00

Pos logit contributions
 last+0.130
 Ms+0.103
+0.088
 front+0.088
 hugs+0.088
Neg logit contributions
ust-0.309
vel-0.308
nes-0.305
aces-0.282
onel-0.282
Token new
Token ID649
Feature activation+3.12e+00

Pos logit contributions
 last+4.828
+4.554
 Ms+4.229
 praise+4.032
ited+3.998
Neg logit contributions
vel-5.040
ust-4.608
nes-4.575
aund-4.226
onel-4.036
Token video
Token ID2008
Feature activation-3.45e-01

Pos logit contributions
 last+4.904
+4.835
 Ms+4.198
ited+4.095
 praise+3.937
Neg logit contributions
vel-4.977
nes-4.624
ust-4.354
aund-4.097
od-3.997
Token.
Token ID13
Feature activation+5.16e-01

Pos logit contributions
ust+0.741
nes+0.723
vel+0.719
onel+0.667
aces+0.661
Neg logit contributions
 last-0.374
 Ms-0.315
 front-0.275
 teach-0.274
 hugs-0.270
Token But
Token ID887
Feature activation-7.52e-02

Pos logit contributions
 last+0.735
 Ms+0.611
 teach+0.573
 praise+0.560
 front+0.559
Neg logit contributions
ust-0.930
nes-0.899
vel-0.882
aces-0.833
onel-0.805
Token when
Token ID618
Feature activation+7.36e-01

Pos logit contributions
ust+0.176
nes+0.172
vel+0.171
onel+0.159
aces+0.158
Neg logit contributions
 last-0.066
 Ms-0.054
 teach-0.046
 hugs-0.045
 front-0.045
Token so
Token ID523
Feature activation-2.41e-01

Pos logit contributions
ust+0.584
nes+0.565
vel+0.561
onel+0.511
aces+0.505
Neg logit contributions
 last-0.503
 Ms-0.443
 teach-0.405
 front-0.404
 hugs-0.400
Token excited
Token ID6568
Feature activation+6.11e-01

Pos logit contributions
ust+0.461
nes+0.444
vel+0.442
onel+0.407
oll+0.399
Neg logit contributions
 last-0.325
 Ms-0.284
 front-0.258
 teach-0.256
 hugs-0.251
Token to
Token ID284
Feature activation+3.56e-01

Pos logit contributions
 last+0.711
 Ms+0.624
 front+0.568
 teach+0.559
+0.557
Neg logit contributions
ust-1.247
nes-1.197
vel-1.196
aces-1.093
oll-1.089
Token watch
Token ID2342
Feature activation+1.38e-01

Pos logit contributions
 last+0.374
 Ms+0.335
+0.269
 hugs+0.268
 Mrs+0.267
Neg logit contributions
ust-0.746
nes-0.745
vel-0.734
onel-0.672
ode-0.670
Token the
Token ID262
Feature activation+3.18e+00

Pos logit contributions
 last+0.130
 Ms+0.103
+0.088
 front+0.088
 hugs+0.088
Neg logit contributions
ust-0.309
vel-0.308
nes-0.305
aces-0.282
onel-0.282
Token new
Token ID649
Feature activation+3.12e+00

Pos logit contributions
 last+4.828
+4.554
 Ms+4.229
 praise+4.032
ited+3.998
Neg logit contributions
vel-5.040
ust-4.608
nes-4.575
aund-4.226
onel-4.036
Token video
Token ID2008
Feature activation-3.45e-01

Pos logit contributions
 last+4.904
+4.835
 Ms+4.198
ited+4.095
 praise+3.937
Neg logit contributions
vel-4.977
nes-4.624
ust-4.354
aund-4.097
od-3.997
Token.
Token ID13
Feature activation+5.16e-01

Pos logit contributions
ust+0.741
nes+0.723
vel+0.719
onel+0.667
aces+0.661
Neg logit contributions
 last-0.374
 Ms-0.315
 front-0.275
 teach-0.274
 hugs-0.270
Token But
Token ID887
Feature activation-7.52e-02

Pos logit contributions
 last+0.735
 Ms+0.611
 teach+0.573
 praise+0.560
 front+0.559
Neg logit contributions
ust-0.930
nes-0.899
vel-0.882
aces-0.833
onel-0.805
Token when
Token ID618
Feature activation+7.36e-01

Pos logit contributions
ust+0.176
nes+0.172
vel+0.171
onel+0.159
aces+0.158
Neg logit contributions
 last-0.066
 Ms-0.054
 teach-0.046
 hugs-0.045
 front-0.045
Token she
Token ID673
Feature activation-1.70e-01

Pos logit contributions
 last+0.858
 Ms+0.725
 teach+0.677
 praise+0.675
 hugs+0.664
Neg logit contributions
ust-1.432
nes-1.424
vel-1.392
onel-1.275
aund-1.262
Token took
Token ID1718
Feature activation-7.00e-01

Pos logit contributions
ust+0.749
nes+0.727
vel+0.722
onel+0.665
aces+0.658
Neg logit contributions
 last-0.511
 Ms-0.446
 front-0.401
 teach-0.399
 hugs-0.392
Token good
Token ID922
Feature activation+2.37e-01

Pos logit contributions
ust+1.510
nes+1.464
vel+1.447
onel+1.351
ode+1.335
Neg logit contributions
 last-0.789
 Ms-0.661
 front-0.586
 teach-0.585
 hugs-0.580
Token care
Token ID1337
Feature activation+2.34e-01

Pos logit contributions
 last+0.242
 Ms+0.213
+0.179
 front+0.176
't+0.174
Neg logit contributions
ust-0.511
vel-0.507
nes-0.497
onel-0.463
aces-0.463
Token of
Token ID286
Feature activation+1.74e-02

Pos logit contributions
 last+0.253
 Ms+0.229
 teach+0.194
 front+0.193
+0.190
Neg logit contributions
ust-0.495
vel-0.485
nes-0.482
onel-0.451
aces-0.446
Token his
Token ID465
Feature activation+2.98e+00

Pos logit contributions
 last+0.025
 Ms+0.022
 front+0.020
+0.019
 hugs+0.019
Neg logit contributions
ust-0.031
vel-0.030
nes-0.030
onel-0.028
aces-0.027
Token new
Token ID649
Feature activation+3.09e+00

Pos logit contributions
 last+4.580
+4.294
 Ms+4.276
't+4.145
glers+3.937
Neg logit contributions
vel-4.480
ust-4.402
nes-4.012
aces-3.932
onel-3.780
Token tablet
Token ID14147
Feature activation-1.94e-01

Pos logit contributions
 last+5.580
+5.355
 Ms+5.120
't+4.888
ited+4.819
Neg logit contributions
vel-4.153
ust-3.853
nes-3.632
aces-3.418
ying-3.133
Token so
Token ID523
Feature activation-4.36e-02

Pos logit contributions
ust+0.408
nes+0.397
vel+0.395
onel+0.368
aces+0.365
Neg logit contributions
 last-0.214
 Ms-0.184
 front-0.161
 teach-0.160
 hugs-0.157
Token it
Token ID340
Feature activation-5.21e-01

Pos logit contributions
ust+0.084
nes+0.081
vel+0.081
onel+0.075
oll+0.073
Neg logit contributions
 last-0.057
 Ms-0.051
 front-0.046
 teach-0.045
 hugs-0.044
Token wouldn
Token ID3636
Feature activation+1.33e+00

Pos logit contributions
ust+0.921
nes+0.891
vel+0.885
onel+0.807
aces+0.796
Neg logit contributions
 last-0.771
 Ms-0.680
 teach-0.621
 front-0.620
 hugs-0.612
Token't
Token ID470
Feature activation-2.53e-01

Pos logit contributions
 last+0.746
 Ms+0.634
 hugs+0.502
 praise+0.449
 front+0.388
Neg logit contributions
nes-3.509
ust-3.460
vel-3.391
onel-3.245
aces-3.116
Token
Token ID198
Feature activation-4.12e-01

Pos logit contributions
ust+0.843
nes+0.822
vel+0.816
onel+0.760
aces+0.754
Neg logit contributions
 last-0.387
 Ms-0.317
 front-0.277
 teach-0.276
 hugs-0.268
Token
Token ID198
Feature activation-6.08e-01

Pos logit contributions
ust+0.880
nes+0.858
vel+0.851
onel+0.790
aces+0.781
Neg logit contributions
 last-0.457
 Ms-0.384
 front-0.338
 teach-0.338
 hugs-0.330
TokenSpot
Token ID32565
Feature activation+3.72e-01

Pos logit contributions
ust+1.078
nes+1.042
vel+1.035
onel+0.949
aces+0.937
Neg logit contributions
 last-0.900
 Ms-0.798
 teach-0.727
 front-0.726
 hugs-0.715
Token happily
Token ID18177
Feature activation-9.51e-02

Pos logit contributions
 last+0.440
 Ms+0.370
 front+0.338
 teach+0.331
't+0.328
Neg logit contributions
ust-0.745
nes-0.734
vel-0.724
aces-0.668
onel-0.665
Token took
Token ID1718
Feature activation+2.19e-01

Pos logit contributions
ust+0.186
nes+0.182
vel+0.180
onel+0.166
oll+0.164
Neg logit contributions
 last-0.120
 Ms-0.105
 front-0.094
 teach-0.093
't-0.092
Token the
Token ID262
Feature activation+3.08e+00

Pos logit contributions
 last+0.220
 Ms+0.184
 teach+0.162
 front+0.160
't+0.156
Neg logit contributions
ust-0.474
nes-0.466
vel-0.465
onel-0.432
oll-0.431
Token bone
Token ID9970
Feature activation-2.05e-01

Pos logit contributions
 Ms+4.406
 last+4.221
glers+3.947
 praise+3.854
 front+3.725
Neg logit contributions
vel-4.820
oll-4.455
ode-4.201
ust-4.174
ery-4.101
Token and
Token ID290
Feature activation+3.87e-01

Pos logit contributions
ust+0.414
nes+0.402
vel+0.400
onel+0.370
aces+0.365
Neg logit contributions
 last-0.244
 Ms-0.213
 teach-0.189
 front-0.187
 hugs-0.185
Token ate
Token ID15063
Feature activation+1.37e-01

Pos logit contributions
 last+0.491
 Ms+0.422
 front+0.390
't+0.385
 teach+0.384
Neg logit contributions
ust-0.722
nes-0.721
vel-0.706
onel-0.654
oll-0.653
Token it
Token ID340
Feature activation-1.17e-01

Pos logit contributions
 last+0.203
 Ms+0.180
 teach+0.168
 front+0.166
't+0.161
Neg logit contributions
ust-0.232
vel-0.228
nes-0.224
onel-0.204
oll-0.202
Token up
Token ID510
Feature activation+1.88e-01

Pos logit contributions
ust+0.216
nes+0.210
vel+0.209
onel+0.191
aces+0.190
Neg logit contributions
 last-0.159
 Ms-0.142
 teach-0.129
 front-0.128
 hugs-0.125
Token smile
Token ID8212
Feature activation+5.31e-01

Pos logit contributions
 last+0.338
 Ms+0.314
 front+0.278
+0.277
't+0.274
Neg logit contributions
ust-0.389
vel-0.376
nes-0.373
onel-0.341
aces-0.336
Token.
Token ID13
Feature activation+2.65e-01

Pos logit contributions
 last+0.675
 Ms+0.610
 front+0.536
+0.533
 praise+0.530
Neg logit contributions
ust-1.024
vel-1.002
nes-0.972
onel-0.910
aces-0.900
Token
Token ID198
Feature activation+5.50e-02

Pos logit contributions
 last+0.316
 Ms+0.260
 teach+0.242
 front+0.239
 praise+0.225
Neg logit contributions
ust-0.543
nes-0.523
vel-0.518
onel-0.479
oll-0.477
Token
Token ID198
Feature activation+5.88e-01

Pos logit contributions
 last+0.079
 Ms+0.069
 teach+0.064
 front+0.063
 hugs+0.062
Neg logit contributions
ust-0.098
nes-0.096
vel-0.095
onel-0.086
oll-0.085
TokenAfter
Token ID3260
Feature activation-5.34e-02

Pos logit contributions
 last+0.734
 Ms+0.638
 hugs+0.579
 teach+0.568
 front+0.567
Neg logit contributions
ust-1.202
vel-1.148
nes-1.140
onel-1.026
ode-1.014
Token the
Token ID262
Feature activation+3.05e+00

Pos logit contributions
ust+0.109
vel+0.108
nes+0.107
onel+0.099
oll+0.098
Neg logit contributions
 last-0.061
 Ms-0.052
 front-0.046
 teach-0.046
 hugs-0.045
Token tablet
Token ID14147
Feature activation-5.38e-01

Pos logit contributions
+4.627
 Ms+4.613
 last+4.525
etchup+4.128
ted+4.092
Neg logit contributions
ust-4.464
vel-4.408
nes-4.222
aund-3.987
oll-3.965
Token was
Token ID373
Feature activation-4.12e-01

Pos logit contributions
ust+1.167
nes+1.135
vel+1.128
onel+1.049
aces+1.038
Neg logit contributions
 last-0.578
 Ms-0.484
 teach-0.423
 front-0.423
 hugs-0.414
Token fixed
Token ID5969
Feature activation+4.29e-02

Pos logit contributions
ust+0.737
nes+0.713
vel+0.708
onel+0.647
aces+0.638
Neg logit contributions
 last-0.601
 Ms-0.530
 teach-0.483
 front-0.483
 hugs-0.476
Token,
Token ID11
Feature activation+2.53e-01

Pos logit contributions
 last+0.040
 Ms+0.034
 front+0.029
 teach+0.029
 hugs+0.028
Neg logit contributions
ust-0.098
vel-0.095
nes-0.095
onel-0.089
aces-0.088
Token Tim
Token ID5045
Feature activation+8.81e-02

Pos logit contributions
 last+0.312
 Ms+0.272
 hugs+0.249
 teach+0.247
 front+0.245
Neg logit contributions
ust-0.495
nes-0.470
vel-0.468
onel-0.432
ode-0.426
Token he
Token ID339
Feature activation+4.51e-01

Pos logit contributions
ust+0.376
nes+0.365
vel+0.363
onel+0.338
aces+0.334
Neg logit contributions
 last-0.184
 Ms-0.154
 front-0.136
 teach-0.134
 hugs-0.134
Token had
Token ID550
Feature activation-4.23e-01

Pos logit contributions
 last+0.658
 Ms+0.589
 front+0.545
 teach+0.526
 praise+0.522
Neg logit contributions
nes-0.792
ust-0.792
vel-0.760
onel-0.699
aces-0.688
Token chosen
Token ID7147
Feature activation+2.60e-01

Pos logit contributions
ust+0.898
nes+0.875
vel+0.868
onel+0.805
aces+0.796
Neg logit contributions
 last-0.479
 Ms-0.406
 front-0.357
 teach-0.356
 hugs-0.350
Token to
Token ID284
Feature activation+2.99e-01

Pos logit contributions
 last+0.262
 Ms+0.220
+0.192
 front+0.189
 teach+0.186
Neg logit contributions
ust-0.567
nes-0.553
vel-0.549
onel-0.509
aces-0.507
Token watch
Token ID2342
Feature activation+3.80e-01

Pos logit contributions
 last+0.360
 Ms+0.326
 hugs+0.279
 front+0.276
+0.273
Neg logit contributions
ust-0.588
nes-0.582
vel-0.571
onel-0.520
ode-0.515
Token the
Token ID262
Feature activation+3.00e+00

Pos logit contributions
 last+0.401
 Ms+0.335
+0.296
 hugs+0.293
 front+0.293
Neg logit contributions
ust-0.802
vel-0.793
nes-0.782
onel-0.724
aces-0.722
Token fun
Token ID1257
Feature activation+5.48e-01

Pos logit contributions
 last+4.400
 Ms+4.038
+3.770
 hugs+3.738
 front+3.674
Neg logit contributions
vel-4.708
ust-4.666
nes-4.399
onel-4.100
aund-4.041
Token and
Token ID290
Feature activation+4.27e-01

Pos logit contributions
 last+0.685
 Ms+0.590
 front+0.543
 hugs+0.524
+0.523
Neg logit contributions
ust-1.057
nes-1.039
vel-1.033
onel-0.935
aces-0.923
Token easy
Token ID2562
Feature activation+1.31e+00

Pos logit contributions
 last+0.494
 Ms+0.439
 front+0.389
+0.386
't+0.367
Neg logit contributions
ust-0.863
nes-0.848
vel-0.827
onel-0.770
aces-0.759
Token video
Token ID2008
Feature activation-2.51e-01

Pos logit contributions
 last+1.775
 Ms+1.569
 front+1.458
+1.421
 hugs+1.385
Neg logit contributions
vel-2.342
ust-2.302
nes-2.285
onel-2.048
aces-2.036
Token.
Token ID13
Feature activation+2.21e-01

Pos logit contributions
ust+0.531
nes+0.518
vel+0.514
onel+0.477
aces+0.473
Neg logit contributions
 last-0.279
 Ms-0.238
 front-0.209
 teach-0.207
 hugs-0.204
Token end
Token ID886
Feature activation-2.56e-02

Pos logit contributions
 last+0.435
 Ms+0.396
 hugs+0.358
 front+0.342
+0.342
Neg logit contributions
ust-0.647
nes-0.638
vel-0.622
onel-0.566
ode-0.557
Token!
Token ID0
Feature activation-6.68e-01

Pos logit contributions
ust+0.055
nes+0.053
vel+0.053
onel+0.049
aces+0.049
Neg logit contributions
 last-0.028
 Ms-0.023
 teach-0.020
 front-0.020
 hugs-0.020
Token
Token ID198
Feature activation-1.18e-01

Pos logit contributions
ust+1.552
nes+1.511
vel+1.506
onel+1.407
ode+1.393
Neg logit contributions
 last-0.627
 Ms-0.514
 front-0.436
 teach-0.431
 hugs-0.428
Token
Token ID198
Feature activation-6.71e-01

Pos logit contributions
ust+0.232
nes+0.227
vel+0.225
onel+0.206
oll+0.203
Neg logit contributions
 last-0.148
 Ms-0.128
 teach-0.115
 front-0.114
 hugs-0.111
TokenAfter
Token ID3260
Feature activation+1.61e-01

Pos logit contributions
ust+1.377
nes+1.337
vel+1.329
onel+1.235
aces+1.226
Neg logit contributions
 last-0.807
 Ms-0.689
 front-0.615
 teach-0.611
 hugs-0.600
Token the
Token ID262
Feature activation+2.99e+00

Pos logit contributions
 last+0.174
 Ms+0.146
 teach+0.127
 front+0.127
 hugs+0.124
Neg logit contributions
ust-0.334
vel-0.333
nes-0.331
onel-0.307
oll-0.304
Token video
Token ID2008
Feature activation-2.81e-01

Pos logit contributions
 last+4.327
 Ms+4.216
+3.984
ted+3.896
 Mrs+3.764
Neg logit contributions
vel-4.580
nes-4.396
ust-4.294
aund-4.069
onel-3.944
Token was
Token ID373
Feature activation-2.57e-01

Pos logit contributions
ust+0.619
nes+0.606
vel+0.600
onel+0.561
aces+0.554
Neg logit contributions
 last-0.283
 Ms-0.236
 front-0.206
 teach-0.204
 hugs-0.200
Token over
Token ID625
Feature activation+1.56e+00

Pos logit contributions
ust+0.443
nes+0.428
vel+0.424
onel+0.386
aces+0.380
Neg logit contributions
 last-0.389
 Ms-0.347
 teach-0.319
 front-0.317
 hugs-0.314
Token,
Token ID11
Feature activation+1.06e-01

Pos logit contributions
 last+1.683
 Ms+1.538
 teach+1.435
 praise+1.316
 hugs+1.301
Neg logit contributions
ust-3.055
vel-3.034
nes-2.949
onel-2.809
adow-2.803
Token Lu
Token ID6026
Feature activation+2.78e-01

Pos logit contributions
 last+0.151
 Ms+0.132
 teach+0.124
 front+0.121
 hugs+0.120
Neg logit contributions
ust-0.190
nes-0.185
vel-0.181
onel-0.166
aces-0.163
Token going
Token ID1016
Feature activation-3.83e-01

Pos logit contributions
 last+1.065
 Ms+0.897
 teach+0.825
 hugs+0.803
 praise+0.790
Neg logit contributions
vel-1.709
ust-1.704
nes-1.663
onel-1.520
aces-1.507
Token off
Token ID572
Feature activation+5.35e-01

Pos logit contributions
ust+0.712
nes+0.692
vel+0.686
onel+0.630
aces+0.622
Neg logit contributions
 last-0.529
 Ms-0.467
 teach-0.419
 front-0.418
 hugs-0.412
Token to
Token ID284
Feature activation+1.52e-01

Pos logit contributions
 last+0.632
 Ms+0.549
 teach+0.487
 front+0.474
 praise+0.468
Neg logit contributions
vel-1.053
nes-1.053
ust-1.052
onel-0.965
aces-0.958
Token watch
Token ID2342
Feature activation+2.98e-01

Pos logit contributions
 last+0.158
 Ms+0.135
 front+0.113
+0.112
 hugs+0.112
Neg logit contributions
ust-0.323
nes-0.323
vel-0.318
onel-0.296
aces-0.288
Token her
Token ID607
Feature activation+2.39e+00

Pos logit contributions
 last+0.366
 Ms+0.304
 front+0.275
 hugs+0.273
 teach+0.272
Neg logit contributions
ust-0.579
vel-0.576
nes-0.572
onel-0.520
aces-0.520
Token beloved
Token ID14142
Feature activation+2.98e+00

Pos logit contributions
 last+3.271
 Ms+2.899
+2.796
't+2.723
 front+2.684
Neg logit contributions
ust-4.201
nes-4.037
vel-3.985
onel-3.685
oll-3.549
Token k
Token ID479
Feature activation+1.84e+00

Pos logit contributions
 last+4.632
 Ms+3.949
 praise+3.746
Grab+3.739
ited+3.737
Neg logit contributions
vel-4.487
nes-4.290
ust-4.215
onel-3.975
aund-3.775
Tokenitty
Token ID9760
Feature activation+1.08e+00

Pos logit contributions
 last+2.958
 Ms+2.561
 praise+2.355
 teach+2.299
 hugs+2.299
Neg logit contributions
nes-3.227
ust-3.224
vel-3.222
onel-2.850
oll-2.771
Token video
Token ID2008
Feature activation-3.32e-01

Pos logit contributions
 last+1.359
 Ms+1.206
 front+1.079
't+1.054
+1.054
Neg logit contributions
vel-2.018
ust-2.008
nes-1.962
onel-1.843
aces-1.803
Token.
Token ID13
Feature activation+7.83e-01

Pos logit contributions
ust+0.715
nes+0.698
vel+0.693
onel+0.644
aces+0.638
Neg logit contributions
 last-0.356
 Ms-0.299
 front-0.262
 teach-0.260
 hugs-0.256
Token<|endoftext|>
Token ID50256
Feature activation+5.79e-02

Pos logit contributions
 last+0.994
 Ms+0.773
 teach+0.760
 front+0.745
 first+0.716
Neg logit contributions
ust-1.530
nes-1.492
vel-1.462
oll-1.339
onel-1.338
Tokenmy
Token ID1820
Feature activation-3.90e-01

Pos logit contributions
ust+1.665
nes+1.632
vel+1.619
onel+1.522
aces+1.504
Neg logit contributions
 last-0.532
 Ms-0.408
 teach-0.332
 front-0.331
 hugs-0.324
Token took
Token ID1718
Feature activation-7.00e-01

Pos logit contributions
ust+0.749
nes+0.727
vel+0.722
onel+0.665
aces+0.658
Neg logit contributions
 last-0.511
 Ms-0.446
 front-0.401
 teach-0.399
 hugs-0.392
Token good
Token ID922
Feature activation+2.37e-01

Pos logit contributions
ust+1.510
nes+1.464
vel+1.447
onel+1.351
ode+1.335
Neg logit contributions
 last-0.789
 Ms-0.661
 front-0.586
 teach-0.585
 hugs-0.580
Token care
Token ID1337
Feature activation+2.34e-01

Pos logit contributions
 last+0.242
 Ms+0.213
+0.179
 front+0.176
't+0.174
Neg logit contributions
ust-0.511
vel-0.507
nes-0.497
onel-0.463
aces-0.463
Token of
Token ID286
Feature activation+1.74e-02

Pos logit contributions
 last+0.253
 Ms+0.229
 teach+0.194
 front+0.193
+0.190
Neg logit contributions
ust-0.495
vel-0.485
nes-0.482
onel-0.451
aces-0.446
Token his
Token ID465
Feature activation+2.98e+00

Pos logit contributions
 last+0.025
 Ms+0.022
 front+0.020
+0.019
 hugs+0.019
Neg logit contributions
ust-0.031
vel-0.030
nes-0.030
onel-0.028
aces-0.027
Token new
Token ID649
Feature activation+3.09e+00

Pos logit contributions
 last+4.580
+4.294
 Ms+4.276
't+4.145
glers+3.937
Neg logit contributions
vel-4.480
ust-4.402
nes-4.012
aces-3.932
onel-3.780
Token tablet
Token ID14147
Feature activation-1.94e-01

Pos logit contributions
 last+5.580
+5.355
 Ms+5.120
't+4.888
ited+4.819
Neg logit contributions
vel-4.153
ust-3.853
nes-3.632
aces-3.418
ying-3.133
Token so
Token ID523
Feature activation-4.36e-02

Pos logit contributions
ust+0.408
nes+0.397
vel+0.395
onel+0.368
aces+0.365
Neg logit contributions
 last-0.214
 Ms-0.184
 front-0.161
 teach-0.160
 hugs-0.157
Token it
Token ID340
Feature activation-5.21e-01

Pos logit contributions
ust+0.084
nes+0.081
vel+0.081
onel+0.075
oll+0.073
Neg logit contributions
 last-0.057
 Ms-0.051
 front-0.046
 teach-0.045
 hugs-0.044
Token wouldn
Token ID3636
Feature activation+1.33e+00

Pos logit contributions
ust+0.921
nes+0.891
vel+0.885
onel+0.807
aces+0.796
Neg logit contributions
 last-0.771
 Ms-0.680
 teach-0.621
 front-0.620
 hugs-0.612
Token they
Token ID484
Feature activation-8.65e-01

Pos logit contributions
 last+0.206
 Ms+0.177
 teach+0.159
 hugs+0.156
 front+0.154
Neg logit contributions
ust-0.365
nes-0.362
vel-0.353
onel-0.323
oll-0.319
Token arrived
Token ID5284
Feature activation-1.21e+00

Pos logit contributions
ust+1.766
vel+1.715
nes+1.705
aces+1.581
onel+1.578
Neg logit contributions
 last-1.082
 Ms-0.922
 front-0.819
 teach-0.818
 hugs-0.807
Token,
Token ID11
Feature activation+4.58e-01

Pos logit contributions
ust+2.868
nes+2.785
vel+2.774
ode+2.608
aces+2.597
Neg logit contributions
 last-1.148
 Ms-0.908
 front-0.799
 hugs-0.780
 teach-0.757
Token Tom
Token ID4186
Feature activation+3.50e-01

Pos logit contributions
 last+0.552
 Ms+0.466
 teach+0.426
 hugs+0.424
 front+0.423
Neg logit contributions
ust-0.914
nes-0.892
vel-0.868
onel-0.793
aund-0.789
Token put
Token ID1234
Feature activation-2.72e-01

Pos logit contributions
 last+0.437
 Ms+0.372
 front+0.339
 teach+0.335
+0.329
Neg logit contributions
ust-0.679
nes-0.673
vel-0.659
onel-0.606
aces-0.602
Token the
Token ID262
Feature activation+2.97e+00

Pos logit contributions
ust+0.588
nes+0.574
vel+0.572
onel+0.532
aces+0.529
Neg logit contributions
 last-0.290
 Ms-0.245
 teach-0.215
 front-0.211
 hugs-0.207
Token video
Token ID2008
Feature activation-1.05e+00

Pos logit contributions
 Ms+4.311
 last+4.310
 Mrs+3.823
+3.800
 Together+3.746
Neg logit contributions
vel-4.560
nes-4.362
ust-4.177
onel-4.005
aund-3.945
Token in
Token ID287
Feature activation+1.00e+00

Pos logit contributions
ust+1.889
nes+1.811
vel+1.806
onel+1.642
ode+1.627
Neg logit contributions
 last-1.563
 Ms-1.330
 front-1.247
 hugs-1.228
 teach-1.226
Token the
Token ID262
Feature activation+7.48e-01

Pos logit contributions
 last+1.114
 Ms+0.985
 teach+0.882
 hugs+0.863
't+0.856
Neg logit contributions
ust-1.967
vel-1.962
nes-1.944
aces-1.767
onel-1.733
Token player
Token ID2137
Feature activation-6.34e-01

Pos logit contributions
 last+1.145
 Ms+1.052
't+0.967
 teach+0.936
 praise+0.916
Neg logit contributions
ust-1.162
nes-1.154
vel-1.139
onel-1.048
ode-1.031
Token and
Token ID290
Feature activation-5.29e-03

Pos logit contributions
ust+1.209
nes+1.168
vel+1.159
onel+1.068
aces+1.051
Neg logit contributions
 last-0.851
 Ms-0.738
 front-0.670
 teach-0.669
 hugs-0.658
Token started
Token ID2067
Feature activation-2.01e-02

Pos logit contributions
ust+0.369
nes+0.361
vel+0.358
onel+0.330
aces+0.326
Neg logit contributions
 last-0.224
 Ms-0.192
 front-0.172
 teach-0.172
 hugs-0.166
Token to
Token ID284
Feature activation+5.51e-01

Pos logit contributions
ust+0.042
nes+0.042
vel+0.042
onel+0.038
aces+0.037
Neg logit contributions
 last-0.022
 Ms-0.019
 teach-0.016
 front-0.016
 hugs-0.016
Token g
Token ID308
Feature activation-1.27e+00

Pos logit contributions
 last+0.725
 Ms+0.645
 front+0.559
+0.557
't+0.556
Neg logit contributions
ust-1.020
nes-1.009
vel-1.005
ode-0.907
onel-0.896
Tokeniggle
Token ID24082
Feature activation-5.50e-01

Pos logit contributions
ust+1.771
nes+1.738
vel+1.690
ode+1.549
aces+1.527
Neg logit contributions
 last-2.290
 Ms-2.051
 teach-1.939
 hugs-1.933
 front-1.931
Token.
Token ID13
Feature activation+8.29e-02

Pos logit contributions
ust+1.207
nes+1.175
vel+1.166
onel+1.087
aces+1.075
Neg logit contributions
 last-0.578
 Ms-0.482
 front-0.420
 teach-0.420
 hugs-0.411
Token The
Token ID383
Feature activation+2.95e+00

Pos logit contributions
 last+0.104
 Ms+0.085
 teach+0.080
 front+0.078
 hugs+0.075
Neg logit contributions
ust-0.165
nes-0.160
vel-0.159
onel-0.147
oll-0.145
Token water
Token ID1660
Feature activation+6.09e-01

Pos logit contributions
+4.417
ted+4.392
 last+4.267
't+4.249
 Ms+4.234
Neg logit contributions
ust-4.349
vel-4.148
nes-4.145
onel-3.830
aky-3.820
Token was
Token ID373
Feature activation+3.74e-01

Pos logit contributions
 last+0.747
 Ms+0.669
 teach+0.598
 front+0.586
+0.572
Neg logit contributions
ust-1.175
nes-1.167
vel-1.132
onel-1.067
aces-1.042
Token cold
Token ID4692
Feature activation-6.87e-05

Pos logit contributions
 last+0.532
 Ms+0.481
 teach+0.443
 hugs+0.435
+0.430
Neg logit contributions
ust-0.667
vel-0.639
nes-0.638
onel-0.580
oll-0.565
Token,
Token ID11
Feature activation-1.36e-02

Pos logit contributions
ust+0.000
vel+0.000
nes+0.000
onel+0.000
aces+0.000
Neg logit contributions
 last-0.000
 Ms-0.000
 teach-0.000
 front-0.000
 hugs-0.000
Token but
Token ID475
Feature activation+2.44e-01

Pos logit contributions
ust+0.025
nes+0.025
vel+0.025
onel+0.023
aces+0.022
Neg logit contributions
 last-0.018
 Ms-0.016
 teach-0.014
 front-0.014
 hugs-0.014
Token watched
Token ID7342
Feature activation+3.04e-02

Pos logit contributions
ust+0.135
nes+0.132
vel+0.130
onel+0.120
aces+0.119
Neg logit contributions
 last-0.084
 Ms-0.073
 front-0.065
 teach-0.065
't-0.064
Token.
Token ID13
Feature activation+5.35e-01

Pos logit contributions
 last+0.031
 Ms+0.025
 front+0.022
 teach+0.021
 hugs+0.021
Neg logit contributions
ust-0.067
vel-0.066
nes-0.065
onel-0.061
aces-0.060
Token
Token ID198
Feature activation+2.12e-01

Pos logit contributions
 last+0.664
 Ms+0.544
 teach+0.518
 front+0.500
 praise+0.480
Neg logit contributions
ust-1.060
vel-1.023
nes-1.022
onel-0.932
aces-0.930
Token
Token ID198
Feature activation+3.92e-01

Pos logit contributions
 last+0.245
 Ms+0.205
 teach+0.188
 front+0.183
 hugs+0.180
Neg logit contributions
ust-0.439
nes-0.430
vel-0.426
onel-0.388
oll-0.384
TokenWhen
Token ID2215
Feature activation+3.99e-01

Pos logit contributions
 last+0.485
 Ms+0.408
 hugs+0.376
 teach+0.369
 front+0.362
Neg logit contributions
ust-0.810
vel-0.782
nes-0.781
onel-0.695
ode-0.683
Token the
Token ID262
Feature activation+2.95e+00

Pos logit contributions
 last+0.474
 Ms+0.417
 teach+0.376
 front+0.371
 hugs+0.369
Neg logit contributions
ust-0.781
nes-0.763
vel-0.758
onel-0.689
aund-0.680
Token video
Token ID2008
Feature activation-1.70e-01

Pos logit contributions
 Ms+4.493
 last+4.436
 Mrs+3.963
+3.774
 praise+3.761
Neg logit contributions
vel-4.032
nes-4.029
ust-3.963
aund-3.779
oll-3.755
Token was
Token ID373
Feature activation-3.34e-01

Pos logit contributions
ust+0.354
nes+0.347
vel+0.342
onel+0.319
aces+0.314
Neg logit contributions
 last-0.190
 Ms-0.161
 front-0.143
 teach-0.142
 hugs-0.140
Token over
Token ID625
Feature activation+1.19e+00

Pos logit contributions
ust+0.612
nes+0.592
vel+0.588
onel+0.537
aces+0.530
Neg logit contributions
 last-0.469
 Ms-0.414
 teach-0.377
 front-0.374
 hugs-0.371
Token,
Token ID11
Feature activation+8.91e-01

Pos logit contributions
 last+1.243
 Ms+1.167
 teach+1.039
 hugs+0.974
 praise+0.944
Neg logit contributions
ust-2.432
vel-2.369
nes-2.331
aund-2.170
onel-2.161
Token they
Token ID484
Feature activation-8.15e-01

Pos logit contributions
 last+1.059
 Ms+0.901
 hugs+0.835
 teach+0.830
 front+0.828
Neg logit contributions
ust-1.760
nes-1.699
vel-1.685
onel-1.544
aund-1.530
Token She
Token ID1375
Feature activation-8.06e-01

Pos logit contributions
ust+2.087
nes+2.037
vel+2.021
onel+1.882
ode+1.861
Neg logit contributions
 last-1.133
 Ms-0.993
 front-0.872
 teach-0.855
 hugs-0.837
Token sat
Token ID3332
Feature activation-2.55e-01

Pos logit contributions
ust+1.659
nes+1.613
vel+1.609
onel+1.487
aces+1.472
Neg logit contributions
 last-0.974
 Ms-0.831
 front-0.736
 teach-0.734
 hugs-0.732
Token down
Token ID866
Feature activation+1.21e+00

Pos logit contributions
ust+0.427
nes+0.413
vel+0.411
onel+0.374
aces+0.368
Neg logit contributions
 last-0.400
 Ms-0.356
 teach-0.329
 front-0.325
 hugs-0.322
Token and
Token ID290
Feature activation-2.16e-01

Pos logit contributions
 last+1.645
 Ms+1.553
 teach+1.427
 hugs+1.386
 praise+1.385
Neg logit contributions
vel-2.017
nes-2.013
ust-2.007
onel-1.815
aces-1.805
Token watched
Token ID7342
Feature activation+7.29e-02

Pos logit contributions
ust+0.421
nes+0.411
vel+0.407
onel+0.375
aces+0.368
Neg logit contributions
 last-0.276
 Ms-0.236
 front-0.214
 teach-0.213
 hugs-0.207
Token the
Token ID262
Feature activation+2.94e+00

Pos logit contributions
 last+0.092
 Ms+0.079
 front+0.071
 teach+0.070
 hugs+0.070
Neg logit contributions
ust-0.141
vel-0.139
nes-0.138
onel-0.126
aces-0.125
Token cartoon
Token ID16251
Feature activation-3.28e-01

Pos logit contributions
 last+4.114
+3.856
't+3.469
 Ms+3.426
 praise+3.249
Neg logit contributions
vel-4.793
ust-4.741
nes-4.376
oll-4.162
onel-4.097
Token.
Token ID13
Feature activation-2.22e-01

Pos logit contributions
ust+0.670
nes+0.653
vel+0.650
onel+0.600
aces+0.594
Neg logit contributions
 last-0.388
 Ms-0.332
 front-0.294
 teach-0.293
 hugs-0.289
Token After
Token ID2293
Feature activation+5.96e-01

Pos logit contributions
ust+0.451
nes+0.438
vel+0.437
onel+0.403
aces+0.401
Neg logit contributions
 last-0.267
 Ms-0.225
 teach-0.202
 front-0.200
 hugs-0.198
Token a
Token ID257
Feature activation+8.25e-01

Pos logit contributions
 last+0.618
 Ms+0.522
 teach+0.458
 front+0.444
 hugs+0.441
Neg logit contributions
vel-1.248
ust-1.237
nes-1.227
onel-1.140
oll-1.137
Token while
Token ID981
Feature activation+8.69e-01

Pos logit contributions
 last+0.739
 Ms+0.621
 hugs+0.577
 front+0.558
 praise+0.529
Neg logit contributions
ust-1.831
nes-1.808
vel-1.791
ode-1.600
onel-1.594
Token.
Token ID13
Feature activation-2.39e-01

Pos logit contributions
ust+0.038
vel+0.037
nes+0.037
onel+0.034
aces+0.034
Neg logit contributions
 last-0.021
 Ms-0.018
 front-0.016
 teach-0.016
 hugs-0.016
Token He
Token ID679
Feature activation-4.68e-03

Pos logit contributions
ust+0.499
nes+0.484
vel+0.480
onel+0.445
aces+0.442
Neg logit contributions
 last-0.279
 Ms-0.232
 teach-0.209
 front-0.207
 hugs-0.203
Token said
Token ID531
Feature activation-7.59e-01

Pos logit contributions
ust+0.009
nes+0.009
vel+0.009
onel+0.008
oll+0.008
Neg logit contributions
 last-0.006
 Ms-0.005
 front-0.004
 teach-0.004
't-0.004
Token he
Token ID339
Feature activation+3.67e-01

Pos logit contributions
ust+1.756
nes+1.707
vel+1.693
onel+1.581
ode+1.567
Neg logit contributions
 last-0.728
 Ms-0.599
 teach-0.513
 hugs-0.505
 front-0.504
Token loved
Token ID6151
Feature activation+3.90e-01

Pos logit contributions
 last+0.510
 Ms+0.462
 front+0.427
+0.404
 teach+0.402
Neg logit contributions
ust-0.678
nes-0.649
vel-0.641
onel-0.591
aces-0.588
Token his
Token ID465
Feature activation+2.92e+00

Pos logit contributions
 last+0.518
 Ms+0.444
+0.414
 front+0.407
 teach+0.403
Neg logit contributions
ust-0.722
nes-0.705
vel-0.703
aces-0.647
onel-0.637
Token mom
Token ID1995
Feature activation-2.79e-01

Pos logit contributions
 last+4.499
+4.381
 Ms+4.294
ted+4.266
ited+4.252
Neg logit contributions
ust-4.111
nes-4.043
vel-3.901
aces-3.608
oll-3.429
Token too
Token ID1165
Feature activation+4.58e-01

Pos logit contributions
ust+0.586
nes+0.568
vel+0.565
onel+0.523
aces+0.521
Neg logit contributions
 last-0.315
 Ms-0.270
 front-0.238
 teach-0.237
 hugs-0.231
Token.
Token ID13
Feature activation-1.03e-01

Pos logit contributions
 last+0.532
 Ms+0.477
 teach+0.420
 hugs+0.410
 praise+0.408
Neg logit contributions
ust-0.932
vel-0.900
nes-0.885
aces-0.822
oll-0.817
Token He
Token ID679
Feature activation+7.80e-02

Pos logit contributions
ust+0.213
nes+0.205
vel+0.203
onel+0.188
aces+0.188
Neg logit contributions
 last-0.121
 Ms-0.100
 teach-0.091
 front-0.090
 hugs-0.088
Token said
Token ID531
Feature activation-6.30e-01

Pos logit contributions
 last+0.090
 Ms+0.078
 front+0.071
't+0.068
 teach+0.068
Neg logit contributions
ust-0.161
nes-0.156
vel-0.153
onel-0.142
oll-0.142
Token.
Token ID13
Feature activation+3.04e-01

Pos logit contributions
 last+0.664
 Ms+0.611
 teach+0.527
 praise+0.524
+0.518
Neg logit contributions
ust-1.110
vel-1.083
nes-1.075
aces-0.990
oll-0.980
Token
Token ID220
Feature activation-2.00e-01

Pos logit contributions
 last+0.379
 Ms+0.302
 teach+0.291
 front+0.285
 praise+0.272
Neg logit contributions
ust-0.608
nes-0.592
vel-0.587
onel-0.538
oll-0.536
Token
Token ID198
Feature activation+5.26e-02

Pos logit contributions
ust+0.449
nes+0.438
vel+0.434
onel+0.405
aces+0.402
Neg logit contributions
 last-0.200
 Ms-0.163
 teach-0.143
 front-0.142
 hugs-0.138
Token
Token ID198
Feature activation+8.29e-02

Pos logit contributions
 last+0.056
 Ms+0.045
 teach+0.041
 front+0.040
 hugs+0.039
Neg logit contributions
ust-0.114
nes-0.113
vel-0.111
onel-0.102
aces-0.101
TokenAfter
Token ID3260
Feature activation+2.32e-01

Pos logit contributions
 last+0.094
 Ms+0.078
 teach+0.070
 hugs+0.070
 front+0.069
Neg logit contributions
ust-0.178
nes-0.174
vel-0.172
onel-0.156
aces-0.153
Token the
Token ID262
Feature activation+2.91e+00

Pos logit contributions
 last+0.245
 Ms+0.206
 teach+0.181
 front+0.179
 hugs+0.174
Neg logit contributions
vel-0.484
ust-0.483
nes-0.482
onel-0.444
oll-0.443
Token cartoon
Token ID16251
Feature activation+1.32e-01

Pos logit contributions
 last+4.074
 Ms+4.066
ted+3.788
+3.779
glers+3.733
Neg logit contributions
vel-4.321
nes-4.142
ust-4.113
oll-4.086
aund-3.796
Token was
Token ID373
Feature activation-2.97e-01

Pos logit contributions
 last+0.127
 Ms+0.105
 front+0.093
 teach+0.090
 hugs+0.088
Neg logit contributions
ust-0.290
nes-0.289
vel-0.285
onel-0.266
oll-0.263
Token over
Token ID625
Feature activation+1.39e+00

Pos logit contributions
ust+0.547
nes+0.530
vel+0.526
onel+0.481
aces+0.475
Neg logit contributions
 last-0.413
 Ms-0.363
 teach-0.331
 front-0.329
 hugs-0.324
Token,
Token ID11
Feature activation+3.77e-01

Pos logit contributions
 last+1.467
 Ms+1.332
 teach+1.236
 front+1.122
 hugs+1.117
Neg logit contributions
ust-2.744
vel-2.717
nes-2.710
onel-2.528
 penn-2.515
Token Lily
Token ID20037
Feature activation-8.75e-02

Pos logit contributions
 last+0.479
 Ms+0.400
 front+0.378
 teach+0.374
 hugs+0.369
Neg logit contributions
ust-0.718
nes-0.709
vel-0.696
onel-0.637
oll-0.626
Token car
Token ID1097
Feature activation+3.96e-01

Pos logit contributions
 last+1.724
 Ms+1.648
 praise+1.422
 hugs+1.416
 Mrs+1.398
Neg logit contributions
vel-1.797
ust-1.792
aces-1.748
nes-1.737
ode-1.689
Token and
Token ID290
Feature activation+2.15e+00

Pos logit contributions
 last+0.463
 Ms+0.429
 teach+0.359
 hugs+0.351
 praise+0.351
Neg logit contributions
ust-0.793
vel-0.774
nes-0.760
aces-0.713
onel-0.712
Token likes
Token ID7832
Feature activation+5.90e-02

Pos logit contributions
 last+3.546
+3.043
 Ms+2.928
 praise+2.766
 teach+2.736
Neg logit contributions
vel-3.178
ust-3.157
aces-2.997
nes-2.851
onel-2.746
Token it
Token ID340
Feature activation+3.32e-01

Pos logit contributions
 last+0.076
 Ms+0.066
 front+0.059
 teach+0.059
+0.059
Neg logit contributions
ust-0.112
nes-0.108
vel-0.108
aces-0.100
onel-0.100
Token.
Token ID13
Feature activation+6.52e-01

Pos logit contributions
 last+0.377
 Ms+0.349
+0.297
 teach+0.295
 front+0.293
Neg logit contributions
ust-0.689
vel-0.661
nes-0.652
aces-0.613
onel-0.611
Token The
Token ID383
Feature activation+2.91e+00

Pos logit contributions
 last+1.020
 Ms+0.850
 teach+0.801
 front+0.797
 praise+0.785
Neg logit contributions
ust-1.073
vel-1.030
nes-1.002
aces-0.959
onel-0.943
Token cat
Token ID3797
Feature activation+4.57e-01

Pos logit contributions
 last+5.111
 Ms+4.931
+4.831
ted+4.786
ited+4.683
Neg logit contributions
aces-3.306
vel-3.167
 wing-3.119
ust-3.103
aky-2.886
Token jumps
Token ID18045
Feature activation+2.33e-01

Pos logit contributions
 last+0.614
 Ms+0.529
 front+0.482
+0.480
 teach+0.479
Neg logit contributions
ust-0.853
vel-0.820
nes-0.806
aces-0.759
onel-0.756
Token on
Token ID319
Feature activation+7.17e-01

Pos logit contributions
 last+0.354
 Ms+0.317
 teach+0.295
 praise+0.287
+0.285
Neg logit contributions
ust-0.396
vel-0.386
nes-0.382
onel-0.348
aces-0.344
Token the
Token ID262
Feature activation+1.91e-01

Pos logit contributions
 last+0.835
 Ms+0.721
 teach+0.645
 praise+0.643
 hugs+0.634
Neg logit contributions
vel-1.389
ust-1.386
nes-1.343
aces-1.251
onel-1.243
Token car
Token ID1097
Feature activation-5.99e-02

Pos logit contributions
 last+0.290
 Ms+0.264
 teach+0.238
+0.234
 praise+0.233
Neg logit contributions
ust-0.314
nes-0.309
vel-0.305
aces-0.285
ode-0.277
Token scare
Token ID19437
Feature activation+4.57e-01

Pos logit contributions
 last+0.436
 Ms+0.385
 front+0.328
 hugs+0.322
+0.314
Neg logit contributions
ust-0.813
nes-0.800
vel-0.778
onel-0.718
ode-0.717
Token her
Token ID607
Feature activation-5.87e-01

Pos logit contributions
 last+0.655
 Ms+0.579
+0.538
 front+0.522
 teach+0.521
Neg logit contributions
ust-0.796
nes-0.789
vel-0.777
onel-0.708
aces-0.685
Token a
Token ID257
Feature activation-4.20e-01

Pos logit contributions
ust+1.099
nes+1.065
vel+1.057
onel+0.971
aces+0.959
Neg logit contributions
 last-0.808
 Ms-0.704
 front-0.639
 teach-0.638
 hugs-0.628
Token bit
Token ID1643
Feature activation+5.52e-01

Pos logit contributions
ust+0.832
nes+0.807
vel+0.802
onel+0.740
aces+0.729
Neg logit contributions
 last-0.529
 Ms-0.457
 front-0.408
 teach-0.408
 hugs-0.402
Token".
Token ID1911
Feature activation+3.14e-01

Pos logit contributions
 last+0.810
 Ms+0.752
+0.692
 front+0.678
 teach+0.677
Neg logit contributions
ust-0.970
nes-0.921
vel-0.905
onel-0.829
ode-0.793
Token The
Token ID383
Feature activation+2.90e+00

Pos logit contributions
 last+0.344
 Ms+0.273
 front+0.251
 teach+0.250
 hugs+0.239
Neg logit contributions
ust-0.659
nes-0.648
vel-0.637
onel-0.592
aces-0.588
Token other
Token ID584
Feature activation+2.20e+00

Pos logit contributions
 last+4.780
+4.452
ted+4.228
ited+4.137
 Ms+4.069
Neg logit contributions
ust-4.109
nes-3.913
vel-3.794
aces-3.506
onel-3.468
Token cat
Token ID3797
Feature activation-6.33e-02

Pos logit contributions
 last+3.608
+3.091
 Ms+2.995
 praise+2.976
 front+2.958
Neg logit contributions
ust-3.562
vel-3.252
nes-3.239
aces-3.082
onel-2.907
Token agreed
Token ID4987
Feature activation-9.35e-02

Pos logit contributions
ust+0.129
nes+0.125
vel+0.124
onel+0.115
aces+0.114
Neg logit contributions
 last-0.076
 Ms-0.065
 front-0.059
 teach-0.057
 hugs-0.056
Token.
Token ID13
Feature activation+5.75e-01

Pos logit contributions
ust+0.200
nes+0.194
vel+0.194
onel+0.177
aces+0.177
Neg logit contributions
 last-0.102
 Ms-0.087
 front-0.076
 teach-0.075
 hugs-0.074
Token The
Token ID383
Feature activation+2.57e+00

Pos logit contributions
 last+0.754
 Ms+0.608
 teach+0.584
 front+0.578
 praise+0.550
Neg logit contributions
ust-1.107
nes-1.062
vel-1.053
aces-0.974
onel-0.970
Token better
Token ID1365
Feature activation-5.25e-01

Pos logit contributions
 last+0.778
 Ms+0.692
+0.638
 teach+0.637
 front+0.633
Neg logit contributions
ust-0.945
vel-0.919
nes-0.911
onel-0.859
oll-0.831
Token.
Token ID13
Feature activation-4.64e-01

Pos logit contributions
ust+1.158
nes+1.127
vel+1.119
onel+1.043
aces+1.031
Neg logit contributions
 last-0.552
 Ms-0.460
 front-0.401
 teach-0.400
 hugs-0.391
Token
Token ID198
Feature activation-6.02e-01

Pos logit contributions
ust+0.955
nes+0.928
vel+0.922
onel+0.854
aces+0.844
Neg logit contributions
 last-0.554
 Ms-0.473
 teach-0.420
 front-0.420
 hugs-0.412
Token
Token ID198
Feature activation-7.66e-01

Pos logit contributions
ust+1.392
nes+1.355
vel+1.346
onel+1.261
aces+1.248
Neg logit contributions
 last-0.565
 Ms-0.461
 front-0.393
 teach-0.391
 hugs-0.382
TokenWhen
Token ID2215
Feature activation+6.75e-02

Pos logit contributions
ust+1.567
nes+1.520
vel+1.503
onel+1.408
aces+1.393
Neg logit contributions
 last-0.928
 Ms-0.799
 front-0.719
 teach-0.713
 hugs-0.691
Token the
Token ID262
Feature activation+2.90e+00

Pos logit contributions
 last+0.070
 Ms+0.058
 hugs+0.051
 teach+0.051
 front+0.050
Neg logit contributions
ust-0.144
nes-0.142
vel-0.141
onel-0.130
oll-0.128
Token cartoon
Token ID16251
Feature activation-1.66e-01

Pos logit contributions
 last+3.712
+3.472
 Ms+3.399
't+3.131
ted+3.083
Neg logit contributions
ust-4.686
oll-4.627
vel-4.595
nes-4.548
onel-4.154
Token finished
Token ID5201
Feature activation+8.11e-02

Pos logit contributions
ust+0.354
nes+0.348
vel+0.344
onel+0.320
aces+0.316
Neg logit contributions
 last-0.178
 Ms-0.149
 front-0.131
 teach-0.130
 hugs-0.128
Token,
Token ID11
Feature activation+3.39e-01

Pos logit contributions
 last+0.077
 Ms+0.062
 front+0.053
 teach+0.053
 hugs+0.052
Neg logit contributions
ust-0.183
vel-0.180
nes-0.180
onel-0.166
aces-0.165
Token her
Token ID607
Feature activation+6.07e-01

Pos logit contributions
 last+0.406
 Ms+0.336
 hugs+0.314
 teach+0.305
 front+0.305
Neg logit contributions
ust-0.674
vel-0.659
nes-0.657
onel-0.601
aces-0.591
Token mom
Token ID1995
Feature activation+4.87e-01

Pos logit contributions
 last+0.846
 Ms+0.772
't+0.687
 teach+0.686
+0.674
Neg logit contributions
ust-1.059
nes-1.036
vel-1.005
oll-0.955
onel-0.931
Token a
Token ID257
Feature activation+1.31e-01

Pos logit contributions
ust+0.086
nes+0.083
vel+0.082
onel+0.076
ode+0.076
Neg logit contributions
 last-0.043
 Ms-0.036
 front-0.032
 hugs-0.031
 teach-0.031
Token little
Token ID1310
Feature activation-1.93e-01

Pos logit contributions
 last+0.164
 Ms+0.140
 front+0.127
 hugs+0.126
 teach+0.125
Neg logit contributions
ust-0.256
nes-0.248
vel-0.247
onel-0.226
ode-0.224
Token mouse
Token ID10211
Feature activation-3.06e-02

Pos logit contributions
ust+0.347
nes+0.335
vel+0.333
onel+0.306
ode+0.301
Neg logit contributions
 last-0.275
 Ms-0.243
 front-0.221
 teach-0.219
't-0.217
Token named
Token ID3706
Feature activation+4.41e-01

Pos logit contributions
ust+0.058
nes+0.057
vel+0.056
onel+0.052
aces+0.051
Neg logit contributions
 last-0.040
 Ms-0.035
 front-0.032
 teach-0.031
 hugs-0.031
Token S
Token ID311
Feature activation-6.39e-01

Pos logit contributions
 last+0.499
 Ms+0.396
 front+0.375
 praise+0.365
 teach+0.360
Neg logit contributions
ust-0.925
nes-0.906
vel-0.889
onel-0.825
aces-0.820
Tokenque
Token ID4188
Feature activation-3.33e+00

Pos logit contributions
ust+1.040
nes+1.007
vel+0.997
onel+0.902
aces+0.886
Neg logit contributions
 last-1.039
 Ms-0.923
 teach-0.851
 front-0.848
 hugs-0.839
Tokenaky
Token ID15492
Feature activation+1.00e+00

Pos logit contributions
nes+3.088
ust+2.991
vel+2.557
aund+2.418
onel+2.347
Neg logit contributions
 last-8.075
 first-7.127
 Ms-7.106
 front-6.964
't-6.811
Token.
Token ID13
Feature activation+2.62e-01

Pos logit contributions
 last+1.204
 Ms+1.098
+0.977
't+0.971
 praise+0.948
Neg logit contributions
ust-1.954
vel-1.871
nes-1.857
aces-1.778
ode-1.726
Token One
Token ID1881
Feature activation+5.58e-02

Pos logit contributions
 last+0.299
 Ms+0.238
 teach+0.220
 hugs+0.219
 front+0.219
Neg logit contributions
ust-0.546
nes-0.530
vel-0.525
aces-0.490
oll-0.486
Token day
Token ID1110
Feature activation+1.19e+00

Pos logit contributions
 last+0.074
 Ms+0.067
 teach+0.060
 front+0.060
 hugs+0.060
Neg logit contributions
ust-0.105
nes-0.101
vel-0.099
onel-0.092
oll-0.091
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
 last+1.338
 Ms+1.109
 praise+1.047
 teach+1.042
 front+1.028
Neg logit contributions
vel-2.413
ust-2.402
nes-2.358
oll-2.232
aund-2.226
Token his
Token ID465
Feature activation+8.26e-02

Pos logit contributions
ust+0.648
nes+0.628
vel+0.625
onel+0.574
aces+0.569
Neg logit contributions
 last-0.471
 Ms-0.410
 teach-0.372
 front-0.370
 hugs-0.363
Token pet
Token ID4273
Feature activation+3.53e-02

Pos logit contributions
 last+0.135
 Ms+0.125
 teach+0.113
 front+0.112
't+0.111
Neg logit contributions
ust-0.130
nes-0.124
vel-0.123
onel-0.112
aces-0.112
Token mouse
Token ID10211
Feature activation-2.05e-01

Pos logit contributions
 last+0.047
 Ms+0.041
 teach+0.037
 front+0.037
't+0.036
Neg logit contributions
ust-0.066
nes-0.064
vel-0.063
onel-0.059
aces-0.058
Token,
Token ID11
Feature activation-4.48e-01

Pos logit contributions
ust+0.430
nes+0.417
vel+0.416
onel+0.386
aces+0.384
Neg logit contributions
 last-0.232
 Ms-0.198
 front-0.174
 teach-0.173
 hugs-0.168
Token S
Token ID311
Feature activation-4.87e-01

Pos logit contributions
ust+0.945
nes+0.919
vel+0.912
onel+0.847
aces+0.837
Neg logit contributions
 last-0.511
 Ms-0.432
 front-0.382
 teach-0.381
 hugs-0.374
Tokenque
Token ID4188
Feature activation-3.31e+00

Pos logit contributions
ust+0.781
nes+0.753
vel+0.746
onel+0.674
aces+0.664
Neg logit contributions
 last-0.803
 Ms-0.719
 front-0.663
 teach-0.662
 hugs-0.655
Tokenaky
Token ID15492
Feature activation+5.93e-01

Pos logit contributions
ust+3.017
nes+2.972
aund+2.519
vel+2.375
onel+2.314
Neg logit contributions
 last-7.976
 Ms-7.128
 first-7.089
 front-7.039
 while-6.904
Token.
Token ID13
Feature activation-1.04e-01

Pos logit contributions
 last+0.689
 Ms+0.602
 front+0.527
+0.520
 teach+0.518
Neg logit contributions
ust-1.201
nes-1.191
vel-1.170
aces-1.106
onel-1.083
Token
Token ID220
Feature activation-7.45e-01

Pos logit contributions
ust+0.233
nes+0.226
vel+0.225
aces+0.209
onel+0.208
Neg logit contributions
 last-0.104
 Ms-0.083
 front-0.074
 teach-0.074
 hugs-0.072
Token
Token ID198
Feature activation-3.57e-02

Pos logit contributions
ust+1.738
nes+1.699
vel+1.691
onel+1.583
ode+1.570
Neg logit contributions
 last-0.671
 Ms-0.544
 front-0.456
 teach-0.450
 hugs-0.441
TokenOne
Token ID3198
Feature activation+3.81e-01

Pos logit contributions
ust+0.078
nes+0.077
vel+0.076
onel+0.070
aces+0.069
Neg logit contributions
 last-0.036
 Ms-0.030
 teach-0.027
 hugs-0.026
 front-0.026
TokenOne
Token ID3198
Feature activation+3.81e-01

Pos logit contributions
ust+0.078
nes+0.077
vel+0.076
onel+0.070
aces+0.069
Neg logit contributions
 last-0.036
 Ms-0.030
 teach-0.027
 hugs-0.026
 front-0.026
Token sunny
Token ID27737
Feature activation-2.11e-02

Pos logit contributions
 last+0.520
 Ms+0.483
 teach+0.442
+0.442
 hugs+0.438
Neg logit contributions
ust-0.691
nes-0.649
vel-0.644
oll-0.596
adow-0.595
Token day
Token ID1110
Feature activation+9.72e-01

Pos logit contributions
ust+0.039
nes+0.038
vel+0.038
onel+0.035
aces+0.034
Neg logit contributions
 last-0.028
 Ms-0.025
 teach-0.023
 front-0.022
 hugs-0.022
Token,
Token ID11
Feature activation+1.30e+00

Pos logit contributions
 last+1.049
 Ms+0.885
 teach+0.822
 praise+0.816
 front+0.806
Neg logit contributions
ust-2.003
vel-1.990
nes-1.947
oll-1.840
aund-1.823
Token S
Token ID311
Feature activation-3.06e-01

Pos logit contributions
 last+1.753
 praise+1.509
 teach+1.474
 Ms+1.465
 front+1.437
Neg logit contributions
ust-2.284
nes-2.227
vel-2.164
oll-2.059
aces-2.012
Tokenque
Token ID4188
Feature activation-3.11e+00

Pos logit contributions
ust+0.478
nes+0.457
vel+0.454
onel+0.410
aces+0.406
Neg logit contributions
 last-0.517
 Ms-0.467
 front-0.433
 teach-0.431
 hugs-0.426
Tokenaky
Token ID15492
Feature activation+2.59e-01

Pos logit contributions
ust+2.622
nes+2.613
vel+2.191
aund+2.068
ode+1.894
Neg logit contributions
 last-7.628
 Ms-6.854
 first-6.789
't-6.663
 while-6.571
Token came
Token ID1625
Feature activation+1.15e+00

Pos logit contributions
 last+0.357
 Ms+0.310
 front+0.285
 teach+0.284
+0.280
Neg logit contributions
ust-0.476
nes-0.469
vel-0.457
onel-0.419
aces-0.415
Token up
Token ID510
Feature activation+2.08e+00

Pos logit contributions
 last+1.558
 Ms+1.484
+1.442
 teach+1.352
 praise+1.331
Neg logit contributions
ust-2.125
vel-1.976
nes-1.962
onel-1.819
adow-1.789
Token to
Token ID284
Feature activation+1.09e+00

Pos logit contributions
 last+2.933
 Ms+2.700
 hugs+2.643
 praise+2.607
 teach+2.560
Neg logit contributions
ust-3.579
vel-3.530
nes-3.330
adow-3.005
onel-2.975
Token Fl
Token ID1610
Feature activation-1.77e-01

Pos logit contributions
 last+1.219
 Ms+0.993
+0.940
 praise+0.905
 hugs+0.898
Neg logit contributions
nes-2.151
vel-2.086
ust-2.084
ode-1.884
aund-1.851
Token She
Token ID1375
Feature activation-3.36e-01

Pos logit contributions
ust+0.387
nes+0.377
vel+0.373
onel+0.344
aces+0.343
Neg logit contributions
 last-0.227
 Ms-0.189
 teach-0.171
 front-0.170
 hugs-0.166
Token blinked
Token ID40360
Feature activation-2.38e-02

Pos logit contributions
ust+0.701
nes+0.681
vel+0.675
onel+0.625
aces+0.618
Neg logit contributions
 last-0.389
 Ms-0.331
 teach-0.294
 front-0.293
 hugs-0.286
Token her
Token ID607
Feature activation-5.34e-01

Pos logit contributions
ust+0.043
vel+0.042
nes+0.042
onel+0.038
aces+0.038
Neg logit contributions
 last-0.034
 Ms-0.030
 teach-0.027
 front-0.027
-0.027
Token eyes
Token ID2951
Feature activation+1.25e+00

Pos logit contributions
ust+0.809
nes+0.778
vel+0.770
onel+0.691
aces+0.679
Neg logit contributions
 last-0.938
 Ms-0.843
 front-0.784
 teach-0.782
 hugs-0.774
Token and
Token ID290
Feature activation+1.53e-01

Pos logit contributions
 last+1.509
 Ms+1.464
 praise+1.266
 teach+1.258
 hugs+1.230
Neg logit contributions
ust-2.421
vel-2.406
nes-2.259
aces-2.162
onel-2.124
Token imm
Token ID2296
Feature activation-3.11e+00

Pos logit contributions
 last+0.201
 Ms+0.175
+0.159
 front+0.158
 teach+0.157
Neg logit contributions
ust-0.289
nes-0.280
vel-0.279
onel-0.256
aces-0.251
Tokenag
Token ID363
Feature activation-1.34e+00

Pos logit contributions
vel+5.013
nes+4.832
ode+4.630
ust+4.558
onel+4.477
Neg logit contributions
 last-5.273
 Ms-4.405
 front-4.291
 first-4.240
 teach-4.118
Tokenined
Token ID1389
Feature activation-5.18e-01

Pos logit contributions
ust+2.056
nes+2.033
vel+1.967
ode+1.806
onel+1.763
Neg logit contributions
 last-2.302
 Ms-2.016
 front-1.924
 hugs-1.880
't-1.878
Token she
Token ID673
Feature activation+5.87e-01

Pos logit contributions
ust+1.049
nes+1.020
vel+1.011
onel+0.935
aces+0.924
Neg logit contributions
 last-0.639
 Ms-0.549
 front-0.490
 teach-0.489
 hugs-0.481
Token could
Token ID714
Feature activation+4.12e-01

Pos logit contributions
 last+0.784
 Ms+0.720
+0.642
 front+0.630
 teach+0.630
Neg logit contributions
ust-1.103
nes-1.062
vel-1.056
onel-0.981
aces-0.959
Token see
Token ID766
Feature activation-3.76e-01

Pos logit contributions
 last+0.498
 Ms+0.468
 hugs+0.409
+0.395
 Mrs+0.387
Neg logit contributions
ust-0.807
nes-0.797
vel-0.773
onel-0.721
aces-0.702
Token G
Token ID402
Feature activation-7.50e-01

Pos logit contributions
ust+2.236
vel+2.182
nes+2.163
onel+2.037
aces+1.991
Neg logit contributions
 last-1.122
 Ms-0.958
 teach-0.830
 front-0.800
 hugs-0.796
Tokeniz
Token ID528
Feature activation-2.16e-01

Pos logit contributions
ust+1.126
nes+1.087
vel+1.077
onel+0.967
aces+0.943
Neg logit contributions
 last-1.308
 Ms-1.165
 front-1.087
 teach-1.083
 hugs-1.070
Tokenmo
Token ID5908
Feature activation+3.28e-02

Pos logit contributions
ust+0.427
nes+0.416
vel+0.412
onel+0.381
aces+0.378
Neg logit contributions
 last-0.269
 Ms-0.236
 front-0.210
 teach-0.209
 hugs-0.205
Token and
Token ID290
Feature activation-6.01e-01

Pos logit contributions
 last+0.039
 Ms+0.035
 front+0.031
 teach+0.030
 hugs+0.030
Neg logit contributions
ust-0.066
nes-0.064
vel-0.063
aces-0.059
onel-0.059
Token S
Token ID311
Feature activation-8.58e-01

Pos logit contributions
ust+1.303
nes+1.266
vel+1.261
onel+1.170
aces+1.156
Neg logit contributions
 last-0.663
 Ms-0.560
 teach-0.490
 front-0.488
 hugs-0.479
Tokenque
Token ID4188
Feature activation-3.10e+00

Pos logit contributions
ust+1.341
nes+1.306
vel+1.292
onel+1.156
aces+1.127
Neg logit contributions
 last-1.460
 Ms-1.285
 teach-1.204
 front-1.186
 hugs-1.180
Tokeneker
Token ID28233
Feature activation+4.93e-01

Pos logit contributions
ust+2.767
nes+2.703
vel+2.334
aund+2.232
oll+2.092
Neg logit contributions
 last-7.556
 first-6.709
 Ms-6.615
 front-6.399
't-6.357
Token,
Token ID11
Feature activation-2.54e-01

Pos logit contributions
 last+0.513
 Ms+0.465
 front+0.389
 hugs+0.379
 teach+0.376
Neg logit contributions
ust-1.049
nes-1.018
vel-1.012
aces-0.964
onel-0.950
Token were
Token ID547
Feature activation+5.92e-01

Pos logit contributions
ust+0.481
nes+0.467
vel+0.462
onel+0.426
aces+0.421
Neg logit contributions
 last-0.342
 Ms-0.297
 front-0.271
 teach-0.269
 hugs-0.265
Token both
Token ID1111
Feature activation+3.68e-01

Pos logit contributions
 last+0.837
 Ms+0.722
 front+0.671
 hugs+0.670
+0.670
Neg logit contributions
ust-1.064
nes-1.052
vel-1.029
onel-0.943
aces-0.939
Token small
Token ID1402
Feature activation+8.70e-02

Pos logit contributions
 last+0.541
 Ms+0.484
 hugs+0.439
 front+0.435
 teach+0.435
Neg logit contributions
ust-0.642
nes-0.630
vel-0.618
onel-0.570
aces-0.566
Token play
Token ID711
Feature activation-3.86e-01

Pos logit contributions
 last+0.300
 Ms+0.250
 front+0.219
+0.215
 hugs+0.211
Neg logit contributions
ust-0.630
nes-0.628
vel-0.605
ode-0.568
onel-0.567
Token?"
Token ID1701
Feature activation-3.51e-01

Pos logit contributions
ust+0.842
nes+0.822
vel+0.818
onel+0.758
aces+0.750
Neg logit contributions
 last-0.408
 Ms-0.341
 front-0.297
 teach-0.294
 hugs-0.290
Token
Token ID198
Feature activation+4.56e-01

Pos logit contributions
ust+0.780
nes+0.762
vel+0.756
onel+0.704
aces+0.698
Neg logit contributions
 last-0.358
 Ms-0.293
 front-0.255
 teach-0.254
 hugs-0.249
Token
Token ID198
Feature activation+3.14e-01

Pos logit contributions
 last+0.514
 Ms+0.418
 teach+0.378
 front+0.377
 hugs+0.377
Neg logit contributions
ust-0.950
nes-0.942
vel-0.923
oll-0.844
onel-0.838
TokenS
Token ID50
Feature activation+3.02e-01

Pos logit contributions
 last+0.473
 Ms+0.405
 hugs+0.376
 praise+0.371
 front+0.369
Neg logit contributions
ust-0.550
nes-0.543
vel-0.533
ode-0.477
aces-0.471
Tokenque
Token ID4188
Feature activation-3.08e+00

Pos logit contributions
 last+0.469
 Ms+0.421
 front+0.392
 teach+0.390
 hugs+0.381
Neg logit contributions
ust-0.514
vel-0.492
nes-0.487
ode-0.442
oll-0.441
Tokenaky
Token ID15492
Feature activation-3.82e-01

Pos logit contributions
nes+2.679
ust+2.548
vel+2.219
ode+2.022
onel+1.974
Neg logit contributions
 last-7.462
 Ms-6.757
 first-6.680
't-6.572
 front-6.407
Token replied
Token ID8712
Feature activation+3.94e-01

Pos logit contributions
ust+0.828
nes+0.807
vel+0.800
onel+0.743
aces+0.736
Neg logit contributions
 last-0.408
 Ms-0.342
 front-0.300
 teach-0.298
 hugs-0.292
Token,
Token ID11
Feature activation+5.59e-01

Pos logit contributions
 last+0.391
 Ms+0.316
 front+0.283
 teach+0.271
 praise+0.271
Neg logit contributions
ust-0.871
vel-0.867
nes-0.854
onel-0.802
aces-0.793
Token "
Token ID366
Feature activation+1.06e+00

Pos logit contributions
 last+0.764
 Ms+0.627
 praise+0.596
 front+0.587
 hugs+0.582
Neg logit contributions
ust-1.019
vel-1.015
nes-0.991
aces-0.919
onel-0.909
TokenYes
Token ID5297
Feature activation+7.20e-01

Pos logit contributions
 last+1.078
 Ms+0.873
 front+0.790
 hugs+0.785
 praise+0.764
Neg logit contributions
ust-2.436
nes-2.381
vel-2.259
ode-2.126
onel-2.093
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
 last+1.338
 Ms+1.109
 praise+1.047
 teach+1.042
 front+1.028
Neg logit contributions
vel-2.413
ust-2.402
nes-2.358
oll-2.232
aund-2.226
Token Fl
Token ID1610
Feature activation-4.25e-01

Pos logit contributions
 last+1.833
 praise+1.572
 hugs+1.508
 teach+1.505
 front+1.477
Neg logit contributions
ust-2.424
nes-2.380
vel-2.306
oll-2.167
aund-2.109
Tokenuffy
Token ID15352
Feature activation+3.38e-01

Pos logit contributions
ust+0.714
nes+0.688
vel+0.683
onel+0.621
aces+0.611
Neg logit contributions
 last-0.671
 Ms-0.598
 front-0.549
 teach-0.549
 hugs-0.542
Token asked
Token ID1965
Feature activation-3.69e-01

Pos logit contributions
 last+0.471
 Ms+0.410
 front+0.375
 teach+0.373
 praise+0.368
Neg logit contributions
ust-0.618
nes-0.604
vel-0.593
onel-0.544
ode-0.538
Token S
Token ID311
Feature activation-2.20e-01

Pos logit contributions
ust+0.758
nes+0.736
vel+0.733
onel+0.678
aces+0.671
Neg logit contributions
 last-0.441
 Ms-0.375
 front-0.336
 teach-0.335
 hugs-0.328
Tokenque
Token ID4188
Feature activation-3.08e+00

Pos logit contributions
ust+0.350
nes+0.334
vel+0.332
onel+0.300
aces+0.296
Neg logit contributions
 last-0.364
 Ms-0.328
 front-0.306
 teach-0.302
 hugs-0.300
Tokenaky
Token ID15492
Feature activation+7.26e-01

Pos logit contributions
nes+2.914
ust+2.710
vel+2.277
aund+2.260
onel+2.195
Neg logit contributions
 last-7.370
 first-6.569
 Ms-6.505
 front-6.432
't-6.358
Token,
Token ID11
Feature activation+1.96e-01

Pos logit contributions
 last+0.820
 Ms+0.705
+0.679
 front+0.650
 praise+0.610
Neg logit contributions
ust-1.490
vel-1.437
nes-1.408
onel-1.329
aces-1.321
Token "
Token ID366
Feature activation+7.32e-01

Pos logit contributions
 last+0.246
 Ms+0.207
 front+0.190
 hugs+0.189
 praise+0.188
Neg logit contributions
ust-0.380
vel-0.374
nes-0.364
onel-0.341
aces-0.336
TokenDo
Token ID5211
Feature activation+7.45e-02

Pos logit contributions
 last+0.929
 Ms+0.790
 hugs+0.732
 praise+0.731
 teach+0.724
Neg logit contributions
ust-1.438
vel-1.402
nes-1.390
ode-1.317
onel-1.265
Token you
Token ID345
Feature activation+6.84e-01

Pos logit contributions
 last+0.096
 Ms+0.084
 front+0.076
 teach+0.074
 hugs+0.073
Neg logit contributions
ust-0.146
nes-0.141
vel-0.139
aces-0.127
onel-0.127
Tokeny
Token ID88
Feature activation-3.20e-01

Pos logit contributions
nes+2.398
ust+2.390
vel+2.366
onel+2.144
ode+2.112
Neg logit contributions
 last-2.146
 Ms-1.883
 front-1.697
 hugs-1.686
 teach-1.684
Token puppet
Token ID30095
Feature activation+3.94e-01

Pos logit contributions
ust+0.628
nes+0.610
vel+0.606
onel+0.558
aces+0.550
Neg logit contributions
 last-0.404
 Ms-0.349
 front-0.313
 teach-0.312
 hugs-0.309
Token called
Token ID1444
Feature activation+1.78e-01

Pos logit contributions
 last+0.562
 Ms+0.496
 front+0.452
 hugs+0.446
 praise+0.441
Neg logit contributions
ust-0.693
nes-0.686
vel-0.676
onel-0.616
oll-0.604
Token P
Token ID350
Feature activation-2.91e-01

Pos logit contributions
 last+0.213
 Ms+0.175
 front+0.160
 teach+0.160
 hugs+0.159
Neg logit contributions
ust-0.361
nes-0.357
vel-0.352
aces-0.323
onel-0.321
Tokenips
Token ID2419
Feature activation-1.36e-01

Pos logit contributions
ust+0.511
nes+0.491
vel+0.489
onel+0.444
aces+0.438
Neg logit contributions
 last-0.437
 Ms-0.389
 front-0.355
 teach-0.355
 hugs-0.350
Tokenque
Token ID4188
Feature activation-3.06e+00

Pos logit contributions
ust+0.270
nes+0.263
vel+0.259
onel+0.240
aces+0.239
Neg logit contributions
 last-0.170
 Ms-0.149
 teach-0.132
 front-0.132
 hugs-0.130
Tokenak
Token ID461
Feature activation+7.89e-01

Pos logit contributions
ust+2.714
nes+2.586
vel+2.265
aund+2.148
onel+2.019
Neg logit contributions
 last-7.424
 first-6.687
 Ms-6.599
 front-6.470
't-6.355
Token.
Token ID13
Feature activation+3.88e-01

Pos logit contributions
 last+0.953
 Ms+0.854
 praise+0.748
 teach+0.734
 hugs+0.720
Neg logit contributions
ust-1.562
vel-1.513
nes-1.507
aces-1.424
onel-1.387
Token One
Token ID1881
Feature activation-1.48e-01

Pos logit contributions
 last+0.511
 Ms+0.439
 hugs+0.398
 front+0.398
 teach+0.396
Neg logit contributions
ust-0.735
vel-0.705
nes-0.703
aces-0.657
onel-0.652
Token day
Token ID1110
Feature activation+1.13e+00

Pos logit contributions
ust+0.286
nes+0.276
vel+0.273
onel+0.252
oll+0.249
Neg logit contributions
 last-0.193
 Ms-0.172
 front-0.154
 teach-0.154
 hugs-0.152
Token P
Token ID350
Feature activation-4.43e-01

Pos logit contributions
 last+1.281
 Ms+1.098
 praise+0.965
 front+0.962
 teach+0.933
Neg logit contributions
vel-2.331
ust-2.290
nes-2.214
aund-2.135
aces-2.119
Token not
Token ID407
Feature activation-9.37e-01

Pos logit contributions
ust+0.971
nes+0.943
vel+0.936
onel+0.864
aces+0.854
Neg logit contributions
 last-0.619
 Ms-0.535
 front-0.479
 teach-0.478
 hugs-0.470
Token touch
Token ID3638
Feature activation-1.19e+00

Pos logit contributions
ust+2.087
nes+2.059
vel+2.035
onel+1.914
aces+1.881
Neg logit contributions
 last-0.980
 Ms-0.770
 teach-0.708
 front-0.699
't-0.674
Token things
Token ID1243
Feature activation-1.17e+00

Pos logit contributions
ust+2.486
nes+2.450
vel+2.382
onel+2.223
oll+2.182
Neg logit contributions
 last-1.473
 Ms-1.197
 front-1.151
 hugs-1.094
 teach-1.083
Token that
Token ID326
Feature activation-1.12e+00

Pos logit contributions
ust+2.628
nes+2.558
vel+2.541
onel+2.383
ode+2.367
Neg logit contributions
 last-1.251
 Ms-0.979
 teach-0.877
 front-0.865
 first-0.863
Token were
Token ID547
Feature activation-1.50e+00

Pos logit contributions
ust+2.210
vel+2.174
nes+2.148
onel+1.961
ode+1.939
Neg logit contributions
 last-1.489
 Ms-1.257
 teach-1.151
 hugs-1.119
 front-1.113
Token not
Token ID407
Feature activation-3.05e+00

Pos logit contributions
nes+3.087
ust+3.076
vel+3.064
onel+2.829
ode+2.768
Neg logit contributions
 last-1.913
 Ms-1.500
 first-1.416
 front-1.399
 teach-1.358
Token his
Token ID465
Feature activation-1.26e+00

Pos logit contributions
nes+7.057
ust+6.863
vel+6.831
onel+6.568
ode+6.447
Neg logit contributions
 last-3.836
 first-2.679
 while-2.623
 teach-2.577
 front-2.498
Token.
Token ID13
Feature activation+2.21e-02

Pos logit contributions
ust+2.580
nes+2.551
vel+2.526
onel+2.316
oll+2.278
Neg logit contributions
 last-1.620
 Ms-1.267
 front-1.239
 teach-1.214
 first-1.189
Token Tim
Token ID5045
Feature activation-8.23e-01

Pos logit contributions
 last+0.035
 Ms+0.030
 front+0.028
 teach+0.028
 hugs+0.028
Neg logit contributions
ust-0.037
nes-0.036
vel-0.035
aces-0.032
onel-0.032
Token decided
Token ID3066
Feature activation-2.57e-01

Pos logit contributions
ust+1.639
nes+1.596
vel+1.591
onel+1.468
aces+1.441
Neg logit contributions
 last-1.055
 Ms-0.895
 teach-0.810
 front-0.808
 hugs-0.805
Token to
Token ID284
Feature activation+2.44e-01

Pos logit contributions
ust+0.569
nes+0.556
vel+0.549
onel+0.510
aces+0.506
Neg logit contributions
 last-0.264
 Ms-0.222
 front-0.191
 teach-0.190
 hugs-0.185
Token.
Token ID13
Feature activation-6.90e-02

Pos logit contributions
ust+0.427
nes+0.412
vel+0.408
onel+0.374
aces+0.369
Neg logit contributions
 last-0.357
 Ms-0.323
 teach-0.293
 front-0.291
 hugs-0.288
Token
Token ID198
Feature activation+9.78e-02

Pos logit contributions
ust+0.147
nes+0.143
vel+0.142
onel+0.132
aces+0.131
Neg logit contributions
 last-0.076
 Ms-0.063
 front-0.056
 teach-0.056
 hugs-0.054
Token
Token ID198
Feature activation+4.62e-01

Pos logit contributions
 last+0.104
 Ms+0.086
 teach+0.077
 front+0.076
 hugs+0.075
Neg logit contributions
ust-0.212
nes-0.208
vel-0.205
onel-0.189
oll-0.186
TokenP
Token ID47
Feature activation-8.03e-01

Pos logit contributions
 last+0.712
 Ms+0.620
 hugs+0.579
 praise+0.562
 teach+0.560
Neg logit contributions
ust-0.818
vel-0.788
nes-0.782
onel-0.695
ode-0.684
Tokenips
Token ID2419
Feature activation-1.41e+00

Pos logit contributions
ust+1.185
nes+1.153
vel+1.132
onel+1.014
aces+0.999
Neg logit contributions
 last-1.414
 Ms-1.265
 teach-1.178
 front-1.173
 hugs-1.171
Tokenque
Token ID4188
Feature activation-3.05e+00

Pos logit contributions
ust+2.841
vel+2.754
nes+2.717
ode+2.536
onel+2.487
Neg logit contributions
 last-1.822
 Ms-1.506
 hugs-1.370
 front-1.351
 first-1.321
Tokenak
Token ID461
Feature activation+5.85e-01

Pos logit contributions
ust+2.659
nes+2.565
aund+2.129
vel+2.100
oll+2.052
Neg logit contributions
 last-7.323
 first-6.580
 Ms-6.497
't-6.493
 front-6.307
Token started
Token ID2067
Feature activation+4.32e-01

Pos logit contributions
 last+0.751
 Ms+0.678
 teach+0.597
 front+0.593
+0.591
Neg logit contributions
ust-1.110
nes-1.095
vel-1.082
onel-1.007
aces-1.003
Token to
Token ID284
Feature activation+3.40e-02

Pos logit contributions
 last+0.490
 Ms+0.418
 teach+0.362
 front+0.359
+0.351
Neg logit contributions
ust-0.879
vel-0.864
nes-0.855
onel-0.793
aces-0.781
Token regret
Token ID13721
Feature activation-7.95e-01

Pos logit contributions
 last+0.045
 Ms+0.040
 front+0.036
 hugs+0.035
 teach+0.035
Neg logit contributions
ust-0.064
nes-0.062
vel-0.062
onel-0.057
ode-0.056
Token it
Token ID340
Feature activation+3.19e-01

Pos logit contributions
ust+1.620
nes+1.577
vel+1.550
onel+1.440
ode+1.428
Neg logit contributions
 last-0.971
 Ms-0.832
 front-0.741
 teach-0.740
 hugs-0.724
Token.
Token ID13
Feature activation+3.88e-01

Pos logit contributions
 last+0.953
 Ms+0.854
 praise+0.748
 teach+0.734
 hugs+0.720
Neg logit contributions
ust-1.562
vel-1.513
nes-1.507
aces-1.424
onel-1.387
Token One
Token ID1881
Feature activation-1.48e-01

Pos logit contributions
 last+0.511
 Ms+0.439
 hugs+0.398
 front+0.398
 teach+0.396
Neg logit contributions
ust-0.735
vel-0.705
nes-0.703
aces-0.657
onel-0.652
Token day
Token ID1110
Feature activation+1.13e+00

Pos logit contributions
ust+0.286
nes+0.276
vel+0.273
onel+0.252
oll+0.249
Neg logit contributions
 last-0.193
 Ms-0.172
 front-0.154
 teach-0.154
 hugs-0.152
Token P
Token ID350
Feature activation-4.43e-01

Pos logit contributions
 last+1.281
 Ms+1.098
 praise+0.965
 front+0.962
 teach+0.933
Neg logit contributions
vel-2.331
ust-2.290
nes-2.214
aund-2.135
aces-2.119
Tokenips
Token ID2419
Feature activation-8.44e-01

Pos logit contributions
ust+0.742
nes+0.715
vel+0.709
onel+0.644
aces+0.635
Neg logit contributions
 last-0.701
 Ms-0.625
 front-0.574
 teach-0.573
 hugs-0.566
Tokenque
Token ID4188
Feature activation-3.04e+00

Pos logit contributions
ust+1.717
vel+1.662
nes+1.662
onel+1.527
ode+1.516
Neg logit contributions
 last-1.035
 Ms-0.872
 front-0.784
 teach-0.776
 hugs-0.773
Tokenak
Token ID461
Feature activation+5.82e-01

Pos logit contributions
ust+2.996
nes+2.777
vel+2.501
aund+2.402
oll+2.281
Neg logit contributions
 last-7.053
 first-6.300
 Ms-6.268
't-6.190
 hugs-6.058
Token was
Token ID373
Feature activation+1.50e-01

Pos logit contributions
 last+0.790
 Ms+0.708
 teach+0.632
 front+0.627
 praise+0.625
Neg logit contributions
ust-1.075
nes-1.048
vel-1.031
aces-0.956
onel-0.954
Token playing
Token ID2712
Feature activation-2.67e-01

Pos logit contributions
 last+0.201
 Ms+0.174
 teach+0.158
 front+0.157
 praise+0.156
Neg logit contributions
ust-0.282
vel-0.276
nes-0.275
onel-0.253
aces-0.250
Token with
Token ID351
Feature activation+1.32e-01

Pos logit contributions
ust+0.533
nes+0.520
vel+0.520
onel+0.479
aces+0.474
Neg logit contributions
 last-0.331
 Ms-0.285
 front-0.253
 teach-0.251
 hugs-0.251
Token its
Token ID663
Feature activation+9.28e-01

Pos logit contributions
 last+0.189
 Ms+0.167
 teach+0.151
 front+0.148
+0.147
Neg logit contributions
ust-0.233
nes-0.227
vel-0.227
aces-0.208
onel-0.208
Token Lily
Token ID20037
Feature activation-2.47e-01

Pos logit contributions
 last+0.499
 Ms+0.420
 front+0.393
 teach+0.386
+0.385
Neg logit contributions
ust-0.713
nes-0.708
vel-0.690
aces-0.643
onel-0.633
Token.
Token ID13
Feature activation+6.41e-02

Pos logit contributions
ust+0.534
vel+0.518
nes+0.518
onel+0.479
aces+0.474
Neg logit contributions
 last-0.267
 Ms-0.224
 front-0.195
 teach-0.193
 hugs-0.191
Token "
Token ID366
Feature activation-1.46e+00

Pos logit contributions
 last+0.077
 Ms+0.063
 teach+0.057
 hugs+0.057
 front+0.056
Neg logit contributions
ust-0.130
nes-0.128
vel-0.125
aces-0.116
onel-0.115
TokenIt
Token ID1026
Feature activation-3.09e-01

Pos logit contributions
ust+2.906
nes+2.823
vel+2.820
onel+2.651
aces+2.643
Neg logit contributions
 last-1.811
 Ms-1.579
't-1.478
 front-1.466
 teach-1.399
Token's
Token ID338
Feature activation-1.19e+00

Pos logit contributions
ust+0.527
nes+0.508
vel+0.506
onel+0.460
aces+0.453
Neg logit contributions
 last-0.472
 Ms-0.421
 front-0.385
 teach-0.384
 hugs-0.378
Token an
Token ID281
Feature activation-3.00e+00

Pos logit contributions
ust+2.753
nes+2.724
vel+2.677
onel+2.512
aces+2.472
Neg logit contributions
 last-1.227
 Ms-0.965
 front-0.886
 teach-0.814
 first-0.809
Token underground
Token ID11447
Feature activation-1.11e+00

Pos logit contributions
nes+6.496
ust+6.291
vel+6.174
ode+5.615
onel+5.538
Neg logit contributions
 last-4.196
 front-3.192
 first-3.147
 teach-2.893
 Ms-2.847
Token tunnel
Token ID13275
Feature activation+3.34e-01

Pos logit contributions
ust+1.781
nes+1.725
vel+1.716
onel+1.541
oll+1.525
Neg logit contributions
 last-1.900
 Ms-1.635
 front-1.563
 teach-1.562
 hugs-1.526
Token,"
Token ID553
Feature activation-5.00e-01

Pos logit contributions
 last+0.400
 Ms+0.368
 front+0.317
 teach+0.316
 hugs+0.311
Neg logit contributions
ust-0.664
nes-0.654
vel-0.642
aces-0.595
onel-0.592
Token said
Token ID531
Feature activation-3.22e-01

Pos logit contributions
ust+1.134
nes+1.106
vel+1.098
onel+1.025
aces+1.014
Neg logit contributions
 last-0.489
 Ms-0.402
 front-0.345
 teach-0.345
 hugs-0.337
Token her
Token ID607
Feature activation-9.34e-01

Pos logit contributions
ust+0.658
nes+0.644
vel+0.637
onel+0.591
aces+0.584
Neg logit contributions
 last-0.381
 Ms-0.325
 front-0.289
 teach-0.287
 hugs-0.283
TokenLook
Token ID8567
Feature activation-1.38e+00

Pos logit contributions
ust+0.046
nes+0.045
vel+0.045
onel+0.041
ode+0.040
Neg logit contributions
 last-0.032
 Ms-0.028
 teach-0.025
 front-0.025
 hugs-0.025
Token,
Token ID11
Feature activation-1.32e+00

Pos logit contributions
ust+3.408
nes+3.350
vel+3.252
onel+3.120
aces+3.115
Neg logit contributions
 last-1.077
 Ms-0.862
 hugs-0.704
 teach-0.683
 front-0.670
Token mom
Token ID1995
Feature activation+7.46e-01

Pos logit contributions
ust+2.967
nes+2.888
vel+2.872
onel+2.686
aces+2.670
Neg logit contributions
 last-1.396
 Ms-1.184
 teach-1.034
 front-1.001
 hugs-1.000
Token,
Token ID11
Feature activation-1.76e+00

Pos logit contributions
 last+0.730
 Ms+0.622
+0.549
 teach+0.540
 front+0.538
Neg logit contributions
ust-1.669
vel-1.625
nes-1.593
ode-1.502
adow-1.485
Token isn
Token ID2125
Feature activation+2.48e-01

Pos logit contributions
ust+4.248
nes+4.151
vel+4.117
onel+3.884
aces+3.870
Neg logit contributions
 last-1.698
 Ms-1.350
 teach-1.231
 hugs-1.154
 front-1.135
Token't
Token ID470
Feature activation-2.99e+00

Pos logit contributions
 last+0.113
 Ms+0.067
 hugs+0.043
 teach+0.042
 praise+0.039
Neg logit contributions
ust-0.690
nes-0.679
vel-0.673
onel-0.633
oll-0.624
Token it
Token ID340
Feature activation+2.17e-01

Pos logit contributions
ust+6.855
nes+6.787
aces+6.619
ode+6.553
vel+6.462
Neg logit contributions
 last-3.339
 Ms-2.701
 Mrs-2.562
 first-2.546
't-2.473
Token incredible
Token ID8082
Feature activation-1.36e-01

Pos logit contributions
 last+0.265
 Ms+0.234
 hugs+0.210
 front+0.210
 praise+0.208
Neg logit contributions
ust-0.438
nes-0.423
vel-0.421
onel-0.385
oll-0.379
Token?"
Token ID1701
Feature activation-7.99e-01

Pos logit contributions
ust+0.286
nes+0.280
vel+0.278
onel+0.257
aces+0.256
Neg logit contributions
 last-0.154
 Ms-0.132
 front-0.116
 teach-0.114
 hugs-0.112
Token Mom
Token ID11254
Feature activation+7.03e-01

Pos logit contributions
ust+1.778
nes+1.720
vel+1.719
onel+1.601
ode+1.580
Neg logit contributions
 last-0.835
 Ms-0.719
 teach-0.613
 front-0.611
 hugs-0.605
Token agreed
Token ID4987
Feature activation+3.40e-01

Pos logit contributions
 last+0.734
 Ms+0.597
 front+0.560
+0.549
 teach+0.526
Neg logit contributions
ust-1.523
vel-1.474
nes-1.467
ode-1.356
oll-1.351
Token
Token ID198
Feature activation-2.09e-01

Pos logit contributions
 last+0.055
 Ms+0.045
 teach+0.040
 front+0.040
 hugs+0.039
Neg logit contributions
ust-0.113
nes-0.111
vel-0.109
onel-0.101
oll-0.100
TokenFl
Token ID7414
Feature activation-3.10e-01

Pos logit contributions
ust+0.388
nes+0.377
vel+0.372
onel+0.338
ode+0.335
Neg logit contributions
 last-0.294
 Ms-0.255
 teach-0.232
 front-0.232
 hugs-0.232
Tokenuffy
Token ID15352
Feature activation-1.31e-01

Pos logit contributions
ust+0.514
nes+0.495
vel+0.492
onel+0.446
aces+0.440
Neg logit contributions
 last-0.496
 Ms-0.443
 front-0.408
 teach-0.407
 hugs-0.402
Token and
Token ID290
Feature activation-3.93e-01

Pos logit contributions
ust+0.262
nes+0.257
vel+0.252
onel+0.232
aces+0.231
Neg logit contributions
 last-0.159
 Ms-0.136
 front-0.122
 teach-0.121
't-0.117
Token S
Token ID311
Feature activation-3.85e-01

Pos logit contributions
ust+0.881
nes+0.859
vel+0.854
onel+0.796
aces+0.789
Neg logit contributions
 last-0.395
 Ms-0.325
 front-0.281
 teach-0.280
 hugs-0.274
Tokenque
Token ID4188
Feature activation-2.96e+00

Pos logit contributions
ust+0.622
nes+0.597
vel+0.592
onel+0.537
aces+0.530
Neg logit contributions
 last-0.630
 Ms-0.565
 front-0.522
 teach-0.520
 hugs-0.514
Tokenaky
Token ID15492
Feature activation-3.77e-01

Pos logit contributions
nes+2.551
ust+2.434
vel+2.128
aund+1.944
onel+1.891
Neg logit contributions
 last-7.316
 Ms-6.624
 first-6.476
't-6.320
 while-6.282
Token played
Token ID2826
Feature activation-7.47e-01

Pos logit contributions
ust+0.733
nes+0.713
vel+0.706
onel+0.651
aces+0.644
Neg logit contributions
 last-0.488
 Ms-0.425
 front-0.380
 teach-0.380
 hugs-0.373
Token for
Token ID329
Feature activation+8.70e-02

Pos logit contributions
ust+1.466
nes+1.412
vel+1.398
onel+1.295
ode+1.281
Neg logit contributions
 last-0.984
 Ms-0.851
 teach-0.772
 front-0.766
 hugs-0.756
Token hours
Token ID2250
Feature activation-1.34e-01

Pos logit contributions
 last+0.084
 Ms+0.071
 teach+0.060
 front+0.059
 hugs+0.058
Neg logit contributions
ust-0.191
vel-0.189
nes-0.189
aces-0.174
onel-0.174
Token until
Token ID1566
Feature activation-6.62e-02

Pos logit contributions
ust+0.290
nes+0.285
vel+0.283
aces+0.262
onel+0.262
Neg logit contributions
 last-0.141
 Ms-0.121
 teach-0.104
 hugs-0.103
 front-0.103
Token wins
Token ID7864
Feature activation+4.29e-01

Pos logit contributions
ust+0.121
nes+0.118
vel+0.117
aces+0.108
onel+0.107
Neg logit contributions
 last-0.062
 Ms-0.053
 front-0.047
 teach-0.046
 hugs-0.045
Token!"
Token ID2474
Feature activation-9.45e-01

Pos logit contributions
 last+0.482
 Ms+0.426
 teach+0.373
 hugs+0.372
 praise+0.365
Neg logit contributions
ust-0.886
nes-0.876
vel-0.859
aces-0.801
onel-0.788
Token
Token ID198
Feature activation-4.17e-01

Pos logit contributions
ust+2.182
nes+2.115
vel+2.105
onel+1.967
aces+1.927
Neg logit contributions
 last-0.923
 Ms-0.772
 teach-0.667
 front-0.658
't-0.642
Token
Token ID198
Feature activation-5.23e-01

Pos logit contributions
ust+0.953
nes+0.930
vel+0.924
onel+0.862
aces+0.853
Neg logit contributions
 last-0.402
 Ms-0.330
 front-0.282
 teach-0.281
 hugs-0.275
TokenS
Token ID50
Feature activation-9.53e-01

Pos logit contributions
ust+0.902
nes+0.872
vel+0.865
onel+0.788
aces+0.777
Neg logit contributions
 last-0.802
 Ms-0.711
 front-0.651
 teach-0.651
 hugs-0.641
Tokenque
Token ID4188
Feature activation-2.96e+00

Pos logit contributions
ust+1.321
nes+1.299
vel+1.267
onel+1.127
aces+1.115
Neg logit contributions
 last-1.779
 Ms-1.580
 front-1.489
 teach-1.486
 hugs-1.468
Tokeneker
Token ID28233
Feature activation+2.59e-01

Pos logit contributions
nes+2.559
ust+2.540
vel+2.146
oll+1.914
aund+1.896
Neg logit contributions
 last-7.095
 first-6.451
 Ms-6.365
't-6.187
 front-6.110
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
 last+0.318
 Ms+0.281
 front+0.249
 teach+0.242
 hugs+0.241
Neg logit contributions
ust-0.512
nes-0.503
vel-0.495
aces-0.456
onel-0.455
Token up
Token ID510
Feature activation+9.79e-01

Pos logit contributions
 last+0.072
 Ms+0.063
 teach+0.058
 hugs+0.058
 front+0.058
Neg logit contributions
ust-0.080
nes-0.077
vel-0.077
onel-0.069
aces-0.069
Token for
Token ID329
Feature activation+5.65e-01

Pos logit contributions
 last+1.389
 Ms+1.272
 hugs+1.212
 teach+1.159
 praise+1.136
Neg logit contributions
ust-1.728
vel-1.643
nes-1.621
onel-1.459
aces-1.431
Token the
Token ID262
Feature activation+1.36e+00

Pos logit contributions
 last+0.598
 Ms+0.512
 hugs+0.435
 teach+0.431
 front+0.429
Neg logit contributions
ust-1.179
vel-1.171
nes-1.154
aces-1.072
onel-1.067
Token in
Token ID287
Feature activation+6.86e-02

Pos logit contributions
ust+1.267
nes+1.226
vel+1.201
onel+1.096
aces+1.074
Neg logit contributions
 last-1.205
 Ms-1.056
 front-0.987
 teach-0.981
 hugs-0.962
Token the
Token ID262
Feature activation-1.80e+00

Pos logit contributions
 last+0.073
 Ms+0.061
 teach+0.055
 hugs+0.053
+0.053
Neg logit contributions
ust-0.145
nes-0.142
vel-0.141
aces-0.129
onel-0.129
Token bathroom
Token ID12436
Feature activation+5.61e-02

Pos logit contributions
ust+3.672
nes+3.525
vel+3.516
onel+3.250
aces+3.194
Neg logit contributions
 last-2.560
 front-2.161
 Ms-2.152
 teach-1.929
 hugs-1.929
Token and
Token ID290
Feature activation-3.80e-01

Pos logit contributions
 last+0.062
 Ms+0.054
 teach+0.047
 hugs+0.047
 praise+0.045
Neg logit contributions
ust-0.117
nes-0.114
vel-0.114
aces-0.105
onel-0.105
Token didn
Token ID1422
Feature activation+4.94e-01

Pos logit contributions
ust+0.743
nes+0.722
vel+0.716
onel+0.660
aces+0.652
Neg logit contributions
 last-0.492
 Ms-0.425
 front-0.382
 teach-0.381
 hugs-0.373
Token't
Token ID470
Feature activation-2.93e+00

Pos logit contributions
 last+0.230
 Ms+0.166
 hugs+0.116
 praise+0.102
 teach+0.095
Neg logit contributions
ust-1.345
nes-1.331
vel-1.320
onel-1.237
oll-1.233
Token know
Token ID760
Feature activation-1.11e+00

Pos logit contributions
nes+6.408
ust+6.377
vel+6.171
onel+6.051
aces+5.940
Neg logit contributions
 last-3.731
 teach-2.976
 while-2.771
 front-2.726
 hug-2.697
Token what
Token ID644
Feature activation-1.50e+00

Pos logit contributions
ust+2.018
nes+2.007
vel+1.919
onel+1.785
ode+1.752
Neg logit contributions
 last-1.610
 Ms-1.408
 teach-1.282
 hugs-1.278
 front-1.278
Token it
Token ID340
Feature activation+2.67e-01

Pos logit contributions
ust+3.281
nes+3.160
vel+3.103
onel+2.967
ode+2.874
Neg logit contributions
 last-1.778
 Ms-1.513
 teach-1.352
 hugs-1.343
 front-1.326
Token was
Token ID373
Feature activation-1.10e+00

Pos logit contributions
 last+0.311
 Ms+0.268
 front+0.247
 hugs+0.238
 teach+0.238
Neg logit contributions
ust-0.547
nes-0.524
vel-0.520
onel-0.485
aces-0.483
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
ust+2.517
nes+2.466
vel+2.428
onel+2.287
ode+2.249
Neg logit contributions
 last-1.113
 Ms-0.860
 teach-0.782
 front-0.764
 first-0.747
Token one
Token ID530
Feature activation+5.94e-01

Pos logit contributions
 last+0.078
 Ms+0.068
 teach+0.062
+0.062
't+0.061
Neg logit contributions
ust-0.109
nes-0.106
vel-0.106
onel-0.098
aces-0.096
Token more
Token ID517
Feature activation-5.27e-01

Pos logit contributions
 last+0.637
 Ms+0.613
't+0.550
+0.535
 teach+0.513
Neg logit contributions
ust-1.203
vel-1.167
nes-1.150
onel-1.081
ode-1.079
Token time
Token ID640
Feature activation+1.52e-01

Pos logit contributions
ust+0.868
nes+0.838
vel+0.831
onel+0.753
aces+0.741
Neg logit contributions
 last-0.854
 Ms-0.761
 front-0.701
 teach-0.701
 hugs-0.692
Token,
Token ID11
Feature activation+6.56e-02

Pos logit contributions
 last+0.171
 Ms+0.149
 front+0.130
 teach+0.129
 hugs+0.127
Neg logit contributions
ust-0.313
vel-0.309
nes-0.306
onel-0.284
aces-0.283
Token and
Token ID290
Feature activation-5.76e-01

Pos logit contributions
 last+0.079
 Ms+0.069
 front+0.062
 teach+0.061
 hugs+0.060
Neg logit contributions
ust-0.130
vel-0.127
nes-0.126
onel-0.117
aces-0.116
Token Sil
Token ID4243
Feature activation-2.93e+00

Pos logit contributions
ust+1.323
nes+1.288
vel+1.281
onel+1.196
aces+1.184
Neg logit contributions
 last-0.553
 Ms-0.452
 teach-0.385
 front-0.385
 hugs-0.376
Tokenv
Token ID85
Feature activation-1.02e+00

Pos logit contributions
vel+3.264
ust+3.177
oll+3.173
nes+3.163
ode+3.056
Neg logit contributions
 last-5.838
 Ms-5.097
 first-4.885
't-4.839
 teach-4.821
Tokenie
Token ID494
Feature activation+9.03e-02

Pos logit contributions
ust+1.904
nes+1.847
vel+1.843
onel+1.673
ode+1.663
Neg logit contributions
 last-1.436
 Ms-1.222
 front-1.124
 hugs-1.117
 teach-1.113
Token repeated
Token ID5100
Feature activation-7.41e-01

Pos logit contributions
 last+0.114
 Ms+0.101
 front+0.089
 teach+0.088
+0.086
Neg logit contributions
ust-0.175
nes-0.173
vel-0.170
onel-0.157
aces-0.156
Token it
Token ID340
Feature activation+1.99e-01

Pos logit contributions
ust+1.629
nes+1.585
vel+1.565
onel+1.464
ode+1.446
Neg logit contributions
 last-0.788
 Ms-0.661
 teach-0.578
 front-0.576
 hugs-0.569
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
 last+0.238
 Ms+0.212
 front+0.185
+0.183
 teach+0.181
Neg logit contributions
ust-0.400
vel-0.390
nes-0.386
onel-0.357
aces-0.356
Token!"
Token ID2474
Feature activation+9.77e-02

Pos logit contributions
 last+0.464
 Ms+0.399
 front+0.357
+0.349
 teach+0.347
Neg logit contributions
ust-0.807
nes-0.795
vel-0.790
onel-0.728
aces-0.727
Token
Token ID198
Feature activation+1.42e-01

Pos logit contributions
 last+0.109
 Ms+0.087
 teach+0.080
 front+0.079
 hugs+0.076
Neg logit contributions
ust-0.205
nes-0.203
vel-0.199
aces-0.185
onel-0.184
Token
Token ID198
Feature activation-7.46e-02

Pos logit contributions
 last+0.153
 Ms+0.124
 teach+0.113
 front+0.109
 hugs+0.108
Neg logit contributions
ust-0.308
nes-0.303
vel-0.298
onel-0.274
aces-0.272
Token"
Token ID1
Feature activation+4.75e-01

Pos logit contributions
ust+0.146
nes+0.142
vel+0.140
onel+0.127
ode+0.126
Neg logit contributions
 last-0.099
 Ms-0.084
 teach-0.076
 hugs-0.076
 front-0.075
TokenS
Token ID50
Feature activation+2.87e-01

Pos logit contributions
 last+0.632
 Ms+0.536
 hugs+0.492
 teach+0.483
 front+0.482
Neg logit contributions
ust-0.912
nes-0.898
vel-0.872
onel-0.801
aces-0.796
Tokenque
Token ID4188
Feature activation-2.93e+00

Pos logit contributions
 last+0.404
 Ms+0.362
 front+0.330
 teach+0.326
't+0.322
Neg logit contributions
ust-0.527
vel-0.509
nes-0.495
ode-0.465
oll-0.457
Tokenak
Token ID461
Feature activation+8.67e-01

Pos logit contributions
ust+3.154
nes+3.010
vel+2.488
onel+2.412
aund+2.336
Neg logit contributions
 last-6.751
 Ms-5.932
 first-5.883
 front-5.806
't-5.781
Token!"
Token ID2474
Feature activation-5.42e-02

Pos logit contributions
 last+0.972
 Ms+0.890
+0.767
 front+0.751
 teach+0.743
Neg logit contributions
ust-1.761
vel-1.725
nes-1.721
aces-1.619
onel-1.608
Token Lily
Token ID20037
Feature activation+8.01e-01

Pos logit contributions
ust+0.107
nes+0.106
vel+0.104
onel+0.096
aces+0.096
Neg logit contributions
 last-0.067
 Ms-0.056
 teach-0.051
 front-0.051
 hugs-0.050
Token says
Token ID1139
Feature activation+2.93e-01

Pos logit contributions
 last+1.044
 Ms+0.868
+0.820
 front+0.805
't+0.780
Neg logit contributions
ust-1.512
nes-1.509
vel-1.482
aces-1.411
onel-1.350
Token.
Token ID13
Feature activation-6.99e-02

Pos logit contributions
 last+0.338
 Ms+0.291
 front+0.255
 teach+0.253
 praise+0.244
Neg logit contributions
ust-0.602
vel-0.595
nes-0.592
onel-0.546
aces-0.546
Token that
Token ID326
Feature activation-3.10e-01

Pos logit contributions
 last+0.860
 Ms+0.795
 teach+0.701
+0.699
 front+0.692
Neg logit contributions
ust-1.445
nes-1.399
vel-1.357
onel-1.257
aces-1.256
Token day
Token ID1110
Feature activation-8.01e-01

Pos logit contributions
ust+0.606
nes+0.587
vel+0.585
onel+0.539
aces+0.530
Neg logit contributions
 last-0.398
 Ms-0.348
 front-0.312
 teach-0.311
 hugs-0.309
Token forward
Token ID2651
Feature activation-3.20e-01

Pos logit contributions
ust+1.404
nes+1.352
vel+1.327
onel+1.214
ode+1.207
Neg logit contributions
 last-1.223
 Ms-1.075
 front-0.995
 teach-0.987
 hugs-0.976
Token,
Token ID11
Feature activation+9.42e-02

Pos logit contributions
ust+0.759
nes+0.742
vel+0.736
onel+0.691
aces+0.684
Neg logit contributions
 last-0.277
 Ms-0.224
 front-0.187
 teach-0.185
 hugs-0.178
Token Daisy
Token ID40355
Feature activation+5.26e-01

Pos logit contributions
 last+0.104
 Ms+0.086
 front+0.079
 teach+0.077
 hugs+0.076
Neg logit contributions
ust-0.199
nes-0.194
vel-0.192
onel-0.178
aces-0.177
Token stri
Token ID2338
Feature activation-2.92e+00

Pos logit contributions
 last+0.700
 Ms+0.620
 front+0.563
+0.550
 teach+0.549
Neg logit contributions
ust-0.980
nes-0.965
vel-0.948
aces-0.880
onel-0.879
Tokenved
Token ID1079
Feature activation+6.10e-01

Pos logit contributions
ust+3.719
vel+3.632
nes+3.531
oll+3.050
ode+3.037
Neg logit contributions
 last-6.034
 first-4.992
 teach-4.889
 Ms-4.872
 front-4.853
Token to
Token ID284
Feature activation-1.23e-01

Pos logit contributions
 last+0.769
 Ms+0.683
+0.641
 front+0.604
 teach+0.601
Neg logit contributions
ust-1.185
vel-1.138
nes-1.130
onel-1.047
aces-1.041
Token be
Token ID307
Feature activation+9.72e-02

Pos logit contributions
ust+0.247
nes+0.241
vel+0.239
onel+0.221
aces+0.219
Neg logit contributions
 last-0.147
 Ms-0.128
 front-0.112
 hugs-0.111
 teach-0.111
Token the
Token ID262
Feature activation+4.22e-01

Pos logit contributions
 last+0.130
 Ms+0.115
 teach+0.102
+0.102
 front+0.101
Neg logit contributions
ust-0.183
vel-0.178
nes-0.178
onel-0.163
aces-0.161
Token best
Token ID1266
Feature activation+3.83e-02

Pos logit contributions
 last+0.635
 Ms+0.593
 hugs+0.540
't+0.537
 front+0.528
Neg logit contributions
ust-0.705
vel-0.664
nes-0.663
onel-0.612
ode-0.604
Token by
Token ID416
Feature activation-6.45e-01

Pos logit contributions
 last+0.833
 Ms+0.746
 hugs+0.673
+0.660
 teach+0.660
Neg logit contributions
ust-1.395
vel-1.395
nes-1.375
onel-1.264
adow-1.245
Token himself
Token ID2241
Feature activation+4.94e-01

Pos logit contributions
ust+1.524
nes+1.488
vel+1.475
onel+1.384
aces+1.366
Neg logit contributions
 last-0.585
 Ms-0.468
 front-0.402
 teach-0.394
 hugs-0.384
Token when
Token ID618
Feature activation+7.55e-01

Pos logit contributions
 last+0.549
 Ms+0.510
 hugs+0.450
 praise+0.444
 teach+0.437
Neg logit contributions
ust-1.028
vel-0.996
nes-0.976
adow-0.909
onel-0.905
Token he
Token ID339
Feature activation+1.10e+00

Pos logit contributions
 last+0.849
 Ms+0.752
 teach+0.691
 hugs+0.674
 praise+0.674
Neg logit contributions
ust-1.500
nes-1.478
vel-1.458
onel-1.359
aces-1.334
Token heard
Token ID2982
Feature activation-5.64e-01

Pos logit contributions
 last+1.392
 Ms+1.247
 praise+1.169
 teach+1.157
 hugs+1.128
Neg logit contributions
ust-2.109
nes-2.051
vel-2.003
onel-1.880
adow-1.867
Token an
Token ID281
Feature activation-2.91e+00

Pos logit contributions
ust+1.308
nes+1.276
vel+1.267
onel+1.185
aces+1.172
Neg logit contributions
 last-0.526
 Ms-0.428
 front-0.364
 teach-0.363
 hugs-0.355
Token enormous
Token ID9812
Feature activation-9.48e-01

Pos logit contributions
nes+6.426
ust+6.325
vel+6.318
aces+5.747
ode+5.733
Neg logit contributions
 last-3.607
 Ms-2.900
 teach-2.835
 front-2.828
 first-2.689
Token noise
Token ID7838
Feature activation+2.35e-02

Pos logit contributions
ust+1.808
nes+1.760
vel+1.713
onel+1.589
aces+1.579
Neg logit contributions
 last-1.318
 Ms-1.146
 teach-1.059
 front-1.055
 hugs-1.028
Token.
Token ID13
Feature activation-7.21e-01

Pos logit contributions
 last+0.027
 Ms+0.023
 hugs+0.020
 front+0.020
 teach+0.020
Neg logit contributions
ust-0.049
vel-0.047
nes-0.047
onel-0.043
aces-0.043
Token He
Token ID679
Feature activation-1.23e+00

Pos logit contributions
ust+1.371
nes+1.329
vel+1.319
onel+1.219
ode+1.190
Neg logit contributions
 last-0.983
 Ms-0.870
 front-0.784
 teach-0.777
 hugs-0.767
Token looked
Token ID3114
Feature activation+5.94e-02

Pos logit contributions
ust+2.566
nes+2.534
vel+2.527
onel+2.352
aces+2.340
Neg logit contributions
 last-1.435
 Ms-1.207
 front-1.080
 hugs-1.073
 teach-1.066
Token,
Token ID11
Feature activation+5.90e-01

Pos logit contributions
 last+0.230
 Ms+0.193
 teach+0.168
 front+0.165
 hugs+0.162
Neg logit contributions
ust-0.549
nes-0.545
vel-0.542
onel-0.504
oll-0.502
Token Lily
Token ID20037
Feature activation+2.58e-01

Pos logit contributions
 last+0.753
 Ms+0.626
 hugs+0.599
 teach+0.597
 front+0.596
Neg logit contributions
ust-1.100
nes-1.094
vel-1.071
onel-0.979
oll-0.965
Token's
Token ID338
Feature activation+8.84e-01

Pos logit contributions
 last+0.413
 Ms+0.366
 front+0.336
 teach+0.335
 praise+0.329
Neg logit contributions
ust-0.415
nes-0.406
vel-0.404
onel-0.360
ode-0.358
Token mom
Token ID1995
Feature activation+1.05e+00

Pos logit contributions
 last+1.243
 Ms+1.184
't+1.054
 teach+1.027
 front+1.017
Neg logit contributions
ust-1.523
vel-1.457
nes-1.429
oll-1.360
onel-1.344
Token took
Token ID1718
Feature activation+2.05e-02

Pos logit contributions
 last+1.367
 Ms+1.132
 front+1.088
+1.066
 teach+1.042
Neg logit contributions
ust-2.000
vel-1.961
nes-1.948
oll-1.767
onel-1.760
Token the
Token ID262
Feature activation+1.83e+00

Pos logit contributions
 last+0.026
 Ms+0.023
 teach+0.021
 front+0.020
+0.020
Neg logit contributions
ust-0.039
nes-0.038
vel-0.038
onel-0.035
oll-0.035
Token cookies
Token ID14746
Feature activation-6.53e-01

Pos logit contributions
 last+2.838
 Ms+2.788
 praise+2.525
 hugs+2.509
 teach+2.469
Neg logit contributions
vel-2.684
ust-2.487
nes-2.474
aund-2.320
oll-2.289
Token out
Token ID503
Feature activation+9.19e-01

Pos logit contributions
ust+1.305
nes+1.263
vel+1.256
onel+1.163
aces+1.148
Neg logit contributions
 last-0.828
 Ms-0.707
 front-0.638
 teach-0.633
 hugs-0.620
Token of
Token ID286
Feature activation+1.20e+00

Pos logit contributions
 last+1.043
 Ms+0.970
+0.865
 teach+0.860
't+0.829
Neg logit contributions
ust-1.823
vel-1.811
nes-1.741
onel-1.649
oll-1.638
Token the
Token ID262
Feature activation+2.93e-01

Pos logit contributions
 last+1.107
 teach+0.884
 Ms+0.882
 hugs+0.834
't+0.831
Neg logit contributions
vel-2.471
nes-2.467
ust-2.454
 penn-2.255
aund-2.253
Token oven
Token ID14361
Feature activation+2.33e-02

Pos logit contributions
 last+0.486
 Ms+0.442
 teach+0.407
 hugs+0.399
't+0.397
Neg logit contributions
ust-0.438
vel-0.434
nes-0.426
onel-0.383
ode-0.379
Token.
Token ID13
Feature activation+5.33e-01

Pos logit contributions
 last+0.168
 Ms+0.143
 teach+0.128
 front+0.126
't+0.126
Neg logit contributions
ust-0.296
vel-0.287
nes-0.286
ode-0.264
aces-0.264
Token
Token ID220
Feature activation-3.83e-01

Pos logit contributions
 last+0.577
 Ms+0.462
 teach+0.426
 hugs+0.421
 front+0.412
Neg logit contributions
ust-1.134
nes-1.087
vel-1.071
aces-0.998
ode-0.979
Token
Token ID198
Feature activation+4.55e-01

Pos logit contributions
ust+0.871
nes+0.849
vel+0.842
onel+0.786
aces+0.779
Neg logit contributions
 last-0.371
 Ms-0.303
 teach-0.261
 front-0.260
 hugs-0.255
Token
Token ID198
Feature activation+8.40e-01

Pos logit contributions
 last+0.509
 Ms+0.418
 teach+0.391
 hugs+0.389
 praise+0.372
Neg logit contributions
ust-0.957
nes-0.945
vel-0.918
onel-0.821
aces-0.821
TokenOne
Token ID3198
Feature activation+6.31e-01

Pos logit contributions
 last+0.993
 Ms+0.820
 hugs+0.776
 teach+0.765
 praise+0.747
Neg logit contributions
ust-1.785
nes-1.691
vel-1.689
aund-1.491
ode-1.482
Token day
Token ID1110
Feature activation+1.70e+00

Pos logit contributions
 last+0.874
 Ms+0.812
+0.748
 hugs+0.746
 teach+0.744
Neg logit contributions
ust-1.129
nes-1.055
vel-1.038
adow-0.971
oll-0.971
Token,
Token ID11
Feature activation+2.22e+00

Pos logit contributions
 last+2.072
 Ms+1.805
 front+1.629
 praise+1.626
 teach+1.607
Neg logit contributions
vel-3.275
ust-3.203
nes-3.050
aund-3.011
oll-2.877
Token Jack
Token ID3619
Feature activation+4.54e-02

Pos logit contributions
 last+2.994
 Ms+2.614
 hugs+2.579
 praise+2.488
 front+2.437
Neg logit contributions
ust-3.892
vel-3.717
nes-3.555
aund-3.404
ying-3.240
Token wanted
Token ID2227
Feature activation-3.00e-01

Pos logit contributions
 last+0.065
 Ms+0.057
 teach+0.052
 front+0.052
 praise+0.051
Neg logit contributions
ust-0.081
nes-0.079
vel-0.078
onel-0.071
ode-0.071
Token to
Token ID284
Feature activation-1.91e-01

Pos logit contributions
ust+0.667
nes+0.651
vel+0.645
onel+0.600
aces+0.595
Neg logit contributions
 last-0.304
 Ms-0.254
 teach-0.218
 front-0.218
 hugs-0.213
Token go
Token ID467
Feature activation+4.65e-01

Pos logit contributions
ust+0.384
nes+0.377
vel+0.370
onel+0.342
ode+0.339
Neg logit contributions
 last-0.229
 Ms-0.199
 hugs-0.173
 front-0.173
 teach-0.170
Token help
Token ID1037
Feature activation-1.11e+00

Pos logit contributions
ust+0.234
nes+0.231
vel+0.228
onel+0.212
ode+0.211
Neg logit contributions
 last-0.108
 Ms-0.094
 front-0.078
 hugs-0.077
 teach-0.075
Token people
Token ID661
Feature activation-8.73e-01

Pos logit contributions
ust+2.134
nes+2.077
vel+2.048
onel+1.905
aces+1.869
Neg logit contributions
 last-1.556
 Ms-1.364
 teach-1.252
 front-1.231
 hugs-1.192
Token.
Token ID13
Feature activation-1.43e-02

Pos logit contributions
ust+1.646
nes+1.590
vel+1.574
onel+1.458
aces+1.437
Neg logit contributions
 last-1.215
 Ms-1.047
 teach-0.976
 front-0.953
 hugs-0.939
Token One
Token ID1881
Feature activation+2.14e-01

Pos logit contributions
ust+0.031
nes+0.031
vel+0.030
aces+0.028
onel+0.028
Neg logit contributions
 last-0.015
 Ms-0.012
 teach-0.011
 front-0.011
 hugs-0.011
Token day
Token ID1110
Feature activation+1.36e+00

Pos logit contributions
 last+0.286
 Ms+0.265
 hugs+0.239
 teach+0.236
+0.236
Neg logit contributions
ust-0.393
nes-0.375
vel-0.370
onel-0.339
oll-0.338
Token,
Token ID11
Feature activation+2.09e+00

Pos logit contributions
 last+1.610
 Ms+1.425
 front+1.284
 praise+1.249
 teach+1.245
Neg logit contributions
vel-2.697
ust-2.648
nes-2.535
aund-2.472
oll-2.415
Token Jack
Token ID3619
Feature activation+2.52e-01

Pos logit contributions
 last+2.791
 hugs+2.466
 Ms+2.456
 front+2.335
 praise+2.335
Neg logit contributions
ust-3.613
vel-3.505
nes-3.382
aund-3.208
aces-3.061
Token got
Token ID1392
Feature activation+6.61e-01

Pos logit contributions
 last+0.367
 Ms+0.325
 teach+0.296
 front+0.295
+0.292
Neg logit contributions
ust-0.443
nes-0.434
vel-0.426
onel-0.388
ode-0.387
Token a
Token ID257
Feature activation-3.10e-01

Pos logit contributions
 last+0.814
 Ms+0.738
+0.668
 teach+0.645
 front+0.639
Neg logit contributions
ust-1.270
nes-1.232
vel-1.227
onel-1.147
oll-1.127
Token call
Token ID869
Feature activation+7.28e-02

Pos logit contributions
ust+0.568
nes+0.549
vel+0.546
onel+0.500
ode+0.492
Neg logit contributions
 last-0.432
 Ms-0.382
 front-0.345
 teach-0.342
 hugs-0.340
Token that
Token ID326
Feature activation+3.63e-01

Pos logit contributions
 last+0.103
 Ms+0.091
 hugs+0.082
 front+0.082
 teach+0.082
Neg logit contributions
ust-0.129
vel-0.126
nes-0.126
onel-0.115
aces-0.113
Tokenmy
Token ID1820
Feature activation+1.73e-01

Pos logit contributions
 last+0.399
 Ms+0.337
 front+0.295
 teach+0.292
+0.288
Neg logit contributions
ust-0.926
vel-0.876
nes-0.876
ode-0.828
aces-0.812
Token said
Token ID531
Feature activation+1.60e-01

Pos logit contributions
 last+0.200
 Ms+0.164
 front+0.153
 teach+0.146
+0.144
Neg logit contributions
ust-0.360
nes-0.349
vel-0.343
onel-0.314
aces-0.313
Token no
Token ID645
Feature activation+2.65e-01

Pos logit contributions
 last+0.205
 Ms+0.178
 front+0.164
 teach+0.157
 praise+0.155
Neg logit contributions
ust-0.313
vel-0.304
nes-0.303
onel-0.279
aces-0.276
Token and
Token ID290
Feature activation+5.53e-02

Pos logit contributions
 last+0.278
 Ms+0.244
 front+0.215
 teach+0.207
 hugs+0.206
Neg logit contributions
ust-0.577
vel-0.548
nes-0.547
aces-0.505
onel-0.502
Token ran
Token ID4966
Feature activation+2.12e-01

Pos logit contributions
 last+0.066
 Ms+0.056
 front+0.052
 teach+0.050
 hugs+0.050
Neg logit contributions
ust-0.112
nes-0.109
vel-0.107
onel-0.099
aces-0.098
Token to
Token ID284
Feature activation+1.71e+00

Pos logit contributions
 last+0.347
 Ms+0.315
 teach+0.291
 praise+0.285
 hugs+0.284
Neg logit contributions
ust-0.332
nes-0.330
vel-0.317
aces-0.282
onel-0.282
Token buy
Token ID2822
Feature activation+1.06e+00

Pos logit contributions
 last+2.209
 Ms+2.034
+1.850
 hugs+1.812
ted+1.786
Neg logit contributions
nes-2.884
ust-2.863
vel-2.765
onel-2.455
ode-2.444
Token candy
Token ID18550
Feature activation-2.11e-01

Pos logit contributions
 last+1.417
 Ms+1.282
+1.161
't+1.137
 front+1.129
Neg logit contributions
ust-1.833
vel-1.799
nes-1.745
aces-1.663
oll-1.639
Token.
Token ID13
Feature activation+9.76e-01

Pos logit contributions
ust+0.456
nes+0.444
vel+0.441
aces+0.407
onel+0.407
Neg logit contributions
 last-0.226
 Ms-0.192
 front-0.168
 teach-0.168
 hugs-0.165
Token When
Token ID1649
Feature activation+5.59e-01

Pos logit contributions
 last+1.336
 Ms+1.096
 front+1.017
 hugs+1.013
 praise+1.005
Neg logit contributions
ust-1.799
nes-1.729
vel-1.677
aces-1.568
ode-1.542
Token Tim
Token ID5045
Feature activation+4.00e-01

Pos logit contributions
 last+0.666
 Ms+0.593
 hugs+0.535
 teach+0.533
 praise+0.523
Neg logit contributions
ust-1.083
nes-1.056
vel-1.042
onel-0.945
aund-0.942
Token to
Token ID284
Feature activation-3.45e-01

Pos logit contributions
ust+0.904
nes+0.881
vel+0.876
onel+0.816
aces+0.808
Neg logit contributions
 last-0.406
 Ms-0.335
 front-0.290
 teach-0.289
 hugs-0.282
Token explore
Token ID7301
Feature activation+1.17e-01

Pos logit contributions
ust+0.720
nes+0.702
vel+0.697
onel+0.645
aces+0.640
Neg logit contributions
 last-0.395
 Ms-0.338
 front-0.295
 teach-0.291
 hugs-0.290
Token.
Token ID13
Feature activation+3.21e-02

Pos logit contributions
 last+0.129
 Ms+0.108
't+0.096
 teach+0.096
 front+0.095
Neg logit contributions
ust-0.246
vel-0.240
nes-0.236
onel-0.219
aces-0.218
Token One
Token ID1881
Feature activation+2.44e-01

Pos logit contributions
 last+0.034
 Ms+0.028
 front+0.025
 teach+0.025
 hugs+0.024
Neg logit contributions
ust-0.070
nes-0.068
vel-0.067
aces-0.062
onel-0.062
Token day
Token ID1110
Feature activation+8.83e-01

Pos logit contributions
 last+0.329
 Ms+0.304
 teach+0.273
 hugs+0.273
+0.269
Neg logit contributions
ust-0.453
nes-0.428
vel-0.423
adow-0.393
onel-0.389
Token,
Token ID11
Feature activation+1.80e+00

Pos logit contributions
 last+0.975
 Ms+0.783
 front+0.722
 teach+0.716
 praise+0.704
Neg logit contributions
vel-1.834
ust-1.825
nes-1.810
aund-1.679
oll-1.659
Token she
Token ID673
Feature activation-4.03e-01

Pos logit contributions
 last+2.511
 praise+2.082
 teach+2.052
 hugs+2.037
 Ms+2.021
Neg logit contributions
nes-3.142
ust-3.033
vel-2.988
 penn-2.759
aund-2.740
Token saw
Token ID2497
Feature activation+1.06e-01

Pos logit contributions
ust+0.722
nes+0.700
vel+0.692
onel+0.633
aces+0.624
Neg logit contributions
 last-0.586
 Ms-0.517
 teach-0.470
 front-0.469
 hugs-0.463
Token a
Token ID257
Feature activation-1.87e-01

Pos logit contributions
 last+0.109
 Ms+0.091
 teach+0.080
 front+0.078
+0.078
Neg logit contributions
ust-0.230
vel-0.224
nes-0.224
onel-0.210
aces-0.208
Token big
Token ID1263
Feature activation-3.17e-01

Pos logit contributions
ust+0.372
vel+0.359
nes+0.359
onel+0.330
ode+0.327
Neg logit contributions
 last-0.228
 Ms-0.196
 front-0.174
 teach-0.174
 hugs-0.173
Token jar
Token ID17379
Feature activation+3.33e-01

Pos logit contributions
ust+0.578
nes+0.559
vel+0.559
onel+0.510
ode+0.504
Neg logit contributions
 last-0.440
 Ms-0.386
 teach-0.349
 front-0.348
 hugs-0.345
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 last+0.144
 Ms+0.126
 front+0.111
 teach+0.110
+0.110
Neg logit contributions
ust-0.262
vel-0.252
nes-0.252
aces-0.230
oll-0.229
Token said
Token ID531
Feature activation-1.22e-01

Pos logit contributions
 last+1.379
 front+1.114
 Ms+1.101
+1.071
 praise+1.030
Neg logit contributions
ust-1.858
nes-1.852
vel-1.775
onel-1.609
aces-1.595
Token,
Token ID11
Feature activation-2.39e-01

Pos logit contributions
ust+0.257
vel+0.251
nes+0.250
onel+0.231
aces+0.229
Neg logit contributions
 last-0.140
 Ms-0.118
 front-0.106
 teach-0.103
 hugs-0.101
Token "
Token ID366
Feature activation+5.18e-01

Pos logit contributions
ust+0.486
nes+0.474
vel+0.473
onel+0.434
aces+0.431
Neg logit contributions
 last-0.285
 Ms-0.239
 front-0.215
 teach-0.214
 hugs-0.212
TokenYes
Token ID5297
Feature activation+5.45e-01

Pos logit contributions
 last+0.573
 Ms+0.462
 hugs+0.418
 teach+0.409
 front+0.408
Neg logit contributions
ust-1.116
nes-1.096
vel-1.078
ode-0.978
onel-0.966
Token please
Token ID3387
Feature activation+1.14e+00

Pos logit contributions
 last+0.579
 Ms+0.493
 front+0.437
 praise+0.427
 teach+0.425
Neg logit contributions
ust-1.170
nes-1.141
vel-1.140
oll-1.034
onel-1.033
Token!"
Token ID2474
Feature activation+3.43e-01

Pos logit contributions
 last+1.357
 Ms+1.232
 praise+1.056
 front+1.053
 hugs+1.041
Neg logit contributions
vel-2.257
ust-2.227
nes-2.209
ode-1.985
aces-1.960
Token
Token ID198
Feature activation+2.95e-01

Pos logit contributions
 last+0.376
 Ms+0.305
 teach+0.275
 front+0.268
 hugs+0.266
Neg logit contributions
ust-0.719
nes-0.710
vel-0.698
aces-0.647
oll-0.634
Token
Token ID198
Feature activation+1.53e-01

Pos logit contributions
 last+0.320
 Ms+0.263
 teach+0.243
 front+0.236
 hugs+0.235
Neg logit contributions
ust-0.627
nes-0.617
vel-0.602
onel-0.551
oll-0.550
TokenL
Token ID43
Feature activation-1.18e+00

Pos logit contributions
 last+0.191
 Ms+0.160
 hugs+0.146
 teach+0.146
 front+0.143
Neg logit contributions
ust-0.313
nes-0.305
vel-0.299
aces-0.268
onel-0.267
Tokenily
Token ID813
Feature activation+4.40e-01

Pos logit contributions
ust+1.049
nes+0.992
vel+0.984
ode+0.841
onel+0.814
Neg logit contributions
 last-2.789
 Ms-2.557
 hugs-2.428
 teach-2.424
 front-2.404
Token got
Token ID1392
Feature activation+5.09e-02

Pos logit contributions
ust+0.855
nes+0.830
vel+0.824
onel+0.760
aces+0.751
Neg logit contributions
 last-0.569
 Ms-0.493
 front-0.442
 teach-0.442
 hugs-0.434
Token up
Token ID510
Feature activation+9.22e-01

Pos logit contributions
 last+0.068
 Ms+0.060
 teach+0.054
+0.053
 front+0.053
Neg logit contributions
ust-0.095
nes-0.092
vel-0.092
onel-0.086
oll-0.084
Token and
Token ID290
Feature activation+4.71e-01

Pos logit contributions
 last+1.151
 Ms+1.042
 teach+0.943
 hugs+0.934
 praise+0.904
Neg logit contributions
ust-1.676
vel-1.647
nes-1.639
oll-1.496
aces-1.484
Token looked
Token ID3114
Feature activation+2.93e-02

Pos logit contributions
 last+0.650
 Ms+0.556
+0.515
 teach+0.514
't+0.508
Neg logit contributions
ust-0.837
vel-0.820
nes-0.816
ode-0.751
onel-0.750
Token under
Token ID739
Feature activation+3.68e-01

Pos logit contributions
 last+0.054
 Ms+0.050
 teach+0.046
't+0.046
 front+0.046
Neg logit contributions
ust-0.040
vel-0.038
nes-0.038
onel-0.034
aces-0.033
Token the
Token ID262
Feature activation+6.76e-01

Pos logit contributions
 last+0.327
 Ms+0.281
 teach+0.245
't+0.221
+0.220
Neg logit contributions
ust-0.810
nes-0.799
vel-0.798
aces-0.752
onel-0.742
Token sofa
Token ID34902
Feature activation-7.37e-01

Pos logit contributions
 last+1.052
 Ms+0.986
 teach+0.886
+0.875
't+0.869
Neg logit contributions
ust-1.013
nes-1.004
vel-1.002
ode-0.921
aces-0.915
Token.
Token ID13
Feature activation-7.48e-01

Pos logit contributions
ust+1.667
nes+1.619
vel+1.606
onel+1.501
ode+1.477
Neg logit contributions
 last-0.742
 Ms-0.604
 front-0.536
 teach-0.523
 hugs-0.510
Token She
Token ID1375
Feature activation-2.56e-01

Pos logit contributions
ust+1.528
nes+1.488
vel+1.475
onel+1.359
ode+1.346
Neg logit contributions
 last-0.913
 Ms-0.791
 front-0.700
 teach-0.697
 hugs-0.688
Token saw
Token ID2497
Feature activation+7.16e-01

Pos logit contributions
ust+0.538
nes+0.525
vel+0.520
onel+0.484
aces+0.480
Neg logit contributions
 last-0.287
 Ms-0.244
 teach-0.216
 front-0.214
 hugs-0.208
Token the
Token ID262
Feature activation+9.35e-01

Pos logit contributions
 last+0.745
 Ms+0.651
+0.586
 teach+0.582
't+0.558
Neg logit contributions
ust-1.475
vel-1.444
nes-1.426
onel-1.392
aces-1.377
Token will
Token ID481
Feature activation+6.68e-01

Pos logit contributions
 last+0.532
 Ms+0.499
 front+0.450
 hugs+0.429
 Mrs+0.428
Neg logit contributions
ust-0.672
nes-0.653
vel-0.640
onel-0.588
oll-0.570
Token watch
Token ID2342
Feature activation+5.33e-02

Pos logit contributions
 last+0.822
 Ms+0.774
 hugs+0.644
+0.642
 front+0.634
Neg logit contributions
ust-1.282
nes-1.276
vel-1.251
ode-1.145
onel-1.139
Token over
Token ID625
Feature activation+5.26e-01

Pos logit contributions
 last+0.071
 Ms+0.062
 front+0.056
 teach+0.055
+0.055
Neg logit contributions
ust-0.100
vel-0.098
nes-0.097
onel-0.089
aces-0.088
Token it
Token ID340
Feature activation-2.68e-01

Pos logit contributions
 last+0.651
 Ms+0.573
 teach+0.511
+0.503
 praise+0.496
Neg logit contributions
ust-1.026
vel-1.008
nes-0.991
onel-0.911
aces-0.910
Token."
Token ID526
Feature activation+8.17e-01

Pos logit contributions
ust+0.556
nes+0.540
vel+0.538
onel+0.497
aces+0.492
Neg logit contributions
 last-0.309
 Ms-0.265
 front-0.234
 teach-0.233
 hugs-0.231
Token Emily
Token ID17608
Feature activation+4.23e-01

Pos logit contributions
 last+0.853
 Ms+0.656
 front+0.612
 teach+0.576
 first+0.567
Neg logit contributions
ust-1.740
vel-1.737
nes-1.728
aces-1.607
onel-1.578
Token was
Token ID373
Feature activation-1.39e-01

Pos logit contributions
 last+0.492
 Ms+0.424
 front+0.375
 teach+0.368
't+0.364
Neg logit contributions
nes-0.855
ust-0.854
vel-0.831
aces-0.773
onel-0.765
Token so
Token ID523
Feature activation-4.20e-01

Pos logit contributions
ust+0.222
nes+0.215
vel+0.212
onel+0.192
aces+0.189
Neg logit contributions
 last-0.230
 Ms-0.205
 teach-0.190
 front-0.188
 hugs-0.187
Token happy
Token ID3772
Feature activation+4.16e-01

Pos logit contributions
ust+0.743
nes+0.718
vel+0.712
onel+0.651
aces+0.640
Neg logit contributions
 last-0.628
 Ms-0.555
 front-0.507
 teach-0.507
 hugs-0.499
Token.
Token ID13
Feature activation+9.19e-01

Pos logit contributions
 last+0.484
 Ms+0.416
 front+0.375
 teach+0.366
 praise+0.355
Neg logit contributions
ust-0.846
vel-0.827
nes-0.820
onel-0.750
aces-0.749
Token
Token ID198
Feature activation+3.75e-01

Pos logit contributions
 last+1.290
 Ms+1.037
 front+1.015
 teach+1.009
 praise+0.953
Neg logit contributions
ust-1.671
nes-1.640
vel-1.591
aces-1.481
onel-1.473
Token was
Token ID373
Feature activation-2.58e-01

Pos logit contributions
ust+0.356
nes+0.347
vel+0.344
onel+0.316
aces+0.313
Neg logit contributions
 last-0.219
 Ms-0.188
 front-0.168
 teach-0.167
 hugs-0.164
Token very
Token ID845
Feature activation+6.24e-01

Pos logit contributions
ust+0.459
nes+0.446
vel+0.443
onel+0.404
aces+0.398
Neg logit contributions
 last-0.378
 Ms-0.331
 teach-0.303
 front-0.302
 hugs-0.300
Token curious
Token ID11040
Feature activation-3.46e-01

Pos logit contributions
 last+0.926
 Ms+0.811
 front+0.755
+0.747
 hugs+0.736
Neg logit contributions
ust-1.121
nes-1.073
vel-1.054
onel-0.972
oll-0.939
Token and
Token ID290
Feature activation+6.84e-01

Pos logit contributions
ust+0.696
nes+0.672
vel+0.671
onel+0.618
aces+0.614
Neg logit contributions
 last-0.427
 Ms-0.369
 front-0.329
 teach-0.329
 hugs-0.321
Token she
Token ID673
Feature activation-2.00e-01

Pos logit contributions
 last+0.962
 Ms+0.826
+0.796
 front+0.783
 hugs+0.760
Neg logit contributions
ust-1.238
nes-1.201
vel-1.160
onel-1.069
aces-1.050
Token wanted
Token ID2227
Feature activation+7.90e-02

Pos logit contributions
ust+0.391
nes+0.379
vel+0.375
onel+0.344
aces+0.340
Neg logit contributions
 last-0.258
 Ms-0.222
 front-0.201
 teach-0.199
 hugs-0.196
Token to
Token ID284
Feature activation+4.30e-01

Pos logit contributions
 last+0.084
 Ms+0.071
+0.062
 teach+0.061
 front+0.061
Neg logit contributions
ust-0.168
nes-0.164
vel-0.163
aces-0.150
onel-0.150
Token know
Token ID760
Feature activation-3.30e-01

Pos logit contributions
 last+0.537
 Ms+0.479
+0.418
 hugs+0.416
 front+0.404
Neg logit contributions
ust-0.829
nes-0.814
vel-0.795
ode-0.724
onel-0.720
Token what
Token ID644
Feature activation-1.35e+00

Pos logit contributions
ust+0.559
nes+0.537
vel+0.536
onel+0.486
aces+0.480
Neg logit contributions
 last-0.506
 Ms-0.450
 front-0.413
 teach-0.410
 hugs-0.405
Token was
Token ID373
Feature activation-1.51e+00

Pos logit contributions
ust+3.211
nes+3.121
vel+3.113
onel+2.952
ode+2.884
Neg logit contributions
 last-1.287
 Ms-1.063
 teach-0.912
 hugs-0.878
 front-0.866
Token inside
Token ID2641
Feature activation+2.11e-01

Pos logit contributions
ust+2.442
nes+2.286
vel+2.251
ode+2.082
onel+2.082
Neg logit contributions
 last-2.623
 Ms-2.310
 front-2.167
 teach-2.160
 first-2.158
Token shouted
Token ID17293
Feature activation+1.36e-01

Pos logit contributions
ust+1.072
nes+1.043
vel+1.036
onel+0.962
aces+0.951
Neg logit contributions
 last-0.575
 Ms-0.487
 front-0.429
 teach-0.429
 hugs-0.420
Token.
Token ID13
Feature activation+1.22e-01

Pos logit contributions
 last+0.158
 Ms+0.133
 front+0.118
 teach+0.116
 hugs+0.113
Neg logit contributions
ust-0.280
vel-0.275
nes-0.272
onel-0.252
aces-0.248
Token
Token ID198
Feature activation+2.75e-01

Pos logit contributions
 last+0.140
 Ms+0.111
 teach+0.102
 hugs+0.100
 front+0.100
Neg logit contributions
ust-0.255
nes-0.251
vel-0.246
aces-0.227
onel-0.226
Token
Token ID198
Feature activation+3.12e-02

Pos logit contributions
 last+0.301
 Ms+0.242
 teach+0.221
 hugs+0.215
 front+0.210
Neg logit contributions
ust-0.587
nes-0.583
vel-0.568
onel-0.516
aces-0.512
TokenMom
Token ID29252
Feature activation+7.00e-01

Pos logit contributions
 last+0.043
 Ms+0.037
 teach+0.033
 hugs+0.033
 front+0.033
Neg logit contributions
ust-0.059
nes-0.058
vel-0.057
onel-0.051
aces-0.050
Token smiled
Token ID13541
Feature activation+1.58e-01

Pos logit contributions
 last+0.853
 Ms+0.742
 teach+0.670
+0.665
 front+0.662
Neg logit contributions
ust-1.368
vel-1.339
nes-1.316
oll-1.219
ode-1.218
Token and
Token ID290
Feature activation+2.90e-01

Pos logit contributions
 last+0.190
 Ms+0.164
 teach+0.145
't+0.144
+0.144
Neg logit contributions
ust-0.323
vel-0.311
nes-0.304
onel-0.284
aces-0.281
Token carried
Token ID5281
Feature activation+3.49e-01

Pos logit contributions
 last+0.323
 Ms+0.262
 front+0.246
't+0.237
 teach+0.237
Neg logit contributions
ust-0.603
nes-0.591
vel-0.585
onel-0.545
oll-0.540
Token a
Token ID257
Feature activation-3.07e-02

Pos logit contributions
 last+0.501
 Ms+0.434
 teach+0.408
+0.400
 front+0.397
Neg logit contributions
ust-0.602
nes-0.601
vel-0.592
onel-0.532
aces-0.529
Token small
Token ID1402
Feature activation+4.65e-01

Pos logit contributions
ust+0.055
nes+0.054
vel+0.053
onel+0.048
ode+0.048
Neg logit contributions
 last-0.043
 Ms-0.038
 teach-0.034
 front-0.034
't-0.034
Token bundle
Token ID18537
Feature activation+8.10e-02

Pos logit contributions
 last+0.705
 Ms+0.646
't+0.592
 teach+0.579
 front+0.567
Neg logit contributions
ust-0.749
vel-0.739
nes-0.732
ode-0.656
onel-0.654
Token thanked
Token ID26280
Feature activation-1.28e-01

Pos logit contributions
ust+3.398
vel+3.279
nes+3.191
aces+2.957
ode+2.956
Neg logit contributions
 last-1.915
 Ms-1.514
 hugs-1.446
 front-1.396
 teach-1.382
Token him
Token ID683
Feature activation+1.59e-01

Pos logit contributions
ust+0.286
nes+0.280
vel+0.278
onel+0.258
aces+0.255
Neg logit contributions
 last-0.124
 Ms-0.102
 front-0.088
 teach-0.087
 hugs-0.087
Token for
Token ID329
Feature activation-1.16e+00

Pos logit contributions
 last+0.192
 Ms+0.169
 front+0.151
+0.149
 teach+0.148
Neg logit contributions
ust-0.316
vel-0.309
nes-0.304
onel-0.281
aces-0.278
Token coming
Token ID2406
Feature activation+3.31e-01

Pos logit contributions
ust+2.669
nes+2.573
vel+2.527
aces+2.372
onel+2.356
Neg logit contributions
 last-1.222
 Ms-0.998
 teach-0.896
 front-0.880
 hugs-0.862
Token to
Token ID284
Feature activation-8.38e-02

Pos logit contributions
 last+0.501
 Ms+0.469
 teach+0.416
+0.413
 hugs+0.407
Neg logit contributions
ust-0.548
nes-0.539
vel-0.529
onel-0.481
ode-0.470
Token help
Token ID1037
Feature activation-5.16e-01

Pos logit contributions
ust+0.195
nes+0.191
vel+0.190
onel+0.177
ode+0.175
Neg logit contributions
 last-0.073
 Ms-0.061
 hugs-0.049
 front-0.049
 teach-0.048
Token them
Token ID606
Feature activation-4.71e-01

Pos logit contributions
ust+1.102
nes+1.072
vel+1.064
onel+0.989
aces+0.978
Neg logit contributions
 last-0.579
 Ms-0.489
 front-0.430
 teach-0.430
 hugs-0.421
Token.
Token ID13
Feature activation-3.48e-01

Pos logit contributions
ust+1.015
nes+0.988
vel+0.981
onel+0.912
aces+0.902
Neg logit contributions
 last-0.515
 Ms-0.434
 front-0.380
 teach-0.379
 hugs-0.372
Token
Token ID198
Feature activation-1.31e-01

Pos logit contributions
ust+0.760
nes+0.742
vel+0.735
onel+0.684
aces+0.679
Neg logit contributions
 last-0.371
 Ms-0.308
 teach-0.271
 front-0.270
 hugs-0.262
Token
Token ID198
Feature activation+2.53e-01

Pos logit contributions
ust+0.256
nes+0.251
vel+0.248
onel+0.227
aces+0.225
Neg logit contributions
 last-0.167
 Ms-0.144
 teach-0.130
 front-0.129
 hugs-0.127
TokenThe
Token ID464
Feature activation+8.34e-02

Pos logit contributions
 last+0.302
 Ms+0.253
 hugs+0.232
 front+0.225
 teach+0.225
Neg logit contributions
ust-0.527
nes-0.516
vel-0.509
onel-0.461
ode-0.454
Token to
Token ID284
Feature activation+4.69e-02

Pos logit contributions
 last+0.033
 Ms+0.028
 teach+0.025
 front+0.025
+0.025
Neg logit contributions
ust-0.069
nes-0.067
vel-0.066
onel-0.062
aces-0.062
Token share
Token ID2648
Feature activation-1.39e+00

Pos logit contributions
 last+0.049
 Ms+0.043
 hugs+0.037
 front+0.037
 teach+0.036
Neg logit contributions
ust-0.101
nes-0.100
vel-0.097
onel-0.090
ode-0.089
Token its
Token ID663
Feature activation+8.80e-01

Pos logit contributions
ust+3.168
nes+3.104
vel+3.036
onel+2.911
ode+2.859
Neg logit contributions
 last-1.429
 Ms-1.173
 front-1.039
 hugs-1.030
 teach-1.023
Token space
Token ID2272
Feature activation-8.50e-02

Pos logit contributions
 last+1.361
 Ms+1.303
+1.177
 Mrs+1.155
 teach+1.142
Neg logit contributions
ust-1.439
nes-1.374
vel-1.341
aces-1.243
onel-1.208
Token with
Token ID351
Feature activation-5.53e-01

Pos logit contributions
ust+0.180
nes+0.177
vel+0.174
aces+0.161
onel+0.161
Neg logit contributions
 last-0.094
 Ms-0.081
 front-0.070
 hugs-0.069
 teach-0.069
Token anyone
Token ID2687
Feature activation-1.23e+00

Pos logit contributions
ust+0.995
nes+0.964
vel+0.956
onel+0.874
aces+0.861
Neg logit contributions
 last-0.806
 Ms-0.709
 front-0.646
 teach-0.646
 hugs-0.638
Token else
Token ID2073
Feature activation-4.83e-01

Pos logit contributions
ust+2.598
nes+2.549
vel+2.514
onel+2.345
ode+2.322
Neg logit contributions
 last-1.426
 Ms-1.180
 hugs-1.069
 teach-1.066
 front-1.040
Token.
Token ID13
Feature activation-9.01e-01

Pos logit contributions
ust+1.060
nes+1.032
vel+1.026
onel+0.955
aces+0.945
Neg logit contributions
 last-0.507
 Ms-0.423
 front-0.368
 teach-0.368
 hugs-0.360
Token
Token ID198
Feature activation-2.72e-01

Pos logit contributions
ust+2.151
nes+2.097
vel+2.093
onel+1.963
ode+1.932
Neg logit contributions
 last-0.796
 Ms-0.656
 front-0.541
 teach-0.535
 hugs-0.528
Token
Token ID198
Feature activation-4.38e-01

Pos logit contributions
ust+0.611
nes+0.596
vel+0.591
onel+0.549
aces+0.543
Neg logit contributions
 last-0.268
 Ms-0.221
 teach-0.192
 front-0.191
 hugs-0.188
TokenOne
Token ID3198
Feature activation+9.87e-02

Pos logit contributions
ust+1.022
nes+0.996
vel+0.990
onel+0.923
aces+0.915
Neg logit contributions
 last-0.410
 Ms-0.333
 teach-0.283
 front-0.282
 hugs-0.277
Token I
Token ID314
Feature activation+5.11e-01

Pos logit contributions
 last+1.811
 Ms+1.494
 front+1.485
glers+1.361
 praise+1.349
Neg logit contributions
ust-2.366
nes-2.321
vel-2.227
onel-2.076
aces-2.053
Token understand
Token ID1833
Feature activation+7.69e-01

Pos logit contributions
 last+0.729
 Ms+0.669
 front+0.625
+0.577
 teach+0.573
Neg logit contributions
ust-0.916
nes-0.887
vel-0.857
onel-0.792
adow-0.775
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
 last+1.047
 Ms+0.921
+0.896
 front+0.872
 praise+0.826
Neg logit contributions
ust-1.404
vel-1.369
nes-1.336
onel-1.210
aces-1.201
Token You
Token ID921
Feature activation-1.23e-01

Pos logit contributions
 last+1.997
 Ms+1.632
 front+1.623
glers+1.538
 first+1.485
Neg logit contributions
ust-2.443
nes-2.382
vel-2.331
aces-2.133
onel-2.131
Token were
Token ID547
Feature activation-5.25e-01

Pos logit contributions
ust+0.226
nes+0.219
vel+0.215
onel+0.199
aces+0.195
Neg logit contributions
 last-0.172
 Ms-0.155
 front-0.141
 teach-0.136
 hugs-0.135
Token just
Token ID655
Feature activation-4.27e-01

Pos logit contributions
ust+0.990
nes+0.960
vel+0.952
onel+0.875
aces+0.864
Neg logit contributions
 last-0.724
 Ms-0.633
 front-0.572
 teach-0.572
 hugs-0.563
Token excited
Token ID6568
Feature activation+6.82e-01

Pos logit contributions
ust+0.846
nes+0.821
vel+0.816
onel+0.754
aces+0.744
Neg logit contributions
 last-0.541
 Ms-0.467
 front-0.419
 teach-0.418
 hugs-0.411
Token to
Token ID284
Feature activation-9.80e-02

Pos logit contributions
 last+0.820
 Ms+0.753
+0.701
 front+0.690
 teach+0.651
Neg logit contributions
ust-1.370
nes-1.315
vel-1.308
aces-1.209
onel-1.186
Token meet
Token ID1826
Feature activation+1.53e-01

Pos logit contributions
ust+0.199
nes+0.195
vel+0.192
onel+0.177
ode+0.176
Neg logit contributions
 last-0.116
 Ms-0.102
 front-0.088
 hugs-0.087
 teach-0.086
Token me
Token ID502
Feature activation+5.71e-01

Pos logit contributions
 last+0.199
 Ms+0.171
 front+0.158
 teach+0.155
+0.154
Neg logit contributions
ust-0.295
nes-0.286
vel-0.285
onel-0.261
aces-0.257
Token."
Token ID526
Feature activation+2.89e-01

Pos logit contributions
 last+0.704
 Ms+0.612
+0.551
 front+0.550
 praise+0.541
Neg logit contributions
ust-1.127
nes-1.106
vel-1.103
aces-1.006
onel-1.001
Token have
Token ID423
Feature activation-7.61e-01

Pos logit contributions
 last+0.465
 Ms+0.420
 hugs+0.354
 front+0.339
+0.337
Neg logit contributions
nes-0.918
ust-0.912
vel-0.879
aces-0.810
ode-0.810
Token money
Token ID1637
Feature activation+1.41e-01

Pos logit contributions
ust+1.549
nes+1.496
vel+1.492
onel+1.391
ode+1.360
Neg logit contributions
 last-0.945
 Ms-0.815
 front-0.728
 teach-0.725
 hugs-0.721
Token.
Token ID13
Feature activation-6.01e-01

Pos logit contributions
 last+0.161
 Ms+0.142
 front+0.125
 teach+0.124
 hugs+0.121
Neg logit contributions
ust-0.290
nes-0.282
vel-0.278
aces-0.260
oll-0.256
Token Money
Token ID12911
Feature activation-3.83e-01

Pos logit contributions
ust+1.410
nes+1.377
vel+1.369
onel+1.284
aces+1.268
Neg logit contributions
 last-0.544
 Ms-0.444
 front-0.373
 teach-0.370
 hugs-0.361
Token can
Token ID460
Feature activation+4.59e-01

Pos logit contributions
ust+0.759
nes+0.737
vel+0.730
onel+0.675
aces+0.669
Neg logit contributions
 last-0.480
 Ms-0.417
 front-0.372
 teach-0.372
 hugs-0.364
Token buy
Token ID2822
Feature activation-3.73e-01

Pos logit contributions
 last+0.500
 Ms+0.451
 hugs+0.397
 front+0.381
 praise+0.377
Neg logit contributions
ust-0.953
nes-0.939
vel-0.908
ode-0.838
aces-0.835
Token candy
Token ID18550
Feature activation-5.53e-01

Pos logit contributions
ust+0.685
nes+0.663
vel+0.658
onel+0.602
aces+0.597
Neg logit contributions
 last-0.522
 Ms-0.458
 front-0.417
 teach-0.416
 hugs-0.408
Token and
Token ID290
Feature activation-3.67e-01

Pos logit contributions
ust+1.195
nes+1.162
vel+1.155
onel+1.076
aces+1.061
Neg logit contributions
 last-0.600
 Ms-0.503
 teach-0.439
 front-0.439
 hugs-0.429
Token ice
Token ID4771
Feature activation-1.18e+00

Pos logit contributions
ust+0.685
nes+0.664
vel+0.658
onel+0.603
aces+0.598
Neg logit contributions
 last-0.501
 Ms-0.439
 front-0.397
 teach-0.395
 hugs-0.389
Token cream
Token ID8566
Feature activation-5.03e-01

Pos logit contributions
ust+2.335
vel+2.261
nes+2.246
onel+2.120
ode+2.046
Neg logit contributions
 last-1.562
 Ms-1.299
 hugs-1.174
 first-1.168
 front-1.167
Token and
Token ID290
Feature activation-4.52e-01

Pos logit contributions
ust+1.088
nes+1.060
vel+1.053
onel+0.979
aces+0.968
Neg logit contributions
 last-0.542
 Ms-0.455
 front-0.398
 teach-0.397
 hugs-0.389
Token she
Token ID673
Feature activation+2.56e-01

Pos logit contributions
 last+0.132
 Ms+0.108
 teach+0.094
 front+0.092
 praise+0.089
Neg logit contributions
ust-0.328
nes-0.321
vel-0.321
onel-0.299
aces-0.298
Token always
Token ID1464
Feature activation-3.62e-01

Pos logit contributions
 last+0.358
 Ms+0.324
 front+0.294
 teach+0.284
 praise+0.278
Neg logit contributions
ust-0.459
nes-0.450
vel-0.442
onel-0.411
aces-0.401
Token made
Token ID925
Feature activation-6.72e-01

Pos logit contributions
ust+0.678
nes+0.658
vel+0.653
onel+0.599
aces+0.591
Neg logit contributions
 last-0.497
 Ms-0.437
 front-0.394
 teach-0.391
 hugs-0.385
Token sure
Token ID1654
Feature activation-5.68e-01

Pos logit contributions
ust+0.851
nes+0.812
vel+0.794
onel+0.700
aces+0.687
Neg logit contributions
 last-1.343
 Ms-1.228
 teach-1.154
 front-1.151
 hugs-1.144
Token to
Token ID284
Feature activation-5.17e-01

Pos logit contributions
ust+1.288
nes+1.255
vel+1.250
onel+1.165
aces+1.153
Neg logit contributions
 last-0.559
 Ms-0.459
 teach-0.394
 front-0.394
 hugs-0.386
Token keep
Token ID1394
Feature activation-7.25e-01

Pos logit contributions
ust+1.018
nes+0.988
vel+0.981
onel+0.905
aces+0.894
Neg logit contributions
 last-0.662
 Ms-0.571
 teach-0.513
 front-0.513
 hugs-0.504
Token her
Token ID607
Feature activation-1.41e+00

Pos logit contributions
ust+1.510
nes+1.468
vel+1.447
onel+1.347
aces+1.329
Neg logit contributions
 last-0.873
 Ms-0.743
 front-0.669
 teach-0.665
 hugs-0.656
Token key
Token ID1994
Feature activation+4.37e-03

Pos logit contributions
ust+2.262
nes+2.218
vel+2.166
onel+1.927
oll+1.924
Neg logit contributions
 last-2.504
 front-2.106
 Ms-2.082
 teach-2.046
 hugs-2.041
Token close
Token ID1969
Feature activation+1.13e+00

Pos logit contributions
 last+0.006
 Ms+0.006
 teach+0.005
 hugs+0.005
 front+0.005
Neg logit contributions
ust-0.008
nes-0.007
vel-0.007
onel-0.007
aces-0.007
Token.
Token ID13
Feature activation-2.85e-01

Pos logit contributions
 last+1.303
 Ms+1.278
 teach+1.139
+1.132
't+1.076
Neg logit contributions
ust-2.270
nes-2.142
vel-2.120
aces-2.007
adow-1.990
Token<|endoftext|>
Token ID50256
Feature activation-5.08e-02

Pos logit contributions
ust+0.642
nes+0.626
vel+0.622
onel+0.581
aces+0.575
Neg logit contributions
 last-0.283
 Ms-0.231
 teach-0.201
 front-0.201
 hugs-0.195
Token"
Token ID1
Feature activation-9.47e-01

Pos logit contributions
 last+0.562
 Ms+0.474
 hugs+0.443
 teach+0.437
 praise+0.429
Neg logit contributions
ust-0.937
nes-0.918
vel-0.905
aund-0.801
ode-0.796
TokenLook
Token ID8567
Feature activation-1.50e+00

Pos logit contributions
ust+1.923
vel+1.856
nes+1.855
onel+1.735
aces+1.700
Neg logit contributions
 last-1.172
 Ms-1.015
 front-0.915
 teach-0.892
't-0.869
Token,
Token ID11
Feature activation-1.22e+00

Pos logit contributions
ust+3.697
nes+3.629
vel+3.519
onel+3.381
aces+3.367
Neg logit contributions
 last-1.237
 Ms-0.964
 hugs-0.816
 first-0.787
 teach-0.783
Token L
Token ID406
Feature activation-6.31e-01

Pos logit contributions
ust+2.681
nes+2.596
vel+2.585
onel+2.420
aces+2.388
Neg logit contributions
 last-1.366
 Ms-1.166
 teach-1.026
 hugs-1.011
 front-1.008
Tokenila
Token ID10102
Feature activation-1.04e-02

Pos logit contributions
ust+0.802
nes+0.766
vel+0.758
onel+0.666
aces+0.651
Neg logit contributions
 last-1.255
 Ms-1.140
 front-1.070
 teach-1.070
 hugs-1.058
Token,
Token ID11
Feature activation-1.70e+00

Pos logit contributions
ust+0.024
nes+0.023
vel+0.023
aces+0.022
onel+0.021
Neg logit contributions
 last-0.010
 Ms-0.008
 front-0.007
 teach-0.007
 hugs-0.007
Token I
Token ID314
Feature activation-4.61e-01

Pos logit contributions
ust+4.210
nes+4.062
vel+4.006
onel+3.819
aces+3.744
Neg logit contributions
 last-1.540
 Ms-1.192
 teach-1.108
 front-1.037
 hugs-1.030
Token see
Token ID766
Feature activation-8.41e-01

Pos logit contributions
ust+0.919
nes+0.893
vel+0.887
onel+0.819
aces+0.809
Neg logit contributions
 last-0.579
 Ms-0.500
 front-0.447
 teach-0.446
 hugs-0.439
Token a
Token ID257
Feature activation-8.03e-01

Pos logit contributions
ust+2.046
nes+1.998
vel+1.970
onel+1.855
aces+1.830
Neg logit contributions
 last-0.713
 Ms-0.553
 front-0.479
 teach-0.463
 hugs-0.457
Token wide
Token ID3094
Feature activation-6.02e-01

Pos logit contributions
ust+1.406
nes+1.355
vel+1.342
onel+1.232
aces+1.224
Neg logit contributions
 last-1.221
 Ms-1.073
 front-0.992
 teach-0.988
 hugs-0.961
Token land
Token ID1956
Feature activation-2.21e-01

Pos logit contributions
ust+0.903
nes+0.867
vel+0.857
onel+0.773
aces+0.758
Neg logit contributions
 last-1.055
 Ms-0.947
 front-0.884
 teach-0.880
 hugs-0.868
Token Let
Token ID3914
Feature activation-2.26e-01

Pos logit contributions
 last+0.178
 Ms+0.153
 front+0.138
 teach+0.137
 hugs+0.136
Neg logit contributions
ust-0.247
nes-0.242
vel-0.238
onel-0.220
ode-0.217
Token's
Token ID338
Feature activation-5.03e-01

Pos logit contributions
ust+0.349
nes+0.337
vel+0.333
onel+0.298
aces+0.292
Neg logit contributions
 last-0.383
 Ms-0.344
 teach-0.318
 front-0.317
 hugs-0.313
Token go
Token ID467
Feature activation+7.31e-01

Pos logit contributions
ust+0.952
nes+0.923
vel+0.916
onel+0.842
aces+0.831
Neg logit contributions
 last-0.684
 Ms-0.596
 front-0.539
 teach-0.539
 hugs-0.530
Token take
Token ID1011
Feature activation-2.50e-01

Pos logit contributions
 last+0.953
 Ms+0.922
+0.780
 teach+0.769
 Mrs+0.768
Neg logit contributions
nes-1.355
ust-1.316
vel-1.288
adow-1.179
ode-1.168
Token a
Token ID257
Feature activation-8.07e-01

Pos logit contributions
ust+0.517
nes+0.507
vel+0.504
onel+0.464
aces+0.461
Neg logit contributions
 last-0.291
 Ms-0.250
 teach-0.220
 front-0.217
 hugs-0.210
Token closer
Token ID5699
Feature activation-2.01e+00

Pos logit contributions
ust+1.757
nes+1.707
vel+1.694
onel+1.592
aces+1.577
Neg logit contributions
 last-0.891
 Ms-0.730
 front-0.650
 teach-0.647
 hugs-0.635
Token look
Token ID804
Feature activation-1.55e-03

Pos logit contributions
ust+4.558
vel+4.344
nes+4.329
onel+4.264
oll+3.979
Neg logit contributions
 last-2.420
 front-1.764
 Ms-1.740
 first-1.695
 teach-1.690
Token,"
Token ID553
Feature activation-7.24e-01

Pos logit contributions
ust+0.003
nes+0.003
vel+0.003
onel+0.003
aces+0.003
Neg logit contributions
 last-0.002
 Ms-0.002
 teach-0.002
 front-0.002
-0.001
Token her
Token ID607
Feature activation-3.55e-01

Pos logit contributions
ust+1.536
nes+1.489
vel+1.481
onel+1.375
aces+1.357
Neg logit contributions
 last-0.845
 Ms-0.725
 teach-0.640
 front-0.636
 hugs-0.630
Token mom
Token ID1995
Feature activation+8.03e-01

Pos logit contributions
ust+0.709
nes+0.688
vel+0.681
onel+0.630
aces+0.623
Neg logit contributions
 last-0.448
 Ms-0.387
 teach-0.345
 front-0.343
 hugs-0.339
Tokenmy
Token ID1820
Feature activation+5.74e-01

Pos logit contributions
 last+0.908
 Ms+0.776
+0.712
 front+0.711
't+0.675
Neg logit contributions
ust-1.691
vel-1.636
nes-1.621
oll-1.520
aces-1.496
Token,
Token ID11
Feature activation+1.60e+00

Pos logit contributions
 last+0.880
 Ms+0.727
 front+0.657
 praise+0.648
 teach+0.646
Neg logit contributions
vel-1.670
ust-1.643
nes-1.615
aund-1.524
aces-1.522
Token the
Token ID262
Feature activation+2.62e-01

Pos logit contributions
 last+1.891
 hugs+1.631
 Ms+1.615
 praise+1.537
+1.480
Neg logit contributions
ust-2.837
nes-2.831
vel-2.822
aces-2.612
onel-2.606
Token man
Token ID582
Feature activation+1.49e-01

Pos logit contributions
 last+0.355
 Ms+0.325
+0.284
 front+0.282
 hugs+0.281
Neg logit contributions
ust-0.476
vel-0.460
nes-0.455
ode-0.425
aces-0.423
Token went
Token ID1816
Feature activation+5.18e-01

Pos logit contributions
 last+0.201
 Ms+0.178
 front+0.162
+0.157
 teach+0.157
Neg logit contributions
ust-0.278
nes-0.269
vel-0.266
onel-0.246
ode-0.243
Token on
Token ID319
Feature activation+3.94e-01

Pos logit contributions
 last+0.643
 Ms+0.567
+0.520
 praise+0.493
 teach+0.492
Neg logit contributions
ust-0.978
nes-0.964
vel-0.963
onel-0.868
aces-0.865
Token an
Token ID281
Feature activation-1.81e+00

Pos logit contributions
 last+0.413
 Ms+0.352
 teach+0.308
 hugs+0.307
 praise+0.299
Neg logit contributions
ust-0.821
vel-0.810
nes-0.803
aces-0.737
oll-0.726
Token unknown
Token ID6439
Feature activation-1.01e+00

Pos logit contributions
ust+3.313
vel+3.243
nes+3.239
onel+2.964
aces+2.882
Neg logit contributions
 last-2.800
 Ms-2.262
 teach-2.232
 front-2.231
 first-2.189
Token adventure
Token ID8855
Feature activation+1.04e-01

Pos logit contributions
ust+1.825
vel+1.762
nes+1.751
onel+1.621
aces+1.603
Neg logit contributions
 last-1.523
 Ms-1.293
 teach-1.224
 front-1.223
 hugs-1.173
Token.
Token ID13
Feature activation-2.15e-01

Pos logit contributions
 last+0.121
 Ms+0.108
 hugs+0.095
 teach+0.094
 front+0.093
Neg logit contributions
ust-0.212
nes-0.207
vel-0.205
aces-0.189
onel-0.188
Token He
Token ID679
Feature activation-8.35e-02

Pos logit contributions
ust+0.418
nes+0.406
vel+0.400
onel+0.369
aces+0.368
Neg logit contributions
 last-0.280
 Ms-0.240
 teach-0.218
 front-0.216
 hugs-0.215
Token didn
Token ID1422
Feature activation+5.54e-01

Pos logit contributions
ust+0.170
nes+0.166
vel+0.163
onel+0.151
ode+0.149
Neg logit contributions
 last-0.100
 Ms-0.086
 front-0.077
 teach-0.076
 hugs-0.075
Token called
Token ID1444
Feature activation+2.29e-01

Pos logit contributions
 last+0.116
 Ms+0.102
 teach+0.091
 front+0.091
 hugs+0.090
Neg logit contributions
ust-0.160
nes-0.157
vel-0.156
onel-0.141
aces-0.141
Token Sally
Token ID25737
Feature activation-9.35e-01

Pos logit contributions
 last+0.297
 Ms+0.250
 teach+0.231
 front+0.229
 hugs+0.226
Neg logit contributions
ust-0.440
nes-0.440
vel-0.428
aces-0.391
onel-0.389
Token.
Token ID13
Feature activation-2.18e-01

Pos logit contributions
ust+2.147
nes+2.103
vel+2.092
onel+1.965
ode+1.922
Neg logit contributions
 last-0.922
 Ms-0.743
 front-0.638
 hugs-0.626
 teach-0.625
Token She
Token ID1375
Feature activation-7.02e-01

Pos logit contributions
ust+0.430
nes+0.413
vel+0.411
onel+0.380
aces+0.378
Neg logit contributions
 last-0.280
 Ms-0.241
 teach-0.217
 front-0.217
 hugs-0.213
Token loved
Token ID6151
Feature activation-8.84e-01

Pos logit contributions
ust+1.350
nes+1.300
vel+1.298
onel+1.196
ode+1.176
Neg logit contributions
 last-0.949
 Ms-0.824
 teach-0.744
 front-0.742
 hugs-0.737
Token playing
Token ID2712
Feature activation-1.88e+00

Pos logit contributions
ust+2.057
nes+2.005
vel+1.986
onel+1.867
aces+1.834
Neg logit contributions
 last-0.850
 Ms-0.701
 teach-0.602
 front-0.598
 hugs-0.597
Token outside
Token ID2354
Feature activation+3.75e-01

Pos logit contributions
ust+4.392
nes+4.121
vel+3.986
onel+3.871
oll+3.767
Neg logit contributions
 last-1.993
 Ms-1.717
 teach-1.637
 while-1.558
 front-1.538
Token and
Token ID290
Feature activation+3.05e-01

Pos logit contributions
 last+0.620
 Ms+0.568
 teach+0.525
 praise+0.519
 hugs+0.519
Neg logit contributions
ust-0.569
vel-0.555
nes-0.549
onel-0.494
aces-0.494
Token jumping
Token ID14284
Feature activation+7.69e-02

Pos logit contributions
 last+0.399
 Ms+0.349
 front+0.314
 teach+0.311
+0.309
Neg logit contributions
ust-0.572
vel-0.561
nes-0.557
ode-0.507
onel-0.506
Token in
Token ID287
Feature activation-2.40e-01

Pos logit contributions
 last+0.120
 Ms+0.108
 teach+0.099
 front+0.097
 praise+0.096
Neg logit contributions
ust-0.126
nes-0.124
vel-0.123
aces-0.113
onel-0.110
Token p
Token ID279
Feature activation-4.28e-01

Pos logit contributions
ust+0.589
nes+0.576
vel+0.575
onel+0.536
aces+0.533
Neg logit contributions
 last-0.184
 Ms-0.144
 teach-0.117
 front-0.111
 hugs-0.111
Token mom
Token ID1995
Feature activation+1.22e+00

Pos logit contributions
 last+0.524
 Ms+0.496
 teach+0.424
+0.424
't+0.422
Neg logit contributions
ust-0.699
vel-0.666
nes-0.664
oll-0.619
onel-0.617
Token put
Token ID1234
Feature activation+6.83e-02

Pos logit contributions
 last+1.523
 Ms+1.372
 front+1.262
+1.253
't+1.221
Neg logit contributions
ust-2.305
vel-2.294
nes-2.206
ode-2.092
onel-2.069
Token the
Token ID262
Feature activation+2.26e-02

Pos logit contributions
 last+0.073
 Ms+0.063
 teach+0.056
 front+0.054
't+0.053
Neg logit contributions
ust-0.144
vel-0.141
nes-0.139
onel-0.131
aces-0.130
Token cream
Token ID8566
Feature activation-6.73e-01

Pos logit contributions
 last+0.034
 Ms+0.032
 teach+0.028
 front+0.028
't+0.028
Neg logit contributions
ust-0.038
nes-0.036
vel-0.036
onel-0.033
aces-0.033
Token on
Token ID319
Feature activation+1.43e-01

Pos logit contributions
ust+1.139
nes+1.098
vel+1.089
onel+0.988
aces+0.971
Neg logit contributions
 last-1.061
 Ms-0.930
 front-0.864
 teach-0.858
 hugs-0.849
Token his
Token ID465
Feature activation-1.70e+00

Pos logit contributions
 last+0.168
 Ms+0.146
 teach+0.132
 hugs+0.129
 praise+0.126
Neg logit contributions
ust-0.283
nes-0.276
vel-0.274
aces-0.251
onel-0.251
Token knee
Token ID10329
Feature activation-1.50e-01

Pos logit contributions
ust+2.721
vel+2.569
nes+2.538
onel+2.336
aund+2.234
Neg logit contributions
 last-3.102
 front-2.684
 Ms-2.640
 hugs-2.573
 teach-2.565
Token and
Token ID290
Feature activation+4.44e-01

Pos logit contributions
ust+0.314
nes+0.308
vel+0.305
aces+0.284
onel+0.283
Neg logit contributions
 last-0.167
 Ms-0.144
 front-0.125
 teach-0.124
 hugs-0.122
Token soon
Token ID2582
Feature activation-1.24e+00

Pos logit contributions
 last+0.530
 Ms+0.445
 front+0.418
+0.405
't+0.404
Neg logit contributions
ust-0.869
nes-0.861
vel-0.851
oll-0.780
onel-0.779
Token it
Token ID340
Feature activation+8.29e-01

Pos logit contributions
ust+2.888
nes+2.805
vel+2.781
onel+2.591
ode+2.586
Neg logit contributions
 last-1.224
 Ms-1.004
 hugs-0.877
 front-0.836
 teach-0.830
Token felt
Token ID2936
Feature activation+8.61e-01

Pos logit contributions
 last+1.015
 Ms+0.921
 teach+0.832
 praise+0.816
 front+0.801
Neg logit contributions
ust-1.612
nes-1.570
vel-1.570
onel-1.468
oll-1.445