TOP ACTIVATIONS
MAX = 3.969

played basketball together until the sun went down . <|endoftext|> Once
in the park until the sun went down . <|endoftext|> Once
continued to play until the sun went down . It was
in the park until the sun came out and the fog
slide all day until the sun went down . They had
nice !" The lady smiled and said , " I 'm
that they played until the sun went down . From that
they all played until the sun went down . They were
laughing and playing until the sun set and it was time
you okay , my d arl ings ?" she asked .
. The little girl smiled and went on her way .
around the garden until the sun went down . They became
flew by and soon the sun went down and it was
the k ite until the sun went down . The end
." The sw an smiled and sw am away .
L ily smiled and hugged them . She said
ball all day until the sun went down . Max was
y flew away , happy and clean . <|endoftext|> Once upon
and free . Lily smiled and laughed . She loved to
they played together until the sun came out and the beach

MIN ACTIVATIONS
MIN = -3.944

surprise for you . Do you want to know what it
has many things inside . They see a candle , a
stories with them . Do you want to try ?"
try something different . He asked his farm friends to help
blocks in the basement . They are very happy . They
many shiny things inside . They are curious and want to
many modern toys inside . They are shiny and make noises
a lot of boxes . They want to see what is
fix your bike . Do you want me to try ?"
asks Tom , " Do you want ice cream ?"
box in the garage . They wonder what is inside .
made a plan . He asked his mom for help ,
with something else . Do you want to draw a picture
in the play room . They are very happy and start
toys in the park . They are very happy .
box in the yard . They want to see what is
real car on it . They are very excited . They
a new one . Do you want to help me ?"
see a big box . They want to know what is
box in the garage . They open the box and see

INTERVAL 1.991 - 3.969
CONTAINS 0.799%

amed them with sugar and cream . M
at the park . The sun was shining , and the
up and said : " Let 's take it inside and
enjoyed their meal . After lunch , they went back to
reached , but she still couldn 't touch the rainbow .

INTERVAL 0.012 - 1.991
CONTAINS 48.264%

then , a voice called out behind her . It was
a great book . It was colourful and it even had
It is just dark . Don 't be scared ."
his mom decided it was too late . His mom said
she had experienced . Little did she know

INTERVAL -1.966 - 0.012
CONTAINS 49.941%

duck saw that the river was icy . The duck didn
and went back inside . Suddenly , the tank appeared again
. Peter punched Sam right back and Sam gave up .
park . John promised , and he has
Anna , come here . I have something for you ."

INTERVAL -3.944 - -1.966
CONTAINS 0.996%

" I don 't know . Maybe it is a toy
toys with his friend . They started to fight over the
the bus . The moral of the story is
<|endoftext|> Once upon a time , there was a little frog
<|endoftext|> Once upon a time , there was a big party
Token played
Token ID2826
Feature activation-1.10e-01

Pos logit contributions
 questions+0.499
years+0.486
ruits+0.469
year+0.438
adies+0.433
Neg logit contributions
ww-0.722
burn-0.678
lor-0.658
 subs-0.652
 tou-0.623
Token basketball
Token ID9669
Feature activation+1.13e+00

Pos logit contributions
years+0.175
 questions+0.172
ruits+0.166
year+0.159
nam+0.158
Neg logit contributions
ww-0.193
lor-0.180
burn-0.179
 subs-0.171
actus-0.164
Token together
Token ID1978
Feature activation+1.03e-01

Pos logit contributions
ww+2.236
lor+2.127
burn+2.113
 subs+2.003
 tou+1.953
Neg logit contributions
 questions-1.597
years-1.583
ruits-1.491
year-1.441
nam-1.386
Token until
Token ID1566
Feature activation+1.36e+00

Pos logit contributions
ww+0.211
lor+0.199
burn+0.199
 subs+0.191
actus+0.183
Neg logit contributions
years-0.136
 questions-0.135
ruits-0.127
year-0.122
nam-0.121
Token the
Token ID262
Feature activation+1.33e+00

Pos logit contributions
lor+2.370
ww+2.352
burn+2.195
actus+2.094
 subs+2.068
Neg logit contributions
years-2.177
 questions-2.163
nam-2.053
ruits-2.026
year-2.007
Token sun
Token ID4252
Feature activation+3.97e+00

Pos logit contributions
ww+1.916
lor+1.902
burn+1.775
 subs+1.638
actus+1.569
Neg logit contributions
 questions-2.613
years-2.519
ruits-2.420
 three-2.379
nam-2.367
Token went
Token ID1816
Feature activation+1.99e+00

Pos logit contributions
ww+10.204
lor+10.194
burn+8.729
rob+8.656
 hawk+8.462
Neg logit contributions
 three-4.025
 questions-3.845
 want-3.763
 liked-3.761
 anx-3.580
Token down
Token ID866
Feature activation+9.25e-01

Pos logit contributions
lor+4.130
ww+4.042
burn+3.654
 subs+3.589
actus+3.583
Neg logit contributions
 questions-2.721
years-2.643
nam-2.500
year-2.457
 three-2.420
Token.
Token ID13
Feature activation+1.06e+00

Pos logit contributions
ww+1.875
lor+1.776
burn+1.751
 subs+1.653
actus+1.624
Neg logit contributions
 questions-1.273
years-1.259
ruits-1.171
year-1.168
nam-1.107
Token<|endoftext|>
Token ID50256
Feature activation+1.45e+00

Pos logit contributions
ww+2.028
lor+1.938
burn+1.912
 subs+1.812
actus+1.760
Neg logit contributions
years-1.526
 questions-1.518
ruits-1.457
year-1.418
nam-1.369
TokenOnce
Token ID7454
Feature activation-3.66e-01

Pos logit contributions
ww+2.724
burn+2.519
lor+2.443
 subs+2.371
 tou+2.273
Neg logit contributions
 questions-2.236
years-2.170
ruits-2.105
year-2.054
 three-2.001
Token in
Token ID287
Feature activation+2.43e-01

Pos logit contributions
ww+0.640
lor+0.604
burn+0.603
 subs+0.578
actus+0.557
Neg logit contributions
years-0.394
 questions-0.393
ruits-0.368
year-0.352
nam-0.347
Token the
Token ID262
Feature activation+2.10e+00

Pos logit contributions
ww+0.431
burn+0.403
lor+0.402
 subs+0.383
 tou+0.366
Neg logit contributions
years-0.389
 questions-0.387
ruits-0.369
year-0.356
nam-0.350
Token park
Token ID3952
Feature activation+7.00e-02

Pos logit contributions
lor+3.711
ww+3.646
burn+3.542
 subs+3.282
ic+3.197
Neg logit contributions
 questions-3.544
years-3.273
 three-3.180
ruits-3.062
 ten-3.033
Token until
Token ID1566
Feature activation+1.22e+00

Pos logit contributions
ww+0.131
burn+0.123
lor+0.123
 subs+0.118
 tou+0.113
Neg logit contributions
years-0.104
 questions-0.103
ruits-0.098
year-0.094
nam-0.093
Token the
Token ID262
Feature activation+1.29e+00

Pos logit contributions
ww+2.117
lor+2.092
burn+1.961
actus+1.857
 subs+1.849
Neg logit contributions
years-1.974
 questions-1.965
ruits-1.857
nam-1.834
year-1.805
Token sun
Token ID4252
Feature activation+3.95e+00

Pos logit contributions
ww+1.721
lor+1.698
burn+1.578
 subs+1.460
actus+1.355
Neg logit contributions
 questions-2.649
years-2.537
ruits-2.477
 three-2.399
year-2.364
Token went
Token ID1816
Feature activation+1.80e+00

Pos logit contributions
ww+10.472
lor+10.029
rob+8.711
burn+8.576
www+8.499
Neg logit contributions
 three-3.820
 liked-3.733
 questions-3.661
 counted-3.529
 want-3.524
Token down
Token ID866
Feature activation+8.27e-01

Pos logit contributions
lor+3.579
ww+3.562
burn+3.211
actus+3.142
 subs+3.105
Neg logit contributions
 questions-2.585
years-2.510
nam-2.390
year-2.318
ruits-2.303
Token.
Token ID13
Feature activation+6.07e-01

Pos logit contributions
ww+1.632
lor+1.538
burn+1.520
 subs+1.432
actus+1.407
Neg logit contributions
 questions-1.181
years-1.172
ruits-1.093
year-1.083
nam-1.039
Token<|endoftext|>
Token ID50256
Feature activation+1.09e+00

Pos logit contributions
ww+1.070
lor+1.005
burn+1.003
 subs+0.950
actus+0.910
Neg logit contributions
years-0.969
 questions-0.969
ruits-0.928
year-0.894
nam-0.874
TokenOnce
Token ID7454
Feature activation-2.23e-01

Pos logit contributions
ww+1.932
burn+1.779
lor+1.751
 subs+1.688
 tou+1.601
Neg logit contributions
 questions-1.766
years-1.727
ruits-1.672
year-1.622
 three-1.562
Token continued
Token ID3767
Feature activation+1.53e+00

Pos logit contributions
 questions+0.213
years+0.210
ruits+0.202
year+0.190
nam+0.187
Neg logit contributions
ww-0.315
burn-0.297
lor-0.292
 subs-0.286
 tou-0.274
Token to
Token ID284
Feature activation-7.51e-02

Pos logit contributions
ww+3.181
lor+3.070
burn+2.861
actus+2.804
 subs+2.742
Neg logit contributions
 questions-2.139
years-2.078
ruits-1.870
 three-1.844
year-1.837
Token play
Token ID711
Feature activation+1.33e-01

Pos logit contributions
years+0.101
 questions+0.100
ruits+0.094
year+0.092
nam+0.088
Neg logit contributions
ww-0.153
burn-0.146
lor-0.143
 subs-0.139
 tou-0.134
Token until
Token ID1566
Feature activation+2.40e+00

Pos logit contributions
ww+0.253
lor+0.237
burn+0.236
 subs+0.226
actus+0.217
Neg logit contributions
years-0.197
 questions-0.195
ruits-0.186
year-0.178
nam-0.177
Token the
Token ID262
Feature activation+1.12e+00

Pos logit contributions
lor+4.656
ww+4.533
ic+4.165
burn+4.135
actus+3.957
Neg logit contributions
years-3.610
 questions-3.464
nam-3.421
 three-3.414
ruits-3.265
Token sun
Token ID4252
Feature activation+3.88e+00

Pos logit contributions
ww+1.598
lor+1.546
burn+1.495
 subs+1.376
 tou+1.306
Neg logit contributions
 questions-2.168
years-2.110
ruits-2.020
year-1.978
 three-1.969
Token went
Token ID1816
Feature activation+2.06e+00

Pos logit contributions
ww+9.902
lor+9.432
rob+8.425
www+8.073
burn+8.059
Neg logit contributions
 three-4.239
 questions-3.857
 curiously-3.784
 anx-3.759
 if-3.622
Token down
Token ID866
Feature activation+1.73e+00

Pos logit contributions
lor+4.300
ww+4.249
burn+3.873
actus+3.769
 subs+3.748
Neg logit contributions
 questions-2.810
years-2.672
 three-2.513
year-2.474
nam-2.470
Token.
Token ID13
Feature activation+2.18e+00

Pos logit contributions
ww+3.911
lor+3.737
burn+3.648
actus+3.398
 subs+3.383
Neg logit contributions
 questions-2.120
years-2.042
year-1.933
 three-1.883
ruits-1.852
Token It
Token ID632
Feature activation+2.78e+00

Pos logit contributions
ww+4.828
burn+4.610
lor+4.586
 subs+4.379
rob+4.362
Neg logit contributions
years-2.545
 questions-2.484
ruits-2.403
 One-2.356
 three-2.329
Token was
Token ID373
Feature activation+9.32e-01

Pos logit contributions
ww+5.800
lor+5.590
burn+5.460
actus+4.991
rob+4.949
Neg logit contributions
 three-3.632
years-3.601
 questions-3.590
 curiously-3.447
 especially-3.384
Token in
Token ID287
Feature activation-4.22e-02

Pos logit contributions
years+0.392
 questions+0.375
ruits+0.365
year+0.351
nam+0.347
Neg logit contributions
ww-0.509
burn-0.474
lor-0.473
 subs-0.453
 tou-0.441
Token the
Token ID262
Feature activation+1.23e+00

Pos logit contributions
years+0.070
 questions+0.069
ruits+0.067
year+0.065
nam+0.063
Neg logit contributions
ww-0.071
burn-0.067
lor-0.066
 subs-0.063
 tou-0.061
Token park
Token ID3952
Feature activation-5.14e-01

Pos logit contributions
ww+1.899
lor+1.842
burn+1.781
 subs+1.688
 chip+1.589
Neg logit contributions
 questions-2.228
years-2.146
ruits-2.046
year-1.998
nam-1.974
Token until
Token ID1566
Feature activation+1.45e+00

Pos logit contributions
years+0.707
 questions+0.686
ruits+0.659
year+0.640
nam+0.637
Neg logit contributions
ww-1.001
burn-0.958
lor-0.945
 subs-0.915
 chip-0.885
Token the
Token ID262
Feature activation+7.65e-01

Pos logit contributions
ww+2.505
lor+2.471
burn+2.307
 subs+2.184
actus+2.181
Neg logit contributions
years-2.418
 questions-2.407
ruits-2.255
nam-2.254
 three-2.223
Token sun
Token ID4252
Feature activation+3.85e+00

Pos logit contributions
ww+0.973
lor+0.914
burn+0.882
 subs+0.815
actus+0.761
Neg logit contributions
 questions-1.617
years-1.587
ruits-1.536
year-1.490
nam-1.478
Token came
Token ID1625
Feature activation+5.69e-01

Pos logit contributions
ww+9.731
lor+9.264
rob+8.202
burn+7.862
www+7.731
Neg logit contributions
 three-4.326
 questions-4.102
 liked-4.055
 curiously-3.837
 anx-3.790
Token out
Token ID503
Feature activation+4.21e-01

Pos logit contributions
ww+1.016
lor+0.952
burn+0.935
 subs+0.892
actus+0.864
Neg logit contributions
 questions-0.900
years-0.897
ruits-0.848
year-0.821
nam-0.813
Token and
Token ID290
Feature activation+1.66e+00

Pos logit contributions
ww+0.808
lor+0.759
burn+0.755
 subs+0.720
actus+0.695
Neg logit contributions
years-0.612
 questions-0.612
ruits-0.576
year-0.556
nam-0.546
Token the
Token ID262
Feature activation+7.82e-01

Pos logit contributions
ww+3.088
lor+3.054
burn+2.737
actus+2.660
 subs+2.641
Neg logit contributions
 questions-2.626
years-2.579
nam-2.538
ruits-2.415
 three-2.403
Token fog
Token ID19558
Feature activation+2.65e+00

Pos logit contributions
ww+1.077
lor+1.007
burn+0.977
 subs+0.908
actus+0.851
Neg logit contributions
 questions-1.573
years-1.545
ruits-1.488
year-1.441
nam-1.439
Token slide
Token ID10649
Feature activation+7.72e-01

Pos logit contributions
years+0.135
 questions+0.134
ruits+0.127
year+0.124
nam+0.120
Neg logit contributions
ww-0.135
burn-0.126
lor-0.122
 subs-0.118
 tou-0.114
Token all
Token ID477
Feature activation+1.37e+00

Pos logit contributions
ww+1.578
lor+1.511
burn+1.467
 subs+1.413
 tou+1.370
Neg logit contributions
 questions-1.039
years-1.024
ruits-0.968
year-0.926
nam-0.915
Token day
Token ID1110
Feature activation+1.03e+00

Pos logit contributions
lor+2.358
ww+2.343
burn+2.178
 subs+2.071
actus+2.029
Neg logit contributions
 questions-2.272
years-2.258
ruits-2.156
year-2.134
 three-2.047
Token until
Token ID1566
Feature activation+1.67e+00

Pos logit contributions
ww+2.060
lor+1.963
burn+1.945
 subs+1.837
 tou+1.799
Neg logit contributions
 questions-1.439
years-1.425
ruits-1.362
nam-1.292
year-1.284
Token the
Token ID262
Feature activation+1.36e+00

Pos logit contributions
lor+2.911
ww+2.875
burn+2.665
actus+2.574
 subs+2.542
Neg logit contributions
 questions-2.701
years-2.667
nam-2.541
 three-2.524
ruits-2.522
Token sun
Token ID4252
Feature activation+3.81e+00

Pos logit contributions
lor+1.815
ww+1.813
burn+1.674
 subs+1.543
actus+1.482
Neg logit contributions
 questions-2.803
years-2.639
ruits-2.632
 three-2.564
year-2.498
Token went
Token ID1816
Feature activation+1.90e+00

Pos logit contributions
ww+9.278
lor+9.256
rob+7.924
actus+7.760
burn+7.735
Neg logit contributions
 three-4.440
 questions-3.904
 liked-3.879
 counted-3.615
adies-3.606
Token down
Token ID866
Feature activation+1.15e+00

Pos logit contributions
lor+3.904
ww+3.833
burn+3.476
 subs+3.414
actus+3.359
Neg logit contributions
 questions-2.656
years-2.561
nam-2.410
year-2.365
ruits-2.351
Token.
Token ID13
Feature activation+1.27e+00

Pos logit contributions
ww+2.370
lor+2.251
burn+2.208
 subs+2.084
actus+2.045
Neg logit contributions
 questions-1.546
years-1.516
year-1.421
ruits-1.409
 three-1.342
Token They
Token ID1119
Feature activation-2.04e-01

Pos logit contributions
ww+2.769
lor+2.644
burn+2.593
 subs+2.489
actus+2.429
Neg logit contributions
 questions-1.557
years-1.504
ruits-1.454
nam-1.365
year-1.359
Token had
Token ID550
Feature activation+7.32e-01

Pos logit contributions
 questions+0.307
years+0.304
ruits+0.291
year+0.278
nam+0.270
Neg logit contributions
ww-0.381
burn-0.359
lor-0.347
 subs-0.342
 tou-0.329
Token nice
Token ID3621
Feature activation+4.15e-01

Pos logit contributions
ruits+1.611
 questions+1.611
years+1.506
adies+1.429
 whether+1.400
Neg logit contributions
ww-2.112
 subs-1.832
burn-1.809
 tou-1.797
lor-1.794
Token!"
Token ID2474
Feature activation+1.53e+00

Pos logit contributions
ww+0.754
lor+0.705
burn+0.703
 subs+0.670
actus+0.643
Neg logit contributions
 questions-0.640
years-0.640
ruits-0.606
year-0.584
nam-0.575
Token The
Token ID383
Feature activation+5.44e-01

Pos logit contributions
ww+3.539
lor+3.335
burn+3.309
 subs+3.202
 tou+3.073
Neg logit contributions
years-1.725
 questions-1.707
ruits-1.572
nam-1.512
year-1.490
Token lady
Token ID10846
Feature activation+5.01e-01

Pos logit contributions
ww+0.578
lor+0.516
burn+0.513
 subs+0.465
actus+0.429
Neg logit contributions
 questions-1.269
years-1.263
ruits-1.221
year-1.193
nam-1.181
Token smiled
Token ID13541
Feature activation+5.67e-01

Pos logit contributions
ww+1.006
lor+0.947
burn+0.943
 subs+0.900
actus+0.870
Neg logit contributions
years-0.685
 questions-0.681
ruits-0.638
year-0.618
nam-0.603
Token and
Token ID290
Feature activation+3.78e+00

Pos logit contributions
ww+1.220
lor+1.151
burn+1.143
 subs+1.096
actus+1.062
Neg logit contributions
 questions-0.699
years-0.698
ruits-0.643
year-0.624
nam-0.607
Token said
Token ID531
Feature activation+1.74e-01

Pos logit contributions
lor+9.008
ww+8.371
burn+8.232
ic+7.761
actus+7.733
Neg logit contributions
years-4.708
 three-4.667
 if-4.544
 questions-4.486
nam-4.454
Token,
Token ID11
Feature activation+6.05e-01

Pos logit contributions
ww+0.364
lor+0.342
burn+0.340
 subs+0.328
 tou+0.315
Neg logit contributions
years-0.225
 questions-0.224
ruits-0.211
year-0.200
nam-0.198
Token "
Token ID366
Feature activation+1.95e+00

Pos logit contributions
ww+1.225
lor+1.151
burn+1.147
 subs+1.097
actus+1.063
Neg logit contributions
years-0.814
 questions-0.813
ruits-0.761
year-0.734
nam-0.724
TokenI
Token ID40
Feature activation+2.59e-02

Pos logit contributions
ww+3.014
burn+2.917
lor+2.896
 subs+2.761
actus+2.707
Neg logit contributions
years-3.644
year-3.389
 questions-3.367
ruits-3.261
nam-3.249
Token'm
Token ID1101
Feature activation+1.03e+00

Pos logit contributions
ww+0.044
lor+0.041
burn+0.041
 subs+0.039
actus+0.037
Neg logit contributions
years-0.044
 questions-0.043
ruits-0.041
nam-0.040
year-0.040
Token that
Token ID326
Feature activation+1.10e+00

Pos logit contributions
ww+1.827
lor+1.751
burn+1.740
 subs+1.651
 tou+1.614
Neg logit contributions
 questions-1.279
years-1.256
ruits-1.190
year-1.130
nam-1.106
Token they
Token ID484
Feature activation+6.27e-01

Pos logit contributions
ww+1.797
lor+1.743
burn+1.638
 subs+1.563
actus+1.551
Neg logit contributions
 questions-1.933
years-1.913
ruits-1.842
nam-1.814
 three-1.747
Token played
Token ID2826
Feature activation+1.76e-01

Pos logit contributions
ww+1.194
lor+1.132
burn+1.119
 subs+1.063
actus+1.028
Neg logit contributions
years-0.924
 questions-0.915
ruits-0.866
year-0.834
nam-0.833
Token until
Token ID1566
Feature activation+1.80e+00

Pos logit contributions
ww+0.329
lor+0.307
burn+0.307
 subs+0.292
actus+0.281
Neg logit contributions
years-0.265
 questions-0.263
ruits-0.250
year-0.241
nam-0.237
Token the
Token ID262
Feature activation+1.78e+00

Pos logit contributions
lor+3.288
ww+3.223
burn+2.973
actus+2.905
 subs+2.867
Neg logit contributions
 questions-2.814
years-2.785
nam-2.712
 three-2.680
ruits-2.622
Token sun
Token ID4252
Feature activation+3.78e+00

Pos logit contributions
lor+2.567
ww+2.485
burn+2.366
 subs+2.147
rob+2.055
Neg logit contributions
 questions-3.562
years-3.362
ruits-3.282
 three-3.265
nam-3.159
Token went
Token ID1816
Feature activation+2.14e+00

Pos logit contributions
ww+9.226
lor+9.205
rob+7.938
actus+7.835
burn+7.690
Neg logit contributions
 three-4.194
 questions-4.056
 want-3.901
 liked-3.897
ruits-3.711
Token down
Token ID866
Feature activation+1.18e+00

Pos logit contributions
lor+4.587
ww+4.460
burn+4.006
actus+3.975
 subs+3.961
Neg logit contributions
 questions-2.888
years-2.690
nam-2.602
 three-2.567
year-2.501
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
ww+2.479
lor+2.360
burn+2.319
 subs+2.191
actus+2.159
Neg logit contributions
 questions-1.553
years-1.510
year-1.407
ruits-1.396
 three-1.345
Token From
Token ID3574
Feature activation+1.25e+00

Pos logit contributions
ww+2.180
lor+2.044
burn+2.040
 subs+1.944
actus+1.896
Neg logit contributions
 questions-1.558
years-1.555
ruits-1.488
year-1.434
nam-1.424
Token that
Token ID326
Feature activation+5.24e-01

Pos logit contributions
ww+2.591
lor+2.547
burn+2.491
 subs+2.426
 tou+2.285
Neg logit contributions
 questions-1.589
years-1.575
ruits-1.443
nam-1.405
year-1.394
Token they
Token ID484
Feature activation+4.95e-01

Pos logit contributions
ww+3.107
lor+3.017
burn+2.860
 subs+2.761
actus+2.755
Neg logit contributions
 questions-2.291
years-2.225
ruits-2.097
nam-2.084
year-2.009
Token all
Token ID477
Feature activation+3.71e-01

Pos logit contributions
ww+1.062
lor+1.006
burn+0.998
 subs+0.959
actus+0.929
Neg logit contributions
years-0.609
 questions-0.604
ruits-0.563
year-0.541
nam-0.532
Token played
Token ID2826
Feature activation-6.73e-04

Pos logit contributions
ww+0.716
lor+0.672
burn+0.670
 subs+0.640
actus+0.616
Neg logit contributions
years-0.533
 questions-0.532
ruits-0.502
year-0.483
nam-0.476
Token until
Token ID1566
Feature activation+1.88e+00

Pos logit contributions
years+0.001
 questions+0.001
ruits+0.001
year+0.001
nam+0.001
Neg logit contributions
ww-0.001
lor-0.001
burn-0.001
 subs-0.001
actus-0.001
Token the
Token ID262
Feature activation+1.48e+00

Pos logit contributions
lor+3.378
ww+3.340
burn+3.048
actus+3.020
 subs+2.949
Neg logit contributions
years-2.983
 questions-2.962
nam-2.885
 three-2.813
year-2.753
Token sun
Token ID4252
Feature activation+3.75e+00

Pos logit contributions
ww+1.971
lor+1.958
burn+1.831
 subs+1.689
actus+1.573
Neg logit contributions
 questions-3.054
years-2.920
ruits-2.833
 three-2.789
year-2.729
Token went
Token ID1816
Feature activation+2.29e+00

Pos logit contributions
ww+9.136
lor+8.763
 hawk+7.690
rob+7.677
actus+7.540
Neg logit contributions
 three-4.086
 questions-3.932
 liked-3.753
 want-3.689
 anx-3.657
Token down
Token ID866
Feature activation+1.64e+00

Pos logit contributions
lor+4.891
ww+4.791
rob+4.243
actus+4.240
burn+4.233
Neg logit contributions
 questions-3.045
years-2.861
nam-2.728
 three-2.719
year-2.686
Token.
Token ID13
Feature activation+9.54e-01

Pos logit contributions
ww+3.550
lor+3.380
burn+3.287
 subs+3.117
actus+3.098
Neg logit contributions
 questions-2.090
years-2.010
year-1.908
 three-1.842
ruits-1.842
Token They
Token ID1119
Feature activation+5.06e-01

Pos logit contributions
ww+1.940
burn+1.837
lor+1.834
 subs+1.751
actus+1.698
Neg logit contributions
years-1.270
 questions-1.265
ruits-1.214
year-1.169
nam-1.131
Token were
Token ID547
Feature activation+1.20e+00

Pos logit contributions
ww+1.060
lor+1.004
burn+0.996
 subs+0.954
actus+0.923
Neg logit contributions
years-0.650
 questions-0.645
ruits-0.604
year-0.582
nam-0.573
Token laughing
Token ID14376
Feature activation+1.93e-01

Pos logit contributions
ww+0.774
lor+0.727
burn+0.723
 subs+0.691
actus+0.665
Neg logit contributions
years-0.581
 questions-0.579
ruits-0.547
year-0.526
nam-0.519
Token and
Token ID290
Feature activation+1.33e+00

Pos logit contributions
ww+0.387
lor+0.363
burn+0.363
 subs+0.348
actus+0.335
Neg logit contributions
years-0.263
 questions-0.261
ruits-0.247
year-0.236
nam-0.233
Token playing
Token ID2712
Feature activation-2.81e-02

Pos logit contributions
ww+2.810
lor+2.766
burn+2.603
 subs+2.499
actus+2.494
Neg logit contributions
years-1.769
 questions-1.730
nam-1.591
ruits-1.574
year-1.531
Token until
Token ID1566
Feature activation+1.39e+00

Pos logit contributions
years+0.042
 questions+0.042
ruits+0.040
year+0.038
nam+0.038
Neg logit contributions
ww-0.052
lor-0.049
burn-0.048
 subs-0.046
actus-0.045
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
ww+2.445
lor+2.406
burn+2.300
actus+2.167
 subs+2.135
Neg logit contributions
 questions-2.253
years-2.229
ruits-2.094
 three-2.074
year-2.055
Token sun
Token ID4252
Feature activation+3.74e+00

Pos logit contributions
ww+1.338
lor+1.259
burn+1.217
 subs+1.119
actus+1.047
Neg logit contributions
 questions-2.079
years-2.017
ruits-1.955
year-1.893
nam-1.880
Token set
Token ID900
Feature activation+2.15e+00

Pos logit contributions
ww+8.911
lor+8.447
rob+7.440
burn+7.416
 hawk+7.277
Neg logit contributions
 three-4.624
 questions-4.444
 anx-4.062
 curiously-4.052
year-3.993
Token and
Token ID290
Feature activation+1.67e+00

Pos logit contributions
ww+4.704
lor+4.509
burn+4.289
actus+4.235
rob+4.136
Neg logit contributions
 questions-2.739
years-2.510
ruits-2.467
 three-2.450
year-2.429
Token it
Token ID340
Feature activation+2.70e+00

Pos logit contributions
ww+3.261
lor+3.209
burn+2.992
 subs+2.887
actus+2.879
Neg logit contributions
 questions-2.430
years-2.377
nam-2.259
 three-2.238
ruits-2.228
Token was
Token ID373
Feature activation+9.09e-01

Pos logit contributions
ww+5.485
lor+5.131
burn+5.023
 tou+4.612
actus+4.593
Neg logit contributions
 questions-3.914
 three-3.685
years-3.638
year-3.484
 anx-3.439
Token time
Token ID640
Feature activation-9.14e-02

Pos logit contributions
ww+1.691
lor+1.611
burn+1.553
 subs+1.484
actus+1.474
Neg logit contributions
 questions-1.368
years-1.353
ruits-1.288
year-1.248
nam-1.230
Token you
Token ID345
Feature activation-1.66e-01

Pos logit contributions
 questions+0.520
years+0.518
ruits+0.505
year+0.495
nam+0.477
Neg logit contributions
ww-0.313
burn-0.274
lor-0.273
 subs-0.255
 tou-0.241
Token okay
Token ID8788
Feature activation-4.96e-01

Pos logit contributions
 questions+0.171
years+0.171
ruits+0.161
year+0.150
nam+0.147
Neg logit contributions
ww-0.390
burn-0.376
lor-0.366
 subs-0.357
 tou-0.343
Token,
Token ID11
Feature activation+3.84e-01

Pos logit contributions
years+0.729
 questions+0.695
ruits+0.666
year+0.666
nam+0.643
Neg logit contributions
ww-0.981
burn-0.937
lor-0.890
 subs-0.872
 chip-0.845
Token my
Token ID616
Feature activation+4.57e-02

Pos logit contributions
ww+0.711
lor+0.667
burn+0.664
 subs+0.633
actus+0.608
Neg logit contributions
years-0.586
 questions-0.586
ruits-0.554
year-0.533
nam-0.527
Token d
Token ID288
Feature activation+5.26e-01

Pos logit contributions
ww+0.086
burn+0.080
lor+0.080
 subs+0.076
actus+0.074
Neg logit contributions
years-0.070
 questions-0.068
ruits-0.065
year-0.063
nam-0.063
Tokenarl
Token ID7063
Feature activation+3.73e+00

Pos logit contributions
ww+0.729
lor+0.667
burn+0.662
 subs+0.624
 tou+0.588
Neg logit contributions
years-1.048
 questions-1.045
ruits-1.007
year-0.978
nam-0.957
Tokenings
Token ID654
Feature activation+1.23e+00

Pos logit contributions
ww+5.214
lor+4.877
burn+4.534
 subs+4.474
 tou+4.347
Neg logit contributions
ruits-7.751
 questions-7.676
 ago-7.133
 if-6.944
 want-6.923
Token?"
Token ID1701
Feature activation-1.34e-01

Pos logit contributions
ww+2.704
lor+2.635
burn+2.589
 subs+2.521
 tou+2.426
Neg logit contributions
 questions-1.450
years-1.370
ruits-1.321
year-1.254
adies-1.184
Token she
Token ID673
Feature activation+2.16e-01

Pos logit contributions
years+0.260
 questions+0.256
ruits+0.249
year+0.242
nam+0.236
Neg logit contributions
ww-0.191
burn-0.174
lor-0.171
 subs-0.165
 tou-0.159
Token asked
Token ID1965
Feature activation-1.00e+00

Pos logit contributions
ww+0.436
burn+0.410
lor+0.410
 subs+0.393
 tou+0.378
Neg logit contributions
years-0.290
 questions-0.288
ruits-0.273
year-0.261
nam-0.255
Token.
Token ID13
Feature activation+1.01e+00

Pos logit contributions
years+1.778
 questions+1.717
ruits+1.682
year+1.646
nam+1.581
Neg logit contributions
burn-1.491
ww-1.472
lor-1.399
 subs-1.377
 chip-1.348
Token.
Token ID13
Feature activation+2.49e+00

Pos logit contributions
ww+1.079
lor+1.007
burn+1.005
 subs+0.959
actus+0.928
Neg logit contributions
 questions-0.794
years-0.792
ruits-0.744
year-0.719
nam-0.703
Token The
Token ID383
Feature activation-9.56e-01

Pos logit contributions
ww+5.295
burn+5.124
lor+4.735
 subs+4.697
actus+4.662
Neg logit contributions
years-3.320
 questions-3.305
year-3.144
ruits-3.124
 One-3.006
Token little
Token ID1310
Feature activation-7.69e-02

Pos logit contributions
years+1.517
 questions+1.405
ruits+1.383
nam+1.345
year+1.331
Neg logit contributions
ww-1.788
burn-1.624
lor-1.610
actus-1.581
 chip-1.579
Token girl
Token ID2576
Feature activation+1.69e+00

Pos logit contributions
years+0.146
 questions+0.144
ruits+0.138
year+0.134
nam+0.133
Neg logit contributions
ww-0.116
burn-0.106
lor-0.104
 subs-0.099
 tou-0.096
Token smiled
Token ID13541
Feature activation+5.00e-01

Pos logit contributions
ww+3.628
burn+3.461
lor+3.398
 subs+3.236
actus+3.143
Neg logit contributions
years-2.084
 questions-2.076
 three-1.860
ruits-1.849
year-1.831
Token and
Token ID290
Feature activation+3.72e+00

Pos logit contributions
ww+1.034
lor+0.974
burn+0.970
 subs+0.929
actus+0.900
Neg logit contributions
years-0.653
 questions-0.653
ruits-0.607
year-0.587
nam-0.574
Token went
Token ID1816
Feature activation+2.46e+00

Pos logit contributions
lor+8.526
burn+8.184
ww+8.092
actus+7.779
ic+7.729
Neg logit contributions
years-4.737
 three-4.627
year-4.421
 questions-4.413
 curious-4.273
Token on
Token ID319
Feature activation+6.89e-01

Pos logit contributions
ww+4.418
lor+4.019
burn+3.926
 subs+3.841
actus+3.750
Neg logit contributions
 questions-4.353
years-4.084
year-3.867
 three-3.838
ruits-3.764
Token her
Token ID607
Feature activation+7.91e-01

Pos logit contributions
ww+1.387
lor+1.302
burn+1.300
 subs+1.236
actus+1.212
Neg logit contributions
 questions-0.962
years-0.956
ruits-0.886
year-0.860
nam-0.847
Token way
Token ID835
Feature activation+7.85e-01

Pos logit contributions
ww+1.038
lor+0.948
burn+0.941
 subs+0.868
actus+0.826
Neg logit contributions
 questions-1.650
years-1.619
ruits-1.554
nam-1.517
year-1.496
Token.
Token ID13
Feature activation+2.11e+00

Pos logit contributions
ww+1.597
lor+1.490
burn+1.482
 subs+1.420
actus+1.369
Neg logit contributions
 questions-1.068
years-1.062
ruits-1.000
year-0.950
nam-0.945
Token around
Token ID1088
Feature activation-5.05e-01

Pos logit contributions
ww+3.130
lor+2.976
burn+2.926
 subs+2.815
actus+2.710
Neg logit contributions
 questions-1.845
years-1.742
ruits-1.645
 three-1.600
year-1.592
Token the
Token ID262
Feature activation+9.94e-01

Pos logit contributions
years+0.803
 questions+0.791
ruits+0.781
year+0.737
nam+0.730
Neg logit contributions
ww-0.869
lor-0.813
burn-0.811
 subs-0.784
 chip-0.752
Token garden
Token ID11376
Feature activation+5.90e-01

Pos logit contributions
ww+1.541
burn+1.433
lor+1.433
 subs+1.363
 tou+1.240
Neg logit contributions
 questions-1.839
years-1.809
ruits-1.740
nam-1.671
year-1.663
Token until
Token ID1566
Feature activation+1.62e+00

Pos logit contributions
ww+1.184
lor+1.107
burn+1.104
 subs+1.055
actus+1.020
Neg logit contributions
 questions-0.810
years-0.808
ruits-0.760
year-0.727
nam-0.716
Token the
Token ID262
Feature activation+8.17e-01

Pos logit contributions
ww+2.812
lor+2.758
burn+2.611
 subs+2.461
actus+2.441
Neg logit contributions
 questions-2.665
years-2.663
ruits-2.535
nam-2.530
 three-2.509
Token sun
Token ID4252
Feature activation+3.71e+00

Pos logit contributions
ww+1.068
lor+1.001
burn+0.975
 subs+0.907
 tou+0.838
Neg logit contributions
 questions-1.699
years-1.672
ruits-1.613
nam-1.566
year-1.558
Token went
Token ID1816
Feature activation+2.08e+00

Pos logit contributions
ww+9.179
lor+8.662
rob+7.786
burn+7.630
 masse+7.339
Neg logit contributions
 three-4.301
 liked-4.048
 questions-3.967
ruits-3.850
 curiously-3.842
Token down
Token ID866
Feature activation+9.44e-01

Pos logit contributions
ww+4.340
lor+4.313
burn+3.907
 subs+3.820
rob+3.794
Neg logit contributions
 questions-2.850
years-2.718
nam-2.575
year-2.529
ruits-2.521
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
ww+1.898
lor+1.779
burn+1.768
 subs+1.670
actus+1.627
Neg logit contributions
 questions-1.317
years-1.305
ruits-1.220
year-1.205
nam-1.151
Token They
Token ID1119
Feature activation+7.37e-01

Pos logit contributions
ww+2.624
burn+2.498
lor+2.457
 subs+2.343
actus+2.264
Neg logit contributions
 questions-1.993
years-1.989
ruits-1.930
year-1.841
nam-1.782
Token became
Token ID2627
Feature activation+1.23e+00

Pos logit contributions
ww+1.538
lor+1.461
burn+1.446
 subs+1.380
actus+1.335
Neg logit contributions
years-0.958
 questions-0.942
ruits-0.883
year-0.853
nam-0.846
Token flew
Token ID13112
Feature activation+1.36e+00

Pos logit contributions
ww+4.430
lor+4.303
burn+4.182
actus+3.904
 tou+3.754
Neg logit contributions
 questions-3.409
years-3.251
year-3.148
ruits-2.968
 three-2.957
Token by
Token ID416
Feature activation-5.14e-01

Pos logit contributions
ww+2.717
lor+2.619
burn+2.565
 subs+2.433
actus+2.390
Neg logit contributions
 questions-1.936
years-1.858
ruits-1.721
year-1.704
 three-1.680
Token and
Token ID290
Feature activation+1.10e+00

Pos logit contributions
years+0.751
 questions+0.732
ruits+0.714
nam+0.684
year+0.681
Neg logit contributions
ww-0.984
burn-0.920
lor-0.910
 subs-0.884
 chip-0.841
Token soon
Token ID2582
Feature activation+2.75e+00

Pos logit contributions
ww+2.092
lor+2.006
burn+1.964
 subs+1.847
actus+1.833
Neg logit contributions
years-1.636
 questions-1.630
ruits-1.506
nam-1.479
year-1.472
Token the
Token ID262
Feature activation+6.35e-01

Pos logit contributions
ww+6.161
burn+6.029
lor+5.989
actus+5.511
rob+5.459
Neg logit contributions
 questions-3.361
years-3.051
 if-3.020
 three-3.017
 ten-2.988
Token sun
Token ID4252
Feature activation+3.71e+00

Pos logit contributions
ww+0.862
lor+0.804
burn+0.785
 subs+0.734
 tou+0.686
Neg logit contributions
 questions-1.293
years-1.283
ruits-1.227
year-1.191
nam-1.186
Token went
Token ID1816
Feature activation+2.30e+00

Pos logit contributions
ww+8.848
lor+8.238
rob+7.376
burn+7.275
actus+7.132
Neg logit contributions
 three-4.542
 questions-4.359
 liked-4.203
year-4.165
 curiously-4.123
Token down
Token ID866
Feature activation+1.16e+00

Pos logit contributions
lor+4.850
ww+4.836
burn+4.420
actus+4.243
 subs+4.227
Neg logit contributions
 questions-3.085
years-2.948
nam-2.736
year-2.705
 three-2.679
Token and
Token ID290
Feature activation+1.90e+00

Pos logit contributions
ww+2.389
lor+2.244
burn+2.232
 subs+2.085
actus+2.048
Neg logit contributions
 questions-1.590
years-1.564
year-1.449
ruits-1.440
adies-1.367
Token it
Token ID340
Feature activation+3.18e+00

Pos logit contributions
ww+3.864
lor+3.838
burn+3.533
actus+3.429
 subs+3.385
Neg logit contributions
 questions-2.634
years-2.563
nam-2.444
 three-2.379
ruits-2.289
Token was
Token ID373
Feature activation+1.19e+00

Pos logit contributions
ww+6.912
burn+6.454
lor+6.410
 chip+5.736
 Cathy+5.735
Neg logit contributions
 questions-4.158
 three-3.998
years-3.986
 liked-3.848
 especially-3.799
Token the
Token ID262
Feature activation+9.02e-01

Pos logit contributions
ww+0.267
burn+0.250
lor+0.248
 subs+0.237
 tou+0.227
Neg logit contributions
years-0.239
 questions-0.239
ruits-0.228
year-0.219
nam-0.215
Token k
Token ID479
Feature activation+1.63e+00

Pos logit contributions
ww+0.940
lor+0.869
burn+0.810
 subs+0.747
actus+0.699
Neg logit contributions
 questions-2.101
years-2.077
ruits-2.016
nam-1.979
year-1.954
Tokenite
Token ID578
Feature activation+8.72e-01

Pos logit contributions
ww+2.100
lor+1.976
burn+1.840
 subs+1.826
 tou+1.653
Neg logit contributions
years-3.483
 questions-3.279
ruits-3.242
nam-3.226
year-3.212
Token until
Token ID1566
Feature activation+1.62e+00

Pos logit contributions
ww+1.729
lor+1.631
burn+1.594
 subs+1.534
actus+1.490
Neg logit contributions
 questions-1.219
years-1.207
ruits-1.134
nam-1.082
year-1.082
Token the
Token ID262
Feature activation+1.40e+00

Pos logit contributions
lor+2.919
ww+2.881
burn+2.656
actus+2.566
 subs+2.534
Neg logit contributions
years-2.538
 questions-2.513
nam-2.409
ruits-2.375
 three-2.348
Token sun
Token ID4252
Feature activation+3.71e+00

Pos logit contributions
ww+2.036
lor+2.002
burn+1.831
 subs+1.741
actus+1.691
Neg logit contributions
 questions-2.660
years-2.570
ruits-2.501
 three-2.481
nam-2.433
Token went
Token ID1816
Feature activation+2.11e+00

Pos logit contributions
ww+9.156
lor+8.719
rob+7.745
actus+7.593
 masse+7.468
Neg logit contributions
 three-4.084
 liked-3.735
 questions-3.693
 want-3.667
 if-3.620
Token down
Token ID866
Feature activation+1.36e+00

Pos logit contributions
lor+4.428
ww+4.355
burn+3.926
actus+3.885
 subs+3.854
Neg logit contributions
 questions-2.846
years-2.745
nam-2.606
year-2.540
 three-2.537
Token.
Token ID13
Feature activation+9.79e-01

Pos logit contributions
ww+2.884
lor+2.741
burn+2.687
 subs+2.514
actus+2.510
Neg logit contributions
 questions-1.792
years-1.771
year-1.657
ruits-1.618
 three-1.582
Token The
Token ID383
Feature activation+5.46e-01

Pos logit contributions
ww+1.893
lor+1.795
burn+1.786
 subs+1.698
actus+1.634
Neg logit contributions
years-1.404
 questions-1.387
ruits-1.334
year-1.289
nam-1.253
Token end
Token ID886
Feature activation+2.25e+00

Pos logit contributions
ww+1.197
lor+1.140
burn+1.127
 subs+1.085
actus+1.050
Neg logit contributions
 questions-0.645
years-0.642
ruits-0.598
year-0.568
nam-0.564
Token."
Token ID526
Feature activation+9.41e-01

Pos logit contributions
ww+1.246
lor+1.171
burn+1.162
 subs+1.118
 tou+1.072
Neg logit contributions
 questions-1.038
years-1.026
ruits-0.977
year-0.940
nam-0.913
Token The
Token ID383
Feature activation+1.10e+00

Pos logit contributions
ww+1.909
lor+1.790
burn+1.782
 subs+1.714
 tou+1.644
Neg logit contributions
years-1.289
 questions-1.286
ruits-1.215
year-1.154
nam-1.147
Token sw
Token ID1509
Feature activation+1.34e+00

Pos logit contributions
ww+1.653
lor+1.582
burn+1.550
 subs+1.450
actus+1.333
Neg logit contributions
 questions-2.103
years-2.062
ruits-1.959
year-1.911
 three-1.894
Tokenan
Token ID272
Feature activation+6.92e-01

Pos logit contributions
ww+1.808
burn+1.661
lor+1.630
 subs+1.530
 chip+1.429
Neg logit contributions
 questions-2.734
years-2.728
ruits-2.615
year-2.534
adies-2.488
Token smiled
Token ID13541
Feature activation+3.41e-01

Pos logit contributions
ww+1.379
lor+1.297
burn+1.292
 subs+1.225
actus+1.182
Neg logit contributions
 questions-0.956
years-0.950
ruits-0.886
year-0.856
nam-0.840
Token and
Token ID290
Feature activation+3.71e+00

Pos logit contributions
ww+0.719
lor+0.678
burn+0.677
 subs+0.649
actus+0.627
Neg logit contributions
years-0.430
 questions-0.429
ruits-0.401
year-0.383
nam-0.377
Token sw
Token ID1509
Feature activation+9.42e-01

Pos logit contributions
lor+8.797
burn+8.050
ww+8.048
ic+7.633
actus+7.381
Neg logit contributions
 three-4.792
years-4.561
 questions-4.511
 liked-4.489
 curious-4.397
Tokenam
Token ID321
Feature activation+1.14e+00

Pos logit contributions
ww+1.991
burn+1.879
lor+1.874
 subs+1.795
 tou+1.736
Neg logit contributions
years-1.196
 questions-1.178
ruits-1.113
year-1.066
adies-1.013
Token away
Token ID1497
Feature activation+1.11e+00

Pos logit contributions
ww+1.862
lor+1.752
burn+1.742
 subs+1.617
actus+1.573
Neg logit contributions
 questions-2.058
years-2.039
year-1.921
ruits-1.903
 three-1.857
Token.
Token ID13
Feature activation+2.20e+00

Pos logit contributions
ww+2.303
lor+2.171
burn+2.139
 subs+1.995
actus+1.986
Neg logit contributions
 questions-1.510
years-1.496
ruits-1.368
year-1.363
nam-1.333
Token
Token ID220
Feature activation+9.88e-01

Pos logit contributions
ww+4.574
burn+4.437
lor+4.315
 subs+4.125
actus+3.962
Neg logit contributions
 questions-3.022
years-2.994
ruits-2.826
year-2.821
 Some-2.750
Token
Token ID198
Feature activation+7.90e-01

Pos logit contributions
ww+1.939
lor+1.825
burn+1.814
 subs+1.724
actus+1.626
Neg logit contributions
 questions-1.812
years-1.766
ruits-1.706
year-1.622
nam-1.611
Token
Token ID198
Feature activation+6.20e-01

Pos logit contributions
ww+1.332
burn+1.246
lor+1.234
 subs+1.183
 tou+1.112
Neg logit contributions
 questions-1.347
years-1.334
ruits-1.272
year-1.238
nam-1.202
TokenL
Token ID43
Feature activation-4.69e-01

Pos logit contributions
ww+1.006
burn+0.941
lor+0.939
 subs+0.892
actus+0.846
Neg logit contributions
years-1.091
 questions-1.088
ruits-1.036
year-1.011
nam-0.990
Tokenily
Token ID813
Feature activation+4.15e-01

Pos logit contributions
years+0.491
 questions+0.474
nam+0.446
ruits+0.421
year+0.405
Neg logit contributions
ww-1.097
lor-1.044
burn-1.029
 subs-0.989
actus-0.973
Token smiled
Token ID13541
Feature activation+5.33e-01

Pos logit contributions
ww+0.871
lor+0.821
burn+0.819
 subs+0.785
actus+0.757
Neg logit contributions
years-0.528
 questions-0.528
ruits-0.493
year-0.471
nam-0.463
Token and
Token ID290
Feature activation+3.69e+00

Pos logit contributions
ww+1.119
lor+1.055
burn+1.047
 subs+1.005
actus+0.968
Neg logit contributions
 questions-0.689
years-0.683
ruits-0.637
year-0.613
nam-0.601
Token hugged
Token ID41130
Feature activation+1.16e+00

Pos logit contributions
lor+9.102
ww+8.788
burn+8.577
ic+8.189
 subs+7.819
Neg logit contributions
 three-4.146
 questions-4.138
years-4.134
 curious-4.123
 want-4.053
Token them
Token ID606
Feature activation+5.40e-02

Pos logit contributions
ww+2.431
lor+2.324
burn+2.204
 subs+2.178
actus+2.082
Neg logit contributions
 questions-1.607
years-1.535
ruits-1.456
nam-1.378
year-1.353
Token.
Token ID13
Feature activation+9.54e-02

Pos logit contributions
ww+0.088
burn+0.083
lor+0.081
 subs+0.077
actus+0.074
Neg logit contributions
years-0.093
 questions-0.092
ruits-0.089
year-0.086
nam-0.084
Token She
Token ID1375
Feature activation+1.34e+00

Pos logit contributions
ww+0.175
burn+0.163
lor+0.163
 subs+0.155
actus+0.149
Neg logit contributions
years-0.147
 questions-0.145
ruits-0.138
year-0.134
nam-0.132
Token said
Token ID531
Feature activation-2.63e-02

Pos logit contributions
ww+2.515
lor+2.351
burn+2.338
 subs+2.190
 tou+2.079
Neg logit contributions
 questions-2.073
years-2.041
ruits-1.918
year-1.871
 three-1.833
Token ball
Token ID2613
Feature activation-1.40e-01

Pos logit contributions
ww+0.427
lor+0.385
burn+0.383
 subs+0.355
actus+0.332
Neg logit contributions
years-0.766
 questions-0.765
ruits-0.737
year-0.718
nam-0.712
Token all
Token ID477
Feature activation+3.95e-01

Pos logit contributions
years+0.203
 questions+0.201
ruits+0.191
year+0.184
nam+0.183
Neg logit contributions
ww-0.266
burn-0.250
lor-0.248
 subs-0.240
actus-0.229
Token day
Token ID1110
Feature activation+1.25e+00

Pos logit contributions
ww+0.594
lor+0.549
burn+0.546
 subs+0.514
actus+0.488
Neg logit contributions
years-0.740
 questions-0.740
ruits-0.708
year-0.687
nam-0.679
Token until
Token ID1566
Feature activation+1.86e+00

Pos logit contributions
ww+2.572
lor+2.448
burn+2.430
 subs+2.325
 tou+2.263
Neg logit contributions
 questions-1.686
years-1.664
ruits-1.584
nam-1.510
year-1.474
Token the
Token ID262
Feature activation+7.09e-01

Pos logit contributions
lor+3.388
ww+3.356
burn+3.127
actus+3.015
 subs+2.992
Neg logit contributions
 questions-2.896
years-2.873
nam-2.751
 three-2.715
ruits-2.657
Token sun
Token ID4252
Feature activation+3.69e+00

Pos logit contributions
ww+0.887
lor+0.831
burn+0.800
 subs+0.745
actus+0.699
Neg logit contributions
 questions-1.503
years-1.485
ruits-1.436
nam-1.390
year-1.388
Token went
Token ID1816
Feature activation+1.94e+00

Pos logit contributions
ww+9.015
lor+8.856
rob+7.892
burn+7.547
 Cathy+7.350
Neg logit contributions
 three-4.177
 questions-3.803
 liked-3.774
 want-3.657
ruits-3.624
Token down
Token ID866
Feature activation+1.39e+00

Pos logit contributions
ww+3.922
lor+3.915
burn+3.512
 subs+3.462
actus+3.413
Neg logit contributions
 questions-2.726
years-2.631
nam-2.484
year-2.441
 three-2.415
Token.
Token ID13
Feature activation+7.68e-01

Pos logit contributions
ww+2.983
lor+2.832
burn+2.775
 subs+2.610
actus+2.586
Neg logit contributions
 questions-1.821
years-1.773
year-1.670
ruits-1.623
 three-1.590
Token Max
Token ID5436
Feature activation+1.42e+00

Pos logit contributions
ww+1.529
lor+1.442
burn+1.434
 subs+1.368
actus+1.324
Neg logit contributions
 questions-1.074
years-1.064
ruits-1.009
nam-0.963
year-0.962
Token was
Token ID373
Feature activation+3.11e-01

Pos logit contributions
ww+2.759
lor+2.652
burn+2.609
 subs+2.457
 tou+2.371
Neg logit contributions
 questions-1.992
years-1.988
ruits-1.836
 three-1.788
nam-1.737
Tokeny
Token ID88
Feature activation+1.18e+00

Pos logit contributions
ww+2.196
lor+2.087
burn+2.032
 subs+1.966
 tou+1.869
Neg logit contributions
years-1.988
 questions-1.980
ruits-1.874
year-1.835
 three-1.778
Token flew
Token ID13112
Feature activation+1.34e+00

Pos logit contributions
ww+2.376
lor+2.221
burn+2.216
 subs+2.109
 tou+2.040
Neg logit contributions
 questions-1.615
years-1.577
ruits-1.478
year-1.424
 three-1.419
Token away
Token ID1497
Feature activation+1.14e+00

Pos logit contributions
ww+2.301
lor+2.185
burn+2.077
 subs+1.960
actus+1.944
Neg logit contributions
 questions-2.334
years-2.303
year-2.155
ruits-2.119
 three-2.096
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
ww+2.522
lor+2.391
burn+2.343
 subs+2.197
actus+2.169
Neg logit contributions
years-1.394
 questions-1.392
year-1.245
ruits-1.243
nam-1.214
Token happy
Token ID3772
Feature activation+1.98e+00

Pos logit contributions
ww+2.831
lor+2.676
burn+2.565
 subs+2.472
actus+2.424
Neg logit contributions
years-1.684
 questions-1.664
nam-1.558
ruits-1.540
year-1.495
Token and
Token ID290
Feature activation+3.69e+00

Pos logit contributions
ww+4.614
lor+4.368
burn+4.293
 subs+4.044
rob+4.023
Neg logit contributions
 questions-2.352
years-2.198
 three-2.052
ruits-1.978
 if-1.973
Token clean
Token ID3424
Feature activation+1.93e+00

Pos logit contributions
lor+9.676
ww+9.142
burn+9.138
ic+8.554
rob+8.524
Neg logit contributions
years-3.968
 curious-3.649
nam-3.646
year-3.573
 want-3.493
Token.
Token ID13
Feature activation+2.20e+00

Pos logit contributions
ww+4.266
lor+3.942
burn+3.847
 subs+3.761
 tou+3.710
Neg logit contributions
 questions-2.531
years-2.353
ruits-2.164
year-2.137
 three-2.135
Token<|endoftext|>
Token ID50256
Feature activation+1.28e+00

Pos logit contributions
ww+4.390
burn+4.299
lor+4.098
 subs+4.043
 pel+3.834
Neg logit contributions
years-3.032
 questions-3.004
year-2.883
ruits-2.879
 three-2.745
TokenOnce
Token ID7454
Feature activation+8.68e-03

Pos logit contributions
ww+2.349
burn+2.175
lor+2.109
 subs+2.053
 tou+1.947
Neg logit contributions
 questions-2.036
years-1.986
ruits-1.913
year-1.862
 three-1.810
Token upon
Token ID2402
Feature activation-5.43e-01

Pos logit contributions
ww+0.024
lor+0.023
burn+0.023
 subs+0.023
actus+0.022
Neg logit contributions
years-0.005
 questions-0.005
ruits-0.004
year-0.004
nam-0.003
Token and
Token ID290
Feature activation+2.81e+00

Pos logit contributions
ww+2.470
lor+2.353
burn+2.328
 subs+2.220
actus+2.152
Neg logit contributions
 questions-1.240
years-1.181
ruits-1.101
year-1.039
 three-1.028
Token free
Token ID1479
Feature activation+1.08e+00

Pos logit contributions
lor+6.548
ww+6.439
burn+6.149
actus+5.957
ic+5.723
Neg logit contributions
years-3.471
nam-3.394
 questions-3.379
 want-3.095
year-3.068
Token.
Token ID13
Feature activation+9.09e-01

Pos logit contributions
ww+2.202
lor+2.057
burn+2.027
 subs+1.922
actus+1.900
Neg logit contributions
 questions-1.519
years-1.490
ruits-1.389
year-1.356
 three-1.317
Token Lily
Token ID20037
Feature activation+1.44e+00

Pos logit contributions
ww+1.737
burn+1.631
lor+1.616
 subs+1.537
actus+1.485
Neg logit contributions
 questions-1.355
years-1.345
ruits-1.287
year-1.236
nam-1.212
Token smiled
Token ID13541
Feature activation+4.87e-01

Pos logit contributions
ww+3.005
burn+2.783
lor+2.778
 subs+2.637
 tou+2.529
Neg logit contributions
years-1.915
 questions-1.904
ruits-1.733
 three-1.665
year-1.659
Token and
Token ID290
Feature activation+3.69e+00

Pos logit contributions
ww+1.042
lor+0.983
burn+0.978
 subs+0.940
actus+0.908
Neg logit contributions
 questions-0.605
years-0.603
ruits-0.560
year-0.538
nam-0.528
Token laughed
Token ID13818
Feature activation-3.88e-01

Pos logit contributions
lor+8.969
ww+8.699
ic+8.453
burn+8.249
actus+8.039
Neg logit contributions
years-4.485
nam-4.405
 three-4.262
 curious-4.176
 liked-4.076
Token.
Token ID13
Feature activation+5.10e-01

Pos logit contributions
years+0.617
 questions+0.600
ruits+0.594
nam+0.556
year+0.550
Neg logit contributions
ww-0.682
burn-0.648
lor-0.638
 subs-0.614
 tou-0.586
Token She
Token ID1375
Feature activation+1.82e+00

Pos logit contributions
ww+0.975
lor+0.914
burn+0.912
 subs+0.869
actus+0.835
Neg logit contributions
 questions-0.746
years-0.745
ruits-0.706
year-0.676
nam-0.668
Token loved
Token ID6151
Feature activation+1.43e+00

Pos logit contributions
ww+4.103
burn+3.923
lor+3.882
 subs+3.642
ic+3.570
Neg logit contributions
years-2.112
 questions-2.062
 three-1.843
ruits-1.831
year-1.825
Token to
Token ID284
Feature activation+7.86e-01

Pos logit contributions
ww+2.888
lor+2.794
burn+2.638
 subs+2.556
actus+2.515
Neg logit contributions
 questions-2.139
years-2.054
ruits-1.872
nam-1.851
 three-1.810
Token they
Token ID484
Feature activation-1.04e-01

Pos logit contributions
ww+2.489
lor+2.345
burn+2.333
 subs+2.199
actus+2.187
Neg logit contributions
years-2.097
 questions-2.093
ruits-1.947
nam-1.920
year-1.917
Token played
Token ID2826
Feature activation-3.61e-01

Pos logit contributions
 questions+0.131
years+0.129
ruits+0.123
year+0.114
nam+0.114
Neg logit contributions
ww-0.219
burn-0.206
lor-0.205
 subs-0.199
 tou-0.192
Token together
Token ID1978
Feature activation+1.48e-01

Pos logit contributions
years+0.572
 questions+0.550
ruits+0.538
nam+0.513
year+0.511
Neg logit contributions
ww-0.643
lor-0.608
burn-0.598
 subs-0.573
actus-0.554
Token until
Token ID1566
Feature activation+2.26e+00

Pos logit contributions
ww+0.305
burn+0.288
lor+0.288
 subs+0.277
actus+0.266
Neg logit contributions
years-0.193
 questions-0.190
ruits-0.180
year-0.172
nam-0.170
Token the
Token ID262
Feature activation+9.99e-01

Pos logit contributions
lor+4.518
ww+4.411
burn+4.146
actus+4.038
ic+3.969
Neg logit contributions
years-3.193
 questions-3.170
 three-3.132
nam-3.061
year-2.965
Token sun
Token ID4252
Feature activation+3.69e+00

Pos logit contributions
ww+1.366
lor+1.305
burn+1.291
 subs+1.170
 tou+1.095
Neg logit contributions
 questions-2.005
years-1.965
ruits-1.891
year-1.847
nam-1.816
Token came
Token ID1625
Feature activation+5.79e-01

Pos logit contributions
ww+9.112
lor+8.517
burn+7.857
 Cathy+7.648
rob+7.633
Neg logit contributions
 three-4.020
 questions-3.791
 anx-3.721
 want-3.582
adies-3.578
Token out
Token ID503
Feature activation+9.27e-01

Pos logit contributions
ww+1.034
lor+0.968
burn+0.953
 subs+0.908
actus+0.878
Neg logit contributions
 questions-0.914
years-0.909
ruits-0.862
year-0.833
nam-0.824
Token and
Token ID290
Feature activation+1.90e+00

Pos logit contributions
ww+1.836
lor+1.735
burn+1.702
 subs+1.613
actus+1.590
Neg logit contributions
 questions-1.307
years-1.288
ruits-1.204
year-1.185
nam-1.139
Token the
Token ID262
Feature activation+7.88e-01

Pos logit contributions
ww+3.701
lor+3.682
burn+3.337
actus+3.250
 subs+3.203
Neg logit contributions
 questions-2.787
nam-2.784
years-2.743
 three-2.605
ruits-2.568
Token beach
Token ID10481
Feature activation+2.19e+00

Pos logit contributions
ww+1.157
lor+1.083
burn+1.078
 subs+0.996
 tou+0.931
Neg logit contributions
 questions-1.502
years-1.488
ruits-1.425
nam-1.379
year-1.379
Token surprise
Token ID5975
Feature activation-8.81e-01

Pos logit contributions
ww+0.035
burn+0.032
lor+0.032
 subs+0.030
actus+0.029
Neg logit contributions
years-0.043
 questions-0.042
ruits-0.040
year-0.040
nam-0.039
Token for
Token ID329
Feature activation-4.67e-01

Pos logit contributions
years+1.363
 questions+1.265
ruits+1.230
nam+1.219
year+1.206
Neg logit contributions
ww-1.654
burn-1.545
lor-1.504
 subs-1.426
 chip-1.401
Token you
Token ID345
Feature activation-3.00e+00

Pos logit contributions
years+0.951
 questions+0.941
ruits+0.912
year+0.904
nam+0.866
Neg logit contributions
ww-0.622
burn-0.554
lor-0.550
 subs-0.503
 tou-0.492
Token.
Token ID13
Feature activation-1.42e+00

Pos logit contributions
years+6.540
year+6.046
nam+5.787
ruits+5.568
 questions+5.504
Neg logit contributions
ww-4.062
lor-3.715
burn-3.648
 chip-3.394
actus-3.261
Token Do
Token ID2141
Feature activation-2.92e-01

Pos logit contributions
years+2.475
 questions+2.428
nam+2.228
ruits+2.224
year+2.156
Neg logit contributions
ww-2.420
lor-2.292
burn-2.176
 tou-2.074
 chip-2.037
Token you
Token ID345
Feature activation-3.89e+00

Pos logit contributions
years+0.627
 questions+0.616
year+0.597
ruits+0.594
nam+0.567
Neg logit contributions
ww-0.372
lor-0.330
burn-0.322
 subs-0.296
 tou-0.286
Token want
Token ID765
Feature activation-2.18e-01

Pos logit contributions
years+5.965
nam+5.704
 curiously+5.008
 questions+4.848
year+4.834
Neg logit contributions
ww-8.219
burn-7.620
 tou-7.225
 subs-6.856
lor-6.766
Token to
Token ID284
Feature activation+1.59e-01

Pos logit contributions
years+0.375
 questions+0.366
ruits+0.355
year+0.345
nam+0.342
Neg logit contributions
ww-0.364
burn-0.336
lor-0.333
 subs-0.313
 chip-0.302
Token know
Token ID760
Feature activation-1.67e+00

Pos logit contributions
ww+0.322
burn+0.305
lor+0.303
 subs+0.290
 tou+0.281
Neg logit contributions
years-0.216
 questions-0.213
ruits-0.201
year-0.195
nam-0.190
Token what
Token ID644
Feature activation+3.24e-01

Pos logit contributions
years+2.659
year+2.396
 questions+2.380
ruits+2.319
nam+2.181
Neg logit contributions
ww-3.104
burn-2.968
lor-2.886
 chip-2.717
 tou-2.638
Token it
Token ID340
Feature activation+8.23e-02

Pos logit contributions
ww+0.566
lor+0.527
burn+0.526
 subs+0.500
actus+0.479
Neg logit contributions
years-0.525
 questions-0.523
ruits-0.497
year-0.480
nam-0.474
Token has
Token ID468
Feature activation-7.06e-01

Pos logit contributions
ww+0.346
lor+0.318
burn+0.316
 subs+0.298
actus+0.282
Neg logit contributions
years-0.464
 questions-0.462
ruits-0.443
year-0.431
nam-0.427
Token many
Token ID867
Feature activation-2.00e+00

Pos logit contributions
years+0.899
 questions+0.869
ruits+0.843
year+0.794
nam+0.767
Neg logit contributions
ww-1.489
lor-1.402
burn-1.378
 subs-1.342
 tou-1.337
Token things
Token ID1243
Feature activation+8.14e-01

Pos logit contributions
years+3.534
ruits+3.177
year+3.170
 questions+3.099
nam+3.044
Neg logit contributions
ww-3.466
actus-3.011
 chip-2.991
 tou-2.986
 subs-2.940
Token inside
Token ID2641
Feature activation-7.69e-01

Pos logit contributions
ww+1.555
lor+1.469
burn+1.455
 subs+1.384
actus+1.339
Neg logit contributions
 questions-1.203
years-1.177
ruits-1.131
year-1.074
nam-1.069
Token.
Token ID13
Feature activation-1.62e-01

Pos logit contributions
years+1.316
 questions+1.242
ruits+1.187
year+1.186
nam+1.160
Neg logit contributions
ww-1.273
lor-1.183
burn-1.167
 subs-1.098
 tou-1.080
Token They
Token ID1119
Feature activation-3.83e+00

Pos logit contributions
years+0.284
 questions+0.280
ruits+0.264
year+0.261
nam+0.257
Neg logit contributions
ww-0.263
lor-0.245
burn-0.242
 subs-0.230
actus-0.221
Token see
Token ID766
Feature activation-1.61e+00

Pos logit contributions
years+6.620
year+6.484
ruits+6.326
adies+6.113
 questions+5.858
Neg logit contributions
 tou-6.481
ww-6.064
 po-6.023
 subs-5.875
 bottle-5.835
Token a
Token ID257
Feature activation+7.74e-01

Pos logit contributions
years+2.576
ruits+2.441
year+2.399
 questions+2.372
adies+2.236
Neg logit contributions
ww-2.874
burn-2.697
lor-2.680
 chip-2.634
 tou-2.625
Token candle
Token ID26839
Feature activation+1.71e+00

Pos logit contributions
ww+1.221
lor+1.155
burn+1.143
 subs+1.079
actus+1.011
Neg logit contributions
 questions-1.368
years-1.344
ruits-1.295
nam-1.255
year-1.248
Token,
Token ID11
Feature activation-2.32e+00

Pos logit contributions
ww+3.740
lor+3.544
burn+3.439
 subs+3.363
actus+3.224
Neg logit contributions
 questions-2.063
ruits-1.925
years-1.923
 three-1.823
year-1.772
Token a
Token ID257
Feature activation+6.41e-01

Pos logit contributions
years+3.842
 questions+3.449
ruits+3.449
year+3.318
adies+3.140
Neg logit contributions
burn-4.081
ww-4.080
 chip-3.868
 tou-3.834
 pel-3.738
Token stories
Token ID3923
Feature activation-4.89e-01

Pos logit contributions
years+2.303
year+2.100
 questions+2.011
ruits+2.004
adies+1.920
Neg logit contributions
ww-2.596
burn-2.492
lor-2.360
 tou-2.286
actus-2.281
Token with
Token ID351
Feature activation-8.32e-01

Pos logit contributions
years+0.774
 questions+0.725
ruits+0.697
year+0.689
nam+0.670
Neg logit contributions
ww-0.899
burn-0.843
lor-0.842
 subs-0.786
 chip-0.766
Token them
Token ID606
Feature activation-1.42e+00

Pos logit contributions
years+1.474
 questions+1.413
year+1.397
ruits+1.383
adies+1.322
Neg logit contributions
ww-1.332
burn-1.210
lor-1.200
 tou-1.110
 subs-1.106
Token.
Token ID13
Feature activation-8.71e-01

Pos logit contributions
years+2.608
year+2.397
 questions+2.349
ruits+2.328
adies+2.255
Neg logit contributions
ww-2.333
burn-2.159
lor-2.097
 chip-1.975
actus-1.927
Token Do
Token ID2141
Feature activation-3.07e-01

Pos logit contributions
years+1.366
 questions+1.322
ruits+1.245
year+1.230
nam+1.225
Neg logit contributions
ww-1.589
lor-1.484
burn-1.463
 subs-1.359
 chip-1.357
Token you
Token ID345
Feature activation-3.82e+00

Pos logit contributions
years+0.653
 questions+0.633
year+0.621
ruits+0.613
nam+0.588
Neg logit contributions
ww-0.402
lor-0.357
burn-0.350
 subs-0.320
 tou-0.314
Token want
Token ID765
Feature activation-2.16e-01

Pos logit contributions
years+6.246
nam+5.283
year+5.096
 chambers+4.517
 questions+4.254
Neg logit contributions
ww-8.092
burn-7.463
 tou-7.095
lor-6.761
 subs-6.744
Token to
Token ID284
Feature activation-9.09e-02

Pos logit contributions
years+0.373
 questions+0.361
ruits+0.354
year+0.345
nam+0.338
Neg logit contributions
ww-0.356
burn-0.328
lor-0.326
 subs-0.306
actus-0.295
Token try
Token ID1949
Feature activation-8.81e-01

Pos logit contributions
years+0.119
 questions+0.115
ruits+0.110
year+0.109
nam+0.103
Neg logit contributions
ww-0.188
burn-0.179
lor-0.177
 subs-0.170
 tou-0.166
Token?"
Token ID1701
Feature activation-1.12e+00

Pos logit contributions
years+1.297
year+1.216
 questions+1.210
ruits+1.190
nam+1.097
Neg logit contributions
ww-1.679
burn-1.650
lor-1.592
actus-1.491
 chip-1.486
Token
Token ID198
Feature activation+1.03e+00

Pos logit contributions
years+1.981
 questions+1.869
ruits+1.864
year+1.827
nam+1.719
Neg logit contributions
ww-1.843
burn-1.684
 tou-1.613
 subs-1.609
 chip-1.604
Token try
Token ID1949
Feature activation+2.29e-01

Pos logit contributions
ww+0.709
lor+0.668
burn+0.666
 subs+0.638
actus+0.615
Neg logit contributions
years-0.467
 questions-0.466
ruits-0.438
year-0.420
nam-0.413
Token something
Token ID1223
Feature activation+1.24e+00

Pos logit contributions
ww+0.423
lor+0.397
burn+0.397
 subs+0.377
actus+0.363
Neg logit contributions
years-0.349
 questions-0.344
ruits-0.328
year-0.317
nam-0.313
Token different
Token ID1180
Feature activation-3.04e-01

Pos logit contributions
ww+2.294
lor+2.189
burn+2.173
 subs+2.015
actus+1.976
Neg logit contributions
 questions-2.009
years-1.855
ruits-1.801
 whether-1.733
year-1.728
Token.
Token ID13
Feature activation-1.82e+00

Pos logit contributions
years+0.501
 questions+0.475
ruits+0.463
year+0.452
nam+0.446
Neg logit contributions
ww-0.524
burn-0.489
lor-0.489
 subs-0.472
 tou-0.446
Token He
Token ID679
Feature activation-1.35e-01

Pos logit contributions
years+2.820
nam+2.820
ruits+2.767
 questions+2.665
adies+2.576
Neg logit contributions
ww-3.234
lor-3.121
 subs-3.033
 chip-2.836
burn-2.834
Token asked
Token ID1965
Feature activation-3.80e+00

Pos logit contributions
years+0.207
 questions+0.207
ruits+0.198
nam+0.190
year+0.188
Neg logit contributions
ww-0.245
burn-0.229
lor-0.228
 subs-0.220
 tou-0.210
Token his
Token ID465
Feature activation-1.04e+00

Pos logit contributions
nam+7.574
ruits+7.304
years+6.699
racted+6.545
 ago+6.319
Neg logit contributions
ww-5.360
 subs-5.245
lor-5.050
ic-4.881
 cho-4.870
Token farm
Token ID5318
Feature activation-8.17e-01

Pos logit contributions
years+1.817
year+1.688
ruits+1.685
nam+1.661
adies+1.613
Neg logit contributions
ww-1.735
burn-1.605
lor-1.568
 subs-1.529
 chip-1.526
Token friends
Token ID2460
Feature activation-1.68e+00

Pos logit contributions
years+1.460
nam+1.326
ruits+1.323
year+1.323
 questions+1.311
Neg logit contributions
ww-1.293
lor-1.227
burn-1.208
 chip-1.137
 subs-1.135
Token to
Token ID284
Feature activation+4.08e-01

Pos logit contributions
years+2.875
nam+2.620
year+2.592
ruits+2.567
adies+2.496
Neg logit contributions
ww-2.696
lor-2.612
burn-2.607
 subs-2.433
 chip-2.405
Token help
Token ID1037
Feature activation-1.58e+00

Pos logit contributions
ww+0.844
lor+0.796
burn+0.793
 subs+0.760
actus+0.735
Neg logit contributions
 questions-0.532
years-0.531
ruits-0.498
year-0.475
nam-0.469
Token blocks
Token ID7021
Feature activation+3.05e-02

Pos logit contributions
years+4.021
ruits+3.885
year+3.680
adies+3.569
 questions+3.445
Neg logit contributions
ww-3.630
 bottle-3.214
 tou-3.189
 chip-3.179
rob-3.114
Token in
Token ID287
Feature activation-3.05e-01

Pos logit contributions
ww+0.056
burn+0.053
lor+0.053
 subs+0.050
actus+0.048
Neg logit contributions
years-0.046
 questions-0.046
ruits-0.043
year-0.042
nam-0.041
Token the
Token ID262
Feature activation+6.55e-01

Pos logit contributions
years+0.518
 questions+0.507
ruits+0.487
year+0.473
adies+0.456
Neg logit contributions
ww-0.509
lor-0.477
burn-0.476
 subs-0.448
 chip-0.434
Token basement
Token ID17012
Feature activation-3.98e-01

Pos logit contributions
ww+1.253
burn+1.174
lor+1.173
 subs+1.119
actus+1.067
Neg logit contributions
 questions-0.969
years-0.965
ruits-0.911
nam-0.873
year-0.872
Token.
Token ID13
Feature activation-6.26e-01

Pos logit contributions
years+0.677
 questions+0.652
ruits+0.621
year+0.610
nam+0.603
Neg logit contributions
ww-0.663
lor-0.629
burn-0.621
 subs-0.594
 tou-0.568
Token They
Token ID1119
Feature activation-3.78e+00

Pos logit contributions
years+1.025
 questions+0.981
nam+0.934
year+0.928
ruits+0.925
Neg logit contributions
ww-1.076
lor-1.004
burn-0.983
 subs-0.968
 chip-0.945
Token are
Token ID389
Feature activation-1.88e+00

Pos logit contributions
years+6.941
year+6.845
ruits+6.513
 questions+6.465
adies+6.216
Neg logit contributions
 tou-5.885
ww-5.759
 po-5.497
 bottle-5.450
 subs-5.437
Token very
Token ID845
Feature activation-1.35e+00

Pos logit contributions
years+3.064
ruits+3.041
 questions+2.966
year+2.909
adies+2.889
Neg logit contributions
ww-2.957
 tou-2.849
 subs-2.834
burn-2.781
 chip-2.781
Token happy
Token ID3772
Feature activation-7.81e-02

Pos logit contributions
years+1.809
 questions+1.770
ruits+1.687
adies+1.623
year+1.584
Neg logit contributions
ww-2.743
 tou-2.420
 subs-2.420
 pel-2.394
 chip-2.376
Token.
Token ID13
Feature activation-4.13e-01

Pos logit contributions
years+0.123
 questions+0.120
ruits+0.115
year+0.112
nam+0.110
Neg logit contributions
ww-0.141
burn-0.132
lor-0.131
 subs-0.126
actus-0.121
Token They
Token ID1119
Feature activation-2.61e+00

Pos logit contributions
years+0.667
 questions+0.643
year+0.613
ruits+0.607
nam+0.605
Neg logit contributions
ww-0.728
lor-0.676
burn-0.662
 subs-0.644
 chip-0.625
Token many
Token ID867
Feature activation-2.03e+00

Pos logit contributions
years+1.161
 questions+1.134
ruits+1.116
year+1.036
nam+0.986
Neg logit contributions
ww-1.905
lor-1.797
burn-1.760
 subs-1.736
 tou-1.719
Token shiny
Token ID22441
Feature activation-1.23e+00

Pos logit contributions
years+3.382
ruits+3.242
year+3.152
 questions+3.094
adies+3.069
Neg logit contributions
ww-3.537
 subs-3.109
actus-3.098
 chip-3.082
lor-3.073
Token things
Token ID1243
Feature activation+9.71e-01

Pos logit contributions
years+2.091
 questions+1.941
ruits+1.904
year+1.901
nam+1.785
Neg logit contributions
ww-2.180
burn-1.964
lor-1.944
actus-1.919
 subs-1.880
Token inside
Token ID2641
Feature activation-4.34e-01

Pos logit contributions
ww+1.924
lor+1.821
burn+1.799
 subs+1.712
actus+1.666
Neg logit contributions
 questions-1.372
years-1.334
ruits-1.283
year-1.217
nam-1.211
Token.
Token ID13
Feature activation+1.75e-01

Pos logit contributions
years+0.726
 questions+0.699
ruits+0.664
year+0.655
nam+0.644
Neg logit contributions
ww-0.733
lor-0.684
burn-0.677
 subs-0.640
 tou-0.622
Token They
Token ID1119
Feature activation-3.78e+00

Pos logit contributions
ww+0.293
lor+0.273
burn+0.270
 subs+0.257
actus+0.247
Neg logit contributions
years-0.297
 questions-0.295
ruits-0.280
year-0.272
nam-0.269
Token are
Token ID389
Feature activation-1.86e+00

Pos logit contributions
years+6.831
year+6.677
ruits+6.468
adies+6.213
 questions+6.053
Neg logit contributions
 tou-6.103
ww-5.787
 po-5.702
 subs-5.660
 bottle-5.603
Token curious
Token ID11040
Feature activation-1.60e+00

Pos logit contributions
years+2.986
ruits+2.984
adies+2.885
 questions+2.867
year+2.812
Neg logit contributions
ww-2.939
 tou-2.893
 subs-2.838
 pel-2.816
 chip-2.780
Token and
Token ID290
Feature activation-2.44e+00

Pos logit contributions
years+2.674
ruits+2.470
year+2.435
nam+2.402
 questions+2.357
Neg logit contributions
ww-2.752
burn-2.596
 subs-2.521
lor-2.496
 chip-2.413
Token want
Token ID765
Feature activation-6.07e-01

Pos logit contributions
years+2.878
ruits+2.696
adies+2.408
year+2.404
 questions+2.393
Neg logit contributions
ww-5.219
 tou-4.970
 pel-4.804
 subs-4.763
 po-4.554
Token to
Token ID284
Feature activation-1.07e+00

Pos logit contributions
years+1.178
ruits+1.129
 questions+1.126
year+1.101
nam+1.077
Neg logit contributions
ww-0.840
burn-0.788
lor-0.774
 subs-0.747
 chip-0.704
Token many
Token ID867
Feature activation-2.30e+00

Pos logit contributions
years+1.248
 questions+1.208
ruits+1.185
year+1.114
nam+1.043
Neg logit contributions
ww-2.020
lor-1.902
burn-1.862
 tou-1.831
 subs-1.828
Token modern
Token ID3660
Feature activation-2.23e+00

Pos logit contributions
years+4.116
year+3.815
ruits+3.766
adies+3.645
 questions+3.613
Neg logit contributions
ww-3.850
actus-3.434
 tou-3.344
 chip-3.308
 subs-3.264
Token toys
Token ID14958
Feature activation+3.69e-01

Pos logit contributions
years+4.165
year+3.820
ruits+3.653
 questions+3.583
nam+3.448
Neg logit contributions
ww-3.782
lor-3.241
actus-3.235
burn-3.224
 chip-3.195
Token inside
Token ID2641
Feature activation-7.93e-01

Pos logit contributions
ww+0.711
lor+0.667
burn+0.666
 subs+0.636
actus+0.611
Neg logit contributions
years-0.532
 questions-0.531
ruits-0.502
year-0.482
nam-0.475
Token.
Token ID13
Feature activation+1.63e-01

Pos logit contributions
years+1.403
 questions+1.329
ruits+1.263
year+1.254
nam+1.226
Neg logit contributions
ww-1.267
lor-1.176
burn-1.171
 subs-1.086
 tou-1.073
Token They
Token ID1119
Feature activation-3.77e+00

Pos logit contributions
ww+0.272
lor+0.253
burn+0.251
 subs+0.238
actus+0.229
Neg logit contributions
years-0.279
 questions-0.277
ruits-0.262
year-0.256
nam-0.252
Token are
Token ID389
Feature activation-1.84e+00

Pos logit contributions
years+6.606
year+6.473
ruits+6.070
adies+5.905
 questions+5.743
Neg logit contributions
 tou-6.356
ww-6.052
 po-5.926
 bottle-5.765
 subs-5.673
Token shiny
Token ID22441
Feature activation-1.42e+00

Pos logit contributions
years+2.893
ruits+2.835
adies+2.757
 questions+2.755
year+2.715
Neg logit contributions
ww-3.023
 tou-2.977
 chip-2.888
 subs-2.868
 pel-2.846
Token and
Token ID290
Feature activation-1.66e+00

Pos logit contributions
years+2.387
ruits+2.227
year+2.217
 questions+2.194
adies+2.010
Neg logit contributions
ww-2.526
burn-2.347
actus-2.330
lor-2.270
 chip-2.198
Token make
Token ID787
Feature activation-1.11e+00

Pos logit contributions
years+3.114
 questions+3.002
ruits+2.986
adies+2.928
year+2.896
Neg logit contributions
ww-2.429
 tou-2.155
 chip-2.082
burn-2.020
 subs-2.018
Token noises
Token ID26782
Feature activation-4.89e-01

Pos logit contributions
years+1.886
ruits+1.796
 questions+1.784
year+1.730
nam+1.673
Neg logit contributions
ww-1.805
burn-1.696
lor-1.695
 subs-1.638
 chip-1.593
Token a
Token ID257
Feature activation-1.88e-01

Pos logit contributions
ruits+2.839
years+2.813
 questions+2.721
adies+2.568
year+2.499
Neg logit contributions
ww-3.455
burn-3.252
 chip-3.203
 pel-3.165
 subs-3.163
Token lot
Token ID1256
Feature activation-3.79e-01

Pos logit contributions
years+0.348
 questions+0.337
ruits+0.327
year+0.320
nam+0.309
Neg logit contributions
ww-0.295
burn-0.270
lor-0.266
 subs-0.255
 tou-0.245
Token of
Token ID286
Feature activation-2.22e+00

Pos logit contributions
years+0.636
 questions+0.610
ruits+0.583
nam+0.579
year+0.558
Neg logit contributions
ww-0.652
lor-0.601
burn-0.594
 subs-0.571
 tou-0.554
Token boxes
Token ID10559
Feature activation-1.82e-01

Pos logit contributions
years+4.215
ruits+3.825
nam+3.652
year+3.627
adies+3.606
Neg logit contributions
ww-3.528
 chip-3.113
burn-3.105
 pel-3.087
 tou-3.034
Token.
Token ID13
Feature activation-1.35e+00

Pos logit contributions
years+0.296
 questions+0.292
ruits+0.276
year+0.267
nam+0.264
Neg logit contributions
ww-0.314
burn-0.298
lor-0.295
 subs-0.282
 tou-0.270
Token They
Token ID1119
Feature activation-3.75e+00

Pos logit contributions
years+2.458
 questions+2.234
nam+2.185
year+2.170
ruits+2.132
Neg logit contributions
ww-2.161
 chip-1.993
 subs-1.935
lor-1.927
burn-1.927
Token want
Token ID765
Feature activation-1.01e+00

Pos logit contributions
years+7.327
year+6.841
ruits+6.500
adies+6.119
 questions+5.953
Neg logit contributions
 tou-6.060
ww-6.053
 pel-5.686
 subs-5.634
 po-5.459
Token to
Token ID284
Feature activation-9.49e-01

Pos logit contributions
years+2.074
ruits+1.924
 questions+1.904
year+1.900
nam+1.867
Neg logit contributions
ww-1.327
burn-1.272
lor-1.180
 subs-1.178
 chip-1.161
Token see
Token ID766
Feature activation-1.64e+00

Pos logit contributions
years+1.427
 questions+1.333
ruits+1.282
year+1.262
nam+1.209
Neg logit contributions
ww-1.824
burn-1.761
 tou-1.688
 subs-1.668
lor-1.656
Token what
Token ID644
Feature activation+1.28e+00

Pos logit contributions
years+2.715
ruits+2.464
year+2.380
 questions+2.370
adies+2.278
Neg logit contributions
ww-2.891
burn-2.861
 chip-2.669
lor-2.663
 tou-2.655
Token is
Token ID318
Feature activation-3.65e-01

Pos logit contributions
ww+2.061
lor+1.977
burn+1.948
 subs+1.830
actus+1.780
Neg logit contributions
 questions-2.278
years-2.154
ruits-2.129
nam-2.043
year-2.032
Token fix
Token ID4259
Feature activation+3.79e-01

Pos logit contributions
years+0.977
 questions+0.941
ruits+0.910
year+0.905
nam+0.845
Neg logit contributions
ww-1.242
burn-1.202
lor-1.179
 subs-1.126
 tou-1.104
Token your
Token ID534
Feature activation-8.27e-01

Pos logit contributions
ww+0.599
lor+0.554
burn+0.552
 subs+0.522
actus+0.497
Neg logit contributions
years-0.676
 questions-0.676
ruits-0.644
year-0.624
nam-0.618
Token bike
Token ID7161
Feature activation-9.21e-01

Pos logit contributions
years+1.547
year+1.466
 questions+1.437
ruits+1.430
adies+1.356
Neg logit contributions
ww-1.301
burn-1.223
lor-1.176
 subs-1.135
 tou-1.124
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
years+1.577
 questions+1.505
ruits+1.459
year+1.451
adies+1.377
Neg logit contributions
ww-1.580
burn-1.498
lor-1.480
 chip-1.365
 subs-1.356
Token Do
Token ID2141
Feature activation-4.29e-01

Pos logit contributions
years+2.561
 questions+2.530
ruits+2.434
year+2.346
nam+2.328
Neg logit contributions
ww-2.110
lor-1.910
burn-1.874
 subs-1.742
 chip-1.730
Token you
Token ID345
Feature activation-3.75e+00

Pos logit contributions
years+0.931
 questions+0.912
year+0.899
ruits+0.891
nam+0.840
Neg logit contributions
ww-0.536
lor-0.474
burn-0.456
 subs-0.423
 tou-0.410
Token want
Token ID765
Feature activation+1.51e-02

Pos logit contributions
years+6.078
year+5.347
nam+5.308
ruits+4.907
 questions+4.760
Neg logit contributions
ww-7.678
burn-7.318
 subs-6.985
 tou-6.889
br-6.556
Token me
Token ID502
Feature activation-6.02e-01

Pos logit contributions
ww+0.024
burn+0.023
lor+0.022
 subs+0.021
 tou+0.020
Neg logit contributions
years-0.027
 questions-0.026
ruits-0.025
year-0.025
nam-0.024
Token to
Token ID284
Feature activation-3.09e-01

Pos logit contributions
years+1.134
 questions+1.101
ruits+1.084
year+1.056
nam+1.029
Neg logit contributions
ww-0.907
burn-0.878
lor-0.827
 subs-0.774
 tou-0.752
Token try
Token ID1949
Feature activation-7.60e-01

Pos logit contributions
years+0.451
 questions+0.438
ruits+0.424
year+0.419
nam+0.391
Neg logit contributions
ww-0.598
burn-0.573
lor-0.557
 subs-0.538
 tou-0.526
Token?"
Token ID1701
Feature activation-1.01e+00

Pos logit contributions
years+1.213
 questions+1.150
year+1.140
ruits+1.133
adies+1.052
Neg logit contributions
ww-1.370
burn-1.332
lor-1.279
 chip-1.189
actus-1.183
Token asks
Token ID7893
Feature activation-1.34e+00

Pos logit contributions
years+1.969
year+1.838
 questions+1.826
ruits+1.818
nam+1.789
Neg logit contributions
ww-1.767
burn-1.539
 subs-1.527
lor-1.518
actus-1.485
Token Tom
Token ID4186
Feature activation-1.38e+00

Pos logit contributions
years+2.268
year+2.030
 questions+2.006
nam+1.993
ruits+1.967
Neg logit contributions
ww-2.287
burn-2.170
lor-2.076
 chip-2.011
 tou-1.998
Token,
Token ID11
Feature activation-1.57e+00

Pos logit contributions
years+2.252
year+2.067
nam+1.976
 questions+1.922
ruits+1.911
Neg logit contributions
ww-2.472
burn-2.393
lor-2.336
 chip-2.266
 subs-2.183
Token "
Token ID366
Feature activation-1.45e+00

Pos logit contributions
years+2.835
 questions+2.671
ruits+2.648
year+2.591
adies+2.563
Neg logit contributions
ww-2.357
burn-2.196
lor-2.178
 chip-2.176
 subs-2.095
TokenDo
Token ID5211
Feature activation+4.97e-01

Pos logit contributions
 questions+2.616
years+2.580
ruits+2.444
year+2.373
nam+2.361
Neg logit contributions
ww-2.388
burn-2.274
lor-2.171
 chip-2.047
 subs-2.003
Token you
Token ID345
Feature activation-3.75e+00

Pos logit contributions
ww+0.721
burn+0.669
lor+0.669
 subs+0.626
actus+0.594
Neg logit contributions
 questions-0.950
years-0.948
ruits-0.910
year-0.877
nam-0.875
Token want
Token ID765
Feature activation+2.47e-01

Pos logit contributions
years+5.479
nam+4.864
year+4.643
 chambers+4.558
 curiously+4.330
Neg logit contributions
ww-8.553
burn-7.420
 tou-7.198
 subs-7.088
 chip-6.951
Token ice
Token ID4771
Feature activation+6.62e-01

Pos logit contributions
ww+0.435
burn+0.404
lor+0.404
 subs+0.383
actus+0.367
Neg logit contributions
years-0.399
 questions-0.396
ruits-0.377
year-0.365
nam-0.360
Token cream
Token ID8566
Feature activation+7.12e-01

Pos logit contributions
ww+0.805
lor+0.740
burn+0.723
 subs+0.687
 tou+0.637
Neg logit contributions
 questions-1.413
years-1.405
ruits-1.364
year-1.318
nam-1.309
Token?"
Token ID1701
Feature activation+3.52e-01

Pos logit contributions
ww+1.522
lor+1.452
burn+1.431
 subs+1.396
 tou+1.339
Neg logit contributions
 questions-0.860
years-0.850
ruits-0.806
year-0.754
nam-0.752
Token
Token ID198
Feature activation+1.56e+00

Pos logit contributions
ww+0.744
lor+0.702
burn+0.700
 subs+0.672
actus+0.649
Neg logit contributions
years-0.446
 questions-0.445
ruits-0.416
year-0.398
nam-0.391
Token box
Token ID3091
Feature activation-1.89e-01

Pos logit contributions
years+0.076
 questions+0.075
ruits+0.072
year+0.071
nam+0.069
Neg logit contributions
ww-0.059
burn-0.054
lor-0.053
 subs-0.051
 tou-0.049
Token in
Token ID287
Feature activation-2.53e-01

Pos logit contributions
years+0.270
 questions+0.269
ruits+0.250
nam+0.239
year+0.237
Neg logit contributions
ww-0.364
burn-0.345
lor-0.342
 subs-0.329
 tou-0.317
Token the
Token ID262
Feature activation+8.83e-01

Pos logit contributions
years+0.431
 questions+0.423
ruits+0.407
year+0.393
adies+0.380
Neg logit contributions
ww-0.425
burn-0.396
lor-0.394
 subs-0.372
 tou-0.364
Token garage
Token ID15591
Feature activation-5.91e-01

Pos logit contributions
ww+1.950
lor+1.842
burn+1.837
 subs+1.772
actus+1.695
Neg logit contributions
 questions-1.050
years-1.037
ruits-0.960
nam-0.933
year-0.907
Token.
Token ID13
Feature activation-3.01e-01

Pos logit contributions
years+1.021
 questions+0.990
ruits+0.928
year+0.918
nam+0.907
Neg logit contributions
ww-0.963
lor-0.931
burn-0.915
 subs-0.870
 chip-0.835
Token They
Token ID1119
Feature activation-3.74e+00

Pos logit contributions
years+0.449
 questions+0.440
ruits+0.406
year+0.403
nam+0.394
Neg logit contributions
ww-0.570
lor-0.531
burn-0.529
 subs-0.509
 chip-0.494
Token wonder
Token ID4240
Feature activation-2.76e+00

Pos logit contributions
years+6.764
year+6.552
ruits+6.392
 questions+6.264
adies+6.104
Neg logit contributions
 tou-6.108
ww-5.960
 subs-5.547
 po-5.489
 bottle-5.393
Token what
Token ID644
Feature activation+1.33e+00

Pos logit contributions
years+3.733
year+3.055
ruits+2.824
adies+2.671
 Many+2.634
Neg logit contributions
ww-5.730
burn-5.621
 pel-5.391
 po-5.356
 tou-5.353
Token is
Token ID318
Feature activation+1.99e-01

Pos logit contributions
ww+2.211
burn+2.124
lor+2.081
 subs+1.964
actus+1.883
Neg logit contributions
 questions-2.394
years-2.231
ruits-2.219
nam-2.122
 three-2.091
Token inside
Token ID2641
Feature activation-6.41e-01

Pos logit contributions
ww+0.375
lor+0.351
burn+0.349
 subs+0.334
 tou+0.322
Neg logit contributions
years-0.295
 questions-0.292
ruits-0.277
year-0.267
nam-0.263
Token.
Token ID13
Feature activation-6.04e-01

Pos logit contributions
years+1.117
 questions+1.071
ruits+1.024
year+1.017
nam+0.990
Neg logit contributions
ww-1.035
lor-0.965
burn-0.965
 subs-0.897
 tou-0.880
Token made
Token ID925
Feature activation-4.16e-01

Pos logit contributions
years+0.253
 questions+0.253
ruits+0.243
year+0.229
nam+0.228
Neg logit contributions
ww-0.338
burn-0.314
lor-0.313
 subs-0.304
 tou-0.291
Token a
Token ID257
Feature activation+1.25e+00

Pos logit contributions
years+0.634
ruits+0.612
 questions+0.612
year+0.576
nam+0.575
Neg logit contributions
ww-0.753
lor-0.712
burn-0.708
 subs-0.684
 chip-0.647
Token plan
Token ID1410
Feature activation+7.85e-02

Pos logit contributions
ww+2.413
lor+2.339
burn+2.308
 subs+2.163
rob+2.109
Neg logit contributions
 questions-1.811
years-1.716
ruits-1.627
nam-1.618
 three-1.585
Token.
Token ID13
Feature activation-1.46e+00

Pos logit contributions
ww+0.136
lor+0.127
burn+0.127
 subs+0.121
 tou+0.115
Neg logit contributions
years-0.127
 questions-0.126
ruits-0.121
year-0.116
nam-0.116
Token He
Token ID679
Feature activation-1.06e+00

Pos logit contributions
years+2.459
 questions+2.339
ruits+2.246
adies+2.150
nam+2.138
Neg logit contributions
ww-2.526
lor-2.371
 subs-2.298
 chip-2.267
burn-2.238
Token asked
Token ID1965
Feature activation-3.74e+00

Pos logit contributions
 questions+1.742
years+1.727
ruits+1.694
nam+1.623
adies+1.586
Neg logit contributions
ww-1.823
burn-1.687
 subs-1.654
lor-1.651
 tou-1.538
Token his
Token ID465
Feature activation-8.41e-01

Pos logit contributions
years+7.858
ruits+7.415
nam+6.986
 ago+6.810
adies+6.775
Neg logit contributions
ww-4.916
 chip-4.586
lor-4.366
 Cathy-4.337
 subs-4.327
Token mom
Token ID1995
Feature activation-2.63e+00

Pos logit contributions
years+1.422
ruits+1.351
year+1.336
 questions+1.308
adies+1.303
Neg logit contributions
ww-1.412
burn-1.294
lor-1.289
 subs-1.269
 chip-1.248
Token for
Token ID329
Feature activation-1.50e+00

Pos logit contributions
years+4.828
nam+4.457
adies+4.420
year+4.386
ruits+4.318
Neg logit contributions
ww-3.879
burn-3.846
lor-3.784
 chip-3.748
 bottle-3.416
Token help
Token ID1037
Feature activation-1.47e+00

Pos logit contributions
ruits+2.255
years+2.237
 questions+2.135
year+2.080
adies+2.074
Neg logit contributions
ww-2.654
lor-2.495
burn-2.410
 subs-2.372
 chip-2.282
Token,
Token ID11
Feature activation-2.12e+00

Pos logit contributions
years+2.597
 questions+2.448
ruits+2.440
year+2.408
adies+2.402
Neg logit contributions
ww-2.235
burn-2.195
lor-2.122
 chip-2.071
 subs-2.019
Token with
Token ID351
Feature activation-7.53e-01

Pos logit contributions
years+2.349
ruits+2.247
 questions+2.220
year+2.137
nam+2.033
Neg logit contributions
ww-2.055
lor-1.961
burn-1.880
 tou-1.734
 chip-1.727
Token something
Token ID1223
Feature activation+5.20e-01

Pos logit contributions
years+1.536
 questions+1.498
ruits+1.474
year+1.474
adies+1.382
Neg logit contributions
ww-1.012
burn-0.894
lor-0.893
actus-0.810
 subs-0.798
Token else
Token ID2073
Feature activation-1.17e+00

Pos logit contributions
ww+1.074
lor+1.017
burn+1.016
 subs+0.971
actus+0.937
Neg logit contributions
 questions-0.677
years-0.670
ruits-0.631
year-0.602
nam-0.595
Token.
Token ID13
Feature activation-7.05e-01

Pos logit contributions
years+2.093
 questions+1.934
year+1.927
ruits+1.876
adies+1.815
Neg logit contributions
ww-1.956
burn-1.765
lor-1.761
 chip-1.624
actus-1.615
Token Do
Token ID2141
Feature activation-1.63e-01

Pos logit contributions
years+1.072
 questions+1.042
ruits+0.977
year+0.960
nam+0.944
Neg logit contributions
ww-1.326
lor-1.255
burn-1.236
 subs-1.160
 tou-1.143
Token you
Token ID345
Feature activation-3.74e+00

Pos logit contributions
years+0.344
 questions+0.337
ruits+0.327
year+0.327
nam+0.312
Neg logit contributions
ww-0.213
lor-0.191
burn-0.187
 subs-0.172
 tou-0.166
Token want
Token ID765
Feature activation-2.83e-02

Pos logit contributions
years+5.962
nam+5.074
year+5.025
 chambers+4.316
 curiously+4.178
Neg logit contributions
ww-7.802
burn-7.533
 tou-7.085
 subs-6.897
lor-6.896
Token to
Token ID284
Feature activation+4.86e-01

Pos logit contributions
years+0.050
 questions+0.049
ruits+0.047
year+0.046
nam+0.045
Neg logit contributions
ww-0.046
burn-0.042
lor-0.042
 subs-0.039
actus-0.038
Token draw
Token ID3197
Feature activation-7.68e-01

Pos logit contributions
ww+1.020
lor+0.962
burn+0.956
 subs+0.920
actus+0.889
Neg logit contributions
 questions-0.619
years-0.615
ruits-0.577
year-0.546
nam-0.544
Token a
Token ID257
Feature activation+7.42e-01

Pos logit contributions
years+1.177
ruits+1.123
 questions+1.121
year+1.075
nam+1.030
Neg logit contributions
ww-1.427
lor-1.344
burn-1.331
actus-1.233
 subs-1.229
Token picture
Token ID4286
Feature activation-5.17e-03

Pos logit contributions
ww+1.080
lor+1.008
burn+0.997
 subs+0.945
actus+0.881
Neg logit contributions
 questions-1.411
years-1.382
ruits-1.336
nam-1.294
year-1.290
Token in
Token ID287
Feature activation+7.01e-02

Pos logit contributions
ww+0.087
burn+0.081
lor+0.081
 subs+0.077
 tou+0.074
Neg logit contributions
years-0.072
 questions-0.071
ruits-0.067
year-0.065
nam-0.064
Token the
Token ID262
Feature activation+7.18e-01

Pos logit contributions
ww+0.122
lor+0.114
burn+0.114
 subs+0.108
 tou+0.104
Neg logit contributions
years-0.115
 questions-0.113
ruits-0.108
year-0.104
nam-0.102
Token play
Token ID711
Feature activation-1.01e+00

Pos logit contributions
ww+1.437
burn+1.350
lor+1.348
 subs+1.291
actus+1.230
Neg logit contributions
 questions-1.000
years-0.994
ruits-0.938
nam-0.898
year-0.892
Tokenroom
Token ID3823
Feature activation+5.49e-05

Pos logit contributions
years+1.639
 questions+1.572
nam+1.477
ruits+1.469
year+1.408
Neg logit contributions
ww-1.801
lor-1.766
burn-1.678
 subs-1.622
 tou-1.587
Token.
Token ID13
Feature activation-6.19e-01

Pos logit contributions
ww+0.000
lor+0.000
burn+0.000
 subs+0.000
actus+0.000
Neg logit contributions
years-0.000
 questions-0.000
ruits-0.000
year-0.000
nam-0.000
Token They
Token ID1119
Feature activation-3.73e+00

Pos logit contributions
years+0.982
 questions+0.945
nam+0.892
year+0.890
ruits+0.878
Neg logit contributions
ww-1.107
lor-1.036
burn-1.006
 subs-0.984
 chip-0.964
Token are
Token ID389
Feature activation-2.08e+00

Pos logit contributions
years+6.784
year+6.614
 questions+6.285
ruits+6.283
adies+6.187
Neg logit contributions
ww-6.008
 tou-5.903
 subs-5.553
 po-5.461
 pel-5.375
Token very
Token ID845
Feature activation-1.45e+00

Pos logit contributions
years+3.538
ruits+3.523
 questions+3.410
adies+3.355
year+3.323
Neg logit contributions
ww-3.152
 tou-3.103
 subs-3.050
 chip-3.020
 pel-3.013
Token happy
Token ID3772
Feature activation-2.20e-01

Pos logit contributions
years+1.801
 questions+1.772
ruits+1.657
adies+1.603
year+1.547
Neg logit contributions
ww-3.117
 pel-2.766
 tou-2.764
 subs-2.757
 chip-2.716
Token and
Token ID290
Feature activation-1.11e-01

Pos logit contributions
years+0.336
 questions+0.327
ruits+0.314
year+0.305
nam+0.299
Neg logit contributions
ww-0.409
burn-0.385
lor-0.380
 subs-0.367
actus-0.352
Token start
Token ID923
Feature activation-9.13e-01

Pos logit contributions
years+0.123
 questions+0.121
ruits+0.114
year+0.107
nam+0.102
Neg logit contributions
ww-0.252
burn-0.236
lor-0.234
 subs-0.230
 tou-0.224
Token toys
Token ID14958
Feature activation-3.86e-02

Pos logit contributions
years+4.280
ruits+4.139
year+3.954
adies+3.896
 questions+3.713
Neg logit contributions
ww-3.605
 chip-3.194
 bottle-3.171
 tou-3.168
rob-3.048
Token in
Token ID287
Feature activation+7.57e-02

Pos logit contributions
years+0.058
 questions+0.058
ruits+0.054
year+0.053
nam+0.052
Neg logit contributions
ww-0.071
burn-0.067
lor-0.067
 subs-0.064
 tou-0.062
Token the
Token ID262
Feature activation+9.45e-01

Pos logit contributions
ww+0.132
burn+0.123
lor+0.123
 subs+0.116
 tou+0.112
Neg logit contributions
years-0.124
 questions-0.122
ruits-0.116
year-0.113
nam-0.110
Token park
Token ID3952
Feature activation-7.08e-01

Pos logit contributions
ww+1.892
lor+1.774
burn+1.770
 subs+1.701
actus+1.615
Neg logit contributions
 questions-1.323
years-1.303
ruits-1.231
nam-1.193
year-1.162
Token.
Token ID13
Feature activation+9.01e-02

Pos logit contributions
years+1.255
 questions+1.210
year+1.126
ruits+1.124
nam+1.109
Neg logit contributions
ww-1.129
lor-1.079
burn-1.059
 subs-1.000
 tou-0.987
Token They
Token ID1119
Feature activation-3.72e+00

Pos logit contributions
ww+0.168
lor+0.157
burn+0.157
 subs+0.150
 tou+0.145
Neg logit contributions
years-0.136
 questions-0.135
ruits-0.126
year-0.123
nam-0.121
Token are
Token ID389
Feature activation-1.64e+00

Pos logit contributions
years+6.761
year+6.610
ruits+6.196
 questions+6.090
adies+6.055
Neg logit contributions
 tou-6.024
ww-5.812
 po-5.509
 bottle-5.336
 subs-5.263
Token very
Token ID845
Feature activation-1.15e+00

Pos logit contributions
years+2.632
ruits+2.604
 questions+2.554
adies+2.494
year+2.486
Neg logit contributions
ww-2.642
 tou-2.545
 subs-2.480
burn-2.480
 chip-2.467
Token happy
Token ID3772
Feature activation-2.07e-01

Pos logit contributions
years+1.507
 questions+1.487
ruits+1.402
adies+1.352
year+1.324
Neg logit contributions
ww-2.374
 tou-2.104
 subs-2.095
 pel-2.065
burn-2.062
Token.
Token ID13
Feature activation+5.68e-01

Pos logit contributions
years+0.314
 questions+0.306
ruits+0.293
year+0.285
nam+0.279
Neg logit contributions
ww-0.387
burn-0.365
lor-0.360
 subs-0.347
actus-0.334
Token
Token ID198
Feature activation+4.00e-01

Pos logit contributions
ww+0.953
lor+0.888
burn+0.885
 subs+0.838
actus+0.800
Neg logit contributions
 questions-0.970
years-0.966
ruits-0.929
year-0.890
nam-0.884
Token box
Token ID3091
Feature activation+4.76e-02

Pos logit contributions
ww+0.312
burn+0.286
lor+0.284
 subs+0.268
actus+0.255
Neg logit contributions
years-0.393
 questions-0.390
ruits-0.374
year-0.365
nam-0.358
Token in
Token ID287
Feature activation+2.59e-02

Pos logit contributions
ww+0.094
burn+0.089
lor+0.089
 subs+0.085
 tou+0.082
Neg logit contributions
years-0.066
 questions-0.066
ruits-0.061
year-0.058
nam-0.058
Token the
Token ID262
Feature activation+6.50e-01

Pos logit contributions
ww+0.045
burn+0.042
lor+0.042
 subs+0.040
 tou+0.038
Neg logit contributions
years-0.043
 questions-0.042
ruits-0.040
year-0.039
nam-0.038
Token yard
Token ID12699
Feature activation-4.89e-01

Pos logit contributions
ww+1.349
burn+1.271
lor+1.270
 subs+1.220
actus+1.166
Neg logit contributions
 questions-0.852
years-0.849
ruits-0.794
nam-0.761
year-0.759
Token.
Token ID13
Feature activation-4.60e-01

Pos logit contributions
years+0.828
 questions+0.810
ruits+0.754
nam+0.742
year+0.739
Neg logit contributions
ww-0.813
lor-0.780
burn-0.771
 subs-0.732
 chip-0.702
Token They
Token ID1119
Feature activation-3.71e+00

Pos logit contributions
years+0.726
 questions+0.717
ruits+0.662
year+0.654
nam+0.639
Neg logit contributions
ww-0.826
lor-0.774
burn-0.758
 subs-0.732
 chip-0.719
Token want
Token ID765
Feature activation-5.99e-01

Pos logit contributions
years+6.879
year+6.723
ruits+6.397
 questions+6.374
adies+6.167
Neg logit contributions
 tou-5.940
ww-5.575
 po-5.478
 bottle-5.371
 subs-5.197
Token to
Token ID284
Feature activation-5.58e-01

Pos logit contributions
years+1.193
 questions+1.132
ruits+1.116
year+1.104
nam+1.065
Neg logit contributions
ww-0.829
burn-0.781
lor-0.746
 subs-0.728
 tou-0.702
Token see
Token ID766
Feature activation-1.40e+00

Pos logit contributions
years+0.757
 questions+0.725
ruits+0.678
year+0.675
nam+0.629
Neg logit contributions
ww-1.144
burn-1.086
lor-1.064
 subs-1.041
 tou-1.035
Token what
Token ID644
Feature activation+1.74e+00

Pos logit contributions
years+1.936
ruits+1.722
year+1.695
 questions+1.688
adies+1.570
Neg logit contributions
ww-2.794
burn-2.749
lor-2.649
 tou-2.599
 chip-2.560
Token is
Token ID318
Feature activation-3.05e-02

Pos logit contributions
ww+2.943
lor+2.840
burn+2.834
 subs+2.627
actus+2.532
Neg logit contributions
 questions-3.044
ruits-2.838
years-2.791
 three-2.707
nam-2.693
Token real
Token ID1103
Feature activation+8.26e-02

Pos logit contributions
ww+0.471
lor+0.435
burn+0.434
 subs+0.411
actus+0.392
Neg logit contributions
years-0.518
 questions-0.515
ruits-0.493
year-0.477
nam-0.471
Token car
Token ID1097
Feature activation+3.17e-01

Pos logit contributions
ww+0.132
burn+0.122
lor+0.121
 subs+0.115
actus+0.111
Neg logit contributions
years-0.147
 questions-0.146
ruits-0.140
year-0.136
nam-0.134
Token on
Token ID319
Feature activation-3.57e-02

Pos logit contributions
ww+0.582
lor+0.544
burn+0.543
 subs+0.517
actus+0.497
Neg logit contributions
years-0.483
 questions-0.481
ruits-0.456
year-0.440
nam-0.434
Token it
Token ID340
Feature activation-2.77e-01

Pos logit contributions
years+0.065
 questions+0.064
ruits+0.061
year+0.060
nam+0.058
Neg logit contributions
ww-0.056
lor-0.052
burn-0.052
 subs-0.049
 tou-0.047
Token.
Token ID13
Feature activation-6.04e-02

Pos logit contributions
years+0.461
 questions+0.445
ruits+0.427
year+0.419
nam+0.414
Neg logit contributions
ww-0.474
lor-0.446
burn-0.439
 subs-0.421
actus-0.402
Token They
Token ID1119
Feature activation-3.70e+00

Pos logit contributions
years+0.098
 questions+0.096
ruits+0.091
year+0.089
nam+0.089
Neg logit contributions
ww-0.106
lor-0.099
burn-0.098
 subs-0.094
actus-0.091
Token are
Token ID389
Feature activation-1.73e+00

Pos logit contributions
years+7.175
year+6.882
ruits+6.607
adies+6.098
 questions+6.060
Neg logit contributions
 tou-5.779
ww-5.487
 subs-5.448
 po-5.318
burn-5.299
Token very
Token ID845
Feature activation-1.33e+00

Pos logit contributions
years+2.636
ruits+2.620
 questions+2.502
adies+2.444
year+2.444
Neg logit contributions
ww-2.953
 tou-2.880
burn-2.877
 subs-2.864
 pel-2.827
Token excited
Token ID6568
Feature activation-5.02e-02

Pos logit contributions
years+1.588
 questions+1.534
ruits+1.465
adies+1.369
year+1.349
Neg logit contributions
ww-2.943
 subs-2.651
 tou-2.643
 pel-2.642
burn-2.588
Token.
Token ID13
Feature activation+3.36e-01

Pos logit contributions
years+0.079
 questions+0.076
ruits+0.073
year+0.072
nam+0.071
Neg logit contributions
ww-0.091
burn-0.086
lor-0.085
 subs-0.081
actus-0.078
Token They
Token ID1119
Feature activation-2.75e+00

Pos logit contributions
ww+0.637
lor+0.597
burn+0.596
 subs+0.568
actus+0.547
Neg logit contributions
years-0.499
 questions-0.498
ruits-0.471
year-0.453
nam-0.447
Token a
Token ID257
Feature activation+1.50e+00

Pos logit contributions
ww+0.333
lor+0.310
burn+0.310
 subs+0.294
actus+0.282
Neg logit contributions
years-0.304
 questions-0.301
ruits-0.289
year-0.278
nam-0.273
Token new
Token ID649
Feature activation-1.66e-01

Pos logit contributions
ww+2.562
lor+2.481
burn+2.401
 subs+2.278
actus+2.149
Neg logit contributions
 questions-2.545
years-2.366
ruits-2.271
nam-2.269
 three-2.260
Token one
Token ID530
Feature activation+2.46e-01

Pos logit contributions
years+0.311
 questions+0.303
ruits+0.295
year+0.287
nam+0.279
Neg logit contributions
ww-0.255
burn-0.235
lor-0.234
 subs-0.220
actus-0.212
Token.
Token ID13
Feature activation-5.86e-01

Pos logit contributions
ww+0.452
lor+0.421
burn+0.421
 subs+0.401
actus+0.385
Neg logit contributions
years-0.377
 questions-0.374
ruits-0.355
year-0.342
nam-0.337
Token Do
Token ID2141
Feature activation-1.74e-01

Pos logit contributions
years+0.887
 questions+0.865
ruits+0.818
year+0.790
nam+0.789
Neg logit contributions
ww-1.101
lor-1.038
burn-1.026
 subs-0.975
 tou-0.949
Token you
Token ID345
Feature activation-3.69e+00

Pos logit contributions
years+0.369
 questions+0.363
ruits+0.352
year+0.352
nam+0.334
Neg logit contributions
ww-0.223
lor-0.199
burn-0.197
 subs-0.182
 tou-0.175
Token want
Token ID765
Feature activation-1.90e-01

Pos logit contributions
years+5.648
nam+5.079
year+4.735
 questions+4.438
 curiously+4.332
Neg logit contributions
ww-7.596
burn-7.197
 subs-6.797
 tou-6.629
 bottle-6.502
Token to
Token ID284
Feature activation-4.53e-01

Pos logit contributions
years+0.333
 questions+0.326
ruits+0.318
year+0.308
nam+0.302
Neg logit contributions
ww-0.309
burn-0.285
lor-0.283
 subs-0.268
 tou-0.256
Token help
Token ID1037
Feature activation-1.33e-01

Pos logit contributions
years+0.666
 questions+0.638
ruits+0.617
year+0.615
nam+0.577
Neg logit contributions
ww-0.871
burn-0.837
lor-0.809
 subs-0.785
 tou-0.767
Token me
Token ID502
Feature activation-8.97e-01

Pos logit contributions
years+0.199
 questions+0.194
ruits+0.186
year+0.182
nam+0.176
Neg logit contributions
ww-0.250
burn-0.236
lor-0.232
 subs-0.219
actus-0.215
Token?"
Token ID1701
Feature activation+8.75e-02

Pos logit contributions
years+1.350
 questions+1.263
ruits+1.238
year+1.225
adies+1.158
Neg logit contributions
ww-1.681
burn-1.670
lor-1.598
 subs-1.501
 chip-1.500
Token see
Token ID766
Feature activation-1.43e+00

Pos logit contributions
 questions+3.183
years+3.177
year+2.981
ruits+2.951
nam+2.769
Neg logit contributions
ww-3.979
 pel-3.730
burn-3.727
 tou-3.616
 subs-3.578
Token a
Token ID257
Feature activation+1.54e+00

Pos logit contributions
years+2.092
ruits+1.978
 questions+1.932
year+1.881
adies+1.761
Neg logit contributions
ww-2.791
burn-2.676
 chip-2.524
lor-2.521
 pel-2.480
Token big
Token ID1263
Feature activation+4.17e-01

Pos logit contributions
lor+2.595
ww+2.546
burn+2.468
 subs+2.350
ic+2.170
Neg logit contributions
 questions-2.554
years-2.434
nam-2.349
ruits-2.343
 three-2.330
Token box
Token ID3091
Feature activation+2.94e-01

Pos logit contributions
ww+0.653
lor+0.607
burn+0.603
 subs+0.570
actus+0.541
Neg logit contributions
 questions-0.748
years-0.748
ruits-0.714
year-0.691
nam-0.685
Token.
Token ID13
Feature activation-4.68e-01

Pos logit contributions
ww+0.558
lor+0.523
burn+0.522
 subs+0.498
actus+0.479
Neg logit contributions
years-0.432
 questions-0.431
ruits-0.407
year-0.391
nam-0.386
Token They
Token ID1119
Feature activation-3.69e+00

Pos logit contributions
years+0.755
 questions+0.737
ruits+0.684
year+0.683
nam+0.667
Neg logit contributions
ww-0.818
lor-0.766
burn-0.764
 subs-0.731
 chip-0.716
Token want
Token ID765
Feature activation-9.61e-01

Pos logit contributions
years+6.492
year+6.097
ruits+6.058
 questions+5.988
adies+5.621
Neg logit contributions
ww-6.211
 tou-5.988
 subs-5.653
 po-5.595
 pel-5.586
Token to
Token ID284
Feature activation-6.01e-01

Pos logit contributions
years+1.990
 questions+1.855
ruits+1.855
year+1.835
nam+1.798
Neg logit contributions
ww-1.250
burn-1.151
lor-1.131
 subs-1.090
 chip-1.053
Token know
Token ID760
Feature activation-2.24e+00

Pos logit contributions
years+0.764
 questions+0.734
ruits+0.683
year+0.672
nam+0.627
Neg logit contributions
ww-1.277
burn-1.207
lor-1.187
 subs-1.170
 tou-1.164
Token what
Token ID644
Feature activation+1.77e+00

Pos logit contributions
years+3.176
year+2.602
ruits+2.598
 questions+2.507
adies+2.437
Neg logit contributions
ww-4.511
burn-4.353
lor-4.338
 chip-4.266
 tou-4.224
Token is
Token ID318
Feature activation+1.08e-01

Pos logit contributions
ww+3.064
burn+3.009
lor+2.937
 subs+2.752
actus+2.689
Neg logit contributions
 questions-3.058
ruits-2.796
years-2.765
 three-2.697
year-2.657
Token box
Token ID3091
Feature activation-4.13e-02

Pos logit contributions
ww+0.455
lor+0.418
burn+0.418
 subs+0.393
actus+0.374
Neg logit contributions
years-0.560
 questions-0.558
ruits-0.534
year-0.519
nam-0.512
Token in
Token ID287
Feature activation+5.20e-02

Pos logit contributions
years+0.059
 questions+0.059
ruits+0.054
nam+0.052
year+0.052
Neg logit contributions
ww-0.080
burn-0.076
lor-0.075
 subs-0.072
 tou-0.070
Token the
Token ID262
Feature activation+1.02e+00

Pos logit contributions
ww+0.090
burn+0.084
lor+0.084
 subs+0.079
 tou+0.077
Neg logit contributions
years-0.086
 questions-0.084
ruits-0.081
year-0.078
nam-0.076
Token garage
Token ID15591
Feature activation-2.91e-01

Pos logit contributions
ww+2.134
lor+2.009
burn+2.000
 subs+1.930
actus+1.828
Neg logit contributions
 questions-1.331
years-1.310
ruits-1.223
nam-1.200
year-1.159
Token.
Token ID13
Feature activation+2.18e-01

Pos logit contributions
years+0.484
 questions+0.473
ruits+0.445
year+0.437
nam+0.432
Neg logit contributions
ww-0.492
lor-0.472
burn-0.464
 subs-0.443
actus-0.424
Token They
Token ID1119
Feature activation-3.67e+00

Pos logit contributions
ww+0.424
lor+0.398
burn+0.397
 subs+0.379
actus+0.365
Neg logit contributions
years-0.312
 questions-0.311
ruits-0.292
year-0.282
nam-0.277
Token open
Token ID1280
Feature activation-3.55e-01

Pos logit contributions
years+6.435
year+6.234
 questions+6.037
ruits+6.017
adies+5.820
Neg logit contributions
 tou-6.110
ww-5.937
 subs-5.509
 po-5.478
 bottle-5.427
Token the
Token ID262
Feature activation-3.63e-01

Pos logit contributions
years+0.677
 questions+0.663
ruits+0.641
year+0.628
nam+0.607
Neg logit contributions
ww-0.510
burn-0.474
lor-0.471
 subs-0.444
 tou-0.427
Token box
Token ID3091
Feature activation-6.45e-01

Pos logit contributions
years+0.782
 questions+0.756
ruits+0.738
year+0.726
nam+0.706
Neg logit contributions
ww-0.448
lor-0.417
burn-0.395
 subs-0.379
actus-0.365
Token and
Token ID290
Feature activation-2.77e+00

Pos logit contributions
years+0.963
 questions+0.929
ruits+0.868
nam+0.853
year+0.841
Neg logit contributions
ww-1.203
lor-1.154
burn-1.135
 subs-1.097
 chip-1.054
Token see
Token ID766
Feature activation-1.61e+00

Pos logit contributions
years+4.914
year+4.560
ruits+4.534
 questions+4.366
adies+3.945
Neg logit contributions
ww-4.382
 tou-4.302
 subs-4.105
 po-4.092
 disappears-4.050
Tokenamed
Token ID2434
Feature activation+2.18e-01

Pos logit contributions
ww+0.687
lor+0.636
burn+0.635
 subs+0.601
actus+0.572
Neg logit contributions
years-0.782
 questions-0.777
ruits-0.743
year-0.724
nam-0.711
Token them
Token ID606
Feature activation+4.41e-01

Pos logit contributions
ww+0.395
lor+0.369
burn+0.368
 subs+0.351
actus+0.336
Neg logit contributions
years-0.337
 questions-0.336
ruits-0.320
year-0.306
nam-0.304
Token with
Token ID351
Feature activation+8.96e-01

Pos logit contributions
ww+0.873
lor+0.821
burn+0.816
 subs+0.781
actus+0.755
Neg logit contributions
years-0.611
 questions-0.610
ruits-0.574
year-0.552
nam-0.544
Token sugar
Token ID7543
Feature activation+1.78e+00

Pos logit contributions
ww+1.837
lor+1.752
burn+1.735
 subs+1.654
actus+1.623
Neg logit contributions
 questions-1.206
years-1.200
ruits-1.103
year-1.072
nam-1.064
Token and
Token ID290
Feature activation+7.04e-01

Pos logit contributions
ww+3.799
lor+3.541
burn+3.377
 subs+3.357
 tou+3.297
Neg logit contributions
 questions-2.339
years-2.231
ruits-2.201
 three-2.021
year-2.012
Token cream
Token ID8566
Feature activation+2.07e+00

Pos logit contributions
ww+1.169
lor+1.103
burn+1.083
 subs+1.027
actus+0.991
Neg logit contributions
years-1.200
 questions-1.199
ruits-1.137
year-1.103
nam-1.099
Token.
Token ID13
Feature activation+4.98e-01

Pos logit contributions
ww+4.395
lor+4.170
burn+3.986
 subs+3.801
 tou+3.800
Neg logit contributions
 questions-2.775
years-2.604
ruits-2.596
 three-2.439
year-2.413
Token
Token ID220
Feature activation+6.67e-01

Pos logit contributions
ww+0.914
burn+0.854
lor+0.853
 subs+0.813
actus+0.780
Neg logit contributions
years-0.764
 questions-0.762
ruits-0.722
year-0.696
nam-0.685
Token
Token ID198
Feature activation+5.61e-01

Pos logit contributions
ww+1.094
burn+1.009
lor+1.000
 subs+0.957
 tou+0.911
Neg logit contributions
 questions-1.157
years-1.151
ruits-1.112
year-1.066
nam-1.048
Token
Token ID198
Feature activation-2.23e-01

Pos logit contributions
ww+0.904
burn+0.842
lor+0.835
 subs+0.795
actus+0.757
Neg logit contributions
years-0.987
 questions-0.982
ruits-0.938
year-0.913
nam-0.893
TokenM
Token ID44
Feature activation-3.17e-01

Pos logit contributions
 questions+0.395
years+0.388
ruits+0.375
nam+0.361
year+0.355
Neg logit contributions
ww-0.367
lor-0.337
burn-0.329
 subs-0.313
 tou-0.297
Token at
Token ID379
Feature activation+1.57e+00

Pos logit contributions
ww+0.114
burn+0.107
lor+0.107
 subs+0.102
actus+0.097
Neg logit contributions
years-0.091
 questions-0.091
ruits-0.086
year-0.083
nam-0.082
Token the
Token ID262
Feature activation+8.05e-01

Pos logit contributions
ww+3.314
lor+3.137
burn+3.030
 subs+2.946
actus+2.850
Neg logit contributions
 questions-2.121
years-2.058
 three-1.929
nam-1.893
ruits-1.834
Token park
Token ID3952
Feature activation+1.86e-01

Pos logit contributions
ww+1.279
lor+1.188
burn+1.184
 subs+1.128
 tou+1.048
Neg logit contributions
 questions-1.429
years-1.415
ruits-1.348
nam-1.301
year-1.299
Token.
Token ID13
Feature activation+6.12e-01

Pos logit contributions
ww+0.338
lor+0.317
burn+0.317
 subs+0.301
actus+0.289
Neg logit contributions
years-0.287
 questions-0.285
ruits-0.270
year-0.261
nam-0.258
Token The
Token ID383
Feature activation-1.47e-01

Pos logit contributions
ww+1.267
lor+1.191
burn+1.185
 subs+1.139
actus+1.096
Neg logit contributions
 questions-0.800
years-0.795
ruits-0.753
year-0.713
nam-0.708
Token sun
Token ID4252
Feature activation+2.42e+00

Pos logit contributions
years+0.307
 questions+0.303
ruits+0.292
year+0.286
nam+0.282
Neg logit contributions
ww-0.193
burn-0.173
lor-0.170
 subs-0.160
actus-0.156
Token was
Token ID373
Feature activation+7.73e-01

Pos logit contributions
ww+4.616
lor+4.485
burn+3.968
 tou+3.867
 subs+3.850
Neg logit contributions
 questions-3.555
 three-3.477
years-3.463
ruits-3.359
year-3.287
Token shining
Token ID22751
Feature activation+2.73e-01

Pos logit contributions
ww+1.259
lor+1.199
burn+1.146
 subs+1.099
actus+1.071
Neg logit contributions
years-1.338
 questions-1.331
ruits-1.264
year-1.238
nam-1.224
Token,
Token ID11
Feature activation+8.38e-01

Pos logit contributions
ww+0.566
burn+0.533
lor+0.532
 subs+0.510
actus+0.493
Neg logit contributions
years-0.353
 questions-0.351
ruits-0.330
year-0.315
nam-0.310
Token and
Token ID290
Feature activation+1.22e+00

Pos logit contributions
ww+1.779
lor+1.705
burn+1.662
 subs+1.617
actus+1.558
Neg logit contributions
 questions-1.034
years-1.031
ruits-0.963
year-0.923
nam-0.914
Token the
Token ID262
Feature activation+9.92e-02

Pos logit contributions
ww+2.233
lor+2.166
burn+2.067
 subs+2.022
actus+1.948
Neg logit contributions
years-1.852
 questions-1.849
nam-1.738
ruits-1.715
 three-1.682
Token up
Token ID510
Feature activation-4.31e-01

Pos logit contributions
years+0.021
 questions+0.021
ruits+0.020
year+0.019
nam+0.019
Neg logit contributions
ww-0.019
burn-0.017
lor-0.017
 subs-0.016
actus-0.015
Token and
Token ID290
Feature activation+8.19e-01

Pos logit contributions
years+0.609
 questions+0.596
ruits+0.574
year+0.543
nam+0.532
Neg logit contributions
ww-0.841
burn-0.797
lor-0.783
 subs-0.762
 chip-0.734
Token said
Token ID531
Feature activation-5.76e-01

Pos logit contributions
ww+1.534
lor+1.454
burn+1.418
 subs+1.347
actus+1.313
Neg logit contributions
years-1.246
 questions-1.240
ruits-1.165
nam-1.138
year-1.129
Token:
Token ID25
Feature activation-2.61e-01

Pos logit contributions
years+0.850
 questions+0.823
ruits+0.791
nam+0.768
year+0.757
Neg logit contributions
ww-1.094
lor-1.028
burn-0.988
 subs-0.969
 tou-0.919
Token "
Token ID366
Feature activation+2.21e-01

Pos logit contributions
years+0.403
 questions+0.393
ruits+0.375
year+0.364
nam+0.359
Neg logit contributions
ww-0.475
lor-0.446
burn-0.442
 subs-0.422
 chip-0.407
TokenLet
Token ID5756
Feature activation+2.38e+00

Pos logit contributions
ww+0.314
lor+0.286
burn+0.286
 subs+0.267
 tou+0.252
Neg logit contributions
 questions-0.436
years-0.435
ruits-0.417
year-0.404
nam-0.401
Token's
Token ID338
Feature activation-2.10e-01

Pos logit contributions
lor+4.832
ww+4.826
burn+4.679
 subs+4.541
rob+4.514
Neg logit contributions
 questions-3.252
 want-2.908
 three-2.866
ruits-2.782
 if-2.764
Token take
Token ID1011
Feature activation+9.25e-01

Pos logit contributions
years+0.302
 questions+0.295
ruits+0.280
year+0.276
nam+0.261
Neg logit contributions
ww-0.409
burn-0.388
lor-0.380
 subs-0.367
 tou-0.357
Token it
Token ID340
Feature activation+2.04e-01

Pos logit contributions
ww+1.638
lor+1.572
burn+1.564
 subs+1.494
actus+1.452
Neg logit contributions
 questions-1.476
years-1.460
ruits-1.357
year-1.325
nam-1.306
Token inside
Token ID2641
Feature activation-8.81e-01

Pos logit contributions
ww+0.329
burn+0.304
lor+0.304
 subs+0.285
actus+0.273
Neg logit contributions
years-0.361
 questions-0.358
ruits-0.342
year-0.332
nam-0.328
Token and
Token ID290
Feature activation-1.13e-01

Pos logit contributions
years+1.639
 questions+1.582
ruits+1.539
year+1.501
nam+1.493
Neg logit contributions
ww-1.344
lor-1.230
burn-1.228
 chip-1.105
 subs-1.105
Token enjoyed
Token ID8359
Feature activation+8.14e-01

Pos logit contributions
 questions+0.597
years+0.590
ruits+0.567
year+0.531
nam+0.530
Neg logit contributions
ww-0.728
burn-0.678
lor-0.665
 subs-0.651
 tou-0.623
Token their
Token ID511
Feature activation-6.29e-01

Pos logit contributions
ww+1.546
lor+1.454
burn+1.435
 subs+1.377
actus+1.333
Neg logit contributions
 questions-1.233
years-1.225
ruits-1.145
year-1.113
nam-1.110
Token meal
Token ID9799
Feature activation+1.20e+00

Pos logit contributions
years+1.280
 questions+1.226
ruits+1.204
year+1.197
nam+1.192
Neg logit contributions
ww-0.877
burn-0.765
lor-0.752
 subs-0.732
actus-0.700
Token.
Token ID13
Feature activation+1.62e+00

Pos logit contributions
ww+2.362
lor+2.220
burn+2.200
 subs+2.107
 tou+2.055
Neg logit contributions
 questions-1.709
years-1.678
ruits-1.605
year-1.549
nam-1.517
Token After
Token ID2293
Feature activation+2.97e-01

Pos logit contributions
ww+3.396
burn+3.251
lor+3.202
 subs+3.077
 tou+2.970
Neg logit contributions
 questions-2.132
years-2.090
ruits-2.035
year-1.934
 three-1.872
Token lunch
Token ID9965
Feature activation+2.18e+00

Pos logit contributions
ww+0.491
lor+0.455
burn+0.454
 subs+0.430
actus+0.410
Neg logit contributions
years-0.511
 questions-0.510
ruits-0.487
year-0.470
nam-0.465
Token,
Token ID11
Feature activation+1.22e+00

Pos logit contributions
burn+5.245
ww+5.229
lor+5.143
 tou+4.858
actus+4.828
Neg logit contributions
 questions-2.173
ruits-1.938
years-1.933
 three-1.885
 ten-1.780
Token they
Token ID484
Feature activation+5.63e-02

Pos logit contributions
ww+2.173
lor+2.088
burn+1.991
 subs+1.899
actus+1.872
Neg logit contributions
 questions-1.961
years-1.943
nam-1.838
ruits-1.837
 three-1.787
Token went
Token ID1816
Feature activation+1.80e+00

Pos logit contributions
ww+0.101
burn+0.094
lor+0.094
 subs+0.090
 tou+0.086
Neg logit contributions
 questions-0.088
years-0.088
ruits-0.084
year-0.080
nam-0.080
Token back
Token ID736
Feature activation+8.18e-01

Pos logit contributions
ww+3.184
lor+2.909
burn+2.801
 subs+2.790
 tou+2.638
Neg logit contributions
 questions-3.121
years-2.980
ruits-2.813
 three-2.777
year-2.776
Token to
Token ID284
Feature activation+2.32e-01

Pos logit contributions
ww+1.439
lor+1.340
burn+1.314
 subs+1.265
 tou+1.206
Neg logit contributions
 questions-1.346
years-1.335
ruits-1.262
year-1.223
nam-1.197
Token reached
Token ID4251
Feature activation+1.35e-02

Pos logit contributions
years+0.283
 questions+0.282
ruits+0.269
year+0.259
nam+0.249
Neg logit contributions
ww-0.334
burn-0.313
lor-0.311
 subs-0.301
 tou-0.286
Token,
Token ID11
Feature activation-4.33e-01

Pos logit contributions
ww+0.024
burn+0.022
lor+0.022
 subs+0.021
 tou+0.020
Neg logit contributions
years-0.021
 questions-0.021
ruits-0.020
year-0.020
nam-0.019
Token but
Token ID475
Feature activation+2.64e-01

Pos logit contributions
years+0.516
 questions+0.501
ruits+0.473
year+0.463
nam+0.436
Neg logit contributions
ww-0.930
burn-0.887
lor-0.876
 subs-0.857
 chip-0.828
Token she
Token ID673
Feature activation-5.04e-02

Pos logit contributions
ww+0.420
lor+0.388
burn+0.387
 subs+0.367
 tou+0.349
Neg logit contributions
years-0.468
 questions-0.467
ruits-0.445
year-0.432
nam-0.426
Token still
Token ID991
Feature activation-4.56e-01

Pos logit contributions
years+0.077
 questions+0.077
ruits+0.073
year+0.070
nam+0.070
Neg logit contributions
ww-0.092
lor-0.086
burn-0.085
 subs-0.083
actus-0.079
Token couldn
Token ID3521
Feature activation+2.00e+00

Pos logit contributions
years+0.701
 questions+0.697
ruits+0.661
year+0.645
nam+0.632
Neg logit contributions
ww-0.821
burn-0.766
lor-0.761
 subs-0.745
actus-0.710
Token't
Token ID470
Feature activation+8.57e-02

Pos logit contributions
ww+3.507
burn+3.239
lor+3.238
actus+3.050
 subs+3.031
Neg logit contributions
 questions-3.239
years-3.148
ruits-3.055
 three-2.997
 One-2.954
Token touch
Token ID3638
Feature activation+6.47e-01

Pos logit contributions
ww+0.174
burn+0.165
lor+0.165
 subs+0.158
 tou+0.153
Neg logit contributions
years-0.114
 questions-0.112
ruits-0.106
year-0.102
nam-0.100
Token the
Token ID262
Feature activation+3.84e-01

Pos logit contributions
ww+1.114
lor+1.037
burn+1.025
 subs+0.978
actus+0.936
Neg logit contributions
 questions-1.075
years-1.068
ruits-1.013
nam-0.976
year-0.973
Token rainbow
Token ID27223
Feature activation+1.53e+00

Pos logit contributions
ww+0.578
lor+0.533
burn+0.531
 subs+0.500
actus+0.474
Neg logit contributions
years-0.711
 questions-0.710
ruits-0.679
year-0.658
nam-0.652
Token.
Token ID13
Feature activation+1.35e-01

Pos logit contributions
ww+3.107
lor+2.959
burn+2.829
 subs+2.754
actus+2.699
Neg logit contributions
 questions-2.083
years-2.014
ruits-1.947
nam-1.836
year-1.830
Token then
Token ID788
Feature activation-4.81e-01

Pos logit contributions
years+1.376
 questions+1.277
year+1.228
ruits+1.224
nam+1.136
Neg logit contributions
ww-2.626
burn-2.434
lor-2.404
 subs-2.350
 tou-2.265
Token,
Token ID11
Feature activation-4.69e-01

Pos logit contributions
years+0.688
 questions+0.660
ruits+0.650
year+0.626
nam+0.603
Neg logit contributions
ww-0.952
lor-0.883
burn-0.882
 subs-0.853
 tou-0.814
Token a
Token ID257
Feature activation+4.13e-01

Pos logit contributions
years+0.767
 questions+0.743
ruits+0.725
year+0.699
nam+0.663
Neg logit contributions
ww-0.828
burn-0.773
lor-0.760
 subs-0.730
 tou-0.705
Token voice
Token ID3809
Feature activation+4.93e-01

Pos logit contributions
ww+0.779
lor+0.733
burn+0.730
 subs+0.696
actus+0.669
Neg logit contributions
years-0.607
 questions-0.607
ruits-0.572
year-0.551
nam-0.544
Token called
Token ID1444
Feature activation+1.68e-01

Pos logit contributions
ww+0.832
lor+0.774
burn+0.773
 subs+0.728
actus+0.701
Neg logit contributions
 questions-0.829
years-0.826
ruits-0.783
year-0.763
nam-0.748
Token out
Token ID503
Feature activation+1.20e-01

Pos logit contributions
ww+0.311
lor+0.292
burn+0.290
 subs+0.278
 tou+0.267
Neg logit contributions
years-0.254
 questions-0.252
ruits-0.240
year-0.231
nam-0.227
Token behind
Token ID2157
Feature activation+2.31e-01

Pos logit contributions
ww+0.237
lor+0.223
burn+0.222
 subs+0.215
 tou+0.206
Neg logit contributions
years-0.166
 questions-0.163
ruits-0.156
nam-0.148
year-0.148
Token her
Token ID607
Feature activation+4.53e-01

Pos logit contributions
ww+0.442
lor+0.416
burn+0.414
 subs+0.396
 tou+0.380
Neg logit contributions
years-0.336
 questions-0.334
ruits-0.317
year-0.304
nam-0.301
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
ww+0.868
lor+0.811
burn+0.811
 subs+0.773
actus+0.742
Neg logit contributions
 questions-0.663
years-0.661
ruits-0.623
year-0.597
nam-0.591
Token It
Token ID632
Feature activation+1.34e+00

Pos logit contributions
ww+0.243
lor+0.228
burn+0.225
 subs+0.217
 tou+0.208
Neg logit contributions
years-0.199
 questions-0.197
ruits-0.188
year-0.179
nam-0.178
Token was
Token ID373
Feature activation+7.18e-01

Pos logit contributions
ww+2.335
burn+2.208
lor+2.165
actus+1.991
 subs+1.977
Neg logit contributions
 questions-2.252
years-2.180
year-2.036
ruits-2.030
 three-1.991
Token a
Token ID257
Feature activation+1.32e-01

Pos logit contributions
years+0.094
 questions+0.093
ruits+0.089
year+0.084
nam+0.082
Neg logit contributions
ww-0.157
lor-0.149
burn-0.149
 subs-0.143
 tou-0.138
Token great
Token ID1049
Feature activation-7.14e-01

Pos logit contributions
ww+0.206
lor+0.190
burn+0.190
 subs+0.179
 tou+0.171
Neg logit contributions
years-0.240
 questions-0.237
ruits-0.228
year-0.222
nam-0.217
Token book
Token ID1492
Feature activation+1.86e-01

Pos logit contributions
years+1.453
 questions+1.373
ruits+1.368
year+1.355
nam+1.322
Neg logit contributions
ww-1.002
burn-0.897
lor-0.895
 subs-0.841
 tou-0.813
Token.
Token ID13
Feature activation-1.35e-01

Pos logit contributions
ww+0.341
lor+0.321
burn+0.319
 subs+0.304
 tou+0.292
Neg logit contributions
years-0.284
 questions-0.283
ruits-0.268
year-0.258
nam-0.255
Token It
Token ID632
Feature activation+1.38e+00

Pos logit contributions
years+0.208
 questions+0.205
ruits+0.193
nam+0.188
year+0.187
Neg logit contributions
ww-0.246
lor-0.230
burn-0.228
 subs-0.220
 tou-0.212
Token was
Token ID373
Feature activation+3.67e-02

Pos logit contributions
ww+2.489
burn+2.351
lor+2.338
actus+2.129
 subs+2.124
Neg logit contributions
 questions-2.233
years-2.225
ruits-2.053
year-2.037
nam-2.014
Token colourful
Token ID44799
Feature activation-8.40e-02

Pos logit contributions
ww+0.075
burn+0.071
lor+0.070
 subs+0.068
 tou+0.066
Neg logit contributions
years-0.048
 questions-0.048
ruits-0.045
year-0.043
nam-0.042
Token and
Token ID290
Feature activation+7.72e-01

Pos logit contributions
years+0.113
 questions+0.111
ruits+0.106
year+0.101
nam+0.099
Neg logit contributions
ww-0.170
lor-0.161
burn-0.160
 subs-0.153
actus-0.148
Token it
Token ID340
Feature activation+1.69e+00

Pos logit contributions
ww+1.386
lor+1.316
burn+1.305
 subs+1.230
actus+1.198
Neg logit contributions
years-1.218
 questions-1.215
ruits-1.138
nam-1.121
year-1.112
Token even
Token ID772
Feature activation+1.29e-01

Pos logit contributions
ww+3.226
lor+3.057
burn+3.042
actus+2.765
 subs+2.723
Neg logit contributions
 questions-2.541
years-2.534
nam-2.334
year-2.322
ruits-2.318
Token had
Token ID550
Feature activation+3.63e-01

Pos logit contributions
ww+0.221
burn+0.204
lor+0.204
 subs+0.195
 tou+0.186
Neg logit contributions
years-0.213
 questions-0.212
ruits-0.203
year-0.195
nam-0.192
Token It
Token ID632
Feature activation+1.27e+00

Pos logit contributions
years+1.938
 questions+1.885
ruits+1.781
nam+1.771
year+1.769
Neg logit contributions
ww-1.794
lor-1.692
burn-1.623
 subs-1.545
 tou-1.528
Token is
Token ID318
Feature activation+5.91e-01

Pos logit contributions
ww+2.098
burn+1.991
lor+1.951
 subs+1.809
actus+1.746
Neg logit contributions
 questions-2.197
years-2.139
ruits-2.009
year-1.973
adies-1.940
Token just
Token ID655
Feature activation-4.61e-01

Pos logit contributions
ww+1.184
lor+1.118
burn+1.113
 subs+1.061
actus+1.029
Neg logit contributions
 questions-0.815
years-0.807
ruits-0.758
year-0.726
nam-0.722
Token dark
Token ID3223
Feature activation-1.52e+00

Pos logit contributions
years+0.631
 questions+0.605
ruits+0.592
year+0.566
nam+0.550
Neg logit contributions
ww-0.919
lor-0.857
burn-0.856
 subs-0.826
 tou-0.811
Token.
Token ID13
Feature activation-1.16e+00

Pos logit contributions
years+2.622
ruits+2.434
nam+2.428
 questions+2.425
year+2.345
Neg logit contributions
ww-2.589
burn-2.459
lor-2.402
 subs-2.236
actus-2.209
Token Don
Token ID2094
Feature activation+1.34e+00

Pos logit contributions
years+1.992
 questions+1.913
nam+1.838
ruits+1.818
year+1.799
Neg logit contributions
ww-1.966
lor-1.857
burn-1.782
 subs-1.681
 chip-1.669
Token't
Token ID470
Feature activation-4.46e-03

Pos logit contributions
ww+2.025
burn+1.902
lor+1.893
 subs+1.770
actus+1.737
Neg logit contributions
 questions-2.501
years-2.411
ruits-2.348
year-2.275
adies-2.230
Token be
Token ID307
Feature activation-6.37e-01

Pos logit contributions
years+0.007
 questions+0.007
ruits+0.007
year+0.006
nam+0.006
Neg logit contributions
ww-0.008
burn-0.008
lor-0.008
 subs-0.007
 tou-0.007
Token scared
Token ID12008
Feature activation-6.90e-01

Pos logit contributions
years+0.814
 questions+0.804
ruits+0.790
year+0.741
adies+0.717
Neg logit contributions
ww-1.323
burn-1.235
lor-1.198
 subs-1.185
 tou-1.157
Token."
Token ID526
Feature activation-3.77e-01

Pos logit contributions
years+1.174
 questions+1.093
ruits+1.079
year+1.076
nam+1.060
Neg logit contributions
ww-1.199
burn-1.106
lor-1.061
 subs-1.029
 chip-0.993
Token
Token ID198
Feature activation-8.14e-02

Pos logit contributions
years+0.757
 questions+0.725
ruits+0.709
nam+0.698
year+0.693
Neg logit contributions
ww-0.521
lor-0.482
burn-0.464
 subs-0.446
actus-0.431
Token his
Token ID465
Feature activation-6.41e-01

Pos logit contributions
years+2.153
 questions+2.083
ruits+1.992
adies+1.948
year+1.946
Neg logit contributions
ww-2.114
burn-1.890
lor-1.856
 subs-1.849
 chip-1.769
Token mom
Token ID1995
Feature activation+6.64e-02

Pos logit contributions
years+1.013
 questions+0.970
ruits+0.970
year+0.963
adies+0.919
Neg logit contributions
ww-1.132
burn-1.064
lor-1.047
 subs-1.032
actus-0.996
Token decided
Token ID3066
Feature activation-1.69e-01

Pos logit contributions
ww+0.105
burn+0.098
lor+0.097
 subs+0.093
actus+0.088
Neg logit contributions
years-0.118
 questions-0.117
ruits-0.112
year-0.109
nam-0.108
Token it
Token ID340
Feature activation+9.12e-01

Pos logit contributions
years+0.296
 questions+0.289
ruits+0.280
year+0.273
nam+0.267
Neg logit contributions
ww-0.272
lor-0.254
burn-0.251
 subs-0.240
 tou-0.228
Token was
Token ID373
Feature activation+4.54e-01

Pos logit contributions
ww+1.545
lor+1.442
burn+1.423
 subs+1.329
actus+1.305
Neg logit contributions
 questions-1.545
years-1.519
ruits-1.456
year-1.408
nam-1.373
Token too
Token ID1165
Feature activation+3.26e-01

Pos logit contributions
ww+0.832
lor+0.781
burn+0.775
 subs+0.738
actus+0.713
Neg logit contributions
 questions-0.699
years-0.696
ruits-0.660
year-0.636
nam-0.628
Token late
Token ID2739
Feature activation-3.83e-01

Pos logit contributions
ww+0.612
lor+0.572
burn+0.571
 subs+0.545
actus+0.524
Neg logit contributions
years-0.491
 questions-0.489
ruits-0.463
year-0.446
nam-0.440
Token.
Token ID13
Feature activation-3.92e-01

Pos logit contributions
years+0.638
 questions+0.614
ruits+0.597
year+0.578
nam+0.575
Neg logit contributions
ww-0.645
lor-0.610
burn-0.606
 subs-0.578
 chip-0.545
Token His
Token ID2399
Feature activation-1.10e+00

Pos logit contributions
years+0.595
 questions+0.579
ruits+0.544
year+0.533
nam+0.527
Neg logit contributions
ww-0.735
lor-0.684
burn-0.676
 subs-0.658
 tou-0.635
Token mom
Token ID1995
Feature activation-2.94e-01

Pos logit contributions
years+1.866
year+1.752
ruits+1.728
 questions+1.702
adies+1.679
Neg logit contributions
ww-1.859
burn-1.769
 subs-1.719
lor-1.683
 chip-1.631
Token said
Token ID531
Feature activation-1.41e+00

Pos logit contributions
 questions+0.523
years+0.521
ruits+0.501
nam+0.487
year+0.481
Neg logit contributions
ww-0.462
burn-0.432
lor-0.426
 subs-0.411
 chip-0.383
Token she
Token ID673
Feature activation+4.77e-02

Pos logit contributions
ww+1.501
burn+1.423
lor+1.397
 subs+1.305
actus+1.286
Neg logit contributions
 questions-1.625
years-1.589
ruits-1.505
year-1.465
nam-1.463
Token had
Token ID550
Feature activation+8.17e-01

Pos logit contributions
ww+0.086
lor+0.081
burn+0.079
 subs+0.077
actus+0.074
Neg logit contributions
years-0.074
 questions-0.074
ruits-0.071
nam-0.068
year-0.068
Token experienced
Token ID5924
Feature activation+3.23e-01

Pos logit contributions
ww+1.727
burn+1.629
lor+1.600
 subs+1.530
actus+1.502
Neg logit contributions
years-1.050
 questions-1.050
ruits-0.977
year-0.940
nam-0.910
Token.
Token ID13
Feature activation+7.68e-01

Pos logit contributions
ww+0.570
lor+0.533
burn+0.531
 subs+0.505
actus+0.484
Neg logit contributions
years-0.515
 questions-0.513
ruits-0.488
year-0.470
nam-0.465
Token
Token ID220
Feature activation+8.47e-01

Pos logit contributions
ww+1.379
burn+1.289
lor+1.265
 subs+1.217
actus+1.180
Neg logit contributions
 questions-1.231
years-1.214
ruits-1.152
year-1.114
nam-1.091
Token
Token ID198
Feature activation+9.64e-01

Pos logit contributions
ww+1.425
burn+1.317
lor+1.286
 subs+1.238
actus+1.193
Neg logit contributions
 questions-1.446
years-1.432
ruits-1.379
year-1.325
nam-1.299
Token
Token ID198
Feature activation+1.10e+00

Pos logit contributions
ww+1.625
burn+1.531
lor+1.471
 subs+1.441
actus+1.383
Neg logit contributions
 questions-1.644
years-1.624
ruits-1.530
year-1.510
nam-1.443
TokenLittle
Token ID22253
Feature activation+9.83e-01

Pos logit contributions
ww+1.950
burn+1.851
lor+1.778
 subs+1.751
actus+1.707
Neg logit contributions
 questions-1.769
years-1.746
year-1.617
ruits-1.615
adies-1.525
Token did
Token ID750
Feature activation+7.41e-01

Pos logit contributions
ww+1.386
burn+1.287
lor+1.279
 subs+1.186
actus+1.156
Neg logit contributions
 questions-1.925
years-1.887
ruits-1.804
year-1.757
adies-1.732
Token she
Token ID673
Feature activation+3.17e-01

Pos logit contributions
ww+1.477
burn+1.398
lor+1.381
 subs+1.326
actus+1.299
Neg logit contributions
 questions-1.021
years-1.007
ruits-0.941
year-0.906
nam-0.897
Token know
Token ID760
Feature activation-1.04e+00

Pos logit contributions
ww+0.625
lor+0.588
burn+0.587
 subs+0.561
actus+0.540
Neg logit contributions
years-0.440
 questions-0.438
ruits-0.413
year-0.396
nam-0.391
Token duck
Token ID22045
Feature activation-2.90e-01

Pos logit contributions
ww+1.090
lor+1.012
burn+0.971
 subs+0.903
actus+0.809
Neg logit contributions
 questions-2.033
years-2.005
ruits-1.949
 three-1.891
year-1.886
Token saw
Token ID2497
Feature activation-3.20e-01

Pos logit contributions
years+0.411
 questions+0.406
ruits+0.384
nam+0.375
year+0.371
Neg logit contributions
ww-0.570
burn-0.537
lor-0.532
 subs-0.514
actus-0.503
Token that
Token ID326
Feature activation-1.97e-02

Pos logit contributions
years+0.387
 questions+0.382
ruits+0.365
year+0.341
nam+0.336
Neg logit contributions
ww-0.700
burn-0.656
lor-0.650
 subs-0.628
 tou-0.615
Token the
Token ID262
Feature activation+3.66e-01

Pos logit contributions
years+0.031
 questions+0.030
ruits+0.029
year+0.028
nam+0.027
Neg logit contributions
ww-0.036
burn-0.034
lor-0.033
 subs-0.032
 tou-0.031
Token river
Token ID7850
Feature activation+3.44e-01

Pos logit contributions
ww+0.471
lor+0.427
burn+0.425
 subs+0.396
actus+0.372
Neg logit contributions
years-0.764
 questions-0.763
ruits-0.734
year-0.714
nam-0.708
Token was
Token ID373
Feature activation-2.51e-02

Pos logit contributions
ww+0.530
lor+0.489
burn+0.487
 subs+0.460
actus+0.437
Neg logit contributions
years-0.626
 questions-0.624
ruits-0.597
year-0.579
nam-0.573
Token icy
Token ID30284
Feature activation+1.08e+00

Pos logit contributions
years+0.040
 questions+0.040
ruits+0.038
year+0.037
nam+0.036
Neg logit contributions
ww-0.044
burn-0.041
lor-0.041
 subs-0.040
 tou-0.038
Token.
Token ID13
Feature activation+7.87e-01

Pos logit contributions
ww+2.273
lor+2.211
burn+2.131
 subs+2.078
 tou+1.990
Neg logit contributions
 questions-1.370
years-1.327
ruits-1.264
year-1.218
 three-1.179
Token The
Token ID383
Feature activation+8.49e-03

Pos logit contributions
ww+1.739
burn+1.654
lor+1.645
 subs+1.578
actus+1.523
Neg logit contributions
 questions-0.926
years-0.913
ruits-0.861
year-0.811
nam-0.805
Token duck
Token ID22045
Feature activation-7.01e-01

Pos logit contributions
ww+0.009
burn+0.008
lor+0.008
 subs+0.007
 tou+0.007
Neg logit contributions
years-0.020
 questions-0.020
ruits-0.019
year-0.019
nam-0.018
Token didn
Token ID1422
Feature activation+1.29e+00

Pos logit contributions
years+1.056
 questions+1.046
ruits+1.012
nam+0.973
year+0.962
Neg logit contributions
ww-1.322
burn-1.215
lor-1.197
 subs-1.181
actus-1.162
Token and
Token ID290
Feature activation+4.05e-01

Pos logit contributions
ww+0.006
lor+0.006
burn+0.006
 subs+0.006
actus+0.005
Neg logit contributions
years-0.004
 questions-0.004
ruits-0.004
year-0.004
nam-0.004
Token went
Token ID1816
Feature activation+1.01e+00

Pos logit contributions
ww+0.706
lor+0.660
burn+0.658
 subs+0.623
actus+0.598
Neg logit contributions
years-0.659
 questions-0.658
ruits-0.624
year-0.603
nam-0.596
Token back
Token ID736
Feature activation-1.30e-01

Pos logit contributions
ww+1.791
lor+1.652
burn+1.615
 subs+1.531
actus+1.476
Neg logit contributions
 questions-1.745
years-1.702
ruits-1.583
year-1.570
nam-1.527
Token inside
Token ID2641
Feature activation-1.87e-01

Pos logit contributions
years+0.232
 questions+0.228
ruits+0.223
year+0.214
nam+0.213
Neg logit contributions
ww-0.204
burn-0.192
lor-0.188
 subs-0.181
 tou-0.173
Token.
Token ID13
Feature activation+2.42e-01

Pos logit contributions
years+0.301
 questions+0.295
ruits+0.287
year+0.275
nam+0.272
Neg logit contributions
ww-0.317
burn-0.299
lor-0.298
 subs-0.288
 tou-0.271
Token Suddenly
Token ID24975
Feature activation-4.19e-02

Pos logit contributions
ww+0.455
lor+0.426
burn+0.424
 subs+0.406
actus+0.390
Neg logit contributions
years-0.359
 questions-0.358
ruits-0.338
year-0.326
nam-0.322
Token,
Token ID11
Feature activation+3.71e-01

Pos logit contributions
years+0.057
 questions+0.055
ruits+0.053
year+0.051
nam+0.050
Neg logit contributions
ww-0.086
lor-0.080
burn-0.080
 subs-0.077
 tou-0.074
Token the
Token ID262
Feature activation-2.99e-03

Pos logit contributions
ww+0.560
lor+0.516
burn+0.514
 subs+0.484
actus+0.460
Neg logit contributions
years-0.693
 questions-0.692
ruits-0.662
year-0.642
nam-0.636
Token tank
Token ID6873
Feature activation+8.56e-01

Pos logit contributions
years+0.005
 questions+0.005
ruits+0.005
year+0.005
nam+0.005
Neg logit contributions
ww-0.005
burn-0.005
lor-0.005
 subs-0.004
actus-0.004
Token appeared
Token ID4120
Feature activation-6.41e-01

Pos logit contributions
ww+1.466
burn+1.354
lor+1.354
 subs+1.237
actus+1.209
Neg logit contributions
 questions-1.435
years-1.425
ruits-1.350
year-1.314
nam-1.283
Token again
Token ID757
Feature activation+8.48e-01

Pos logit contributions
years+0.986
 questions+0.981
ruits+0.952
nam+0.909
year+0.868
Neg logit contributions
ww-1.157
burn-1.117
lor-1.103
 subs-1.084
 tou-1.015
Token.
Token ID13
Feature activation-3.43e-01

Pos logit contributions
ww+1.727
lor+1.624
burn+1.603
 subs+1.525
actus+1.500
Neg logit contributions
 questions-1.218
years-1.192
ruits-1.130
year-1.082
nam-1.041
Token Peter
Token ID5613
Feature activation+5.43e-01

Pos logit contributions
years+0.519
 questions+0.510
ruits+0.490
nam+0.473
year+0.472
Neg logit contributions
ww-0.627
lor-0.592
burn-0.580
 subs-0.568
 chip-0.541
Token punched
Token ID25436
Feature activation-8.64e-01

Pos logit contributions
ww+0.970
lor+0.906
burn+0.903
 subs+0.853
actus+0.824
Neg logit contributions
years-0.857
 questions-0.857
ruits-0.809
year-0.782
nam-0.769
Token Sam
Token ID3409
Feature activation+3.64e-02

Pos logit contributions
years+1.362
ruits+1.334
 questions+1.295
year+1.268
nam+1.255
Neg logit contributions
ww-1.538
burn-1.404
lor-1.368
 subs-1.365
 chip-1.310
Token right
Token ID826
Feature activation-3.15e-02

Pos logit contributions
ww+0.068
burn+0.063
lor+0.063
 subs+0.061
 tou+0.058
Neg logit contributions
years-0.055
 questions-0.054
ruits-0.052
year-0.050
nam-0.049
Token back
Token ID736
Feature activation-5.38e-01

Pos logit contributions
years+0.058
 questions+0.058
ruits+0.056
nam+0.054
year+0.054
Neg logit contributions
ww-0.048
burn-0.044
lor-0.044
 subs-0.042
actus-0.039
Token and
Token ID290
Feature activation+1.15e+00

Pos logit contributions
years+0.820
 questions+0.784
ruits+0.773
year+0.747
nam+0.740
Neg logit contributions
ww-0.989
burn-0.932
lor-0.922
 subs-0.896
 chip-0.849
Token Sam
Token ID3409
Feature activation+5.39e-01

Pos logit contributions
ww+2.195
lor+2.112
burn+2.044
actus+1.932
 subs+1.902
Neg logit contributions
 questions-1.715
years-1.704
ruits-1.553
nam-1.540
year-1.531
Token gave
Token ID2921
Feature activation+2.21e-01

Pos logit contributions
ww+0.955
lor+0.892
burn+0.887
 subs+0.837
actus+0.812
Neg logit contributions
years-0.858
 questions-0.857
ruits-0.809
year-0.782
nam-0.771
Token up
Token ID510
Feature activation+1.98e-02

Pos logit contributions
ww+0.489
burn+0.463
lor+0.463
 subs+0.445
 tou+0.429
Neg logit contributions
years-0.254
 questions-0.252
ruits-0.237
year-0.224
nam-0.220
Token.
Token ID13
Feature activation+3.17e-01

Pos logit contributions
ww+0.036
lor+0.033
burn+0.033
 subs+0.032
 tou+0.030
Neg logit contributions
years-0.031
 questions-0.030
ruits-0.029
year-0.028
nam-0.028
Token park
Token ID3952
Feature activation+3.11e-02

Pos logit contributions
ww+0.107
lor+0.099
burn+0.099
 subs+0.094
actus+0.090
Neg logit contributions
years-0.117
 questions-0.116
ruits-0.112
year-0.108
nam-0.106
Token.
Token ID13
Feature activation-8.84e-02

Pos logit contributions
ww+0.057
lor+0.054
burn+0.054
 subs+0.051
 tou+0.049
Neg logit contributions
years-0.047
 questions-0.047
ruits-0.045
year-0.043
nam-0.042
Token
Token ID220
Feature activation+2.73e-01

Pos logit contributions
years+0.155
 questions+0.154
ruits+0.147
year+0.142
nam+0.141
Neg logit contributions
ww-0.143
lor-0.133
burn-0.131
 subs-0.125
 tou-0.118
Token
Token ID198
Feature activation+1.80e-01

Pos logit contributions
ww+0.425
lor+0.394
burn+0.391
 subs+0.370
actus+0.352
Neg logit contributions
years-0.492
 questions-0.490
ruits-0.468
year-0.454
nam-0.450
Token
Token ID198
Feature activation-3.33e-01

Pos logit contributions
ww+0.280
lor+0.260
burn+0.257
 subs+0.243
actus+0.231
Neg logit contributions
years-0.327
 questions-0.326
ruits-0.312
year-0.301
nam-0.301
TokenJohn
Token ID7554
Feature activation-1.78e-01

Pos logit contributions
years+0.464
 questions+0.464
ruits+0.442
nam+0.424
year+0.405
Neg logit contributions
ww-0.668
lor-0.632
burn-0.612
 subs-0.589
 tou-0.569
Token promised
Token ID8072
Feature activation+7.78e-01

Pos logit contributions
years+0.295
 questions+0.295
ruits+0.284
nam+0.275
year+0.272
Neg logit contributions
ww-0.303
lor-0.284
burn-0.283
 subs-0.270
actus-0.257
Token,
Token ID11
Feature activation+5.15e-01

Pos logit contributions
ww+1.536
burn+1.431
lor+1.429
 subs+1.363
actus+1.340
Neg logit contributions
 questions-1.109
years-1.084
ruits-1.022
year-0.983
nam-0.965
Token and
Token ID290
Feature activation+1.04e+00

Pos logit contributions
ww+1.068
lor+1.007
burn+1.004
 subs+0.961
actus+0.935
Neg logit contributions
 questions-0.670
years-0.666
ruits-0.623
year-0.598
nam-0.589
Token he
Token ID339
Feature activation+1.15e+00

Pos logit contributions
ww+2.025
lor+1.914
burn+1.909
 subs+1.802
actus+1.800
Neg logit contributions
 questions-1.471
years-1.460
ruits-1.357
year-1.344
nam-1.316
Token has
Token ID468
Feature activation-1.37e-01

Pos logit contributions
ww+2.192
lor+2.040
burn+2.038
 subs+1.918
actus+1.877
Neg logit contributions
 questions-1.697
years-1.682
ruits-1.568
year-1.554
 three-1.472
TokenAnna
Token ID31160
Feature activation+6.06e-01

Pos logit contributions
ww+1.132
burn+1.054
lor+1.045
 subs+0.992
actus+0.943
Neg logit contributions
years-1.371
 questions-1.366
ruits-1.303
year-1.274
nam-1.259
Token,
Token ID11
Feature activation-3.22e-01

Pos logit contributions
ww+1.309
lor+1.242
burn+1.240
 subs+1.188
actus+1.154
Neg logit contributions
 questions-0.731
years-0.723
ruits-0.680
year-0.637
nam-0.624
Token come
Token ID1282
Feature activation-1.82e-01

Pos logit contributions
years+0.531
 questions+0.511
ruits+0.495
year+0.490
nam+0.474
Neg logit contributions
ww-0.567
lor-0.522
burn-0.518
 subs-0.495
 tou-0.482
Token here
Token ID994
Feature activation-6.33e-01

Pos logit contributions
years+0.295
 questions+0.293
ruits+0.281
nam+0.271
year+0.270
Neg logit contributions
ww-0.320
burn-0.302
lor-0.298
 subs-0.284
 tou-0.276
Token.
Token ID13
Feature activation-8.98e-01

Pos logit contributions
years+1.083
 questions+1.027
nam+0.998
ruits+0.997
year+0.986
Neg logit contributions
ww-1.062
lor-0.995
burn-0.986
 subs-0.936
 chip-0.895
Token I
Token ID314
Feature activation-3.35e-01

Pos logit contributions
years+1.655
 questions+1.618
ruits+1.542
year+1.517
nam+1.491
Neg logit contributions
ww-1.419
lor-1.322
burn-1.264
 subs-1.198
 tou-1.191
Token have
Token ID423
Feature activation+2.82e-01

Pos logit contributions
years+0.513
 questions+0.497
ruits+0.472
nam+0.469
year+0.457
Neg logit contributions
ww-0.629
burn-0.587
lor-0.585
 subs-0.567
 tou-0.547
Token something
Token ID1223
Feature activation+1.92e-01

Pos logit contributions
ww+0.605
lor+0.571
burn+0.569
 subs+0.546
actus+0.528
Neg logit contributions
years-0.346
 questions-0.345
ruits-0.323
year-0.309
nam-0.302
Token for
Token ID329
Feature activation-6.58e-01

Pos logit contributions
ww+0.358
lor+0.334
burn+0.332
 subs+0.317
actus+0.306
Neg logit contributions
years-0.290
 questions-0.287
ruits-0.273
year-0.264
nam-0.259
Token you
Token ID345
Feature activation-1.75e+00

Pos logit contributions
years+1.369
 questions+1.345
ruits+1.311
year+1.308
nam+1.262
Neg logit contributions
ww-0.836
lor-0.744
burn-0.730
 subs-0.667
 tou-0.653
Token."
Token ID526
Feature activation+1.55e-01

Pos logit contributions
years+3.428
year+3.198
nam+3.173
 questions+3.078
ruits+3.029
Neg logit contributions
ww-2.535
lor-2.372
burn-2.308
 subs-2.090
actus-2.077
Token"
Token ID1
Feature activation+4.05e-01

Pos logit contributions
 questions+0.299
years+0.299
ruits+0.281
nam+0.274
year+0.270
Neg logit contributions
ww-0.318
lor-0.289
burn-0.284
 subs-0.273
 tou-0.262
TokenI
Token ID40
Feature activation-1.43e+00

Pos logit contributions
ww+0.556
lor+0.509
burn+0.507
 subs+0.474
actus+0.448
Neg logit contributions
years-0.814
 questions-0.812
ruits-0.780
year-0.759
nam-0.752
Token don
Token ID836
Feature activation+7.90e-01

Pos logit contributions
years+2.160
 questions+2.011
nam+1.965
ruits+1.874
year+1.859
Neg logit contributions
ww-2.892
 subs-2.609
burn-2.606
lor-2.578
 tou-2.517
Token't
Token ID470
Feature activation-1.57e+00

Pos logit contributions
ww+1.130
lor+1.038
burn+1.033
 subs+0.975
actus+0.929
Neg logit contributions
 questions-1.522
years-1.521
ruits-1.457
year-1.421
nam-1.400
Token know
Token ID760
Feature activation-1.81e+00

Pos logit contributions
years+1.791
 questions+1.604
ruits+1.574
year+1.541
nam+1.532
Neg logit contributions
ww-3.697
burn-3.335
 tou-3.270
 subs-3.215
lor-3.148
Token.
Token ID13
Feature activation-2.44e+00

Pos logit contributions
years+3.508
year+3.270
 questions+3.171
ruits+3.103
nam+2.969
Neg logit contributions
ww-2.861
lor-2.607
burn-2.500
 chip-2.413
 subs-2.383
Token Maybe
Token ID6674
Feature activation-1.35e+00

Pos logit contributions
years+4.359
 questions+4.255
year+4.041
ruits+4.001
nam+3.854
Neg logit contributions
ww-4.036
 chip-3.795
lor-3.771
burn-3.528
 tou-3.442
Token it
Token ID340
Feature activation+2.08e-01

Pos logit contributions
years+2.914
year+2.738
 questions+2.736
ruits+2.710
racted+2.514
Neg logit contributions
ww-1.746
burn-1.520
lor-1.489
 tou-1.429
 chip-1.409
Token is
Token ID318
Feature activation+3.32e-01

Pos logit contributions
ww+0.304
lor+0.279
burn+0.278
 subs+0.262
actus+0.248
Neg logit contributions
years-0.398
 questions-0.395
ruits-0.379
year-0.369
nam-0.366
Token a
Token ID257
Feature activation+8.42e-01

Pos logit contributions
ww+0.681
lor+0.641
burn+0.640
 subs+0.613
actus+0.591
Neg logit contributions
years-0.437
 questions-0.435
ruits-0.408
year-0.391
nam-0.385
Token toy
Token ID13373
Feature activation-1.04e-01

Pos logit contributions
ww+1.336
lor+1.269
burn+1.244
 subs+1.180
actus+1.125
Neg logit contributions
 questions-1.485
years-1.450
ruits-1.397
nam-1.352
year-1.344
Token toys
Token ID14958
Feature activation-7.08e-01

Pos logit contributions
years+2.662
year+2.527
 questions+2.399
ruits+2.352
adies+2.298
Neg logit contributions
ww-3.086
burn-2.873
 tou-2.795
actus-2.763
 chip-2.758
Token with
Token ID351
Feature activation-5.91e-01

Pos logit contributions
years+1.151
 questions+1.114
ruits+1.064
year+1.064
adies+1.036
Neg logit contributions
ww-1.235
burn-1.157
lor-1.133
 chip-1.090
 subs-1.085
Token his
Token ID465
Feature activation-3.38e-01

Pos logit contributions
years+0.787
 questions+0.770
ruits+0.737
year+0.735
adies+0.673
Neg logit contributions
ww-1.199
burn-1.130
lor-1.103
 subs-1.080
 tou-1.054
Token friend
Token ID1545
Feature activation-4.28e-01

Pos logit contributions
years+0.593
 questions+0.575
year+0.562
ruits+0.557
adies+0.533
Neg logit contributions
ww-0.550
burn-0.516
lor-0.499
 subs-0.490
 tou-0.470
Token.
Token ID13
Feature activation-4.28e-01

Pos logit contributions
years+0.626
 questions+0.604
year+0.585
ruits+0.579
nam+0.549
Neg logit contributions
ww-0.818
burn-0.780
lor-0.752
 subs-0.739
 chip-0.718
Token They
Token ID1119
Feature activation-2.27e+00

Pos logit contributions
years+0.576
 questions+0.554
ruits+0.527
year+0.520
nam+0.501
Neg logit contributions
ww-0.874
burn-0.810
lor-0.802
 subs-0.793
 chip-0.777
Token started
Token ID2067
Feature activation-3.02e-01

Pos logit contributions
ruits+4.082
 questions+3.995
years+3.918
adies+3.882
year+3.881
Neg logit contributions
ww-3.490
burn-3.394
 subs-3.350
 tou-3.077
 pel-3.028
Token to
Token ID284
Feature activation-2.01e+00

Pos logit contributions
years+0.549
 questions+0.543
ruits+0.531
year+0.517
adies+0.505
Neg logit contributions
ww-0.453
burn-0.435
lor-0.410
 subs-0.408
 chip-0.385
Token fight
Token ID1907
Feature activation-6.87e-01

Pos logit contributions
years+3.464
year+3.363
adies+3.181
ruits+3.166
 questions+3.108
Neg logit contributions
burn-3.627
ww-3.420
 chip-3.285
 subs-3.272
 tou-3.151
Token over
Token ID625
Feature activation-2.16e+00

Pos logit contributions
years+1.173
 questions+1.098
ruits+1.094
year+1.059
nam+1.046
Neg logit contributions
ww-1.159
burn-1.081
 subs-1.041
lor-1.013
 chip-0.992
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
years+3.816
ruits+3.685
year+3.678
 questions+3.584
adies+3.562
Neg logit contributions
ww-3.396
burn-3.309
 pel-3.131
 chip-3.098
 tou-3.078
Token the
Token ID262
Feature activation-3.85e-01

Pos logit contributions
ww+2.287
lor+2.168
burn+2.119
 subs+2.051
actus+1.973
Neg logit contributions
 questions-1.700
years-1.648
ruits-1.564
nam-1.485
year-1.477
Token bus
Token ID1323
Feature activation-5.86e-02

Pos logit contributions
years+0.806
 questions+0.781
ruits+0.755
year+0.753
nam+0.731
Neg logit contributions
ww-0.516
burn-0.462
lor-0.456
actus-0.429
 subs-0.424
Token.
Token ID13
Feature activation+8.37e-01

Pos logit contributions
years+0.091
 questions+0.090
ruits+0.086
year+0.083
nam+0.083
Neg logit contributions
ww-0.105
lor-0.099
burn-0.098
 subs-0.094
 tou-0.090
Token
Token ID198
Feature activation+2.73e-01

Pos logit contributions
ww+1.635
burn+1.532
lor+1.530
 subs+1.463
actus+1.408
Neg logit contributions
years-1.199
 questions-1.197
ruits-1.137
year-1.095
nam-1.066
Token
Token ID198
Feature activation-3.15e-01

Pos logit contributions
ww+0.470
lor+0.438
burn+0.435
 subs+0.413
actus+0.396
Neg logit contributions
years-0.450
 questions-0.449
ruits-0.427
year-0.412
nam-0.409
TokenThe
Token ID464
Feature activation-2.22e+00

Pos logit contributions
 questions+0.549
years+0.543
ruits+0.523
nam+0.504
year+0.492
Neg logit contributions
ww-0.523
lor-0.483
burn-0.473
 subs-0.447
 tou-0.427
Token moral
Token ID6573
Feature activation+9.42e-01

Pos logit contributions
years+4.298
ruits+3.965
nam+3.922
year+3.796
 questions+3.795
Neg logit contributions
ww-3.606
 chip-3.257
actus-3.194
burn-3.161
 subs-3.061
Token of
Token ID286
Feature activation+8.82e-01

Pos logit contributions
ww+1.827
burn+1.772
lor+1.741
 subs+1.675
actus+1.610
Neg logit contributions
 questions-1.358
years-1.324
ruits-1.254
year-1.241
 three-1.148
Token the
Token ID262
Feature activation-3.46e-01

Pos logit contributions
ww+1.657
lor+1.561
burn+1.555
 subs+1.503
actus+1.425
Neg logit contributions
years-1.331
 questions-1.329
ruits-1.244
year-1.217
nam-1.178
Token story
Token ID1621
Feature activation+6.84e-01

Pos logit contributions
years+0.733
 questions+0.705
ruits+0.691
nam+0.683
year+0.678
Neg logit contributions
ww-0.456
lor-0.399
burn-0.398
 subs-0.378
actus-0.362
Token is
Token ID318
Feature activation+1.56e+00

Pos logit contributions
ww+0.975
burn+0.910
lor+0.895
 subs+0.841
actus+0.797
Neg logit contributions
 questions-1.333
years-1.311
ruits-1.275
year-1.233
nam-1.205
Token<|endoftext|>
Token ID50256
Feature activation+1.47e+00

Pos logit contributions
ww+2.038
lor+1.926
burn+1.918
 subs+1.831
 tou+1.763
Neg logit contributions
 questions-1.399
years-1.387
ruits-1.338
year-1.265
nam-1.227
TokenOnce
Token ID7454
Feature activation-1.99e-01

Pos logit contributions
ww+2.785
burn+2.579
lor+2.476
 subs+2.435
 tou+2.311
Neg logit contributions
 questions-2.273
years-2.189
ruits-2.130
year-2.086
 three-2.029
Token upon
Token ID2402
Feature activation-1.02e+00

Pos logit contributions
years+0.139
 questions+0.131
ruits+0.115
year+0.109
nam+0.106
Neg logit contributions
ww-0.529
burn-0.505
lor-0.503
 subs-0.491
actus-0.480
Token a
Token ID257
Feature activation-2.72e-01

Pos logit contributions
years+1.403
 questions+1.367
ruits+1.309
year+1.198
nam+1.180
Neg logit contributions
ww-2.063
burn-1.963
lor-1.932
 subs-1.831
 tou-1.818
Token time
Token ID640
Feature activation-7.36e-01

Pos logit contributions
years+0.481
 questions+0.470
ruits+0.442
nam+0.442
year+0.438
Neg logit contributions
ww-0.452
burn-0.411
lor-0.410
 subs-0.388
actus-0.372
Token,
Token ID11
Feature activation-1.99e+00

Pos logit contributions
years+1.108
 questions+1.086
year+1.014
ruits+1.004
nam+0.985
Neg logit contributions
ww-1.414
lor-1.304
burn-1.285
 subs-1.255
 tou-1.187
Token there
Token ID612
Feature activation+6.98e-01

Pos logit contributions
 questions+3.339
years+3.318
ruits+3.000
racted+2.977
year+2.814
Neg logit contributions
ww-3.681
burn-3.300
 subs-3.132
 tou-3.074
lor-3.069
Token was
Token ID373
Feature activation-9.41e-01

Pos logit contributions
ww+1.107
lor+1.024
burn+1.023
 subs+0.954
actus+0.914
Neg logit contributions
 questions-1.240
years-1.236
ruits-1.191
year-1.148
nam-1.126
Token a
Token ID257
Feature activation-1.16e-02

Pos logit contributions
 questions+1.251
years+1.220
ruits+1.164
year+1.063
nam+1.059
Neg logit contributions
ww-1.979
burn-1.846
lor-1.760
 subs-1.712
 tou-1.706
Token little
Token ID1310
Feature activation+7.52e-01

Pos logit contributions
years+0.017
 questions+0.017
ruits+0.016
year+0.015
nam+0.015
Neg logit contributions
ww-0.023
burn-0.021
lor-0.021
 subs-0.020
actus-0.020
Token frog
Token ID21264
Feature activation-3.50e-02

Pos logit contributions
ww+1.189
lor+1.156
burn+1.110
 subs+1.069
 tou+0.994
Neg logit contributions
 questions-1.316
years-1.312
ruits-1.250
year-1.213
nam-1.209
Token<|endoftext|>
Token ID50256
Feature activation+1.29e+00

Pos logit contributions
ww+2.145
burn+2.031
lor+1.989
 subs+1.908
 tou+1.813
Neg logit contributions
 questions-1.823
years-1.823
ruits-1.739
year-1.683
nam-1.631
TokenOnce
Token ID7454
Feature activation-4.42e-01

Pos logit contributions
ww+2.348
burn+2.187
lor+2.105
 subs+2.048
 tou+1.959
Neg logit contributions
 questions-2.058
years-2.009
ruits-1.937
year-1.895
 three-1.839
Token upon
Token ID2402
Feature activation-1.51e+00

Pos logit contributions
years+0.306
 questions+0.283
ruits+0.246
year+0.238
nam+0.232
Neg logit contributions
ww-1.179
burn-1.120
lor-1.120
 subs-1.095
actus-1.073
Token a
Token ID257
Feature activation-4.03e-01

Pos logit contributions
years+2.271
 questions+2.180
ruits+2.136
year+1.931
nam+1.902
Neg logit contributions
ww-2.899
lor-2.726
burn-2.718
 subs-2.535
 tou-2.526
Token time
Token ID640
Feature activation-1.15e+00

Pos logit contributions
years+0.729
 questions+0.706
nam+0.670
ruits+0.666
year+0.666
Neg logit contributions
ww-0.662
burn-0.596
lor-0.595
 subs-0.563
actus-0.538
Token,
Token ID11
Feature activation-2.16e+00

Pos logit contributions
years+1.858
 questions+1.797
year+1.711
ruits+1.675
nam+1.658
Neg logit contributions
ww-2.119
lor-1.914
 subs-1.855
burn-1.854
 chip-1.742
Token there
Token ID612
Feature activation+5.53e-01

Pos logit contributions
 questions+3.763
years+3.721
ruits+3.392
racted+3.303
year+3.193
Neg logit contributions
ww-3.982
burn-3.401
 subs-3.305
 tou-3.265
lor-3.233
Token was
Token ID373
Feature activation-9.46e-01

Pos logit contributions
ww+0.857
burn+0.792
lor+0.791
 subs+0.740
actus+0.706
Neg logit contributions
 questions-1.003
years-1.002
ruits-0.961
year-0.929
nam-0.915
Token a
Token ID257
Feature activation-1.05e-01

Pos logit contributions
 questions+1.264
years+1.233
ruits+1.185
year+1.080
nam+1.068
Neg logit contributions
ww-1.981
burn-1.844
lor-1.760
 subs-1.714
 chip-1.713
Token big
Token ID1263
Feature activation-6.90e-03

Pos logit contributions
years+0.153
 questions+0.151
ruits+0.144
year+0.137
nam+0.134
Neg logit contributions
ww-0.206
burn-0.190
lor-0.187
 subs-0.181
 tou-0.178
Token party
Token ID2151
Feature activation+1.15e+00

Pos logit contributions
years+0.012
 questions+0.012
ruits+0.012
year+0.012
nam+0.011
Neg logit contributions
ww-0.011
burn-0.010
lor-0.010
 subs-0.009
actus-0.009