TOP ACTIVATIONS
MAX = 3.969

played basketball together until the sun went down . <|endoftext|> Once
in the park until the sun went down . <|endoftext|> Once
continued to play until the sun went down . It was
in the park until the sun came out and the fog
slide all day until the sun went down . They had
nice !" The lady smiled and said , " I 'm
that they played until the sun went down . From that
they all played until the sun went down . They were
laughing and playing until the sun set and it was time
you okay , my d arl ings ?" she asked .
. The little girl smiled and went on her way .
around the garden until the sun went down . They became
flew by and soon the sun went down and it was
the k ite until the sun went down . The end
." The sw an smiled and sw am away .
L ily smiled and hugged them . She said
ball all day until the sun went down . Max was
y flew away , happy and clean . <|endoftext|> Once upon
and free . Lily smiled and laughed . She loved to
they played together until the sun came out and the beach

MIN ACTIVATIONS
MIN = -3.944

surprise for you . Do you want to know what it
has many things inside . They see a candle , a
stories with them . Do you want to try ?"
try something different . He asked his farm friends to help
blocks in the basement . They are very happy . They
many shiny things inside . They are curious and want to
many modern toys inside . They are shiny and make noises
a lot of boxes . They want to see what is
fix your bike . Do you want me to try ?"
asks Tom , " Do you want ice cream ?"
box in the garage . They wonder what is inside .
made a plan . He asked his mom for help ,
with something else . Do you want to draw a picture
in the play room . They are very happy and start
toys in the park . They are very happy .
box in the yard . They want to see what is
real car on it . They are very excited . They
a new one . Do you want to help me ?"
see a big box . They want to know what is
box in the garage . They open the box and see

INTERVAL 1.991 - 3.969
CONTAINS 0.799%

. When she took a sip , she made a face
. He says , " This is my bowl . Go
flowers . Lily was so happy to see the flowers and
idea . " Let 's mail ourselves !" Lily
inside the church . " Don 't worry , we 'll

INTERVAL 0.012 - 1.991
CONTAINS 48.264%

Anna and Ben nodded . They said they would share
Thank you mom my , I love it so much !"
important to keep your promises and be kind to others -
fat soap . It was fun and y ummy and everyone
looks like one of the machines has gone on fire .

INTERVAL -1.966 - 0.012
CONTAINS 49.941%

cry . He yelled and b anged on the door ,
time there was an owner who had a fit number .
friends playing together . M ia smiled and
always be respectful , appreci ative and supportive of those around
" Ben , I want to go back ,"

INTERVAL -3.944 - -1.966
CONTAINS 0.996%

climb the tree and pick an olive just for fun .
a big jungle with lots of trees and animals . One
had a lot of cars , trucks , and trains ,
<|endoftext|> Once upon a time , there was a little girl
and knock them down . They also liked to pretend they
Token played
Token ID2826
Feature activation-1.10e-01

Pos logit contributions
 questions+0.499
years+0.486
ruits+0.469
year+0.438
adies+0.433
Neg logit contributions
ww-0.722
burn-0.678
lor-0.658
 subs-0.652
 tou-0.623
Token basketball
Token ID9669
Feature activation+1.13e+00

Pos logit contributions
years+0.175
 questions+0.172
ruits+0.166
year+0.159
nam+0.158
Neg logit contributions
ww-0.193
lor-0.180
burn-0.179
 subs-0.171
actus-0.164
Token together
Token ID1978
Feature activation+1.03e-01

Pos logit contributions
ww+2.236
lor+2.127
burn+2.113
 subs+2.003
 tou+1.953
Neg logit contributions
 questions-1.597
years-1.583
ruits-1.491
year-1.441
nam-1.386
Token until
Token ID1566
Feature activation+1.36e+00

Pos logit contributions
ww+0.211
lor+0.199
burn+0.199
 subs+0.191
actus+0.183
Neg logit contributions
years-0.136
 questions-0.135
ruits-0.127
year-0.122
nam-0.121
Token the
Token ID262
Feature activation+1.33e+00

Pos logit contributions
lor+2.370
ww+2.352
burn+2.195
actus+2.094
 subs+2.068
Neg logit contributions
years-2.177
 questions-2.163
nam-2.053
ruits-2.026
year-2.007
Token sun
Token ID4252
Feature activation+3.97e+00

Pos logit contributions
ww+1.916
lor+1.902
burn+1.775
 subs+1.638
actus+1.569
Neg logit contributions
 questions-2.613
years-2.519
ruits-2.420
 three-2.379
nam-2.367
Token went
Token ID1816
Feature activation+1.99e+00

Pos logit contributions
ww+10.204
lor+10.194
burn+8.729
rob+8.656
 hawk+8.462
Neg logit contributions
 three-4.025
 questions-3.845
 want-3.763
 liked-3.761
 anx-3.580
Token down
Token ID866
Feature activation+9.25e-01

Pos logit contributions
lor+4.130
ww+4.042
burn+3.654
 subs+3.589
actus+3.583
Neg logit contributions
 questions-2.721
years-2.643
nam-2.500
year-2.457
 three-2.420
Token.
Token ID13
Feature activation+1.06e+00

Pos logit contributions
ww+1.875
lor+1.776
burn+1.751
 subs+1.653
actus+1.624
Neg logit contributions
 questions-1.273
years-1.259
ruits-1.171
year-1.168
nam-1.107
Token<|endoftext|>
Token ID50256
Feature activation+1.45e+00

Pos logit contributions
ww+2.028
lor+1.938
burn+1.912
 subs+1.812
actus+1.760
Neg logit contributions
years-1.526
 questions-1.518
ruits-1.457
year-1.418
nam-1.369
TokenOnce
Token ID7454
Feature activation-3.66e-01

Pos logit contributions
ww+2.724
burn+2.519
lor+2.443
 subs+2.371
 tou+2.273
Neg logit contributions
 questions-2.236
years-2.170
ruits-2.105
year-2.054
 three-2.001
Token in
Token ID287
Feature activation+2.43e-01

Pos logit contributions
ww+0.640
lor+0.604
burn+0.603
 subs+0.578
actus+0.557
Neg logit contributions
years-0.394
 questions-0.393
ruits-0.368
year-0.352
nam-0.347
Token the
Token ID262
Feature activation+2.10e+00

Pos logit contributions
ww+0.431
burn+0.403
lor+0.402
 subs+0.383
 tou+0.366
Neg logit contributions
years-0.389
 questions-0.387
ruits-0.369
year-0.356
nam-0.350
Token park
Token ID3952
Feature activation+7.00e-02

Pos logit contributions
lor+3.711
ww+3.646
burn+3.542
 subs+3.282
ic+3.197
Neg logit contributions
 questions-3.544
years-3.273
 three-3.180
ruits-3.062
 ten-3.033
Token until
Token ID1566
Feature activation+1.22e+00

Pos logit contributions
ww+0.131
burn+0.123
lor+0.123
 subs+0.118
 tou+0.113
Neg logit contributions
years-0.104
 questions-0.103
ruits-0.098
year-0.094
nam-0.093
Token the
Token ID262
Feature activation+1.29e+00

Pos logit contributions
ww+2.117
lor+2.092
burn+1.961
actus+1.857
 subs+1.849
Neg logit contributions
years-1.974
 questions-1.965
ruits-1.857
nam-1.834
year-1.805
Token sun
Token ID4252
Feature activation+3.95e+00

Pos logit contributions
ww+1.721
lor+1.698
burn+1.578
 subs+1.460
actus+1.355
Neg logit contributions
 questions-2.649
years-2.537
ruits-2.477
 three-2.399
year-2.364
Token went
Token ID1816
Feature activation+1.80e+00

Pos logit contributions
ww+10.472
lor+10.029
rob+8.711
burn+8.576
www+8.499
Neg logit contributions
 three-3.820
 liked-3.733
 questions-3.661
 counted-3.529
 want-3.524
Token down
Token ID866
Feature activation+8.27e-01

Pos logit contributions
lor+3.579
ww+3.562
burn+3.211
actus+3.142
 subs+3.105
Neg logit contributions
 questions-2.585
years-2.510
nam-2.390
year-2.318
ruits-2.303
Token.
Token ID13
Feature activation+6.07e-01

Pos logit contributions
ww+1.632
lor+1.538
burn+1.520
 subs+1.432
actus+1.407
Neg logit contributions
 questions-1.181
years-1.172
ruits-1.093
year-1.083
nam-1.039
Token<|endoftext|>
Token ID50256
Feature activation+1.09e+00

Pos logit contributions
ww+1.070
lor+1.005
burn+1.003
 subs+0.950
actus+0.910
Neg logit contributions
years-0.969
 questions-0.969
ruits-0.928
year-0.894
nam-0.874
TokenOnce
Token ID7454
Feature activation-2.23e-01

Pos logit contributions
ww+1.932
burn+1.779
lor+1.751
 subs+1.688
 tou+1.601
Neg logit contributions
 questions-1.766
years-1.727
ruits-1.672
year-1.622
 three-1.562
Token continued
Token ID3767
Feature activation+1.53e+00

Pos logit contributions
 questions+0.213
years+0.210
ruits+0.202
year+0.190
nam+0.187
Neg logit contributions
ww-0.315
burn-0.297
lor-0.292
 subs-0.286
 tou-0.274
Token to
Token ID284
Feature activation-7.51e-02

Pos logit contributions
ww+3.181
lor+3.070
burn+2.861
actus+2.804
 subs+2.742
Neg logit contributions
 questions-2.139
years-2.078
ruits-1.870
 three-1.844
year-1.837
Token play
Token ID711
Feature activation+1.33e-01

Pos logit contributions
years+0.101
 questions+0.100
ruits+0.094
year+0.092
nam+0.088
Neg logit contributions
ww-0.153
burn-0.146
lor-0.143
 subs-0.139
 tou-0.134
Token until
Token ID1566
Feature activation+2.40e+00

Pos logit contributions
ww+0.253
lor+0.237
burn+0.236
 subs+0.226
actus+0.217
Neg logit contributions
years-0.197
 questions-0.195
ruits-0.186
year-0.178
nam-0.177
Token the
Token ID262
Feature activation+1.12e+00

Pos logit contributions
lor+4.656
ww+4.533
ic+4.165
burn+4.135
actus+3.957
Neg logit contributions
years-3.610
 questions-3.464
nam-3.421
 three-3.414
ruits-3.265
Token sun
Token ID4252
Feature activation+3.88e+00

Pos logit contributions
ww+1.598
lor+1.546
burn+1.495
 subs+1.376
 tou+1.306
Neg logit contributions
 questions-2.168
years-2.110
ruits-2.020
year-1.978
 three-1.969
Token went
Token ID1816
Feature activation+2.06e+00

Pos logit contributions
ww+9.902
lor+9.432
rob+8.425
www+8.073
burn+8.059
Neg logit contributions
 three-4.239
 questions-3.857
 curiously-3.784
 anx-3.759
 if-3.622
Token down
Token ID866
Feature activation+1.73e+00

Pos logit contributions
lor+4.300
ww+4.249
burn+3.873
actus+3.769
 subs+3.748
Neg logit contributions
 questions-2.810
years-2.672
 three-2.513
year-2.474
nam-2.470
Token.
Token ID13
Feature activation+2.18e+00

Pos logit contributions
ww+3.911
lor+3.737
burn+3.648
actus+3.398
 subs+3.383
Neg logit contributions
 questions-2.120
years-2.042
year-1.933
 three-1.883
ruits-1.852
Token It
Token ID632
Feature activation+2.78e+00

Pos logit contributions
ww+4.828
burn+4.610
lor+4.586
 subs+4.379
rob+4.362
Neg logit contributions
years-2.545
 questions-2.484
ruits-2.403
 One-2.356
 three-2.329
Token was
Token ID373
Feature activation+9.32e-01

Pos logit contributions
ww+5.800
lor+5.590
burn+5.460
actus+4.991
rob+4.949
Neg logit contributions
 three-3.632
years-3.601
 questions-3.590
 curiously-3.447
 especially-3.384
Token in
Token ID287
Feature activation-4.22e-02

Pos logit contributions
years+0.392
 questions+0.375
ruits+0.365
year+0.351
nam+0.347
Neg logit contributions
ww-0.509
burn-0.474
lor-0.473
 subs-0.453
 tou-0.441
Token the
Token ID262
Feature activation+1.23e+00

Pos logit contributions
years+0.070
 questions+0.069
ruits+0.067
year+0.065
nam+0.063
Neg logit contributions
ww-0.071
burn-0.067
lor-0.066
 subs-0.063
 tou-0.061
Token park
Token ID3952
Feature activation-5.14e-01

Pos logit contributions
ww+1.899
lor+1.842
burn+1.781
 subs+1.688
 chip+1.589
Neg logit contributions
 questions-2.228
years-2.146
ruits-2.046
year-1.998
nam-1.974
Token until
Token ID1566
Feature activation+1.45e+00

Pos logit contributions
years+0.707
 questions+0.686
ruits+0.659
year+0.640
nam+0.637
Neg logit contributions
ww-1.001
burn-0.958
lor-0.945
 subs-0.915
 chip-0.885
Token the
Token ID262
Feature activation+7.65e-01

Pos logit contributions
ww+2.505
lor+2.471
burn+2.307
 subs+2.184
actus+2.181
Neg logit contributions
years-2.418
 questions-2.407
ruits-2.255
nam-2.254
 three-2.223
Token sun
Token ID4252
Feature activation+3.85e+00

Pos logit contributions
ww+0.973
lor+0.914
burn+0.882
 subs+0.815
actus+0.761
Neg logit contributions
 questions-1.617
years-1.587
ruits-1.536
year-1.490
nam-1.478
Token came
Token ID1625
Feature activation+5.69e-01

Pos logit contributions
ww+9.731
lor+9.264
rob+8.202
burn+7.862
www+7.731
Neg logit contributions
 three-4.326
 questions-4.102
 liked-4.055
 curiously-3.837
 anx-3.790
Token out
Token ID503
Feature activation+4.21e-01

Pos logit contributions
ww+1.016
lor+0.952
burn+0.935
 subs+0.892
actus+0.864
Neg logit contributions
 questions-0.900
years-0.897
ruits-0.848
year-0.821
nam-0.813
Token and
Token ID290
Feature activation+1.66e+00

Pos logit contributions
ww+0.808
lor+0.759
burn+0.755
 subs+0.720
actus+0.695
Neg logit contributions
years-0.612
 questions-0.612
ruits-0.576
year-0.556
nam-0.546
Token the
Token ID262
Feature activation+7.82e-01

Pos logit contributions
ww+3.088
lor+3.054
burn+2.737
actus+2.660
 subs+2.641
Neg logit contributions
 questions-2.626
years-2.579
nam-2.538
ruits-2.415
 three-2.403
Token fog
Token ID19558
Feature activation+2.65e+00

Pos logit contributions
ww+1.077
lor+1.007
burn+0.977
 subs+0.908
actus+0.851
Neg logit contributions
 questions-1.573
years-1.545
ruits-1.488
year-1.441
nam-1.439
Token slide
Token ID10649
Feature activation+7.72e-01

Pos logit contributions
years+0.135
 questions+0.134
ruits+0.127
year+0.124
nam+0.120
Neg logit contributions
ww-0.135
burn-0.126
lor-0.122
 subs-0.118
 tou-0.114
Token all
Token ID477
Feature activation+1.37e+00

Pos logit contributions
ww+1.578
lor+1.511
burn+1.467
 subs+1.413
 tou+1.370
Neg logit contributions
 questions-1.039
years-1.024
ruits-0.968
year-0.926
nam-0.915
Token day
Token ID1110
Feature activation+1.03e+00

Pos logit contributions
lor+2.358
ww+2.343
burn+2.178
 subs+2.071
actus+2.029
Neg logit contributions
 questions-2.272
years-2.258
ruits-2.156
year-2.134
 three-2.047
Token until
Token ID1566
Feature activation+1.67e+00

Pos logit contributions
ww+2.060
lor+1.963
burn+1.945
 subs+1.837
 tou+1.799
Neg logit contributions
 questions-1.439
years-1.425
ruits-1.362
nam-1.292
year-1.284
Token the
Token ID262
Feature activation+1.36e+00

Pos logit contributions
lor+2.911
ww+2.875
burn+2.665
actus+2.574
 subs+2.542
Neg logit contributions
 questions-2.701
years-2.667
nam-2.541
 three-2.524
ruits-2.522
Token sun
Token ID4252
Feature activation+3.81e+00

Pos logit contributions
lor+1.815
ww+1.813
burn+1.674
 subs+1.543
actus+1.482
Neg logit contributions
 questions-2.803
years-2.639
ruits-2.632
 three-2.564
year-2.498
Token went
Token ID1816
Feature activation+1.90e+00

Pos logit contributions
ww+9.278
lor+9.256
rob+7.924
actus+7.760
burn+7.735
Neg logit contributions
 three-4.440
 questions-3.904
 liked-3.879
 counted-3.615
adies-3.606
Token down
Token ID866
Feature activation+1.15e+00

Pos logit contributions
lor+3.904
ww+3.833
burn+3.476
 subs+3.414
actus+3.359
Neg logit contributions
 questions-2.656
years-2.561
nam-2.410
year-2.365
ruits-2.351
Token.
Token ID13
Feature activation+1.27e+00

Pos logit contributions
ww+2.370
lor+2.251
burn+2.208
 subs+2.084
actus+2.045
Neg logit contributions
 questions-1.546
years-1.516
year-1.421
ruits-1.409
 three-1.342
Token They
Token ID1119
Feature activation-2.04e-01

Pos logit contributions
ww+2.769
lor+2.644
burn+2.593
 subs+2.489
actus+2.429
Neg logit contributions
 questions-1.557
years-1.504
ruits-1.454
nam-1.365
year-1.359
Token had
Token ID550
Feature activation+7.32e-01

Pos logit contributions
 questions+0.307
years+0.304
ruits+0.291
year+0.278
nam+0.270
Neg logit contributions
ww-0.381
burn-0.359
lor-0.347
 subs-0.342
 tou-0.329
Token nice
Token ID3621
Feature activation+4.15e-01

Pos logit contributions
ruits+1.611
 questions+1.611
years+1.506
adies+1.429
 whether+1.400
Neg logit contributions
ww-2.112
 subs-1.832
burn-1.809
 tou-1.797
lor-1.794
Token!"
Token ID2474
Feature activation+1.53e+00

Pos logit contributions
ww+0.754
lor+0.705
burn+0.703
 subs+0.670
actus+0.643
Neg logit contributions
 questions-0.640
years-0.640
ruits-0.606
year-0.584
nam-0.575
Token The
Token ID383
Feature activation+5.44e-01

Pos logit contributions
ww+3.539
lor+3.335
burn+3.309
 subs+3.202
 tou+3.073
Neg logit contributions
years-1.725
 questions-1.707
ruits-1.572
nam-1.512
year-1.490
Token lady
Token ID10846
Feature activation+5.01e-01

Pos logit contributions
ww+0.578
lor+0.516
burn+0.513
 subs+0.465
actus+0.429
Neg logit contributions
 questions-1.269
years-1.263
ruits-1.221
year-1.193
nam-1.181
Token smiled
Token ID13541
Feature activation+5.67e-01

Pos logit contributions
ww+1.006
lor+0.947
burn+0.943
 subs+0.900
actus+0.870
Neg logit contributions
years-0.685
 questions-0.681
ruits-0.638
year-0.618
nam-0.603
Token and
Token ID290
Feature activation+3.78e+00

Pos logit contributions
ww+1.220
lor+1.151
burn+1.143
 subs+1.096
actus+1.062
Neg logit contributions
 questions-0.699
years-0.698
ruits-0.643
year-0.624
nam-0.607
Token said
Token ID531
Feature activation+1.74e-01

Pos logit contributions
lor+9.008
ww+8.371
burn+8.232
ic+7.761
actus+7.733
Neg logit contributions
years-4.708
 three-4.667
 if-4.544
 questions-4.486
nam-4.454
Token,
Token ID11
Feature activation+6.05e-01

Pos logit contributions
ww+0.364
lor+0.342
burn+0.340
 subs+0.328
 tou+0.315
Neg logit contributions
years-0.225
 questions-0.224
ruits-0.211
year-0.200
nam-0.198
Token "
Token ID366
Feature activation+1.95e+00

Pos logit contributions
ww+1.225
lor+1.151
burn+1.147
 subs+1.097
actus+1.063
Neg logit contributions
years-0.814
 questions-0.813
ruits-0.761
year-0.734
nam-0.724
TokenI
Token ID40
Feature activation+2.59e-02

Pos logit contributions
ww+3.014
burn+2.917
lor+2.896
 subs+2.761
actus+2.707
Neg logit contributions
years-3.644
year-3.389
 questions-3.367
ruits-3.261
nam-3.249
Token'm
Token ID1101
Feature activation+1.03e+00

Pos logit contributions
ww+0.044
lor+0.041
burn+0.041
 subs+0.039
actus+0.037
Neg logit contributions
years-0.044
 questions-0.043
ruits-0.041
nam-0.040
year-0.040
Token that
Token ID326
Feature activation+1.10e+00

Pos logit contributions
ww+1.827
lor+1.751
burn+1.740
 subs+1.651
 tou+1.614
Neg logit contributions
 questions-1.279
years-1.256
ruits-1.190
year-1.130
nam-1.106
Token they
Token ID484
Feature activation+6.27e-01

Pos logit contributions
ww+1.797
lor+1.743
burn+1.638
 subs+1.563
actus+1.551
Neg logit contributions
 questions-1.933
years-1.913
ruits-1.842
nam-1.814
 three-1.747
Token played
Token ID2826
Feature activation+1.76e-01

Pos logit contributions
ww+1.194
lor+1.132
burn+1.119
 subs+1.063
actus+1.028
Neg logit contributions
years-0.924
 questions-0.915
ruits-0.866
year-0.834
nam-0.833
Token until
Token ID1566
Feature activation+1.80e+00

Pos logit contributions
ww+0.329
lor+0.307
burn+0.307
 subs+0.292
actus+0.281
Neg logit contributions
years-0.265
 questions-0.263
ruits-0.250
year-0.241
nam-0.237
Token the
Token ID262
Feature activation+1.78e+00

Pos logit contributions
lor+3.288
ww+3.223
burn+2.973
actus+2.905
 subs+2.867
Neg logit contributions
 questions-2.814
years-2.785
nam-2.712
 three-2.680
ruits-2.622
Token sun
Token ID4252
Feature activation+3.78e+00

Pos logit contributions
lor+2.567
ww+2.485
burn+2.366
 subs+2.147
rob+2.055
Neg logit contributions
 questions-3.562
years-3.362
ruits-3.282
 three-3.265
nam-3.159
Token went
Token ID1816
Feature activation+2.14e+00

Pos logit contributions
ww+9.226
lor+9.205
rob+7.938
actus+7.835
burn+7.690
Neg logit contributions
 three-4.194
 questions-4.056
 want-3.901
 liked-3.897
ruits-3.711
Token down
Token ID866
Feature activation+1.18e+00

Pos logit contributions
lor+4.587
ww+4.460
burn+4.006
actus+3.975
 subs+3.961
Neg logit contributions
 questions-2.888
years-2.690
nam-2.602
 three-2.567
year-2.501
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
ww+2.479
lor+2.360
burn+2.319
 subs+2.191
actus+2.159
Neg logit contributions
 questions-1.553
years-1.510
year-1.407
ruits-1.396
 three-1.345
Token From
Token ID3574
Feature activation+1.25e+00

Pos logit contributions
ww+2.180
lor+2.044
burn+2.040
 subs+1.944
actus+1.896
Neg logit contributions
 questions-1.558
years-1.555
ruits-1.488
year-1.434
nam-1.424
Token that
Token ID326
Feature activation+5.24e-01

Pos logit contributions
ww+2.591
lor+2.547
burn+2.491
 subs+2.426
 tou+2.285
Neg logit contributions
 questions-1.589
years-1.575
ruits-1.443
nam-1.405
year-1.394
Token they
Token ID484
Feature activation+4.95e-01

Pos logit contributions
ww+3.107
lor+3.017
burn+2.860
 subs+2.761
actus+2.755
Neg logit contributions
 questions-2.291
years-2.225
ruits-2.097
nam-2.084
year-2.009
Token all
Token ID477
Feature activation+3.71e-01

Pos logit contributions
ww+1.062
lor+1.006
burn+0.998
 subs+0.959
actus+0.929
Neg logit contributions
years-0.609
 questions-0.604
ruits-0.563
year-0.541
nam-0.532
Token played
Token ID2826
Feature activation-6.73e-04

Pos logit contributions
ww+0.716
lor+0.672
burn+0.670
 subs+0.640
actus+0.616
Neg logit contributions
years-0.533
 questions-0.532
ruits-0.502
year-0.483
nam-0.476
Token until
Token ID1566
Feature activation+1.88e+00

Pos logit contributions
years+0.001
 questions+0.001
ruits+0.001
year+0.001
nam+0.001
Neg logit contributions
ww-0.001
lor-0.001
burn-0.001
 subs-0.001
actus-0.001
Token the
Token ID262
Feature activation+1.48e+00

Pos logit contributions
lor+3.378
ww+3.340
burn+3.048
actus+3.020
 subs+2.949
Neg logit contributions
years-2.983
 questions-2.962
nam-2.885
 three-2.813
year-2.753
Token sun
Token ID4252
Feature activation+3.75e+00

Pos logit contributions
ww+1.971
lor+1.958
burn+1.831
 subs+1.689
actus+1.573
Neg logit contributions
 questions-3.054
years-2.920
ruits-2.833
 three-2.789
year-2.729
Token went
Token ID1816
Feature activation+2.29e+00

Pos logit contributions
ww+9.136
lor+8.763
 hawk+7.690
rob+7.677
actus+7.540
Neg logit contributions
 three-4.086
 questions-3.932
 liked-3.753
 want-3.689
 anx-3.657
Token down
Token ID866
Feature activation+1.64e+00

Pos logit contributions
lor+4.891
ww+4.791
rob+4.243
actus+4.240
burn+4.233
Neg logit contributions
 questions-3.045
years-2.861
nam-2.728
 three-2.719
year-2.686
Token.
Token ID13
Feature activation+9.54e-01

Pos logit contributions
ww+3.550
lor+3.380
burn+3.287
 subs+3.117
actus+3.098
Neg logit contributions
 questions-2.090
years-2.010
year-1.908
 three-1.842
ruits-1.842
Token They
Token ID1119
Feature activation+5.06e-01

Pos logit contributions
ww+1.940
burn+1.837
lor+1.834
 subs+1.751
actus+1.698
Neg logit contributions
years-1.270
 questions-1.265
ruits-1.214
year-1.169
nam-1.131
Token were
Token ID547
Feature activation+1.20e+00

Pos logit contributions
ww+1.060
lor+1.004
burn+0.996
 subs+0.954
actus+0.923
Neg logit contributions
years-0.650
 questions-0.645
ruits-0.604
year-0.582
nam-0.573
Token laughing
Token ID14376
Feature activation+1.93e-01

Pos logit contributions
ww+0.774
lor+0.727
burn+0.723
 subs+0.691
actus+0.665
Neg logit contributions
years-0.581
 questions-0.579
ruits-0.547
year-0.526
nam-0.519
Token and
Token ID290
Feature activation+1.33e+00

Pos logit contributions
ww+0.387
lor+0.363
burn+0.363
 subs+0.348
actus+0.335
Neg logit contributions
years-0.263
 questions-0.261
ruits-0.247
year-0.236
nam-0.233
Token playing
Token ID2712
Feature activation-2.81e-02

Pos logit contributions
ww+2.810
lor+2.766
burn+2.603
 subs+2.499
actus+2.494
Neg logit contributions
years-1.769
 questions-1.730
nam-1.591
ruits-1.574
year-1.531
Token until
Token ID1566
Feature activation+1.39e+00

Pos logit contributions
years+0.042
 questions+0.042
ruits+0.040
year+0.038
nam+0.038
Neg logit contributions
ww-0.052
lor-0.049
burn-0.048
 subs-0.046
actus-0.045
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
ww+2.445
lor+2.406
burn+2.300
actus+2.167
 subs+2.135
Neg logit contributions
 questions-2.253
years-2.229
ruits-2.094
 three-2.074
year-2.055
Token sun
Token ID4252
Feature activation+3.74e+00

Pos logit contributions
ww+1.338
lor+1.259
burn+1.217
 subs+1.119
actus+1.047
Neg logit contributions
 questions-2.079
years-2.017
ruits-1.955
year-1.893
nam-1.880
Token set
Token ID900
Feature activation+2.15e+00

Pos logit contributions
ww+8.911
lor+8.447
rob+7.440
burn+7.416
 hawk+7.277
Neg logit contributions
 three-4.624
 questions-4.444
 anx-4.062
 curiously-4.052
year-3.993
Token and
Token ID290
Feature activation+1.67e+00

Pos logit contributions
ww+4.704
lor+4.509
burn+4.289
actus+4.235
rob+4.136
Neg logit contributions
 questions-2.739
years-2.510
ruits-2.467
 three-2.450
year-2.429
Token it
Token ID340
Feature activation+2.70e+00

Pos logit contributions
ww+3.261
lor+3.209
burn+2.992
 subs+2.887
actus+2.879
Neg logit contributions
 questions-2.430
years-2.377
nam-2.259
 three-2.238
ruits-2.228
Token was
Token ID373
Feature activation+9.09e-01

Pos logit contributions
ww+5.485
lor+5.131
burn+5.023
 tou+4.612
actus+4.593
Neg logit contributions
 questions-3.914
 three-3.685
years-3.638
year-3.484
 anx-3.439
Token time
Token ID640
Feature activation-9.14e-02

Pos logit contributions
ww+1.691
lor+1.611
burn+1.553
 subs+1.484
actus+1.474
Neg logit contributions
 questions-1.368
years-1.353
ruits-1.288
year-1.248
nam-1.230
Token you
Token ID345
Feature activation-1.66e-01

Pos logit contributions
 questions+0.520
years+0.518
ruits+0.505
year+0.495
nam+0.477
Neg logit contributions
ww-0.313
burn-0.274
lor-0.273
 subs-0.255
 tou-0.241
Token okay
Token ID8788
Feature activation-4.96e-01

Pos logit contributions
 questions+0.171
years+0.171
ruits+0.161
year+0.150
nam+0.147
Neg logit contributions
ww-0.390
burn-0.376
lor-0.366
 subs-0.357
 tou-0.343
Token,
Token ID11
Feature activation+3.84e-01

Pos logit contributions
years+0.729
 questions+0.695
ruits+0.666
year+0.666
nam+0.643
Neg logit contributions
ww-0.981
burn-0.937
lor-0.890
 subs-0.872
 chip-0.845
Token my
Token ID616
Feature activation+4.57e-02

Pos logit contributions
ww+0.711
lor+0.667
burn+0.664
 subs+0.633
actus+0.608
Neg logit contributions
years-0.586
 questions-0.586
ruits-0.554
year-0.533
nam-0.527
Token d
Token ID288
Feature activation+5.26e-01

Pos logit contributions
ww+0.086
burn+0.080
lor+0.080
 subs+0.076
actus+0.074
Neg logit contributions
years-0.070
 questions-0.068
ruits-0.065
year-0.063
nam-0.063
Tokenarl
Token ID7063
Feature activation+3.73e+00

Pos logit contributions
ww+0.729
lor+0.667
burn+0.662
 subs+0.624
 tou+0.588
Neg logit contributions
years-1.048
 questions-1.045
ruits-1.007
year-0.978
nam-0.957
Tokenings
Token ID654
Feature activation+1.23e+00

Pos logit contributions
ww+5.214
lor+4.877
burn+4.534
 subs+4.474
 tou+4.347
Neg logit contributions
ruits-7.751
 questions-7.676
 ago-7.133
 if-6.944
 want-6.923
Token?"
Token ID1701
Feature activation-1.34e-01

Pos logit contributions
ww+2.704
lor+2.635
burn+2.589
 subs+2.521
 tou+2.426
Neg logit contributions
 questions-1.450
years-1.370
ruits-1.321
year-1.254
adies-1.184
Token she
Token ID673
Feature activation+2.16e-01

Pos logit contributions
years+0.260
 questions+0.256
ruits+0.249
year+0.242
nam+0.236
Neg logit contributions
ww-0.191
burn-0.174
lor-0.171
 subs-0.165
 tou-0.159
Token asked
Token ID1965
Feature activation-1.00e+00

Pos logit contributions
ww+0.436
burn+0.410
lor+0.410
 subs+0.393
 tou+0.378
Neg logit contributions
years-0.290
 questions-0.288
ruits-0.273
year-0.261
nam-0.255
Token.
Token ID13
Feature activation+1.01e+00

Pos logit contributions
years+1.778
 questions+1.717
ruits+1.682
year+1.646
nam+1.581
Neg logit contributions
burn-1.491
ww-1.472
lor-1.399
 subs-1.377
 chip-1.348
Token.
Token ID13
Feature activation+2.49e+00

Pos logit contributions
ww+1.079
lor+1.007
burn+1.005
 subs+0.959
actus+0.928
Neg logit contributions
 questions-0.794
years-0.792
ruits-0.744
year-0.719
nam-0.703
Token The
Token ID383
Feature activation-9.56e-01

Pos logit contributions
ww+5.295
burn+5.124
lor+4.735
 subs+4.697
actus+4.662
Neg logit contributions
years-3.320
 questions-3.305
year-3.144
ruits-3.124
 One-3.006
Token little
Token ID1310
Feature activation-7.69e-02

Pos logit contributions
years+1.517
 questions+1.405
ruits+1.383
nam+1.345
year+1.331
Neg logit contributions
ww-1.788
burn-1.624
lor-1.610
actus-1.581
 chip-1.579
Token girl
Token ID2576
Feature activation+1.69e+00

Pos logit contributions
years+0.146
 questions+0.144
ruits+0.138
year+0.134
nam+0.133
Neg logit contributions
ww-0.116
burn-0.106
lor-0.104
 subs-0.099
 tou-0.096
Token smiled
Token ID13541
Feature activation+5.00e-01

Pos logit contributions
ww+3.628
burn+3.461
lor+3.398
 subs+3.236
actus+3.143
Neg logit contributions
years-2.084
 questions-2.076
 three-1.860
ruits-1.849
year-1.831
Token and
Token ID290
Feature activation+3.72e+00

Pos logit contributions
ww+1.034
lor+0.974
burn+0.970
 subs+0.929
actus+0.900
Neg logit contributions
years-0.653
 questions-0.653
ruits-0.607
year-0.587
nam-0.574
Token went
Token ID1816
Feature activation+2.46e+00

Pos logit contributions
lor+8.526
burn+8.184
ww+8.092
actus+7.779
ic+7.729
Neg logit contributions
years-4.737
 three-4.627
year-4.421
 questions-4.413
 curious-4.273
Token on
Token ID319
Feature activation+6.89e-01

Pos logit contributions
ww+4.418
lor+4.019
burn+3.926
 subs+3.841
actus+3.750
Neg logit contributions
 questions-4.353
years-4.084
year-3.867
 three-3.838
ruits-3.764
Token her
Token ID607
Feature activation+7.91e-01

Pos logit contributions
ww+1.387
lor+1.302
burn+1.300
 subs+1.236
actus+1.212
Neg logit contributions
 questions-0.962
years-0.956
ruits-0.886
year-0.860
nam-0.847
Token way
Token ID835
Feature activation+7.85e-01

Pos logit contributions
ww+1.038
lor+0.948
burn+0.941
 subs+0.868
actus+0.826
Neg logit contributions
 questions-1.650
years-1.619
ruits-1.554
nam-1.517
year-1.496
Token.
Token ID13
Feature activation+2.11e+00

Pos logit contributions
ww+1.597
lor+1.490
burn+1.482
 subs+1.420
actus+1.369
Neg logit contributions
 questions-1.068
years-1.062
ruits-1.000
year-0.950
nam-0.945
Token around
Token ID1088
Feature activation-5.05e-01

Pos logit contributions
ww+3.130
lor+2.976
burn+2.926
 subs+2.815
actus+2.710
Neg logit contributions
 questions-1.845
years-1.742
ruits-1.645
 three-1.600
year-1.592
Token the
Token ID262
Feature activation+9.94e-01

Pos logit contributions
years+0.803
 questions+0.791
ruits+0.781
year+0.737
nam+0.730
Neg logit contributions
ww-0.869
lor-0.813
burn-0.811
 subs-0.784
 chip-0.752
Token garden
Token ID11376
Feature activation+5.90e-01

Pos logit contributions
ww+1.541
burn+1.433
lor+1.433
 subs+1.363
 tou+1.240
Neg logit contributions
 questions-1.839
years-1.809
ruits-1.740
nam-1.671
year-1.663
Token until
Token ID1566
Feature activation+1.62e+00

Pos logit contributions
ww+1.184
lor+1.107
burn+1.104
 subs+1.055
actus+1.020
Neg logit contributions
 questions-0.810
years-0.808
ruits-0.760
year-0.727
nam-0.716
Token the
Token ID262
Feature activation+8.17e-01

Pos logit contributions
ww+2.812
lor+2.758
burn+2.611
 subs+2.461
actus+2.441
Neg logit contributions
 questions-2.665
years-2.663
ruits-2.535
nam-2.530
 three-2.509
Token sun
Token ID4252
Feature activation+3.71e+00

Pos logit contributions
ww+1.068
lor+1.001
burn+0.975
 subs+0.907
 tou+0.838
Neg logit contributions
 questions-1.699
years-1.672
ruits-1.613
nam-1.566
year-1.558
Token went
Token ID1816
Feature activation+2.08e+00

Pos logit contributions
ww+9.179
lor+8.662
rob+7.786
burn+7.630
 masse+7.339
Neg logit contributions
 three-4.301
 liked-4.048
 questions-3.967
ruits-3.850
 curiously-3.842
Token down
Token ID866
Feature activation+9.44e-01

Pos logit contributions
ww+4.340
lor+4.313
burn+3.907
 subs+3.820
rob+3.794
Neg logit contributions
 questions-2.850
years-2.718
nam-2.575
year-2.529
ruits-2.521
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
ww+1.898
lor+1.779
burn+1.768
 subs+1.670
actus+1.627
Neg logit contributions
 questions-1.317
years-1.305
ruits-1.220
year-1.205
nam-1.151
Token They
Token ID1119
Feature activation+7.37e-01

Pos logit contributions
ww+2.624
burn+2.498
lor+2.457
 subs+2.343
actus+2.264
Neg logit contributions
 questions-1.993
years-1.989
ruits-1.930
year-1.841
nam-1.782
Token became
Token ID2627
Feature activation+1.23e+00

Pos logit contributions
ww+1.538
lor+1.461
burn+1.446
 subs+1.380
actus+1.335
Neg logit contributions
years-0.958
 questions-0.942
ruits-0.883
year-0.853
nam-0.846
Token flew
Token ID13112
Feature activation+1.36e+00

Pos logit contributions
ww+4.430
lor+4.303
burn+4.182
actus+3.904
 tou+3.754
Neg logit contributions
 questions-3.409
years-3.251
year-3.148
ruits-2.968
 three-2.957
Token by
Token ID416
Feature activation-5.14e-01

Pos logit contributions
ww+2.717
lor+2.619
burn+2.565
 subs+2.433
actus+2.390
Neg logit contributions
 questions-1.936
years-1.858
ruits-1.721
year-1.704
 three-1.680
Token and
Token ID290
Feature activation+1.10e+00

Pos logit contributions
years+0.751
 questions+0.732
ruits+0.714
nam+0.684
year+0.681
Neg logit contributions
ww-0.984
burn-0.920
lor-0.910
 subs-0.884
 chip-0.841
Token soon
Token ID2582
Feature activation+2.75e+00

Pos logit contributions
ww+2.092
lor+2.006
burn+1.964
 subs+1.847
actus+1.833
Neg logit contributions
years-1.636
 questions-1.630
ruits-1.506
nam-1.479
year-1.472
Token the
Token ID262
Feature activation+6.35e-01

Pos logit contributions
ww+6.161
burn+6.029
lor+5.989
actus+5.511
rob+5.459
Neg logit contributions
 questions-3.361
years-3.051
 if-3.020
 three-3.017
 ten-2.988
Token sun
Token ID4252
Feature activation+3.71e+00

Pos logit contributions
ww+0.862
lor+0.804
burn+0.785
 subs+0.734
 tou+0.686
Neg logit contributions
 questions-1.293
years-1.283
ruits-1.227
year-1.191
nam-1.186
Token went
Token ID1816
Feature activation+2.30e+00

Pos logit contributions
ww+8.848
lor+8.238
rob+7.376
burn+7.275
actus+7.132
Neg logit contributions
 three-4.542
 questions-4.359
 liked-4.203
year-4.165
 curiously-4.123
Token down
Token ID866
Feature activation+1.16e+00

Pos logit contributions
lor+4.850
ww+4.836
burn+4.420
actus+4.243
 subs+4.227
Neg logit contributions
 questions-3.085
years-2.948
nam-2.736
year-2.705
 three-2.679
Token and
Token ID290
Feature activation+1.90e+00

Pos logit contributions
ww+2.389
lor+2.244
burn+2.232
 subs+2.085
actus+2.048
Neg logit contributions
 questions-1.590
years-1.564
year-1.449
ruits-1.440
adies-1.367
Token it
Token ID340
Feature activation+3.18e+00

Pos logit contributions
ww+3.864
lor+3.838
burn+3.533
actus+3.429
 subs+3.385
Neg logit contributions
 questions-2.634
years-2.563
nam-2.444
 three-2.379
ruits-2.289
Token was
Token ID373
Feature activation+1.19e+00

Pos logit contributions
ww+6.912
burn+6.454
lor+6.410
 chip+5.736
 Cathy+5.735
Neg logit contributions
 questions-4.158
 three-3.998
years-3.986
 liked-3.848
 especially-3.799
Token the
Token ID262
Feature activation+9.02e-01

Pos logit contributions
ww+0.267
burn+0.250
lor+0.248
 subs+0.237
 tou+0.227
Neg logit contributions
years-0.239
 questions-0.239
ruits-0.228
year-0.219
nam-0.215
Token k
Token ID479
Feature activation+1.63e+00

Pos logit contributions
ww+0.940
lor+0.869
burn+0.810
 subs+0.747
actus+0.699
Neg logit contributions
 questions-2.101
years-2.077
ruits-2.016
nam-1.979
year-1.954
Tokenite
Token ID578
Feature activation+8.72e-01

Pos logit contributions
ww+2.100
lor+1.976
burn+1.840
 subs+1.826
 tou+1.653
Neg logit contributions
years-3.483
 questions-3.279
ruits-3.242
nam-3.226
year-3.212
Token until
Token ID1566
Feature activation+1.62e+00

Pos logit contributions
ww+1.729
lor+1.631
burn+1.594
 subs+1.534
actus+1.490
Neg logit contributions
 questions-1.219
years-1.207
ruits-1.134
nam-1.082
year-1.082
Token the
Token ID262
Feature activation+1.40e+00

Pos logit contributions
lor+2.919
ww+2.881
burn+2.656
actus+2.566
 subs+2.534
Neg logit contributions
years-2.538
 questions-2.513
nam-2.409
ruits-2.375
 three-2.348
Token sun
Token ID4252
Feature activation+3.71e+00

Pos logit contributions
ww+2.036
lor+2.002
burn+1.831
 subs+1.741
actus+1.691
Neg logit contributions
 questions-2.660
years-2.570
ruits-2.501
 three-2.481
nam-2.433
Token went
Token ID1816
Feature activation+2.11e+00

Pos logit contributions
ww+9.156
lor+8.719
rob+7.745
actus+7.593
 masse+7.468
Neg logit contributions
 three-4.084
 liked-3.735
 questions-3.693
 want-3.667
 if-3.620
Token down
Token ID866
Feature activation+1.36e+00

Pos logit contributions
lor+4.428
ww+4.355
burn+3.926
actus+3.885
 subs+3.854
Neg logit contributions
 questions-2.846
years-2.745
nam-2.606
year-2.540
 three-2.537
Token.
Token ID13
Feature activation+9.79e-01

Pos logit contributions
ww+2.884
lor+2.741
burn+2.687
 subs+2.514
actus+2.510
Neg logit contributions
 questions-1.792
years-1.771
year-1.657
ruits-1.618
 three-1.582
Token The
Token ID383
Feature activation+5.46e-01

Pos logit contributions
ww+1.893
lor+1.795
burn+1.786
 subs+1.698
actus+1.634
Neg logit contributions
years-1.404
 questions-1.387
ruits-1.334
year-1.289
nam-1.253
Token end
Token ID886
Feature activation+2.25e+00

Pos logit contributions
ww+1.197
lor+1.140
burn+1.127
 subs+1.085
actus+1.050
Neg logit contributions
 questions-0.645
years-0.642
ruits-0.598
year-0.568
nam-0.564
Token."
Token ID526
Feature activation+9.41e-01

Pos logit contributions
ww+1.246
lor+1.171
burn+1.162
 subs+1.118
 tou+1.072
Neg logit contributions
 questions-1.038
years-1.026
ruits-0.977
year-0.940
nam-0.913
Token The
Token ID383
Feature activation+1.10e+00

Pos logit contributions
ww+1.909
lor+1.790
burn+1.782
 subs+1.714
 tou+1.644
Neg logit contributions
years-1.289
 questions-1.286
ruits-1.215
year-1.154
nam-1.147
Token sw
Token ID1509
Feature activation+1.34e+00

Pos logit contributions
ww+1.653
lor+1.582
burn+1.550
 subs+1.450
actus+1.333
Neg logit contributions
 questions-2.103
years-2.062
ruits-1.959
year-1.911
 three-1.894
Tokenan
Token ID272
Feature activation+6.92e-01

Pos logit contributions
ww+1.808
burn+1.661
lor+1.630
 subs+1.530
 chip+1.429
Neg logit contributions
 questions-2.734
years-2.728
ruits-2.615
year-2.534
adies-2.488
Token smiled
Token ID13541
Feature activation+3.41e-01

Pos logit contributions
ww+1.379
lor+1.297
burn+1.292
 subs+1.225
actus+1.182
Neg logit contributions
 questions-0.956
years-0.950
ruits-0.886
year-0.856
nam-0.840
Token and
Token ID290
Feature activation+3.71e+00

Pos logit contributions
ww+0.719
lor+0.678
burn+0.677
 subs+0.649
actus+0.627
Neg logit contributions
years-0.430
 questions-0.429
ruits-0.401
year-0.383
nam-0.377
Token sw
Token ID1509
Feature activation+9.42e-01

Pos logit contributions
lor+8.797
burn+8.050
ww+8.048
ic+7.633
actus+7.381
Neg logit contributions
 three-4.792
years-4.561
 questions-4.511
 liked-4.489
 curious-4.397
Tokenam
Token ID321
Feature activation+1.14e+00

Pos logit contributions
ww+1.991
burn+1.879
lor+1.874
 subs+1.795
 tou+1.736
Neg logit contributions
years-1.196
 questions-1.178
ruits-1.113
year-1.066
adies-1.013
Token away
Token ID1497
Feature activation+1.11e+00

Pos logit contributions
ww+1.862
lor+1.752
burn+1.742
 subs+1.617
actus+1.573
Neg logit contributions
 questions-2.058
years-2.039
year-1.921
ruits-1.903
 three-1.857
Token.
Token ID13
Feature activation+2.20e+00

Pos logit contributions
ww+2.303
lor+2.171
burn+2.139
 subs+1.995
actus+1.986
Neg logit contributions
 questions-1.510
years-1.496
ruits-1.368
year-1.363
nam-1.333
Token
Token ID220
Feature activation+9.88e-01

Pos logit contributions
ww+4.574
burn+4.437
lor+4.315
 subs+4.125
actus+3.962
Neg logit contributions
 questions-3.022
years-2.994
ruits-2.826
year-2.821
 Some-2.750
Token
Token ID198
Feature activation+7.90e-01

Pos logit contributions
ww+1.939
lor+1.825
burn+1.814
 subs+1.724
actus+1.626
Neg logit contributions
 questions-1.812
years-1.766
ruits-1.706
year-1.622
nam-1.611
Token
Token ID198
Feature activation+6.20e-01

Pos logit contributions
ww+1.332
burn+1.246
lor+1.234
 subs+1.183
 tou+1.112
Neg logit contributions
 questions-1.347
years-1.334
ruits-1.272
year-1.238
nam-1.202
TokenL
Token ID43
Feature activation-4.69e-01

Pos logit contributions
ww+1.006
burn+0.941
lor+0.939
 subs+0.892
actus+0.846
Neg logit contributions
years-1.091
 questions-1.088
ruits-1.036
year-1.011
nam-0.990
Tokenily
Token ID813
Feature activation+4.15e-01

Pos logit contributions
years+0.491
 questions+0.474
nam+0.446
ruits+0.421
year+0.405
Neg logit contributions
ww-1.097
lor-1.044
burn-1.029
 subs-0.989
actus-0.973
Token smiled
Token ID13541
Feature activation+5.33e-01

Pos logit contributions
ww+0.871
lor+0.821
burn+0.819
 subs+0.785
actus+0.757
Neg logit contributions
years-0.528
 questions-0.528
ruits-0.493
year-0.471
nam-0.463
Token and
Token ID290
Feature activation+3.69e+00

Pos logit contributions
ww+1.119
lor+1.055
burn+1.047
 subs+1.005
actus+0.968
Neg logit contributions
 questions-0.689
years-0.683
ruits-0.637
year-0.613
nam-0.601
Token hugged
Token ID41130
Feature activation+1.16e+00

Pos logit contributions
lor+9.102
ww+8.788
burn+8.577
ic+8.189
 subs+7.819
Neg logit contributions
 three-4.146
 questions-4.138
years-4.134
 curious-4.123
 want-4.053
Token them
Token ID606
Feature activation+5.40e-02

Pos logit contributions
ww+2.431
lor+2.324
burn+2.204
 subs+2.178
actus+2.082
Neg logit contributions
 questions-1.607
years-1.535
ruits-1.456
nam-1.378
year-1.353
Token.
Token ID13
Feature activation+9.54e-02

Pos logit contributions
ww+0.088
burn+0.083
lor+0.081
 subs+0.077
actus+0.074
Neg logit contributions
years-0.093
 questions-0.092
ruits-0.089
year-0.086
nam-0.084
Token She
Token ID1375
Feature activation+1.34e+00

Pos logit contributions
ww+0.175
burn+0.163
lor+0.163
 subs+0.155
actus+0.149
Neg logit contributions
years-0.147
 questions-0.145
ruits-0.138
year-0.134
nam-0.132
Token said
Token ID531
Feature activation-2.63e-02

Pos logit contributions
ww+2.515
lor+2.351
burn+2.338
 subs+2.190
 tou+2.079
Neg logit contributions
 questions-2.073
years-2.041
ruits-1.918
year-1.871
 three-1.833
Token ball
Token ID2613
Feature activation-1.40e-01

Pos logit contributions
ww+0.427
lor+0.385
burn+0.383
 subs+0.355
actus+0.332
Neg logit contributions
years-0.766
 questions-0.765
ruits-0.737
year-0.718
nam-0.712
Token all
Token ID477
Feature activation+3.95e-01

Pos logit contributions
years+0.203
 questions+0.201
ruits+0.191
year+0.184
nam+0.183
Neg logit contributions
ww-0.266
burn-0.250
lor-0.248
 subs-0.240
actus-0.229
Token day
Token ID1110
Feature activation+1.25e+00

Pos logit contributions
ww+0.594
lor+0.549
burn+0.546
 subs+0.514
actus+0.488
Neg logit contributions
years-0.740
 questions-0.740
ruits-0.708
year-0.687
nam-0.679
Token until
Token ID1566
Feature activation+1.86e+00

Pos logit contributions
ww+2.572
lor+2.448
burn+2.430
 subs+2.325
 tou+2.263
Neg logit contributions
 questions-1.686
years-1.664
ruits-1.584
nam-1.510
year-1.474
Token the
Token ID262
Feature activation+7.09e-01

Pos logit contributions
lor+3.388
ww+3.356
burn+3.127
actus+3.015
 subs+2.992
Neg logit contributions
 questions-2.896
years-2.873
nam-2.751
 three-2.715
ruits-2.657
Token sun
Token ID4252
Feature activation+3.69e+00

Pos logit contributions
ww+0.887
lor+0.831
burn+0.800
 subs+0.745
actus+0.699
Neg logit contributions
 questions-1.503
years-1.485
ruits-1.436
nam-1.390
year-1.388
Token went
Token ID1816
Feature activation+1.94e+00

Pos logit contributions
ww+9.015
lor+8.856
rob+7.892
burn+7.547
 Cathy+7.350
Neg logit contributions
 three-4.177
 questions-3.803
 liked-3.774
 want-3.657
ruits-3.624
Token down
Token ID866
Feature activation+1.39e+00

Pos logit contributions
ww+3.922
lor+3.915
burn+3.512
 subs+3.462
actus+3.413
Neg logit contributions
 questions-2.726
years-2.631
nam-2.484
year-2.441
 three-2.415
Token.
Token ID13
Feature activation+7.68e-01

Pos logit contributions
ww+2.983
lor+2.832
burn+2.775
 subs+2.610
actus+2.586
Neg logit contributions
 questions-1.821
years-1.773
year-1.670
ruits-1.623
 three-1.590
Token Max
Token ID5436
Feature activation+1.42e+00

Pos logit contributions
ww+1.529
lor+1.442
burn+1.434
 subs+1.368
actus+1.324
Neg logit contributions
 questions-1.074
years-1.064
ruits-1.009
nam-0.963
year-0.962
Token was
Token ID373
Feature activation+3.11e-01

Pos logit contributions
ww+2.759
lor+2.652
burn+2.609
 subs+2.457
 tou+2.371
Neg logit contributions
 questions-1.992
years-1.988
ruits-1.836
 three-1.788
nam-1.737
Tokeny
Token ID88
Feature activation+1.18e+00

Pos logit contributions
ww+2.196
lor+2.087
burn+2.032
 subs+1.966
 tou+1.869
Neg logit contributions
years-1.988
 questions-1.980
ruits-1.874
year-1.835
 three-1.778
Token flew
Token ID13112
Feature activation+1.34e+00

Pos logit contributions
ww+2.376
lor+2.221
burn+2.216
 subs+2.109
 tou+2.040
Neg logit contributions
 questions-1.615
years-1.577
ruits-1.478
year-1.424
 three-1.419
Token away
Token ID1497
Feature activation+1.14e+00

Pos logit contributions
ww+2.301
lor+2.185
burn+2.077
 subs+1.960
actus+1.944
Neg logit contributions
 questions-2.334
years-2.303
year-2.155
ruits-2.119
 three-2.096
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
ww+2.522
lor+2.391
burn+2.343
 subs+2.197
actus+2.169
Neg logit contributions
years-1.394
 questions-1.392
year-1.245
ruits-1.243
nam-1.214
Token happy
Token ID3772
Feature activation+1.98e+00

Pos logit contributions
ww+2.831
lor+2.676
burn+2.565
 subs+2.472
actus+2.424
Neg logit contributions
years-1.684
 questions-1.664
nam-1.558
ruits-1.540
year-1.495
Token and
Token ID290
Feature activation+3.69e+00

Pos logit contributions
ww+4.614
lor+4.368
burn+4.293
 subs+4.044
rob+4.023
Neg logit contributions
 questions-2.352
years-2.198
 three-2.052
ruits-1.978
 if-1.973
Token clean
Token ID3424
Feature activation+1.93e+00

Pos logit contributions
lor+9.676
ww+9.142
burn+9.138
ic+8.554
rob+8.524
Neg logit contributions
years-3.968
 curious-3.649
nam-3.646
year-3.573
 want-3.493
Token.
Token ID13
Feature activation+2.20e+00

Pos logit contributions
ww+4.266
lor+3.942
burn+3.847
 subs+3.761
 tou+3.710
Neg logit contributions
 questions-2.531
years-2.353
ruits-2.164
year-2.137
 three-2.135
Token<|endoftext|>
Token ID50256
Feature activation+1.28e+00

Pos logit contributions
ww+4.390
burn+4.299
lor+4.098
 subs+4.043
 pel+3.834
Neg logit contributions
years-3.032
 questions-3.004
year-2.883
ruits-2.879
 three-2.745
TokenOnce
Token ID7454
Feature activation+8.68e-03

Pos logit contributions
ww+2.349
burn+2.175
lor+2.109
 subs+2.053
 tou+1.947
Neg logit contributions
 questions-2.036
years-1.986
ruits-1.913
year-1.862
 three-1.810
Token upon
Token ID2402
Feature activation-5.43e-01

Pos logit contributions
ww+0.024
lor+0.023
burn+0.023
 subs+0.023
actus+0.022
Neg logit contributions
years-0.005
 questions-0.005
ruits-0.004
year-0.004
nam-0.003
Token and
Token ID290
Feature activation+2.81e+00

Pos logit contributions
ww+2.470
lor+2.353
burn+2.328
 subs+2.220
actus+2.152
Neg logit contributions
 questions-1.240
years-1.181
ruits-1.101
year-1.039
 three-1.028
Token free
Token ID1479
Feature activation+1.08e+00

Pos logit contributions
lor+6.548
ww+6.439
burn+6.149
actus+5.957
ic+5.723
Neg logit contributions
years-3.471
nam-3.394
 questions-3.379
 want-3.095
year-3.068
Token.
Token ID13
Feature activation+9.09e-01

Pos logit contributions
ww+2.202
lor+2.057
burn+2.027
 subs+1.922
actus+1.900
Neg logit contributions
 questions-1.519
years-1.490
ruits-1.389
year-1.356
 three-1.317
Token Lily
Token ID20037
Feature activation+1.44e+00

Pos logit contributions
ww+1.737
burn+1.631
lor+1.616
 subs+1.537
actus+1.485
Neg logit contributions
 questions-1.355
years-1.345
ruits-1.287
year-1.236
nam-1.212
Token smiled
Token ID13541
Feature activation+4.87e-01

Pos logit contributions
ww+3.005
burn+2.783
lor+2.778
 subs+2.637
 tou+2.529
Neg logit contributions
years-1.915
 questions-1.904
ruits-1.733
 three-1.665
year-1.659
Token and
Token ID290
Feature activation+3.69e+00

Pos logit contributions
ww+1.042
lor+0.983
burn+0.978
 subs+0.940
actus+0.908
Neg logit contributions
 questions-0.605
years-0.603
ruits-0.560
year-0.538
nam-0.528
Token laughed
Token ID13818
Feature activation-3.88e-01

Pos logit contributions
lor+8.969
ww+8.699
ic+8.453
burn+8.249
actus+8.039
Neg logit contributions
years-4.485
nam-4.405
 three-4.262
 curious-4.176
 liked-4.076
Token.
Token ID13
Feature activation+5.10e-01

Pos logit contributions
years+0.617
 questions+0.600
ruits+0.594
nam+0.556
year+0.550
Neg logit contributions
ww-0.682
burn-0.648
lor-0.638
 subs-0.614
 tou-0.586
Token She
Token ID1375
Feature activation+1.82e+00

Pos logit contributions
ww+0.975
lor+0.914
burn+0.912
 subs+0.869
actus+0.835
Neg logit contributions
 questions-0.746
years-0.745
ruits-0.706
year-0.676
nam-0.668
Token loved
Token ID6151
Feature activation+1.43e+00

Pos logit contributions
ww+4.103
burn+3.923
lor+3.882
 subs+3.642
ic+3.570
Neg logit contributions
years-2.112
 questions-2.062
 three-1.843
ruits-1.831
year-1.825
Token to
Token ID284
Feature activation+7.86e-01

Pos logit contributions
ww+2.888
lor+2.794
burn+2.638
 subs+2.556
actus+2.515
Neg logit contributions
 questions-2.139
years-2.054
ruits-1.872
nam-1.851
 three-1.810
Token they
Token ID484
Feature activation-1.04e-01

Pos logit contributions
ww+2.489
lor+2.345
burn+2.333
 subs+2.199
actus+2.187
Neg logit contributions
years-2.097
 questions-2.093
ruits-1.947
nam-1.920
year-1.917
Token played
Token ID2826
Feature activation-3.61e-01

Pos logit contributions
 questions+0.131
years+0.129
ruits+0.123
year+0.114
nam+0.114
Neg logit contributions
ww-0.219
burn-0.206
lor-0.205
 subs-0.199
 tou-0.192
Token together
Token ID1978
Feature activation+1.48e-01

Pos logit contributions
years+0.572
 questions+0.550
ruits+0.538
nam+0.513
year+0.511
Neg logit contributions
ww-0.643
lor-0.608
burn-0.598
 subs-0.573
actus-0.554
Token until
Token ID1566
Feature activation+2.26e+00

Pos logit contributions
ww+0.305
burn+0.288
lor+0.288
 subs+0.277
actus+0.266
Neg logit contributions
years-0.193
 questions-0.190
ruits-0.180
year-0.172
nam-0.170
Token the
Token ID262
Feature activation+9.99e-01

Pos logit contributions
lor+4.518
ww+4.411
burn+4.146
actus+4.038
ic+3.969
Neg logit contributions
years-3.193
 questions-3.170
 three-3.132
nam-3.061
year-2.965
Token sun
Token ID4252
Feature activation+3.69e+00

Pos logit contributions
ww+1.366
lor+1.305
burn+1.291
 subs+1.170
 tou+1.095
Neg logit contributions
 questions-2.005
years-1.965
ruits-1.891
year-1.847
nam-1.816
Token came
Token ID1625
Feature activation+5.79e-01

Pos logit contributions
ww+9.112
lor+8.517
burn+7.857
 Cathy+7.648
rob+7.633
Neg logit contributions
 three-4.020
 questions-3.791
 anx-3.721
 want-3.582
adies-3.578
Token out
Token ID503
Feature activation+9.27e-01

Pos logit contributions
ww+1.034
lor+0.968
burn+0.953
 subs+0.908
actus+0.878
Neg logit contributions
 questions-0.914
years-0.909
ruits-0.862
year-0.833
nam-0.824
Token and
Token ID290
Feature activation+1.90e+00

Pos logit contributions
ww+1.836
lor+1.735
burn+1.702
 subs+1.613
actus+1.590
Neg logit contributions
 questions-1.307
years-1.288
ruits-1.204
year-1.185
nam-1.139
Token the
Token ID262
Feature activation+7.88e-01

Pos logit contributions
ww+3.701
lor+3.682
burn+3.337
actus+3.250
 subs+3.203
Neg logit contributions
 questions-2.787
nam-2.784
years-2.743
 three-2.605
ruits-2.568
Token beach
Token ID10481
Feature activation+2.19e+00

Pos logit contributions
ww+1.157
lor+1.083
burn+1.078
 subs+0.996
 tou+0.931
Neg logit contributions
 questions-1.502
years-1.488
ruits-1.425
nam-1.379
year-1.379
Token surprise
Token ID5975
Feature activation-8.81e-01

Pos logit contributions
ww+0.035
burn+0.032
lor+0.032
 subs+0.030
actus+0.029
Neg logit contributions
years-0.043
 questions-0.042
ruits-0.040
year-0.040
nam-0.039
Token for
Token ID329
Feature activation-4.67e-01

Pos logit contributions
years+1.363
 questions+1.265
ruits+1.230
nam+1.219
year+1.206
Neg logit contributions
ww-1.654
burn-1.545
lor-1.504
 subs-1.426
 chip-1.401
Token you
Token ID345
Feature activation-3.00e+00

Pos logit contributions
years+0.951
 questions+0.941
ruits+0.912
year+0.904
nam+0.866
Neg logit contributions
ww-0.622
burn-0.554
lor-0.550
 subs-0.503
 tou-0.492
Token.
Token ID13
Feature activation-1.42e+00

Pos logit contributions
years+6.540
year+6.046
nam+5.787
ruits+5.568
 questions+5.504
Neg logit contributions
ww-4.062
lor-3.715
burn-3.648
 chip-3.394
actus-3.261
Token Do
Token ID2141
Feature activation-2.92e-01

Pos logit contributions
years+2.475
 questions+2.428
nam+2.228
ruits+2.224
year+2.156
Neg logit contributions
ww-2.420
lor-2.292
burn-2.176
 tou-2.074
 chip-2.037
Token you
Token ID345
Feature activation-3.89e+00

Pos logit contributions
years+0.627
 questions+0.616
year+0.597
ruits+0.594
nam+0.567
Neg logit contributions
ww-0.372
lor-0.330
burn-0.322
 subs-0.296
 tou-0.286
Token want
Token ID765
Feature activation-2.18e-01

Pos logit contributions
years+5.965
nam+5.704
 curiously+5.008
 questions+4.848
year+4.834
Neg logit contributions
ww-8.219
burn-7.620
 tou-7.225
 subs-6.856
lor-6.766
Token to
Token ID284
Feature activation+1.59e-01

Pos logit contributions
years+0.375
 questions+0.366
ruits+0.355
year+0.345
nam+0.342
Neg logit contributions
ww-0.364
burn-0.336
lor-0.333
 subs-0.313
 chip-0.302
Token know
Token ID760
Feature activation-1.67e+00

Pos logit contributions
ww+0.322
burn+0.305
lor+0.303
 subs+0.290
 tou+0.281
Neg logit contributions
years-0.216
 questions-0.213
ruits-0.201
year-0.195
nam-0.190
Token what
Token ID644
Feature activation+3.24e-01

Pos logit contributions
years+2.659
year+2.396
 questions+2.380
ruits+2.319
nam+2.181
Neg logit contributions
ww-3.104
burn-2.968
lor-2.886
 chip-2.717
 tou-2.638
Token it
Token ID340
Feature activation+8.23e-02

Pos logit contributions
ww+0.566
lor+0.527
burn+0.526
 subs+0.500
actus+0.479
Neg logit contributions
years-0.525
 questions-0.523
ruits-0.497
year-0.480
nam-0.474
Token has
Token ID468
Feature activation-7.06e-01

Pos logit contributions
ww+0.346
lor+0.318
burn+0.316
 subs+0.298
actus+0.282
Neg logit contributions
years-0.464
 questions-0.462
ruits-0.443
year-0.431
nam-0.427
Token many
Token ID867
Feature activation-2.00e+00

Pos logit contributions
years+0.899
 questions+0.869
ruits+0.843
year+0.794
nam+0.767
Neg logit contributions
ww-1.489
lor-1.402
burn-1.378
 subs-1.342
 tou-1.337
Token things
Token ID1243
Feature activation+8.14e-01

Pos logit contributions
years+3.534
ruits+3.177
year+3.170
 questions+3.099
nam+3.044
Neg logit contributions
ww-3.466
actus-3.011
 chip-2.991
 tou-2.986
 subs-2.940
Token inside
Token ID2641
Feature activation-7.69e-01

Pos logit contributions
ww+1.555
lor+1.469
burn+1.455
 subs+1.384
actus+1.339
Neg logit contributions
 questions-1.203
years-1.177
ruits-1.131
year-1.074
nam-1.069
Token.
Token ID13
Feature activation-1.62e-01

Pos logit contributions
years+1.316
 questions+1.242
ruits+1.187
year+1.186
nam+1.160
Neg logit contributions
ww-1.273
lor-1.183
burn-1.167
 subs-1.098
 tou-1.080
Token They
Token ID1119
Feature activation-3.83e+00

Pos logit contributions
years+0.284
 questions+0.280
ruits+0.264
year+0.261
nam+0.257
Neg logit contributions
ww-0.263
lor-0.245
burn-0.242
 subs-0.230
actus-0.221
Token see
Token ID766
Feature activation-1.61e+00

Pos logit contributions
years+6.620
year+6.484
ruits+6.326
adies+6.113
 questions+5.858
Neg logit contributions
 tou-6.481
ww-6.064
 po-6.023
 subs-5.875
 bottle-5.835
Token a
Token ID257
Feature activation+7.74e-01

Pos logit contributions
years+2.576
ruits+2.441
year+2.399
 questions+2.372
adies+2.236
Neg logit contributions
ww-2.874
burn-2.697
lor-2.680
 chip-2.634
 tou-2.625
Token candle
Token ID26839
Feature activation+1.71e+00

Pos logit contributions
ww+1.221
lor+1.155
burn+1.143
 subs+1.079
actus+1.011
Neg logit contributions
 questions-1.368
years-1.344
ruits-1.295
nam-1.255
year-1.248
Token,
Token ID11
Feature activation-2.32e+00

Pos logit contributions
ww+3.740
lor+3.544
burn+3.439
 subs+3.363
actus+3.224
Neg logit contributions
 questions-2.063
ruits-1.925
years-1.923
 three-1.823
year-1.772
Token a
Token ID257
Feature activation+6.41e-01

Pos logit contributions
years+3.842
 questions+3.449
ruits+3.449
year+3.318
adies+3.140
Neg logit contributions
burn-4.081
ww-4.080
 chip-3.868
 tou-3.834
 pel-3.738
Token stories
Token ID3923
Feature activation-4.89e-01

Pos logit contributions
years+2.303
year+2.100
 questions+2.011
ruits+2.004
adies+1.920
Neg logit contributions
ww-2.596
burn-2.492
lor-2.360
 tou-2.286
actus-2.281
Token with
Token ID351
Feature activation-8.32e-01

Pos logit contributions
years+0.774
 questions+0.725
ruits+0.697
year+0.689
nam+0.670
Neg logit contributions
ww-0.899
burn-0.843
lor-0.842
 subs-0.786
 chip-0.766
Token them
Token ID606
Feature activation-1.42e+00

Pos logit contributions
years+1.474
 questions+1.413
year+1.397
ruits+1.383
adies+1.322
Neg logit contributions
ww-1.332
burn-1.210
lor-1.200
 tou-1.110
 subs-1.106
Token.
Token ID13
Feature activation-8.71e-01

Pos logit contributions
years+2.608
year+2.397
 questions+2.349
ruits+2.328
adies+2.255
Neg logit contributions
ww-2.333
burn-2.159
lor-2.097
 chip-1.975
actus-1.927
Token Do
Token ID2141
Feature activation-3.07e-01

Pos logit contributions
years+1.366
 questions+1.322
ruits+1.245
year+1.230
nam+1.225
Neg logit contributions
ww-1.589
lor-1.484
burn-1.463
 subs-1.359
 chip-1.357
Token you
Token ID345
Feature activation-3.82e+00

Pos logit contributions
years+0.653
 questions+0.633
year+0.621
ruits+0.613
nam+0.588
Neg logit contributions
ww-0.402
lor-0.357
burn-0.350
 subs-0.320
 tou-0.314
Token want
Token ID765
Feature activation-2.16e-01

Pos logit contributions
years+6.246
nam+5.283
year+5.096
 chambers+4.517
 questions+4.254
Neg logit contributions
ww-8.092
burn-7.463
 tou-7.095
lor-6.761
 subs-6.744
Token to
Token ID284
Feature activation-9.09e-02

Pos logit contributions
years+0.373
 questions+0.361
ruits+0.354
year+0.345
nam+0.338
Neg logit contributions
ww-0.356
burn-0.328
lor-0.326
 subs-0.306
actus-0.295
Token try
Token ID1949
Feature activation-8.81e-01

Pos logit contributions
years+0.119
 questions+0.115
ruits+0.110
year+0.109
nam+0.103
Neg logit contributions
ww-0.188
burn-0.179
lor-0.177
 subs-0.170
 tou-0.166
Token?"
Token ID1701
Feature activation-1.12e+00

Pos logit contributions
years+1.297
year+1.216
 questions+1.210
ruits+1.190
nam+1.097
Neg logit contributions
ww-1.679
burn-1.650
lor-1.592
actus-1.491
 chip-1.486
Token
Token ID198
Feature activation+1.03e+00

Pos logit contributions
years+1.981
 questions+1.869
ruits+1.864
year+1.827
nam+1.719
Neg logit contributions
ww-1.843
burn-1.684
 tou-1.613
 subs-1.609
 chip-1.604
Token try
Token ID1949
Feature activation+2.29e-01

Pos logit contributions
ww+0.709
lor+0.668
burn+0.666
 subs+0.638
actus+0.615
Neg logit contributions
years-0.467
 questions-0.466
ruits-0.438
year-0.420
nam-0.413
Token something
Token ID1223
Feature activation+1.24e+00

Pos logit contributions
ww+0.423
lor+0.397
burn+0.397
 subs+0.377
actus+0.363
Neg logit contributions
years-0.349
 questions-0.344
ruits-0.328
year-0.317
nam-0.313
Token different
Token ID1180
Feature activation-3.04e-01

Pos logit contributions
ww+2.294
lor+2.189
burn+2.173
 subs+2.015
actus+1.976
Neg logit contributions
 questions-2.009
years-1.855
ruits-1.801
 whether-1.733
year-1.728
Token.
Token ID13
Feature activation-1.82e+00

Pos logit contributions
years+0.501
 questions+0.475
ruits+0.463
year+0.452
nam+0.446
Neg logit contributions
ww-0.524
burn-0.489
lor-0.489
 subs-0.472
 tou-0.446
Token He
Token ID679
Feature activation-1.35e-01

Pos logit contributions
years+2.820
nam+2.820
ruits+2.767
 questions+2.665
adies+2.576
Neg logit contributions
ww-3.234
lor-3.121
 subs-3.033
 chip-2.836
burn-2.834
Token asked
Token ID1965
Feature activation-3.80e+00

Pos logit contributions
years+0.207
 questions+0.207
ruits+0.198
nam+0.190
year+0.188
Neg logit contributions
ww-0.245
burn-0.229
lor-0.228
 subs-0.220
 tou-0.210
Token his
Token ID465
Feature activation-1.04e+00

Pos logit contributions
nam+7.574
ruits+7.304
years+6.699
racted+6.545
 ago+6.319
Neg logit contributions
ww-5.360
 subs-5.245
lor-5.050
ic-4.881
 cho-4.870
Token farm
Token ID5318
Feature activation-8.17e-01

Pos logit contributions
years+1.817
year+1.688
ruits+1.685
nam+1.661
adies+1.613
Neg logit contributions
ww-1.735
burn-1.605
lor-1.568
 subs-1.529
 chip-1.526
Token friends
Token ID2460
Feature activation-1.68e+00

Pos logit contributions
years+1.460
nam+1.326
ruits+1.323
year+1.323
 questions+1.311
Neg logit contributions
ww-1.293
lor-1.227
burn-1.208
 chip-1.137
 subs-1.135
Token to
Token ID284
Feature activation+4.08e-01

Pos logit contributions
years+2.875
nam+2.620
year+2.592
ruits+2.567
adies+2.496
Neg logit contributions
ww-2.696
lor-2.612
burn-2.607
 subs-2.433
 chip-2.405
Token help
Token ID1037
Feature activation-1.58e+00

Pos logit contributions
ww+0.844
lor+0.796
burn+0.793
 subs+0.760
actus+0.735
Neg logit contributions
 questions-0.532
years-0.531
ruits-0.498
year-0.475
nam-0.469
Token blocks
Token ID7021
Feature activation+3.05e-02

Pos logit contributions
years+4.021
ruits+3.885
year+3.680
adies+3.569
 questions+3.445
Neg logit contributions
ww-3.630
 bottle-3.214
 tou-3.189
 chip-3.179
rob-3.114
Token in
Token ID287
Feature activation-3.05e-01

Pos logit contributions
ww+0.056
burn+0.053
lor+0.053
 subs+0.050
actus+0.048
Neg logit contributions
years-0.046
 questions-0.046
ruits-0.043
year-0.042
nam-0.041
Token the
Token ID262
Feature activation+6.55e-01

Pos logit contributions
years+0.518
 questions+0.507
ruits+0.487
year+0.473
adies+0.456
Neg logit contributions
ww-0.509
lor-0.477
burn-0.476
 subs-0.448
 chip-0.434
Token basement
Token ID17012
Feature activation-3.98e-01

Pos logit contributions
ww+1.253
burn+1.174
lor+1.173
 subs+1.119
actus+1.067
Neg logit contributions
 questions-0.969
years-0.965
ruits-0.911
nam-0.873
year-0.872
Token.
Token ID13
Feature activation-6.26e-01

Pos logit contributions
years+0.677
 questions+0.652
ruits+0.621
year+0.610
nam+0.603
Neg logit contributions
ww-0.663
lor-0.629
burn-0.621
 subs-0.594
 tou-0.568
Token They
Token ID1119
Feature activation-3.78e+00

Pos logit contributions
years+1.025
 questions+0.981
nam+0.934
year+0.928
ruits+0.925
Neg logit contributions
ww-1.076
lor-1.004
burn-0.983
 subs-0.968
 chip-0.945
Token are
Token ID389
Feature activation-1.88e+00

Pos logit contributions
years+6.941
year+6.845
ruits+6.513
 questions+6.465
adies+6.216
Neg logit contributions
 tou-5.885
ww-5.759
 po-5.497
 bottle-5.450
 subs-5.437
Token very
Token ID845
Feature activation-1.35e+00

Pos logit contributions
years+3.064
ruits+3.041
 questions+2.966
year+2.909
adies+2.889
Neg logit contributions
ww-2.957
 tou-2.849
 subs-2.834
burn-2.781
 chip-2.781
Token happy
Token ID3772
Feature activation-7.81e-02

Pos logit contributions
years+1.809
 questions+1.770
ruits+1.687
adies+1.623
year+1.584
Neg logit contributions
ww-2.743
 tou-2.420
 subs-2.420
 pel-2.394
 chip-2.376
Token.
Token ID13
Feature activation-4.13e-01

Pos logit contributions
years+0.123
 questions+0.120
ruits+0.115
year+0.112
nam+0.110
Neg logit contributions
ww-0.141
burn-0.132
lor-0.131
 subs-0.126
actus-0.121
Token They
Token ID1119
Feature activation-2.61e+00

Pos logit contributions
years+0.667
 questions+0.643
year+0.613
ruits+0.607
nam+0.605
Neg logit contributions
ww-0.728
lor-0.676
burn-0.662
 subs-0.644
 chip-0.625
Token many
Token ID867
Feature activation-2.03e+00

Pos logit contributions
years+1.161
 questions+1.134
ruits+1.116
year+1.036
nam+0.986
Neg logit contributions
ww-1.905
lor-1.797
burn-1.760
 subs-1.736
 tou-1.719
Token shiny
Token ID22441
Feature activation-1.23e+00

Pos logit contributions
years+3.382
ruits+3.242
year+3.152
 questions+3.094
adies+3.069
Neg logit contributions
ww-3.537
 subs-3.109
actus-3.098
 chip-3.082
lor-3.073
Token things
Token ID1243
Feature activation+9.71e-01

Pos logit contributions
years+2.091
 questions+1.941
ruits+1.904
year+1.901
nam+1.785
Neg logit contributions
ww-2.180
burn-1.964
lor-1.944
actus-1.919
 subs-1.880
Token inside
Token ID2641
Feature activation-4.34e-01

Pos logit contributions
ww+1.924
lor+1.821
burn+1.799
 subs+1.712
actus+1.666
Neg logit contributions
 questions-1.372
years-1.334
ruits-1.283
year-1.217
nam-1.211
Token.
Token ID13
Feature activation+1.75e-01

Pos logit contributions
years+0.726
 questions+0.699
ruits+0.664
year+0.655
nam+0.644
Neg logit contributions
ww-0.733
lor-0.684
burn-0.677
 subs-0.640
 tou-0.622
Token They
Token ID1119
Feature activation-3.78e+00

Pos logit contributions
ww+0.293
lor+0.273
burn+0.270
 subs+0.257
actus+0.247
Neg logit contributions
years-0.297
 questions-0.295
ruits-0.280
year-0.272
nam-0.269
Token are
Token ID389
Feature activation-1.86e+00

Pos logit contributions
years+6.831
year+6.677
ruits+6.468
adies+6.213
 questions+6.053
Neg logit contributions
 tou-6.103
ww-5.787
 po-5.702
 subs-5.660
 bottle-5.603
Token curious
Token ID11040
Feature activation-1.60e+00

Pos logit contributions
years+2.986
ruits+2.984
adies+2.885
 questions+2.867
year+2.812
Neg logit contributions
ww-2.939
 tou-2.893
 subs-2.838
 pel-2.816
 chip-2.780
Token and
Token ID290
Feature activation-2.44e+00

Pos logit contributions
years+2.674
ruits+2.470
year+2.435
nam+2.402
 questions+2.357
Neg logit contributions
ww-2.752
burn-2.596
 subs-2.521
lor-2.496
 chip-2.413
Token want
Token ID765
Feature activation-6.07e-01

Pos logit contributions
years+2.878
ruits+2.696
adies+2.408
year+2.404
 questions+2.393
Neg logit contributions
ww-5.219
 tou-4.970
 pel-4.804
 subs-4.763
 po-4.554
Token to
Token ID284
Feature activation-1.07e+00

Pos logit contributions
years+1.178
ruits+1.129
 questions+1.126
year+1.101
nam+1.077
Neg logit contributions
ww-0.840
burn-0.788
lor-0.774
 subs-0.747
 chip-0.704
Token many
Token ID867
Feature activation-2.30e+00

Pos logit contributions
years+1.248
 questions+1.208
ruits+1.185
year+1.114
nam+1.043
Neg logit contributions
ww-2.020
lor-1.902
burn-1.862
 tou-1.831
 subs-1.828
Token modern
Token ID3660
Feature activation-2.23e+00

Pos logit contributions
years+4.116
year+3.815
ruits+3.766
adies+3.645
 questions+3.613
Neg logit contributions
ww-3.850
actus-3.434
 tou-3.344
 chip-3.308
 subs-3.264
Token toys
Token ID14958
Feature activation+3.69e-01

Pos logit contributions
years+4.165
year+3.820
ruits+3.653
 questions+3.583
nam+3.448
Neg logit contributions
ww-3.782
lor-3.241
actus-3.235
burn-3.224
 chip-3.195
Token inside
Token ID2641
Feature activation-7.93e-01

Pos logit contributions
ww+0.711
lor+0.667
burn+0.666
 subs+0.636
actus+0.611
Neg logit contributions
years-0.532
 questions-0.531
ruits-0.502
year-0.482
nam-0.475
Token.
Token ID13
Feature activation+1.63e-01

Pos logit contributions
years+1.403
 questions+1.329
ruits+1.263
year+1.254
nam+1.226
Neg logit contributions
ww-1.267
lor-1.176
burn-1.171
 subs-1.086
 tou-1.073
Token They
Token ID1119
Feature activation-3.77e+00

Pos logit contributions
ww+0.272
lor+0.253
burn+0.251
 subs+0.238
actus+0.229
Neg logit contributions
years-0.279
 questions-0.277
ruits-0.262
year-0.256
nam-0.252
Token are
Token ID389
Feature activation-1.84e+00

Pos logit contributions
years+6.606
year+6.473
ruits+6.070
adies+5.905
 questions+5.743
Neg logit contributions
 tou-6.356
ww-6.052
 po-5.926
 bottle-5.765
 subs-5.673
Token shiny
Token ID22441
Feature activation-1.42e+00

Pos logit contributions
years+2.893
ruits+2.835
adies+2.757
 questions+2.755
year+2.715
Neg logit contributions
ww-3.023
 tou-2.977
 chip-2.888
 subs-2.868
 pel-2.846
Token and
Token ID290
Feature activation-1.66e+00

Pos logit contributions
years+2.387
ruits+2.227
year+2.217
 questions+2.194
adies+2.010
Neg logit contributions
ww-2.526
burn-2.347
actus-2.330
lor-2.270
 chip-2.198
Token make
Token ID787
Feature activation-1.11e+00

Pos logit contributions
years+3.114
 questions+3.002
ruits+2.986
adies+2.928
year+2.896
Neg logit contributions
ww-2.429
 tou-2.155
 chip-2.082
burn-2.020
 subs-2.018
Token noises
Token ID26782
Feature activation-4.89e-01

Pos logit contributions
years+1.886
ruits+1.796
 questions+1.784
year+1.730
nam+1.673
Neg logit contributions
ww-1.805
burn-1.696
lor-1.695
 subs-1.638
 chip-1.593
Token a
Token ID257
Feature activation-1.88e-01

Pos logit contributions
ruits+2.839
years+2.813
 questions+2.721
adies+2.568
year+2.499
Neg logit contributions
ww-3.455
burn-3.252
 chip-3.203
 pel-3.165
 subs-3.163
Token lot
Token ID1256
Feature activation-3.79e-01

Pos logit contributions
years+0.348
 questions+0.337
ruits+0.327
year+0.320
nam+0.309
Neg logit contributions
ww-0.295
burn-0.270
lor-0.266
 subs-0.255
 tou-0.245
Token of
Token ID286
Feature activation-2.22e+00

Pos logit contributions
years+0.636
 questions+0.610
ruits+0.583
nam+0.579
year+0.558
Neg logit contributions
ww-0.652
lor-0.601
burn-0.594
 subs-0.571
 tou-0.554
Token boxes
Token ID10559
Feature activation-1.82e-01

Pos logit contributions
years+4.215
ruits+3.825
nam+3.652
year+3.627
adies+3.606
Neg logit contributions
ww-3.528
 chip-3.113
burn-3.105
 pel-3.087
 tou-3.034
Token.
Token ID13
Feature activation-1.35e+00

Pos logit contributions
years+0.296
 questions+0.292
ruits+0.276
year+0.267
nam+0.264
Neg logit contributions
ww-0.314
burn-0.298
lor-0.295
 subs-0.282
 tou-0.270
Token They
Token ID1119
Feature activation-3.75e+00

Pos logit contributions
years+2.458
 questions+2.234
nam+2.185
year+2.170
ruits+2.132
Neg logit contributions
ww-2.161
 chip-1.993
 subs-1.935
lor-1.927
burn-1.927
Token want
Token ID765
Feature activation-1.01e+00

Pos logit contributions
years+7.327
year+6.841
ruits+6.500
adies+6.119
 questions+5.953
Neg logit contributions
 tou-6.060
ww-6.053
 pel-5.686
 subs-5.634
 po-5.459
Token to
Token ID284
Feature activation-9.49e-01

Pos logit contributions
years+2.074
ruits+1.924
 questions+1.904
year+1.900
nam+1.867
Neg logit contributions
ww-1.327
burn-1.272
lor-1.180
 subs-1.178
 chip-1.161
Token see
Token ID766
Feature activation-1.64e+00

Pos logit contributions
years+1.427
 questions+1.333
ruits+1.282
year+1.262
nam+1.209
Neg logit contributions
ww-1.824
burn-1.761
 tou-1.688
 subs-1.668
lor-1.656
Token what
Token ID644
Feature activation+1.28e+00

Pos logit contributions
years+2.715
ruits+2.464
year+2.380
 questions+2.370
adies+2.278
Neg logit contributions
ww-2.891
burn-2.861
 chip-2.669
lor-2.663
 tou-2.655
Token is
Token ID318
Feature activation-3.65e-01

Pos logit contributions
ww+2.061
lor+1.977
burn+1.948
 subs+1.830
actus+1.780
Neg logit contributions
 questions-2.278
years-2.154
ruits-2.129
nam-2.043
year-2.032
Token fix
Token ID4259
Feature activation+3.79e-01

Pos logit contributions
years+0.977
 questions+0.941
ruits+0.910
year+0.905
nam+0.845
Neg logit contributions
ww-1.242
burn-1.202
lor-1.179
 subs-1.126
 tou-1.104
Token your
Token ID534
Feature activation-8.27e-01

Pos logit contributions
ww+0.599
lor+0.554
burn+0.552
 subs+0.522
actus+0.497
Neg logit contributions
years-0.676
 questions-0.676
ruits-0.644
year-0.624
nam-0.618
Token bike
Token ID7161
Feature activation-9.21e-01

Pos logit contributions
years+1.547
year+1.466
 questions+1.437
ruits+1.430
adies+1.356
Neg logit contributions
ww-1.301
burn-1.223
lor-1.176
 subs-1.135
 tou-1.124
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
years+1.577
 questions+1.505
ruits+1.459
year+1.451
adies+1.377
Neg logit contributions
ww-1.580
burn-1.498
lor-1.480
 chip-1.365
 subs-1.356
Token Do
Token ID2141
Feature activation-4.29e-01

Pos logit contributions
years+2.561
 questions+2.530
ruits+2.434
year+2.346
nam+2.328
Neg logit contributions
ww-2.110
lor-1.910
burn-1.874
 subs-1.742
 chip-1.730
Token you
Token ID345
Feature activation-3.75e+00

Pos logit contributions
years+0.931
 questions+0.912
year+0.899
ruits+0.891
nam+0.840
Neg logit contributions
ww-0.536
lor-0.474
burn-0.456
 subs-0.423
 tou-0.410
Token want
Token ID765
Feature activation+1.51e-02

Pos logit contributions
years+6.078
year+5.347
nam+5.308
ruits+4.907
 questions+4.760
Neg logit contributions
ww-7.678
burn-7.318
 subs-6.985
 tou-6.889
br-6.556
Token me
Token ID502
Feature activation-6.02e-01

Pos logit contributions
ww+0.024
burn+0.023
lor+0.022
 subs+0.021
 tou+0.020
Neg logit contributions
years-0.027
 questions-0.026
ruits-0.025
year-0.025
nam-0.024
Token to
Token ID284
Feature activation-3.09e-01

Pos logit contributions
years+1.134
 questions+1.101
ruits+1.084
year+1.056
nam+1.029
Neg logit contributions
ww-0.907
burn-0.878
lor-0.827
 subs-0.774
 tou-0.752
Token try
Token ID1949
Feature activation-7.60e-01

Pos logit contributions
years+0.451
 questions+0.438
ruits+0.424
year+0.419
nam+0.391
Neg logit contributions
ww-0.598
burn-0.573
lor-0.557
 subs-0.538
 tou-0.526
Token?"
Token ID1701
Feature activation-1.01e+00

Pos logit contributions
years+1.213
 questions+1.150
year+1.140
ruits+1.133
adies+1.052
Neg logit contributions
ww-1.370
burn-1.332
lor-1.279
 chip-1.189
actus-1.183
Token asks
Token ID7893
Feature activation-1.34e+00

Pos logit contributions
years+1.969
year+1.838
 questions+1.826
ruits+1.818
nam+1.789
Neg logit contributions
ww-1.767
burn-1.539
 subs-1.527
lor-1.518
actus-1.485
Token Tom
Token ID4186
Feature activation-1.38e+00

Pos logit contributions
years+2.268
year+2.030
 questions+2.006
nam+1.993
ruits+1.967
Neg logit contributions
ww-2.287
burn-2.170
lor-2.076
 chip-2.011
 tou-1.998
Token,
Token ID11
Feature activation-1.57e+00

Pos logit contributions
years+2.252
year+2.067
nam+1.976
 questions+1.922
ruits+1.911
Neg logit contributions
ww-2.472
burn-2.393
lor-2.336
 chip-2.266
 subs-2.183
Token "
Token ID366
Feature activation-1.45e+00

Pos logit contributions
years+2.835
 questions+2.671
ruits+2.648
year+2.591
adies+2.563
Neg logit contributions
ww-2.357
burn-2.196
lor-2.178
 chip-2.176
 subs-2.095
TokenDo
Token ID5211
Feature activation+4.97e-01

Pos logit contributions
 questions+2.616
years+2.580
ruits+2.444
year+2.373
nam+2.361
Neg logit contributions
ww-2.388
burn-2.274
lor-2.171
 chip-2.047
 subs-2.003
Token you
Token ID345
Feature activation-3.75e+00

Pos logit contributions
ww+0.721
burn+0.669
lor+0.669
 subs+0.626
actus+0.594
Neg logit contributions
 questions-0.950
years-0.948
ruits-0.910
year-0.877
nam-0.875
Token want
Token ID765
Feature activation+2.47e-01

Pos logit contributions
years+5.479
nam+4.864
year+4.643
 chambers+4.558
 curiously+4.330
Neg logit contributions
ww-8.553
burn-7.420
 tou-7.198
 subs-7.088
 chip-6.951
Token ice
Token ID4771
Feature activation+6.62e-01

Pos logit contributions
ww+0.435
burn+0.404
lor+0.404
 subs+0.383
actus+0.367
Neg logit contributions
years-0.399
 questions-0.396
ruits-0.377
year-0.365
nam-0.360
Token cream
Token ID8566
Feature activation+7.12e-01

Pos logit contributions
ww+0.805
lor+0.740
burn+0.723
 subs+0.687
 tou+0.637
Neg logit contributions
 questions-1.413
years-1.405
ruits-1.364
year-1.318
nam-1.309
Token?"
Token ID1701
Feature activation+3.52e-01

Pos logit contributions
ww+1.522
lor+1.452
burn+1.431
 subs+1.396
 tou+1.339
Neg logit contributions
 questions-0.860
years-0.850
ruits-0.806
year-0.754
nam-0.752
Token
Token ID198
Feature activation+1.56e+00

Pos logit contributions
ww+0.744
lor+0.702
burn+0.700
 subs+0.672
actus+0.649
Neg logit contributions
years-0.446
 questions-0.445
ruits-0.416
year-0.398
nam-0.391
Token box
Token ID3091
Feature activation-1.89e-01

Pos logit contributions
years+0.076
 questions+0.075
ruits+0.072
year+0.071
nam+0.069
Neg logit contributions
ww-0.059
burn-0.054
lor-0.053
 subs-0.051
 tou-0.049
Token in
Token ID287
Feature activation-2.53e-01

Pos logit contributions
years+0.270
 questions+0.269
ruits+0.250
nam+0.239
year+0.237
Neg logit contributions
ww-0.364
burn-0.345
lor-0.342
 subs-0.329
 tou-0.317
Token the
Token ID262
Feature activation+8.83e-01

Pos logit contributions
years+0.431
 questions+0.423
ruits+0.407
year+0.393
adies+0.380
Neg logit contributions
ww-0.425
burn-0.396
lor-0.394
 subs-0.372
 tou-0.364
Token garage
Token ID15591
Feature activation-5.91e-01

Pos logit contributions
ww+1.950
lor+1.842
burn+1.837
 subs+1.772
actus+1.695
Neg logit contributions
 questions-1.050
years-1.037
ruits-0.960
nam-0.933
year-0.907
Token.
Token ID13
Feature activation-3.01e-01

Pos logit contributions
years+1.021
 questions+0.990
ruits+0.928
year+0.918
nam+0.907
Neg logit contributions
ww-0.963
lor-0.931
burn-0.915
 subs-0.870
 chip-0.835
Token They
Token ID1119
Feature activation-3.74e+00

Pos logit contributions
years+0.449
 questions+0.440
ruits+0.406
year+0.403
nam+0.394
Neg logit contributions
ww-0.570
lor-0.531
burn-0.529
 subs-0.509
 chip-0.494
Token wonder
Token ID4240
Feature activation-2.76e+00

Pos logit contributions
years+6.764
year+6.552
ruits+6.392
 questions+6.264
adies+6.104
Neg logit contributions
 tou-6.108
ww-5.960
 subs-5.547
 po-5.489
 bottle-5.393
Token what
Token ID644
Feature activation+1.33e+00

Pos logit contributions
years+3.733
year+3.055
ruits+2.824
adies+2.671
 Many+2.634
Neg logit contributions
ww-5.730
burn-5.621
 pel-5.391
 po-5.356
 tou-5.353
Token is
Token ID318
Feature activation+1.99e-01

Pos logit contributions
ww+2.211
burn+2.124
lor+2.081
 subs+1.964
actus+1.883
Neg logit contributions
 questions-2.394
years-2.231
ruits-2.219
nam-2.122
 three-2.091
Token inside
Token ID2641
Feature activation-6.41e-01

Pos logit contributions
ww+0.375
lor+0.351
burn+0.349
 subs+0.334
 tou+0.322
Neg logit contributions
years-0.295
 questions-0.292
ruits-0.277
year-0.267
nam-0.263
Token.
Token ID13
Feature activation-6.04e-01

Pos logit contributions
years+1.117
 questions+1.071
ruits+1.024
year+1.017
nam+0.990
Neg logit contributions
ww-1.035
lor-0.965
burn-0.965
 subs-0.897
 tou-0.880
Token made
Token ID925
Feature activation-4.16e-01

Pos logit contributions
years+0.253
 questions+0.253
ruits+0.243
year+0.229
nam+0.228
Neg logit contributions
ww-0.338
burn-0.314
lor-0.313
 subs-0.304
 tou-0.291
Token a
Token ID257
Feature activation+1.25e+00

Pos logit contributions
years+0.634
ruits+0.612
 questions+0.612
year+0.576
nam+0.575
Neg logit contributions
ww-0.753
lor-0.712
burn-0.708
 subs-0.684
 chip-0.647
Token plan
Token ID1410
Feature activation+7.85e-02

Pos logit contributions
ww+2.413
lor+2.339
burn+2.308
 subs+2.163
rob+2.109
Neg logit contributions
 questions-1.811
years-1.716
ruits-1.627
nam-1.618
 three-1.585
Token.
Token ID13
Feature activation-1.46e+00

Pos logit contributions
ww+0.136
lor+0.127
burn+0.127
 subs+0.121
 tou+0.115
Neg logit contributions
years-0.127
 questions-0.126
ruits-0.121
year-0.116
nam-0.116
Token He
Token ID679
Feature activation-1.06e+00

Pos logit contributions
years+2.459
 questions+2.339
ruits+2.246
adies+2.150
nam+2.138
Neg logit contributions
ww-2.526
lor-2.371
 subs-2.298
 chip-2.267
burn-2.238
Token asked
Token ID1965
Feature activation-3.74e+00

Pos logit contributions
 questions+1.742
years+1.727
ruits+1.694
nam+1.623
adies+1.586
Neg logit contributions
ww-1.823
burn-1.687
 subs-1.654
lor-1.651
 tou-1.538
Token his
Token ID465
Feature activation-8.41e-01

Pos logit contributions
years+7.858
ruits+7.415
nam+6.986
 ago+6.810
adies+6.775
Neg logit contributions
ww-4.916
 chip-4.586
lor-4.366
 Cathy-4.337
 subs-4.327
Token mom
Token ID1995
Feature activation-2.63e+00

Pos logit contributions
years+1.422
ruits+1.351
year+1.336
 questions+1.308
adies+1.303
Neg logit contributions
ww-1.412
burn-1.294
lor-1.289
 subs-1.269
 chip-1.248
Token for
Token ID329
Feature activation-1.50e+00

Pos logit contributions
years+4.828
nam+4.457
adies+4.420
year+4.386
ruits+4.318
Neg logit contributions
ww-3.879
burn-3.846
lor-3.784
 chip-3.748
 bottle-3.416
Token help
Token ID1037
Feature activation-1.47e+00

Pos logit contributions
ruits+2.255
years+2.237
 questions+2.135
year+2.080
adies+2.074
Neg logit contributions
ww-2.654
lor-2.495
burn-2.410
 subs-2.372
 chip-2.282
Token,
Token ID11
Feature activation-2.12e+00

Pos logit contributions
years+2.597
 questions+2.448
ruits+2.440
year+2.408
adies+2.402
Neg logit contributions
ww-2.235
burn-2.195
lor-2.122
 chip-2.071
 subs-2.019
Token with
Token ID351
Feature activation-7.53e-01

Pos logit contributions
years+2.349
ruits+2.247
 questions+2.220
year+2.137
nam+2.033
Neg logit contributions
ww-2.055
lor-1.961
burn-1.880
 tou-1.734
 chip-1.727
Token something
Token ID1223
Feature activation+5.20e-01

Pos logit contributions
years+1.536
 questions+1.498
ruits+1.474
year+1.474
adies+1.382
Neg logit contributions
ww-1.012
burn-0.894
lor-0.893
actus-0.810
 subs-0.798
Token else
Token ID2073
Feature activation-1.17e+00

Pos logit contributions
ww+1.074
lor+1.017
burn+1.016
 subs+0.971
actus+0.937
Neg logit contributions
 questions-0.677
years-0.670
ruits-0.631
year-0.602
nam-0.595
Token.
Token ID13
Feature activation-7.05e-01

Pos logit contributions
years+2.093
 questions+1.934
year+1.927
ruits+1.876
adies+1.815
Neg logit contributions
ww-1.956
burn-1.765
lor-1.761
 chip-1.624
actus-1.615
Token Do
Token ID2141
Feature activation-1.63e-01

Pos logit contributions
years+1.072
 questions+1.042
ruits+0.977
year+0.960
nam+0.944
Neg logit contributions
ww-1.326
lor-1.255
burn-1.236
 subs-1.160
 tou-1.143
Token you
Token ID345
Feature activation-3.74e+00

Pos logit contributions
years+0.344
 questions+0.337
ruits+0.327
year+0.327
nam+0.312
Neg logit contributions
ww-0.213
lor-0.191
burn-0.187
 subs-0.172
 tou-0.166
Token want
Token ID765
Feature activation-2.83e-02

Pos logit contributions
years+5.962
nam+5.074
year+5.025
 chambers+4.316
 curiously+4.178
Neg logit contributions
ww-7.802
burn-7.533
 tou-7.085
 subs-6.897
lor-6.896
Token to
Token ID284
Feature activation+4.86e-01

Pos logit contributions
years+0.050
 questions+0.049
ruits+0.047
year+0.046
nam+0.045
Neg logit contributions
ww-0.046
burn-0.042
lor-0.042
 subs-0.039
actus-0.038
Token draw
Token ID3197
Feature activation-7.68e-01

Pos logit contributions
ww+1.020
lor+0.962
burn+0.956
 subs+0.920
actus+0.889
Neg logit contributions
 questions-0.619
years-0.615
ruits-0.577
year-0.546
nam-0.544
Token a
Token ID257
Feature activation+7.42e-01

Pos logit contributions
years+1.177
ruits+1.123
 questions+1.121
year+1.075
nam+1.030
Neg logit contributions
ww-1.427
lor-1.344
burn-1.331
actus-1.233
 subs-1.229
Token picture
Token ID4286
Feature activation-5.17e-03

Pos logit contributions
ww+1.080
lor+1.008
burn+0.997
 subs+0.945
actus+0.881
Neg logit contributions
 questions-1.411
years-1.382
ruits-1.336
nam-1.294
year-1.290
Token in
Token ID287
Feature activation+7.01e-02

Pos logit contributions
ww+0.087
burn+0.081
lor+0.081
 subs+0.077
 tou+0.074
Neg logit contributions
years-0.072
 questions-0.071
ruits-0.067
year-0.065
nam-0.064
Token the
Token ID262
Feature activation+7.18e-01

Pos logit contributions
ww+0.122
lor+0.114
burn+0.114
 subs+0.108
 tou+0.104
Neg logit contributions
years-0.115
 questions-0.113
ruits-0.108
year-0.104
nam-0.102
Token play
Token ID711
Feature activation-1.01e+00

Pos logit contributions
ww+1.437
burn+1.350
lor+1.348
 subs+1.291
actus+1.230
Neg logit contributions
 questions-1.000
years-0.994
ruits-0.938
nam-0.898
year-0.892
Tokenroom
Token ID3823
Feature activation+5.49e-05

Pos logit contributions
years+1.639
 questions+1.572
nam+1.477
ruits+1.469
year+1.408
Neg logit contributions
ww-1.801
lor-1.766
burn-1.678
 subs-1.622
 tou-1.587
Token.
Token ID13
Feature activation-6.19e-01

Pos logit contributions
ww+0.000
lor+0.000
burn+0.000
 subs+0.000
actus+0.000
Neg logit contributions
years-0.000
 questions-0.000
ruits-0.000
year-0.000
nam-0.000
Token They
Token ID1119
Feature activation-3.73e+00

Pos logit contributions
years+0.982
 questions+0.945
nam+0.892
year+0.890
ruits+0.878
Neg logit contributions
ww-1.107
lor-1.036
burn-1.006
 subs-0.984
 chip-0.964
Token are
Token ID389
Feature activation-2.08e+00

Pos logit contributions
years+6.784
year+6.614
 questions+6.285
ruits+6.283
adies+6.187
Neg logit contributions
ww-6.008
 tou-5.903
 subs-5.553
 po-5.461
 pel-5.375
Token very
Token ID845
Feature activation-1.45e+00

Pos logit contributions
years+3.538
ruits+3.523
 questions+3.410
adies+3.355
year+3.323
Neg logit contributions
ww-3.152
 tou-3.103
 subs-3.050
 chip-3.020
 pel-3.013
Token happy
Token ID3772
Feature activation-2.20e-01

Pos logit contributions
years+1.801
 questions+1.772
ruits+1.657
adies+1.603
year+1.547
Neg logit contributions
ww-3.117
 pel-2.766
 tou-2.764
 subs-2.757
 chip-2.716
Token and
Token ID290
Feature activation-1.11e-01

Pos logit contributions
years+0.336
 questions+0.327
ruits+0.314
year+0.305
nam+0.299
Neg logit contributions
ww-0.409
burn-0.385
lor-0.380
 subs-0.367
actus-0.352
Token start
Token ID923
Feature activation-9.13e-01

Pos logit contributions
years+0.123
 questions+0.121
ruits+0.114
year+0.107
nam+0.102
Neg logit contributions
ww-0.252
burn-0.236
lor-0.234
 subs-0.230
 tou-0.224
Token toys
Token ID14958
Feature activation-3.86e-02

Pos logit contributions
years+4.280
ruits+4.139
year+3.954
adies+3.896
 questions+3.713
Neg logit contributions
ww-3.605
 chip-3.194
 bottle-3.171
 tou-3.168
rob-3.048
Token in
Token ID287
Feature activation+7.57e-02

Pos logit contributions
years+0.058
 questions+0.058
ruits+0.054
year+0.053
nam+0.052
Neg logit contributions
ww-0.071
burn-0.067
lor-0.067
 subs-0.064
 tou-0.062
Token the
Token ID262
Feature activation+9.45e-01

Pos logit contributions
ww+0.132
burn+0.123
lor+0.123
 subs+0.116
 tou+0.112
Neg logit contributions
years-0.124
 questions-0.122
ruits-0.116
year-0.113
nam-0.110
Token park
Token ID3952
Feature activation-7.08e-01

Pos logit contributions
ww+1.892
lor+1.774
burn+1.770
 subs+1.701
actus+1.615
Neg logit contributions
 questions-1.323
years-1.303
ruits-1.231
nam-1.193
year-1.162
Token.
Token ID13
Feature activation+9.01e-02

Pos logit contributions
years+1.255
 questions+1.210
year+1.126
ruits+1.124
nam+1.109
Neg logit contributions
ww-1.129
lor-1.079
burn-1.059
 subs-1.000
 tou-0.987
Token They
Token ID1119
Feature activation-3.72e+00

Pos logit contributions
ww+0.168
lor+0.157
burn+0.157
 subs+0.150
 tou+0.145
Neg logit contributions
years-0.136
 questions-0.135
ruits-0.126
year-0.123
nam-0.121
Token are
Token ID389
Feature activation-1.64e+00

Pos logit contributions
years+6.761
year+6.610
ruits+6.196
 questions+6.090
adies+6.055
Neg logit contributions
 tou-6.024
ww-5.812
 po-5.509
 bottle-5.336
 subs-5.263
Token very
Token ID845
Feature activation-1.15e+00

Pos logit contributions
years+2.632
ruits+2.604
 questions+2.554
adies+2.494
year+2.486
Neg logit contributions
ww-2.642
 tou-2.545
 subs-2.480
burn-2.480
 chip-2.467
Token happy
Token ID3772
Feature activation-2.07e-01

Pos logit contributions
years+1.507
 questions+1.487
ruits+1.402
adies+1.352
year+1.324
Neg logit contributions
ww-2.374
 tou-2.104
 subs-2.095
 pel-2.065
burn-2.062
Token.
Token ID13
Feature activation+5.68e-01

Pos logit contributions
years+0.314
 questions+0.306
ruits+0.293
year+0.285
nam+0.279
Neg logit contributions
ww-0.387
burn-0.365
lor-0.360
 subs-0.347
actus-0.334
Token
Token ID198
Feature activation+4.00e-01

Pos logit contributions
ww+0.953
lor+0.888
burn+0.885
 subs+0.838
actus+0.800
Neg logit contributions
 questions-0.970
years-0.966
ruits-0.929
year-0.890
nam-0.884
Token box
Token ID3091
Feature activation+4.76e-02

Pos logit contributions
ww+0.312
burn+0.286
lor+0.284
 subs+0.268
actus+0.255
Neg logit contributions
years-0.393
 questions-0.390
ruits-0.374
year-0.365
nam-0.358
Token in
Token ID287
Feature activation+2.59e-02

Pos logit contributions
ww+0.094
burn+0.089
lor+0.089
 subs+0.085
 tou+0.082
Neg logit contributions
years-0.066
 questions-0.066
ruits-0.061
year-0.058
nam-0.058
Token the
Token ID262
Feature activation+6.50e-01

Pos logit contributions
ww+0.045
burn+0.042
lor+0.042
 subs+0.040
 tou+0.038
Neg logit contributions
years-0.043
 questions-0.042
ruits-0.040
year-0.039
nam-0.038
Token yard
Token ID12699
Feature activation-4.89e-01

Pos logit contributions
ww+1.349
burn+1.271
lor+1.270
 subs+1.220
actus+1.166
Neg logit contributions
 questions-0.852
years-0.849
ruits-0.794
nam-0.761
year-0.759
Token.
Token ID13
Feature activation-4.60e-01

Pos logit contributions
years+0.828
 questions+0.810
ruits+0.754
nam+0.742
year+0.739
Neg logit contributions
ww-0.813
lor-0.780
burn-0.771
 subs-0.732
 chip-0.702
Token They
Token ID1119
Feature activation-3.71e+00

Pos logit contributions
years+0.726
 questions+0.717
ruits+0.662
year+0.654
nam+0.639
Neg logit contributions
ww-0.826
lor-0.774
burn-0.758
 subs-0.732
 chip-0.719
Token want
Token ID765
Feature activation-5.99e-01

Pos logit contributions
years+6.879
year+6.723
ruits+6.397
 questions+6.374
adies+6.167
Neg logit contributions
 tou-5.940
ww-5.575
 po-5.478
 bottle-5.371
 subs-5.197
Token to
Token ID284
Feature activation-5.58e-01

Pos logit contributions
years+1.193
 questions+1.132
ruits+1.116
year+1.104
nam+1.065
Neg logit contributions
ww-0.829
burn-0.781
lor-0.746
 subs-0.728
 tou-0.702
Token see
Token ID766
Feature activation-1.40e+00

Pos logit contributions
years+0.757
 questions+0.725
ruits+0.678
year+0.675
nam+0.629
Neg logit contributions
ww-1.144
burn-1.086
lor-1.064
 subs-1.041
 tou-1.035
Token what
Token ID644
Feature activation+1.74e+00

Pos logit contributions
years+1.936
ruits+1.722
year+1.695
 questions+1.688
adies+1.570
Neg logit contributions
ww-2.794
burn-2.749
lor-2.649
 tou-2.599
 chip-2.560
Token is
Token ID318
Feature activation-3.05e-02

Pos logit contributions
ww+2.943
lor+2.840
burn+2.834
 subs+2.627
actus+2.532
Neg logit contributions
 questions-3.044
ruits-2.838
years-2.791
 three-2.707
nam-2.693
Token real
Token ID1103
Feature activation+8.26e-02

Pos logit contributions
ww+0.471
lor+0.435
burn+0.434
 subs+0.411
actus+0.392
Neg logit contributions
years-0.518
 questions-0.515
ruits-0.493
year-0.477
nam-0.471
Token car
Token ID1097
Feature activation+3.17e-01

Pos logit contributions
ww+0.132
burn+0.122
lor+0.121
 subs+0.115
actus+0.111
Neg logit contributions
years-0.147
 questions-0.146
ruits-0.140
year-0.136
nam-0.134
Token on
Token ID319
Feature activation-3.57e-02

Pos logit contributions
ww+0.582
lor+0.544
burn+0.543
 subs+0.517
actus+0.497
Neg logit contributions
years-0.483
 questions-0.481
ruits-0.456
year-0.440
nam-0.434
Token it
Token ID340
Feature activation-2.77e-01

Pos logit contributions
years+0.065
 questions+0.064
ruits+0.061
year+0.060
nam+0.058
Neg logit contributions
ww-0.056
lor-0.052
burn-0.052
 subs-0.049
 tou-0.047
Token.
Token ID13
Feature activation-6.04e-02

Pos logit contributions
years+0.461
 questions+0.445
ruits+0.427
year+0.419
nam+0.414
Neg logit contributions
ww-0.474
lor-0.446
burn-0.439
 subs-0.421
actus-0.402
Token They
Token ID1119
Feature activation-3.70e+00

Pos logit contributions
years+0.098
 questions+0.096
ruits+0.091
year+0.089
nam+0.089
Neg logit contributions
ww-0.106
lor-0.099
burn-0.098
 subs-0.094
actus-0.091
Token are
Token ID389
Feature activation-1.73e+00

Pos logit contributions
years+7.175
year+6.882
ruits+6.607
adies+6.098
 questions+6.060
Neg logit contributions
 tou-5.779
ww-5.487
 subs-5.448
 po-5.318
burn-5.299
Token very
Token ID845
Feature activation-1.33e+00

Pos logit contributions
years+2.636
ruits+2.620
 questions+2.502
adies+2.444
year+2.444
Neg logit contributions
ww-2.953
 tou-2.880
burn-2.877
 subs-2.864
 pel-2.827
Token excited
Token ID6568
Feature activation-5.02e-02

Pos logit contributions
years+1.588
 questions+1.534
ruits+1.465
adies+1.369
year+1.349
Neg logit contributions
ww-2.943
 subs-2.651
 tou-2.643
 pel-2.642
burn-2.588
Token.
Token ID13
Feature activation+3.36e-01

Pos logit contributions
years+0.079
 questions+0.076
ruits+0.073
year+0.072
nam+0.071
Neg logit contributions
ww-0.091
burn-0.086
lor-0.085
 subs-0.081
actus-0.078
Token They
Token ID1119
Feature activation-2.75e+00

Pos logit contributions
ww+0.637
lor+0.597
burn+0.596
 subs+0.568
actus+0.547
Neg logit contributions
years-0.499
 questions-0.498
ruits-0.471
year-0.453
nam-0.447
Token a
Token ID257
Feature activation+1.50e+00

Pos logit contributions
ww+0.333
lor+0.310
burn+0.310
 subs+0.294
actus+0.282
Neg logit contributions
years-0.304
 questions-0.301
ruits-0.289
year-0.278
nam-0.273
Token new
Token ID649
Feature activation-1.66e-01

Pos logit contributions
ww+2.562
lor+2.481
burn+2.401
 subs+2.278
actus+2.149
Neg logit contributions
 questions-2.545
years-2.366
ruits-2.271
nam-2.269
 three-2.260
Token one
Token ID530
Feature activation+2.46e-01

Pos logit contributions
years+0.311
 questions+0.303
ruits+0.295
year+0.287
nam+0.279
Neg logit contributions
ww-0.255
burn-0.235
lor-0.234
 subs-0.220
actus-0.212
Token.
Token ID13
Feature activation-5.86e-01

Pos logit contributions
ww+0.452
lor+0.421
burn+0.421
 subs+0.401
actus+0.385
Neg logit contributions
years-0.377
 questions-0.374
ruits-0.355
year-0.342
nam-0.337
Token Do
Token ID2141
Feature activation-1.74e-01

Pos logit contributions
years+0.887
 questions+0.865
ruits+0.818
year+0.790
nam+0.789
Neg logit contributions
ww-1.101
lor-1.038
burn-1.026
 subs-0.975
 tou-0.949
Token you
Token ID345
Feature activation-3.69e+00

Pos logit contributions
years+0.369
 questions+0.363
ruits+0.352
year+0.352
nam+0.334
Neg logit contributions
ww-0.223
lor-0.199
burn-0.197
 subs-0.182
 tou-0.175
Token want
Token ID765
Feature activation-1.90e-01

Pos logit contributions
years+5.648
nam+5.079
year+4.735
 questions+4.438
 curiously+4.332
Neg logit contributions
ww-7.596
burn-7.197
 subs-6.797
 tou-6.629
 bottle-6.502
Token to
Token ID284
Feature activation-4.53e-01

Pos logit contributions
years+0.333
 questions+0.326
ruits+0.318
year+0.308
nam+0.302
Neg logit contributions
ww-0.309
burn-0.285
lor-0.283
 subs-0.268
 tou-0.256
Token help
Token ID1037
Feature activation-1.33e-01

Pos logit contributions
years+0.666
 questions+0.638
ruits+0.617
year+0.615
nam+0.577
Neg logit contributions
ww-0.871
burn-0.837
lor-0.809
 subs-0.785
 tou-0.767
Token me
Token ID502
Feature activation-8.97e-01

Pos logit contributions
years+0.199
 questions+0.194
ruits+0.186
year+0.182
nam+0.176
Neg logit contributions
ww-0.250
burn-0.236
lor-0.232
 subs-0.219
actus-0.215
Token?"
Token ID1701
Feature activation+8.75e-02

Pos logit contributions
years+1.350
 questions+1.263
ruits+1.238
year+1.225
adies+1.158
Neg logit contributions
ww-1.681
burn-1.670
lor-1.598
 subs-1.501
 chip-1.500
Token see
Token ID766
Feature activation-1.43e+00

Pos logit contributions
 questions+3.183
years+3.177
year+2.981
ruits+2.951
nam+2.769
Neg logit contributions
ww-3.979
 pel-3.730
burn-3.727
 tou-3.616
 subs-3.578
Token a
Token ID257
Feature activation+1.54e+00

Pos logit contributions
years+2.092
ruits+1.978
 questions+1.932
year+1.881
adies+1.761
Neg logit contributions
ww-2.791
burn-2.676
 chip-2.524
lor-2.521
 pel-2.480
Token big
Token ID1263
Feature activation+4.17e-01

Pos logit contributions
lor+2.595
ww+2.546
burn+2.468
 subs+2.350
ic+2.170
Neg logit contributions
 questions-2.554
years-2.434
nam-2.349
ruits-2.343
 three-2.330
Token box
Token ID3091
Feature activation+2.94e-01

Pos logit contributions
ww+0.653
lor+0.607
burn+0.603
 subs+0.570
actus+0.541
Neg logit contributions
 questions-0.748
years-0.748
ruits-0.714
year-0.691
nam-0.685
Token.
Token ID13
Feature activation-4.68e-01

Pos logit contributions
ww+0.558
lor+0.523
burn+0.522
 subs+0.498
actus+0.479
Neg logit contributions
years-0.432
 questions-0.431
ruits-0.407
year-0.391
nam-0.386
Token They
Token ID1119
Feature activation-3.69e+00

Pos logit contributions
years+0.755
 questions+0.737
ruits+0.684
year+0.683
nam+0.667
Neg logit contributions
ww-0.818
lor-0.766
burn-0.764
 subs-0.731
 chip-0.716
Token want
Token ID765
Feature activation-9.61e-01

Pos logit contributions
years+6.492
year+6.097
ruits+6.058
 questions+5.988
adies+5.621
Neg logit contributions
ww-6.211
 tou-5.988
 subs-5.653
 po-5.595
 pel-5.586
Token to
Token ID284
Feature activation-6.01e-01

Pos logit contributions
years+1.990
 questions+1.855
ruits+1.855
year+1.835
nam+1.798
Neg logit contributions
ww-1.250
burn-1.151
lor-1.131
 subs-1.090
 chip-1.053
Token know
Token ID760
Feature activation-2.24e+00

Pos logit contributions
years+0.764
 questions+0.734
ruits+0.683
year+0.672
nam+0.627
Neg logit contributions
ww-1.277
burn-1.207
lor-1.187
 subs-1.170
 tou-1.164
Token what
Token ID644
Feature activation+1.77e+00

Pos logit contributions
years+3.176
year+2.602
ruits+2.598
 questions+2.507
adies+2.437
Neg logit contributions
ww-4.511
burn-4.353
lor-4.338
 chip-4.266
 tou-4.224
Token is
Token ID318
Feature activation+1.08e-01

Pos logit contributions
ww+3.064
burn+3.009
lor+2.937
 subs+2.752
actus+2.689
Neg logit contributions
 questions-3.058
ruits-2.796
years-2.765
 three-2.697
year-2.657
Token box
Token ID3091
Feature activation-4.13e-02

Pos logit contributions
ww+0.455
lor+0.418
burn+0.418
 subs+0.393
actus+0.374
Neg logit contributions
years-0.560
 questions-0.558
ruits-0.534
year-0.519
nam-0.512
Token in
Token ID287
Feature activation+5.20e-02

Pos logit contributions
years+0.059
 questions+0.059
ruits+0.054
nam+0.052
year+0.052
Neg logit contributions
ww-0.080
burn-0.076
lor-0.075
 subs-0.072
 tou-0.070
Token the
Token ID262
Feature activation+1.02e+00

Pos logit contributions
ww+0.090
burn+0.084
lor+0.084
 subs+0.079
 tou+0.077
Neg logit contributions
years-0.086
 questions-0.084
ruits-0.081
year-0.078
nam-0.076
Token garage
Token ID15591
Feature activation-2.91e-01

Pos logit contributions
ww+2.134
lor+2.009
burn+2.000
 subs+1.930
actus+1.828
Neg logit contributions
 questions-1.331
years-1.310
ruits-1.223
nam-1.200
year-1.159
Token.
Token ID13
Feature activation+2.18e-01

Pos logit contributions
years+0.484
 questions+0.473
ruits+0.445
year+0.437
nam+0.432
Neg logit contributions
ww-0.492
lor-0.472
burn-0.464
 subs-0.443
actus-0.424
Token They
Token ID1119
Feature activation-3.67e+00

Pos logit contributions
ww+0.424
lor+0.398
burn+0.397
 subs+0.379
actus+0.365
Neg logit contributions
years-0.312
 questions-0.311
ruits-0.292
year-0.282
nam-0.277
Token open
Token ID1280
Feature activation-3.55e-01

Pos logit contributions
years+6.435
year+6.234
 questions+6.037
ruits+6.017
adies+5.820
Neg logit contributions
 tou-6.110
ww-5.937
 subs-5.509
 po-5.478
 bottle-5.427
Token the
Token ID262
Feature activation-3.63e-01

Pos logit contributions
years+0.677
 questions+0.663
ruits+0.641
year+0.628
nam+0.607
Neg logit contributions
ww-0.510
burn-0.474
lor-0.471
 subs-0.444
 tou-0.427
Token box
Token ID3091
Feature activation-6.45e-01

Pos logit contributions
years+0.782
 questions+0.756
ruits+0.738
year+0.726
nam+0.706
Neg logit contributions
ww-0.448
lor-0.417
burn-0.395
 subs-0.379
actus-0.365
Token and
Token ID290
Feature activation-2.77e+00

Pos logit contributions
years+0.963
 questions+0.929
ruits+0.868
nam+0.853
year+0.841
Neg logit contributions
ww-1.203
lor-1.154
burn-1.135
 subs-1.097
 chip-1.054
Token see
Token ID766
Feature activation-1.61e+00

Pos logit contributions
years+4.914
year+4.560
ruits+4.534
 questions+4.366
adies+3.945
Neg logit contributions
ww-4.382
 tou-4.302
 subs-4.105
 po-4.092
 disappears-4.050
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
ww+2.103
lor+1.977
burn+1.960
 subs+1.873
actus+1.818
Neg logit contributions
 questions-1.507
years-1.436
ruits-1.384
year-1.295
 whether-1.281
Token When
Token ID1649
Feature activation+1.70e+00

Pos logit contributions
ww+2.097
lor+1.945
burn+1.925
 subs+1.845
 tou+1.772
Neg logit contributions
 questions-1.855
years-1.811
ruits-1.781
nam-1.685
year-1.658
Token she
Token ID673
Feature activation+1.26e+00

Pos logit contributions
lor+3.260
ww+3.234
burn+2.914
actus+2.880
 subs+2.879
Neg logit contributions
 questions-2.555
years-2.438
nam-2.418
ruits-2.367
 three-2.298
Token took
Token ID1718
Feature activation-1.04e-01

Pos logit contributions
ww+2.331
lor+2.224
burn+2.172
 subs+2.007
actus+1.979
Neg logit contributions
years-1.932
 questions-1.914
ruits-1.799
year-1.765
 three-1.725
Token a
Token ID257
Feature activation+1.40e+00

Pos logit contributions
years+0.148
 questions+0.145
ruits+0.141
year+0.134
nam+0.131
Neg logit contributions
ww-0.202
burn-0.191
lor-0.187
 subs-0.181
 tou-0.174
Token sip
Token ID31145
Feature activation+2.18e+00

Pos logit contributions
ww+2.141
lor+2.104
burn+1.990
 subs+1.878
 tou+1.822
Neg logit contributions
 questions-2.589
years-2.474
ruits-2.434
 three-2.335
year-2.320
Token,
Token ID11
Feature activation+1.96e-01

Pos logit contributions
lor+5.405
ww+5.223
burn+5.110
 tou+4.905
 subs+4.879
Neg logit contributions
 questions-2.197
years-1.937
 three-1.901
ruits-1.831
year-1.779
Token she
Token ID673
Feature activation+9.49e-02

Pos logit contributions
ww+0.313
burn+0.290
lor+0.288
 subs+0.274
 tou+0.260
Neg logit contributions
years-0.346
 questions-0.345
ruits-0.330
year-0.319
nam-0.312
Token made
Token ID925
Feature activation+4.48e-01

Pos logit contributions
ww+0.168
burn+0.157
lor+0.156
 subs+0.150
actus+0.144
Neg logit contributions
years-0.151
 questions-0.150
ruits-0.143
year-0.138
nam-0.136
Token a
Token ID257
Feature activation+1.39e+00

Pos logit contributions
ww+1.014
lor+0.961
burn+0.956
 subs+0.922
actus+0.893
Neg logit contributions
 questions-0.496
years-0.496
ruits-0.457
year-0.435
nam-0.427
Token face
Token ID1986
Feature activation+1.52e-01

Pos logit contributions
ww+2.612
lor+2.578
burn+2.455
 subs+2.330
rob+2.276
Neg logit contributions
 questions-2.115
years-2.002
ruits-1.902
nam-1.878
 three-1.867
Token.
Token ID13
Feature activation+2.93e-01

Pos logit contributions
ww+0.867
lor+0.813
burn+0.809
 subs+0.769
actus+0.742
Neg logit contributions
 questions-0.712
years-0.707
ruits-0.670
year-0.647
nam-0.637
Token He
Token ID679
Feature activation+6.32e-01

Pos logit contributions
ww+0.582
lor+0.547
burn+0.545
 subs+0.522
actus+0.503
Neg logit contributions
years-0.409
 questions-0.406
ruits-0.384
year-0.368
nam-0.363
Token says
Token ID1139
Feature activation+1.87e-01

Pos logit contributions
ww+1.072
lor+1.003
burn+0.997
 subs+0.937
actus+0.894
Neg logit contributions
 questions-1.069
years-1.056
ruits-1.001
year-0.965
nam-0.959
Token,
Token ID11
Feature activation-3.18e-01

Pos logit contributions
ww+0.368
lor+0.344
burn+0.344
 subs+0.329
 tou+0.317
Neg logit contributions
years-0.265
 questions-0.262
ruits-0.249
year-0.239
nam-0.236
Token "
Token ID366
Feature activation+1.24e+00

Pos logit contributions
years+0.490
 questions+0.471
ruits+0.460
year+0.447
nam+0.439
Neg logit contributions
ww-0.578
lor-0.540
burn-0.540
 subs-0.521
 chip-0.500
TokenThis
Token ID1212
Feature activation+2.33e+00

Pos logit contributions
ww+1.956
lor+1.875
burn+1.838
 subs+1.730
actus+1.669
Neg logit contributions
years-2.219
 questions-2.203
ruits-2.061
year-2.027
nam-2.020
Token is
Token ID318
Feature activation+1.26e+00

Pos logit contributions
ww+3.959
burn+3.875
lor+3.865
 subs+3.610
actus+3.480
Neg logit contributions
 questions-3.969
ruits-3.586
 three-3.536
 want-3.515
 ten-3.485
Token my
Token ID616
Feature activation+1.34e-01

Pos logit contributions
lor+2.466
ww+2.462
burn+2.371
actus+2.259
 subs+2.235
Neg logit contributions
 questions-1.850
years-1.781
ruits-1.674
nam-1.657
 ten-1.588
Token bowl
Token ID9396
Feature activation+7.52e-01

Pos logit contributions
ww+0.202
burn+0.186
lor+0.184
 subs+0.175
actus+0.167
Neg logit contributions
years-0.251
 questions-0.247
ruits-0.237
year-0.233
nam-0.228
Token.
Token ID13
Feature activation+2.15e-01

Pos logit contributions
ww+1.394
lor+1.320
burn+1.304
 subs+1.247
 tou+1.200
Neg logit contributions
 questions-1.131
years-1.122
ruits-1.078
year-1.027
nam-1.004
Token Go
Token ID1514
Feature activation+7.63e-02

Pos logit contributions
ww+0.376
lor+0.350
burn+0.348
 subs+0.330
 tou+0.316
Neg logit contributions
years-0.351
 questions-0.350
ruits-0.331
year-0.321
nam-0.318
Token flowers
Token ID12734
Feature activation+1.29e+00

Pos logit contributions
ww+0.391
lor+0.361
burn+0.360
 subs+0.340
actus+0.325
Neg logit contributions
years-0.448
 questions-0.446
ruits-0.427
year-0.414
nam-0.409
Token.
Token ID13
Feature activation+2.92e-01

Pos logit contributions
ww+2.663
lor+2.452
burn+2.429
 subs+2.373
 tou+2.259
Neg logit contributions
 questions-1.752
years-1.721
ruits-1.642
year-1.564
nam-1.546
Token Lily
Token ID20037
Feature activation+9.92e-01

Pos logit contributions
ww+0.572
lor+0.537
burn+0.536
 subs+0.512
actus+0.493
Neg logit contributions
years-0.415
 questions-0.414
ruits-0.390
year-0.374
nam-0.369
Token was
Token ID373
Feature activation+2.23e-01

Pos logit contributions
ww+1.900
lor+1.779
burn+1.771
 subs+1.680
 tou+1.628
Neg logit contributions
years-1.460
 questions-1.446
ruits-1.356
year-1.301
 three-1.273
Token so
Token ID523
Feature activation-2.22e-02

Pos logit contributions
ww+0.457
burn+0.432
lor+0.431
 subs+0.413
 tou+0.398
Neg logit contributions
 questions-0.292
years-0.291
ruits-0.275
year-0.261
nam-0.257
Token happy
Token ID3772
Feature activation+2.06e+00

Pos logit contributions
 questions+0.023
years+0.022
ruits+0.021
year+0.019
nam+0.019
Neg logit contributions
ww-0.053
burn-0.050
lor-0.049
 subs-0.048
 tou-0.047
Token to
Token ID284
Feature activation+5.16e-01

Pos logit contributions
ww+4.435
lor+4.199
burn+4.123
 subs+3.921
rob+3.872
Neg logit contributions
 questions-2.832
years-2.587
 whether-2.502
 if-2.489
ruits-2.453
Token see
Token ID766
Feature activation+7.12e-01

Pos logit contributions
ww+1.106
lor+1.046
burn+1.041
 subs+0.999
actus+0.969
Neg logit contributions
 questions-0.635
years-0.633
ruits-0.592
year-0.563
nam-0.549
Token the
Token ID262
Feature activation+2.97e-01

Pos logit contributions
ww+1.381
lor+1.290
burn+1.273
 subs+1.225
actus+1.182
Neg logit contributions
 questions-1.060
years-1.045
ruits-0.980
nam-0.948
year-0.947
Token flowers
Token ID12734
Feature activation+8.17e-01

Pos logit contributions
ww+0.544
lor+0.508
burn+0.507
 subs+0.484
actus+0.465
Neg logit contributions
years-0.457
 questions-0.455
ruits-0.432
year-0.417
nam-0.411
Token and
Token ID290
Feature activation+3.76e-01

Pos logit contributions
ww+1.569
lor+1.460
burn+1.445
 subs+1.377
actus+1.330
Neg logit contributions
 questions-1.206
years-1.195
ruits-1.131
year-1.089
nam-1.075
Token idea
Token ID2126
Feature activation+2.45e-01

Pos logit contributions
years+2.771
nam+2.525
ruits+2.486
 questions+2.450
year+2.401
Neg logit contributions
ww-3.205
burn-2.870
 tou-2.801
 subs-2.751
lor-2.727
Token.
Token ID13
Feature activation+6.30e-01

Pos logit contributions
ww+0.445
lor+0.416
burn+0.415
 subs+0.396
 tou+0.379
Neg logit contributions
years-0.379
 questions-0.375
ruits-0.357
year-0.344
nam-0.341
Token
Token ID198
Feature activation+6.34e-01

Pos logit contributions
ww+1.164
burn+1.090
lor+1.090
 subs+1.034
actus+0.994
Neg logit contributions
 questions-0.973
years-0.965
ruits-0.922
year-0.886
nam-0.871
Token
Token ID198
Feature activation+6.82e-01

Pos logit contributions
ww+1.042
burn+0.973
lor+0.964
 subs+0.917
actus+0.871
Neg logit contributions
 questions-1.101
years-1.091
ruits-1.048
year-1.016
nam-0.985
Token"
Token ID1
Feature activation+9.57e-01

Pos logit contributions
ww+1.233
burn+1.172
lor+1.168
 subs+1.113
actus+1.069
Neg logit contributions
years-1.057
 questions-1.057
ruits-1.004
year-0.973
nam-0.945
TokenLet
Token ID5756
Feature activation+2.30e+00

Pos logit contributions
ww+1.717
lor+1.623
burn+1.596
 subs+1.518
actus+1.481
Neg logit contributions
 questions-1.546
years-1.524
ruits-1.466
year-1.417
nam-1.395
Token's
Token ID338
Feature activation-4.64e-01

Pos logit contributions
ww+4.780
lor+4.764
burn+4.591
 subs+4.490
rob+4.395
Neg logit contributions
 questions-3.102
 want-2.705
 three-2.704
ruits-2.643
adies-2.634
Token mail
Token ID6920
Feature activation+2.35e-01

Pos logit contributions
years+0.713
 questions+0.687
ruits+0.656
year+0.651
nam+0.609
Neg logit contributions
ww-0.872
burn-0.833
lor-0.813
 subs-0.780
 tou-0.773
Token ourselves
Token ID6731
Feature activation-9.29e-01

Pos logit contributions
ww+0.428
lor+0.400
burn+0.399
 subs+0.379
 tou+0.365
Neg logit contributions
years-0.365
 questions-0.361
ruits-0.344
year-0.332
nam-0.328
Token!"
Token ID2474
Feature activation+6.92e-01

Pos logit contributions
years+1.527
 questions+1.407
year+1.362
ruits+1.349
nam+1.305
Neg logit contributions
ww-1.683
burn-1.589
lor-1.578
 chip-1.456
 tou-1.456
Token Lily
Token ID20037
Feature activation-1.37e-01

Pos logit contributions
ww+1.249
lor+1.190
burn+1.180
 subs+1.110
actus+1.072
Neg logit contributions
 questions-1.099
years-1.083
ruits-1.032
nam-0.991
year-0.989
Token inside
Token ID2641
Feature activation+3.98e-03

Pos logit contributions
years+0.382
 questions+0.371
ruits+0.353
year+0.348
nam+0.339
Neg logit contributions
ww-0.390
lor-0.370
burn-0.368
 subs-0.348
actus-0.332
Token the
Token ID262
Feature activation-2.24e-01

Pos logit contributions
ww+0.007
lor+0.007
burn+0.007
 subs+0.006
 tou+0.006
Neg logit contributions
years-0.006
 questions-0.006
ruits-0.006
year-0.006
nam-0.006
Token church
Token ID4928
Feature activation+4.37e-01

Pos logit contributions
years+0.375
 questions+0.362
ruits+0.354
year+0.342
nam+0.334
Neg logit contributions
ww-0.386
lor-0.358
burn-0.356
 subs-0.338
actus-0.332
Token.
Token ID13
Feature activation+9.72e-01

Pos logit contributions
ww+0.803
lor+0.750
burn+0.749
 subs+0.713
actus+0.685
Neg logit contributions
 questions-0.665
years-0.664
ruits-0.629
year-0.605
nam-0.598
Token "
Token ID366
Feature activation+3.05e-01

Pos logit contributions
ww+1.873
burn+1.775
lor+1.736
 subs+1.670
actus+1.599
Neg logit contributions
 questions-1.419
years-1.418
ruits-1.342
year-1.294
nam-1.282
TokenDon
Token ID3987
Feature activation+2.75e+00

Pos logit contributions
ww+0.442
lor+0.405
burn+0.404
 subs+0.379
actus+0.359
Neg logit contributions
years-0.585
 questions-0.585
ruits-0.561
year-0.543
nam-0.538
Token't
Token ID470
Feature activation-3.32e-01

Pos logit contributions
ww+4.971
lor+4.684
burn+4.672
actus+4.607
rob+4.366
Neg logit contributions
 questions-4.226
 three-4.054
years-4.050
year-3.924
 One-3.870
Token worry
Token ID5490
Feature activation+2.92e-02

Pos logit contributions
years+0.353
 questions+0.338
ruits+0.320
year+0.317
nam+0.281
Neg logit contributions
ww-0.792
burn-0.750
lor-0.727
 subs-0.721
 tou-0.704
Token,
Token ID11
Feature activation+9.61e-02

Pos logit contributions
ww+0.062
burn+0.058
lor+0.058
 subs+0.055
 tou+0.053
Neg logit contributions
years-0.038
 questions-0.037
ruits-0.035
year-0.034
nam-0.033
Token we
Token ID356
Feature activation-1.14e+00

Pos logit contributions
ww+0.141
lor+0.129
burn+0.129
 subs+0.121
 tou+0.116
Neg logit contributions
years-0.184
 questions-0.182
ruits-0.175
year-0.170
nam-0.168
Token'll
Token ID1183
Feature activation+6.13e-01

Pos logit contributions
years+2.299
 questions+2.189
ruits+2.145
nam+2.123
year+2.118
Neg logit contributions
ww-1.607
burn-1.536
lor-1.483
 subs-1.453
 tou-1.386
Token
Token ID198
Feature activation-4.21e-01

Pos logit contributions
ww+0.388
lor+0.359
burn+0.356
 subs+0.336
actus+0.322
Neg logit contributions
years-0.454
 questions-0.451
ruits-0.432
year-0.419
nam-0.415
TokenAnna
Token ID31160
Feature activation+2.29e-01

Pos logit contributions
years+0.802
 questions+0.795
ruits+0.765
nam+0.737
year+0.736
Neg logit contributions
ww-0.639
lor-0.584
burn-0.556
 subs-0.516
actus-0.512
Token and
Token ID290
Feature activation+2.32e-01

Pos logit contributions
ww+0.480
lor+0.454
burn+0.452
 subs+0.434
actus+0.420
Neg logit contributions
years-0.289
 questions-0.287
ruits-0.270
year-0.258
nam-0.255
Token Ben
Token ID3932
Feature activation-7.02e-01

Pos logit contributions
ww+0.415
burn+0.388
lor+0.387
 subs+0.369
 tou+0.354
Neg logit contributions
years-0.366
 questions-0.363
ruits-0.347
year-0.336
nam-0.329
Token nodded
Token ID14464
Feature activation-8.84e-01

Pos logit contributions
years+1.118
 questions+1.097
ruits+1.072
year+1.022
adies+0.987
Neg logit contributions
ww-1.226
burn-1.166
lor-1.164
 subs-1.124
 tou-1.077
Token.
Token ID13
Feature activation+1.01e+00

Pos logit contributions
years+1.560
ruits+1.477
 questions+1.445
nam+1.402
year+1.391
Neg logit contributions
ww-1.389
burn-1.386
lor-1.303
 subs-1.292
 tou-1.228
Token They
Token ID1119
Feature activation-1.58e-01

Pos logit contributions
ww+2.112
burn+1.986
lor+1.984
 subs+1.899
actus+1.844
Neg logit contributions
 questions-1.351
years-1.288
ruits-1.255
year-1.181
nam-1.161
Token said
Token ID531
Feature activation-3.70e-01

Pos logit contributions
years+0.249
 questions+0.249
ruits+0.238
year+0.227
nam+0.222
Neg logit contributions
ww-0.280
burn-0.263
lor-0.260
 subs-0.252
 tou-0.243
Token they
Token ID484
Feature activation-1.58e+00

Pos logit contributions
years+0.694
 questions+0.667
ruits+0.653
year+0.634
nam+0.624
Neg logit contributions
ww-0.564
burn-0.520
lor-0.512
 subs-0.490
 tou-0.464
Token would
Token ID561
Feature activation-9.34e-01

Pos logit contributions
years+2.349
 questions+2.292
ruits+2.200
nam+2.142
adies+2.131
Neg logit contributions
burn-2.837
ww-2.825
 subs-2.721
lor-2.644
actus-2.628
Token share
Token ID2648
Feature activation-4.53e-01

Pos logit contributions
years+1.485
 questions+1.348
year+1.347
ruits+1.295
nam+1.266
Neg logit contributions
ww-1.711
burn-1.673
lor-1.598
 subs-1.549
 tou-1.533
TokenThank
Token ID10449
Feature activation+2.11e+00

Pos logit contributions
ww+2.292
lor+2.164
burn+2.099
 subs+2.023
actus+1.971
Neg logit contributions
years-3.190
 questions-2.996
year-2.984
ruits-2.930
nam-2.915
Token you
Token ID345
Feature activation+1.02e-01

Pos logit contributions
burn+3.490
ww+3.400
 subs+3.334
lor+3.318
actus+3.215
Neg logit contributions
years-3.756
 questions-3.665
year-3.356
adies-3.321
ruits-3.308
Token mom
Token ID1995
Feature activation-4.51e-01

Pos logit contributions
ww+0.206
burn+0.193
lor+0.193
 subs+0.184
actus+0.177
Neg logit contributions
years-0.138
 questions-0.137
ruits-0.130
year-0.124
nam-0.123
Tokenmy
Token ID1820
Feature activation-2.66e-01

Pos logit contributions
years+0.527
 questions+0.512
ruits+0.482
year+0.469
nam+0.460
Neg logit contributions
ww-1.000
burn-0.955
lor-0.932
 subs-0.885
 chip-0.881
Token,
Token ID11
Feature activation+7.23e-01

Pos logit contributions
years+0.429
 questions+0.420
ruits+0.404
year+0.393
nam+0.390
Neg logit contributions
ww-0.475
burn-0.445
lor-0.436
 subs-0.411
 chip-0.401
Token I
Token ID314
Feature activation+4.54e-01

Pos logit contributions
ww+1.438
lor+1.362
burn+1.351
 subs+1.298
actus+1.250
Neg logit contributions
 questions-1.007
years-0.992
ruits-0.939
year-0.898
nam-0.894
Token love
Token ID1842
Feature activation+2.08e+00

Pos logit contributions
ww+0.955
lor+0.903
burn+0.899
 subs+0.862
actus+0.834
Neg logit contributions
 questions-0.576
years-0.575
ruits-0.539
year-0.515
nam-0.502
Token it
Token ID340
Feature activation+1.88e-01

Pos logit contributions
lor+3.793
ww+3.767
burn+3.568
 subs+3.516
 tou+3.359
Neg logit contributions
 questions-3.447
years-3.397
 whether-3.063
 three-3.059
nam-2.989
Token so
Token ID523
Feature activation+3.06e-01

Pos logit contributions
ww+0.309
burn+0.285
lor+0.285
 subs+0.269
actus+0.257
Neg logit contributions
years-0.324
 questions-0.323
ruits-0.308
year-0.299
nam-0.295
Token much
Token ID881
Feature activation+9.48e-01

Pos logit contributions
ww+0.607
lor+0.569
burn+0.568
 subs+0.543
actus+0.524
Neg logit contributions
years-0.426
 questions-0.426
ruits-0.401
year-0.384
nam-0.379
Token!"
Token ID2474
Feature activation+1.28e+00

Pos logit contributions
ww+1.603
burn+1.524
lor+1.521
 subs+1.458
 tou+1.398
Neg logit contributions
 questions-1.576
years-1.526
ruits-1.491
year-1.424
adies-1.391
Token important
Token ID1593
Feature activation+8.57e-01

Pos logit contributions
years+0.464
 questions+0.443
ruits+0.433
year+0.418
nam+0.415
Neg logit contributions
ww-0.547
lor-0.510
burn-0.503
 subs-0.484
 tou-0.467
Token to
Token ID284
Feature activation+3.00e-01

Pos logit contributions
ww+1.510
burn+1.426
lor+1.416
 subs+1.341
actus+1.303
Neg logit contributions
 questions-1.422
years-1.372
ruits-1.313
year-1.271
nam-1.234
Token keep
Token ID1394
Feature activation+8.23e-01

Pos logit contributions
ww+0.620
lor+0.585
burn+0.583
 subs+0.559
 tou+0.539
Neg logit contributions
years-0.389
 questions-0.387
ruits-0.363
year-0.348
nam-0.342
Token your
Token ID534
Feature activation+1.48e-01

Pos logit contributions
ww+1.566
lor+1.471
burn+1.465
 subs+1.405
actus+1.359
Neg logit contributions
 questions-1.243
years-1.226
ruits-1.141
year-1.102
nam-1.101
Token promises
Token ID10497
Feature activation+7.49e-01

Pos logit contributions
ww+0.268
lor+0.250
burn+0.249
 subs+0.238
 tou+0.228
Neg logit contributions
years-0.232
 questions-0.228
ruits-0.219
year-0.212
nam-0.208
Token and
Token ID290
Feature activation+1.11e+00

Pos logit contributions
ww+1.550
lor+1.455
burn+1.454
 subs+1.398
actus+1.349
Neg logit contributions
 questions-0.981
years-0.958
ruits-0.913
year-0.859
nam-0.855
Token be
Token ID307
Feature activation+6.16e-01

Pos logit contributions
ww+2.287
lor+2.167
burn+2.163
 subs+2.062
actus+2.035
Neg logit contributions
 questions-1.486
years-1.420
ruits-1.344
nam-1.300
year-1.282
Token kind
Token ID1611
Feature activation+4.12e-01

Pos logit contributions
ww+1.264
lor+1.194
burn+1.189
 subs+1.139
actus+1.113
Neg logit contributions
years-0.818
 questions-0.816
ruits-0.755
year-0.735
nam-0.723
Token to
Token ID284
Feature activation+6.89e-01

Pos logit contributions
ww+0.775
lor+0.726
burn+0.725
 subs+0.691
actus+0.665
Neg logit contributions
 questions-0.611
years-0.611
ruits-0.576
year-0.555
nam-0.546
Token others
Token ID1854
Feature activation+3.29e-01

Pos logit contributions
ww+1.265
lor+1.185
burn+1.182
 subs+1.129
actus+1.088
Neg logit contributions
 questions-1.076
years-1.062
ruits-1.001
nam-0.962
year-0.962
Token -
Token ID532
Feature activation+3.36e-01

Pos logit contributions
ww+0.626
lor+0.587
burn+0.585
 subs+0.559
actus+0.537
Neg logit contributions
years-0.480
 questions-0.478
ruits-0.452
year-0.435
nam-0.428
Token fat
Token ID3735
Feature activation+5.73e-01

Pos logit contributions
years+0.639
 questions+0.636
ruits+0.610
year+0.576
nam+0.566
Neg logit contributions
ww-0.639
burn-0.572
lor-0.553
 subs-0.547
 tou-0.537
Token soap
Token ID19533
Feature activation+1.22e+00

Pos logit contributions
ww+0.740
lor+0.681
burn+0.672
 subs+0.632
 tou+0.582
Neg logit contributions
 questions-1.181
years-1.174
ruits-1.134
year-1.099
nam-1.088
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
ww+2.407
lor+2.253
burn+2.240
 subs+2.164
 tou+2.062
Neg logit contributions
 questions-1.686
years-1.668
ruits-1.603
year-1.516
 three-1.512
Token It
Token ID632
Feature activation+1.08e+00

Pos logit contributions
 questions+1.854
years+1.843
ruits+1.697
nam+1.652
year+1.644
Neg logit contributions
ww-2.031
burn-1.814
lor-1.803
 chip-1.764
 subs-1.752
Token was
Token ID373
Feature activation+3.29e-01

Pos logit contributions
ww+2.070
lor+1.989
burn+1.979
 subs+1.849
 tou+1.779
Neg logit contributions
years-1.568
 questions-1.547
ruits-1.450
year-1.434
nam-1.374
Token fun
Token ID1257
Feature activation+3.36e-01

Pos logit contributions
ww+0.629
lor+0.589
burn+0.588
 subs+0.561
actus+0.540
Neg logit contributions
years-0.483
 questions-0.482
ruits-0.456
year-0.438
nam-0.432
Token and
Token ID290
Feature activation+1.08e+00

Pos logit contributions
ww+0.592
lor+0.551
burn+0.550
 subs+0.523
actus+0.501
Neg logit contributions
years-0.545
 questions-0.543
ruits-0.516
year-0.498
nam-0.492
Token y
Token ID331
Feature activation+1.05e+00

Pos logit contributions
ww+1.965
lor+1.938
burn+1.881
 subs+1.792
actus+1.733
Neg logit contributions
years-1.673
 questions-1.660
ruits-1.566
nam-1.545
year-1.529
Tokenummy
Token ID13513
Feature activation+1.12e+00

Pos logit contributions
ww+1.452
lor+1.372
burn+1.353
 subs+1.273
 tou+1.217
Neg logit contributions
years-2.066
 questions-2.021
ruits-1.983
year-1.915
nam-1.875
Token and
Token ID290
Feature activation+1.41e+00

Pos logit contributions
ww+2.173
lor+2.112
burn+2.084
 subs+1.980
 tou+1.907
Neg logit contributions
 questions-1.603
years-1.543
ruits-1.481
year-1.410
 three-1.383
Token everyone
Token ID2506
Feature activation+2.73e-01

Pos logit contributions
lor+2.680
ww+2.650
burn+2.557
 subs+2.457
actus+2.386
Neg logit contributions
years-2.085
 questions-2.053
nam-1.962
ruits-1.935
year-1.885
Token looks
Token ID3073
Feature activation-9.77e-01

Pos logit contributions
ww+1.200
burn+1.120
lor+1.119
 subs+1.045
actus+1.017
Neg logit contributions
 questions-1.292
years-1.276
ruits-1.214
year-1.182
nam-1.158
Token like
Token ID588
Feature activation+5.72e-01

Pos logit contributions
years+1.249
ruits+1.211
 questions+1.185
year+1.107
nam+1.090
Neg logit contributions
ww-2.042
burn-1.885
 subs-1.855
lor-1.825
 tou-1.778
Token one
Token ID530
Feature activation+4.39e-01

Pos logit contributions
ww+1.238
lor+1.177
burn+1.172
 subs+1.122
actus+1.095
Neg logit contributions
 questions-0.687
years-0.683
ruits-0.632
year-0.605
nam-0.598
Token of
Token ID286
Feature activation+7.15e-01

Pos logit contributions
ww+0.778
lor+0.729
burn+0.726
 subs+0.690
actus+0.662
Neg logit contributions
 questions-0.698
years-0.697
ruits-0.661
year-0.639
nam-0.630
Token the
Token ID262
Feature activation-3.05e-02

Pos logit contributions
ww+1.340
lor+1.260
burn+1.254
 subs+1.198
actus+1.162
Neg logit contributions
 questions-1.084
years-1.078
ruits-1.014
year-0.977
nam-0.972
Token machines
Token ID8217
Feature activation+1.51e-01

Pos logit contributions
years+0.051
 questions+0.049
ruits+0.048
year+0.046
nam+0.045
Neg logit contributions
ww-0.053
burn-0.049
lor-0.048
 subs-0.047
 tou-0.045
Token has
Token ID468
Feature activation+3.14e-01

Pos logit contributions
ww+0.277
burn+0.258
lor+0.258
 subs+0.247
actus+0.236
Neg logit contributions
years-0.230
 questions-0.227
ruits-0.216
year-0.209
nam-0.205
Token gone
Token ID3750
Feature activation+5.25e-02

Pos logit contributions
ww+0.608
lor+0.571
burn+0.570
 subs+0.544
actus+0.523
Neg logit contributions
years-0.447
 questions-0.446
ruits-0.421
year-0.404
nam-0.399
Token on
Token ID319
Feature activation+3.93e-01

Pos logit contributions
ww+0.096
burn+0.089
lor+0.089
 subs+0.085
actus+0.081
Neg logit contributions
years-0.081
 questions-0.079
ruits-0.076
year-0.074
nam-0.072
Token fire
Token ID2046
Feature activation+6.86e-01

Pos logit contributions
ww+0.725
lor+0.678
burn+0.676
 subs+0.644
actus+0.619
Neg logit contributions
years-0.595
 questions-0.595
ruits-0.563
year-0.541
nam-0.535
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
ww+1.278
lor+1.190
burn+1.186
 subs+1.135
 tou+1.094
Neg logit contributions
 questions-1.035
years-1.023
ruits-0.977
year-0.931
nam-0.917
Token cry
Token ID3960
Feature activation-3.79e-01

Pos logit contributions
years+0.950
ruits+0.904
 questions+0.900
year+0.868
adies+0.825
Neg logit contributions
ww-1.102
burn-1.072
 subs-1.039
lor-1.020
 tou-0.992
Token.
Token ID13
Feature activation-1.33e+00

Pos logit contributions
years+0.633
 questions+0.613
ruits+0.605
nam+0.575
year+0.573
Neg logit contributions
ww-0.642
burn-0.610
lor-0.598
 subs-0.581
 tou-0.545
Token He
Token ID679
Feature activation-2.43e-01

Pos logit contributions
years+2.188
 questions+2.147
ruits+2.087
nam+2.044
year+2.002
Neg logit contributions
ww-2.207
lor-2.122
 subs-2.025
 chip-1.927
actus-1.899
Token yelled
Token ID22760
Feature activation-9.02e-01

Pos logit contributions
years+0.400
 questions+0.398
ruits+0.394
nam+0.371
year+0.368
Neg logit contributions
ww-0.415
lor-0.385
burn-0.379
 subs-0.373
 tou-0.354
Token and
Token ID290
Feature activation+7.62e-01

Pos logit contributions
years+1.406
ruits+1.353
 questions+1.319
nam+1.311
year+1.265
Neg logit contributions
ww-1.615
lor-1.502
burn-1.500
 subs-1.473
 chip-1.381
Token b
Token ID275
Feature activation-3.13e-01

Pos logit contributions
ww+1.470
lor+1.392
burn+1.381
 subs+1.297
actus+1.287
Neg logit contributions
years-1.115
 questions-1.113
ruits-1.024
nam-1.006
year-1.004
Tokenanged
Token ID5102
Feature activation-1.21e+00

Pos logit contributions
 questions+0.382
years+0.364
ruits+0.349
nam+0.343
year+0.328
Neg logit contributions
ww-0.683
burn-0.647
lor-0.639
 subs-0.618
actus-0.594
Token on
Token ID319
Feature activation-1.23e+00

Pos logit contributions
ruits+2.183
years+2.165
 questions+2.082
nam+2.016
year+1.976
Neg logit contributions
ww-1.869
burn-1.750
 subs-1.725
lor-1.673
 pel-1.581
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
years+2.375
ruits+2.307
 questions+2.199
year+2.191
nam+2.138
Neg logit contributions
ww-1.767
lor-1.624
burn-1.611
 subs-1.604
 chip-1.532
Token door
Token ID3420
Feature activation-6.50e-01

Pos logit contributions
years+2.497
ruits+2.408
 questions+2.326
year+2.297
nam+2.272
Neg logit contributions
ww-1.709
 subs-1.531
burn-1.530
lor-1.526
actus-1.523
Token,
Token ID11
Feature activation-3.43e-01

Pos logit contributions
years+0.991
 questions+0.963
ruits+0.950
nam+0.899
year+0.893
Neg logit contributions
ww-1.175
burn-1.124
lor-1.111
 subs-1.108
 chip-1.032
Token time
Token ID640
Feature activation-1.28e+00

Pos logit contributions
years+0.789
 questions+0.768
nam+0.732
ruits+0.728
year+0.722
Neg logit contributions
ww-0.715
burn-0.643
lor-0.639
 subs-0.607
actus-0.576
Token there
Token ID612
Feature activation+4.84e-01

Pos logit contributions
years+2.086
 questions+2.017
year+1.938
ruits+1.886
nam+1.878
Neg logit contributions
ww-2.360
lor-2.113
 subs-2.042
burn-2.039
 chip-1.928
Token was
Token ID373
Feature activation-8.89e-01

Pos logit contributions
ww+0.740
lor+0.683
burn+0.682
 subs+0.639
actus+0.609
Neg logit contributions
years-0.885
 questions-0.885
ruits-0.846
year-0.820
nam-0.809
Token an
Token ID281
Feature activation-2.04e+00

Pos logit contributions
 questions+1.169
years+1.146
ruits+1.094
nam+1.013
year+1.004
Neg logit contributions
ww-1.880
burn-1.760
lor-1.673
 subs-1.636
 chip-1.608
Token owner
Token ID4870
Feature activation-1.07e+00

Pos logit contributions
ruits+3.575
years+3.574
 questions+3.543
nam+3.331
year+3.296
Neg logit contributions
ww-3.365
 tou-2.957
burn-2.953
 chip-2.910
actus-2.887
Token who
Token ID508
Feature activation-7.54e-01

Pos logit contributions
years+1.568
 questions+1.522
ruits+1.449
nam+1.444
year+1.388
Neg logit contributions
ww-2.013
burn-1.949
lor-1.884
 chip-1.785
 tou-1.784
Token had
Token ID550
Feature activation+1.33e-01

Pos logit contributions
years+1.033
 questions+1.014
ruits+0.974
nam+0.925
adies+0.911
Neg logit contributions
ww-1.518
lor-1.401
burn-1.400
 subs-1.363
 tou-1.326
Token a
Token ID257
Feature activation+5.81e-02

Pos logit contributions
ww+0.303
burn+0.288
lor+0.287
 subs+0.276
 tou+0.268
Neg logit contributions
years-0.144
 questions-0.144
ruits-0.134
year-0.126
nam-0.122
Token fit
Token ID4197
Feature activation-9.05e-02

Pos logit contributions
ww+0.095
burn+0.087
lor+0.086
 subs+0.082
 tou+0.079
Neg logit contributions
years-0.102
 questions-0.100
ruits-0.097
year-0.094
nam-0.092
Token number
Token ID1271
Feature activation-8.67e-01

Pos logit contributions
years+0.141
 questions+0.139
ruits+0.133
year+0.128
nam+0.126
Neg logit contributions
ww-0.163
burn-0.153
lor-0.152
 subs-0.145
actus-0.139
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
years+1.419
 questions+1.376
ruits+1.347
nam+1.290
adies+1.281
Neg logit contributions
ww-1.473
lor-1.373
burn-1.362
 subs-1.297
 chip-1.263
Token friends
Token ID2460
Feature activation-4.95e-01

Pos logit contributions
years+0.071
 questions+0.070
ruits+0.067
year+0.065
nam+0.063
Neg logit contributions
ww-0.090
burn-0.083
lor-0.083
 subs-0.081
 tou-0.078
Token playing
Token ID2712
Feature activation-3.11e-01

Pos logit contributions
years+0.778
 questions+0.770
ruits+0.736
year+0.710
nam+0.709
Neg logit contributions
ww-0.874
burn-0.811
lor-0.805
 subs-0.786
 tou-0.747
Token together
Token ID1978
Feature activation-1.69e-02

Pos logit contributions
years+0.504
 questions+0.495
ruits+0.476
year+0.457
nam+0.446
Neg logit contributions
ww-0.537
lor-0.499
burn-0.495
 subs-0.473
 tou-0.462
Token.
Token ID13
Feature activation-1.29e+00

Pos logit contributions
years+0.027
 questions+0.026
ruits+0.025
year+0.025
nam+0.024
Neg logit contributions
ww-0.030
burn-0.028
lor-0.028
 subs-0.027
 tou-0.026
Token
Token ID220
Feature activation-2.71e-01

Pos logit contributions
years+2.185
 questions+2.058
ruits+2.011
nam+1.983
year+1.969
Neg logit contributions
ww-2.131
lor-2.040
burn-1.906
 subs-1.893
 chip-1.878
Token
Token ID198
Feature activation-6.16e-01

Pos logit contributions
years+0.526
 questions+0.511
ruits+0.488
year+0.483
nam+0.482
Neg logit contributions
ww-0.383
lor-0.363
burn-0.352
 subs-0.331
actus-0.314
Token
Token ID198
Feature activation-1.44e+00

Pos logit contributions
years+1.234
 questions+1.208
ruits+1.177
nam+1.166
year+1.128
Neg logit contributions
ww-0.845
lor-0.788
burn-0.735
 subs-0.704
 chip-0.670
TokenM
Token ID44
Feature activation-3.89e-01

Pos logit contributions
 questions+2.619
years+2.604
ruits+2.593
nam+2.508
year+2.322
Neg logit contributions
ww-2.324
lor-2.108
burn-1.923
 subs-1.884
 chip-1.820
Tokenia
Token ID544
Feature activation-5.27e-01

Pos logit contributions
years+0.661
 questions+0.641
nam+0.629
ruits+0.611
year+0.596
Neg logit contributions
ww-0.668
lor-0.618
burn-0.599
 subs-0.584
actus-0.560
Token smiled
Token ID13541
Feature activation-6.40e-01

Pos logit contributions
years+0.807
 questions+0.798
ruits+0.772
nam+0.747
year+0.747
Neg logit contributions
ww-0.958
lor-0.891
burn-0.883
 subs-0.858
actus-0.828
Token and
Token ID290
Feature activation+1.64e+00

Pos logit contributions
years+0.924
ruits+0.897
 questions+0.883
year+0.824
nam+0.824
Neg logit contributions
ww-1.219
burn-1.165
lor-1.140
 subs-1.128
 tou-1.072
Token always
Token ID1464
Feature activation-2.18e-01

Pos logit contributions
ww+0.301
lor+0.284
burn+0.283
 subs+0.270
 tou+0.261
Neg logit contributions
years-0.200
 questions-0.196
ruits-0.185
year-0.179
nam-0.176
Token be
Token ID307
Feature activation+1.51e-01

Pos logit contributions
years+0.297
 questions+0.283
ruits+0.272
year+0.267
nam+0.257
Neg logit contributions
ww-0.441
burn-0.414
lor-0.413
 subs-0.396
 tou-0.386
Token respectful
Token ID25923
Feature activation-1.68e-01

Pos logit contributions
ww+0.318
burn+0.300
lor+0.299
 subs+0.288
 tou+0.278
Neg logit contributions
years-0.189
 questions-0.188
ruits-0.178
year-0.169
nam-0.165
Token,
Token ID11
Feature activation-4.17e-01

Pos logit contributions
years+0.250
 questions+0.243
ruits+0.233
nam+0.226
year+0.225
Neg logit contributions
ww-0.315
lor-0.295
burn-0.294
 subs-0.281
actus-0.267
Token appreci
Token ID5763
Feature activation+1.14e+00

Pos logit contributions
years+0.583
 questions+0.561
ruits+0.542
year+0.527
nam+0.510
Neg logit contributions
ww-0.822
burn-0.766
lor-0.758
 subs-0.737
 tou-0.714
Tokenative
Token ID876
Feature activation-1.41e-01

Pos logit contributions
ww+1.585
burn+1.478
lor+1.380
 subs+1.359
actus+1.297
Neg logit contributions
 questions-2.352
years-2.295
ruits-2.203
year-2.188
adies-2.108
Token and
Token ID290
Feature activation-1.08e-01

Pos logit contributions
years+0.215
 questions+0.209
ruits+0.201
nam+0.195
year+0.194
Neg logit contributions
ww-0.261
lor-0.244
burn-0.242
 subs-0.232
actus-0.221
Token supportive
Token ID16443
Feature activation+6.91e-01

Pos logit contributions
years+0.152
 questions+0.149
ruits+0.144
year+0.137
nam+0.133
Neg logit contributions
ww-0.211
burn-0.196
lor-0.196
 subs-0.189
 tou-0.182
Token of
Token ID286
Feature activation-6.77e-02

Pos logit contributions
ww+1.305
lor+1.228
burn+1.228
 subs+1.164
actus+1.128
Neg logit contributions
 questions-1.038
years-1.016
ruits-0.967
year-0.921
nam-0.903
Token those
Token ID883
Feature activation-1.11e+00

Pos logit contributions
years+0.105
 questions+0.103
ruits+0.099
year+0.095
nam+0.094
Neg logit contributions
ww-0.122
lor-0.114
burn-0.113
 subs-0.108
 tou-0.104
Token around
Token ID1088
Feature activation+3.28e-01

Pos logit contributions
years+1.687
ruits+1.581
 questions+1.541
year+1.530
adies+1.501
Neg logit contributions
ww-2.016
lor-1.855
burn-1.794
 subs-1.788
 tou-1.729
Token
Token ID198
Feature activation-6.98e-02

Pos logit contributions
ww+0.415
lor+0.385
burn+0.382
 subs+0.362
actus+0.345
Neg logit contributions
years-0.463
 questions-0.461
ruits-0.440
year-0.427
nam-0.422
Token
Token ID198
Feature activation-5.15e-01

Pos logit contributions
years+0.133
 questions+0.131
ruits+0.126
nam+0.123
year+0.122
Neg logit contributions
ww-0.102
lor-0.095
burn-0.092
 subs-0.087
actus-0.084
Token"
Token ID1
Feature activation+1.30e+00

Pos logit contributions
years+0.818
 questions+0.815
ruits+0.764
nam+0.741
year+0.733
Neg logit contributions
ww-0.956
lor-0.873
burn-0.837
 subs-0.805
actus-0.791
TokenBen
Token ID11696
Feature activation+1.44e+00

Pos logit contributions
ww+2.390
lor+2.306
burn+2.222
 subs+2.115
actus+2.104
Neg logit contributions
 questions-1.993
years-1.973
ruits-1.868
year-1.829
nam-1.807
Token,
Token ID11
Feature activation-3.19e-01

Pos logit contributions
ww+3.291
lor+3.166
burn+3.144
 subs+3.019
actus+2.913
Neg logit contributions
 questions-1.549
years-1.439
ruits-1.427
 want-1.287
adies-1.286
Token I
Token ID314
Feature activation-5.27e-01

Pos logit contributions
years+0.493
 questions+0.473
ruits+0.452
year+0.446
nam+0.435
Neg logit contributions
ww-0.599
burn-0.553
lor-0.552
 subs-0.529
 tou-0.515
Token want
Token ID765
Feature activation+7.23e-01

Pos logit contributions
years+0.660
 questions+0.624
nam+0.592
ruits+0.588
year+0.576
Neg logit contributions
ww-1.156
burn-1.089
lor-1.071
 subs-1.053
 tou-1.009
Token to
Token ID284
Feature activation-2.42e-02

Pos logit contributions
ww+1.286
lor+1.211
burn+1.196
 subs+1.143
actus+1.098
Neg logit contributions
 questions-1.171
years-1.148
ruits-1.092
nam-1.044
year-1.044
Token go
Token ID467
Feature activation-3.51e-01

Pos logit contributions
years+0.032
 questions+0.031
ruits+0.030
year+0.029
nam+0.028
Neg logit contributions
ww-0.050
burn-0.047
lor-0.047
 subs-0.045
 tou-0.044
Token back
Token ID736
Feature activation-6.08e-01

Pos logit contributions
years+0.639
 questions+0.622
ruits+0.609
year+0.590
nam+0.584
Neg logit contributions
ww-0.553
burn-0.526
lor-0.511
 subs-0.480
 tou-0.469
Token,"
Token ID553
Feature activation+5.55e-01

Pos logit contributions
years+1.035
 questions+0.992
ruits+0.973
year+0.949
nam+0.930
Neg logit contributions
ww-1.045
burn-0.983
lor-0.970
 subs-0.894
actus-0.891
Token climb
Token ID12080
Feature activation-1.10e-01

Pos logit contributions
years+1.097
 questions+1.024
year+0.985
ruits+0.968
nam+0.910
Neg logit contributions
ww-1.621
burn-1.538
lor-1.516
 tou-1.472
 subs-1.466
Token the
Token ID262
Feature activation+8.79e-01

Pos logit contributions
years+0.211
 questions+0.208
ruits+0.203
year+0.196
nam+0.194
Neg logit contributions
ww-0.158
burn-0.146
lor-0.144
 subs-0.135
actus-0.130
Token tree
Token ID5509
Feature activation-1.09e+00

Pos logit contributions
ww+1.097
lor+1.014
burn+1.011
 subs+0.930
actus+0.836
Neg logit contributions
 questions-1.865
years-1.864
ruits-1.785
nam-1.734
year-1.729
Token and
Token ID290
Feature activation-9.21e-01

Pos logit contributions
years+1.837
 questions+1.740
ruits+1.711
year+1.649
nam+1.640
Neg logit contributions
ww-1.826
burn-1.775
lor-1.755
 subs-1.662
 chip-1.634
Token pick
Token ID2298
Feature activation+7.94e-01

Pos logit contributions
years+1.295
 questions+1.233
ruits+1.217
year+1.171
adies+1.091
Neg logit contributions
ww-1.822
burn-1.719
lor-1.686
 subs-1.665
 tou-1.663
Token an
Token ID281
Feature activation-2.00e+00

Pos logit contributions
ww+1.508
burn+1.421
lor+1.420
 subs+1.358
 tou+1.287
Neg logit contributions
 questions-1.173
years-1.164
ruits-1.093
year-1.063
nam-1.037
Token olive
Token ID19450
Feature activation+3.23e-01

Pos logit contributions
years+3.940
 questions+3.916
ruits+3.842
adies+3.690
nam+3.671
Neg logit contributions
ww-2.895
 chip-2.457
 tou-2.428
burn-2.351
actus-2.339
Token just
Token ID655
Feature activation-9.78e-01

Pos logit contributions
ww+0.589
lor+0.550
burn+0.549
 subs+0.523
actus+0.502
Neg logit contributions
years-0.499
 questions-0.498
ruits-0.472
year-0.455
nam-0.449
Token for
Token ID329
Feature activation-5.32e-01

Pos logit contributions
years+1.433
 questions+1.337
ruits+1.314
year+1.271
nam+1.232
Neg logit contributions
ww-1.855
burn-1.761
lor-1.723
 subs-1.720
 tou-1.663
Token fun
Token ID1257
Feature activation+3.24e-01

Pos logit contributions
years+0.736
 questions+0.725
ruits+0.720
year+0.682
nam+0.663
Neg logit contributions
ww-1.046
lor-0.967
burn-0.966
 subs-0.937
 tou-0.905
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
ww+0.594
lor+0.556
burn+0.555
 subs+0.528
actus+0.507
Neg logit contributions
years-0.502
 questions-0.500
ruits-0.474
year-0.457
nam-0.451
Token a
Token ID257
Feature activation-1.03e-01

Pos logit contributions
 questions+1.084
years+1.055
ruits+1.013
year+0.927
nam+0.918
Neg logit contributions
ww-1.764
burn-1.642
lor-1.581
 subs-1.534
 chip-1.528
Token big
Token ID1263
Feature activation+5.42e-02

Pos logit contributions
years+0.152
 questions+0.150
ruits+0.143
year+0.137
nam+0.134
Neg logit contributions
ww-0.198
burn-0.183
lor-0.179
 subs-0.174
 tou-0.171
Token jungle
Token ID20712
Feature activation+6.84e-02

Pos logit contributions
ww+0.084
burn+0.077
lor+0.076
 subs+0.072
actus+0.070
Neg logit contributions
years-0.099
 questions-0.098
ruits-0.095
year-0.092
nam-0.091
Token with
Token ID351
Feature activation-6.18e-01

Pos logit contributions
ww+0.123
lor+0.114
burn+0.113
 subs+0.109
 tou+0.105
Neg logit contributions
years-0.108
 questions-0.107
ruits-0.102
year-0.098
nam-0.097
Token lots
Token ID6041
Feature activation-1.32e-01

Pos logit contributions
 questions+0.717
ruits+0.698
years+0.695
year+0.618
nam+0.612
Neg logit contributions
ww-1.384
lor-1.258
burn-1.253
 subs-1.225
 tou-1.212
Token of
Token ID286
Feature activation-1.99e+00

Pos logit contributions
years+0.210
 questions+0.207
ruits+0.199
nam+0.192
year+0.190
Neg logit contributions
ww-0.234
lor-0.214
burn-0.213
 subs-0.206
actus-0.201
Token trees
Token ID7150
Feature activation+4.18e-01

Pos logit contributions
years+3.939
ruits+3.849
nam+3.697
year+3.647
 questions+3.619
Neg logit contributions
ww-2.913
 tou-2.586
actus-2.394
 pel-2.385
 chip-2.363
Token and
Token ID290
Feature activation-2.20e+00

Pos logit contributions
ww+0.808
lor+0.758
burn+0.755
 subs+0.721
actus+0.694
Neg logit contributions
 questions-0.599
years-0.599
ruits-0.565
year-0.543
nam-0.535
Token animals
Token ID4695
Feature activation-5.04e-01

Pos logit contributions
ruits+4.436
years+4.368
 questions+4.249
year+4.081
illed+4.003
Neg logit contributions
ww-3.179
 tou-2.879
burn-2.720
 pel-2.714
 chip-2.691
Token.
Token ID13
Feature activation-2.30e+00

Pos logit contributions
years+0.853
 questions+0.825
ruits+0.800
year+0.773
nam+0.770
Neg logit contributions
ww-0.840
burn-0.789
lor-0.780
 subs-0.751
 tou-0.726
Token One
Token ID1881
Feature activation-5.10e-01

Pos logit contributions
years+3.918
 questions+3.600
year+3.516
ruits+3.400
adies+3.195
Neg logit contributions
ww-4.073
 chip-3.801
 Cathy-3.710
burn-3.668
 tou-3.635
Token had
Token ID550
Feature activation-9.20e-01

Pos logit contributions
 questions+2.820
years+2.805
ruits+2.717
nam+2.643
year+2.596
Neg logit contributions
ww-2.448
burn-2.234
 subs-2.183
lor-2.161
 tou-2.095
Token a
Token ID257
Feature activation+4.94e-01

Pos logit contributions
 questions+1.142
years+1.139
ruits+1.136
year+1.054
adies+0.996
Neg logit contributions
ww-1.949
burn-1.830
lor-1.816
 subs-1.754
 tou-1.692
Token lot
Token ID1256
Feature activation-5.39e-01

Pos logit contributions
ww+0.784
lor+0.731
burn+0.724
 subs+0.685
actus+0.651
Neg logit contributions
 questions-0.879
years-0.876
ruits-0.836
year-0.808
nam-0.805
Token of
Token ID286
Feature activation-2.20e+00

Pos logit contributions
years+0.891
 questions+0.869
ruits+0.838
nam+0.834
year+0.796
Neg logit contributions
ww-0.915
lor-0.845
burn-0.824
 subs-0.799
actus-0.770
Token cars
Token ID5006
Feature activation-1.40e-01

Pos logit contributions
years+4.224
ruits+3.982
adies+3.976
year+3.892
 whether+3.636
Neg logit contributions
ww-3.338
burn-2.855
 subs-2.786
 tou-2.746
 chip-2.746
Token,
Token ID11
Feature activation-2.36e+00

Pos logit contributions
years+0.192
 questions+0.189
ruits+0.179
year+0.173
nam+0.171
Neg logit contributions
ww-0.281
burn-0.264
lor-0.262
 subs-0.253
 tou-0.243
Token trucks
Token ID13960
Feature activation-1.24e-02

Pos logit contributions
years+3.555
year+3.478
ruits+3.218
adies+3.147
 questions+3.139
Neg logit contributions
burn-4.493
ww-4.376
 chip-4.308
 fle-4.073
 tou-4.055
Token,
Token ID11
Feature activation-2.09e+00

Pos logit contributions
years+0.016
 questions+0.016
ruits+0.015
year+0.015
nam+0.015
Neg logit contributions
ww-0.025
burn-0.024
lor-0.024
 subs-0.023
 tou-0.022
Token and
Token ID290
Feature activation-1.93e+00

Pos logit contributions
years+3.587
year+3.473
ruits+3.394
 questions+3.303
adies+3.104
Neg logit contributions
burn-3.594
ww-3.534
 chip-3.231
 tou-3.165
 subs-3.165
Token trains
Token ID13404
Feature activation+2.06e-02

Pos logit contributions
ruits+2.960
years+2.931
year+2.818
 questions+2.764
adies+2.731
Neg logit contributions
ww-3.580
burn-3.356
 chip-3.161
 subs-3.117
 tou-3.045
Token,
Token ID11
Feature activation-1.81e+00

Pos logit contributions
ww+0.038
burn+0.036
lor+0.036
 subs+0.034
 tou+0.033
Neg logit contributions
years-0.031
 questions-0.030
ruits-0.029
year-0.028
nam-0.028
Token<|endoftext|>
Token ID50256
Feature activation+1.24e+00

Pos logit contributions
ww+2.718
burn+2.555
lor+2.545
 subs+2.418
actus+2.319
Neg logit contributions
 questions-2.150
years-2.117
ruits-2.038
year-1.947
 whether-1.883
TokenOnce
Token ID7454
Feature activation-3.38e-01

Pos logit contributions
ww+2.256
burn+2.090
lor+2.024
 subs+1.967
 tou+1.871
Neg logit contributions
 questions-1.974
years-1.922
ruits-1.854
year-1.816
 three-1.749
Token upon
Token ID2402
Feature activation-1.47e+00

Pos logit contributions
years+0.230
 questions+0.215
ruits+0.188
year+0.179
nam+0.174
Neg logit contributions
ww-0.905
burn-0.864
lor-0.861
 subs-0.841
actus-0.826
Token a
Token ID257
Feature activation-5.38e-01

Pos logit contributions
years+2.179
 questions+2.103
ruits+2.054
year+1.854
nam+1.833
Neg logit contributions
ww-2.822
burn-2.688
lor-2.652
 tou-2.472
 subs-2.464
Token time
Token ID640
Feature activation-9.34e-01

Pos logit contributions
years+0.981
 questions+0.952
nam+0.908
year+0.897
ruits+0.896
Neg logit contributions
ww-0.876
burn-0.789
lor-0.780
 subs-0.742
actus-0.711
Token,
Token ID11
Feature activation-2.09e+00

Pos logit contributions
years+1.455
 questions+1.422
year+1.345
ruits+1.320
nam+1.294
Neg logit contributions
ww-1.766
lor-1.608
burn-1.574
 subs-1.550
 chip-1.461
Token there
Token ID612
Feature activation+7.95e-01

Pos logit contributions
 questions+3.587
years+3.579
ruits+3.239
racted+3.196
year+3.013
Neg logit contributions
ww-3.864
burn-3.384
 subs-3.252
lor-3.234
 tou-3.206
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
ww+1.279
burn+1.188
lor+1.184
 subs+1.105
actus+1.059
Neg logit contributions
 questions-1.395
years-1.389
ruits-1.340
year-1.287
nam-1.262
Token a
Token ID257
Feature activation-1.43e-01

Pos logit contributions
 questions+1.350
years+1.317
ruits+1.257
year+1.153
nam+1.144
Neg logit contributions
ww-2.105
burn-1.957
lor-1.868
 chip-1.813
 subs-1.809
Token little
Token ID1310
Feature activation+6.89e-01

Pos logit contributions
years+0.207
 questions+0.204
ruits+0.195
year+0.186
nam+0.181
Neg logit contributions
ww-0.280
burn-0.259
lor-0.254
 subs-0.247
 tou-0.243
Token girl
Token ID2576
Feature activation-1.12e-01

Pos logit contributions
ww+1.098
lor+1.058
burn+1.023
 subs+0.983
 tou+0.918
Neg logit contributions
 questions-1.201
years-1.198
ruits-1.141
year-1.106
nam-1.102
Token and
Token ID290
Feature activation-3.29e+00

Pos logit contributions
ww+1.052
lor+0.988
burn+0.983
 subs+0.940
actus+0.904
Neg logit contributions
 questions-0.762
years-0.755
ruits-0.719
year-0.685
nam-0.676
Token knock
Token ID10643
Feature activation-1.07e+00

Pos logit contributions
years+6.066
year+5.719
ruits+5.683
adies+5.290
illed+5.258
Neg logit contributions
ww-5.123
burn-5.084
 chip-5.036
 tou-4.881
 pel-4.787
Token them
Token ID606
Feature activation-6.28e-01

Pos logit contributions
years+2.124
 questions+1.995
ruits+1.967
year+1.926
nam+1.881
Neg logit contributions
ww-1.476
burn-1.436
lor-1.292
 subs-1.254
actus-1.238
Token down
Token ID866
Feature activation-1.17e+00

Pos logit contributions
years+1.224
 questions+1.163
ruits+1.148
year+1.117
nam+1.079
Neg logit contributions
ww-0.889
burn-0.853
 subs-0.811
lor-0.811
 tou-0.762
Token.
Token ID13
Feature activation-1.74e+00

Pos logit contributions
years+2.163
ruits+1.999
 questions+1.962
nam+1.907
year+1.891
Neg logit contributions
ww-1.787
burn-1.673
lor-1.641
 subs-1.628
 chip-1.531
Token They
Token ID1119
Feature activation-2.98e+00

Pos logit contributions
years+3.117
 questions+2.906
year+2.795
nam+2.792
ruits+2.593
Neg logit contributions
ww-2.877
burn-2.676
 chip-2.667
 pel-2.621
 subs-2.617
Token also
Token ID635
Feature activation-2.59e+00

Pos logit contributions
 questions+6.552
years+6.229
ruits+6.118
adies+6.117
year+5.922
Neg logit contributions
ww-3.362
 subs-3.306
 pel-3.229
burn-3.222
 tou-3.016
Token liked
Token ID8288
Feature activation-1.04e+00

Pos logit contributions
 questions+5.490
ruits+5.310
years+5.284
adies+5.165
 curiously+5.025
Neg logit contributions
ww-3.090
burn-2.877
 pel-2.852
 subs-2.852
rob-2.694
Token to
Token ID284
Feature activation-2.56e+00

Pos logit contributions
years+2.067
ruits+2.011
 questions+1.989
year+1.954
adies+1.934
Neg logit contributions
ww-1.351
burn-1.257
 subs-1.196
 chip-1.167
 pel-1.141
Token pretend
Token ID16614
Feature activation+4.65e-01

Pos logit contributions
years+4.984
year+4.648
ruits+4.397
 questions+4.347
adies+4.120
Neg logit contributions
ww-3.838
burn-3.750
 chip-3.647
 pel-3.607
 subs-3.591
Token they
Token ID484
Feature activation-1.24e+00

Pos logit contributions
ww+0.692
lor+0.637
burn+0.635
 subs+0.595
actus+0.567
Neg logit contributions
 questions-0.878
years-0.875
ruits-0.838
year-0.812
nam-0.805