TOP ACTIVATIONS
MAX = 2.628

redundant ) get out of bed ready to p ounce .
clock , leap out of bed , dress , force -
at the B DP + Bott leneck Buffer Size point (
of exploiting it . <|endoftext|> Bed side is a series about
need to get you into bed ." And then I realize
ki and Stella were completely soaked . There ,
underdog status , thus ing r ati ating yourself with fans
photo journal ist Sean O ' Connell ( played by Sean
first met , and O ' Neill attempted to advise Carter
of returning them to their bed and the sexual side of
before 8 : 30 a . m . threatens children 's
alone . O ' Brian is as fascinated by
vow to meet in the Sword - Art Festival finals .
com and V odka O ' Clock Podcast . Author of
Ha ! Fuck off ! he
undrum . Within a few hours we received a definitive answer
stories behind him , O ' Brian accepted an offer from
want to get you into bed , there are no words
to a 2012 Pew Research Center survey of Hispanics , just
- W restling - C ou ples Cherry D arts 200

MIN ACTIVATIONS
MIN = -2.557

of protest . <|endoftext|> Last Chance to See the Small ,
LLC . Click here for reprint permission . <|endoftext|> The S
LLC . Click here for reprint permission . <|endoftext|> While we
4 garlic cloves and the ginger . Cut the white onion
Em or Hang Em . P aying
Email This Story Send email to this address Enter Your
jc ' OR RIGHT ( path , 5 ) = '.
's been dumped on Maggie Rita 's across the street ,
L ared o Sen . Judith Z aff ir ini ,
aren t as forgiving . Most chefs won
the onion , garlic and ginger . 5 .
js ' OR RIGHT ( path , 10 ) = '
. E arl Morse K Z 8 E suggest
j ' OR RIGHT ( path , 5 ) = '.
He discussed two options with the family at this
right field that scored Michael Morse to tie and Ian Desmond
An 1888 photo of with her teacher ,, has been
Now we cut the bell peppers into small - medium sized
Grant ing retro active immunity to companies that went
think ... <|endoftext|> What could have been : Darren Ar on

INTERVAL 1.332 - 2.628
CONTAINS 0.263%

little bit like a spinning cup at Disneyland ),
logged over 6 500 n autical reactor - years to 2015
daughter to school . In one late - night podcast interview
Orlando gay bar he would later soak in blood .
emissions , so poll uters typically pay none of these costs

INTERVAL 0.035 - 1.332
CONTAINS 46.642%

the allegation that Brah ms slaughtered fel ines , transc ribed
might reassure Americans afraid of Trump that there s
or your setup is wrong . I have been in the
how do you keep an even ke el ?
men . Then I ll say ,

INTERVAL -1.261 - 0.035
CONTAINS 52.699%

Sea premiered was reminiscent of when Boy
, on average , simply to determine whether to accept a
In later desperation , I also gave a link to
could work for every conceivable type of company . How do
oses over the entire palm opl ant ar surfaces , beginning

INTERVAL -2.557 - -1.261
CONTAINS 0.397%

f ); RAW _ ASS ERT ( it -> vec ()
Un iting Care told the ABC demand for financial help had
indicated doubts about the child s paternity .
value per play ), and fourth in running back success rate
. Advertisement Continue reading the main story
Token redundant
Token ID30806
Feature activation+9.02e-02

Pos logit contributions
theless+0.776
env+0.652
lis+0.603
terday+0.602
 Haas+0.592
Neg logit contributions
hower-0.930
pees-0.905
-0.841
ombat-0.841
pee-0.827
Token)
Token ID8
Feature activation+4.84e-01

Pos logit contributions
hower+0.349
pees+0.341
pee+0.314
+0.313
ombat+0.313
Neg logit contributions
theless-0.264
env-0.223
terday-0.201
lis-0.199
 Seym-0.197
Token get
Token ID651
Feature activation+5.59e-01

Pos logit contributions
hower+1.708
pees+1.672
+1.529
pee+1.524
ombat+1.521
Neg logit contributions
theless-1.603
env-1.358
lis-1.237
terday-1.236
 Seym-1.225
Token out
Token ID503
Feature activation+5.92e-01

Pos logit contributions
hower+2.049
pees+1.999
+1.837
pee+1.833
ombat+1.828
Neg logit contributions
theless-1.765
env-1.505
lis-1.352
terday-1.346
 Haas-1.344
Token of
Token ID286
Feature activation+4.89e-01

Pos logit contributions
hower+2.006
pees+1.953
pee+1.786
ombat+1.781
+1.775
Neg logit contributions
theless-2.018
env-1.744
lis-1.607
terday-1.587
 Seym-1.583
Token bed
Token ID3996
Feature activation+2.63e+00

Pos logit contributions
hower+1.779
pees+1.740
pee+1.593
ombat+1.590
+1.580
Neg logit contributions
theless-1.549
env-1.325
 Haas-1.207
terday-1.197
 Seym-1.196
Token ready
Token ID3492
Feature activation+6.91e-01

Pos logit contributions
hower+8.362
pees+8.118
pee+7.559
legate+7.234
apor+7.167
Neg logit contributions
theless-8.744
env-7.394
 Seym-7.104
 Haas-6.812
terday-6.795
Token to
Token ID284
Feature activation+3.97e-01

Pos logit contributions
hower+2.653
pees+2.604
pee+2.412
ombat+2.388
legate+2.375
Neg logit contributions
theless-2.044
env-1.696
 Seym-1.536
terday-1.536
lis-1.517
Token p
Token ID279
Feature activation+3.52e-01

Pos logit contributions
hower+1.420
pees+1.387
+1.274
ombat+1.262
pee+1.261
Neg logit contributions
theless-1.291
env-1.091
terday-0.995
lis-0.993
 Haas-0.980
Tokenounce
Token ID8652
Feature activation+2.40e-01

Pos logit contributions
hower+1.212
pees+1.189
+1.096
pee+1.075
ombat+1.068
Neg logit contributions
theless-1.165
env-1.012
lis-0.926
terday-0.917
 Haas-0.901
Token.
Token ID13
Feature activation+5.52e-01

Pos logit contributions
hower+0.832
pees+0.824
ombat+0.751
pee+0.750
+0.750
Neg logit contributions
theless-0.792
env-0.675
terday-0.628
lis-0.622
 Seym-0.616
Token clock
Token ID8801
Feature activation-8.03e-01

Pos logit contributions
theless+1.009
env+0.841
terday+0.774
lis+0.754
 Haas+0.748
Neg logit contributions
pees-1.424
hower-1.407
ombat-1.315
-1.295
pee-1.292
Token,
Token ID11
Feature activation-4.36e-02

Pos logit contributions
theless+2.465
env+2.070
terday+1.943
lis+1.883
 Haas+1.877
Neg logit contributions
pees-2.892
hower-2.866
ombat-2.658
-2.631
pee-2.598
Token leap
Token ID16470
Feature activation+2.49e-01

Pos logit contributions
theless+0.139
env+0.118
 Haas+0.108
terday+0.108
lis+0.107
Neg logit contributions
hower-0.157
pees-0.154
ombat-0.141
-0.140
pee-0.140
Token out
Token ID503
Feature activation+5.16e-01

Pos logit contributions
hower+0.906
pees+0.892
ombat+0.818
pee+0.815
+0.810
Neg logit contributions
theless-0.786
env-0.663
terday-0.608
lis-0.601
ylum-0.595
Token of
Token ID286
Feature activation+5.41e-01

Pos logit contributions
hower+1.679
pees+1.646
pee+1.491
ombat+1.487
+1.472
Neg logit contributions
theless-1.834
env-1.582
terday-1.460
 Seym-1.453
lis-1.452
Token bed
Token ID3996
Feature activation+2.42e+00

Pos logit contributions
hower+1.636
pees+1.607
ombat+1.447
pee+1.442
+1.433
Neg logit contributions
theless-2.027
env-1.777
 Haas-1.640
lis-1.635
 Seym-1.626
Token,
Token ID11
Feature activation+4.22e-01

Pos logit contributions
hower+7.334
pees+7.182
pee+6.649
legate+6.289
+6.242
Neg logit contributions
theless-8.490
env-7.339
 Seym-6.985
terday-6.806
ylum-6.789
Token dress
Token ID6576
Feature activation+9.23e-01

Pos logit contributions
hower+1.542
pees+1.494
ombat+1.373
+1.372
pee+1.358
Neg logit contributions
theless-1.339
env-1.142
terday-1.037
lis-1.033
 Haas-1.030
Token,
Token ID11
Feature activation+2.12e-02

Pos logit contributions
hower+3.335
pees+3.262
pee+2.998
ombat+2.983
+2.962
Neg logit contributions
theless-2.947
env-2.491
terday-2.275
 Seym-2.266
lis-2.254
Token force
Token ID2700
Feature activation-2.25e-01

Pos logit contributions
hower+0.077
pees+0.074
ombat+0.068
+0.068
legate+0.067
Neg logit contributions
theless-0.066
env-0.058
 Haas-0.052
lis-0.052
terday-0.052
Token-
Token ID12
Feature activation-4.27e-01

Pos logit contributions
theless+0.657
env+0.564
lis+0.507
 Haas+0.504
terday+0.503
Neg logit contributions
hower-0.860
pees-0.831
ombat-0.774
-0.760
legate-0.757
Token at
Token ID379
Feature activation-1.20e-01

Pos logit contributions
hower+1.498
pees+1.466
pee+1.342
ombat+1.327
+1.327
Neg logit contributions
theless-1.392
env-1.186
terday-1.084
 Seym-1.079
lis-1.077
Token the
Token ID262
Feature activation-3.85e-01

Pos logit contributions
theless+0.332
env+0.277
 Haas+0.244
terday+0.243
 Seym+0.243
Neg logit contributions
hower-0.486
pees-0.476
pee-0.441
-0.433
ombat-0.432
Token B
Token ID347
Feature activation+3.52e-01

Pos logit contributions
theless+1.182
env+1.007
 Haas+0.898
terday+0.876
 Seym+0.874
Neg logit contributions
hower-1.465
pees-1.447
pee-1.317
-1.286
ombat-1.284
TokenDP
Token ID6322
Feature activation+1.87e-01

Pos logit contributions
hower+1.119
pees+1.094
pee+0.993
ombat+0.983
+0.977
Neg logit contributions
theless-1.274
env-1.100
terday-1.015
 Seym-1.013
lis-1.010
Token +
Token ID1343
Feature activation+2.28e-01

Pos logit contributions
hower+0.667
pees+0.655
pee+0.598
+0.590
ombat+0.589
Neg logit contributions
theless-0.612
env-0.526
lis-0.472
 Seym-0.472
terday-0.469
Token Bott
Token ID14835
Feature activation+2.41e+00

Pos logit contributions
hower+0.832
pees+0.816
pee+0.746
ombat+0.734
legate+0.733
Neg logit contributions
theless-0.733
env-0.626
terday-0.561
 Seym-0.559
 Haas-0.556
Tokenleneck
Token ID43163
Feature activation+8.53e-01

Pos logit contributions
hower+6.982
pees+6.797
legate+6.384
pee+6.268
ombat+6.223
Neg logit contributions
theless-8.931
env-7.549
 Seym-7.402
terday-7.225
 Haas-7.121
Token Buffer
Token ID47017
Feature activation+1.82e-01

Pos logit contributions
hower+2.985
pees+2.922
pee+2.682
ombat+2.658
+2.642
Neg logit contributions
theless-2.816
env-2.391
terday-2.193
 Seym-2.189
lis-2.174
Token Size
Token ID12849
Feature activation+9.97e-01

Pos logit contributions
hower+0.586
pees+0.575
pee+0.518
+0.510
ombat+0.510
Neg logit contributions
theless-0.654
env-0.573
 Haas-0.522
 Seym-0.518
lis-0.518
Token point
Token ID966
Feature activation-2.49e-03

Pos logit contributions
hower+3.171
pees+3.100
pee+2.817
ombat+2.789
+2.771
Neg logit contributions
theless-3.606
env-3.112
terday-2.878
 Seym-2.873
lis-2.856
Token (
Token ID357
Feature activation+5.01e-01

Pos logit contributions
theless+0.008
env+0.007
lis+0.007
terday+0.006
 Haas+0.006
Neg logit contributions
hower-0.009
pees-0.009
pee-0.008
ombat-0.008
-0.008
Token of
Token ID286
Feature activation-2.86e-01

Pos logit contributions
theless+3.534
env+2.989
terday+2.857
lis+2.768
 Seym+2.737
Neg logit contributions
hower-3.459
pees-3.452
pee-3.139
ombat-3.075
-2.999
Token exploiting
Token ID29440
Feature activation+1.21e-01

Pos logit contributions
theless+0.973
env+0.830
terday+0.762
 Manson+0.761
 Haas+0.758
Neg logit contributions
hower-0.974
pees-0.962
-0.877
ombat-0.876
pee-0.857
Token it
Token ID340
Feature activation+9.15e-01

Pos logit contributions
hower+0.475
pees+0.465
pee+0.429
ombat+0.429
+0.424
Neg logit contributions
theless-0.352
env-0.293
terday-0.265
 Haas-0.262
 Seym-0.262
Token.
Token ID13
Feature activation+1.29e-01

Pos logit contributions
hower+3.274
pees+3.206
pee+2.955
ombat+2.928
legate+2.910
Neg logit contributions
theless-2.929
env-2.475
 Seym-2.267
terday-2.263
lis-2.244
Token<|endoftext|>
Token ID50256
Feature activation+3.41e-01

Pos logit contributions
hower+0.470
pees+0.465
pee+0.425
ombat+0.419
+0.416
Neg logit contributions
theless-0.405
env-0.349
 Haas-0.322
terday-0.314
 Manson-0.314
TokenBed
Token ID45896
Feature activation+2.35e+00

Pos logit contributions
hower+1.168
pees+1.128
ombat+1.026
pee+1.023
+1.021
Neg logit contributions
theless-1.154
env-1.016
lis-0.928
terday-0.916
 Haas-0.899
Tokenside
Token ID1589
Feature activation+1.47e-01

Pos logit contributions
hower+7.757
pees+7.730
pee+7.143
legate+6.936
ombat+6.859
Neg logit contributions
theless-7.869
env-6.448
terday-6.156
 Seym-6.094
ylum-5.985
Token is
Token ID318
Feature activation-7.93e-04

Pos logit contributions
hower+0.503
pees+0.491
ombat+0.452
+0.448
pee+0.445
Neg logit contributions
theless-0.497
env-0.424
lis-0.391
 Haas-0.391
 Seym-0.386
Token a
Token ID257
Feature activation-5.01e-01

Pos logit contributions
theless+0.002
env+0.002
lis+0.002
terday+0.002
 Haas+0.002
Neg logit contributions
hower-0.003
pees-0.003
-0.003
ombat-0.003
pee-0.003
Token series
Token ID2168
Feature activation-5.49e-01

Pos logit contributions
theless+1.574
env+1.339
 Haas+1.208
lis+1.199
terday+1.191
Neg logit contributions
hower-1.816
pees-1.789
-1.657
ombat-1.635
pee-1.627
Token about
Token ID546
Feature activation-5.57e-01

Pos logit contributions
theless+1.777
env+1.562
lis+1.440
ylum+1.412
terday+1.410
Neg logit contributions
pees-1.880
hower-1.870
-1.736
pee-1.700
ombat-1.663
Token need
Token ID761
Feature activation+5.35e-02

Pos logit contributions
theless+0.875
env+0.749
lis+0.678
terday+0.674
 Haas+0.673
Neg logit contributions
hower-1.008
pees-0.974
-0.911
ombat-0.896
pee-0.882
Token to
Token ID284
Feature activation+6.48e-01

Pos logit contributions
hower+0.211
pees+0.206
ombat+0.192
pee+0.191
+0.191
Neg logit contributions
theless-0.152
env-0.127
terday-0.114
lis-0.114
 Seym-0.112
Token get
Token ID651
Feature activation+7.94e-01

Pos logit contributions
hower+2.186
pees+2.120
ombat+1.926
+1.925
pee+1.922
Neg logit contributions
theless-2.239
env-1.931
terday-1.771
lis-1.769
 Haas-1.746
Token you
Token ID345
Feature activation-6.12e-02

Pos logit contributions
hower+2.763
pees+2.709
pee+2.486
ombat+2.461
legate+2.447
Neg logit contributions
theless-2.628
env-2.231
terday-2.050
 Seym-2.048
lis-2.031
Token into
Token ID656
Feature activation+4.03e-01

Pos logit contributions
theless+0.190
env+0.159
 Haas+0.143
terday+0.142
lis+0.141
Neg logit contributions
hower-0.233
pees-0.224
-0.211
ombat-0.209
pee-0.205
Token bed
Token ID3996
Feature activation+2.23e+00

Pos logit contributions
hower+1.183
pees+1.139
ombat+1.027
+1.026
pee+1.022
Neg logit contributions
theless-1.569
env-1.375
terday-1.279
 Seym-1.276
lis-1.271
Token."
Token ID526
Feature activation+2.82e-01

Pos logit contributions
hower+6.738
pees+6.646
pee+6.183
legate+5.920
 Oo+5.772
Neg logit contributions
theless-7.862
env-6.616
 Seym-6.349
terday-6.225
ylum-6.129
Token And
Token ID843
Feature activation-3.12e-01

Pos logit contributions
hower+1.053
pees+1.037
ombat+0.960
pee+0.952
+0.944
Neg logit contributions
theless-0.861
env-0.735
lis-0.664
 Haas-0.660
terday-0.659
Token then
Token ID788
Feature activation+3.94e-01

Pos logit contributions
theless+0.987
env+0.843
lis+0.769
 Haas+0.757
 Seym+0.751
Neg logit contributions
hower-1.139
pees-1.116
ombat-1.053
-1.031
pee-1.016
Token I
Token ID314
Feature activation-2.95e-01

Pos logit contributions
hower+1.390
pees+1.362
ombat+1.257
pee+1.245
+1.240
Neg logit contributions
theless-1.295
env-1.109
terday-1.014
lis-1.012
 Seym-1.002
Token realize
Token ID6537
Feature activation+8.49e-01

Pos logit contributions
theless+0.918
env+0.790
lis+0.707
terday+0.698
 Haas+0.698
Neg logit contributions
hower-1.092
pees-1.058
ombat-0.984
-0.983
pee-0.954
Tokenki
Token ID4106
Feature activation+4.33e-01

Pos logit contributions
theless+1.171
env+1.065
lis+0.995
sov+0.918
ylum+0.907
Neg logit contributions
hower-1.629
pees-1.565
-1.519
ombat-1.464
 Dimension-1.448
Token and
Token ID290
Feature activation-1.06e-01

Pos logit contributions
hower+1.513
pees+1.477
pee+1.357
ombat+1.348
legate+1.336
Neg logit contributions
theless-1.432
env-1.218
 Seym-1.117
terday-1.114
lis-1.108
Token Stella
Token ID45856
Feature activation+5.38e-02

Pos logit contributions
theless+0.344
env+0.290
 Seym+0.271
 Haas+0.269
terday+0.268
Neg logit contributions
hower-0.377
pees-0.368
ombat-0.340
pee-0.338
-0.333
Token were
Token ID547
Feature activation+6.22e-01

Pos logit contributions
hower+0.210
pees+0.205
ombat+0.189
pee+0.189
+0.185
Neg logit contributions
theless-0.159
env-0.132
 Haas-0.121
terday-0.120
lis-0.118
Token completely
Token ID3190
Feature activation+5.19e-01

Pos logit contributions
hower+2.174
pees+2.080
+1.928
ombat+1.920
pee+1.898
Neg logit contributions
theless-2.117
env-1.819
 Haas-1.654
lis-1.647
terday-1.629
Token soaked
Token ID33657
Feature activation+2.22e+00

Pos logit contributions
hower+1.780
pees+1.708
ombat+1.572
+1.570
pee+1.554
Neg logit contributions
theless-1.794
env-1.542
 Haas-1.402
lis-1.388
terday-1.387
Token.
Token ID13
Feature activation+6.36e-01

Pos logit contributions
hower+6.681
pees+6.599
pee+6.045
legate+5.830
ombat+5.827
Neg logit contributions
theless-7.865
env-6.817
 Seym-6.415
lis-6.295
terday-6.286
Token
Token ID198
Feature activation+1.99e-01

Pos logit contributions
hower+2.334
pees+2.291
pee+2.102
ombat+2.097
legate+2.079
Neg logit contributions
theless-2.009
env-1.707
 Haas-1.552
lis-1.547
terday-1.547
Token
Token ID198
Feature activation+2.34e-01

Pos logit contributions
hower+0.757
pees+0.741
pee+0.680
ombat+0.678
+0.674
Neg logit contributions
theless-0.591
env-0.508
lis-0.455
terday-0.450
 Haas-0.448
TokenThere
Token ID1858
Feature activation-5.41e-02

Pos logit contributions
hower+0.850
pees+0.818
ombat+0.755
pee+0.745
legate+0.737
Neg logit contributions
theless-0.746
env-0.653
lis-0.593
terday-0.581
 Haas-0.573
Token,
Token ID11
Feature activation+4.54e-01

Pos logit contributions
theless+0.148
env+0.120
terday+0.109
lis+0.107
 Haas+0.105
Neg logit contributions
hower-0.222
pees-0.218
ombat-0.202
pee-0.202
-0.200
Token underdog
Token ID44288
Feature activation+3.47e-01

Pos logit contributions
hower+1.039
pees+1.027
pee+0.943
ombat+0.935
+0.924
Neg logit contributions
theless-0.845
env-0.715
terday-0.646
 Haas-0.645
lis-0.638
Token status
Token ID3722
Feature activation+1.20e-01

Pos logit contributions
hower+1.284
pees+1.269
pee+1.165
ombat+1.153
legate+1.142
Neg logit contributions
theless-1.081
env-0.915
terday-0.830
lis-0.824
 Haas-0.824
Token,
Token ID11
Feature activation+1.05e-01

Pos logit contributions
hower+0.436
pees+0.428
pee+0.393
+0.392
ombat+0.389
Neg logit contributions
theless-0.376
env-0.321
lis-0.292
 Haas-0.291
terday-0.290
Token thus
Token ID4145
Feature activation+5.00e-01

Pos logit contributions
hower+0.411
pees+0.400
+0.368
pee+0.367
ombat+0.366
Neg logit contributions
theless-0.308
env-0.260
 Haas-0.233
lis-0.232
 Seym-0.232
Token ing
Token ID5347
Feature activation+2.29e-01

Pos logit contributions
hower+1.703
pees+1.665
pee+1.510
ombat+1.507
+1.506
Neg logit contributions
theless-1.727
env-1.477
terday-1.357
lis-1.344
 Haas-1.337
Tokenr
Token ID81
Feature activation+2.17e+00

Pos logit contributions
hower+0.922
pees+0.891
pee+0.824
+0.819
ombat+0.819
Neg logit contributions
theless-0.641
env-0.553
lis-0.509
ylum-0.474
 Haas-0.469
Tokenati
Token ID7246
Feature activation+9.84e-01

Pos logit contributions
pees+7.207
hower+7.100
pee+6.679
ombat+6.505
legate+6.492
Neg logit contributions
theless-7.231
env-5.964
 Seym-5.819
terday-5.612
 Haas-5.541
Tokenating
Token ID803
Feature activation+3.65e-01

Pos logit contributions
hower+2.844
pees+2.742
pee+2.453
+2.440
ombat+2.421
Neg logit contributions
theless-3.848
env-3.419
terday-3.163
lis-3.151
 Haas-3.131
Token yourself
Token ID3511
Feature activation+4.65e-01

Pos logit contributions
hower+1.326
pees+1.294
pee+1.188
ombat+1.183
+1.176
Neg logit contributions
theless-1.166
env-0.985
terday-0.895
 Seym-0.894
lis-0.891
Token with
Token ID351
Feature activation+3.22e-01

Pos logit contributions
hower+1.575
pees+1.541
pee+1.409
ombat+1.395
+1.388
Neg logit contributions
theless-1.587
env-1.356
terday-1.252
 Seym-1.244
lis-1.239
Token fans
Token ID3296
Feature activation+7.90e-01

Pos logit contributions
hower+1.227
pees+1.198
pee+1.106
ombat+1.104
+1.095
Neg logit contributions
theless-0.973
env-0.821
lis-0.739
terday-0.737
 Haas-0.735
Token photo
Token ID4590
Feature activation-2.57e-01

Pos logit contributions
hower+1.011
pees+0.990
pee+0.912
ombat+0.905
legate+0.900
Neg logit contributions
theless-0.816
env-0.684
terday-0.617
 Haas-0.616
 Seym-0.615
Tokenjournal
Token ID24891
Feature activation+1.10e-01

Pos logit contributions
theless+0.972
env+0.871
lis+0.796
 Haas+0.789
terday+0.778
Neg logit contributions
hower-0.798
pees-0.758
pee-0.677
-0.675
ombat-0.672
Tokenist
Token ID396
Feature activation+5.58e-02

Pos logit contributions
hower+0.391
pees+0.382
ombat+0.349
pee+0.347
+0.344
Neg logit contributions
theless-0.359
env-0.308
lis-0.278
terday-0.278
 Haas-0.277
Token Sean
Token ID11465
Feature activation+3.80e-01

Pos logit contributions
hower+0.206
pees+0.201
ombat+0.185
pee+0.184
+0.183
Neg logit contributions
theless-0.174
env-0.148
 Haas-0.136
 Seym-0.135
terday-0.134
Token O
Token ID440
Feature activation+8.74e-01

Pos logit contributions
hower+1.390
pees+1.370
+1.265
ombat+1.259
pee+1.255
Neg logit contributions
theless-1.186
env-1.007
 Haas-0.928
lis-0.917
 Manson-0.908
Token'
Token ID6
Feature activation+2.09e+00

Pos logit contributions
hower+2.625
pees+2.565
pee+2.314
ombat+2.288
+2.286
Neg logit contributions
theless-3.318
env-2.889
terday-2.674
lis-2.673
 Seym-2.671
TokenConnell
Token ID15559
Feature activation+3.92e-01

Pos logit contributions
hower+6.203
pees+5.897
pee+5.431
legate+5.424
ombat+5.261
Neg logit contributions
theless-7.783
env-6.575
terday-6.450
 Seym-6.295
ylum-6.133
Token (
Token ID357
Feature activation+6.79e-01

Pos logit contributions
hower+1.471
pees+1.437
pee+1.328
ombat+1.314
+1.312
Neg logit contributions
theless-1.206
env-1.015
terday-0.915
lis-0.914
 Haas-0.906
Tokenplayed
Token ID21542
Feature activation+1.01e-01

Pos logit contributions
hower+2.251
pees+2.202
pee+2.011
ombat+1.992
+1.979
Neg logit contributions
theless-2.360
env-2.022
terday-1.866
 Seym-1.862
lis-1.850
Token by
Token ID416
Feature activation+1.32e-01

Pos logit contributions
hower+0.347
pees+0.341
pee+0.310
ombat+0.309
+0.306
Neg logit contributions
theless-0.342
env-0.293
 Seym-0.269
terday-0.269
lis-0.268
Token Sean
Token ID11465
Feature activation+2.76e-01

Pos logit contributions
hower+0.485
pees+0.478
pee+0.437
ombat+0.436
+0.433
Neg logit contributions
theless-0.416
env-0.353
 Haas-0.321
lis-0.319
 Seym-0.319
Token first
Token ID717
Feature activation-8.18e-03

Pos logit contributions
theless+1.586
env+1.321
terday+1.177
 Haas+1.168
lis+1.154
Neg logit contributions
hower-2.049
pees-2.039
-1.877
pee-1.870
ombat-1.865
Token met
Token ID1138
Feature activation+1.15e-01

Pos logit contributions
theless+0.024
env+0.020
terday+0.018
 Haas+0.018
lis+0.017
Neg logit contributions
hower-0.032
pees-0.032
ombat-0.029
pee-0.029
-0.029
Token,
Token ID11
Feature activation+3.46e-01

Pos logit contributions
hower+0.405
pees+0.404
ombat+0.372
pee+0.370
+0.365
Neg logit contributions
theless-0.374
env-0.320
terday-0.294
 Haas-0.292
 Seym-0.292
Token and
Token ID290
Feature activation+2.50e-01

Pos logit contributions
hower+1.296
pees+1.284
pee+1.189
ombat+1.184
+1.179
Neg logit contributions
theless-1.055
env-0.882
 Haas-0.805
terday-0.801
lis-0.798
Token O
Token ID440
Feature activation+9.52e-01

Pos logit contributions
hower+0.911
pees+0.907
pee+0.838
ombat+0.837
+0.836
Neg logit contributions
theless-0.786
env-0.658
terday-0.605
lis-0.597
 Haas-0.597
Token'
Token ID6
Feature activation+2.08e+00

Pos logit contributions
hower+2.714
pees+2.646
pee+2.382
ombat+2.354
legate+2.339
Neg logit contributions
theless-3.757
env-3.270
 Seym-3.064
terday-3.059
lis-3.025
TokenNeill
Token ID26538
Feature activation+3.16e-01

Pos logit contributions
hower+4.604
pees+4.422
pee+3.929
legate+3.800
ombat+3.744
Neg logit contributions
theless-9.369
 Seym-8.110
env-8.067
terday-7.841
ylum-7.638
Token attempted
Token ID7482
Feature activation-3.70e-01

Pos logit contributions
hower+1.216
pees+1.200
ombat+1.109
pee+1.108
+1.103
Neg logit contributions
theless-0.940
env-0.780
terday-0.704
lis-0.698
 Haas-0.697
Token to
Token ID284
Feature activation+8.64e-02

Pos logit contributions
theless+1.009
env+0.834
terday+0.743
 Seym+0.741
lis+0.738
Neg logit contributions
hower-1.493
pees-1.472
pee-1.365
ombat-1.358
-1.350
Token advise
Token ID18595
Feature activation+2.37e-01

Pos logit contributions
hower+0.307
pees+0.301
+0.276
pee+0.273
ombat+0.271
Neg logit contributions
theless-0.284
env-0.242
terday-0.220
lis-0.219
 Haas-0.216
Token Carter
Token ID10831
Feature activation-5.36e-01

Pos logit contributions
hower+0.889
pees+0.883
pee+0.816
ombat+0.812
+0.805
Neg logit contributions
theless-0.722
env-0.604
lis-0.546
terday-0.544
 Haas-0.544
Token of
Token ID286
Feature activation-7.12e-01

Pos logit contributions
theless+1.302
env+1.135
lis+1.054
 Seym+1.043
 Haas+1.040
Neg logit contributions
hower-1.331
pees-1.311
pee-1.191
ombat-1.181
legate-1.181
Token returning
Token ID8024
Feature activation-2.22e-01

Pos logit contributions
theless+2.120
env+1.776
 Haas+1.636
lis+1.630
 Manson+1.588
Neg logit contributions
hower-2.732
pees-2.644
ombat-2.449
-2.426
pee-2.416
Token them
Token ID606
Feature activation-2.11e-01

Pos logit contributions
theless+0.624
env+0.518
lis+0.470
 Haas+0.460
terday+0.459
Neg logit contributions
hower-0.899
pees-0.874
ombat-0.807
pee-0.806
-0.803
Token to
Token ID284
Feature activation-2.55e-01

Pos logit contributions
theless+0.596
env+0.496
terday+0.443
lis+0.441
ylum+0.434
Neg logit contributions
hower-0.849
pees-0.829
pee-0.766
ombat-0.764
-0.763
Token their
Token ID511
Feature activation+6.53e-01

Pos logit contributions
theless+0.754
env+0.634
lis+0.578
 Haas+0.573
 Seym+0.571
Neg logit contributions
hower-0.993
pees-0.954
ombat-0.890
-0.883
pee-0.879
Token bed
Token ID3996
Feature activation+2.08e+00

Pos logit contributions
hower+2.442
pees+2.382
pee+2.197
ombat+2.188
legate+2.171
Neg logit contributions
theless-2.014
env-1.696
terday-1.544
 Seym-1.533
lis-1.526
Token and
Token ID290
Feature activation+2.97e-01

Pos logit contributions
pees+7.263
hower+7.243
pee+6.653
legate+6.432
 Dimension+6.395
Neg logit contributions
theless-6.565
env-5.314
terday-5.096
 Seym-5.079
 Haas-4.953
Token the
Token ID262
Feature activation+2.54e-01

Pos logit contributions
hower+1.035
pees+1.008
ombat+0.922
+0.921
pee+0.916
Neg logit contributions
theless-1.004
env-0.863
lis-0.784
terday-0.784
 Seym-0.779
Token sexual
Token ID3206
Feature activation+1.29e+00

Pos logit contributions
hower+0.925
pees+0.901
ombat+0.827
pee+0.822
+0.819
Neg logit contributions
theless-0.812
env-0.693
lis-0.629
terday-0.629
 Seym-0.623
Token side
Token ID1735
Feature activation-2.08e-02

Pos logit contributions
hower+4.619
pees+4.529
pee+4.193
ombat+4.127
legate+4.111
Neg logit contributions
theless-4.094
env-3.420
 Seym-3.198
terday-3.154
lis-3.110
Token of
Token ID286
Feature activation-1.99e-01

Pos logit contributions
theless+0.073
env+0.063
lis+0.058
terday+0.057
 Haas+0.057
Neg logit contributions
hower-0.069
pees-0.068
-0.061
pee-0.061
ombat-0.061
Token before
Token ID878
Feature activation+2.96e-01

Pos logit contributions
hower+1.866
pees+1.824
pee+1.647
ombat+1.646
+1.633
Neg logit contributions
theless-2.329
env-2.018
terday-1.891
 Seym-1.860
lis-1.859
Token 8
Token ID807
Feature activation+5.79e-01

Pos logit contributions
hower+1.093
pees+1.078
ombat+0.993
pee+0.990
legate+0.984
Neg logit contributions
theless-0.916
env-0.769
terday-0.711
lis-0.708
 Haas-0.705
Token:
Token ID25
Feature activation+2.77e-01

Pos logit contributions
hower+1.954
pees+1.904
ombat+1.745
pee+1.738
+1.720
Neg logit contributions
theless-1.989
env-1.720
terday-1.576
lis-1.567
 Haas-1.553
Token30
Token ID1270
Feature activation+1.35e+00

Pos logit contributions
hower+1.045
pees+1.013
ombat+0.941
pee+0.929
legate+0.927
Neg logit contributions
theless-0.843
env-0.716
lis-0.652
 Haas-0.646
terday-0.643
Token a
Token ID257
Feature activation+3.48e-01

Pos logit contributions
hower+5.013
pees+4.919
pee+4.541
ombat+4.476
legate+4.475
Neg logit contributions
theless-4.142
env-3.458
 Seym-3.161
terday-3.150
lis-3.109
Token.
Token ID13
Feature activation+2.08e+00

Pos logit contributions
hower+1.232
pees+1.206
pee+1.099
+1.098
ombat+1.095
Neg logit contributions
theless-1.121
env-0.973
lis-0.887
terday-0.879
 Haas-0.876
Tokenm
Token ID76
Feature activation+1.46e+00

Pos logit contributions
hower+5.401
pees+5.328
pee+4.905
legate+4.768
quet+4.745
Neg logit contributions
theless-8.573
 Seym-7.485
env-7.035
terday-6.919
etheless-6.775
Token.
Token ID13
Feature activation+8.64e-01

Pos logit contributions
hower+4.511
pees+4.430
pee+4.043
ombat+4.010
legate+3.986
Neg logit contributions
theless-5.380
env-4.571
 Seym-4.248
terday-4.199
lis-4.174
Token threatens
Token ID20201
Feature activation+8.23e-01

Pos logit contributions
hower+3.210
pees+3.144
pee+2.897
ombat+2.884
+2.867
Neg logit contributions
theless-2.673
env-2.243
terday-2.050
 Seym-2.033
lis-2.027
Token children
Token ID1751
Feature activation+3.00e-01

Pos logit contributions
hower+3.176
pees+3.120
pee+2.884
ombat+2.865
+2.851
Neg logit contributions
theless-2.420
env-2.019
terday-1.830
 Seym-1.816
lis-1.808
Token's
Token ID338
Feature activation-7.66e-01

Pos logit contributions
hower+1.136
pees+1.114
pee+1.031
ombat+1.021
+1.015
Neg logit contributions
theless-0.901
env-0.750
terday-0.681
 Seym-0.681
lis-0.674
Token alone
Token ID3436
Feature activation-3.94e-02

Pos logit contributions
hower+0.485
pees+0.475
+0.434
ombat+0.431
pee+0.430
Neg logit contributions
theless-0.439
env-0.375
terday-0.337
 Haas-0.336
lis-0.336
Token.
Token ID13
Feature activation+1.30e-02

Pos logit contributions
theless+0.126
env+0.106
terday+0.097
lis+0.096
 Haas+0.095
Neg logit contributions
hower-0.143
pees-0.141
ombat-0.128
pee-0.128
-0.128
Token
Token ID198
Feature activation+2.29e-01

Pos logit contributions
hower+0.050
pees+0.049
ombat+0.045
pee+0.045
legate+0.044
Neg logit contributions
theless-0.039
env-0.034
 Haas-0.031
 Manson-0.030
lis-0.030
Token
Token ID198
Feature activation+2.01e-01

Pos logit contributions
hower+0.861
pees+0.843
pee+0.774
ombat+0.769
+0.766
Neg logit contributions
theless-0.695
env-0.594
lis-0.533
terday-0.532
 Haas-0.526
TokenO
Token ID46
Feature activation+2.88e-01

Pos logit contributions
hower+0.725
pees+0.696
ombat+0.636
pee+0.628
+0.627
Neg logit contributions
theless-0.646
env-0.577
lis-0.519
terday-0.504
URE-0.502
Token'
Token ID6
Feature activation+2.07e+00

Pos logit contributions
hower+0.829
pees+0.809
pee+0.728
ombat+0.720
legate+0.714
Neg logit contributions
theless-1.130
env-0.985
terday-0.920
 Seym-0.917
lis-0.911
TokenBrian
Token ID24761
Feature activation-1.28e-01

Pos logit contributions
hower+6.497
pees+6.365
pee+6.007
legate+5.797
ombat+5.796
Neg logit contributions
theless-7.215
env-5.964
 Seym-5.962
terday-5.751
ylum-5.627
Token is
Token ID318
Feature activation-2.30e-01

Pos logit contributions
theless+0.373
env+0.307
 Haas+0.275
lis+0.275
terday+0.273
Neg logit contributions
hower-0.505
pees-0.494
pee-0.456
ombat-0.454
-0.448
Token as
Token ID355
Feature activation-6.95e-01

Pos logit contributions
theless+0.689
env+0.589
 Haas+0.526
lis+0.521
terday+0.517
Neg logit contributions
hower-0.891
pees-0.869
-0.796
ombat-0.792
pee-0.791
Token fascinated
Token ID30103
Feature activation+5.02e-01

Pos logit contributions
theless+2.257
env+1.931
 Haas+1.745
ylum+1.742
lis+1.729
Neg logit contributions
hower-2.498
pees-2.439
-2.226
ombat-2.188
pee-2.181
Token by
Token ID416
Feature activation+4.88e-01

Pos logit contributions
hower+1.640
pees+1.602
pee+1.463
ombat+1.447
legate+1.437
Neg logit contributions
theless-1.770
env-1.519
terday-1.403
 Seym-1.401
lis-1.392
Token vow
Token ID23268
Feature activation-3.24e-01

Pos logit contributions
hower+1.230
pees+1.208
pee+1.113
ombat+1.111
+1.098
Neg logit contributions
theless-0.929
env-0.776
terday-0.695
 Seym-0.689
lis-0.689
Token to
Token ID284
Feature activation-4.55e-01

Pos logit contributions
theless+0.933
env+0.791
lis+0.721
terday+0.699
 Seym+0.698
Neg logit contributions
hower-1.258
pees-1.244
ombat-1.163
pee-1.140
-1.131
Token meet
Token ID1826
Feature activation-2.04e-01

Pos logit contributions
theless+1.416
env+1.165
 Haas+1.083
lis+1.075
 Seym+1.059
Neg logit contributions
hower-1.708
pees-1.644
ombat-1.550
pee-1.497
-1.493
Token in
Token ID287
Feature activation+2.26e-01

Pos logit contributions
theless+0.605
env+0.507
 Haas+0.460
lis+0.457
 Seym+0.455
Neg logit contributions
hower-0.794
pees-0.774
ombat-0.719
pee-0.715
legate-0.699
Token the
Token ID262
Feature activation+9.76e-02

Pos logit contributions
hower+0.878
pees+0.858
pee+0.790
ombat+0.787
legate+0.776
Neg logit contributions
theless-0.667
env-0.564
 Haas-0.509
 Seym-0.507
lis-0.502
Token Sword
Token ID11535
Feature activation+2.07e+00

Pos logit contributions
hower+0.355
pees+0.349
pee+0.319
ombat+0.316
legate+0.310
Neg logit contributions
theless-0.314
env-0.269
 Haas-0.244
 Seym-0.243
lis-0.240
Token-
Token ID12
Feature activation+8.89e-01

Pos logit contributions
hower+4.880
pees+4.758
 Dimension+4.335
pee+4.243
quet+4.230
Neg logit contributions
theless-8.891
env-7.497
terday-7.256
 Seym-7.103
lis-7.012
TokenArt
Token ID8001
Feature activation+6.92e-01

Pos logit contributions
hower+1.809
pees+1.768
pee+1.528
ombat+1.497
+1.486
Neg logit contributions
theless-4.213
env-3.725
terday-3.558
 Seym-3.542
 Haas-3.509
Token Festival
Token ID11117
Feature activation+6.87e-01

Pos logit contributions
hower+1.957
pees+1.910
pee+1.729
+1.702
ombat+1.700
Neg logit contributions
theless-2.735
env-2.376
terday-2.225
 Seym-2.210
lis-2.200
Token finals
Token ID18576
Feature activation+6.91e-02

Pos logit contributions
hower+2.599
pees+2.549
pee+2.353
ombat+2.335
+2.317
Neg logit contributions
theless-2.080
env-1.740
terday-1.573
 Seym-1.573
lis-1.564
Token.
Token ID13
Feature activation-1.41e-01

Pos logit contributions
hower+0.276
pees+0.274
ombat+0.252
pee+0.252
+0.249
Neg logit contributions
theless-0.195
env-0.161
terday-0.143
lis-0.143
 Haas-0.143
Tokencom
Token ID785
Feature activation-1.70e-01

Pos logit contributions
theless+0.326
env+0.273
 Seym+0.243
lis+0.243
terday+0.240
Neg logit contributions
hower-0.473
pees-0.463
pee-0.429
-0.426
ombat-0.425
Token and
Token ID290
Feature activation-2.94e-01

Pos logit contributions
theless+0.535
env+0.458
lis+0.420
 Haas+0.413
 Seym+0.412
Neg logit contributions
hower-0.619
pees-0.600
pee-0.550
-0.548
ombat-0.546
Token V
Token ID569
Feature activation+1.64e-01

Pos logit contributions
theless+0.883
env+0.747
lis+0.671
 Haas+0.670
terday+0.662
Neg logit contributions
hower-1.113
pees-1.101
pee-1.011
-0.993
ombat-0.987
Tokenodka
Token ID28601
Feature activation+5.13e-01

Pos logit contributions
hower+0.361
pees+0.343
pee+0.293
+0.292
legate+0.283
Neg logit contributions
theless-0.759
env-0.687
lis-0.642
terday-0.638
 Haas-0.635
Token O
Token ID440
Feature activation+6.57e-01

Pos logit contributions
hower+1.795
pees+1.758
pee+1.610
+1.589
legate+1.586
Neg logit contributions
theless-1.686
env-1.452
terday-1.319
 Seym-1.316
lis-1.310
Token'
Token ID6
Feature activation+2.05e+00

Pos logit contributions
hower+1.952
pees+1.897
pee+1.704
+1.686
ombat+1.685
Neg logit contributions
theless-2.518
env-2.216
lis-2.047
terday-2.039
 Seym-2.027
TokenClock
Token ID44758
Feature activation+7.83e-01

Pos logit contributions
hower+4.666
pees+4.523
pee+4.122
legate+4.044
ombat+3.987
Neg logit contributions
theless-9.079
 Seym-7.829
env-7.673
terday-7.558
ylum-7.384
Token Podcast
Token ID16036
Feature activation-1.69e-01

Pos logit contributions
hower+2.104
pees+2.049
pee+1.833
ombat+1.815
+1.793
Neg logit contributions
theless-3.215
env-2.819
terday-2.643
 Seym-2.635
lis-2.620
Token.
Token ID13
Feature activation+1.92e-01

Pos logit contributions
theless+0.554
env+0.475
 Seym+0.435
lis+0.432
terday+0.432
Neg logit contributions
hower-0.589
pees-0.577
pee-0.529
ombat-0.521
-0.520
Token Author
Token ID6434
Feature activation-2.00e-01

Pos logit contributions
hower+0.717
pees+0.703
pee+0.645
+0.639
ombat+0.636
Neg logit contributions
theless-0.587
env-0.500
 Seym-0.453
lis-0.452
 Haas-0.448
Token of
Token ID286
Feature activation-4.93e-01

Pos logit contributions
theless+0.673
env+0.603
lis+0.558
 Haas+0.548
terday+0.535
Neg logit contributions
hower-0.671
pees-0.654
pee-0.591
-0.587
ombat-0.585
Token
Token ID198
Feature activation-1.53e-01

Pos logit contributions
hower+0.480
pees+0.469
pee+0.430
ombat+0.428
+0.427
Neg logit contributions
theless-0.368
env-0.318
lis-0.285
terday-0.281
 Haas-0.279
Token
Token ID447
Feature activation-5.55e-02

Pos logit contributions
theless+0.498
env+0.435
lis+0.401
terday+0.390
 Haas+0.384
Neg logit contributions
hower-0.539
pees-0.518
ombat-0.475
-0.471
pee-0.470
Token
Token ID250
Feature activation-1.77e-01

Pos logit contributions
theless+0.202
env+0.175
terday+0.162
 Seym+0.162
lis+0.161
Neg logit contributions
hower-0.175
pees-0.170
pee-0.155
ombat-0.153
legate-0.152
TokenHa
Token ID23303
Feature activation-3.21e-01

Pos logit contributions
theless+0.610
env+0.538
lis+0.500
terday+0.484
ylum+0.477
Neg logit contributions
hower-0.591
pees-0.565
-0.520
ombat-0.519
pee-0.508
Token!
Token ID0
Feature activation+2.96e-01

Pos logit contributions
theless+1.005
env+0.847
terday+0.792
 Haas+0.763
lis+0.760
Neg logit contributions
hower-1.182
pees-1.150
pee-1.047
ombat-1.043
-1.040
Token Fuck
Token ID25617
Feature activation+2.04e+00

Pos logit contributions
hower+1.022
pees+1.000
pee+0.910
ombat+0.909
+0.904
Neg logit contributions
theless-0.992
env-0.844
terday-0.774
lis-0.774
 Haas-0.773
Token off
Token ID572
Feature activation+2.52e-01

Pos logit contributions
hower+6.571
pees+6.483
pee+6.060
legate+5.909
ombat+5.902
Neg logit contributions
theless-6.856
env-5.740
 Seym-5.531
terday-5.352
lis-5.166
Token!
Token ID0
Feature activation+1.80e-01

Pos logit contributions
hower+0.871
pees+0.851
pee+0.777
ombat+0.771
+0.771
Neg logit contributions
theless-0.840
env-0.719
terday-0.660
lis-0.657
 Seym-0.657
Token
Token ID447
Feature activation-7.25e-01

Pos logit contributions
hower+0.632
pees+0.615
ombat+0.559
pee+0.559
legate+0.559
Neg logit contributions
theless-0.592
env-0.506
 Haas-0.463
lis-0.463
 Seym-0.461
Token
Token ID251
Feature activation-2.02e-01

Pos logit contributions
theless+1.920
env+1.674
 Haas+1.557
lis+1.520
ylum+1.506
Neg logit contributions
hower-2.856
pees-2.726
pee-2.538
ombat-2.520
apor-2.513
Token he
Token ID339
Feature activation+1.29e-01

Pos logit contributions
theless+0.621
env+0.527
lis+0.484
 Haas+0.479
terday+0.476
Neg logit contributions
hower-0.747
pees-0.728
pee-0.669
legate-0.668
ombat-0.664
Tokenundrum
Token ID46859
Feature activation-1.80e-01

Pos logit contributions
hower+0.346
pees+0.346
+0.311
pee+0.307
ombat+0.301
Neg logit contributions
theless-0.443
env-0.408
lis-0.381
 Haas-0.368
ylum-0.364
Token.
Token ID13
Feature activation-2.78e-01

Pos logit contributions
theless+0.547
env+0.470
terday+0.425
lis+0.419
 Haas+0.416
Neg logit contributions
hower-0.668
pees-0.662
pee-0.606
ombat-0.605
-0.602
Token Within
Token ID12511
Feature activation+1.17e+00

Pos logit contributions
theless+0.837
env+0.730
 Haas+0.656
lis+0.654
terday+0.652
Neg logit contributions
hower-1.039
pees-1.011
-0.937
ombat-0.931
pee-0.921
Token a
Token ID257
Feature activation+7.51e-01

Pos logit contributions
hower+4.218
pees+4.133
pee+3.807
ombat+3.774
legate+3.756
Neg logit contributions
theless-3.666
env-3.081
 Seym-2.819
terday-2.815
lis-2.792
Token few
Token ID1178
Feature activation+1.31e+00

Pos logit contributions
hower+2.983
pees+2.928
pee+2.717
ombat+2.696
+2.682
Neg logit contributions
theless-2.121
env-1.747
terday-1.574
 Seym-1.569
lis-1.556
Token hours
Token ID2250
Feature activation+2.02e+00

Pos logit contributions
hower+3.540
pees+3.448
pee+3.078
ombat+3.040
+3.020
Neg logit contributions
theless-5.351
env-4.697
terday-4.395
 Seym-4.387
lis-4.368
Token we
Token ID356
Feature activation+2.72e-01

Pos logit contributions
hower+6.102
pees+5.943
pee+5.372
legate+5.329
ombat+5.283
Neg logit contributions
theless-7.354
env-6.311
 Seym-5.832
ylum-5.777
terday-5.752
Token received
Token ID2722
Feature activation+7.30e-01

Pos logit contributions
hower+1.083
pees+1.058
pee+0.981
ombat+0.969
+0.968
Neg logit contributions
theless-0.784
env-0.648
terday-0.583
 Haas-0.569
lis-0.567
Token a
Token ID257
Feature activation+6.36e-01

Pos logit contributions
hower+2.815
pees+2.757
pee+2.553
ombat+2.532
+2.524
Neg logit contributions
theless-2.163
env-1.802
terday-1.630
 Seym-1.626
lis-1.611
Token definitive
Token ID17347
Feature activation+8.34e-01

Pos logit contributions
hower+2.286
pees+2.247
pee+2.053
ombat+2.050
+2.038
Neg logit contributions
theless-2.062
env-1.742
 Haas-1.592
 Seym-1.586
terday-1.586
Token answer
Token ID3280
Feature activation+7.53e-01

Pos logit contributions
hower+2.941
pees+2.886
pee+2.629
ombat+2.623
+2.607
Neg logit contributions
theless-2.755
env-2.347
terday-2.147
 Haas-2.142
 Seym-2.129
Token stories
Token ID3923
Feature activation-2.73e-01

Pos logit contributions
theless+1.008
env+0.877
lis+0.814
 Seym+0.808
 Haas+0.805
Neg logit contributions
hower-0.824
pees-0.781
-0.709
pee-0.708
ombat-0.704
Token behind
Token ID2157
Feature activation+6.71e-02

Pos logit contributions
theless+0.868
env+0.743
 Haas+0.675
terday+0.672
lis+0.666
Neg logit contributions
hower-0.982
pees-0.966
ombat-0.891
-0.882
pee-0.870
Token him
Token ID683
Feature activation+9.50e-01

Pos logit contributions
hower+0.259
pees+0.254
ombat+0.235
pee+0.234
legate+0.230
Neg logit contributions
theless-0.197
env-0.167
terday-0.149
 Haas-0.148
lis-0.148
Token,
Token ID11
Feature activation+9.64e-02

Pos logit contributions
hower+3.147
pees+3.078
pee+2.811
ombat+2.785
+2.767
Neg logit contributions
theless-3.308
env-2.836
terday-2.616
 Seym-2.610
lis-2.594
Token O
Token ID440
Feature activation+7.34e-01

Pos logit contributions
hower+0.353
pees+0.350
ombat+0.321
pee+0.320
+0.315
Neg logit contributions
theless-0.304
env-0.256
 Haas-0.232
terday-0.232
 Seym-0.230
Token'
Token ID6
Feature activation+2.02e+00

Pos logit contributions
hower+2.118
pees+2.063
pee+1.856
ombat+1.837
+1.824
Neg logit contributions
theless-2.870
env-2.507
terday-2.335
 Seym-2.330
lis-2.321
TokenBrian
Token ID24761
Feature activation+3.61e-01

Pos logit contributions
hower+6.294
pees+6.209
pee+5.863
legate+5.628
ombat+5.628
Neg logit contributions
theless-7.143
 Seym-5.912
env-5.872
terday-5.671
ylum-5.560
Token accepted
Token ID6292
Feature activation-1.08e-01

Pos logit contributions
hower+1.392
pees+1.369
pee+1.262
ombat+1.257
+1.243
Neg logit contributions
theless-1.083
env-0.899
terday-0.804
lis-0.803
 Haas-0.800
Token an
Token ID281
Feature activation+3.75e-01

Pos logit contributions
theless+0.294
env+0.244
 Haas+0.214
ylum+0.213
terday+0.212
Neg logit contributions
hower-0.445
pees-0.437
pee-0.407
ombat-0.404
-0.402
Token offer
Token ID2897
Feature activation-3.42e-01

Pos logit contributions
hower+1.374
pees+1.345
pee+1.236
ombat+1.226
+1.216
Neg logit contributions
theless-1.190
env-1.030
 Haas-0.918
lis-0.916
 Seym-0.915
Token from
Token ID422
Feature activation-5.62e-01

Pos logit contributions
theless+0.953
env+0.809
 Haas+0.722
terday+0.714
ylum+0.709
Neg logit contributions
hower-1.357
pees-1.345
pee-1.241
ombat-1.240
-1.232
Token want
Token ID765
Feature activation+3.40e-01

Pos logit contributions
theless+3.072
env+2.601
terday+2.312
 Haas+2.287
lis+2.272
Neg logit contributions
hower-4.263
pees-4.119
-3.948
ombat-3.885
formance-3.744
Token to
Token ID284
Feature activation+2.68e-01

Pos logit contributions
hower+1.311
pees+1.282
pee+1.186
ombat+1.183
+1.176
Neg logit contributions
theless-1.007
env-0.840
terday-0.759
 Seym-0.756
lis-0.754
Token get
Token ID651
Feature activation+3.81e-01

Pos logit contributions
hower+0.941
pees+0.915
+0.845
ombat+0.838
pee+0.824
Neg logit contributions
theless-0.889
env-0.763
lis-0.700
terday-0.698
 Haas-0.689
Token you
Token ID345
Feature activation-6.38e-01

Pos logit contributions
hower+1.392
pees+1.358
ombat+1.249
+1.249
pee+1.245
Neg logit contributions
theless-1.214
env-1.034
 Haas-0.931
lis-0.930
terday-0.930
Token into
Token ID656
Feature activation+1.97e-01

Pos logit contributions
theless+1.872
env+1.558
 Haas+1.401
lis+1.372
ylum+1.369
Neg logit contributions
hower-2.485
pees-2.434
-2.330
ombat-2.283
pee-2.198
Token bed
Token ID3996
Feature activation+2.02e+00

Pos logit contributions
hower+0.484
pees+0.465
+0.415
ombat+0.414
pee+0.406
Neg logit contributions
theless-0.857
env-0.763
 Seym-0.715
lis-0.713
 Haas-0.712
Token,
Token ID11
Feature activation+8.94e-02

Pos logit contributions
hower+6.089
pees+5.915
pee+5.463
legate+5.268
+5.160
Neg logit contributions
theless-7.204
env-6.200
 Seym-5.878
ylum-5.726
terday-5.722
Token there
Token ID612
Feature activation-1.91e-01

Pos logit contributions
hower+0.308
pees+0.297
ombat+0.276
+0.274
pee+0.270
Neg logit contributions
theless-0.303
env-0.259
 Haas-0.238
 Seym-0.238
terday-0.238
Token are
Token ID389
Feature activation+1.75e-01

Pos logit contributions
theless+0.541
env+0.448
terday+0.402
lis+0.401
 Haas+0.396
Neg logit contributions
hower-0.758
pees-0.736
ombat-0.684
pee-0.682
-0.681
Token no
Token ID645
Feature activation+7.87e-01

Pos logit contributions
hower+0.677
pees+0.651
ombat+0.607
pee+0.606
+0.605
Neg logit contributions
theless-0.523
env-0.437
 Haas-0.396
lis-0.394
terday-0.393
Token words
Token ID2456
Feature activation-4.66e-01

Pos logit contributions
hower+2.675
pees+2.601
pee+2.377
ombat+2.374
+2.346
Neg logit contributions
theless-2.711
env-2.328
terday-2.139
 Seym-2.118
lis-2.113
Token to
Token ID284
Feature activation+4.22e-01

Pos logit contributions
hower+1.568
pees+1.540
pee+1.429
ombat+1.418
legate+1.412
Neg logit contributions
theless-1.135
env-0.930
 Seym-0.846
terday-0.842
lis-0.829
Token a
Token ID257
Feature activation+4.80e-01

Pos logit contributions
hower+1.680
pees+1.651
pee+1.532
ombat+1.523
+1.510
Neg logit contributions
theless-1.203
env-0.993
terday-0.894
 Seym-0.888
lis-0.886
Token 2012
Token ID2321
Feature activation+5.24e-01

Pos logit contributions
hower+1.734
pees+1.699
pee+1.563
ombat+1.552
+1.541
Neg logit contributions
theless-1.531
env-1.293
terday-1.181
 Seym-1.177
lis-1.171
Token Pew
Token ID21805
Feature activation+2.42e-01

Pos logit contributions
hower+1.869
pees+1.833
pee+1.683
ombat+1.671
+1.657
Neg logit contributions
theless-1.702
env-1.443
terday-1.318
 Seym-1.313
lis-1.309
Token Research
Token ID4992
Feature activation+1.06e+00

Pos logit contributions
hower+0.818
pees+0.808
pee+0.739
ombat+0.732
+0.727
Neg logit contributions
theless-0.822
env-0.707
terday-0.648
lis-0.648
 Haas-0.645
Token Center
Token ID3337
Feature activation+2.01e+00

Pos logit contributions
hower+2.952
pees+2.867
pee+2.567
ombat+2.542
legate+2.528
Neg logit contributions
theless-4.233
env-3.702
 Seym-3.465
terday-3.456
lis-3.432
Token survey
Token ID5526
Feature activation+5.13e-01

Pos logit contributions
hower+6.920
pees+6.677
legate+6.231
 Abstract+6.209
 Dimension+6.193
Neg logit contributions
theless-6.357
env-5.201
 Seym-4.981
terday-4.754
ylum-4.665
Token of
Token ID286
Feature activation+6.26e-01

Pos logit contributions
hower+1.714
pees+1.678
pee+1.534
ombat+1.521
+1.511
Neg logit contributions
theless-1.771
env-1.515
terday-1.400
 Seym-1.391
lis-1.387
Token Hispanics
Token ID30714
Feature activation+4.14e-01

Pos logit contributions
hower+2.175
pees+2.128
pee+1.951
ombat+1.940
+1.926
Neg logit contributions
theless-2.092
env-1.778
terday-1.635
 Seym-1.622
lis-1.617
Token,
Token ID11
Feature activation+1.02e-01

Pos logit contributions
hower+1.456
pees+1.416
pee+1.307
ombat+1.300
+1.291
Neg logit contributions
theless-1.360
env-1.165
terday-1.083
lis-1.066
 Haas-1.059
Token just
Token ID655
Feature activation-6.50e-01

Pos logit contributions
hower+0.390
pees+0.381
ombat+0.353
pee+0.351
+0.350
Neg logit contributions
theless-0.307
env-0.256
terday-0.236
 Haas-0.234
lis-0.231
Token-
Token ID12
Feature activation+5.07e-01

Pos logit contributions
hower+0.356
pees+0.350
pee+0.315
ombat+0.314
+0.311
Neg logit contributions
theless-0.375
env-0.329
 Haas-0.301
lis-0.301
terday-0.299
TokenW
Token ID54
Feature activation+1.14e-01

Pos logit contributions
hower+1.662
pees+1.619
pee+1.464
ombat+1.461
+1.447
Neg logit contributions
theless-1.779
env-1.557
lis-1.427
terday-1.422
 Seym-1.413
Tokenrestling
Token ID48839
Feature activation+1.35e+00

Pos logit contributions
hower+0.373
pees+0.363
+0.332
pee+0.327
 Dimension+0.319
Neg logit contributions
theless-0.403
env-0.354
lis-0.327
ylum-0.319
terday-0.319
Token-
Token ID12
Feature activation+4.58e-01

Pos logit contributions
hower+4.463
pees+4.363
pee+3.995
ombat+3.947
legate+3.924
Neg logit contributions
theless-4.680
env-3.996
terday-3.694
 Seym-3.687
lis-3.652
TokenC
Token ID34
Feature activation-4.49e-02

Pos logit contributions
hower+1.505
pees+1.464
ombat+1.322
pee+1.321
+1.310
Neg logit contributions
theless-1.606
env-1.410
lis-1.292
terday-1.284
 Seym-1.275
Tokenou
Token ID280
Feature activation+1.98e+00

Pos logit contributions
theless+0.157
env+0.139
lis+0.128
ylum+0.125
terday+0.124
Neg logit contributions
hower-0.146
pees-0.141
-0.129
pee-0.127
ombat-0.126
Tokenples
Token ID2374
Feature activation+5.31e-01

Pos logit contributions
pees+6.161
hower+6.093
pee+5.650
legate+5.452
ombat+5.451
Neg logit contributions
theless-7.289
env-6.080
terday-5.838
 Seym-5.748
 Haas-5.593
Token Cherry
Token ID23165
Feature activation-1.53e+00

Pos logit contributions
hower+1.715
pees+1.693
ombat+1.529
+1.523
pee+1.521
Neg logit contributions
theless-1.864
env-1.632
 Haas-1.504
lis-1.503
 Seym-1.497
Token D
Token ID360
Feature activation+1.27e-01

Pos logit contributions
theless+4.937
env+4.530
lis+4.276
 Manson+4.242
 Haas+4.210
Neg logit contributions
hower-4.658
pees-4.556
ombat-4.323
-4.267
legate-4.187
Tokenarts
Token ID5889
Feature activation+4.98e-01

Pos logit contributions
hower+0.384
pees+0.370
+0.337
ombat+0.332
pee+0.332
Neg logit contributions
theless-0.471
env-0.424
lis-0.392
 Seym-0.390
 Haas-0.388
Token 200
Token ID939
Feature activation+5.10e-01

Pos logit contributions
hower+1.832
pees+1.799
pee+1.655
ombat+1.646
+1.638
Neg logit contributions
theless-1.548
env-1.308
 Seym-1.189
terday-1.189
lis-1.183
Token of
Token ID286
Feature activation-4.71e-01

Pos logit contributions
theless+1.793
env+1.546
ylum+1.389
lis+1.379
terday+1.375
Neg logit contributions
hower-2.082
pees-2.049
-1.933
ombat-1.910
legate-1.867
Token protest
Token ID5402
Feature activation-2.36e-01

Pos logit contributions
theless+1.474
env+1.240
 Haas+1.131
 citizenship+1.130
lis+1.125
Neg logit contributions
hower-1.721
pees-1.701
-1.571
ombat-1.550
pee-1.533
Token.
Token ID13
Feature activation+2.12e-01

Pos logit contributions
theless+0.733
env+0.628
terday+0.574
lis+0.572
ylum+0.569
Neg logit contributions
hower-0.851
pees-0.832
-0.771
ombat-0.765
legate-0.759
Token<|endoftext|>
Token ID50256
Feature activation+4.02e-01

Pos logit contributions
hower+0.769
pees+0.753
pee+0.685
ombat+0.681
+0.678
Neg logit contributions
theless-0.668
env-0.578
 Haas-0.533
lis-0.523
 Manson-0.519
TokenLast
Token ID5956
Feature activation-1.24e+00

Pos logit contributions
hower+1.372
pees+1.328
ombat+1.206
pee+1.204
+1.202
Neg logit contributions
theless-1.362
env-1.196
lis-1.092
terday-1.079
 Haas-1.060
Token Chance
Token ID11809
Feature activation-2.56e+00

Pos logit contributions
theless+3.328
env+2.822
terday+2.474
lis+2.464
ylum+2.461
Neg logit contributions
hower-4.859
pees-4.847
ombat-4.510
pee-4.463
legate-4.380
Token to
Token ID284
Feature activation-6.98e-01

Pos logit contributions
theless+7.613
env+6.878
 Haas+6.368
 Hurricanes+6.312
lis+6.290
Neg logit contributions
hower-8.356
pees-7.904
-7.501
ombat-7.266
formance-7.207
Token See
Token ID4091
Feature activation-8.89e-01

Pos logit contributions
theless+2.290
env+2.048
 Haas+1.900
lis+1.874
 Manson+1.864
Neg logit contributions
hower-2.356
pees-2.233
-2.044
ombat-2.032
legate-1.999
Token the
Token ID262
Feature activation-1.71e-01

Pos logit contributions
theless+2.734
env+2.406
 Haas+2.188
terday+2.178
 Manson+2.157
Neg logit contributions
hower-3.200
pees-3.118
ombat-2.898
-2.843
legate-2.836
Token Small
Token ID10452
Feature activation-1.60e-01

Pos logit contributions
theless+0.534
env+0.466
 Haas+0.431
lis+0.423
 Manson+0.420
Neg logit contributions
hower-0.622
pees-0.609
ombat-0.553
pee-0.553
legate-0.546
Token,
Token ID11
Feature activation-1.69e-01

Pos logit contributions
theless+0.545
env+0.488
 Haas+0.451
lis+0.451
ylum+0.440
Neg logit contributions
hower-0.533
pees-0.516
ombat-0.467
-0.466
pee-0.466
Token LLC
Token ID11419
Feature activation-3.96e-04

Pos logit contributions
theless+0.693
env+0.597
 Haas+0.538
lis+0.533
 Seym+0.532
Neg logit contributions
hower-0.816
pees-0.784
ombat-0.729
legate-0.723
pee-0.721
Token.
Token ID13
Feature activation-1.06e-01

Pos logit contributions
theless+0.001
env+0.001
 Haas+0.001
lis+0.001
terday+0.001
Neg logit contributions
hower-0.001
pees-0.001
-0.001
pee-0.001
ombat-0.001
Token Click
Token ID6914
Feature activation+1.12e-01

Pos logit contributions
theless+0.352
env+0.308
 Haas+0.283
terday+0.280
lis+0.279
Neg logit contributions
hower-0.367
pees-0.355
pee-0.324
ombat-0.324
legate-0.323
Token here
Token ID994
Feature activation-4.97e-01

Pos logit contributions
hower+0.330
pees+0.321
+0.289
ombat+0.288
pee+0.286
Neg logit contributions
theless-0.425
env-0.379
terday-0.353
 Haas-0.349
lis-0.347
Token for
Token ID329
Feature activation+2.37e-02

Pos logit contributions
theless+1.448
env+1.210
terday+1.084
 Haas+1.077
lis+1.077
Neg logit contributions
hower-1.917
pees-1.880
pee-1.735
-1.730
ombat-1.722
Token reprint
Token ID26589
Feature activation-2.40e+00

Pos logit contributions
hower+0.087
pees+0.085
pee+0.078
ombat+0.078
+0.077
Neg logit contributions
theless-0.075
env-0.063
terday-0.057
 Seym-0.057
lis-0.057
Token permission
Token ID7170
Feature activation-1.37e+00

Pos logit contributions
theless+2.571
env+2.044
 Haas+1.643
 citizenship+1.620
iates+1.376
Neg logit contributions
hower-12.648
pees-12.530
-12.001
pee-11.929
ascal-11.629
Token.
Token ID13
Feature activation-3.55e-01

Pos logit contributions
theless+3.150
env+2.797
 Haas+2.663
lis+2.579
 Manson+2.526
Neg logit contributions
hower-5.613
pees-5.426
-5.212
ascal-5.077
pee-5.052
Token<|endoftext|>
Token ID50256
Feature activation+2.75e-01

Pos logit contributions
theless+0.977
env+0.834
 Haas+0.766
lis+0.746
terday+0.734
Neg logit contributions
hower-1.406
pees-1.376
ombat-1.272
legate-1.269
-1.265
TokenThe
Token ID464
Feature activation-2.41e-01

Pos logit contributions
hower+0.946
pees+0.913
ombat+0.830
+0.826
pee+0.826
Neg logit contributions
theless-0.929
env-0.820
lis-0.749
terday-0.737
ylum-0.723
Token S
Token ID311
Feature activation+9.86e-02

Pos logit contributions
theless+0.781
env+0.672
 Haas+0.619
 Seym+0.610
terday+0.610
Neg logit contributions
hower-0.858
pees-0.839
pee-0.767
ombat-0.763
-0.761
Token LLC
Token ID11419
Feature activation-1.55e-01

Pos logit contributions
theless+0.702
env+0.606
 Haas+0.544
lis+0.542
 Seym+0.540
Neg logit contributions
hower-0.805
pees-0.774
ombat-0.720
pee-0.713
legate-0.711
Token.
Token ID13
Feature activation-5.31e-02

Pos logit contributions
theless+0.472
env+0.410
 Haas+0.370
lis+0.367
terday+0.366
Neg logit contributions
hower-0.574
pees-0.552
-0.510
pee-0.506
ombat-0.501
Token Click
Token ID6914
Feature activation+1.38e-01

Pos logit contributions
theless+0.176
env+0.154
 Haas+0.142
lis+0.140
terday+0.140
Neg logit contributions
hower-0.183
pees-0.178
pee-0.162
ombat-0.162
legate-0.161
Token here
Token ID994
Feature activation-4.52e-01

Pos logit contributions
hower+0.406
pees+0.393
+0.353
ombat+0.351
pee+0.349
Neg logit contributions
theless-0.529
env-0.471
terday-0.437
 Haas-0.432
lis-0.430
Token for
Token ID329
Feature activation+3.81e-02

Pos logit contributions
theless+1.336
env+1.120
terday+1.006
lis+1.000
 Haas+0.995
Neg logit contributions
hower-1.731
pees-1.698
pee-1.566
-1.558
ombat-1.554
Token reprint
Token ID26589
Feature activation-2.37e+00

Pos logit contributions
hower+0.141
pees+0.138
pee+0.127
+0.126
ombat+0.126
Neg logit contributions
theless-0.119
env-0.100
terday-0.091
 Seym-0.090
 Haas-0.090
Token permission
Token ID7170
Feature activation-1.33e+00

Pos logit contributions
theless+2.584
env+2.116
 citizenship+1.624
 Haas+1.599
URE+1.367
Neg logit contributions
hower-12.415
pees-12.266
-11.790
pee-11.683
ÍÍ-11.378
Token.
Token ID13
Feature activation-3.96e-01

Pos logit contributions
theless+3.198
env+2.860
 Haas+2.657
lis+2.627
 Manson+2.505
Neg logit contributions
hower-5.349
pees-5.137
-4.914
pee-4.768
ascal-4.767
Token<|endoftext|>
Token ID50256
Feature activation+3.32e-01

Pos logit contributions
theless+1.075
env+0.919
 Haas+0.845
lis+0.821
terday+0.809
Neg logit contributions
hower-1.582
pees-1.547
ombat-1.431
legate-1.430
pee-1.421
TokenWhile
Token ID3633
Feature activation+3.98e-01

Pos logit contributions
hower+1.138
pees+1.099
ombat+1.000
pee+0.996
+0.996
Neg logit contributions
theless-1.120
env-0.987
lis-0.901
terday-0.888
ylum-0.872
Token we
Token ID356
Feature activation-9.24e-01

Pos logit contributions
hower+1.442
pees+1.405
pee+1.292
ombat+1.287
+1.280
Neg logit contributions
theless-1.281
env-1.091
terday-1.000
 Haas-0.993
lis-0.991
Token 4
Token ID604
Feature activation-4.35e-01

Pos logit contributions
hower+0.225
pees+0.219
ombat+0.204
pee+0.203
+0.203
Neg logit contributions
theless-0.163
env-0.137
 Haas-0.124
lis-0.123
 Seym-0.122
Token garlic
Token ID17644
Feature activation-1.31e+00

Pos logit contributions
theless+1.486
env+1.302
lis+1.217
 Haas+1.193
 Seym+1.164
Neg logit contributions
hower-1.458
pees-1.393
ombat-1.295
pee-1.271
legate-1.260
Token cloves
Token ID45025
Feature activation-1.15e+00

Pos logit contributions
theless+1.883
env+1.320
lis+1.264
 Haas+1.210
ured+1.157
Neg logit contributions
hower-6.797
pees-6.589
pee-6.221
ombat-6.149
formance-6.126
Token and
Token ID290
Feature activation+2.83e-02

Pos logit contributions
theless+3.306
env+2.782
lis+2.603
 Haas+2.539
 Manson+2.515
Neg logit contributions
hower-4.313
pees-4.231
-3.913
ombat-3.879
legate-3.869
Token the
Token ID262
Feature activation+2.01e-01

Pos logit contributions
hower+0.107
pees+0.105
pee+0.096
ombat+0.096
+0.095
Neg logit contributions
theless-0.086
env-0.074
 Haas-0.067
lis-0.067
terday-0.066
Token ginger
Token ID25072
Feature activation-2.34e+00

Pos logit contributions
hower+0.747
pees+0.731
pee+0.672
ombat+0.668
+0.665
Neg logit contributions
theless-0.629
env-0.536
 Haas-0.486
lis-0.483
terday-0.480
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
theless+5.873
env+5.269
lis+5.040
bread+4.848
 Haas+4.758
Neg logit contributions
hower-8.975
pees-8.556
ombat-7.906
-7.894
legate-7.840
Token Cut
Token ID9712
Feature activation+2.33e-01

Pos logit contributions
theless+0.032
env+0.028
 Haas+0.026
lis+0.026
 Manson+0.025
Neg logit contributions
hower-0.039
pees-0.038
pee-0.035
ombat-0.035
legate-0.034
Token the
Token ID262
Feature activation+1.67e-01

Pos logit contributions
hower+0.877
pees+0.859
pee+0.793
ombat+0.790
+0.786
Neg logit contributions
theless-0.709
env-0.597
lis-0.540
terday-0.539
 Seym-0.534
Token white
Token ID2330
Feature activation-5.50e-01

Pos logit contributions
hower+0.681
pees+0.668
pee+0.620
ombat+0.616
+0.614
Neg logit contributions
theless-0.463
env-0.384
 Haas-0.343
lis-0.342
 Seym-0.337
Token onion
Token ID21670
Feature activation-1.21e+00

Pos logit contributions
theless+1.507
env+1.310
 Haas+1.168
lis+1.166
 Manson+1.143
Neg logit contributions
hower-2.220
pees-2.162
pee-1.992
ombat-1.990
-1.965
TokenEm
Token ID10161
Feature activation-1.26e+00

Pos logit contributions
theless+0.832
env+0.712
lis+0.665
terday+0.650
 Haas+0.639
Neg logit contributions
hower-0.955
pees-0.926
-0.853
ombat-0.844
pee-0.841
Token or
Token ID393
Feature activation+6.60e-03

Pos logit contributions
theless+4.128
env+3.663
lis+3.527
terday+3.466
ylum+3.345
Neg logit contributions
hower-4.051
pees-3.814
-3.714
ombat-3.670
 Dimension-3.487
Token Hang
Token ID24300
Feature activation-1.75e-01

Pos logit contributions
hower+0.023
pees+0.023
+0.021
pee+0.021
ombat+0.021
Neg logit contributions
theless-0.021
env-0.018
 Haas-0.017
lis-0.017
terday-0.017
Token
Token ID564
Feature activation-6.53e-01

Pos logit contributions
theless+0.559
env+0.493
lis+0.466
 Haas+0.455
terday+0.443
Neg logit contributions
hower-0.606
pees-0.583
-0.539
ombat-0.531
pee-0.526
Token
Token ID247
Feature activation-5.82e-01

Pos logit contributions
theless+2.193
env+1.929
lis+1.863
ylum+1.847
 Haas+1.826
Neg logit contributions
hower-2.095
pees-2.020
ombat-1.845
pee-1.828
apor-1.814
TokenEm
Token ID10161
Feature activation-2.33e+00

Pos logit contributions
theless+1.959
env+1.740
lis+1.635
 Haas+1.604
terday+1.582
Neg logit contributions
hower-1.903
pees-1.823
-1.671
ombat-1.660
legate-1.644
Token.
Token ID13
Feature activation-2.87e-01

Pos logit contributions
theless+6.001
env+5.278
lis+5.137
 Haas+5.069
 Hurricanes+4.944
Neg logit contributions
hower-8.336
pees-7.653
-7.461
legate-7.287
apor-7.147
Token
Token ID564
Feature activation-1.05e+00

Pos logit contributions
theless+0.814
env+0.698
 Haas+0.633
lis+0.624
terday+0.619
Neg logit contributions
hower-1.126
pees-1.100
ombat-1.013
pee-1.010
legate-1.007
Token
Token ID250
Feature activation-3.94e-01

Pos logit contributions
theless+3.012
env+2.695
lis+2.547
 Haas+2.482
terday+2.409
Neg logit contributions
hower-3.891
pees-3.616
ombat-3.388
apor-3.374
pee-3.330
TokenP
Token ID47
Feature activation-8.08e-01

Pos logit contributions
theless+1.287
env+1.138
lis+1.044
terday+1.021
 Haas+1.009
Neg logit contributions
hower-1.385
pees-1.325
ombat-1.220
-1.200
pee-1.195
Tokenaying
Token ID8369
Feature activation-4.87e-01

Pos logit contributions
theless+2.718
env+2.392
lis+2.215
URE+2.171
ylum+2.163
Neg logit contributions
hower-2.714
pees-2.585
-2.371
pee-2.332
 Dimension-2.315
Token
Token ID198
Feature activation+1.69e-01

Pos logit contributions
hower+0.709
pees+0.694
pee+0.635
ombat+0.633
+0.632
Neg logit contributions
theless-0.546
env-0.471
lis-0.422
terday-0.417
 Haas-0.414
TokenEmail
Token ID15333
Feature activation-6.86e-01

Pos logit contributions
hower+0.657
pees+0.636
+0.596
ombat+0.584
pee+0.578
Neg logit contributions
theless-0.492
env-0.419
lis-0.379
terday-0.367
URE-0.366
Token This
Token ID770
Feature activation-5.91e-01

Pos logit contributions
theless+1.938
env+1.666
 Haas+1.575
lis+1.536
 Manson+1.501
Neg logit contributions
hower-2.660
-2.533
pees-2.524
pee-2.371
ombat-2.340
Token Story
Token ID8362
Feature activation-9.00e-01

Pos logit contributions
theless+2.366
env+2.102
 Seym+2.024
lis+2.008
 Haas+2.004
Neg logit contributions
pees-1.611
hower-1.584
-1.419
pee-1.359
ombat-1.319
Token Send
Token ID16290
Feature activation-6.08e-01

Pos logit contributions
theless+2.300
env+1.772
lis+1.639
 Haas+1.633
 Manson+1.581
Neg logit contributions
hower-3.870
pees-3.756
-3.514
ombat-3.480
pee-3.449
Token email
Token ID3053
Feature activation-2.32e+00

Pos logit contributions
theless+2.011
env+1.713
terday+1.570
 Seym+1.560
lis+1.557
Neg logit contributions
hower-2.116
pees-2.072
pee-1.900
ombat-1.888
-1.879
Token to
Token ID284
Feature activation-8.99e-01

Pos logit contributions
theless+3.880
env+3.284
 Haas+2.722
lis+2.638
 alternatively+2.575
Neg logit contributions
hower-11.214
-10.889
pees-10.491
pee-10.104
ascal-10.030
Token this
Token ID428
Feature activation-5.51e-01

Pos logit contributions
theless+2.501
env+2.040
lis+1.917
 Haas+1.887
 Manson+1.824
Neg logit contributions
hower-3.694
pees-3.465
-3.267
pee-3.230
ombat-3.195
Token address
Token ID2209
Feature activation-1.18e+00

Pos logit contributions
theless+1.658
env+1.402
terday+1.267
 Seym+1.258
lis+1.253
Neg logit contributions
hower-2.090
pees-2.044
pee-1.881
-1.866
ombat-1.863
Token Enter
Token ID6062
Feature activation-6.76e-01

Pos logit contributions
theless+3.206
env+2.911
 Haas+2.622
lis+2.551
 Seym+2.528
Neg logit contributions
hower-4.624
pees-4.372
-4.192
pee-4.060
legate-4.037
Token Your
Token ID3406
Feature activation+7.49e-01

Pos logit contributions
theless+2.394
env+2.068
 Seym+1.906
terday+1.901
lis+1.887
Neg logit contributions
hower-2.205
pees-2.136
pee-1.943
-1.942
ombat-1.930
Tokenjc
Token ID48055
Feature activation-3.86e-01

Pos logit contributions
hower+1.145
pees+1.100
pee+1.001
ombat+0.997
+0.997
Neg logit contributions
theless-1.231
env-1.090
lis-0.992
terday-0.986
 Haas-0.970
Token'
Token ID6
Feature activation+2.80e-01

Pos logit contributions
theless+1.411
env+1.263
lis+1.152
terday+1.149
 Seym+1.143
Neg logit contributions
hower-1.184
pees-1.143
ombat-1.035
pee-1.028
-1.027
Token OR
Token ID6375
Feature activation-9.74e-01

Pos logit contributions
hower+1.008
pees+0.994
pee+0.916
ombat+0.903
legate+0.898
Neg logit contributions
theless-0.886
env-0.765
lis-0.689
terday-0.689
 Haas-0.685
Token RIGHT
Token ID33621
Feature activation-9.77e-01

Pos logit contributions
theless+2.329
env+1.993
 Haas+1.774
lis+1.718
terday+1.689
Neg logit contributions
pees-4.100
hower-4.066
pee-3.818
-3.783
legate-3.757
Token (
Token ID357
Feature activation-3.04e-01

Pos logit contributions
theless+2.862
env+2.672
 Haas+2.401
URE+2.350
lis+2.327
Neg logit contributions
hower-3.538
pees-3.511
pee-3.174
legate-3.154
ombat-3.085
Token path
Token ID3108
Feature activation-2.30e+00

Pos logit contributions
theless+0.883
env+0.770
terday+0.677
 Haas+0.676
lis+0.670
Neg logit contributions
hower-1.162
pees-1.153
pee-1.063
-1.053
ombat-1.050
Token,
Token ID837
Feature activation-1.48e+00

Pos logit contributions
env+4.365
theless+4.115
lis+3.790
 Haas+3.513
URE+3.507
Neg logit contributions
hower-9.818
pees-9.483
��-8.822
ombat-8.783
��-8.778
Token 5
Token ID642
Feature activation-1.10e+00

Pos logit contributions
theless+3.513
env+2.920
lis+2.587
 Haas+2.559
terday+2.496
Neg logit contributions
hower-6.324
pees-6.143
pee-5.727
ombat-5.697
legate-5.665
Token )
Token ID1267
Feature activation-7.53e-01

Pos logit contributions
theless+3.150
env+2.732
lis+2.438
terday+2.435
 Haas+2.410
Neg logit contributions
hower-4.138
pees-4.088
pee-3.771
ombat-3.727
legate-3.725
Token =
Token ID796
Feature activation-9.57e-01

Pos logit contributions
theless+2.198
env+1.883
lis+1.688
 Seym+1.686
 Haas+1.679
Neg logit contributions
hower-2.846
pees-2.797
pee-2.577
legate-2.555
ombat-2.552
Token '.
Token ID45302
Feature activation-1.01e+00

Pos logit contributions
theless+2.537
env+2.234
 Haas+1.987
lis+1.974
terday+1.969
Neg logit contributions
hower-3.746
pees-3.736
pee-3.477
ombat-3.455
-3.449
Token's
Token ID338
Feature activation-8.40e-01

Pos logit contributions
theless+1.144
env+0.939
terday+0.826
lis+0.826
 Haas+0.821
Neg logit contributions
hower-1.746
pees-1.691
ombat-1.600
-1.580
pee-1.575
Token been
Token ID587
Feature activation+4.82e-01

Pos logit contributions
theless+2.450
env+2.094
 Manson+1.941
 Haas+1.926
terday+1.865
Neg logit contributions
hower-3.202
pees-3.117
ombat-2.901
pee-2.871
-2.848
Token dumped
Token ID24105
Feature activation+1.19e-01

Pos logit contributions
hower+1.812
pees+1.772
pee+1.636
ombat+1.625
+1.619
Neg logit contributions
theless-1.471
env-1.238
terday-1.119
 Seym-1.117
lis-1.112
Token on
Token ID319
Feature activation+8.70e-02

Pos logit contributions
hower+0.417
pees+0.410
ombat+0.376
pee+0.376
+0.373
Neg logit contributions
theless-0.393
env-0.336
terday-0.308
lis-0.303
 Haas-0.303
Token Maggie
Token ID33014
Feature activation-1.39e+00

Pos logit contributions
hower+0.334
pees+0.328
pee+0.303
ombat+0.302
legate+0.299
Neg logit contributions
theless-0.260
env-0.217
terday-0.197
 Seym-0.196
 Haas-0.196
Token Rita
Token ID42605
Feature activation-2.28e+00

Pos logit contributions
theless+4.040
env+3.393
 Haas+3.242
 Manson+3.229
lis+3.124
Neg logit contributions
pees-5.084
hower-5.068
ombat-4.743
pee-4.677
-4.650
Token's
Token ID338
Feature activation-9.37e-01

Pos logit contributions
theless+6.366
env+5.526
 Manson+5.279
 Haas+5.241
lis+5.020
Neg logit contributions
hower-7.970
pees-7.831
ombat-7.447
pee-7.237
-7.195
Token across
Token ID1973
Feature activation+6.98e-03

Pos logit contributions
theless+2.942
env+2.508
 Haas+2.278
 Manson+2.256
terday+2.255
Neg logit contributions
hower-3.315
pees-3.288
ombat-3.016
-3.011
pee-2.996
Token the
Token ID262
Feature activation+1.81e-01

Pos logit contributions
hower+0.028
pees+0.027
pee+0.025
ombat+0.025
+0.025
Neg logit contributions
theless-0.020
env-0.016
 Haas-0.015
terday-0.015
lis-0.015
Token street
Token ID4675
Feature activation+1.45e-01

Pos logit contributions
hower+0.791
pees+0.780
pee+0.725
+0.721
ombat+0.721
Neg logit contributions
theless-0.439
env-0.354
 Haas-0.309
 Seym-0.308
lis-0.307
Token,
Token ID11
Feature activation-2.80e-02

Pos logit contributions
hower+0.514
pees+0.502
pee+0.461
ombat+0.459
+0.457
Neg logit contributions
theless-0.473
env-0.403
terday-0.370
lis-0.366
 Seym-0.366
Token L
Token ID406
Feature activation-2.86e-01

Pos logit contributions
hower+0.247
pees+0.242
pee+0.224
ombat+0.223
+0.222
Neg logit contributions
theless-0.203
env-0.170
 Haas-0.154
terday-0.154
lis-0.153
Tokenared
Token ID1144
Feature activation+7.61e-02

Pos logit contributions
theless+0.967
env+0.872
lis+0.803
URE+0.779
terday+0.779
Neg logit contributions
hower-0.953
pees-0.914
pee-0.826
-0.820
ombat-0.818
Tokeno
Token ID78
Feature activation+3.42e-01

Pos logit contributions
hower+0.234
pees+0.229
+0.206
pee+0.206
ombat+0.203
Neg logit contributions
theless-0.281
env-0.248
lis-0.229
 Haas-0.228
terday-0.226
Token Sen
Token ID2311
Feature activation-6.43e-01

Pos logit contributions
hower+1.242
pees+1.212
ombat+1.109
pee+1.106
+1.103
Neg logit contributions
theless-1.091
env-0.927
lis-0.845
 Haas-0.842
terday-0.839
Token.
Token ID13
Feature activation-3.58e-01

Pos logit contributions
theless+1.907
env+1.725
lis+1.625
 Haas+1.515
iates+1.513
Neg logit contributions
hower-2.371
pees-2.274
-2.109
pee-2.087
ombat-2.070
Token Judith
Token ID39987
Feature activation-2.25e+00

Pos logit contributions
theless+1.073
env+0.929
 Haas+0.845
lis+0.836
 Manson+0.821
Neg logit contributions
hower-1.359
pees-1.310
ombat-1.204
pee-1.201
-1.199
Token Z
Token ID1168
Feature activation-5.98e-01

Pos logit contributions
theless+6.406
env+5.830
 Haas+5.773
 Manson+5.629
lis+5.500
Neg logit contributions
pees-7.693
hower-7.656
ombat-7.015
-6.974
pee-6.924
Tokenaff
Token ID2001
Feature activation+1.42e-01

Pos logit contributions
theless+1.948
env+1.817
lis+1.654
ylum+1.593
 Haas+1.566
Neg logit contributions
hower-2.006
pees-1.928
-1.810
pee-1.742
 Dimension-1.731
Tokenir
Token ID343
Feature activation-4.94e-03

Pos logit contributions
hower+0.483
pees+0.469
+0.432
ombat+0.428
pee+0.427
Neg logit contributions
theless-0.473
env-0.420
lis-0.389
 Haas-0.378
 Seym-0.376
Tokenini
Token ID5362
Feature activation-6.00e-02

Pos logit contributions
theless+0.016
env+0.015
lis+0.014
 Haas+0.013
 Seym+0.013
Neg logit contributions
hower-0.017
pees-0.016
-0.015
ombat-0.015
pee-0.015
Token,
Token ID11
Feature activation-3.13e-01

Pos logit contributions
theless+0.195
env+0.170
 Haas+0.156
lis+0.156
terday+0.153
Neg logit contributions
hower-0.210
pees-0.205
pee-0.188
-0.187
ombat-0.186
Token aren
Token ID3588
Feature activation-2.27e-01

Pos logit contributions
hower+2.289
pees+2.233
pee+2.042
+2.030
ombat+2.025
Neg logit contributions
theless-2.231
env-1.897
terday-1.742
 Seym-1.739
lis-1.738
Token
Token ID447
Feature activation+4.05e-01

Pos logit contributions
theless+0.661
env+0.585
lis+0.535
terday+0.508
 Haas+0.508
Neg logit contributions
hower-0.862
pees-0.836
ombat-0.766
pee-0.764
-0.759
Token
Token ID247
Feature activation+4.89e-01

Pos logit contributions
hower+1.188
pees+1.157
pee+1.041
ombat+1.036
+1.021
Neg logit contributions
theless-1.555
env-1.358
lis-1.268
terday-1.267
 Haas-1.256
Tokent
Token ID83
Feature activation-4.98e-02

Pos logit contributions
hower+1.934
pees+1.896
pee+1.751
ombat+1.746
+1.739
Neg logit contributions
theless-1.409
env-1.159
lis-1.039
terday-1.037
 Seym-1.026
Token as
Token ID355
Feature activation-4.20e-01

Pos logit contributions
theless+0.152
env+0.130
terday+0.117
lis+0.116
 Haas+0.115
Neg logit contributions
hower-0.188
pees-0.181
-0.170
ombat-0.167
pee-0.166
Token forgiving
Token ID43486
Feature activation-2.25e+00

Pos logit contributions
theless+1.385
env+1.214
lis+1.096
terday+1.076
ylum+1.068
Neg logit contributions
hower-1.462
pees-1.406
-1.341
ombat-1.277
legate-1.270
Token.
Token ID13
Feature activation-2.99e-01

Pos logit contributions
theless+5.667
env+4.882
lis+4.459
terday+4.417
 Haas+4.344
Neg logit contributions
hower-8.455
pees-8.232
-7.686
pee-7.576
legate-7.545
Token Most
Token ID4042
Feature activation+7.66e-01

Pos logit contributions
theless+0.903
env+0.782
 Haas+0.729
lis+0.714
 Manson+0.714
Neg logit contributions
hower-1.110
pees-1.090
pee-0.991
legate-0.990
ombat-0.987
Token chefs
Token ID36236
Feature activation+4.98e-01

Pos logit contributions
hower+2.671
pees+2.601
pee+2.376
ombat+2.360
+2.354
Neg logit contributions
theless-2.576
env-2.189
terday-2.021
lis-2.002
 Seym-2.002
Token won
Token ID1839
Feature activation+5.06e-01

Pos logit contributions
hower+1.695
pees+1.647
ombat+1.507
pee+1.498
+1.498
Neg logit contributions
theless-1.696
env-1.469
terday-1.348
lis-1.345
 Seym-1.329
Token
Token ID447
Feature activation-3.25e-02

Pos logit contributions
hower+1.969
pees+1.926
pee+1.785
ombat+1.771
+1.759
Neg logit contributions
theless-1.479
env-1.233
terday-1.109
 Seym-1.106
lis-1.106
Token the
Token ID262
Feature activation-2.09e-02

Pos logit contributions
theless+0.261
env+0.220
lis+0.195
 Haas+0.194
terday+0.193
Neg logit contributions
hower-0.361
pees-0.352
ombat-0.327
pee-0.327
-0.325
Token onion
Token ID21670
Feature activation-9.86e-01

Pos logit contributions
theless+0.062
env+0.052
lis+0.046
 Haas+0.046
terday+0.046
Neg logit contributions
hower-0.082
pees-0.080
pee-0.074
ombat-0.074
-0.074
Token,
Token ID11
Feature activation-3.66e-02

Pos logit contributions
theless+2.811
env+2.423
lis+2.272
 Haas+2.161
 Manson+2.119
Neg logit contributions
hower-3.790
pees-3.664
ombat-3.412
-3.385
pee-3.381
Token garlic
Token ID17644
Feature activation-5.47e-01

Pos logit contributions
theless+0.080
env+0.064
lis+0.055
 Haas+0.054
terday+0.054
Neg logit contributions
hower-0.169
pees-0.166
ombat-0.156
pee-0.156
-0.154
Token and
Token ID290
Feature activation-2.69e-01

Pos logit contributions
theless+1.728
env+1.477
lis+1.354
 Haas+1.339
terday+1.336
Neg logit contributions
hower-1.968
pees-1.929
pee-1.767
ombat-1.752
-1.739
Token ginger
Token ID25072
Feature activation-2.21e+00

Pos logit contributions
theless+0.569
env+0.452
 Haas+0.385
lis+0.376
terday+0.372
Neg logit contributions
hower-1.264
pees-1.244
ombat-1.168
pee-1.167
-1.153
Token.
Token ID13
Feature activation+2.49e-01

Pos logit contributions
theless+5.552
env+4.787
lis+4.353
 Haas+4.152
 Manson+4.142
Neg logit contributions
hower-8.597
pees-8.299
ombat-7.918
-7.915
pee-7.577
Token
Token ID198
Feature activation+4.85e-01

Pos logit contributions
hower+0.917
pees+0.904
pee+0.828
ombat+0.826
+0.819
Neg logit contributions
theless-0.775
env-0.665
 Haas-0.604
lis-0.604
terday-0.592
Token
Token ID198
Feature activation+3.77e-01

Pos logit contributions
hower+1.779
pees+1.743
pee+1.604
ombat+1.592
+1.584
Neg logit contributions
theless-1.517
env-1.281
terday-1.165
lis-1.157
 Seym-1.155
Token5
Token ID20
Feature activation-5.07e-01

Pos logit contributions
hower+1.340
pees+1.311
pee+1.205
ombat+1.196
+1.189
Neg logit contributions
theless-1.224
env-1.039
terday-0.949
 Seym-0.946
lis-0.943
Token.
Token ID13
Feature activation-1.03e-01

Pos logit contributions
theless+1.413
env+1.298
lis+1.173
 Haas+1.132
 Manson+1.103
Neg logit contributions
hower-1.951
pees-1.899
ombat-1.740
-1.735
pee-1.725
Tokenjs
Token ID8457
Feature activation+7.01e-02

Pos logit contributions
hower+1.044
pees+1.013
pee+0.931
ombat+0.931
+0.930
Neg logit contributions
theless-0.894
env-0.772
lis-0.691
terday-0.688
 Seym-0.676
Token'
Token ID6
Feature activation+2.50e-01

Pos logit contributions
hower+0.201
pees+0.196
pee+0.176
ombat+0.174
+0.173
Neg logit contributions
theless-0.277
env-0.241
terday-0.225
 Seym-0.225
lis-0.223
Token OR
Token ID6375
Feature activation-7.72e-01

Pos logit contributions
hower+0.900
pees+0.887
pee+0.817
ombat+0.806
legate+0.802
Neg logit contributions
theless-0.792
env-0.685
terday-0.616
lis-0.616
 Seym-0.612
Token RIGHT
Token ID33621
Feature activation-9.06e-01

Pos logit contributions
theless+1.800
env+1.482
 Haas+1.300
lis+1.280
terday+1.269
Neg logit contributions
pees-3.338
hower-3.322
pee-3.126
legate-3.076
-3.072
Token (
Token ID357
Feature activation-2.88e-01

Pos logit contributions
theless+2.748
env+2.529
 Haas+2.290
URE+2.240
lis+2.233
Neg logit contributions
hower-3.247
pees-3.196
pee-2.885
legate-2.862
ombat-2.813
Token path
Token ID3108
Feature activation-2.20e+00

Pos logit contributions
theless+0.845
env+0.731
terday+0.647
 Haas+0.645
lis+0.640
Neg logit contributions
hower-1.095
pees-1.086
pee-1.001
-0.991
ombat-0.987
Token,
Token ID837
Feature activation-1.51e+00

Pos logit contributions
env+4.161
theless+3.926
lis+3.569
URE+3.375
 Haas+3.367
Neg logit contributions
hower-9.551
pees-9.202
��-8.549
ombat-8.515
��-8.511
Token 10
Token ID838
Feature activation-2.28e-01

Pos logit contributions
theless+3.693
env+3.211
lis+2.807
 Haas+2.788
URE+2.685
Neg logit contributions
hower-6.243
pees-6.080
pee-5.657
ombat-5.638
-5.629
Token )
Token ID1267
Feature activation-4.08e-01

Pos logit contributions
theless+0.718
env+0.606
terday+0.553
 Seym+0.551
lis+0.548
Neg logit contributions
hower-0.833
pees-0.817
pee-0.752
ombat-0.746
-0.741
Token =
Token ID796
Feature activation-7.96e-01

Pos logit contributions
theless+1.231
env+1.050
lis+0.945
 Seym+0.944
terday+0.941
Neg logit contributions
hower-1.519
pees-1.492
pee-1.375
ombat-1.362
legate-1.360
Token '
Token ID705
Feature activation+2.20e-01

Pos logit contributions
theless+2.178
env+1.900
 Haas+1.693
terday+1.692
lis+1.689
Neg logit contributions
hower-3.077
pees-3.069
pee-2.856
ombat-2.830
-2.823
Token.
Token ID13
Feature activation+4.85e-01

Pos logit contributions
theless+1.013
env+0.862
lis+0.782
terday+0.777
 Haas+0.770
Neg logit contributions
hower-1.174
pees-1.146
pee-1.045
-1.038
ombat-1.034
Token
Token ID198
Feature activation+1.17e-01

Pos logit contributions
hower+1.633
pees+1.595
pee+1.452
ombat+1.444
+1.435
Neg logit contributions
theless-1.677
env-1.447
lis-1.324
terday-1.323
 Seym-1.317
Token
Token ID198
Feature activation+1.39e-01

Pos logit contributions
hower+0.448
pees+0.439
pee+0.402
ombat+0.401
+0.399
Neg logit contributions
theless-0.344
env-0.296
lis-0.265
terday-0.262
 Haas-0.261
TokenE
Token ID36
Feature activation-6.59e-01

Pos logit contributions
hower+0.482
pees+0.462
ombat+0.423
+0.418
pee+0.417
Neg logit contributions
theless-0.466
env-0.412
lis-0.375
terday-0.368
URE-0.360
Tokenarl
Token ID7063
Feature activation+1.71e-01

Pos logit contributions
theless+2.103
env+1.827
lis+1.677
terday+1.651
URE+1.635
Neg logit contributions
hower-2.283
pees-2.206
-2.070
ombat-2.060
legate-2.008
Token Morse
Token ID44049
Feature activation-2.16e+00

Pos logit contributions
hower+0.613
pees+0.608
pee+0.557
ombat+0.555
+0.551
Neg logit contributions
theless-0.554
env-0.466
 Haas-0.425
lis-0.424
terday-0.422
Token K
Token ID509
Feature activation+4.95e-01

Pos logit contributions
theless+6.449
env+5.366
 Haas+5.083
lis+5.055
 Manson+5.011
Neg logit contributions
pees-7.344
hower-7.066
-6.737
ombat-6.645
apor-6.618
TokenZ
Token ID57
Feature activation-1.79e-01

Pos logit contributions
hower+1.688
pees+1.647
+1.495
pee+1.492
legate+1.480
Neg logit contributions
theless-1.676
env-1.450
lis-1.334
 Seym-1.318
 Haas-1.311
Token8
Token ID23
Feature activation-1.79e-01

Pos logit contributions
theless+0.603
env+0.521
 Haas+0.478
lis+0.478
terday+0.473
Neg logit contributions
hower-0.604
pees-0.593
pee-0.539
legate-0.535
-0.532
TokenE
Token ID36
Feature activation-9.07e-02

Pos logit contributions
theless+0.607
env+0.525
lis+0.482
 Haas+0.478
terday+0.477
Neg logit contributions
hower-0.608
pees-0.596
pee-0.539
ombat-0.538
legate-0.535
Token suggest
Token ID1950
Feature activation-3.80e-01

Pos logit contributions
theless+0.302
env+0.260
lis+0.237
terday+0.237
 Haas+0.236
Neg logit contributions
hower-0.312
pees-0.306
pee-0.278
ombat-0.278
legate-0.276
Tokenj
Token ID73
Feature activation+3.37e-01

Pos logit contributions
hower+2.756
pees+2.716
pee+2.442
ombat+2.420
legate+2.403
Neg logit contributions
theless-3.721
env-3.194
 Seym-3.017
terday-3.008
lis-2.961
Token'
Token ID6
Feature activation+5.78e-01

Pos logit contributions
hower+1.047
pees+1.015
ombat+0.919
pee+0.918
+0.909
Neg logit contributions
theless-1.240
env-1.088
terday-1.000
lis-0.997
 Seym-0.994
Token OR
Token ID6375
Feature activation-6.53e-01

Pos logit contributions
hower+2.051
pees+2.010
pee+1.848
ombat+1.831
legate+1.818
Neg logit contributions
theless-1.875
env-1.612
terday-1.465
lis-1.455
 Seym-1.452
Token RIGHT
Token ID33621
Feature activation-7.56e-01

Pos logit contributions
theless+1.913
env+1.714
 Haas+1.533
terday+1.507
lis+1.495
Neg logit contributions
hower-2.435
pees-2.415
pee-2.222
-2.222
ombat-2.203
Token (
Token ID357
Feature activation-2.29e-01

Pos logit contributions
theless+2.743
env+2.399
terday+2.202
lis+2.183
 Haas+2.183
Neg logit contributions
hower-2.358
pees-2.312
pee-2.095
legate-2.064
ombat-2.057
Token path
Token ID3108
Feature activation-2.15e+00

Pos logit contributions
theless+0.668
env+0.595
terday+0.520
 Haas+0.519
lis+0.516
Neg logit contributions
hower-0.867
pees-0.854
pee-0.785
ombat-0.784
-0.783
Token,
Token ID837
Feature activation-1.44e+00

Pos logit contributions
theless+5.174
env+4.798
lis+4.212
terday+4.083
 Haas+4.045
Neg logit contributions
hower-8.439
pees-8.271
pee-7.638
ombat-7.627
legate-7.615
Token 5
Token ID642
Feature activation-1.07e+00

Pos logit contributions
theless+3.680
env+3.337
 Haas+2.901
lis+2.832
URE+2.748
Neg logit contributions
hower-5.681
pees-5.537
-5.207
ombat-5.189
pee-5.129
Token )
Token ID1267
Feature activation-5.95e-01

Pos logit contributions
theless+3.065
env+2.637
lis+2.359
terday+2.347
 Seym+2.337
Neg logit contributions
hower-4.076
pees-4.041
pee-3.725
ombat-3.693
legate-3.683
Token =
Token ID796
Feature activation-9.40e-01

Pos logit contributions
theless+1.758
env+1.500
lis+1.348
 Seym+1.345
terday+1.343
Neg logit contributions
hower-2.238
pees-2.207
pee-2.029
ombat-2.011
legate-2.010
Token '.
Token ID45302
Feature activation-9.11e-01

Pos logit contributions
theless+2.462
env+2.163
 Haas+1.928
terday+1.908
lis+1.897
Neg logit contributions
hower-3.713
pees-3.694
pee-3.437
-3.434
ombat-3.431
Token
Token ID198
Feature activation-3.06e-01

Pos logit contributions
theless+0.084
env+0.071
terday+0.064
 Haas+0.064
lis+0.064
Neg logit contributions
hower-0.105
pees-0.104
pee-0.095
ombat-0.095
legate-0.094
Token
Token ID198
Feature activation-1.86e-01

Pos logit contributions
theless+0.841
env+0.742
lis+0.659
 Haas+0.642
terday+0.636
Neg logit contributions
hower-1.224
pees-1.197
pee-1.095
ombat-1.092
-1.092
TokenHe
Token ID1544
Feature activation+5.76e-01

Pos logit contributions
theless+0.596
env+0.520
lis+0.472
terday+0.465
ylum+0.458
Neg logit contributions
hower-0.678
pees-0.652
ombat-0.600
pee-0.590
-0.589
Token discussed
Token ID6693
Feature activation-2.26e-01

Pos logit contributions
hower+2.171
pees+2.125
pee+1.961
ombat+1.952
+1.939
Neg logit contributions
theless-1.755
env-1.464
terday-1.330
lis-1.318
 Seym-1.314
Token two
Token ID734
Feature activation+3.52e-01

Pos logit contributions
theless+0.673
env+0.571
 Haas+0.522
lis+0.517
 Manson+0.517
Neg logit contributions
hower-0.861
pees-0.844
ombat-0.778
pee-0.773
-0.770
Token options
Token ID3689
Feature activation-2.13e+00

Pos logit contributions
hower+1.379
pees+1.345
ombat+1.244
pee+1.243
+1.231
Neg logit contributions
theless-1.032
env-0.864
terday-0.774
lis-0.772
 Haas-0.772
Token with
Token ID351
Feature activation-3.41e-01

Pos logit contributions
theless+5.277
env+4.518
lis+4.277
terday+4.196
ylum+4.132
Neg logit contributions
hower-8.240
pees-8.090
ombat-7.500
pee-7.437
-7.412
Token the
Token ID262
Feature activation-3.39e-01

Pos logit contributions
theless+0.972
env+0.805
 Haas+0.724
 Seym+0.721
terday+0.720
Neg logit contributions
hower-1.364
pees-1.326
ombat-1.233
pee-1.222
-1.215
Token family
Token ID1641
Feature activation-4.79e-01

Pos logit contributions
theless+0.891
env+0.726
lis+0.651
 Haas+0.651
 Seym+0.647
Neg logit contributions
hower-1.438
pees-1.407
ombat-1.308
-1.291
pee-1.289
Token at
Token ID379
Feature activation-4.31e-01

Pos logit contributions
theless+1.445
env+1.247
terday+1.130
lis+1.130
 Haas+1.117
Neg logit contributions
hower-1.797
pees-1.768
ombat-1.615
pee-1.612
-1.597
Token this
Token ID428
Feature activation-3.22e-01

Pos logit contributions
theless+1.201
env+1.018
 Haas+0.929
 Seym+0.917
lis+0.910
Neg logit contributions
hower-1.723
pees-1.664
pee-1.539
ombat-1.538
-1.536
Token right
Token ID826
Feature activation+5.27e-01

Pos logit contributions
hower+0.616
pees+0.607
ombat+0.563
pee+0.562
+0.556
Neg logit contributions
theless-0.418
env-0.345
 Haas-0.311
lis-0.310
terday-0.310
Token field
Token ID2214
Feature activation-6.72e-01

Pos logit contributions
hower+1.794
pees+1.755
pee+1.596
ombat+1.590
+1.581
Neg logit contributions
theless-1.791
env-1.552
terday-1.425
 Seym-1.411
 Haas-1.410
Token that
Token ID326
Feature activation+5.50e-01

Pos logit contributions
theless+1.973
env+1.658
terday+1.529
 Haas+1.521
lis+1.476
Neg logit contributions
hower-2.570
pees-2.489
ombat-2.321
-2.295
pee-2.271
Token scored
Token ID7781
Feature activation+4.93e-01

Pos logit contributions
hower+2.108
pees+2.070
pee+1.914
ombat+1.900
+1.886
Neg logit contributions
theless-1.640
env-1.367
terday-1.239
 Seym-1.232
lis-1.229
Token Michael
Token ID3899
Feature activation+4.10e-02

Pos logit contributions
hower+1.835
pees+1.800
pee+1.661
ombat+1.648
+1.638
Neg logit contributions
theless-1.523
env-1.277
terday-1.160
 Seym-1.158
lis-1.157
Token Morse
Token ID44049
Feature activation-2.12e+00

Pos logit contributions
hower+0.156
pees+0.152
+0.144
pee+0.142
ombat+0.140
Neg logit contributions
theless-0.123
env-0.107
 Haas-0.099
lis-0.095
 Seym-0.095
Token to
Token ID284
Feature activation+4.31e-01

Pos logit contributions
theless+5.368
env+4.657
 Haas+4.402
lis+4.350
 Manson+4.231
Neg logit contributions
hower-7.894
pees-7.777
ombat-7.395
-7.300
pee-7.184
Token tie
Token ID9839
Feature activation+1.62e+00

Pos logit contributions
hower+1.511
pees+1.473
pee+1.351
ombat+1.347
+1.336
Neg logit contributions
theless-1.432
env-1.221
terday-1.117
lis-1.116
 Haas-1.105
Token and
Token ID290
Feature activation+4.16e-01

Pos logit contributions
hower+5.618
pees+5.518
pee+5.055
legate+5.013
ombat+5.010
Neg logit contributions
theless-5.209
env-4.380
 Seym-4.107
terday-4.034
ylum-3.967
Token Ian
Token ID12930
Feature activation+4.16e-01

Pos logit contributions
hower+1.530
pees+1.497
pee+1.378
ombat+1.376
+1.359
Neg logit contributions
theless-1.319
env-1.112
lis-1.014
 Haas-1.013
terday-1.009
Token Desmond
Token ID41045
Feature activation-6.05e-01

Pos logit contributions
hower+1.655
pees+1.631
+1.525
ombat+1.517
pee+1.504
Neg logit contributions
theless-1.177
env-0.974
 Haas-0.912
lis-0.884
 Manson-0.880
Token
Token ID198
Feature activation-7.36e-01

Pos logit contributions
theless+1.635
env+1.403
lis+1.276
terday+1.269
 Seym+1.263
Neg logit contributions
hower-1.918
pees-1.891
ombat-1.723
pee-1.721
legate-1.701
TokenAn
Token ID2025
Feature activation-8.77e-01

Pos logit contributions
theless+2.755
env+2.557
lis+2.372
URE+2.299
terday+2.279
Neg logit contributions
hower-2.193
pees-2.073
ombat-1.882
-1.820
pee-1.796
Token 1888
Token ID49584
Feature activation+6.36e-01

Pos logit contributions
theless+2.761
env+2.491
lis+2.232
terday+2.224
 Seym+2.212
Neg logit contributions
hower-3.066
pees-3.024
ombat-2.770
pee-2.749
-2.684
Token photo
Token ID4590
Feature activation-2.53e-01

Pos logit contributions
hower+2.666
pees+2.623
pee+2.437
ombat+2.430
+2.410
Neg logit contributions
theless-1.662
env-1.349
 Seym-1.197
terday-1.196
lis-1.188
Token of
Token ID286
Feature activation-3.70e-01

Pos logit contributions
theless+0.817
env+0.716
 Haas+0.650
lis+0.644
ylum+0.638
Neg logit contributions
hower-0.898
pees-0.879
ombat-0.805
-0.797
legate-0.796
Tokenwith
Token ID4480
Feature activation-2.12e+00

Pos logit contributions
theless+1.032
env+0.846
 Seym+0.763
 Haas+0.760
lis+0.751
Neg logit contributions
hower-1.481
pees-1.477
ombat-1.366
pee-1.362
-1.335
Token her
Token ID607
Feature activation-7.64e-01

Pos logit contributions
theless+5.867
env+4.983
lis+4.451
 Haas+4.343
 Manson+4.292
Neg logit contributions
hower-7.762
pees-7.688
ombat-7.290
pee-7.035
-7.031
Token teacher
Token ID4701
Feature activation+3.32e-01

Pos logit contributions
theless+2.185
env+1.840
terday+1.653
 Haas+1.649
 Manson+1.632
Neg logit contributions
hower-2.983
pees-2.946
ombat-2.758
-2.695
pee-2.669
Token,,
Token ID9832
Feature activation-1.36e+00

Pos logit contributions
hower+1.207
pees+1.188
ombat+1.098
pee+1.087
+1.074
Neg logit contributions
theless-1.060
env-0.896
terday-0.819
 Seym-0.811
lis-0.810
Token has
Token ID468
Feature activation-4.39e-01

Pos logit contributions
theless+4.282
env+3.629
lis+3.318
 Haas+3.254
terday+3.249
Neg logit contributions
hower-4.748
pees-4.696
ombat-4.484
-4.244
pee-4.211
Token been
Token ID587
Feature activation+3.84e-01

Pos logit contributions
theless+1.025
env+0.828
lis+0.732
 Seym+0.710
 Haas+0.707
Neg logit contributions
hower-1.967
pees-1.921
ombat-1.794
pee-1.791
-1.783
Token Now
Token ID2735
Feature activation-1.25e-01

Pos logit contributions
theless+0.340
env+0.300
 Haas+0.277
lis+0.275
 Manson+0.273
Neg logit contributions
hower-0.355
pees-0.351
ombat-0.320
pee-0.318
legate-0.311
Token we
Token ID356
Feature activation-6.63e-01

Pos logit contributions
theless+0.389
env+0.332
lis+0.304
terday+0.304
 Haas+0.297
Neg logit contributions
hower-0.458
pees-0.452
ombat-0.418
pee-0.415
-0.410
Token cut
Token ID2005
Feature activation+2.79e-01

Pos logit contributions
theless+2.017
env+1.703
lis+1.607
terday+1.571
 Haas+1.530
Neg logit contributions
hower-2.399
pees-2.364
ombat-2.196
pee-2.165
-2.162
Token the
Token ID262
Feature activation+1.06e-01

Pos logit contributions
hower+1.074
pees+1.053
pee+0.975
ombat+0.974
+0.965
Neg logit contributions
theless-0.822
env-0.688
terday-0.621
lis-0.618
 Seym-0.615
Token bell
Token ID8966
Feature activation-7.15e-01

Pos logit contributions
hower+0.424
pees+0.417
pee+0.386
ombat+0.386
+0.383
Neg logit contributions
theless-0.296
env-0.247
 Haas-0.221
lis-0.221
terday-0.219
Token peppers
Token ID31804
Feature activation-2.12e+00

Pos logit contributions
theless+1.258
env+1.025
lis+0.940
 Haas+0.853
terday+0.797
Neg logit contributions
hower-3.492
pees-3.343
-3.184
ombat-3.159
formance-3.148
Token into
Token ID656
Feature activation-4.71e-01

Pos logit contributions
theless+5.511
env+4.586
terday+4.346
lis+4.311
 Manson+4.185
Neg logit contributions
hower-8.080
pees-7.834
ombat-7.226
-7.218
legate-7.160
Token small
Token ID1402
Feature activation-3.54e-01

Pos logit contributions
theless+1.412
env+1.197
 Haas+1.091
lis+1.076
 Manson+1.074
Neg logit contributions
hower-1.800
pees-1.732
ombat-1.604
pee-1.594
-1.592
Token-
Token ID12
Feature activation-2.94e-01

Pos logit contributions
theless+1.153
env+0.986
lis+0.909
 Haas+0.908
terday+0.892
Neg logit contributions
hower-1.270
pees-1.216
pee-1.113
legate-1.108
ombat-1.108
Tokenmedium
Token ID24132
Feature activation-4.62e-01

Pos logit contributions
theless+1.032
env+0.909
lis+0.848
terday+0.813
 Haas+0.813
Neg logit contributions
hower-0.953
pees-0.924
-0.844
pee-0.832
legate-0.828
Token sized
Token ID19943
Feature activation-2.34e-01

Pos logit contributions
theless+1.638
env+1.415
lis+1.330
 Haas+1.326
terday+1.311
Neg logit contributions
hower-1.516
pees-1.457
pee-1.318
legate-1.313
-1.303
Token
Token ID447
Feature activation+1.36e-01

Pos logit contributions
theless+1.708
env+1.558
lis+1.400
URE+1.341
terday+1.324
Neg logit contributions
hower-2.153
pees-2.039
ombat-1.873
-1.859
pee-1.839
Token
Token ID250
Feature activation+1.31e-01

Pos logit contributions
hower+0.475
pees+0.467
pee+0.432
ombat+0.423
legate+0.422
Neg logit contributions
theless-0.447
env-0.385
terday-0.352
lis-0.351
 Haas-0.347
TokenGrant
Token ID45431
Feature activation-3.23e-01

Pos logit contributions
hower+0.470
pees+0.459
pee+0.422
ombat+0.420
legate+0.417
Neg logit contributions
theless-0.423
env-0.357
lis-0.327
terday-0.326
 Seym-0.322
Tokening
Token ID278
Feature activation-5.28e-01

Pos logit contributions
theless+0.993
env+0.835
 Haas+0.755
terday+0.749
lis+0.744
Neg logit contributions
hower-1.198
pees-1.180
ombat-1.089
pee-1.080
-1.077
Token retro
Token ID12175
Feature activation-1.54e+00

Pos logit contributions
theless+1.579
env+1.321
 Haas+1.213
ylum+1.201
 citizenship+1.190
Neg logit contributions
hower-2.018
pees-1.964
ombat-1.814
pee-1.813
-1.800
Tokenactive
Token ID5275
Feature activation-2.12e+00

Pos logit contributions
theless+4.734
env+4.360
lis+3.913
 citizenship+3.872
terday+3.836
Neg logit contributions
hower-5.345
pees-5.083
-4.737
pee-4.622
quet-4.622
Token immunity
Token ID16989
Feature activation-1.17e+00

Pos logit contributions
theless+5.601
 citizenship+4.960
env+4.835
 Haas+4.372
 Manson+4.306
Neg logit contributions
hower-8.320
pees-7.836
pee-7.270
-7.261
ombat-7.180
Token to
Token ID284
Feature activation-7.86e-01

Pos logit contributions
theless+3.069
env+2.460
terday+2.262
lis+2.238
 Haas+2.236
Neg logit contributions
hower-4.830
pees-4.650
-4.378
ombat-4.353
pee-4.329
Token companies
Token ID2706
Feature activation-1.82e-01

Pos logit contributions
theless+2.279
env+1.885
 Haas+1.775
 citizenship+1.726
lis+1.718
Neg logit contributions
hower-3.038
pees-2.906
-2.731
ombat-2.722
pee-2.677
Token that
Token ID326
Feature activation-5.42e-01

Pos logit contributions
theless+0.582
env+0.497
 Haas+0.467
terday+0.459
lis+0.458
Neg logit contributions
hower-0.654
pees-0.632
-0.580
ombat-0.580
legate-0.577
Token went
Token ID1816
Feature activation-6.86e-01

Pos logit contributions
theless+1.676
env+1.407
terday+1.307
 Haas+1.285
lis+1.262
Neg logit contributions
hower-2.008
pees-1.943
-1.787
pee-1.777
ombat-1.772
Token think
Token ID892
Feature activation-3.04e-01

Pos logit contributions
theless+3.541
env+3.065
terday+2.822
 Haas+2.778
 Hurricanes+2.757
Neg logit contributions
hower-4.848
pees-4.737
-4.561
Û-4.380
ombat-4.336
Token...
Token ID986
Feature activation-3.46e-01

Pos logit contributions
theless+0.976
env+0.841
terday+0.780
 Haas+0.778
lis+0.766
Neg logit contributions
hower-1.080
pees-1.061
ombat-0.965
-0.964
pee-0.963
Token<|endoftext|>
Token ID50256
Feature activation+2.77e-01

Pos logit contributions
theless+1.021
env+0.856
 Haas+0.784
terday+0.776
lis+0.770
Neg logit contributions
hower-1.314
pees-1.303
ombat-1.193
-1.192
pee-1.189
TokenWhat
Token ID2061
Feature activation-1.72e-02

Pos logit contributions
hower+0.953
pees+0.920
ombat+0.836
+0.833
pee+0.832
Neg logit contributions
theless-0.934
env-0.825
lis-0.753
terday-0.741
 Haas-0.727
Token could
Token ID714
Feature activation-1.01e+00

Pos logit contributions
theless+0.051
env+0.044
 Haas+0.039
lis+0.039
terday+0.039
Neg logit contributions
hower-0.065
pees-0.064
ombat-0.059
-0.059
pee-0.059
Token have
Token ID423
Feature activation-2.11e+00

Pos logit contributions
theless+2.869
env+2.579
 Haas+2.321
lis+2.310
terday+2.299
Neg logit contributions
pees-3.722
hower-3.718
-3.423
pee-3.415
ombat-3.385
Token been
Token ID587
Feature activation-1.01e+00

Pos logit contributions
theless+4.382
env+3.805
terday+3.374
 Haas+3.364
lis+3.279
Neg logit contributions
hower-9.077
pees-8.985
pee-8.397
-8.325
formance-8.276
Token:
Token ID25
Feature activation-1.40e+00

Pos logit contributions
theless+2.865
env+2.424
 Haas+2.201
lis+2.186
terday+2.156
Neg logit contributions
hower-3.890
pees-3.850
pee-3.563
-3.533
ombat-3.505
Token Darren
Token ID26203
Feature activation+1.18e-01

Pos logit contributions
theless+4.440
env+3.720
 Haas+3.512
 Seym+3.444
 Manson+3.436
Neg logit contributions
hower-4.769
pees-4.738
pee-4.302
ombat-4.292
-4.290
Token Ar
Token ID943
Feature activation-3.62e-03

Pos logit contributions
hower+0.448
pees+0.438
pee+0.405
ombat+0.404
+0.397
Neg logit contributions
theless-0.355
env-0.300
 Haas-0.275
lis-0.272
 Seym-0.265
Tokenon
Token ID261
Feature activation-5.07e-01

Pos logit contributions
theless+0.013
env+0.012
lis+0.011
iates+0.010
ylum+0.010
Neg logit contributions
hower-0.011
pees-0.011
-0.011
ombat-0.010
 Dimension-0.010
Token little
Token ID1310
Feature activation+1.69e-01

Pos logit contributions
hower+1.741
pees+1.711
ombat+1.565
pee+1.562
+1.550
Neg logit contributions
theless-1.590
env-1.353
lis-1.233
terday-1.229
 Seym-1.228
Token bit
Token ID1643
Feature activation-4.78e-01

Pos logit contributions
hower+0.582
pees+0.569
ombat+0.522
+0.517
pee+0.512
Neg logit contributions
theless-0.567
env-0.488
terday-0.446
 Haas-0.443
lis-0.441
Token like
Token ID588
Feature activation+5.68e-01

Pos logit contributions
theless+1.620
env+1.406
terday+1.291
lis+1.276
 Haas+1.269
Neg logit contributions
hower-1.600
pees-1.562
ombat-1.446
-1.430
formance-1.412
Token a
Token ID257
Feature activation-2.09e-01

Pos logit contributions
hower+2.035
pees+1.993
pee+1.831
ombat+1.820
+1.809
Neg logit contributions
theless-1.830
env-1.547
terday-1.418
 Seym-1.411
lis-1.404
Token spinning
Token ID19493
Feature activation-2.82e-01

Pos logit contributions
theless+0.672
env+0.575
terday+0.520
lis+0.518
 Haas+0.514
Neg logit contributions
hower-0.750
pees-0.737
ombat-0.678
-0.669
pee-0.661
Token cup
Token ID6508
Feature activation+1.34e+00

Pos logit contributions
theless+0.916
env+0.815
lis+0.732
 Haas+0.723
terday+0.721
Neg logit contributions
pees-0.978
hower-0.970
ombat-0.879
pee-0.879
-0.869
Token at
Token ID379
Feature activation-5.47e-02

Pos logit contributions
hower+4.248
pees+4.137
pee+3.792
ombat+3.725
legate+3.723
Neg logit contributions
theless-4.853
env-4.158
terday-3.867
 Seym-3.860
 Haas-3.811
Token Disneyland
Token ID36312
Feature activation+2.26e-01

Pos logit contributions
theless+0.161
env+0.136
 Haas+0.123
lis+0.123
terday+0.122
Neg logit contributions
hower-0.210
pees-0.204
-0.190
ombat-0.189
pee-0.188
Token
Token ID447
Feature activation-6.39e-01

Pos logit contributions
hower+0.803
pees+0.786
pee+0.721
ombat+0.719
+0.712
Neg logit contributions
theless-0.731
env-0.625
terday-0.570
lis-0.568
 Seym-0.564
Token
Token ID251
Feature activation-3.69e-01

Pos logit contributions
theless+2.052
env+1.815
lis+1.757
ylum+1.747
 Haas+1.726
Neg logit contributions
hower-2.121
pees-2.041
ombat-1.883
apor-1.863
pee-1.861
Token),
Token ID828
Feature activation+3.16e-01

Pos logit contributions
theless+1.091
env+0.920
lis+0.847
terday+0.836
 Haas+0.811
Neg logit contributions
hower-1.404
pees-1.379
pee-1.268
ombat-1.264
-1.254
Token logged
Token ID18832
Feature activation+5.07e-02

Pos logit contributions
hower+0.628
pees+0.617
pee+0.569
ombat+0.565
legate+0.558
Neg logit contributions
theless-0.467
env-0.393
 Haas-0.354
 Seym-0.349
lis-0.349
Token over
Token ID625
Feature activation-1.81e-01

Pos logit contributions
hower+0.191
pees+0.187
pee+0.172
ombat+0.171
+0.170
Neg logit contributions
theless-0.155
env-0.130
 Haas-0.118
terday-0.118
 Seym-0.117
Token 6
Token ID718
Feature activation-1.42e-01

Pos logit contributions
theless+0.586
env+0.498
 Haas+0.453
terday+0.453
lis+0.451
Neg logit contributions
hower-0.645
pees-0.633
pee-0.581
-0.580
ombat-0.579
Token500
Token ID4059
Feature activation+1.67e-01

Pos logit contributions
theless+0.488
env+0.423
terday+0.389
lis+0.386
 Haas+0.385
Neg logit contributions
hower-0.476
pees-0.465
pee-0.420
legate-0.419
ombat-0.419
Token n
Token ID299
Feature activation+4.27e-01

Pos logit contributions
hower+0.625
pees+0.613
pee+0.564
legate+0.557
ombat+0.556
Neg logit contributions
theless-0.515
env-0.433
terday-0.392
 Seym-0.390
lis-0.389
Tokenautical
Token ID37073
Feature activation+1.42e+00

Pos logit contributions
hower+1.305
pees+1.231
+1.120
legate+1.117
ombat+1.117
Neg logit contributions
theless-1.596
env-1.430
lis-1.314
terday-1.304
ylum-1.299
Token reactor
Token ID21905
Feature activation-1.06e-01

Pos logit contributions
hower+3.994
pees+3.881
pee+3.480
ombat+3.471
+3.464
Neg logit contributions
theless-5.593
env-4.868
 Seym-4.558
terday-4.554
ylum-4.489
Token-
Token ID12
Feature activation+3.26e-02

Pos logit contributions
theless+0.347
env+0.303
lis+0.273
 Haas+0.273
 Seym+0.272
Neg logit contributions
hower-0.365
pees-0.361
pee-0.333
ombat-0.327
legate-0.326
Tokenyears
Token ID19002
Feature activation-1.57e-01

Pos logit contributions
hower+0.100
pees+0.098
pee+0.088
ombat+0.088
+0.087
Neg logit contributions
theless-0.122
env-0.106
terday-0.098
 Seym-0.097
lis-0.097
Token to
Token ID284
Feature activation+4.31e-01

Pos logit contributions
theless+0.537
env+0.458
terday+0.421
lis+0.418
 Seym+0.416
Neg logit contributions
hower-0.537
pees-0.531
pee-0.483
ombat-0.478
legate-0.475
Token 2015
Token ID1853
Feature activation-1.55e-02

Pos logit contributions
hower+1.548
pees+1.516
pee+1.393
ombat+1.382
+1.375
Neg logit contributions
theless-1.385
env-1.174
terday-1.071
lis-1.066
 Seym-1.064
Token daughter
Token ID4957
Feature activation-1.14e-03

Pos logit contributions
hower+1.177
pees+1.147
pee+1.057
ombat+1.051
+1.048
Neg logit contributions
theless-0.976
env-0.822
terday-0.746
 Haas-0.742
lis-0.742
Token to
Token ID284
Feature activation-1.88e-01

Pos logit contributions
theless+0.003
env+0.003
lis+0.003
 Haas+0.003
terday+0.003
Neg logit contributions
hower-0.004
pees-0.004
-0.004
ombat-0.004
pee-0.004
Token school
Token ID1524
Feature activation+3.39e-01

Pos logit contributions
theless+0.587
env+0.498
 Haas+0.464
lis+0.460
terday+0.453
Neg logit contributions
hower-0.680
pees-0.662
ombat-0.615
-0.612
pee-0.606
Token.
Token ID13
Feature activation+1.65e-01

Pos logit contributions
hower+1.219
pees+1.191
pee+1.092
ombat+1.090
+1.086
Neg logit contributions
theless-1.087
env-0.924
terday-0.847
lis-0.841
 Seym-0.835
Token In
Token ID554
Feature activation+4.03e-01

Pos logit contributions
hower+0.606
pees+0.598
ombat+0.548
pee+0.547
legate+0.540
Neg logit contributions
theless-0.513
env-0.442
 Haas-0.407
terday-0.399
lis-0.398
Token one
Token ID530
Feature activation+1.46e+00

Pos logit contributions
hower+1.496
pees+1.463
pee+1.347
ombat+1.345
+1.340
Neg logit contributions
theless-1.248
env-1.057
 Haas-0.955
lis-0.953
terday-0.953
Token late
Token ID2739
Feature activation+5.57e-01

Pos logit contributions
hower+4.758
pees+4.634
pee+4.245
ombat+4.208
legate+4.190
Neg logit contributions
theless-5.125
env-4.343
 Seym-4.054
terday-4.030
lis-4.001
Token-
Token ID12
Feature activation-5.47e-01

Pos logit contributions
hower+1.862
pees+1.822
pee+1.658
ombat+1.657
+1.644
Neg logit contributions
theless-1.931
env-1.669
terday-1.531
lis-1.529
 Haas-1.521
Tokennight
Token ID3847
Feature activation+4.96e-01

Pos logit contributions
theless+2.049
env+1.862
lis+1.697
terday+1.658
 Haas+1.629
Neg logit contributions
hower-1.611
pees-1.570
-1.454
ombat-1.448
 Dimension-1.389
Token podcast
Token ID9905
Feature activation+8.13e-02

Pos logit contributions
hower+1.746
pees+1.720
ombat+1.562
pee+1.560
+1.552
Neg logit contributions
theless-1.640
env-1.408
terday-1.281
 Haas-1.274
lis-1.266
Token interview
Token ID2720
Feature activation-2.12e-01

Pos logit contributions
hower+0.282
pees+0.282
ombat+0.257
pee+0.255
+0.254
Neg logit contributions
theless-0.265
env-0.228
terday-0.210
lis-0.209
 Haas-0.207
Token Orlando
Token ID13117
Feature activation+3.81e-01

Pos logit contributions
theless+0.144
env+0.124
 Manson+0.114
 Haas+0.112
terday+0.111
Neg logit contributions
hower-0.175
pees-0.168
ombat-0.154
-0.153
pee-0.152
Token gay
Token ID5650
Feature activation+6.40e-01

Pos logit contributions
hower+1.513
pees+1.476
pee+1.345
ombat+1.343
+1.339
Neg logit contributions
theless-1.100
env-0.940
ylum-0.847
lis-0.839
 Haas-0.834
Token bar
Token ID2318
Feature activation+5.03e-01

Pos logit contributions
hower+2.477
pees+2.412
ombat+2.208
pee+2.205
+2.203
Neg logit contributions
theless-1.890
env-1.615
terday-1.448
lis-1.439
ylum-1.430
Token he
Token ID339
Feature activation+4.15e-01

Pos logit contributions
hower+1.658
pees+1.612
pee+1.461
ombat+1.451
+1.448
Neg logit contributions
theless-1.763
env-1.531
terday-1.408
lis-1.404
 Seym-1.400
Token would
Token ID561
Feature activation+1.50e-01

Pos logit contributions
hower+1.643
pees+1.593
pee+1.470
ombat+1.464
+1.463
Neg logit contributions
theless-1.196
env-1.000
terday-0.906
lis-0.893
 Haas-0.886
Token later
Token ID1568
Feature activation+1.37e+00

Pos logit contributions
hower+0.546
pees+0.531
+0.485
pee+0.479
ombat+0.477
Neg logit contributions
theless-0.473
env-0.413
terday-0.376
lis-0.372
 Manson-0.369
Token soak
Token ID33182
Feature activation+1.30e+00

Pos logit contributions
hower+4.637
pees+4.555
pee+4.184
ombat+4.147
legate+4.132
Neg logit contributions
theless-4.591
env-3.890
 Seym-3.621
terday-3.598
lis-3.558
Token in
Token ID287
Feature activation+9.24e-01

Pos logit contributions
hower+4.922
pees+4.830
pee+4.466
ombat+4.423
legate+4.401
Neg logit contributions
theless-3.930
env-3.272
terday-2.977
 Seym-2.971
lis-2.950
Token blood
Token ID2910
Feature activation+4.01e-01

Pos logit contributions
hower+3.452
pees+3.384
pee+3.121
ombat+3.097
+3.080
Neg logit contributions
theless-2.834
env-2.378
terday-2.161
 Seym-2.155
lis-2.140
Token.
Token ID13
Feature activation+7.54e-02

Pos logit contributions
hower+1.461
pees+1.414
+1.303
ombat+1.300
pee+1.292
Neg logit contributions
theless-1.268
env-1.078
terday-0.989
lis-0.977
ylum-0.975
Token
Token ID198
Feature activation-1.49e-01

Pos logit contributions
hower+0.293
pees+0.288
pee+0.263
ombat+0.262
legate+0.261
Neg logit contributions
theless-0.220
env-0.188
 Haas-0.173
 Manson-0.171
terday-0.168
Token emissions
Token ID8971
Feature activation-6.15e-01

Pos logit contributions
theless+2.142
env+1.798
terday+1.567
 Haas+1.565
lis+1.527
Neg logit contributions
hower-3.340
pees-3.279
pee-3.047
-3.018
ombat-3.010
Token,
Token ID11
Feature activation-2.09e-02

Pos logit contributions
theless+1.878
env+1.619
terday+1.471
 Haas+1.461
lis+1.437
Neg logit contributions
hower-2.253
pees-2.208
ombat-2.056
legate-2.044
-2.035
Token so
Token ID523
Feature activation+4.50e-01

Pos logit contributions
theless+0.062
env+0.052
 Haas+0.047
 Seym+0.046
terday+0.046
Neg logit contributions
hower-0.081
pees-0.079
ombat-0.073
pee-0.073
-0.072
Token poll
Token ID3278
Feature activation-5.85e-02

Pos logit contributions
hower+1.668
pees+1.637
pee+1.507
ombat+1.504
+1.485
Neg logit contributions
theless-1.404
env-1.188
terday-1.072
 Seym-1.068
lis-1.064
Tokenuters
Token ID5843
Feature activation+4.40e-01

Pos logit contributions
theless+0.175
env+0.152
terday+0.134
lis+0.132
icity+0.131
Neg logit contributions
hower-0.221
pees-0.211
ombat-0.198
-0.196
 Dimension-0.195
Token typically
Token ID6032
Feature activation+1.43e+00

Pos logit contributions
hower+1.598
pees+1.546
ombat+1.428
+1.424
pee+1.416
Neg logit contributions
theless-1.406
env-1.195
terday-1.090
 Haas-1.088
lis-1.068
Token pay
Token ID1414
Feature activation+6.19e-01

Pos logit contributions
hower+4.628
pees+4.575
pee+4.177
ombat+4.110
legate+4.090
Neg logit contributions
theless-4.978
env-4.253
 Seym-4.014
terday-3.943
lis-3.941
Token none
Token ID4844
Feature activation+1.09e+00

Pos logit contributions
hower+2.324
pees+2.270
pee+2.097
ombat+2.087
+2.074
Neg logit contributions
theless-1.891
env-1.590
terday-1.440
 Seym-1.437
lis-1.426
Token of
Token ID286
Feature activation-2.96e-01

Pos logit contributions
hower+3.644
pees+3.576
pee+3.268
ombat+3.226
legate+3.206
Neg logit contributions
theless-3.712
env-3.172
 Seym-2.932
terday-2.924
lis-2.901
Token these
Token ID777
Feature activation+4.79e-01

Pos logit contributions
theless+0.813
env+0.686
terday+0.601
 Haas+0.600
lis+0.592
Neg logit contributions
hower-1.191
pees-1.159
ombat-1.092
-1.085
pee-1.082
Token costs
Token ID3484
Feature activation+4.71e-01

Pos logit contributions
hower+1.908
pees+1.863
pee+1.734
ombat+1.723
legate+1.713
Neg logit contributions
theless-1.367
env-1.130
terday-1.005
 Seym-1.001
lis-0.996
Token the
Token ID262
Feature activation+1.97e-01

Pos logit contributions
theless+0.095
env+0.080
 Haas+0.073
terday+0.073
 Manson+0.072
Neg logit contributions
hower-0.126
pees-0.123
pee-0.114
ombat-0.113
-0.112
Token allegation
Token ID24389
Feature activation+9.46e-01

Pos logit contributions
hower+0.695
pees+0.674
ombat+0.615
pee+0.614
+0.611
Neg logit contributions
theless-0.654
env-0.566
terday-0.516
 Seym-0.515
lis-0.512
Token that
Token ID326
Feature activation-1.04e-02

Pos logit contributions
hower+2.924
pees+2.859
pee+2.595
ombat+2.565
legate+2.547
Neg logit contributions
theless-3.496
env-3.018
terday-2.803
 Seym-2.799
lis-2.775
Token Brah
Token ID36415
Feature activation+6.36e-01

Pos logit contributions
theless+0.038
env+0.033
 Haas+0.031
 Seym+0.031
 Manson+0.031
Neg logit contributions
hower-0.033
pees-0.032
pee-0.029
ombat-0.029
-0.028
Tokenms
Token ID907
Feature activation-1.53e-01

Pos logit contributions
hower+1.750
pees+1.681
+1.512
pee+1.503
ombat+1.499
Neg logit contributions
theless-2.589
env-2.287
lis-2.133
terday-2.120
 Haas-2.115
Token slaughtered
Token ID33850
Feature activation+2.12e-01

Pos logit contributions
theless+0.455
env+0.382
terday+0.352
 Manson+0.345
lis+0.344
Neg logit contributions
hower-0.586
pees-0.571
pee-0.526
ombat-0.523
legate-0.521
Token fel
Token ID10756
Feature activation-3.15e-01

Pos logit contributions
hower+0.823
pees+0.795
pee+0.732
+0.727
ombat+0.726
Neg logit contributions
theless-0.632
env-0.537
 Seym-0.487
 Haas-0.481
terday-0.481
Tokenines
Token ID1127
Feature activation+6.06e-02

Pos logit contributions
theless+1.291
env+1.151
lis+1.078
icity+1.055
 Haas+1.049
Neg logit contributions
hower-0.866
pees-0.796
-0.721
ombat-0.713
pee-0.713
Token,
Token ID11
Feature activation-2.16e-01

Pos logit contributions
hower+0.226
pees+0.219
+0.201
pee+0.200
ombat+0.200
Neg logit contributions
theless-0.190
env-0.160
terday-0.148
 Haas-0.145
lis-0.144
Token transc
Token ID23589
Feature activation+1.78e-01

Pos logit contributions
theless+0.633
env+0.535
 Haas+0.489
 Seym+0.484
 Manson+0.484
Neg logit contributions
hower-0.850
pees-0.807
pee-0.746
-0.740
ombat-0.739
Tokenribed
Token ID8725
Feature activation+3.13e-01

Pos logit contributions
hower+0.747
pees+0.723
+0.678
pee+0.672
legate+0.661
Neg logit contributions
theless-0.469
env-0.387
lis-0.342
terday-0.336
 Seym-0.334
Token might
Token ID1244
Feature activation-1.02e+00

Pos logit contributions
theless+0.402
env+0.333
lis+0.301
terday+0.300
 Haas+0.297
Neg logit contributions
hower-0.520
pees-0.507
ombat-0.468
pee-0.468
-0.463
Token reassure
Token ID37762
Feature activation-8.25e-01

Pos logit contributions
theless+3.153
env+2.757
lis+2.518
terday+2.507
 Haas+2.400
Neg logit contributions
pees-3.623
hower-3.583
-3.294
pee-3.264
ombat-3.240
Token Americans
Token ID3399
Feature activation+1.16e-01

Pos logit contributions
theless+2.321
env+1.975
 Haas+1.800
lis+1.764
 Manson+1.747
Neg logit contributions
hower-3.241
pees-3.137
ombat-2.942
-2.888
pee-2.879
Token afraid
Token ID7787
Feature activation-8.19e-01

Pos logit contributions
hower+0.387
pees+0.379
ombat+0.345
pee+0.343
+0.343
Neg logit contributions
theless-0.396
env-0.342
terday-0.318
lis-0.317
 Haas-0.316
Token of
Token ID286
Feature activation-3.11e-01

Pos logit contributions
theless+2.631
env+2.257
lis+2.098
 Haas+2.072
terday+2.070
Neg logit contributions
hower-2.845
pees-2.837
pee-2.580
ombat-2.526
legate-2.510
Token Trump
Token ID1301
Feature activation+3.67e-01

Pos logit contributions
theless+0.883
env+0.744
lis+0.673
 Haas+0.664
 Manson+0.663
Neg logit contributions
hower-1.219
pees-1.199
pee-1.104
ombat-1.100
-1.084
Token that
Token ID326
Feature activation-1.93e-01

Pos logit contributions
hower+1.287
pees+1.268
pee+1.162
ombat+1.156
+1.139
Neg logit contributions
theless-1.213
env-1.033
lis-0.945
terday-0.941
 Seym-0.930
Token there
Token ID612
Feature activation-4.78e-01

Pos logit contributions
theless+0.585
env+0.496
terday+0.450
lis+0.450
 Haas+0.448
Neg logit contributions
hower-0.726
pees-0.714
pee-0.656
ombat-0.651
-0.642
Token
Token ID447
Feature activation-5.82e-01

Pos logit contributions
theless+1.326
env+1.098
lis+0.992
terday+0.979
ylum+0.964
Neg logit contributions
hower-1.916
pees-1.870
ombat-1.735
pee-1.731
legate-1.702
Token
Token ID247
Feature activation-5.90e-01

Pos logit contributions
theless+1.944
env+1.692
lis+1.650
 Haas+1.631
ylum+1.602
Neg logit contributions
hower-1.911
pees-1.836
ombat-1.693
pee-1.669
apor-1.640
Tokens
Token ID82
Feature activation-6.11e-01

Pos logit contributions
theless+1.480
env+1.175
lis+1.106
terday+1.046
ylum+1.034
Neg logit contributions
hower-2.529
pees-2.470
pee-2.298
ombat-2.282
-2.257
Token or
Token ID393
Feature activation+3.04e-01

Pos logit contributions
hower+0.088
pees+0.087
pee+0.080
legate+0.079
+0.079
Neg logit contributions
theless-0.076
env-0.064
terday-0.059
lis-0.058
 Haas-0.058
Token your
Token ID534
Feature activation+2.07e-01

Pos logit contributions
hower+1.070
pees+1.058
pee+0.972
legate+0.957
+0.953
Neg logit contributions
theless-1.002
env-0.854
terday-0.778
lis-0.774
 Seym-0.773
Token setup
Token ID9058
Feature activation+2.77e-01

Pos logit contributions
hower+0.729
pees+0.722
pee+0.661
legate+0.654
+0.654
Neg logit contributions
theless-0.675
env-0.577
terday-0.528
 Haas-0.524
lis-0.524
Token is
Token ID318
Feature activation+5.40e-01

Pos logit contributions
hower+1.098
pees+1.081
pee+0.999
+0.988
ombat+0.986
Neg logit contributions
theless-0.784
env-0.655
terday-0.587
 Seym-0.581
lis-0.580
Token wrong
Token ID2642
Feature activation+8.79e-01

Pos logit contributions
hower+1.866
pees+1.830
pee+1.668
ombat+1.654
+1.650
Neg logit contributions
theless-1.810
env-1.549
lis-1.409
 Seym-1.405
 Haas-1.404
Token.
Token ID13
Feature activation+4.48e-02

Pos logit contributions
hower+3.001
pees+2.937
pee+2.689
ombat+2.663
+2.648
Neg logit contributions
theless-2.978
env-2.541
terday-2.338
 Seym-2.332
lis-2.317
Token I
Token ID314
Feature activation-1.02e-01

Pos logit contributions
hower+0.155
pees+0.153
pee+0.140
legate+0.137
ombat+0.137
Neg logit contributions
theless-0.149
env-0.128
 Seym-0.117
lis-0.117
 Haas-0.117
Token have
Token ID423
Feature activation-1.33e-01

Pos logit contributions
theless+0.309
env+0.262
terday+0.238
lis+0.237
 Seym+0.236
Neg logit contributions
hower-0.377
pees-0.374
pee-0.342
-0.340
ombat-0.339
Token been
Token ID587
Feature activation+3.67e-01

Pos logit contributions
theless+0.372
env+0.308
lis+0.276
 Seym+0.275
terday+0.273
Neg logit contributions
hower-0.535
pees-0.525
pee-0.486
-0.484
ombat-0.482
Token in
Token ID287
Feature activation+2.28e-01

Pos logit contributions
hower+1.164
pees+1.149
+1.042
pee+1.035
ombat+1.023
Neg logit contributions
theless-1.329
env-1.166
 Seym-1.063
lis-1.060
terday-1.058
Token the
Token ID262
Feature activation+7.95e-02

Pos logit contributions
hower+0.861
pees+0.843
pee+0.780
+0.775
legate+0.773
Neg logit contributions
theless-0.691
env-0.587
terday-0.531
 Haas-0.523
lis-0.522
Token how
Token ID703
Feature activation+1.22e+00

Pos logit contributions
hower+1.359
pees+1.317
+1.213
ombat+1.208
pee+1.199
Neg logit contributions
theless-1.244
env-1.061
lis-0.961
terday-0.958
 Haas-0.956
Token do
Token ID466
Feature activation+2.17e-01

Pos logit contributions
hower+3.943
pees+3.861
pee+3.524
ombat+3.479
legate+3.457
Neg logit contributions
theless-4.355
env-3.749
 Seym-3.472
terday-3.472
lis-3.441
Token you
Token ID345
Feature activation+4.08e-01

Pos logit contributions
hower+0.799
pees+0.779
pee+0.716
ombat+0.713
+0.710
Neg logit contributions
theless-0.680
env-0.576
terday-0.522
 Seym-0.521
lis-0.521
Token keep
Token ID1394
Feature activation-4.10e-01

Pos logit contributions
hower+1.549
pees+1.509
+1.395
pee+1.378
ombat+1.375
Neg logit contributions
theless-1.248
env-1.059
terday-0.954
lis-0.951
 Haas-0.924
Token an
Token ID281
Feature activation+1.34e-01

Pos logit contributions
theless+1.242
env+1.059
 Haas+0.960
 Manson+0.947
lis+0.946
Neg logit contributions
hower-1.580
pees-1.503
ombat-1.401
-1.400
pee-1.391
Token even
Token ID772
Feature activation+1.15e-01

Pos logit contributions
hower+0.550
pees+0.531
pee+0.491
ombat+0.490
legate+0.488
Neg logit contributions
theless-0.370
env-0.318
lis-0.276
 Haas-0.275
terday-0.274
Token ke
Token ID885
Feature activation+5.00e-01

Pos logit contributions
hower+0.408
pees+0.399
+0.363
ombat+0.362
pee+0.362
Neg logit contributions
theless-0.376
env-0.324
lis-0.293
 Haas-0.292
terday-0.291
Tokenel
Token ID417
Feature activation+3.65e-01

Pos logit contributions
hower+1.998
pees+1.950
+1.816
ombat+1.808
pee+1.800
Neg logit contributions
theless-1.388
env-1.179
lis-1.062
terday-1.015
ylum-1.012
Token?
Token ID30
Feature activation+5.55e-01

Pos logit contributions
hower+1.291
pees+1.262
ombat+1.159
+1.156
pee+1.152
Neg logit contributions
theless-1.190
env-1.019
lis-0.925
terday-0.925
 Seym-0.912
Token
Token ID198
Feature activation+6.93e-01

Pos logit contributions
hower+2.017
pees+1.974
pee+1.814
ombat+1.803
+1.796
Neg logit contributions
theless-1.754
env-1.495
lis-1.353
terday-1.353
 Seym-1.352
Token
Token ID198
Feature activation+3.67e-01

Pos logit contributions
hower+2.441
pees+2.392
pee+2.211
ombat+2.184
legate+2.179
Neg logit contributions
theless-2.271
env-1.887
 Seym-1.784
terday-1.754
lis-1.715
Token men
Token ID1450
Feature activation+9.06e-01

Pos logit contributions
theless+0.462
env+0.383
 Haas+0.345
lis+0.345
 Seym+0.343
Neg logit contributions
hower-0.638
pees-0.620
-0.581
ombat-0.581
pee-0.572
Token.
Token ID13
Feature activation+5.35e-04

Pos logit contributions
hower+3.042
pees+2.976
pee+2.721
ombat+2.697
+2.680
Neg logit contributions
theless-3.114
env-2.663
terday-2.453
 Seym-2.448
lis-2.432
Token Then
Token ID3244
Feature activation+5.31e-01

Pos logit contributions
hower+0.002
pees+0.002
ombat+0.002
pee+0.002
legate+0.002
Neg logit contributions
theless-0.002
env-0.001
lis-0.001
 Haas-0.001
 Manson-0.001
Token I
Token ID314
Feature activation-3.76e-01

Pos logit contributions
hower+1.826
pees+1.787
pee+1.633
ombat+1.629
+1.620
Neg logit contributions
theless-1.794
env-1.533
terday-1.409
lis-1.402
 Seym-1.397
Token
Token ID447
Feature activation-5.12e-02

Pos logit contributions
theless+1.197
env+1.027
lis+0.951
terday+0.920
 Haas+0.920
Neg logit contributions
hower-1.336
pees-1.302
ombat-1.200
-1.199
legate-1.186
Token
Token ID247
Feature activation+1.58e-01

Pos logit contributions
theless+0.190
env+0.168
lis+0.160
 Haas+0.156
ylum+0.155
Neg logit contributions
hower-0.155
pees-0.150
pee-0.134
ombat-0.134
-0.131
Tokenll
Token ID297
Feature activation+6.21e-01

Pos logit contributions
hower+0.648
pees+0.635
pee+0.589
ombat+0.586
+0.584
Neg logit contributions
theless-0.428
env-0.350
lis-0.313
terday-0.312
 Seym-0.309
Token say
Token ID910
Feature activation+6.93e-02

Pos logit contributions
hower+2.022
pees+1.966
+1.781
ombat+1.779
pee+1.778
Neg logit contributions
theless-2.205
env-1.910
lis-1.764
terday-1.754
 Haas-1.734
Token,
Token ID11
Feature activation-2.67e-01

Pos logit contributions
hower+0.246
pees+0.240
ombat+0.220
+0.219
pee+0.219
Neg logit contributions
theless-0.226
env-0.194
terday-0.177
lis-0.176
 Haas-0.175
Token
Token ID564
Feature activation-5.33e-01

Pos logit contributions
theless+0.742
env+0.617
lis+0.559
 Haas+0.554
terday+0.551
Neg logit contributions
hower-1.067
pees-1.038
-0.965
ombat-0.965
legate-0.955
Token
Token ID246
Feature activation-2.64e-01

Pos logit contributions
theless+1.784
env+1.540
lis+1.441
 Haas+1.422
terday+1.417
Neg logit contributions
hower-1.789
pees-1.724
ombat-1.572
pee-1.572
legate-1.555
Token Sea
Token ID6896
Feature activation+1.30e+00

Pos logit contributions
hower+1.147
pees+1.121
pee+1.021
ombat+1.011
+1.005
Neg logit contributions
theless-1.251
env-1.077
 Seym-0.993
lis-0.993
terday-0.992
Token
Token ID447
Feature activation+7.59e-02

Pos logit contributions
hower+4.699
pees+4.584
pee+4.229
ombat+4.179
legate+4.178
Neg logit contributions
theless-4.079
env-3.448
 Seym-3.182
terday-3.138
 Haas-3.090
Token
Token ID251
Feature activation-2.85e-01

Pos logit contributions
hower+0.250
pees+0.246
pee+0.224
ombat+0.222
legate+0.217
Neg logit contributions
theless-0.263
env-0.225
lis-0.212
terday-0.212
 Haas-0.209
Token premiered
Token ID44119
Feature activation+2.11e-01

Pos logit contributions
theless+0.868
env+0.725
lis+0.675
terday+0.655
 Haas+0.650
Neg logit contributions
hower-1.066
pees-1.051
pee-0.972
ombat-0.964
-0.952
Token was
Token ID373
Feature activation+2.73e-01

Pos logit contributions
hower+0.764
pees+0.749
pee+0.690
+0.685
ombat+0.685
Neg logit contributions
theless-0.681
env-0.572
terday-0.528
lis-0.524
 Haas-0.516
Token reminiscent
Token ID24960
Feature activation-2.05e-01

Pos logit contributions
hower+1.011
pees+0.986
pee+0.909
+0.906
ombat+0.899
Neg logit contributions
theless-0.849
env-0.718
lis-0.653
terday-0.652
 Seym-0.644
Token of
Token ID286
Feature activation+1.28e-02

Pos logit contributions
theless+0.756
env+0.654
terday+0.608
 Seym+0.605
lis+0.605
Neg logit contributions
hower-0.639
pees-0.625
pee-0.567
ombat-0.559
-0.557
Token when
Token ID618
Feature activation+8.30e-02

Pos logit contributions
hower+0.049
pees+0.048
pee+0.045
ombat+0.044
+0.044
Neg logit contributions
theless-0.038
env-0.032
lis-0.029
terday-0.029
 Haas-0.029
Token
Token ID564
Feature activation-7.23e-01

Pos logit contributions
hower+0.305
pees+0.300
pee+0.276
ombat+0.274
+0.272
Neg logit contributions
theless-0.260
env-0.220
lis-0.201
terday-0.201
 Haas-0.199
Token
Token ID250
Feature activation-2.60e-01

Pos logit contributions
theless+2.310
env+2.055
lis+1.889
terday+1.862
 Haas+1.855
Neg logit contributions
hower-2.512
pees-2.391
ombat-2.200
pee-2.176
legate-2.155
TokenBoy
Token ID26554
Feature activation-1.12e-01

Pos logit contributions
theless+0.881
env+0.773
lis+0.707
terday+0.696
 Haas+0.687
Neg logit contributions
hower-0.885
pees-0.855
ombat-0.786
-0.774
pee-0.771
Token,
Token ID11
Feature activation-3.24e-01

Pos logit contributions
theless+1.401
env+1.167
terday+1.042
lis+1.033
 Seym+1.029
Neg logit contributions
hower-2.115
pees-2.047
pee-1.906
ombat-1.902
-1.888
Token on
Token ID319
Feature activation-2.46e-01

Pos logit contributions
theless+0.965
env+0.827
terday+0.757
 Haas+0.756
ylum+0.742
Neg logit contributions
hower-1.225
pees-1.170
ombat-1.097
-1.091
pee-1.085
Token average
Token ID2811
Feature activation+6.54e-01

Pos logit contributions
theless+0.565
env+0.456
terday+0.399
 Haas+0.390
lis+0.385
Neg logit contributions
hower-1.108
pees-1.079
ombat-1.020
-1.013
pee-1.011
Token,
Token ID11
Feature activation-2.68e-01

Pos logit contributions
hower+1.996
pees+1.931
pee+1.750
ombat+1.731
legate+1.727
Neg logit contributions
theless-2.433
env-2.092
 Seym-1.939
lis-1.926
terday-1.922
Token simply
Token ID2391
Feature activation-4.58e-01

Pos logit contributions
theless+0.774
env+0.639
terday+0.576
 Seym+0.575
lis+0.571
Neg logit contributions
hower-1.047
pees-1.031
pee-0.955
ombat-0.946
-0.943
Token to
Token ID284
Feature activation+1.18e-02

Pos logit contributions
theless+1.214
env+0.997
terday+0.904
ylum+0.879
 Haas+0.876
Neg logit contributions
hower-1.895
pees-1.831
ombat-1.722
-1.711
pee-1.702
Token determine
Token ID5004
Feature activation+5.21e-02

Pos logit contributions
hower+0.042
pees+0.040
+0.037
pee+0.037
ombat+0.037
Neg logit contributions
theless-0.039
env-0.033
lis-0.030
terday-0.030
 Haas-0.030
Token whether
Token ID1771
Feature activation+3.14e-01

Pos logit contributions
hower+0.206
pees+0.201
ombat+0.187
pee+0.187
+0.185
Neg logit contributions
theless-0.149
env-0.123
terday-0.112
ylum-0.110
lis-0.110
Token to
Token ID284
Feature activation-7.64e-01

Pos logit contributions
hower+1.195
pees+1.163
pee+1.076
ombat+1.073
+1.066
Neg logit contributions
theless-0.948
env-0.792
terday-0.720
lis-0.715
 Haas-0.713
Token accept
Token ID2453
Feature activation-4.46e-01

Pos logit contributions
theless+2.434
env+2.126
lis+1.970
terday+1.916
 Haas+1.896
Neg logit contributions
hower-2.711
pees-2.614
pee-2.378
ombat-2.369
-2.365
Token a
Token ID257
Feature activation+1.72e-01

Pos logit contributions
theless+1.390
env+1.175
 Haas+1.080
ylum+1.079
lis+1.067
Neg logit contributions
hower-1.648
pees-1.585
ombat-1.494
pee-1.468
-1.459
Token
Token ID198
Feature activation+3.84e-01

Pos logit contributions
hower+0.762
pees+0.745
pee+0.684
ombat+0.681
+0.678
Neg logit contributions
theless-0.603
env-0.517
lis-0.464
terday-0.460
 Haas-0.457
TokenIn
Token ID818
Feature activation+1.12e-01

Pos logit contributions
hower+1.305
pees+1.259
ombat+1.150
pee+1.144
+1.141
Neg logit contributions
theless-1.315
env-1.154
lis-1.056
terday-1.040
ylum-1.024
Token later
Token ID1568
Feature activation+2.84e-01

Pos logit contributions
hower+0.420
pees+0.413
pee+0.381
ombat+0.380
+0.379
Neg logit contributions
theless-0.339
env-0.290
lis-0.263
terday-0.261
 Haas-0.259
Token desperation
Token ID31265
Feature activation-3.08e-01

Pos logit contributions
hower+0.982
pees+0.956
pee+0.877
ombat+0.864
+0.858
Neg logit contributions
theless-0.966
env-0.824
terday-0.757
lis-0.753
 Seym-0.752
Token,
Token ID11
Feature activation-9.67e-02

Pos logit contributions
theless+1.000
env+0.859
terday+0.810
lis+0.799
 Haas+0.789
Neg logit contributions
hower-1.081
pees-1.056
ombat-0.967
pee-0.966
-0.959
Token I
Token ID314
Feature activation-4.45e-01

Pos logit contributions
theless+0.309
env+0.262
terday+0.241
 Seym+0.241
lis+0.240
Neg logit contributions
hower-0.351
pees-0.343
pee-0.316
ombat-0.315
-0.313
Token also
Token ID635
Feature activation+4.63e-01

Pos logit contributions
theless+1.328
env+1.103
lis+1.005
terday+0.996
 Haas+0.981
Neg logit contributions
hower-1.694
pees-1.665
-1.529
ombat-1.529
pee-1.525
Token gave
Token ID2921
Feature activation-5.85e-01

Pos logit contributions
hower+1.794
pees+1.761
pee+1.626
ombat+1.615
+1.614
Neg logit contributions
theless-1.366
env-1.136
terday-1.027
lis-1.020
 Seym-1.016
Token a
Token ID257
Feature activation-1.15e-01

Pos logit contributions
theless+1.700
env+1.464
 Seym+1.305
lis+1.298
 Haas+1.286
Neg logit contributions
hower-2.252
pees-2.177
ombat-2.043
pee-2.033
-2.016
Token link
Token ID2792
Feature activation+3.95e-01

Pos logit contributions
theless+0.356
env+0.302
 Haas+0.275
lis+0.275
terday+0.272
Neg logit contributions
hower-0.427
pees-0.421
ombat-0.386
pee-0.384
-0.382
Token to
Token ID284
Feature activation+2.57e-01

Pos logit contributions
hower+1.555
pees+1.524
pee+1.410
ombat+1.402
+1.401
Neg logit contributions
theless-1.133
env-0.945
terday-0.851
 Seym-0.842
lis-0.842
Token could
Token ID714
Feature activation+3.93e-02

Pos logit contributions
hower+0.056
pees+0.054
ombat+0.050
pee+0.050
+0.049
Neg logit contributions
theless-0.048
env-0.042
lis-0.038
 Haas-0.038
terday-0.038
Token work
Token ID670
Feature activation-6.17e-01

Pos logit contributions
hower+0.140
pees+0.138
pee+0.126
+0.125
ombat+0.124
Neg logit contributions
theless-0.125
env-0.109
lis-0.100
 Haas-0.099
terday-0.098
Token for
Token ID329
Feature activation-4.97e-01

Pos logit contributions
theless+1.897
env+1.649
 Haas+1.503
terday+1.483
lis+1.473
Neg logit contributions
hower-2.240
pees-2.205
ombat-2.042
-2.038
pee-2.027
Token every
Token ID790
Feature activation+9.45e-02

Pos logit contributions
theless+1.437
env+1.229
 Haas+1.124
lis+1.088
 Manson+1.077
Neg logit contributions
hower-1.933
pees-1.856
ombat-1.739
pee-1.737
-1.734
Token conceivable
Token ID40324
Feature activation-2.79e-01

Pos logit contributions
hower+0.346
pees+0.341
ombat+0.316
pee+0.312
+0.307
Neg logit contributions
theless-0.298
env-0.254
 Haas-0.230
terday-0.230
lis-0.228
Token type
Token ID2099
Feature activation-3.06e-01

Pos logit contributions
theless+0.816
env+0.697
 Haas+0.625
terday+0.614
lis+0.607
Neg logit contributions
hower-1.070
pees-1.068
pee-0.982
ombat-0.979
-0.962
Token of
Token ID286
Feature activation-4.70e-01

Pos logit contributions
theless+1.124
env+0.977
terday+0.903
 Seym+0.902
lis+0.900
Neg logit contributions
hower-0.947
pees-0.929
pee-0.841
ombat-0.830
legate-0.824
Token company
Token ID1664
Feature activation-6.45e-02

Pos logit contributions
theless+1.337
env+1.137
 Haas+1.036
 citizenship+0.991
lis+0.986
Neg logit contributions
hower-1.825
pees-1.813
ombat-1.669
pee-1.664
-1.638
Token.
Token ID13
Feature activation-2.39e-01

Pos logit contributions
theless+0.206
env+0.180
 Haas+0.162
lis+0.162
terday+0.161
Neg logit contributions
hower-0.230
pees-0.226
pee-0.206
ombat-0.205
-0.203
Token How
Token ID1374
Feature activation+7.84e-01

Pos logit contributions
theless+0.758
env+0.665
 Haas+0.612
 Manson+0.595
lis+0.595
Neg logit contributions
hower-0.866
pees-0.841
ombat-0.773
pee-0.766
legate-0.761
Token do
Token ID466
Feature activation-3.82e-01

Pos logit contributions
hower+2.836
pees+2.769
pee+2.540
ombat+2.534
+2.521
Neg logit contributions
theless-2.521
env-2.130
terday-1.934
 Seym-1.927
lis-1.926
Tokenoses
Token ID4629
Feature activation+2.61e-01

Pos logit contributions
theless+1.428
env+1.270
lis+1.150
ured+1.103
iates+1.094
Neg logit contributions
hower-2.015
pees-1.928
pee-1.782
-1.775
legate-1.733
Token over
Token ID625
Feature activation-2.92e-01

Pos logit contributions
hower+0.915
pees+0.894
pee+0.816
+0.814
ombat+0.810
Neg logit contributions
theless-0.863
env-0.743
lis-0.685
terday-0.680
 Haas-0.668
Token the
Token ID262
Feature activation-2.36e-01

Pos logit contributions
theless+0.817
env+0.692
lis+0.614
terday+0.607
 Haas+0.606
Neg logit contributions
hower-1.171
pees-1.144
pee-1.057
-1.049
legate-1.039
Token entire
Token ID2104
Feature activation-4.94e-01

Pos logit contributions
theless+0.742
env+0.642
 Haas+0.584
lis+0.580
 Seym+0.574
Neg logit contributions
hower-0.872
pees-0.851
pee-0.775
-0.766
legate-0.762
Token palm
Token ID18057
Feature activation+5.12e-01

Pos logit contributions
theless+1.587
env+1.386
 Haas+1.277
lis+1.252
 Manson+1.243
Neg logit contributions
hower-1.778
pees-1.746
pee-1.576
-1.556
legate-1.548
Tokenopl
Token ID20106
Feature activation-3.42e-01

Pos logit contributions
hower+1.823
pees+1.796
pee+1.639
ombat+1.619
+1.613
Neg logit contributions
theless-1.663
env-1.419
lis-1.295
terday-1.289
 Seym-1.286
Tokenant
Token ID415
Feature activation+1.28e-01

Pos logit contributions
theless+1.243
env+1.116
ylum+1.044
lis+1.022
ured+1.001
Neg logit contributions
hower-1.091
pees-1.056
-0.956
pee-0.932
 Dimension-0.902
Tokenar
Token ID283
Feature activation-4.09e-01

Pos logit contributions
hower+0.357
pees+0.353
pee+0.311
+0.306
ombat+0.305
Neg logit contributions
theless-0.514
env-0.453
lis-0.424
ylum-0.422
terday-0.420
Token surfaces
Token ID16649
Feature activation-1.35e-01

Pos logit contributions
theless+1.263
env+1.102
 Haas+1.005
lis+1.000
ylum+0.981
Neg logit contributions
hower-1.514
pees-1.494
pee-1.355
-1.330
ombat-1.322
Token,
Token ID11
Feature activation-5.99e-02

Pos logit contributions
theless+0.436
env+0.373
lis+0.343
terday+0.341
ylum+0.338
Neg logit contributions
hower-0.481
pees-0.472
pee-0.432
-0.428
legate-0.426
Token beginning
Token ID3726
Feature activation-2.63e-01

Pos logit contributions
theless+0.181
env+0.153
 Haas+0.139
lis+0.138
 Seym+0.137
Neg logit contributions
hower-0.229
pees-0.224
pee-0.205
ombat-0.203
legate-0.203
Tokenf
Token ID69
Feature activation-1.39e-01

Pos logit contributions
hower+0.886
pees+0.861
+0.794
ombat+0.792
pee+0.792
Neg logit contributions
theless-0.702
env-0.603
lis-0.542
terday-0.534
ylum-0.524
Token);
Token ID1776
Feature activation+9.45e-01

Pos logit contributions
theless+0.364
env+0.312
lis+0.273
terday+0.267
 Haas+0.263
Neg logit contributions
hower-0.572
pees-0.560
pee-0.516
-0.514
ombat-0.512
Token RAW
Token ID33782
Feature activation+1.58e-01

Pos logit contributions
hower+2.710
pees+2.641
pee+2.375
ombat+2.349
+2.331
Neg logit contributions
theless-3.713
env-3.242
terday-3.024
 Seym-3.019
lis-3.002
Token_
Token ID62
Feature activation+7.38e-01

Pos logit contributions
hower+0.635
pees+0.619
pee+0.569
ombat+0.566
+0.565
Neg logit contributions
theless-0.436
env-0.375
lis-0.330
terday-0.326
 Haas-0.323
TokenASS
Token ID10705
Feature activation+2.04e-01

Pos logit contributions
hower+2.465
pees+2.410
pee+2.202
ombat+2.182
+2.169
Neg logit contributions
theless-2.550
env-2.185
terday-2.012
 Seym-2.006
lis-1.997
TokenERT
Token ID17395
Feature activation-1.44e+00

Pos logit contributions
hower+0.574
pees+0.555
pee+0.494
ombat+0.486
+0.483
Neg logit contributions
theless-0.809
env-0.724
terday-0.671
lis-0.667
URE-0.662
Token(
Token ID7
Feature activation-6.10e-01

Pos logit contributions
theless+3.843
env+3.788
URE+3.410
lis+3.201
ured+3.111
Neg logit contributions
hower-5.461
pees-5.221
ombat-4.701
pee-4.647
-4.629
Tokenit
Token ID270
Feature activation-1.25e+00

Pos logit contributions
theless+1.773
env+1.685
lis+1.453
sov+1.396
terday+1.377
Neg logit contributions
hower-2.316
pees-2.175
-2.038
ombat-2.009
Û-1.953
Token->
Token ID3784
Feature activation-7.28e-01

Pos logit contributions
env+2.654
theless+2.595
lis+2.264
URE+2.034
 Haas+2.015
Neg logit contributions
hower-5.550
pees-5.310
ombat-4.940
-4.888
��-4.869
Tokenvec
Token ID35138
Feature activation-7.01e-01

Pos logit contributions
theless+1.942
env+1.769
lis+1.516
terday+1.423
sov+1.408
Neg logit contributions
hower-2.942
pees-2.812
-2.642
ombat-2.600
 Dimension-2.589
Token()
Token ID3419
Feature activation-9.43e-01

Pos logit contributions
theless+2.582
env+2.418
lis+2.165
 Haas+2.140
terday+2.125
Neg logit contributions
hower-2.113
pees-2.025
ombat-1.804
-1.784
pee-1.781
Token Un
Token ID791
Feature activation-5.16e-01

Pos logit contributions
theless+1.397
env+1.186
lis+1.082
terday+1.072
ylum+1.071
Neg logit contributions
hower-1.565
pees-1.535
pee-1.410
-1.387
ombat-1.386
Tokeniting
Token ID1780
Feature activation-7.21e-01

Pos logit contributions
theless+1.607
env+1.568
lis+1.431
iates+1.420
icity+1.395
Neg logit contributions
hower-1.741
pees-1.671
-1.551
pee-1.509
Û-1.476
Token Care
Token ID7276
Feature activation+4.05e-01

Pos logit contributions
theless+2.223
env+1.861
lis+1.704
terday+1.703
 Seym+1.690
Neg logit contributions
hower-2.664
pees-2.553
pee-2.361
-2.354
ombat-2.334
Token told
Token ID1297
Feature activation-4.41e-01

Pos logit contributions
hower+1.447
pees+1.413
pee+1.300
ombat+1.286
+1.277
Neg logit contributions
theless-1.317
env-1.122
terday-1.021
lis-1.021
 Seym-1.012
Token the
Token ID262
Feature activation-3.16e-01

Pos logit contributions
theless+1.252
env+1.044
terday+0.957
 Seym+0.948
 Haas+0.936
Neg logit contributions
hower-1.758
pees-1.711
pee-1.597
-1.567
ombat-1.540
Token ABC
Token ID9738
Feature activation-1.35e+00

Pos logit contributions
theless+0.840
env+0.703
terday+0.626
 Seym+0.625
lis+0.617
Neg logit contributions
hower-1.318
pees-1.286
pee-1.194
ombat-1.180
-1.179
Token demand
Token ID3512
Feature activation+7.91e-01

Pos logit contributions
theless+3.813
env+3.243
 Haas+3.012
ylum+2.988
terday+2.987
Neg logit contributions
hower-5.196
pees-5.100
pee-4.743
ombat-4.592
-4.586
Token for
Token ID329
Feature activation+2.92e-01

Pos logit contributions
hower+2.965
pees+2.907
pee+2.686
ombat+2.664
+2.648
Neg logit contributions
theless-2.408
env-2.011
terday-1.829
 Seym-1.827
lis-1.811
Token financial
Token ID3176
Feature activation-5.22e-01

Pos logit contributions
hower+1.137
pees+1.110
pee+1.028
ombat+1.024
+1.017
Neg logit contributions
theless-0.858
env-0.717
terday-0.645
 Seym-0.640
lis-0.639
Token help
Token ID1037
Feature activation-2.60e-01

Pos logit contributions
theless+1.407
env+1.201
 Haas+1.043
 citizenship+1.037
terday+1.030
Neg logit contributions
hower-2.144
pees-2.082
pee-1.946
-1.926
ombat-1.922
Token had
Token ID550
Feature activation+1.27e-02

Pos logit contributions
theless+0.730
env+0.608
ylum+0.548
terday+0.547
 Haas+0.546
Neg logit contributions
hower-1.033
pees-1.010
-0.937
ombat-0.936
pee-0.935
Token indicated
Token ID8203
Feature activation-2.43e-01

Pos logit contributions
theless+1.085
env+0.930
lis+0.853
terday+0.841
ylum+0.829
Neg logit contributions
hower-1.307
pees-1.265
legate-1.155
pee-1.155
ombat-1.150
Token doubts
Token ID17188
Feature activation-2.09e-01

Pos logit contributions
theless+0.759
env+0.642
 Seym+0.582
terday+0.582
lis+0.580
Neg logit contributions
hower-0.901
pees-0.891
pee-0.816
ombat-0.811
legate-0.799
Token about
Token ID546
Feature activation+1.22e-02

Pos logit contributions
theless+0.669
env+0.565
terday+0.533
lis+0.523
ylum+0.515
Neg logit contributions
hower-0.744
pees-0.734
ombat-0.676
-0.675
pee-0.673
Token the
Token ID262
Feature activation-6.71e-02

Pos logit contributions
hower+0.048
pees+0.047
ombat+0.043
pee+0.043
+0.043
Neg logit contributions
theless-0.035
env-0.030
 Seym-0.027
lis-0.026
terday-0.026
Token child
Token ID1200
Feature activation-1.95e-01

Pos logit contributions
theless+0.202
env+0.170
 Seym+0.153
terday+0.153
 Haas+0.152
Neg logit contributions
hower-0.257
pees-0.251
pee-0.230
ombat-0.230
legate-0.228
Token
Token ID447
Feature activation-1.28e+00

Pos logit contributions
theless+0.568
env+0.473
lis+0.421
terday+0.414
ylum+0.414
Neg logit contributions
hower-0.770
pees-0.755
ombat-0.697
pee-0.692
legate-0.682
Token
Token ID247
Feature activation-4.35e-01

Pos logit contributions
theless+4.297
env+3.730
lis+3.593
 Haas+3.544
ylum+3.540
Neg logit contributions
hower-4.070
pees-3.968
ombat-3.656
pee-3.593
apor-3.520
Tokens
Token ID82
Feature activation-3.09e-01

Pos logit contributions
theless+1.209
env+1.013
lis+0.912
terday+0.890
ylum+0.879
Neg logit contributions
hower-1.748
pees-1.699
ombat-1.576
pee-1.563
-1.551
Token paternity
Token ID50064
Feature activation+2.46e-01

Pos logit contributions
theless+0.874
env+0.724
terday+0.660
 citizenship+0.640
 Haas+0.639
Neg logit contributions
hower-1.228
pees-1.220
pee-1.118
ombat-1.118
legate-1.108
Token.
Token ID13
Feature activation-7.02e-02

Pos logit contributions
hower+0.850
pees+0.837
pee+0.765
ombat+0.762
+0.759
Neg logit contributions
theless-0.823
env-0.699
terday-0.648
lis-0.640
 Haas-0.636
Token
Token ID198
Feature activation+1.92e-01

Pos logit contributions
theless+0.217
env+0.185
 Haas+0.168
lis+0.167
 Seym+0.166
Neg logit contributions
hower-0.264
pees-0.259
pee-0.235
ombat-0.234
legate-0.233
Tokenvalue
Token ID8367
Feature activation+2.76e-01

Pos logit contributions
hower+3.372
pees+3.299
pee+3.017
ombat+2.988
+2.969
Neg logit contributions
theless-3.463
env-2.961
terday-2.730
 Seym-2.725
lis-2.705
Token per
Token ID583
Feature activation+7.82e-01

Pos logit contributions
hower+0.972
pees+0.948
ombat+0.874
pee+0.872
+0.868
Neg logit contributions
theless-0.907
env-0.778
terday-0.709
lis-0.699
 Haas-0.698
Token play
Token ID711
Feature activation+1.03e+00

Pos logit contributions
hower+2.833
pees+2.785
pee+2.556
ombat+2.553
+2.540
Neg logit contributions
theless-2.480
env-2.108
terday-1.913
 Seym-1.902
lis-1.900
Token),
Token ID828
Feature activation+2.76e-01

Pos logit contributions
hower+3.824
pees+3.747
pee+3.461
ombat+3.425
legate+3.410
Neg logit contributions
theless-3.166
env-2.654
 Seym-2.417
terday-2.416
lis-2.393
Token and
Token ID290
Feature activation-1.35e-01

Pos logit contributions
hower+0.981
pees+0.957
pee+0.877
ombat+0.873
+0.868
Neg logit contributions
theless-0.901
env-0.769
terday-0.701
 Seym-0.699
lis-0.697
Token fourth
Token ID5544
Feature activation-1.86e+00

Pos logit contributions
theless+0.435
env+0.372
terday+0.337
 Haas+0.336
lis+0.336
Neg logit contributions
hower-0.485
pees-0.474
pee-0.434
ombat-0.433
-0.430
Token in
Token ID287
Feature activation-5.52e-01

Pos logit contributions
theless+5.173
env+4.484
terday+4.126
lis+4.108
 Haas+3.941
Neg logit contributions
hower-6.867
pees-6.786
ombat-6.368
pee-6.197
legate-6.049
Token running
Token ID2491
Feature activation-6.37e-01

Pos logit contributions
theless+1.683
env+1.453
 Haas+1.329
lis+1.312
terday+1.295
Neg logit contributions
hower-2.023
pees-1.952
ombat-1.824
-1.796
pee-1.795
Token back
Token ID736
Feature activation+1.64e-02

Pos logit contributions
theless+2.080
env+1.862
 Haas+1.706
lis+1.679
terday+1.670
Neg logit contributions
hower-2.242
pees-2.091
pee-1.924
ombat-1.918
-1.904
Token success
Token ID1943
Feature activation-6.20e-01

Pos logit contributions
hower+0.064
pees+0.061
ombat+0.057
pee+0.057
+0.057
Neg logit contributions
theless-0.048
env-0.041
lis-0.037
terday-0.037
 Haas-0.036
Token rate
Token ID2494
Feature activation-2.28e-01

Pos logit contributions
theless+2.067
env+1.880
lis+1.731
URE+1.720
 Seym+1.692
Neg logit contributions
hower-2.013
pees-1.911
-1.806
ombat-1.782
legate-1.782
Token.
Token ID13
Feature activation-5.31e-02

Pos logit contributions
hower+3.046
pees+2.972
ombat+2.706
pee+2.692
legate+2.676
Neg logit contributions
theless-3.670
env-3.145
terday-2.867
 Seym-2.856
lis-2.830
Token
Token ID198
Feature activation-6.60e-02

Pos logit contributions
theless+0.159
env+0.134
lis+0.123
 Haas+0.122
terday+0.122
Neg logit contributions
hower-0.203
pees-0.198
pee-0.183
legate-0.180
-0.180
Token
Token ID198
Feature activation-9.26e-02

Pos logit contributions
theless+0.186
env+0.163
lis+0.145
 Haas+0.143
terday+0.142
Neg logit contributions
hower-0.261
pees-0.255
pee-0.233
ombat-0.233
-0.232
TokenAdvertisement
Token ID4723
Feature activation+5.09e-01

Pos logit contributions
theless+0.302
env+0.268
lis+0.248
terday+0.240
 Haas+0.235
Neg logit contributions
hower-0.333
pees-0.315
pee-0.286
-0.286
ombat-0.286
Token Continue
Token ID10054
Feature activation-5.83e-01

Pos logit contributions
hower+1.527
pees+1.493
pee+1.345
+1.336
ombat+1.331
Neg logit contributions
theless-1.940
env-1.697
lis-1.575
terday-1.570
 Seym-1.559
Token reading
Token ID3555
Feature activation-1.64e+00

Pos logit contributions
theless+1.974
env+1.712
terday+1.571
lis+1.564
 Haas+1.546
Neg logit contributions
hower-2.003
pees-1.928
-1.772
ombat-1.756
pee-1.752
Token the
Token ID262
Feature activation+5.81e-01

Pos logit contributions
theless+3.230
env+2.856
lis+2.472
 Haas+2.461
URE+2.320
Neg logit contributions
hower-7.454
pees-7.321
-6.868
ombat-6.841
pee-6.792
Token main
Token ID1388
Feature activation+9.30e-01

Pos logit contributions
hower+1.906
pees+1.863
pee+1.700
ombat+1.684
+1.673
Neg logit contributions
theless-2.041
env-1.751
terday-1.617
 Seym-1.616
lis-1.603
Token story
Token ID1621
Feature activation-7.77e-01

Pos logit contributions
hower+2.858
pees+2.776
pee+2.536
legate+2.502
ombat+2.500
Neg logit contributions
theless-3.447
env-2.940
 Seym-2.859
terday-2.772
lis-2.716
Token
Token ID198
Feature activation-4.82e-01

Pos logit contributions
theless+1.955
env+1.733
lis+1.525
 Haas+1.496
URE+1.449
Neg logit contributions
hower-3.228
pees-3.167
ombat-2.934
-2.915
pee-2.886
Token
Token ID198
Feature activation-5.04e-01

Pos logit contributions
theless+1.248
env+1.127
lis+0.987
 Haas+0.964
terday+0.939
Neg logit contributions
hower-1.991
pees-1.951
ombat-1.784
-1.783
pee-1.779