TOP ACTIVATIONS
MAX = 3.565

age . Some how , no one at Cardinals
View more <|endoftext|> About R acing Apex is
. How do you view the Cowboys ' season
, I 'd go some place and masturb ate and use
View more <|endoftext|> About R acing Apex is a
. How do you maintain a semblance of normal
Democratic strategist , said both parties could walk away from the
China to Expand Military Reach with a Fleet of the
fiction . Any ways , this latest leak has
_ state . un wrap ()) } } pub fn apply
par liam ents have the power to legisl ate with respect
K OD I 17 K RY P TON Setting
things slow down quite a bit but I can happily play
UG instead . Y og ventures raised nearly $
и я , т о
, the Commonwealth has the power to levy income tax which
the machine at the Franklin Institute . While working on the
curly braces though , in order to differentiate between string liter
CNN reports two people are confirmed dead , including an 8
к а » с р о

MIN ACTIVATIONS
MIN = -3.861

U . S . Cons ulate in Ad ana and the
s US Cons ulate to protest Trump
two federal PACs Liber ty 2 . 0 and Oklahoma
ve seen it so far . PA X
has repeatedly criticized President Trump Donald John Trump House committee believes
youtube . com / watch ? v = 2 g 7
10 . 13 71 / journal . p one . 00
the aid of Victor L own es , the crown prince
ballot ended up more or less mirror ing the Rab id
She cuts me off by enc ir cling my
s undeniable how well Arduino has enabled no v
in its episode " Sil ent Ro ar : Search ing
when a TC RI truck drove by , and a strange
M DB 's response so far was that it needed the
radio debate hosted by La Opinion and public radio station KP
as Bac ile 's Inn oc ence of Muslims are taken
situation . Somebody always pops off a baby in the middle
Washington Examiner , Donald Trump Donald John Trump READ : Cohen
on banks . <|endoftext|> Ill ust rious Illum ination : Incredible
26 citations to drivers so far for violating the transit lanes

INTERVAL 1.708 - 3.565
CONTAINS 0.346%

to just lean back and cheer for the apocalyptic disasters of
not in the radius of ignore effects , or they have
( don t forget to also add their respective
а к е я в л я е
as I add the comparison code like in the above code

INTERVAL -0.148 - 1.708
CONTAINS 61.353%

Development of the F - 35 fighter jet will
carry needs to match total circulation in the grid . Likewise
Most importantly however , we can restore safe and healthy habitats
curtain - like slit in the bottom rather than bursting
hes were dry , I gave it another fe int highlight

INTERVAL -2.004 - -0.148
CONTAINS 38.161%

troops in Iraq and Syria again , The Daily
gap with debt . Polit icians attract votes by promising large
EM M ITT SM ITH , FL
2 tires specifically designed for off - road performance .
ons , meanwhile , represent density deviation from that ambient .

INTERVAL -3.861 - -2.004
CONTAINS 0.141%

rock - favorite favorite favorite favorite - December 13 , 2010
- called ethnic music ? Yes , I 'm very interested
To find out if their creation works , they
She cuts me off by enc ir cling my
the world I 'm let loose upon by the creators ."
Tokenage
Token ID496
Feature activation-4.59e-02

Pos logit contributions
Sov+0.430
 plur+0.417
edin+0.412
repre+0.379
 differently+0.371
Neg logit contributions
aband-0.513
ipel-0.473
 trap-0.449
opsis-0.446
hemer-0.422
Token.
Token ID13
Feature activation+4.66e-01

Pos logit contributions
aband+0.038
ipel+0.034
 trap+0.033
opsis+0.032
hemer+0.031
Neg logit contributions
Sov-0.035
 plur-0.033
edin-0.033
repre-0.031
 differently-0.030
Token
Token ID198
Feature activation+3.79e-01

Pos logit contributions
Sov+0.373
 plur+0.363
edin+0.360
repre+0.333
 differently+0.328
Neg logit contributions
aband-0.358
ipel-0.327
 trap-0.309
opsis-0.306
hemer-0.288
Token
Token ID198
Feature activation+2.24e-01

Pos logit contributions
Sov+0.277
 plur+0.268
edin+0.266
repre+0.245
 differently+0.240
Neg logit contributions
aband-0.318
ipel-0.293
 trap-0.277
opsis-0.276
hemer-0.261
TokenSome
Token ID4366
Feature activation+1.72e-01

Pos logit contributions
Sov+0.174
 plur+0.168
edin+0.167
repre+0.155
 differently+0.151
Neg logit contributions
aband-0.178
ipel-0.163
 trap-0.154
opsis-0.153
hemer-0.144
Tokenhow
Token ID4919
Feature activation+3.56e+00

Pos logit contributions
Sov+0.134
 plur+0.130
edin+0.129
repre+0.119
 differently+0.117
Neg logit contributions
aband-0.135
ipel-0.124
 trap-0.117
opsis-0.116
hemer-0.110
Token,
Token ID11
Feature activation+4.74e-01

Pos logit contributions
Sov+2.704
 plur+2.659
edin+2.616
repre+2.427
 differently+2.420
Neg logit contributions
aband-2.852
ipel-2.606
opsis-2.452
 trap-2.436
��-2.298
Token no
Token ID645
Feature activation+5.89e-03

Pos logit contributions
Sov+0.341
 plur+0.329
edin+0.325
repre+0.300
 differently+0.292
Neg logit contributions
aband-0.403
ipel-0.371
 trap-0.354
opsis-0.350
hemer-0.332
Token one
Token ID530
Feature activation-9.74e-01

Pos logit contributions
Sov+0.004
 plur+0.004
edin+0.004
repre+0.004
 differently+0.004
Neg logit contributions
aband-0.005
ipel-0.005
 trap-0.005
opsis-0.004
hemer-0.004
Token at
Token ID379
Feature activation+4.80e-01

Pos logit contributions
aband+0.812
ipel+0.739
 trap+0.714
opsis+0.701
hemer+0.663
Neg logit contributions
Sov-0.723
 plur-0.689
edin-0.688
repre-0.637
 differently-0.614
Token Cardinals
Token ID18071
Feature activation+1.02e+00

Pos logit contributions
Sov+0.422
 plur+0.411
edin+0.407
repre+0.381
 differently+0.374
Neg logit contributions
aband-0.332
ipel-0.300
 trap-0.282
opsis-0.279
hemer-0.260
Token
Token ID198
Feature activation-8.58e-02

Pos logit contributions
aband+0.132
ipel+0.121
 trap+0.115
opsis+0.114
hemer+0.107
Neg logit contributions
Sov-0.121
 plur-0.118
edin-0.116
repre-0.108
 differently-0.105
TokenView
Token ID7680
Feature activation-2.70e-01

Pos logit contributions
aband+0.081
ipel+0.075
 trap+0.072
opsis+0.071
hemer+0.068
Neg logit contributions
Sov-0.054
 plur-0.052
edin-0.051
repre-0.046
 differently-0.045
Token more
Token ID517
Feature activation-2.16e-01

Pos logit contributions
aband+0.233
ipel+0.217
 trap+0.208
opsis+0.203
hemer+0.194
Neg logit contributions
Sov-0.190
 plur-0.183
edin-0.180
repre-0.168
 differently-0.160
Token<|endoftext|>
Token ID50256
Feature activation-8.80e-03

Pos logit contributions
aband+0.234
ipel+0.219
opsis+0.210
 trap+0.210
hemer+0.201
Neg logit contributions
Sov-0.106
 plur-0.101
edin-0.098
repre-0.087
 differently-0.085
TokenAbout
Token ID8585
Feature activation+6.41e-01

Pos logit contributions
aband+0.007
ipel+0.006
 trap+0.006
opsis+0.006
hemer+0.006
Neg logit contributions
Sov-0.007
 plur-0.007
edin-0.007
repre-0.006
 differently-0.006
Token
Token ID198
Feature activation+3.29e+00

Pos logit contributions
Sov+0.485
 plur+0.471
edin+0.465
repre+0.430
 differently+0.422
Neg logit contributions
aband-0.522
ipel-0.479
 trap-0.452
opsis-0.449
hemer-0.425
Token
Token ID198
Feature activation+2.99e+00

Pos logit contributions
Sov+2.192
edin+2.112
 plur+2.103
 differently+1.903
repre+1.892
Neg logit contributions
aband-2.946
ipel-2.772
opsis-2.567
 trap-2.559
��-2.505
TokenR
Token ID49
Feature activation+4.99e-02

Pos logit contributions
Sov+2.321
 plur+2.206
edin+2.201
repre+2.052
 differently+1.973
Neg logit contributions
aband-2.376
ipel-2.196
 trap-2.072
opsis-2.043
hemer-1.928
Tokenacing
Token ID4092
Feature activation+3.18e-01

Pos logit contributions
Sov+0.041
 plur+0.040
edin+0.040
repre+0.037
 differently+0.036
Neg logit contributions
aband-0.037
ipel-0.034
 trap-0.032
opsis-0.032
hemer-0.030
Token Apex
Token ID49440
Feature activation-5.18e-01

Pos logit contributions
Sov+0.240
 plur+0.233
edin+0.230
repre+0.213
 differently+0.209
Neg logit contributions
aband-0.260
ipel-0.238
 trap-0.226
opsis-0.224
hemer-0.212
Token is
Token ID318
Feature activation-9.99e-02

Pos logit contributions
aband+0.449
ipel+0.415
 trap+0.395
opsis+0.392
hemer+0.371
Neg logit contributions
Sov-0.365
 plur-0.352
edin-0.348
repre-0.321
 differently-0.312
Token.
Token ID13
Feature activation+2.03e-01

Pos logit contributions
Sov+0.031
 plur+0.030
edin+0.029
repre+0.027
 differently+0.026
Neg logit contributions
aband-0.035
ipel-0.033
 trap-0.031
opsis-0.031
hemer-0.029
Token
Token ID198
Feature activation-3.10e-01

Pos logit contributions
Sov+0.163
 plur+0.159
edin+0.157
repre+0.146
 differently+0.143
Neg logit contributions
aband-0.156
ipel-0.142
 trap-0.135
opsis-0.133
hemer-0.125
Token
Token ID198
Feature activation-1.54e-01

Pos logit contributions
aband+0.256
ipel+0.236
 trap+0.224
opsis+0.222
hemer+0.210
Neg logit contributions
Sov-0.230
 plur-0.223
edin-0.220
repre-0.204
 differently-0.199
TokenHow
Token ID2437
Feature activation+6.54e-01

Pos logit contributions
aband+0.119
ipel+0.108
 trap+0.103
opsis+0.101
hemer+0.095
Neg logit contributions
Sov-0.124
 plur-0.120
edin-0.119
repre-0.111
 differently-0.108
Token do
Token ID466
Feature activation-6.37e-02

Pos logit contributions
Sov+0.465
 plur+0.451
edin+0.445
repre+0.409
 differently+0.402
Neg logit contributions
aband-0.561
ipel-0.516
 trap-0.491
opsis-0.487
hemer-0.462
Token you
Token ID345
Feature activation+3.09e+00

Pos logit contributions
aband+0.055
ipel+0.051
 trap+0.048
opsis+0.048
hemer+0.045
Neg logit contributions
Sov-0.045
 plur-0.044
edin-0.043
repre-0.040
 differently-0.039
Token view
Token ID1570
Feature activation+1.70e+00

Pos logit contributions
Sov+2.052
 plur+2.037
edin+2.001
 differently+1.833
repre+1.814
Neg logit contributions
aband-2.761
ipel-2.564
opsis-2.407
 trap-2.369
��-2.303
Token the
Token ID262
Feature activation+7.82e-01

Pos logit contributions
Sov+1.310
 plur+1.289
edin+1.263
 differently+1.173
repre+1.171
Neg logit contributions
aband-1.349
ipel-1.239
opsis-1.162
 trap-1.155
hemer-1.094
Token Cowboys
Token ID16535
Feature activation+6.79e-01

Pos logit contributions
Sov+0.596
 plur+0.580
edin+0.571
repre+0.530
 differently+0.522
Neg logit contributions
aband-0.631
ipel-0.580
 trap-0.546
opsis-0.545
hemer-0.514
Token'
Token ID6
Feature activation+8.90e-01

Pos logit contributions
Sov+0.458
 plur+0.443
edin+0.438
repre+0.401
 differently+0.386
Neg logit contributions
aband-0.606
ipel-0.561
 trap-0.538
opsis-0.530
hemer-0.505
Token season
Token ID1622
Feature activation+7.41e-01

Pos logit contributions
Sov+0.696
 plur+0.677
edin+0.668
repre+0.621
 differently+0.611
Neg logit contributions
aband-0.700
ipel-0.641
 trap-0.604
opsis-0.601
hemer-0.566
Token,
Token ID11
Feature activation-1.89e-02

Pos logit contributions
aband+0.275
ipel+0.252
 trap+0.239
opsis+0.236
hemer+0.223
Neg logit contributions
Sov-0.274
 plur-0.265
edin-0.262
repre-0.244
 differently-0.238
Token I
Token ID314
Feature activation-4.82e-01

Pos logit contributions
aband+0.015
ipel+0.014
 trap+0.013
opsis+0.013
hemer+0.012
Neg logit contributions
Sov-0.014
 plur-0.014
edin-0.014
repre-0.013
 differently-0.012
Token'd
Token ID1549
Feature activation-1.24e+00

Pos logit contributions
aband+0.462
ipel+0.430
 trap+0.413
opsis+0.408
hemer+0.390
Neg logit contributions
Sov-0.296
 plur-0.283
edin-0.279
repre-0.254
 differently-0.246
Token go
Token ID467
Feature activation-8.42e-01

Pos logit contributions
aband+0.996
ipel+0.909
 trap+0.881
opsis+0.857
hemer+0.810
Neg logit contributions
Sov-0.951
 plur-0.909
edin-0.902
repre-0.840
 differently-0.810
Token some
Token ID617
Feature activation-1.86e-01

Pos logit contributions
aband+0.699
ipel+0.641
 trap+0.614
opsis+0.604
hemer+0.570
Neg logit contributions
Sov-0.628
 plur-0.605
edin-0.598
repre-0.555
 differently-0.536
Tokenplace
Token ID5372
Feature activation+3.09e+00

Pos logit contributions
aband+0.196
ipel+0.184
 trap+0.176
opsis+0.176
hemer+0.168
Neg logit contributions
Sov-0.095
 plur-0.091
edin-0.090
repre-0.080
 differently-0.077
Token and
Token ID290
Feature activation+5.05e-01

Pos logit contributions
Sov+2.170
 plur+2.126
edin+2.077
 differently+1.946
repre+1.919
Neg logit contributions
aband-2.639
ipel-2.454
opsis-2.317
 trap-2.282
��-2.193
Token masturb
Token ID22938
Feature activation-6.15e-01

Pos logit contributions
Sov+0.388
 plur+0.376
edin+0.371
repre+0.344
 differently+0.337
Neg logit contributions
aband-0.406
ipel-0.372
 trap-0.353
opsis-0.349
hemer-0.329
Tokenate
Token ID378
Feature activation-1.36e-01

Pos logit contributions
aband+0.546
ipel+0.505
 trap+0.481
opsis+0.477
hemer+0.453
Neg logit contributions
Sov-0.420
 plur-0.406
edin-0.400
repre-0.367
 differently-0.359
Token and
Token ID290
Feature activation+6.62e-01

Pos logit contributions
aband+0.115
ipel+0.105
 trap+0.101
opsis+0.099
hemer+0.095
Neg logit contributions
Sov-0.099
 plur-0.095
edin-0.094
repre-0.087
 differently-0.083
Token use
Token ID779
Feature activation-5.33e-01

Pos logit contributions
Sov+0.497
 plur+0.481
edin+0.475
repre+0.440
 differently+0.430
Neg logit contributions
aband-0.543
ipel-0.498
 trap-0.475
opsis-0.469
hemer-0.444
TokenView
Token ID7680
Feature activation-2.70e-01

Pos logit contributions
aband+0.081
ipel+0.075
 trap+0.072
opsis+0.071
hemer+0.068
Neg logit contributions
Sov-0.054
 plur-0.052
edin-0.051
repre-0.046
 differently-0.045
Token more
Token ID517
Feature activation-2.16e-01

Pos logit contributions
aband+0.233
ipel+0.217
 trap+0.208
opsis+0.203
hemer+0.194
Neg logit contributions
Sov-0.190
 plur-0.183
edin-0.180
repre-0.168
 differently-0.160
Token<|endoftext|>
Token ID50256
Feature activation-8.80e-03

Pos logit contributions
aband+0.234
ipel+0.219
opsis+0.210
 trap+0.210
hemer+0.201
Neg logit contributions
Sov-0.106
 plur-0.101
edin-0.098
repre-0.087
 differently-0.085
TokenAbout
Token ID8585
Feature activation+6.41e-01

Pos logit contributions
aband+0.007
ipel+0.006
 trap+0.006
opsis+0.006
hemer+0.006
Neg logit contributions
Sov-0.007
 plur-0.007
edin-0.007
repre-0.006
 differently-0.006
Token
Token ID198
Feature activation+3.29e+00

Pos logit contributions
Sov+0.485
 plur+0.471
edin+0.465
repre+0.430
 differently+0.422
Neg logit contributions
aband-0.522
ipel-0.479
 trap-0.452
opsis-0.449
hemer-0.425
Token
Token ID198
Feature activation+2.99e+00

Pos logit contributions
Sov+2.192
edin+2.112
 plur+2.103
 differently+1.903
repre+1.892
Neg logit contributions
aband-2.946
ipel-2.772
opsis-2.567
 trap-2.559
��-2.505
TokenR
Token ID49
Feature activation+4.99e-02

Pos logit contributions
Sov+2.321
 plur+2.206
edin+2.201
repre+2.052
 differently+1.973
Neg logit contributions
aband-2.376
ipel-2.196
 trap-2.072
opsis-2.043
hemer-1.928
Tokenacing
Token ID4092
Feature activation+3.18e-01

Pos logit contributions
Sov+0.041
 plur+0.040
edin+0.040
repre+0.037
 differently+0.036
Neg logit contributions
aband-0.037
ipel-0.034
 trap-0.032
opsis-0.032
hemer-0.030
Token Apex
Token ID49440
Feature activation-5.18e-01

Pos logit contributions
Sov+0.240
 plur+0.233
edin+0.230
repre+0.213
 differently+0.209
Neg logit contributions
aband-0.260
ipel-0.238
 trap-0.226
opsis-0.224
hemer-0.212
Token is
Token ID318
Feature activation-9.99e-02

Pos logit contributions
aband+0.449
ipel+0.415
 trap+0.395
opsis+0.392
hemer+0.371
Neg logit contributions
Sov-0.365
 plur-0.352
edin-0.348
repre-0.321
 differently-0.312
Token a
Token ID257
Feature activation-3.80e-01

Pos logit contributions
aband+0.089
ipel+0.083
 trap+0.079
opsis+0.078
hemer+0.074
Neg logit contributions
Sov-0.068
 plur-0.065
edin-0.064
repre-0.059
 differently-0.058
Token.
Token ID13
Feature activation-1.45e-01

Pos logit contributions
Sov+0.400
 plur+0.387
edin+0.383
repre+0.356
 differently+0.347
Neg logit contributions
aband-0.384
ipel-0.351
 trap-0.333
opsis-0.329
hemer-0.310
Token
Token ID198
Feature activation+1.59e-01

Pos logit contributions
aband+0.106
ipel+0.097
 trap+0.091
opsis+0.090
hemer+0.085
Neg logit contributions
Sov-0.122
 plur-0.118
edin-0.117
repre-0.110
 differently-0.107
Token
Token ID198
Feature activation+2.77e-01

Pos logit contributions
Sov+0.117
 plur+0.113
edin+0.112
repre+0.103
 differently+0.101
Neg logit contributions
aband-0.134
ipel-0.123
 trap-0.116
opsis-0.116
hemer-0.110
TokenHow
Token ID2437
Feature activation+6.90e-01

Pos logit contributions
Sov+0.217
 plur+0.210
edin+0.208
repre+0.193
 differently+0.189
Neg logit contributions
aband-0.219
ipel-0.200
 trap-0.190
opsis-0.188
hemer-0.177
Token do
Token ID466
Feature activation+8.80e-02

Pos logit contributions
Sov+0.491
 plur+0.476
edin+0.470
repre+0.432
 differently+0.424
Neg logit contributions
aband-0.592
ipel-0.545
 trap-0.518
opsis-0.515
hemer-0.488
Token you
Token ID345
Feature activation+2.98e+00

Pos logit contributions
Sov+0.063
 plur+0.061
edin+0.060
repre+0.056
 differently+0.054
Neg logit contributions
aband-0.075
ipel-0.069
 trap-0.066
opsis-0.065
hemer-0.062
Token maintain
Token ID5529
Feature activation+7.67e-01

Pos logit contributions
Sov+2.104
 plur+2.082
edin+2.052
 differently+1.886
repre+1.869
Neg logit contributions
aband-2.541
ipel-2.348
opsis-2.196
 trap-2.162
��-2.104
Token a
Token ID257
Feature activation+5.72e-01

Pos logit contributions
Sov+0.551
 plur+0.530
edin+0.525
repre+0.485
 differently+0.471
Neg logit contributions
aband-0.653
ipel-0.603
 trap-0.577
opsis-0.569
hemer-0.539
Token semblance
Token ID45960
Feature activation+1.41e+00

Pos logit contributions
Sov+0.460
 plur+0.446
edin+0.441
repre+0.411
 differently+0.402
Neg logit contributions
aband-0.438
ipel-0.400
 trap-0.380
opsis-0.375
hemer-0.353
Token of
Token ID286
Feature activation+4.48e-01

Pos logit contributions
Sov+0.960
 plur+0.941
edin+0.924
repre+0.841
 differently+0.838
Neg logit contributions
aband-1.247
ipel-1.152
 trap-1.086
opsis-1.085
hemer-1.031
Token normal
Token ID3487
Feature activation+1.83e+00

Pos logit contributions
Sov+0.353
 plur+0.342
edin+0.339
repre+0.315
 differently+0.308
Neg logit contributions
aband-0.350
ipel-0.320
 trap-0.305
opsis-0.301
hemer-0.283
Token Democratic
Token ID4390
Feature activation+2.69e-01

Pos logit contributions
aband+0.176
ipel+0.160
 trap+0.150
opsis+0.149
hemer+0.140
Neg logit contributions
Sov-0.193
 plur-0.189
edin-0.186
repre-0.174
 differently-0.171
Token strategist
Token ID25651
Feature activation-3.84e-01

Pos logit contributions
Sov+0.240
 plur+0.234
edin+0.232
repre+0.217
 differently+0.214
Neg logit contributions
aband-0.182
ipel-0.165
 trap-0.153
opsis-0.153
hemer-0.142
Token,
Token ID11
Feature activation+4.24e-01

Pos logit contributions
aband+0.302
ipel+0.277
 trap+0.262
opsis+0.259
hemer+0.244
Neg logit contributions
Sov-0.301
 plur-0.292
edin-0.289
repre-0.268
 differently-0.262
Token said
Token ID531
Feature activation-3.60e-01

Pos logit contributions
Sov+0.312
 plur+0.302
edin+0.299
repre+0.276
 differently+0.270
Neg logit contributions
aband-0.354
ipel-0.326
 trap-0.310
opsis-0.307
hemer-0.290
Token both
Token ID1111
Feature activation-9.62e-01

Pos logit contributions
aband+0.282
ipel+0.257
 trap+0.244
opsis+0.241
hemer+0.227
Neg logit contributions
Sov-0.284
 plur-0.275
edin-0.272
repre-0.253
 differently-0.247
Token parties
Token ID4671
Feature activation+2.97e+00

Pos logit contributions
aband+0.710
ipel+0.645
 trap+0.608
opsis+0.602
hemer+0.565
Neg logit contributions
Sov-0.801
 plur-0.779
edin-0.771
repre-0.719
 differently-0.706
Token could
Token ID714
Feature activation+4.09e-01

Pos logit contributions
Sov+2.101
 plur+2.073
edin+2.028
 differently+1.877
repre+1.859
Neg logit contributions
aband-2.536
ipel-2.339
opsis-2.202
 trap-2.167
hemer-2.087
Token walk
Token ID2513
Feature activation-5.07e-01

Pos logit contributions
Sov+0.309
 plur+0.300
edin+0.296
repre+0.274
 differently+0.269
Neg logit contributions
aband-0.333
ipel-0.306
 trap-0.290
opsis-0.288
hemer-0.272
Token away
Token ID1497
Feature activation+3.98e-01

Pos logit contributions
aband+0.423
ipel+0.389
 trap+0.374
opsis+0.366
hemer+0.346
Neg logit contributions
Sov-0.373
 plur-0.358
edin-0.357
repre-0.332
 differently-0.315
Token from
Token ID422
Feature activation+6.57e-01

Pos logit contributions
Sov+0.283
 plur+0.274
edin+0.270
repre+0.249
 differently+0.243
Neg logit contributions
aband-0.342
ipel-0.315
 trap-0.300
opsis-0.297
hemer-0.282
Token the
Token ID262
Feature activation+5.66e-01

Pos logit contributions
Sov+0.494
 plur+0.483
edin+0.473
repre+0.440
 differently+0.434
Neg logit contributions
aband-0.536
ipel-0.494
opsis-0.464
 trap-0.464
hemer-0.439
Token
Token ID198
Feature activation+4.34e-01

Pos logit contributions
Sov+0.595
 plur+0.575
edin+0.571
repre+0.522
 differently+0.515
Neg logit contributions
aband-0.706
ipel-0.654
 trap-0.615
opsis-0.613
��-0.581
TokenChina
Token ID14581
Feature activation+2.39e-01

Pos logit contributions
Sov+0.338
 plur+0.326
edin+0.323
repre+0.300
 differently+0.292
Neg logit contributions
aband-0.345
ipel-0.317
 trap-0.301
opsis-0.297
hemer-0.280
Token to
Token ID284
Feature activation+5.35e-01

Pos logit contributions
Sov+0.180
 plur+0.175
edin+0.172
repre+0.159
 differently+0.156
Neg logit contributions
aband-0.195
ipel-0.179
 trap-0.170
opsis-0.169
hemer-0.159
Token Expand
Token ID49368
Feature activation+1.40e+00

Pos logit contributions
Sov+0.428
 plur+0.418
edin+0.410
repre+0.378
 differently+0.376
Neg logit contributions
aband-0.412
ipel-0.378
opsis-0.354
 trap-0.353
hemer-0.334
Token Military
Token ID12842
Feature activation+1.41e+00

Pos logit contributions
Sov+1.144
 plur+1.111
edin+1.101
repre+1.017
 differently+1.005
Neg logit contributions
aband-1.056
ipel-0.962
 trap-0.909
opsis-0.901
hemer-0.852
Token Reach
Token ID25146
Feature activation+2.93e+00

Pos logit contributions
Sov+1.153
 plur+1.121
edin+1.109
repre+1.032
 differently+1.014
Neg logit contributions
aband-1.056
ipel-0.961
 trap-0.907
opsis-0.898
hemer-0.845
Token with
Token ID351
Feature activation+1.29e+00

Pos logit contributions
Sov+2.134
 plur+2.074
edin+2.062
repre+1.886
 differently+1.867
Neg logit contributions
aband-2.431
ipel-2.220
 trap-2.126
opsis-2.105
hemer-2.000
Token a
Token ID257
Feature activation+1.07e+00

Pos logit contributions
Sov+1.039
 plur+1.013
edin+0.997
repre+0.924
 differently+0.915
Neg logit contributions
aband-0.975
ipel-0.892
 trap-0.842
opsis-0.834
hemer-0.790
Token Fleet
Token ID20001
Feature activation+7.53e-01

Pos logit contributions
Sov+0.851
 plur+0.827
edin+0.817
repre+0.759
 differently+0.746
Neg logit contributions
aband-0.823
ipel-0.751
 trap-0.710
opsis-0.703
hemer-0.663
Token of
Token ID286
Feature activation+6.20e-01

Pos logit contributions
Sov+0.544
 plur+0.526
edin+0.520
repre+0.479
 differently+0.470
Neg logit contributions
aband-0.638
ipel-0.586
 trap-0.559
opsis-0.553
hemer-0.525
Token the
Token ID262
Feature activation+4.22e-01

Pos logit contributions
Sov+0.481
 plur+0.467
edin+0.461
repre+0.427
 differently+0.421
Neg logit contributions
aband-0.491
ipel-0.450
 trap-0.427
opsis-0.423
hemer-0.399
Token fiction
Token ID10165
Feature activation-4.41e-01

Pos logit contributions
aband+0.311
ipel+0.282
 trap+0.265
opsis+0.262
hemer+0.245
Neg logit contributions
Sov-0.381
 plur-0.371
edin-0.367
repre-0.344
 differently-0.337
Token.
Token ID13
Feature activation+8.28e-01

Pos logit contributions
aband+0.371
ipel+0.342
 trap+0.326
opsis+0.322
hemer+0.305
Neg logit contributions
Sov-0.322
 plur-0.311
edin-0.308
repre-0.284
 differently-0.277
Token
Token ID198
Feature activation-9.07e-02

Pos logit contributions
Sov+0.653
 plur+0.634
edin+0.627
repre+0.582
 differently+0.571
Neg logit contributions
aband-0.647
ipel-0.592
 trap-0.560
opsis-0.555
hemer-0.523
Token
Token ID198
Feature activation-1.40e-01

Pos logit contributions
aband+0.074
ipel+0.068
 trap+0.065
opsis+0.064
hemer+0.061
Neg logit contributions
Sov-0.068
 plur-0.066
edin-0.065
repre-0.060
 differently-0.059
TokenAny
Token ID7149
Feature activation+2.88e-01

Pos logit contributions
aband+0.115
ipel+0.105
 trap+0.100
opsis+0.099
hemer+0.094
Neg logit contributions
Sov-0.104
 plur-0.102
edin-0.100
repre-0.092
 differently-0.091
Tokenways
Token ID1322
Feature activation+2.90e+00

Pos logit contributions
Sov+0.230
 plur+0.223
edin+0.221
repre+0.205
 differently+0.201
Neg logit contributions
aband-0.223
ipel-0.203
 trap-0.193
opsis-0.191
hemer-0.179
Token,
Token ID11
Feature activation+4.53e-01

Pos logit contributions
Sov+2.311
 plur+2.244
edin+2.225
repre+2.072
 differently+2.035
Neg logit contributions
aband-2.236
ipel-2.030
 trap-1.923
opsis-1.900
hemer-1.796
Token this
Token ID428
Feature activation-4.24e-01

Pos logit contributions
Sov+0.302
 plur+0.292
edin+0.288
repre+0.264
 differently+0.258
Neg logit contributions
aband-0.408
ipel-0.378
 trap-0.360
opsis-0.357
hemer-0.340
Token latest
Token ID3452
Feature activation-5.64e-01

Pos logit contributions
aband+0.361
ipel+0.332
 trap+0.319
opsis+0.315
hemer+0.297
Neg logit contributions
Sov-0.306
 plur-0.295
edin-0.291
repre-0.269
 differently-0.262
Token leak
Token ID13044
Feature activation-2.50e-01

Pos logit contributions
aband+0.380
ipel+0.341
 trap+0.323
opsis+0.318
hemer+0.295
Neg logit contributions
Sov-0.506
 plur-0.491
edin-0.487
repre-0.457
 differently-0.448
Token has
Token ID468
Feature activation-5.56e-01

Pos logit contributions
aband+0.212
ipel+0.195
 trap+0.187
opsis+0.185
hemer+0.174
Neg logit contributions
Sov-0.182
 plur-0.175
edin-0.173
repre-0.159
 differently-0.155
Token_
Token ID62
Feature activation+8.81e-01

Pos logit contributions
Sov+1.566
 plur+1.530
edin+1.518
repre+1.427
 differently+1.404
Neg logit contributions
aband-1.036
ipel-0.920
 trap-0.855
opsis-0.845
hemer-0.781
Tokenstate
Token ID5219
Feature activation+9.40e-02

Pos logit contributions
Sov+0.560
 plur+0.542
edin+0.538
repre+0.490
 differently+0.476
Neg logit contributions
aband-0.819
ipel-0.759
 trap-0.726
opsis-0.721
hemer-0.687
Token.
Token ID764
Feature activation-4.38e-01

Pos logit contributions
Sov+0.072
 plur+0.070
edin+0.069
repre+0.064
 differently+0.062
Neg logit contributions
aband-0.076
ipel-0.069
 trap-0.066
opsis-0.065
hemer-0.062
Tokenun
Token ID403
Feature activation+8.89e-01

Pos logit contributions
aband+0.358
ipel+0.329
 trap+0.312
opsis+0.309
hemer+0.292
Neg logit contributions
Sov-0.330
 plur-0.320
edin-0.316
repre-0.292
 differently-0.286
Tokenwrap
Token ID37150
Feature activation+5.56e-01

Pos logit contributions
Sov+0.693
 plur+0.673
edin+0.666
repre+0.618
 differently+0.605
Neg logit contributions
aband-0.702
ipel-0.643
 trap-0.609
opsis-0.603
hemer-0.568
Token ())
Token ID32865
Feature activation+2.89e+00

Pos logit contributions
Sov+0.470
 plur+0.457
edin+0.452
repre+0.422
 differently+0.415
Neg logit contributions
aband-0.404
ipel-0.366
 trap-0.345
opsis-0.341
hemer-0.319
Token }
Token ID1782
Feature activation-3.58e-01

Pos logit contributions
Sov+1.962
 plur+1.922
edin+1.887
repre+1.722
 differently+1.713
Neg logit contributions
aband-2.558
ipel-2.350
opsis-2.218
 trap-2.206
hemer-2.102
Token }
Token ID1782
Feature activation-5.65e-01

Pos logit contributions
aband+0.323
ipel+0.300
 trap+0.287
opsis+0.284
hemer+0.270
Neg logit contributions
Sov-0.238
 plur-0.229
edin-0.226
repre-0.208
 differently-0.202
Token pub
Token ID2240
Feature activation-1.48e-01

Pos logit contributions
aband+0.477
ipel+0.440
 trap+0.420
opsis+0.415
hemer+0.393
Neg logit contributions
Sov-0.410
 plur-0.395
edin-0.390
repre-0.361
 differently-0.352
Token fn
Token ID24714
Feature activation-1.33e+00

Pos logit contributions
aband+0.130
ipel+0.120
 trap+0.116
opsis+0.114
hemer+0.108
Neg logit contributions
Sov-0.103
 plur-0.099
edin-0.097
repre-0.090
 differently-0.087
Token apply
Token ID4174
Feature activation-4.47e-01

Pos logit contributions
aband+0.952
ipel+0.869
 trap+0.837
opsis+0.811
 sleep+0.757
Neg logit contributions
Sov-1.127
 plur-1.077
edin-1.069
repre-1.003
 differently-0.968
Token par
Token ID1582
Feature activation+6.63e-01

Pos logit contributions
Sov+0.866
 plur+0.848
edin+0.830
repre+0.775
 differently+0.768
Neg logit contributions
aband-0.831
ipel-0.763
 trap-0.713
opsis-0.712
hemer-0.671
Tokenliam
Token ID5058
Feature activation+1.47e+00

Pos logit contributions
Sov+0.591
 plur+0.582
edin+0.576
repre+0.540
 differently+0.530
Neg logit contributions
aband-0.441
ipel-0.398
 trap-0.375
opsis-0.368
��-0.341
Tokenents
Token ID658
Feature activation-2.06e-01

Pos logit contributions
Sov+1.290
 plur+1.280
edin+1.265
 differently+1.183
repre+1.179
Neg logit contributions
aband-0.981
ipel-0.893
 trap-0.830
opsis-0.825
��-0.766
Token have
Token ID423
Feature activation+2.57e-01

Pos logit contributions
aband+0.174
ipel+0.161
 trap+0.152
opsis+0.151
hemer+0.143
Neg logit contributions
Sov-0.149
 plur-0.145
edin-0.143
repre-0.132
 differently-0.130
Token the
Token ID262
Feature activation-1.70e-01

Pos logit contributions
Sov+0.205
 plur+0.200
edin+0.197
repre+0.183
 differently+0.180
Neg logit contributions
aband-0.199
ipel-0.182
 trap-0.171
opsis-0.170
hemer-0.160
Token power
Token ID1176
Feature activation+2.88e+00

Pos logit contributions
aband+0.102
ipel+0.091
 trap+0.085
opsis+0.083
hemer+0.076
Neg logit contributions
Sov-0.166
 plur-0.161
edin-0.160
repre-0.151
 differently-0.148
Token to
Token ID284
Feature activation+8.27e-01

Pos logit contributions
Sov+1.697
 plur+1.665
edin+1.624
repre+1.484
 differently+1.470
Neg logit contributions
aband-2.795
ipel-2.583
 trap-2.465
opsis-2.453
hemer-2.340
Token legisl
Token ID3560
Feature activation+1.40e+00

Pos logit contributions
Sov+0.623
 plur+0.612
edin+0.601
repre+0.555
 differently+0.551
Neg logit contributions
aband-0.672
ipel-0.617
opsis-0.579
 trap-0.578
hemer-0.546
Tokenate
Token ID378
Feature activation+8.85e-02

Pos logit contributions
Sov+0.885
 plur+0.869
edin+0.854
repre+0.781
 differently+0.767
Neg logit contributions
aband-1.284
ipel-1.205
 trap-1.141
opsis-1.141
uggets-1.078
Token with
Token ID351
Feature activation+6.04e-01

Pos logit contributions
Sov+0.058
 plur+0.057
edin+0.056
repre+0.051
 differently+0.050
Neg logit contributions
aband-0.081
ipel-0.075
 trap-0.071
opsis-0.071
hemer-0.067
Token respect
Token ID2461
Feature activation-1.11e+00

Pos logit contributions
Sov+0.456
 plur+0.446
edin+0.437
repre+0.406
 differently+0.400
Neg logit contributions
aband-0.491
ipel-0.451
 trap-0.426
opsis-0.423
hemer-0.400
Token K
Token ID509
Feature activation+1.38e+00

Pos logit contributions
Sov+0.467
 plur+0.455
edin+0.451
repre+0.424
 differently+0.416
Neg logit contributions
aband-0.332
ipel-0.298
 trap-0.279
opsis-0.275
hemer-0.255
TokenOD
Token ID3727
Feature activation+6.78e-01

Pos logit contributions
Sov+0.943
 plur+0.912
edin+0.902
repre+0.822
 differently+0.807
Neg logit contributions
aband-1.218
ipel-1.128
 trap-1.072
opsis-1.061
hemer-1.010
TokenI
Token ID40
Feature activation+4.37e-01

Pos logit contributions
Sov+0.519
 plur+0.503
edin+0.497
repre+0.461
 differently+0.451
Neg logit contributions
aband-0.546
ipel-0.501
 trap-0.475
opsis-0.470
hemer-0.444
Token 17
Token ID1596
Feature activation+8.77e-01

Pos logit contributions
Sov+0.339
 plur+0.328
edin+0.324
repre+0.302
 differently+0.293
Neg logit contributions
aband-0.348
ipel-0.320
 trap-0.303
opsis-0.300
hemer-0.282
Token K
Token ID509
Feature activation+5.78e-01

Pos logit contributions
Sov+0.726
 plur+0.705
edin+0.698
repre+0.651
 differently+0.638
Neg logit contributions
aband-0.651
ipel-0.593
 trap-0.559
opsis-0.553
hemer-0.519
TokenRY
Token ID18276
Feature activation+2.87e+00

Pos logit contributions
Sov+0.457
 plur+0.444
edin+0.439
repre+0.408
 differently+0.400
Neg logit contributions
aband-0.451
ipel-0.412
 trap-0.390
opsis-0.386
hemer-0.364
TokenP
Token ID47
Feature activation+6.52e-01

Pos logit contributions
Sov+2.376
 plur+2.262
edin+2.242
 differently+2.073
repre+2.065
Neg logit contributions
aband-2.171
ipel-1.948
 trap-1.834
opsis-1.790
hemer-1.739
TokenTON
Token ID11357
Feature activation+1.01e+00

Pos logit contributions
Sov+0.540
 plur+0.525
edin+0.519
repre+0.485
 differently+0.473
Neg logit contributions
aband-0.484
ipel-0.442
 trap-0.417
opsis-0.413
hemer-0.387
Token
Token ID198
Feature activation+9.52e-02

Pos logit contributions
Sov+0.764
 plur+0.740
edin+0.731
repre+0.678
 differently+0.661
Neg logit contributions
aband-0.820
ipel-0.753
 trap-0.715
opsis-0.708
hemer-0.668
Token
Token ID198
Feature activation-2.63e-01

Pos logit contributions
Sov+0.071
 plur+0.069
edin+0.068
repre+0.063
 differently+0.062
Neg logit contributions
aband-0.078
ipel-0.072
 trap-0.068
opsis-0.067
hemer-0.064
TokenSetting
Token ID34149
Feature activation-1.58e-01

Pos logit contributions
aband+0.210
ipel+0.192
 trap+0.181
opsis+0.180
hemer+0.170
Neg logit contributions
Sov-0.202
 plur-0.198
edin-0.195
repre-0.181
 differently-0.178
Token things
Token ID1243
Feature activation+2.71e-01

Pos logit contributions
Sov+0.799
 plur+0.773
edin+0.764
repre+0.704
 differently+0.687
Neg logit contributions
aband-0.958
ipel-0.883
 trap-0.840
opsis-0.833
hemer-0.789
Token slow
Token ID3105
Feature activation+1.10e+00

Pos logit contributions
Sov+0.199
 plur+0.192
edin+0.189
repre+0.175
 differently+0.169
Neg logit contributions
aband-0.228
ipel-0.209
 trap-0.202
opsis-0.198
hemer-0.187
Token down
Token ID866
Feature activation+8.13e-01

Pos logit contributions
Sov+0.837
 plur+0.811
edin+0.802
repre+0.743
 differently+0.728
Neg logit contributions
aband-0.886
ipel-0.812
 trap-0.771
opsis-0.763
hemer-0.721
Token quite
Token ID2407
Feature activation-5.82e-02

Pos logit contributions
Sov+0.538
 plur+0.516
edin+0.509
repre+0.467
 differently+0.447
Neg logit contributions
aband-0.742
ipel-0.685
 trap-0.660
opsis-0.651
hemer-0.619
Token a
Token ID257
Feature activation+1.04e+00

Pos logit contributions
aband+0.051
ipel+0.047
 trap+0.046
opsis+0.045
hemer+0.042
Neg logit contributions
Sov-0.040
 plur-0.039
edin-0.038
repre-0.035
 differently-0.033
Token bit
Token ID1643
Feature activation+2.86e+00

Pos logit contributions
Sov+0.752
 plur+0.728
edin+0.719
repre+0.664
 differently+0.649
Neg logit contributions
aband-0.876
ipel-0.807
 trap-0.767
opsis-0.760
hemer-0.720
Token but
Token ID475
Feature activation+2.41e-01

Pos logit contributions
Sov+1.946
 plur+1.884
edin+1.865
repre+1.709
 differently+1.693
Neg logit contributions
aband-2.530
ipel-2.343
 trap-2.220
opsis-2.210
hemer-2.100
Token I
Token ID314
Feature activation-7.93e-01

Pos logit contributions
Sov+0.168
 plur+0.161
edin+0.159
repre+0.147
 differently+0.141
Neg logit contributions
aband-0.211
ipel-0.195
 trap-0.188
opsis-0.184
hemer-0.175
Token can
Token ID460
Feature activation-4.57e-01

Pos logit contributions
aband+0.674
ipel+0.620
 trap+0.597
opsis+0.586
hemer+0.556
Neg logit contributions
Sov-0.574
 plur-0.549
edin-0.542
repre-0.503
 differently-0.487
Token happily
Token ID18177
Feature activation-4.28e-01

Pos logit contributions
aband+0.379
ipel+0.348
 trap+0.337
opsis+0.331
hemer+0.312
Neg logit contributions
Sov-0.339
 plur-0.324
edin-0.320
repre-0.299
 differently-0.288
Token play
Token ID711
Feature activation+8.69e-01

Pos logit contributions
aband+0.348
ipel+0.321
 trap+0.310
opsis+0.303
hemer+0.286
Neg logit contributions
Sov-0.324
 plur-0.311
edin-0.307
repre-0.286
 differently-0.276
TokenUG
Token ID7340
Feature activation+5.66e-01

Pos logit contributions
Sov+0.562
 plur+0.547
edin+0.541
repre+0.503
 differently+0.493
Neg logit contributions
aband-0.536
ipel-0.488
 trap-0.463
opsis-0.457
hemer-0.430
Token instead
Token ID2427
Feature activation+1.23e+00

Pos logit contributions
Sov+0.437
 plur+0.424
edin+0.419
repre+0.389
 differently+0.381
Neg logit contributions
aband-0.452
ipel-0.414
 trap-0.392
opsis-0.388
hemer-0.366
Token.
Token ID13
Feature activation+2.80e-03

Pos logit contributions
Sov+0.886
 plur+0.857
edin+0.846
repre+0.780
 differently+0.762
Neg logit contributions
aband-1.053
ipel-0.970
 trap-0.924
opsis-0.916
hemer-0.867
Token
Token ID198
Feature activation+5.52e-02

Pos logit contributions
Sov+0.002
 plur+0.002
edin+0.002
repre+0.002
 differently+0.002
Neg logit contributions
aband-0.002
ipel-0.002
 trap-0.002
opsis-0.002
hemer-0.002
Token
Token ID198
Feature activation+8.19e-02

Pos logit contributions
Sov+0.041
 plur+0.040
edin+0.039
repre+0.036
 differently+0.035
Neg logit contributions
aband-0.046
ipel-0.042
 trap-0.040
opsis-0.040
hemer-0.037
TokenY
Token ID56
Feature activation+2.83e+00

Pos logit contributions
Sov+0.064
 plur+0.062
edin+0.062
repre+0.057
 differently+0.056
Neg logit contributions
aband-0.065
ipel-0.059
 trap-0.056
opsis-0.055
hemer-0.052
Tokenog
Token ID519
Feature activation+2.12e+00

Pos logit contributions
Sov+2.437
 plur+2.367
edin+2.362
repre+2.195
 differently+2.156
Neg logit contributions
aband-1.975
ipel-1.788
 trap-1.694
opsis-1.668
��-1.566
Tokenventures
Token ID10065
Feature activation+8.77e-01

Pos logit contributions
Sov+1.276
 plur+1.225
edin+1.211
repre+1.097
 differently+1.066
Neg logit contributions
aband-2.033
ipel-1.897
 trap-1.818
opsis-1.803
hemer-1.724
Token raised
Token ID4376
Feature activation+6.39e-02

Pos logit contributions
Sov+0.638
 plur+0.620
edin+0.611
repre+0.563
 differently+0.554
Neg logit contributions
aband-0.738
ipel-0.679
 trap-0.644
opsis-0.639
hemer-0.606
Token nearly
Token ID3016
Feature activation+6.08e-01

Pos logit contributions
Sov+0.044
 plur+0.042
edin+0.041
repre+0.038
 differently+0.037
Neg logit contributions
aband-0.057
ipel-0.052
 trap-0.050
opsis-0.050
hemer-0.047
Token $
Token ID720
Feature activation+7.69e-01

Pos logit contributions
Sov+0.386
 plur+0.370
edin+0.364
repre+0.333
 differently+0.321
Neg logit contributions
aband-0.569
ipel-0.528
 trap-0.508
opsis-0.501
hemer-0.477
Token
Token ID141
Feature activation-1.15e+00

Pos logit contributions
aband+0.796
ipel+0.744
 trap+0.713
opsis+0.707
hemer+0.675
Neg logit contributions
Sov-0.487
 plur-0.469
edin-0.463
repre-0.421
 differently-0.406
Token
Token ID228
Feature activation-2.80e-01

Pos logit contributions
aband+0.571
ipel+0.494
 trap+0.450
opsis+0.442
hemer+0.398
Neg logit contributions
Sov-1.239
 plur-1.213
edin-1.203
repre-1.141
 differently-1.124
Tokenи
Token ID18849
Feature activation-4.41e-01

Pos logit contributions
aband+0.285
ipel+0.267
 trap+0.256
opsis+0.254
hemer+0.243
Neg logit contributions
Sov-0.155
 plur-0.148
edin-0.146
repre-0.131
 differently-0.127
Tokenя
Token ID40623
Feature activation-6.40e-01

Pos logit contributions
aband+0.452
ipel+0.423
 trap+0.407
opsis+0.404
hemer+0.387
Neg logit contributions
Sov-0.239
 plur-0.230
edin-0.225
repre-0.204
 differently-0.195
Token,
Token ID11
Feature activation+2.34e-01

Pos logit contributions
aband+0.598
ipel+0.557
 trap+0.533
opsis+0.527
hemer+0.503
Neg logit contributions
Sov-0.404
 plur-0.389
edin-0.385
repre-0.352
 differently-0.340
Token
Token ID220
Feature activation+2.82e+00

Pos logit contributions
Sov+0.148
 plur+0.142
edin+0.141
repre+0.128
 differently+0.125
Neg logit contributions
aband-0.219
ipel-0.204
 trap-0.195
opsis-0.193
hemer-0.184
Token
Token ID141
Feature activation-7.61e-01

Pos logit contributions
Sov+1.389
 plur+1.314
edin+1.286
repre+1.121
 differently+1.089
Neg logit contributions
aband-3.048
ipel-2.843
 trap-2.752
opsis-2.723
uggets-2.604
Token
Token ID229
Feature activation+4.11e-01

Pos logit contributions
aband+0.374
ipel+0.325
 trap+0.296
opsis+0.290
hemer+0.262
Neg logit contributions
Sov-0.819
 plur-0.801
edin-0.796
repre-0.755
 differently-0.744
Tokenт
Token ID20375
Feature activation+7.56e-02

Pos logit contributions
Sov+0.223
 plur+0.213
edin+0.209
repre+0.189
 differently+0.183
Neg logit contributions
aband-0.423
ipel-0.396
 trap-0.380
opsis-0.376
hemer-0.362
Tokenо
Token ID15166
Feature activation+6.52e-01

Pos logit contributions
Sov+0.035
 plur+0.033
edin+0.033
repre+0.029
 differently+0.028
Neg logit contributions
aband-0.084
ipel-0.079
 trap-0.076
opsis-0.075
hemer-0.072
Token
Token ID12466
Feature activation-1.19e-01

Pos logit contributions
Sov+0.376
 plur+0.361
edin+0.355
repre+0.320
 differently+0.311
Neg logit contributions
aband-0.649
ipel-0.605
 trap-0.580
opsis-0.576
hemer-0.550
Token,
Token ID11
Feature activation-1.01e+00

Pos logit contributions
Sov+0.472
 plur+0.459
edin+0.454
repre+0.422
 differently+0.414
Neg logit contributions
aband-0.451
ipel-0.411
 trap-0.389
opsis-0.385
hemer-0.362
Token the
Token ID262
Feature activation-5.77e-01

Pos logit contributions
aband+0.829
ipel+0.759
 trap+0.726
opsis+0.714
hemer+0.675
Neg logit contributions
Sov-0.764
 plur-0.730
edin-0.729
repre-0.676
 differently-0.652
Token Commonwealth
Token ID17165
Feature activation+1.48e-01

Pos logit contributions
aband+0.491
ipel+0.452
 trap+0.432
opsis+0.426
hemer+0.404
Neg logit contributions
Sov-0.415
 plur-0.399
edin-0.396
repre-0.365
 differently-0.355
Token has
Token ID468
Feature activation-5.94e-01

Pos logit contributions
Sov+0.121
 plur+0.117
edin+0.116
repre+0.108
 differently+0.106
Neg logit contributions
aband-0.112
ipel-0.102
 trap-0.097
opsis-0.096
hemer-0.090
Token the
Token ID262
Feature activation-4.72e-01

Pos logit contributions
aband+0.487
ipel+0.447
 trap+0.425
opsis+0.420
hemer+0.397
Neg logit contributions
Sov-0.445
 plur-0.430
edin-0.427
repre-0.395
 differently-0.384
Token power
Token ID1176
Feature activation+2.81e+00

Pos logit contributions
aband+0.320
ipel+0.289
 trap+0.273
opsis+0.268
hemer+0.249
Neg logit contributions
Sov-0.421
 plur-0.408
edin-0.406
repre-0.381
 differently-0.372
Token to
Token ID284
Feature activation+7.12e-01

Pos logit contributions
Sov+1.683
 plur+1.643
edin+1.606
repre+1.467
 differently+1.451
Neg logit contributions
aband-2.705
ipel-2.508
 trap-2.391
opsis-2.380
hemer-2.267
Token levy
Token ID35783
Feature activation+7.85e-01

Pos logit contributions
Sov+0.553
 plur+0.542
edin+0.533
repre+0.493
 differently+0.489
Neg logit contributions
aband-0.562
ipel-0.515
opsis-0.483
 trap-0.483
hemer-0.454
Token income
Token ID3739
Feature activation+6.07e-01

Pos logit contributions
Sov+0.574
 plur+0.559
edin+0.550
repre+0.507
 differently+0.500
Neg logit contributions
aband-0.656
ipel-0.605
 trap-0.573
opsis-0.571
hemer-0.539
Token tax
Token ID1687
Feature activation-1.08e-02

Pos logit contributions
Sov+0.389
 plur+0.377
edin+0.369
repre+0.337
 differently+0.332
Neg logit contributions
aband-0.565
ipel-0.525
opsis-0.499
 trap-0.499
hemer-0.473
Token which
Token ID543
Feature activation+4.23e-01

Pos logit contributions
aband+0.009
ipel+0.009
 trap+0.008
opsis+0.008
hemer+0.008
Neg logit contributions
Sov-0.008
 plur-0.007
edin-0.007
repre-0.007
 differently-0.007
Token the
Token ID262
Feature activation-6.66e-02

Pos logit contributions
Sov+0.042
 plur+0.041
edin+0.040
repre+0.037
 differently+0.037
Neg logit contributions
aband-0.043
ipel-0.039
 trap-0.037
opsis-0.037
hemer-0.034
Token machine
Token ID4572
Feature activation-5.32e-01

Pos logit contributions
aband+0.052
ipel+0.047
 trap+0.045
opsis+0.044
hemer+0.042
Neg logit contributions
Sov-0.053
 plur-0.051
edin-0.051
repre-0.047
 differently-0.046
Token at
Token ID379
Feature activation-5.33e-01

Pos logit contributions
aband+0.440
ipel+0.405
 trap+0.383
opsis+0.380
hemer+0.360
Neg logit contributions
Sov-0.395
 plur-0.384
edin-0.378
repre-0.350
 differently-0.344
Token the
Token ID262
Feature activation-3.66e-01

Pos logit contributions
aband+0.422
ipel+0.386
 trap+0.366
opsis+0.362
hemer+0.342
Neg logit contributions
Sov-0.415
 plur-0.402
edin-0.398
repre-0.369
 differently-0.362
Token Franklin
Token ID14021
Feature activation+7.01e-01

Pos logit contributions
aband+0.290
ipel+0.266
 trap+0.252
opsis+0.249
hemer+0.235
Neg logit contributions
Sov-0.284
 plur-0.275
edin-0.272
repre-0.253
 differently-0.247
Token Institute
Token ID5136
Feature activation+2.79e+00

Pos logit contributions
Sov+0.549
 plur+0.533
edin+0.527
repre+0.489
 differently+0.479
Neg logit contributions
aband-0.552
ipel-0.505
 trap-0.478
opsis-0.474
hemer-0.446
Token.
Token ID13
Feature activation-4.04e-01

Pos logit contributions
Sov+1.855
 plur+1.809
edin+1.762
 differently+1.628
repre+1.625
Neg logit contributions
aband-2.495
ipel-2.296
opsis-2.162
 trap-2.157
��-2.073
Token While
Token ID2893
Feature activation-1.32e+00

Pos logit contributions
aband+0.287
ipel+0.260
 trap+0.245
opsis+0.242
hemer+0.226
Neg logit contributions
Sov-0.347
 plur-0.338
edin-0.334
repre-0.313
 differently-0.306
Token working
Token ID1762
Feature activation+5.55e-01

Pos logit contributions
aband+1.079
ipel+0.991
 trap+0.952
opsis+0.932
hemer+0.883
Neg logit contributions
Sov-0.983
 plur-0.944
edin-0.937
repre-0.872
 differently-0.841
Token on
Token ID319
Feature activation+7.78e-01

Pos logit contributions
Sov+0.352
 plur+0.340
edin+0.335
repre+0.305
 differently+0.297
Neg logit contributions
aband-0.519
ipel-0.482
 trap-0.460
opsis-0.457
hemer-0.435
Token the
Token ID262
Feature activation-3.01e-01

Pos logit contributions
Sov+0.570
 plur+0.554
edin+0.545
repre+0.505
 differently+0.495
Neg logit contributions
aband-0.652
ipel-0.599
 trap-0.567
opsis-0.565
hemer-0.535
Token curly
Token ID45731
Feature activation+5.15e-01

Pos logit contributions
aband+0.039
ipel+0.036
 trap+0.034
opsis+0.034
hemer+0.032
Neg logit contributions
Sov-0.038
 plur-0.037
edin-0.037
repre-0.034
 differently-0.033
Token braces
Token ID47241
Feature activation+2.50e-01

Pos logit contributions
Sov+0.430
 plur+0.418
edin+0.414
repre+0.386
 differently+0.379
Neg logit contributions
aband-0.378
ipel-0.344
 trap-0.324
opsis-0.321
hemer-0.301
Token though
Token ID996
Feature activation-4.53e-01

Pos logit contributions
Sov+0.183
 plur+0.175
edin+0.174
repre+0.161
 differently+0.155
Neg logit contributions
aband-0.210
ipel-0.194
 trap-0.185
opsis-0.183
hemer-0.172
Token,
Token ID11
Feature activation+8.46e-02

Pos logit contributions
aband+0.353
ipel+0.324
 trap+0.307
opsis+0.303
hemer+0.286
Neg logit contributions
Sov-0.359
 plur-0.346
edin-0.343
repre-0.319
 differently-0.310
Token in
Token ID287
Feature activation+8.96e-01

Pos logit contributions
Sov+0.058
 plur+0.055
edin+0.054
repre+0.050
 differently+0.048
Neg logit contributions
aband-0.075
ipel-0.070
 trap-0.067
opsis-0.066
hemer-0.063
Token order
Token ID1502
Feature activation+2.79e+00

Pos logit contributions
Sov+0.670
 plur+0.647
edin+0.641
repre+0.593
 differently+0.579
Neg logit contributions
aband-0.737
ipel-0.677
 trap-0.644
opsis-0.638
hemer-0.602
Token to
Token ID284
Feature activation+7.31e-01

Pos logit contributions
Sov+1.714
 plur+1.659
edin+1.633
repre+1.483
 differently+1.456
Neg logit contributions
aband-2.661
ipel-2.466
 trap-2.348
opsis-2.332
hemer-2.227
Token differentiate
Token ID28754
Feature activation+4.85e-02

Pos logit contributions
Sov+0.586
 plur+0.569
edin+0.563
repre+0.524
 differently+0.513
Neg logit contributions
aband-0.561
ipel-0.512
 trap-0.485
opsis-0.480
hemer-0.451
Token between
Token ID1022
Feature activation+5.67e-01

Pos logit contributions
Sov+0.032
 plur+0.031
edin+0.031
repre+0.028
 differently+0.027
Neg logit contributions
aband-0.044
ipel-0.041
 trap-0.039
opsis-0.039
hemer-0.037
Token string
Token ID4731
Feature activation+3.77e-01

Pos logit contributions
Sov+0.437
 plur+0.422
edin+0.419
repre+0.388
 differently+0.379
Neg logit contributions
aband-0.454
ipel-0.416
 trap-0.395
opsis-0.390
hemer-0.369
Token liter
Token ID4187
Feature activation+3.40e-01

Pos logit contributions
Sov+0.284
 plur+0.276
edin+0.272
repre+0.252
 differently+0.247
Neg logit contributions
aband-0.308
ipel-0.283
 trap-0.268
opsis-0.266
hemer-0.251
Token CNN
Token ID8100
Feature activation+1.10e+00

Pos logit contributions
Sov+0.033
 plur+0.032
edin+0.032
repre+0.030
 differently+0.029
Neg logit contributions
aband-0.026
ipel-0.024
 trap-0.022
opsis-0.022
hemer-0.021
Token reports
Token ID3136
Feature activation+1.60e+00

Pos logit contributions
Sov+0.865
 plur+0.842
edin+0.836
repre+0.771
 differently+0.762
Neg logit contributions
aband-0.859
ipel-0.783
 trap-0.742
opsis-0.736
hemer-0.693
Token two
Token ID734
Feature activation+2.17e-01

Pos logit contributions
Sov+1.078
 plur+1.044
edin+1.029
repre+0.940
 differently+0.930
Neg logit contributions
aband-1.434
ipel-1.325
 trap-1.259
opsis-1.254
hemer-1.195
Token people
Token ID661
Feature activation+1.09e+00

Pos logit contributions
Sov+0.186
 plur+0.181
edin+0.179
repre+0.167
 differently+0.164
Neg logit contributions
aband-0.155
ipel-0.141
 trap-0.132
opsis-0.131
hemer-0.122
Token are
Token ID389
Feature activation-3.76e-02

Pos logit contributions
Sov+0.779
 plur+0.758
edin+0.749
repre+0.685
 differently+0.682
Neg logit contributions
aband-0.928
ipel-0.856
 trap-0.808
opsis-0.805
hemer-0.766
Token confirmed
Token ID4999
Feature activation+2.77e+00

Pos logit contributions
aband+0.027
ipel+0.025
 trap+0.023
opsis+0.023
hemer+0.021
Neg logit contributions
Sov-0.032
 plur-0.031
edin-0.031
repre-0.029
 differently-0.028
Token dead
Token ID2636
Feature activation+6.96e-01

Pos logit contributions
Sov+2.524
 plur+2.480
edin+2.442
repre+2.290
 differently+2.284
Neg logit contributions
aband-1.816
ipel-1.635
 trap-1.516
opsis-1.509
hemer-1.402
Token,
Token ID11
Feature activation+1.32e+00

Pos logit contributions
Sov+0.489
 plur+0.473
edin+0.466
repre+0.430
 differently+0.418
Neg logit contributions
aband-0.604
ipel-0.557
 trap-0.531
opsis-0.526
hemer-0.499
Token including
Token ID1390
Feature activation+8.07e-01

Pos logit contributions
Sov+0.951
 plur+0.924
edin+0.913
repre+0.840
 differently+0.826
Neg logit contributions
aband-1.121
ipel-1.032
 trap-0.978
opsis-0.973
hemer-0.923
Token an
Token ID281
Feature activation-4.27e-01

Pos logit contributions
Sov+0.585
 plur+0.568
edin+0.560
repre+0.517
 differently+0.507
Neg logit contributions
aband-0.681
ipel-0.628
 trap-0.595
opsis-0.592
hemer-0.561
Token 8
Token ID807
Feature activation-3.06e-01

Pos logit contributions
aband+0.293
ipel+0.264
 trap+0.248
opsis+0.245
hemer+0.228
Neg logit contributions
Sov-0.378
 plur-0.367
edin-0.364
repre-0.341
 differently-0.334
Tokenк
Token ID31583
Feature activation-6.69e-01

Pos logit contributions
Sov+0.444
 plur+0.425
edin+0.417
repre+0.372
 differently+0.360
Neg logit contributions
aband-0.870
ipel-0.814
 trap-0.782
opsis-0.777
hemer-0.744
Tokenа
Token ID16142
Feature activation-5.32e-01

Pos logit contributions
aband+0.672
ipel+0.628
 trap+0.601
opsis+0.596
hemer+0.571
Neg logit contributions
Sov-0.377
 plur-0.363
edin-0.357
repre-0.322
 differently-0.311
Token
Token ID141
Feature activation-7.08e-01

Pos logit contributions
aband+0.538
ipel+0.504
 trap+0.483
opsis+0.479
hemer+0.459
Neg logit contributions
Sov-0.294
 plur-0.282
edin-0.278
repre-0.252
 differently-0.242
Token
Token ID227
Feature activation-2.76e-01

Pos logit contributions
aband+0.348
ipel+0.300
 trap+0.273
opsis+0.269
hemer+0.241
Neg logit contributions
Sov-0.764
 plur-0.747
edin-0.741
repre-0.703
 differently-0.693
Token»
Token ID17730
Feature activation+9.93e-01

Pos logit contributions
aband+0.251
ipel+0.232
 trap+0.222
opsis+0.220
hemer+0.209
Neg logit contributions
Sov-0.182
 plur-0.176
edin-0.174
repre-0.159
 differently-0.155
Token
Token ID220
Feature activation+2.77e+00

Pos logit contributions
Sov+0.676
 plur+0.653
edin+0.644
repre+0.590
 differently+0.578
Neg logit contributions
aband-0.885
ipel-0.817
 trap-0.779
opsis-0.773
hemer-0.734
Tokenс
Token ID21727
Feature activation+7.54e-01

Pos logit contributions
Sov+1.363
 plur+1.293
edin+1.267
repre+1.104
 differently+1.075
Neg logit contributions
aband-2.998
ipel-2.799
 trap-2.706
opsis-2.683
uggets-2.562
Token
Token ID12466
Feature activation-1.27e-02

Pos logit contributions
Sov+0.411
 plur+0.393
edin+0.387
repre+0.346
 differently+0.336
Neg logit contributions
aband-0.774
ipel-0.723
 trap-0.694
opsis-0.690
hemer-0.660
Token
Token ID123
Feature activation+1.23e+00

Pos logit contributions
aband+0.009
ipel+0.008
 trap+0.008
opsis+0.007
hemer+0.007
Neg logit contributions
Sov-0.011
 plur-0.011
edin-0.011
repre-0.010
 differently-0.010
Tokenр
Token ID21169
Feature activation+4.99e-01

Pos logit contributions
Sov+0.689
 plur+0.661
edin+0.651
repre+0.583
 differently+0.566
Neg logit contributions
aband-1.247
ipel-1.164
 trap-1.118
opsis-1.111
hemer-1.059
Tokenо
Token ID15166
Feature activation+1.44e+00

Pos logit contributions
Sov+0.247
 plur+0.236
edin+0.232
repre+0.204
 differently+0.197
Neg logit contributions
aband-0.536
ipel-0.503
 trap-0.484
opsis-0.480
hemer-0.459
Token U
Token ID471
Feature activation+2.45e-01

Pos logit contributions
aband+0.077
ipel+0.071
 trap+0.067
opsis+0.066
hemer+0.063
Neg logit contributions
Sov-0.070
 plur-0.068
edin-0.067
repre-0.062
 differently-0.061
Token.
Token ID13
Feature activation+4.69e-01

Pos logit contributions
Sov+0.173
 plur+0.168
edin+0.166
repre+0.153
 differently+0.149
Neg logit contributions
aband-0.211
ipel-0.194
 trap-0.185
opsis-0.183
hemer-0.174
TokenS
Token ID50
Feature activation+2.02e-01

Pos logit contributions
Sov+0.397
 plur+0.386
edin+0.382
repre+0.357
 differently+0.350
Neg logit contributions
aband-0.340
ipel-0.309
 trap-0.291
opsis-0.287
hemer-0.269
Token.
Token ID13
Feature activation+2.19e-01

Pos logit contributions
Sov+0.142
 plur+0.137
edin+0.136
repre+0.125
 differently+0.122
Neg logit contributions
aband-0.175
ipel-0.161
 trap-0.153
opsis-0.152
hemer-0.144
Token Cons
Token ID3515
Feature activation+1.09e+00

Pos logit contributions
Sov+0.159
 plur+0.154
edin+0.152
repre+0.140
 differently+0.138
Neg logit contributions
aband-0.185
ipel-0.170
 trap-0.161
opsis-0.161
hemer-0.152
Tokenulate
Token ID5039
Feature activation-3.86e+00

Pos logit contributions
Sov+0.676
 plur+0.648
edin+0.641
repre+0.583
 differently+0.565
Neg logit contributions
aband-1.021
ipel-0.949
 trap-0.913
opsis-0.905
hemer-0.862
Token in
Token ID287
Feature activation-5.14e-01

Pos logit contributions
aband+3.389
ipel+3.149
 trap+3.070
opsis+2.978
 sleep+2.809
Neg logit contributions
Sov-2.613
 plur-2.504
edin-2.453
repre-2.271
 differently-2.168
Token Ad
Token ID1215
Feature activation+1.09e+00

Pos logit contributions
aband+0.402
ipel+0.368
 trap+0.347
opsis+0.345
hemer+0.325
Neg logit contributions
Sov-0.404
 plur-0.393
edin-0.389
repre-0.361
 differently-0.354
Tokenana
Token ID2271
Feature activation-9.41e-01

Pos logit contributions
Sov+0.870
edin+0.855
 plur+0.845
repre+0.782
 differently+0.759
Neg logit contributions
aband-0.830
ipel-0.760
 trap-0.720
opsis-0.708
hemer-0.667
Token and
Token ID290
Feature activation-6.27e-02

Pos logit contributions
aband+0.773
ipel+0.714
 trap+0.679
opsis+0.672
hemer+0.632
Neg logit contributions
Sov-0.704
 plur-0.681
edin-0.670
repre-0.625
 differently-0.605
Token the
Token ID262
Feature activation+3.14e-01

Pos logit contributions
aband+0.048
ipel+0.044
 trap+0.041
opsis+0.041
hemer+0.039
Neg logit contributions
Sov-0.051
 plur-0.049
edin-0.049
repre-0.045
 differently-0.045
Token
Token ID447
Feature activation+3.45e-02

Pos logit contributions
aband+0.462
ipel+0.424
 trap+0.404
opsis+0.398
hemer+0.376
Neg logit contributions
Sov-0.425
 plur-0.410
edin-0.407
repre-0.375
 differently-0.366
Token
Token ID247
Feature activation+3.81e-02

Pos logit contributions
Sov+0.040
 plur+0.039
edin+0.039
repre+0.037
 differently+0.036
Neg logit contributions
aband-0.015
ipel-0.012
 trap-0.011
opsis-0.011
hemer-0.009
Tokens
Token ID82
Feature activation+4.91e-01

Pos logit contributions
Sov+0.028
 plur+0.027
edin+0.027
repre+0.025
 differently+0.024
Neg logit contributions
aband-0.032
ipel-0.029
 trap-0.028
opsis-0.027
hemer-0.026
Token US
Token ID1294
Feature activation-5.48e-02

Pos logit contributions
Sov+0.396
 plur+0.386
edin+0.380
repre+0.355
 differently+0.349
Neg logit contributions
aband-0.374
ipel-0.343
 trap-0.322
opsis-0.321
hemer-0.301
Token Cons
Token ID3515
Feature activation+6.29e-01

Pos logit contributions
aband+0.044
ipel+0.041
 trap+0.038
opsis+0.038
hemer+0.036
Neg logit contributions
Sov-0.042
 plur-0.041
edin-0.040
repre-0.037
 differently-0.036
Tokenulate
Token ID5039
Feature activation-3.39e+00

Pos logit contributions
Sov+0.402
 plur+0.387
edin+0.382
repre+0.348
 differently+0.339
Neg logit contributions
aband-0.585
ipel-0.543
 trap-0.519
opsis-0.516
hemer-0.491
Token to
Token ID284
Feature activation-3.93e-01

Pos logit contributions
aband+2.915
ipel+2.721
 trap+2.615
opsis+2.555
hemer+2.425
Neg logit contributions
Sov-2.347
 plur-2.242
edin-2.216
repre-2.023
 differently-1.962
Token protest
Token ID5402
Feature activation-5.77e-03

Pos logit contributions
aband+0.291
ipel+0.265
 trap+0.249
opsis+0.247
hemer+0.232
Neg logit contributions
Sov-0.327
 plur-0.318
edin-0.314
repre-0.293
 differently-0.288
Token Trump
Token ID1301
Feature activation+9.31e-01

Pos logit contributions
aband+0.004
ipel+0.004
 trap+0.004
opsis+0.004
hemer+0.004
Neg logit contributions
Sov-0.005
 plur-0.005
edin-0.005
repre-0.004
 differently-0.004
Token
Token ID447
Feature activation-5.59e-01

Pos logit contributions
Sov+0.710
 plur+0.689
edin+0.680
repre+0.632
 differently+0.618
Neg logit contributions
aband-0.752
ipel-0.689
 trap-0.653
opsis-0.647
hemer-0.611
Token
Token ID247
Feature activation-1.36e-01

Pos logit contributions
aband+0.221
ipel+0.185
 trap+0.163
opsis+0.159
hemer+0.138
Neg logit contributions
Sov-0.655
 plur-0.642
edin-0.639
repre-0.610
 differently-0.601
Token two
Token ID734
Feature activation-6.78e-01

Pos logit contributions
aband+0.196
ipel+0.178
 trap+0.169
opsis+0.167
hemer+0.157
Neg logit contributions
Sov-0.215
 plur-0.209
edin-0.207
repre-0.193
 differently-0.189
Token federal
Token ID2717
Feature activation-5.14e-01

Pos logit contributions
aband+0.485
ipel+0.440
 trap+0.414
opsis+0.409
hemer+0.383
Neg logit contributions
Sov-0.580
 plur-0.564
edin-0.558
repre-0.522
 differently-0.511
Token PACs
Token ID43838
Feature activation-3.84e-01

Pos logit contributions
aband+0.382
ipel+0.348
 trap+0.327
opsis+0.325
hemer+0.305
Neg logit contributions
Sov-0.424
 plur-0.413
edin-0.408
repre-0.381
 differently-0.374
Token
Token ID960
Feature activation-1.44e-01

Pos logit contributions
aband+0.322
ipel+0.297
 trap+0.283
opsis+0.280
hemer+0.265
Neg logit contributions
Sov-0.281
 plur-0.272
edin-0.268
repre-0.248
 differently-0.242
TokenLiber
Token ID31199
Feature activation-1.16e-01

Pos logit contributions
aband+0.114
ipel+0.104
 trap+0.098
opsis+0.097
hemer+0.092
Neg logit contributions
Sov-0.113
 plur-0.110
edin-0.108
repre-0.101
 differently-0.099
Tokenty
Token ID774
Feature activation-3.24e+00

Pos logit contributions
aband+0.113
ipel+0.105
 trap+0.100
opsis+0.100
hemer+0.095
Neg logit contributions
Sov-0.070
 plur-0.067
edin-0.066
repre-0.060
 differently-0.058
Token 2
Token ID362
Feature activation-4.61e-01

Pos logit contributions
aband+2.639
ipel+2.399
 trap+2.290
opsis+2.266
uggets+2.131
Neg logit contributions
Sov-2.470
 plur-2.392
edin-2.351
repre-2.176
 differently-2.111
Token.
Token ID13
Feature activation-1.24e-01

Pos logit contributions
aband+0.388
ipel+0.357
 trap+0.339
opsis+0.336
hemer+0.318
Neg logit contributions
Sov-0.336
 plur-0.325
edin-0.321
repre-0.297
 differently-0.290
Token0
Token ID15
Feature activation-4.00e-01

Pos logit contributions
aband+0.107
ipel+0.098
 trap+0.094
opsis+0.093
hemer+0.088
Neg logit contributions
Sov-0.088
 plur-0.085
edin-0.084
repre-0.078
 differently-0.075
Token and
Token ID290
Feature activation-4.08e-01

Pos logit contributions
aband+0.323
ipel+0.297
 trap+0.281
opsis+0.278
hemer+0.263
Neg logit contributions
Sov-0.304
 plur-0.295
edin-0.291
repre-0.270
 differently-0.265
Token Oklahoma
Token ID10433
Feature activation+7.21e-01

Pos logit contributions
aband+0.319
ipel+0.292
 trap+0.276
opsis+0.274
hemer+0.258
Neg logit contributions
Sov-0.322
 plur-0.312
edin-0.309
repre-0.287
 differently-0.281
Token
Token ID247
Feature activation+1.28e-01

Pos logit contributions
aband+0.264
ipel+0.231
 trap+0.213
opsis+0.209
hemer+0.192
Neg logit contributions
Sov-0.488
 plur-0.477
edin-0.473
repre-0.447
 differently-0.440
Tokenve
Token ID303
Feature activation-4.90e-01

Pos logit contributions
Sov+0.095
 plur+0.092
edin+0.091
repre+0.084
 differently+0.082
Neg logit contributions
aband-0.107
ipel-0.098
 trap-0.093
opsis-0.092
hemer-0.087
Token seen
Token ID1775
Feature activation+1.07e-01

Pos logit contributions
aband+0.359
ipel+0.327
 trap+0.307
opsis+0.305
hemer+0.286
Neg logit contributions
Sov-0.410
 plur-0.400
edin-0.395
repre-0.369
 differently-0.362
Token it
Token ID340
Feature activation+5.14e-02

Pos logit contributions
Sov+0.076
 plur+0.074
edin+0.073
repre+0.067
 differently+0.065
Neg logit contributions
aband-0.091
ipel-0.084
 trap-0.080
opsis-0.079
hemer-0.075
Token so
Token ID523
Feature activation-6.82e-01

Pos logit contributions
Sov+0.037
 plur+0.036
edin+0.035
repre+0.033
 differently+0.032
Neg logit contributions
aband-0.044
ipel-0.040
 trap-0.039
opsis-0.038
hemer-0.036
Token far
Token ID1290
Feature activation-3.18e+00

Pos logit contributions
aband+0.576
ipel+0.531
 trap+0.505
opsis+0.500
hemer+0.473
Neg logit contributions
Sov-0.495
 plur-0.479
edin-0.474
repre-0.437
 differently-0.427
Token.
Token ID13
Feature activation+3.73e-01

Pos logit contributions
aband+2.635
ipel+2.443
 trap+2.338
opsis+2.305
hemer+2.183
Neg logit contributions
Sov-2.345
 plur-2.243
edin-2.217
repre-2.056
 differently-1.976
Token
Token ID198
Feature activation-6.23e-01

Pos logit contributions
Sov+0.294
 plur+0.285
edin+0.283
repre+0.262
 differently+0.257
Neg logit contributions
aband-0.291
ipel-0.266
 trap-0.252
opsis-0.249
hemer-0.235
Token
Token ID198
Feature activation-4.25e-01

Pos logit contributions
aband+0.506
ipel+0.463
 trap+0.442
opsis+0.436
hemer+0.412
Neg logit contributions
Sov-0.472
 plur-0.458
edin-0.451
repre-0.419
 differently-0.408
TokenPA
Token ID4537
Feature activation+5.72e-01

Pos logit contributions
aband+0.300
ipel+0.271
 trap+0.255
opsis+0.252
hemer+0.236
Neg logit contributions
Sov-0.368
 plur-0.358
edin-0.355
repre-0.332
 differently-0.326
TokenX
Token ID55
Feature activation-2.39e-03

Pos logit contributions
Sov+0.444
 plur+0.428
edin+0.423
repre+0.391
 differently+0.383
Neg logit contributions
aband-0.454
ipel-0.417
 trap-0.395
opsis-0.388
hemer-0.368
Token has
Token ID468
Feature activation+1.17e-02

Pos logit contributions
aband+0.837
ipel+0.770
 trap+0.737
opsis+0.727
hemer+0.685
Neg logit contributions
Sov-0.743
 plur-0.715
edin-0.705
repre-0.656
 differently-0.632
Token repeatedly
Token ID7830
Feature activation-6.85e-01

Pos logit contributions
Sov+0.009
 plur+0.009
edin+0.009
repre+0.008
 differently+0.008
Neg logit contributions
aband-0.009
ipel-0.009
 trap-0.008
opsis-0.008
hemer-0.008
Token criticized
Token ID12318
Feature activation-1.34e+00

Pos logit contributions
aband+0.512
ipel+0.466
 trap+0.444
opsis+0.436
hemer+0.409
Neg logit contributions
Sov-0.564
 plur-0.545
edin-0.540
repre-0.504
 differently-0.490
Token President
Token ID1992
Feature activation-1.25e-01

Pos logit contributions
aband+1.000
ipel+0.909
 trap+0.861
opsis+0.849
hemer+0.797
Neg logit contributions
Sov-1.110
 plur-1.077
edin-1.068
repre-0.995
 differently-0.973
Token Trump
Token ID1301
Feature activation-5.76e-01

Pos logit contributions
aband+0.080
ipel+0.071
 trap+0.067
opsis+0.066
hemer+0.061
Neg logit contributions
Sov-0.116
 plur-0.114
edin-0.112
repre-0.106
 differently-0.104
Token Donald
Token ID3759
Feature activation-3.02e+00

Pos logit contributions
aband+0.485
ipel+0.446
 trap+0.426
opsis+0.421
hemer+0.399
Neg logit contributions
Sov-0.420
 plur-0.405
edin-0.400
repre-0.369
 differently-0.359
Token John
Token ID1757
Feature activation-4.61e-01

Pos logit contributions
aband+1.834
ipel+1.631
 trap+1.521
opsis+1.501
hemer+1.382
Neg logit contributions
Sov-2.909
 plur-2.837
edin-2.812
repre-2.650
 differently-2.605
Token Trump
Token ID1301
Feature activation-1.39e+00

Pos logit contributions
aband+0.339
ipel+0.311
opsis+0.286
 trap+0.285
hemer+0.269
Neg logit contributions
Sov-0.384
 plur-0.375
edin-0.373
repre-0.345
 differently-0.343
TokenHouse
Token ID18102
Feature activation-2.10e+00

Pos logit contributions
aband+1.135
ipel+1.036
 trap+0.989
opsis+0.980
hemer+0.930
Neg logit contributions
Sov-1.035
 plur-1.009
edin-0.998
repre-0.920
 differently-0.904
Token committee
Token ID5583
Feature activation-9.06e-01

Pos logit contributions
aband+1.373
ipel+1.232
 trap+1.176
opsis+1.150
��+1.073
Neg logit contributions
Sov-1.903
edin-1.867
 plur-1.860
repre-1.715
 differently-1.687
Token believes
Token ID5804
Feature activation-7.19e-01

Pos logit contributions
aband+0.525
ipel+0.470
 trap+0.445
opsis+0.436
��+0.400
Neg logit contributions
Sov-0.892
 plur-0.865
edin-0.860
repre-0.806
 Modified-0.788
Tokenyoutube
Token ID11604
Feature activation-5.97e-01

Pos logit contributions
Sov+0.045
 plur+0.044
edin+0.044
repre+0.041
 differently+0.041
Neg logit contributions
aband-0.023
ipel-0.020
 trap-0.018
opsis-0.018
hemer-0.016
Token.
Token ID13
Feature activation+6.16e-01

Pos logit contributions
aband+0.513
ipel+0.473
 trap+0.451
opsis+0.447
hemer+0.423
Neg logit contributions
Sov-0.424
 plur-0.410
edin-0.404
repre-0.373
 differently-0.364
Tokencom
Token ID785
Feature activation+8.29e-01

Pos logit contributions
Sov+0.488
 plur+0.472
edin+0.469
repre+0.435
 differently+0.426
Neg logit contributions
aband-0.478
ipel-0.438
 trap-0.415
opsis-0.409
hemer-0.386
Token/
Token ID14
Feature activation-1.65e-01

Pos logit contributions
Sov+0.578
 plur+0.560
edin+0.554
repre+0.508
 differently+0.497
Neg logit contributions
aband-0.722
ipel-0.668
 trap-0.632
opsis-0.629
hemer-0.599
Tokenwatch
Token ID8340
Feature activation+4.16e-05

Pos logit contributions
aband+0.126
ipel+0.114
 trap+0.108
opsis+0.106
hemer+0.100
Neg logit contributions
Sov-0.134
 plur-0.130
edin-0.128
repre-0.119
 differently-0.118
Token?
Token ID30
Feature activation-3.02e+00

Pos logit contributions
Sov+0.000
 plur+0.000
edin+0.000
repre+0.000
 differently+0.000
Neg logit contributions
aband-0.000
ipel-0.000
 trap-0.000
opsis-0.000
hemer-0.000
Tokenv
Token ID85
Feature activation-6.71e-01

Pos logit contributions
aband+2.282
ipel+2.076
 trap+1.977
opsis+1.944
hemer+1.815
Neg logit contributions
Sov-2.438
 plur-2.392
edin-2.310
repre-2.154
 differently-2.118
Token=
Token ID28
Feature activation+8.64e-02

Pos logit contributions
aband+0.492
ipel+0.451
 trap+0.422
opsis+0.418
hemer+0.390
Neg logit contributions
Sov-0.563
 plur-0.546
edin-0.536
repre-0.499
 differently-0.492
Token2
Token ID17
Feature activation+3.17e-01

Pos logit contributions
Sov+0.067
 plur+0.065
edin+0.064
repre+0.059
 differently+0.058
Neg logit contributions
aband-0.069
ipel-0.063
 trap-0.060
opsis-0.059
hemer-0.056
Tokeng
Token ID70
Feature activation+8.82e-01

Pos logit contributions
Sov+0.248
 plur+0.241
edin+0.238
repre+0.221
 differently+0.217
Neg logit contributions
aband-0.250
ipel-0.228
 trap-0.216
opsis-0.214
hemer-0.202
Token7
Token ID22
Feature activation+9.69e-01

Pos logit contributions
Sov+0.675
 plur+0.651
edin+0.646
repre+0.598
 differently+0.582
Neg logit contributions
aband-0.711
ipel-0.652
 trap-0.620
opsis-0.613
hemer-0.579
Token10
Token ID940
Feature activation-5.71e-01

Pos logit contributions
aband+0.489
ipel+0.451
 trap+0.434
opsis+0.431
hemer+0.412
Neg logit contributions
Sov-0.290
 plur-0.276
edin-0.273
repre-0.247
 differently-0.240
Token.
Token ID13
Feature activation-7.89e-01

Pos logit contributions
aband+0.477
ipel+0.436
 trap+0.418
opsis+0.414
hemer+0.391
Neg logit contributions
Sov-0.419
 plur-0.405
edin-0.398
repre-0.374
 differently-0.360
Token13
Token ID1485
Feature activation-6.59e-02

Pos logit contributions
aband+0.732
ipel+0.667
 trap+0.646
opsis+0.643
hemer+0.609
Neg logit contributions
Sov-0.505
 plur-0.487
edin-0.479
repre-0.442
 differently-0.423
Token71
Token ID4869
Feature activation+1.98e-01

Pos logit contributions
aband+0.052
ipel+0.046
 trap+0.045
opsis+0.043
hemer+0.041
Neg logit contributions
Sov-0.053
 plur-0.050
edin-0.049
repre-0.047
 differently-0.045
Token/
Token ID14
Feature activation+9.51e-01

Pos logit contributions
Sov+0.140
 plur+0.135
edin+0.133
repre+0.122
 differently+0.120
Neg logit contributions
aband-0.171
ipel-0.159
 trap-0.150
opsis-0.149
hemer-0.142
Tokenjournal
Token ID24891
Feature activation-3.01e+00

Pos logit contributions
Sov+0.842
 plur+0.821
edin+0.812
repre+0.759
 differently+0.749
Neg logit contributions
aband-0.650
ipel-0.587
 trap-0.551
opsis-0.544
hemer-0.508
Token.
Token ID13
Feature activation-1.10e+00

Pos logit contributions
aband+2.421
ipel+2.232
opsis+2.109
 trap+2.103
hemer+1.978
Neg logit contributions
Sov-2.290
 plur-2.220
edin-2.159
repre-2.059
 differently-1.979
Tokenp
Token ID79
Feature activation-4.41e-01

Pos logit contributions
aband+0.771
ipel+0.696
opsis+0.655
 trap+0.653
hemer+0.600
Neg logit contributions
Sov-0.967
 plur-0.932
edin-0.922
repre-0.865
 differently-0.855
Tokenone
Token ID505
Feature activation-1.20e+00

Pos logit contributions
aband+0.387
ipel+0.358
 trap+0.338
opsis+0.336
hemer+0.316
Neg logit contributions
Sov-0.313
 plur-0.297
edin-0.291
repre-0.271
 differently-0.268
Token.
Token ID13
Feature activation-7.13e-02

Pos logit contributions
aband+1.033
ipel+0.952
 trap+0.907
opsis+0.899
hemer+0.851
Neg logit contributions
Sov-0.857
 plur-0.830
edin-0.818
repre-0.756
 differently-0.737
Token00
Token ID405
Feature activation-5.37e-01

Pos logit contributions
aband+0.044
ipel+0.040
 trap+0.037
opsis+0.036
hemer+0.034
Neg logit contributions
Sov-0.067
 plur-0.066
edin-0.065
repre-0.061
 differently-0.060
Token the
Token ID262
Feature activation+1.63e-01

Pos logit contributions
aband+0.263
ipel+0.241
 trap+0.229
opsis+0.226
hemer+0.214
Neg logit contributions
Sov-0.246
 plur-0.239
edin-0.236
repre-0.219
 differently-0.214
Token aid
Token ID6133
Feature activation+1.36e+00

Pos logit contributions
Sov+0.124
 plur+0.120
edin+0.119
repre+0.110
 differently+0.107
Neg logit contributions
aband-0.132
ipel-0.121
 trap-0.115
opsis-0.114
hemer-0.107
Token of
Token ID286
Feature activation-7.77e-02

Pos logit contributions
Sov+0.910
 plur+0.888
edin+0.873
repre+0.798
 differently+0.792
Neg logit contributions
aband-1.215
ipel-1.125
 trap-1.057
opsis-1.054
hemer-1.012
Token Victor
Token ID12622
Feature activation+2.54e-01

Pos logit contributions
aband+0.061
ipel+0.056
 trap+0.053
opsis+0.053
hemer+0.050
Neg logit contributions
Sov-0.061
 plur-0.059
edin-0.058
repre-0.054
 differently-0.053
Token L
Token ID406
Feature activation-6.31e-01

Pos logit contributions
Sov+0.202
 plur+0.196
edin+0.194
repre+0.180
 differently+0.177
Neg logit contributions
aband-0.197
ipel-0.180
 trap-0.170
opsis-0.169
hemer-0.159
Tokenown
Token ID593
Feature activation-3.01e+00

Pos logit contributions
aband+0.510
ipel+0.468
 trap+0.442
opsis+0.439
hemer+0.415
Neg logit contributions
Sov-0.483
 plur-0.467
edin-0.460
repre-0.428
 differently-0.420
Tokenes
Token ID274
Feature activation-1.16e+00

Pos logit contributions
aband+2.587
ipel+2.381
 trap+2.255
opsis+2.242
hemer+2.139
Neg logit contributions
Sov-2.128
 plur-2.052
edin-1.973
repre-1.869
 differently-1.805
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
aband+0.927
ipel+0.850
 trap+0.805
opsis+0.798
hemer+0.752
Neg logit contributions
Sov-0.890
 plur-0.863
edin-0.852
repre-0.793
 differently-0.773
Token the
Token ID262
Feature activation-3.39e-01

Pos logit contributions
aband+0.884
ipel+0.811
 trap+0.773
opsis+0.762
hemer+0.719
Neg logit contributions
Sov-0.824
 plur-0.795
edin-0.789
repre-0.731
 differently-0.711
Token crown
Token ID12389
Feature activation-4.61e-01

Pos logit contributions
aband+0.261
ipel+0.238
 trap+0.226
opsis+0.223
hemer+0.210
Neg logit contributions
Sov-0.272
 plur-0.264
edin-0.261
repre-0.243
 differently-0.238
Token prince
Token ID19716
Feature activation-1.73e-01

Pos logit contributions
aband+0.364
ipel+0.333
 trap+0.315
opsis+0.312
hemer+0.294
Neg logit contributions
Sov-0.359
 plur-0.349
edin-0.345
repre-0.320
 differently-0.314
Token ballot
Token ID11100
Feature activation-3.44e-01

Pos logit contributions
aband+0.473
ipel+0.432
 trap+0.412
opsis+0.407
hemer+0.380
Neg logit contributions
Sov-0.541
 plur-0.527
edin-0.526
repre-0.488
 differently-0.473
Token ended
Token ID4444
Feature activation-1.06e+00

Pos logit contributions
aband+0.289
ipel+0.266
 trap+0.254
opsis+0.251
hemer+0.238
Neg logit contributions
Sov-0.251
 plur-0.243
edin-0.240
repre-0.221
 differently-0.216
Token up
Token ID510
Feature activation+2.95e-01

Pos logit contributions
aband+0.835
ipel+0.763
 trap+0.731
opsis+0.715
hemer+0.676
Neg logit contributions
Sov-0.826
 plur-0.794
edin-0.785
repre-0.732
 differently-0.698
Token more
Token ID517
Feature activation-1.46e-02

Pos logit contributions
Sov+0.196
 plur+0.189
edin+0.186
repre+0.171
 differently+0.165
Neg logit contributions
aband-0.267
ipel-0.247
 trap-0.236
opsis-0.234
hemer-0.222
Token or
Token ID393
Feature activation+2.05e-01

Pos logit contributions
aband+0.012
ipel+0.011
 trap+0.011
opsis+0.011
hemer+0.010
Neg logit contributions
Sov-0.011
 plur-0.010
edin-0.010
repre-0.009
 differently-0.009
Token less
Token ID1342
Feature activation-3.01e+00

Pos logit contributions
Sov+0.116
 plur+0.111
edin+0.110
repre+0.098
 differently+0.097
Neg logit contributions
aband-0.205
ipel-0.192
 trap-0.184
opsis-0.183
hemer-0.175
Token mirror
Token ID10162
Feature activation+1.13e-01

Pos logit contributions
aband+2.484
ipel+2.292
 trap+2.223
opsis+2.158
hemer+2.050
Neg logit contributions
Sov-2.194
edin-2.101
 plur-2.086
repre-1.947
 Modified-1.840
Tokening
Token ID278
Feature activation+4.73e-01

Pos logit contributions
Sov+0.086
 plur+0.083
edin+0.082
repre+0.076
 differently+0.074
Neg logit contributions
aband-0.091
ipel-0.084
 trap-0.080
opsis-0.079
hemer-0.074
Token the
Token ID262
Feature activation-8.27e-02

Pos logit contributions
Sov+0.372
 plur+0.361
edin+0.357
repre+0.332
 differently+0.325
Neg logit contributions
aband-0.371
ipel-0.339
 trap-0.321
opsis-0.318
hemer-0.300
Token Rab
Token ID28559
Feature activation-1.82e-01

Pos logit contributions
aband+0.066
ipel+0.060
 trap+0.057
opsis+0.057
hemer+0.053
Neg logit contributions
Sov-0.064
 plur-0.062
edin-0.061
repre-0.057
 differently-0.056
Tokenid
Token ID312
Feature activation-6.15e-01

Pos logit contributions
aband+0.127
ipel+0.115
 trap+0.108
opsis+0.107
hemer+0.099
Neg logit contributions
Sov-0.160
 plur-0.155
edin-0.154
repre-0.144
 differently-0.141
Token
Token ID198
Feature activation-5.70e-01

Pos logit contributions
aband+0.850
ipel+0.773
 trap+0.733
opsis+0.720
hemer+0.677
Neg logit contributions
Sov-0.967
 plur-0.937
edin-0.927
repre-0.867
 differently-0.846
Token
Token ID198
Feature activation-3.24e-01

Pos logit contributions
aband+0.476
ipel+0.438
 trap+0.415
opsis+0.412
hemer+0.390
Neg logit contributions
Sov-0.421
 plur-0.405
edin-0.402
repre-0.370
 differently-0.363
TokenShe
Token ID3347
Feature activation-7.22e-01

Pos logit contributions
aband+0.248
ipel+0.226
 trap+0.214
opsis+0.212
hemer+0.199
Neg logit contributions
Sov-0.261
 plur-0.254
edin-0.251
repre-0.234
 differently-0.229
Token cuts
Token ID6630
Feature activation+6.78e-01

Pos logit contributions
aband+0.560
ipel+0.511
 trap+0.486
opsis+0.479
hemer+0.450
Neg logit contributions
Sov-0.575
 plur-0.557
edin-0.551
repre-0.513
 differently-0.501
Token me
Token ID502
Feature activation-2.29e-01

Pos logit contributions
Sov+0.541
 plur+0.524
edin+0.517
repre+0.482
 differently+0.468
Neg logit contributions
aband-0.525
ipel-0.480
 trap-0.457
opsis-0.449
hemer-0.423
Token off
Token ID572
Feature activation-3.00e+00

Pos logit contributions
aband+0.235
ipel+0.220
 trap+0.212
opsis+0.210
hemer+0.200
Neg logit contributions
Sov-0.127
 plur-0.119
edin-0.116
repre-0.105
 differently-0.099
Token by
Token ID416
Feature activation-4.93e-01

Pos logit contributions
aband+2.427
ipel+2.214
 trap+2.182
opsis+2.119
 sleep+2.010
Neg logit contributions
Sov-2.293
 plur-2.167
edin-2.140
repre-1.990
 Modified-1.885
Token enc
Token ID2207
Feature activation-4.43e-01

Pos logit contributions
aband+0.387
ipel+0.355
 trap+0.336
opsis+0.332
hemer+0.313
Neg logit contributions
Sov-0.386
 plur-0.375
edin-0.370
repre-0.344
 differently-0.337
Tokenir
Token ID343
Feature activation+1.32e-01

Pos logit contributions
aband+0.350
ipel+0.319
 trap+0.304
opsis+0.300
hemer+0.284
Neg logit contributions
Sov-0.352
 plur-0.337
edin-0.332
repre-0.309
 differently-0.303
Tokencling
Token ID8493
Feature activation+3.99e-01

Pos logit contributions
Sov+0.094
 plur+0.090
edin+0.089
repre+0.082
 differently+0.080
Neg logit contributions
aband-0.114
ipel-0.105
 trap-0.100
opsis-0.099
hemer-0.094
Token my
Token ID616
Feature activation-4.00e-01

Pos logit contributions
Sov+0.332
 plur+0.322
edin+0.319
repre+0.298
 differently+0.292
Neg logit contributions
aband-0.295
ipel-0.269
 trap-0.254
opsis-0.251
hemer-0.235
Token
Token ID447
Feature activation-4.24e-01

Pos logit contributions
aband+0.403
ipel+0.369
 trap+0.352
opsis+0.348
hemer+0.329
Neg logit contributions
Sov-0.370
 plur-0.357
edin-0.353
repre-0.327
 differently-0.319
Token
Token ID247
Feature activation-4.97e-01

Pos logit contributions
aband+0.170
ipel+0.142
 trap+0.125
opsis+0.123
hemer+0.105
Neg logit contributions
Sov-0.496
 plur-0.487
edin-0.484
repre-0.462
 differently-0.455
Tokens
Token ID82
Feature activation-5.10e-01

Pos logit contributions
aband+0.380
ipel+0.345
 trap+0.327
opsis+0.323
hemer+0.305
Neg logit contributions
Sov-0.403
 plur-0.389
edin-0.385
repre-0.356
 differently-0.350
Token undeniable
Token ID36332
Feature activation-6.81e-02

Pos logit contributions
aband+0.451
ipel+0.417
 trap+0.402
opsis+0.395
hemer+0.375
Neg logit contributions
Sov-0.348
 plur-0.334
edin-0.331
repre-0.303
 differently-0.294
Token how
Token ID703
Feature activation+3.05e-01

Pos logit contributions
aband+0.062
ipel+0.057
 trap+0.055
opsis+0.054
hemer+0.052
Neg logit contributions
Sov-0.045
 plur-0.043
edin-0.043
repre-0.039
 differently-0.038
Token well
Token ID880
Feature activation-3.00e+00

Pos logit contributions
Sov+0.230
 plur+0.223
edin+0.220
repre+0.204
 differently+0.200
Neg logit contributions
aband-0.249
ipel-0.229
 trap-0.217
opsis-0.215
hemer-0.203
Token Arduino
Token ID27634
Feature activation+1.40e-01

Pos logit contributions
aband+2.453
ipel+2.268
 trap+2.199
opsis+2.149
hemer+2.023
Neg logit contributions
Sov-2.235
 plur-2.110
edin-2.088
repre-1.945
 differently-1.865
Token has
Token ID468
Feature activation-1.41e-01

Pos logit contributions
Sov+0.106
 plur+0.102
edin+0.101
repre+0.094
 differently+0.092
Neg logit contributions
aband-0.115
ipel-0.105
 trap-0.100
opsis-0.099
hemer-0.094
Token enabled
Token ID9343
Feature activation+2.16e-01

Pos logit contributions
aband+0.121
ipel+0.111
 trap+0.106
opsis+0.105
hemer+0.099
Neg logit contributions
Sov-0.101
 plur-0.098
edin-0.097
repre-0.089
 differently-0.087
Token no
Token ID645
Feature activation+4.10e-01

Pos logit contributions
Sov+0.173
 plur+0.168
edin+0.166
repre+0.154
 differently+0.152
Neg logit contributions
aband-0.166
ipel-0.151
 trap-0.143
opsis-0.142
hemer-0.133
Tokenv
Token ID85
Feature activation+4.86e-01

Pos logit contributions
Sov+0.308
 plur+0.299
edin+0.295
repre+0.273
 differently+0.268
Neg logit contributions
aband-0.336
ipel-0.309
 trap-0.292
opsis-0.291
hemer-0.274
Token in
Token ID287
Feature activation+2.12e-01

Pos logit contributions
aband+0.111
ipel+0.103
 trap+0.098
opsis+0.097
hemer+0.091
Neg logit contributions
Sov-0.099
 plur-0.096
edin-0.095
repre-0.088
 differently-0.085
Token its
Token ID663
Feature activation-4.79e-01

Pos logit contributions
Sov+0.165
 plur+0.161
edin+0.158
repre+0.147
 differently+0.145
Neg logit contributions
aband-0.168
ipel-0.153
 trap-0.144
opsis-0.144
hemer-0.136
Token episode
Token ID4471
Feature activation+3.90e-01

Pos logit contributions
aband+0.366
ipel+0.334
 trap+0.314
opsis+0.312
hemer+0.294
Neg logit contributions
Sov-0.386
 plur-0.376
edin-0.370
repre-0.345
 differently-0.340
Token "
Token ID366
Feature activation+5.31e-01

Pos logit contributions
Sov+0.309
 plur+0.300
edin+0.296
repre+0.275
 differently+0.271
Neg logit contributions
aband-0.304
ipel-0.276
 trap-0.261
opsis-0.258
hemer-0.244
TokenSil
Token ID15086
Feature activation-6.59e-01

Pos logit contributions
Sov+0.398
 plur+0.383
edin+0.378
repre+0.350
 differently+0.343
Neg logit contributions
aband-0.438
ipel-0.402
 trap-0.381
opsis-0.378
hemer-0.357
Tokenent
Token ID298
Feature activation-2.98e+00

Pos logit contributions
aband+0.477
ipel+0.433
 trap+0.406
opsis+0.405
hemer+0.379
Neg logit contributions
Sov-0.559
 plur-0.544
edin-0.538
repre-0.504
 differently-0.494
Token Ro
Token ID5564
Feature activation+2.91e-02

Pos logit contributions
aband+2.399
ipel+2.207
 trap+2.101
opsis+2.087
hemer+1.960
Neg logit contributions
Sov-2.271
 plur-2.200
edin-2.184
repre-2.025
 differently-1.960
Tokenar
Token ID283
Feature activation-4.56e-01

Pos logit contributions
Sov+0.024
 plur+0.023
edin+0.023
repre+0.021
 differently+0.021
Neg logit contributions
aband-0.022
ipel-0.020
 trap-0.019
opsis-0.019
hemer-0.017
Token:
Token ID25
Feature activation+5.82e-01

Pos logit contributions
aband+0.387
ipel+0.356
 trap+0.339
opsis+0.336
hemer+0.318
Neg logit contributions
Sov-0.329
 plur-0.319
edin-0.315
repre-0.291
 differently-0.284
Token Search
Token ID11140
Feature activation+7.75e-01

Pos logit contributions
Sov+0.478
 plur+0.464
edin+0.458
repre+0.426
 differently+0.421
Neg logit contributions
aband-0.437
ipel-0.396
 trap-0.372
opsis-0.369
hemer-0.347
Tokening
Token ID278
Feature activation+2.17e-01

Pos logit contributions
Sov+0.565
 plur+0.549
edin+0.544
repre+0.499
 differently+0.491
Neg logit contributions
aband-0.648
ipel-0.597
 trap-0.567
opsis-0.561
hemer-0.533
Token when
Token ID618
Feature activation-1.63e-01

Pos logit contributions
Sov+0.032
 plur+0.030
edin+0.030
repre+0.028
 differently+0.027
Neg logit contributions
aband-0.035
ipel-0.032
 trap-0.031
opsis-0.030
hemer-0.029
Token a
Token ID257
Feature activation-1.43e+00

Pos logit contributions
aband+0.131
ipel+0.120
 trap+0.114
opsis+0.112
hemer+0.106
Neg logit contributions
Sov-0.125
 plur-0.122
edin-0.120
repre-0.111
 differently-0.109
Token TC
Token ID17283
Feature activation+3.07e-01

Pos logit contributions
aband+1.055
ipel+0.961
 trap+0.917
opsis+0.896
hemer+0.842
Neg logit contributions
Sov-1.186
 plur-1.149
edin-1.140
repre-1.062
 differently-1.036
TokenRI
Token ID7112
Feature activation+7.08e-01

Pos logit contributions
Sov+0.243
 plur+0.236
edin+0.233
repre+0.217
 differently+0.213
Neg logit contributions
aband-0.238
ipel-0.218
 trap-0.206
opsis-0.204
hemer-0.192
Token truck
Token ID7779
Feature activation-1.10e+00

Pos logit contributions
Sov+0.577
 plur+0.562
edin+0.555
repre+0.517
 differently+0.509
Neg logit contributions
aband-0.533
ipel-0.485
 trap-0.455
opsis-0.453
hemer-0.426
Token drove
Token ID10357
Feature activation-2.93e+00

Pos logit contributions
aband+0.865
ipel+0.787
 trap+0.755
opsis+0.740
hemer+0.698
Neg logit contributions
Sov-0.874
 plur-0.845
edin-0.834
repre-0.777
 differently-0.757
Token by
Token ID416
Feature activation-3.99e-01

Pos logit contributions
aband+2.428
ipel+2.195
 trap+2.190
opsis+2.074
 traps+1.986
Neg logit contributions
Sov-2.206
 plur-2.088
edin-2.083
repre-1.939
 differently-1.794
Token,
Token ID11
Feature activation-1.03e-01

Pos logit contributions
aband+0.323
ipel+0.296
 trap+0.283
opsis+0.278
hemer+0.263
Neg logit contributions
Sov-0.304
 plur-0.294
edin-0.291
repre-0.269
 differently-0.262
Token and
Token ID290
Feature activation-1.89e-01

Pos logit contributions
aband+0.083
ipel+0.076
 trap+0.072
opsis+0.071
hemer+0.067
Neg logit contributions
Sov-0.080
 plur-0.077
edin-0.077
repre-0.071
 differently-0.069
Token a
Token ID257
Feature activation-4.17e-01

Pos logit contributions
aband+0.152
ipel+0.140
 trap+0.133
opsis+0.131
hemer+0.124
Neg logit contributions
Sov-0.145
 plur-0.140
edin-0.139
repre-0.129
 differently-0.126
Token strange
Token ID6283
Feature activation-4.80e-01

Pos logit contributions
aband+0.305
ipel+0.277
 trap+0.262
opsis+0.258
hemer+0.242
Neg logit contributions
Sov-0.351
 plur-0.341
edin-0.337
repre-0.315
 differently-0.308
TokenM
Token ID44
Feature activation-8.65e-02

Pos logit contributions
aband+0.437
ipel+0.392
 trap+0.365
opsis+0.360
hemer+0.336
Neg logit contributions
Sov-0.619
 plur-0.603
edin-0.598
repre-0.563
 differently-0.552
TokenDB
Token ID11012
Feature activation-1.28e-01

Pos logit contributions
aband+0.036
ipel+0.031
 trap+0.027
opsis+0.027
��+0.023
Neg logit contributions
Sov-0.100
edin-0.097
 plur-0.097
repre-0.092
 differently-0.091
Token's
Token ID338
Feature activation+1.29e-01

Pos logit contributions
aband+0.110
ipel+0.101
 trap+0.097
opsis+0.096
hemer+0.091
Neg logit contributions
Sov-0.091
 plur-0.088
edin-0.087
repre-0.081
 differently-0.078
Token response
Token ID2882
Feature activation-1.73e-01

Pos logit contributions
Sov+0.100
 plur+0.097
edin+0.096
repre+0.089
 differently+0.087
Neg logit contributions
aband-0.103
ipel-0.095
 trap-0.089
opsis-0.089
hemer-0.084
Token so
Token ID523
Feature activation-7.07e-01

Pos logit contributions
aband+0.155
ipel+0.144
 trap+0.138
opsis+0.136
hemer+0.129
Neg logit contributions
Sov-0.116
 plur-0.112
edin-0.110
repre-0.101
 differently-0.098
Token far
Token ID1290
Feature activation-2.93e+00

Pos logit contributions
aband+0.582
ipel+0.535
 trap+0.510
opsis+0.504
hemer+0.478
Neg logit contributions
Sov-0.522
 plur-0.510
edin-0.503
repre-0.466
 differently-0.450
Token was
Token ID373
Feature activation+2.13e-01

Pos logit contributions
aband+2.530
ipel+2.321
 trap+2.262
opsis+2.225
hemer+2.100
Neg logit contributions
Sov-2.050
 plur-1.951
edin-1.927
repre-1.786
 Modified-1.713
Token that
Token ID326
Feature activation-1.21e+00

Pos logit contributions
Sov+0.159
 plur+0.154
edin+0.152
repre+0.141
 differently+0.138
Neg logit contributions
aband-0.175
ipel-0.161
 trap-0.153
opsis-0.151
hemer-0.143
Token it
Token ID340
Feature activation-1.28e+00

Pos logit contributions
aband+1.071
ipel+0.989
 trap+0.948
opsis+0.931
hemer+0.889
Neg logit contributions
Sov-0.827
 plur-0.791
edin-0.784
repre-0.723
 differently-0.695
Token needed
Token ID2622
Feature activation-8.80e-01

Pos logit contributions
aband+1.098
ipel+1.004
 trap+0.971
opsis+0.952
hemer+0.899
Neg logit contributions
Sov-0.922
 plur-0.880
edin-0.876
repre-0.813
 differently-0.776
Token the
Token ID262
Feature activation-9.40e-02

Pos logit contributions
aband+0.779
ipel+0.720
 trap+0.690
opsis+0.679
hemer+0.647
Neg logit contributions
Sov-0.602
 plur-0.579
edin-0.571
repre-0.528
 differently-0.509
Token radio
Token ID5243
Feature activation-1.36e-01

Pos logit contributions
aband+0.699
ipel+0.630
 trap+0.595
opsis+0.582
hemer+0.540
Neg logit contributions
Sov-0.968
 plur-0.937
edin-0.933
repre-0.874
 differently-0.855
Token debate
Token ID4384
Feature activation+6.67e-01

Pos logit contributions
aband+0.106
ipel+0.097
 trap+0.091
opsis+0.091
hemer+0.085
Neg logit contributions
Sov-0.107
 plur-0.104
edin-0.103
repre-0.096
 differently-0.094
Token hosted
Token ID12007
Feature activation-2.85e-01

Pos logit contributions
Sov+0.462
 plur+0.448
edin+0.441
repre+0.405
 differently+0.398
Neg logit contributions
aband-0.586
ipel-0.540
 trap-0.513
opsis-0.510
hemer-0.485
Token by
Token ID416
Feature activation-5.12e-01

Pos logit contributions
aband+0.249
ipel+0.231
 trap+0.221
opsis+0.218
hemer+0.207
Neg logit contributions
Sov-0.197
 plur-0.190
edin-0.188
repre-0.173
 differently-0.167
Token La
Token ID4689
Feature activation-6.64e-01

Pos logit contributions
aband+0.375
ipel+0.341
 trap+0.320
opsis+0.319
hemer+0.299
Neg logit contributions
Sov-0.428
 plur-0.419
edin-0.412
repre-0.385
 differently-0.380
Token Opinion
Token ID29525
Feature activation-2.93e+00

Pos logit contributions
aband+0.437
ipel+0.393
 trap+0.368
opsis+0.363
hemer+0.338
Neg logit contributions
Sov-0.605
 plur-0.588
edin-0.582
repre-0.547
 differently-0.538
Token and
Token ID290
Feature activation-3.15e-01

Pos logit contributions
aband+2.337
ipel+2.147
 trap+2.064
opsis+2.022
hemer+1.905
Neg logit contributions
Sov-2.237
 plur-2.158
edin-2.152
repre-1.993
 differently-1.918
Token public
Token ID1171
Feature activation-2.86e-01

Pos logit contributions
aband+0.239
ipel+0.218
 trap+0.204
opsis+0.204
hemer+0.192
Neg logit contributions
Sov-0.256
 plur-0.250
edin-0.246
repre-0.230
 differently-0.227
Token radio
Token ID5243
Feature activation-2.52e-01

Pos logit contributions
aband+0.177
ipel+0.158
 trap+0.147
opsis+0.145
hemer+0.134
Neg logit contributions
Sov-0.272
 plur-0.265
edin-0.263
repre-0.247
 differently-0.243
Token station
Token ID4429
Feature activation+3.71e-01

Pos logit contributions
aband+0.198
ipel+0.181
 trap+0.171
opsis+0.169
hemer+0.160
Neg logit contributions
Sov-0.199
 plur-0.193
edin-0.191
repre-0.177
 differently-0.174
Token KP
Token ID45814
Feature activation-1.83e+00

Pos logit contributions
Sov+0.312
 plur+0.305
edin+0.300
repre+0.280
 differently+0.277
Neg logit contributions
aband-0.271
ipel-0.246
 trap-0.230
opsis-0.229
hemer-0.215
Token as
Token ID355
Feature activation+1.01e+00

Pos logit contributions
aband+0.820
ipel+0.761
 trap+0.727
opsis+0.721
hemer+0.687
Neg logit contributions
Sov-0.558
 plur-0.537
edin-0.530
repre-0.483
 differently-0.470
Token Bac
Token ID35583
Feature activation+2.99e-01

Pos logit contributions
Sov+0.823
 plur+0.803
edin+0.794
repre+0.743
 differently+0.728
Neg logit contributions
aband-0.753
ipel-0.685
 trap-0.645
opsis-0.640
hemer-0.602
Tokenile
Token ID576
Feature activation-3.60e-03

Pos logit contributions
Sov+0.227
 plur+0.221
edin+0.219
repre+0.203
 differently+0.198
Neg logit contributions
aband-0.241
ipel-0.220
 trap-0.210
opsis-0.207
��-0.196
Token's
Token ID338
Feature activation-1.15e-01

Pos logit contributions
aband+0.003
ipel+0.003
 trap+0.003
opsis+0.002
hemer+0.002
Neg logit contributions
Sov-0.003
 plur-0.003
edin-0.003
repre-0.002
 differently-0.002
Token Inn
Token ID10849
Feature activation+3.03e-01

Pos logit contributions
aband+0.093
ipel+0.085
 trap+0.081
opsis+0.080
hemer+0.076
Neg logit contributions
Sov-0.087
 plur-0.085
edin-0.084
repre-0.077
 differently-0.076
Tokenoc
Token ID420
Feature activation-2.92e+00

Pos logit contributions
Sov+0.209
 plur+0.202
edin+0.200
repre+0.184
 differently+0.179
Neg logit contributions
aband-0.266
ipel-0.246
 trap-0.234
opsis-0.233
hemer-0.221
Tokenence
Token ID594
Feature activation-8.04e-01

Pos logit contributions
aband+2.307
ipel+2.083
 trap+1.979
opsis+1.962
hemer+1.853
Neg logit contributions
Sov-2.297
 plur-2.213
edin-2.200
repre-2.020
 differently-1.994
Token of
Token ID286
Feature activation-5.57e-01

Pos logit contributions
aband+0.664
ipel+0.613
 trap+0.587
opsis+0.576
hemer+0.544
Neg logit contributions
Sov-0.597
 plur-0.574
edin-0.572
repre-0.529
 differently-0.512
Token Muslims
Token ID7045
Feature activation-2.39e-01

Pos logit contributions
aband+0.490
ipel+0.455
 trap+0.433
opsis+0.430
hemer+0.408
Neg logit contributions
Sov-0.383
 plur-0.372
edin-0.364
repre-0.335
 differently-0.327
Token are
Token ID389
Feature activation-4.81e-01

Pos logit contributions
aband+0.192
ipel+0.176
 trap+0.168
opsis+0.166
hemer+0.156
Neg logit contributions
Sov-0.184
 plur-0.178
edin-0.176
repre-0.163
 differently-0.159
Token taken
Token ID2077
Feature activation+2.87e-01

Pos logit contributions
aband+0.401
ipel+0.368
 trap+0.352
opsis+0.347
hemer+0.328
Neg logit contributions
Sov-0.355
 plur-0.343
edin-0.340
repre-0.313
 differently-0.305
Token situation
Token ID3074
Feature activation+8.84e-03

Pos logit contributions
aband+0.259
ipel+0.234
 trap+0.220
opsis+0.217
hemer+0.203
Neg logit contributions
Sov-0.320
 plur-0.311
edin-0.308
repre-0.288
 differently-0.283
Token.
Token ID13
Feature activation+5.13e-01

Pos logit contributions
Sov+0.006
 plur+0.006
edin+0.006
repre+0.006
 differently+0.005
Neg logit contributions
aband-0.008
ipel-0.007
 trap-0.007
opsis-0.007
hemer-0.006
Token Somebody
Token ID43141
Feature activation-1.05e-01

Pos logit contributions
Sov+0.405
 plur+0.392
edin+0.389
repre+0.360
 differently+0.355
Neg logit contributions
aband-0.401
ipel-0.367
 trap-0.347
opsis-0.344
hemer-0.324
Token always
Token ID1464
Feature activation-1.15e-01

Pos logit contributions
aband+0.089
ipel+0.082
 trap+0.078
opsis+0.077
hemer+0.073
Neg logit contributions
Sov-0.077
 plur-0.074
edin-0.073
repre-0.068
 differently-0.066
Token pops
Token ID26384
Feature activation+1.52e-01

Pos logit contributions
aband+0.092
ipel+0.084
 trap+0.079
opsis+0.079
hemer+0.074
Neg logit contributions
Sov-0.090
 plur-0.087
edin-0.086
repre-0.080
 differently-0.078
Token off
Token ID572
Feature activation-2.92e+00

Pos logit contributions
Sov+0.112
 plur+0.108
edin+0.107
repre+0.099
 differently+0.096
Neg logit contributions
aband-0.127
ipel-0.116
 trap-0.111
opsis-0.110
hemer-0.104
Token a
Token ID257
Feature activation+3.79e-01

Pos logit contributions
aband+2.413
ipel+2.193
 trap+2.145
opsis+2.071
 sleep+1.980
Neg logit contributions
Sov-2.172
 plur-2.068
edin-2.052
repre-1.910
 Modified-1.807
Token baby
Token ID5156
Feature activation-2.28e-01

Pos logit contributions
Sov+0.334
 plur+0.326
edin+0.322
repre+0.302
 differently+0.297
Neg logit contributions
aband-0.261
ipel-0.236
 trap-0.220
opsis-0.219
hemer-0.204
Token in
Token ID287
Feature activation-8.81e-02

Pos logit contributions
aband+0.190
ipel+0.175
 trap+0.167
opsis+0.164
hemer+0.156
Neg logit contributions
Sov-0.168
 plur-0.163
edin-0.161
repre-0.149
 differently-0.145
Token the
Token ID262
Feature activation-5.73e-02

Pos logit contributions
aband+0.070
ipel+0.063
 trap+0.060
opsis+0.060
hemer+0.056
Neg logit contributions
Sov-0.069
 plur-0.067
edin-0.066
repre-0.061
 differently-0.060
Token middle
Token ID3504
Feature activation-5.46e-02

Pos logit contributions
aband+0.049
ipel+0.046
 trap+0.043
opsis+0.043
hemer+0.041
Neg logit contributions
Sov-0.041
 plur-0.039
edin-0.039
repre-0.036
 differently-0.035
Token Washington
Token ID2669
Feature activation+7.80e-02

Pos logit contributions
aband+0.027
ipel+0.025
 trap+0.023
opsis+0.023
hemer+0.021
Neg logit contributions
Sov-0.030
 plur-0.030
edin-0.029
repre-0.027
 differently-0.027
Token Examiner
Token ID35855
Feature activation-2.82e-01

Pos logit contributions
Sov+0.046
 plur+0.044
edin+0.043
repre+0.039
 differently+0.038
Neg logit contributions
aband-0.077
ipel-0.071
 trap-0.068
opsis-0.068
hemer-0.065
Token,
Token ID11
Feature activation-4.99e-01

Pos logit contributions
aband+0.215
ipel+0.195
 trap+0.184
opsis+0.182
hemer+0.171
Neg logit contributions
Sov-0.229
 plur-0.222
edin-0.220
repre-0.205
 differently-0.202
Token Donald
Token ID3759
Feature activation-2.75e-01

Pos logit contributions
aband+0.369
ipel+0.335
 trap+0.315
opsis+0.313
hemer+0.293
Neg logit contributions
Sov-0.415
 plur-0.405
edin-0.400
repre-0.373
 differently-0.366
Token Trump
Token ID1301
Feature activation-2.55e-01

Pos logit contributions
aband+0.186
ipel+0.168
 trap+0.159
opsis+0.157
hemer+0.146
Neg logit contributions
Sov-0.244
 plur-0.237
edin-0.235
repre-0.221
 differently-0.217
Token Donald
Token ID3759
Feature activation-2.91e+00

Pos logit contributions
aband+0.206
ipel+0.189
 trap+0.180
opsis+0.178
hemer+0.168
Neg logit contributions
Sov-0.194
 plur-0.188
edin-0.185
repre-0.172
 differently-0.168
Token John
Token ID1757
Feature activation-1.18e-01

Pos logit contributions
aband+1.728
ipel+1.535
 trap+1.442
opsis+1.419
hemer+1.304
Neg logit contributions
Sov-2.835
 plur-2.760
edin-2.736
repre-2.583
 differently-2.534
Token Trump
Token ID1301
Feature activation-9.71e-01

Pos logit contributions
aband+0.088
ipel+0.081
opsis+0.074
 trap+0.073
hemer+0.070
Neg logit contributions
Sov-0.097
 plur-0.095
edin-0.094
 differently-0.087
repre-0.087
TokenREAD
Token ID15675
Feature activation-9.92e-01

Pos logit contributions
aband+0.792
ipel+0.725
 trap+0.691
opsis+0.684
hemer+0.648
Neg logit contributions
Sov-0.728
 plur-0.708
edin-0.700
repre-0.647
 differently-0.634
Token:
Token ID25
Feature activation-1.46e+00

Pos logit contributions
aband+0.660
ipel+0.592
 trap+0.551
opsis+0.546
hemer+0.508
Neg logit contributions
Sov-0.901
 plur-0.880
edin-0.869
repre-0.813
 differently-0.806
Token Cohen
Token ID17154
Feature activation+1.96e-01

Pos logit contributions
aband+1.023
ipel+0.933
 trap+0.880
opsis+0.867
uggets+0.809
Neg logit contributions
Sov-1.256
 plur-1.227
edin-1.207
repre-1.137
 differently-1.100
Token on
Token ID319
Feature activation+1.03e-01

Pos logit contributions
Sov+0.573
 plur+0.554
edin+0.547
repre+0.503
 differently+0.491
Neg logit contributions
aband-0.721
ipel-0.666
 trap-0.634
opsis-0.629
hemer-0.597
Token banks
Token ID6341
Feature activation-9.95e-01

Pos logit contributions
Sov+0.078
 plur+0.075
edin+0.074
repre+0.069
 differently+0.067
Neg logit contributions
aband-0.084
ipel-0.077
 trap-0.073
opsis-0.072
hemer-0.068
Token.
Token ID13
Feature activation-9.23e-02

Pos logit contributions
aband+0.829
ipel+0.765
 trap+0.730
opsis+0.719
hemer+0.681
Neg logit contributions
Sov-0.732
 plur-0.705
edin-0.694
repre-0.643
 differently-0.623
Token<|endoftext|>
Token ID50256
Feature activation-8.39e-02

Pos logit contributions
aband+0.072
ipel+0.066
 trap+0.063
opsis+0.062
hemer+0.058
Neg logit contributions
Sov-0.072
 plur-0.070
edin-0.069
repre-0.065
 differently-0.063
TokenIll
Token ID21478
Feature activation+1.02e+00

Pos logit contributions
aband+0.066
ipel+0.061
 trap+0.058
opsis+0.057
hemer+0.054
Neg logit contributions
Sov-0.065
 plur-0.063
edin-0.063
repre-0.058
 differently-0.057
Tokenust
Token ID436
Feature activation-2.91e+00

Pos logit contributions
Sov+0.864
 plur+0.844
edin+0.833
repre+0.780
 differently+0.765
Neg logit contributions
aband-0.733
ipel-0.662
 trap-0.626
opsis-0.618
hemer-0.579
Tokenrious
Token ID32527
Feature activation+2.52e-01

Pos logit contributions
aband+1.855
ipel+1.641
 trap+1.549
opsis+1.537
hemer+1.424
Neg logit contributions
Sov-2.705
 plur-2.626
edin-2.605
repre-2.433
 differently-2.413
Token Illum
Token ID39256
Feature activation-7.68e-01

Pos logit contributions
Sov+0.201
 plur+0.196
edin+0.193
repre+0.180
 differently+0.177
Neg logit contributions
aband-0.194
ipel-0.177
 trap-0.167
opsis-0.166
hemer-0.156
Tokenination
Token ID1883
Feature activation-9.28e-01

Pos logit contributions
aband+0.583
ipel+0.532
 trap+0.502
opsis+0.498
hemer+0.468
Neg logit contributions
Sov-0.624
 plur-0.606
edin-0.599
repre-0.558
 differently-0.548
Token:
Token ID25
Feature activation+7.29e-02

Pos logit contributions
aband+0.751
ipel+0.690
 trap+0.656
opsis+0.649
hemer+0.612
Neg logit contributions
Sov-0.705
 plur-0.683
edin-0.676
repre-0.626
 differently-0.612
Token Incredible
Token ID33832
Feature activation-3.73e-01

Pos logit contributions
Sov+0.061
 plur+0.060
edin+0.059
repre+0.055
 differently+0.054
Neg logit contributions
aband-0.053
ipel-0.048
 trap-0.045
opsis-0.045
hemer-0.042
Token 26
Token ID2608
Feature activation-5.12e-01

Pos logit contributions
aband+0.298
ipel+0.272
 trap+0.255
opsis+0.254
hemer+0.238
Neg logit contributions
Sov-0.320
 plur-0.312
edin-0.308
repre-0.287
 differently-0.283
Token citations
Token ID33499
Feature activation-1.23e+00

Pos logit contributions
aband+0.376
ipel+0.341
 trap+0.319
opsis+0.319
hemer+0.298
Neg logit contributions
Sov-0.427
 plur-0.418
edin-0.412
repre-0.384
 differently-0.381
Token to
Token ID284
Feature activation+1.44e-01

Pos logit contributions
aband+1.108
ipel+1.026
 trap+0.988
opsis+0.974
hemer+0.926
Neg logit contributions
Sov-0.824
 plur-0.788
edin-0.778
repre-0.711
 differently-0.687
Token drivers
Token ID6643
Feature activation-9.09e-01

Pos logit contributions
Sov+0.118
 plur+0.115
edin+0.113
repre+0.105
 differently+0.104
Neg logit contributions
aband-0.109
ipel-0.099
 trap-0.093
opsis-0.093
hemer-0.087
Token so
Token ID523
Feature activation-9.13e-01

Pos logit contributions
aband+0.724
ipel+0.660
 trap+0.633
opsis+0.623
hemer+0.587
Neg logit contributions
Sov-0.706
 plur-0.681
edin-0.675
repre-0.626
 differently-0.607
Token far
Token ID1290
Feature activation-2.87e+00

Pos logit contributions
aband+0.758
ipel+0.695
 trap+0.666
opsis+0.654
hemer+0.622
Neg logit contributions
Sov-0.674
 plur-0.655
edin-0.647
repre-0.599
 differently-0.580
Token for
Token ID329
Feature activation+3.50e-01

Pos logit contributions
aband+2.657
ipel+2.462
 trap+2.385
opsis+2.365
hemer+2.232
Neg logit contributions
Sov-1.862
 plur-1.766
edin-1.738
repre-1.585
 differently-1.508
Token violating
Token ID18134
Feature activation+4.58e-01

Pos logit contributions
Sov+0.288
 plur+0.282
edin+0.277
repre+0.259
 differently+0.257
Neg logit contributions
aband-0.259
ipel-0.237
 trap-0.221
opsis-0.221
hemer-0.207
Token the
Token ID262
Feature activation-4.11e-01

Pos logit contributions
Sov+0.373
 plur+0.367
edin+0.360
repre+0.333
 differently+0.332
Neg logit contributions
aband-0.343
ipel-0.317
opsis-0.293
 trap-0.293
hemer-0.276
Token transit
Token ID11168
Feature activation-9.14e-01

Pos logit contributions
aband+0.299
ipel+0.272
 trap+0.254
opsis+0.253
hemer+0.237
Neg logit contributions
Sov-0.346
 plur-0.338
edin-0.334
repre-0.311
 differently-0.307
Token lanes
Token ID15296
Feature activation-1.19e+00

Pos logit contributions
aband+0.657
ipel+0.596
 trap+0.559
opsis+0.555
hemer+0.519
Neg logit contributions
Sov-0.778
 plur-0.758
edin-0.749
repre-0.700
 differently-0.688
Token to
Token ID284
Feature activation-1.99e-01

Pos logit contributions
aband+0.644
ipel+0.596
 trap+0.576
opsis+0.565
hemer+0.540
Neg logit contributions
Sov-0.477
 plur-0.457
edin-0.451
repre-0.418
 differently-0.400
Token just
Token ID655
Feature activation+1.80e-01

Pos logit contributions
aband+0.153
ipel+0.139
 trap+0.134
opsis+0.131
hemer+0.123
Neg logit contributions
Sov-0.160
 plur-0.154
edin-0.153
repre-0.143
 differently-0.139
Token lean
Token ID10904
Feature activation+2.14e-01

Pos logit contributions
Sov+0.141
 plur+0.137
edin+0.135
repre+0.126
 differently+0.123
Neg logit contributions
aband-0.142
ipel-0.130
 trap-0.123
opsis-0.122
hemer-0.115
Token back
Token ID736
Feature activation+3.13e-01

Pos logit contributions
Sov+0.146
 plur+0.140
edin+0.139
repre+0.128
 differently+0.122
Neg logit contributions
aband-0.191
ipel-0.177
 trap-0.169
opsis-0.166
hemer-0.158
Token and
Token ID290
Feature activation+9.69e-01

Pos logit contributions
Sov+0.228
 plur+0.220
edin+0.218
repre+0.201
 differently+0.193
Neg logit contributions
aband-0.264
ipel-0.244
 trap-0.232
opsis-0.229
hemer-0.217
Token cheer
Token ID14042
Feature activation+1.80e+00

Pos logit contributions
Sov+0.741
 plur+0.721
edin+0.711
repre+0.658
 differently+0.647
Neg logit contributions
aband-0.779
ipel-0.714
 trap-0.673
opsis-0.669
hemer-0.633
Token for
Token ID329
Feature activation-8.27e-01

Pos logit contributions
Sov+1.251
 plur+1.211
edin+1.195
repre+1.096
 differently+1.077
Neg logit contributions
aband-1.573
ipel-1.453
 trap-1.383
opsis-1.374
hemer-1.302
Token the
Token ID262
Feature activation+5.82e-01

Pos logit contributions
aband+0.665
ipel+0.607
 trap+0.580
opsis+0.565
hemer+0.536
Neg logit contributions
Sov-0.637
 plur-0.613
edin-0.611
repre-0.571
 differently-0.545
Token apocalyptic
Token ID41792
Feature activation+3.60e-01

Pos logit contributions
Sov+0.434
 plur+0.419
edin+0.416
repre+0.385
 differently+0.375
Neg logit contributions
aband-0.481
ipel-0.441
 trap-0.420
opsis-0.414
hemer-0.393
Token disasters
Token ID24193
Feature activation-6.81e-01

Pos logit contributions
Sov+0.280
 plur+0.271
edin+0.269
repre+0.250
 differently+0.243
Neg logit contributions
aband-0.284
ipel-0.260
 trap-0.248
opsis-0.244
hemer-0.230
Token of
Token ID286
Feature activation-1.07e-01

Pos logit contributions
aband+0.584
ipel+0.545
 trap+0.521
opsis+0.509
hemer+0.484
Neg logit contributions
Sov-0.482
 plur-0.464
edin-0.460
repre-0.425
 differently-0.404
Token not
Token ID407
Feature activation+8.43e-02

Pos logit contributions
Sov+0.292
 plur+0.284
edin+0.280
repre+0.259
 differently+0.254
Neg logit contributions
aband-0.326
ipel-0.301
 trap-0.284
opsis-0.283
hemer-0.267
Token in
Token ID287
Feature activation-2.86e-01

Pos logit contributions
Sov+0.062
 plur+0.060
edin+0.059
repre+0.055
 differently+0.053
Neg logit contributions
aband-0.071
ipel-0.065
 trap-0.062
opsis-0.061
hemer-0.058
Token the
Token ID262
Feature activation-9.00e-02

Pos logit contributions
aband+0.214
ipel+0.194
 trap+0.186
opsis+0.182
hemer+0.171
Neg logit contributions
Sov-0.236
 plur-0.228
edin-0.226
repre-0.211
 differently-0.206
Token radius
Token ID16874
Feature activation+9.30e-01

Pos logit contributions
aband+0.071
ipel+0.065
 trap+0.061
opsis+0.061
hemer+0.058
Neg logit contributions
Sov-0.070
 plur-0.068
edin-0.068
repre-0.063
 differently-0.062
Token of
Token ID286
Feature activation-8.78e-02

Pos logit contributions
Sov+0.717
 plur+0.697
edin+0.688
repre+0.638
 differently+0.627
Neg logit contributions
aband-0.741
ipel-0.679
 trap-0.642
opsis-0.637
hemer-0.602
Token ignore
Token ID8856
Feature activation+1.90e+00

Pos logit contributions
aband+0.069
ipel+0.064
 trap+0.060
opsis+0.060
hemer+0.056
Neg logit contributions
Sov-0.068
 plur-0.067
edin-0.066
repre-0.061
 differently-0.060
Token effects
Token ID3048
Feature activation+7.82e-02

Pos logit contributions
Sov+1.463
 plur+1.439
edin+1.416
repre+1.312
 differently+1.296
Neg logit contributions
aband-1.505
ipel-1.387
opsis-1.294
 trap-1.288
hemer-1.230
Token,
Token ID11
Feature activation+2.82e-01

Pos logit contributions
Sov+0.060
 plur+0.058
edin+0.058
repre+0.054
 differently+0.052
Neg logit contributions
aband-0.062
ipel-0.057
 trap-0.054
opsis-0.054
hemer-0.051
Token or
Token ID393
Feature activation+2.77e-01

Pos logit contributions
Sov+0.190
 plur+0.183
edin+0.180
repre+0.166
 differently+0.160
Neg logit contributions
aband-0.253
ipel-0.234
 trap-0.225
opsis-0.222
hemer-0.210
Token they
Token ID484
Feature activation+2.57e-01

Pos logit contributions
Sov+0.200
 plur+0.194
edin+0.192
repre+0.177
 differently+0.173
Neg logit contributions
aband-0.235
ipel-0.216
 trap-0.206
opsis-0.204
hemer-0.193
Token have
Token ID423
Feature activation+6.25e-01

Pos logit contributions
Sov+0.187
 plur+0.181
edin+0.179
repre+0.165
 differently+0.161
Neg logit contributions
aband-0.216
ipel-0.199
 trap-0.189
opsis-0.188
hemer-0.178
Token (
Token ID357
Feature activation+3.22e-01

Pos logit contributions
Sov+0.086
 plur+0.083
edin+0.082
repre+0.076
 differently+0.075
Neg logit contributions
aband-0.094
ipel-0.086
 trap-0.082
opsis-0.081
hemer-0.077
Tokendon
Token ID9099
Feature activation-5.59e-01

Pos logit contributions
Sov+0.254
 plur+0.247
edin+0.244
repre+0.227
 differently+0.223
Neg logit contributions
aband-0.250
ipel-0.229
 trap-0.217
opsis-0.214
hemer-0.202
Token
Token ID447
Feature activation-2.05e-01

Pos logit contributions
aband+0.404
ipel+0.365
 trap+0.345
opsis+0.341
hemer+0.319
Neg logit contributions
Sov-0.475
 plur-0.460
edin-0.455
repre-0.426
 differently-0.417
Token
Token ID247
Feature activation+9.66e-01

Pos logit contributions
aband+0.083
ipel+0.069
 trap+0.061
opsis+0.060
hemer+0.052
Neg logit contributions
Sov-0.240
 plur-0.235
edin-0.234
repre-0.222
 differently-0.220
Tokent
Token ID83
Feature activation-5.51e-01

Pos logit contributions
Sov+0.659
 plur+0.637
edin+0.628
repre+0.577
 differently+0.563
Neg logit contributions
aband-0.857
ipel-0.792
 trap-0.756
opsis-0.749
hemer-0.712
Token forget
Token ID6044
Feature activation+2.07e+00

Pos logit contributions
aband+0.423
ipel+0.387
 trap+0.372
opsis+0.362
hemer+0.342
Neg logit contributions
Sov-0.445
 plur-0.426
edin-0.423
repre-0.394
 differently-0.383
Token to
Token ID284
Feature activation+6.89e-02

Pos logit contributions
Sov+1.388
 plur+1.353
edin+1.328
repre+1.219
 differently+1.197
Neg logit contributions
aband-1.858
ipel-1.706
 trap-1.620
opsis-1.611
hemer-1.535
Token also
Token ID635
Feature activation+1.52e-02

Pos logit contributions
Sov+0.055
 plur+0.053
edin+0.053
repre+0.049
 differently+0.048
Neg logit contributions
aband-0.053
ipel-0.049
 trap-0.046
opsis-0.045
hemer-0.043
Token add
Token ID751
Feature activation-1.05e-01

Pos logit contributions
Sov+0.012
 plur+0.012
edin+0.012
repre+0.011
 differently+0.011
Neg logit contributions
aband-0.011
ipel-0.010
 trap-0.010
opsis-0.010
hemer-0.009
Token their
Token ID511
Feature activation+6.00e-01

Pos logit contributions
aband+0.088
ipel+0.081
 trap+0.078
opsis+0.076
hemer+0.072
Neg logit contributions
Sov-0.078
 plur-0.074
edin-0.074
repre-0.068
 differently-0.066
Token respective
Token ID11756
Feature activation+6.15e-01

Pos logit contributions
Sov+0.454
 plur+0.440
edin+0.435
repre+0.402
 differently+0.394
Neg logit contributions
aband-0.488
ipel-0.448
 trap-0.425
opsis-0.421
hemer-0.397
Tokenа
Token ID16142
Feature activation+1.74e-02

Pos logit contributions
Sov+0.188
 plur+0.179
edin+0.176
repre+0.155
 differently+0.149
Neg logit contributions
aband-0.423
ipel-0.397
 trap-0.382
opsis-0.380
hemer-0.365
Tokenк
Token ID31583
Feature activation-1.82e-01

Pos logit contributions
Sov+0.009
 plur+0.009
edin+0.009
repre+0.008
 differently+0.008
Neg logit contributions
aband-0.018
ipel-0.017
 trap-0.016
opsis-0.016
hemer-0.015
Token
Token ID140
Feature activation-1.18e-01

Pos logit contributions
aband+0.181
ipel+0.169
 trap+0.162
opsis+0.161
hemer+0.154
Neg logit contributions
Sov-0.104
 plur-0.100
edin-0.098
repre-0.089
 differently-0.086
Token
Token ID114
Feature activation+5.91e-01

Pos logit contributions
aband+0.064
ipel+0.056
 trap+0.052
opsis+0.051
hemer+0.046
Neg logit contributions
Sov-0.121
 plur-0.118
edin-0.117
repre-0.111
 differently-0.109
Tokenе
Token ID16843
Feature activation-2.63e-01

Pos logit contributions
Sov+0.276
 plur+0.263
edin+0.257
repre+0.226
 differently+0.217
Neg logit contributions
aband-0.652
ipel-0.613
 trap-0.590
opsis-0.586
hemer-0.563
Token
Token ID220
Feature activation+1.96e+00

Pos logit contributions
aband+0.263
ipel+0.246
 trap+0.236
opsis+0.234
hemer+0.224
Neg logit contributions
Sov-0.148
 plur-0.143
edin-0.140
repre-0.127
 differently-0.122
Tokenя
Token ID40623
Feature activation+7.13e-01

Pos logit contributions
Sov+0.999
 plur+0.948
edin+0.932
repre+0.820
 differently+0.797
Neg logit contributions
aband-2.078
ipel-1.942
 trap-1.872
opsis-1.857
hemer-1.774
Tokenв
Token ID38857
Feature activation+2.06e-01

Pos logit contributions
Sov+0.356
 plur+0.339
edin+0.333
repre+0.295
 differently+0.285
Neg logit contributions
aband-0.764
ipel-0.716
 trap-0.689
opsis-0.684
hemer-0.656
Tokenл
Token ID30143
Feature activation-9.10e-02

Pos logit contributions
Sov+0.133
 plur+0.128
edin+0.126
repre+0.115
 differently+0.112
Neg logit contributions
aband-0.191
ipel-0.177
 trap-0.169
opsis-0.168
hemer-0.160
Tokenя
Token ID40623
Feature activation+9.09e-01

Pos logit contributions
aband+0.099
ipel+0.093
 trap+0.089
opsis+0.089
hemer+0.085
Neg logit contributions
Sov-0.044
 plur-0.042
edin-0.041
repre-0.036
 differently-0.035
Tokenе
Token ID16843
Feature activation-5.94e-02

Pos logit contributions
Sov+0.451
 plur+0.430
edin+0.422
repre+0.373
 differently+0.361
Neg logit contributions
aband-0.976
ipel-0.915
 trap-0.880
opsis-0.874
hemer-0.839
Token as
Token ID355
Feature activation+6.54e-01

Pos logit contributions
Sov+0.542
 plur+0.521
edin+0.515
repre+0.470
 differently+0.456
Neg logit contributions
aband-0.765
ipel-0.710
 trap-0.680
opsis-0.673
hemer-0.642
Token I
Token ID314
Feature activation-5.01e-01

Pos logit contributions
Sov+0.486
 plur+0.470
edin+0.464
repre+0.430
 differently+0.418
Neg logit contributions
aband-0.541
ipel-0.498
 trap-0.475
opsis-0.468
hemer-0.443
Token add
Token ID751
Feature activation-1.05e+00

Pos logit contributions
aband+0.397
ipel+0.365
 trap+0.349
opsis+0.342
hemer+0.323
Neg logit contributions
Sov-0.391
 plur-0.376
edin-0.373
repre-0.347
 differently-0.337
Token the
Token ID262
Feature activation+2.12e-01

Pos logit contributions
aband+0.881
ipel+0.817
 trap+0.783
opsis+0.768
hemer+0.726
Neg logit contributions
Sov-0.773
 plur-0.741
edin-0.737
repre-0.683
 differently-0.654
Token comparison
Token ID7208
Feature activation-5.15e-01

Pos logit contributions
Sov+0.162
 plur+0.157
edin+0.155
repre+0.144
 differently+0.141
Neg logit contributions
aband-0.171
ipel-0.157
 trap-0.149
opsis-0.147
hemer-0.139
Token code
Token ID2438
Feature activation+1.92e+00

Pos logit contributions
aband+0.443
ipel+0.410
 trap+0.393
opsis+0.388
hemer+0.367
Neg logit contributions
Sov-0.365
 plur-0.353
edin-0.348
repre-0.321
 differently-0.311
Token like
Token ID588
Feature activation+9.26e-01

Pos logit contributions
Sov+1.333
 plur+1.290
edin+1.273
repre+1.172
 differently+1.154
Neg logit contributions
aband-1.678
ipel-1.543
 trap-1.467
opsis-1.458
hemer-1.386
Token in
Token ID287
Feature activation+3.65e-01

Pos logit contributions
Sov+0.567
 plur+0.544
edin+0.538
repre+0.488
 differently+0.471
Neg logit contributions
aband-0.886
ipel-0.826
 trap-0.794
opsis-0.786
hemer-0.749
Token the
Token ID262
Feature activation+2.29e-01

Pos logit contributions
Sov+0.283
 plur+0.274
edin+0.271
repre+0.252
 differently+0.246
Neg logit contributions
aband-0.291
ipel-0.267
 trap-0.253
opsis-0.251
hemer-0.236
Token above
Token ID2029
Feature activation-6.07e-01

Pos logit contributions
Sov+0.156
 plur+0.151
edin+0.149
repre+0.137
 differently+0.133
Neg logit contributions
aband-0.203
ipel-0.188
 trap-0.179
opsis-0.177
hemer-0.169
Token code
Token ID2438
Feature activation+1.16e+00

Pos logit contributions
aband+0.498
ipel+0.458
 trap+0.441
opsis+0.434
hemer+0.408
Neg logit contributions
Sov-0.453
 plur-0.438
edin-0.434
repre-0.401
 differently-0.389
Token
Token ID198
Feature activation+8.47e-01

Pos logit contributions
aband+0.279
ipel+0.255
 trap+0.242
opsis+0.240
hemer+0.227
Neg logit contributions
Sov-0.264
 plur-0.256
edin-0.254
repre-0.235
 differently-0.230
Token
Token ID198
Feature activation+8.77e-02

Pos logit contributions
Sov+0.620
 plur+0.595
edin+0.590
repre+0.545
 differently+0.532
Neg logit contributions
aband-0.713
ipel-0.658
 trap-0.624
opsis-0.618
hemer-0.585
TokenDevelopment
Token ID41206
Feature activation+1.34e+00

Pos logit contributions
Sov+0.068
 plur+0.066
edin+0.065
repre+0.061
 differently+0.059
Neg logit contributions
aband-0.069
ipel-0.064
 trap-0.060
opsis-0.060
hemer-0.056
Token of
Token ID286
Feature activation+5.10e-01

Pos logit contributions
Sov+1.000
 plur+0.972
edin+0.961
repre+0.887
 differently+0.871
Neg logit contributions
aband-1.103
ipel-1.013
 trap-0.959
opsis-0.952
hemer-0.902
Token the
Token ID262
Feature activation-3.74e-01

Pos logit contributions
Sov+0.386
 plur+0.376
edin+0.370
repre+0.343
 differently+0.336
Neg logit contributions
aband-0.414
ipel-0.380
 trap-0.360
opsis-0.357
hemer-0.337
Token F
Token ID376
Feature activation+6.49e-01

Pos logit contributions
aband+0.299
ipel+0.274
 trap+0.261
opsis+0.257
hemer+0.243
Neg logit contributions
Sov-0.288
 plur-0.278
edin-0.276
repre-0.256
 differently-0.250
Token-
Token ID12
Feature activation+2.45e-01

Pos logit contributions
Sov+0.518
 plur+0.503
edin+0.497
repre+0.463
 differently+0.454
Neg logit contributions
aband-0.501
ipel-0.457
 trap-0.432
opsis-0.428
hemer-0.403
Token35
Token ID2327
Feature activation+8.28e-01

Pos logit contributions
Sov+0.181
 plur+0.175
edin+0.173
repre+0.160
 differently+0.156
Neg logit contributions
aband-0.205
ipel-0.188
 trap-0.179
opsis-0.177
hemer-0.168
Token fighter
Token ID10543
Feature activation+8.88e-01

Pos logit contributions
Sov+0.591
 plur+0.572
edin+0.564
repre+0.519
 differently+0.512
Neg logit contributions
aband-0.709
ipel-0.653
 trap-0.621
opsis-0.617
hemer-0.585
Token jet
Token ID12644
Feature activation+4.24e-01

Pos logit contributions
Sov+0.639
 plur+0.617
edin+0.610
repre+0.562
 differently+0.554
Neg logit contributions
aband-0.759
ipel-0.696
 trap-0.660
opsis-0.659
hemer-0.623
Token will
Token ID481
Feature activation+2.87e-01

Pos logit contributions
Sov+0.305
 plur+0.296
edin+0.292
repre+0.269
 differently+0.264
Neg logit contributions
aband-0.361
ipel-0.332
 trap-0.315
opsis-0.313
hemer-0.297
Token carry
Token ID3283
Feature activation-7.65e-01

Pos logit contributions
aband+0.568
ipel+0.514
 trap+0.487
opsis+0.477
hemer+0.446
Neg logit contributions
Sov-0.717
 plur-0.695
edin-0.689
repre-0.646
 differently-0.631
Token needs
Token ID2476
Feature activation+5.28e-01

Pos logit contributions
aband+0.669
ipel+0.617
 trap+0.592
opsis+0.583
hemer+0.553
Neg logit contributions
Sov-0.534
 plur-0.513
edin-0.509
repre-0.468
 differently-0.452
Token to
Token ID284
Feature activation+5.23e-01

Pos logit contributions
Sov+0.336
 plur+0.324
edin+0.319
repre+0.291
 differently+0.283
Neg logit contributions
aband-0.492
ipel-0.457
 trap-0.437
opsis-0.433
hemer-0.413
Token match
Token ID2872
Feature activation-5.09e-01

Pos logit contributions
Sov+0.394
 plur+0.382
edin+0.377
repre+0.349
 differently+0.341
Neg logit contributions
aband-0.428
ipel-0.393
 trap-0.373
opsis-0.369
hemer-0.349
Token total
Token ID2472
Feature activation-4.32e-01

Pos logit contributions
aband+0.402
ipel+0.368
 trap+0.350
opsis+0.346
hemer+0.326
Neg logit contributions
Sov-0.396
 plur-0.384
edin-0.381
repre-0.353
 differently-0.344
Token circulation
Token ID19133
Feature activation+3.15e-01

Pos logit contributions
aband+0.290
ipel+0.261
 trap+0.245
opsis+0.241
hemer+0.225
Neg logit contributions
Sov-0.389
 plur-0.379
edin-0.376
repre-0.352
 differently-0.345
Token in
Token ID287
Feature activation-6.87e-02

Pos logit contributions
Sov+0.219
 plur+0.212
edin+0.209
repre+0.192
 differently+0.187
Neg logit contributions
aband-0.276
ipel-0.255
 trap-0.243
opsis-0.240
hemer-0.228
Token the
Token ID262
Feature activation-1.82e-01

Pos logit contributions
aband+0.054
ipel+0.050
 trap+0.047
opsis+0.047
hemer+0.044
Neg logit contributions
Sov-0.054
 plur-0.052
edin-0.051
repre-0.048
 differently-0.046
Token grid
Token ID10706
Feature activation+1.44e-01

Pos logit contributions
aband+0.144
ipel+0.132
 trap+0.125
opsis+0.123
hemer+0.117
Neg logit contributions
Sov-0.141
 plur-0.136
edin-0.135
repre-0.126
 differently-0.122
Token.
Token ID13
Feature activation+6.14e-03

Pos logit contributions
Sov+0.105
 plur+0.102
edin+0.101
repre+0.093
 differently+0.090
Neg logit contributions
aband-0.120
ipel-0.111
 trap-0.106
opsis-0.104
hemer-0.099
Token Likewise
Token ID22660
Feature activation+4.86e-01

Pos logit contributions
Sov+0.005
 plur+0.005
edin+0.005
repre+0.005
 differently+0.005
Neg logit contributions
aband-0.004
ipel-0.004
 trap-0.004
opsis-0.004
hemer-0.003
Token Most
Token ID4042
Feature activation+2.20e-01

Pos logit contributions
Sov+0.390
 plur+0.378
edin+0.374
repre+0.346
 differently+0.338
Neg logit contributions
aband-0.430
ipel-0.395
 trap-0.375
opsis-0.372
hemer-0.351
Token importantly
Token ID11003
Feature activation-4.95e-01

Pos logit contributions
Sov+0.136
 plur+0.131
edin+0.129
repre+0.118
 differently+0.115
Neg logit contributions
aband-0.209
ipel-0.194
 trap-0.185
opsis-0.184
hemer-0.176
Token however
Token ID2158
Feature activation+1.28e+00

Pos logit contributions
aband+0.367
ipel+0.335
 trap+0.318
opsis+0.314
hemer+0.293
Neg logit contributions
Sov-0.410
 plur-0.397
edin-0.395
repre-0.367
 differently-0.358
Token,
Token ID11
Feature activation-8.48e-01

Pos logit contributions
Sov+1.030
 plur+1.004
edin+0.989
repre+0.923
 differently+0.907
Neg logit contributions
aband-0.977
ipel-0.888
 trap-0.837
opsis-0.829
hemer-0.782
Token we
Token ID356
Feature activation-2.01e-01

Pos logit contributions
aband+0.750
ipel+0.696
 trap+0.671
opsis+0.659
hemer+0.624
Neg logit contributions
Sov-0.580
 plur-0.553
edin-0.553
repre-0.507
 differently-0.486
Token can
Token ID460
Feature activation+1.14e-01

Pos logit contributions
aband+0.163
ipel+0.150
 trap+0.142
opsis+0.141
hemer+0.133
Neg logit contributions
Sov-0.152
 plur-0.148
edin-0.146
repre-0.135
 differently-0.132
Token restore
Token ID11169
Feature activation-4.19e-01

Pos logit contributions
Sov+0.089
 plur+0.086
edin+0.085
repre+0.079
 differently+0.077
Neg logit contributions
aband-0.090
ipel-0.082
 trap-0.078
opsis-0.077
hemer-0.073
Token safe
Token ID3338
Feature activation-1.24e-03

Pos logit contributions
aband+0.332
ipel+0.305
 trap+0.291
opsis+0.286
hemer+0.269
Neg logit contributions
Sov-0.326
edin-0.313
 plur-0.313
repre-0.290
 differently-0.281
Token and
Token ID290
Feature activation+3.47e-01

Pos logit contributions
aband+0.001
ipel+0.001
 trap+0.001
opsis+0.001
hemer+0.001
Neg logit contributions
Sov-0.001
 plur-0.001
edin-0.001
repre-0.001
 differently-0.001
Token healthy
Token ID5448
Feature activation+9.29e-01

Pos logit contributions
Sov+0.256
 plur+0.246
edin+0.245
repre+0.226
 differently+0.220
Neg logit contributions
aband-0.288
ipel-0.265
 trap-0.253
opsis-0.250
hemer-0.236
Token habitats
Token ID35308
Feature activation+1.01e+00

Pos logit contributions
Sov+0.723
 plur+0.706
edin+0.695
repre+0.645
 differently+0.635
Neg logit contributions
aband-0.734
ipel-0.672
 trap-0.633
opsis-0.628
hemer-0.594
Token curtain
Token ID29461
Feature activation-1.40e-02

Pos logit contributions
Sov+0.124
 plur+0.120
edin+0.119
repre+0.110
 differently+0.108
Neg logit contributions
aband-0.139
ipel-0.128
 trap-0.122
opsis-0.120
hemer-0.114
Token-
Token ID12
Feature activation+7.69e-02

Pos logit contributions
aband+0.011
ipel+0.010
 trap+0.010
opsis+0.009
hemer+0.009
Neg logit contributions
Sov-0.011
 plur-0.011
edin-0.011
repre-0.010
 differently-0.010
Tokenlike
Token ID2339
Feature activation+8.91e-01

Pos logit contributions
Sov+0.066
 plur+0.064
edin+0.064
repre+0.060
 differently+0.058
Neg logit contributions
aband-0.055
ipel-0.049
 trap-0.047
opsis-0.046
hemer-0.043
Token slit
Token ID40724
Feature activation-5.78e-01

Pos logit contributions
Sov+0.745
 plur+0.727
edin+0.720
repre+0.670
 differently+0.661
Neg logit contributions
aband-0.652
ipel-0.594
opsis-0.554
 trap-0.551
hemer-0.519
Token in
Token ID287
Feature activation+1.22e-01

Pos logit contributions
aband+0.494
ipel+0.457
 trap+0.438
opsis+0.430
hemer+0.408
Neg logit contributions
Sov-0.415
 plur-0.401
edin-0.393
repre-0.364
 differently-0.353
Token the
Token ID262
Feature activation-1.62e-02

Pos logit contributions
Sov+0.098
 plur+0.096
edin+0.095
repre+0.088
 differently+0.086
Neg logit contributions
aband-0.093
ipel-0.085
 trap-0.080
opsis-0.080
hemer-0.075
Token bottom
Token ID4220
Feature activation+7.10e-01

Pos logit contributions
aband+0.013
ipel+0.012
 trap+0.011
opsis+0.011
hemer+0.010
Neg logit contributions
Sov-0.013
 plur-0.012
edin-0.012
repre-0.011
 differently-0.011
Token
Token ID784
Feature activation+6.97e-01

Pos logit contributions
Sov+0.497
 plur+0.480
edin+0.475
repre+0.436
 differently+0.426
Neg logit contributions
aband-0.618
ipel-0.571
 trap-0.543
opsis-0.539
hemer-0.511
Token rather
Token ID2138
Feature activation+1.20e+00

Pos logit contributions
Sov+0.487
 plur+0.472
edin+0.466
repre+0.428
 differently+0.419
Neg logit contributions
aband-0.608
ipel-0.561
 trap-0.534
opsis-0.530
hemer-0.503
Token than
Token ID621
Feature activation-6.09e-01

Pos logit contributions
Sov+0.859
 plur+0.834
edin+0.823
repre+0.757
 differently+0.747
Neg logit contributions
aband-1.020
ipel-0.940
 trap-0.888
opsis-0.885
hemer-0.838
Token bursting
Token ID37493
Feature activation-9.94e-01

Pos logit contributions
aband+0.496
ipel+0.455
 trap+0.434
opsis+0.428
hemer+0.404
Neg logit contributions
Sov-0.461
 plur-0.445
edin-0.440
repre-0.409
 differently-0.398
Tokenhes
Token ID956
Feature activation+6.23e-01

Pos logit contributions
Sov+0.764
 plur+0.746
edin+0.739
repre+0.697
 differently+0.685
Neg logit contributions
aband-0.470
ipel-0.418
 trap-0.388
opsis-0.382
hemer-0.352
Token were
Token ID547
Feature activation-1.36e-01

Pos logit contributions
Sov+0.485
 plur+0.471
edin+0.465
repre+0.432
 differently+0.423
Neg logit contributions
aband-0.493
ipel-0.452
 trap-0.428
opsis-0.424
hemer-0.399
Token dry
Token ID5894
Feature activation-7.89e-01

Pos logit contributions
aband+0.120
ipel+0.111
 trap+0.107
opsis+0.105
hemer+0.100
Neg logit contributions
Sov-0.093
 plur-0.090
edin-0.089
repre-0.081
 differently-0.079
Token,
Token ID11
Feature activation+3.74e-02

Pos logit contributions
aband+0.608
ipel+0.556
 trap+0.527
opsis+0.519
hemer+0.490
Neg logit contributions
Sov-0.632
 plur-0.614
edin-0.605
repre-0.564
 differently-0.551
Token I
Token ID314
Feature activation-7.86e-01

Pos logit contributions
Sov+0.030
 plur+0.029
edin+0.029
repre+0.027
 differently+0.026
Neg logit contributions
aband-0.029
ipel-0.026
 trap-0.025
opsis-0.025
hemer-0.023
Token gave
Token ID2921
Feature activation-9.73e-02

Pos logit contributions
aband+0.622
ipel+0.571
 trap+0.544
opsis+0.535
hemer+0.506
Neg logit contributions
Sov-0.613
 plur-0.591
edin-0.585
repre-0.544
 differently-0.529
Token it
Token ID340
Feature activation-3.89e-01

Pos logit contributions
aband+0.083
ipel+0.077
 trap+0.073
opsis+0.072
hemer+0.069
Neg logit contributions
Sov-0.070
 plur-0.067
edin-0.066
repre-0.061
 differently-0.060
Token another
Token ID1194
Feature activation-2.76e-01

Pos logit contributions
aband+0.342
ipel+0.316
 trap+0.303
opsis+0.299
hemer+0.284
Neg logit contributions
Sov-0.267
 plur-0.257
edin-0.255
repre-0.234
 differently-0.227
Token fe
Token ID730
Feature activation+6.00e-01

Pos logit contributions
aband+0.221
ipel+0.202
 trap+0.192
opsis+0.190
hemer+0.179
Neg logit contributions
Sov-0.212
 plur-0.206
edin-0.203
repre-0.189
 differently-0.185
Tokenint
Token ID600
Feature activation-1.10e+00

Pos logit contributions
Sov+0.467
 plur+0.453
edin+0.448
repre+0.416
 differently+0.407
Neg logit contributions
aband-0.474
ipel-0.434
 trap-0.411
opsis-0.407
hemer-0.384
Token highlight
Token ID7238
Feature activation+2.08e-01

Pos logit contributions
aband+0.902
ipel+0.836
 trap+0.803
opsis+0.789
hemer+0.749
Neg logit contributions
Sov-0.830
 plur-0.792
edin-0.781
repre-0.721
 differently-0.701
Token troops
Token ID6553
Feature activation-4.27e-01

Pos logit contributions
aband+0.430
ipel+0.392
 trap+0.370
opsis+0.366
hemer+0.343
Neg logit contributions
Sov-0.477
 plur-0.464
edin-0.459
repre-0.428
 differently-0.420
Token in
Token ID287
Feature activation-1.67e-03

Pos logit contributions
aband+0.378
ipel+0.350
 trap+0.334
opsis+0.330
hemer+0.313
Neg logit contributions
Sov-0.293
 plur-0.282
edin-0.279
repre-0.256
 differently-0.249
Token Iraq
Token ID3908
Feature activation+1.37e-01

Pos logit contributions
aband+0.001
ipel+0.001
 trap+0.001
opsis+0.001
hemer+0.001
Neg logit contributions
Sov-0.001
 plur-0.001
edin-0.001
repre-0.001
 differently-0.001
Token and
Token ID290
Feature activation+3.71e-01

Pos logit contributions
Sov+0.100
 plur+0.097
edin+0.096
repre+0.088
 differently+0.086
Neg logit contributions
aband-0.115
ipel-0.106
 trap-0.101
opsis-0.100
hemer-0.095
Token Syria
Token ID4392
Feature activation+4.62e-01

Pos logit contributions
Sov+0.293
 plur+0.287
edin+0.282
repre+0.262
 differently+0.259
Neg logit contributions
aband-0.289
ipel-0.264
 trap-0.249
opsis-0.247
hemer-0.234
Token again
Token ID757
Feature activation-4.19e-01

Pos logit contributions
Sov+0.340
 plur+0.329
edin+0.325
repre+0.300
 differently+0.293
Neg logit contributions
aband-0.386
ipel-0.355
 trap-0.337
opsis-0.334
hemer-0.316
Token,
Token ID11
Feature activation+1.66e-01

Pos logit contributions
aband+0.349
ipel+0.321
 trap+0.305
opsis+0.302
hemer+0.286
Neg logit contributions
Sov-0.309
 plur-0.299
edin-0.295
repre-0.273
 differently-0.265
Token
Token ID447
Feature activation+7.13e-01

Pos logit contributions
Sov+0.119
 plur+0.115
edin+0.114
repre+0.105
 differently+0.103
Neg logit contributions
aband-0.141
ipel-0.130
 trap-0.124
opsis-0.122
hemer-0.116
Token
Token ID251
Feature activation+5.72e-01

Pos logit contributions
Sov+0.813
 plur+0.796
edin+0.791
repre+0.753
 differently+0.743
Neg logit contributions
aband-0.306
ipel-0.259
 trap-0.231
opsis-0.226
hemer-0.199
Token The
Token ID383
Feature activation+2.48e-01

Pos logit contributions
Sov+0.412
 plur+0.399
edin+0.394
repre+0.364
 differently+0.356
Neg logit contributions
aband-0.486
ipel-0.447
 trap-0.425
opsis-0.422
hemer-0.400
Token Daily
Token ID6714
Feature activation+7.00e-02

Pos logit contributions
Sov+0.202
 plur+0.197
edin+0.194
repre+0.181
 differently+0.178
Neg logit contributions
aband-0.189
ipel-0.172
 trap-0.161
opsis-0.161
hemer-0.151
Token gap
Token ID7625
Feature activation+3.92e-01

Pos logit contributions
aband+0.250
ipel+0.231
 trap+0.221
opsis+0.219
hemer+0.208
Neg logit contributions
Sov-0.181
 plur-0.175
edin-0.172
repre-0.158
 differently-0.153
Token with
Token ID351
Feature activation-1.95e-01

Pos logit contributions
Sov+0.268
 plur+0.259
edin+0.256
repre+0.235
 differently+0.228
Neg logit contributions
aband-0.348
ipel-0.322
 trap-0.307
opsis-0.304
hemer-0.289
Token debt
Token ID5057
Feature activation+9.17e-01

Pos logit contributions
aband+0.159
ipel+0.147
 trap+0.139
opsis+0.138
hemer+0.130
Neg logit contributions
Sov-0.146
 plur-0.142
edin-0.140
repre-0.129
 differently-0.128
Token.
Token ID13
Feature activation+9.30e-02

Pos logit contributions
Sov+0.640
 plur+0.622
edin+0.612
repre+0.564
 differently+0.554
Neg logit contributions
aband-0.799
ipel-0.735
 trap-0.698
opsis-0.697
hemer-0.660
Token Polit
Token ID7793
Feature activation-4.17e-01

Pos logit contributions
Sov+0.078
 plur+0.076
edin+0.075
repre+0.070
 differently+0.069
Neg logit contributions
aband-0.068
ipel-0.062
 trap-0.058
opsis-0.057
hemer-0.054
Tokenicians
Token ID5106
Feature activation-2.80e-01

Pos logit contributions
aband+0.271
ipel+0.242
 trap+0.225
opsis+0.223
hemer+0.207
Neg logit contributions
Sov-0.386
 plur-0.377
edin-0.372
repre-0.351
 differently-0.344
Token attract
Token ID4729
Feature activation-1.39e-02

Pos logit contributions
aband+0.223
ipel+0.204
 trap+0.195
opsis+0.191
hemer+0.180
Neg logit contributions
Sov-0.217
 plur-0.209
edin-0.208
repre-0.193
 differently-0.187
Token votes
Token ID5690
Feature activation+9.61e-02

Pos logit contributions
aband+0.011
ipel+0.011
 trap+0.010
opsis+0.010
hemer+0.009
Neg logit contributions
Sov-0.010
 plur-0.010
edin-0.010
repre-0.009
 differently-0.009
Token by
Token ID416
Feature activation+3.32e-01

Pos logit contributions
Sov+0.067
 plur+0.065
edin+0.064
repre+0.059
 differently+0.057
Neg logit contributions
aband-0.084
ipel-0.078
 trap-0.074
opsis-0.073
hemer-0.070
Token promising
Token ID11781
Feature activation+2.90e-02

Pos logit contributions
Sov+0.248
 plur+0.241
edin+0.237
repre+0.219
 differently+0.216
Neg logit contributions
aband-0.273
ipel-0.252
 trap-0.238
opsis-0.237
hemer-0.224
Token large
Token ID1588
Feature activation-2.62e-01

Pos logit contributions
Sov+0.020
 plur+0.019
edin+0.019
repre+0.018
 differently+0.017
Neg logit contributions
aband-0.025
ipel-0.023
 trap-0.022
opsis-0.022
hemer-0.021
Token
Token ID447
Feature activation+2.03e-01

Pos logit contributions
Sov+0.424
 plur+0.412
edin+0.409
repre+0.381
 differently+0.375
Neg logit contributions
aband-0.373
ipel-0.339
 trap-0.319
opsis-0.316
hemer-0.297
Token
Token ID251
Feature activation+1.04e-01

Pos logit contributions
Sov+0.236
 plur+0.231
edin+0.229
repre+0.218
 differently+0.216
Neg logit contributions
aband-0.084
ipel-0.070
 trap-0.062
opsis-0.061
hemer-0.053
Token
Token ID198
Feature activation-1.12e-01

Pos logit contributions
Sov+0.079
 plur+0.076
edin+0.075
repre+0.070
 differently+0.068
Neg logit contributions
aband-0.084
ipel-0.077
 trap-0.073
opsis-0.073
hemer-0.069
Token
Token ID198
Feature activation-5.75e-02

Pos logit contributions
aband+0.093
ipel+0.085
 trap+0.081
opsis+0.080
hemer+0.076
Neg logit contributions
Sov-0.083
 plur-0.080
edin-0.079
repre-0.073
 differently-0.072
TokenEM
Token ID3620
Feature activation+1.87e-02

Pos logit contributions
aband+0.046
ipel+0.042
 trap+0.040
opsis+0.039
hemer+0.037
Neg logit contributions
Sov-0.045
 plur-0.043
edin-0.043
repre-0.040
 differently-0.039
TokenM
Token ID44
Feature activation-5.10e-01

Pos logit contributions
Sov+0.016
 plur+0.015
edin+0.015
repre+0.014
 differently+0.014
Neg logit contributions
aband-0.014
ipel-0.012
 trap-0.012
opsis-0.012
hemer-0.011
TokenITT
Token ID22470
Feature activation+1.23e+00

Pos logit contributions
aband+0.385
ipel+0.351
 trap+0.334
opsis+0.329
hemer+0.307
Neg logit contributions
Sov-0.414
 plur-0.402
edin-0.398
repre-0.371
 differently-0.362
Token SM
Token ID9447
Feature activation+2.49e-01

Pos logit contributions
Sov+0.998
 plur+0.970
edin+0.959
repre+0.891
 differently+0.878
Neg logit contributions
aband-0.941
ipel-0.858
 trap-0.808
opsis-0.800
hemer-0.755
TokenITH
Token ID10554
Feature activation+2.37e-01

Pos logit contributions
Sov+0.229
 plur+0.223
edin+0.221
repre+0.208
 differently+0.204
Neg logit contributions
aband-0.163
ipel-0.146
 trap-0.137
opsis-0.135
hemer-0.125
Token,
Token ID11
Feature activation+6.00e-01

Pos logit contributions
Sov+0.192
 plur+0.187
edin+0.185
repre+0.172
 differently+0.169
Neg logit contributions
aband-0.179
ipel-0.163
 trap-0.154
opsis-0.153
hemer-0.144
Token FL
Token ID9977
Feature activation+1.40e+00

Pos logit contributions
Sov+0.478
 plur+0.465
edin+0.459
repre+0.426
 differently+0.420
Neg logit contributions
aband-0.464
ipel-0.424
 trap-0.399
opsis-0.396
hemer-0.373
Token2
Token ID17
Feature activation-5.71e-01

Pos logit contributions
aband+0.419
ipel+0.384
 trap+0.365
opsis+0.359
hemer+0.338
Neg logit contributions
Sov-0.406
 plur-0.394
edin-0.390
repre-0.361
 differently-0.352
Token tires
Token ID17920
Feature activation+5.22e-02

Pos logit contributions
aband+0.443
ipel+0.405
 trap+0.384
opsis+0.379
hemer+0.356
Neg logit contributions
Sov-0.453
 plur-0.440
edin-0.436
repre-0.405
 differently-0.396
Token specifically
Token ID5734
Feature activation+2.95e-02

Pos logit contributions
Sov+0.038
 plur+0.036
edin+0.036
repre+0.033
 differently+0.032
Neg logit contributions
aband-0.044
ipel-0.041
 trap-0.039
opsis-0.039
hemer-0.037
Token designed
Token ID3562
Feature activation+7.99e-01

Pos logit contributions
Sov+0.027
 plur+0.026
edin+0.026
repre+0.024
 differently+0.024
Neg logit contributions
aband-0.019
ipel-0.017
 trap-0.016
opsis-0.016
hemer-0.015
Token for
Token ID329
Feature activation+4.24e-01

Pos logit contributions
Sov+0.523
 plur+0.505
edin+0.498
repre+0.455
 differently+0.443
Neg logit contributions
aband-0.732
ipel-0.678
 trap-0.648
opsis-0.643
hemer-0.611
Token off
Token ID572
Feature activation-2.21e-01

Pos logit contributions
Sov+0.280
 plur+0.270
edin+0.267
repre+0.244
 differently+0.238
Neg logit contributions
aband-0.385
ipel-0.357
 trap-0.341
opsis-0.338
hemer-0.321
Token-
Token ID12
Feature activation-2.63e-01

Pos logit contributions
aband+0.173
ipel+0.158
 trap+0.150
opsis+0.148
hemer+0.139
Neg logit contributions
Sov-0.174
 plur-0.169
edin-0.167
repre-0.155
 differently-0.152
Tokenroad
Token ID6344
Feature activation-2.43e-01

Pos logit contributions
aband+0.241
ipel+0.223
 trap+0.213
opsis+0.211
hemer+0.201
Neg logit contributions
Sov-0.173
 plur-0.166
edin-0.164
repre-0.150
 differently-0.146
Token performance
Token ID2854
Feature activation-1.54e-01

Pos logit contributions
aband+0.183
ipel+0.166
 trap+0.158
opsis+0.156
hemer+0.146
Neg logit contributions
Sov-0.199
 plur-0.193
edin-0.191
repre-0.178
 differently-0.174
Token.
Token ID13
Feature activation+1.61e-01

Pos logit contributions
aband+0.130
ipel+0.120
 trap+0.115
opsis+0.113
hemer+0.107
Neg logit contributions
Sov-0.111
 plur-0.107
edin-0.106
repre-0.098
 differently-0.095
Token
Token ID198
Feature activation+5.26e-01

Pos logit contributions
Sov+0.126
 plur+0.122
edin+0.120
repre+0.112
 differently+0.109
Neg logit contributions
aband-0.128
ipel-0.117
 trap-0.111
opsis-0.109
hemer-0.103
Tokenons
Token ID684
Feature activation-5.64e-02

Pos logit contributions
Sov+0.505
 plur+0.492
edin+0.487
repre+0.456
 differently+0.449
Neg logit contributions
aband-0.378
ipel-0.340
 trap-0.318
opsis-0.315
hemer-0.293
Token,
Token ID11
Feature activation+2.47e-01

Pos logit contributions
aband+0.046
ipel+0.043
 trap+0.041
opsis+0.040
hemer+0.038
Neg logit contributions
Sov-0.042
 plur-0.041
edin-0.040
repre-0.037
 differently-0.036
Token meanwhile
Token ID15066
Feature activation+2.92e-01

Pos logit contributions
Sov+0.171
 plur+0.165
edin+0.163
repre+0.150
 differently+0.146
Neg logit contributions
aband-0.217
ipel-0.201
 trap-0.191
opsis-0.189
hemer-0.180
Token,
Token ID11
Feature activation-4.93e-01

Pos logit contributions
Sov+0.235
 plur+0.230
edin+0.226
repre+0.211
 differently+0.208
Neg logit contributions
aband-0.223
ipel-0.203
 trap-0.190
opsis-0.190
uggets-0.178
Token represent
Token ID2380
Feature activation-6.15e-01

Pos logit contributions
aband+0.389
ipel+0.356
 trap+0.340
opsis+0.334
hemer+0.316
Neg logit contributions
Sov-0.385
 plur-0.371
edin-0.368
repre-0.342
 differently-0.333
Token density
Token ID12109
Feature activation-5.79e-01

Pos logit contributions
aband+0.492
ipel+0.451
 trap+0.429
opsis+0.423
hemer+0.400
Neg logit contributions
Sov-0.473
 plur-0.458
edin-0.455
repre-0.421
 differently-0.411
Token deviation
Token ID28833
Feature activation-6.23e-01

Pos logit contributions
aband+0.512
ipel+0.473
 trap+0.456
opsis+0.446
hemer+0.426
Neg logit contributions
Sov-0.397
edin-0.380
 plur-0.380
repre-0.348
 differently-0.331
Token from
Token ID422
Feature activation+9.40e-04

Pos logit contributions
aband+0.523
ipel+0.481
 trap+0.459
opsis+0.453
hemer+0.430
Neg logit contributions
Sov-0.457
 plur-0.440
edin-0.436
repre-0.403
 differently-0.390
Token that
Token ID326
Feature activation+1.93e-01

Pos logit contributions
Sov+0.001
 plur+0.001
edin+0.001
repre+0.001
 differently+0.001
Neg logit contributions
aband-0.001
ipel-0.001
 trap-0.001
opsis-0.001
hemer-0.001
Token ambient
Token ID25237
Feature activation-9.44e-01

Pos logit contributions
Sov+0.150
 plur+0.146
edin+0.144
repre+0.134
 differently+0.132
Neg logit contributions
aband-0.152
ipel-0.139
 trap-0.131
opsis-0.131
hemer-0.123
Token.
Token ID13
Feature activation+2.36e-01

Pos logit contributions
aband+0.691
ipel+0.626
 trap+0.595
opsis+0.585
hemer+0.549
Neg logit contributions
Sov-0.792
 plur-0.766
edin-0.761
repre-0.710
 differently-0.693
Tokenrock
Token ID10823
Feature activation+6.10e-01

Pos logit contributions
aband+0.103
ipel+0.095
 trap+0.090
opsis+0.089
hemer+0.084
Neg logit contributions
Sov-0.094
 plur-0.091
edin-0.090
repre-0.083
 differently-0.082
Token -
Token ID532
Feature activation-4.94e-01

Pos logit contributions
Sov+0.439
 plur+0.424
edin+0.421
repre+0.387
 differently+0.379
Neg logit contributions
aband-0.517
ipel-0.477
 trap-0.453
opsis-0.450
hemer-0.426
Token favorite
Token ID4004
Feature activation-1.26e+00

Pos logit contributions
aband+0.405
ipel+0.371
 trap+0.350
opsis+0.349
hemer+0.329
Neg logit contributions
Sov-0.370
 plur-0.361
edin-0.356
repre-0.328
 differently-0.324
Token favorite
Token ID4004
Feature activation-1.63e+00

Pos logit contributions
aband+0.987
ipel+0.903
 trap+0.866
opsis+0.849
hemer+0.802
Neg logit contributions
Sov-0.987
 plur-0.951
edin-0.946
repre-0.876
 differently-0.854
Token favorite
Token ID4004
Feature activation-1.98e+00

Pos logit contributions
aband+1.266
ipel+1.158
 trap+1.114
opsis+1.088
hemer+1.028
Neg logit contributions
Sov-1.295
 plur-1.243
edin-1.240
repre-1.149
 differently-1.119
Token favorite
Token ID4004
Feature activation-2.28e+00

Pos logit contributions
aband+1.534
ipel+1.403
 trap+1.354
opsis+1.320
hemer+1.248
Neg logit contributions
Sov-1.579
 plur-1.511
edin-1.508
repre-1.397
 differently-1.361
Token -
Token ID532
Feature activation-6.67e-01

Pos logit contributions
aband+1.792
ipel+1.642
 trap+1.589
opsis+1.547
hemer+1.464
Neg logit contributions
Sov-1.778
 plur-1.699
edin-1.695
repre-1.568
 differently-1.526
Token December
Token ID3426
Feature activation-9.22e-02

Pos logit contributions
aband+0.501
ipel+0.456
 trap+0.430
opsis+0.426
hemer+0.400
Neg logit contributions
Sov-0.547
 plur-0.532
edin-0.526
repre-0.490
 differently-0.481
Token 13
Token ID1511
Feature activation+2.36e-01

Pos logit contributions
aband+0.078
ipel+0.071
 trap+0.068
opsis+0.067
hemer+0.064
Neg logit contributions
Sov-0.067
 plur-0.065
edin-0.064
repre-0.059
 differently-0.058
Token,
Token ID11
Feature activation-1.14e+00

Pos logit contributions
Sov+0.184
 plur+0.180
edin+0.179
 differently+0.165
repre+0.164
Neg logit contributions
aband-0.185
ipel-0.170
 trap-0.159
opsis-0.158
��-0.150
Token 2010
Token ID3050
Feature activation+3.57e-01

Pos logit contributions
aband+0.950
ipel+0.873
 trap+0.832
opsis+0.824
hemer+0.778
Neg logit contributions
Sov-0.845
 plur-0.815
edin-0.805
repre-0.746
 differently-0.724
Token-
Token ID12
Feature activation-1.91e-01

Pos logit contributions
aband+0.465
ipel+0.425
 trap+0.402
opsis+0.398
hemer+0.375
Neg logit contributions
Sov-0.487
 plur-0.474
edin-0.469
repre-0.436
 differently-0.426
Tokencalled
Token ID7174
Feature activation-8.82e-01

Pos logit contributions
aband+0.123
ipel+0.111
 trap+0.103
opsis+0.102
hemer+0.095
Neg logit contributions
Sov-0.176
 plur-0.171
edin-0.170
repre-0.160
 differently-0.157
Token ethnic
Token ID9450
Feature activation-7.50e-01

Pos logit contributions
aband+0.664
ipel+0.604
 trap+0.574
opsis+0.565
hemer+0.531
Neg logit contributions
Sov-0.721
 plur-0.699
edin-0.694
repre-0.646
 differently-0.632
Token music
Token ID2647
Feature activation+5.41e-01

Pos logit contributions
aband+0.548
ipel+0.495
 trap+0.477
opsis+0.463
hemer+0.433
Neg logit contributions
Sov-0.628
 plur-0.609
edin-0.608
repre-0.567
 differently-0.549
Token?
Token ID30
Feature activation+6.36e-02

Pos logit contributions
Sov+0.338
 plur+0.324
edin+0.322
repre+0.293
 differently+0.278
Neg logit contributions
aband-0.512
ipel-0.475
 trap-0.459
opsis-0.451
hemer-0.429
Token Yes
Token ID3363
Feature activation-2.02e+00

Pos logit contributions
Sov+0.050
 plur+0.048
edin+0.048
repre+0.044
 differently+0.043
Neg logit contributions
aband-0.050
ipel-0.046
 trap-0.044
opsis-0.043
hemer-0.041
Token,
Token ID11
Feature activation-1.12e-01

Pos logit contributions
aband+1.613
ipel+1.476
 trap+1.421
opsis+1.394
hemer+1.312
Neg logit contributions
Sov-1.567
 plur-1.498
edin-1.491
repre-1.375
 differently-1.323
Token I
Token ID314
Feature activation+3.24e-01

Pos logit contributions
aband+0.094
ipel+0.086
 trap+0.083
opsis+0.081
hemer+0.077
Neg logit contributions
Sov-0.082
 plur-0.079
edin-0.078
repre-0.072
 differently-0.070
Token'm
Token ID1101
Feature activation+2.01e-01

Pos logit contributions
Sov+0.254
 plur+0.246
edin+0.243
repre+0.226
 differently+0.221
Neg logit contributions
aband-0.256
ipel-0.234
 trap-0.222
opsis-0.220
hemer-0.207
Token very
Token ID845
Feature activation+8.65e-02

Pos logit contributions
Sov+0.136
 plur+0.131
edin+0.130
repre+0.119
 differently+0.114
Neg logit contributions
aband-0.179
ipel-0.166
 trap-0.159
opsis-0.157
hemer-0.149
Token interested
Token ID4609
Feature activation-2.74e-01

Pos logit contributions
Sov+0.061
 plur+0.059
edin+0.058
repre+0.054
 differently+0.052
Neg logit contributions
aband-0.075
ipel-0.069
 trap-0.066
opsis-0.065
hemer-0.062
Token
Token ID198
Feature activation-5.59e-01

Pos logit contributions
aband+0.004
ipel+0.003
 trap+0.003
opsis+0.003
hemer+0.003
Neg logit contributions
Sov-0.003
 plur-0.003
edin-0.003
repre-0.003
 differently-0.003
Token
Token ID198
Feature activation-2.08e-01

Pos logit contributions
aband+0.461
ipel+0.423
 trap+0.402
opsis+0.398
hemer+0.376
Neg logit contributions
Sov-0.418
 plur-0.405
edin-0.400
repre-0.370
 differently-0.362
TokenTo
Token ID2514
Feature activation-5.83e-01

Pos logit contributions
aband+0.160
ipel+0.147
 trap+0.139
opsis+0.137
hemer+0.129
Neg logit contributions
Sov-0.165
 plur-0.161
edin-0.159
repre-0.148
 differently-0.145
Token find
Token ID1064
Feature activation-3.71e-01

Pos logit contributions
aband+0.471
ipel+0.433
 trap+0.417
opsis+0.408
hemer+0.384
Neg logit contributions
Sov-0.443
 plur-0.427
edin-0.423
repre-0.394
 differently-0.383
Token out
Token ID503
Feature activation-6.31e-03

Pos logit contributions
aband+0.270
ipel+0.245
 trap+0.231
opsis+0.229
hemer+0.214
Neg logit contributions
Sov-0.312
 plur-0.303
edin-0.300
repre-0.280
 differently-0.275
Token if
Token ID611
Feature activation-2.16e+00

Pos logit contributions
aband+0.005
ipel+0.005
 trap+0.005
opsis+0.005
hemer+0.004
Neg logit contributions
Sov-0.004
 plur-0.004
edin-0.004
repre-0.004
 differently-0.004
Token their
Token ID511
Feature activation-1.36e+00

Pos logit contributions
aband+1.794
ipel+1.656
 trap+1.607
opsis+1.576
hemer+1.474
Neg logit contributions
Sov-1.579
edin-1.513
 plur-1.499
repre-1.398
 differently-1.334
Token creation
Token ID6282
Feature activation+2.72e-01

Pos logit contributions
aband+1.059
ipel+0.967
 trap+0.937
opsis+0.912
hemer+0.853
Neg logit contributions
Sov-1.062
edin-1.020
 plur-1.018
repre-0.946
 differently-0.916
Token works
Token ID2499
Feature activation-5.26e-01

Pos logit contributions
Sov+0.197
 plur+0.190
edin+0.188
repre+0.174
 differently+0.169
Neg logit contributions
aband-0.230
ipel-0.212
 trap-0.202
opsis-0.200
hemer-0.189
Token,
Token ID11
Feature activation-5.44e-01

Pos logit contributions
aband+0.425
ipel+0.390
 trap+0.373
opsis+0.367
hemer+0.346
Neg logit contributions
Sov-0.401
 plur-0.387
edin-0.384
repre-0.357
 differently-0.343
Token they
Token ID484
Feature activation-1.12e+00

Pos logit contributions
aband+0.416
ipel+0.379
 trap+0.360
opsis+0.356
hemer+0.334
Neg logit contributions
Sov-0.438
 plur-0.424
edin-0.421
repre-0.392
 differently-0.382
Token
Token ID198
Feature activation-5.70e-01

Pos logit contributions
aband+0.850
ipel+0.773
 trap+0.733
opsis+0.720
hemer+0.677
Neg logit contributions
Sov-0.967
 plur-0.937
edin-0.927
repre-0.867
 differently-0.846
Token
Token ID198
Feature activation-3.24e-01

Pos logit contributions
aband+0.476
ipel+0.438
 trap+0.415
opsis+0.412
hemer+0.390
Neg logit contributions
Sov-0.421
 plur-0.405
edin-0.402
repre-0.370
 differently-0.363
TokenShe
Token ID3347
Feature activation-7.22e-01

Pos logit contributions
aband+0.248
ipel+0.226
 trap+0.214
opsis+0.212
hemer+0.199
Neg logit contributions
Sov-0.261
 plur-0.254
edin-0.251
repre-0.234
 differently-0.229
Token cuts
Token ID6630
Feature activation+6.78e-01

Pos logit contributions
aband+0.560
ipel+0.511
 trap+0.486
opsis+0.479
hemer+0.450
Neg logit contributions
Sov-0.575
 plur-0.557
edin-0.551
repre-0.513
 differently-0.501
Token me
Token ID502
Feature activation-2.29e-01

Pos logit contributions
Sov+0.541
 plur+0.524
edin+0.517
repre+0.482
 differently+0.468
Neg logit contributions
aband-0.525
ipel-0.480
 trap-0.457
opsis-0.449
hemer-0.423
Token off
Token ID572
Feature activation-3.00e+00

Pos logit contributions
aband+0.235
ipel+0.220
 trap+0.212
opsis+0.210
hemer+0.200
Neg logit contributions
Sov-0.127
 plur-0.119
edin-0.116
repre-0.105
 differently-0.099
Token by
Token ID416
Feature activation-4.93e-01

Pos logit contributions
aband+2.427
ipel+2.214
 trap+2.182
opsis+2.119
 sleep+2.010
Neg logit contributions
Sov-2.293
 plur-2.167
edin-2.140
repre-1.990
 Modified-1.885
Token enc
Token ID2207
Feature activation-4.43e-01

Pos logit contributions
aband+0.387
ipel+0.355
 trap+0.336
opsis+0.332
hemer+0.313
Neg logit contributions
Sov-0.386
 plur-0.375
edin-0.370
repre-0.344
 differently-0.337
Tokenir
Token ID343
Feature activation+1.32e-01

Pos logit contributions
aband+0.350
ipel+0.319
 trap+0.304
opsis+0.300
hemer+0.284
Neg logit contributions
Sov-0.352
 plur-0.337
edin-0.332
repre-0.309
 differently-0.303
Tokencling
Token ID8493
Feature activation+3.99e-01

Pos logit contributions
Sov+0.094
 plur+0.090
edin+0.089
repre+0.082
 differently+0.080
Neg logit contributions
aband-0.114
ipel-0.105
 trap-0.100
opsis-0.099
hemer-0.094
Token my
Token ID616
Feature activation-4.00e-01

Pos logit contributions
Sov+0.332
 plur+0.322
edin+0.319
repre+0.298
 differently+0.292
Neg logit contributions
aband-0.295
ipel-0.269
 trap-0.254
opsis-0.251
hemer-0.235
Token the
Token ID262
Feature activation-1.07e-01

Pos logit contributions
aband+0.510
ipel+0.469
 trap+0.448
opsis+0.441
hemer+0.417
Neg logit contributions
Sov-0.466
 plur-0.450
edin-0.447
repre-0.414
 differently-0.400
Token world
Token ID995
Feature activation-5.91e-01

Pos logit contributions
aband+0.087
ipel+0.080
 trap+0.077
opsis+0.075
hemer+0.071
Neg logit contributions
Sov-0.081
 plur-0.078
edin-0.078
repre-0.072
 differently-0.070
Token I
Token ID314
Feature activation+1.83e-01

Pos logit contributions
aband+0.484
ipel+0.447
 trap+0.425
opsis+0.419
hemer+0.396
Neg logit contributions
Sov-0.444
 plur-0.429
edin-0.425
repre-0.395
 differently-0.377
Token'm
Token ID1101
Feature activation+3.05e-01

Pos logit contributions
Sov+0.136
 plur+0.132
edin+0.130
repre+0.121
 differently+0.118
Neg logit contributions
aband-0.151
ipel-0.139
 trap-0.132
opsis-0.131
hemer-0.123
Token let
Token ID1309
Feature activation+1.31e+00

Pos logit contributions
Sov+0.220
 plur+0.211
edin+0.209
repre+0.195
 differently+0.187
Neg logit contributions
aband-0.258
ipel-0.238
 trap-0.228
opsis-0.224
hemer-0.213
Token loose
Token ID9155
Feature activation-2.46e+00

Pos logit contributions
Sov+0.918
 plur+0.895
edin+0.885
repre+0.805
 differently+0.797
Neg logit contributions
aband-1.138
ipel-1.046
 trap-0.995
opsis-0.986
hemer-0.939
Token upon
Token ID2402
Feature activation-1.40e-01

Pos logit contributions
aband+2.129
ipel+1.984
 trap+1.925
opsis+1.865
 sleep+1.776
Neg logit contributions
Sov-1.733
 plur-1.631
edin-1.616
repre-1.519
 Modified-1.447
Token by
Token ID416
Feature activation-6.48e-02

Pos logit contributions
aband+0.122
ipel+0.113
 trap+0.108
opsis+0.107
hemer+0.101
Neg logit contributions
Sov-0.098
 plur-0.095
edin-0.093
repre-0.086
 differently-0.084
Token the
Token ID262
Feature activation+4.39e-01

Pos logit contributions
aband+0.051
ipel+0.047
 trap+0.044
opsis+0.044
hemer+0.041
Neg logit contributions
Sov-0.051
 plur-0.049
edin-0.049
repre-0.045
 differently-0.044
Token creators
Token ID16294
Feature activation+8.01e-01

Pos logit contributions
Sov+0.345
 plur+0.336
edin+0.331
repre+0.307
 differently+0.303
Neg logit contributions
aband-0.345
ipel-0.314
 trap-0.296
opsis-0.296
hemer-0.278
Token."
Token ID526
Feature activation+1.99e-01

Pos logit contributions
Sov+0.569
 plur+0.550
edin+0.543
repre+0.500
 differently+0.489
Neg logit contributions
aband-0.688
ipel-0.635
 trap-0.604
opsis-0.599
hemer-0.568