TOP ACTIVATIONS
MAX = 8.392

Xavier Rhodes , Vikings 1 Next up : at Washington Redskins
abduct ing children - not only from their homes , but
to seize the Su wa ł ki region by the evening
en Ramsey , Jaguars 1 Next up : vs . Los
Medicine With Dr . Steve 200 80 204 O & A
Jimmy Smith , Ravens 1 Next up : Bye week (
new type ), new not only in its being , but
your egg repl acer and set aside . 2 . Steam
and employees - past or present - could face jail sentences
GDP have already changed not only their legal headquarters but ,
turn : self . clone (), pieces : self . pieces
bourne - 3000 , vic First posted <|endoftext|> R
arrangement appears to be not just the challenge to U .
self . copy _ board (); // !! !!! self .
- technology , q ld First posted <|endoftext|> The
1 Josh Norman , Redskins Next up : vs . Minnesota
366 . Topics : community - and - soc
among gay teens ? Not only did they want to escape
and the App oint ments Clause , the relevant portion of
- interest , and not only an obligation , to denounce

MIN ACTIVATIONS
MIN = -7.944

Korea 's founder Kim Il Sung and his descend ents .
wheelchair ( Figure 2 A , [ Action Junior 3 ,
submarines ( SS BN / SS GN ) in operation and
with the command , sl s logs . To see the
said . Every two years or so it comes
examining the structure , how and what makes the drawing work
. See also [ edit ] <|endoftext|> I checked
help teach others to follow in the same footsteps . He
that and really analyze how and what makes that work ,
- powered attack submarines ( SS N ), plus some diesel
Korea 's founder Kim Il Sung and his descend ents .
invoke it locally with sl s invoke local - f test
thread / 54 33 95 / lost - val ley -
shock doctrine experiment . An arch
type , which used SR - I OV and the n
Despite assurances by Japanese authorities about low
R obby M ook ( left ) was asked
unaware of how we got here . I
ically , with better standardized test scores and better quality of
) , 0 ) black hat = c v 2 .

INTERVAL 4.308 - 8.392
CONTAINS 0.366%

The one thing that Bob told me in training in
le S é nat Imp é rial , le contr at
Trump could learn several lessons from former President Harry Truman .
it was ok , a quick search revealed that they had
't see this as too big of a concern though since

INTERVAL 0.224 - 4.308
CONTAINS 43.517%

anyone with an Amazon Prime subscription to share shipping benefits and
rock in 1979 , unfortunately about the same time Jeffrey M
and her husband Douglas Fair banks wanted the Daniels ancestor to
. P ., Self ie ), but , as a female
Gr ong Gr ong was the one that stood out over

INTERVAL -3.860 - 0.224
CONTAINS 55.582%

it calls Ro java - the Kurdish areas of Syria -
An historical marker in May esville , Sum ter County ,
v 2 . Ga ussian Bl ur ( gray , (
50 homeless each week , says He vil aine
mad geek cred like wearing a TR ON - inspired watch

INTERVAL -7.944 - -3.860
CONTAINS 0.535%

each designed for 50 - year service life with one mid
od ongo s Cavern ,
res ize ( image , height = 600 ) gray =
30 kg / m ² ). In
6 ) C S –– One of the Mag
Token Xavier
Token ID30825
Feature activation+1.60e+00

Pos logit contributions
agement+1.991
gil+1.898
 Email+1.873
arge+1.829
ODY+1.824
Neg logit contributions
ixel-2.115
jri-2.060
 popul-2.031
 condem-1.895
usra-1.889
Token Rhodes
Token ID25656
Feature activation+1.16e+00

Pos logit contributions
agement+1.066
gil+1.015
 Email+0.990
arge+0.987
ODY+0.977
Neg logit contributions
ixel-1.132
jri-1.087
 popul-1.081
 forfe-1.016
 )))-1.014
Token,
Token ID11
Feature activation-6.03e-01

Pos logit contributions
agement+0.734
gil+0.690
 Email+0.679
arge+0.673
ODY+0.672
Neg logit contributions
ixel-0.863
 popul-0.836
jri-0.832
usra-0.779
 )))-0.777
Token Vikings
Token ID16420
Feature activation+1.13e+00

Pos logit contributions
ixel+0.347
jri+0.331
 popul+0.331
usra+0.303
 )))+0.302
Neg logit contributions
agement-0.482
gil-0.459
 Email-0.453
arge-0.451
ODY-0.451
Token 1
Token ID352
Feature activation+3.90e+00

Pos logit contributions
agement+0.669
gil+0.628
 Email+0.619
arge+0.612
ODY+0.612
Neg logit contributions
ixel-0.888
 popul-0.860
jri-0.859
usra-0.805
 )))-0.801
Token Next
Token ID7406
Feature activation+8.39e+00

Pos logit contributions
agement+2.349
 Email+2.255
gil+2.229
ODY+2.219
arge+2.203
Neg logit contributions
ixel-2.924
jri-2.848
 popul-2.831
usra-2.661
 forfe-2.633
Token up
Token ID510
Feature activation+3.80e+00

Pos logit contributions
agement+5.291
 Email+5.137
ODY+5.039
gil+5.010
arge+4.943
Neg logit contributions
ixel-5.840
jri-5.667
 popul-5.599
usra-5.365
 condem-5.275
Token:
Token ID25
Feature activation+9.27e-01

Pos logit contributions
agement+2.263
 Email+2.136
gil+2.134
arge+2.096
TPS+2.075
Neg logit contributions
ixel-2.928
jri-2.844
 popul-2.838
 condem-2.696
usra-2.660
Token at
Token ID379
Feature activation+1.62e+00

Pos logit contributions
agement+0.667
gil+0.630
 Email+0.627
ODY+0.623
arge+0.621
Neg logit contributions
ixel-0.602
 popul-0.579
jri-0.578
usra-0.535
 )))-0.530
Token Washington
Token ID2669
Feature activation+2.09e-01

Pos logit contributions
agement+1.186
gil+1.129
 Email+1.120
arge+1.111
ODY+1.106
Neg logit contributions
ixel-1.028
jri-0.992
 popul-0.984
usra-0.907
 condem-0.891
Token Redskins
Token ID22038
Feature activation-1.89e+00

Pos logit contributions
agement+0.204
gil+0.196
 Email+0.195
ODY+0.194
arge+0.193
Neg logit contributions
ixel-0.083
 popul-0.078
jri-0.077
usra-0.067
 )))-0.067
Token abduct
Token ID24827
Feature activation-5.29e-01

Pos logit contributions
ixel+0.409
jri+0.392
 popul+0.390
 )))+0.360
usra+0.359
Neg logit contributions
agement-0.516
gil-0.487
 Email-0.480
ODY-0.477
arge-0.477
Tokening
Token ID278
Feature activation-5.64e-01

Pos logit contributions
ixel+0.419
 popul+0.407
jri+0.404
 )))+0.383
 forfe+0.382
Neg logit contributions
agement-0.308
gil-0.286
arge-0.279
ODY-0.279
 Email-0.279
Token children
Token ID1751
Feature activation+1.10e+00

Pos logit contributions
ixel+0.331
 popul+0.318
jri+0.315
 )))+0.292
 forfe+0.291
Neg logit contributions
agement-0.444
gil-0.422
arge-0.414
ODY-0.414
 Email-0.414
Token -
Token ID532
Feature activation+1.69e+00

Pos logit contributions
agement+0.750
gil+0.705
 Email+0.694
arge+0.693
ODY+0.690
Neg logit contributions
ixel-0.755
jri-0.727
 popul-0.725
 )))-0.677
usra-0.675
Token not
Token ID407
Feature activation+2.13e+00

Pos logit contributions
agement+1.158
gil+1.083
 Email+1.064
ODY+1.062
arge+1.059
Neg logit contributions
ixel-1.163
 popul-1.119
jri-1.117
 )))-1.044
usra-1.036
Token only
Token ID691
Feature activation+8.38e+00

Pos logit contributions
agement+1.270
gil+1.172
arge+1.148
ODY+1.148
 Email+1.144
Neg logit contributions
ixel-1.674
jri-1.609
 popul-1.591
 )))-1.511
usra-1.490
Token from
Token ID422
Feature activation+1.09e+00

Pos logit contributions
agement+5.757
 Email+5.318
arge+5.291
gil+5.272
ODY+5.206
Neg logit contributions
ixel-5.428
jri-5.198
 popul-5.025
OHN-4.825
 )))-4.777
Token their
Token ID511
Feature activation+1.91e+00

Pos logit contributions
agement+0.731
gil+0.683
 Email+0.672
arge+0.672
ODY+0.669
Neg logit contributions
ixel-0.765
 popul-0.734
jri-0.730
 )))-0.686
 forfe-0.682
Token homes
Token ID5682
Feature activation+1.25e+00

Pos logit contributions
agement+1.399
gil+1.315
 Email+1.305
arge+1.298
ODY+1.295
Neg logit contributions
ixel-1.229
jri-1.182
 popul-1.169
 )))-1.093
 forfe-1.079
Token,
Token ID11
Feature activation+3.26e+00

Pos logit contributions
agement+0.772
gil+0.721
arge+0.704
 Email+0.704
ODY+0.704
Neg logit contributions
ixel-0.952
 popul-0.923
jri-0.917
 )))-0.865
 forfe-0.863
Token but
Token ID475
Feature activation+1.74e+00

Pos logit contributions
agement+1.762
gil+1.605
 Email+1.584
arge+1.575
ODY+1.566
Neg logit contributions
ixel-2.735
jri-2.633
 popul-2.618
 )))-2.494
usra-2.459
Token to
Token ID284
Feature activation+1.36e+00

Pos logit contributions
agement+0.416
gil+0.392
 Email+0.385
arge+0.383
ODY+0.382
Neg logit contributions
ixel-0.419
jri-0.404
 popul-0.403
 )))-0.375
usra-0.375
Token seize
Token ID21031
Feature activation-5.47e-01

Pos logit contributions
agement+0.934
gil+0.874
 Email+0.864
arge+0.860
ODY+0.849
Neg logit contributions
ixel-0.937
jri-0.906
 popul-0.903
usra-0.841
 )))-0.836
Token the
Token ID262
Feature activation-2.63e-01

Pos logit contributions
ixel+0.382
 popul+0.368
jri+0.367
 )))+0.344
usra+0.342
Neg logit contributions
agement-0.370
gil-0.348
 Email-0.341
arge-0.341
ODY-0.340
Token Su
Token ID1778
Feature activation+1.26e+00

Pos logit contributions
ixel+0.185
 popul+0.179
jri+0.178
 )))+0.167
 forfe+0.166
Neg logit contributions
agement-0.177
gil-0.166
arge-0.163
 Email-0.163
ODY-0.163
Tokenwa
Token ID10247
Feature activation+1.43e+00

Pos logit contributions
agement+0.566
gil+0.518
arge+0.500
 Email+0.497
ODY+0.496
Neg logit contributions
ixel-1.158
 popul-1.132
jri-1.122
 )))-1.073
 forfe-1.071
Tokenł
Token ID41615
Feature activation+8.23e+00

Pos logit contributions
agement+0.844
gil+0.789
arge+0.771
ODY+0.769
 Email+0.766
Neg logit contributions
ixel-1.115
 popul-1.082
jri-1.075
 )))-1.018
 forfe-1.017
Tokenki
Token ID4106
Feature activation+2.94e+00

Pos logit contributions
agement+5.017
arge+4.737
gil+4.655
ODY+4.636
 Email+4.581
Neg logit contributions
ixel-5.853
 popul-5.781
jri-5.550
 forfe-5.391
usra-5.383
Token region
Token ID3814
Feature activation+1.48e+00

Pos logit contributions
agement+1.910
gil+1.787
arge+1.768
 Email+1.768
ODY+1.744
Neg logit contributions
ixel-2.114
jri-2.044
 popul-2.023
 )))-1.908
usra-1.906
Token by
Token ID416
Feature activation-1.97e+00

Pos logit contributions
agement+0.975
gil+0.916
arge+0.899
 Email+0.897
ODY+0.894
Neg logit contributions
ixel-1.055
jri-1.017
 popul-1.009
 )))-0.952
usra-0.947
Token the
Token ID262
Feature activation-6.08e-02

Pos logit contributions
ixel+1.385
 popul+1.347
jri+1.329
 forfe+1.255
 )))+1.254
Neg logit contributions
agement-1.312
gil-1.236
ODY-1.210
arge-1.209
 Email-1.204
Token evening
Token ID6180
Feature activation-6.91e-02

Pos logit contributions
ixel+0.042
 popul+0.041
jri+0.041
 )))+0.038
 forfe+0.038
Neg logit contributions
agement-0.041
gil-0.039
 Email-0.038
arge-0.038
ODY-0.038
Tokenen
Token ID268
Feature activation+1.38e+00

Pos logit contributions
agement+1.318
gil+1.265
arge+1.233
 Email+1.228
ODY+1.201
Neg logit contributions
ixel-1.330
 popul-1.298
jri-1.282
 forfe-1.207
 condem-1.205
Token Ramsey
Token ID30552
Feature activation+1.74e+00

Pos logit contributions
agement+0.644
gil+0.595
 Email+0.581
arge+0.570
ODY+0.568
Neg logit contributions
ixel-1.253
jri-1.217
 popul-1.213
 )))-1.154
 forfe-1.152
Token,
Token ID11
Feature activation+8.92e-01

Pos logit contributions
agement+1.081
gil+1.013
 Email+1.003
arge+0.992
ODY+0.989
Neg logit contributions
ixel-1.307
 popul-1.268
jri-1.261
usra-1.182
 condem-1.179
Token Jaguars
Token ID28392
Feature activation+7.24e-01

Pos logit contributions
agement+0.695
gil+0.660
 Email+0.657
ODY+0.649
arge+0.648
Neg logit contributions
ixel-0.527
jri-0.505
 popul-0.504
usra-0.462
 )))-0.456
Token 1
Token ID352
Feature activation+4.13e+00

Pos logit contributions
agement+0.440
gil+0.415
 Email+0.410
ODY+0.405
arge+0.405
Neg logit contributions
ixel-0.551
 popul-0.537
jri-0.534
usra-0.500
 )))-0.497
Token Next
Token ID7406
Feature activation+8.21e+00

Pos logit contributions
agement+2.316
 Email+2.230
gil+2.183
ODY+2.165
arge+2.152
Neg logit contributions
ixel-3.278
jri-3.203
 popul-3.181
usra-2.994
 forfe-2.955
Token up
Token ID510
Feature activation+4.52e+00

Pos logit contributions
agement+4.928
 Email+4.706
arge+4.596
gil+4.586
ODY+4.542
Neg logit contributions
ixel-5.972
jri-5.795
 popul-5.750
usra-5.510
 condem-5.473
Token:
Token ID25
Feature activation+1.24e+00

Pos logit contributions
agement+2.634
 Email+2.486
gil+2.483
arge+2.430
TPS+2.414
Neg logit contributions
ixel-3.538
jri-3.450
 popul-3.442
 condem-3.333
usra-3.222
Token vs
Token ID3691
Feature activation+1.45e+00

Pos logit contributions
agement+0.854
gil+0.804
 Email+0.803
ODY+0.794
arge+0.793
Neg logit contributions
ixel-0.841
 popul-0.811
jri-0.807
usra-0.752
 )))-0.744
Token.
Token ID13
Feature activation+1.12e+00

Pos logit contributions
agement+0.919
gil+0.865
 Email+0.851
arge+0.845
ODY+0.841
Neg logit contributions
ixel-1.075
 popul-1.042
jri-1.038
 condem-0.971
usra-0.965
Token Los
Token ID5401
Feature activation+1.39e+00

Pos logit contributions
agement+0.841
gil+0.796
 Email+0.792
arge+0.785
ODY+0.783
Neg logit contributions
ixel-0.696
jri-0.669
 popul-0.664
usra-0.611
 condem-0.602
Token Medicine
Token ID11558
Feature activation+2.13e+00

Pos logit contributions
ixel+0.054
 popul+0.052
jri+0.052
 )))+0.049
 forfe+0.048
Neg logit contributions
agement-0.052
gil-0.049
 Email-0.048
arge-0.048
ODY-0.048
Token With
Token ID2080
Feature activation-9.15e-01

Pos logit contributions
agement+1.536
gil+1.449
 Email+1.437
arge+1.420
ODY+1.412
Neg logit contributions
ixel-1.389
 popul-1.345
jri-1.330
 forfe-1.238
 )))-1.237
Token Dr
Token ID1583
Feature activation+7.39e-01

Pos logit contributions
ixel+0.647
 popul+0.626
jri+0.621
 )))+0.584
 forfe+0.583
Neg logit contributions
agement-0.610
gil-0.572
arge-0.560
ODY-0.560
 Email-0.559
Token.
Token ID13
Feature activation+2.11e+00

Pos logit contributions
agement+0.492
gil+0.464
arge+0.454
 Email+0.454
ODY+0.452
Neg logit contributions
ixel-0.523
 popul-0.506
jri-0.502
 )))-0.472
 forfe-0.471
Token Steve
Token ID6542
Feature activation+8.82e-01

Pos logit contributions
agement+1.374
gil+1.300
 Email+1.285
arge+1.269
ODY+1.260
Neg logit contributions
ixel-1.526
 popul-1.476
jri-1.468
 )))-1.366
 condem-1.365
Token 200
Token ID939
Feature activation+7.79e+00

Pos logit contributions
agement+0.610
gil+0.573
 Email+0.566
arge+0.561
ODY+0.558
Neg logit contributions
ixel-0.602
 popul-0.583
jri-0.580
 )))-0.540
 forfe-0.539
Token80
Token ID1795
Feature activation-1.53e-01

Pos logit contributions
agement+3.598
 Email+3.305
ODY+3.261
gil+3.216
arge+3.084
Neg logit contributions
ixel-6.808
 popul-6.770
jri-6.718
 forfe-6.396
usra-6.369
Token204
Token ID18638
Feature activation+6.32e-01

Pos logit contributions
ixel+0.097
 popul+0.094
jri+0.093
 )))+0.086
 forfe+0.086
Neg logit contributions
agement-0.114
gil-0.107
 Email-0.105
ODY-0.105
arge-0.105
Token O
Token ID440
Feature activation-7.13e-01

Pos logit contributions
agement+0.433
gil+0.408
 Email+0.405
ODY+0.402
arge+0.398
Neg logit contributions
ixel-0.437
jri-0.422
 popul-0.420
 condem-0.390
usra-0.389
Token&
Token ID5
Feature activation-6.35e-01

Pos logit contributions
ixel+0.554
 popul+0.536
jri+0.532
 )))+0.507
 forfe+0.504
Neg logit contributions
agement-0.424
gil-0.395
arge-0.386
ODY-0.385
 Email-0.385
TokenA
Token ID32
Feature activation+9.61e-02

Pos logit contributions
ixel+0.381
 popul+0.366
jri+0.362
 )))+0.340
 forfe+0.337
Neg logit contributions
agement-0.489
gil-0.464
arge-0.456
ODY-0.454
 Email-0.454
Token Jimmy
Token ID12963
Feature activation-1.12e+00

Pos logit contributions
agement+1.205
gil+1.149
 Email+1.133
ODY+1.112
arge+1.110
Neg logit contributions
ixel-1.219
jri-1.179
 popul-1.170
usra-1.084
 condem-1.078
Token Smith
Token ID4176
Feature activation+1.71e+00

Pos logit contributions
ixel+1.068
 popul+1.048
jri+1.047
 )))+0.995
 forfe+0.993
Neg logit contributions
agement-0.465
gil-0.426
 Email-0.411
arge-0.408
ODY-0.398
Token,
Token ID11
Feature activation+3.55e-01

Pos logit contributions
agement+1.068
gil+1.002
 Email+0.990
arge+0.983
ODY+0.977
Neg logit contributions
ixel-1.285
 popul-1.249
jri-1.232
usra-1.160
 condem-1.157
Token Ravens
Token ID20091
Feature activation-1.91e-01

Pos logit contributions
agement+0.240
gil+0.225
 Email+0.225
ODY+0.221
arge+0.221
Neg logit contributions
ixel-0.248
jri-0.239
 popul-0.238
usra-0.222
 )))-0.219
Token 1
Token ID352
Feature activation+3.54e+00

Pos logit contributions
ixel+0.145
 popul+0.141
jri+0.140
 )))+0.132
 forfe+0.131
Neg logit contributions
agement-0.117
gil-0.110
 Email-0.108
arge-0.107
ODY-0.107
Token Next
Token ID7406
Feature activation+7.72e+00

Pos logit contributions
agement+2.072
 Email+1.988
gil+1.957
ODY+1.938
arge+1.927
Neg logit contributions
ixel-2.732
jri-2.671
 popul-2.648
usra-2.482
 forfe-2.461
Token up
Token ID510
Feature activation+3.75e+00

Pos logit contributions
agement+4.746
 Email+4.531
gil+4.437
arge+4.419
ODY+4.391
Neg logit contributions
ixel-5.550
jri-5.377
 popul-5.324
usra-5.080
 condem-5.059
Token:
Token ID25
Feature activation+9.06e-01

Pos logit contributions
agement+2.240
gil+2.112
 Email+2.103
arge+2.065
TPS+2.052
Neg logit contributions
ixel-2.890
jri-2.811
 popul-2.809
 condem-2.695
usra-2.623
Token Bye
Token ID47843
Feature activation+2.01e+00

Pos logit contributions
agement+0.605
gil+0.568
 Email+0.564
ODY+0.561
arge+0.560
Neg logit contributions
ixel-0.636
 popul-0.614
jri-0.612
usra-0.570
 )))-0.566
Token week
Token ID1285
Feature activation+3.90e+00

Pos logit contributions
agement+1.413
 Email+1.322
gil+1.318
ODY+1.310
arge+1.303
Neg logit contributions
ixel-1.344
jri-1.305
 popul-1.305
usra-1.213
 )))-1.208
Token (
Token ID357
Feature activation+8.91e-01

Pos logit contributions
agement+2.389
 Email+2.250
gil+2.236
arge+2.210
ODY+2.208
Neg logit contributions
ixel-2.923
 popul-2.834
jri-2.816
usra-2.657
 )))-2.622
Token new
Token ID649
Feature activation-1.52e+00

Pos logit contributions
ixel+0.126
 popul+0.121
jri+0.120
 )))+0.112
 forfe+0.112
Neg logit contributions
agement-0.135
gil-0.128
arge-0.125
 Email-0.125
ODY-0.125
Token type
Token ID2099
Feature activation+3.97e-01

Pos logit contributions
ixel+0.889
 popul+0.851
jri+0.846
 )))+0.782
 forfe+0.780
Neg logit contributions
agement-1.201
gil-1.140
arge-1.120
 Email-1.120
ODY-1.120
Token),
Token ID828
Feature activation-8.56e-01

Pos logit contributions
agement+0.265
gil+0.249
arge+0.244
 Email+0.244
ODY+0.244
Neg logit contributions
ixel-0.280
 popul-0.269
jri-0.268
 )))-0.251
 forfe-0.251
Tokennew
Token ID3605
Feature activation-9.33e-01

Pos logit contributions
ixel+0.607
 popul+0.588
jri+0.583
 )))+0.548
 forfe+0.547
Neg logit contributions
agement-0.568
gil-0.534
arge-0.522
 Email-0.522
ODY-0.522
Token not
Token ID407
Feature activation+6.99e-01

Pos logit contributions
ixel+0.611
 popul+0.588
jri+0.584
 )))+0.546
 forfe+0.544
Neg logit contributions
agement-0.671
gil-0.634
arge-0.622
 Email-0.621
ODY-0.621
Token only
Token ID691
Feature activation+7.68e+00

Pos logit contributions
agement+0.450
gil+0.421
arge+0.412
ODY+0.412
 Email+0.410
Neg logit contributions
ixel-0.515
 popul-0.491
jri-0.489
 )))-0.464
 forfe-0.460
Token in
Token ID287
Feature activation+1.70e+00

Pos logit contributions
agement+5.220
gil+4.913
ODY+4.890
arge+4.859
 Email+4.834
Neg logit contributions
ixel-5.000
jri-4.606
 popul-4.471
 )))-4.432
usra-4.357
Token its
Token ID663
Feature activation+2.47e+00

Pos logit contributions
agement+1.133
gil+1.064
ODY+1.045
 Email+1.044
arge+1.038
Neg logit contributions
ixel-1.200
jri-1.153
 popul-1.144
 )))-1.082
 forfe-1.069
Token being
Token ID852
Feature activation+2.48e+00

Pos logit contributions
agement+1.774
gil+1.679
 Email+1.659
arge+1.652
ODY+1.650
Neg logit contributions
ixel-1.596
jri-1.530
 popul-1.501
 )))-1.419
usra-1.404
Token,
Token ID11
Feature activation+2.81e+00

Pos logit contributions
agement+1.554
gil+1.449
ODY+1.419
arge+1.416
 Email+1.412
Neg logit contributions
ixel-1.849
jri-1.765
 popul-1.757
 )))-1.674
usra-1.668
Tokenbut
Token ID4360
Feature activation+9.63e-02

Pos logit contributions
agement+1.436
gil+1.322
arge+1.281
ODY+1.279
 Email+1.270
Neg logit contributions
ixel-2.421
jri-2.341
 popul-2.331
 )))-2.232
 forfe-2.210
Token your
Token ID534
Feature activation+3.93e-01

Pos logit contributions
agement+0.647
gil+0.607
 Email+0.594
ODY+0.593
arge+0.593
Neg logit contributions
ixel-0.712
 popul-0.688
jri-0.685
 )))-0.643
 forfe-0.640
Token egg
Token ID5935
Feature activation+2.35e+00

Pos logit contributions
agement+0.281
gil+0.265
 Email+0.260
arge+0.260
ODY+0.259
Neg logit contributions
ixel-0.259
 popul-0.250
jri-0.248
 )))-0.232
 forfe-0.231
Token repl
Token ID2186
Feature activation+2.35e+00

Pos logit contributions
agement+2.286
gil+2.196
 Email+2.161
ODY+2.156
arge+2.154
Neg logit contributions
ixel-0.931
jri-0.873
 popul-0.854
 )))-0.771
usra-0.761
Tokenacer
Token ID11736
Feature activation+1.99e+00

Pos logit contributions
agement+0.987
gil+0.875
arge+0.860
 Email+0.838
ODY+0.826
Neg logit contributions
ixel-2.235
 popul-2.212
jri-2.183
 condem-2.096
 forfe-2.094
Token and
Token ID290
Feature activation+3.02e+00

Pos logit contributions
agement+1.323
gil+1.242
 Email+1.223
arge+1.215
ODY+1.215
Neg logit contributions
ixel-1.408
 popul-1.356
jri-1.353
 )))-1.267
usra-1.263
Token set
Token ID900
Feature activation+7.63e+00

Pos logit contributions
agement+2.100
gil+1.966
arge+1.947
 Email+1.946
TPS+1.939
Neg logit contributions
ixel-2.041
jri-1.959
 popul-1.950
usra-1.824
 )))-1.811
Token aside
Token ID7263
Feature activation+5.46e+00

Pos logit contributions
agement+4.532
gil+4.084
 Email+4.050
TPS+4.023
arge+3.989
Neg logit contributions
ixel-5.937
 popul-5.627
jri-5.564
 condem-5.337
usra-5.242
Token.
Token ID13
Feature activation+2.72e+00

Pos logit contributions
agement+3.201
gil+2.996
 Email+2.995
arge+2.933
TPS+2.898
Neg logit contributions
ixel-4.216
 popul-4.078
jri-4.030
usra-3.832
 )))-3.808
Token 2
Token ID362
Feature activation+2.42e+00

Pos logit contributions
agement+1.747
gil+1.647
 Email+1.621
arge+1.607
ODY+1.597
Neg logit contributions
ixel-1.986
 popul-1.928
jri-1.911
 forfe-1.796
usra-1.791
Token.
Token ID13
Feature activation-7.15e-01

Pos logit contributions
agement+1.509
gil+1.428
 Email+1.404
arge+1.390
TPS+1.379
Neg logit contributions
ixel-1.816
 popul-1.772
jri-1.763
 condem-1.663
 forfe-1.640
Token Steam
Token ID9094
Feature activation+3.47e+00

Pos logit contributions
ixel+0.488
 popul+0.469
jri+0.467
 )))+0.442
 forfe+0.439
Neg logit contributions
agement-0.492
gil-0.460
ODY-0.454
arge-0.448
TPS-0.444
Token and
Token ID290
Feature activation+7.47e-01

Pos logit contributions
ixel+0.388
 popul+0.373
jri+0.372
 )))+0.348
usra+0.346
Neg logit contributions
agement-0.400
gil-0.377
 Email-0.370
arge-0.370
ODY-0.369
Token employees
Token ID4409
Feature activation+8.42e-01

Pos logit contributions
agement+0.486
gil+0.453
 Email+0.446
arge+0.444
ODY+0.441
Neg logit contributions
ixel-0.543
jri-0.524
 popul-0.521
 )))-0.490
usra-0.489
Token -
Token ID532
Feature activation+2.27e+00

Pos logit contributions
agement+0.592
gil+0.556
 Email+0.547
arge+0.546
ODY+0.544
Neg logit contributions
ixel-0.565
jri-0.545
 popul-0.543
usra-0.505
 )))-0.503
Token past
Token ID1613
Feature activation-4.75e-01

Pos logit contributions
agement+1.481
gil+1.386
 Email+1.364
ODY+1.356
arge+1.353
Neg logit contributions
ixel-1.625
 popul-1.566
jri-1.560
 )))-1.466
usra-1.464
Token or
Token ID393
Feature activation+1.01e+00

Pos logit contributions
ixel+0.328
 popul+0.317
jri+0.314
 )))+0.295
 forfe+0.294
Neg logit contributions
agement-0.325
gil-0.306
ODY-0.300
arge-0.300
 Email-0.299
Token present
Token ID1944
Feature activation+7.61e+00

Pos logit contributions
agement+0.738
gil+0.695
 Email+0.687
arge+0.681
ODY+0.676
Neg logit contributions
ixel-0.642
jri-0.613
 popul-0.610
usra-0.571
 )))-0.569
Token -
Token ID532
Feature activation+2.19e+00

Pos logit contributions
agement+3.575
 Email+3.351
gil+3.266
arge+3.252
TPS+3.174
Neg logit contributions
ixel-6.512
jri-6.217
 popul-6.198
usra-6.007
 )))-5.999
Token could
Token ID714
Feature activation+6.96e-02

Pos logit contributions
agement+1.552
gil+1.457
 Email+1.434
ODY+1.423
arge+1.423
Neg logit contributions
ixel-1.465
jri-1.415
 popul-1.407
usra-1.302
 )))-1.300
Token face
Token ID1986
Feature activation+1.15e+00

Pos logit contributions
agement+0.051
gil+0.048
 Email+0.048
arge+0.048
ODY+0.047
Neg logit contributions
ixel-0.044
 popul-0.043
jri-0.043
usra-0.039
 )))-0.039
Token jail
Token ID7356
Feature activation-1.95e-01

Pos logit contributions
agement+0.784
gil+0.734
 Email+0.724
arge+0.720
ODY+0.714
Neg logit contributions
ixel-0.804
jri-0.775
 popul-0.771
 )))-0.722
usra-0.719
Token sentences
Token ID13439
Feature activation+3.53e-01

Pos logit contributions
ixel+0.135
jri+0.130
 popul+0.129
 )))+0.121
usra+0.120
Neg logit contributions
agement-0.135
gil-0.126
 Email-0.124
arge-0.124
ODY-0.123
Token GDP
Token ID12396
Feature activation+2.14e+00

Pos logit contributions
ixel+0.867
 popul+0.839
jri+0.835
 )))+0.785
 forfe+0.783
Neg logit contributions
agement-0.759
gil-0.712
arge-0.696
 Email-0.696
ODY-0.695
Token have
Token ID423
Feature activation+1.89e+00

Pos logit contributions
agement+1.421
gil+1.343
arge+1.314
 Email+1.313
ODY+1.305
Neg logit contributions
ixel-1.504
jri-1.455
 popul-1.436
usra-1.363
 )))-1.353
Token already
Token ID1541
Feature activation-3.19e-01

Pos logit contributions
agement+1.378
gil+1.307
arge+1.297
 Email+1.295
TPS+1.271
Neg logit contributions
ixel-1.211
jri-1.181
 popul-1.126
 )))-1.071
usra-1.071
Token changed
Token ID3421
Feature activation+1.39e+00

Pos logit contributions
ixel+0.200
jri+0.193
 popul+0.189
 )))+0.177
usra+0.177
Neg logit contributions
agement-0.238
gil-0.226
arge-0.223
 Email-0.222
ODY-0.219
Token not
Token ID407
Feature activation+6.00e-01

Pos logit contributions
agement+0.890
gil+0.834
 Email+0.819
arge+0.819
ODY+0.812
Neg logit contributions
ixel-1.013
jri-0.973
 popul-0.971
 )))-0.918
usra-0.911
Token only
Token ID691
Feature activation+7.56e+00

Pos logit contributions
agement+0.359
gil+0.334
arge+0.327
ODY+0.326
 Email+0.326
Neg logit contributions
ixel-0.468
 popul-0.452
jri-0.451
 )))-0.427
usra-0.423
Token their
Token ID511
Feature activation+3.11e+00

Pos logit contributions
agement+4.569
 Email+4.246
arge+4.220
gil+4.204
ODY+4.143
Neg logit contributions
ixel-5.586
jri-5.354
 popul-5.201
usra-5.093
 )))-5.083
Token legal
Token ID2742
Feature activation+2.02e+00

Pos logit contributions
agement+1.981
gil+1.853
 Email+1.849
arge+1.844
ODY+1.810
Neg logit contributions
ixel-2.263
jri-2.203
 popul-2.141
 )))-2.043
usra-2.033
Token headquarters
Token ID10043
Feature activation+2.80e+00

Pos logit contributions
agement+1.287
gil+1.206
arge+1.200
 Email+1.193
ODY+1.188
Neg logit contributions
ixel-1.482
jri-1.432
 popul-1.411
 )))-1.339
usra-1.327
Token but
Token ID475
Feature activation+2.14e+00

Pos logit contributions
agement+1.665
gil+1.556
arge+1.533
 Email+1.531
ODY+1.519
Neg logit contributions
ixel-2.160
 popul-2.087
jri-2.085
 )))-1.973
usra-1.964
Token,
Token ID11
Feature activation+2.32e+00

Pos logit contributions
agement+1.330
gil+1.241
 Email+1.221
arge+1.218
ODY+1.207
Neg logit contributions
ixel-1.613
jri-1.556
 popul-1.548
usra-1.463
 )))-1.459
Token turn
Token ID1210
Feature activation+4.47e+00

Pos logit contributions
agement+2.196
 Email+2.091
gil+2.065
ODY+2.054
arge+2.045
Neg logit contributions
ixel-2.249
jri-2.216
 popul-2.198
usra-2.055
 forfe-2.027
Token :
Token ID1058
Feature activation+4.55e+00

Pos logit contributions
agement+2.947
gil+2.789
 Email+2.743
arge+2.732
ODY+2.727
Neg logit contributions
ixel-3.050
jri-3.019
 popul-3.014
usra-2.825
 forfe-2.789
Token self
Token ID2116
Feature activation+4.07e+00

Pos logit contributions
agement+2.741
 Email+2.634
gil+2.548
arge+2.535
ODY+2.525
Neg logit contributions
jri-3.328
ixel-3.307
 popul-3.265
usra-3.122
OHN-3.047
Token.
Token ID764
Feature activation+6.04e+00

Pos logit contributions
agement+3.786
gil+3.634
 Email+3.604
arge+3.583
ODY+3.570
Neg logit contributions
ixel-1.712
jri-1.684
 popul-1.650
usra-1.457
 forfe-1.448
Tokenclone
Token ID21018
Feature activation+4.99e+00

Pos logit contributions
agement+3.681
gil+3.472
arge+3.461
 Email+3.383
ODY+3.364
Neg logit contributions
jri-4.299
ixel-4.277
 popul-4.277
usra-4.042
OHN-3.961
Token (),
Token ID29994
Feature activation+7.48e+00

Pos logit contributions
agement+3.673
gil+3.471
 Email+3.448
ODY+3.410
arge+3.402
Neg logit contributions
ixel-3.027
 popul-2.990
jri-2.972
usra-2.777
 forfe-2.745
Token pieces
Token ID5207
Feature activation+9.96e-01

Pos logit contributions
agement+4.568
 Email+4.440
arge+4.391
ODY+4.310
gil+4.237
Neg logit contributions
jri-5.203
ixel-5.109
 popul-5.012
usra-4.861
 )))-4.712
Token :
Token ID1058
Feature activation+1.90e+00

Pos logit contributions
agement+0.664
gil+0.624
arge+0.616
 Email+0.613
ODY+0.612
Neg logit contributions
ixel-0.700
 popul-0.679
jri-0.677
usra-0.633
 forfe-0.630
Token self
Token ID2116
Feature activation+1.38e+00

Pos logit contributions
agement+1.016
 Email+0.943
gil+0.937
arge+0.927
ODY+0.927
Neg logit contributions
ixel-1.569
jri-1.546
 popul-1.524
usra-1.452
 )))-1.434
Token.
Token ID764
Feature activation+5.53e+00

Pos logit contributions
agement+1.367
gil+1.313
 Email+1.297
arge+1.295
ODY+1.293
Neg logit contributions
ixel-0.526
jri-0.495
 popul-0.493
usra-0.427
 forfe-0.425
Tokenpieces
Token ID34154
Feature activation+2.17e+00

Pos logit contributions
agement+3.589
gil+3.421
arge+3.372
ODY+3.289
Boss+3.280
Neg logit contributions
 popul-3.733
ixel-3.724
jri-3.712
usra-3.481
 )))-3.460
Tokenbourne
Token ID12544
Feature activation+5.70e-01

Pos logit contributions
agement+1.079
gil+0.997
arge+0.966
 Email+0.946
ODY+0.934
Neg logit contributions
ixel-1.952
jri-1.920
 popul-1.905
 condem-1.814
 forfe-1.807
Token-
Token ID12
Feature activation+5.05e-01

Pos logit contributions
agement+0.448
gil+0.425
arge+0.417
 Email+0.417
ODY+0.417
Neg logit contributions
ixel-0.335
 popul-0.322
jri-0.320
 )))-0.296
 forfe-0.295
Token3000
Token ID23924
Feature activation+4.09e+00

Pos logit contributions
agement+0.285
gil+0.266
ODY+0.265
 Email+0.263
TPS+0.262
Neg logit contributions
ixel-0.401
 popul-0.389
jri-0.384
 )))-0.373
usra-0.366
Token,
Token ID11
Feature activation+3.09e+00

Pos logit contributions
agement+2.505
gil+2.347
 Email+2.326
arge+2.300
ODY+2.261
Neg logit contributions
ixel-3.083
 popul-2.998
jri-2.979
 )))-2.804
usra-2.804
Token vic
Token ID20429
Feature activation+1.83e+00

Pos logit contributions
agement+2.300
gil+2.178
 Email+2.149
arge+2.135
ODY+2.124
Neg logit contributions
ixel-1.952
jri-1.848
 popul-1.828
 )))-1.708
 forfe-1.699
Token
Token ID198
Feature activation+7.43e+00

Pos logit contributions
agement+1.164
gil+1.090
 Email+1.072
arge+1.069
ODY+1.055
Neg logit contributions
ixel-1.353
 popul-1.308
jri-1.299
 )))-1.228
 forfe-1.222
Token
Token ID198
Feature activation+6.58e+00

Pos logit contributions
agement+3.688
 Email+3.558
gil+3.433
TPS+3.354
arge+3.340
Neg logit contributions
ixel-6.461
 condem-6.365
jri-6.310
 popul-6.234
 conduc-6.095
TokenFirst
Token ID5962
Feature activation+3.18e-01

Pos logit contributions
agement+1.741
 Email+1.586
arge+1.549
TPS+1.496
gil+1.451
Neg logit contributions
ixel-7.206
 popul-7.091
jri-6.830
 forfe-6.781
 condem-6.762
Token posted
Token ID4481
Feature activation+3.76e+00

Pos logit contributions
agement+0.242
gil+0.231
ODY+0.225
arge+0.224
 Email+0.224
Neg logit contributions
ixel-0.193
 popul-0.187
jri-0.185
 )))-0.173
 forfe-0.172
Token<|endoftext|>
Token ID50256
Feature activation+3.04e+00

Pos logit contributions
agement+2.109
 Email+2.071
gil+1.999
arge+1.975
ODY+1.895
Neg logit contributions
ixel-2.985
 popul-2.885
jri-2.867
 condem-2.828
 disse-2.745
TokenR
Token ID49
Feature activation+1.04e+00

Pos logit contributions
agement+1.954
gil+1.846
ODY+1.811
TPS+1.806
arge+1.797
Neg logit contributions
ixel-2.209
 popul-2.161
jri-2.124
 forfe-2.009
 condem-1.992
Token arrangement
Token ID13888
Feature activation-1.68e+00

Pos logit contributions
agement+0.578
gil+0.540
 Email+0.531
arge+0.528
ODY+0.527
Neg logit contributions
ixel-0.674
 popul-0.651
jri-0.647
 )))-0.610
usra-0.609
Token appears
Token ID3568
Feature activation+1.80e+00

Pos logit contributions
ixel+1.078
 popul+1.049
jri+1.038
 forfe+0.970
 )))+0.969
Neg logit contributions
agement-1.226
gil-1.160
arge-1.138
ODY-1.137
 Email-1.133
Token to
Token ID284
Feature activation+4.55e-01

Pos logit contributions
agement+1.002
gil+0.929
 Email+0.910
arge+0.904
ODY+0.901
Neg logit contributions
ixel-1.479
 popul-1.416
jri-1.411
 )))-1.342
usra-1.341
Token be
Token ID307
Feature activation+2.99e+00

Pos logit contributions
agement+0.332
gil+0.313
 Email+0.308
arge+0.308
ODY+0.306
Neg logit contributions
ixel-0.293
 popul-0.281
jri-0.279
 forfe-0.260
 )))-0.260
Token not
Token ID407
Feature activation+3.32e+00

Pos logit contributions
agement+1.969
gil+1.832
 Email+1.811
ODY+1.799
arge+1.793
Neg logit contributions
ixel-2.159
 popul-2.055
jri-2.041
 )))-1.939
usra-1.921
Token just
Token ID655
Feature activation+7.40e+00

Pos logit contributions
agement+2.130
gil+1.979
 Email+1.956
ODY+1.955
arge+1.947
Neg logit contributions
ixel-2.461
 popul-2.329
jri-2.320
 )))-2.209
usra-2.175
Token the
Token ID262
Feature activation+4.59e+00

Pos logit contributions
agement+4.765
ODY+4.352
 Email+4.352
gil+4.328
arge+4.303
Neg logit contributions
ixel-5.400
jri-4.922
 popul-4.922
 )))-4.786
OHN-4.714
Token challenge
Token ID4427
Feature activation+3.09e+00

Pos logit contributions
agement+3.302
 Email+3.070
gil+3.052
arge+3.026
TPS+3.013
Neg logit contributions
ixel-3.006
jri-2.794
 popul-2.771
 )))-2.653
usra-2.603
Token to
Token ID284
Feature activation+3.15e+00

Pos logit contributions
agement+2.000
gil+1.843
 Email+1.835
arge+1.820
ODY+1.815
Neg logit contributions
ixel-2.244
jri-2.123
 popul-2.120
 )))-2.013
usra-2.003
Token U
Token ID471
Feature activation+2.59e-01

Pos logit contributions
agement+2.104
gil+1.954
 Email+1.938
ODY+1.915
arge+1.912
Neg logit contributions
ixel-2.232
jri-2.100
 popul-2.092
usra-1.983
 )))-1.978
Token.
Token ID13
Feature activation+4.97e-01

Pos logit contributions
agement+0.175
gil+0.164
ODY+0.161
arge+0.160
 Email+0.160
Neg logit contributions
ixel-0.181
 popul-0.175
jri-0.174
 )))-0.164
 forfe-0.163
Token self
Token ID2116
Feature activation+4.90e-01

Pos logit contributions
agement+1.648
 Email+1.543
gil+1.516
arge+1.501
ODY+1.501
Neg logit contributions
ixel-2.628
jri-2.603
 popul-2.561
usra-2.444
 )))-2.412
Token.
Token ID764
Feature activation+2.74e+00

Pos logit contributions
agement+0.494
gil+0.474
 Email+0.468
arge+0.468
ODY+0.467
Neg logit contributions
ixel-0.179
 popul-0.168
jri-0.167
 )))-0.145
 forfe-0.145
Tokencopy
Token ID30073
Feature activation+3.56e+00

Pos logit contributions
agement+1.541
gil+1.439
arge+1.413
 Email+1.390
ODY+1.389
Neg logit contributions
ixel-2.161
 popul-2.131
jri-2.126
 forfe-1.999
usra-1.997
Token_
Token ID62
Feature activation+5.00e+00

Pos logit contributions
agement+2.613
gil+2.464
 Email+2.427
ODY+2.418
arge+2.410
Neg logit contributions
ixel-2.215
 popul-2.164
jri-2.161
usra-2.006
 forfe-2.005
Tokenboard
Token ID3526
Feature activation+1.72e+00

Pos logit contributions
agement+2.568
gil+2.390
arge+2.308
ODY+2.283
TPS+2.254
Neg logit contributions
ixel-4.162
 popul-4.123
jri-4.077
 forfe-3.899
usra-3.866
Token ();
Token ID13979
Feature activation+7.36e+00

Pos logit contributions
agement+1.215
gil+1.142
 Email+1.124
ODY+1.121
arge+1.120
Neg logit contributions
ixel-1.146
 popul-1.111
jri-1.108
usra-1.027
 forfe-1.027
Token //
Token ID3373
Feature activation+2.28e+00

Pos logit contributions
agement+4.219
 Email+4.200
arge+4.043
ODY+3.935
Boss+3.927
Neg logit contributions
jri-5.451
ixel-5.381
 popul-5.208
usra-5.015
 )))-4.939
Token!!
Token ID37867
Feature activation-9.62e-02

Pos logit contributions
agement+1.480
gil+1.400
 Email+1.392
arge+1.380
ODY+1.370
Neg logit contributions
ixel-1.620
jri-1.583
 popul-1.574
usra-1.477
 forfe-1.462
Token!!!
Token ID10185
Feature activation+2.56e+00

Pos logit contributions
ixel+0.067
 popul+0.065
jri+0.065
 )))+0.060
 forfe+0.060
Neg logit contributions
agement-0.065
gil-0.061
 Email-0.060
arge-0.060
ODY-0.060
Token self
Token ID2116
Feature activation+1.26e+00

Pos logit contributions
agement+1.728
 Email+1.619
gil+1.610
arge+1.605
ODY+1.600
Neg logit contributions
ixel-1.757
 popul-1.710
jri-1.698
usra-1.591
 forfe-1.580
Token.
Token ID764
Feature activation+4.15e+00

Pos logit contributions
agement+1.230
gil+1.177
 Email+1.166
arge+1.162
ODY+1.161
Neg logit contributions
ixel-0.495
jri-0.468
 popul-0.467
usra-0.406
 forfe-0.402
Token-
Token ID12
Feature activation-1.22e+00

Pos logit contributions
ixel+0.642
 popul+0.621
jri+0.611
 )))+0.565
 forfe+0.564
Neg logit contributions
agement-0.897
gil-0.853
ODY-0.843
 Email-0.842
TPS-0.837
Tokentechnology
Token ID45503
Feature activation+2.94e+00

Pos logit contributions
ixel+0.844
 popul+0.827
jri+0.811
usra+0.769
 )))+0.765
Neg logit contributions
agement-0.819
ODY-0.771
gil-0.766
 Email-0.764
arge-0.759
Token,
Token ID11
Feature activation+2.59e+00

Pos logit contributions
agement+1.825
gil+1.687
arge+1.680
 Email+1.665
ODY+1.632
Neg logit contributions
ixel-2.228
 popul-2.148
jri-2.143
 condem-2.018
usra-2.017
Token q
Token ID10662
Feature activation+1.24e+00

Pos logit contributions
agement+1.805
gil+1.697
 Email+1.670
arge+1.666
ODY+1.660
Neg logit contributions
ixel-1.766
 popul-1.677
jri-1.674
 )))-1.571
 forfe-1.562
Tokenld
Token ID335
Feature activation+3.51e+00

Pos logit contributions
agement+0.917
gil+0.875
arge+0.855
 Email+0.851
ODY+0.842
Neg logit contributions
ixel-0.787
 popul-0.757
jri-0.751
 forfe-0.701
 condem-0.697
Token
Token ID198
Feature activation+7.32e+00

Pos logit contributions
agement+2.162
gil+2.034
 Email+2.011
arge+1.998
ODY+1.963
Neg logit contributions
ixel-2.641
 popul-2.555
jri-2.527
 )))-2.404
usra-2.386
Token
Token ID198
Feature activation+5.71e+00

Pos logit contributions
agement+3.693
 Email+3.542
gil+3.459
arge+3.356
TPS+3.350
Neg logit contributions
ixel-6.316
 condem-6.162
jri-6.137
 popul-6.074
 conduc-5.889
TokenFirst
Token ID5962
Feature activation+2.92e-01

Pos logit contributions
agement+1.599
arge+1.443
 Email+1.441
TPS+1.372
gil+1.372
Neg logit contributions
ixel-6.159
 popul-6.041
jri-5.847
 forfe-5.804
 condem-5.780
Token posted
Token ID4481
Feature activation+3.43e+00

Pos logit contributions
agement+0.219
gil+0.207
arge+0.203
ODY+0.203
 Email+0.203
Neg logit contributions
ixel-0.182
 popul-0.176
jri-0.174
 )))-0.162
 forfe-0.162
Token<|endoftext|>
Token ID50256
Feature activation+3.13e+00

Pos logit contributions
agement+1.952
 Email+1.907
gil+1.846
arge+1.824
TPS+1.764
Neg logit contributions
ixel-2.696
 popul-2.602
jri-2.574
 condem-2.534
usra-2.472
TokenThe
Token ID464
Feature activation-8.70e-01

Pos logit contributions
agement+2.019
gil+1.907
ODY+1.871
TPS+1.867
arge+1.858
Neg logit contributions
ixel-2.277
 popul-2.227
jri-2.189
 forfe-2.071
 condem-2.053
Token 1
Token ID352
Feature activation+1.18e+00

Pos logit contributions
agement+1.252
gil+1.160
 Email+1.158
ODY+1.156
arge+1.133
Neg logit contributions
ixel-1.692
 popul-1.647
jri-1.643
usra-1.539
 condem-1.527
Token Josh
Token ID8518
Feature activation-2.52e-01

Pos logit contributions
agement+0.793
gil+0.752
 Email+0.740
ODY+0.738
arge+0.732
Neg logit contributions
ixel-0.830
 popul-0.806
jri-0.800
usra-0.746
 forfe-0.742
Token Norman
Token ID19636
Feature activation+6.17e-01

Pos logit contributions
ixel+0.164
 popul+0.158
jri+0.157
 )))+0.146
 forfe+0.146
Neg logit contributions
agement-0.182
gil-0.171
ODY-0.169
 Email-0.168
arge-0.168
Token,
Token ID11
Feature activation-2.71e-01

Pos logit contributions
agement+0.394
gil+0.370
 Email+0.364
ODY+0.362
arge+0.362
Neg logit contributions
ixel-0.454
 popul-0.439
jri-0.436
 )))-0.409
usra-0.408
Token Redskins
Token ID22038
Feature activation+8.42e-01

Pos logit contributions
ixel+0.188
 popul+0.182
jri+0.181
 )))+0.169
 forfe+0.169
Neg logit contributions
agement-0.184
gil-0.173
 Email-0.170
ODY-0.170
arge-0.169
Token Next
Token ID7406
Feature activation+7.32e+00

Pos logit contributions
agement+0.528
gil+0.496
 Email+0.492
ODY+0.486
arge+0.485
Neg logit contributions
ixel-0.626
 popul-0.606
jri-0.606
usra-0.565
 )))-0.564
Token up
Token ID510
Feature activation+1.40e+00

Pos logit contributions
agement+4.896
 Email+4.825
ODY+4.697
gil+4.668
arge+4.603
Neg logit contributions
ixel-4.768
 popul-4.751
jri-4.639
usra-4.420
 forfe-4.242
Token:
Token ID25
Feature activation-4.41e-02

Pos logit contributions
agement+0.891
gil+0.843
 Email+0.834
ODY+0.827
arge+0.825
Neg logit contributions
ixel-1.019
 popul-0.989
jri-0.980
usra-0.926
 )))-0.918
Token vs
Token ID3691
Feature activation+5.95e-01

Pos logit contributions
ixel+0.029
 popul+0.028
jri+0.028
usra+0.026
 )))+0.026
Neg logit contributions
agement-0.031
gil-0.029
 Email-0.029
ODY-0.029
arge-0.029
Token.
Token ID13
Feature activation+1.80e-01

Pos logit contributions
agement+0.391
gil+0.368
 Email+0.361
arge+0.360
ODY+0.359
Neg logit contributions
ixel-0.426
 popul-0.412
jri-0.409
usra-0.383
 forfe-0.382
Token Minnesota
Token ID8919
Feature activation-5.82e-01

Pos logit contributions
agement+0.132
gil+0.125
 Email+0.123
arge+0.122
ODY+0.122
Neg logit contributions
ixel-0.115
 popul-0.111
jri-0.111
usra-0.102
 forfe-0.102
Token 366
Token ID44856
Feature activation+1.45e+00

Pos logit contributions
agement+1.410
 Email+1.300
gil+1.297
arge+1.271
ODY+1.268
Neg logit contributions
ixel-2.119
jri-2.060
 popul-2.052
usra-1.942
 forfe-1.927
Token.
Token ID13
Feature activation+5.92e-01

Pos logit contributions
agement+0.919
 Email+0.863
gil+0.861
arge+0.848
ODY+0.840
Neg logit contributions
ixel-1.070
jri-1.036
 popul-1.030
usra-0.970
 )))-0.967
Token
Token ID198
Feature activation+1.55e+00

Pos logit contributions
agement+0.388
gil+0.364
 Email+0.360
arge+0.356
ODY+0.356
Neg logit contributions
ixel-0.428
 popul-0.412
jri-0.411
 forfe-0.384
 )))-0.384
Token
Token ID198
Feature activation+9.82e-01

Pos logit contributions
agement+0.969
gil+0.908
 Email+0.893
arge+0.887
ODY+0.883
Neg logit contributions
ixel-1.169
 popul-1.134
jri-1.129
 forfe-1.055
 condem-1.055
TokenTopics
Token ID25902
Feature activation-6.13e-01

Pos logit contributions
agement+0.532
gil+0.493
 Email+0.480
arge+0.480
ODY+0.480
Neg logit contributions
ixel-0.819
 popul-0.797
jri-0.789
 forfe-0.750
 )))-0.748
Token:
Token ID25
Feature activation+7.29e+00

Pos logit contributions
ixel+0.439
 popul+0.425
jri+0.421
 )))+0.396
 forfe+0.395
Neg logit contributions
agement-0.404
gil-0.379
arge-0.371
ODY-0.371
 Email-0.371
Token community
Token ID2055
Feature activation+2.14e+00

Pos logit contributions
agement+4.699
gil+4.366
 Email+4.356
arge+4.299
ODY+4.214
Neg logit contributions
ixel-5.266
jri-4.913
 popul-4.700
usra-4.647
OHN-4.597
Token-
Token ID12
Feature activation+4.53e+00

Pos logit contributions
agement+1.585
gil+1.485
arge+1.479
 Email+1.454
ODY+1.442
Neg logit contributions
ixel-1.361
jri-1.296
 popul-1.289
 condem-1.199
usra-1.198
Tokenand
Token ID392
Feature activation+2.70e+00

Pos logit contributions
agement+2.709
gil+2.502
mot+2.420
arge+2.403
 Email+2.359
Neg logit contributions
ixel-3.529
jri-3.458
 popul-3.379
OHN-3.312
 forfe-3.277
Token-
Token ID12
Feature activation+2.94e-01

Pos logit contributions
agement+2.051
gil+1.936
 Email+1.909
arge+1.907
ODY+1.889
Neg logit contributions
ixel-1.666
 popul-1.592
jri-1.581
 forfe-1.459
 )))-1.459
Tokensoc
Token ID35634
Feature activation-8.18e-01

Pos logit contributions
agement+0.312
gil+0.300
ODY+0.297
 Email+0.297
arge+0.297
Neg logit contributions
ixel-0.091
 popul-0.084
jri-0.082
 )))-0.071
usra-0.069
Token among
Token ID1871
Feature activation-5.92e-01

Pos logit contributions
ixel+1.662
 popul+1.627
jri+1.601
 forfe+1.514
 )))+1.509
Neg logit contributions
agement-1.756
gil-1.657
ODY-1.617
arge-1.617
 Email-1.606
Token gay
Token ID5650
Feature activation-1.03e+00

Pos logit contributions
ixel+0.396
 popul+0.382
jri+0.379
 )))+0.355
 forfe+0.354
Neg logit contributions
agement-0.418
gil-0.394
arge-0.386
 Email-0.386
ODY-0.386
Token teens
Token ID15508
Feature activation+2.73e-01

Pos logit contributions
ixel+0.550
 popul+0.525
jri+0.521
 )))+0.478
 forfe+0.477
Neg logit contributions
agement-0.859
gil-0.818
arge-0.804
 Email-0.804
ODY-0.803
Token?
Token ID30
Feature activation+3.81e-01

Pos logit contributions
agement+0.184
gil+0.173
 Email+0.170
arge+0.169
ODY+0.169
Neg logit contributions
ixel-0.192
 popul-0.185
jri-0.184
 )))-0.172
usra-0.172
Token Not
Token ID1892
Feature activation+3.01e+00

Pos logit contributions
agement+0.249
gil+0.234
 Email+0.230
ODY+0.229
arge+0.228
Neg logit contributions
ixel-0.275
 popul-0.265
jri-0.264
 forfe-0.247
 )))-0.247
Token only
Token ID691
Feature activation+7.22e+00

Pos logit contributions
agement+1.820
gil+1.700
ODY+1.681
arge+1.674
 Email+1.668
Neg logit contributions
ixel-2.307
jri-2.210
 popul-2.189
 )))-2.074
 forfe-2.059
Token did
Token ID750
Feature activation+1.69e+00

Pos logit contributions
agement+4.094
 Email+3.811
ODY+3.793
gil+3.774
arge+3.744
Neg logit contributions
ixel-5.697
jri-5.444
 popul-5.290
 )))-5.078
usra-5.070
Token they
Token ID484
Feature activation+4.69e-02

Pos logit contributions
agement+1.100
gil+1.029
 Email+1.014
ODY+1.009
arge+1.009
Neg logit contributions
ixel-1.227
jri-1.178
 popul-1.176
 )))-1.103
usra-1.100
Token want
Token ID765
Feature activation+1.54e+00

Pos logit contributions
agement+0.031
gil+0.030
 Email+0.029
arge+0.029
ODY+0.029
Neg logit contributions
ixel-0.033
 popul-0.032
jri-0.032
 )))-0.030
 forfe-0.030
Token to
Token ID284
Feature activation+4.16e-01

Pos logit contributions
agement+0.906
gil+0.843
 Email+0.823
arge+0.823
ODY+0.823
Neg logit contributions
ixel-1.211
 popul-1.170
jri-1.165
 )))-1.099
 forfe-1.096
Token escape
Token ID6654
Feature activation-2.08e+00

Pos logit contributions
agement+0.282
gil+0.265
 Email+0.261
arge+0.260
ODY+0.259
Neg logit contributions
ixel-0.290
 popul-0.280
jri-0.279
usra-0.260
 )))-0.259
Token and
Token ID290
Feature activation+4.02e-01

Pos logit contributions
agement+0.032
gil+0.030
ODY+0.030
 Email+0.030
arge+0.030
Neg logit contributions
ixel-0.034
 popul-0.033
jri-0.033
 )))-0.031
usra-0.031
Token the
Token ID262
Feature activation+2.50e-01

Pos logit contributions
agement+0.266
gil+0.251
arge+0.246
 Email+0.246
ODY+0.243
Neg logit contributions
ixel-0.286
jri-0.275
 popul-0.273
 )))-0.257
usra-0.257
Token App
Token ID2034
Feature activation-1.75e+00

Pos logit contributions
agement+0.172
gil+0.162
 Email+0.158
arge+0.158
ODY+0.158
Neg logit contributions
ixel-0.171
 popul-0.165
jri-0.164
 )))-0.154
 forfe-0.153
Tokenoint
Token ID1563
Feature activation-1.12e+00

Pos logit contributions
ixel+1.228
 popul+1.177
jri+1.158
 forfe+1.102
 diss+1.087
Neg logit contributions
agement-1.172
gil-1.095
ODY-1.086
TPS-1.082
 Email-1.072
Tokenments
Token ID902
Feature activation-1.89e+00

Pos logit contributions
ixel+1.089
 popul+1.069
jri+1.049
 forfe+1.015
 )))+1.012
Neg logit contributions
agement-0.440
gil-0.396
ODY-0.383
TPS-0.382
arge-0.381
Token Clause
Token ID28081
Feature activation+7.20e+00

Pos logit contributions
ixel+1.333
 popul+1.295
jri+1.278
 )))+1.202
 forfe+1.200
Neg logit contributions
agement-1.258
gil-1.185
ODY-1.156
arge-1.155
 Email-1.153
Token,
Token ID11
Feature activation+1.71e+00

Pos logit contributions
agement+4.153
arge+3.870
gil+3.869
ODY+3.761
 Email+3.751
Neg logit contributions
ixel-5.429
jri-5.262
 popul-5.106
usra-5.008
 )))-4.897
Token the
Token ID262
Feature activation-3.61e-02

Pos logit contributions
agement+1.089
gil+1.023
arge+1.001
 Email+0.996
ODY+0.995
Neg logit contributions
ixel-1.263
jri-1.220
 popul-1.206
usra-1.143
 )))-1.136
Token relevant
Token ID5981
Feature activation-1.45e+00

Pos logit contributions
ixel+0.025
 popul+0.025
jri+0.024
 )))+0.023
 forfe+0.023
Neg logit contributions
agement-0.024
gil-0.023
arge-0.022
ODY-0.022
 Email-0.022
Token portion
Token ID6903
Feature activation+3.65e+00

Pos logit contributions
ixel+0.983
 popul+0.962
jri+0.939
 diss+0.895
 forfe+0.895
Neg logit contributions
agement-1.004
gil-0.946
arge-0.927
ODY-0.924
TPS-0.921
Token of
Token ID286
Feature activation+1.64e-01

Pos logit contributions
agement+2.270
gil+2.121
 Email+2.075
arge+2.072
ODY+2.063
Neg logit contributions
ixel-2.714
jri-2.617
 popul-2.570
usra-2.468
 )))-2.446
Token-
Token ID12
Feature activation+2.95e+00

Pos logit contributions
agement+1.806
gil+1.681
arge+1.658
 Email+1.647
ODY+1.636
Neg logit contributions
ixel-1.645
jri-1.584
 popul-1.563
 )))-1.452
usra-1.444
Tokeninterest
Token ID9446
Feature activation+8.72e-01

Pos logit contributions
agement+1.931
gil+1.824
arge+1.765
 Email+1.745
ODY+1.733
Neg logit contributions
ixel-2.139
 popul-2.064
jri-2.058
 )))-1.919
usra-1.908
Token,
Token ID11
Feature activation+1.15e+00

Pos logit contributions
agement+0.555
gil+0.519
arge+0.509
 Email+0.508
ODY+0.507
Neg logit contributions
ixel-0.644
 popul-0.621
jri-0.619
 )))-0.582
usra-0.579
Token and
Token ID290
Feature activation+5.01e-01

Pos logit contributions
agement+0.769
gil+0.721
 Email+0.707
arge+0.707
ODY+0.705
Neg logit contributions
ixel-0.812
jri-0.779
 popul-0.779
 )))-0.730
usra-0.728
Token not
Token ID407
Feature activation+2.82e-01

Pos logit contributions
agement+0.330
gil+0.309
 Email+0.303
arge+0.303
ODY+0.303
Neg logit contributions
ixel-0.359
 popul-0.347
jri-0.345
 )))-0.324
 forfe-0.322
Token only
Token ID691
Feature activation+7.19e+00

Pos logit contributions
agement+0.180
gil+0.169
ODY+0.165
arge+0.165
 Email+0.165
Neg logit contributions
ixel-0.207
 popul-0.201
jri-0.199
 )))-0.188
 forfe-0.187
Token an
Token ID281
Feature activation+1.73e+00

Pos logit contributions
agement+4.295
 Email+3.963
gil+3.929
arge+3.916
ODY+3.842
Neg logit contributions
ixel-5.503
jri-5.265
 popul-5.023
usra-4.888
OHN-4.861
Token obligation
Token ID12990
Feature activation+2.95e+00

Pos logit contributions
agement+1.196
gil+1.114
 Email+1.110
arge+1.099
ODY+1.098
Neg logit contributions
ixel-1.189
jri-1.151
 popul-1.135
usra-1.060
 )))-1.057
Token,
Token ID11
Feature activation+1.83e+00

Pos logit contributions
agement+1.782
gil+1.650
arge+1.630
ODY+1.623
 Email+1.620
Neg logit contributions
ixel-2.270
 popul-2.182
jri-2.178
usra-2.055
 )))-2.055
Token to
Token ID284
Feature activation+7.48e-01

Pos logit contributions
agement+1.094
gil+1.018
arge+0.996
 Email+0.995
ODY+0.994
Neg logit contributions
ixel-1.425
 popul-1.380
jri-1.373
 )))-1.297
 forfe-1.292
Token denounce
Token ID38639
Feature activation-5.55e-01

Pos logit contributions
agement+0.478
gil+0.446
 Email+0.440
arge+0.438
ODY+0.435
Neg logit contributions
ixel-0.547
jri-0.532
 popul-0.528
usra-0.493
 )))-0.492
Token Korea
Token ID4969
Feature activation-3.25e+00

Pos logit contributions
ixel+1.168
 popul+1.130
jri+1.107
 forfe+1.025
usra+1.021
Neg logit contributions
agement-1.867
gil-1.767
arge-1.752
ODY-1.749
 Email-1.741
Token's
Token ID338
Feature activation-1.61e+00

Pos logit contributions
ixel+2.342
 popul+2.315
jri+2.273
 diss+2.149
 forfe+2.142
Neg logit contributions
agement-2.077
gil-1.939
arge-1.920
ODY-1.890
 Email-1.884
Token founder
Token ID9119
Feature activation-3.71e-01

Pos logit contributions
ixel+0.780
 popul+0.751
jri+0.738
 )))+0.673
 forfe+0.672
Neg logit contributions
agement-1.426
gil-1.360
arge-1.340
 Email-1.338
ODY-1.338
Token Kim
Token ID6502
Feature activation-4.60e+00

Pos logit contributions
ixel+0.256
 popul+0.246
jri+0.245
 )))+0.230
 forfe+0.229
Neg logit contributions
agement-0.254
gil-0.239
 Email-0.234
ODY-0.234
arge-0.234
Token Il
Token ID13778
Feature activation-4.08e+00

Pos logit contributions
ixel+3.746
 popul+3.730
jri+3.686
 diss+3.538
 forfe+3.530
Neg logit contributions
agement-2.440
arge-2.252
ODY-2.233
gil-2.225
TPS-2.202
Token Sung
Token ID29802
Feature activation-7.94e+00

Pos logit contributions
ixel+2.179
 popul+2.131
jri+2.094
 diss+1.974
 forfe+1.963
Neg logit contributions
agement-3.320
ODY-3.171
gil-3.158
arge-3.156
TPS-3.098
Token and
Token ID290
Feature activation-2.51e+00

Pos logit contributions
 popul+4.663
ixel+4.583
jri+4.508
 diss+4.318
+4.193
Neg logit contributions
agement-5.898
gil-5.539
arge-5.494
TPS-5.484
ODY-5.425
Token his
Token ID465
Feature activation-2.44e+00

Pos logit contributions
ixel+1.764
 popul+1.734
jri+1.703
 forfe+1.611
 diss+1.607
Neg logit contributions
agement-1.666
gil-1.562
arge-1.541
 Email-1.535
ODY-1.532
Token descend
Token ID15350
Feature activation-1.77e+00

Pos logit contributions
ixel+1.482
 popul+1.451
jri+1.415
 diss+1.333
 forfe+1.329
Neg logit contributions
agement-1.852
gil-1.751
ODY-1.722
arge-1.721
TPS-1.714
Tokenents
Token ID658
Feature activation-1.14e+00

Pos logit contributions
ixel+1.117
 popul+1.085
jri+1.054
 )))+0.997
 forfe+0.992
Neg logit contributions
agement-1.295
gil-1.233
ODY-1.214
 Email-1.211
TPS-1.207
Token.
Token ID13
Feature activation-1.72e+00

Pos logit contributions
ixel+0.829
 popul+0.807
jri+0.799
 )))+0.751
 forfe+0.751
Neg logit contributions
agement-0.732
gil-0.685
arge-0.671
ODY-0.670
 Email-0.669
Token wheelchair
Token ID27181
Feature activation-2.75e+00

Pos logit contributions
agement+0.184
gil+0.173
 Email+0.170
arge+0.170
ODY+0.169
Neg logit contributions
ixel-0.194
jri-0.187
 popul-0.187
 )))-0.175
usra-0.174
Token (
Token ID357
Feature activation-2.97e-02

Pos logit contributions
ixel+1.926
 popul+1.858
jri+1.837
 forfe+1.733
 )))+1.730
Neg logit contributions
agement-1.853
gil-1.739
ODY-1.703
 Email-1.698
arge-1.696
TokenFigure
Token ID11337
Feature activation-5.18e+00

Pos logit contributions
ixel+0.021
 popul+0.021
jri+0.021
 forfe+0.019
 )))+0.019
Neg logit contributions
agement-0.019
gil-0.018
ODY-0.018
arge-0.018
 Email-0.018
Token 2
Token ID362
Feature activation-2.39e+00

Pos logit contributions
ixel+3.546
 popul+3.430
jri+3.322
 )))+3.244
 forfe+3.230
Neg logit contributions
agement-3.522
gil-3.289
arge-3.223
ODY-3.206
TPS-3.187
TokenA
Token ID32
Feature activation-4.48e+00

Pos logit contributions
ixel+1.585
 popul+1.525
jri+1.512
 )))+1.420
 forfe+1.418
Neg logit contributions
agement-1.699
gil-1.598
arge-1.566
ODY-1.561
 Email-1.560
Token,
Token ID11
Feature activation-7.76e+00

Pos logit contributions
ixel+2.804
 popul+2.666
jri+2.648
 forfe+2.476
 )))+2.474
Neg logit contributions
agement-3.344
gil-3.150
arge-3.092
TPS-3.078
ODY-3.070
Token [
Token ID685
Feature activation-2.37e+00

Pos logit contributions
ixel+5.717
 popul+5.450
jri+5.371
 )))+5.324
 diss+5.291
Neg logit contributions
agement-4.639
gil-4.364
arge-4.225
TPS-4.142
ODY-4.095
TokenAction
Token ID12502
Feature activation-6.79e-01

Pos logit contributions
ixel+1.781
 popul+1.725
jri+1.716
 )))+1.635
usra+1.614
Neg logit contributions
agement-1.463
gil-1.366
arge-1.341
ODY-1.333
 Email-1.330
Token Junior
Token ID20000
Feature activation+1.13e+00

Pos logit contributions
ixel+0.487
 popul+0.471
jri+0.468
 )))+0.440
 forfe+0.439
Neg logit contributions
agement-0.446
gil-0.419
arge-0.410
ODY-0.409
 Email-0.409
Token 3
Token ID513
Feature activation+2.21e+00

Pos logit contributions
agement+0.783
gil+0.732
 Email+0.732
arge+0.724
ODY+0.723
Neg logit contributions
ixel-0.758
 popul-0.734
jri-0.730
usra-0.682
 forfe-0.677
Token,
Token ID11
Feature activation-8.88e-01

Pos logit contributions
agement+1.548
gil+1.456
ODY+1.449
 Email+1.449
arge+1.441
Neg logit contributions
ixel-1.463
 popul-1.425
jri-1.406
usra-1.320
 forfe-1.318
Token submarines
Token ID34031
Feature activation+8.09e-01

Pos logit contributions
agement+0.216
gil+0.204
arge+0.202
 Email+0.201
ODY+0.200
Neg logit contributions
ixel-0.185
jri-0.177
 popul-0.177
 )))-0.164
usra-0.163
Token (
Token ID357
Feature activation+2.85e-01

Pos logit contributions
agement+0.533
gil+0.500
arge+0.494
 Email+0.493
ODY+0.489
Neg logit contributions
ixel-0.576
jri-0.558
 popul-0.555
 )))-0.520
usra-0.516
TokenSS
Token ID5432
Feature activation-4.21e+00

Pos logit contributions
agement+0.177
gil+0.165
arge+0.161
ODY+0.161
 Email+0.161
Neg logit contributions
ixel-0.215
 popul-0.209
jri-0.207
 )))-0.195
 forfe-0.195
TokenBN
Token ID15766
Feature activation-1.59e+00

Pos logit contributions
ixel+2.473
 popul+2.381
jri+2.355
 forfe+2.194
 )))+2.191
Neg logit contributions
agement-3.275
gil-3.106
ODY-3.034
 Email-3.024
arge-3.020
Token/
Token ID14
Feature activation-3.11e+00

Pos logit contributions
ixel+1.129
 popul+1.092
jri+1.082
 )))+1.019
 forfe+1.018
Neg logit contributions
agement-1.050
gil-0.986
ODY-0.964
arge-0.963
 Email-0.962
TokenSS
Token ID5432
Feature activation-7.65e+00

Pos logit contributions
ixel+1.668
 popul+1.591
jri+1.583
 )))+1.463
usra+1.456
Neg logit contributions
agement-2.572
gil-2.449
 Email-2.403
arge-2.394
ODY-2.394
TokenGN
Token ID16630
Feature activation-1.78e+00

Pos logit contributions
ixel+4.381
 popul+4.123
jri+4.094
 )))+3.904
OHN+3.872
Neg logit contributions
agement-5.858
gil-5.490
 Email-5.405
arge-5.328
ODY-5.307
Token)
Token ID8
Feature activation+8.15e-01

Pos logit contributions
ixel+1.294
 popul+1.255
jri+1.243
 )))+1.172
 forfe+1.171
Neg logit contributions
agement-1.149
gil-1.078
ODY-1.053
arge-1.052
 Email-1.051
Token in
Token ID287
Feature activation+1.22e-01

Pos logit contributions
agement+0.569
gil+0.533
 Email+0.528
arge+0.528
ODY+0.521
Neg logit contributions
ixel-0.551
jri-0.533
 popul-0.528
 )))-0.491
usra-0.489
Token operation
Token ID4905
Feature activation+3.31e-02

Pos logit contributions
agement+0.075
gil+0.070
arge+0.068
 Email+0.068
ODY+0.068
Neg logit contributions
ixel-0.093
 popul-0.090
jri-0.090
 )))-0.085
 forfe-0.085
Token and
Token ID290
Feature activation+1.47e+00

Pos logit contributions
agement+0.021
gil+0.020
arge+0.019
 Email+0.019
ODY+0.019
Neg logit contributions
ixel-0.024
jri-0.024
 popul-0.024
 )))-0.022
usra-0.022
Token with
Token ID351
Feature activation+1.21e+00

Pos logit contributions
ixel+0.039
 popul+0.037
jri+0.037
 )))+0.035
 forfe+0.035
Neg logit contributions
agement-0.040
gil-0.037
 Email-0.037
arge-0.037
ODY-0.037
Token the
Token ID262
Feature activation-1.06e+00

Pos logit contributions
agement+0.804
gil+0.754
 Email+0.744
arge+0.741
ODY+0.739
Neg logit contributions
ixel-0.856
 popul-0.826
jri-0.822
 forfe-0.769
usra-0.769
Token command
Token ID3141
Feature activation-2.90e+00

Pos logit contributions
ixel+0.729
 popul+0.710
jri+0.702
 )))+0.660
 forfe+0.657
Neg logit contributions
agement-0.729
gil-0.687
ODY-0.672
arge-0.670
 Email-0.667
Token,
Token ID11
Feature activation-1.45e+00

Pos logit contributions
ixel+1.964
 popul+1.920
jri+1.895
 )))+1.778
 diss+1.777
Neg logit contributions
agement-1.990
gil-1.889
ODY-1.837
arge-1.832
TPS-1.824
Token sl
Token ID1017
Feature activation-3.73e+00

Pos logit contributions
ixel+0.981
 popul+0.954
jri+0.944
 )))+0.887
 forfe+0.884
Neg logit contributions
agement-1.006
gil-0.949
ODY-0.928
arge-0.926
 Email-0.921
Tokens
Token ID82
Feature activation-7.58e+00

Pos logit contributions
ixel+2.856
 popul+2.733
jri+2.722
 )))+2.602
usra+2.588
Neg logit contributions
agement-2.243
gil-2.080
ODY-2.070
 Email-2.029
TPS-2.027
Token logs
Token ID17259
Feature activation-2.67e+00

Pos logit contributions
ixel+4.822
 popul+4.747
jri+4.625
 )))+4.485
 diss+4.429
Neg logit contributions
agement-5.225
gil-4.956
ODY-4.798
TPS-4.755
 Email-4.700
Token.
Token ID764
Feature activation-8.62e-01

Pos logit contributions
ixel+1.734
 popul+1.688
jri+1.656
 )))+1.578
 forfe+1.547
Neg logit contributions
agement-1.919
gil-1.811
ODY-1.768
arge-1.764
 Email-1.763
Token To
Token ID1675
Feature activation+1.82e+00

Pos logit contributions
ixel+0.610
 popul+0.593
jri+0.586
 )))+0.556
 forfe+0.552
Neg logit contributions
agement-0.571
gil-0.536
ODY-0.525
arge-0.523
 Email-0.518
Token see
Token ID766
Feature activation+6.39e-01

Pos logit contributions
agement+1.151
gil+1.072
 Email+1.069
arge+1.062
ODY+1.039
Neg logit contributions
ixel-1.348
 popul-1.302
jri-1.301
usra-1.217
 forfe-1.212
Token the
Token ID262
Feature activation-2.05e-01

Pos logit contributions
agement+0.429
gil+0.402
 Email+0.396
arge+0.395
ODY+0.394
Neg logit contributions
ixel-0.450
 popul-0.434
jri-0.432
 )))-0.404
 forfe-0.404
Token said
Token ID531
Feature activation-1.01e+00

Pos logit contributions
ixel+0.980
 popul+0.956
jri+0.947
 forfe+0.893
 )))+0.892
Neg logit contributions
agement-0.912
gil-0.858
ODY-0.842
arge-0.841
 Email-0.838
Token.
Token ID13
Feature activation-2.51e-01

Pos logit contributions
ixel+0.691
 popul+0.673
jri+0.666
 forfe+0.625
 )))+0.624
Neg logit contributions
agement-0.686
gil-0.646
arge-0.635
 Email-0.631
ODY-0.631
Token
Token ID564
Feature activation+2.09e+00

Pos logit contributions
ixel+0.170
 popul+0.165
jri+0.163
 )))+0.153
 forfe+0.153
Neg logit contributions
agement-0.175
gil-0.164
arge-0.161
ODY-0.161
 Email-0.161
Token
Token ID250
Feature activation-4.46e-01

Pos logit contributions
agement+1.236
gil+1.157
 Email+1.128
arge+1.125
ODY+1.124
Neg logit contributions
ixel-1.630
 popul-1.587
jri-1.575
 )))-1.483
 forfe-1.480
TokenEvery
Token ID6109
Feature activation+7.80e-01

Pos logit contributions
ixel+0.331
 popul+0.321
jri+0.319
 )))+0.302
 forfe+0.300
Neg logit contributions
agement-0.280
gil-0.261
arge-0.257
ODY-0.255
 Email-0.255
Token two
Token ID734
Feature activation-7.16e+00

Pos logit contributions
agement+0.499
gil+0.468
ODY+0.458
 Email+0.458
arge+0.457
Neg logit contributions
ixel-0.572
 popul-0.555
jri-0.550
 )))-0.518
 forfe-0.516
Token years
Token ID812
Feature activation-3.59e+00

Pos logit contributions
ixel+3.445
 popul+3.389
jri+3.251
 diss+3.147
 forfe+3.136
Neg logit contributions
agement-6.142
gil-5.850
ODY-5.793
TPS-5.732
arge-5.675
Token or
Token ID393
Feature activation-2.79e+00

Pos logit contributions
ixel+2.407
 popul+2.349
jri+2.321
 forfe+2.176
 )))+2.168
Neg logit contributions
agement-2.485
gil-2.331
arge-2.281
ODY-2.277
 Email-2.267
Token so
Token ID523
Feature activation+4.57e-01

Pos logit contributions
ixel+2.052
 popul+2.011
jri+1.984
 forfe+1.883
 )))+1.882
Neg logit contributions
agement-1.745
gil-1.639
ODY-1.607
 Email-1.596
arge-1.596
Token it
Token ID340
Feature activation-1.24e+00

Pos logit contributions
agement+0.301
gil+0.283
 Email+0.277
arge+0.277
ODY+0.277
Neg logit contributions
ixel-0.326
 popul-0.316
jri-0.313
 )))-0.295
 forfe-0.294
Token comes
Token ID2058
Feature activation-1.09e+00

Pos logit contributions
ixel+0.859
 popul+0.833
jri+0.825
 forfe+0.775
 )))+0.775
Neg logit contributions
agement-0.835
gil-0.786
ODY-0.772
arge-0.770
 Email-0.769
Token examining
Token ID17247
Feature activation-6.30e-01

Pos logit contributions
ixel+1.316
 popul+1.280
jri+1.263
 )))+1.198
 forfe+1.195
Neg logit contributions
agement-1.175
gil-1.103
arge-1.078
 Email-1.077
ODY-1.073
Token the
Token ID262
Feature activation+7.56e-01

Pos logit contributions
ixel+0.456
 popul+0.441
jri+0.438
 )))+0.412
 forfe+0.411
Neg logit contributions
agement-0.410
gil-0.385
arge-0.377
 Email-0.376
ODY-0.376
Token structure
Token ID4645
Feature activation-1.34e+00

Pos logit contributions
agement+0.524
gil+0.495
ODY+0.487
 Email+0.486
arge+0.483
Neg logit contributions
ixel-0.513
 popul-0.493
jri-0.492
 forfe-0.459
 )))-0.459
Token,
Token ID11
Feature activation-1.46e+00

Pos logit contributions
ixel+0.922
 popul+0.894
jri+0.885
 )))+0.831
 forfe+0.827
Neg logit contributions
agement-0.916
gil-0.863
arge-0.845
 Email-0.843
ODY-0.841
Token how
Token ID703
Feature activation-2.28e-01

Pos logit contributions
ixel+1.010
 popul+0.979
jri+0.968
 )))+0.910
 forfe+0.908
Neg logit contributions
agement-0.996
gil-0.938
 Email-0.918
arge-0.917
ODY-0.916
Token and
Token ID290
Feature activation-6.85e+00

Pos logit contributions
ixel+0.169
 popul+0.165
jri+0.163
 )))+0.154
 forfe+0.153
Neg logit contributions
agement-0.144
gil-0.134
arge-0.132
 Email-0.131
ODY-0.131
Token what
Token ID644
Feature activation-1.76e+00

Pos logit contributions
ixel+4.712
 popul+4.696
jri+4.495
 diss+4.445
 fasc+4.380
Neg logit contributions
agement-4.336
gil-4.113
TPS-4.041
 Email-4.037
ODY-4.007
Token makes
Token ID1838
Feature activation+1.11e+00

Pos logit contributions
ixel+1.230
 popul+1.199
jri+1.182
 )))+1.115
 diss+1.110
Neg logit contributions
agement-1.184
gil-1.112
arge-1.091
 Email-1.087
ODY-1.082
Token the
Token ID262
Feature activation-5.39e-02

Pos logit contributions
agement+0.691
gil+0.647
ODY+0.635
 Email+0.633
arge+0.630
Neg logit contributions
ixel-0.834
 popul-0.805
jri-0.804
 forfe-0.755
 )))-0.754
Token drawing
Token ID8263
Feature activation-2.33e+00

Pos logit contributions
ixel+0.037
 popul+0.036
jri+0.036
 )))+0.034
 forfe+0.034
Neg logit contributions
agement-0.037
gil-0.035
ODY-0.034
 Email-0.034
arge-0.034
Token work
Token ID670
Feature activation-1.20e+00

Pos logit contributions
ixel+1.598
 popul+1.550
jri+1.516
 )))+1.445
 diss+1.442
Neg logit contributions
agement-1.601
gil-1.505
arge-1.473
ODY-1.470
TPS-1.468
Token.
Token ID13
Feature activation-2.45e-01

Pos logit contributions
agement+0.045
gil+0.042
arge+0.041
 Email+0.041
ODY+0.041
Neg logit contributions
ixel-0.047
 popul-0.045
jri-0.045
 )))-0.042
 forfe-0.042
Token
Token ID198
Feature activation-6.59e-01

Pos logit contributions
ixel+0.174
 popul+0.168
jri+0.168
 )))+0.157
 forfe+0.157
Neg logit contributions
agement-0.163
gil-0.153
 Email-0.151
ODY-0.150
arge-0.150
Token
Token ID198
Feature activation-3.79e-01

Pos logit contributions
ixel+0.460
 popul+0.445
jri+0.439
 )))+0.422
 forfe+0.418
Neg logit contributions
agement-0.440
gil-0.412
ODY-0.405
arge-0.405
 Email-0.399
TokenSee
Token ID6214
Feature activation-9.34e-01

Pos logit contributions
ixel+0.280
 popul+0.271
jri+0.269
 )))+0.254
 forfe+0.253
Neg logit contributions
agement-0.241
gil-0.225
arge-0.220
 Email-0.220
ODY-0.220
Token also
Token ID635
Feature activation-1.45e+00

Pos logit contributions
ixel+0.684
 popul+0.661
jri+0.659
 )))+0.618
 forfe+0.617
Neg logit contributions
agement-0.601
gil-0.565
 Email-0.553
ODY-0.551
arge-0.551
Token [
Token ID685
Feature activation-6.84e+00

Pos logit contributions
ixel+0.754
 popul+0.728
jri+0.706
 forfe+0.654
 )))+0.652
Neg logit contributions
agement-1.237
gil-1.176
arge-1.157
ODY-1.153
TPS-1.153
Token edit
Token ID4370
Feature activation-2.45e+00

Pos logit contributions
ixel+3.728
 popul+3.466
 diss+3.402
 )))+3.372
+3.349
Neg logit contributions
agement-5.363
ODY-5.092
TPS-5.004
gil-4.935
arge-4.910
Token ]
Token ID2361
Feature activation-2.14e+00

Pos logit contributions
ixel+0.938
 popul+0.909
jri+0.853
 )))+0.840
 diss+0.803
Neg logit contributions
agement-2.387
gil-2.285
ODY-2.262
arge-2.252
TPS-2.231
Token<|endoftext|>
Token ID50256
Feature activation+2.62e+00

Pos logit contributions
ixel+1.458
 popul+1.427
jri+1.395
 forfe+1.321
 )))+1.321
Neg logit contributions
agement-1.455
gil-1.367
arge-1.338
ODY-1.328
 Email-1.324
TokenI
Token ID40
Feature activation+4.42e-01

Pos logit contributions
agement+1.690
gil+1.595
ODY+1.563
TPS+1.558
arge+1.554
Neg logit contributions
ixel-1.903
 popul-1.860
jri-1.831
 forfe-1.730
 condem-1.714
Token checked
Token ID10667
Feature activation-8.75e-01

Pos logit contributions
agement+0.286
gil+0.269
 Email+0.263
arge+0.263
ODY+0.262
Neg logit contributions
ixel-0.320
 popul-0.312
jri-0.310
 )))-0.290
 forfe-0.289
Token help
Token ID1037
Feature activation+1.37e+00

Pos logit contributions
agement+0.598
gil+0.560
 Email+0.556
arge+0.554
ODY+0.548
Neg logit contributions
ixel-0.619
jri-0.601
 popul-0.599
usra-0.560
 )))-0.558
Token teach
Token ID4545
Feature activation+1.41e-01

Pos logit contributions
agement+0.854
gil+0.797
 Email+0.788
arge+0.787
ODY+0.782
Neg logit contributions
ixel-1.023
jri-0.994
 popul-0.993
usra-0.934
 )))-0.931
Token others
Token ID1854
Feature activation+1.15e+00

Pos logit contributions
agement+0.085
gil+0.080
 Email+0.078
arge+0.078
ODY+0.078
Neg logit contributions
ixel-0.109
 popul-0.105
jri-0.105
 )))-0.099
 forfe-0.099
Token to
Token ID284
Feature activation+3.61e-01

Pos logit contributions
agement+0.738
gil+0.693
 Email+0.680
arge+0.679
ODY+0.678
Neg logit contributions
ixel-0.831
 popul-0.805
jri-0.804
 )))-0.753
usra-0.752
Token follow
Token ID1061
Feature activation-9.69e-01

Pos logit contributions
agement+0.245
gil+0.232
 Email+0.228
arge+0.227
ODY+0.226
Neg logit contributions
ixel-0.247
 popul-0.241
jri-0.241
usra-0.224
 forfe-0.223
Token in
Token ID287
Feature activation-6.83e+00

Pos logit contributions
ixel+0.703
 popul+0.682
jri+0.674
 )))+0.635
 forfe+0.635
Neg logit contributions
agement-0.629
gil-0.588
arge-0.577
ODY-0.575
 Email-0.571
Token the
Token ID262
Feature activation-1.82e+00

Pos logit contributions
ixel+4.361
 popul+4.240
jri+4.015
 forfe+3.947
 )))+3.946
Neg logit contributions
agement-4.794
gil-4.517
arge-4.477
TPS-4.416
ODY-4.390
Token same
Token ID976
Feature activation-7.34e-01

Pos logit contributions
ixel+1.110
 popul+1.070
jri+1.059
 )))+0.986
 forfe+0.983
Neg logit contributions
agement-1.387
gil-1.314
arge-1.291
ODY-1.290
 Email-1.285
Token footsteps
Token ID27146
Feature activation-3.98e+00

Pos logit contributions
ixel+0.457
 popul+0.440
jri+0.437
 )))+0.406
 forfe+0.405
Neg logit contributions
agement-0.551
gil-0.522
arge-0.512
 Email-0.512
ODY-0.511
Token.
Token ID13
Feature activation-3.33e-01

Pos logit contributions
ixel+2.696
 popul+2.616
jri+2.543
 diss+2.400
 )))+2.399
Neg logit contributions
agement-2.768
gil-2.592
arge-2.544
TPS-2.531
ODY-2.529
Token He
Token ID679
Feature activation-3.73e-01

Pos logit contributions
ixel+0.234
 popul+0.227
jri+0.225
 )))+0.211
 forfe+0.211
Neg logit contributions
agement-0.223
gil-0.210
arge-0.206
ODY-0.205
 Email-0.205
Token that
Token ID326
Feature activation+5.36e-01

Pos logit contributions
ixel+0.733
 popul+0.713
jri+0.705
 )))+0.666
 forfe+0.663
Neg logit contributions
agement-0.657
gil-0.614
arge-0.601
ODY-0.600
 Email-0.599
Token and
Token ID290
Feature activation-1.99e+00

Pos logit contributions
agement+0.351
gil+0.330
 Email+0.324
ODY+0.324
arge+0.323
Neg logit contributions
ixel-0.384
 popul-0.371
jri-0.369
 forfe-0.347
 )))-0.347
Token really
Token ID1107
Feature activation+6.36e-01

Pos logit contributions
ixel+1.449
 popul+1.407
jri+1.388
 )))+1.316
 forfe+1.310
Neg logit contributions
agement-1.285
gil-1.204
ODY-1.179
 Email-1.175
arge-1.174
Token analyze
Token ID16602
Feature activation-6.81e-01

Pos logit contributions
agement+0.399
gil+0.372
arge+0.365
 Email+0.364
ODY+0.362
Neg logit contributions
ixel-0.475
jri-0.460
 popul-0.459
 )))-0.429
 forfe-0.429
Token how
Token ID703
Feature activation+1.05e+00

Pos logit contributions
ixel+0.500
 popul+0.485
jri+0.480
 )))+0.453
 forfe+0.452
Neg logit contributions
agement-0.436
gil-0.408
arge-0.399
 Email-0.399
ODY-0.398
Token and
Token ID290
Feature activation-6.77e+00

Pos logit contributions
agement+0.681
gil+0.640
 Email+0.626
ODY+0.625
arge+0.624
Neg logit contributions
ixel-0.761
jri-0.734
 popul-0.734
 )))-0.686
 forfe-0.686
Token what
Token ID644
Feature activation-9.77e-01

Pos logit contributions
ixel+4.815
 popul+4.755
jri+4.561
 diss+4.509
 )))+4.448
Neg logit contributions
agement-4.168
gil-3.940
TPS-3.910
 Email-3.890
ODY-3.860
Token makes
Token ID1838
Feature activation+1.30e+00

Pos logit contributions
ixel+0.686
 popul+0.667
jri+0.658
 )))+0.620
 forfe+0.617
Neg logit contributions
agement-0.654
gil-0.614
arge-0.602
 Email-0.600
ODY-0.600
Token that
Token ID326
Feature activation+7.43e-01

Pos logit contributions
agement+0.801
gil+0.753
ODY+0.737
 Email+0.736
arge+0.732
Neg logit contributions
ixel-0.973
jri-0.940
 popul-0.938
 forfe-0.882
 )))-0.881
Token work
Token ID670
Feature activation-5.33e-01

Pos logit contributions
agement+0.509
gil+0.479
 Email+0.469
ODY+0.469
arge+0.469
Neg logit contributions
ixel-0.512
 popul-0.495
jri-0.492
 )))-0.460
 forfe-0.460
Token,
Token ID11
Feature activation-1.78e+00

Pos logit contributions
ixel+0.377
 popul+0.364
jri+0.362
 )))+0.340
 forfe+0.339
Neg logit contributions
agement-0.356
gil-0.335
arge-0.327
 Email-0.327
ODY-0.327
Token-
Token ID12
Feature activation-2.09e+00

Pos logit contributions
ixel+0.136
 popul+0.130
jri+0.129
 )))+0.120
 forfe+0.119
Neg logit contributions
agement-0.171
gil-0.161
arge-0.159
 Email-0.158
ODY-0.158
Tokenpowered
Token ID12293
Feature activation+1.42e+00

Pos logit contributions
ixel+1.340
 popul+1.294
jri+1.283
 )))+1.199
 forfe+1.198
Neg logit contributions
agement-1.510
gil-1.428
ODY-1.409
 Email-1.407
arge-1.395
Token attack
Token ID1368
Feature activation-1.41e+00

Pos logit contributions
agement+0.939
gil+0.878
arge+0.875
 Email+0.870
ODY+0.859
Neg logit contributions
ixel-1.020
jri-0.982
 popul-0.973
 )))-0.913
usra-0.909
Token submarines
Token ID34031
Feature activation+3.98e-01

Pos logit contributions
ixel+0.775
 popul+0.744
jri+0.735
 forfe+0.678
 )))+0.677
Neg logit contributions
agement-1.162
gil-1.106
ODY-1.087
 Email-1.086
arge-1.084
Token (
Token ID357
Feature activation-9.16e-01

Pos logit contributions
agement+0.257
gil+0.240
arge+0.237
 Email+0.236
ODY+0.235
Neg logit contributions
ixel-0.290
jri-0.281
 popul-0.280
 )))-0.262
usra-0.261
TokenSS
Token ID5432
Feature activation-6.63e+00

Pos logit contributions
ixel+0.582
 popul+0.562
jri+0.557
 )))+0.519
 forfe+0.518
Neg logit contributions
agement-0.676
gil-0.639
arge-0.627
ODY-0.627
 Email-0.627
TokenN
Token ID45
Feature activation-3.97e+00

Pos logit contributions
ixel+3.436
 popul+3.249
jri+3.220
 forfe+3.018
 )))+3.011
Neg logit contributions
agement-5.507
gil-5.231
 Email-5.102
ODY-5.063
arge-5.047
Token),
Token ID828
Feature activation+2.59e-01

Pos logit contributions
ixel+2.855
 popul+2.754
jri+2.726
 )))+2.593
 forfe+2.588
Neg logit contributions
agement-2.560
gil-2.403
arge-2.345
ODY-2.344
 Email-2.335
Token plus
Token ID5556
Feature activation+3.55e-01

Pos logit contributions
agement+0.161
gil+0.150
 Email+0.148
arge+0.148
ODY+0.147
Neg logit contributions
ixel-0.194
jri-0.188
 popul-0.188
 )))-0.176
usra-0.175
Token some
Token ID617
Feature activation-5.40e-01

Pos logit contributions
agement+0.207
gil+0.192
 Email+0.188
arge+0.188
ODY+0.186
Neg logit contributions
ixel-0.282
jri-0.272
 popul-0.272
 )))-0.257
usra-0.255
Token diesel
Token ID21866
Feature activation-8.12e-02

Pos logit contributions
ixel+0.418
 popul+0.406
jri+0.403
 )))+0.381
 forfe+0.380
Neg logit contributions
agement-0.324
gil-0.302
arge-0.295
 Email-0.295
ODY-0.295
Token Korea
Token ID4969
Feature activation-1.91e+00

Pos logit contributions
ixel+1.236
 popul+1.202
jri+1.183
 forfe+1.106
 )))+1.101
Neg logit contributions
agement-1.466
gil-1.383
arge-1.360
ODY-1.360
 Email-1.357
Token's
Token ID338
Feature activation+5.53e-01

Pos logit contributions
ixel+1.376
 popul+1.343
jri+1.328
 forfe+1.248
 diss+1.245
Neg logit contributions
agement-1.244
gil-1.162
arge-1.142
ODY-1.141
 Email-1.136
Token founder
Token ID9119
Feature activation-2.58e-01

Pos logit contributions
agement+0.408
gil+0.388
 Email+0.382
ODY+0.378
arge+0.377
Neg logit contributions
ixel-0.352
jri-0.336
 popul-0.334
 )))-0.311
usra-0.310
Token Kim
Token ID6502
Feature activation-3.09e+00

Pos logit contributions
ixel+0.181
 popul+0.174
jri+0.174
 )))+0.163
usra+0.162
Neg logit contributions
agement-0.174
gil-0.164
 Email-0.160
ODY-0.160
arge-0.159
Token Il
Token ID13778
Feature activation-1.97e+00

Pos logit contributions
ixel+2.617
 popul+2.588
jri+2.561
 diss+2.446
 forfe+2.442
Neg logit contributions
agement-1.573
arge-1.435
gil-1.427
ODY-1.421
 Email-1.404
Token Sung
Token ID29802
Feature activation-6.58e+00

Pos logit contributions
ixel+1.186
 popul+1.141
jri+1.132
 )))+1.049
 forfe+1.048
Neg logit contributions
agement-1.518
gil-1.438
arge-1.413
ODY-1.412
 Email-1.412
Token and
Token ID290
Feature activation-5.75e-02

Pos logit contributions
ixel+4.179
 popul+4.156
jri+4.129
 diss+3.921
 forfe+3.829
Neg logit contributions
agement-4.636
gil-4.293
TPS-4.277
arge-4.257
ODY-4.248
Token his
Token ID465
Feature activation+3.69e-01

Pos logit contributions
ixel+0.039
 popul+0.038
jri+0.038
 )))+0.035
usra+0.035
Neg logit contributions
agement-0.040
gil-0.037
 Email-0.037
arge-0.036
ODY-0.036
Token descend
Token ID15350
Feature activation-6.69e-01

Pos logit contributions
agement+0.272
gil+0.257
 Email+0.255
ODY+0.251
arge+0.251
Neg logit contributions
ixel-0.235
jri-0.223
 popul-0.222
 )))-0.208
 forfe-0.206
Tokenents
Token ID658
Feature activation-1.80e-01

Pos logit contributions
ixel+0.441
 popul+0.426
jri+0.422
 )))+0.394
 forfe+0.394
Neg logit contributions
agement-0.477
gil-0.450
ODY-0.442
 Email-0.441
arge-0.441
Token.
Token ID13
Feature activation+3.44e-01

Pos logit contributions
ixel+0.130
 popul+0.125
jri+0.125
 )))+0.117
 forfe+0.117
Neg logit contributions
agement-0.117
gil-0.110
 Email-0.107
arge-0.107
ODY-0.107
Token invoke
Token ID26342
Feature activation-1.70e+00

Pos logit contributions
ixel+0.135
 popul+0.131
jri+0.130
 )))+0.123
 forfe+0.122
Neg logit contributions
agement-0.119
gil-0.111
ODY-0.109
arge-0.109
 Email-0.109
Token it
Token ID340
Feature activation+5.07e-01

Pos logit contributions
ixel+1.193
 popul+1.165
jri+1.143
 )))+1.090
 forfe+1.081
Neg logit contributions
agement-1.138
gil-1.071
arge-1.045
ODY-1.044
 Email-1.034
Token locally
Token ID15726
Feature activation+1.37e+00

Pos logit contributions
agement+0.331
gil+0.311
ODY+0.304
arge+0.304
 Email+0.303
Neg logit contributions
ixel-0.365
 popul-0.354
jri-0.350
 )))-0.330
 forfe-0.329
Token with
Token ID351
Feature activation+4.57e-01

Pos logit contributions
agement+0.902
gil+0.845
 Email+0.830
arge+0.829
ODY+0.828
Neg logit contributions
ixel-0.985
 popul-0.952
jri-0.949
usra-0.888
 )))-0.886
Token sl
Token ID1017
Feature activation-3.14e+00

Pos logit contributions
agement+0.309
gil+0.291
 Email+0.285
arge+0.285
ODY+0.285
Neg logit contributions
ixel-0.318
 popul-0.308
jri-0.305
 )))-0.286
 forfe-0.286
Tokens
Token ID82
Feature activation-6.58e+00

Pos logit contributions
ixel+2.415
 popul+2.345
jri+2.320
 )))+2.209
 forfe+2.191
Neg logit contributions
agement-1.893
gil-1.760
ODY-1.729
 Email-1.711
TPS-1.706
Token invoke
Token ID26342
Feature activation-1.44e+00

Pos logit contributions
ixel+4.248
 popul+4.159
jri+4.032
 )))+3.915
 diss+3.891
Neg logit contributions
agement-4.599
gil-4.291
ODY-4.210
TPS-4.156
 Email-4.126
Token local
Token ID1957
Feature activation-1.87e+00

Pos logit contributions
ixel+0.923
 popul+0.898
jri+0.882
 )))+0.834
 forfe+0.823
Neg logit contributions
agement-1.054
gil-0.996
ODY-0.974
arge-0.972
 Email-0.968
Token -
Token ID532
Feature activation-2.88e+00

Pos logit contributions
ixel+1.120
 popul+1.081
jri+1.065
 )))+0.999
 forfe+0.987
Neg logit contributions
agement-1.447
gil-1.371
ODY-1.344
 Email-1.343
arge-1.343
Tokenf
Token ID69
Feature activation-3.25e+00

Pos logit contributions
ixel+2.024
 popul+1.952
jri+1.937
 )))+1.829
usra+1.807
Neg logit contributions
agement-1.925
gil-1.799
ODY-1.763
arge-1.759
 Email-1.758
Token test
Token ID1332
Feature activation-3.62e+00

Pos logit contributions
ixel+1.847
 popul+1.783
jri+1.737
 )))+1.656
 diss+1.624
Neg logit contributions
agement-2.598
gil-2.454
ODY-2.408
arge-2.407
 Email-2.401
Tokenthread
Token ID16663
Feature activation-4.23e+00

Pos logit contributions
ixel+2.264
jri+2.155
 popul+2.150
 )))+2.016
 forfe+1.994
Neg logit contributions
agement-2.182
gil-2.036
ODY-2.018
arge-2.016
 Email-2.012
Token/
Token ID14
Feature activation-3.13e+00

Pos logit contributions
ixel+2.063
 popul+1.908
jri+1.889
 )))+1.837
+1.780
Neg logit contributions
agement-3.645
ODY-3.459
gil-3.443
arge-3.431
TPS-3.411
Token54
Token ID4051
Feature activation-8.27e-01

Pos logit contributions
ixel+2.632
jri+2.557
 popul+2.543
 )))+2.442
usra+2.423
Neg logit contributions
agement-1.610
gil-1.461
arge-1.457
ODY-1.456
 Email-1.447
Token33
Token ID2091
Feature activation-1.07e+00

Pos logit contributions
ixel+0.637
 popul+0.616
jri+0.613
 )))+0.583
 forfe+0.577
Neg logit contributions
agement-0.496
gil-0.463
arge-0.455
ODY-0.454
 Email-0.452
Token95
Token ID3865
Feature activation-3.87e+00

Pos logit contributions
ixel+0.674
 popul+0.649
jri+0.645
 )))+0.603
 forfe+0.599
Neg logit contributions
agement-0.792
gil-0.749
arge-0.736
ODY-0.736
 Email-0.733
Token/
Token ID14
Feature activation-6.58e+00

Pos logit contributions
ixel+1.939
 popul+1.817
jri+1.798
 )))+1.736
 forfe+1.674
Neg logit contributions
agement-3.303
gil-3.148
arge-3.113
ODY-3.101
TPS-3.094
Tokenlost
Token ID33224
Feature activation-1.55e-02

Pos logit contributions
ixel+4.831
 popul+4.580
jri+4.551
 diss+4.283
+4.278
Neg logit contributions
agement-4.047
ODY-3.711
 Email-3.697
TPS-3.674
arge-3.665
Token-
Token ID12
Feature activation+7.18e-01

Pos logit contributions
ixel+0.009
 popul+0.009
jri+0.009
 )))+0.008
 forfe+0.008
Neg logit contributions
agement-0.012
gil-0.011
arge-0.011
 Email-0.011
ODY-0.011
Tokenval
Token ID2100
Feature activation-5.22e-01

Pos logit contributions
agement+0.390
gil+0.363
arge+0.352
ODY+0.349
 Email+0.349
Neg logit contributions
ixel-0.597
 popul-0.582
jri-0.578
 )))-0.548
 forfe-0.547
Tokenley
Token ID1636
Feature activation-4.15e+00

Pos logit contributions
ixel+0.415
 popul+0.404
jri+0.401
 )))+0.379
 forfe+0.378
Neg logit contributions
agement-0.301
gil-0.280
arge-0.274
 Email-0.273
ODY-0.273
Token-
Token ID12
Feature activation-2.41e+00

Pos logit contributions
ixel+2.368
 popul+2.273
jri+2.241
 forfe+2.081
 )))+2.075
Neg logit contributions
agement-3.285
gil-3.125
ODY-3.087
TPS-3.067
arge-3.066
Token
Token ID250
Feature activation+2.51e-01

Pos logit contributions
agement+0.712
gil+0.660
 Email+0.645
arge+0.643
ODY+0.639
Neg logit contributions
ixel-1.060
 popul-1.034
jri-1.029
 forfe-0.972
 )))-0.972
Tokenshock
Token ID39563
Feature activation-2.53e+00

Pos logit contributions
agement+0.170
gil+0.159
arge+0.156
 Email+0.155
ODY+0.155
Neg logit contributions
ixel-0.177
 popul-0.171
jri-0.169
 )))-0.159
 forfe-0.159
Token doctrine
Token ID13115
Feature activation+8.78e-01

Pos logit contributions
ixel+1.706
 popul+1.663
jri+1.639
 )))+1.536
 forfe+1.526
Neg logit contributions
agement-1.754
gil-1.657
ODY-1.621
 Email-1.619
arge-1.615
Token
Token ID447
Feature activation-1.10e+00

Pos logit contributions
agement+0.569
gil+0.532
arge+0.525
 Email+0.523
ODY+0.520
Neg logit contributions
ixel-0.639
 popul-0.611
jri-0.611
 forfe-0.575
 )))-0.574
Token
Token ID251
Feature activation-7.50e-01

Pos logit contributions
ixel+0.924
 popul+0.897
jri+0.890
 )))+0.846
 forfe+0.843
Neg logit contributions
agement-0.592
gil-0.548
ODY-0.534
arge-0.532
 Email-0.531
Token experiment
Token ID6306
Feature activation-6.57e+00

Pos logit contributions
ixel+0.528
 popul+0.509
jri+0.506
 )))+0.475
 forfe+0.475
Neg logit contributions
agement-0.504
gil-0.473
arge-0.464
 Email-0.464
ODY-0.463
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
ixel+4.314
 popul+4.308
jri+4.203
 diss+3.971
+3.971
Neg logit contributions
agement-4.451
gil-4.225
ODY-4.134
TPS-4.118
 Email-4.095
Token
Token ID198
Feature activation-7.48e-01

Pos logit contributions
ixel+0.710
 popul+0.689
jri+0.682
 )))+0.642
 forfe+0.639
Neg logit contributions
agement-0.683
gil-0.642
arge-0.629
ODY-0.628
 Email-0.627
Token
Token ID198
Feature activation-5.74e-01

Pos logit contributions
ixel+0.517
 popul+0.500
jri+0.492
 )))+0.475
 forfe+0.469
Neg logit contributions
agement-0.504
gil-0.472
arge-0.464
ODY-0.464
 Email-0.458
TokenAn
Token ID2025
Feature activation-1.91e+00

Pos logit contributions
ixel+0.436
 popul+0.423
jri+0.420
 )))+0.397
 forfe+0.395
Neg logit contributions
agement-0.352
gil-0.328
 Email-0.322
arge-0.321
ODY-0.320
Tokenarch
Token ID998
Feature activation-2.45e+00

Pos logit contributions
ixel+0.735
 popul+0.693
jri+0.682
 )))+0.602
 forfe+0.601
Neg logit contributions
agement-1.883
gil-1.807
ODY-1.780
arge-1.779
 Email-1.777
Token type
Token ID2099
Feature activation-2.52e+00

Pos logit contributions
ixel+2.472
 popul+2.405
jri+2.356
 forfe+2.195
 )))+2.188
Neg logit contributions
agement-3.090
gil-2.929
ODY-2.885
 Email-2.868
arge-2.868
Token,
Token ID11
Feature activation-2.37e+00

Pos logit contributions
ixel+1.737
 popul+1.684
jri+1.667
 forfe+1.561
 )))+1.560
Neg logit contributions
agement-1.719
gil-1.621
ODY-1.591
arge-1.586
 Email-1.583
Token which
Token ID543
Feature activation-3.06e+00

Pos logit contributions
ixel+1.625
 popul+1.582
jri+1.551
 forfe+1.468
 )))+1.461
Neg logit contributions
agement-1.627
gil-1.533
ODY-1.502
arge-1.495
 Email-1.485
Token used
Token ID973
Feature activation-6.44e-01

Pos logit contributions
ixel+1.968
 popul+1.904
jri+1.864
 forfe+1.765
 diss+1.760
Neg logit contributions
agement-2.221
gil-2.096
ODY-2.073
arge-2.053
TPS-2.043
Token SR
Token ID16808
Feature activation-2.59e+00

Pos logit contributions
ixel+0.423
 popul+0.408
jri+0.405
 )))+0.378
 forfe+0.377
Neg logit contributions
agement-0.461
gil-0.435
arge-0.427
 Email-0.427
ODY-0.427
Token-
Token ID12
Feature activation-6.53e+00

Pos logit contributions
ixel+1.518
 popul+1.450
jri+1.441
 forfe+1.334
 )))+1.330
Neg logit contributions
agement-2.036
gil-1.940
ODY-1.898
arge-1.894
 Email-1.893
TokenI
Token ID40
Feature activation-3.17e+00

Pos logit contributions
ixel+4.089
jri+3.879
 popul+3.814
 )))+3.564
 diss+3.552
Neg logit contributions
agement-4.753
gil-4.507
 Email-4.430
arge-4.425
ODY-4.382
TokenOV
Token ID8874
Feature activation-2.35e+00

Pos logit contributions
ixel+1.419
jri+1.293
 popul+1.280
 )))+1.184
 forfe+1.173
Neg logit contributions
agement-2.946
gil-2.825
arge-2.773
 Email-2.766
TPS-2.737
Token and
Token ID290
Feature activation-2.95e-02

Pos logit contributions
ixel+1.592
 popul+1.540
jri+1.525
 forfe+1.429
 )))+1.426
Neg logit contributions
agement-1.629
gil-1.535
ODY-1.506
arge-1.500
 Email-1.499
Token the
Token ID262
Feature activation-8.25e-01

Pos logit contributions
ixel+0.020
 popul+0.019
jri+0.019
 )))+0.018
 forfe+0.018
Neg logit contributions
agement-0.021
gil-0.020
 Email-0.019
arge-0.019
ODY-0.019
Token n
Token ID299
Feature activation-2.83e+00

Pos logit contributions
ixel+0.557
 popul+0.540
jri+0.534
 )))+0.501
 forfe+0.500
Neg logit contributions
agement-0.576
gil-0.543
ODY-0.532
arge-0.531
 Email-0.529
Token
Token ID447
Feature activation+3.80e-01

Pos logit contributions
agement+0.281
gil+0.263
 Email+0.261
ODY+0.259
arge+0.257
Neg logit contributions
ixel-0.300
jri-0.289
 popul-0.288
 forfe-0.268
usra-0.268
Token
Token ID251
Feature activation+8.96e-01

Pos logit contributions
agement+0.214
gil+0.199
arge+0.194
ODY+0.194
 Email+0.194
Neg logit contributions
ixel-0.307
 popul-0.298
jri-0.297
 )))-0.281
 forfe-0.280
Token
Token ID198
Feature activation-1.54e-01

Pos logit contributions
agement+0.581
gil+0.545
 Email+0.539
ODY+0.535
arge+0.532
Neg logit contributions
ixel-0.655
 popul-0.626
jri-0.625
 forfe-0.586
usra-0.585
Token
Token ID198
Feature activation+5.52e-01

Pos logit contributions
ixel+0.109
 popul+0.106
jri+0.104
 )))+0.100
 forfe+0.099
Neg logit contributions
agement-0.102
gil-0.095
arge-0.093
ODY-0.093
 Email-0.093
TokenDespite
Token ID8332
Feature activation-2.34e+00

Pos logit contributions
agement+0.345
gil+0.324
ODY+0.317
arge+0.316
 Email+0.315
Neg logit contributions
ixel-0.414
 popul-0.401
jri-0.398
 forfe-0.375
 )))-0.374
Token assurances
Token ID35568
Feature activation-6.53e+00

Pos logit contributions
ixel+1.595
 popul+1.575
jri+1.541
 forfe+1.458
 diss+1.451
Neg logit contributions
agement-1.598
gil-1.508
ODY-1.483
arge-1.480
 Email-1.468
Token by
Token ID416
Feature activation-2.36e+00

Pos logit contributions
ixel+3.782
 popul+3.768
jri+3.656
 diss+3.427
 forfe+3.424
Neg logit contributions
agement-4.981
gil-4.766
ODY-4.666
 Email-4.630
arge-4.612
Token Japanese
Token ID4960
Feature activation-8.56e-01

Pos logit contributions
ixel+1.581
 popul+1.566
jri+1.531
 diss+1.449
 )))+1.445
Neg logit contributions
agement-1.622
gil-1.539
ODY-1.504
arge-1.497
 Email-1.485
Token authorities
Token ID4773
Feature activation-9.17e-02

Pos logit contributions
ixel+0.485
 popul+0.471
jri+0.462
 forfe+0.429
 )))+0.428
Neg logit contributions
agement-0.689
gil-0.655
ODY-0.643
arge-0.641
 Email-0.640
Token about
Token ID546
Feature activation+1.68e+00

Pos logit contributions
ixel+0.068
 popul+0.065
jri+0.065
 )))+0.061
usra+0.061
Neg logit contributions
agement-0.058
gil-0.054
 Email-0.053
ODY-0.053
arge-0.053
Token low
Token ID1877
Feature activation-5.06e-01

Pos logit contributions
agement+1.129
gil+1.059
 Email+1.043
arge+1.038
ODY+1.032
Neg logit contributions
ixel-1.188
jri-1.141
 popul-1.135
 )))-1.061
usra-1.061
Token
Token ID198
Feature activation+3.49e-01

Pos logit contributions
agement+0.858
gil+0.804
 Email+0.788
arge+0.787
ODY+0.786
Neg logit contributions
ixel-0.962
 popul-0.932
jri-0.925
 )))-0.870
 forfe-0.868
Token
Token ID198
Feature activation+1.20e+00

Pos logit contributions
agement+0.227
gil+0.212
arge+0.208
ODY+0.208
 Email+0.207
Neg logit contributions
ixel-0.253
 popul-0.245
jri-0.243
 )))-0.230
 forfe-0.229
TokenR
Token ID49
Feature activation+2.76e+00

Pos logit contributions
agement+0.736
gil+0.689
ODY+0.675
 Email+0.674
arge+0.672
Neg logit contributions
ixel-0.914
 popul-0.888
jri-0.881
 forfe-0.829
 )))-0.827
Tokenobby
Token ID11369
Feature activation+1.66e+00

Pos logit contributions
agement+1.757
gil+1.642
ODY+1.611
 Email+1.608
arge+1.603
Neg logit contributions
ixel-2.039
 popul-1.992
jri-1.950
 forfe-1.852
 )))-1.848
Token M
Token ID337
Feature activation+6.22e-01

Pos logit contributions
agement+1.136
gil+1.072
 Email+1.066
arge+1.053
ODY+1.044
Neg logit contributions
ixel-1.138
jri-1.106
 popul-1.105
 forfe-1.019
 )))-1.018
Tokenook
Token ID566
Feature activation-6.52e+00

Pos logit contributions
agement+0.381
gil+0.356
arge+0.348
 Email+0.348
ODY+0.348
Neg logit contributions
ixel-0.473
 popul-0.459
jri-0.456
 )))-0.430
 forfe-0.429
Token (
Token ID357
Feature activation+8.25e-01

Pos logit contributions
ixel+4.093
 popul+4.065
jri+3.969
 diss+3.899
 )))+3.789
Neg logit contributions
agement-4.601
gil-4.378
ODY-4.305
TPS-4.264
arge-4.174
Tokenleft
Token ID9464
Feature activation+6.29e-01

Pos logit contributions
agement+0.424
gil+0.392
arge+0.381
ODY+0.380
 Email+0.379
Neg logit contributions
ixel-0.709
 popul-0.692
jri-0.684
 forfe-0.653
 )))-0.651
Token)
Token ID8
Feature activation+9.65e-02

Pos logit contributions
agement+0.370
gil+0.344
 Email+0.337
arge+0.336
ODY+0.336
Neg logit contributions
ixel-0.494
 popul-0.479
jri-0.476
 )))-0.450
 forfe-0.449
Token was
Token ID373
Feature activation-4.74e-01

Pos logit contributions
agement+0.066
gil+0.062
arge+0.061
 Email+0.061
ODY+0.061
Neg logit contributions
ixel-0.067
 popul-0.064
jri-0.064
 )))-0.060
 forfe-0.060
Token asked
Token ID1965
Feature activation-1.20e+00

Pos logit contributions
ixel+0.331
 popul+0.322
jri+0.318
 )))+0.300
 forfe+0.299
Neg logit contributions
agement-0.319
gil-0.300
ODY-0.293
arge-0.293
 Email-0.292
Token unaware
Token ID17261
Feature activation+7.35e-01

Pos logit contributions
agement+0.286
gil+0.269
 Email+0.264
arge+0.264
ODY+0.263
Neg logit contributions
ixel-0.294
jri-0.282
 popul-0.282
 )))-0.263
usra-0.262
Token of
Token ID286
Feature activation+1.43e+00

Pos logit contributions
agement+0.488
gil+0.459
ODY+0.450
 Email+0.450
arge+0.449
Neg logit contributions
ixel-0.521
 popul-0.503
jri-0.500
 )))-0.469
 forfe-0.469
Token how
Token ID703
Feature activation+7.87e-01

Pos logit contributions
agement+0.918
gil+0.862
 Email+0.847
ODY+0.843
arge+0.843
Neg logit contributions
ixel-1.041
 popul-1.004
jri-1.002
 )))-0.941
 forfe-0.938
Token we
Token ID356
Feature activation-2.55e-01

Pos logit contributions
agement+0.546
gil+0.515
 Email+0.507
arge+0.504
ODY+0.503
Neg logit contributions
ixel-0.535
 popul-0.513
jri-0.513
 )))-0.478
usra-0.477
Token got
Token ID1392
Feature activation-1.40e+00

Pos logit contributions
ixel+0.176
 popul+0.170
jri+0.169
 )))+0.158
 forfe+0.158
Neg logit contributions
agement-0.175
gil-0.164
ODY-0.161
arge-0.161
 Email-0.161
Token here
Token ID994
Feature activation-6.51e+00

Pos logit contributions
ixel+0.960
 popul+0.938
jri+0.921
 )))+0.869
 forfe+0.868
Neg logit contributions
agement-0.960
gil-0.904
arge-0.881
ODY-0.879
 Email-0.878
Token.
Token ID13
Feature activation+4.96e-01

Pos logit contributions
ixel+4.122
 popul+4.073
jri+4.037
+3.811
 diss+3.779
Neg logit contributions
agement-4.616
gil-4.308
arge-4.237
TPS-4.203
 Email-4.180
Token
Token ID198
Feature activation-1.54e+00

Pos logit contributions
agement+0.323
gil+0.303
 Email+0.299
ODY+0.297
arge+0.296
Neg logit contributions
ixel-0.360
 popul-0.347
jri-0.346
 forfe-0.324
usra-0.323
Token
Token ID198
Feature activation-1.08e+00

Pos logit contributions
ixel+1.036
 popul+1.002
jri+0.984
 )))+0.961
+0.943
Neg logit contributions
agement-1.061
gil-0.994
arge-0.977
ODY-0.975
 Email-0.961
TokenI
Token ID40
Feature activation+1.24e-01

Pos logit contributions
ixel+0.804
 popul+0.779
jri+0.776
 )))+0.733
 forfe+0.727
Neg logit contributions
agement-0.676
gil-0.629
 Email-0.617
arge-0.616
ODY-0.613
Token
Token ID447
Feature activation-6.41e-01

Pos logit contributions
agement+0.080
gil+0.075
arge+0.073
 Email+0.073
ODY+0.073
Neg logit contributions
ixel-0.090
 popul-0.087
jri-0.087
 )))-0.082
 forfe-0.081
Tokenically
Token ID1146
Feature activation-1.65e+00

Pos logit contributions
ixel+0.041
 popul+0.040
jri+0.039
 )))+0.037
usra+0.037
Neg logit contributions
agement-0.033
gil-0.031
arge-0.030
 Email-0.030
ODY-0.030
Token,
Token ID11
Feature activation-5.70e-01

Pos logit contributions
ixel+1.140
 popul+1.107
jri+1.094
 forfe+1.033
 )))+1.027
Neg logit contributions
agement-1.121
gil-1.055
 Email-1.033
arge-1.031
ODY-1.029
Token with
Token ID351
Feature activation-1.54e-01

Pos logit contributions
ixel+0.403
 popul+0.390
jri+0.388
 )))+0.364
 forfe+0.362
Neg logit contributions
agement-0.380
gil-0.356
 Email-0.349
ODY-0.349
arge-0.349
Token better
Token ID1365
Feature activation-1.04e+00

Pos logit contributions
ixel+0.111
 popul+0.107
jri+0.106
 )))+0.100
 forfe+0.100
Neg logit contributions
agement-0.101
gil-0.095
 Email-0.093
arge-0.093
ODY-0.093
Token standardized
Token ID25713
Feature activation-1.77e+00

Pos logit contributions
ixel+0.744
 popul+0.720
jri+0.715
 )))+0.672
 forfe+0.671
Neg logit contributions
agement-0.681
gil-0.639
arge-0.626
ODY-0.625
 Email-0.625
Token test
Token ID1332
Feature activation-6.50e+00

Pos logit contributions
ixel+1.050
 popul+1.008
jri+1.002
 )))+0.926
 forfe+0.923
Neg logit contributions
agement-1.383
gil-1.311
 Email-1.288
arge-1.288
ODY-1.288
Token scores
Token ID8198
Feature activation-6.47e-02

Pos logit contributions
ixel+1.528
 popul+1.430
jri+1.326
 diss+1.245
 forfe+1.197
Neg logit contributions
agement-7.281
gil-7.095
arge-6.967
 Email-6.925
ODY-6.924
Token and
Token ID290
Feature activation-7.61e-01

Pos logit contributions
ixel+0.046
 popul+0.045
jri+0.044
 )))+0.042
usra+0.042
Neg logit contributions
agement-0.043
gil-0.040
arge-0.039
ODY-0.039
 Email-0.039
Token better
Token ID1365
Feature activation-1.67e-01

Pos logit contributions
ixel+0.530
 popul+0.514
jri+0.508
 forfe+0.479
 )))+0.478
Neg logit contributions
agement-0.514
gil-0.484
ODY-0.473
arge-0.473
 Email-0.472
Token quality
Token ID3081
Feature activation-8.27e-01

Pos logit contributions
ixel+0.119
 popul+0.115
jri+0.115
 )))+0.107
usra+0.107
Neg logit contributions
agement-0.111
gil-0.104
 Email-0.102
arge-0.102
ODY-0.102
Token of
Token ID286
Feature activation-1.81e+00

Pos logit contributions
ixel+0.576
 popul+0.557
jri+0.553
 )))+0.519
 forfe+0.517
Neg logit contributions
agement-0.560
gil-0.526
 Email-0.515
arge-0.515
ODY-0.515
Token )
Token ID1267
Feature activation+1.52e+00

Pos logit contributions
ixel+0.612
 popul+0.573
jri+0.565
 )))+0.507
 forfe+0.500
Neg logit contributions
agement-1.578
gil-1.513
arge-1.494
ODY-1.493
 Email-1.492
Token,
Token ID837
Feature activation+3.06e-01

Pos logit contributions
agement+1.368
gil+1.312
 Email+1.297
ODY+1.288
arge+1.284
Neg logit contributions
ixel-0.710
 popul-0.687
jri-0.685
 condem-0.618
usra-0.602
Token 0
Token ID657
Feature activation-1.51e-01

Pos logit contributions
agement+0.212
gil+0.200
ODY+0.196
arge+0.196
 Email+0.196
Neg logit contributions
ixel-0.208
 popul-0.201
jri-0.200
 )))-0.188
 forfe-0.187
Token )
Token ID1267
Feature activation-9.73e-01

Pos logit contributions
ixel+0.079
 popul+0.076
jri+0.075
 )))+0.069
 forfe+0.069
Neg logit contributions
agement-0.128
gil-0.122
arge-0.120
ODY-0.120
 Email-0.120
Token black
Token ID2042
Feature activation+6.56e-02

Pos logit contributions
ixel+0.728
 popul+0.715
jri+0.700
 )))+0.675
 forfe+0.674
Neg logit contributions
agement-0.603
gil-0.564
ODY-0.555
arge-0.551
TPS-0.550
Tokenhat
Token ID5183
Feature activation-6.46e+00

Pos logit contributions
agement+0.043
gil+0.041
arge+0.040
 Email+0.040
ODY+0.040
Neg logit contributions
ixel-0.047
 popul-0.045
jri-0.045
 forfe-0.042
 )))-0.042
Token =
Token ID796
Feature activation+6.14e-01

Pos logit contributions
ixel+3.451
 )))+3.448
 popul+3.261
jri+3.104
 forfe+3.099
Neg logit contributions
agement-5.116
ODY-4.813
gil-4.778
arge-4.726
TPS-4.688
Token c
Token ID269
Feature activation-4.75e-01

Pos logit contributions
agement+0.368
gil+0.344
 Email+0.336
arge+0.335
ODY+0.335
Neg logit contributions
ixel-0.476
 popul-0.461
jri-0.459
 )))-0.432
 forfe-0.431
Tokenv
Token ID85
Feature activation+2.63e-02

Pos logit contributions
ixel+0.271
 popul+0.260
jri+0.257
 )))+0.238
 forfe+0.237
Neg logit contributions
agement-0.381
gil-0.362
arge-0.356
ODY-0.356
 Email-0.356
Token2
Token ID17
Feature activation-2.00e+00

Pos logit contributions
agement+0.018
gil+0.017
arge+0.017
 Email+0.017
ODY+0.017
Neg logit contributions
ixel-0.018
 popul-0.018
jri-0.017
 )))-0.016
 forfe-0.016
Token.
Token ID764
Feature activation-2.02e+00

Pos logit contributions
ixel+0.652
 popul+0.605
jri+0.581
 )))+0.546
 forfe+0.524
Neg logit contributions
agement-2.087
gil-1.997
ODY-1.982
arge-1.981
TPS-1.965
Token
Token ID250
Feature activation-1.59e+00

Pos logit contributions
ixel+1.064
 popul+1.023
jri+1.019
 )))+0.962
 forfe+0.961
Neg logit contributions
agement-0.941
gil-0.878
arge-0.871
 Email-0.861
ODY-0.861
TokenThe
Token ID464
Feature activation-5.54e-01

Pos logit contributions
ixel+1.183
 popul+1.146
jri+1.139
 )))+1.077
 forfe+1.070
Neg logit contributions
agement-0.990
gil-0.923
arge-0.906
 Email-0.902
ODY-0.901
Token one
Token ID530
Feature activation+1.96e+00

Pos logit contributions
ixel+0.384
 popul+0.372
jri+0.369
 )))+0.346
 forfe+0.345
Neg logit contributions
agement-0.376
gil-0.354
 Email-0.346
ODY-0.346
arge-0.346
Token thing
Token ID1517
Feature activation+3.12e+00

Pos logit contributions
agement+1.239
gil+1.156
 Email+1.138
ODY+1.136
arge+1.131
Neg logit contributions
ixel-1.447
 popul-1.396
jri-1.391
 )))-1.307
usra-1.307
Token that
Token ID326
Feature activation+4.37e-01

Pos logit contributions
agement+2.052
gil+1.921
 Email+1.920
ODY+1.894
TPS+1.881
Neg logit contributions
ixel-2.200
jri-2.116
 popul-2.091
 )))-1.983
 forfe-1.956
Token Bob
Token ID5811
Feature activation+5.07e+00

Pos logit contributions
agement+0.304
gil+0.287
 Email+0.282
ODY+0.280
arge+0.280
Neg logit contributions
ixel-0.295
 popul-0.284
jri-0.283
 )))-0.265
 forfe-0.264
Token told
Token ID1297
Feature activation+1.71e+00

Pos logit contributions
agement+2.697
gil+2.492
 Email+2.469
ODY+2.450
arge+2.399
Neg logit contributions
ixel-4.206
 popul-4.099
jri-4.068
 )))-3.845
 forfe-3.813
Token me
Token ID502
Feature activation+1.37e+00

Pos logit contributions
agement+1.204
gil+1.135
 Email+1.125
ODY+1.116
arge+1.113
Neg logit contributions
ixel-1.139
 popul-1.096
jri-1.096
 )))-1.019
 forfe-1.018
Token in
Token ID287
Feature activation+6.87e-01

Pos logit contributions
agement+0.939
gil+0.884
 Email+0.873
ODY+0.867
arge+0.862
Neg logit contributions
ixel-0.937
jri-0.901
 popul-0.898
 )))-0.839
usra-0.836
Token training
Token ID3047
Feature activation-1.07e+00

Pos logit contributions
agement+0.481
gil+0.452
 Email+0.448
arge+0.441
ODY+0.441
Neg logit contributions
ixel-0.466
 popul-0.446
jri-0.445
usra-0.414
 )))-0.414
Token in
Token ID287
Feature activation-1.42e-02

Pos logit contributions
ixel+0.705
 popul+0.686
jri+0.677
 forfe+0.637
 )))+0.636
Neg logit contributions
agement-0.755
gil-0.711
arge-0.698
ODY-0.696
 Email-0.693
Token le
Token ID443
Feature activation-1.52e-01

Pos logit contributions
ixel+0.486
 popul+0.470
jri+0.466
 )))+0.436
 forfe+0.435
Neg logit contributions
agement-0.515
gil-0.487
ODY-0.477
arge-0.476
 Email-0.476
Token S
Token ID311
Feature activation-1.70e-02

Pos logit contributions
ixel+0.099
 popul+0.096
jri+0.095
 )))+0.089
 forfe+0.089
Neg logit contributions
agement-0.109
gil-0.103
arge-0.101
ODY-0.101
 Email-0.100
Tokené
Token ID2634
Feature activation+3.19e+00

Pos logit contributions
ixel+0.013
 popul+0.013
jri+0.012
 )))+0.012
usra+0.012
Neg logit contributions
agement-0.010
gil-0.010
 Email-0.009
ODY-0.009
arge-0.009
Tokennat
Token ID32353
Feature activation+1.04e+00

Pos logit contributions
agement+2.380
gil+2.238
arge+2.208
 Email+2.170
ODY+2.161
Neg logit contributions
ixel-2.007
 popul-1.952
jri-1.926
 forfe-1.803
usra-1.794
Token Imp
Token ID9855
Feature activation-8.98e-01

Pos logit contributions
agement+0.715
gil+0.664
arge+0.656
 Email+0.654
ODY+0.649
Neg logit contributions
ixel-0.716
jri-0.687
 popul-0.685
 )))-0.640
 forfe-0.639
Tokené
Token ID2634
Feature activation+4.88e+00

Pos logit contributions
ixel+0.797
 popul+0.776
jri+0.771
 )))+0.733
 forfe+0.733
Neg logit contributions
agement-0.436
gil-0.402
ODY-0.390
 Email-0.389
arge-0.388
Tokenrial
Token ID4454
Feature activation-2.17e+00

Pos logit contributions
agement+4.037
gil+3.826
arge+3.741
 Email+3.665
ODY+3.663
Neg logit contributions
ixel-2.660
 popul-2.571
jri-2.529
 forfe-2.375
 )))-2.340
Token,
Token ID11
Feature activation+2.02e-01

Pos logit contributions
ixel+1.448
 popul+1.410
jri+1.386
 )))+1.305
 forfe+1.302
Neg logit contributions
agement-1.518
gil-1.440
ODY-1.408
arge-1.399
 Email-1.398
Token le
Token ID443
Feature activation-1.27e-01

Pos logit contributions
agement+0.143
gil+0.134
 Email+0.132
arge+0.132
ODY+0.132
Neg logit contributions
ixel-0.135
 popul-0.130
jri-0.129
 )))-0.121
 forfe-0.120
Token contr
Token ID3445
Feature activation+1.41e+00

Pos logit contributions
ixel+0.084
 popul+0.081
jri+0.081
 )))+0.075
 forfe+0.075
Neg logit contributions
agement-0.091
gil-0.086
arge-0.084
 Email-0.084
ODY-0.084
Tokenat
Token ID265
Feature activation+4.34e+00

Pos logit contributions
agement+0.922
gil+0.857
arge+0.845
 Email+0.834
ODY+0.833
Neg logit contributions
ixel-1.021
 popul-0.993
jri-0.988
 )))-0.925
 forfe-0.923
Token Trump
Token ID1301
Feature activation+6.95e-01

Pos logit contributions
agement+0.965
gil+0.908
 Email+0.900
arge+0.890
ODY+0.886
Neg logit contributions
ixel-0.956
jri-0.915
 popul-0.904
usra-0.849
 )))-0.848
Token could
Token ID714
Feature activation+8.83e-01

Pos logit contributions
agement+0.465
gil+0.433
 Email+0.429
arge+0.423
ODY+0.423
Neg logit contributions
ixel-0.493
jri-0.473
 popul-0.472
 )))-0.441
usra-0.440
Token learn
Token ID2193
Feature activation-8.28e-01

Pos logit contributions
agement+0.604
gil+0.565
 Email+0.562
arge+0.557
ODY+0.545
Neg logit contributions
ixel-0.608
jri-0.587
 popul-0.584
usra-0.547
 )))-0.540
Token several
Token ID1811
Feature activation-8.72e-01

Pos logit contributions
ixel+0.555
 popul+0.534
jri+0.532
 )))+0.496
 forfe+0.494
Neg logit contributions
agement-0.584
gil-0.550
 Email-0.540
arge-0.539
ODY-0.539
Token lessons
Token ID11658
Feature activation+2.04e+00

Pos logit contributions
ixel+0.589
jri+0.566
 popul+0.563
 )))+0.525
usra+0.524
Neg logit contributions
agement-0.613
gil-0.575
 Email-0.568
arge-0.564
ODY-0.562
Token from
Token ID422
Feature activation+4.43e+00

Pos logit contributions
agement+1.503
 Email+1.424
gil+1.422
arge+1.404
TPS+1.385
Neg logit contributions
ixel-1.304
jri-1.228
 popul-1.207
usra-1.162
 )))-1.123
Token former
Token ID1966
Feature activation+1.61e+00

Pos logit contributions
agement+2.814
gil+2.650
 Email+2.648
arge+2.581
TPS+2.556
Neg logit contributions
ixel-3.232
jri-3.109
 popul-2.996
usra-2.863
 )))-2.856
Token President
Token ID1992
Feature activation-1.11e+00

Pos logit contributions
agement+1.134
gil+1.069
 Email+1.056
arge+1.050
ODY+1.043
Neg logit contributions
ixel-1.085
jri-1.042
 popul-1.035
 )))-0.969
 forfe-0.962
Token Harry
Token ID5850
Feature activation+8.96e-01

Pos logit contributions
ixel+0.712
 popul+0.697
jri+0.683
 forfe+0.643
 diss+0.640
Neg logit contributions
agement-0.807
gil-0.764
ODY-0.753
arge-0.750
 Email-0.744
Token Truman
Token ID33027
Feature activation+3.80e+00

Pos logit contributions
agement+0.689
gil+0.655
ODY+0.646
arge+0.645
 Email+0.638
Neg logit contributions
ixel-0.539
 popul-0.520
jri-0.510
 )))-0.478
 forfe-0.477
Token.
Token ID13
Feature activation+7.35e-01

Pos logit contributions
agement+2.350
 Email+2.213
gil+2.201
arge+2.139
ODY+2.134
Neg logit contributions
ixel-2.851
jri-2.746
 popul-2.724
 )))-2.581
usra-2.574
Token it
Token ID340
Feature activation+7.37e-01

Pos logit contributions
ixel+1.549
 popul+1.506
jri+1.489
 )))+1.410
 forfe+1.409
Neg logit contributions
agement-1.454
gil-1.369
ODY-1.332
arge-1.327
 Email-1.315
Token was
Token ID373
Feature activation-1.43e+00

Pos logit contributions
agement+0.516
gil+0.487
arge+0.478
 Email+0.477
ODY+0.477
Neg logit contributions
ixel-0.496
 popul-0.478
jri-0.474
 )))-0.443
 forfe-0.442
Token ok
Token ID12876
Feature activation+6.05e-01

Pos logit contributions
ixel+0.967
 popul+0.941
jri+0.927
 forfe+0.876
 )))+0.874
Neg logit contributions
agement-0.991
gil-0.937
ODY-0.917
arge-0.913
 Email-0.909
Token,
Token ID11
Feature activation+9.49e-01

Pos logit contributions
agement+0.403
gil+0.379
 Email+0.371
arge+0.371
ODY+0.370
Neg logit contributions
ixel-0.428
 popul-0.414
jri-0.411
 )))-0.385
 forfe-0.385
Token a
Token ID257
Feature activation+1.60e+00

Pos logit contributions
agement+0.655
gil+0.615
 Email+0.608
arge+0.606
ODY+0.603
Neg logit contributions
ixel-0.648
 popul-0.623
jri-0.622
usra-0.579
 )))-0.578
Token quick
Token ID2068
Feature activation+5.22e+00

Pos logit contributions
agement+1.052
gil+0.983
 Email+0.971
arge+0.966
ODY+0.959
Neg logit contributions
ixel-1.151
 popul-1.116
jri-1.108
 )))-1.039
usra-1.037
Token search
Token ID2989
Feature activation+2.60e+00

Pos logit contributions
agement+3.345
 Email+3.215
gil+3.148
arge+3.106
ODY+3.053
Neg logit contributions
ixel-3.628
jri-3.614
 popul-3.556
usra-3.323
 forfe-3.311
Token revealed
Token ID4602
Feature activation+3.54e+00

Pos logit contributions
agement+1.759
gil+1.649
 Email+1.631
arge+1.624
ODY+1.617
Neg logit contributions
ixel-1.803
jri-1.744
 popul-1.742
 )))-1.620
 forfe-1.614
Token that
Token ID326
Feature activation+3.73e+00

Pos logit contributions
agement+2.164
 Email+2.018
gil+2.005
arge+1.985
TPS+1.977
Neg logit contributions
ixel-2.669
 popul-2.580
jri-2.576
 )))-2.416
usra-2.396
Token they
Token ID484
Feature activation+2.11e+00

Pos logit contributions
agement+2.302
 Email+2.158
gil+2.128
arge+2.120
TPS+2.117
Neg logit contributions
ixel-2.771
jri-2.682
 popul-2.663
usra-2.491
 )))-2.487
Token had
Token ID550
Feature activation+1.55e+00

Pos logit contributions
agement+1.441
gil+1.347
 Email+1.337
arge+1.331
TPS+1.317
Neg logit contributions
ixel-1.448
jri-1.399
 popul-1.394
usra-1.298
 )))-1.296
Token't
Token ID470
Feature activation+1.94e-01

Pos logit contributions
ixel+0.772
 popul+0.750
jri+0.744
 )))+0.719
 forfe+0.713
Neg logit contributions
agement-0.388
gil-0.351
ODY-0.344
arge-0.343
 Email-0.342
Token see
Token ID766
Feature activation-4.19e-01

Pos logit contributions
agement+0.133
gil+0.125
arge+0.122
 Email+0.122
ODY+0.122
Neg logit contributions
ixel-0.134
 popul-0.130
jri-0.129
 )))-0.121
 forfe-0.120
Token this
Token ID428
Feature activation+1.74e+00

Pos logit contributions
ixel+0.309
 popul+0.300
jri+0.298
 forfe+0.282
 )))+0.281
Neg logit contributions
agement-0.265
gil-0.249
ODY-0.242
arge-0.242
 Email-0.241
Token as
Token ID355
Feature activation+2.91e+00

Pos logit contributions
agement+1.122
gil+1.050
arge+1.030
 Email+1.028
ODY+1.027
Neg logit contributions
ixel-1.273
 popul-1.233
jri-1.225
 )))-1.151
usra-1.146
Token too
Token ID1165
Feature activation+2.24e+00

Pos logit contributions
agement+1.856
gil+1.730
arge+1.709
 Email+1.701
TPS+1.697
Neg logit contributions
ixel-2.144
 popul-2.067
jri-2.055
 )))-1.923
usra-1.919
Token big
Token ID1263
Feature activation+6.03e+00

Pos logit contributions
agement+1.491
gil+1.397
arge+1.389
ODY+1.376
 Email+1.372
Neg logit contributions
ixel-1.602
jri-1.529
 popul-1.512
 condem-1.426
usra-1.420
Token of
Token ID286
Feature activation+7.43e-01

Pos logit contributions
agement+3.505
 Email+3.296
gil+3.254
arge+3.223
ODY+3.180
Neg logit contributions
ixel-4.754
jri-4.600
 popul-4.553
 condem-4.417
usra-4.253
Token a
Token ID257
Feature activation+7.46e-01

Pos logit contributions
agement+0.456
gil+0.427
ODY+0.417
arge+0.417
 Email+0.416
Neg logit contributions
ixel-0.563
 popul-0.546
jri-0.542
 )))-0.512
 forfe-0.512
Token concern
Token ID2328
Feature activation+2.26e+00

Pos logit contributions
agement+0.450
gil+0.420
arge+0.411
 Email+0.410
ODY+0.410
Neg logit contributions
ixel-0.574
 popul-0.557
jri-0.553
 )))-0.522
 forfe-0.521
Token though
Token ID996
Feature activation-1.07e+00

Pos logit contributions
agement+1.376
gil+1.289
 Email+1.245
arge+1.244
ODY+1.234
Neg logit contributions
ixel-1.734
 popul-1.684
jri-1.677
 )))-1.596
 forfe-1.587
Token since
Token ID1201
Feature activation-2.62e-01

Pos logit contributions
ixel+0.613
 popul+0.597
jri+0.593
 forfe+0.564
 )))+0.562
Neg logit contributions
agement-0.843
gil-0.799
ODY-0.784
arge-0.782
TPS-0.777
Token anyone
Token ID2687
Feature activation-1.60e+00

Pos logit contributions
agement+0.613
gil+0.574
 Email+0.568
arge+0.564
ODY+0.561
Neg logit contributions
ixel-0.632
 popul-0.607
jri-0.605
 )))-0.567
usra-0.564
Token with
Token ID351
Feature activation+5.81e-01

Pos logit contributions
ixel+1.204
 popul+1.183
jri+1.165
 forfe+1.116
 )))+1.110
Neg logit contributions
agement-0.971
gil-0.922
ODY-0.897
arge-0.892
TPS-0.884
Token an
Token ID281
Feature activation+6.30e-01

Pos logit contributions
agement+0.355
gil+0.330
 Email+0.326
arge+0.324
ODY+0.322
Neg logit contributions
ixel-0.444
 popul-0.430
jri-0.428
 )))-0.404
usra-0.403
Token Amazon
Token ID6186
Feature activation-6.86e-01

Pos logit contributions
agement+0.405
 Email+0.377
gil+0.376
arge+0.372
ODY+0.369
Neg logit contributions
ixel-0.461
jri-0.449
 popul-0.448
 )))-0.418
usra-0.417
Token Prime
Token ID5537
Feature activation+7.77e-01

Pos logit contributions
ixel+0.455
 popul+0.439
jri+0.435
 )))+0.407
 forfe+0.406
Neg logit contributions
agement-0.488
gil-0.460
arge-0.451
 Email-0.451
ODY-0.451
Token subscription
Token ID14569
Feature activation+2.43e+00

Pos logit contributions
agement+0.537
gil+0.502
 Email+0.498
arge+0.495
ODY+0.493
Neg logit contributions
ixel-0.533
 popul-0.514
jri-0.512
 )))-0.478
usra-0.477
Token to
Token ID284
Feature activation-6.18e-01

Pos logit contributions
agement+1.372
gil+1.267
 Email+1.256
arge+1.249
ODY+1.230
Neg logit contributions
ixel-1.972
 popul-1.907
jri-1.906
usra-1.796
 )))-1.795
Token share
Token ID2648
Feature activation-1.39e+00

Pos logit contributions
ixel+0.429
 popul+0.416
jri+0.412
 )))+0.387
 forfe+0.386
Neg logit contributions
agement-0.420
gil-0.395
ODY-0.387
arge-0.386
 Email-0.385
Token shipping
Token ID8440
Feature activation+7.52e-01

Pos logit contributions
ixel+0.993
 popul+0.972
jri+0.959
 forfe+0.913
 diss+0.904
Neg logit contributions
agement-0.892
gil-0.846
ODY-0.828
arge-0.821
TPS-0.816
Token benefits
Token ID4034
Feature activation-1.38e-01

Pos logit contributions
agement+0.577
gil+0.543
 Email+0.537
arge+0.535
ODY+0.531
Neg logit contributions
ixel-0.460
 popul-0.437
jri-0.436
 )))-0.406
usra-0.401
Token and
Token ID290
Feature activation+1.80e-01

Pos logit contributions
ixel+0.113
 popul+0.109
jri+0.109
 )))+0.103
usra+0.103
Neg logit contributions
agement-0.076
gil-0.071
 Email-0.069
arge-0.069
ODY-0.069
Tokenrock
Token ID10823
Feature activation+5.75e-01

Pos logit contributions
agement+0.947
gil+0.890
 Email+0.870
arge+0.870
ODY+0.867
Neg logit contributions
ixel-1.037
 popul-1.001
jri-0.995
 )))-0.940
 forfe-0.936
Token in
Token ID287
Feature activation+8.88e-01

Pos logit contributions
agement+0.388
gil+0.365
arge+0.357
 Email+0.357
ODY+0.356
Neg logit contributions
ixel-0.402
 popul-0.388
jri-0.386
 )))-0.363
 forfe-0.362
Token 1979
Token ID13521
Feature activation-2.91e-01

Pos logit contributions
agement+0.574
gil+0.538
 Email+0.529
ODY+0.527
arge+0.526
Neg logit contributions
ixel-0.645
 popul-0.624
jri-0.621
 )))-0.583
 forfe-0.582
Token,
Token ID11
Feature activation+6.82e-01

Pos logit contributions
ixel+0.208
 popul+0.202
jri+0.200
 )))+0.188
 forfe+0.188
Neg logit contributions
agement-0.191
gil-0.179
arge-0.175
ODY-0.175
 Email-0.174
Token unfortunately
Token ID12716
Feature activation+6.13e-02

Pos logit contributions
agement+0.439
gil+0.411
arge+0.402
 Email+0.402
ODY+0.402
Neg logit contributions
ixel-0.498
 popul-0.482
jri-0.479
 )))-0.451
 forfe-0.450
Token about
Token ID546
Feature activation+1.24e+00

Pos logit contributions
agement+0.042
gil+0.039
 Email+0.038
arge+0.038
ODY+0.038
Neg logit contributions
ixel-0.043
 popul-0.041
jri-0.041
 )))-0.038
 forfe-0.038
Token the
Token ID262
Feature activation+7.87e-02

Pos logit contributions
agement+0.732
gil+0.682
arge+0.666
ODY+0.666
 Email+0.664
Neg logit contributions
ixel-0.973
 popul-0.945
jri-0.938
 )))-0.888
 forfe-0.886
Token same
Token ID976
Feature activation-2.44e-01

Pos logit contributions
agement+0.053
gil+0.049
ODY+0.048
arge+0.048
 Email+0.048
Neg logit contributions
ixel-0.056
 popul-0.054
jri-0.053
 )))-0.050
 forfe-0.050
Token time
Token ID640
Feature activation+7.50e-01

Pos logit contributions
ixel+0.169
 popul+0.164
jri+0.162
 )))+0.152
 forfe+0.152
Neg logit contributions
agement-0.166
gil-0.156
arge-0.152
ODY-0.152
 Email-0.152
Token Jeffrey
Token ID19627
Feature activation-1.93e+00

Pos logit contributions
agement+0.496
gil+0.466
arge+0.456
ODY+0.455
 Email+0.455
Neg logit contributions
ixel-0.533
 popul-0.518
jri-0.512
 )))-0.482
 forfe-0.481
Token M
Token ID337
Feature activation+1.27e+00

Pos logit contributions
ixel+1.303
 popul+1.261
jri+1.246
 forfe+1.170
 )))+1.169
Neg logit contributions
agement-1.338
gil-1.260
ODY-1.242
arge-1.234
 Email-1.231
Token and
Token ID290
Feature activation-7.91e-01

Pos logit contributions
ixel+2.322
 popul+2.260
jri+2.223
 diss+2.100
 forfe+2.098
Neg logit contributions
agement-2.331
gil-2.182
ODY-2.151
arge-2.151
TPS-2.138
Token her
Token ID607
Feature activation-8.72e-01

Pos logit contributions
ixel+0.514
 popul+0.495
jri+0.492
 )))+0.459
 forfe+0.458
Neg logit contributions
agement-0.573
gil-0.542
 Email-0.531
arge-0.531
ODY-0.531
Token husband
Token ID5229
Feature activation+9.15e-01

Pos logit contributions
ixel+0.539
 popul+0.518
jri+0.514
 )))+0.477
 forfe+0.476
Neg logit contributions
agement-0.660
gil-0.625
 Email-0.613
arge-0.613
ODY-0.613
Token Douglas
Token ID15796
Feature activation-4.72e-01

Pos logit contributions
agement+0.623
gil+0.589
arge+0.577
 Email+0.576
ODY+0.574
Neg logit contributions
ixel-0.634
 popul-0.611
jri-0.610
usra-0.570
 )))-0.570
Token Fair
Token ID7011
Feature activation-1.54e+00

Pos logit contributions
ixel+0.341
 popul+0.330
jri+0.328
 )))+0.308
 forfe+0.307
Neg logit contributions
agement-0.308
gil-0.290
arge-0.283
 Email-0.283
ODY-0.283
Tokenbanks
Token ID43558
Feature activation+3.92e-01

Pos logit contributions
ixel+1.364
 popul+1.330
jri+1.322
 )))+1.258
 forfe+1.254
Neg logit contributions
agement-0.754
gil-0.683
TPS-0.668
ODY-0.668
 Email-0.667
Token wanted
Token ID2227
Feature activation+2.19e+00

Pos logit contributions
agement+0.283
gil+0.268
arge+0.263
 Email+0.263
ODY+0.262
Neg logit contributions
ixel-0.254
 popul-0.245
jri-0.244
 )))-0.227
usra-0.227
Token the
Token ID262
Feature activation+1.27e+00

Pos logit contributions
agement+1.249
gil+1.167
arge+1.138
ODY+1.138
 Email+1.128
Neg logit contributions
ixel-1.759
jri-1.694
 popul-1.687
 )))-1.589
usra-1.587
Token Daniels
Token ID28162
Feature activation-7.61e-02

Pos logit contributions
agement+0.891
gil+0.837
arge+0.820
 Email+0.818
ODY+0.817
Neg logit contributions
ixel-0.856
jri-0.821
 popul-0.817
 )))-0.761
usra-0.761
Token ancestor
Token ID31836
Feature activation+1.34e+00

Pos logit contributions
ixel+0.052
 popul+0.051
jri+0.050
 )))+0.047
 forfe+0.047
Neg logit contributions
agement-0.052
gil-0.049
arge-0.048
 Email-0.048
ODY-0.048
Token to
Token ID284
Feature activation+3.70e-02

Pos logit contributions
agement+0.824
gil+0.766
arge+0.750
 Email+0.750
ODY+0.748
Neg logit contributions
ixel-1.016
jri-0.981
 popul-0.978
 )))-0.918
usra-0.916
Token.
Token ID13
Feature activation-9.94e-01

Pos logit contributions
agement+0.596
gil+0.559
arge+0.548
ODY+0.546
 Email+0.546
Neg logit contributions
ixel-0.623
 popul-0.604
jri-0.598
 )))-0.564
 forfe-0.562
TokenP
Token ID47
Feature activation-1.99e+00

Pos logit contributions
ixel+0.527
 popul+0.507
jri+0.496
 )))+0.460
 forfe+0.458
Neg logit contributions
agement-0.837
gil-0.794
arge-0.783
ODY-0.779
 Email-0.779
Token.,
Token ID1539
Feature activation-2.00e+00

Pos logit contributions
ixel+1.218
 popul+1.182
jri+1.162
 )))+1.087
 forfe+1.080
Neg logit contributions
agement-1.505
gil-1.424
ODY-1.395
arge-1.394
 Email-1.394
Token Self
Token ID12189
Feature activation+4.16e-01

Pos logit contributions
ixel+1.353
 popul+1.318
jri+1.296
 )))+1.224
 forfe+1.223
Neg logit contributions
agement-1.381
gil-1.304
arge-1.274
ODY-1.273
 Email-1.266
Tokenie
Token ID494
Feature activation+2.92e+00

Pos logit contributions
agement+0.277
gil+0.259
arge+0.255
 Email+0.254
ODY+0.252
Neg logit contributions
ixel-0.295
 popul-0.287
jri-0.287
 forfe-0.267
usra-0.266
Token),
Token ID828
Feature activation+8.29e-01

Pos logit contributions
agement+1.910
gil+1.793
 Email+1.768
arge+1.754
ODY+1.751
Neg logit contributions
ixel-2.100
jri-2.023
 popul-2.016
usra-1.888
 forfe-1.878
Token but
Token ID475
Feature activation-4.68e-01

Pos logit contributions
agement+0.477
gil+0.442
arge+0.432
ODY+0.431
 Email+0.431
Neg logit contributions
ixel-0.663
 popul-0.641
jri-0.640
 )))-0.604
usra-0.602
Token,
Token ID11
Feature activation-5.85e-02

Pos logit contributions
ixel+0.320
 popul+0.310
jri+0.307
 )))+0.288
 forfe+0.288
Neg logit contributions
agement-0.322
gil-0.303
arge-0.297
ODY-0.297
 Email-0.296
Token as
Token ID355
Feature activation-5.17e-01

Pos logit contributions
ixel+0.041
 popul+0.040
jri+0.039
 )))+0.037
 forfe+0.037
Neg logit contributions
agement-0.039
gil-0.037
arge-0.036
 Email-0.036
ODY-0.036
Token a
Token ID257
Feature activation-1.12e-01

Pos logit contributions
ixel+0.364
 popul+0.352
jri+0.349
 )))+0.328
 forfe+0.327
Neg logit contributions
agement-0.346
gil-0.325
arge-0.318
 Email-0.318
ODY-0.318
Token female
Token ID4048
Feature activation-1.75e+00

Pos logit contributions
ixel+0.070
 popul+0.067
jri+0.067
 )))+0.062
 forfe+0.062
Neg logit contributions
agement-0.085
gil-0.080
 Email-0.079
ODY-0.079
arge-0.079
Token Gr
Token ID1902
Feature activation-1.38e-01

Pos logit contributions
ixel+0.879
 popul+0.854
jri+0.840
 )))+0.793
 forfe+0.792
Neg logit contributions
agement-0.876
gil-0.824
arge-0.806
ODY-0.804
 Email-0.804
Tokenong
Token ID506
Feature activation-2.66e-01

Pos logit contributions
ixel+0.084
 popul+0.081
jri+0.080
 )))+0.075
 forfe+0.075
Neg logit contributions
agement-0.105
gil-0.100
arge-0.098
 Email-0.098
ODY-0.098
Token Gr
Token ID1902
Feature activation+8.66e-02

Pos logit contributions
ixel+0.204
 popul+0.198
jri+0.196
 forfe+0.186
 )))+0.186
Neg logit contributions
agement-0.161
gil-0.150
ODY-0.147
arge-0.146
 Email-0.146
Tokenong
Token ID506
Feature activation-1.16e-01

Pos logit contributions
agement+0.066
gil+0.062
arge+0.061
 Email+0.061
ODY+0.061
Neg logit contributions
ixel-0.053
 popul-0.051
jri-0.051
 )))-0.047
 forfe-0.047
Token was
Token ID373
Feature activation+8.52e-01

Pos logit contributions
ixel+0.075
 popul+0.072
jri+0.072
 )))+0.067
 forfe+0.066
Neg logit contributions
agement-0.085
gil-0.080
 Email-0.079
ODY-0.079
arge-0.079
Token the
Token ID262
Feature activation+2.83e-01

Pos logit contributions
agement+0.556
gil+0.521
 Email+0.512
ODY+0.512
arge+0.511
Neg logit contributions
ixel-0.616
jri-0.594
 popul-0.587
usra-0.552
 )))-0.551
Token one
Token ID530
Feature activation-4.15e-01

Pos logit contributions
agement+0.171
gil+0.159
 Email+0.156
arge+0.155
ODY+0.155
Neg logit contributions
ixel-0.219
 popul-0.211
jri-0.210
 )))-0.198
 forfe-0.198
Token that
Token ID326
Feature activation-2.78e+00

Pos logit contributions
ixel+0.295
 popul+0.286
jri+0.283
 )))+0.266
 forfe+0.266
Neg logit contributions
agement-0.275
gil-0.259
arge-0.253
 Email-0.253
ODY-0.253
Token stood
Token ID6204
Feature activation+1.86e-03

Pos logit contributions
ixel+1.790
 popul+1.757
jri+1.705
 forfe+1.628
 diss+1.619
Neg logit contributions
agement-2.009
gil-1.903
arge-1.863
ODY-1.859
 Email-1.854
Token out
Token ID503
Feature activation+1.51e+00

Pos logit contributions
agement+0.002
gil+0.001
arge+0.001
 Email+0.001
ODY+0.001
Neg logit contributions
ixel-0.001
 popul-0.001
jri-0.001
 )))-0.001
 forfe-0.001
Token over
Token ID625
Feature activation+2.00e+00

Pos logit contributions
agement+0.956
gil+0.890
arge+0.879
ODY+0.878
 Email+0.877
Neg logit contributions
ixel-1.117
jri-1.079
 popul-1.071
usra-1.018
 )))-1.006
Token it
Token ID340
Feature activation-9.34e-01

Pos logit contributions
ixel+1.682
 popul+1.650
jri+1.624
 forfe+1.532
 )))+1.529
Neg logit contributions
agement-1.605
gil-1.510
arge-1.482
ODY-1.481
 Email-1.471
Token calls
Token ID3848
Feature activation-7.20e-01

Pos logit contributions
ixel+0.643
 popul+0.624
jri+0.617
 forfe+0.580
 )))+0.579
Neg logit contributions
agement-0.639
gil-0.601
ODY-0.590
arge-0.590
 Email-0.589
Token Ro
Token ID5564
Feature activation-2.67e+00

Pos logit contributions
ixel+0.511
 popul+0.494
jri+0.491
 )))+0.461
 forfe+0.459
Neg logit contributions
agement-0.479
gil-0.450
 Email-0.440
arge-0.440
ODY-0.440
Tokenjava
Token ID12355
Feature activation-1.74e+00

Pos logit contributions
ixel+1.469
 popul+1.411
jri+1.378
 forfe+1.285
 )))+1.283
Neg logit contributions
agement-2.210
gil-2.100
ODY-2.059
arge-2.058
 Email-2.044
Token -
Token ID532
Feature activation-2.49e-01

Pos logit contributions
ixel+1.260
 popul+1.217
jri+1.211
 )))+1.139
 forfe+1.135
Neg logit contributions
agement-1.133
gil-1.063
 Email-1.041
ODY-1.039
arge-1.039
Token the
Token ID262
Feature activation-6.61e-02

Pos logit contributions
ixel+0.183
 popul+0.177
jri+0.176
 )))+0.166
 forfe+0.165
Neg logit contributions
agement-0.159
gil-0.149
 Email-0.145
ODY-0.145
arge-0.144
Token Kurdish
Token ID14845
Feature activation-6.06e-01

Pos logit contributions
ixel+0.044
 popul+0.042
jri+0.042
 )))+0.040
 forfe+0.039
Neg logit contributions
agement-0.047
gil-0.044
 Email-0.043
ODY-0.043
arge-0.043
Token areas
Token ID3006
Feature activation+1.30e+00

Pos logit contributions
ixel+0.455
 popul+0.439
jri+0.438
 )))+0.413
 forfe+0.411
Neg logit contributions
agement-0.377
gil-0.353
 Email-0.345
arge-0.345
ODY-0.345
Token of
Token ID286
Feature activation+4.64e-01

Pos logit contributions
agement+0.852
gil+0.799
 Email+0.783
arge+0.778
ODY+0.778
Neg logit contributions
ixel-0.931
jri-0.896
 popul-0.889
 )))-0.839
usra-0.832
Token Syria
Token ID4392
Feature activation+1.77e-01

Pos logit contributions
agement+0.301
gil+0.283
 Email+0.278
arge+0.276
ODY+0.276
Neg logit contributions
ixel-0.335
jri-0.324
 popul-0.323
 )))-0.304
 forfe-0.302
Token -
Token ID532
Feature activation+1.40e+00

Pos logit contributions
agement+0.114
gil+0.106
 Email+0.104
arge+0.104
ODY+0.103
Neg logit contributions
ixel-0.130
 popul-0.125
jri-0.125
 )))-0.117
 forfe-0.116
TokenAn
Token ID2025
Feature activation+9.63e-01

Pos logit contributions
ixel+0.064
 popul+0.062
jri+0.062
 )))+0.058
 forfe+0.058
Neg logit contributions
agement-0.054
gil-0.050
arge-0.049
 Email-0.049
ODY-0.049
Token historical
Token ID6754
Feature activation+1.53e-01

Pos logit contributions
agement+0.697
gil+0.656
 Email+0.648
arge+0.643
ODY+0.640
Neg logit contributions
ixel-0.630
 popul-0.611
jri-0.608
 )))-0.565
usra-0.560
Token marker
Token ID18364
Feature activation+1.12e+00

Pos logit contributions
agement+0.102
gil+0.096
arge+0.094
 Email+0.094
ODY+0.093
Neg logit contributions
ixel-0.109
jri-0.105
 popul-0.105
 )))-0.098
usra-0.097
Token in
Token ID287
Feature activation-5.19e-01

Pos logit contributions
agement+0.781
gil+0.733
 Email+0.722
arge+0.719
ODY+0.713
Neg logit contributions
ixel-0.768
jri-0.742
 popul-0.729
usra-0.683
 )))-0.682
Token May
Token ID1737
Feature activation-5.67e-01

Pos logit contributions
ixel+0.358
 popul+0.347
jri+0.343
 )))+0.323
 forfe+0.323
Neg logit contributions
agement-0.353
gil-0.332
arge-0.326
ODY-0.325
 Email-0.325
Tokenesville
Token ID19641
Feature activation-1.35e+00

Pos logit contributions
ixel+0.380
 popul+0.367
jri+0.363
 )))+0.341
 forfe+0.340
Neg logit contributions
agement-0.399
gil-0.376
arge-0.368
ODY-0.368
 Email-0.368
Token,
Token ID11
Feature activation+3.79e-01

Pos logit contributions
ixel+0.932
 popul+0.909
jri+0.894
 forfe+0.849
 )))+0.843
Neg logit contributions
agement-0.910
gil-0.855
ODY-0.839
arge-0.839
 Email-0.834
Token Sum
Token ID5060
Feature activation+2.66e-01

Pos logit contributions
agement+0.247
gil+0.232
 Email+0.228
ODY+0.227
arge+0.227
Neg logit contributions
ixel-0.274
 popul-0.264
jri-0.263
 )))-0.247
usra-0.246
Tokenter
Token ID353
Feature activation+1.55e-01

Pos logit contributions
agement+0.170
gil+0.160
arge+0.157
 Email+0.156
ODY+0.156
Neg logit contributions
ixel-0.195
 popul-0.189
jri-0.188
 )))-0.176
 forfe-0.176
Token County
Token ID3418
Feature activation+1.58e-01

Pos logit contributions
agement+0.083
gil+0.077
 Email+0.075
arge+0.075
ODY+0.075
Neg logit contributions
ixel-0.130
 popul-0.125
jri-0.125
 )))-0.119
usra-0.118
Token,
Token ID11
Feature activation+6.51e-01

Pos logit contributions
agement+0.104
gil+0.097
 Email+0.095
ODY+0.095
arge+0.095
Neg logit contributions
ixel-0.114
 popul-0.110
jri-0.109
 )))-0.103
usra-0.103
Tokenv
Token ID85
Feature activation+3.77e-02

Pos logit contributions
ixel+0.386
 popul+0.369
jri+0.366
 )))+0.339
 forfe+0.337
Neg logit contributions
agement-0.553
gil-0.524
ODY-0.517
 Email-0.516
arge-0.516
Token2
Token ID17
Feature activation-2.60e+00

Pos logit contributions
agement+0.026
gil+0.024
arge+0.024
ODY+0.024
 Email+0.024
Neg logit contributions
ixel-0.026
 popul-0.025
jri-0.025
 )))-0.023
 forfe-0.023
Token.
Token ID764
Feature activation-3.04e+00

Pos logit contributions
ixel+0.768
 popul+0.709
jri+0.668
 )))+0.649
 forfe+0.614
Neg logit contributions
agement-2.774
gil-2.649
ODY-2.636
arge-2.635
TPS-2.610
Token Ga
Token ID12822
Feature activation-1.38e+00

Pos logit contributions
ixel+2.030
 popul+1.989
jri+1.911
 diss+1.860
 forfe+1.850
Neg logit contributions
agement-2.152
TPS-1.992
ODY-1.979
gil-1.977
arge-1.950
Tokenussian
Token ID31562
Feature activation+5.88e-01

Pos logit contributions
ixel+1.003
 popul+0.968
jri+0.963
 )))+0.909
usra+0.905
Neg logit contributions
agement-0.900
gil-0.842
ODY-0.829
 Email-0.825
arge-0.822
TokenBl
Token ID3629
Feature activation-2.79e+00

Pos logit contributions
agement+0.499
gil+0.475
arge+0.467
 Email+0.467
ODY+0.467
Neg logit contributions
ixel-0.309
 popul-0.296
jri-0.293
 )))-0.269
 forfe-0.268
Tokenur
Token ID333
Feature activation+3.62e-01

Pos logit contributions
ixel+2.044
jri+1.927
 popul+1.913
usra+1.822
 )))+1.815
Neg logit contributions
agement-1.802
ODY-1.688
gil-1.670
TPS-1.661
 Email-1.656
Token (
Token ID357
Feature activation+9.14e-01

Pos logit contributions
agement+0.246
gil+0.231
arge+0.228
ODY+0.225
 Email+0.224
Neg logit contributions
ixel-0.248
 popul-0.240
jri-0.238
 )))-0.228
 forfe-0.225
Token gray
Token ID12768
Feature activation-2.08e+00

Pos logit contributions
agement+0.599
gil+0.558
ODY+0.554
TPS+0.551
arge+0.549
Neg logit contributions
ixel-0.658
 popul-0.638
jri-0.626
 )))-0.602
 forfe-0.598
Token,
Token ID837
Feature activation-3.17e+00

Pos logit contributions
ixel+0.741
 popul+0.676
jri+0.658
 )))+0.646
 forfe+0.610
Neg logit contributions
agement-2.093
gil-2.005
arge-1.993
ODY-1.989
TPS-1.976
Token (
Token ID357
Feature activation-1.48e+00

Pos logit contributions
ixel+2.019
 popul+1.968
 )))+1.899
jri+1.877
 diss+1.867
Neg logit contributions
agement-2.304
gil-2.161
ODY-2.127
arge-2.113
TPS-2.109
Token 50
Token ID2026
Feature activation-1.64e+00

Pos logit contributions
ixel+1.440
 popul+1.409
jri+1.389
 )))+1.302
 forfe+1.301
Neg logit contributions
agement-1.414
gil-1.330
ODY-1.302
arge-1.301
 Email-1.298
Token homeless
Token ID10463
Feature activation+6.02e-03

Pos logit contributions
ixel+1.050
 popul+1.016
jri+1.007
 )))+0.937
 forfe+0.936
Neg logit contributions
agement-1.202
gil-1.134
ODY-1.117
 Email-1.113
arge-1.113
Token each
Token ID1123
Feature activation-1.20e+00

Pos logit contributions
agement+0.005
gil+0.005
arge+0.005
 Email+0.005
ODY+0.005
Neg logit contributions
ixel-0.003
 popul-0.003
jri-0.003
 forfe-0.003
 )))-0.003
Token week
Token ID1285
Feature activation+1.84e+00

Pos logit contributions
ixel+0.746
 popul+0.725
jri+0.715
 )))+0.668
 forfe+0.664
Neg logit contributions
agement-0.900
gil-0.854
ODY-0.837
arge-0.834
 Email-0.833
Token,
Token ID11
Feature activation+5.35e-01

Pos logit contributions
agement+1.166
gil+1.096
 Email+1.078
ODY+1.071
arge+1.070
Neg logit contributions
ixel-1.355
jri-1.307
 popul-1.305
 )))-1.224
 forfe-1.222
Token
Token ID447
Feature activation-2.04e+00

Pos logit contributions
agement+0.360
gil+0.338
 Email+0.332
arge+0.331
ODY+0.331
Neg logit contributions
ixel-0.374
 popul-0.361
jri-0.360
 )))-0.337
 forfe-0.336
Token
Token ID251
Feature activation-5.84e-01

Pos logit contributions
ixel+1.578
 popul+1.519
jri+1.504
 )))+1.430
 forfe+1.426
Neg logit contributions
agement-1.227
gil-1.141
arge-1.126
ODY-1.121
 Email-1.118
Token says
Token ID1139
Feature activation-2.63e+00

Pos logit contributions
ixel+0.421
 popul+0.409
jri+0.405
 )))+0.381
 forfe+0.380
Neg logit contributions
agement-0.381
gil-0.357
arge-0.350
ODY-0.349
 Email-0.349
Token He
Token ID679
Feature activation-1.16e+00

Pos logit contributions
ixel+1.810
 popul+1.780
jri+1.751
 diss+1.646
 forfe+1.645
Neg logit contributions
agement-1.778
gil-1.665
arge-1.649
ODY-1.640
TPS-1.626
Tokenvil
Token ID2991
Feature activation+3.57e-01

Pos logit contributions
ixel+0.817
 popul+0.791
jri+0.786
 )))+0.737
 forfe+0.735
Neg logit contributions
agement-0.770
gil-0.722
ODY-0.709
arge-0.708
 Email-0.707
Tokenaine
Token ID5718
Feature activation+2.38e+00

Pos logit contributions
agement+0.242
gil+0.228
arge+0.223
 Email+0.223
ODY+0.223
Neg logit contributions
ixel-0.248
 popul-0.240
jri-0.238
 )))-0.224
 forfe-0.223
Token mad
Token ID8805
Feature activation+2.78e+00

Pos logit contributions
ixel+0.559
 popul+0.539
jri+0.535
 )))+0.499
 forfe+0.498
Neg logit contributions
agement-0.624
gil-0.589
arge-0.578
ODY-0.577
 Email-0.577
Token geek
Token ID27314
Feature activation-9.21e-02

Pos logit contributions
agement+1.956
gil+1.837
 Email+1.783
arge+1.783
ODY+1.780
Neg logit contributions
ixel-1.882
 popul-1.799
jri-1.782
 forfe-1.675
 )))-1.664
Token cred
Token ID2600
Feature activation+1.41e+00

Pos logit contributions
ixel+0.061
 popul+0.059
jri+0.059
 )))+0.055
 forfe+0.055
Neg logit contributions
agement-0.065
gil-0.062
arge-0.060
ODY-0.060
 Email-0.060
Token like
Token ID588
Feature activation+1.08e-01

Pos logit contributions
agement+1.079
gil+1.021
arge+1.003
ODY+1.002
 Email+1.001
Neg logit contributions
ixel-0.864
 popul-0.830
jri-0.823
 )))-0.767
 forfe-0.765
Token wearing
Token ID5762
Feature activation+1.81e+00

Pos logit contributions
agement+0.072
gil+0.067
 Email+0.066
arge+0.066
ODY+0.066
Neg logit contributions
ixel-0.077
 popul-0.074
jri-0.074
 )))-0.069
usra-0.069
Token a
Token ID257
Feature activation-1.98e-01

Pos logit contributions
agement+1.116
gil+1.038
 Email+1.018
ODY+1.014
arge+1.013
Neg logit contributions
ixel-1.377
 popul-1.331
jri-1.327
usra-1.249
 )))-1.248
Token TR
Token ID7579
Feature activation+1.03e+00

Pos logit contributions
ixel+0.136
 popul+0.131
jri+0.130
 )))+0.122
 forfe+0.121
Neg logit contributions
agement-0.137
gil-0.129
arge-0.126
 Email-0.126
ODY-0.126
TokenON
Token ID1340
Feature activation+1.25e+00

Pos logit contributions
agement+0.424
gil+0.379
ODY+0.374
 Email+0.371
arge+0.369
Neg logit contributions
ixel-0.988
 popul-0.967
jri-0.963
 forfe-0.915
usra-0.914
Token-
Token ID12
Feature activation-2.06e+00

Pos logit contributions
agement+0.863
gil+0.805
 Email+0.795
arge+0.794
ODY+0.792
Neg logit contributions
ixel-0.859
 popul-0.829
jri-0.822
usra-0.771
 forfe-0.770
Tokeninspired
Token ID24194
Feature activation+9.25e-01

Pos logit contributions
ixel+1.461
 popul+1.414
jri+1.404
 )))+1.322
 forfe+1.310
Neg logit contributions
agement-1.358
gil-1.279
ODY-1.257
 Email-1.253
arge-1.252
Token watch
Token ID2342
Feature activation+7.78e-01

Pos logit contributions
agement+0.592
gil+0.551
 Email+0.543
arge+0.541
ODY+0.540
Neg logit contributions
ixel-0.682
 popul-0.658
jri-0.655
usra-0.616
 )))-0.614
Token each
Token ID1123
Feature activation-1.28e+00

Pos logit contributions
agement+0.513
gil+0.474
 Email+0.467
arge+0.466
ODY+0.463
Neg logit contributions
ixel-0.708
jri-0.687
 popul-0.684
 )))-0.645
usra-0.641
Token designed
Token ID3562
Feature activation-7.42e-01

Pos logit contributions
ixel+0.932
 popul+0.899
jri+0.898
 )))+0.839
usra+0.835
Neg logit contributions
agement-0.825
gil-0.770
 Email-0.757
arge-0.757
ODY-0.753
Token for
Token ID329
Feature activation-1.06e+00

Pos logit contributions
ixel+0.530
 popul+0.512
jri+0.510
 )))+0.478
 forfe+0.475
Neg logit contributions
agement-0.490
gil-0.460
arge-0.451
 Email-0.450
ODY-0.449
Token 50
Token ID2026
Feature activation-9.42e-01

Pos logit contributions
ixel+0.810
 popul+0.787
jri+0.781
 )))+0.737
 forfe+0.736
Neg logit contributions
agement-0.646
gil-0.603
arge-0.589
ODY-0.589
 Email-0.589
Token-
Token ID12
Feature activation-2.90e-01

Pos logit contributions
ixel+0.617
 popul+0.594
jri+0.590
 )))+0.551
 forfe+0.549
Neg logit contributions
agement-0.679
gil-0.640
arge-0.630
 Email-0.628
ODY-0.627
Tokenyear
Token ID1941
Feature activation-4.45e+00

Pos logit contributions
ixel+0.237
 popul+0.230
jri+0.229
 )))+0.217
 forfe+0.217
Neg logit contributions
agement-0.160
gil-0.148
ODY-0.145
 Email-0.145
arge-0.144
Token service
Token ID2139
Feature activation+1.17e+00

Pos logit contributions
ixel+2.861
 popul+2.804
jri+2.709
 diss+2.636
 forfe+2.623
Neg logit contributions
agement-3.152
gil-3.004
ODY-2.966
 Email-2.929
TPS-2.910
Token life
Token ID1204
Feature activation-2.63e-02

Pos logit contributions
agement+0.773
gil+0.718
arge+0.710
 Email+0.708
ODY+0.705
Neg logit contributions
ixel-0.837
jri-0.807
 popul-0.806
 )))-0.755
usra-0.752
Token with
Token ID351
Feature activation+2.58e-01

Pos logit contributions
ixel+0.019
 popul+0.018
jri+0.018
 )))+0.017
usra+0.017
Neg logit contributions
agement-0.017
gil-0.016
 Email-0.016
arge-0.016
ODY-0.016
Token one
Token ID530
Feature activation-2.38e+00

Pos logit contributions
agement+0.159
gil+0.149
 Email+0.146
arge+0.145
ODY+0.145
Neg logit contributions
ixel-0.196
 popul-0.189
jri-0.189
 )))-0.178
usra-0.176
Token mid
Token ID3095
Feature activation-7.11e-01

Pos logit contributions
ixel+1.612
 popul+1.569
jri+1.549
 forfe+1.460
 )))+1.457
Neg logit contributions
agement-1.641
gil-1.552
ODY-1.520
 Email-1.512
arge-1.509
Tokenod
Token ID375
Feature activation+1.28e+00

Pos logit contributions
agement+0.011
gil+0.010
 Email+0.010
ODY+0.010
arge+0.010
Neg logit contributions
ixel-0.014
 popul-0.013
jri-0.013
 )))-0.012
 forfe-0.012
Tokenongo
Token ID25162
Feature activation-3.83e-01

Pos logit contributions
agement+0.794
gil+0.744
arge+0.726
 Email+0.726
ODY+0.726
Neg logit contributions
ixel-0.960
 popul-0.932
jri-0.926
 )))-0.872
 forfe-0.871
Token
Token ID447
Feature activation+9.38e-01

Pos logit contributions
ixel+0.288
 popul+0.279
jri+0.276
 )))+0.261
 forfe+0.261
Neg logit contributions
agement-0.239
gil-0.223
arge-0.218
ODY-0.217
 Email-0.217
Token
Token ID247
Feature activation+1.80e+00

Pos logit contributions
agement+0.538
gil+0.500
ODY+0.488
arge+0.488
 Email+0.487
Neg logit contributions
ixel-0.751
 popul-0.728
jri-0.723
 )))-0.686
 forfe-0.684
Tokens
Token ID82
Feature activation+1.11e+00

Pos logit contributions
agement+1.129
gil+1.058
ODY+1.033
 Email+1.032
arge+1.032
Neg logit contributions
ixel-1.334
 popul-1.295
jri-1.285
 )))-1.211
 forfe-1.209
Token Cavern
Token ID40230
Feature activation-4.66e+00

Pos logit contributions
agement+0.730
gil+0.687
 Email+0.672
arge+0.671
ODY+0.671
Neg logit contributions
ixel-0.795
 popul-0.769
jri-0.765
 )))-0.717
 forfe-0.716
Token
Token ID447
Feature activation+7.91e-01

Pos logit contributions
ixel+3.378
 popul+3.295
jri+3.237
 )))+3.102
+3.073
Neg logit contributions
agement-2.952
gil-2.725
TPS-2.716
ODY-2.670
 Email-2.657
Token
Token ID247
Feature activation+1.61e+00

Pos logit contributions
agement+0.451
gil+0.419
ODY+0.409
arge+0.409
 Email+0.409
Neg logit contributions
ixel-0.635
 popul-0.616
jri-0.612
 )))-0.580
 forfe-0.579
Token,
Token ID11
Feature activation-2.72e+00

Pos logit contributions
agement+0.975
gil+0.911
 Email+0.889
ODY+0.889
arge+0.888
Neg logit contributions
ixel-1.230
 popul-1.195
jri-1.187
usra-1.119
 )))-1.119
Token
Token ID564
Feature activation-2.29e+00

Pos logit contributions
ixel+1.519
 popul+1.460
jri+1.430
 )))+1.355
 forfe+1.337
Neg logit contributions
agement-2.199
gil-2.083
arge-2.054
ODY-2.043
 Email-2.040
Token
Token ID246
Feature activation+7.39e-01

Pos logit contributions
ixel+1.713
 popul+1.628
jri+1.625
 )))+1.560
 forfe+1.539
Neg logit contributions
agement-1.438
gil-1.329
 Email-1.307
ODY-1.306
arge-1.299
Tokenres
Token ID411
Feature activation+1.76e-01

Pos logit contributions
agement+0.236
gil+0.219
ODY+0.215
 Email+0.214
arge+0.213
Neg logit contributions
ixel-0.335
 popul-0.324
jri-0.322
 )))-0.306
 forfe-0.305
Tokenize
Token ID1096
Feature activation+3.62e-01

Pos logit contributions
agement+0.094
gil+0.087
ODY+0.085
 Email+0.085
arge+0.084
Neg logit contributions
ixel-0.148
 popul-0.144
jri-0.143
 )))-0.136
 forfe-0.135
Token(
Token ID7
Feature activation-1.06e+00

Pos logit contributions
agement+0.218
gil+0.202
arge+0.198
ODY+0.197
 Email+0.196
Neg logit contributions
ixel-0.278
 popul-0.268
jri-0.267
 )))-0.256
 forfe-0.254
Tokenimage
Token ID9060
Feature activation-3.04e+00

Pos logit contributions
ixel+0.759
 popul+0.717
jri+0.710
 )))+0.678
usra+0.675
Neg logit contributions
agement-0.699
gil-0.649
ODY-0.645
 Email-0.642
TPS-0.641
Token,
Token ID11
Feature activation-1.75e+00

Pos logit contributions
ixel+1.915
 popul+1.818
jri+1.765
 )))+1.765
+1.739
Neg logit contributions
agement-2.185
arge-2.040
gil-2.038
ODY-2.010
TPS-1.991
Token height
Token ID6001
Feature activation-4.36e+00

Pos logit contributions
ixel+1.247
 popul+1.212
jri+1.181
 )))+1.140
 forfe+1.139
Neg logit contributions
agement-1.142
gil-1.091
ODY-1.066
arge-1.049
TPS-1.044
Token=
Token ID28
Feature activation+1.21e+00

Pos logit contributions
ixel+2.607
jri+2.462
 popul+2.453
 )))+2.427
+2.409
Neg logit contributions
agement-3.246
ODY-3.048
gil-3.022
arge-3.021
 Email-2.962
Token600
Token ID8054
Feature activation+1.18e+00

Pos logit contributions
agement+0.653
gil+0.608
arge+0.593
TPS+0.589
 Email+0.589
Neg logit contributions
ixel-0.999
 popul-0.980
jri-0.976
 forfe-0.923
usra-0.919
Token)
Token ID8
Feature activation-2.40e-01

Pos logit contributions
agement+0.655
gil+0.608
arge+0.592
ODY+0.592
 Email+0.592
Neg logit contributions
ixel-0.962
 popul-0.935
jri-0.929
 )))-0.881
 forfe-0.879
Token gray
Token ID12768
Feature activation-5.15e-01

Pos logit contributions
ixel+0.173
 popul+0.168
jri+0.164
 )))+0.158
 forfe+0.157
Neg logit contributions
agement-0.157
gil-0.147
ODY-0.145
TPS-0.144
arge-0.144
Token =
Token ID796
Feature activation-6.48e-01

Pos logit contributions
ixel+0.344
 popul+0.330
jri+0.328
 )))+0.311
 forfe+0.307
Neg logit contributions
agement-0.363
gil-0.341
ODY-0.336
arge-0.335
 Email-0.334
Token
Token ID35705
Feature activation-9.37e-01

Pos logit contributions
ixel+0.929
 popul+0.905
jri+0.900
 )))+0.860
 forfe+0.855
Neg logit contributions
agement-0.491
gil-0.449
 Email-0.438
arge-0.435
ODY-0.433
Token
Token ID98
Feature activation+2.01e+00

Pos logit contributions
ixel+0.679
 popul+0.659
jri+0.655
 )))+0.618
 forfe+0.617
Neg logit contributions
agement-0.602
gil-0.566
arge-0.555
ODY-0.555
 Email-0.552
Token30
Token ID1270
Feature activation+5.70e-01

Pos logit contributions
agement+0.818
gil+0.739
arge+0.712
ODY+0.712
 Email+0.711
Neg logit contributions
ixel-1.951
 popul-1.903
jri-1.893
 forfe-1.806
usra-1.804
Token kg
Token ID14211
Feature activation-3.52e+00

Pos logit contributions
agement+0.404
gil+0.381
ODY+0.375
arge+0.374
 Email+0.374
Neg logit contributions
ixel-0.380
 popul-0.366
jri-0.364
 )))-0.339
 forfe-0.338
Token/
Token ID14
Feature activation-1.84e+00

Pos logit contributions
ixel+1.819
 popul+1.744
jri+1.710
 )))+1.603
 diss+1.588
Neg logit contributions
agement-2.995
gil-2.850
arge-2.795
 Email-2.779
ODY-2.778
Tokenm
Token ID76
Feature activation-4.90e+00

Pos logit contributions
ixel+1.315
 popul+1.267
jri+1.257
 )))+1.198
usra+1.186
Neg logit contributions
agement-1.197
gil-1.114
ODY-1.106
 Email-1.100
TPS-1.093
Token²
Token ID31185
Feature activation-1.44e+00

Pos logit contributions
ixel+2.646
 popul+2.464
jri+2.448
 )))+2.317
usra+2.291
Neg logit contributions
agement-4.055
gil-3.864
ODY-3.800
 Email-3.793
TPS-3.775
Token).
Token ID737
Feature activation-1.02e-01

Pos logit contributions
ixel+0.964
 popul+0.933
jri+0.924
 )))+0.866
 forfe+0.863
Neg logit contributions
agement-1.017
gil-0.959
arge-0.939
 Email-0.938
ODY-0.938
Token
Token ID198
Feature activation-8.16e-01

Pos logit contributions
ixel+0.072
 popul+0.070
jri+0.070
 )))+0.065
 forfe+0.065
Neg logit contributions
agement-0.068
gil-0.064
 Email-0.063
ODY-0.062
arge-0.062
Token
Token ID198
Feature activation-8.17e-01

Pos logit contributions
ixel+0.571
 popul+0.552
jri+0.545
 )))+0.523
 forfe+0.517
Neg logit contributions
agement-0.544
gil-0.509
ODY-0.501
arge-0.500
 Email-0.495
TokenIn
Token ID818
Feature activation+1.48e-01

Pos logit contributions
ixel+0.582
 popul+0.563
jri+0.559
 )))+0.527
 forfe+0.523
Neg logit contributions
agement-0.538
gil-0.505
arge-0.495
 Email-0.494
ODY-0.492
Token6
Token ID21
Feature activation+9.34e-01

Pos logit contributions
agement+0.553
gil+0.513
ODY+0.499
arge+0.499
 Email+0.498
Neg logit contributions
ixel-0.839
 popul-0.817
jri-0.811
 )))-0.769
 forfe-0.768
Token)
Token ID8
Feature activation-1.87e+00

Pos logit contributions
agement+0.514
gil+0.476
ODY+0.465
 Email+0.465
arge+0.465
Neg logit contributions
ixel-0.767
 popul-0.746
jri-0.742
 )))-0.702
 forfe-0.701
Token C
Token ID327
Feature activation-1.16e+00

Pos logit contributions
ixel+1.377
 popul+1.336
jri+1.317
 )))+1.257
 forfe+1.247
Neg logit contributions
agement-1.194
gil-1.117
arge-1.085
ODY-1.085
TPS-1.080
Token
Token ID1906
Feature activation-4.32e+00

Pos logit contributions
ixel+0.896
 popul+0.868
jri+0.861
 )))+0.815
 forfe+0.812
Neg logit contributions
agement-0.695
gil-0.650
 Email-0.633
arge-0.631
ODY-0.628
TokenS
Token ID50
Feature activation-6.10e-01

Pos logit contributions
ixel+2.938
 popul+2.829
jri+2.783
 )))+2.677
 diss+2.634
Neg logit contributions
agement-2.947
gil-2.768
arge-2.711
 Email-2.693
ODY-2.633
Token––
Token ID25608
Feature activation-5.71e+00

Pos logit contributions
ixel+0.406
 popul+0.391
jri+0.388
 )))+0.363
 forfe+0.362
Neg logit contributions
agement-0.433
gil-0.408
 Email-0.400
arge-0.399
ODY-0.399
Token
Token ID1906
Feature activation-2.68e+00

Pos logit contributions
ixel+3.460
 popul+3.313
jri+3.280
 )))+3.162
 forfe+3.101
Neg logit contributions
agement-4.209
gil-4.005
arge-3.934
TPS-3.906
 Email-3.858
Token One
Token ID1881
Feature activation-1.31e+00

Pos logit contributions
ixel+1.819
 popul+1.757
jri+1.736
 )))+1.647
 forfe+1.633
Neg logit contributions
agement-1.848
gil-1.747
arge-1.709
 Email-1.692
ODY-1.688
Token of
Token ID286
Feature activation-2.44e-02

Pos logit contributions
ixel+0.897
 popul+0.869
jri+0.861
 )))+0.810
 forfe+0.807
Neg logit contributions
agement-0.904
gil-0.851
arge-0.832
ODY-0.831
 Email-0.831
Token the
Token ID262
Feature activation-1.38e+00

Pos logit contributions
ixel+0.017
 popul+0.017
jri+0.017
 )))+0.016
 forfe+0.016
Neg logit contributions
agement-0.016
gil-0.015
arge-0.015
ODY-0.015
 Email-0.015
Token Mag
Token ID2944
Feature activation-1.51e+00

Pos logit contributions
ixel+0.958
 popul+0.933
jri+0.920
 )))+0.867
 forfe+0.865
Neg logit contributions
agement-0.933
gil-0.875
ODY-0.856
 Email-0.855
arge-0.855