TOP ACTIVATIONS
MAX = 2.209

; Coral Sea Islands ; Heard Island and McDonald Islands ;
transport infrastructure , such as motor ways and railways
Islands 14 . 6 8 Bots wana 14 . 9 9
June 2012 , the C FP B launched a consumer help
re going to discuss the Vend read Trap Card that will
N . " special rapport eur " for the protection of
Brennan Center filed an am icus brief calling for the Court
However he said that the general classification is a bigger goal
for comments about the mother grizz ly killing a person in
middle - class any motor vehicle , TV , computer
, but the chassis and general handling are quite similar [
rez , with an army general ; Mr . Ram í
t make the repeal crus aders feel discouraged enough
S . dollars from rem itt ances , tourism and pal
next time ! <|endoftext|> BR USS ELS ( Reuters ) -
iv loans likely also lacked collateral . Man agers
- mentioned assets a motor vehicle , a TV ,
f 4 b be 66 IEEE 802 . 11 b gn
team to share with the general public ; quotes from coaches
) Will Del inqu encies Trigger a New American

MIN ACTIVATIONS
MIN = -2.646

2012 United States 5 667 Posts # 12 I 'm glad
June 2012 Canada 16 10 Posts # 14 On September 20
2002 United States 3 472 Posts # 11 On September 20
2010 United States 6 83 Posts Last Edited : 2013 -
2011 United States 6 12 Posts # 17 Good to hear
2010 United States 21 50 Posts # 16 Glad to hear
2010 United States 10 53 Posts # 15 honestly if we
June 2009 Germany 247 34 Posts # 20 Nice , happy
2011 United States 228 65 Posts # 13 On September 20
TL AD T 235 21 Posts # 18 Good stuff !
United States 3 472 Posts # 11 On September 20 2013
United States 10 53 Posts # 15 honestly if we just
2012 Canada 16 10 Posts # 14 On September 20 2013
United States 228 65 Posts # 13 On September 20 2013
2009 Germany 247 34 Posts # 20 Nice , happy to
2012 United States 3 240 Posts # 10 Happy it worked
United States 21 50 Posts # 16 Glad to hear everything
United States 5 667 Posts # 12 I 'm glad there
United States 6 12 Posts # 17 Good to hear .
AD T 235 21 Posts # 18 Good stuff ! Great

INTERVAL 0.995 - 2.209
CONTAINS 0.804%

conflict , refugees , displacement all that stuff has an
Touch ( Use with extreme caution !) Touch can
the Elven forest of L oth l ó rien , where
I wanted the reader to feel like they were reading quickly
agreed . Similarly , the pian ist 's disagreement with

INTERVAL -0.219 - 0.995
CONTAINS 76.087%

moreover , were sk itt ish of formally align ing themselves
and making a couple of turns about an 1 - 2
33 19 21 19 09
this pr imate was uniquely adapted to its environment , Dr
court accepts a central government legal challenge related to any of

INTERVAL -1.433 - -0.219
CONTAINS 23.036%

, Bird of Pre y is a psychedelic indie rock band
ator are pointing to the lack of an imp assion ed
minded voters that accused Obama of lying about his position on
prince of comics himself : Frank Miller . How could this
him for you . <|endoftext|> Tan aka 's artwork is ,

INTERVAL -2.646 - -1.433
CONTAINS 0.073%

View all New York Times newsletters .
By comparison : Earth 's moon is already in an Earth
Windows For W AB L members , this app
that also no longer works .) The table of
3 lit ers of w ort , which is just enough
Token;
Token ID26
Feature activation-1.17e-01

Pos logit contributions
thora+1.279
icket+1.249
oli+1.229
 incent+1.210
emouth+1.210
Neg logit contributions
 contrary-0.470
 Uk-0.450
 Outbreak-0.426
 Kah-0.424
 Melt-0.420
Token Coral
Token ID41390
Feature activation+1.01e+00

Pos logit contributions
 contrary+0.168
 Uk+0.162
 Kah+0.156
 Outbreak+0.154
 Melt+0.154
Neg logit contributions
thora-0.297
icket-0.284
oli-0.279
 incent-0.275
ashtra-0.274
Token Sea
Token ID6896
Feature activation+4.65e-01

Pos logit contributions
thora+2.754
icket+2.640
oli+2.597
 incent+2.568
ashtra+2.556
Neg logit contributions
 contrary-1.279
 Uk-1.227
 Kah-1.175
 Outbreak-1.159
 Melt-1.158
Token Islands
Token ID12010
Feature activation+7.45e-01

Pos logit contributions
thora+1.341
icket+1.289
oli+1.269
 incent+1.255
ashtra+1.250
Neg logit contributions
 contrary-0.506
 Uk-0.483
 Kah-0.458
 Outbreak-0.452
 Melt-0.450
Token;
Token ID26
Feature activation-9.85e-02

Pos logit contributions
thora+2.084
icket+2.033
oli+1.997
 incent+1.969
ashtra+1.968
Neg logit contributions
 contrary-0.798
 Uk-0.770
 Outbreak-0.726
 Kah-0.724
posts-0.714
Token Heard
Token ID49417
Feature activation+2.21e+00

Pos logit contributions
 contrary+0.143
 Uk+0.138
 Kah+0.133
 Melt+0.131
 Outbreak+0.131
Neg logit contributions
thora-0.249
icket-0.238
oli-0.234
 incent-0.231
ashtra-0.230
Token Island
Token ID5451
Feature activation+4.71e-01

Pos logit contributions
thora+4.801
icket+4.620
oli+4.591
emouth+4.514
iral+4.471
Neg logit contributions
 contrary-3.795
 Uk-3.658
 Melt-3.561
 Outbreak-3.557
posts-3.523
Token and
Token ID290
Feature activation+4.80e-02

Pos logit contributions
thora+1.252
icket+1.198
oli+1.176
 incent+1.162
ashtra+1.157
Neg logit contributions
 contrary-0.629
 Uk-0.606
 Kah-0.583
 Outbreak-0.574
 Melt-0.572
Token McDonald
Token ID14115
Feature activation-3.41e-01

Pos logit contributions
thora+0.124
icket+0.118
oli+0.116
 incent+0.114
ashtra+0.114
Neg logit contributions
 contrary-0.069
 Uk-0.066
 Kah-0.064
 Melt-0.063
 Outbreak-0.062
Token Islands
Token ID12010
Feature activation+5.27e-01

Pos logit contributions
 contrary+0.649
 Uk+0.632
 Kah+0.617
 Melt+0.609
 Outbreak+0.608
Neg logit contributions
thora-0.713
icket-0.671
oli-0.657
 incent-0.648
ashtra-0.645
Token;
Token ID26
Feature activation-2.13e-01

Pos logit contributions
thora+1.503
icket+1.465
oli+1.440
 incent+1.418
emouth+1.417
Neg logit contributions
 contrary-0.554
 Uk-0.529
 Kah-0.498
 Outbreak-0.497
 Melt-0.490
Token transport
Token ID4839
Feature activation+2.53e-01

Pos logit contributions
 contrary+0.089
 Uk+0.087
 Kah+0.084
 Melt+0.083
 Outbreak+0.082
Neg logit contributions
thora-0.097
icket-0.091
oli-0.089
ashtra-0.087
 incent-0.087
Token infrastructure
Token ID6884
Feature activation+2.93e-01

Pos logit contributions
thora+0.562
icket+0.532
oli+0.520
 incent+0.512
ashtra+0.511
Neg logit contributions
 contrary-0.450
 Uk-0.436
 Kah-0.422
 Melt-0.418
 Outbreak-0.416
Token,
Token ID11
Feature activation+9.75e-02

Pos logit contributions
thora+0.749
icket+0.715
oli+0.702
 incent+0.693
ashtra+0.691
Neg logit contributions
 contrary-0.420
 Uk-0.405
 Kah-0.389
 Outbreak-0.384
 Melt-0.384
Token such
Token ID884
Feature activation-3.59e-01

Pos logit contributions
thora+0.249
icket+0.238
oli+0.234
ashtra+0.230
 incent+0.230
Neg logit contributions
 contrary-0.141
 Uk-0.135
 Kah-0.130
 Melt-0.128
 Outbreak-0.127
Token as
Token ID355
Feature activation-1.61e-01

Pos logit contributions
 contrary+0.593
 Uk+0.567
 Kah+0.546
posts+0.539
 KS+0.539
Neg logit contributions
thora-0.868
 incent-0.807
icket-0.806
oli-0.790
ashtra-0.789
Token motor
Token ID5584
Feature activation+2.13e+00

Pos logit contributions
 contrary+0.294
 Uk+0.287
 Kah+0.279
 KS+0.274
 Melt+0.274
Neg logit contributions
thora-0.350
icket-0.331
oli-0.324
ashtra-0.318
emouth-0.315
Tokenways
Token ID1322
Feature activation+4.81e-01

Pos logit contributions
thora+4.403
icket+4.218
oli+4.158
 incent+4.054
iral+4.023
Neg logit contributions
 contrary-3.892
 Uk-3.811
 Outbreak-3.786
 Kah-3.736
 Melt-3.699
Token and
Token ID290
Feature activation-4.36e-02

Pos logit contributions
thora+1.262
icket+1.210
oli+1.189
 incent+1.175
ashtra+1.168
Neg logit contributions
 contrary-0.648
 Uk-0.626
 Kah-0.599
 Outbreak-0.596
 Melt-0.592
Token railways
Token ID45247
Feature activation-1.05e-01

Pos logit contributions
 contrary+0.081
 Uk+0.079
 Kah+0.077
 KS+0.075
 Melt+0.075
Neg logit contributions
thora-0.094
icket-0.089
oli-0.087
ashtra-0.085
 incent-0.085
Token
Token ID198
Feature activation-2.15e-01

Pos logit contributions
 contrary+0.143
 Uk+0.138
 Kah+0.132
 Outbreak+0.131
 Melt+0.130
Neg logit contributions
thora-0.273
icket-0.261
oli-0.257
 incent-0.254
ashtra-0.253
Token
Token ID198
Feature activation-2.36e-01

Pos logit contributions
 contrary+0.304
 Uk+0.291
 Kah+0.281
 Melt+0.277
 Outbreak+0.276
Neg logit contributions
thora-0.560
icket-0.533
 incent-0.529
oli-0.522
ashtra-0.518
Token Islands
Token ID12010
Feature activation+1.11e+00

Pos logit contributions
thora+1.466
icket+1.397
oli+1.343
 incent+1.341
iral+1.334
Neg logit contributions
 contrary-1.268
 Uk-1.245
 Kah-1.216
 Melt-1.193
 KS-1.189
Token 14
Token ID1478
Feature activation-1.46e-01

Pos logit contributions
thora+2.852
icket+2.721
oli+2.691
 incent+2.644
ashtra+2.638
Neg logit contributions
 contrary-1.522
 Uk-1.458
 Kah-1.397
 Outbreak-1.393
 Melt-1.386
Token.
Token ID13
Feature activation-1.05e-01

Pos logit contributions
 contrary+0.229
 Uk+0.219
 Kah+0.211
 Outbreak+0.210
 Melt+0.209
Neg logit contributions
thora-0.353
icket-0.334
oli-0.330
 incent-0.325
ashtra-0.323
Token6
Token ID21
Feature activation-1.74e-01

Pos logit contributions
 contrary+0.145
 Uk+0.139
 Kah+0.133
 Outbreak+0.132
 Melt+0.132
Neg logit contributions
thora-0.271
icket-0.259
oli-0.255
 incent-0.251
ashtra-0.250
Token 8
Token ID807
Feature activation-1.63e-01

Pos logit contributions
 contrary+0.251
 Uk+0.243
 Kah+0.234
 Outbreak+0.231
 Melt+0.231
Neg logit contributions
thora-0.445
icket-0.426
oli-0.417
 incent-0.412
ashtra-0.411
Token Bots
Token ID40946
Feature activation+2.05e+00

Pos logit contributions
 contrary+0.269
 Uk+0.264
 Kah+0.255
 Melt+0.250
 Outbreak+0.250
Neg logit contributions
thora-0.384
icket-0.368
oli-0.359
 incent-0.356
ashtra-0.354
Tokenwana
Token ID49484
Feature activation+6.43e-01

Pos logit contributions
thora+3.441
icket+3.222
oli+3.136
ashtra+3.046
emouth+3.030
Neg logit contributions
 contrary-4.707
 Uk-4.588
 Kah-4.487
 Outbreak-4.478
 Melt-4.475
Token 14
Token ID1478
Feature activation-4.37e-02

Pos logit contributions
thora+1.707
icket+1.634
oli+1.611
 incent+1.586
ashtra+1.582
Neg logit contributions
 contrary-0.839
 Uk-0.802
 Kah-0.768
 Outbreak-0.762
 Melt-0.761
Token.
Token ID13
Feature activation-1.52e-01

Pos logit contributions
 contrary+0.069
 Uk+0.066
 Kah+0.064
 Outbreak+0.064
 Melt+0.063
Neg logit contributions
thora-0.105
icket-0.100
oli-0.098
 incent-0.097
ashtra-0.096
Token9
Token ID24
Feature activation-1.12e-01

Pos logit contributions
 contrary+0.208
 Uk+0.200
 Kah+0.192
 Outbreak+0.190
 Melt+0.190
Neg logit contributions
thora-0.396
icket-0.379
oli-0.373
 incent-0.367
ashtra-0.367
Token 9
Token ID860
Feature activation-2.40e-01

Pos logit contributions
 contrary+0.141
 Uk+0.136
 Kah+0.130
 Outbreak+0.129
 Melt+0.128
Neg logit contributions
thora-0.305
icket-0.292
oli-0.286
 incent-0.285
ashtra-0.283
Token June
Token ID2795
Feature activation-1.19e-01

Pos logit contributions
 contrary+0.287
 Uk+0.276
 Kah+0.268
 Melt+0.264
 Outbreak+0.264
Neg logit contributions
thora-0.419
icket-0.399
oli-0.390
 incent-0.387
ashtra-0.385
Token 2012
Token ID2321
Feature activation+7.29e-01

Pos logit contributions
 contrary+0.160
 Uk+0.154
 Kah+0.148
 Melt+0.146
 Outbreak+0.145
Neg logit contributions
thora-0.316
icket-0.303
oli-0.296
 incent-0.294
ashtra-0.293
Token,
Token ID11
Feature activation-9.98e-02

Pos logit contributions
thora+1.863
icket+1.784
oli+1.765
ashtra+1.727
 incent+1.725
Neg logit contributions
 contrary-1.002
 Uk-0.966
 Kah-0.928
 Outbreak-0.926
 Melt-0.916
Token the
Token ID262
Feature activation-3.28e-02

Pos logit contributions
 contrary+0.161
 Uk+0.155
 Kah+0.150
 Melt+0.148
 Outbreak+0.147
Neg logit contributions
thora-0.240
icket-0.227
oli-0.223
 incent-0.221
ashtra-0.220
Token C
Token ID327
Feature activation+2.33e-04

Pos logit contributions
 contrary+0.046
 Uk+0.045
 Kah+0.043
 Melt+0.042
 Outbreak+0.042
Neg logit contributions
thora-0.085
icket-0.081
oli-0.080
 incent-0.079
ashtra-0.079
TokenFP
Token ID5837
Feature activation+2.05e+00

Pos logit contributions
thora+0.001
icket+0.001
 incent+0.001
oli+0.001
ashtra+0.001
Neg logit contributions
 contrary-0.000
 Uk-0.000
 Melt-0.000
 Kah-0.000
 Outbreak-0.000
TokenB
Token ID33
Feature activation-8.69e-02

Pos logit contributions
thora+4.525
icket+4.304
oli+4.203
iral+4.107
ashtra+4.063
Neg logit contributions
 contrary-3.501
 Uk-3.420
 Kah-3.345
 Outbreak-3.343
posts-3.340
Token launched
Token ID5611
Feature activation+5.43e-02

Pos logit contributions
 contrary+0.147
 Uk+0.143
 Kah+0.138
 Melt+0.136
 Outbreak+0.136
Neg logit contributions
thora-0.202
icket-0.191
oli-0.187
ashtra-0.185
 incent-0.184
Token a
Token ID257
Feature activation+2.32e-01

Pos logit contributions
thora+0.124
icket+0.118
oli+0.116
 incent+0.114
ashtra+0.114
Neg logit contributions
 contrary-0.092
 Uk-0.089
 Kah-0.086
 Outbreak-0.085
 Melt-0.085
Token consumer
Token ID7172
Feature activation+1.06e+00

Pos logit contributions
thora+0.510
icket+0.483
oli+0.473
 incent+0.467
ashtra+0.466
Neg logit contributions
 contrary-0.418
 Uk-0.404
 Kah-0.392
 Melt-0.389
 Outbreak-0.388
Token help
Token ID1037
Feature activation+4.32e-01

Pos logit contributions
thora+2.320
icket+2.216
oli+2.169
 incent+2.128
ashtra+2.114
Neg logit contributions
 contrary-1.841
 Uk-1.794
 Kah-1.740
 Outbreak-1.733
 Melt-1.726
Tokenre
Token ID260
Feature activation-1.36e-01

Pos logit contributions
thora+0.447
icket+0.422
oli+0.413
 incent+0.409
ashtra+0.406
Neg logit contributions
 contrary-0.372
 Uk-0.362
 Kah-0.354
 Melt-0.349
 Outbreak-0.347
Token going
Token ID1016
Feature activation+1.64e-01

Pos logit contributions
 contrary+0.239
 Uk+0.231
 Kah+0.224
 Melt+0.223
 Outbreak+0.221
Neg logit contributions
thora-0.306
icket-0.288
oli-0.281
ashtra-0.277
iral-0.277
Token to
Token ID284
Feature activation-2.44e-01

Pos logit contributions
thora+0.452
icket+0.433
oli+0.426
 incent+0.422
ashtra+0.419
Neg logit contributions
 contrary-0.203
 Uk-0.194
 Kah-0.186
 Outbreak-0.183
 Melt-0.183
Token discuss
Token ID2112
Feature activation-6.10e-01

Pos logit contributions
 contrary+0.471
 Uk+0.457
 Kah+0.448
 Melt+0.447
 Outbreak+0.439
Neg logit contributions
thora-0.507
icket-0.475
oli-0.463
ashtra-0.458
emouth-0.456
Token the
Token ID262
Feature activation+1.02e-01

Pos logit contributions
 contrary+1.035
 Uk+1.007
 Kah+0.980
 Outbreak+0.972
 Melt+0.971
Neg logit contributions
thora-1.407
icket-1.334
oli-1.303
ashtra-1.290
 incent-1.287
Token Vend
Token ID44220
Feature activation+2.03e+00

Pos logit contributions
thora+0.234
icket+0.223
oli+0.218
 incent+0.215
ashtra+0.214
Neg logit contributions
 contrary-0.174
 Uk-0.169
 Kah-0.163
 Outbreak-0.162
 Melt-0.162
Tokenread
Token ID961
Feature activation+6.32e-02

Pos logit contributions
thora+5.288
icket+5.149
oli+4.992
iral+4.928
ashtra+4.888
Neg logit contributions
 contrary-2.701
 Uk-2.615
 Kah-2.502
 Outbreak-2.474
 Melt-2.452
Token Trap
Token ID21914
Feature activation+3.82e-01

Pos logit contributions
thora+0.137
icket+0.130
oli+0.127
 incent+0.125
ashtra+0.125
Neg logit contributions
 contrary-0.115
 Uk-0.112
 Kah-0.109
 Outbreak-0.108
 Melt-0.108
Token Card
Token ID5172
Feature activation-2.49e-01

Pos logit contributions
thora+0.914
icket+0.865
oli+0.851
 incent+0.841
ashtra+0.840
Neg logit contributions
 contrary-0.615
 Uk-0.594
 Kah-0.574
 Outbreak-0.571
 Melt-0.571
Token that
Token ID326
Feature activation+2.58e-02

Pos logit contributions
 contrary+0.376
 Uk+0.361
 Kah+0.353
 Melt+0.349
 Outbreak+0.348
Neg logit contributions
thora-0.618
icket-0.590
oli-0.578
 incent-0.570
ashtra-0.568
Token will
Token ID481
Feature activation-3.56e-02

Pos logit contributions
thora+0.060
icket+0.057
oli+0.056
 incent+0.055
ashtra+0.055
Neg logit contributions
 contrary-0.043
 Uk-0.042
 Kah-0.041
 Melt-0.041
 Outbreak-0.040
TokenN
Token ID45
Feature activation-1.94e-01

Pos logit contributions
 contrary+0.727
 Uk+0.704
 Kah+0.687
 Outbreak+0.679
 Melt+0.677
Neg logit contributions
thora-0.923
icket-0.883
oli-0.867
 incent-0.846
iral-0.846
Token.
Token ID13
Feature activation-2.06e-01

Pos logit contributions
 contrary+0.297
 Uk+0.287
 Kah+0.277
 Outbreak+0.274
 Melt+0.273
Neg logit contributions
thora-0.474
icket-0.452
oli-0.443
 incent-0.438
ashtra-0.436
Token "
Token ID366
Feature activation+1.93e-01

Pos logit contributions
 contrary+0.372
 Uk+0.362
 Kah+0.349
 Melt+0.348
 Outbreak+0.347
Neg logit contributions
thora-0.456
icket-0.429
oli-0.416
 incent-0.415
ashtra-0.412
Tokenspecial
Token ID20887
Feature activation+4.69e-01

Pos logit contributions
thora+0.417
icket+0.389
 incent+0.381
oli+0.379
ashtra+0.376
Neg logit contributions
 contrary-0.356
 Uk-0.346
posts-0.336
 Outbreak-0.334
 Kah-0.334
Token rapport
Token ID49498
Feature activation+1.32e+00

Pos logit contributions
thora+0.823
icket+0.766
oli+0.739
ashtra+0.731
 incent+0.730
Neg logit contributions
 contrary-1.058
 Uk-1.033
 Kah-1.004
 Melt-0.996
 Outbreak-0.994
Tokeneur
Token ID23365
Feature activation+2.03e+00

Pos logit contributions
thora+0.567
icket+0.417
oli+0.355
 incent+0.322
ashtra+0.312
Neg logit contributions
 contrary-4.677
 Uk-4.606
 Kah-4.534
 Outbreak-4.513
 Melt-4.512
Token"
Token ID1
Feature activation+5.74e-01

Pos logit contributions
thora+4.625
oli+4.447
icket+4.438
 incent+4.323
iral+4.295
Neg logit contributions
 contrary-3.128
 Uk-3.096
 Melt-3.061
 Kah-3.045
 Outbreak-3.041
Token for
Token ID329
Feature activation-1.17e-01

Pos logit contributions
thora+1.480
icket+1.417
oli+1.395
 incent+1.375
ashtra+1.370
Neg logit contributions
 contrary-0.796
 Uk-0.769
 Kah-0.740
 Outbreak-0.735
 Melt-0.733
Token the
Token ID262
Feature activation-1.10e-03

Pos logit contributions
 contrary+0.225
 Uk+0.218
 Kah+0.211
 Melt+0.209
 Outbreak+0.209
Neg logit contributions
thora-0.243
icket-0.228
oli-0.223
 incent-0.221
ashtra-0.220
Token protection
Token ID4800
Feature activation+2.91e-01

Pos logit contributions
 contrary+0.002
 Uk+0.002
 Kah+0.002
 Melt+0.002
 Outbreak+0.002
Neg logit contributions
thora-0.002
icket-0.002
oli-0.002
 incent-0.002
ashtra-0.002
Token of
Token ID286
Feature activation+5.04e-02

Pos logit contributions
thora+0.769
icket+0.742
oli+0.739
 incent+0.720
ashtra+0.718
Neg logit contributions
 contrary-0.364
 Uk-0.358
 Outbreak-0.350
 Melt-0.350
 Kah-0.343
Token Brennan
Token ID26577
Feature activation+6.44e-01

Pos logit contributions
thora+0.879
icket+0.834
oli+0.816
 incent+0.810
ashtra+0.806
Neg logit contributions
 contrary-0.578
 Uk-0.557
 Kah-0.539
 Melt-0.531
 Outbreak-0.529
Token Center
Token ID3337
Feature activation+4.74e-01

Pos logit contributions
thora+1.799
icket+1.725
oli+1.696
 incent+1.679
ashtra+1.673
Neg logit contributions
 contrary-0.766
 Uk-0.734
 Kah-0.701
 Outbreak-0.690
 Melt-0.688
Token filed
Token ID5717
Feature activation-2.25e-01

Pos logit contributions
thora+1.213
icket+1.159
oli+1.137
 incent+1.124
ashtra+1.119
Neg logit contributions
 contrary-0.678
 Uk-0.652
 Kah-0.628
 Outbreak-0.620
 Melt-0.620
Token an
Token ID281
Feature activation+4.55e-01

Pos logit contributions
 contrary+0.385
 Uk+0.367
 Kah+0.358
 Melt+0.355
 KS+0.350
Neg logit contributions
thora-0.521
icket-0.493
oli-0.480
 incent-0.480
ashtra-0.476
Token am
Token ID716
Feature activation+2.03e-01

Pos logit contributions
thora+1.114
icket+1.051
oli+1.026
ashtra+1.022
 incent+1.021
Neg logit contributions
 contrary-0.727
 Uk-0.698
 Kah-0.673
 Melt-0.663
 KS-0.662
Tokenicus
Token ID24552
Feature activation+1.99e+00

Pos logit contributions
thora+0.448
icket+0.419
 incent+0.412
ashtra+0.411
oli+0.409
Neg logit contributions
 contrary-0.364
 Uk-0.349
 Kah-0.340
 Outbreak-0.336
 Dispatch-0.335
Token brief
Token ID4506
Feature activation+1.20e+00

Pos logit contributions
thora+3.359
icket+3.157
oli+3.078
 incent+2.995
iral+2.962
Neg logit contributions
 contrary-4.512
 Uk-4.438
 Kah-4.321
 Outbreak-4.314
 Melt-4.297
Token calling
Token ID4585
Feature activation-1.88e-01

Pos logit contributions
thora+2.809
icket+2.673
oli+2.624
 incent+2.584
ashtra+2.571
Neg logit contributions
 contrary-1.971
 Uk-1.919
 Kah-1.853
 Outbreak-1.842
 Melt-1.834
Token for
Token ID329
Feature activation-9.65e-02

Pos logit contributions
 contrary+0.272
 Uk+0.261
 Kah+0.253
 Melt+0.249
 Outbreak+0.248
Neg logit contributions
thora-0.477
icket-0.456
oli-0.447
 incent-0.442
ashtra-0.440
Token the
Token ID262
Feature activation+5.96e-02

Pos logit contributions
 contrary+0.160
 Uk+0.154
 Kah+0.150
 Melt+0.148
 Outbreak+0.147
Neg logit contributions
thora-0.226
icket-0.215
oli-0.210
 incent-0.208
ashtra-0.208
Token Court
Token ID3078
Feature activation-2.07e-01

Pos logit contributions
thora+0.142
icket+0.135
oli+0.132
 incent+0.130
ashtra+0.130
Neg logit contributions
 contrary-0.098
 Uk-0.094
 Kah-0.091
 Melt-0.090
 Outbreak-0.089
TokenHowever
Token ID4864
Feature activation+3.54e-01

Pos logit contributions
thora+0.587
icket+0.554
 incent+0.542
oli+0.541
ashtra+0.535
Neg logit contributions
 contrary-0.467
 Uk-0.453
 Kah-0.441
 Melt-0.436
 Outbreak-0.434
Token he
Token ID339
Feature activation-3.44e-02

Pos logit contributions
thora+0.884
icket+0.845
oli+0.831
 incent+0.817
ashtra+0.814
Neg logit contributions
 contrary-0.520
 Uk-0.504
 Kah-0.483
 Outbreak-0.482
 Melt-0.479
Token said
Token ID531
Feature activation-2.52e-01

Pos logit contributions
 contrary+0.059
 Uk+0.056
 Kah+0.055
 Melt+0.054
posts+0.053
Neg logit contributions
thora-0.080
icket-0.075
 incent-0.073
oli-0.073
ashtra-0.072
Token that
Token ID326
Feature activation-2.05e-01

Pos logit contributions
 contrary+0.409
 Uk+0.394
 Kah+0.383
 Melt+0.377
 Outbreak+0.376
Neg logit contributions
thora-0.600
icket-0.570
oli-0.558
 incent-0.553
ashtra-0.551
Token the
Token ID262
Feature activation+3.78e-01

Pos logit contributions
 contrary+0.334
 Uk+0.322
 Kah+0.313
 Melt+0.308
 Outbreak+0.307
Neg logit contributions
thora-0.487
icket-0.462
oli-0.452
 incent-0.449
ashtra-0.447
Token general
Token ID2276
Feature activation+1.97e+00

Pos logit contributions
thora+0.822
icket+0.778
oli+0.761
 incent+0.752
ashtra+0.748
Neg logit contributions
 contrary-0.687
 Uk-0.667
 Kah-0.648
 Outbreak-0.641
 Melt-0.641
Token classification
Token ID17923
Feature activation+7.29e-02

Pos logit contributions
thora+4.345
icket+4.135
oli+4.059
 incent+3.981
ashtra+3.964
Neg logit contributions
 contrary-3.402
 Uk-3.330
 Kah-3.220
 Outbreak-3.212
 Melt-3.186
Token is
Token ID318
Feature activation+1.07e-01

Pos logit contributions
thora+0.174
icket+0.165
oli+0.162
 incent+0.159
ashtra+0.159
Neg logit contributions
 contrary-0.118
 Uk-0.114
 Kah-0.111
 Melt-0.109
 Outbreak-0.108
Token a
Token ID257
Feature activation-2.69e-01

Pos logit contributions
thora+0.243
icket+0.230
oli+0.224
 incent+0.223
iral+0.222
Neg logit contributions
 contrary-0.187
 Uk-0.180
 Kah-0.175
 Melt-0.173
 Outbreak-0.171
Token bigger
Token ID5749
Feature activation-2.13e-02

Pos logit contributions
 contrary+0.460
 Uk+0.441
 Kah+0.430
 Melt+0.423
 Outbreak+0.418
Neg logit contributions
thora-0.616
icket-0.585
 incent-0.569
oli-0.567
ashtra-0.567
Token goal
Token ID3061
Feature activation-4.34e-01

Pos logit contributions
 contrary+0.039
 Uk+0.038
 Kah+0.037
 Melt+0.037
 Outbreak+0.036
Neg logit contributions
thora-0.046
icket-0.043
oli-0.042
 incent-0.042
ashtra-0.042
Token for
Token ID329
Feature activation+5.33e-02

Pos logit contributions
 contrary+0.385
 Uk+0.370
 Kah+0.359
 Melt+0.354
 Outbreak+0.353
Neg logit contributions
thora-0.640
icket-0.612
oli-0.596
ashtra-0.592
 incent-0.591
Token comments
Token ID3651
Feature activation-1.43e-01

Pos logit contributions
thora+0.123
icket+0.116
oli+0.114
ashtra+0.112
 incent+0.112
Neg logit contributions
 contrary-0.091
 Uk-0.088
 Kah-0.085
 Outbreak-0.084
 Melt-0.084
Token about
Token ID546
Feature activation+1.44e-01

Pos logit contributions
 contrary+0.195
 Uk+0.187
 Kah+0.180
 Outbreak+0.177
 Melt+0.177
Neg logit contributions
thora-0.375
icket-0.359
oli-0.352
 incent-0.348
ashtra-0.347
Token the
Token ID262
Feature activation+8.55e-03

Pos logit contributions
thora+0.336
icket+0.319
oli+0.312
 incent+0.308
ashtra+0.307
Neg logit contributions
 contrary-0.238
 Uk-0.231
 Kah-0.224
 Outbreak-0.221
 Melt-0.221
Token mother
Token ID2802
Feature activation+2.30e-01

Pos logit contributions
thora+0.019
icket+0.018
oli+0.018
ashtra+0.017
 incent+0.017
Neg logit contributions
 contrary-0.015
 Uk-0.015
 Kah-0.014
 Outbreak-0.014
 Melt-0.014
Token grizz
Token ID46307
Feature activation+1.95e+00

Pos logit contributions
thora+0.544
icket+0.519
oli+0.509
 incent+0.502
ashtra+0.499
Neg logit contributions
 contrary-0.371
 Uk-0.359
 Kah-0.347
 Outbreak-0.344
 Melt-0.343
Tokenly
Token ID306
Feature activation+6.18e-02

Pos logit contributions
thora+4.200
icket+4.043
oli+4.042
iral+3.910
emouth+3.885
Neg logit contributions
 contrary-3.436
 Uk-3.326
 Kah-3.255
 Outbreak-3.237
posts-3.216
Token killing
Token ID5170
Feature activation+9.46e-02

Pos logit contributions
thora+0.122
icket+0.114
oli+0.111
ashtra+0.109
 incent+0.109
Neg logit contributions
 contrary-0.126
 Uk-0.123
 Kah-0.119
 Melt-0.119
 Outbreak-0.119
Token a
Token ID257
Feature activation-9.21e-02

Pos logit contributions
thora+0.236
icket+0.225
oli+0.220
ashtra+0.217
 incent+0.217
Neg logit contributions
 contrary-0.143
 Uk-0.138
 Kah-0.133
 Outbreak-0.132
 Melt-0.132
Token person
Token ID1048
Feature activation-2.81e-02

Pos logit contributions
 contrary+0.186
 Uk+0.182
 Kah+0.177
 Melt+0.175
 Outbreak+0.174
Neg logit contributions
thora-0.184
icket-0.173
oli-0.167
ashtra-0.166
 incent-0.165
Token in
Token ID287
Feature activation+4.32e-02

Pos logit contributions
 contrary+0.042
 Uk+0.040
 Kah+0.038
 Melt+0.038
 Outbreak+0.038
Neg logit contributions
thora-0.071
icket-0.068
oli-0.066
ashtra-0.066
 incent-0.066
Token middle
Token ID3504
Feature activation-1.22e-02

Pos logit contributions
thora+0.863
icket+0.815
oli+0.795
 incent+0.781
ashtra+0.776
Neg logit contributions
 contrary-0.846
 Uk-0.823
 Kah-0.800
 Melt-0.793
 Outbreak-0.792
Token-
Token ID12
Feature activation+3.08e-01

Pos logit contributions
 contrary+0.023
 Uk+0.023
 Outbreak+0.023
 Kah+0.022
 Melt+0.022
Neg logit contributions
thora-0.025
icket-0.024
oli-0.023
 incent-0.023
ashtra-0.023
Tokenclass
Token ID4871
Feature activation+2.00e-01

Pos logit contributions
thora+0.661
icket+0.640
oli+0.619
ashtra+0.610
 incent+0.608
Neg logit contributions
 contrary-0.557
 Uk-0.541
 Outbreak-0.531
 Kah-0.531
 Melt-0.523
Token
Token ID960
Feature activation+7.48e-01

Pos logit contributions
thora+0.510
icket+0.486
oli+0.477
 incent+0.472
ashtra+0.469
Neg logit contributions
 contrary-0.289
 Uk-0.278
 Kah-0.267
 Melt-0.263
 Outbreak-0.263
Tokenany
Token ID1092
Feature activation+4.89e-01

Pos logit contributions
thora+1.690
icket+1.604
oli+1.572
 incent+1.551
ashtra+1.542
Neg logit contributions
 contrary-1.290
 Uk-1.252
 Kah-1.213
 Outbreak-1.200
 Melt-1.199
Token motor
Token ID5584
Feature activation+1.91e+00

Pos logit contributions
thora+1.112
icket+1.058
oli+1.036
 incent+1.023
ashtra+1.018
Neg logit contributions
 contrary-0.836
 Uk-0.809
 Kah-0.784
 Melt-0.773
 Outbreak-0.773
Token vehicle
Token ID4038
Feature activation+6.80e-01

Pos logit contributions
thora+3.551
icket+3.364
oli+3.290
 incent+3.208
ashtra+3.182
Neg logit contributions
 contrary-3.914
 Uk-3.843
 Outbreak-3.795
 Kah-3.770
 Melt-3.744
Token,
Token ID11
Feature activation+2.24e-01

Pos logit contributions
thora+1.784
icket+1.706
oli+1.677
 incent+1.656
ashtra+1.651
Neg logit contributions
 contrary-0.929
 Uk-0.890
 Kah-0.856
 Melt-0.842
 Outbreak-0.842
Token TV
Token ID3195
Feature activation-1.77e-02

Pos logit contributions
thora+0.459
icket+0.431
oli+0.423
ashtra+0.415
 incent+0.413
Neg logit contributions
 contrary-0.441
 Uk-0.427
 Kah-0.416
 Melt-0.409
 KS-0.407
Token,
Token ID11
Feature activation+9.74e-02

Pos logit contributions
 contrary+0.023
 Uk+0.022
 Kah+0.021
 Outbreak+0.021
 Melt+0.021
Neg logit contributions
thora-0.048
icket-0.045
oli-0.045
 incent-0.044
ashtra-0.044
Token computer
Token ID3644
Feature activation-3.26e-02

Pos logit contributions
thora+0.217
icket+0.204
oli+0.200
ashtra+0.198
 incent+0.198
Neg logit contributions
 contrary-0.174
 Uk-0.168
 Kah-0.163
 Melt-0.161
 Outbreak-0.160
Token,
Token ID11
Feature activation-1.50e-01

Pos logit contributions
 contrary+0.097
 Uk+0.093
 Kah+0.090
 Melt+0.088
 Outbreak+0.088
Neg logit contributions
thora-0.173
icket-0.165
oli-0.162
ashtra-0.160
 incent-0.160
Token but
Token ID475
Feature activation+1.05e-01

Pos logit contributions
 contrary+0.237
 Uk+0.226
 Kah+0.220
 Melt+0.216
 Outbreak+0.214
Neg logit contributions
thora-0.369
icket-0.351
oli-0.343
ashtra-0.341
 incent-0.339
Token the
Token ID262
Feature activation+3.66e-01

Pos logit contributions
thora+0.248
icket+0.235
oli+0.230
 incent+0.228
ashtra+0.228
Neg logit contributions
 contrary-0.176
 Uk-0.169
 Kah-0.165
 Melt-0.162
 Outbreak-0.161
Token chassis
Token ID24587
Feature activation+7.03e-01

Pos logit contributions
thora+0.803
icket+0.761
oli+0.746
 incent+0.736
ashtra+0.735
Neg logit contributions
 contrary-0.666
 Uk-0.647
 Kah-0.628
 Melt-0.620
 Outbreak-0.617
Token and
Token ID290
Feature activation+1.86e-01

Pos logit contributions
thora+1.707
icket+1.625
oli+1.592
 incent+1.572
ashtra+1.566
Neg logit contributions
 contrary-1.102
 Uk-1.064
 Kah-1.030
 Melt-1.017
 Outbreak-1.014
Token general
Token ID2276
Feature activation+1.90e+00

Pos logit contributions
thora+0.376
icket+0.355
oli+0.348
ashtra+0.341
 incent+0.340
Neg logit contributions
 contrary-0.369
 Uk-0.359
 Kah-0.349
 Melt-0.345
 Outbreak-0.343
Token handling
Token ID9041
Feature activation+7.67e-02

Pos logit contributions
thora+3.975
icket+3.763
oli+3.671
 incent+3.625
ashtra+3.584
Neg logit contributions
 contrary-3.477
 Uk-3.385
 Kah-3.297
 Outbreak-3.297
 Melt-3.270
Token are
Token ID389
Feature activation+3.07e-01

Pos logit contributions
thora+0.185
icket+0.177
oli+0.172
 incent+0.171
ashtra+0.171
Neg logit contributions
 contrary-0.122
 Uk-0.117
 Kah-0.113
 Melt-0.111
posts-0.110
Token quite
Token ID2407
Feature activation-1.99e-01

Pos logit contributions
thora+0.694
icket+0.656
oli+0.642
iral+0.634
ashtra+0.634
Neg logit contributions
 contrary-0.543
 Uk-0.522
 Kah-0.507
 Melt-0.500
 Outbreak-0.494
Token similar
Token ID2092
Feature activation+3.15e-01

Pos logit contributions
 contrary+0.354
 Uk+0.338
 Kah+0.329
 Melt+0.322
 KS+0.316
Neg logit contributions
thora-0.452
icket-0.429
oli-0.418
iral-0.414
ashtra-0.413
Token [
Token ID685
Feature activation+9.85e-01

Pos logit contributions
thora+0.822
icket+0.787
oli+0.772
 incent+0.764
ashtra+0.761
Neg logit contributions
 contrary-0.436
 Uk-0.418
 Kah-0.403
 Melt-0.396
 Outbreak-0.396
Tokenrez
Token ID21107
Feature activation+4.23e-01

Pos logit contributions
thora+0.411
icket+0.366
oli+0.351
 incent+0.332
ashtra+0.332
Neg logit contributions
 contrary-1.281
 Uk-1.258
 Kah-1.234
 Outbreak-1.231
 Melt-1.226
Token,
Token ID11
Feature activation-2.77e-02

Pos logit contributions
thora+1.085
icket+1.037
oli+1.024
 incent+1.006
ashtra+1.001
Neg logit contributions
 contrary-0.596
 Uk-0.572
 Kah-0.549
 Outbreak-0.546
 Melt-0.543
Token with
Token ID351
Feature activation+1.26e-01

Pos logit contributions
 contrary+0.041
 Uk+0.039
 Kah+0.038
 Outbreak+0.037
 Melt+0.037
Neg logit contributions
thora-0.070
icket-0.067
oli-0.065
 incent-0.065
ashtra-0.064
Token an
Token ID281
Feature activation+3.76e-01

Pos logit contributions
thora+0.282
icket+0.269
oli+0.262
 incent+0.259
ashtra+0.258
Neg logit contributions
 contrary-0.223
 Uk-0.217
 Kah-0.210
 Melt-0.207
 Outbreak-0.207
Token army
Token ID5428
Feature activation+3.55e-01

Pos logit contributions
thora+0.781
icket+0.730
oli+0.705
emouth+0.702
ashtra+0.701
Neg logit contributions
 contrary-0.738
 Uk-0.721
 Kah-0.692
 Outbreak-0.690
 Melt-0.681
Token general
Token ID2276
Feature activation+1.89e+00

Pos logit contributions
thora+0.725
icket+0.685
oli+0.664
 incent+0.658
ashtra+0.657
Neg logit contributions
 contrary-0.697
 Uk-0.676
 Kah-0.657
 Melt-0.651
 Outbreak-0.651
Token;
Token ID26
Feature activation+2.18e-01

Pos logit contributions
thora+4.440
icket+4.223
oli+4.213
iral+4.094
 incent+4.064
Neg logit contributions
 contrary-2.927
 Uk-2.889
 Outbreak-2.803
 Kah-2.784
 Melt-2.772
Token Mr
Token ID1770
Feature activation+4.24e-01

Pos logit contributions
thora+0.514
icket+0.489
oli+0.478
 incent+0.473
ashtra+0.471
Neg logit contributions
 contrary-0.360
 Uk-0.347
 Kah-0.336
 Outbreak-0.331
 Melt-0.331
Token.
Token ID13
Feature activation-1.45e-01

Pos logit contributions
thora+1.066
icket+1.017
 incent+0.997
oli+0.996
ashtra+0.985
Neg logit contributions
 contrary-0.628
 Uk-0.607
 Kah-0.586
 Melt-0.578
 Outbreak-0.577
Token Ram
Token ID7431
Feature activation+7.73e-01

Pos logit contributions
 contrary+0.241
 Uk+0.235
 Kah+0.227
 Melt+0.224
 Outbreak+0.223
Neg logit contributions
thora-0.339
icket-0.323
oli-0.315
 incent-0.314
ashtra-0.312
Tokení
Token ID8836
Feature activation+4.83e-01

Pos logit contributions
thora+1.522
icket+1.430
 incent+1.398
oli+1.393
ashtra+1.371
Neg logit contributions
 contrary-1.559
 Uk-1.516
 Kah-1.476
 Outbreak-1.461
 Melt-1.459
Token
Token ID447
Feature activation-1.07e-01

Pos logit contributions
thora+0.071
icket+0.068
 incent+0.067
oli+0.066
ashtra+0.066
Neg logit contributions
 contrary-0.036
 Uk-0.034
 Kah-0.033
 Melt-0.033
 Outbreak-0.033
Token
Token ID247
Feature activation-2.37e-02

Pos logit contributions
 contrary+0.193
 Uk+0.187
 Kah+0.182
 Melt+0.181
 Outbreak+0.180
Neg logit contributions
thora-0.233
icket-0.221
oli-0.216
 incent-0.214
ashtra-0.213
Tokent
Token ID83
Feature activation+2.70e-01

Pos logit contributions
 contrary+0.037
 Uk+0.036
 Kah+0.035
 Melt+0.034
 Outbreak+0.034
Neg logit contributions
thora-0.058
icket-0.055
oli-0.054
 incent-0.053
ashtra-0.053
Token make
Token ID787
Feature activation-1.56e-02

Pos logit contributions
thora+0.571
icket+0.537
oli+0.522
ashtra+0.517
emouth+0.515
Neg logit contributions
 contrary-0.514
 Uk-0.495
 Kah-0.481
 Melt-0.478
 Outbreak-0.476
Token the
Token ID262
Feature activation-1.79e-01

Pos logit contributions
 contrary+0.026
 Uk+0.025
 Kah+0.025
 Outbreak+0.024
 Melt+0.024
Neg logit contributions
thora-0.036
icket-0.034
oli-0.034
 incent-0.033
ashtra-0.033
Token repeal
Token ID14634
Feature activation+1.89e+00

Pos logit contributions
 contrary+0.309
 Uk+0.298
 Kah+0.289
 Melt+0.285
 Outbreak+0.285
Neg logit contributions
thora-0.410
icket-0.388
oli-0.378
 incent-0.376
ashtra-0.376
Token crus
Token ID25164
Feature activation+1.99e-01

Pos logit contributions
thora+4.522
icket+4.314
oli+4.292
 incent+4.187
iral+4.173
Neg logit contributions
 contrary-2.882
 Uk-2.797
 Outbreak-2.699
 Kah-2.686
 Melt-2.679
Tokenaders
Token ID9972
Feature activation-2.51e-01

Pos logit contributions
thora+0.347
icket+0.325
 incent+0.313
oli+0.313
ashtra+0.310
Neg logit contributions
 contrary-0.444
 Uk-0.433
 Kah-0.425
 Outbreak-0.420
 Melt-0.420
Token feel
Token ID1254
Feature activation+1.18e+00

Pos logit contributions
 contrary+0.488
 Uk+0.473
 Kah+0.463
 Melt+0.454
 Outbreak+0.452
Neg logit contributions
thora-0.521
icket-0.487
oli-0.473
 incent-0.469
ashtra-0.466
Token discouraged
Token ID30170
Feature activation+6.88e-01

Pos logit contributions
thora+2.558
icket+2.426
oli+2.382
 incent+2.345
ashtra+2.330
Neg logit contributions
 contrary-2.127
 Uk-2.067
 Kah-2.006
 Outbreak-1.999
 Melt-1.989
Token enough
Token ID1576
Feature activation+1.46e-02

Pos logit contributions
thora+1.785
icket+1.708
oli+1.688
 incent+1.658
ashtra+1.653
Neg logit contributions
 contrary-0.937
 Uk-0.906
 Kah-0.868
 Outbreak-0.864
 Melt-0.857
TokenS
Token ID50
Feature activation-3.89e-01

Pos logit contributions
 contrary+1.338
 Uk+1.302
 Kah+1.261
 Outbreak+1.251
 Melt+1.246
Neg logit contributions
thora-1.856
icket-1.772
oli-1.735
 incent-1.705
iral-1.704
Token.
Token ID13
Feature activation-9.67e-01

Pos logit contributions
 contrary+0.646
 Uk+0.626
 Kah+0.607
 Outbreak+0.601
 Melt+0.600
Neg logit contributions
thora-0.903
icket-0.860
oli-0.844
 incent-0.831
ashtra-0.826
Token dollars
Token ID5054
Feature activation-5.40e-01

Pos logit contributions
 contrary+1.634
 Uk+1.569
 Kah+1.513
 Outbreak+1.501
 Melt+1.500
Neg logit contributions
thora-2.259
icket-2.123
oli-2.076
iral-2.041
ashtra-2.039
Token from
Token ID422
Feature activation-1.22e-01

Pos logit contributions
 contrary+0.893
 Uk+0.857
 Kah+0.830
 Melt+0.824
 Outbreak+0.810
Neg logit contributions
thora-1.272
icket-1.205
oli-1.172
ashtra-1.162
iral-1.159
Token rem
Token ID816
Feature activation+1.26e-01

Pos logit contributions
 contrary+0.202
 Uk+0.196
 Kah+0.189
 Melt+0.186
 Outbreak+0.185
Neg logit contributions
thora-0.288
icket-0.273
oli-0.267
ashtra-0.264
 incent-0.263
Tokenitt
Token ID715
Feature activation+1.88e+00

Pos logit contributions
thora+0.266
icket+0.254
oli+0.249
 incent+0.244
iral+0.243
Neg logit contributions
 contrary-0.235
 Uk-0.229
 Kah-0.222
 Outbreak-0.221
 Melt-0.220
Tokenances
Token ID1817
Feature activation+7.76e-01

Pos logit contributions
thora+1.410
icket+1.253
oli+1.190
 incent+1.088
iral+1.080
Neg logit contributions
 contrary-6.021
 Uk-5.943
 Kah-5.831
 Outbreak-5.813
 Melt-5.794
Token,
Token ID11
Feature activation-4.46e-02

Pos logit contributions
thora+1.958
icket+1.877
oli+1.848
 incent+1.824
ashtra+1.806
Neg logit contributions
 contrary-1.103
 Uk-1.072
 Kah-1.030
 Outbreak-1.029
 Melt-1.017
Token tourism
Token ID19277
Feature activation+1.93e-01

Pos logit contributions
 contrary+0.076
 Uk+0.073
 Kah+0.071
 Outbreak+0.070
 Melt+0.070
Neg logit contributions
thora-0.103
icket-0.098
oli-0.096
ashtra-0.094
 incent-0.094
Token and
Token ID290
Feature activation-9.92e-02

Pos logit contributions
thora+0.502
icket+0.481
oli+0.472
 incent+0.467
ashtra+0.465
Neg logit contributions
 contrary-0.265
 Uk-0.256
 Kah-0.246
 Outbreak-0.244
 Melt-0.243
Token pal
Token ID6340
Feature activation+1.45e-02

Pos logit contributions
 contrary+0.167
 Uk+0.161
 Kah+0.156
 Outbreak+0.153
 Melt+0.153
Neg logit contributions
thora-0.230
icket-0.219
oli-0.215
ashtra-0.211
 incent-0.211
Token next
Token ID1306
Feature activation+1.44e-01

Pos logit contributions
thora+0.930
icket+0.887
oli+0.869
 incent+0.861
ashtra+0.855
Neg logit contributions
 contrary-0.598
 Uk-0.576
 Kah-0.560
 Melt-0.551
 Outbreak-0.551
Token time
Token ID640
Feature activation-5.68e-01

Pos logit contributions
thora+0.351
icket+0.334
oli+0.326
ashtra+0.324
 incent+0.322
Neg logit contributions
 contrary-0.226
 Uk-0.219
 Kah-0.213
 Outbreak-0.210
 Melt-0.210
Token!
Token ID0
Feature activation-7.63e-01

Pos logit contributions
 contrary+0.872
 Uk+0.849
 Kah+0.820
 Melt+0.813
 Outbreak+0.809
Neg logit contributions
thora-1.397
icket-1.323
oli-1.298
ashtra-1.289
 incent-1.289
Token<|endoftext|>
Token ID50256
Feature activation+4.33e-01

Pos logit contributions
 contrary+1.163
 Uk+1.130
 Kah+1.097
 Outbreak+1.080
 Melt+1.080
Neg logit contributions
thora-1.878
icket-1.785
oli-1.749
 incent-1.733
ashtra-1.728
TokenBR
Token ID11473
Feature activation+1.14e-01

Pos logit contributions
thora+0.944
icket+0.893
oli+0.873
 incent+0.865
ashtra+0.858
Neg logit contributions
 contrary-0.782
 Uk-0.760
 Kah-0.738
 Outbreak-0.731
 Melt-0.730
TokenUSS
Token ID32835
Feature activation+1.87e+00

Pos logit contributions
thora+0.220
icket+0.207
oli+0.202
 incent+0.200
iral+0.198
Neg logit contributions
 contrary-0.236
 Uk-0.231
 Kah-0.225
 Outbreak-0.223
 Melt-0.222
TokenELS
Token ID37142
Feature activation+4.17e-01

Pos logit contributions
thora+3.309
icket+3.106
oli+3.034
 incent+2.967
ashtra+2.953
Neg logit contributions
 contrary-4.115
 Uk-4.019
 Kah-3.909
 Outbreak-3.895
 Melt-3.885
Token (
Token ID357
Feature activation-3.88e-01

Pos logit contributions
thora+0.994
icket+0.951
oli+0.934
 incent+0.921
ashtra+0.917
Neg logit contributions
 contrary-0.657
 Uk-0.635
 Kah-0.612
 Outbreak-0.608
 Melt-0.606
TokenReuters
Token ID12637
Feature activation+2.43e-01

Pos logit contributions
 contrary+0.590
 Uk+0.571
 Kah+0.548
 Outbreak+0.541
 Melt+0.539
Neg logit contributions
thora-0.965
icket-0.910
oli-0.892
 incent-0.892
emouth-0.883
Token)
Token ID8
Feature activation-4.88e-01

Pos logit contributions
thora+0.592
icket+0.565
oli+0.554
 incent+0.546
ashtra+0.544
Neg logit contributions
 contrary-0.375
 Uk-0.363
 Kah-0.350
 Outbreak-0.346
 Melt-0.346
Token -
Token ID532
Feature activation-2.75e-01

Pos logit contributions
 contrary+0.605
 Uk+0.596
 Kah+0.570
 Outbreak+0.556
 Melt+0.556
Neg logit contributions
thora-1.372
 incent-1.305
icket-1.299
oli-1.268
emouth-1.259
Tokeniv
Token ID452
Feature activation-1.08e-01

Pos logit contributions
thora+0.148
icket+0.139
 incent+0.137
oli+0.135
ashtra+0.134
Neg logit contributions
 contrary-0.139
 Uk-0.136
 Kah-0.132
 Melt-0.131
 Outbreak-0.129
Token loans
Token ID10021
Feature activation-9.43e-02

Pos logit contributions
 contrary+0.180
 Uk+0.175
 Kah+0.169
 Melt+0.167
 Outbreak+0.167
Neg logit contributions
thora-0.251
icket-0.239
oli-0.233
 incent-0.231
ashtra-0.230
Token likely
Token ID1884
Feature activation-1.32e-02

Pos logit contributions
 contrary+0.154
 Uk+0.150
 Kah+0.144
 Melt+0.143
 Outbreak+0.143
Neg logit contributions
thora-0.222
icket-0.211
oli-0.207
 incent-0.204
ashtra-0.203
Token also
Token ID635
Feature activation-7.49e-01

Pos logit contributions
 contrary+0.021
 Uk+0.020
 Kah+0.019
 Melt+0.019
 Outbreak+0.019
Neg logit contributions
thora-0.032
icket-0.030
oli-0.030
 incent-0.029
ashtra-0.029
Token lacked
Token ID19989
Feature activation-5.91e-01

Pos logit contributions
 contrary+1.183
 Uk+1.136
 Kah+1.098
 Melt+1.079
posts+1.066
Neg logit contributions
thora-1.812
icket-1.724
oli-1.683
ashtra-1.662
 incent-1.657
Token collateral
Token ID27907
Feature activation+1.86e+00

Pos logit contributions
 contrary+1.006
 Uk+0.977
 Kah+0.943
 Melt+0.926
 Outbreak+0.922
Neg logit contributions
thora-1.361
icket-1.293
oli-1.264
ashtra-1.245
iral-1.239
Token.
Token ID13
Feature activation-1.08e-01

Pos logit contributions
thora+4.280
icket+4.134
oli+4.106
 incent+4.053
 diarr+3.963
Neg logit contributions
 Outbreak-2.787
 contrary-2.784
 Uk-2.779
 Kah-2.685
 Melt-2.673
Token
Token ID198
Feature activation-3.10e-01

Pos logit contributions
 contrary+0.148
 Uk+0.143
 Kah+0.137
 Outbreak+0.135
 Melt+0.135
Neg logit contributions
thora-0.282
icket-0.269
oli-0.264
 incent-0.261
ashtra-0.260
Token
Token ID198
Feature activation-2.91e-01

Pos logit contributions
 contrary+0.434
 Uk+0.417
 Kah+0.401
 Melt+0.395
posts+0.393
Neg logit contributions
thora-0.812
icket-0.773
 incent-0.770
oli-0.754
ashtra-0.751
TokenMan
Token ID5124
Feature activation-1.91e-01

Pos logit contributions
 contrary+0.485
 Uk+0.473
 Kah+0.455
posts+0.453
 Melt+0.449
Neg logit contributions
thora-0.677
icket-0.636
 incent-0.627
oli-0.620
emouth-0.617
Tokenagers
Token ID10321
Feature activation-6.27e-01

Pos logit contributions
 contrary+0.390
 Uk+0.382
 Kah+0.371
 Melt+0.368
 Outbreak+0.367
Neg logit contributions
thora-0.368
icket-0.348
oli-0.334
 incent-0.334
ashtra-0.330
Token-
Token ID12
Feature activation+1.09e-02

Pos logit contributions
 contrary+0.151
 Uk+0.146
 Kah+0.140
 Melt+0.137
posts+0.137
Neg logit contributions
thora-0.257
icket-0.245
oli-0.241
ashtra-0.237
 incent-0.237
Tokenmentioned
Token ID17181
Feature activation+2.18e-01

Pos logit contributions
thora+0.022
icket+0.021
oli+0.020
 incent+0.020
ashtra+0.020
Neg logit contributions
 contrary-0.021
 Uk-0.021
 Kah-0.020
 Outbreak-0.020
 Melt-0.020
Token assets
Token ID6798
Feature activation-4.40e-01

Pos logit contributions
thora+0.623
icket+0.598
oli+0.588
 incent+0.580
ashtra+0.580
Neg logit contributions
 contrary-0.250
 Uk-0.238
 Kah-0.225
 Melt-0.222
 Outbreak-0.221
Token
Token ID960
Feature activation+6.41e-01

Pos logit contributions
 contrary+0.592
 Uk+0.564
 Kah+0.542
 Melt+0.533
 Outbreak+0.532
Neg logit contributions
thora-1.166
icket-1.115
oli-1.093
 incent-1.082
ashtra-1.077
Tokena
Token ID64
Feature activation-7.67e-02

Pos logit contributions
thora+1.467
icket+1.393
oli+1.365
 incent+1.347
ashtra+1.341
Neg logit contributions
 contrary-1.087
 Uk-1.053
 Kah-1.020
 Outbreak-1.010
 Melt-1.008
Token motor
Token ID5584
Feature activation+1.86e+00

Pos logit contributions
 contrary+0.139
 Uk+0.134
 Kah+0.131
 KS+0.128
 Melt+0.128
Neg logit contributions
thora-0.169
icket-0.160
oli-0.156
ashtra-0.154
 incent-0.154
Token vehicle
Token ID4038
Feature activation+7.87e-01

Pos logit contributions
thora+2.763
icket+2.592
oli+2.522
 incent+2.423
iral+2.420
Neg logit contributions
 contrary-4.481
 Uk-4.408
 Outbreak-4.397
 Kah-4.355
 Melt-4.339
Token,
Token ID11
Feature activation+2.84e-01

Pos logit contributions
thora+2.047
icket+1.969
oli+1.930
 incent+1.909
ashtra+1.892
Neg logit contributions
 contrary-1.052
 Uk-1.021
 Outbreak-0.987
 Kah-0.985
 Melt-0.973
Token a
Token ID257
Feature activation-1.80e-01

Pos logit contributions
thora+0.654
icket+0.621
oli+0.609
 incent+0.600
ashtra+0.599
Neg logit contributions
 contrary-0.478
 Uk-0.463
 Kah-0.448
 Melt-0.443
 Outbreak-0.443
Token TV
Token ID3195
Feature activation+7.42e-02

Pos logit contributions
 contrary+0.320
 Uk+0.310
 Kah+0.300
 Melt+0.295
 KS+0.295
Neg logit contributions
thora-0.402
icket-0.379
oli-0.372
ashtra-0.367
 incent-0.366
Token,
Token ID11
Feature activation+4.23e-02

Pos logit contributions
thora+0.184
icket+0.176
oli+0.172
 incent+0.170
ashtra+0.169
Neg logit contributions
 contrary-0.111
 Uk-0.107
 Kah-0.104
 Outbreak-0.103
 Melt-0.102
Tokenf
Token ID69
Feature activation-6.51e-01

Pos logit contributions
thora+0.624
icket+0.590
oli+0.580
 incent+0.576
ashtra+0.574
Neg logit contributions
 contrary-0.409
 Uk-0.397
 Kah-0.382
 Melt-0.377
 Outbreak-0.377
Token4
Token ID19
Feature activation+1.67e-01

Pos logit contributions
 contrary+1.017
 Uk+0.989
 Kah+0.954
 Melt+0.937
 Outbreak+0.934
Neg logit contributions
thora-1.582
icket-1.485
ashtra-1.454
 incent-1.452
emouth-1.450
Tokenb
Token ID65
Feature activation-7.15e-01

Pos logit contributions
thora+0.394
icket+0.374
oli+0.366
ashtra+0.363
 incent+0.363
Neg logit contributions
 contrary-0.273
 Uk-0.265
 Kah-0.256
 Melt-0.253
 Outbreak-0.253
Tokenbe
Token ID1350
Feature activation-4.08e-01

Pos logit contributions
 contrary+1.134
 Uk+1.103
 Kah+1.064
 Melt+1.042
posts+1.041
Neg logit contributions
thora-1.709
icket-1.614
ashtra-1.571
oli-1.570
emouth-1.568
Token66
Token ID2791
Feature activation+6.47e-02

Pos logit contributions
 contrary+0.652
 Uk+0.631
 Kah+0.610
 Melt+0.602
 Outbreak+0.601
Neg logit contributions
thora-0.973
icket-0.923
oli-0.900
ashtra-0.899
 incent-0.893
Token IEEE
Token ID40552
Feature activation+1.86e+00

Pos logit contributions
thora+0.144
icket+0.136
oli+0.133
ashtra+0.132
 incent+0.132
Neg logit contributions
 contrary-0.114
 Uk-0.111
 Kah-0.108
 Melt-0.106
 Outbreak-0.106
Token 802
Token ID33121
Feature activation+1.40e+00

Pos logit contributions
thora+3.391
icket+3.205
oli+3.127
 incent+3.061
ashtra+3.036
Neg logit contributions
 contrary-3.943
 Uk-3.813
 Kah-3.742
 Outbreak-3.698
posts-3.687
Token.
Token ID13
Feature activation-3.18e-01

Pos logit contributions
thora+3.412
icket+3.261
oli+3.198
 incent+3.149
ashtra+3.141
Neg logit contributions
 contrary-2.147
 Uk-2.075
 Kah-2.003
 Outbreak-1.982
 Melt-1.976
Token11
Token ID1157
Feature activation-2.84e-02

Pos logit contributions
 contrary+0.381
 Uk+0.367
 Kah+0.347
 Outbreak+0.345
 Melt+0.338
Neg logit contributions
thora-0.892
icket-0.846
oli-0.838
 incent-0.833
emouth-0.823
Tokenb
Token ID65
Feature activation-7.52e-01

Pos logit contributions
 contrary+0.048
 Uk+0.047
 Kah+0.045
 Melt+0.045
 Outbreak+0.045
Neg logit contributions
thora-0.065
icket-0.062
oli-0.061
 incent-0.060
ashtra-0.060
Tokengn
Token ID4593
Feature activation-3.45e-01

Pos logit contributions
 contrary+1.230
 Uk+1.203
 Kah+1.164
 Dispatch+1.150
 Melt+1.147
Neg logit contributions
thora-1.763
icket-1.657
oli-1.631
emouth-1.618
ashtra-1.616
Token team
Token ID1074
Feature activation-2.95e-01

Pos logit contributions
 contrary+0.118
 Uk+0.114
 Kah+0.111
 Melt+0.109
 Outbreak+0.109
Neg logit contributions
thora-0.139
icket-0.132
oli-0.129
 incent-0.127
ashtra-0.126
Token to
Token ID284
Feature activation-5.66e-02

Pos logit contributions
 contrary+0.444
 Uk+0.422
 Kah+0.408
 Melt+0.404
 Outbreak+0.404
Neg logit contributions
thora-0.737
icket-0.703
oli-0.687
 incent-0.682
ashtra-0.681
Token share
Token ID2648
Feature activation+4.10e-01

Pos logit contributions
 contrary+0.098
 Uk+0.094
 Kah+0.091
 Melt+0.090
 KS+0.089
Neg logit contributions
thora-0.130
icket-0.122
oli-0.120
ashtra-0.118
emouth-0.118
Token with
Token ID351
Feature activation-1.90e-01

Pos logit contributions
thora+1.047
icket+1.001
oli+0.983
 incent+0.971
ashtra+0.967
Neg logit contributions
 contrary-0.586
 Uk-0.564
 Kah-0.543
 Outbreak-0.537
 Melt-0.536
Token the
Token ID262
Feature activation+1.41e-01

Pos logit contributions
 contrary+0.324
 Uk+0.311
 Kah+0.301
 Melt+0.299
 Outbreak+0.296
Neg logit contributions
thora-0.441
icket-0.417
oli-0.408
ashtra-0.406
 incent-0.401
Token general
Token ID2276
Feature activation+1.85e+00

Pos logit contributions
thora+0.301
icket+0.283
oli+0.276
ashtra+0.273
 incent+0.272
Neg logit contributions
 contrary-0.266
 Uk-0.256
 Kah-0.249
 Melt-0.246
 Outbreak-0.246
Token public
Token ID1171
Feature activation+2.93e-01

Pos logit contributions
thora+3.309
icket+3.129
oli+3.045
 incent+2.983
iral+2.954
Neg logit contributions
 contrary-3.982
 Uk-3.898
 Kah-3.814
 Outbreak-3.794
 Melt-3.769
Token;
Token ID26
Feature activation+3.03e-02

Pos logit contributions
thora+0.730
icket+0.697
oli+0.684
 incent+0.675
ashtra+0.672
Neg logit contributions
 contrary-0.436
 Uk-0.421
 Kah-0.406
 Outbreak-0.401
 Melt-0.400
Token quotes
Token ID13386
Feature activation+1.10e+00

Pos logit contributions
thora+0.073
icket+0.069
oli+0.068
ashtra+0.067
 incent+0.067
Neg logit contributions
 contrary-0.049
 Uk-0.046
 Kah-0.045
 Melt-0.044
posts-0.044
Token from
Token ID422
Feature activation+2.52e-01

Pos logit contributions
thora+2.797
icket+2.683
oli+2.645
 incent+2.611
ashtra+2.583
Neg logit contributions
 contrary-1.513
 Uk-1.482
 Kah-1.423
 Outbreak-1.421
 Melt-1.400
Token coaches
Token ID11070
Feature activation-9.35e-02

Pos logit contributions
thora+0.570
icket+0.540
oli+0.528
 incent+0.521
ashtra+0.521
Neg logit contributions
 contrary-0.439
 Uk-0.425
 Kah-0.413
 Melt-0.408
 Outbreak-0.407
Token)
Token ID8
Feature activation-7.24e-02

Pos logit contributions
thora+0.773
icket+0.739
oli+0.726
 incent+0.714
iral+0.708
Neg logit contributions
 contrary-0.522
 Uk-0.506
 Kah-0.489
 Outbreak-0.483
 Melt-0.482
Token
Token ID198
Feature activation-2.71e-02

Pos logit contributions
 contrary+0.102
 Uk+0.098
 Kah+0.095
 Melt+0.094
 Outbreak+0.094
Neg logit contributions
thora-0.188
icket-0.180
oli-0.176
ashtra-0.174
 incent-0.174
Token
Token ID198
Feature activation-7.79e-02

Pos logit contributions
 contrary+0.039
 Uk+0.038
 Kah+0.036
 Melt+0.036
 Outbreak+0.036
Neg logit contributions
thora-0.069
icket-0.066
 incent-0.065
oli-0.065
ashtra-0.064
TokenWill
Token ID8743
Feature activation+1.91e-01

Pos logit contributions
 contrary+0.135
 Uk+0.131
posts+0.127
 Kah+0.127
 Melt+0.126
Neg logit contributions
thora-0.176
icket-0.166
 incent-0.163
oli-0.161
ashtra-0.161
Token Del
Token ID4216
Feature activation-7.63e-01

Pos logit contributions
thora+0.458
icket+0.436
oli+0.427
 incent+0.422
ashtra+0.422
Neg logit contributions
 contrary-0.306
 Uk-0.296
 Kah-0.285
 Melt-0.283
 Outbreak-0.282
Tokeninqu
Token ID18934
Feature activation+1.85e+00

Pos logit contributions
 contrary+1.522
 Uk+1.487
 Kah+1.453
 Melt+1.437
 Outbreak+1.429
Neg logit contributions
thora-1.498
icket-1.423
oli-1.369
ashtra-1.369
 incent-1.361
Tokenencies
Token ID3976
Feature activation-3.02e-03

Pos logit contributions
thora+2.602
icket+2.410
oli+2.340
iral+2.281
ashtra+2.243
Neg logit contributions
 contrary-4.714
 Uk-4.607
 Kah-4.521
 Outbreak-4.477
 Melt-4.476
Token Trigger
Token ID24593
Feature activation-4.40e-01

Pos logit contributions
 contrary+0.004
 Uk+0.004
 Kah+0.004
 Melt+0.004
 Outbreak+0.004
Neg logit contributions
thora-0.008
icket-0.007
oli-0.007
 incent-0.007
ashtra-0.007
Token a
Token ID257
Feature activation-1.61e-02

Pos logit contributions
 contrary+0.662
 Uk+0.640
 Melt+0.617
 Kah+0.615
 Outbreak+0.615
Neg logit contributions
thora-1.097
icket-1.044
oli-1.022
 incent-1.015
ashtra-1.012
Token New
Token ID968
Feature activation-2.01e-01

Pos logit contributions
 contrary+0.022
 Uk+0.021
 Kah+0.020
 Melt+0.020
 Outbreak+0.020
Neg logit contributions
thora-0.042
icket-0.041
oli-0.040
 incent-0.040
ashtra-0.039
Token American
Token ID1605
Feature activation+2.02e-02

Pos logit contributions
 contrary+0.261
 Uk+0.253
 Melt+0.242
 Kah+0.241
 Outbreak+0.241
Neg logit contributions
thora-0.542
icket-0.518
oli-0.509
 incent-0.506
ashtra-0.505
Token 2012
Token ID2321
Feature activation+2.01e-01

Pos logit contributions
 contrary+1.357
 Uk+1.314
 Kah+1.277
 Melt+1.249
 Outbreak+1.240
Neg logit contributions
thora-2.478
icket-2.373
 incent-2.321
oli-2.311
ashtra-2.292
Token United
Token ID1578
Feature activation-4.19e-01

Pos logit contributions
thora+0.519
icket+0.496
oli+0.488
 incent+0.481
ashtra+0.480
Neg logit contributions
 contrary-0.280
 Uk-0.268
 Kah-0.257
 Outbreak-0.256
 Melt-0.254
Token States
Token ID1829
Feature activation+1.16e-01

Pos logit contributions
 contrary+0.612
 Uk+0.582
 Outbreak+0.560
 Kah+0.558
posts+0.556
Neg logit contributions
thora-1.056
icket-1.001
oli-0.990
 incent-0.975
iral-0.974
Token 5
Token ID642
Feature activation-4.16e-01

Pos logit contributions
thora+0.304
icket+0.291
oli+0.285
 incent+0.282
ashtra+0.281
Neg logit contributions
 contrary-0.160
 Uk-0.154
 Kah-0.148
 Outbreak-0.146
 Melt-0.146
Token667
Token ID28933
Feature activation+2.60e-01

Pos logit contributions
 contrary+0.771
 Uk+0.752
 Kah+0.731
 Outbreak+0.726
 Melt+0.724
Neg logit contributions
thora-0.891
icket-0.847
oli-0.819
ashtra-0.816
 incent-0.814
Token Posts
Token ID12043
Feature activation-2.65e+00

Pos logit contributions
thora+0.913
icket+0.884
oli+0.872
 incent+0.865
ashtra+0.862
Neg logit contributions
 contrary-0.122
 Uk-0.109
 Kah-0.095
 Outbreak-0.091
 Melt-0.091
Token #
Token ID1303
Feature activation-2.25e+00

Pos logit contributions
 contrary+4.082
 Uk+3.944
 Kah+3.874
 Outbreak+3.847
 Melt+3.801
Neg logit contributions
thora-6.290
icket-6.062
oli-5.916
ashtra-5.854
 incent-5.819
Token12
Token ID1065
Feature activation+1.39e-01

Pos logit contributions
 contrary+2.711
posts+2.620
 Uk+2.467
 Kah+2.423
+2.325
Neg logit contributions
thora-6.443
 incent-6.319
 distingu-6.092
ashtra-6.069
emouth-6.057
Token I
Token ID314
Feature activation-4.64e-01

Pos logit contributions
thora+0.365
icket+0.349
oli+0.343
 incent+0.338
ashtra+0.338
Neg logit contributions
 contrary-0.188
 Uk-0.182
 Kah-0.174
 Outbreak-0.172
 Melt-0.172
Token'm
Token ID1101
Feature activation+9.22e-02

Pos logit contributions
 contrary+0.796
 Uk+0.769
 Kah+0.752
 Melt+0.741
 KS+0.732
Neg logit contributions
thora-1.066
icket-1.002
oli-0.980
ashtra-0.978
emouth-0.978
Token glad
Token ID9675
Feature activation-1.37e-01

Pos logit contributions
thora+0.193
icket+0.182
oli+0.177
 incent+0.175
ashtra+0.174
Neg logit contributions
 contrary-0.176
 Uk-0.171
 Kah-0.166
 Outbreak-0.165
 Melt-0.164
Token June
Token ID2795
Feature activation-9.33e-01

Pos logit contributions
 contrary+0.640
 Uk+0.617
 Kah+0.596
 Melt+0.586
 Outbreak+0.586
Neg logit contributions
thora-1.237
icket-1.185
oli-1.160
 incent-1.151
ashtra-1.150
Token 2012
Token ID2321
Feature activation+1.13e-01

Pos logit contributions
 contrary+1.345
 Uk+1.309
 Kah+1.267
 Melt+1.240
 Outbreak+1.236
Neg logit contributions
thora-2.413
icket-2.312
oli-2.252
 incent-2.249
ashtra-2.232
Token Canada
Token ID3340
Feature activation-5.49e-01

Pos logit contributions
thora+0.293
icket+0.280
oli+0.275
 incent+0.271
ashtra+0.271
Neg logit contributions
 contrary-0.156
 Uk-0.149
 Kah-0.143
 Outbreak-0.142
 Melt-0.141
Token 16
Token ID1467
Feature activation-5.22e-01

Pos logit contributions
 contrary+0.724
 Uk+0.707
 Kah+0.682
 Melt+0.671
 Outbreak+0.665
Neg logit contributions
thora-1.480
icket-1.420
oli-1.385
 incent-1.380
ashtra-1.375
Token10
Token ID940
Feature activation-2.21e-01

Pos logit contributions
 contrary+0.937
 Uk+0.914
 Kah+0.888
 Outbreak+0.880
 Melt+0.879
Neg logit contributions
thora-1.152
icket-1.093
oli-1.057
ashtra-1.054
 incent-1.052
Token Posts
Token ID12043
Feature activation-2.57e+00

Pos logit contributions
 contrary+0.103
 Uk+0.093
 Kah+0.082
 Outbreak+0.078
 Melt+0.078
Neg logit contributions
thora-0.778
icket-0.753
oli-0.742
 incent-0.738
ashtra-0.736
Token #
Token ID1303
Feature activation-2.32e+00

Pos logit contributions
 contrary+3.982
 Uk+3.848
 Kah+3.756
 Outbreak+3.736
 Melt+3.692
Neg logit contributions
thora-6.097
icket-5.871
oli-5.733
ashtra-5.665
 incent-5.651
Token14
Token ID1415
Feature activation-1.53e-01

Pos logit contributions
 contrary+2.797
posts+2.685
 Uk+2.661
 Outbreak+2.501
 KS+2.455
Neg logit contributions
thora-6.502
 incent-6.463
 distingu-6.257
emouth-6.183
ashtra-6.084
Token On
Token ID1550
Feature activation-9.56e-01

Pos logit contributions
 contrary+0.204
 Uk+0.197
 Kah+0.189
 Outbreak+0.187
 Melt+0.186
Neg logit contributions
thora-0.406
icket-0.388
oli-0.381
ashtra-0.377
 incent-0.376
Token September
Token ID2693
Feature activation-7.13e-01

Pos logit contributions
 contrary+1.102
 Uk+1.025
 Kah+0.993
 Outbreak+0.982
 Melt+0.970
Neg logit contributions
thora-2.737
icket-2.615
 incent-2.600
ashtra-2.575
oli-2.560
Token 20
Token ID1160
Feature activation-1.63e-01

Pos logit contributions
 contrary+0.908
 Uk+0.869
 Kah+0.835
 Outbreak+0.821
 Melt+0.819
Neg logit contributions
thora-1.927
icket-1.847
oli-1.818
 incent-1.788
iral-1.783
Token 2002
Token ID6244
Feature activation+1.71e-02

Pos logit contributions
 contrary+1.181
 Uk+1.144
 Kah+1.112
 Outbreak+1.089
 Melt+1.084
Neg logit contributions
thora-2.150
icket-2.062
oli-2.007
 incent-2.006
ashtra-1.987
Token United
Token ID1578
Feature activation-3.44e-01

Pos logit contributions
thora+0.044
icket+0.042
oli+0.042
 incent+0.041
ashtra+0.041
Neg logit contributions
 contrary-0.024
 Uk-0.023
 Kah-0.022
 Outbreak-0.022
 Melt-0.021
Token States
Token ID1829
Feature activation+1.10e-01

Pos logit contributions
 contrary+0.496
 Uk+0.473
 Kah+0.453
 Outbreak+0.453
 Melt+0.449
Neg logit contributions
thora-0.873
icket-0.828
oli-0.819
 incent-0.807
iral-0.805
Token 3
Token ID513
Feature activation-6.01e-01

Pos logit contributions
thora+0.287
icket+0.274
oli+0.269
 incent+0.266
ashtra+0.265
Neg logit contributions
 contrary-0.152
 Uk-0.146
 Kah-0.141
 Outbreak-0.139
 Melt-0.139
Token472
Token ID37856
Feature activation+2.45e-01

Pos logit contributions
 contrary+1.175
 Uk+1.151
 Kah+1.119
 Outbreak+1.113
 Melt+1.110
Neg logit contributions
thora-1.216
icket-1.159
oli-1.116
ashtra-1.112
 incent-1.112
Token Posts
Token ID12043
Feature activation-2.56e+00

Pos logit contributions
thora+0.857
icket+0.829
oli+0.819
 incent+0.810
ashtra+0.808
Neg logit contributions
 contrary-0.118
 Uk-0.105
 Kah-0.092
 Outbreak-0.088
 Melt-0.087
Token #
Token ID1303
Feature activation-2.37e+00

Pos logit contributions
 contrary+3.827
 Uk+3.694
 Kah+3.625
 Outbreak+3.610
 Melt+3.565
Neg logit contributions
thora-6.212
icket-5.993
oli-5.832
ashtra-5.795
emouth-5.721
Token11
Token ID1157
Feature activation-2.79e-01

Pos logit contributions
 contrary+2.189
posts+1.937
 Kah+1.818
 Uk+1.770
 Dispatch+1.707
Neg logit contributions
 incent-7.383
thora-7.361
 distingu-7.125
emouth-7.072
ashtra-6.988
Token On
Token ID1550
Feature activation-9.52e-01

Pos logit contributions
 contrary+0.398
 Uk+0.387
 Kah+0.373
 Outbreak+0.368
 Melt+0.367
Neg logit contributions
thora-0.714
icket-0.683
oli-0.668
ashtra-0.661
 incent-0.658
Token September
Token ID2693
Feature activation-7.44e-01

Pos logit contributions
 contrary+1.294
 Uk+1.228
 Kah+1.201
 Melt+1.175
 Outbreak+1.168
Neg logit contributions
thora-2.551
icket-2.414
ashtra-2.343
oli-2.336
 incent-2.335
Token 20
Token ID1160
Feature activation-1.51e-01

Pos logit contributions
 contrary+0.899
 Uk+0.870
 Kah+0.833
 Melt+0.820
 Outbreak+0.815
Neg logit contributions
thora-2.089
icket-2.003
 incent-1.963
oli-1.957
ashtra-1.947
Token 2010
Token ID3050
Feature activation+1.61e-01

Pos logit contributions
 contrary+0.490
 Uk+0.475
 Kah+0.460
 Melt+0.452
 Outbreak+0.450
Neg logit contributions
thora-0.868
icket-0.830
oli-0.812
 incent-0.808
ashtra-0.802
Token United
Token ID1578
Feature activation-3.13e-01

Pos logit contributions
thora+0.418
icket+0.399
oli+0.393
 incent+0.387
ashtra+0.386
Neg logit contributions
 contrary-0.222
 Uk-0.212
 Kah-0.204
 Outbreak-0.203
 Melt-0.202
Token States
Token ID1829
Feature activation+1.05e-01

Pos logit contributions
 contrary+0.424
 Uk+0.413
 Kah+0.398
 Melt+0.391
 Outbreak+0.389
Neg logit contributions
thora-0.824
icket-0.793
oli-0.772
 incent-0.772
emouth-0.763
Token 6
Token ID718
Feature activation-4.22e-01

Pos logit contributions
thora+0.273
icket+0.262
oli+0.257
 incent+0.254
ashtra+0.253
Neg logit contributions
 contrary-0.144
 Uk-0.139
 Kah-0.134
 Outbreak-0.132
 Melt-0.132
Token83
Token ID5999
Feature activation-2.24e-01

Pos logit contributions
 contrary+0.737
 Uk+0.719
 Kah+0.697
 Outbreak+0.694
 Melt+0.692
Neg logit contributions
thora-0.948
icket-0.901
ashtra-0.871
oli-0.871
 incent-0.869
Token Posts
Token ID12043
Feature activation-2.54e+00

Pos logit contributions
 contrary+0.097
 Uk+0.087
 Kah+0.076
 Outbreak+0.072
 Melt+0.072
Neg logit contributions
thora-0.797
icket-0.771
oli-0.760
 incent-0.758
ashtra-0.755
Token Last
Token ID4586
Feature activation-8.75e-01

Pos logit contributions
 contrary+3.907
 Uk+3.781
 Kah+3.684
 Outbreak+3.655
 Melt+3.620
Neg logit contributions
thora-6.088
icket-5.854
oli-5.711
ashtra-5.649
 incent-5.638
Token Edited
Token ID34212
Feature activation-7.11e-01

Pos logit contributions
 contrary+1.494
 Uk+1.444
 Outbreak+1.410
 Kah+1.407
 KS+1.399
Neg logit contributions
thora-2.016
 incent-1.905
icket-1.900
emouth-1.866
ashtra-1.854
Token:
Token ID25
Feature activation-9.70e-01

Pos logit contributions
 contrary+0.892
 Uk+0.849
 Kah+0.815
 Melt+0.809
 Outbreak+0.808
Neg logit contributions
thora-1.959
icket-1.870
 incent-1.845
oli-1.830
ashtra-1.822
Token 2013
Token ID2211
Feature activation+1.88e-01

Pos logit contributions
 contrary+1.219
 Uk+1.183
 Kah+1.117
 Outbreak+1.106
 Melt+1.105
Neg logit contributions
thora-2.676
icket-2.547
 incent-2.537
oli-2.514
ashtra-2.503
Token-
Token ID12
Feature activation-2.84e-01

Pos logit contributions
thora+0.485
icket+0.464
oli+0.455
 incent+0.450
ashtra+0.448
Neg logit contributions
 contrary-0.263
 Uk-0.254
 Kah-0.244
 Outbreak-0.241
 Melt-0.240
Token 2011
Token ID2813
Feature activation+2.56e-02

Pos logit contributions
 contrary+1.311
 Uk+1.272
 Kah+1.235
 Melt+1.206
 Outbreak+1.199
Neg logit contributions
thora-2.447
icket-2.343
 incent-2.286
oli-2.286
ashtra-2.259
Token United
Token ID1578
Feature activation-3.73e-01

Pos logit contributions
thora+0.067
icket+0.064
oli+0.063
 incent+0.062
ashtra+0.062
Neg logit contributions
 contrary-0.035
 Uk-0.033
 Kah-0.032
 Outbreak-0.032
 Melt-0.032
Token States
Token ID1829
Feature activation+1.66e-01

Pos logit contributions
 contrary+0.560
 Uk+0.534
 Outbreak+0.515
 Kah+0.512
posts+0.510
Neg logit contributions
thora-0.924
icket-0.873
oli-0.866
iral-0.851
 incent-0.850
Token 6
Token ID718
Feature activation-3.96e-01

Pos logit contributions
thora+0.432
icket+0.414
oli+0.406
 incent+0.401
ashtra+0.400
Neg logit contributions
 contrary-0.226
 Uk-0.218
 Kah-0.209
 Outbreak-0.207
 Melt-0.206
Token12
Token ID1065
Feature activation-1.07e-01

Pos logit contributions
 contrary+0.738
 Uk+0.721
 Kah+0.700
 Outbreak+0.698
 Melt+0.695
Neg logit contributions
thora-0.846
icket-0.803
ashtra-0.776
oli-0.774
 incent-0.772
Token Posts
Token ID12043
Feature activation-2.54e+00

Pos logit contributions
 contrary+0.060
 Uk+0.055
 Kah+0.050
 Outbreak+0.048
 Melt+0.048
Neg logit contributions
thora-0.367
icket-0.355
oli-0.350
 incent-0.348
ashtra-0.347
Token #
Token ID1303
Feature activation-2.25e+00

Pos logit contributions
 contrary+3.832
 Uk+3.706
 Kah+3.607
 Outbreak+3.584
 Melt+3.546
Neg logit contributions
thora-6.143
icket-5.914
oli-5.776
ashtra-5.712
 incent-5.691
Token17
Token ID1558
Feature activation-2.10e-01

Pos logit contributions
 contrary+2.533
 Uk+2.287
posts+2.197
 Outbreak+2.153
 Kah+2.131
Neg logit contributions
thora-6.529
 incent-6.367
ashtra-6.219
 distingu-6.140
emouth-6.114
Token Good
Token ID4599
Feature activation+8.59e-02

Pos logit contributions
 contrary+0.290
 Uk+0.281
 Kah+0.270
 Outbreak+0.267
 Melt+0.266
Neg logit contributions
thora-0.549
icket-0.526
oli-0.514
ashtra-0.509
 incent-0.507
Token to
Token ID284
Feature activation+2.67e-02

Pos logit contributions
thora+0.205
icket+0.195
oli+0.191
 incent+0.189
ashtra+0.188
Neg logit contributions
 contrary-0.138
 Uk-0.133
 Kah-0.129
 Outbreak-0.127
 Melt-0.127
Token hear
Token ID3285
Feature activation+8.84e-01

Pos logit contributions
thora+0.047
icket+0.043
oli+0.042
ashtra+0.042
emouth+0.041
Neg logit contributions
 contrary-0.060
 Uk-0.059
 Kah-0.058
 Melt-0.057
 Outbreak-0.057
Token 2010
Token ID3050
Feature activation-5.83e-02

Pos logit contributions
 contrary+0.446
 Uk+0.431
 Kah+0.417
 Outbreak+0.410
 Melt+0.410
Neg logit contributions
thora-0.803
icket-0.768
oli-0.752
 incent-0.746
ashtra-0.742
Token United
Token ID1578
Feature activation-3.76e-01

Pos logit contributions
 contrary+0.081
 Uk+0.077
 Kah+0.074
 Outbreak+0.074
 Melt+0.073
Neg logit contributions
thora-0.151
icket-0.144
oli-0.142
 incent-0.140
ashtra-0.140
Token States
Token ID1829
Feature activation+1.34e-02

Pos logit contributions
 contrary+0.556
 Uk+0.530
 Kah+0.509
 Outbreak+0.509
posts+0.506
Neg logit contributions
thora-0.943
icket-0.893
oli-0.883
 incent-0.870
iral-0.868
Token 21
Token ID2310
Feature activation-5.54e-01

Pos logit contributions
thora+0.035
icket+0.034
oli+0.033
 incent+0.033
ashtra+0.033
Neg logit contributions
 contrary-0.018
 Uk-0.018
 Kah-0.017
 Outbreak-0.017
 Melt-0.017
Token50
Token ID1120
Feature activation+1.76e-02

Pos logit contributions
 contrary+0.773
 Uk+0.756
 Kah+0.721
 Outbreak+0.719
posts+0.713
Neg logit contributions
thora-1.444
icket-1.379
 incent-1.349
oli-1.347
ashtra-1.344
Token Posts
Token ID12043
Feature activation-2.51e+00

Pos logit contributions
thora+0.061
icket+0.059
oli+0.059
 incent+0.058
ashtra+0.058
Neg logit contributions
 contrary-0.008
 Uk-0.008
 Kah-0.007
 Outbreak-0.006
 Melt-0.006
Token #
Token ID1303
Feature activation-2.27e+00

Pos logit contributions
 contrary+3.897
 Uk+3.760
 Kah+3.669
 Outbreak+3.641
 Melt+3.602
Neg logit contributions
thora-6.005
icket-5.776
oli-5.636
ashtra-5.572
 incent-5.551
Token16
Token ID1433
Feature activation-2.46e-01

Pos logit contributions
 contrary+3.631
posts+3.326
 Uk+3.281
 Kah+3.242
 Outbreak+3.164
Neg logit contributions
thora-5.628
 incent-5.481
 distingu-5.221
icket-5.195
emouth-5.158
Token Glad
Token ID28442
Feature activation+2.81e-01

Pos logit contributions
 contrary+0.327
 Uk+0.316
 Kah+0.303
 Outbreak+0.300
 Melt+0.298
Neg logit contributions
thora-0.655
icket-0.628
oli-0.615
ashtra-0.609
 incent-0.606
Token to
Token ID284
Feature activation+1.08e-01

Pos logit contributions
thora+0.670
icket+0.638
oli+0.624
 incent+0.618
ashtra+0.616
Neg logit contributions
 contrary-0.450
 Uk-0.437
 Kah-0.422
 Melt-0.417
 Outbreak-0.416
Token hear
Token ID3285
Feature activation+8.48e-01

Pos logit contributions
thora+0.203
icket+0.188
oli+0.185
ashtra+0.182
emouth+0.181
Neg logit contributions
 contrary-0.232
 Uk-0.226
 Kah-0.220
 Melt-0.219
 Outbreak-0.218
Token 2010
Token ID3050
Feature activation-9.75e-03

Pos logit contributions
 contrary+1.272
 Uk+1.241
 Kah+1.206
 Melt+1.177
 Outbreak+1.173
Neg logit contributions
thora-2.345
icket-2.248
 incent-2.195
oli-2.188
ashtra-2.166
Token United
Token ID1578
Feature activation-4.34e-01

Pos logit contributions
 contrary+0.013
 Uk+0.013
 Kah+0.012
 Outbreak+0.012
 Melt+0.012
Neg logit contributions
thora-0.025
icket-0.024
oli-0.024
 incent-0.024
ashtra-0.024
Token States
Token ID1829
Feature activation+7.65e-02

Pos logit contributions
 contrary+0.644
 Uk+0.614
 Outbreak+0.590
 Kah+0.589
posts+0.585
Neg logit contributions
thora-1.088
icket-1.030
oli-1.020
 incent-1.004
iral-1.002
Token 10
Token ID838
Feature activation-3.88e-01

Pos logit contributions
thora+0.200
icket+0.191
oli+0.188
 incent+0.186
ashtra+0.185
Neg logit contributions
 contrary-0.105
 Uk-0.101
 Kah-0.097
 Outbreak-0.096
 Melt-0.096
Token53
Token ID4310
Feature activation-3.69e-02

Pos logit contributions
 contrary+0.680
 Uk+0.666
 Kah+0.647
 Outbreak+0.641
 Melt+0.640
Neg logit contributions
thora-0.870
icket-0.829
oli-0.802
 incent-0.797
ashtra-0.797
Token Posts
Token ID12043
Feature activation-2.47e+00

Pos logit contributions
 contrary+0.017
 Uk+0.015
 Kah+0.013
 Outbreak+0.012
 Melt+0.012
Neg logit contributions
thora-0.131
icket-0.126
oli-0.125
 incent-0.124
ashtra-0.124
Token #
Token ID1303
Feature activation-2.33e+00

Pos logit contributions
 contrary+3.935
 Uk+3.799
 Kah+3.714
 Outbreak+3.682
 Melt+3.646
Neg logit contributions
thora-5.799
icket-5.576
oli-5.439
ashtra-5.377
 incent-5.367
Token15
Token ID1314
Feature activation-1.92e-01

Pos logit contributions
 contrary+2.742
 Uk+2.561
posts+2.554
 Kah+2.447
 Outbreak+2.386
Neg logit contributions
thora-6.592
 incent-6.440
 distingu-6.242
emouth-6.088
icket-6.085
Token honestly
Token ID12698
Feature activation-4.09e-01

Pos logit contributions
 contrary+0.258
 Uk+0.249
 Kah+0.239
 Outbreak+0.236
 Melt+0.235
Neg logit contributions
thora-0.508
icket-0.487
oli-0.477
ashtra-0.472
 incent-0.471
Token if
Token ID611
Feature activation+6.49e-02

Pos logit contributions
 contrary+0.655
 Uk+0.632
 Kah+0.610
 Melt+0.603
 Outbreak+0.600
Neg logit contributions
thora-0.986
icket-0.939
oli-0.916
ashtra-0.909
 incent-0.901
Token we
Token ID356
Feature activation-1.56e-01

Pos logit contributions
thora+0.148
icket+0.141
oli+0.138
 incent+0.136
ashtra+0.136
Neg logit contributions
 contrary-0.111
 Uk-0.108
 Kah-0.104
 Melt-0.103
 Outbreak-0.103
Token June
Token ID2795
Feature activation-7.43e-01

Pos logit contributions
 contrary+0.503
 Uk+0.485
 Kah+0.468
 Melt+0.460
 Outbreak+0.459
Neg logit contributions
thora-0.973
icket-0.932
oli-0.913
 incent-0.905
ashtra-0.904
Token 2009
Token ID3717
Feature activation+1.15e-03

Pos logit contributions
 contrary+1.077
 Uk+1.047
 Kah+1.010
 Melt+0.992
 Outbreak+0.989
Neg logit contributions
thora-1.915
icket-1.833
oli-1.785
 incent-1.782
ashtra-1.770
Token Germany
Token ID4486
Feature activation-6.55e-01

Pos logit contributions
thora+0.003
icket+0.003
oli+0.003
 incent+0.003
ashtra+0.003
Neg logit contributions
 contrary-0.002
 Uk-0.002
 Kah-0.001
 Outbreak-0.001
 Melt-0.001
Token 247
Token ID30179
Feature activation-3.09e-01

Pos logit contributions
 contrary+0.864
 Uk+0.843
 Kah+0.810
 Melt+0.796
 Outbreak+0.793
Neg logit contributions
thora-1.762
icket-1.693
oli-1.651
 incent-1.646
ashtra-1.637
Token34
Token ID2682
Feature activation+6.83e-02

Pos logit contributions
 contrary+0.365
 Uk+0.352
 Kah+0.338
 Outbreak+0.332
 Melt+0.331
Neg logit contributions
thora-0.870
icket-0.836
oli-0.815
ashtra-0.812
 incent-0.811
Token Posts
Token ID12043
Feature activation-2.42e+00

Pos logit contributions
thora+0.240
icket+0.232
oli+0.229
 incent+0.227
ashtra+0.227
Neg logit contributions
 contrary-0.032
 Uk-0.028
 Kah-0.025
 Outbreak-0.024
 Melt-0.024
Token #
Token ID1303
Feature activation-2.31e+00

Pos logit contributions
 contrary+3.709
 Uk+3.585
 Kah+3.499
 Outbreak+3.472
 Melt+3.431
Neg logit contributions
thora-5.820
icket-5.604
oli-5.464
ashtra-5.409
 incent-5.387
Token20
Token ID1238
Feature activation-1.83e-01

Pos logit contributions
 contrary+2.987
 Uk+2.806
posts+2.789
 Kah+2.707
 KS+2.691
Neg logit contributions
thora-6.284
 incent-6.079
 distingu-5.858
ashtra-5.824
 diarr-5.747
Token Nice
Token ID18460
Feature activation-3.74e-01

Pos logit contributions
 contrary+0.268
 Uk+0.260
 Kah+0.251
 Outbreak+0.248
 Melt+0.247
Neg logit contributions
thora-0.462
icket-0.441
oli-0.431
ashtra-0.427
 incent-0.425
Token,
Token ID11
Feature activation+3.19e-02

Pos logit contributions
 contrary+0.596
 Uk+0.580
 Kah+0.561
 Melt+0.552
 Outbreak+0.552
Neg logit contributions
thora-0.897
icket-0.849
oli-0.829
ashtra-0.827
 incent-0.822
Token happy
Token ID3772
Feature activation-4.27e-01

Pos logit contributions
thora+0.072
icket+0.068
oli+0.066
ashtra+0.066
 incent+0.065
Neg logit contributions
 contrary-0.056
 Uk-0.055
 Kah-0.053
 Melt-0.052
 Outbreak-0.052
Token 2011
Token ID2813
Feature activation+1.71e-01

Pos logit contributions
 contrary+0.535
 Uk+0.518
 Kah+0.501
 Melt+0.493
 Outbreak+0.493
Neg logit contributions
thora-0.911
icket-0.870
oli-0.851
 incent-0.845
ashtra-0.839
Token United
Token ID1578
Feature activation-4.75e-01

Pos logit contributions
thora+0.443
icket+0.424
oli+0.417
 incent+0.411
ashtra+0.410
Neg logit contributions
 contrary-0.236
 Uk-0.226
 Kah-0.217
 Outbreak-0.215
 Melt-0.214
Token States
Token ID1829
Feature activation+1.97e-02

Pos logit contributions
 contrary+0.703
 Uk+0.670
 Kah+0.644
 Outbreak+0.644
posts+0.638
Neg logit contributions
thora-1.191
icket-1.128
oli-1.116
 incent-1.098
iral-1.096
Token 228
Token ID29041
Feature activation-5.52e-01

Pos logit contributions
thora+0.051
icket+0.049
oli+0.048
 incent+0.048
ashtra+0.048
Neg logit contributions
 contrary-0.027
 Uk-0.026
 Kah-0.025
 Outbreak-0.025
 Melt-0.025
Token65
Token ID2996
Feature activation+3.01e-01

Pos logit contributions
 contrary+0.828
 Uk+0.807
 Kah+0.780
 Outbreak+0.771
 Melt+0.768
Neg logit contributions
thora-1.379
icket-1.321
oli-1.282
 incent-1.276
ashtra-1.274
Token Posts
Token ID12043
Feature activation-2.40e+00

Pos logit contributions
thora+1.049
icket+1.015
oli+1.003
 incent+0.991
ashtra+0.989
Neg logit contributions
 contrary-0.146
 Uk-0.130
 Kah-0.114
 Outbreak-0.110
 Melt-0.109
Token #
Token ID1303
Feature activation-2.31e+00

Pos logit contributions
 contrary+3.721
 Uk+3.599
 Kah+3.517
 Outbreak+3.493
 Melt+3.454
Neg logit contributions
thora-5.729
icket-5.503
oli-5.372
ashtra-5.312
 incent-5.288
Token13
Token ID1485
Feature activation+8.28e-02

Pos logit contributions
 contrary+3.254
posts+3.023
 Uk+3.005
 Dispatch+2.834
 Kah+2.833
Neg logit contributions
thora-6.054
 incent-5.990
 distingu-5.806
emouth-5.751
ashtra-5.687
Token On
Token ID1550
Feature activation-9.12e-01

Pos logit contributions
thora+0.217
icket+0.208
oli+0.204
 incent+0.201
ashtra+0.201
Neg logit contributions
 contrary-0.113
 Uk-0.109
 Kah-0.105
 Outbreak-0.103
 Melt-0.103
Token September
Token ID2693
Feature activation-6.71e-01

Pos logit contributions
 contrary+1.052
 Uk+0.984
 Kah+0.952
 Outbreak+0.937
 Melt+0.931
Neg logit contributions
thora-2.613
icket-2.498
 incent-2.470
ashtra-2.457
oli-2.444
Token 20
Token ID1160
Feature activation-9.85e-02

Pos logit contributions
 contrary+0.823
 Uk+0.788
 Kah+0.754
 Outbreak+0.744
 Melt+0.742
Neg logit contributions
thora-1.847
icket-1.772
oli-1.743
 incent-1.721
ashtra-1.715
Token TL
Token ID24811
Feature activation-6.80e-02

Pos logit contributions
thora+0.257
icket+0.246
oli+0.242
 incent+0.238
ashtra+0.238
Neg logit contributions
 contrary-0.140
 Uk-0.133
 Kah-0.128
 Outbreak-0.128
 Melt-0.127
TokenAD
Token ID2885
Feature activation-4.54e-01

Pos logit contributions
 contrary+0.101
 Uk+0.098
 Kah+0.094
 Outbreak+0.093
 Melt+0.093
Neg logit contributions
thora-0.169
icket-0.162
oli-0.159
 incent-0.157
ashtra-0.156
TokenT
Token ID51
Feature activation-3.04e-01

Pos logit contributions
 contrary+0.791
 Uk+0.767
 Kah+0.747
 Outbreak+0.736
 Melt+0.735
Neg logit contributions
thora-1.021
icket-0.967
oli-0.948
 incent-0.942
ashtra-0.934
Token 235
Token ID28878
Feature activation-7.93e-02

Pos logit contributions
 contrary+0.438
 Uk+0.426
 Kah+0.409
 Outbreak+0.404
 Melt+0.402
Neg logit contributions
thora-0.776
icket-0.737
oli-0.725
 incent-0.719
ashtra-0.716
Token21
Token ID2481
Feature activation+1.88e-02

Pos logit contributions
 contrary+0.129
 Uk+0.126
 Kah+0.121
 Outbreak+0.120
 Melt+0.120
Neg logit contributions
thora-0.187
icket-0.179
oli-0.174
 incent-0.172
ashtra-0.172
Token Posts
Token ID12043
Feature activation-2.37e+00

Pos logit contributions
thora+0.066
icket+0.064
oli+0.063
 incent+0.063
ashtra+0.062
Neg logit contributions
 contrary-0.009
 Uk-0.008
 Kah-0.007
 Outbreak-0.007
 Melt-0.007
Token #
Token ID1303
Feature activation-2.24e+00

Pos logit contributions
 contrary+3.604
 Uk+3.481
 Kah+3.392
 Outbreak+3.368
 Melt+3.330
Neg logit contributions
thora-5.729
icket-5.510
oli-5.381
ashtra-5.318
 incent-5.300
Token18
Token ID1507
Feature activation-3.65e-02

Pos logit contributions
 contrary+3.536
posts+3.350
 Uk+3.303
 Kah+3.206
 Outbreak+3.202
Neg logit contributions
thora-5.459
 incent-5.334
 distingu-5.120
ashtra-5.109
emouth-5.002
Token Good
Token ID4599
Feature activation+4.00e-02

Pos logit contributions
 contrary+0.051
 Uk+0.049
 Kah+0.047
 Outbreak+0.047
 Melt+0.047
Neg logit contributions
thora-0.095
icket-0.091
oli-0.089
ashtra-0.088
 incent-0.088
Token stuff
Token ID3404
Feature activation-1.35e-01

Pos logit contributions
thora+0.096
icket+0.092
oli+0.090
 incent+0.089
ashtra+0.088
Neg logit contributions
 contrary-0.063
 Uk-0.061
 Kah-0.059
 Outbreak-0.059
 Melt-0.059
Token!
Token ID0
Feature activation-3.74e-01

Pos logit contributions
 contrary+0.196
 Uk+0.190
 Kah+0.184
 Melt+0.181
 Outbreak+0.181
Neg logit contributions
thora-0.343
icket-0.326
oli-0.319
ashtra-0.317
 incent-0.317
Token United
Token ID1578
Feature activation-3.44e-01

Pos logit contributions
thora+0.044
icket+0.042
oli+0.042
 incent+0.041
ashtra+0.041
Neg logit contributions
 contrary-0.024
 Uk-0.023
 Kah-0.022
 Outbreak-0.022
 Melt-0.021
Token States
Token ID1829
Feature activation+1.10e-01

Pos logit contributions
 contrary+0.496
 Uk+0.473
 Kah+0.453
 Outbreak+0.453
 Melt+0.449
Neg logit contributions
thora-0.873
icket-0.828
oli-0.819
 incent-0.807
iral-0.805
Token 3
Token ID513
Feature activation-6.01e-01

Pos logit contributions
thora+0.287
icket+0.274
oli+0.269
 incent+0.266
ashtra+0.265
Neg logit contributions
 contrary-0.152
 Uk-0.146
 Kah-0.141
 Outbreak-0.139
 Melt-0.139
Token472
Token ID37856
Feature activation+2.45e-01

Pos logit contributions
 contrary+1.175
 Uk+1.151
 Kah+1.119
 Outbreak+1.113
 Melt+1.110
Neg logit contributions
thora-1.216
icket-1.159
oli-1.116
ashtra-1.112
 incent-1.112
Token Posts
Token ID12043
Feature activation-2.56e+00

Pos logit contributions
thora+0.857
icket+0.829
oli+0.819
 incent+0.810
ashtra+0.808
Neg logit contributions
 contrary-0.118
 Uk-0.105
 Kah-0.092
 Outbreak-0.088
 Melt-0.087
Token #
Token ID1303
Feature activation-2.37e+00

Pos logit contributions
 contrary+3.827
 Uk+3.694
 Kah+3.625
 Outbreak+3.610
 Melt+3.565
Neg logit contributions
thora-6.212
icket-5.993
oli-5.832
ashtra-5.795
emouth-5.721
Token11
Token ID1157
Feature activation-2.79e-01

Pos logit contributions
 contrary+2.189
posts+1.937
 Kah+1.818
 Uk+1.770
 Dispatch+1.707
Neg logit contributions
 incent-7.383
thora-7.361
 distingu-7.125
emouth-7.072
ashtra-6.988
Token On
Token ID1550
Feature activation-9.52e-01

Pos logit contributions
 contrary+0.398
 Uk+0.387
 Kah+0.373
 Outbreak+0.368
 Melt+0.367
Neg logit contributions
thora-0.714
icket-0.683
oli-0.668
ashtra-0.661
 incent-0.658
Token September
Token ID2693
Feature activation-7.44e-01

Pos logit contributions
 contrary+1.294
 Uk+1.228
 Kah+1.201
 Melt+1.175
 Outbreak+1.168
Neg logit contributions
thora-2.551
icket-2.414
ashtra-2.343
oli-2.336
 incent-2.335
Token 20
Token ID1160
Feature activation-1.51e-01

Pos logit contributions
 contrary+0.899
 Uk+0.870
 Kah+0.833
 Melt+0.820
 Outbreak+0.815
Neg logit contributions
thora-2.089
icket-2.003
 incent-1.963
oli-1.957
ashtra-1.947
Token 2013
Token ID2211
Feature activation+5.78e-01

Pos logit contributions
 contrary+0.220
 Uk+0.213
 Kah+0.205
 Outbreak+0.203
 Melt+0.203
Neg logit contributions
thora-0.385
icket-0.368
oli-0.359
 incent-0.357
ashtra-0.357
Token United
Token ID1578
Feature activation-4.34e-01

Pos logit contributions
 contrary+0.013
 Uk+0.013
 Kah+0.012
 Outbreak+0.012
 Melt+0.012
Neg logit contributions
thora-0.025
icket-0.024
oli-0.024
 incent-0.024
ashtra-0.024
Token States
Token ID1829
Feature activation+7.65e-02

Pos logit contributions
 contrary+0.644
 Uk+0.614
 Outbreak+0.590
 Kah+0.589
posts+0.585
Neg logit contributions
thora-1.088
icket-1.030
oli-1.020
 incent-1.004
iral-1.002
Token 10
Token ID838
Feature activation-3.88e-01

Pos logit contributions
thora+0.200
icket+0.191
oli+0.188
 incent+0.186
ashtra+0.185
Neg logit contributions
 contrary-0.105
 Uk-0.101
 Kah-0.097
 Outbreak-0.096
 Melt-0.096
Token53
Token ID4310
Feature activation-3.69e-02

Pos logit contributions
 contrary+0.680
 Uk+0.666
 Kah+0.647
 Outbreak+0.641
 Melt+0.640
Neg logit contributions
thora-0.870
icket-0.829
oli-0.802
 incent-0.797
ashtra-0.797
Token Posts
Token ID12043
Feature activation-2.47e+00

Pos logit contributions
 contrary+0.017
 Uk+0.015
 Kah+0.013
 Outbreak+0.012
 Melt+0.012
Neg logit contributions
thora-0.131
icket-0.126
oli-0.125
 incent-0.124
ashtra-0.124
Token #
Token ID1303
Feature activation-2.33e+00

Pos logit contributions
 contrary+3.935
 Uk+3.799
 Kah+3.714
 Outbreak+3.682
 Melt+3.646
Neg logit contributions
thora-5.799
icket-5.576
oli-5.439
ashtra-5.377
 incent-5.367
Token15
Token ID1314
Feature activation-1.92e-01

Pos logit contributions
 contrary+2.742
 Uk+2.561
posts+2.554
 Kah+2.447
 Outbreak+2.386
Neg logit contributions
thora-6.592
 incent-6.440
 distingu-6.242
emouth-6.088
icket-6.085
Token honestly
Token ID12698
Feature activation-4.09e-01

Pos logit contributions
 contrary+0.258
 Uk+0.249
 Kah+0.239
 Outbreak+0.236
 Melt+0.235
Neg logit contributions
thora-0.508
icket-0.487
oli-0.477
ashtra-0.472
 incent-0.471
Token if
Token ID611
Feature activation+6.49e-02

Pos logit contributions
 contrary+0.655
 Uk+0.632
 Kah+0.610
 Melt+0.603
 Outbreak+0.600
Neg logit contributions
thora-0.986
icket-0.939
oli-0.916
ashtra-0.909
 incent-0.901
Token we
Token ID356
Feature activation-1.56e-01

Pos logit contributions
thora+0.148
icket+0.141
oli+0.138
 incent+0.136
ashtra+0.136
Neg logit contributions
 contrary-0.111
 Uk-0.108
 Kah-0.104
 Melt-0.103
 Outbreak-0.103
Token just
Token ID655
Feature activation-8.67e-01

Pos logit contributions
 contrary+0.272
 Uk+0.264
 Kah+0.257
 Melt+0.254
 KS+0.254
Neg logit contributions
thora-0.351
icket-0.330
oli-0.325
ashtra-0.321
emouth-0.317
Token 2012
Token ID2321
Feature activation+1.13e-01

Pos logit contributions
 contrary+1.345
 Uk+1.309
 Kah+1.267
 Melt+1.240
 Outbreak+1.236
Neg logit contributions
thora-2.413
icket-2.312
oli-2.252
 incent-2.249
ashtra-2.232
Token Canada
Token ID3340
Feature activation-5.49e-01

Pos logit contributions
thora+0.293
icket+0.280
oli+0.275
 incent+0.271
ashtra+0.271
Neg logit contributions
 contrary-0.156
 Uk-0.149
 Kah-0.143
 Outbreak-0.142
 Melt-0.141
Token 16
Token ID1467
Feature activation-5.22e-01

Pos logit contributions
 contrary+0.724
 Uk+0.707
 Kah+0.682
 Melt+0.671
 Outbreak+0.665
Neg logit contributions
thora-1.480
icket-1.420
oli-1.385
 incent-1.380
ashtra-1.375
Token10
Token ID940
Feature activation-2.21e-01

Pos logit contributions
 contrary+0.937
 Uk+0.914
 Kah+0.888
 Outbreak+0.880
 Melt+0.879
Neg logit contributions
thora-1.152
icket-1.093
oli-1.057
ashtra-1.054
 incent-1.052
Token Posts
Token ID12043
Feature activation-2.57e+00

Pos logit contributions
 contrary+0.103
 Uk+0.093
 Kah+0.082
 Outbreak+0.078
 Melt+0.078
Neg logit contributions
thora-0.778
icket-0.753
oli-0.742
 incent-0.738
ashtra-0.736
Token #
Token ID1303
Feature activation-2.32e+00

Pos logit contributions
 contrary+3.982
 Uk+3.848
 Kah+3.756
 Outbreak+3.736
 Melt+3.692
Neg logit contributions
thora-6.097
icket-5.871
oli-5.733
ashtra-5.665
 incent-5.651
Token14
Token ID1415
Feature activation-1.53e-01

Pos logit contributions
 contrary+2.797
posts+2.685
 Uk+2.661
 Outbreak+2.501
 KS+2.455
Neg logit contributions
thora-6.502
 incent-6.463
 distingu-6.257
emouth-6.183
ashtra-6.084
Token On
Token ID1550
Feature activation-9.56e-01

Pos logit contributions
 contrary+0.204
 Uk+0.197
 Kah+0.189
 Outbreak+0.187
 Melt+0.186
Neg logit contributions
thora-0.406
icket-0.388
oli-0.381
ashtra-0.377
 incent-0.376
Token September
Token ID2693
Feature activation-7.13e-01

Pos logit contributions
 contrary+1.102
 Uk+1.025
 Kah+0.993
 Outbreak+0.982
 Melt+0.970
Neg logit contributions
thora-2.737
icket-2.615
 incent-2.600
ashtra-2.575
oli-2.560
Token 20
Token ID1160
Feature activation-1.63e-01

Pos logit contributions
 contrary+0.908
 Uk+0.869
 Kah+0.835
 Outbreak+0.821
 Melt+0.819
Neg logit contributions
thora-1.927
icket-1.847
oli-1.818
 incent-1.788
iral-1.783
Token 2013
Token ID2211
Feature activation+6.84e-01

Pos logit contributions
 contrary+0.214
 Uk+0.204
 Kah+0.195
 Melt+0.193
 Outbreak+0.192
Neg logit contributions
thora-0.442
icket-0.422
 incent-0.416
oli-0.415
ashtra-0.410
Token United
Token ID1578
Feature activation-4.75e-01

Pos logit contributions
thora+0.443
icket+0.424
oli+0.417
 incent+0.411
ashtra+0.410
Neg logit contributions
 contrary-0.236
 Uk-0.226
 Kah-0.217
 Outbreak-0.215
 Melt-0.214
Token States
Token ID1829
Feature activation+1.97e-02

Pos logit contributions
 contrary+0.703
 Uk+0.670
 Kah+0.644
 Outbreak+0.644
posts+0.638
Neg logit contributions
thora-1.191
icket-1.128
oli-1.116
 incent-1.098
iral-1.096
Token 228
Token ID29041
Feature activation-5.52e-01

Pos logit contributions
thora+0.051
icket+0.049
oli+0.048
 incent+0.048
ashtra+0.048
Neg logit contributions
 contrary-0.027
 Uk-0.026
 Kah-0.025
 Outbreak-0.025
 Melt-0.025
Token65
Token ID2996
Feature activation+3.01e-01

Pos logit contributions
 contrary+0.828
 Uk+0.807
 Kah+0.780
 Outbreak+0.771
 Melt+0.768
Neg logit contributions
thora-1.379
icket-1.321
oli-1.282
 incent-1.276
ashtra-1.274
Token Posts
Token ID12043
Feature activation-2.40e+00

Pos logit contributions
thora+1.049
icket+1.015
oli+1.003
 incent+0.991
ashtra+0.989
Neg logit contributions
 contrary-0.146
 Uk-0.130
 Kah-0.114
 Outbreak-0.110
 Melt-0.109
Token #
Token ID1303
Feature activation-2.31e+00

Pos logit contributions
 contrary+3.721
 Uk+3.599
 Kah+3.517
 Outbreak+3.493
 Melt+3.454
Neg logit contributions
thora-5.729
icket-5.503
oli-5.372
ashtra-5.312
 incent-5.288
Token13
Token ID1485
Feature activation+8.28e-02

Pos logit contributions
 contrary+3.254
posts+3.023
 Uk+3.005
 Dispatch+2.834
 Kah+2.833
Neg logit contributions
thora-6.054
 incent-5.990
 distingu-5.806
emouth-5.751
ashtra-5.687
Token On
Token ID1550
Feature activation-9.12e-01

Pos logit contributions
thora+0.217
icket+0.208
oli+0.204
 incent+0.201
ashtra+0.201
Neg logit contributions
 contrary-0.113
 Uk-0.109
 Kah-0.105
 Outbreak-0.103
 Melt-0.103
Token September
Token ID2693
Feature activation-6.71e-01

Pos logit contributions
 contrary+1.052
 Uk+0.984
 Kah+0.952
 Outbreak+0.937
 Melt+0.931
Neg logit contributions
thora-2.613
icket-2.498
 incent-2.470
ashtra-2.457
oli-2.444
Token 20
Token ID1160
Feature activation-9.85e-02

Pos logit contributions
 contrary+0.823
 Uk+0.788
 Kah+0.754
 Outbreak+0.744
 Melt+0.742
Neg logit contributions
thora-1.847
icket-1.772
oli-1.743
 incent-1.721
ashtra-1.715
Token 2013
Token ID2211
Feature activation+6.82e-01

Pos logit contributions
 contrary+0.129
 Uk+0.123
 Kah+0.118
 Melt+0.116
 Outbreak+0.116
Neg logit contributions
thora-0.266
icket-0.254
 incent-0.250
oli-0.250
ashtra-0.247
Token 2009
Token ID3717
Feature activation+1.15e-03

Pos logit contributions
 contrary+1.077
 Uk+1.047
 Kah+1.010
 Melt+0.992
 Outbreak+0.989
Neg logit contributions
thora-1.915
icket-1.833
oli-1.785
 incent-1.782
ashtra-1.770
Token Germany
Token ID4486
Feature activation-6.55e-01

Pos logit contributions
thora+0.003
icket+0.003
oli+0.003
 incent+0.003
ashtra+0.003
Neg logit contributions
 contrary-0.002
 Uk-0.002
 Kah-0.001
 Outbreak-0.001
 Melt-0.001
Token 247
Token ID30179
Feature activation-3.09e-01

Pos logit contributions
 contrary+0.864
 Uk+0.843
 Kah+0.810
 Melt+0.796
 Outbreak+0.793
Neg logit contributions
thora-1.762
icket-1.693
oli-1.651
 incent-1.646
ashtra-1.637
Token34
Token ID2682
Feature activation+6.83e-02

Pos logit contributions
 contrary+0.365
 Uk+0.352
 Kah+0.338
 Outbreak+0.332
 Melt+0.331
Neg logit contributions
thora-0.870
icket-0.836
oli-0.815
ashtra-0.812
 incent-0.811
Token Posts
Token ID12043
Feature activation-2.42e+00

Pos logit contributions
thora+0.240
icket+0.232
oli+0.229
 incent+0.227
ashtra+0.227
Neg logit contributions
 contrary-0.032
 Uk-0.028
 Kah-0.025
 Outbreak-0.024
 Melt-0.024
Token #
Token ID1303
Feature activation-2.31e+00

Pos logit contributions
 contrary+3.709
 Uk+3.585
 Kah+3.499
 Outbreak+3.472
 Melt+3.431
Neg logit contributions
thora-5.820
icket-5.604
oli-5.464
ashtra-5.409
 incent-5.387
Token20
Token ID1238
Feature activation-1.83e-01

Pos logit contributions
 contrary+2.987
 Uk+2.806
posts+2.789
 Kah+2.707
 KS+2.691
Neg logit contributions
thora-6.284
 incent-6.079
 distingu-5.858
ashtra-5.824
 diarr-5.747
Token Nice
Token ID18460
Feature activation-3.74e-01

Pos logit contributions
 contrary+0.268
 Uk+0.260
 Kah+0.251
 Outbreak+0.248
 Melt+0.247
Neg logit contributions
thora-0.462
icket-0.441
oli-0.431
ashtra-0.427
 incent-0.425
Token,
Token ID11
Feature activation+3.19e-02

Pos logit contributions
 contrary+0.596
 Uk+0.580
 Kah+0.561
 Melt+0.552
 Outbreak+0.552
Neg logit contributions
thora-0.897
icket-0.849
oli-0.829
ashtra-0.827
 incent-0.822
Token happy
Token ID3772
Feature activation-4.27e-01

Pos logit contributions
thora+0.072
icket+0.068
oli+0.066
ashtra+0.066
 incent+0.065
Neg logit contributions
 contrary-0.056
 Uk-0.055
 Kah-0.053
 Melt-0.052
 Outbreak-0.052
Token to
Token ID284
Feature activation+1.62e-01

Pos logit contributions
 contrary+0.618
 Uk+0.600
 Kah+0.581
 Melt+0.570
 Outbreak+0.570
Neg logit contributions
thora-1.082
icket-1.032
oli-1.012
 incent-1.002
ashtra-1.001
Token 2012
Token ID2321
Feature activation+1.68e-01

Pos logit contributions
 contrary+1.276
 Uk+1.249
 Kah+1.206
 Melt+1.182
 Outbreak+1.177
Neg logit contributions
thora-2.328
icket-2.232
oli-2.174
 incent-2.171
ashtra-2.146
Token United
Token ID1578
Feature activation-4.13e-01

Pos logit contributions
thora+0.436
icket+0.417
oli+0.410
 incent+0.404
ashtra+0.403
Neg logit contributions
 contrary-0.230
 Uk-0.220
 Kah-0.212
 Outbreak-0.210
 Melt-0.209
Token States
Token ID1829
Feature activation+1.62e-01

Pos logit contributions
 contrary+0.566
 Uk+0.547
 Kah+0.526
 Melt+0.518
 Outbreak+0.518
Neg logit contributions
thora-1.079
icket-1.035
oli-1.014
 incent-1.003
ashtra-0.998
Token 3
Token ID513
Feature activation-6.01e-01

Pos logit contributions
thora+0.420
icket+0.402
oli+0.395
 incent+0.390
ashtra+0.388
Neg logit contributions
 contrary-0.223
 Uk-0.214
 Kah-0.205
 Outbreak-0.203
 Melt-0.203
Token240
Token ID16102
Feature activation+1.58e-03

Pos logit contributions
 contrary+0.741
 Uk+0.717
 Kah+0.688
posts+0.677
 Melt+0.676
Neg logit contributions
thora-1.651
icket-1.588
ashtra-1.552
oli-1.544
 incent-1.540
Token Posts
Token ID12043
Feature activation-2.30e+00

Pos logit contributions
thora+0.006
icket+0.005
oli+0.005
 incent+0.005
ashtra+0.005
Neg logit contributions
 contrary-0.001
 Uk-0.001
 Kah-0.001
 Outbreak-0.001
 Melt-0.001
Token #
Token ID1303
Feature activation-2.21e+00

Pos logit contributions
 contrary+3.561
 Uk+3.446
 Kah+3.390
 Outbreak+3.378
 Melt+3.328
Neg logit contributions
thora-5.472
icket-5.253
oli-5.101
ashtra-5.075
 incent-5.020
Token10
Token ID940
Feature activation-3.36e-01

Pos logit contributions
 contrary+3.018
 Uk+2.908
posts+2.872
 Kah+2.823
 Outbreak+2.781
Neg logit contributions
thora-5.694
 incent-5.480
icket-5.389
emouth-5.343
ashtra-5.318
Token Happy
Token ID14628
Feature activation-5.15e-01

Pos logit contributions
 contrary+0.466
 Uk+0.452
 Kah+0.435
 Outbreak+0.429
 Melt+0.428
Neg logit contributions
thora-0.874
icket-0.837
oli-0.816
ashtra-0.812
 incent-0.807
Token it
Token ID340
Feature activation-1.16e-01

Pos logit contributions
 contrary+0.752
 Uk+0.731
 Kah+0.710
 Melt+0.697
 Outbreak+0.697
Neg logit contributions
thora-1.303
icket-1.241
oli-1.217
ashtra-1.204
 incent-1.203
Token worked
Token ID3111
Feature activation+1.10e-02

Pos logit contributions
 contrary+0.185
 Uk+0.181
 Kah+0.176
 Melt+0.173
 Outbreak+0.172
Neg logit contributions
thora-0.277
icket-0.262
oli-0.256
 incent-0.255
ashtra-0.254
Token United
Token ID1578
Feature activation-3.76e-01

Pos logit contributions
 contrary+0.081
 Uk+0.077
 Kah+0.074
 Outbreak+0.074
 Melt+0.073
Neg logit contributions
thora-0.151
icket-0.144
oli-0.142
 incent-0.140
ashtra-0.140
Token States
Token ID1829
Feature activation+1.34e-02

Pos logit contributions
 contrary+0.556
 Uk+0.530
 Kah+0.509
 Outbreak+0.509
posts+0.506
Neg logit contributions
thora-0.943
icket-0.893
oli-0.883
 incent-0.870
iral-0.868
Token 21
Token ID2310
Feature activation-5.54e-01

Pos logit contributions
thora+0.035
icket+0.034
oli+0.033
 incent+0.033
ashtra+0.033
Neg logit contributions
 contrary-0.018
 Uk-0.018
 Kah-0.017
 Outbreak-0.017
 Melt-0.017
Token50
Token ID1120
Feature activation+1.76e-02

Pos logit contributions
 contrary+0.773
 Uk+0.756
 Kah+0.721
 Outbreak+0.719
posts+0.713
Neg logit contributions
thora-1.444
icket-1.379
 incent-1.349
oli-1.347
ashtra-1.344
Token Posts
Token ID12043
Feature activation-2.51e+00

Pos logit contributions
thora+0.061
icket+0.059
oli+0.059
 incent+0.058
ashtra+0.058
Neg logit contributions
 contrary-0.008
 Uk-0.008
 Kah-0.007
 Outbreak-0.006
 Melt-0.006
Token #
Token ID1303
Feature activation-2.27e+00

Pos logit contributions
 contrary+3.897
 Uk+3.760
 Kah+3.669
 Outbreak+3.641
 Melt+3.602
Neg logit contributions
thora-6.005
icket-5.776
oli-5.636
ashtra-5.572
 incent-5.551
Token16
Token ID1433
Feature activation-2.46e-01

Pos logit contributions
 contrary+3.631
posts+3.326
 Uk+3.281
 Kah+3.242
 Outbreak+3.164
Neg logit contributions
thora-5.628
 incent-5.481
 distingu-5.221
icket-5.195
emouth-5.158
Token Glad
Token ID28442
Feature activation+2.81e-01

Pos logit contributions
 contrary+0.327
 Uk+0.316
 Kah+0.303
 Outbreak+0.300
 Melt+0.298
Neg logit contributions
thora-0.655
icket-0.628
oli-0.615
ashtra-0.609
 incent-0.606
Token to
Token ID284
Feature activation+1.08e-01

Pos logit contributions
thora+0.670
icket+0.638
oli+0.624
 incent+0.618
ashtra+0.616
Neg logit contributions
 contrary-0.450
 Uk-0.437
 Kah-0.422
 Melt-0.417
 Outbreak-0.416
Token hear
Token ID3285
Feature activation+8.48e-01

Pos logit contributions
thora+0.203
icket+0.188
oli+0.185
ashtra+0.182
emouth+0.181
Neg logit contributions
 contrary-0.232
 Uk-0.226
 Kah-0.220
 Melt-0.219
 Outbreak-0.218
Token everything
Token ID2279
Feature activation-1.99e-01

Pos logit contributions
thora+1.926
icket+1.832
oli+1.796
 incent+1.773
ashtra+1.757
Neg logit contributions
 contrary-1.439
 Uk-1.392
 Kah-1.349
 Outbreak-1.338
 Melt-1.336
Token United
Token ID1578
Feature activation-4.19e-01

Pos logit contributions
thora+0.519
icket+0.496
oli+0.488
 incent+0.481
ashtra+0.480
Neg logit contributions
 contrary-0.280
 Uk-0.268
 Kah-0.257
 Outbreak-0.256
 Melt-0.254
Token States
Token ID1829
Feature activation+1.16e-01

Pos logit contributions
 contrary+0.612
 Uk+0.582
 Outbreak+0.560
 Kah+0.558
posts+0.556
Neg logit contributions
thora-1.056
icket-1.001
oli-0.990
 incent-0.975
iral-0.974
Token 5
Token ID642
Feature activation-4.16e-01

Pos logit contributions
thora+0.304
icket+0.291
oli+0.285
 incent+0.282
ashtra+0.281
Neg logit contributions
 contrary-0.160
 Uk-0.154
 Kah-0.148
 Outbreak-0.146
 Melt-0.146
Token667
Token ID28933
Feature activation+2.60e-01

Pos logit contributions
 contrary+0.771
 Uk+0.752
 Kah+0.731
 Outbreak+0.726
 Melt+0.724
Neg logit contributions
thora-0.891
icket-0.847
oli-0.819
ashtra-0.816
 incent-0.814
Token Posts
Token ID12043
Feature activation-2.65e+00

Pos logit contributions
thora+0.913
icket+0.884
oli+0.872
 incent+0.865
ashtra+0.862
Neg logit contributions
 contrary-0.122
 Uk-0.109
 Kah-0.095
 Outbreak-0.091
 Melt-0.091
Token #
Token ID1303
Feature activation-2.25e+00

Pos logit contributions
 contrary+4.082
 Uk+3.944
 Kah+3.874
 Outbreak+3.847
 Melt+3.801
Neg logit contributions
thora-6.290
icket-6.062
oli-5.916
ashtra-5.854
 incent-5.819
Token12
Token ID1065
Feature activation+1.39e-01

Pos logit contributions
 contrary+2.711
posts+2.620
 Uk+2.467
 Kah+2.423
+2.325
Neg logit contributions
thora-6.443
 incent-6.319
 distingu-6.092
ashtra-6.069
emouth-6.057
Token I
Token ID314
Feature activation-4.64e-01

Pos logit contributions
thora+0.365
icket+0.349
oli+0.343
 incent+0.338
ashtra+0.338
Neg logit contributions
 contrary-0.188
 Uk-0.182
 Kah-0.174
 Outbreak-0.172
 Melt-0.172
Token'm
Token ID1101
Feature activation+9.22e-02

Pos logit contributions
 contrary+0.796
 Uk+0.769
 Kah+0.752
 Melt+0.741
 KS+0.732
Neg logit contributions
thora-1.066
icket-1.002
oli-0.980
ashtra-0.978
emouth-0.978
Token glad
Token ID9675
Feature activation-1.37e-01

Pos logit contributions
thora+0.193
icket+0.182
oli+0.177
 incent+0.175
ashtra+0.174
Neg logit contributions
 contrary-0.176
 Uk-0.171
 Kah-0.166
 Outbreak-0.165
 Melt-0.164
Token there
Token ID612
Feature activation+1.32e-02

Pos logit contributions
 contrary+0.234
 Uk+0.228
 Kah+0.220
 Melt+0.218
 Outbreak+0.217
Neg logit contributions
thora-0.313
icket-0.297
oli-0.290
ashtra-0.287
 incent-0.287
Token United
Token ID1578
Feature activation-3.73e-01

Pos logit contributions
thora+0.067
icket+0.064
oli+0.063
 incent+0.062
ashtra+0.062
Neg logit contributions
 contrary-0.035
 Uk-0.033
 Kah-0.032
 Outbreak-0.032
 Melt-0.032
Token States
Token ID1829
Feature activation+1.66e-01

Pos logit contributions
 contrary+0.560
 Uk+0.534
 Outbreak+0.515
 Kah+0.512
posts+0.510
Neg logit contributions
thora-0.924
icket-0.873
oli-0.866
iral-0.851
 incent-0.850
Token 6
Token ID718
Feature activation-3.96e-01

Pos logit contributions
thora+0.432
icket+0.414
oli+0.406
 incent+0.401
ashtra+0.400
Neg logit contributions
 contrary-0.226
 Uk-0.218
 Kah-0.209
 Outbreak-0.207
 Melt-0.206
Token12
Token ID1065
Feature activation-1.07e-01

Pos logit contributions
 contrary+0.738
 Uk+0.721
 Kah+0.700
 Outbreak+0.698
 Melt+0.695
Neg logit contributions
thora-0.846
icket-0.803
ashtra-0.776
oli-0.774
 incent-0.772
Token Posts
Token ID12043
Feature activation-2.54e+00

Pos logit contributions
 contrary+0.060
 Uk+0.055
 Kah+0.050
 Outbreak+0.048
 Melt+0.048
Neg logit contributions
thora-0.367
icket-0.355
oli-0.350
 incent-0.348
ashtra-0.347
Token #
Token ID1303
Feature activation-2.25e+00

Pos logit contributions
 contrary+3.832
 Uk+3.706
 Kah+3.607
 Outbreak+3.584
 Melt+3.546
Neg logit contributions
thora-6.143
icket-5.914
oli-5.776
ashtra-5.712
 incent-5.691
Token17
Token ID1558
Feature activation-2.10e-01

Pos logit contributions
 contrary+2.533
 Uk+2.287
posts+2.197
 Outbreak+2.153
 Kah+2.131
Neg logit contributions
thora-6.529
 incent-6.367
ashtra-6.219
 distingu-6.140
emouth-6.114
Token Good
Token ID4599
Feature activation+8.59e-02

Pos logit contributions
 contrary+0.290
 Uk+0.281
 Kah+0.270
 Outbreak+0.267
 Melt+0.266
Neg logit contributions
thora-0.549
icket-0.526
oli-0.514
ashtra-0.509
 incent-0.507
Token to
Token ID284
Feature activation+2.67e-02

Pos logit contributions
thora+0.205
icket+0.195
oli+0.191
 incent+0.189
ashtra+0.188
Neg logit contributions
 contrary-0.138
 Uk-0.133
 Kah-0.129
 Outbreak-0.127
 Melt-0.127
Token hear
Token ID3285
Feature activation+8.84e-01

Pos logit contributions
thora+0.047
icket+0.043
oli+0.042
ashtra+0.042
emouth+0.041
Neg logit contributions
 contrary-0.060
 Uk-0.059
 Kah-0.058
 Melt-0.057
 Outbreak-0.057
Token.
Token ID13
Feature activation+2.10e-01

Pos logit contributions
thora+2.000
icket+1.901
oli+1.866
 incent+1.841
iral+1.823
Neg logit contributions
 contrary-1.504
 Uk-1.453
 Kah-1.407
 Outbreak-1.397
 Melt-1.395
TokenAD
Token ID2885
Feature activation-4.54e-01

Pos logit contributions
 contrary+0.101
 Uk+0.098
 Kah+0.094
 Outbreak+0.093
 Melt+0.093
Neg logit contributions
thora-0.169
icket-0.162
oli-0.159
 incent-0.157
ashtra-0.156
TokenT
Token ID51
Feature activation-3.04e-01

Pos logit contributions
 contrary+0.791
 Uk+0.767
 Kah+0.747
 Outbreak+0.736
 Melt+0.735
Neg logit contributions
thora-1.021
icket-0.967
oli-0.948
 incent-0.942
ashtra-0.934
Token 235
Token ID28878
Feature activation-7.93e-02

Pos logit contributions
 contrary+0.438
 Uk+0.426
 Kah+0.409
 Outbreak+0.404
 Melt+0.402
Neg logit contributions
thora-0.776
icket-0.737
oli-0.725
 incent-0.719
ashtra-0.716
Token21
Token ID2481
Feature activation+1.88e-02

Pos logit contributions
 contrary+0.129
 Uk+0.126
 Kah+0.121
 Outbreak+0.120
 Melt+0.120
Neg logit contributions
thora-0.187
icket-0.179
oli-0.174
 incent-0.172
ashtra-0.172
Token Posts
Token ID12043
Feature activation-2.37e+00

Pos logit contributions
thora+0.066
icket+0.064
oli+0.063
 incent+0.063
ashtra+0.062
Neg logit contributions
 contrary-0.009
 Uk-0.008
 Kah-0.007
 Outbreak-0.007
 Melt-0.007
Token #
Token ID1303
Feature activation-2.24e+00

Pos logit contributions
 contrary+3.604
 Uk+3.481
 Kah+3.392
 Outbreak+3.368
 Melt+3.330
Neg logit contributions
thora-5.729
icket-5.510
oli-5.381
ashtra-5.318
 incent-5.300
Token18
Token ID1507
Feature activation-3.65e-02

Pos logit contributions
 contrary+3.536
posts+3.350
 Uk+3.303
 Kah+3.206
 Outbreak+3.202
Neg logit contributions
thora-5.459
 incent-5.334
 distingu-5.120
ashtra-5.109
emouth-5.002
Token Good
Token ID4599
Feature activation+4.00e-02

Pos logit contributions
 contrary+0.051
 Uk+0.049
 Kah+0.047
 Outbreak+0.047
 Melt+0.047
Neg logit contributions
thora-0.095
icket-0.091
oli-0.089
ashtra-0.088
 incent-0.088
Token stuff
Token ID3404
Feature activation-1.35e-01

Pos logit contributions
thora+0.096
icket+0.092
oli+0.090
 incent+0.089
ashtra+0.088
Neg logit contributions
 contrary-0.063
 Uk-0.061
 Kah-0.059
 Outbreak-0.059
 Melt-0.059
Token!
Token ID0
Feature activation-3.74e-01

Pos logit contributions
 contrary+0.196
 Uk+0.190
 Kah+0.184
 Melt+0.181
 Outbreak+0.181
Neg logit contributions
thora-0.343
icket-0.326
oli-0.319
ashtra-0.317
 incent-0.317
Token Great
Token ID3878
Feature activation-9.60e-02

Pos logit contributions
 contrary+0.557
 Uk+0.544
 Kah+0.527
 Outbreak+0.519
 Melt+0.517
Neg logit contributions
thora-0.940
icket-0.894
oli-0.874
ashtra-0.867
 incent-0.862
Token conflict
Token ID5358
Feature activation+1.76e-01

Pos logit contributions
thora+0.455
icket+0.434
oli+0.423
ashtra+0.418
iral+0.416
Neg logit contributions
 contrary-0.320
 Uk-0.304
 Kah-0.295
 Melt-0.291
 Outbreak-0.289
Token,
Token ID11
Feature activation+4.21e-01

Pos logit contributions
thora+0.429
icket+0.409
oli+0.398
 incent+0.394
ashtra+0.394
Neg logit contributions
 contrary-0.275
 Uk-0.264
 Kah-0.256
 Outbreak-0.253
 Melt-0.253
Token refugees
Token ID8015
Feature activation+1.16e-01

Pos logit contributions
thora+0.946
icket+0.899
oli+0.880
ashtra+0.866
 incent+0.864
Neg logit contributions
 contrary-0.739
 Uk-0.713
 Kah-0.692
 Melt-0.685
 Outbreak-0.684
Token,
Token ID11
Feature activation+3.59e-01

Pos logit contributions
thora+0.301
icket+0.288
oli+0.284
 incent+0.280
ashtra+0.278
Neg logit contributions
 contrary-0.158
 Uk-0.153
 Kah-0.147
 Outbreak-0.145
 Melt-0.145
Token displacement
Token ID29358
Feature activation+9.25e-01

Pos logit contributions
thora+0.807
icket+0.767
oli+0.751
ashtra+0.740
 incent+0.738
Neg logit contributions
 contrary-0.629
 Uk-0.608
 Kah-0.589
 Melt-0.583
 Outbreak-0.582
Token
Token ID960
Feature activation+1.27e+00

Pos logit contributions
thora+2.363
icket+2.269
oli+2.244
 incent+2.207
iral+2.193
Neg logit contributions
 contrary-1.273
 Uk-1.240
 Kah-1.190
 Melt-1.180
 Outbreak-1.178
Tokenall
Token ID439
Feature activation+2.93e-01

Pos logit contributions
thora+2.780
icket+2.639
oli+2.586
 incent+2.544
ashtra+2.529
Neg logit contributions
 contrary-2.275
 Uk-2.212
 Kah-2.145
 Outbreak-2.128
 Melt-2.123
Token that
Token ID326
Feature activation+2.64e-01

Pos logit contributions
thora+0.734
icket+0.702
oli+0.687
 incent+0.678
ashtra+0.678
Neg logit contributions
 contrary-0.438
 Uk-0.418
 Kah-0.404
 Melt-0.398
 Outbreak-0.397
Token stuff
Token ID3404
Feature activation+2.58e-01

Pos logit contributions
thora+0.611
icket+0.581
oli+0.569
ashtra+0.562
 incent+0.560
Neg logit contributions
 contrary-0.445
 Uk-0.429
 Kah-0.416
 Melt-0.411
 Outbreak-0.410
Token has
Token ID468
Feature activation+4.21e-01

Pos logit contributions
thora+0.624
icket+0.593
oli+0.580
ashtra+0.574
 incent+0.573
Neg logit contributions
 contrary-0.410
 Uk-0.394
 Kah-0.380
 Melt-0.375
 Outbreak-0.373
Token an
Token ID281
Feature activation+4.30e-01

Pos logit contributions
thora+0.935
icket+0.888
oli+0.867
 incent+0.855
ashtra+0.854
Neg logit contributions
 contrary-0.746
 Uk-0.721
 Kah-0.700
 Melt-0.691
 Outbreak-0.689
TokenTouch
Token ID35211
Feature activation-8.33e-01

Pos logit contributions
 contrary+0.050
 Uk+0.049
 Kah+0.047
posts+0.047
 Melt+0.047
Neg logit contributions
thora-0.064
icket-0.061
 incent-0.059
oli-0.059
ashtra-0.059
Token (
Token ID357
Feature activation-1.15e-01

Pos logit contributions
 contrary+1.132
 Uk+1.096
 Kah+1.056
 Melt+1.051
 Outbreak+1.045
Neg logit contributions
thora-2.167
icket-2.056
oli-2.044
 incent-2.026
ashtra-2.024
TokenUse
Token ID11041
Feature activation+6.60e-01

Pos logit contributions
 contrary+0.204
 Uk+0.198
 Kah+0.192
 Melt+0.191
 Outbreak+0.190
Neg logit contributions
thora-0.255
icket-0.240
oli-0.236
 incent-0.233
ashtra-0.232
Token with
Token ID351
Feature activation-2.46e-01

Pos logit contributions
thora+1.545
icket+1.471
oli+1.442
 incent+1.423
ashtra+1.416
Neg logit contributions
 contrary-1.082
 Uk-1.047
 Kah-1.013
 Outbreak-1.003
 Melt-1.002
Token extreme
Token ID3257
Feature activation-2.56e-01

Pos logit contributions
 contrary+0.424
 Uk+0.410
 Kah+0.399
 Melt+0.394
 Outbreak+0.393
Neg logit contributions
thora-0.559
icket-0.532
oli-0.522
ashtra-0.513
 incent-0.511
Token caution
Token ID13041
Feature activation+1.03e+00

Pos logit contributions
 contrary+0.455
 Uk+0.438
 Kah+0.426
 Melt+0.420
 Outbreak+0.420
Neg logit contributions
thora-0.574
icket-0.546
oli-0.530
ashtra-0.523
 incent-0.522
Token!)
Token ID8133
Feature activation-8.61e-01

Pos logit contributions
thora+2.424
icket+2.316
oli+2.275
 incent+2.236
ashtra+2.235
Neg logit contributions
 contrary-1.631
 Uk-1.605
 Kah-1.546
 Outbreak-1.540
 Melt-1.526
Token
Token ID198
Feature activation-3.30e-01

Pos logit contributions
 contrary+1.290
 Uk+1.240
 Kah+1.209
 Melt+1.195
 Outbreak+1.185
Neg logit contributions
thora-2.145
icket-2.040
oli-2.001
 incent-1.985
iral-1.977
Token
Token ID198
Feature activation-3.16e-01

Pos logit contributions
 contrary+0.459
 Uk+0.439
 Kah+0.423
 Melt+0.418
posts+0.414
Neg logit contributions
thora-0.868
icket-0.827
 incent-0.825
oli-0.806
ashtra-0.805
TokenTouch
Token ID35211
Feature activation-7.78e-01

Pos logit contributions
 contrary+0.544
 Uk+0.527
 Kah+0.513
posts+0.511
 Melt+0.508
Neg logit contributions
thora-0.713
icket-0.671
 incent-0.662
oli-0.656
ashtra-0.651
Token can
Token ID460
Feature activation-1.35e-01

Pos logit contributions
 contrary+1.217
 Uk+1.172
 Kah+1.135
 Melt+1.129
 Outbreak+1.123
Neg logit contributions
thora-1.885
icket-1.782
oli-1.763
ashtra-1.740
 incent-1.738
Token the
Token ID262
Feature activation+1.33e-01

Pos logit contributions
thora+0.681
icket+0.648
oli+0.636
 incent+0.628
ashtra+0.624
Neg logit contributions
 contrary-0.471
 Uk-0.457
 Kah-0.442
 Outbreak-0.437
 Melt-0.437
Token Elven
Token ID40748
Feature activation-4.94e-02

Pos logit contributions
thora+0.303
icket+0.288
oli+0.282
 incent+0.279
ashtra+0.277
Neg logit contributions
 contrary-0.228
 Uk-0.221
 Kah-0.214
 Melt-0.212
 Outbreak-0.211
Token forest
Token ID8222
Feature activation+8.12e-01

Pos logit contributions
 contrary+0.079
 Uk+0.077
 Kah+0.074
 Melt+0.073
 Outbreak+0.073
Neg logit contributions
thora-0.118
icket-0.112
oli-0.110
 incent-0.108
ashtra-0.108
Token of
Token ID286
Feature activation+2.17e-01

Pos logit contributions
thora+1.993
icket+1.906
oli+1.878
ashtra+1.847
 incent+1.834
Neg logit contributions
 contrary-1.206
 Uk-1.171
 Outbreak-1.136
 Kah-1.129
 Melt-1.119
Token L
Token ID406
Feature activation+5.94e-01

Pos logit contributions
thora+0.526
icket+0.502
oli+0.493
 incent+0.486
ashtra+0.484
Neg logit contributions
 contrary-0.338
 Uk-0.327
 Kah-0.316
 Outbreak-0.313
 Melt-0.312
Tokenoth
Token ID849
Feature activation+1.11e+00

Pos logit contributions
thora+1.116
icket+1.049
oli+1.020
 incent+1.013
ashtra+1.000
Neg logit contributions
 contrary-1.244
 Uk-1.217
 Kah-1.186
 Melt-1.174
 Outbreak-1.171
Tokenl
Token ID75
Feature activation+1.49e-01

Pos logit contributions
thora+2.234
icket+2.107
oli+2.056
 incent+2.030
ashtra+2.018
Neg logit contributions
 contrary-2.183
 Uk-2.127
 Kah-2.071
 Outbreak-2.049
 Melt-2.049
Tokenó
Token ID10205
Feature activation+6.88e-01

Pos logit contributions
thora+0.256
 incent+0.244
icket+0.237
ashtra+0.229
 diarr+0.228
Neg logit contributions
 contrary-0.331
 Uk-0.325
 Kah-0.321
 Milk-0.313
posts-0.312
Tokenrien
Token ID15355
Feature activation+8.79e-01

Pos logit contributions
thora+0.966
icket+0.881
oli+0.852
 incent+0.842
ashtra+0.829
Neg logit contributions
 contrary-1.778
 Uk-1.754
 Kah-1.714
 Melt-1.698
 Outbreak-1.692
Token,
Token ID11
Feature activation+7.02e-02

Pos logit contributions
thora+2.155
icket+2.067
oli+2.030
emouth+1.986
ashtra+1.985
Neg logit contributions
 contrary-1.314
 Uk-1.267
 Outbreak-1.233
 Kah-1.223
 Melt-1.215
Token where
Token ID810
Feature activation+3.89e-01

Pos logit contributions
thora+0.169
icket+0.161
oli+0.158
 incent+0.156
ashtra+0.155
Neg logit contributions
 contrary-0.111
 Uk-0.108
 Kah-0.104
 Melt-0.103
 Outbreak-0.102
Token I
Token ID314
Feature activation-2.36e-02

Pos logit contributions
thora+0.178
icket+0.168
oli+0.163
ashtra+0.162
iral+0.161
Neg logit contributions
 contrary-0.140
 Uk-0.134
 Kah-0.131
 Melt-0.130
 KS-0.127
Token wanted
Token ID2227
Feature activation+3.07e-01

Pos logit contributions
 contrary+0.042
 Uk+0.040
 Kah+0.040
 Melt+0.039
 KS+0.039
Neg logit contributions
thora-0.053
icket-0.050
ashtra-0.048
oli-0.048
emouth-0.048
Token the
Token ID262
Feature activation+8.30e-02

Pos logit contributions
thora+0.755
icket+0.721
oli+0.708
 incent+0.698
ashtra+0.695
Neg logit contributions
 contrary-0.466
 Uk-0.450
 Kah-0.434
 Outbreak-0.430
 Melt-0.429
Token reader
Token ID9173
Feature activation+1.12e+00

Pos logit contributions
thora+0.168
icket+0.158
oli+0.154
ashtra+0.153
 incent+0.152
Neg logit contributions
 contrary-0.166
 Uk-0.161
 Kah-0.157
 Melt-0.155
 KS-0.155
Token to
Token ID284
Feature activation+2.12e-01

Pos logit contributions
thora+2.962
icket+2.844
oli+2.810
 incent+2.759
ashtra+2.747
Neg logit contributions
 contrary-1.464
 Uk-1.416
 Outbreak-1.356
 Kah-1.350
 Melt-1.333
Token feel
Token ID1254
Feature activation+1.06e+00

Pos logit contributions
thora+0.461
icket+0.435
oli+0.422
ashtra+0.422
emouth+0.420
Neg logit contributions
 contrary-0.390
 Uk-0.375
 Kah-0.368
 Melt-0.365
 KS-0.361
Token like
Token ID588
Feature activation+3.00e-01

Pos logit contributions
thora+2.381
icket+2.261
oli+2.203
ashtra+2.175
 incent+2.170
Neg logit contributions
 contrary-1.869
 Uk-1.805
 Kah-1.755
 Melt-1.734
 Outbreak-1.720
Token they
Token ID484
Feature activation-6.15e-02

Pos logit contributions
thora+0.717
icket+0.682
oli+0.667
 incent+0.661
ashtra+0.660
Neg logit contributions
 contrary-0.481
 Uk-0.464
 Kah-0.452
 Melt-0.446
 Outbreak-0.442
Token were
Token ID547
Feature activation-5.32e-02

Pos logit contributions
 contrary+0.101
 Uk+0.097
 Kah+0.095
 Melt+0.094
 KS+0.093
Neg logit contributions
thora-0.144
icket-0.136
ashtra-0.133
oli-0.133
 incent-0.133
Token reading
Token ID3555
Feature activation+3.23e-01

Pos logit contributions
 contrary+0.092
 Uk+0.088
 Kah+0.087
 Melt+0.086
 KS+0.084
Neg logit contributions
thora-0.123
icket-0.115
ashtra-0.113
oli-0.112
iral-0.112
Token quickly
Token ID2952
Feature activation-3.52e-01

Pos logit contributions
thora+0.770
icket+0.733
oli+0.718
 incent+0.710
ashtra+0.707
Neg logit contributions
 contrary-0.521
 Uk-0.502
 Kah-0.487
 Melt-0.482
 Outbreak-0.481
Token agreed
Token ID4987
Feature activation-3.05e-01

Pos logit contributions
 contrary+0.218
 Uk+0.210
 Kah+0.203
 Melt+0.201
 Outbreak+0.199
Neg logit contributions
thora-0.257
icket-0.243
oli-0.237
ashtra-0.233
 incent-0.233
Token.
Token ID13
Feature activation+5.77e-02

Pos logit contributions
 contrary+0.447
 Uk+0.431
 Kah+0.415
 Outbreak+0.410
 Melt+0.409
Neg logit contributions
thora-0.769
icket-0.734
oli-0.721
 incent-0.712
ashtra-0.709
Token Similarly
Token ID15298
Feature activation-4.67e-01

Pos logit contributions
thora+0.156
icket+0.149
oli+0.146
 incent+0.145
ashtra+0.145
Neg logit contributions
 contrary-0.075
 Uk-0.072
 Kah-0.070
 Melt-0.069
 Outbreak-0.068
Token,
Token ID11
Feature activation-3.13e-03

Pos logit contributions
 contrary+0.612
 Uk+0.588
 Kah+0.564
 Outbreak+0.556
 Melt+0.555
Neg logit contributions
thora-1.248
icket-1.195
oli-1.174
 incent-1.161
ashtra-1.156
Token the
Token ID262
Feature activation-5.17e-04

Pos logit contributions
 contrary+0.005
 Uk+0.005
 Kah+0.005
 Melt+0.005
 Outbreak+0.005
Neg logit contributions
thora-0.007
icket-0.007
oli-0.007
 incent-0.007
ashtra-0.007
Token pian
Token ID43923
Feature activation+1.04e+00

Pos logit contributions
 contrary+0.001
 Uk+0.001
 Kah+0.001
 Melt+0.001
 Outbreak+0.001
Neg logit contributions
thora-0.001
icket-0.001
oli-0.001
ashtra-0.001
 incent-0.001
Tokenist
Token ID396
Feature activation+9.49e-01

Pos logit contributions
thora+2.545
icket+2.423
oli+2.367
 incent+2.361
ashtra+2.357
Neg logit contributions
 contrary-1.607
 Uk-1.559
 Kah-1.492
 Outbreak-1.478
 Melt-1.477
Token's
Token ID338
Feature activation+1.30e-01

Pos logit contributions
thora+2.155
icket+2.053
oli+2.017
 incent+1.982
ashtra+1.972
Neg logit contributions
 contrary-1.603
 Uk-1.557
 Kah-1.507
 Outbreak-1.499
 Melt-1.492
Token disagreement
Token ID25800
Feature activation-5.54e-01

Pos logit contributions
thora+0.289
icket+0.273
oli+0.267
ashtra+0.263
 incent+0.262
Neg logit contributions
 contrary-0.235
 Uk-0.226
 Kah-0.219
 Melt-0.218
 Outbreak-0.216
Token with
Token ID351
Feature activation-6.85e-01

Pos logit contributions
 contrary+0.773
 Uk+0.738
 Kah+0.711
 Melt+0.702
 Outbreak+0.700
Neg logit contributions
thora-1.442
icket-1.377
oli-1.349
 incent-1.335
ashtra-1.335
Token
Token ID564
Feature activation-1.18e-01

Pos logit contributions
 contrary+1.204
 Uk+1.155
 Kah+1.126
 Melt+1.118
 Outbreak+1.111
Neg logit contributions
thora-1.527
icket-1.453
oli-1.424
ashtra-1.407
iral-1.397
Token moreover
Token ID33695
Feature activation-1.19e-01

Pos logit contributions
 contrary+0.066
 Uk+0.062
 Kah+0.060
 Melt+0.059
 Outbreak+0.058
Neg logit contributions
thora-0.127
icket-0.121
oli-0.118
ashtra-0.118
 incent-0.117
Token,
Token ID11
Feature activation+1.92e-01

Pos logit contributions
 contrary+0.145
 Uk+0.135
 Kah+0.130
 Melt+0.128
 Outbreak+0.127
Neg logit contributions
thora-0.334
icket-0.319
 incent-0.314
oli-0.312
ashtra-0.310
Token were
Token ID547
Feature activation-6.32e-03

Pos logit contributions
thora+0.435
icket+0.412
oli+0.402
 incent+0.397
ashtra+0.396
Neg logit contributions
 contrary-0.335
 Uk-0.321
 Kah-0.312
 Melt-0.307
 Outbreak-0.307
Token sk
Token ID1341
Feature activation-1.80e-01

Pos logit contributions
 contrary+0.012
 Uk+0.012
 Kah+0.012
 Melt+0.012
posts+0.011
Neg logit contributions
thora-0.013
icket-0.012
oli-0.012
 incent-0.012
iral-0.012
Tokenitt
Token ID715
Feature activation+2.31e-01

Pos logit contributions
 contrary+0.323
 Uk+0.314
 Kah+0.305
 Melt+0.301
 Outbreak+0.301
Neg logit contributions
thora-0.392
icket-0.367
 incent-0.363
oli-0.360
ashtra-0.358
Tokenish
Token ID680
Feature activation+5.89e-01

Pos logit contributions
thora+0.426
icket+0.396
oli+0.387
 incent+0.383
ashtra+0.381
Neg logit contributions
 contrary-0.494
 Uk-0.482
 Kah-0.471
 Outbreak-0.467
 Melt-0.466
Token of
Token ID286
Feature activation+4.96e-02

Pos logit contributions
thora+1.356
icket+1.299
oli+1.281
 incent+1.254
ashtra+1.245
Neg logit contributions
 contrary-0.966
 Uk-0.942
 Outbreak-0.908
 Kah-0.908
 Melt-0.899
Token formally
Token ID15832
Feature activation+2.97e-01

Pos logit contributions
thora+0.105
icket+0.098
oli+0.096
ashtra+0.095
 incent+0.095
Neg logit contributions
 contrary-0.094
 Uk-0.091
 Kah-0.089
 Outbreak-0.088
 Melt-0.088
Token align
Token ID10548
Feature activation+2.61e-02

Pos logit contributions
thora+0.675
icket+0.641
oli+0.625
ashtra+0.617
 incent+0.615
Neg logit contributions
 contrary-0.513
 Uk-0.496
 Kah-0.481
 Outbreak-0.474
 Melt-0.473
Tokening
Token ID278
Feature activation+1.36e-01

Pos logit contributions
thora+0.082
icket+0.079
oli+0.078
 incent+0.077
ashtra+0.077
Neg logit contributions
 contrary-0.022
 Uk-0.021
 Kah-0.020
 Outbreak-0.019
 Melt-0.019
Token themselves
Token ID2405
Feature activation+6.93e-01

Pos logit contributions
thora+0.278
icket+0.262
oli+0.255
 incent+0.252
ashtra+0.250
Neg logit contributions
 contrary-0.267
 Uk-0.259
 Kah-0.253
 Melt-0.250
 Outbreak-0.250
Token and
Token ID290
Feature activation-9.93e-03

Pos logit contributions
 contrary+0.236
 Uk+0.226
 Kah+0.217
 Melt+0.214
 Outbreak+0.213
Neg logit contributions
thora-0.440
icket-0.420
oli-0.413
ashtra-0.408
 incent-0.408
Token making
Token ID1642
Feature activation-1.13e-01

Pos logit contributions
 contrary+0.018
 Uk+0.017
 Kah+0.017
 Melt+0.017
 Outbreak+0.016
Neg logit contributions
thora-0.022
icket-0.021
oli-0.020
ashtra-0.020
 incent-0.020
Token a
Token ID257
Feature activation-2.87e-02

Pos logit contributions
 contrary+0.198
 Uk+0.191
 Kah+0.186
 Melt+0.183
 Outbreak+0.182
Neg logit contributions
thora-0.254
icket-0.241
oli-0.236
ashtra-0.234
 incent-0.233
Token couple
Token ID3155
Feature activation-3.91e-01

Pos logit contributions
 contrary+0.056
 Uk+0.054
 Kah+0.053
 Melt+0.052
 Outbreak+0.052
Neg logit contributions
thora-0.059
icket-0.056
oli-0.054
ashtra-0.054
 incent-0.054
Token of
Token ID286
Feature activation+1.26e-01

Pos logit contributions
 contrary+0.569
 Uk+0.549
 Kah+0.527
 Melt+0.520
 Outbreak+0.520
Neg logit contributions
thora-0.991
icket-0.948
oli-0.927
 incent-0.919
ashtra-0.918
Token turns
Token ID4962
Feature activation+5.71e-02

Pos logit contributions
thora+0.257
icket+0.242
oli+0.235
ashtra+0.234
 incent+0.233
Neg logit contributions
 contrary-0.248
 Uk-0.241
 Kah-0.234
 Melt-0.231
 Outbreak-0.230
Token about
Token ID546
Feature activation+5.57e-01

Pos logit contributions
thora+0.146
icket+0.139
oli+0.137
ashtra+0.135
 incent+0.135
Neg logit contributions
 contrary-0.083
 Uk-0.080
 Kah-0.076
 Melt-0.075
 Outbreak-0.075
Token an
Token ID281
Feature activation+5.52e-01

Pos logit contributions
thora+1.342
icket+1.278
oli+1.258
ashtra+1.238
 incent+1.235
Neg logit contributions
 contrary-0.882
 Uk-0.852
 Kah-0.825
 Melt-0.813
 Outbreak-0.810
Token 1
Token ID352
Feature activation-2.96e-01

Pos logit contributions
thora+1.078
icket+1.006
oli+0.988
ashtra+0.980
 incent+0.969
Neg logit contributions
 contrary-1.140
 Uk-1.115
 Kah-1.080
 Outbreak-1.067
 KS-1.065
Token-
Token ID12
Feature activation-2.88e-01

Pos logit contributions
 contrary+0.426
 Uk+0.413
 Kah+0.396
 Melt+0.391
 Outbreak+0.387
Neg logit contributions
thora-0.754
icket-0.717
ashtra-0.708
oli-0.707
 incent-0.699
Token2
Token ID17
Feature activation+5.28e-02

Pos logit contributions
 contrary+0.434
 Uk+0.419
 Kah+0.403
 Melt+0.400
 Outbreak+0.397
Neg logit contributions
thora-0.715
icket-0.675
oli-0.666
ashtra-0.665
 incent-0.663
Token33
Token ID2091
Feature activation-5.38e-03

Pos logit contributions
 contrary+0.260
 Uk+0.252
 Kah+0.244
posts+0.241
 Outbreak+0.241
Neg logit contributions
thora-0.360
icket-0.340
 incent-0.333
oli-0.331
ashtra-0.331
Token
Token ID198
Feature activation+1.68e-01

Pos logit contributions
 contrary+0.009
 Uk+0.008
 Kah+0.008
 Outbreak+0.008
 Melt+0.008
Neg logit contributions
thora-0.013
icket-0.012
oli-0.012
 incent-0.012
ashtra-0.012
Token
Token ID198
Feature activation+6.80e-02

Pos logit contributions
thora+0.426
icket+0.406
oli+0.398
 incent+0.397
ashtra+0.393
Neg logit contributions
 contrary-0.246
 Uk-0.237
 Kah-0.228
 Melt-0.225
 Outbreak-0.225
Token19
Token ID1129
Feature activation-1.21e-01

Pos logit contributions
thora+0.153
icket+0.145
 incent+0.142
oli+0.141
ashtra+0.140
Neg logit contributions
 contrary-0.118
 Uk-0.115
 Kah-0.112
 Outbreak-0.110
posts-0.110
Token21
Token ID2481
Feature activation-2.03e-01

Pos logit contributions
 contrary+0.195
 Uk+0.189
 Kah+0.182
 Melt+0.180
posts+0.180
Neg logit contributions
thora-0.292
icket-0.276
 incent-0.271
ashtra-0.269
oli-0.269
Token
Token ID198
Feature activation+1.10e-01

Pos logit contributions
 contrary+0.323
 Uk+0.311
 Kah+0.299
 Outbreak+0.298
 Melt+0.297
Neg logit contributions
thora-0.484
icket-0.461
oli-0.454
 incent-0.446
ashtra-0.444
Token
Token ID198
Feature activation+1.39e-02

Pos logit contributions
thora+0.279
icket+0.266
oli+0.261
 incent+0.261
ashtra+0.258
Neg logit contributions
 contrary-0.159
 Uk-0.153
 Kah-0.148
 Melt-0.146
 Outbreak-0.146
Token19
Token ID1129
Feature activation-5.51e-02

Pos logit contributions
thora+0.031
icket+0.030
 incent+0.029
oli+0.029
ashtra+0.029
Neg logit contributions
 contrary-0.024
 Uk-0.024
 Kah-0.023
posts-0.023
 Outbreak-0.023
Token09
Token ID2931
Feature activation-1.80e-02

Pos logit contributions
 contrary+0.090
 Uk+0.087
 Kah+0.084
posts+0.083
 Melt+0.083
Neg logit contributions
thora-0.132
icket-0.124
 incent-0.122
oli-0.121
ashtra-0.121
Token
Token ID198
Feature activation-1.95e-02

Pos logit contributions
 contrary+0.029
 Uk+0.028
 Outbreak+0.027
 Kah+0.027
 Melt+0.027
Neg logit contributions
thora-0.042
icket-0.040
oli-0.040
 incent-0.039
ashtra-0.039
Token
Token ID198
Feature activation-8.21e-02

Pos logit contributions
 contrary+0.028
 Uk+0.027
 Kah+0.026
 Melt+0.026
 Outbreak+0.026
Neg logit contributions
thora-0.050
icket-0.048
 incent-0.047
oli-0.047
ashtra-0.046
Token this
Token ID428
Feature activation+3.67e-01

Pos logit contributions
 contrary+1.005
 Uk+0.966
 Kah+0.938
 Melt+0.925
 Outbreak+0.924
Neg logit contributions
thora-1.353
icket-1.284
oli-1.250
ashtra-1.239
 incent-1.236
Token pr
Token ID778
Feature activation-6.32e-01

Pos logit contributions
thora+0.743
icket+0.700
oli+0.682
 incent+0.672
ashtra+0.671
Neg logit contributions
 contrary-0.723
 Uk-0.703
 Kah-0.684
 Outbreak-0.678
 Melt-0.676
Tokenimate
Token ID1920
Feature activation+9.04e-01

Pos logit contributions
 contrary+0.936
 Uk+0.912
 Outbreak+0.866
 Kah+0.859
posts+0.852
Neg logit contributions
thora-1.553
 incent-1.489
icket-1.472
oli-1.446
 diarr-1.445
Token was
Token ID373
Feature activation+1.34e-01

Pos logit contributions
thora+1.991
icket+1.890
oli+1.866
 incent+1.835
iral+1.818
Neg logit contributions
 contrary-1.577
 Uk-1.539
 Kah-1.489
 Outbreak-1.476
 Melt-1.476
Token uniquely
Token ID24139
Feature activation-4.26e-03

Pos logit contributions
thora+0.281
icket+0.265
oli+0.258
 incent+0.254
ashtra+0.254
Neg logit contributions
 contrary-0.257
 Uk-0.248
 Kah-0.241
 Melt-0.239
 Outbreak-0.239
Token adapted
Token ID16573
Feature activation+2.17e-01

Pos logit contributions
 contrary+0.009
 Uk+0.009
 Kah+0.008
 Melt+0.008
 Outbreak+0.008
Neg logit contributions
thora-0.008
icket-0.008
oli-0.007
ashtra-0.007
emouth-0.007
Token to
Token ID284
Feature activation+1.64e-01

Pos logit contributions
thora+0.573
icket+0.550
oli+0.544
 incent+0.533
ashtra+0.533
Neg logit contributions
 contrary-0.283
 Uk-0.274
 Kah-0.265
 Outbreak-0.262
 Melt-0.260
Token its
Token ID663
Feature activation+3.61e-01

Pos logit contributions
thora+0.369
icket+0.350
oli+0.342
 incent+0.337
ashtra+0.336
Neg logit contributions
 contrary-0.289
 Uk-0.280
 Kah-0.271
 Melt-0.269
 Outbreak-0.269
Token environment
Token ID2858
Feature activation-7.50e-02

Pos logit contributions
thora+0.694
icket+0.651
oli+0.632
ashtra+0.623
 incent+0.622
Neg logit contributions
 contrary-0.761
 Uk-0.740
 Kah-0.719
 Melt-0.712
 Outbreak-0.711
Token,
Token ID11
Feature activation-9.04e-02

Pos logit contributions
 contrary+0.103
 Uk+0.099
 Kah+0.095
 Outbreak+0.094
 Melt+0.094
Neg logit contributions
thora-0.195
icket-0.187
oli-0.184
 incent-0.181
ashtra-0.181
Token Dr
Token ID1583
Feature activation-7.08e-02

Pos logit contributions
 contrary+0.136
 Uk+0.131
 Kah+0.126
 Outbreak+0.125
 Melt+0.125
Neg logit contributions
thora-0.224
icket-0.214
oli-0.210
 incent-0.208
ashtra-0.207
Token court
Token ID2184
Feature activation-4.12e-02

Pos logit contributions
thora+0.555
icket+0.531
oli+0.519
ashtra+0.515
 incent+0.515
Neg logit contributions
 contrary-0.320
 Uk-0.307
 Kah-0.296
 Melt-0.290
 Outbreak-0.289
Token accepts
Token ID18178
Feature activation-2.17e-01

Pos logit contributions
 contrary+0.074
 Uk+0.071
 Kah+0.069
 Melt+0.068
 Outbreak+0.067
Neg logit contributions
thora-0.092
icket-0.087
oli-0.085
ashtra-0.084
 incent-0.083
Token a
Token ID257
Feature activation+7.02e-02

Pos logit contributions
 contrary+0.359
 Uk+0.347
 Kah+0.336
 Melt+0.331
 Outbreak+0.330
Neg logit contributions
thora-0.509
icket-0.484
oli-0.474
 incent-0.469
ashtra-0.468
Token central
Token ID4318
Feature activation-4.50e-01

Pos logit contributions
thora+0.155
icket+0.147
oli+0.144
 incent+0.143
ashtra+0.142
Neg logit contributions
 contrary-0.126
 Uk-0.121
 Kah-0.118
 Melt-0.116
 Outbreak-0.116
Token government
Token ID1230
Feature activation-7.40e-01

Pos logit contributions
 contrary+0.685
 Uk+0.659
 Kah+0.636
 Melt+0.625
 Outbreak+0.619
Neg logit contributions
thora-1.114
icket-1.059
oli-1.040
 incent-1.032
ashtra-1.028
Token legal
Token ID2742
Feature activation+8.80e-01

Pos logit contributions
 contrary+1.285
 Uk+1.232
 Kah+1.197
 Melt+1.178
 Outbreak+1.163
Neg logit contributions
thora-1.701
icket-1.601
ashtra-1.568
oli-1.562
 incent-1.544
Token challenge
Token ID4427
Feature activation+4.48e-01

Pos logit contributions
thora+1.734
icket+1.635
oli+1.599
 incent+1.570
ashtra+1.557
Neg logit contributions
 contrary-1.757
 Uk-1.717
 Kah-1.671
 Outbreak-1.666
 Melt-1.656
Token related
Token ID3519
Feature activation+4.28e-01

Pos logit contributions
thora+1.152
icket+1.103
oli+1.087
 incent+1.069
ashtra+1.062
Neg logit contributions
 contrary-0.618
 Uk-0.602
 Kah-0.578
 Outbreak-0.576
 Melt-0.571
Token to
Token ID284
Feature activation-6.88e-01

Pos logit contributions
thora+1.174
icket+1.126
oli+1.107
 incent+1.094
ashtra+1.090
Neg logit contributions
 contrary-0.528
 Uk-0.506
 Kah-0.484
 Outbreak-0.478
 Melt-0.476
Token any
Token ID597
Feature activation-1.71e-01

Pos logit contributions
 contrary+1.121
 Uk+1.082
 Kah+1.049
 Melt+1.031
 Outbreak+1.023
Neg logit contributions
thora-1.628
icket-1.550
oli-1.515
 incent-1.504
ashtra-1.500
Token of
Token ID286
Feature activation-4.33e-01

Pos logit contributions
 contrary+0.282
 Uk+0.273
 Kah+0.264
 Outbreak+0.261
 Melt+0.260
Neg logit contributions
thora-0.400
icket-0.381
oli-0.373
 incent-0.368
ashtra-0.367
Token,
Token ID11
Feature activation-3.75e-01

Pos logit contributions
 contrary+0.852
 Uk+0.817
 Kah+0.785
 Outbreak+0.780
 Melt+0.777
Neg logit contributions
thora-1.579
icket-1.513
oli-1.488
 incent-1.467
ashtra-1.458
Token Bird
Token ID14506
Feature activation-9.13e-01

Pos logit contributions
 contrary+0.540
 Uk+0.521
posts+0.498
 Outbreak+0.498
 Kah+0.498
Neg logit contributions
thora-0.945
icket-0.909
oli-0.888
 incent-0.876
ashtra-0.873
Token of
Token ID286
Feature activation-6.73e-01

Pos logit contributions
 contrary+1.092
 Uk+1.034
 Kah+0.993
 Outbreak+0.986
 Melt+0.981
Neg logit contributions
thora-2.538
icket-2.444
oli-2.412
 incent-2.372
emouth-2.361
Token Pre
Token ID3771
Feature activation-9.39e-01

Pos logit contributions
 contrary+0.918
 Uk+0.870
 Kah+0.857
 Outbreak+0.854
 Melt+0.848
Neg logit contributions
thora-1.785
 incent-1.693
icket-1.691
oli-1.662
ashtra-1.655
Tokeny
Token ID88
Feature activation-3.77e-01

Pos logit contributions
 contrary+1.626
 Uk+1.564
posts+1.528
 Kah+1.515
 Melt+1.510
Neg logit contributions
thora-2.103
 incent-2.054
icket-1.963
ashtra-1.943
 distingu-1.942
Token is
Token ID318
Feature activation-2.97e-01

Pos logit contributions
 contrary+0.595
 Uk+0.579
 Kah+0.557
 Melt+0.552
 Outbreak+0.550
Neg logit contributions
thora-0.911
icket-0.864
oli-0.845
 incent-0.836
ashtra-0.834
Token a
Token ID257
Feature activation-4.57e-01

Pos logit contributions
 contrary+0.509
 Uk+0.492
 Kah+0.477
 Melt+0.472
 Outbreak+0.471
Neg logit contributions
thora-0.680
icket-0.643
oli-0.630
 incent-0.624
ashtra-0.620
Token psychedelic
Token ID34490
Feature activation-1.59e+00

Pos logit contributions
 contrary+0.869
 Uk+0.846
 Kah+0.821
 Melt+0.813
 Outbreak+0.811
Neg logit contributions
thora-0.966
icket-0.910
oli-0.881
 incent-0.875
ashtra-0.874
Token indie
Token ID19907
Feature activation-8.63e-01

Pos logit contributions
 contrary+3.056
 Uk+3.001
 Kah+2.892
 Outbreak+2.863
 Melt+2.849
Neg logit contributions
thora-3.352
icket-3.134
emouth-3.014
 incent-3.003
ashtra-2.999
Token rock
Token ID3881
Feature activation-5.21e-01

Pos logit contributions
 contrary+1.784
 Uk+1.754
 Kah+1.695
 Melt+1.686
 Outbreak+1.679
Neg logit contributions
thora-1.670
icket-1.562
oli-1.500
 incent-1.494
ashtra-1.490
Token band
Token ID4097
Feature activation-6.96e-01

Pos logit contributions
 contrary+0.949
 Uk+0.924
 Kah+0.892
 Melt+0.887
 Outbreak+0.886
Neg logit contributions
thora-1.133
icket-1.071
oli-1.039
ashtra-1.027
 incent-1.027
Tokenator
Token ID1352
Feature activation+2.50e-01

Pos logit contributions
thora+0.229
icket+0.213
oli+0.206
 incent+0.204
ashtra+0.202
Neg logit contributions
 contrary-0.309
 Uk-0.302
 Kah-0.295
 Outbreak-0.292
 Melt-0.292
Token are
Token ID389
Feature activation-5.36e-02

Pos logit contributions
thora+0.579
icket+0.550
oli+0.539
 incent+0.533
ashtra+0.529
Neg logit contributions
 contrary-0.420
 Uk-0.406
 Kah-0.394
 Melt-0.390
 Outbreak-0.389
Token pointing
Token ID10609
Feature activation+2.27e-01

Pos logit contributions
 contrary+0.099
 Uk+0.096
 Kah+0.093
 Melt+0.092
posts+0.091
Neg logit contributions
thora-0.116
icket-0.109
oli-0.106
 incent-0.105
iral-0.105
Token to
Token ID284
Feature activation-1.17e-01

Pos logit contributions
thora+0.562
icket+0.536
oli+0.526
 incent+0.520
ashtra+0.517
Neg logit contributions
 contrary-0.341
 Uk-0.329
 Kah-0.317
 Outbreak-0.313
 Melt-0.313
Token the
Token ID262
Feature activation+1.63e-01

Pos logit contributions
 contrary+0.191
 Uk+0.185
 Kah+0.179
 Melt+0.177
 Outbreak+0.176
Neg logit contributions
thora-0.279
icket-0.265
oli-0.259
 incent-0.257
ashtra-0.255
Token lack
Token ID3092
Feature activation-2.57e-01

Pos logit contributions
thora+0.368
icket+0.350
oli+0.342
 incent+0.338
ashtra+0.337
Neg logit contributions
 contrary-0.285
 Uk-0.275
 Kah-0.266
 Melt-0.264
 Outbreak-0.262
Token of
Token ID286
Feature activation+3.20e-01

Pos logit contributions
 contrary+0.301
 Uk+0.291
 Kah+0.276
 Outbreak+0.276
 Melt+0.274
Neg logit contributions
thora-0.718
icket-0.690
oli-0.682
 incent-0.669
ashtra-0.668
Token an
Token ID281
Feature activation+4.80e-01

Pos logit contributions
thora+0.716
icket+0.680
oli+0.665
 incent+0.655
ashtra+0.654
Neg logit contributions
 contrary-0.561
 Uk-0.544
 Kah-0.527
 Melt-0.520
 Outbreak-0.520
Token imp
Token ID848
Feature activation-2.65e-01

Pos logit contributions
thora+1.064
icket+1.003
oli+0.977
 incent+0.967
ashtra+0.967
Neg logit contributions
 contrary-0.861
 Uk-0.841
 Kah-0.810
 Outbreak-0.802
 Melt-0.800
Tokenassion
Token ID11857
Feature activation+1.25e+00

Pos logit contributions
 contrary+0.538
 Uk+0.524
 Kah+0.510
 Melt+0.505
 Outbreak+0.503
Neg logit contributions
thora-0.514
icket-0.483
oli-0.470
 incent-0.468
ashtra-0.465
Tokened
Token ID276
Feature activation+3.67e-01

Pos logit contributions
thora+2.754
icket+2.616
oli+2.562
 incent+2.520
ashtra+2.510
Neg logit contributions
 contrary-2.206
 Uk-2.144
 Kah-2.079
 Outbreak-2.060
 Melt-2.057
Tokenminded
Token ID14543
Feature activation+3.25e-01

Pos logit contributions
thora+0.479
icket+0.436
oli+0.424
 incent+0.419
ashtra+0.417
Neg logit contributions
 contrary-0.789
 Uk-0.774
 Kah-0.757
posts-0.751
 Outbreak-0.749
Token voters
Token ID4446
Feature activation+2.41e-01

Pos logit contributions
thora+0.722
icket+0.675
oli+0.653
iral+0.650
ashtra+0.649
Neg logit contributions
 contrary-0.595
 Uk-0.576
 Kah-0.559
 Melt-0.547
 KS-0.547
Token that
Token ID326
Feature activation+5.50e-02

Pos logit contributions
thora+0.571
icket+0.542
oli+0.530
 incent+0.525
ashtra+0.522
Neg logit contributions
 contrary-0.394
 Uk-0.379
 Kah-0.366
 Melt-0.362
 Outbreak-0.361
Token accused
Token ID5371
Feature activation+5.61e-01

Pos logit contributions
thora+0.117
icket+0.109
oli+0.107
ashtra+0.105
 incent+0.105
Neg logit contributions
 contrary-0.104
 Uk-0.101
 Kah-0.098
 Melt-0.097
 Outbreak-0.096
Token Obama
Token ID2486
Feature activation+1.29e-01

Pos logit contributions
thora+1.071
icket+1.011
oli+0.986
 incent+0.967
ashtra+0.962
Neg logit contributions
 contrary-1.156
 Uk-1.130
 Kah-1.100
 Outbreak-1.094
 Melt-1.091
Token of
Token ID286
Feature activation-3.50e-01

Pos logit contributions
thora+0.343
icket+0.331
oli+0.327
 incent+0.321
iral+0.319
Neg logit contributions
 contrary-0.161
 Uk-0.157
 Outbreak-0.153
 Melt-0.151
 Kah-0.149
Token lying
Token ID9105
Feature activation-6.77e-01

Pos logit contributions
 contrary+0.623
 Uk+0.594
 Kah+0.573
 KS+0.561
posts+0.559
Neg logit contributions
thora-0.797
icket-0.750
oli-0.734
ashtra-0.730
iral-0.722
Token about
Token ID546
Feature activation+3.67e-01

Pos logit contributions
 contrary+1.023
 Uk+0.970
 Kah+0.934
 Melt+0.916
 Outbreak+0.916
Neg logit contributions
thora-1.699
icket-1.618
oli-1.587
 incent-1.574
iral-1.570
Token his
Token ID465
Feature activation+1.73e-02

Pos logit contributions
thora+0.792
icket+0.746
oli+0.731
 incent+0.719
ashtra+0.719
Neg logit contributions
 contrary-0.685
 Uk-0.660
 Kah-0.643
 Melt-0.632
 Outbreak-0.631
Token position
Token ID2292
Feature activation-2.95e-01

Pos logit contributions
thora+0.040
icket+0.038
oli+0.037
ashtra+0.036
 incent+0.036
Neg logit contributions
 contrary-0.030
 Uk-0.029
 Kah-0.028
 Melt-0.028
 Outbreak-0.027
Token on
Token ID319
Feature activation+6.60e-02

Pos logit contributions
 contrary+0.403
 Uk+0.390
 Kah+0.374
 Outbreak+0.371
 Melt+0.370
Neg logit contributions
thora-0.767
icket-0.734
oli-0.722
 incent-0.713
ashtra-0.708
Token prince
Token ID19716
Feature activation+7.77e-01

Pos logit contributions
 contrary+0.536
 Uk+0.517
 Kah+0.504
 Melt+0.497
posts+0.492
Neg logit contributions
thora-0.751
icket-0.711
 incent-0.694
oli-0.693
ashtra-0.685
Token of
Token ID286
Feature activation-3.18e-02

Pos logit contributions
thora+2.001
icket+1.915
oli+1.883
 incent+1.856
ashtra+1.849
Neg logit contributions
 contrary-1.085
 Uk-1.045
 Kah-1.002
 Outbreak-0.994
 Melt-0.989
Token comics
Token ID12770
Feature activation-7.57e-01

Pos logit contributions
 contrary+0.053
 Uk+0.052
 Kah+0.050
 Melt+0.050
 Outbreak+0.049
Neg logit contributions
thora-0.074
icket-0.070
oli-0.068
 incent-0.068
ashtra-0.067
Token himself
Token ID2241
Feature activation+4.01e-01

Pos logit contributions
 contrary+1.128
 Uk+1.086
 Kah+1.055
 Melt+1.043
 Outbreak+1.025
Neg logit contributions
thora-1.904
icket-1.811
oli-1.763
 incent-1.757
ashtra-1.741
Token:
Token ID25
Feature activation+4.12e-02

Pos logit contributions
thora+0.993
icket+0.948
oli+0.931
 incent+0.919
ashtra+0.915
Neg logit contributions
 contrary-0.604
 Uk-0.583
 Kah-0.562
 Outbreak-0.556
 Melt-0.555
Token Frank
Token ID5278
Feature activation-7.37e-01

Pos logit contributions
thora+0.095
icket+0.090
oli+0.088
 incent+0.087
ashtra+0.087
Neg logit contributions
 contrary-0.070
 Uk-0.068
 Kah-0.066
 Melt-0.065
 Outbreak-0.065
Token Miller
Token ID7920
Feature activation-2.46e-01

Pos logit contributions
 contrary+0.682
 Uk+0.653
 Melt+0.619
 Kah+0.618
 Outbreak+0.591
Neg logit contributions
thora-2.251
icket-2.166
 incent-2.118
ashtra-2.116
oli-2.113
Token.
Token ID13
Feature activation+2.85e-01

Pos logit contributions
 contrary+0.373
 Uk+0.360
 Kah+0.348
 Melt+0.343
 Outbreak+0.343
Neg logit contributions
thora-0.609
icket-0.581
oli-0.569
 incent-0.563
ashtra-0.561
Token How
Token ID1374
Feature activation-1.47e-01

Pos logit contributions
thora+0.733
icket+0.701
oli+0.688
 incent+0.681
ashtra+0.678
Neg logit contributions
 contrary-0.403
 Uk-0.389
 Kah-0.374
 Outbreak-0.370
 Melt-0.370
Token could
Token ID714
Feature activation+4.03e-01

Pos logit contributions
 contrary+0.252
 Uk+0.243
 Kah+0.236
 Melt+0.234
 Outbreak+0.232
Neg logit contributions
thora-0.338
icket-0.322
oli-0.312
 incent-0.310
ashtra-0.310
Token this
Token ID428
Feature activation+5.17e-01

Pos logit contributions
thora+0.930
icket+0.886
oli+0.869
 incent+0.856
ashtra+0.852
Neg logit contributions
 contrary-0.668
 Uk-0.648
 Kah-0.627
 Outbreak-0.622
 Melt-0.619
Token him
Token ID683
Feature activation+3.72e-01

Pos logit contributions
thora+0.227
icket+0.216
oli+0.211
 incent+0.209
ashtra+0.208
Neg logit contributions
 contrary-0.172
 Uk-0.166
 Kah-0.162
 Melt-0.160
 Outbreak-0.158
Token for
Token ID329
Feature activation+2.17e-01

Pos logit contributions
thora+0.947
icket+0.905
oli+0.888
 incent+0.878
ashtra+0.874
Neg logit contributions
 contrary-0.534
 Uk-0.514
 Kah-0.495
 Outbreak-0.489
 Melt-0.489
Token you
Token ID345
Feature activation+5.58e-02

Pos logit contributions
thora+0.481
icket+0.457
oli+0.445
ashtra+0.440
 incent+0.440
Neg logit contributions
 contrary-0.386
 Uk-0.374
 Kah-0.364
 Melt-0.358
 Outbreak-0.356
Token.
Token ID13
Feature activation+6.67e-02

Pos logit contributions
thora+0.141
icket+0.135
oli+0.132
 incent+0.131
ashtra+0.130
Neg logit contributions
 contrary-0.082
 Uk-0.078
 Kah-0.076
 Melt-0.075
 Outbreak-0.074
Token<|endoftext|>
Token ID50256
Feature activation+4.93e-01

Pos logit contributions
thora+0.167
icket+0.160
oli+0.156
 incent+0.155
ashtra+0.154
Neg logit contributions
 contrary-0.099
 Uk-0.097
 Kah-0.094
 Melt-0.092
 Outbreak-0.092
TokenTan
Token ID45557
Feature activation-4.06e-01

Pos logit contributions
thora+1.073
icket+1.016
oli+0.993
 incent+0.983
ashtra+0.975
Neg logit contributions
 contrary-0.890
 Uk-0.865
 Kah-0.839
 Outbreak-0.832
 Melt-0.830
Tokenaka
Token ID8130
Feature activation+3.25e-01

Pos logit contributions
 contrary+0.688
 Uk+0.676
 Kah+0.655
 Outbreak+0.643
 Melt+0.643
Neg logit contributions
thora-0.928
icket-0.878
 incent-0.855
oli-0.846
ashtra-0.843
Token's
Token ID338
Feature activation+2.31e-01

Pos logit contributions
thora+0.796
icket+0.759
oli+0.744
 incent+0.735
ashtra+0.732
Neg logit contributions
 contrary-0.497
 Uk-0.482
 Kah-0.465
 Outbreak-0.460
 Melt-0.459
Token artwork
Token ID16257
Feature activation-1.17e-01

Pos logit contributions
thora+0.532
icket+0.506
oli+0.495
 incent+0.489
ashtra+0.486
Neg logit contributions
 contrary-0.388
 Uk-0.377
 Kah-0.365
 Outbreak-0.361
 Melt-0.361
Token is
Token ID318
Feature activation-2.18e-01

Pos logit contributions
 contrary+0.182
 Uk+0.175
 Kah+0.169
 Melt+0.168
 Outbreak+0.168
Neg logit contributions
thora-0.287
icket-0.273
oli-0.267
 incent-0.263
ashtra-0.263
Token,
Token ID11
Feature activation-3.17e-01

Pos logit contributions
 contrary+0.386
 Uk+0.372
 Kah+0.361
 Melt+0.357
 Outbreak+0.356
Neg logit contributions
thora-0.486
icket-0.461
oli-0.450
 incent-0.445
ashtra-0.442
Token View
Token ID3582
Feature activation-1.03e+00

Pos logit contributions
thora+0.086
icket+0.082
oli+0.081
 incent+0.080
ashtra+0.080
Neg logit contributions
 contrary-0.044
 Uk-0.043
 Kah-0.041
 Outbreak-0.040
 Melt-0.040
Token all
Token ID477
Feature activation-1.33e+00

Pos logit contributions
 contrary+1.848
 Uk+1.760
 Kah+1.711
 Outbreak+1.698
 Melt+1.693
Neg logit contributions
thora-2.315
 incent-2.215
icket-2.176
oli-2.131
ashtra-2.110
Token New
Token ID968
Feature activation-1.67e-01

Pos logit contributions
 contrary+1.713
 Uk+1.663
 Kah+1.592
 Dispatch+1.571
 Melt+1.569
Neg logit contributions
thora-3.655
 incent-3.530
icket-3.474
ashtra-3.388
iral-3.375
Token York
Token ID1971
Feature activation+2.40e-01

Pos logit contributions
 contrary+0.286
 Uk+0.278
 Kah+0.268
 Outbreak+0.267
 Melt+0.266
Neg logit contributions
thora-0.382
icket-0.362
oli-0.354
 incent-0.353
ashtra-0.349
Token Times
Token ID3782
Feature activation-2.74e-01

Pos logit contributions
thora+0.598
icket+0.570
oli+0.557
 incent+0.556
ashtra+0.549
Neg logit contributions
 contrary-0.361
 Uk-0.348
 Kah-0.335
 Melt-0.333
 Outbreak-0.332
Token newsletters
Token ID16983
Feature activation-1.67e+00

Pos logit contributions
 contrary+0.350
 Uk+0.333
 Kah+0.315
 Melt+0.315
posts+0.314
Neg logit contributions
thora-0.754
icket-0.713
 incent-0.711
oli-0.704
ashtra-0.693
Token.
Token ID13
Feature activation-2.36e-01

Pos logit contributions
 contrary+2.144
 Uk+2.019
 Kah+1.956
 Melt+1.952
 KS+1.913
Neg logit contributions
thora-4.504
 incent-4.452
icket-4.327
 distingu-4.253
emouth-4.213
Token
Token ID198
Feature activation-1.05e-01

Pos logit contributions
 contrary+0.334
 Uk+0.323
 Kah+0.311
 Melt+0.306
 Outbreak+0.305
Neg logit contributions
thora-0.608
icket-0.580
 incent-0.573
oli-0.568
ashtra-0.563
Token
Token ID198
Feature activation-5.17e-02

Pos logit contributions
 contrary+0.148
 Uk+0.142
 Kah+0.137
 Melt+0.135
 Outbreak+0.134
Neg logit contributions
thora-0.272
icket-0.259
 incent-0.257
oli-0.253
ashtra-0.252
Token
Token ID447
Feature activation-2.04e-01

Pos logit contributions
 contrary+0.089
 Uk+0.087
 Kah+0.084
posts+0.084
 Melt+0.083
Neg logit contributions
thora-0.117
icket-0.110
 incent-0.109
oli-0.107
ashtra-0.107
Token
Token ID250
Feature activation-4.15e-03

Pos logit contributions
 contrary+0.363
 Uk+0.345
 Kah+0.342
 Melt+0.338
 Outbreak+0.334
Neg logit contributions
thora-0.447
icket-0.432
 incent-0.423
oli-0.416
emouth-0.414
Token By
Token ID2750
Feature activation-5.33e-01

Pos logit contributions
thora+0.161
icket+0.154
oli+0.151
 incent+0.149
ashtra+0.148
Neg logit contributions
 contrary-0.090
 Uk-0.087
 Kah-0.084
 Outbreak-0.083
 Melt-0.082
Token comparison
Token ID7208
Feature activation-4.32e-01

Pos logit contributions
 contrary+0.890
 Uk+0.851
 Kah+0.823
 Melt+0.812
 KS+0.804
Neg logit contributions
thora-1.264
icket-1.195
oli-1.157
emouth-1.156
ashtra-1.155
Token:
Token ID25
Feature activation+1.38e-01

Pos logit contributions
 contrary+0.590
 Uk+0.576
 Outbreak+0.558
 Kah+0.554
 Melt+0.548
Neg logit contributions
thora-1.117
icket-1.071
oli-1.060
 incent-1.035
iral-1.035
Token Earth
Token ID3668
Feature activation-1.42e-02

Pos logit contributions
thora+0.351
icket+0.335
oli+0.328
 incent+0.325
ashtra+0.324
Neg logit contributions
 contrary-0.200
 Uk-0.192
 Kah-0.185
 Melt-0.183
 Outbreak-0.182
Token's
Token ID338
Feature activation+8.26e-02

Pos logit contributions
 contrary+0.024
 Uk+0.023
 Kah+0.022
 Melt+0.022
 Outbreak+0.022
Neg logit contributions
thora-0.033
icket-0.032
oli-0.031
 incent-0.031
ashtra-0.031
Token moon
Token ID8824
Feature activation-1.47e+00

Pos logit contributions
thora+0.170
icket+0.160
oli+0.157
ashtra+0.154
 incent+0.154
Neg logit contributions
 contrary-0.161
 Uk-0.156
 Kah-0.152
 Melt-0.150
 Outbreak-0.149
Token is
Token ID318
Feature activation-2.16e-01

Pos logit contributions
 contrary+2.371
 Uk+2.275
 Kah+2.223
 Melt+2.187
posts+2.172
Neg logit contributions
thora-3.508
icket-3.299
ashtra-3.247
emouth-3.189
 incent-3.181
Token already
Token ID1541
Feature activation+1.83e-02

Pos logit contributions
 contrary+0.376
 Uk+0.360
 Kah+0.349
 Melt+0.344
 KS+0.340
Neg logit contributions
thora-0.498
icket-0.471
oli-0.459
ashtra-0.455
emouth-0.452
Token in
Token ID287
Feature activation-2.77e-01

Pos logit contributions
thora+0.041
icket+0.039
oli+0.037
ashtra+0.037
emouth+0.037
Neg logit contributions
 contrary-0.033
 Uk-0.032
 Kah-0.031
 Melt-0.030
 Outbreak-0.030
Token an
Token ID281
Feature activation+3.21e-01

Pos logit contributions
 contrary+0.492
 Uk+0.476
 Kah+0.462
 Melt+0.458
 Outbreak+0.452
Neg logit contributions
thora-0.617
icket-0.583
oli-0.568
 incent-0.563
ashtra-0.563
Token Earth
Token ID3668
Feature activation-6.29e-02

Pos logit contributions
thora+0.671
icket+0.626
emouth+0.608
oli+0.603
ashtra+0.602
Neg logit contributions
 contrary-0.625
 Uk-0.611
 Kah-0.588
 Outbreak-0.583
 Melt-0.582
Token Windows
Token ID3964
Feature activation-1.78e-01

Pos logit contributions
 contrary+0.194
 Uk+0.186
 Kah+0.180
 Melt+0.177
 Outbreak+0.177
Neg logit contributions
thora-0.303
icket-0.289
oli-0.283
ashtra-0.281
 incent-0.278
Token
Token ID198
Feature activation+1.46e-01

Pos logit contributions
 contrary+0.249
 Uk+0.240
 Kah+0.231
 Outbreak+0.228
 Melt+0.228
Neg logit contributions
thora-0.460
icket-0.440
oli-0.432
 incent-0.427
ashtra-0.425
Token
Token ID198
Feature activation-2.06e-02

Pos logit contributions
thora+0.373
icket+0.356
 incent+0.351
oli+0.348
ashtra+0.345
Neg logit contributions
 contrary-0.210
 Uk-0.202
 Kah-0.194
 Melt-0.191
 Outbreak-0.191
TokenFor
Token ID1890
Feature activation-5.42e-02

Pos logit contributions
 contrary+0.037
 Uk+0.036
 Kah+0.035
posts+0.034
 Melt+0.034
Neg logit contributions
thora-0.045
icket-0.043
oli-0.042
 incent-0.042
ashtra-0.041
Token W
Token ID370
Feature activation+2.66e-01

Pos logit contributions
 contrary+0.092
 Uk+0.089
 Kah+0.087
 Melt+0.085
 Outbreak+0.085
Neg logit contributions
thora-0.124
icket-0.118
oli-0.115
ashtra-0.115
 incent-0.114
TokenAB
Token ID6242
Feature activation-1.52e+00

Pos logit contributions
thora+0.584
icket+0.551
oli+0.538
 incent+0.536
ashtra+0.533
Neg logit contributions
 contrary-0.473
 Uk-0.461
 Kah-0.446
 Melt-0.443
 Outbreak-0.441
TokenL
Token ID43
Feature activation-1.67e-01

Pos logit contributions
 contrary+2.535
 Uk+2.471
 Kah+2.378
 Melt+2.368
 Dispatch+2.347
Neg logit contributions
thora-3.550
icket-3.310
iral-3.196
 incent-3.192
ashtra-3.185
Token members
Token ID1866
Feature activation+3.75e-01

Pos logit contributions
 contrary+0.256
 Uk+0.249
 Kah+0.241
 Melt+0.237
 Outbreak+0.236
Neg logit contributions
thora-0.413
icket-0.388
oli-0.379
 incent-0.377
ashtra-0.377
Token,
Token ID11
Feature activation+7.07e-02

Pos logit contributions
thora+0.960
icket+0.917
oli+0.901
 incent+0.889
ashtra+0.886
Neg logit contributions
 contrary-0.535
 Uk-0.516
 Kah-0.497
 Outbreak-0.490
 Melt-0.490
Token this
Token ID428
Feature activation-2.04e-01

Pos logit contributions
thora+0.161
icket+0.153
oli+0.150
 incent+0.148
ashtra+0.147
Neg logit contributions
 contrary-0.121
 Uk-0.117
 Kah-0.114
 Melt-0.112
 Outbreak-0.112
Token app
Token ID598
Feature activation-1.20e-01

Pos logit contributions
 contrary+0.284
 Uk+0.275
 Kah+0.264
 Melt+0.261
 Outbreak+0.260
Neg logit contributions
thora-0.530
icket-0.506
oli-0.497
ashtra-0.491
 incent-0.491
Token that
Token ID326
Feature activation+1.86e-01

Pos logit contributions
thora+0.026
icket+0.025
oli+0.024
 incent+0.024
ashtra+0.024
Neg logit contributions
 contrary-0.019
 Uk-0.018
 Kah-0.018
 Melt-0.017
 Outbreak-0.017
Token also
Token ID635
Feature activation-4.22e-01

Pos logit contributions
thora+0.434
icket+0.410
oli+0.402
ashtra+0.398
emouth+0.396
Neg logit contributions
 contrary-0.311
 Uk-0.300
 Kah-0.292
 Melt-0.287
 Outbreak-0.287
Token no
Token ID645
Feature activation-2.06e-01

Pos logit contributions
 contrary+0.737
 Uk+0.705
 Kah+0.683
 Melt+0.671
posts+0.671
Neg logit contributions
thora-0.972
icket-0.913
ashtra-0.887
oli-0.887
iral-0.874
Token longer
Token ID2392
Feature activation+9.36e-01

Pos logit contributions
 contrary+0.269
 Uk+0.258
 Kah+0.247
 Melt+0.243
 Outbreak+0.243
Neg logit contributions
thora-0.555
icket-0.528
oli-0.517
ashtra-0.515
 incent-0.513
Token works
Token ID2499
Feature activation-1.58e-01

Pos logit contributions
thora+2.156
icket+2.051
oli+2.014
 incent+1.990
ashtra+1.971
Neg logit contributions
 contrary-1.558
 Uk-1.514
 Kah-1.465
 Outbreak-1.455
 Melt-1.450
Token.)
Token ID2014
Feature activation-1.47e+00

Pos logit contributions
 contrary+0.280
 Uk+0.271
 Kah+0.263
 Outbreak+0.260
 Melt+0.260
Neg logit contributions
thora-0.350
icket-0.331
oli-0.324
 incent-0.320
ashtra-0.318
Token
Token ID198
Feature activation-1.34e-01

Pos logit contributions
 contrary+1.854
 Uk+1.814
 Kah+1.772
 Melt+1.744
 Dispatch+1.721
Neg logit contributions
thora-4.036
icket-3.836
ashtra-3.729
oli-3.728
 incent-3.724
Token
Token ID198
Feature activation-1.90e-01

Pos logit contributions
 contrary+0.191
 Uk+0.183
 Kah+0.177
 Melt+0.174
 Outbreak+0.173
Neg logit contributions
thora-0.348
icket-0.331
 incent-0.328
oli-0.324
ashtra-0.322
TokenThe
Token ID464
Feature activation+2.45e-01

Pos logit contributions
 contrary+0.323
 Uk+0.311
posts+0.307
 Kah+0.305
 Melt+0.300
Neg logit contributions
thora-0.437
icket-0.409
 incent-0.407
emouth-0.401
ashtra-0.399
Token table
Token ID3084
Feature activation+4.21e-01

Pos logit contributions
thora+0.548
icket+0.521
oli+0.510
ashtra+0.502
 incent+0.501
Neg logit contributions
 contrary-0.430
 Uk-0.416
 Kah-0.405
 Outbreak-0.400
 Melt-0.400
Token of
Token ID286
Feature activation+6.48e-01

Pos logit contributions
thora+1.027
icket+0.980
oli+0.961
 incent+0.948
ashtra+0.944
Neg logit contributions
 contrary-0.651
 Uk-0.629
 Kah-0.607
 Outbreak-0.601
 Melt-0.600
Token3
Token ID18
Feature activation+6.69e-02

Pos logit contributions
 contrary+0.089
 Uk+0.086
 Kah+0.083
 Melt+0.082
 Outbreak+0.081
Neg logit contributions
thora-0.147
icket-0.140
oli-0.137
 incent-0.136
ashtra-0.136
Token lit
Token ID6578
Feature activation+2.61e-01

Pos logit contributions
thora+0.146
icket+0.138
oli+0.135
 incent+0.133
ashtra+0.133
Neg logit contributions
 contrary-0.121
 Uk-0.117
 Kah-0.114
 Melt-0.113
 Outbreak-0.113
Tokeners
Token ID364
Feature activation+2.40e-01

Pos logit contributions
thora+0.642
icket+0.611
oli+0.597
 incent+0.594
ashtra+0.594
Neg logit contributions
 contrary-0.397
 Uk-0.384
 Kah-0.370
 Melt-0.366
posts-0.365
Token of
Token ID286
Feature activation-1.19e-01

Pos logit contributions
thora+0.650
icket+0.622
oli+0.613
 incent+0.605
ashtra+0.602
Neg logit contributions
 contrary-0.306
 Uk-0.293
 Kah-0.280
 Outbreak-0.278
 Melt-0.276
Token w
Token ID266
Feature activation+3.75e-01

Pos logit contributions
 contrary+0.234
 Uk+0.229
 Kah+0.226
 Melt+0.222
 Milk+0.219
Neg logit contributions
thora-0.239
icket-0.227
oli-0.219
iral-0.218
ashtra-0.217
Tokenort
Token ID419
Feature activation-1.75e+00

Pos logit contributions
thora+0.876
icket+0.825
 incent+0.816
ashtra+0.810
oli+0.809
Neg logit contributions
 contrary-0.611
 Uk-0.593
 Kah-0.575
 Outbreak-0.564
 Melt-0.561
Token,
Token ID11
Feature activation+2.04e-01

Pos logit contributions
 contrary+2.362
 Uk+2.227
 Kah+2.172
 Melt+2.160
 Milk+2.112
Neg logit contributions
thora-4.591
icket-4.401
 incent-4.290
oli-4.262
ashtra-4.252
Token which
Token ID543
Feature activation+2.04e-01

Pos logit contributions
thora+0.480
icket+0.456
oli+0.447
 incent+0.441
ashtra+0.439
Neg logit contributions
 contrary-0.334
 Uk-0.324
 Kah-0.313
 Melt-0.310
 Outbreak-0.309
Token is
Token ID318
Feature activation+3.87e-01

Pos logit contributions
thora+0.480
icket+0.457
oli+0.447
 incent+0.442
ashtra+0.441
Neg logit contributions
 contrary-0.332
 Uk-0.321
 Kah-0.311
 Melt-0.308
 Outbreak-0.307
Token just
Token ID655
Feature activation-3.39e-01

Pos logit contributions
thora+0.886
icket+0.843
oli+0.824
 incent+0.814
ashtra+0.810
Neg logit contributions
 contrary-0.658
 Uk-0.638
 Kah-0.618
 Melt-0.611
 Outbreak-0.610
Token enough
Token ID1576
Feature activation-2.79e-01

Pos logit contributions
 contrary+0.563
 Uk+0.538
 Kah+0.526
 Melt+0.519
 KS+0.510
Neg logit contributions
thora-0.798
icket-0.764
oli-0.745
iral-0.740
 incent-0.738