TOP ACTIVATIONS
MAX = 1.446

now published in the Proceedings of the National Academy of Sciences
last month in the Proceedings of the National Academy of Sciences
1 ). He ikes fl es hes out this commitment to
: set dir = c \ :\ tmp : */ This
images directory ") args = v ars ( ap . parse
See also References <|endoftext|> Police have also seized
search Space ][ [ i ]] ], Keys [ search Space
and should now decide once and for all whether it wanted
ren owned perf umer Fr é d é ric Mal le
published in the Proceedings of the National Academy of Sciences .
end T ear V oid Ray J KS Le en ock
character . CV : So what made you guys
References [ edit ] the ten dollar
- v ick ) The results support some surprising recent measurements
[ edit ] References [ edit ]
: What in hell was the point of kick
References [ edit ] <|endoftext|> Z ombie Deep State
first two in words unf et tered by reality . This
mission manager Dr Rune Fl ober gh agen .
: " Why in the hell do journalists insist on coming

MIN ACTIVATIONS
MIN = -1.330

s , standing 5 ' 7 " with dark hair ,
; and behold , all is vanity and vex
infusion of provincial funds and , in the case of Vancouver
Est eb an [ O con , his new teammate ]
A ole i emo
initiative is feasible constitutionally and , although it would find opposition
AP ) Story continues There were a
He was a major trade asset . And the Padres should
Steam the broccoli for a few minutes until fork - t
stage . Story continues below advertisement He
oppression , humanity was at last free from the cruel tyranny
cameras were turned on and , in the relatively dense atmosphere
{{ 9 , 15 }, {{ 8 , 14 }, {
policy . US AT OD AY OP IN ION
next - generation BF Good rich All - Ter rain KO
energy goals . <|endoftext|> Media playback is unsupported on your device
dollars . Story continues below advertisement Story
of women , men and , increasingly , LGBT people ,
). Cl ueless or powerful ? Ident ifying sub types of
in Sweden , Hungary and , according to several European sources

INTERVAL 0.752 - 1.446
CONTAINS 0.394%

2 Podcast ! pur ak ushi Profile Joined August
positions . But what does that even mean ?
enson as Row lf .[ 4 ][ 9 ] Although released
the city is violating her rights . H ilton
phan ultimately ends up doing nothing . But now

INTERVAL 0.058 - 0.752
CONTAINS 41.083%

the wheel of a Ford Fiesta R 5 , but the
city . It is up to them how to rule it
Dr . Melissa Tall man and her student Andrea Morrow ,
In 2011 , we celebrated our 1 million th beer check
episode at length . He feels very , very

INTERVAL -0.636 - 0.058
CONTAINS 57.487%

G aylor . <|endoftext|> Chief Special Prosecutor , K atic a
held rates at 0 . 5 % since March 2009 in
a foe of President Assad . Recent reports say
to where you are now . And the instant you start
Red Transit L anes on Church Have Made Mun i Faster

INTERVAL -1.330 - -0.636
CONTAINS 1.036%

For Bob , it would come in the form of the
it takes about 15 - 20 minutes for it to be
, behold , all was vanity and vex ation of spirit
lol m 2 k [ 10 : 47 ] < kw
films Seven Samurai and Y oj im bo , among others
Token now
Token ID783
Feature activation+2.89e-01

Pos logit contributions
Hour+1.179
GAN+1.176
table+1.173
Band+1.156
chin+1.146
Neg logit contributions
-1.322
ITNESS-1.274
 Edmund-1.158
 defences-1.105
 insure-1.091
Token published
Token ID3199
Feature activation+1.25e-01

Pos logit contributions
Hour+1.704
GAN+1.700
table+1.696
Band+1.676
chin+1.665
Neg logit contributions
-1.198
ITNESS-1.141
 Edmund-1.007
 defences-0.944
 insure-0.928
Token in
Token ID287
Feature activation+4.25e-01

Pos logit contributions
Hour+0.547
GAN+0.547
table+0.545
Band+0.537
chin+0.531
Neg logit contributions
-0.700
ITNESS-0.675
 Edmund-0.619
 defences-0.591
 insure-0.584
Token the
Token ID262
Feature activation+2.55e-01

Pos logit contributions
Hour+1.954
GAN+1.950
table+1.946
Band+1.914
chin+1.901
Neg logit contributions
-2.297
ITNESS-2.219
 Edmund-2.013
 defences-1.927
 insure-1.905
Token Proceedings
Token ID30641
Feature activation+9.02e-01

Pos logit contributions
Hour+1.102
GAN+1.100
table+1.097
Band+1.079
chin+1.070
Neg logit contributions
-1.454
ITNESS-1.406
 Edmund-1.286
 defences-1.231
 insure-1.218
Token of
Token ID286
Feature activation+1.45e+00

Pos logit contributions
Hour+3.745
GAN+3.726
table+3.702
Band+3.604
chin+3.595
Neg logit contributions
-5.291
ITNESS-5.090
 Edmund-4.620
 defences-4.517
��-4.478
Token the
Token ID262
Feature activation+1.30e+00

Pos logit contributions
GAN+5.840
Hour+5.771
Panel+5.651
table+5.647
Band+5.619
Neg logit contributions
-8.197
ITNESS-7.805
 Edmund-7.052
 defences-6.891
 insure-6.801
Token National
Token ID2351
Feature activation+1.04e+00

Pos logit contributions
Hour+4.121
GAN+4.075
Panel+3.970
table+3.956
chin+3.930
Neg logit contributions
-8.689
ITNESS-8.371
 Edmund-7.417
��-7.402
 defences-7.297
Token Academy
Token ID8581
Feature activation+6.13e-01

Pos logit contributions
Hour+3.568
GAN+3.513
table+3.500
Band+3.487
Panel+3.367
Neg logit contributions
-6.878
ITNESS-6.640
 Edmund-6.082
 defences-5.866
��-5.833
Token of
Token ID286
Feature activation+2.42e-01

Pos logit contributions
Hour+2.672
GAN+2.668
table+2.665
Band+2.614
chin+2.599
Neg logit contributions
-3.471
ITNESS-3.350
 Edmund-3.056
 defences-2.934
 insure-2.905
Token Sciences
Token ID13473
Feature activation+9.84e-01

Pos logit contributions
Hour+0.947
table+0.943
GAN+0.937
Band+0.927
chin+0.913
Neg logit contributions
-1.476
ITNESS-1.430
 Edmund-1.325
 defences-1.269
 insure-1.255
Token last
Token ID938
Feature activation-1.40e-01

Pos logit contributions
Hour+1.606
table+1.600
GAN+1.594
chin+1.564
Band+1.563
Neg logit contributions
-2.130
ITNESS-2.053
 Edmund-1.866
 defences-1.792
 insure-1.780
Token month
Token ID1227
Feature activation-1.94e-01

Pos logit contributions
+0.675
ITNESS+0.650
 Edmund+0.593
 defences+0.560
 insure+0.551
Neg logit contributions
GAN-0.722
Hour-0.721
table-0.713
Band-0.709
chin-0.698
Token in
Token ID287
Feature activation+4.49e-01

Pos logit contributions
+1.099
ITNESS+1.061
 Edmund+0.979
 defences+0.933
 insure+0.919
Neg logit contributions
Hour-0.843
GAN-0.840
table-0.837
Band-0.828
chin-0.816
Token the
Token ID262
Feature activation+1.32e-01

Pos logit contributions
Hour+2.010
GAN+2.009
table+2.000
Band+1.964
chin+1.957
Neg logit contributions
-2.481
ITNESS-2.384
 Edmund-2.160
 defences-2.074
 insure-2.056
Token Proceedings
Token ID30641
Feature activation+9.85e-01

Pos logit contributions
Hour+0.566
GAN+0.565
table+0.564
Band+0.555
chin+0.549
Neg logit contributions
-0.760
ITNESS-0.735
 Edmund-0.675
 defences-0.645
 insure-0.638
Token of
Token ID286
Feature activation+1.41e+00

Pos logit contributions
Hour+4.018
GAN+3.994
table+3.960
chin+3.848
Band+3.836
Neg logit contributions
-5.875
ITNESS-5.613
 Edmund-5.078
��-5.012
 defences-4.993
Token the
Token ID262
Feature activation+1.19e+00

Pos logit contributions
GAN+5.750
Hour+5.635
table+5.530
Panel+5.523
Band+5.467
Neg logit contributions
-8.123
ITNESS-7.601
 Edmund-6.873
 insure-6.694
 defences-6.660
Token National
Token ID2351
Feature activation+9.76e-01

Pos logit contributions
GAN+3.917
Hour+3.915
chin+3.785
table+3.773
Panel+3.745
Neg logit contributions
-7.847
ITNESS-7.512
 Edmund-6.661
��-6.564
 insure-6.535
Token Academy
Token ID8581
Feature activation+5.95e-01

Pos logit contributions
Hour+3.396
GAN+3.352
table+3.331
Band+3.299
chin+3.228
Neg logit contributions
-6.390
ITNESS-6.135
 Edmund-5.624
 defences-5.410
 insure-5.380
Token of
Token ID286
Feature activation+1.92e-01

Pos logit contributions
Hour+2.587
GAN+2.584
table+2.581
Band+2.529
chin+2.516
Neg logit contributions
-3.382
ITNESS-3.258
 Edmund-2.971
 defences-2.852
 insure-2.824
Token Sciences
Token ID13473
Feature activation+9.43e-01

Pos logit contributions
Hour+0.769
table+0.767
GAN+0.757
Band+0.753
chin+0.741
Neg logit contributions
-1.153
ITNESS-1.118
 Edmund-1.038
 defences-0.992
 insure-0.979
Token1
Token ID16
Feature activation-6.81e-02

Pos logit contributions
Hour+2.236
table+2.231
GAN+2.230
Band+2.192
chin+2.173
Neg logit contributions
-2.705
ITNESS-2.607
 Edmund-2.376
 defences-2.272
 insure-2.249
Token).
Token ID737
Feature activation+2.89e-01

Pos logit contributions
+0.340
ITNESS+0.327
 Edmund+0.298
 defences+0.283
 insure+0.279
Neg logit contributions
Hour-0.340
GAN-0.339
table-0.336
Band-0.334
chin-0.330
Token He
Token ID679
Feature activation+2.18e-01

Pos logit contributions
Hour+1.258
GAN+1.255
table+1.249
Band+1.231
chin+1.220
Neg logit contributions
-1.634
ITNESS-1.583
 Edmund-1.461
 defences-1.388
 insure-1.371
Tokenikes
Token ID7938
Feature activation+3.26e-01

Pos logit contributions
Hour+1.290
GAN+1.288
table+1.284
Band+1.270
chin+1.260
Neg logit contributions
-0.896
ITNESS-0.855
 Edmund-0.755
 defences-0.707
 insure-0.696
Token fl
Token ID781
Feature activation-2.87e-02

Pos logit contributions
GAN+1.594
Hour+1.593
table+1.585
Band+1.563
chin+1.546
Neg logit contributions
-1.669
ITNESS-1.603
 Edmund-1.468
 defences-1.401
 insure-1.379
Tokenes
Token ID274
Feature activation+1.38e+00

Pos logit contributions
+0.171
ITNESS+0.166
 Edmund+0.153
 defences+0.147
 insure+0.145
Neg logit contributions
GAN-0.116
Hour-0.115
table-0.115
Band-0.113
chin-0.112
Tokenhes
Token ID956
Feature activation-1.38e-01

Pos logit contributions
chin+0.949
table+0.926
Hour+0.834
GAN+0.764
Band+0.704
Neg logit contributions
-12.878
ITNESS-12.531
 Edmund-11.642
��-11.582
 defences-11.488
Token out
Token ID503
Feature activation-3.36e-01

Pos logit contributions
+0.613
ITNESS+0.586
 Edmund+0.525
 defences+0.494
 insure+0.487
Neg logit contributions
Hour-0.765
GAN-0.764
table-0.761
Band-0.752
chin-0.746
Token this
Token ID428
Feature activation+2.15e-02

Pos logit contributions
+1.742
ITNESS+1.684
 Edmund+1.572
 defences+1.489
 insure+1.456
Neg logit contributions
GAN-1.602
Hour-1.601
table-1.582
Band-1.572
chin-1.560
Token commitment
Token ID7901
Feature activation+3.08e-01

Pos logit contributions
Hour+0.116
GAN+0.115
table+0.115
Band+0.114
chin+0.113
Neg logit contributions
-0.099
ITNESS-0.095
 Edmund-0.087
 defences-0.082
 insure-0.081
Token to
Token ID284
Feature activation+9.58e-02

Pos logit contributions
Hour+1.417
table+1.414
GAN+1.414
Band+1.389
chin+1.377
Neg logit contributions
-1.667
ITNESS-1.607
 Edmund-1.458
 defences-1.389
 insure-1.375
Token:
Token ID25
Feature activation+4.45e-01

Pos logit contributions
table+1.166
Hour+1.151
GAN+1.144
Band+1.140
chin+1.103
Neg logit contributions
-1.465
ITNESS-1.416
 Edmund-1.297
 defences-1.252
 insure-1.216
Tokenset
Token ID2617
Feature activation+5.28e-01

Pos logit contributions
table+1.661
Hour+1.625
GAN+1.624
Band+1.593
chin+1.587
Neg logit contributions
-2.810
ITNESS-2.731
 Edmund-2.524
 defences-2.428
 insure-2.390
Token dir
Token ID26672
Feature activation+5.14e-01

Pos logit contributions
table+2.476
Hour+2.449
GAN+2.448
chin+2.406
Band+2.405
Neg logit contributions
-2.819
ITNESS-2.709
 Edmund-2.459
 defences-2.350
 insure-2.307
Token=
Token ID28
Feature activation+5.10e-01

Pos logit contributions
table+2.061
Hour+2.055
GAN+2.039
Band+2.014
chin+1.992
Neg logit contributions
-3.084
ITNESS-2.984
 Edmund-2.746
 defences-2.646
 insure-2.607
Tokenc
Token ID66
Feature activation+3.12e-01

Pos logit contributions
table+2.215
Hour+2.197
GAN+2.190
Band+2.154
chin+2.135
Neg logit contributions
-2.898
ITNESS-2.803
 Edmund-2.564
 defences-2.465
 insure-2.426
Token\
Token ID59
Feature activation+1.37e+00

Pos logit contributions
table+1.580
Hour+1.579
GAN+1.573
Band+1.551
chin+1.540
Neg logit contributions
-1.537
ITNESS-1.476
 Edmund-1.331
 defences-1.271
 insure-1.246
Token:\
Token ID7479
Feature activation+6.10e-01

Pos logit contributions
table+6.174
Hour+5.967
GAN+5.866
Band+5.827
chin+5.781
Neg logit contributions
-7.259
ITNESS-7.140
 Edmund-6.446
 defences-6.256
 insure-6.038
Tokentmp
Token ID22065
Feature activation+3.55e-01

Pos logit contributions
table+2.834
Hour+2.803
GAN+2.776
Band+2.741
chin+2.721
Neg logit contributions
-3.277
ITNESS-3.178
 Edmund-2.879
 defences-2.767
 insure-2.702
Token:
Token ID25
Feature activation+1.71e-01

Pos logit contributions
table+1.804
Hour+1.797
GAN+1.786
Band+1.767
chin+1.749
Neg logit contributions
-1.751
ITNESS-1.681
 Edmund-1.514
 defences-1.444
 insure-1.417
Token */
Token ID9466
Feature activation-3.09e-01

Pos logit contributions
Hour+0.659
table+0.657
GAN+0.656
Band+0.644
chin+0.636
Neg logit contributions
-1.054
ITNESS-1.020
 Edmund-0.941
 defences-0.905
 insure-0.894
Token This
Token ID770
Feature activation-2.08e-01

Pos logit contributions
+1.652
ITNESS+1.589
 Edmund+1.471
 defences+1.391
 insure+1.384
Neg logit contributions
Hour-1.445
GAN-1.439
table-1.411
Band-1.405
chin-1.393
Token images
Token ID4263
Feature activation+2.83e-02

Pos logit contributions
Hour+0.524
table+0.521
GAN+0.520
Band+0.513
chin+0.510
Neg logit contributions
-0.510
ITNESS-0.492
 Edmund-0.446
 defences-0.424
 insure-0.417
Token directory
Token ID8619
Feature activation+4.58e-01

Pos logit contributions
Hour+0.171
GAN+0.170
table+0.170
Band+0.168
chin+0.167
Neg logit contributions
-0.113
ITNESS-0.108
 Edmund-0.095
 defences-0.088
 insure-0.087
Token")
Token ID4943
Feature activation+2.75e-01

Pos logit contributions
table+2.634
Hour+2.633
GAN+2.625
Band+2.598
chin+2.567
Neg logit contributions
-1.966
ITNESS-1.867
 Edmund-1.627
 defences-1.550
 insure-1.536
Token args
Token ID26498
Feature activation+4.78e-01

Pos logit contributions
GAN+1.710
table+1.696
Hour+1.693
Band+1.680
chin+1.665
Neg logit contributions
-1.065
ITNESS-0.999
 Edmund-0.870
 defences-0.792
 insure-0.783
Token =
Token ID796
Feature activation+2.65e-01

Pos logit contributions
Hour+1.958
table+1.956
GAN+1.955
Band+1.914
chin+1.891
Neg logit contributions
-2.837
ITNESS-2.739
 Edmund-2.515
 defences-2.426
 insure-2.397
Token v
Token ID410
Feature activation+1.37e+00

Pos logit contributions
table+1.433
GAN+1.432
Hour+1.428
Band+1.409
chin+1.403
Neg logit contributions
-1.230
ITNESS-1.175
 Edmund-1.045
 defences-0.984
 insure-0.973
Tokenars
Token ID945
Feature activation+3.30e-01

Pos logit contributions
table+4.948
GAN+4.753
Hour+4.700
chin+4.679
Band+4.612
Neg logit contributions
-8.845
ITNESS-8.523
��-7.890
 Edmund-7.716
-7.455
Token(
Token ID7
Feature activation+3.28e-01

Pos logit contributions
table+1.563
Hour+1.562
GAN+1.556
Band+1.531
chin+1.520
Neg logit contributions
-1.749
ITNESS-1.680
 Edmund-1.528
 defences-1.461
 insure-1.447
Tokenap
Token ID499
Feature activation+3.03e-01

Pos logit contributions
GAN+1.793
Hour+1.793
table+1.783
Band+1.765
chin+1.747
Neg logit contributions
-1.480
ITNESS-1.417
 Edmund-1.269
 defences-1.198
 insure-1.182
Token.
Token ID13
Feature activation-1.83e-02

Pos logit contributions
Hour+1.378
GAN+1.375
table+1.371
Band+1.351
chin+1.339
Neg logit contributions
-1.661
ITNESS-1.603
 Edmund-1.463
 defences-1.396
 insure-1.380
Tokenparse
Token ID29572
Feature activation+1.51e-01

Pos logit contributions
+0.090
ITNESS+0.086
 Edmund+0.078
 defences+0.074
 insure+0.073
Neg logit contributions
GAN-0.093
Hour-0.093
table-0.092
Band-0.092
chin-0.091
Token
Token ID198
Feature activation+4.79e-01

Pos logit contributions
Hour+1.935
table+1.931
GAN+1.926
Band+1.888
chin+1.863
Neg logit contributions
-3.538
ITNESS-3.425
 Edmund-3.158
 defences-3.058
 insure-3.030
TokenSee
Token ID6214
Feature activation+8.72e-01

Pos logit contributions
Hour+1.794
GAN+1.786
table+1.786
Band+1.753
chin+1.732
Neg logit contributions
-3.008
ITNESS-2.915
 Edmund-2.693
 defences-2.592
 insure-2.566
Token also
Token ID635
Feature activation+5.48e-01

Pos logit contributions
table+2.538
GAN+2.496
Hour+2.488
chin+2.436
Band+2.415
Neg logit contributions
-6.186
ITNESS-6.014
 Edmund-5.540
 defences-5.415
 insure-5.349
Token
Token ID198
Feature activation+8.74e-01

Pos logit contributions
table+1.618
GAN+1.596
Hour+1.573
chin+1.553
Band+1.540
Neg logit contributions
-3.862
ITNESS-3.746
 Edmund-3.453
 defences-3.370
 insure-3.342
Token
Token ID198
Feature activation+5.32e-01

Pos logit contributions
table+2.892
Hour+2.861
GAN+2.843
Band+2.800
chin+2.728
Neg logit contributions
-5.945
ITNESS-5.722
 Edmund-5.220
��-5.195
 defences-5.165
TokenReferences
Token ID19927
Feature activation+1.37e+00

Pos logit contributions
Hour+2.090
table+2.083
GAN+2.081
Band+2.047
chin+2.022
Neg logit contributions
-3.241
ITNESS-3.140
 Edmund-2.894
 defences-2.782
 insure-2.755
Token<|endoftext|>
Token ID50256
Feature activation+6.23e-03

Pos logit contributions
table+3.192
chin+2.993
GAN+2.959
Hour+2.925
Band+2.907
Neg logit contributions
-10.203
ITNESS-9.892
 Edmund-8.933
 defences-8.907
 insure-8.828
TokenPolice
Token ID9039
Feature activation-3.01e-01

Pos logit contributions
GAN+0.025
Hour+0.025
table+0.024
Band+0.024
chin+0.024
Neg logit contributions
-0.038
ITNESS-0.036
 Edmund-0.034
 defences-0.032
 insure-0.032
Token have
Token ID423
Feature activation-2.23e-01

Pos logit contributions
+1.534
ITNESS+1.491
 Edmund+1.377
 defences+1.319
 insure+1.301
Neg logit contributions
GAN-1.443
table-1.442
Hour-1.437
Band-1.406
chin-1.395
Token also
Token ID635
Feature activation-1.40e-01

Pos logit contributions
+1.110
ITNESS+1.083
 Edmund+1.006
 defences+0.964
 insure+0.938
Neg logit contributions
GAN-1.095
Hour-1.091
table-1.087
risome-1.061
Band-1.056
Token seized
Token ID12000
Feature activation+1.07e-01

Pos logit contributions
+0.706
ITNESS+0.689
 Edmund+0.639
 defences+0.613
 insure+0.602
Neg logit contributions
Hour-0.679
GAN-0.679
table-0.672
risome-0.656
Band-0.656
Tokensearch
Token ID12947
Feature activation+2.68e-01

Pos logit contributions
Hour+0.050
table+0.049
GAN+0.049
Band+0.048
chin+0.048
Neg logit contributions
-0.053
ITNESS-0.051
 Edmund-0.047
 defences-0.044
 insure-0.044
TokenSpace
Token ID14106
Feature activation+7.38e-02

Pos logit contributions
GAN+0.740
table+0.738
Hour+0.734
risome+0.718
chin+0.713
Neg logit contributions
-1.942
ITNESS-1.883
 Edmund-1.759
 defences-1.684
 insure-1.671
Token][
Token ID7131
Feature activation+3.96e-01

Pos logit contributions
Hour+0.242
GAN+0.242
table+0.241
Band+0.236
chin+0.233
Neg logit contributions
-0.496
ITNESS-0.482
 Edmund-0.448
 defences-0.432
 insure-0.428
Token[
Token ID58
Feature activation+1.54e-01

Pos logit contributions
table+1.488
Hour+1.485
GAN+1.475
Band+1.433
chin+1.432
Neg logit contributions
-2.482
ITNESS-2.415
 Edmund-2.227
 defences-2.157
 insure-2.127
Tokeni
Token ID72
Feature activation+9.11e-01

Pos logit contributions
Hour+0.536
GAN+0.534
table+0.534
Band+0.522
chin+0.516
Neg logit contributions
-1.006
ITNESS-0.977
 Edmund-0.906
 defences-0.873
 insure-0.865
Token]]
Token ID11907
Feature activation+1.32e+00

Pos logit contributions
Hour+1.698
table+1.686
GAN+1.650
Band+1.579
chin+1.516
Neg logit contributions
-7.290
ITNESS-7.171
 Edmund-6.751
 defences-6.591
 insure-6.466
Token],
Token ID4357
Feature activation+3.68e-01

Pos logit contributions
table+4.254
Hour+4.211
GAN+4.181
Band+4.044
chin+3.928
Neg logit contributions
-8.685
ITNESS-8.441
 Edmund-7.787
 defences-7.690
��-7.402
Token Keys
Token ID26363
Feature activation+6.19e-01

Pos logit contributions
Hour+1.584
GAN+1.578
table+1.578
Band+1.548
chin+1.534
Neg logit contributions
-2.102
ITNESS-2.034
 Edmund-1.859
 defences-1.785
 insure-1.762
Token[
Token ID58
Feature activation+1.75e-01

Pos logit contributions
Hour+2.179
table+2.150
GAN+2.122
Band+2.090
Panel+2.046
Neg logit contributions
-4.002
ITNESS-3.902
 Edmund-3.657
 defences-3.547
 insure-3.476
Tokensearch
Token ID12947
Feature activation+2.38e-01

Pos logit contributions
table+0.690
Hour+0.689
GAN+0.685
Band+0.668
chin+0.663
Neg logit contributions
-1.065
ITNESS-1.033
 Edmund-0.952
 defences-0.921
 insure-0.906
TokenSpace
Token ID14106
Feature activation-8.63e-02

Pos logit contributions
table+0.645
GAN+0.645
Hour+0.640
risome+0.627
chin+0.623
Neg logit contributions
-1.733
ITNESS-1.683
 Edmund-1.570
 defences-1.503
 insure-1.491
Token and
Token ID290
Feature activation-6.40e-01

Pos logit contributions
+0.460
ITNESS+0.443
 Edmund+0.410
 defences+0.391
 insure+0.385
Neg logit contributions
GAN-0.428
Hour-0.427
table-0.422
Band-0.418
chin-0.414
Token should
Token ID815
Feature activation-5.85e-01

Pos logit contributions
+3.184
ITNESS+3.151
 Edmund+2.993
 defences+2.835
 insure+2.749
Neg logit contributions
GAN-3.019
Hour-3.004
table-3.000
Band-2.976
chin-2.924
Token now
Token ID783
Feature activation-7.54e-01

Pos logit contributions
+2.564
ITNESS+2.485
 Edmund+2.276
 insure+2.210
 defences+2.151
Neg logit contributions
Hour-3.184
GAN-3.158
table-3.122
Band-3.115
chin-3.096
Token decide
Token ID5409
Feature activation-1.85e-02

Pos logit contributions
+3.075
ITNESS+3.026
 Edmund+2.764
 insure+2.716
 defences+2.621
Neg logit contributions
GAN-4.214
Hour-4.214
Band-4.164
table-4.145
chin-4.117
Token once
Token ID1752
Feature activation+4.47e-01

Pos logit contributions
+0.093
ITNESS+0.090
 Edmund+0.082
 defences+0.078
 insure+0.077
Neg logit contributions
Hour-0.092
GAN-0.092
table-0.091
Band-0.090
chin-0.090
Token and
Token ID290
Feature activation+1.32e+00

Pos logit contributions
table+1.910
Hour+1.906
GAN+1.890
Band+1.861
chin+1.832
Neg logit contributions
-2.588
ITNESS-2.474
 Edmund-2.226
 insure-2.132
 defences-2.126
Token for
Token ID329
Feature activation+7.89e-02

Pos logit contributions
table+4.844
GAN+4.657
Hour+4.562
Band+4.543
chin+4.450
Neg logit contributions
-8.246
ITNESS-7.880
 Edmund-7.181
��-7.065
-6.947
Token all
Token ID477
Feature activation-8.68e-02

Pos logit contributions
GAN+0.434
Hour+0.434
table+0.426
Band+0.426
chin+0.424
Neg logit contributions
-0.350
ITNESS-0.341
 Edmund-0.307
 defences-0.287
 insure-0.284
Token whether
Token ID1771
Feature activation-7.54e-01

Pos logit contributions
+0.409
ITNESS+0.394
 Edmund+0.358
 defences+0.338
 insure+0.333
Neg logit contributions
Hour-0.458
GAN-0.456
table-0.454
Band-0.449
chin-0.446
Token it
Token ID340
Feature activation+4.73e-02

Pos logit contributions
+3.728
ITNESS+3.624
 Edmund+3.369
 defences+3.221
 insure+3.077
Neg logit contributions
Hour-3.704
GAN-3.680
chin-3.661
table-3.646
Band-3.615
Token wanted
Token ID2227
Feature activation+1.85e-03

Pos logit contributions
Hour+0.229
GAN+0.228
table+0.226
Band+0.224
chin+0.222
Neg logit contributions
-0.244
ITNESS-0.236
 Edmund-0.218
 defences-0.207
 insure-0.204
Tokenren
Token ID918
Feature activation-4.87e-01

Pos logit contributions
+1.191
ITNESS+1.145
 Edmund+1.048
 defences+0.992
 insure+0.975
Neg logit contributions
GAN-1.147
Hour-1.141
table-1.120
Band-1.114
chin-1.098
Tokenowned
Token ID11990
Feature activation-9.02e-02

Pos logit contributions
+2.454
ITNESS+2.380
 Edmund+2.218
 defences+2.064
 insure+2.022
Neg logit contributions
GAN-2.354
Hour-2.349
table-2.313
Band-2.308
chin-2.265
Token perf
Token ID23035
Feature activation-2.93e-01

Pos logit contributions
+0.436
ITNESS+0.419
 Edmund+0.387
 defences+0.365
 insure+0.357
Neg logit contributions
GAN-0.465
Hour-0.462
table-0.458
Band-0.454
chin-0.449
Tokenumer
Token ID6975
Feature activation+1.68e-01

Pos logit contributions
+1.622
ITNESS+1.604
 Edmund+1.502
 defences+1.406
 insure+1.404
Neg logit contributions
GAN-1.261
Hour-1.230
Band-1.212
table-1.208
risome-1.188
Token Fr
Token ID1305
Feature activation+1.39e-01

Pos logit contributions
Hour+0.769
table+0.766
GAN+0.766
Band+0.754
chin+0.747
Neg logit contributions
-0.920
ITNESS-0.888
 Edmund-0.804
 defences-0.772
 insure-0.764
Tokené
Token ID2634
Feature activation+1.31e+00

Pos logit contributions
Hour+0.643
GAN+0.641
table+0.638
Band+0.630
chin+0.624
Neg logit contributions
-0.740
ITNESS-0.718
 Edmund-0.661
 defences-0.625
 insure-0.616
Tokend
Token ID67
Feature activation-3.38e-01

Pos logit contributions
table+4.730
chin+4.603
Band+4.597
GAN+4.535
Hour+4.528
Neg logit contributions
-8.339
ITNESS-8.139
 Edmund-7.249
 defences-7.218
��-7.203
Tokené
Token ID2634
Feature activation+3.80e-01

Pos logit contributions
+1.452
ITNESS+1.413
 Edmund+1.307
 defences+1.174
 insure+1.171
Neg logit contributions
GAN-1.878
Hour-1.878
table-1.858
Band-1.844
chin-1.834
Tokenric
Token ID1173
Feature activation-5.81e-02

Pos logit contributions
Hour+1.952
GAN+1.947
table+1.944
Band+1.918
chin+1.903
Neg logit contributions
-1.856
ITNESS-1.784
 Edmund-1.607
 defences-1.525
 insure-1.505
Token Mal
Token ID4434
Feature activation-8.26e-02

Pos logit contributions
+0.333
ITNESS+0.320
 Edmund+0.303
 defences+0.285
 insure+0.282
Neg logit contributions
GAN-0.245
Hour-0.244
table-0.241
Band-0.239
risome-0.236
Tokenle
Token ID293
Feature activation+2.79e-01

Pos logit contributions
+0.456
ITNESS+0.444
 Edmund+0.412
 defences+0.391
 insure+0.389
Neg logit contributions
GAN-0.359
Hour-0.358
Band-0.352
table-0.351
risome-0.341
Token published
Token ID3199
Feature activation+1.25e-01

Pos logit contributions
Hour+1.704
GAN+1.700
table+1.696
Band+1.676
chin+1.665
Neg logit contributions
-1.198
ITNESS-1.141
 Edmund-1.007
 defences-0.944
 insure-0.928
Token in
Token ID287
Feature activation+4.25e-01

Pos logit contributions
Hour+0.547
GAN+0.547
table+0.545
Band+0.537
chin+0.531
Neg logit contributions
-0.700
ITNESS-0.675
 Edmund-0.619
 defences-0.591
 insure-0.584
Token the
Token ID262
Feature activation+2.55e-01

Pos logit contributions
Hour+1.954
GAN+1.950
table+1.946
Band+1.914
chin+1.901
Neg logit contributions
-2.297
ITNESS-2.219
 Edmund-2.013
 defences-1.927
 insure-1.905
Token Proceedings
Token ID30641
Feature activation+9.02e-01

Pos logit contributions
Hour+1.102
GAN+1.100
table+1.097
Band+1.079
chin+1.070
Neg logit contributions
-1.454
ITNESS-1.406
 Edmund-1.286
 defences-1.231
 insure-1.218
Token of
Token ID286
Feature activation+1.45e+00

Pos logit contributions
Hour+3.745
GAN+3.726
table+3.702
Band+3.604
chin+3.595
Neg logit contributions
-5.291
ITNESS-5.090
 Edmund-4.620
 defences-4.517
��-4.478
Token the
Token ID262
Feature activation+1.30e+00

Pos logit contributions
GAN+5.840
Hour+5.771
Panel+5.651
table+5.647
Band+5.619
Neg logit contributions
-8.197
ITNESS-7.805
 Edmund-7.052
 defences-6.891
 insure-6.801
Token National
Token ID2351
Feature activation+1.04e+00

Pos logit contributions
Hour+4.121
GAN+4.075
Panel+3.970
table+3.956
chin+3.930
Neg logit contributions
-8.689
ITNESS-8.371
 Edmund-7.417
��-7.402
 defences-7.297
Token Academy
Token ID8581
Feature activation+6.13e-01

Pos logit contributions
Hour+3.568
GAN+3.513
table+3.500
Band+3.487
Panel+3.367
Neg logit contributions
-6.878
ITNESS-6.640
 Edmund-6.082
 defences-5.866
��-5.833
Token of
Token ID286
Feature activation+2.42e-01

Pos logit contributions
Hour+2.672
GAN+2.668
table+2.665
Band+2.614
chin+2.599
Neg logit contributions
-3.471
ITNESS-3.350
 Edmund-3.056
 defences-2.934
 insure-2.905
Token Sciences
Token ID13473
Feature activation+9.84e-01

Pos logit contributions
Hour+0.947
table+0.943
GAN+0.937
Band+0.927
chin+0.913
Neg logit contributions
-1.476
ITNESS-1.430
 Edmund-1.325
 defences-1.269
 insure-1.255
Token.
Token ID13
Feature activation+1.72e-01

Pos logit contributions
Hour+4.011
GAN+4.004
table+3.975
chin+3.933
Band+3.838
Neg logit contributions
-5.683
ITNESS-5.511
 Edmund-5.029
 defences-4.910
 insure-4.823
Tokenend
Token ID437
Feature activation+4.90e-01

Pos logit contributions
Hour+0.884
table+0.878
GAN+0.877
Band+0.850
chin+0.838
Neg logit contributions
-2.765
ITNESS-2.694
 Edmund-2.525
 defences-2.447
 insure-2.428
TokenT
Token ID51
Feature activation+4.81e-01

Pos logit contributions
Hour+1.740
GAN+1.718
table+1.717
Band+1.705
chin+1.653
Neg logit contributions
-3.188
ITNESS-3.098
 Edmund-2.866
 defences-2.759
 insure-2.738
Tokenear
Token ID451
Feature activation+7.30e-01

Pos logit contributions
Hour+2.044
GAN+2.029
table+2.027
Band+1.999
chin+1.979
Neg logit contributions
-2.787
ITNESS-2.684
 Edmund-2.465
 defences-2.363
 insure-2.340
TokenV
Token ID53
Feature activation+8.00e-01

Pos logit contributions
Hour+2.677
GAN+2.654
table+2.644
Band+2.640
chin+2.546
Neg logit contributions
-4.611
ITNESS-4.485
 Edmund-4.124
 defences-3.977
 insure-3.968
Tokenoid
Token ID1868
Feature activation+9.30e-01

Pos logit contributions
table+3.621
GAN+3.610
Hour+3.605
Band+3.557
chin+3.493
Neg logit contributions
-4.404
ITNESS-4.201
 Edmund-3.852
 defences-3.701
 insure-3.633
TokenRay
Token ID19591
Feature activation+1.30e+00

Pos logit contributions
Hour+2.805
GAN+2.744
table+2.730
Band+2.696
Panel+2.574
Neg logit contributions
-6.512
ITNESS-6.299
 Edmund-5.919
 defences-5.722
 insure-5.669
TokenJ
Token ID41
Feature activation+5.27e-01

Pos logit contributions
Hour+4.849
Band+4.756
GAN+4.685
table+4.636
Role+4.489
Neg logit contributions
-8.066
ITNESS-7.725
 Edmund-7.143
 defences-6.972
 insure-6.914
TokenKS
Token ID27015
Feature activation+5.31e-01

Pos logit contributions
GAN+2.901
Hour+2.886
Band+2.860
table+2.843
chin+2.824
Neg logit contributions
-2.393
ITNESS-2.241
 Edmund-2.060
 insure-1.915
 defences-1.906
TokenLe
Token ID3123
Feature activation+7.01e-01

Pos logit contributions
Hour+2.424
GAN+2.396
Band+2.383
table+2.383
chin+2.321
Neg logit contributions
-2.917
ITNESS-2.812
 Edmund-2.559
 defences-2.458
 insure-2.433
Tokenen
Token ID268
Feature activation+4.50e-01

Pos logit contributions
GAN+2.361
Hour+2.360
table+2.345
chin+2.308
Band+2.276
Neg logit contributions
-4.663
ITNESS-4.524
 Edmund-4.195
 defences-4.059
 insure-4.029
Tokenock
Token ID735
Feature activation+2.94e-01

Pos logit contributions
Hour+2.030
GAN+2.023
table+2.019
Band+1.990
chin+1.974
Neg logit contributions
-2.477
ITNESS-2.387
 Edmund-2.186
 defences-2.085
 insure-2.061
Token character
Token ID2095
Feature activation+3.06e-01

Pos logit contributions
Hour+1.428
GAN+1.424
table+1.417
Band+1.403
chin+1.387
Neg logit contributions
-1.441
ITNESS-1.393
 Edmund-1.273
 defences-1.206
 insure-1.190
Token.
Token ID13
Feature activation+5.40e-01

Pos logit contributions
Hour+1.400
GAN+1.397
table+1.392
Band+1.372
chin+1.360
Neg logit contributions
-1.665
ITNESS-1.607
 Edmund-1.469
 defences-1.401
 insure-1.385
Token
Token ID198
Feature activation+5.97e-01

Pos logit contributions
Hour+2.227
GAN+2.214
table+2.214
Band+2.186
chin+2.160
Neg logit contributions
-3.153
ITNESS-3.080
 Edmund-2.878
 defences-2.696
 insure-2.666
Token
Token ID198
Feature activation+5.66e-01

Pos logit contributions
Hour+2.121
table+2.115
GAN+2.115
Band+2.068
chin+2.040
Neg logit contributions
-3.872
ITNESS-3.745
 Edmund-3.459
 defences-3.348
 insure-3.317
TokenCV
Token ID33538
Feature activation+1.82e-01

Pos logit contributions
GAN+2.010
Hour+2.009
table+1.996
Band+1.960
chin+1.933
Neg logit contributions
-3.659
ITNESS-3.542
 Edmund-3.287
 defences-3.172
 insure-3.143
Token:
Token ID25
Feature activation+1.29e+00

Pos logit contributions
Hour+0.866
GAN+0.864
table+0.862
Band+0.850
chin+0.846
Neg logit contributions
-0.949
ITNESS-0.916
 Edmund-0.833
 defences-0.789
 insure-0.778
Token So
Token ID1406
Feature activation+4.69e-01

Pos logit contributions
Hour+5.179
GAN+5.155
table+5.148
Band+5.034
chin+4.980
Neg logit contributions
-7.747
ITNESS-7.408
 Edmund-6.570
 defences-6.569
 insure-6.511
Token what
Token ID644
Feature activation+7.94e-01

Pos logit contributions
Hour+2.203
GAN+2.197
table+2.186
Band+2.166
chin+2.136
Neg logit contributions
-2.477
ITNESS-2.398
 Edmund-2.211
 defences-2.100
 insure-2.074
Token made
Token ID925
Feature activation+7.35e-01

Pos logit contributions
table+3.714
Hour+3.713
GAN+3.709
Band+3.639
chin+3.622
Neg logit contributions
-4.257
ITNESS-4.084
 Edmund-3.653
 defences-3.487
 insure-3.453
Token you
Token ID345
Feature activation+7.14e-01

Pos logit contributions
Hour+3.190
table+3.180
GAN+3.178
Band+3.117
chin+3.093
Neg logit contributions
-4.180
ITNESS-4.024
 Edmund-3.658
 defences-3.509
 insure-3.477
Token guys
Token ID3730
Feature activation+4.07e-01

Pos logit contributions
table+3.558
GAN+3.553
Hour+3.551
Band+3.493
chin+3.469
Neg logit contributions
-3.599
ITNESS-3.453
 Edmund-3.083
 defences-2.941
 insure-2.877
Token
Token ID198
Feature activation+6.82e-01

Pos logit contributions
table+2.350
Hour+2.286
Band+2.268
GAN+2.233
chin+2.179
Neg logit contributions
-5.296
ITNESS-5.189
 Edmund-4.776
 defences-4.692
 insure-4.639
Token
Token ID198
Feature activation+3.81e-01

Pos logit contributions
table+2.357
Hour+2.355
GAN+2.343
Band+2.299
chin+2.252
Neg logit contributions
-4.507
ITNESS-4.359
 Edmund-4.004
 defences-3.914
 insure-3.884
TokenReferences
Token ID19927
Feature activation+1.27e+00

Pos logit contributions
Hour+1.545
GAN+1.540
table+1.538
Band+1.512
chin+1.495
Neg logit contributions
-2.267
ITNESS-2.195
 Edmund-2.020
 defences-1.937
 insure-1.917
Token [
Token ID685
Feature activation+1.06e+00

Pos logit contributions
table+2.893
Hour+2.648
Band+2.617
GAN+2.614
chin+2.587
Neg logit contributions
-9.561
ITNESS-9.428
 defences-8.445
 Edmund-8.441
��-8.361
Token edit
Token ID4370
Feature activation+4.36e-01

Pos logit contributions
table+5.487
Hour+5.367
GAN+5.354
chin+5.300
Band+5.247
Neg logit contributions
-5.285
ITNESS-5.058
 Edmund-4.485
��-4.375
 defences-4.305
Token ]
Token ID2361
Feature activation+1.28e+00

Pos logit contributions
Hour+1.785
GAN+1.781
table+1.771
Band+1.746
chin+1.730
Neg logit contributions
-2.573
ITNESS-2.493
 Edmund-2.300
 defences-2.193
 insure-2.170
Token
Token ID198
Feature activation+1.03e+00

Pos logit contributions
table+3.256
Hour+3.078
Band+3.076
GAN+2.979
chin+2.954
Neg logit contributions
-9.165
ITNESS-8.988
 Edmund-8.242
 defences-8.206
 insure-8.088
Token
Token ID198
Feature activation+7.52e-01

Pos logit contributions
table+3.351
Hour+3.279
GAN+3.264
Band+3.221
chin+3.134
Neg logit contributions
-7.112
ITNESS-6.839
��-6.292
 Edmund-6.212
 defences-6.191
Tokenthe
Token ID1169
Feature activation+4.23e-01

Pos logit contributions
table+2.944
Hour+2.914
Band+2.893
GAN+2.887
chin+2.825
Neg logit contributions
-4.566
ITNESS-4.465
 Edmund-4.114
 defences-3.963
 insure-3.945
Token ten
Token ID3478
Feature activation-3.28e-01

Pos logit contributions
table+2.159
Hour+2.157
GAN+2.149
Band+2.122
chin+2.105
Neg logit contributions
-2.068
ITNESS-1.995
 Edmund-1.788
 defences-1.707
 insure-1.687
Token dollar
Token ID8872
Feature activation+2.70e-01

Pos logit contributions
+1.659
ITNESS+1.597
 Edmund+1.462
 defences+1.384
 insure+1.355
Neg logit contributions
GAN-1.622
Hour-1.609
table-1.591
Band-1.583
chin-1.574
Token-
Token ID12
Feature activation+3.23e-01

Pos logit contributions
+2.102
ITNESS+2.043
 Edmund+1.870
 insure+1.754
 defences+1.753
Neg logit contributions
Hour-1.818
GAN-1.791
Band-1.777
table-1.774
risome-1.740
Tokenv
Token ID85
Feature activation-1.68e-04

Pos logit contributions
Hour+1.313
GAN+1.309
table+1.306
Band+1.284
chin+1.271
Neg logit contributions
-1.921
ITNESS-1.859
 Edmund-1.710
 defences-1.639
 insure-1.622
Tokenick
Token ID624
Feature activation+1.89e-01

Pos logit contributions
+0.001
ITNESS+0.001
 Edmund+0.001
 defences+0.001
 insure+0.001
Neg logit contributions
Hour-0.001
GAN-0.001
table-0.001
Band-0.001
chin-0.001
Token)
Token ID8
Feature activation+3.24e-01

Pos logit contributions
Hour+0.883
GAN+0.880
table+0.878
Band+0.865
chin+0.857
Neg logit contributions
-1.014
ITNESS-0.978
 Edmund-0.891
 defences-0.849
 insure-0.839
Token The
Token ID383
Feature activation+2.19e-01

Pos logit contributions
Hour+1.427
GAN+1.423
table+1.419
Band+1.397
chin+1.385
Neg logit contributions
-1.821
ITNESS-1.759
 Edmund-1.613
 defences-1.540
 insure-1.523
Token results
Token ID2482
Feature activation+1.28e+00

Pos logit contributions
Hour+1.098
GAN+1.095
table+1.091
Band+1.078
chin+1.068
Neg logit contributions
-1.099
ITNESS-1.060
 Edmund-0.962
 defences-0.912
 insure-0.899
Token support
Token ID1104
Feature activation+9.99e-01

Pos logit contributions
Hour+5.551
table+5.533
chin+5.394
GAN+5.387
Band+5.350
Neg logit contributions
-7.164
ITNESS-6.879
 Edmund-6.072
 defences-5.889
��-5.824
Token some
Token ID617
Feature activation+4.20e-01

Pos logit contributions
table+4.707
Hour+4.655
GAN+4.642
Band+4.560
chin+4.466
Neg logit contributions
-5.261
ITNESS-4.959
 Edmund-4.358
 insure-4.229
��-4.220
Token surprising
Token ID6452
Feature activation+2.59e-01

Pos logit contributions
Hour+2.021
table+2.017
GAN+2.016
Band+1.978
chin+1.970
Neg logit contributions
-2.192
ITNESS-2.088
 Edmund-1.883
 defences-1.784
 insure-1.784
Token recent
Token ID2274
Feature activation+3.00e-01

Pos logit contributions
Hour+1.493
GAN+1.490
table+1.485
Band+1.471
chin+1.460
Neg logit contributions
-1.097
ITNESS-1.050
 Edmund-0.934
 defences-0.879
 insure-0.862
Token measurements
Token ID13871
Feature activation+5.29e-01

Pos logit contributions
Hour+1.797
GAN+1.794
table+1.787
Band+1.771
chin+1.758
Neg logit contributions
-1.202
ITNESS-1.149
 Edmund-1.016
 defences-0.952
 insure-0.930
Token [
Token ID685
Feature activation+8.31e-01

Pos logit contributions
table+1.081
GAN+1.056
Hour+1.053
Band+1.029
chin+1.015
Neg logit contributions
-2.811
ITNESS-2.763
 Edmund-2.548
 defences-2.477
 insure-2.466
Token edit
Token ID4370
Feature activation+2.20e-01

Pos logit contributions
table+4.410
Hour+4.351
GAN+4.343
chin+4.266
Band+4.262
Neg logit contributions
-3.956
ITNESS-3.815
 Edmund-3.386
 defences-3.237
��-3.215
Token ]
Token ID2361
Feature activation+7.71e-01

Pos logit contributions
Hour+0.930
GAN+0.930
table+0.916
Band+0.910
chin+0.903
Neg logit contributions
-1.262
ITNESS-1.227
 Edmund-1.134
 defences-1.073
 insure-1.062
Token
Token ID198
Feature activation+6.82e-01

Pos logit contributions
table+2.350
Hour+2.286
Band+2.268
GAN+2.233
chin+2.179
Neg logit contributions
-5.296
ITNESS-5.189
 Edmund-4.776
 defences-4.692
 insure-4.639
Token
Token ID198
Feature activation+3.81e-01

Pos logit contributions
table+2.357
Hour+2.355
GAN+2.343
Band+2.299
chin+2.252
Neg logit contributions
-4.507
ITNESS-4.359
 Edmund-4.004
 defences-3.914
 insure-3.884
TokenReferences
Token ID19927
Feature activation+1.27e+00

Pos logit contributions
Hour+1.545
GAN+1.540
table+1.538
Band+1.512
chin+1.495
Neg logit contributions
-2.267
ITNESS-2.195
 Edmund-2.020
 defences-1.937
 insure-1.917
Token [
Token ID685
Feature activation+1.06e+00

Pos logit contributions
table+2.893
Hour+2.648
Band+2.617
GAN+2.614
chin+2.587
Neg logit contributions
-9.561
ITNESS-9.428
 defences-8.445
 Edmund-8.441
��-8.361
Token edit
Token ID4370
Feature activation+4.36e-01

Pos logit contributions
table+5.487
Hour+5.367
GAN+5.354
chin+5.300
Band+5.247
Neg logit contributions
-5.285
ITNESS-5.058
 Edmund-4.485
��-4.375
 defences-4.305
Token ]
Token ID2361
Feature activation+1.28e+00

Pos logit contributions
Hour+1.785
GAN+1.781
table+1.771
Band+1.746
chin+1.730
Neg logit contributions
-2.573
ITNESS-2.493
 Edmund-2.300
 defences-2.193
 insure-2.170
Token
Token ID198
Feature activation+1.03e+00

Pos logit contributions
table+3.256
Hour+3.078
Band+3.076
GAN+2.979
chin+2.954
Neg logit contributions
-9.165
ITNESS-8.988
 Edmund-8.242
 defences-8.206
 insure-8.088
Token
Token ID198
Feature activation+7.52e-01

Pos logit contributions
table+3.351
Hour+3.279
GAN+3.264
Band+3.221
chin+3.134
Neg logit contributions
-7.112
ITNESS-6.839
��-6.292
 Edmund-6.212
 defences-6.191
Token :
Token ID1058
Feature activation+1.40e-01

Pos logit contributions
+0.443
ITNESS+0.426
 Edmund+0.391
 defences+0.372
 insure+0.367
Neg logit contributions
Hour-0.375
GAN-0.373
table-0.371
Band-0.366
chin-0.363
Token
Token ID198
Feature activation-1.65e-02

Pos logit contributions
Hour+0.519
GAN+0.518
table+0.517
Band+0.507
chin+0.502
Neg logit contributions
-0.888
ITNESS-0.861
 Edmund-0.796
 defences-0.765
 insure-0.758
Token
Token ID198
Feature activation+8.78e-02

Pos logit contributions
+0.098
ITNESS+0.096
 Edmund+0.090
 defences+0.084
 insure+0.083
Neg logit contributions
GAN-0.065
Hour-0.065
table-0.064
risome-0.064
Band-0.064
TokenWhat
Token ID2061
Feature activation+4.79e-01

Pos logit contributions
Hour+0.348
GAN+0.347
table+0.346
Band+0.338
chin+0.337
Neg logit contributions
-0.530
ITNESS-0.514
 Edmund-0.472
 defences-0.451
 insure-0.446
Token in
Token ID287
Feature activation+6.41e-01

Pos logit contributions
Hour+2.179
GAN+2.174
table+2.173
Band+2.140
chin+2.122
Neg logit contributions
-2.618
ITNESS-2.526
 Edmund-2.299
 defences-2.196
 insure-2.173
Token hell
Token ID5968
Feature activation+1.27e+00

Pos logit contributions
table+3.086
Hour+3.084
GAN+3.083
Band+3.041
chin+3.009
Neg logit contributions
-3.278
ITNESS-3.138
 Edmund-2.830
 defences-2.708
 insure-2.702
Token was
Token ID373
Feature activation+6.98e-01

Pos logit contributions
GAN+4.937
table+4.888
Hour+4.848
chin+4.827
Band+4.813
Neg logit contributions
-7.437
ITNESS-7.298
 Edmund-6.613
 defences-6.392
 insure-6.355
Token the
Token ID262
Feature activation+2.49e-01

Pos logit contributions
GAN+3.011
table+3.009
Hour+3.008
Band+2.961
chin+2.938
Neg logit contributions
-3.948
ITNESS-3.802
 Edmund-3.458
 defences-3.340
 insure-3.312
Token point
Token ID966
Feature activation+2.12e-01

Pos logit contributions
Hour+1.376
GAN+1.371
table+1.368
Band+1.352
chin+1.341
Neg logit contributions
-1.123
ITNESS-1.078
 Edmund-0.965
 defences-0.909
 insure-0.895
Token of
Token ID286
Feature activation+5.83e-01

Pos logit contributions
table+0.910
Hour+0.910
GAN+0.909
Band+0.893
chin+0.884
Neg logit contributions
-1.213
ITNESS-1.171
 Edmund-1.073
 defences-1.028
 insure-1.020
Token kick
Token ID4829
Feature activation+2.47e-01

Pos logit contributions
GAN+2.949
Hour+2.947
table+2.944
Band+2.909
chin+2.880
Neg logit contributions
-2.883
ITNESS-2.753
 Edmund-2.477
 defences-2.356
 insure-2.326
Token
Token ID198
Feature activation+5.37e-01

Pos logit contributions
table+2.383
Hour+2.289
Band+2.284
GAN+2.242
chin+2.209
Neg logit contributions
-5.629
ITNESS-5.521
 Edmund-5.075
 defences-4.974
 insure-4.917
Token
Token ID198
Feature activation+2.34e-01

Pos logit contributions
Hour+1.909
table+1.904
GAN+1.901
Band+1.862
chin+1.835
Neg logit contributions
-3.486
ITNESS-3.375
 Edmund-3.114
 defences-3.017
 insure-2.989
TokenReferences
Token ID19927
Feature activation+1.18e+00

Pos logit contributions
Hour+0.940
GAN+0.939
table+0.931
Band+0.915
chin+0.910
Neg logit contributions
-1.401
ITNESS-1.352
 Edmund-1.245
 defences-1.192
 insure-1.178
Token [
Token ID685
Feature activation+9.36e-01

Pos logit contributions
table+3.002
GAN+2.839
Hour+2.804
chin+2.786
Band+2.759
Neg logit contributions
-8.625
ITNESS-8.428
 Edmund-7.578
 defences-7.494
��-7.486
Token edit
Token ID4370
Feature activation+3.86e-01

Pos logit contributions
table+4.864
Hour+4.778
GAN+4.774
chin+4.713
Band+4.661
Neg logit contributions
-4.587
ITNESS-4.398
 Edmund-3.904
��-3.785
 defences-3.757
Token ]
Token ID2361
Feature activation+1.25e+00

Pos logit contributions
Hour+1.578
GAN+1.574
table+1.566
Band+1.544
chin+1.529
Neg logit contributions
-2.285
ITNESS-2.214
 Edmund-2.041
 defences-1.949
 insure-1.928
Token<|endoftext|>
Token ID50256
Feature activation-2.75e-02

Pos logit contributions
table+3.194
Band+2.997
Hour+2.988
GAN+2.917
chin+2.915
Neg logit contributions
-9.018
ITNESS-8.859
 Edmund-8.083
 defences-8.051
 insure-7.943
TokenZ
Token ID57
Feature activation+1.61e-01

Pos logit contributions
+0.167
ITNESS+0.161
 Edmund+0.148
 defences+0.141
 insure+0.140
Neg logit contributions
GAN-0.108
Hour-0.108
table-0.107
chin-0.105
Band-0.105
Tokenombie
Token ID9081
Feature activation-2.74e-02

Pos logit contributions
Hour+0.693
GAN+0.691
table+0.690
Band+0.679
chin+0.672
Neg logit contributions
-0.919
ITNESS-0.888
 Edmund-0.814
 defences-0.778
 insure-0.770
Token Deep
Token ID10766
Feature activation+3.06e-02

Pos logit contributions
+0.152
ITNESS+0.147
 Edmund+0.135
 defences+0.129
 insure+0.127
Neg logit contributions
Hour-0.121
GAN-0.121
table-0.121
Band-0.119
chin-0.118
Token State
Token ID1812
Feature activation-1.75e-03

Pos logit contributions
GAN+0.129
Hour+0.128
table+0.127
Band+0.125
chin+0.124
Neg logit contributions
-0.177
ITNESS-0.171
 Edmund-0.159
 defences-0.152
 insure-0.149
Token first
Token ID717
Feature activation+2.02e-01

Pos logit contributions
Hour+0.898
GAN+0.895
table+0.893
Band+0.882
chin+0.876
Neg logit contributions
-0.778
ITNESS-0.745
 Edmund-0.669
 defences-0.633
 insure-0.623
Token two
Token ID734
Feature activation-7.39e-02

Pos logit contributions
Hour+0.950
GAN+0.948
table+0.946
Band+0.932
chin+0.924
Neg logit contributions
-1.077
ITNESS-1.038
 Edmund-0.945
 defences-0.901
 insure-0.890
Token in
Token ID287
Feature activation+2.11e-01

Pos logit contributions
+0.353
ITNESS+0.339
 Edmund+0.308
 defences+0.293
 insure+0.286
Neg logit contributions
Hour-0.386
GAN-0.386
table-0.384
Band-0.379
chin-0.378
Token words
Token ID2456
Feature activation+3.52e-01

Pos logit contributions
Hour+1.061
table+1.059
GAN+1.059
Band+1.044
chin+1.035
Neg logit contributions
-1.048
ITNESS-1.008
 Edmund-0.909
 defences-0.862
 insure-0.853
Token unf
Token ID3684
Feature activation-3.03e-01

Pos logit contributions
table+1.546
Hour+1.535
GAN+1.527
Band+1.510
chin+1.500
Neg logit contributions
-1.981
ITNESS-1.933
 Edmund-1.743
 defences-1.665
 insure-1.654
Tokenet
Token ID316
Feature activation+1.24e+00

Pos logit contributions
+1.675
ITNESS+1.662
 Edmund+1.506
 defences+1.420
 insure+1.420
Neg logit contributions
Hour-1.315
GAN-1.310
table-1.297
Band-1.286
chin-1.269
Tokentered
Token ID4400
Feature activation+1.65e-02

Pos logit contributions
table+6.444
Hour+6.246
chin+6.143
GAN+6.129
ction+6.049
Neg logit contributions
-6.150
ITNESS-5.762
 Edmund-5.086
 defences-4.966
��-4.890
Token by
Token ID416
Feature activation-3.79e-01

Pos logit contributions
Hour+0.078
GAN+0.078
table+0.078
Band+0.076
chin+0.076
Neg logit contributions
-0.088
ITNESS-0.085
 Edmund-0.077
 defences-0.073
 insure-0.073
Token reality
Token ID3950
Feature activation+1.25e-01

Pos logit contributions
+1.715
ITNESS+1.646
 Edmund+1.504
 defences+1.423
 insure+1.387
Neg logit contributions
Hour-2.075
GAN-2.063
table-2.039
Band-2.027
chin-2.019
Token.
Token ID13
Feature activation-6.59e-02

Pos logit contributions
table+0.552
Hour+0.551
GAN+0.547
Band+0.539
chin+0.534
Neg logit contributions
-0.703
ITNESS-0.682
 Edmund-0.619
 defences-0.589
 insure-0.586
Token This
Token ID770
Feature activation+1.70e-03

Pos logit contributions
+0.358
ITNESS+0.345
 Edmund+0.322
 defences+0.296
 insure+0.293
Neg logit contributions
Hour-0.299
GAN-0.299
table-0.297
chin-0.293
Band-0.292
Token mission
Token ID4365
Feature activation+6.38e-02

Pos logit contributions
+0.748
ITNESS+0.729
 Edmund+0.669
 defences+0.632
 insure+0.619
Neg logit contributions
Hour-0.725
table-0.721
GAN-0.720
Band-0.706
chin-0.703
Token manager
Token ID4706
Feature activation+2.85e-01

Pos logit contributions
Hour+0.331
GAN+0.330
table+0.330
Band+0.325
chin+0.323
Neg logit contributions
-0.307
ITNESS-0.296
 Edmund-0.267
 defences-0.253
 insure-0.249
Token Dr
Token ID1583
Feature activation+1.05e-01

Pos logit contributions
Hour+1.341
GAN+1.339
table+1.338
Band+1.318
chin+1.307
Neg logit contributions
-1.522
ITNESS-1.464
 Edmund-1.315
 defences-1.263
 insure-1.251
Token Rune
Token ID28228
Feature activation-4.30e-02

Pos logit contributions
Hour+0.503
GAN+0.501
table+0.500
Band+0.493
chin+0.489
Neg logit contributions
-0.551
ITNESS-0.531
 Edmund-0.483
 defences-0.459
 insure-0.454
Token Fl
Token ID1610
Feature activation+2.90e-01

Pos logit contributions
+0.233
ITNESS+0.225
 Edmund+0.206
 defences+0.196
 insure+0.193
Neg logit contributions
Hour-0.198
GAN-0.197
table-0.197
Band-0.194
chin-0.192
Tokenober
Token ID2023
Feature activation+1.24e+00

Pos logit contributions
Hour+1.316
GAN+1.313
table+1.309
Band+1.290
chin+1.275
Neg logit contributions
-1.582
ITNESS-1.528
 Edmund-1.398
 defences-1.332
 insure-1.316
Tokengh
Token ID456
Feature activation-4.06e-02

Pos logit contributions
table+4.905
GAN+4.750
chin+4.707
Band+4.641
Hour+4.639
Neg logit contributions
-7.649
ITNESS-7.304
 Edmund-6.592
 defences-6.461
 insure-6.403
Tokenagen
Token ID11286
Feature activation+1.15e-01

Pos logit contributions
+0.219
ITNESS+0.213
 Edmund+0.196
 defences+0.186
 insure+0.183
Neg logit contributions
Hour-0.186
GAN-0.184
table-0.183
Band-0.181
chin-0.178
Token.
Token ID13
Feature activation-3.24e-02

Pos logit contributions
Hour+0.518
GAN+0.516
table+0.515
Band+0.507
chin+0.502
Neg logit contributions
-0.636
ITNESS-0.614
 Edmund-0.562
 defences-0.536
 insure-0.530
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
+0.192
ITNESS+0.188
 Edmund+0.178
 defences+0.164
 insure+0.162
Neg logit contributions
Hour-0.128
GAN-0.127
table-0.127
risome-0.125
Band-0.125
Token
Token ID198
Feature activation-7.33e-02

Pos logit contributions
Hour+0.492
GAN+0.492
table+0.486
risome+0.482
Band+0.481
Neg logit contributions
-0.765
ITNESS-0.746
 Edmund-0.701
 defences-0.657
 insure-0.649
Token:
Token ID25
Feature activation+8.47e-02

Pos logit contributions
Hour+0.724
GAN+0.723
table+0.719
Band+0.709
chin+0.702
Neg logit contributions
-0.867
ITNESS-0.839
 Edmund-0.771
 defences-0.736
 insure-0.725
Token "
Token ID366
Feature activation+1.22e-01

Pos logit contributions
table+0.369
GAN+0.369
Hour+0.368
Band+0.364
chin+0.359
Neg logit contributions
-0.480
ITNESS-0.462
 Edmund-0.420
 defences-0.404
 insure-0.400
TokenWhy
Token ID5195
Feature activation-1.59e-01

Pos logit contributions
Hour+0.489
GAN+0.489
table+0.487
chin+0.475
risome+0.475
Neg logit contributions
-0.724
ITNESS-0.700
 Edmund-0.647
 defences-0.617
 insure-0.609
Token in
Token ID287
Feature activation+4.11e-01

Pos logit contributions
+0.809
ITNESS+0.778
 Edmund+0.728
 defences+0.688
 insure+0.686
Neg logit contributions
Hour-0.774
GAN-0.768
table-0.756
Band-0.746
Panel-0.741
Token the
Token ID262
Feature activation+5.78e-01

Pos logit contributions
Hour+1.954
GAN+1.951
table+1.950
Band+1.930
chin+1.905
Neg logit contributions
-2.147
ITNESS-2.069
 Edmund-1.866
 defences-1.779
 insure-1.771
Token hell
Token ID5968
Feature activation+1.24e+00

Pos logit contributions
GAN+3.125
Hour+3.122
table+3.119
Band+3.070
chin+3.041
Neg logit contributions
-2.654
ITNESS-2.533
 Edmund-2.247
 defences-2.114
 insure-2.103
Token do
Token ID466
Feature activation+3.12e-01

Pos logit contributions
table+5.332
GAN+5.304
Band+5.261
Hour+5.224
chin+5.221
Neg logit contributions
-6.914
ITNESS-6.674
 Edmund-5.907
��-5.688
 defences-5.639
Token journalists
Token ID9046
Feature activation-2.40e-01

Pos logit contributions
Hour+1.362
GAN+1.359
table+1.356
Band+1.335
chin+1.323
Neg logit contributions
-1.765
ITNESS-1.704
 Edmund-1.559
 defences-1.491
 insure-1.475
Token insist
Token ID16361
Feature activation+2.99e-01

Pos logit contributions
+1.051
ITNESS+1.024
 Edmund+0.938
 insure+0.907
 defences+0.890
Neg logit contributions
Hour-1.314
GAN-1.313
table-1.294
Band-1.288
risome-1.274
Token on
Token ID319
Feature activation+1.54e-01

Pos logit contributions
Hour+1.347
GAN+1.344
table+1.343
Band+1.323
chin+1.310
Neg logit contributions
-1.644
ITNESS-1.584
 Edmund-1.443
 defences-1.380
 insure-1.365
Token coming
Token ID2406
Feature activation+1.61e-01

Pos logit contributions
Hour+0.797
GAN+0.797
table+0.792
Band+0.781
chin+0.778
Neg logit contributions
-0.745
ITNESS-0.718
 Edmund-0.655
 defences-0.619
 insure-0.609
Tokens
Token ID82
Feature activation+1.17e-01

Pos logit contributions
table+0.142
Hour+0.142
GAN+0.140
Band+0.139
chin+0.136
Neg logit contributions
-0.188
ITNESS-0.182
 Edmund-0.166
 defences-0.159
 insure-0.158
Token,
Token ID11
Feature activation-2.02e-01

Pos logit contributions
Hour+0.507
table+0.505
GAN+0.503
Band+0.497
chin+0.493
Neg logit contributions
-0.670
ITNESS-0.647
 Edmund-0.589
 defences-0.565
 insure-0.560
Token standing
Token ID5055
Feature activation+9.19e-02

Pos logit contributions
+0.983
ITNESS+0.945
 Edmund+0.874
 defences+0.826
 insure+0.812
Neg logit contributions
GAN-1.029
Hour-1.021
table-1.014
Band-1.000
risome-0.989
Token 5
Token ID642
Feature activation+5.48e-02

Pos logit contributions
GAN+0.434
Hour+0.432
table+0.430
Band+0.424
chin+0.420
Neg logit contributions
-0.486
ITNESS-0.468
 Edmund-0.431
 defences-0.410
 insure-0.403
Token'
Token ID6
Feature activation-4.96e-01

Pos logit contributions
GAN+0.261
Hour+0.261
table+0.260
Band+0.256
chin+0.254
Neg logit contributions
-0.288
ITNESS-0.277
 Edmund-0.252
 defences-0.240
 insure-0.238
Token7
Token ID22
Feature activation-1.33e+00

Pos logit contributions
+2.475
ITNESS+2.361
 Edmund+2.213
 defences+2.056
 insure+2.054
Neg logit contributions
GAN-2.434
Hour-2.386
Band-2.385
table-2.371
risome-2.340
Token"
Token ID1
Feature activation-5.11e-01

Pos logit contributions
+6.467
ITNESS+6.181
 Edmund+5.835
 defences+5.466
 insure+5.438
Neg logit contributions
GAN-6.396
Hour-6.322
table-6.290
Band-6.235
risome-6.146
Token with
Token ID351
Feature activation-3.87e-01

Pos logit contributions
+2.552
ITNESS+2.436
 Edmund+2.264
 defences+2.116
 insure+2.111
Neg logit contributions
GAN-2.530
table-2.509
Hour-2.503
Band-2.470
risome-2.448
Token dark
Token ID3223
Feature activation+1.30e-01

Pos logit contributions
+1.918
ITNESS+1.866
 Edmund+1.756
 defences+1.657
 insure+1.626
Neg logit contributions
GAN-1.901
Hour-1.892
table-1.853
Band-1.838
andestine-1.828
Token hair
Token ID4190
Feature activation-3.46e-01

Pos logit contributions
Hour+0.667
GAN+0.666
table+0.658
Band+0.652
chin+0.645
Neg logit contributions
-0.632
ITNESS-0.607
 Edmund-0.556
 defences-0.528
 insure-0.518
Token,
Token ID11
Feature activation-2.06e-01

Pos logit contributions
+1.797
ITNESS+1.723
 Edmund+1.608
 defences+1.521
 insure+1.497
Neg logit contributions
GAN-1.645
Hour-1.632
table-1.606
Band-1.586
andestine-1.573
Token;
Token ID26
Feature activation-1.85e-01

Pos logit contributions
Hour+1.474
table+1.473
GAN+1.462
Band+1.442
chin+1.431
Neg logit contributions
-1.961
ITNESS-1.887
 Edmund-1.717
 defences-1.647
 insure-1.621
Token
Token ID198
Feature activation-1.81e-02

Pos logit contributions
+1.155
ITNESS+1.119
 Edmund+1.034
 defences+0.993
 insure+0.984
Neg logit contributions
Hour-0.703
GAN-0.701
table-0.699
Band-0.686
chin-0.679
Token
Token ID198
Feature activation-1.11e-01

Pos logit contributions
+0.105
ITNESS+0.103
 Edmund+0.098
 defences+0.091
 insure+0.090
Neg logit contributions
GAN-0.072
Hour-0.072
risome-0.071
table-0.071
chin-0.070
Tokenand
Token ID392
Feature activation-2.13e-01

Pos logit contributions
+0.690
ITNESS+0.667
 Edmund+0.619
 defences+0.594
 insure+0.587
Neg logit contributions
GAN-0.415
Hour-0.412
table-0.408
Band-0.401
risome-0.397
Token behold
Token ID23700
Feature activation-5.17e-01

Pos logit contributions
+1.089
ITNESS+1.055
 Edmund+0.978
 defences+0.923
 insure+0.916
Neg logit contributions
GAN-1.025
Hour-1.023
table-1.018
Band-1.006
chin-0.994
Token,
Token ID11
Feature activation-1.32e+00

Pos logit contributions
+2.697
ITNESS+2.629
 Edmund+2.447
 defences+2.290
 insure+2.276
Neg logit contributions
Hour-2.403
GAN-2.399
Band-2.373
table-2.350
chin-2.322
Token all
Token ID477
Feature activation-4.33e-01

Pos logit contributions
ITNESS+5.966
+5.964
 Edmund+5.838
 defences+5.396
 insure+5.335
Neg logit contributions
GAN-6.385
Hour-6.312
Band-6.261
table-6.200
risome-6.158
Token is
Token ID318
Feature activation-5.26e-01

Pos logit contributions
+2.138
ITNESS+2.066
 Edmund+1.894
 defences+1.807
 insure+1.789
Neg logit contributions
GAN-2.155
Hour-2.153
table-2.137
Band-2.118
chin-2.088
Token vanity
Token ID39609
Feature activation-7.67e-01

Pos logit contributions
+2.387
ITNESS+2.305
 Edmund+2.154
 defences+2.019
 insure+2.018
Neg logit contributions
GAN-2.794
Hour-2.780
table-2.725
Band-2.716
chin-2.705
Token and
Token ID290
Feature activation-4.90e-01

Pos logit contributions
+3.819
ITNESS+3.673
 Edmund+3.496
 defences+3.306
 insure+3.295
Neg logit contributions
GAN-3.725
Hour-3.664
Band-3.552
table-3.542
risome-3.515
Token vex
Token ID41548
Feature activation-3.03e-01

Pos logit contributions
+2.107
ITNESS+2.052
 Edmund+1.900
 defences+1.793
 insure+1.772
Neg logit contributions
GAN-2.710
Hour-2.703
table-2.669
Band-2.645
chin-2.632
Token infusion
Token ID35547
Feature activation-4.56e-02

Pos logit contributions
+0.683
ITNESS+0.649
 Edmund+0.599
 defences+0.572
 insure+0.559
Neg logit contributions
GAN-0.779
Hour-0.778
table-0.770
Band-0.760
chin-0.754
Token of
Token ID286
Feature activation-5.04e-01

Pos logit contributions
+0.259
ITNESS+0.251
 Edmund+0.231
 defences+0.219
 insure+0.217
Neg logit contributions
Hour-0.198
table-0.197
GAN-0.197
Band-0.193
chin-0.191
Token provincial
Token ID15818
Feature activation+2.05e-02

Pos logit contributions
+2.098
ITNESS+2.000
 Edmund+1.809
 defences+1.752
 insure+1.683
Neg logit contributions
GAN-2.920
Hour-2.905
table-2.873
Band-2.835
chin-2.819
Token funds
Token ID5153
Feature activation+2.68e-01

Pos logit contributions
Hour+0.120
GAN+0.119
table+0.119
Band+0.117
chin+0.117
Neg logit contributions
-0.086
ITNESS-0.082
 Edmund-0.073
 defences-0.069
 insure-0.067
Token and
Token ID290
Feature activation-1.25e-01

Pos logit contributions
Hour+1.161
table+1.158
GAN+1.153
Band+1.138
chin+1.127
Neg logit contributions
-1.523
ITNESS-1.478
 Edmund-1.346
 defences-1.285
 insure-1.271
Token,
Token ID11
Feature activation-1.31e+00

Pos logit contributions
+0.628
ITNESS+0.604
 Edmund+0.552
 defences+0.529
 insure+0.517
Neg logit contributions
GAN-0.623
Hour-0.622
table-0.618
Band-0.608
chin-0.606
Token in
Token ID287
Feature activation-7.55e-01

Pos logit contributions
+5.703
ITNESS+5.524
 Edmund+5.515
 defences+5.100
 insurer+5.011
Neg logit contributions
GAN-6.571
Hour-6.466
table-6.360
rats-6.345
risome-6.338
Token the
Token ID262
Feature activation-4.74e-01

Pos logit contributions
+3.510
ITNESS+3.367
 Edmund+3.229
 defences+2.977
 insure+2.898
Neg logit contributions
GAN-3.925
Hour-3.898
table-3.842
chin-3.818
Band-3.758
Token case
Token ID1339
Feature activation-2.50e-01

Pos logit contributions
+2.065
ITNESS+1.974
 Edmund+1.841
 defences+1.713
 insure+1.669
Neg logit contributions
GAN-2.654
Hour-2.626
table-2.608
chin-2.592
Band-2.587
Token of
Token ID286
Feature activation+2.17e-01

Pos logit contributions
+1.280
ITNESS+1.237
 Edmund+1.151
 defences+1.084
 insure+1.055
Neg logit contributions
GAN-1.189
Hour-1.186
table-1.170
Band-1.170
chin-1.160
Token Vancouver
Token ID10930
Feature activation-4.26e-02

Pos logit contributions
Hour+1.069
GAN+1.066
table+1.066
Band+1.052
chin+1.041
Neg logit contributions
-1.107
ITNESS-1.065
 Edmund-0.962
 defences-0.916
 insure-0.905
Token Est
Token ID10062
Feature activation-4.60e-01

Pos logit contributions
+0.449
ITNESS+0.434
 Edmund+0.397
 defences+0.378
 insure+0.371
Neg logit contributions
GAN-0.428
Hour-0.428
table-0.425
Band-0.418
chin-0.417
Tokeneb
Token ID1765
Feature activation+1.56e-01

Pos logit contributions
+2.584
ITNESS+2.521
 Edmund+2.356
 defences+2.250
 insure+2.179
Neg logit contributions
Hour-1.934
Band-1.909
GAN-1.905
risome-1.896
table-1.891
Tokenan
Token ID272
Feature activation+3.91e-01

Pos logit contributions
Hour+0.452
GAN+0.447
table+0.447
Band+0.435
chin+0.426
Neg logit contributions
-1.098
ITNESS-1.072
 Edmund-1.011
 defences-0.972
 insure-0.962
Token [
Token ID685
Feature activation+1.83e-01

Pos logit contributions
Hour+1.632
GAN+1.629
table+1.625
Band+1.599
chin+1.583
Neg logit contributions
-2.283
ITNESS-2.208
 Edmund-2.033
 defences-1.946
 insure-1.924
TokenO
Token ID46
Feature activation-3.60e-01

Pos logit contributions
GAN+0.691
Hour+0.690
table+0.686
Band+0.672
chin+0.670
Neg logit contributions
-1.138
ITNESS-1.100
 Edmund-1.018
 defences-0.976
 insure-0.967
Tokencon
Token ID1102
Feature activation-1.29e+00

Pos logit contributions
+2.172
ITNESS+2.101
 Edmund+1.964
 defences+1.877
 insure+1.858
Neg logit contributions
Hour-1.391
GAN-1.384
Band-1.352
table-1.338
risome-1.316
Token,
Token ID11
Feature activation+1.30e-01

Pos logit contributions
+7.098
ITNESS+6.672
 Edmund+6.617
 defences+6.226
 insurer+6.031
Neg logit contributions
Hour-5.285
GAN-5.265
risome-5.125
table-4.971
andestine-4.951
Token his
Token ID465
Feature activation-1.44e-01

Pos logit contributions
Hour+0.623
GAN+0.621
table+0.619
Band+0.611
chin+0.606
Neg logit contributions
-0.680
ITNESS-0.655
 Edmund-0.598
 defences-0.568
 insure-0.561
Token new
Token ID649
Feature activation-2.84e-01

Pos logit contributions
+0.668
ITNESS+0.648
 Edmund+0.592
 defences+0.562
 insure+0.549
Neg logit contributions
GAN-0.766
Hour-0.765
table-0.756
Band-0.749
chin-0.747
Token teammate
Token ID20052
Feature activation-5.49e-01

Pos logit contributions
+1.289
ITNESS+1.264
 Edmund+1.161
 defences+1.104
 insure+1.075
Neg logit contributions
GAN-1.516
Hour-1.515
table-1.496
chin-1.479
Band-1.477
Token]
Token ID60
Feature activation+2.27e-01

Pos logit contributions
+3.198
ITNESS+3.064
 Edmund+2.942
 defences+2.780
 insure+2.722
Neg logit contributions
GAN-2.223
Hour-2.218
table-2.185
chin-2.162
risome-2.162
Token
Token ID134
Feature activation+3.51e-03

Pos logit contributions
Hour+0.691
GAN+0.689
table+0.687
Band+0.676
chin+0.670
Neg logit contributions
-0.914
ITNESS-0.885
 Edmund-0.811
 defences-0.776
 insure-0.767
Token
Token ID119
Feature activation-2.76e-01

Pos logit contributions
Hour+0.023
table+0.023
GAN+0.023
Band+0.023
chin+0.023
Neg logit contributions
-0.012
ITNESS-0.011
 Edmund-0.009
 defences-0.009
 insure-0.009
TokenA
Token ID32
Feature activation-3.88e-01

Pos logit contributions
+1.506
ITNESS+1.446
 Edmund+1.320
 insure+1.244
 defences+1.244
Neg logit contributions
GAN-1.245
Hour-1.241
table-1.233
Band-1.217
chin-1.199
Token
Token ID134
Feature activation-8.54e-02

Pos logit contributions
+2.132
ITNESS+2.040
 Edmund+1.866
 defences+1.769
 insure+1.764
Neg logit contributions
Hour-1.754
GAN-1.744
table-1.733
Band-1.713
risome-1.680
Token
Token ID119
Feature activation-5.00e-01

Pos logit contributions
+0.275
ITNESS+0.258
 Edmund+0.219
 defences+0.200
 insure+0.196
Neg logit contributions
Hour-0.581
GAN-0.580
table-0.579
Band-0.573
chin-0.570
Tokenole
Token ID2305
Feature activation-1.28e+00

Pos logit contributions
+2.846
ITNESS+2.745
 Edmund+2.512
 defences+2.370
 insure+2.370
Neg logit contributions
GAN-2.105
Hour-2.104
table-2.086
Band-2.057
risome-2.032
Token i
Token ID1312
Feature activation-1.38e-01

Pos logit contributions
+6.133
ITNESS+5.769
 Edmund+5.232
 insure+4.945
 defences+4.891
Neg logit contributions
Hour-6.323
GAN-6.195
table-6.137
Band-6.134
risome-6.115
Token
Token ID220
Feature activation-1.45e-01

Pos logit contributions
+0.740
ITNESS+0.707
 Edmund+0.648
 defences+0.614
 insure+0.611
Neg logit contributions
Hour-0.643
table-0.640
GAN-0.639
Band-0.629
chin-0.620
Token
Token ID134
Feature activation-5.28e-01

Pos logit contributions
+0.835
ITNESS+0.808
 Edmund+0.741
 defences+0.710
 insure+0.702
Neg logit contributions
Hour-0.615
GAN-0.613
table-0.612
Band-0.602
chin-0.596
Token
Token ID119
Feature activation-5.75e-01

Pos logit contributions
+1.515
ITNESS+1.354
 Edmund+1.156
 defences+1.015
 insure+0.981
Neg logit contributions
GAN-3.785
Hour-3.761
table-3.728
risome-3.728
Band-3.708
Tokenemo
Token ID41903
Feature activation-2.94e-01

Pos logit contributions
+3.244
ITNESS+3.102
 Edmund+2.838
 insure+2.675
 defences+2.653
Neg logit contributions
GAN-2.433
table-2.421
Hour-2.420
Band-2.377
risome-2.364
Token initiative
Token ID10219
Feature activation+1.06e-01

Pos logit contributions
+0.326
ITNESS+0.312
 Edmund+0.286
 defences+0.270
 insure+0.265
Neg logit contributions
GAN-0.341
Hour-0.340
table-0.337
Band-0.336
chin-0.331
Token is
Token ID318
Feature activation-7.33e-02

Pos logit contributions
Hour+0.486
GAN+0.485
table+0.483
Band+0.476
chin+0.471
Neg logit contributions
-0.578
ITNESS-0.557
 Edmund-0.510
 defences-0.487
 insure-0.481
Token feasible
Token ID23498
Feature activation-1.32e-01

Pos logit contributions
+0.375
ITNESS+0.361
 Edmund+0.332
 defences+0.315
 insure+0.312
Neg logit contributions
GAN-0.356
Hour-0.356
table-0.355
Band-0.349
chin-0.347
Token constitutionally
Token ID40800
Feature activation+7.37e-02

Pos logit contributions
+0.715
ITNESS+0.687
 Edmund+0.634
 defences+0.607
 insure+0.599
Neg logit contributions
Hour-0.602
GAN-0.601
table-0.597
Band-0.591
chin-0.586
Token and
Token ID290
Feature activation+1.54e-01

Pos logit contributions
Hour+0.320
table+0.319
GAN+0.319
Band+0.313
chin+0.311
Neg logit contributions
-0.418
ITNESS-0.405
 Edmund-0.369
 defences-0.353
 insure-0.349
Token,
Token ID11
Feature activation-1.27e+00

Pos logit contributions
Hour+0.720
GAN+0.717
table+0.716
Band+0.706
chin+0.700
Neg logit contributions
-0.826
ITNESS-0.795
 Edmund-0.729
 defences-0.696
 insure-0.689
Token although
Token ID3584
Feature activation-3.87e-01

Pos logit contributions
+5.597
ITNESS+5.496
 Edmund+5.348
 defences+5.132
 insure+5.055
Neg logit contributions
GAN-6.178
Hour-6.173
Band-6.039
risome-6.034
chin-6.033
Token it
Token ID340
Feature activation-3.11e-01

Pos logit contributions
+1.931
ITNESS+1.866
 Edmund+1.736
 defences+1.644
 insure+1.598
Neg logit contributions
Hour-1.918
GAN-1.902
table-1.884
Band-1.865
chin-1.856
Token would
Token ID561
Feature activation-5.27e-01

Pos logit contributions
+1.653
ITNESS+1.583
 Edmund+1.483
 defences+1.424
 insure+1.398
Neg logit contributions
Hour-1.439
GAN-1.437
table-1.415
Band-1.405
chin-1.377
Token find
Token ID1064
Feature activation-1.38e-01

Pos logit contributions
+2.342
ITNESS+2.260
 Edmund+2.093
 insure+2.082
 defences+2.011
Neg logit contributions
GAN-2.817
Hour-2.814
Band-2.778
table-2.772
chin-2.731
Token opposition
Token ID5471
Feature activation-1.70e-02

Pos logit contributions
+0.665
ITNESS+0.638
 Edmund+0.575
 defences+0.545
 insure+0.537
Neg logit contributions
Hour-0.718
GAN-0.716
table-0.715
Band-0.705
chin-0.700
TokenAP
Token ID2969
Feature activation-3.40e-01

Pos logit contributions
+2.295
ITNESS+2.239
 Edmund+2.073
 defences+1.944
 insure+1.919
Neg logit contributions
GAN-1.533
Hour-1.521
table-1.520
chin-1.515
risome-1.498
Token)
Token ID8
Feature activation+2.49e-02

Pos logit contributions
+1.837
ITNESS+1.771
 Edmund+1.615
 defences+1.540
 insure+1.522
Neg logit contributions
Hour-1.568
GAN-1.564
table-1.561
Band-1.537
chin-1.524
Token
Token ID198
Feature activation+1.22e-01

Pos logit contributions
Hour+0.102
GAN+0.101
table+0.101
Band+0.099
chin+0.099
Neg logit contributions
-0.148
ITNESS-0.143
 Edmund-0.132
 defences-0.126
 insure-0.125
Token
Token ID198
Feature activation+7.88e-02

Pos logit contributions
GAN+0.470
Hour+0.469
table+0.463
Band+0.458
chin+0.458
Neg logit contributions
-0.734
ITNESS-0.717
 Edmund-0.670
 defences-0.631
 insure-0.624
TokenStory
Token ID11605
Feature activation-2.62e-01

Pos logit contributions
GAN+0.309
table+0.308
Hour+0.308
chin+0.303
risome+0.303
Neg logit contributions
-0.476
ITNESS-0.462
 Edmund-0.426
 defences-0.403
 insure-0.398
Token continues
Token ID4477
Feature activation-1.26e+00

Pos logit contributions
+1.513
ITNESS+1.455
 Edmund+1.347
 defences+1.289
 insure+1.274
Neg logit contributions
GAN-1.111
Hour-1.109
table-1.094
Band-1.085
chin-1.067
Token
Token ID198
Feature activation+7.54e-01

Pos logit contributions
+3.977
ITNESS+3.863
 Edmund+3.740
 defences+3.404
 insure+3.182
Neg logit contributions
GAN-8.087
Hour-7.905
Band-7.898
chin-7.801
table-7.787
Token
Token ID198
Feature activation+6.22e-01

Pos logit contributions
table+2.580
Hour+2.565
GAN+2.547
Band+2.507
chin+2.451
Neg logit contributions
-5.029
ITNESS-4.848
 Edmund-4.452
 defences-4.353
 insure-4.324
TokenThere
Token ID1858
Feature activation+5.14e-02

Pos logit contributions
Hour+1.397
table+1.378
GAN+1.377
Band+1.345
chin+1.296
Neg logit contributions
-4.845
ITNESS-4.726
 Edmund-4.441
 defences-4.320
 insure-4.295
Token were
Token ID547
Feature activation-9.73e-02

Pos logit contributions
Hour+0.222
GAN+0.221
table+0.220
Band+0.217
chin+0.215
Neg logit contributions
-0.294
ITNESS-0.284
 Edmund-0.261
 defences-0.249
 insure-0.247
Token a
Token ID257
Feature activation-3.84e-01

Pos logit contributions
+0.439
ITNESS+0.423
 Edmund+0.389
 defences+0.370
 insure+0.358
Neg logit contributions
Hour-0.530
GAN-0.530
table-0.524
chin-0.520
Band-0.519
Token He
Token ID679
Feature activation-5.66e-02

Pos logit contributions
+1.249
ITNESS+1.226
 Edmund+1.163
 defences+1.051
 insure+1.038
Neg logit contributions
GAN-0.979
Hour-0.978
table-0.971
risome-0.963
Band-0.958
Token was
Token ID373
Feature activation+1.09e-02

Pos logit contributions
+0.315
ITNESS+0.301
 Edmund+0.277
 defences+0.265
 insure+0.262
Neg logit contributions
Hour-0.252
GAN-0.252
table-0.250
Band-0.247
chin-0.244
Token a
Token ID257
Feature activation-3.26e-01

Pos logit contributions
Hour+0.052
GAN+0.052
table+0.052
Band+0.051
chin+0.051
Neg logit contributions
-0.057
ITNESS-0.055
 Edmund-0.050
 defences-0.048
 insure-0.047
Token major
Token ID1688
Feature activation-5.89e-01

Pos logit contributions
+1.480
ITNESS+1.423
 Edmund+1.300
 defences+1.220
 insure+1.213
Neg logit contributions
Hour-1.765
GAN-1.754
Band-1.731
table-1.727
chin-1.710
Token trade
Token ID3292
Feature activation-2.53e-01

Pos logit contributions
+2.569
ITNESS+2.458
 Edmund+2.240
 defences+2.086
 insure+2.072
Neg logit contributions
GAN-3.268
Hour-3.259
Band-3.218
chin-3.203
table-3.196
Token asset
Token ID11171
Feature activation-1.25e+00

Pos logit contributions
+1.198
ITNESS+1.150
 Edmund+1.041
 defences+0.981
 insure+0.980
Neg logit contributions
Hour-1.327
GAN-1.323
Band-1.306
table-1.305
chin-1.286
Token.
Token ID13
Feature activation-9.02e-03

Pos logit contributions
+5.669
ITNESS+5.352
 Edmund+5.135
 defences+4.859
 insurer+4.770
Neg logit contributions
GAN-6.318
Hour-6.167
Band-6.081
chin-6.057
risome-6.041
Token And
Token ID843
Feature activation-1.67e-01

Pos logit contributions
+0.052
ITNESS+0.050
 Edmund+0.047
 defences+0.044
 insure+0.043
Neg logit contributions
Hour-0.038
GAN-0.038
table-0.037
Band-0.037
chin-0.037
Token the
Token ID262
Feature activation-2.64e-01

Pos logit contributions
+0.865
ITNESS+0.846
 Edmund+0.787
 defences+0.744
 insure+0.733
Neg logit contributions
Hour-0.782
GAN-0.780
table-0.772
Band-0.764
chin-0.756
Token Padres
Token ID40188
Feature activation-3.50e-01

Pos logit contributions
+1.393
ITNESS+1.362
 Edmund+1.262
 defences+1.203
 insure+1.183
Neg logit contributions
GAN-1.221
Hour-1.218
table-1.194
Band-1.191
chin-1.185
Token should
Token ID815
Feature activation+1.36e-02

Pos logit contributions
+1.784
ITNESS+1.736
 Edmund+1.606
 defences+1.543
 insure+1.537
Neg logit contributions
Hour-1.669
GAN-1.657
table-1.651
Band-1.620
chin-1.614
Token Steam
Token ID9094
Feature activation-4.53e-01

Pos logit contributions
+2.059
ITNESS+1.974
 Edmund+1.815
 insure+1.695
 defences+1.693
Neg logit contributions
table-1.787
Hour-1.781
GAN-1.778
chin-1.754
Band-1.752
Token the
Token ID262
Feature activation-3.18e-01

Pos logit contributions
+2.107
ITNESS+2.020
 Edmund+1.853
 defences+1.738
 insure+1.705
Neg logit contributions
Hour-2.396
GAN-2.376
table-2.365
Band-2.357
chin-2.344
Token broccoli
Token ID44653
Feature activation-6.09e-01

Pos logit contributions
+1.171
ITNESS+1.104
 Edmund+0.997
 defences+0.916
 insure+0.902
Neg logit contributions
Hour-2.004
GAN-1.978
table-1.969
Band-1.966
chin-1.937
Token for
Token ID329
Feature activation-2.61e-01

Pos logit contributions
+2.908
ITNESS+2.791
 Edmund+2.623
 defences+2.513
 insure+2.434
Neg logit contributions
GAN-3.035
Hour-3.032
chin-2.993
risome-2.985
table-2.985
Token a
Token ID257
Feature activation-6.83e-01

Pos logit contributions
+1.471
ITNESS+1.438
 Edmund+1.340
 defences+1.269
 insure+1.254
Neg logit contributions
GAN-1.110
Hour-1.107
Band-1.083
table-1.080
chin-1.077
Token few
Token ID1178
Feature activation-1.24e+00

Pos logit contributions
+0.752
ITNESS+0.602
 Edmund+0.378
 defences+0.226
 insure+0.219
Neg logit contributions
GAN-5.940
Hour-5.937
Band-5.901
table-5.868
andestine-5.858
Token minutes
Token ID2431
Feature activation-5.70e-01

Pos logit contributions
+1.740
ITNESS+1.649
 Edmund+1.088
 defences+0.978
 insure+0.739
Neg logit contributions
table-10.261
GAN-10.235
Band-10.200
risome-10.167
Hour-10.122
Token until
Token ID1566
Feature activation-7.99e-01

Pos logit contributions
+2.381
ITNESS+2.237
 Edmund+2.044
 insure+1.913
 defences+1.909
Neg logit contributions
GAN-3.292
Band-3.271
Hour-3.257
table-3.246
chin-3.225
Token fork
Token ID15563
Feature activation+3.83e-01

Pos logit contributions
+2.406
ITNESS+2.336
 Edmund+2.075
 insure+1.876
 rehe+1.818
Neg logit contributions
Hour-5.371
table-5.266
GAN-5.263
risome-5.240
Band-5.235
Token-
Token ID12
Feature activation-1.35e-02

Pos logit contributions
table+1.435
Hour+1.431
GAN+1.422
Band+1.397
chin+1.386
Neg logit contributions
-2.411
ITNESS-2.341
 Edmund-2.150
 defences-2.072
 insure-2.052
Tokent
Token ID83
Feature activation-5.36e-01

Pos logit contributions
+0.073
ITNESS+0.070
 Edmund+0.065
 defences+0.062
 insure+0.061
Neg logit contributions
GAN-0.061
Hour-0.061
Band-0.060
risome-0.059
table-0.059
Token stage
Token ID3800
Feature activation+3.50e-02

Pos logit contributions
+2.288
ITNESS+2.231
 Edmund+2.086
 defences+1.949
 insure+1.898
Neg logit contributions
Hour-2.865
GAN-2.865
chin-2.813
Band-2.784
table-2.767
Token.
Token ID13
Feature activation-8.21e-02

Pos logit contributions
Hour+0.160
GAN+0.159
table+0.159
Band+0.156
chin+0.155
Neg logit contributions
-0.190
ITNESS-0.184
 Edmund-0.168
 defences-0.161
 insure-0.159
Token
Token ID198
Feature activation-1.80e-01

Pos logit contributions
+0.478
ITNESS+0.466
 Edmund+0.436
 defences+0.406
 insure+0.401
Neg logit contributions
Hour-0.336
GAN-0.334
table-0.333
chin-0.331
Band-0.328
Token
Token ID198
Feature activation-2.53e-01

Pos logit contributions
+1.020
ITNESS+1.005
 Edmund+0.949
 defences+0.876
 insure+0.863
Neg logit contributions
GAN-0.733
risome-0.730
Hour-0.730
andestine-0.720
chin-0.716
TokenStory
Token ID11605
Feature activation-4.55e-01

Pos logit contributions
+1.454
ITNESS+1.412
 Edmund+1.282
 defences+1.201
 insure+1.194
Neg logit contributions
risome-1.060
GAN-1.037
table-1.036
chin-1.036
Hour-1.032
Token continues
Token ID4477
Feature activation-1.23e+00

Pos logit contributions
+2.023
ITNESS+1.909
 Edmund+1.763
 defences+1.642
 insure+1.626
Neg logit contributions
GAN-2.499
Hour-2.495
Band-2.444
table-2.442
chin-2.405
Token below
Token ID2174
Feature activation-3.88e-01

Pos logit contributions
+3.616
ITNESS+3.613
 Edmund+3.552
 defences+3.181
 insurer+2.945
Neg logit contributions
GAN-8.021
Band-7.872
Hour-7.858
chin-7.755
Role-7.694
Token advertisement
Token ID15422
Feature activation+4.48e-01

Pos logit contributions
+1.482
ITNESS+1.444
 Edmund+1.365
 defences+1.241
 insure+1.213
Neg logit contributions
GAN-2.324
Hour-2.298
Band-2.267
chin-2.249
risome-2.241
Token
Token ID198
Feature activation+6.15e-01

Pos logit contributions
Hour+1.628
GAN+1.622
table+1.618
Band+1.587
chin+1.570
Neg logit contributions
-2.859
ITNESS-2.773
 Edmund-2.567
 defences-2.469
 insure-2.445
Token
Token ID198
Feature activation+2.80e-01

Pos logit contributions
Hour+2.170
table+2.163
GAN+2.157
Band+2.116
chin+2.080
Neg logit contributions
-4.012
ITNESS-3.879
 Edmund-3.576
 defences-3.468
 insure-3.439
TokenHe
Token ID1544
Feature activation-3.16e-01

Pos logit contributions
Hour+1.131
GAN+1.129
table+1.126
Band+1.105
chin+1.098
Neg logit contributions
-1.674
ITNESS-1.621
 Edmund-1.490
 defences-1.427
 insure-1.412
Token oppression
Token ID18345
Feature activation-3.36e-01

Pos logit contributions
+0.411
ITNESS+0.398
 Edmund+0.362
 defences+0.341
 insure+0.334
Neg logit contributions
GAN-0.480
Hour-0.478
table-0.476
Band-0.469
chin-0.467
Token,
Token ID11
Feature activation-3.35e-01

Pos logit contributions
+1.835
ITNESS+1.760
 Edmund+1.632
 defences+1.555
 insure+1.530
Neg logit contributions
Hour-1.525
GAN-1.524
table-1.513
Band-1.490
chin-1.479
Token humanity
Token ID9265
Feature activation-1.83e-01

Pos logit contributions
+1.640
ITNESS+1.576
 Edmund+1.457
 defences+1.375
 insure+1.349
Neg logit contributions
GAN-1.700
Hour-1.696
table-1.684
Band-1.657
chin-1.652
Token was
Token ID373
Feature activation-5.00e-01

Pos logit contributions
+0.997
ITNESS+0.961
 Edmund+0.884
 defences+0.844
 insure+0.832
Neg logit contributions
GAN-0.838
Hour-0.838
table-0.831
Band-0.818
chin-0.812
Token at
Token ID379
Feature activation-4.87e-01

Pos logit contributions
+2.440
ITNESS+2.360
 Edmund+2.188
 defences+2.066
 insure+2.034
Neg logit contributions
GAN-2.508
Hour-2.502
table-2.476
risome-2.442
Band-2.437
Token last
Token ID938
Feature activation-1.23e+00

Pos logit contributions
+2.281
ITNESS+2.178
 Edmund+1.977
 defences+1.871
 insure+1.838
Neg logit contributions
GAN-2.579
Hour-2.577
table-2.554
Band-2.524
chin-2.513
Token free
Token ID1479
Feature activation-6.06e-01

Pos logit contributions
+5.258
ITNESS+5.108
 Edmund+4.795
 defences+4.544
 insure+4.488
Neg logit contributions
risome-6.557
GAN-6.438
andestine-6.418
Hour-6.384
table-6.317
Token from
Token ID422
Feature activation-4.27e-01

Pos logit contributions
+3.145
ITNESS+3.016
 Edmund+2.796
 defences+2.655
 insure+2.621
Neg logit contributions
GAN-2.876
Hour-2.855
Band-2.830
table-2.803
risome-2.785
Token the
Token ID262
Feature activation-7.82e-02

Pos logit contributions
+1.957
ITNESS+1.887
 Edmund+1.742
 defences+1.656
 insure+1.605
Neg logit contributions
GAN-2.281
Hour-2.279
table-2.265
chin-2.237
Band-2.225
Token cruel
Token ID12177
Feature activation-2.96e-01

Pos logit contributions
+0.349
ITNESS+0.334
 Edmund+0.303
 defences+0.287
 insure+0.280
Neg logit contributions
GAN-0.433
Hour-0.432
table-0.430
Band-0.424
chin-0.422
Token tyranny
Token ID29977
Feature activation-5.35e-01

Pos logit contributions
+1.357
ITNESS+1.293
 Edmund+1.186
 defences+1.132
 insure+1.099
Neg logit contributions
GAN-1.590
Hour-1.589
table-1.573
Band-1.562
chin-1.547
Token cameras
Token ID9073
Feature activation-8.03e-02

Pos logit contributions
+0.681
ITNESS+0.668
 Edmund+0.607
 defences+0.574
 insure+0.567
Neg logit contributions
Hour-0.751
GAN-0.750
table-0.745
Band-0.734
chin-0.731
Token were
Token ID547
Feature activation+3.06e-02

Pos logit contributions
+0.416
ITNESS+0.406
 Edmund+0.375
 defences+0.356
 insure+0.356
Neg logit contributions
GAN-0.382
Hour-0.379
table-0.378
Band-0.370
chin-0.368
Token turned
Token ID2900
Feature activation-1.22e-01

Pos logit contributions
GAN+0.152
Hour+0.152
table+0.151
Band+0.148
chin+0.147
Neg logit contributions
-0.154
ITNESS-0.149
 Edmund-0.137
 defences-0.129
 insure-0.129
Token on
Token ID319
Feature activation+3.18e-02

Pos logit contributions
+0.600
ITNESS+0.574
 Edmund+0.528
 defences+0.499
 insure+0.494
Neg logit contributions
Hour-0.621
GAN-0.621
table-0.616
Band-0.610
chin-0.604
Token and
Token ID290
Feature activation-6.20e-01

Pos logit contributions
GAN+0.152
Hour+0.152
table+0.151
Band+0.149
chin+0.149
Neg logit contributions
-0.165
ITNESS-0.159
 Edmund-0.147
 defences-0.139
 insure-0.138
Token,
Token ID11
Feature activation-1.22e+00

Pos logit contributions
+2.862
ITNESS+2.788
 Edmund+2.607
 defences+2.448
 insure+2.427
Neg logit contributions
Hour-3.201
table-3.193
GAN-3.184
chin-3.149
risome-3.112
Token in
Token ID287
Feature activation-6.47e-01

Pos logit contributions
ITNESS+5.337
+5.245
 Edmund+5.160
 defences+4.826
 insure+4.754
Neg logit contributions
GAN-6.148
Hour-6.028
chin-6.012
risome-6.012
table-5.968
Token the
Token ID262
Feature activation-4.03e-01

Pos logit contributions
+3.056
ITNESS+3.029
 Edmund+2.769
 defences+2.585
 insure+2.582
Neg logit contributions
GAN-3.313
Hour-3.274
table-3.272
chin-3.247
Band-3.205
Token relatively
Token ID5365
Feature activation-7.83e-02

Pos logit contributions
+1.751
ITNESS+1.720
 Edmund+1.558
 defences+1.456
 insure+1.442
Neg logit contributions
GAN-2.254
Hour-2.233
table-2.220
chin-2.209
Band-2.203
Token dense
Token ID15715
Feature activation+3.73e-01

Pos logit contributions
+0.306
ITNESS+0.290
 Edmund+0.257
 defences+0.239
 insure+0.237
Neg logit contributions
GAN-0.478
Hour-0.477
table-0.475
Band-0.470
chin-0.468
Token atmosphere
Token ID8137
Feature activation+1.01e-01

Pos logit contributions
Hour+1.855
table+1.850
GAN+1.847
Band+1.826
chin+1.806
Neg logit contributions
-1.886
ITNESS-1.808
 Edmund-1.623
 defences-1.540
 insure-1.527
Token {{
Token ID22935
Feature activation-6.60e-01

Pos logit contributions
Hour+0.121
GAN+0.121
table+0.120
Band+0.119
chin+0.118
Neg logit contributions
-0.186
ITNESS-0.181
 Edmund-0.167
 defences-0.160
 insure-0.158
Token9
Token ID24
Feature activation-3.60e-01

Pos logit contributions
+3.175
ITNESS+3.061
 Edmund+2.794
 defences+2.572
 insure+2.541
Neg logit contributions
GAN-3.239
risome-3.221
Hour-3.210
andestine-3.203
table-3.178
Token,
Token ID11
Feature activation-5.21e-01

Pos logit contributions
+1.854
ITNESS+1.799
 Edmund+1.671
 defences+1.523
 insure+1.512
Neg logit contributions
Hour-1.663
risome-1.661
GAN-1.652
Band-1.639
andestine-1.631
Token 15
Token ID1315
Feature activation-3.86e-01

Pos logit contributions
+2.272
ITNESS+2.212
 Edmund+2.021
 defences+1.813
 insure+1.809
Neg logit contributions
GAN-2.841
Hour-2.841
table-2.786
Band-2.785
risome-2.777
Token},
Token ID5512
Feature activation-4.48e-01

Pos logit contributions
+2.370
ITNESS+2.296
 Edmund+2.115
 defences+2.027
 insure+2.012
Neg logit contributions
Hour-1.491
GAN-1.486
table-1.478
Band-1.458
chin-1.443
Token {{
Token ID22935
Feature activation-1.22e+00

Pos logit contributions
+2.231
ITNESS+2.149
 Edmund+1.946
 defences+1.829
 insure+1.819
Neg logit contributions
Hour-2.245
GAN-2.243
table-2.229
Band-2.213
chin-2.194
Token8
Token ID23
Feature activation-5.77e-01

Pos logit contributions
+5.204
ITNESS+5.051
 Edmund+4.603
 defences+4.033
 insure+4.002
Neg logit contributions
risome-6.365
GAN-6.236
andestine-6.228
Hour-6.106
chin-6.099
Token,
Token ID11
Feature activation-8.79e-01

Pos logit contributions
+2.774
ITNESS+2.714
 Edmund+2.523
 defences+2.242
 insure+2.219
Neg logit contributions
risome-2.819
andestine-2.769
GAN-2.762
Hour-2.734
Band-2.709
Token 14
Token ID1478
Feature activation-3.54e-01

Pos logit contributions
+3.480
ITNESS+3.440
 Edmund+3.173
 defences+2.744
 insure+2.735
Neg logit contributions
Hour-4.971
GAN-4.941
risome-4.911
Band-4.867
chin-4.846
Token},
Token ID5512
Feature activation-5.82e-01

Pos logit contributions
+2.022
ITNESS+1.963
 Edmund+1.795
 insure+1.715
 defences+1.703
Neg logit contributions
GAN-1.466
Hour-1.466
Band-1.455
table-1.449
chin-1.426
Token {
Token ID1391
Feature activation-6.38e-01

Pos logit contributions
+2.805
ITNESS+2.758
 Edmund+2.596
 insure+2.307
 defences+2.289
Neg logit contributions
Hour-2.854
Band-2.845
risome-2.843
GAN-2.842
chin-2.813
Token policy
Token ID2450
Feature activation+1.20e-01

Pos logit contributions
+1.118
ITNESS+1.080
 Edmund+0.990
 defences+0.951
 insure+0.929
Neg logit contributions
Hour-1.423
GAN-1.417
table-1.403
Band-1.396
chin-1.377
Token.
Token ID13
Feature activation+4.49e-02

Pos logit contributions
Hour+0.543
GAN+0.542
table+0.541
Band+0.533
chin+0.528
Neg logit contributions
-0.654
ITNESS-0.631
 Edmund-0.576
 defences-0.550
 insure-0.544
Token
Token ID198
Feature activation-8.39e-02

Pos logit contributions
Hour+0.180
GAN+0.180
table+0.178
chin+0.176
Band+0.174
Neg logit contributions
-0.265
ITNESS-0.259
 Edmund-0.245
 defences-0.226
 insure-0.224
Token
Token ID198
Feature activation-5.67e-02

Pos logit contributions
+0.487
ITNESS+0.479
 Edmund+0.451
 defences+0.418
 insure+0.413
Neg logit contributions
GAN-0.336
Hour-0.334
risome-0.331
table-0.328
Band-0.327
TokenUS
Token ID2937
Feature activation+7.16e-02

Pos logit contributions
+0.338
ITNESS+0.326
 Edmund+0.300
 defences+0.281
 insure+0.278
Neg logit contributions
risome-0.226
GAN-0.225
table-0.223
chin-0.222
Hour-0.221
TokenAT
Token ID1404
Feature activation-1.22e+00

Pos logit contributions
Hour+0.335
table+0.333
GAN+0.332
Band+0.328
chin+0.325
Neg logit contributions
-0.382
ITNESS-0.370
 Edmund-0.337
 defences-0.322
 insure-0.318
TokenOD
Token ID3727
Feature activation-4.48e-01

Pos logit contributions
ITNESS+4.996
+4.935
 Edmund+4.543
 defences+4.120
 insure+4.104
Neg logit contributions
Hour-6.681
table-6.647
chin-6.645
Band-6.536
risome-6.532
TokenAY
Token ID4792
Feature activation+2.63e-01

Pos logit contributions
+2.340
ITNESS+2.326
 Edmund+2.125
 defences+1.993
 insure+1.957
Neg logit contributions
Hour-2.051
andestine-2.010
risome-2.006
table-1.995
GAN-1.993
Token OP
Token ID13349
Feature activation+9.57e-02

Pos logit contributions
Hour+1.194
GAN+1.190
table+1.189
Band+1.173
chin+1.161
Neg logit contributions
-1.441
ITNESS-1.392
 Edmund-1.274
 defences-1.212
 insure-1.202
TokenIN
Token ID1268
Feature activation-2.20e-01

Pos logit contributions
Hour+0.536
table+0.535
GAN+0.532
Band+0.530
risome+0.526
Neg logit contributions
-0.418
ITNESS-0.407
 Edmund-0.361
 defences-0.335
 insure-0.332
TokenION
Token ID2849
Feature activation-6.89e-02

Pos logit contributions
+1.177
ITNESS+1.150
 Edmund+1.045
 defences+0.992
 insure+0.980
Neg logit contributions
Hour-1.012
table-1.008
GAN-0.997
chin-0.991
Band-0.990
Token next
Token ID1306
Feature activation+3.34e-02

Pos logit contributions
+1.780
ITNESS+1.692
 Edmund+1.608
 defences+1.497
 insure+1.483
Neg logit contributions
GAN-1.934
Hour-1.926
table-1.909
Band-1.873
andestine-1.869
Token-
Token ID12
Feature activation-4.26e-01

Pos logit contributions
Hour+0.140
table+0.139
GAN+0.139
Band+0.137
chin+0.136
Neg logit contributions
-0.196
ITNESS-0.189
 Edmund-0.173
 defences-0.167
 insure-0.164
Tokengeneration
Token ID20158
Feature activation-5.05e-02

Pos logit contributions
+2.236
ITNESS+2.159
 Edmund+1.994
 insure+1.883
 defences+1.871
Neg logit contributions
GAN-1.946
Hour-1.933
risome-1.891
andestine-1.872
table-1.865
Token BF
Token ID41646
Feature activation-5.34e-01

Pos logit contributions
+0.248
ITNESS+0.237
 Edmund+0.221
 defences+0.208
 insure+0.205
Neg logit contributions
Hour-0.256
GAN-0.255
table-0.254
Band-0.249
chin-0.247
TokenGood
Token ID10248
Feature activation-2.67e-01

Pos logit contributions
+3.196
ITNESS+3.111
 Edmund+2.922
 defences+2.768
 insure+2.763
Neg logit contributions
table-2.118
Hour-2.111
GAN-2.088
chin-2.061
andestine-2.033
Tokenrich
Token ID7527
Feature activation-1.22e+00

Pos logit contributions
+1.497
ITNESS+1.474
 Edmund+1.363
 insure+1.305
 defences+1.279
Neg logit contributions
GAN-1.143
risome-1.115
Hour-1.111
chin-1.109
table-1.103
Token All
Token ID1439
Feature activation-3.30e-02

Pos logit contributions
+5.908
 Edmund+5.600
ITNESS+5.556
 defences+5.150
 insure+5.082
Neg logit contributions
Hour-5.685
GAN-5.677
table-5.660
andestine-5.576
risome-5.558
Token-
Token ID12
Feature activation-3.59e-01

Pos logit contributions
+0.196
ITNESS+0.189
 Edmund+0.178
 defences+0.168
 insure+0.167
Neg logit contributions
GAN-0.133
table-0.132
Hour-0.132
risome-0.128
chin-0.127
TokenTer
Token ID15156
Feature activation-3.30e-01

Pos logit contributions
+2.252
ITNESS+2.122
 Edmund+2.027
 insure+1.922
 defences+1.903
Neg logit contributions
andestine-1.300
GAN-1.295
risome-1.285
table-1.277
Hour-1.241
Tokenrain
Token ID3201
Feature activation-1.48e-01

Pos logit contributions
+2.103
ITNESS+2.026
 Edmund+1.944
 insure+1.837
 defences+1.831
Neg logit contributions
GAN-1.163
Hour-1.152
andestine-1.120
table-1.117
Band-1.093
Token KO
Token ID21054
Feature activation-7.74e-01

Pos logit contributions
+0.735
ITNESS+0.700
 Edmund+0.660
 defences+0.616
 insure+0.613
Neg logit contributions
Hour-0.743
table-0.741
GAN-0.738
chin-0.721
andestine-0.717
Token energy
Token ID2568
Feature activation+3.46e-01

Pos logit contributions
Hour+0.752
table+0.748
GAN+0.747
Band+0.738
chin+0.732
Neg logit contributions
-0.681
ITNESS-0.659
 Edmund-0.593
 defences-0.564
 insure-0.556
Token goals
Token ID4661
Feature activation+6.19e-01

Pos logit contributions
Hour+1.846
GAN+1.842
table+1.840
Band+1.815
chin+1.801
Neg logit contributions
-1.617
ITNESS-1.549
 Edmund-1.388
 defences-1.311
 insure-1.294
Token.
Token ID13
Feature activation+9.63e-02

Pos logit contributions
table+2.695
Hour+2.691
GAN+2.671
Band+2.634
chin+2.598
Neg logit contributions
-3.532
ITNESS-3.392
 Edmund-3.065
 defences-2.943
 insure-2.911
Token<|endoftext|>
Token ID50256
Feature activation+3.42e-02

Pos logit contributions
Hour+0.366
GAN+0.365
table+0.364
Band+0.357
chin+0.354
Neg logit contributions
-0.598
ITNESS-0.580
 Edmund-0.536
 defences-0.513
 insure-0.509
TokenMedia
Token ID13152
Feature activation-4.24e-01

Pos logit contributions
GAN+0.135
Hour+0.135
table+0.134
Band+0.131
chin+0.131
Neg logit contributions
-0.207
ITNESS-0.200
 Edmund-0.183
 defences-0.175
 insure-0.173
Token playback
Token ID16388
Feature activation-1.21e+00

Pos logit contributions
+1.481
ITNESS+1.412
 Edmund+1.190
 defences+1.145
 insure+1.100
Neg logit contributions
table-2.722
Hour-2.718
GAN-2.708
Band-2.680
chin-2.661
Token is
Token ID318
Feature activation-3.85e-01

Pos logit contributions
+5.520
ITNESS+5.476
 Edmund+5.433
 defences+5.106
 insure+4.880
Neg logit contributions
risome-5.601
Band-5.574
andestine-5.531
rats-5.530
chin-5.516
Token unsupported
Token ID24222
Feature activation-4.21e-01

Pos logit contributions
+1.022
ITNESS+0.960
 Edmund+0.804
 defences+0.719
 insure+0.691
Neg logit contributions
GAN-2.801
Hour-2.797
table-2.787
Band-2.766
chin-2.744
Token on
Token ID319
Feature activation-1.91e-01

Pos logit contributions
+2.044
ITNESS+1.997
 Edmund+1.862
 defences+1.733
 insure+1.692
Neg logit contributions
Hour-2.078
GAN-2.070
risome-2.055
table-2.051
Band-2.042
Token your
Token ID534
Feature activation-6.58e-01

Pos logit contributions
+0.781
ITNESS+0.758
 Edmund+0.678
 defences+0.643
 insure+0.619
Neg logit contributions
Hour-1.109
GAN-1.103
table-1.093
Band-1.085
chin-1.082
Token device
Token ID3335
Feature activation-9.73e-03

Pos logit contributions
ITNESS+2.097
+1.989
 Edmund+1.886
 defences+1.784
 insurer+1.709
Neg logit contributions
Hour-4.332
GAN-4.322
chin-4.258
Band-4.248
risome-4.185
Token dollars
Token ID5054
Feature activation+4.42e-01

Pos logit contributions
Hour+0.293
GAN+0.293
table+0.291
chin+0.288
Band+0.288
Neg logit contributions
-0.224
ITNESS-0.214
 Edmund-0.193
 defences-0.182
 insure-0.179
Token.
Token ID13
Feature activation+8.17e-02

Pos logit contributions
Hour+1.980
GAN+1.973
table+1.973
Band+1.940
chin+1.922
Neg logit contributions
-2.456
ITNESS-2.371
 Edmund-2.161
 defences-2.065
 insure-2.041
Token
Token ID198
Feature activation+1.63e-03

Pos logit contributions
Hour+0.328
GAN+0.327
table+0.325
Band+0.320
chin+0.319
Neg logit contributions
-0.487
ITNESS-0.474
 Edmund-0.443
 defences-0.415
 insure-0.412
Token
Token ID198
Feature activation+8.44e-02

Pos logit contributions
GAN+0.006
Hour+0.006
table+0.006
risome+0.006
chin+0.006
Neg logit contributions
-0.010
ITNESS-0.010
 Edmund-0.009
 defences-0.008
 insure-0.008
TokenStory
Token ID11605
Feature activation-6.22e-02

Pos logit contributions
GAN+0.336
Hour+0.336
table+0.335
Band+0.328
chin+0.327
Neg logit contributions
-0.507
ITNESS-0.490
 Edmund-0.451
 defences-0.430
 insure-0.426
Token continues
Token ID4477
Feature activation-1.21e+00

Pos logit contributions
+0.273
ITNESS+0.262
 Edmund+0.240
 defences+0.224
 insure+0.218
Neg logit contributions
Hour-0.348
GAN-0.347
table-0.341
Band-0.339
chin-0.334
Token below
Token ID2174
Feature activation-4.36e-01

Pos logit contributions
 Edmund+3.336
ITNESS+3.323
+3.317
 defences+2.932
 insure+2.748
Neg logit contributions
GAN-8.059
Hour-7.910
Band-7.873
Role-7.719
risome-7.700
Token advertisement
Token ID15422
Feature activation+4.45e-01

Pos logit contributions
+1.612
ITNESS+1.557
 Edmund+1.489
 defences+1.350
 insure+1.316
Neg logit contributions
GAN-2.643
Hour-2.625
Band-2.567
andestine-2.561
risome-2.554
Token
Token ID198
Feature activation+7.36e-01

Pos logit contributions
Hour+1.625
GAN+1.620
table+1.615
Band+1.585
chin+1.568
Neg logit contributions
-2.835
ITNESS-2.751
 Edmund-2.548
 defences-2.447
 insure-2.424
Token
Token ID198
Feature activation+5.73e-01

Pos logit contributions
Hour+2.531
table+2.527
GAN+2.504
Band+2.469
chin+2.408
Neg logit contributions
-4.886
ITNESS-4.712
 Edmund-4.330
 defences-4.231
 insure-4.197
TokenStory
Token ID11605
Feature activation-1.98e-01

Pos logit contributions
Hour+1.193
GAN+1.172
table+1.171
Band+1.144
chin+1.100
Neg logit contributions
-4.569
ITNESS-4.458
 Edmund-4.195
 defences-4.082
 insure-4.052
Token of
Token ID286
Feature activation-3.03e-01

Pos logit contributions
+1.782
ITNESS+1.720
 Edmund+1.572
 defences+1.502
 insure+1.485
Neg logit contributions
Hour-1.432
GAN-1.428
table-1.425
Band-1.403
chin-1.390
Token women
Token ID1466
Feature activation-6.32e-02

Pos logit contributions
+1.424
ITNESS+1.372
 Edmund+1.266
 defences+1.201
 insure+1.171
Neg logit contributions
GAN-1.579
Hour-1.573
table-1.569
Band-1.551
chin-1.527
Token,
Token ID11
Feature activation-1.42e-01

Pos logit contributions
+0.319
ITNESS+0.306
 Edmund+0.281
 defences+0.269
 insure+0.263
Neg logit contributions
Hour-0.310
GAN-0.309
table-0.309
Band-0.304
chin-0.302
Token men
Token ID1450
Feature activation-1.11e-01

Pos logit contributions
+0.710
ITNESS+0.681
 Edmund+0.633
 defences+0.599
 insure+0.587
Neg logit contributions
Hour-0.707
GAN-0.707
table-0.699
chin-0.689
Band-0.688
Token and
Token ID290
Feature activation+1.28e-01

Pos logit contributions
+0.616
ITNESS+0.597
 Edmund+0.543
 defences+0.520
 insure+0.514
Neg logit contributions
Hour-0.497
table-0.495
GAN-0.493
Band-0.488
chin-0.481
Token,
Token ID11
Feature activation-1.21e+00

Pos logit contributions
Hour+0.674
GAN+0.667
table+0.664
Band+0.658
chin+0.652
Neg logit contributions
-0.607
ITNESS-0.585
 Edmund-0.532
 defences-0.507
 insure-0.499
Token increasingly
Token ID6481
Feature activation-4.73e-01

Pos logit contributions
+5.002
ITNESS+4.899
 Edmund+4.872
 defences+4.646
 insurer+4.502
Neg logit contributions
GAN-6.137
Hour-6.074
risome-5.996
Band-5.920
Panel-5.911
Token,
Token ID11
Feature activation-4.76e-02

Pos logit contributions
+2.691
ITNESS+2.587
 Edmund+2.379
 defences+2.282
 insure+2.253
Neg logit contributions
Hour-2.039
GAN-2.039
table-2.023
Band-1.993
chin-1.980
Token LGBT
Token ID9916
Feature activation-7.28e-02

Pos logit contributions
+0.221
ITNESS+0.213
 Edmund+0.197
 defences+0.186
 insure+0.184
Neg logit contributions
Hour-0.252
GAN-0.249
table-0.247
Band-0.246
risome-0.244
Token people
Token ID661
Feature activation+5.36e-02

Pos logit contributions
+0.363
ITNESS+0.350
 Edmund+0.316
 defences+0.300
 insure+0.296
Neg logit contributions
Hour-0.364
table-0.363
GAN-0.362
Band-0.359
chin-0.354
Token,
Token ID11
Feature activation-4.82e-02

Pos logit contributions
Hour+0.252
GAN+0.251
table+0.250
Band+0.246
chin+0.245
Neg logit contributions
-0.285
ITNESS-0.274
 Edmund-0.251
 defences-0.239
 insure-0.236
Token ).
Token ID6739
Feature activation-3.81e-01

Pos logit contributions
Hour+0.301
GAN+0.301
table+0.301
Band+0.296
chin+0.293
Neg logit contributions
-0.346
ITNESS-0.333
 Edmund-0.302
 defences-0.288
 insure-0.285
Token Cl
Token ID1012
Feature activation+2.15e-01

Pos logit contributions
+2.153
ITNESS+2.094
 Edmund+1.998
 defences+1.847
 insure+1.809
Neg logit contributions
Hour-1.593
GAN-1.581
table-1.574
chin-1.555
Band-1.550
Tokenueless
Token ID38835
Feature activation+3.74e-01

Pos logit contributions
Hour+1.027
GAN+1.024
table+1.017
Band+1.005
chin+0.995
Neg logit contributions
-1.119
ITNESS-1.082
 Edmund-0.986
 defences-0.938
 insure-0.925
Token or
Token ID393
Feature activation+2.11e-02

Pos logit contributions
Hour+1.805
GAN+1.801
table+1.801
Band+1.773
chin+1.761
Neg logit contributions
-1.936
ITNESS-1.861
 Edmund-1.686
 defences-1.601
 insure-1.585
Token powerful
Token ID3665
Feature activation+1.60e-01

Pos logit contributions
Hour+0.120
table+0.119
GAN+0.119
Band+0.117
chin+0.116
Neg logit contributions
-0.092
ITNESS-0.088
 Edmund-0.079
 defences-0.074
 insure-0.073
Token?
Token ID30
Feature activation-1.21e+00

Pos logit contributions
Hour+0.746
GAN+0.742
table+0.738
Band+0.730
chin+0.721
Neg logit contributions
-0.860
ITNESS-0.832
 Edmund-0.763
 defences-0.733
 insure-0.721
Token Ident
Token ID11440
Feature activation-1.63e-01

Pos logit contributions
+6.546
ITNESS+6.369
 Edmund+6.307
 defences+5.758
 insure+5.572
Neg logit contributions
Hour-4.794
GAN-4.731
risome-4.701
Band-4.689
table-4.684
Tokenifying
Token ID4035
Feature activation-2.98e-01

Pos logit contributions
+0.859
ITNESS+0.835
 Edmund+0.758
 defences+0.730
 insure+0.722
Neg logit contributions
Hour-0.758
GAN-0.756
table-0.748
Band-0.736
chin-0.727
Token sub
Token ID850
Feature activation+1.86e-01

Pos logit contributions
+1.402
ITNESS+1.351
 Edmund+1.243
 defences+1.195
 insure+1.158
Neg logit contributions
Hour-1.566
GAN-1.555
table-1.543
Band-1.524
chin-1.510
Tokentypes
Token ID19199
Feature activation+4.92e-01

Pos logit contributions
Hour+0.712
GAN+0.709
table+0.704
Band+0.694
chin+0.685
Neg logit contributions
-1.150
ITNESS-1.116
 Edmund-1.032
 defences-0.992
 insure-0.981
Token of
Token ID286
Feature activation+2.00e-01

Pos logit contributions
table+2.110
Hour+2.109
GAN+2.106
Band+2.069
chin+2.059
Neg logit contributions
-2.816
ITNESS-2.726
 Edmund-2.472
 defences-2.355
 insure-2.330
Token in
Token ID287
Feature activation-2.56e-01

Pos logit contributions
+0.022
ITNESS+0.021
 Edmund+0.019
 defences+0.018
 insure+0.018
Neg logit contributions
GAN-0.019
Hour-0.019
table-0.019
Band-0.018
chin-0.018
Token Sweden
Token ID10710
Feature activation-3.33e-01

Pos logit contributions
+1.228
ITNESS+1.186
 Edmund+1.103
 defences+1.052
 insure+1.020
Neg logit contributions
Hour-1.313
GAN-1.304
table-1.289
Band-1.280
chin-1.275
Token,
Token ID11
Feature activation-3.19e-01

Pos logit contributions
+1.821
ITNESS+1.755
 Edmund+1.623
 defences+1.542
 insure+1.524
Neg logit contributions
Hour-1.501
GAN-1.498
table-1.488
Band-1.469
chin-1.455
Token Hungary
Token ID19562
Feature activation+3.02e-01

Pos logit contributions
+1.608
ITNESS+1.572
 Edmund+1.480
 defences+1.396
 insure+1.373
Neg logit contributions
Hour-1.535
GAN-1.514
table-1.510
Band-1.492
chin-1.491
Token and
Token ID290
Feature activation-1.05e-02

Pos logit contributions
table+1.241
Hour+1.241
GAN+1.234
Band+1.223
chin+1.204
Neg logit contributions
-1.774
ITNESS-1.724
 Edmund-1.557
 defences-1.490
 insure-1.477
Token,
Token ID11
Feature activation-1.21e+00

Pos logit contributions
+0.055
ITNESS+0.053
 Edmund+0.049
 defences+0.047
 insure+0.046
Neg logit contributions
Hour-0.050
GAN-0.049
table-0.049
Band-0.049
chin-0.048
Token according
Token ID1864
Feature activation-2.54e-01

Pos logit contributions
+5.093
ITNESS+5.058
 Edmund+5.038
 defences+4.845
 insurer+4.651
Neg logit contributions
GAN-6.095
Hour-5.997
risome-5.884
Band-5.882
Panel-5.840
Token to
Token ID284
Feature activation-1.63e-01

Pos logit contributions
+1.338
ITNESS+1.288
 Edmund+1.198
 defences+1.128
 insure+1.106
Neg logit contributions
Hour-1.176
GAN-1.175
table-1.163
Band-1.156
risome-1.156
Token several
Token ID1811
Feature activation-2.33e-01

Pos logit contributions
+0.841
ITNESS+0.812
 Edmund+0.751
 defences+0.715
 insure+0.696
Neg logit contributions
Hour-0.791
GAN-0.786
table-0.775
Band-0.770
chin-0.761
Token European
Token ID3427
Feature activation+5.13e-01

Pos logit contributions
+1.059
ITNESS+1.028
 Edmund+0.945
 defences+0.904
 insure+0.871
Neg logit contributions
GAN-1.247
Hour-1.247
Band-1.227
table-1.225
risome-1.205
Token sources
Token ID4237
Feature activation-1.59e-01

Pos logit contributions
table+2.536
Hour+2.534
GAN+2.528
Band+2.490
chin+2.471
Neg logit contributions
-2.607
ITNESS-2.501
 Edmund-2.254
 defences-2.132
 insure-2.116
Token2
Token ID17
Feature activation-5.73e-02

Pos logit contributions
+0.260
ITNESS+0.252
 Edmund+0.231
 defences+0.221
 insure+0.219
Neg logit contributions
Hour-0.192
GAN-0.191
table-0.191
Band-0.188
chin-0.186
Token Podcast
Token ID16036
Feature activation-1.74e-01

Pos logit contributions
+0.337
ITNESS+0.326
 Edmund+0.302
 defences+0.289
 insure+0.285
Neg logit contributions
GAN-0.234
Hour-0.233
table-0.232
chin-0.229
Band-0.229
Token!
Token ID0
Feature activation-4.42e-02

Pos logit contributions
+0.963
ITNESS+0.929
 Edmund+0.855
 defences+0.814
 insure+0.804
Neg logit contributions
GAN-0.771
Hour-0.769
table-0.768
Band-0.755
chin-0.752
Token
Token ID198
Feature activation+2.23e-01

Pos logit contributions
+0.259
ITNESS+0.251
 Edmund+0.232
 defences+0.221
 insure+0.219
Neg logit contributions
GAN-0.181
Hour-0.181
table-0.181
chin-0.177
Band-0.177
Token
Token ID198
Feature activation+2.21e-01

Pos logit contributions
Hour+0.839
GAN+0.837
table+0.835
chin+0.820
risome+0.820
Neg logit contributions
-1.383
ITNESS-1.343
 Edmund-1.241
 defences-1.176
 insure-1.161
Tokenpur
Token ID14225
Feature activation+8.47e-01

Pos logit contributions
Hour+0.868
GAN+0.867
table+0.864
Band+0.847
chin+0.841
Neg logit contributions
-1.347
ITNESS-1.304
 Edmund-1.201
 defences-1.152
 insure-1.141
Tokenak
Token ID461
Feature activation+1.17e-01

Pos logit contributions
table+3.857
Hour+3.827
GAN+3.811
chin+3.807
Band+3.755
Neg logit contributions
-4.555
ITNESS-4.416
 Edmund-4.037
 defences-3.857
 insure-3.810
Tokenushi
Token ID17731
Feature activation-8.17e-02

Pos logit contributions
Hour+0.503
GAN+0.502
table+0.500
Band+0.492
chin+0.487
Neg logit contributions
-0.665
ITNESS-0.643
 Edmund-0.589
 defences-0.564
 insure-0.558
Token Profile
Token ID13118
Feature activation-9.14e-02

Pos logit contributions
+0.476
ITNESS+0.458
 Edmund+0.422
 defences+0.403
 insure+0.400
Neg logit contributions
GAN-0.340
table-0.339
Hour-0.339
Band-0.331
chin-0.331
Token Joined
Token ID16709
Feature activation-2.39e-01

Pos logit contributions
+0.549
ITNESS+0.529
 Edmund+0.494
 defences+0.472
 insure+0.467
Neg logit contributions
Hour-0.364
GAN-0.363
table-0.362
Band-0.354
chin-0.351
Token August
Token ID2932
Feature activation+3.79e-01

Pos logit contributions
+1.171
ITNESS+1.130
 Edmund+1.071
 defences+0.994
 insure+0.970
Neg logit contributions
GAN-1.176
Hour-1.171
table-1.168
chin-1.155
risome-1.152
Token positions
Token ID6116
Feature activation+2.16e-01

Pos logit contributions
Hour+0.379
table+0.379
GAN+0.378
Band+0.374
chin+0.367
Neg logit contributions
-0.367
ITNESS-0.359
 Edmund-0.326
 defences-0.310
 insure-0.303
Token.
Token ID13
Feature activation+4.48e-01

Pos logit contributions
Hour+0.959
GAN+0.956
table+0.954
Band+0.939
chin+0.931
Neg logit contributions
-1.202
ITNESS-1.161
 Edmund-1.061
 defences-1.014
 insure-1.002
Token
Token ID564
Feature activation+1.87e-01

Pos logit contributions
Hour+1.937
GAN+1.932
table+1.930
Band+1.895
chin+1.879
Neg logit contributions
-2.551
ITNESS-2.468
 Edmund-2.267
 defences-2.152
 insure-2.129
Token
Token ID250
Feature activation+5.68e-01

Pos logit contributions
Hour+1.222
GAN+1.219
table+1.217
Band+1.205
chin+1.197
Neg logit contributions
-0.656
ITNESS-0.620
 Edmund-0.534
 defences-0.494
 insure-0.483
TokenBut
Token ID1537
Feature activation+3.65e-01

Pos logit contributions
Hour+2.113
GAN+2.112
table+2.111
Band+2.059
chin+2.049
Neg logit contributions
-3.576
ITNESS-3.468
 Edmund-3.205
 defences-3.062
 insure-3.033
Token what
Token ID644
Feature activation+8.65e-01

Pos logit contributions
Hour+1.653
GAN+1.649
table+1.646
Band+1.620
chin+1.598
Neg logit contributions
-1.996
ITNESS-1.942
 Edmund-1.787
 defences-1.689
 insure-1.675
Token does
Token ID857
Feature activation+5.33e-01

Pos logit contributions
Hour+3.783
table+3.772
GAN+3.766
Band+3.714
chin+3.694
Neg logit contributions
-4.858
ITNESS-4.675
 Edmund-4.236
 defences-4.054
 insure-4.016
Token that
Token ID326
Feature activation+6.45e-01

Pos logit contributions
Hour+2.211
GAN+2.205
table+2.201
Band+2.161
chin+2.140
Neg logit contributions
-3.134
ITNESS-3.035
 Edmund-2.791
 defences-2.668
 insure-2.642
Token even
Token ID772
Feature activation+3.36e-01

Pos logit contributions
GAN+3.136
Hour+3.135
table+3.129
Band+3.087
chin+3.066
Neg logit contributions
-3.307
ITNESS-3.177
 Edmund-2.873
 defences-2.746
 insure-2.691
Token mean
Token ID1612
Feature activation+4.93e-01

Pos logit contributions
Hour+1.621
GAN+1.617
table+1.614
Band+1.591
chin+1.577
Neg logit contributions
-1.747
ITNESS-1.682
 Edmund-1.528
 defences-1.454
 insure-1.438
Token?
Token ID30
Feature activation+2.34e-01

Pos logit contributions
GAN+1.705
table+1.702
Hour+1.701
Band+1.667
chin+1.642
Neg logit contributions
-3.222
ITNESS-3.129
 Edmund-2.900
 defences-2.800
 insure-2.772
Tokenenson
Token ID19069
Feature activation+1.44e-01

Pos logit contributions
Hour+0.427
table+0.422
GAN+0.422
Band+0.418
risome+0.412
Neg logit contributions
-0.452
ITNESS-0.446
 Edmund-0.404
 defences-0.379
 insure-0.379
Token as
Token ID355
Feature activation+5.22e-02

Pos logit contributions
Hour+0.655
GAN+0.653
table+0.653
Band+0.643
chin+0.636
Neg logit contributions
-0.791
ITNESS-0.762
 Edmund-0.695
 defences-0.664
 insure-0.656
Token Row
Token ID11314
Feature activation-6.69e-02

Pos logit contributions
Hour+0.245
GAN+0.244
table+0.243
Band+0.240
chin+0.240
Neg logit contributions
-0.276
ITNESS-0.267
 Edmund-0.245
 defences-0.232
 insure-0.229
Tokenlf
Token ID1652
Feature activation-2.09e-01

Pos logit contributions
+0.243
ITNESS+0.231
 Edmund+0.204
 defences+0.187
 insure+0.183
Neg logit contributions
Hour-0.423
GAN-0.422
table-0.420
Band-0.417
chin-0.413
Token.[
Token ID3693
Feature activation+3.30e-01

Pos logit contributions
+1.133
ITNESS+1.092
 Edmund+1.004
 defences+0.953
 insure+0.942
Neg logit contributions
Hour-0.955
GAN-0.953
table-0.949
Band-0.934
chin-0.927
Token4
Token ID19
Feature activation+8.55e-01

Pos logit contributions
Hour+1.572
table+1.568
GAN+1.566
Band+1.544
chin+1.528
Neg logit contributions
-1.743
ITNESS-1.679
 Edmund-1.523
 defences-1.454
 insure-1.436
Token][
Token ID7131
Feature activation+7.03e-01

Pos logit contributions
table+2.607
Hour+2.606
GAN+2.573
Band+2.555
chin+2.486
Neg logit contributions
-5.805
ITNESS-5.652
 Edmund-5.267
 defences-5.101
 insure-5.040
Token9
Token ID24
Feature activation+4.94e-01

Pos logit contributions
table+3.236
Hour+3.207
GAN+3.177
Band+3.161
chin+3.115
Neg logit contributions
-3.848
ITNESS-3.722
 Edmund-3.351
 defences-3.225
 insure-3.184
Token]
Token ID60
Feature activation+3.24e-01

Pos logit contributions
table+1.583
Hour+1.579
GAN+1.564
Band+1.548
chin+1.502
Neg logit contributions
-3.356
ITNESS-3.257
 Edmund-3.008
 defences-2.920
 insure-2.895
Token Although
Token ID4900
Feature activation+3.49e-01

Pos logit contributions
Hour+1.404
GAN+1.400
table+1.398
Band+1.376
chin+1.362
Neg logit contributions
-1.847
ITNESS-1.784
 Edmund-1.632
 defences-1.565
 insure-1.547
Token released
Token ID2716
Feature activation+4.29e-02

Pos logit contributions
table+1.695
Hour+1.694
GAN+1.692
Band+1.670
chin+1.647
Neg logit contributions
-1.803
ITNESS-1.732
 Edmund-1.559
 defences-1.495
 insure-1.472
Token the
Token ID262
Feature activation-2.25e-02

Pos logit contributions
+1.248
ITNESS+1.211
 Edmund+1.121
 defences+1.066
 insure+1.046
Neg logit contributions
Hour-1.056
GAN-1.055
table-1.054
Band-1.031
chin-1.026
Token city
Token ID1748
Feature activation+1.91e-01

Pos logit contributions
+0.108
ITNESS+0.104
 Edmund+0.095
 defences+0.090
 insure+0.088
Neg logit contributions
Hour-0.117
GAN-0.116
table-0.116
Band-0.115
chin-0.113
Token is
Token ID318
Feature activation-1.98e-01

Pos logit contributions
Hour+0.840
GAN+0.838
table+0.838
Band+0.820
chin+0.813
Neg logit contributions
-1.072
ITNESS-1.031
 Edmund-0.949
 defences-0.909
 insure-0.899
Token violating
Token ID18134
Feature activation+3.47e-01

Pos logit contributions
+0.908
ITNESS+0.852
 Edmund+0.785
 defences+0.739
 insure+0.736
Neg logit contributions
Hour-1.067
GAN-1.063
table-1.056
risome-1.031
Band-1.030
Token her
Token ID607
Feature activation-1.08e-01

Pos logit contributions
Hour+1.671
table+1.668
GAN+1.666
Band+1.642
chin+1.629
Neg logit contributions
-1.803
ITNESS-1.731
 Edmund-1.557
 defences-1.488
 insure-1.473
Token rights
Token ID2489
Feature activation+9.69e-01

Pos logit contributions
+0.496
ITNESS+0.474
 Edmund+0.430
 defences+0.405
 insure+0.400
Neg logit contributions
GAN-0.589
Hour-0.587
table-0.584
Band-0.577
chin-0.572
Token.
Token ID13
Feature activation-3.04e-01

Pos logit contributions
table+4.155
Hour+4.135
GAN+4.063
Band+4.041
chin+3.996
Neg logit contributions
-5.496
ITNESS-5.346
 Edmund-4.790
 defences-4.559
 insure-4.531
Token
Token ID198
Feature activation-3.38e-02

Pos logit contributions
+1.776
ITNESS+1.737
 Edmund+1.631
 defences+1.495
 insure+1.492
Neg logit contributions
GAN-1.230
Hour-1.223
table-1.220
Band-1.204
chin-1.200
Token
Token ID198
Feature activation-6.06e-02

Pos logit contributions
+0.200
ITNESS+0.195
 Edmund+0.183
 defences+0.170
 insure+0.168
Neg logit contributions
GAN-0.134
Hour-0.133
risome-0.132
table-0.131
chin-0.130
TokenH
Token ID39
Feature activation-1.63e-01

Pos logit contributions
+0.370
ITNESS+0.358
 Edmund+0.329
 defences+0.311
 insure+0.309
Neg logit contributions
GAN-0.233
table-0.233
Hour-0.231
risome-0.229
chin-0.227
Tokenilton
Token ID9044
Feature activation+1.83e-01

Pos logit contributions
+0.763
ITNESS+0.739
 Edmund+0.658
 defences+0.621
 insure+0.613
Neg logit contributions
GAN-0.868
Hour-0.862
table-0.861
Band-0.854
chin-0.843
Tokenphan
Token ID19080
Feature activation-1.02e-01

Pos logit contributions
Hour+1.164
table+1.158
GAN+1.152
Band+1.135
chin+1.120
Neg logit contributions
-1.744
ITNESS-1.702
 Edmund-1.565
 defences-1.497
 insure-1.486
Token ultimately
Token ID6165
Feature activation-1.04e-01

Pos logit contributions
+0.560
ITNESS+0.542
 Edmund+0.500
 defences+0.474
 insure+0.470
Neg logit contributions
Hour-0.457
GAN-0.455
table-0.453
Band-0.445
chin-0.442
Token ends
Token ID5645
Feature activation-6.39e-02

Pos logit contributions
+0.519
ITNESS+0.510
 Edmund+0.476
 defences+0.451
 insure+0.450
Neg logit contributions
Hour-0.509
GAN-0.503
table-0.499
Band-0.493
risome-0.491
Token up
Token ID510
Feature activation-1.57e-01

Pos logit contributions
+0.372
ITNESS+0.363
 Edmund+0.339
 defences+0.322
 insure+0.317
Neg logit contributions
Hour-0.261
GAN-0.261
table-0.258
Band-0.254
chin-0.253
Token doing
Token ID1804
Feature activation+2.87e-01

Pos logit contributions
+0.745
ITNESS+0.731
 Edmund+0.678
 defences+0.637
 insure+0.624
Neg logit contributions
Hour-0.800
GAN-0.792
table-0.786
risome-0.777
Band-0.776
Token nothing
Token ID2147
Feature activation+8.57e-01

Pos logit contributions
Hour+1.370
GAN+1.363
table+1.359
Band+1.342
chin+1.330
Neg logit contributions
-1.498
ITNESS-1.449
 Edmund-1.323
 defences-1.258
 insure-1.242
Token.
Token ID13
Feature activation+4.69e-02

Pos logit contributions
table+3.644
Hour+3.618
GAN+3.594
Band+3.540
chin+3.485
Neg logit contributions
-4.935
ITNESS-4.766
 Edmund-4.260
 defences-4.060
 insure-4.058
Token
Token ID198
Feature activation+1.49e-01

Pos logit contributions
Hour+0.198
GAN+0.195
table+0.195
risome+0.192
chin+0.192
Neg logit contributions
-0.268
ITNESS-0.262
 Edmund-0.246
 defences-0.227
 insure-0.225
Token
Token ID198
Feature activation+1.93e-01

Pos logit contributions
Hour+0.571
GAN+0.571
table+0.563
Band+0.556
chin+0.555
Neg logit contributions
-0.905
ITNESS-0.884
 Edmund-0.824
 defences-0.777
 insure-0.769
TokenBut
Token ID1537
Feature activation+5.23e-02

Pos logit contributions
Hour+0.752
GAN+0.751
table+0.748
risome+0.732
Band+0.731
Neg logit contributions
-1.178
ITNESS-1.142
 Edmund-1.050
 defences-1.002
 insure-0.992
Token now
Token ID783
Feature activation-1.25e-01

Pos logit contributions
Hour+0.242
GAN+0.239
table+0.238
Band+0.236
chin+0.232
Neg logit contributions
-0.279
ITNESS-0.271
 Edmund-0.251
 defences-0.238
 insure-0.235
Token the
Token ID262
Feature activation+1.41e-01

Pos logit contributions
Hour+0.205
table+0.205
GAN+0.205
Band+0.201
chin+0.200
Neg logit contributions
-0.233
ITNESS-0.225
 Edmund-0.203
 defences-0.194
 insure-0.192
Token wheel
Token ID7825
Feature activation-9.46e-02

Pos logit contributions
Hour+0.903
GAN+0.901
table+0.900
Band+0.890
chin+0.885
Neg logit contributions
-0.507
ITNESS-0.480
 Edmund-0.416
 defences-0.385
 insure-0.377
Token of
Token ID286
Feature activation+2.40e-02

Pos logit contributions
+0.523
ITNESS+0.505
 Edmund+0.462
 defences+0.441
 insure+0.436
Neg logit contributions
Hour-0.424
GAN-0.423
table-0.422
Band-0.416
chin-0.412
Token a
Token ID257
Feature activation+3.76e-02

Pos logit contributions
Hour+0.104
GAN+0.103
table+0.103
Band+0.101
chin+0.101
Neg logit contributions
-0.137
ITNESS-0.132
 Edmund-0.121
 defences-0.116
 insure-0.115
Token Ford
Token ID8092
Feature activation-1.99e-01

Pos logit contributions
Hour+0.208
GAN+0.208
table+0.207
Band+0.205
chin+0.203
Neg logit contributions
-0.168
ITNESS-0.161
 Edmund-0.145
 defences-0.136
 insure-0.134
Token Fiesta
Token ID49839
Feature activation+2.25e-01

Pos logit contributions
+1.040
ITNESS+1.009
 Edmund+0.926
 defences+0.875
 insure+0.870
Neg logit contributions
Hour-0.941
GAN-0.941
table-0.934
Band-0.922
chin-0.912
Token R
Token ID371
Feature activation-7.05e-02

Pos logit contributions
Hour+0.867
GAN+0.865
table+0.863
Band+0.848
chin+0.839
Neg logit contributions
-1.390
ITNESS-1.346
 Edmund-1.237
 defences-1.190
 insure-1.177
Token5
Token ID20
Feature activation-1.23e-01

Pos logit contributions
+0.378
ITNESS+0.364
 Edmund+0.330
 defences+0.316
 insure+0.311
Neg logit contributions
Hour-0.331
GAN-0.329
table-0.329
Band-0.325
chin-0.321
Token,
Token ID11
Feature activation+1.15e-01

Pos logit contributions
+0.661
ITNESS+0.639
 Edmund+0.589
 defences+0.560
 insure+0.553
Neg logit contributions
Hour-0.561
GAN-0.560
table-0.555
Band-0.550
chin-0.544
Token but
Token ID475
Feature activation+2.15e-02

Pos logit contributions
Hour+0.512
GAN+0.511
table+0.510
Band+0.502
chin+0.497
Neg logit contributions
-0.638
ITNESS-0.616
 Edmund-0.564
 defences-0.538
 insure-0.532
Token the
Token ID262
Feature activation-1.51e-01

Pos logit contributions
Hour+0.101
GAN+0.101
table+0.101
Band+0.100
chin+0.099
Neg logit contributions
-0.114
ITNESS-0.110
 Edmund-0.101
 defences-0.096
 insure-0.094
Token city
Token ID1748
Feature activation+9.39e-02

Pos logit contributions
Hour+0.144
GAN+0.143
table+0.143
Band+0.141
chin+0.141
Neg logit contributions
-0.114
ITNESS-0.110
 Edmund-0.099
 defences-0.095
 insure-0.092
Token.
Token ID13
Feature activation-2.26e-01

Pos logit contributions
Hour+0.405
GAN+0.404
table+0.403
Band+0.397
chin+0.393
Neg logit contributions
-0.535
ITNESS-0.517
 Edmund-0.474
 defences-0.454
 insure-0.449
Token It
Token ID632
Feature activation-8.36e-02

Pos logit contributions
+1.239
ITNESS+1.205
 Edmund+1.139
 defences+1.036
 insure+1.026
Neg logit contributions
Hour-0.995
table-0.993
GAN-0.992
chin-0.981
risome-0.979
Token is
Token ID318
Feature activation-2.12e-01

Pos logit contributions
+0.475
ITNESS+0.458
 Edmund+0.422
 defences+0.405
 insure+0.401
Neg logit contributions
Hour-0.362
GAN-0.361
table-0.358
Band-0.353
chin-0.348
Token up
Token ID510
Feature activation+2.95e-01

Pos logit contributions
+1.028
ITNESS+0.986
 Edmund+0.931
 defences+0.874
 insure+0.859
Neg logit contributions
GAN-1.077
Hour-1.058
table-1.057
chin-1.043
Band-1.040
Token to
Token ID284
Feature activation+3.31e-01

Pos logit contributions
table+1.222
Hour+1.208
GAN+1.201
Band+1.183
chin+1.163
Neg logit contributions
-1.738
ITNESS-1.687
 Edmund-1.545
 defences-1.476
 insure-1.462
Token them
Token ID606
Feature activation+8.43e-02

Pos logit contributions
Hour+1.493
GAN+1.489
table+1.488
Band+1.465
chin+1.450
Neg logit contributions
-1.820
ITNESS-1.754
 Edmund-1.595
 defences-1.525
 insure-1.507
Token how
Token ID703
Feature activation-1.32e-01

Pos logit contributions
Hour+0.369
GAN+0.368
table+0.368
Band+0.362
chin+0.359
Neg logit contributions
-0.475
ITNESS-0.459
 Edmund-0.420
 defences-0.402
 insure-0.397
Token to
Token ID284
Feature activation+7.81e-02

Pos logit contributions
+0.761
ITNESS+0.732
 Edmund+0.685
 defences+0.653
 insure+0.643
Neg logit contributions
Hour-0.563
GAN-0.560
table-0.555
Band-0.548
chin-0.544
Token rule
Token ID3896
Feature activation+3.08e-01

Pos logit contributions
Hour+0.475
GAN+0.473
table+0.471
Band+0.468
chin+0.464
Neg logit contributions
-0.308
ITNESS-0.294
 Edmund-0.260
 insure-0.245
 defences-0.243
Token it
Token ID340
Feature activation+1.31e-01

Pos logit contributions
Hour+1.392
GAN+1.388
table+1.387
Band+1.364
chin+1.351
Neg logit contributions
-1.698
ITNESS-1.640
 Edmund-1.492
 defences-1.425
 insure-1.410
Token Dr
Token ID1583
Feature activation+2.71e-01

Pos logit contributions
Hour+0.927
GAN+0.925
table+0.923
Band+0.908
chin+0.900
Neg logit contributions
-1.174
ITNESS-1.134
 Edmund-1.036
 defences-0.990
 insure-0.980
Token.
Token ID13
Feature activation+5.86e-01

Pos logit contributions
GAN+1.330
table+1.323
Hour+1.322
chin+1.299
Band+1.295
Neg logit contributions
-1.398
ITNESS-1.338
 Edmund-1.181
 defences-1.141
 insure-1.137
Token Melissa
Token ID24164
Feature activation-2.22e-01

Pos logit contributions
GAN+2.841
table+2.823
Hour+2.817
chin+2.770
Band+2.770
Neg logit contributions
-3.033
ITNESS-2.923
 Edmund-2.542
 defences-2.490
 insure-2.482
Token Tall
Token ID22676
Feature activation-1.76e-01

Pos logit contributions
+1.176
ITNESS+1.136
 Edmund+1.063
 defences+0.996
 insure+0.978
Neg logit contributions
Hour-1.026
GAN-1.023
table-1.018
Band-1.003
chin-0.989
Tokenman
Token ID805
Feature activation+2.23e-01

Pos logit contributions
+1.002
ITNESS+0.974
 Edmund+0.910
 defences+0.865
 insure+0.858
Neg logit contributions
GAN-0.738
Hour-0.736
table-0.727
Band-0.721
risome-0.705
Token and
Token ID290
Feature activation+4.38e-01

Pos logit contributions
Hour+0.979
GAN+0.977
table+0.975
Band+0.959
chin+0.950
Neg logit contributions
-1.254
ITNESS-1.211
 Edmund-1.109
 defences-1.060
 insure-1.048
Token her
Token ID607
Feature activation-5.84e-02

Pos logit contributions
table+1.870
GAN+1.868
Hour+1.867
Band+1.828
chin+1.815
Neg logit contributions
-2.522
ITNESS-2.426
 Edmund-2.198
 defences-2.121
 insure-2.097
Token student
Token ID3710
Feature activation-7.13e-02

Pos logit contributions
+0.270
ITNESS+0.260
 Edmund+0.238
 defences+0.224
 insure+0.220
Neg logit contributions
Hour-0.314
GAN-0.311
table-0.309
Band-0.308
chin-0.305
Token Andrea
Token ID23174
Feature activation-2.30e-01

Pos logit contributions
+0.397
ITNESS+0.385
 Edmund+0.359
 defences+0.340
 insure+0.335
Neg logit contributions
Hour-0.314
GAN-0.312
table-0.311
Band-0.308
chin-0.304
Token Morrow
Token ID34693
Feature activation+3.23e-01

Pos logit contributions
+1.253
ITNESS+1.210
 Edmund+1.137
 defences+1.063
 insure+1.046
Neg logit contributions
Hour-1.037
GAN-1.032
table-1.030
Band-1.014
chin-1.004
Token,
Token ID11
Feature activation+1.79e-01

Pos logit contributions
Hour+1.455
GAN+1.449
table+1.446
Band+1.425
chin+1.410
Neg logit contributions
-1.777
ITNESS-1.717
 Edmund-1.571
 defences-1.497
 insure-1.482
Token In
Token ID554
Feature activation-5.34e-01

Pos logit contributions
+1.730
ITNESS+1.675
 Edmund+1.565
 defences+1.420
 insure+1.416
Neg logit contributions
GAN-1.444
table-1.443
Hour-1.442
chin-1.428
risome-1.427
Token 2011
Token ID2813
Feature activation-2.12e-01

Pos logit contributions
+2.743
ITNESS+2.624
 Edmund+2.441
 defences+2.272
 insure+2.251
Neg logit contributions
GAN-2.560
table-2.534
Hour-2.532
chin-2.494
Band-2.478
Token,
Token ID11
Feature activation-2.12e-01

Pos logit contributions
+1.203
ITNESS+1.157
 Edmund+1.066
 defences+1.010
 insure+1.000
Neg logit contributions
Hour-0.923
GAN-0.923
table-0.919
Band-0.905
chin-0.900
Token we
Token ID356
Feature activation+2.31e-01

Pos logit contributions
+1.156
ITNESS+1.115
 Edmund+1.026
 defences+0.968
 insure+0.960
Neg logit contributions
GAN-0.964
Hour-0.962
table-0.960
Band-0.944
chin-0.944
Token celebrated
Token ID13943
Feature activation+1.23e-01

Pos logit contributions
Hour+1.124
GAN+1.119
table+1.119
Band+1.103
chin+1.091
Neg logit contributions
-1.187
ITNESS-1.146
 Edmund-1.043
 defences-0.986
 insure-0.980
Token our
Token ID674
Feature activation+2.14e-01

Pos logit contributions
Hour+0.604
GAN+0.603
table+0.602
Band+0.593
chin+0.589
Neg logit contributions
-0.624
ITNESS-0.601
 Edmund-0.546
 defences-0.517
 insure-0.511
Token 1
Token ID352
Feature activation+7.74e-02

Pos logit contributions
Hour+1.103
GAN+1.101
table+1.099
Band+1.082
chin+1.073
Neg logit contributions
-1.040
ITNESS-1.000
 Edmund-0.907
 defences-0.857
 insure-0.845
Token million
Token ID1510
Feature activation-2.23e-01

Pos logit contributions
Hour+0.333
GAN+0.332
table+0.332
Band+0.326
chin+0.323
Neg logit contributions
-0.442
ITNESS-0.427
 Edmund-0.392
 defences-0.374
 insure-0.370
Tokenth
Token ID400
Feature activation+3.02e-03

Pos logit contributions
+1.524
ITNESS+1.477
 Edmund+1.382
 defences+1.326
 insure+1.319
Neg logit contributions
GAN-0.716
Hour-0.698
table-0.694
Band-0.681
risome-0.679
Token beer
Token ID6099
Feature activation-2.00e-01

Pos logit contributions
GAN+0.016
Hour+0.016
table+0.015
Band+0.015
chin+0.015
Neg logit contributions
-0.015
ITNESS-0.014
 Edmund-0.013
 defences-0.012
 insure-0.012
Token check
Token ID2198
Feature activation-1.49e-01

Pos logit contributions
+0.938
ITNESS+0.897
 Edmund+0.811
 defences+0.760
 insure+0.756
Neg logit contributions
GAN-1.065
Hour-1.061
table-1.052
Band-1.042
chin-1.036
Token episode
Token ID4471
Feature activation-2.24e-01

Pos logit contributions
+1.164
ITNESS+1.113
 Edmund+1.020
 defences+0.955
 insure+0.942
Neg logit contributions
Hour-1.342
GAN-1.341
table-1.327
Band-1.310
chin-1.307
Token at
Token ID379
Feature activation-8.39e-02

Pos logit contributions
+1.178
ITNESS+1.134
 Edmund+1.044
 defences+0.993
 insure+0.977
Neg logit contributions
GAN-1.058
Hour-1.054
table-1.043
Band-1.036
chin-1.030
Token length
Token ID4129
Feature activation+8.83e-02

Pos logit contributions
+0.424
ITNESS+0.408
 Edmund+0.370
 defences+0.351
 insure+0.347
Neg logit contributions
Hour-0.417
GAN-0.416
table-0.415
Band-0.409
chin-0.406
Token.
Token ID13
Feature activation-4.03e-02

Pos logit contributions
Hour+0.411
GAN+0.410
table+0.410
Band+0.404
chin+0.400
Neg logit contributions
-0.473
ITNESS-0.456
 Edmund-0.416
 defences-0.396
 insure-0.392
Token
Token ID564
Feature activation-7.44e-02

Pos logit contributions
+0.231
ITNESS+0.224
 Edmund+0.208
 defences+0.194
 insure+0.193
Neg logit contributions
Hour-0.171
GAN-0.171
table-0.170
Band-0.168
chin-0.166
Token
Token ID250
Feature activation+2.81e-01

Pos logit contributions
+0.268
ITNESS+0.254
 Edmund+0.220
 defences+0.203
 insure+0.200
Neg logit contributions
Hour-0.477
GAN-0.476
table-0.476
Band-0.471
chin-0.468
TokenHe
Token ID1544
Feature activation+1.50e-01

Pos logit contributions
GAN+1.088
Hour+1.087
table+1.084
Band+1.062
chin+1.059
Neg logit contributions
-1.727
ITNESS-1.669
 Edmund-1.544
 defences-1.469
 insure-1.458
Token feels
Token ID5300
Feature activation+4.01e-01

Pos logit contributions
GAN+0.684
Hour+0.684
table+0.678
Band+0.669
chin+0.661
Neg logit contributions
-0.820
ITNESS-0.793
 Edmund-0.727
 defences-0.691
 insure-0.688
Token very
Token ID845
Feature activation-2.90e-02

Pos logit contributions
Hour+2.052
GAN+2.046
table+2.043
Band+2.016
chin+2.000
Neg logit contributions
-1.961
ITNESS-1.884
 Edmund-1.698
 defences-1.611
 insure-1.590
Token,
Token ID11
Feature activation+4.89e-01

Pos logit contributions
+0.122
ITNESS+0.117
 Edmund+0.105
 defences+0.098
 insure+0.097
Neg logit contributions
GAN-0.168
Hour-0.168
table-0.166
Band-0.165
chin-0.164
Token very
Token ID845
Feature activation-3.35e-02

Pos logit contributions
Hour+2.225
table+2.223
GAN+2.217
Band+2.183
chin+2.165
Neg logit contributions
-2.672
ITNESS-2.581
 Edmund-2.342
 defences-2.249
 insure-2.214
Token G
Token ID402
Feature activation+4.18e-01

Pos logit contributions
+2.209
ITNESS+2.142
 Edmund+2.032
 defences+1.876
 insure+1.853
Neg logit contributions
Hour-2.012
table-1.982
GAN-1.980
Band-1.964
chin-1.949
Tokenaylor
Token ID7167
Feature activation+3.89e-01

Pos logit contributions
Hour+1.936
GAN+1.931
table+1.926
Band+1.898
chin+1.880
Neg logit contributions
-2.252
ITNESS-2.174
 Edmund-1.980
 defences-1.889
 insure-1.867
Token.
Token ID13
Feature activation-2.73e-01

Pos logit contributions
Hour+1.596
GAN+1.592
table+1.591
chin+1.569
Band+1.560
Neg logit contributions
-2.291
ITNESS-2.218
 Edmund-2.016
 defences-1.961
 insure-1.930
Token<|endoftext|>
Token ID50256
Feature activation-3.63e-02

Pos logit contributions
+1.591
ITNESS+1.539
 Edmund+1.418
 defences+1.352
 insure+1.339
Neg logit contributions
Hour-1.143
GAN-1.138
table-1.136
Band-1.118
chin-1.107
TokenChief
Token ID23675
Feature activation+2.16e-02

Pos logit contributions
+0.219
ITNESS+0.212
 Edmund+0.195
 defences+0.186
 insure+0.184
Neg logit contributions
GAN-0.143
Hour-0.142
table-0.142
chin-0.139
Band-0.139
Token Special
Token ID6093
Feature activation-3.00e-01

Pos logit contributions
Hour+0.097
GAN+0.097
table+0.097
Band+0.095
chin+0.094
Neg logit contributions
-0.119
ITNESS-0.115
 Edmund-0.105
 defences-0.101
 insure-0.099
Token Prosecutor
Token ID37478
Feature activation+1.16e-01

Pos logit contributions
+1.745
ITNESS+1.707
 Edmund+1.590
 defences+1.529
 insure+1.501
Neg logit contributions
Hour-1.217
GAN-1.212
table-1.210
Band-1.187
chin-1.167
Token,
Token ID11
Feature activation-1.37e-01

Pos logit contributions
Hour+0.535
GAN+0.533
table+0.532
Band+0.524
chin+0.520
Neg logit contributions
-0.624
ITNESS-0.601
 Edmund-0.547
 defences-0.522
 insure-0.516
Token K
Token ID509
Feature activation+2.49e-01

Pos logit contributions
+0.744
ITNESS+0.721
 Edmund+0.665
 defences+0.628
 insure+0.621
Neg logit contributions
Hour-0.626
table-0.625
GAN-0.624
Band-0.612
chin-0.609
Tokenatic
Token ID1512
Feature activation-7.59e-03

Pos logit contributions
Hour+1.059
GAN+1.055
table+1.054
Band+1.036
chin+1.026
Neg logit contributions
-1.433
ITNESS-1.387
 Edmund-1.272
 defences-1.217
 insure-1.203
Tokena
Token ID64
Feature activation-6.81e-02

Pos logit contributions
+0.043
ITNESS+0.042
 Edmund+0.038
 defences+0.037
 insure+0.036
Neg logit contributions
Hour-0.033
GAN-0.033
table-0.033
Band-0.032
chin-0.032
Token held
Token ID2714
Feature activation+4.74e-01

Pos logit contributions
Hour+1.527
GAN+1.524
table+1.517
Band+1.495
chin+1.481
Neg logit contributions
-1.935
ITNESS-1.874
 Edmund-1.723
 defences-1.646
 insure-1.626
Token rates
Token ID3965
Feature activation+6.67e-01

Pos logit contributions
Hour+2.369
table+2.364
GAN+2.353
Band+2.320
chin+2.301
Neg logit contributions
-2.397
ITNESS-2.291
 Edmund-2.036
 defences-1.940
 insure-1.929
Token at
Token ID379
Feature activation+5.31e-01

Pos logit contributions
Hour+3.346
table+3.338
GAN+3.319
Band+3.279
chin+3.242
Neg logit contributions
-3.380
ITNESS-3.233
 Edmund-2.874
 defences-2.714
 insure-2.700
Token 0
Token ID657
Feature activation+2.98e-01

Pos logit contributions
Hour+2.447
table+2.447
GAN+2.426
Band+2.400
chin+2.371
Neg logit contributions
-2.898
ITNESS-2.771
 Edmund-2.500
 defences-2.382
 insure-2.365
Token.
Token ID13
Feature activation+1.32e-03

Pos logit contributions
Hour+1.340
GAN+1.337
table+1.334
Band+1.313
chin+1.302
Neg logit contributions
-1.651
ITNESS-1.593
 Edmund-1.454
 defences-1.390
 insure-1.375
Token5
Token ID20
Feature activation-1.19e-01

Pos logit contributions
Hour+0.006
GAN+0.006
table+0.006
chin+0.006
Band+0.006
Neg logit contributions
-0.007
ITNESS-0.007
 Edmund-0.006
 defences-0.006
 insure-0.006
Token%
Token ID4
Feature activation+4.16e-01

Pos logit contributions
+0.850
ITNESS+0.826
 Edmund+0.775
 defences+0.749
 insure+0.743
Neg logit contributions
Hour-0.339
GAN-0.338
table-0.337
Band-0.328
chin-0.325
Token since
Token ID1201
Feature activation+4.94e-02

Pos logit contributions
Hour+1.914
GAN+1.906
table+1.904
Band+1.875
chin+1.858
Neg logit contributions
-2.260
ITNESS-2.178
 Edmund-1.981
 defences-1.891
 insure-1.869
Token March
Token ID2805
Feature activation+5.27e-01

Pos logit contributions
Hour+0.239
GAN+0.239
table+0.238
Band+0.235
chin+0.233
Neg logit contributions
-0.254
ITNESS-0.245
 Edmund-0.225
 defences-0.213
 insure-0.210
Token 2009
Token ID3717
Feature activation+1.28e-01

Pos logit contributions
table+2.317
Hour+2.310
GAN+2.293
Band+2.255
chin+2.242
Neg logit contributions
-2.994
ITNESS-2.883
 Edmund-2.591
 defences-2.483
 insure-2.466
Token in
Token ID287
Feature activation+9.10e-02

Pos logit contributions
Hour+0.569
GAN+0.567
table+0.566
Band+0.557
chin+0.552
Neg logit contributions
-0.719
ITNESS-0.694
 Edmund-0.633
 defences-0.605
 insure-0.599
Token a
Token ID257
Feature activation+9.45e-03

Pos logit contributions
Hour+0.784
GAN+0.782
table+0.781
Band+0.766
chin+0.765
Neg logit contributions
-0.877
ITNESS-0.847
 Edmund-0.778
 defences-0.742
 insure-0.729
Token foe
Token ID22156
Feature activation-9.68e-03

Pos logit contributions
Hour+0.052
GAN+0.051
table+0.051
Band+0.051
chin+0.050
Neg logit contributions
-0.043
ITNESS-0.041
 Edmund-0.037
 defences-0.035
 insure-0.034
Token of
Token ID286
Feature activation-3.01e-01

Pos logit contributions
+0.053
ITNESS+0.052
 Edmund+0.047
 defences+0.045
 insure+0.045
Neg logit contributions
Hour-0.044
GAN-0.043
table-0.043
Band-0.043
chin-0.042
Token President
Token ID1992
Feature activation-3.76e-02

Pos logit contributions
+1.561
ITNESS+1.502
 Edmund+1.371
 defences+1.313
 insure+1.286
Neg logit contributions
Hour-1.452
GAN-1.442
table-1.440
Band-1.417
chin-1.413
Token Assad
Token ID13032
Feature activation+1.05e-01

Pos logit contributions
+0.219
ITNESS+0.212
 Edmund+0.197
 defences+0.188
 insure+0.185
Neg logit contributions
Hour-0.157
GAN-0.156
table-0.156
Band-0.153
chin-0.151
Token.
Token ID13
Feature activation+4.41e-02

Pos logit contributions
Hour+0.466
GAN+0.464
table+0.464
Band+0.458
chin+0.453
Neg logit contributions
-0.584
ITNESS-0.564
 Edmund-0.512
 defences-0.489
 insure-0.485
Token
Token ID198
Feature activation+2.57e-01

Pos logit contributions
Hour+0.172
GAN+0.171
table+0.171
Band+0.168
chin+0.167
Neg logit contributions
-0.269
ITNESS-0.261
 Edmund-0.242
 defences-0.230
 insure-0.227
Token
Token ID198
Feature activation+1.00e-01

Pos logit contributions
Hour+0.963
GAN+0.962
table+0.954
Band+0.939
chin+0.935
Neg logit contributions
-1.598
ITNESS-1.553
 Edmund-1.447
 defences-1.375
 insure-1.359
TokenRecent
Token ID26446
Feature activation-3.42e-02

Pos logit contributions
table+0.399
Hour+0.396
GAN+0.396
chin+0.391
risome+0.390
Neg logit contributions
-0.605
ITNESS-0.585
 Edmund-0.536
 defences-0.510
 insure-0.503
Token reports
Token ID3136
Feature activation+2.12e-01

Pos logit contributions
+0.156
ITNESS+0.149
 Edmund+0.134
 defences+0.129
 insure+0.124
Neg logit contributions
Hour-0.187
GAN-0.187
table-0.186
Band-0.184
chin-0.183
Token say
Token ID910
Feature activation+1.03e-01

Pos logit contributions
GAN+1.074
Hour+1.071
table+1.067
Band+1.054
chin+1.042
Neg logit contributions
-1.048
ITNESS-1.009
 Edmund-0.919
 defences-0.873
 insure-0.864
Token to
Token ID284
Feature activation+1.73e-01

Pos logit contributions
Hour+0.014
GAN+0.014
table+0.014
Band+0.014
chin+0.014
Neg logit contributions
-0.017
ITNESS-0.017
 Edmund-0.015
 defences-0.015
 insure-0.015
Token where
Token ID810
Feature activation+4.96e-02

Pos logit contributions
Hour+0.856
GAN+0.854
table+0.853
Band+0.841
chin+0.834
Neg logit contributions
-0.872
ITNESS-0.839
 Edmund-0.759
 defences-0.722
 insure-0.712
Token you
Token ID345
Feature activation+1.93e-02

Pos logit contributions
Hour+0.216
GAN+0.215
table+0.215
Band+0.211
chin+0.209
Neg logit contributions
-0.281
ITNESS-0.271
 Edmund-0.249
 defences-0.238
 insure-0.235
Token are
Token ID389
Feature activation-3.15e-01

Pos logit contributions
Hour+0.081
GAN+0.080
table+0.080
Band+0.079
chin+0.078
Neg logit contributions
-0.111
ITNESS-0.109
 Edmund-0.100
 defences-0.096
 insure-0.095
Token now
Token ID783
Feature activation-2.13e-01

Pos logit contributions
+1.618
ITNESS+1.571
 Edmund+1.447
 defences+1.381
 insure+1.347
Neg logit contributions
Hour-1.507
GAN-1.500
table-1.486
Band-1.472
chin-1.464
Token.
Token ID13
Feature activation-3.53e-01

Pos logit contributions
+1.158
ITNESS+1.123
 Edmund+1.026
 defences+0.980
 insure+0.967
Neg logit contributions
Hour-0.969
GAN-0.965
table-0.961
Band-0.950
chin-0.941
Token And
Token ID843
Feature activation-5.52e-01

Pos logit contributions
+1.922
ITNESS+1.913
 Edmund+1.809
 defences+1.653
 insure+1.616
Neg logit contributions
Hour-1.524
risome-1.520
table-1.515
GAN-1.510
andestine-1.492
Token the
Token ID262
Feature activation-5.28e-01

Pos logit contributions
+2.674
ITNESS+2.664
 Edmund+2.458
 defences+2.382
 insure+2.338
Neg logit contributions
Hour-2.707
table-2.681
GAN-2.664
Band-2.642
risome-2.628
Token instant
Token ID9113
Feature activation-9.23e-02

Pos logit contributions
+2.331
ITNESS+2.303
 Edmund+2.078
 defences+2.022
 insure+1.940
Neg logit contributions
Hour-2.867
table-2.850
GAN-2.838
Band-2.808
chin-2.786
Token you
Token ID345
Feature activation-3.01e-02

Pos logit contributions
+0.501
ITNESS+0.492
 Edmund+0.448
 defences+0.431
 insure+0.424
Neg logit contributions
Hour-0.418
GAN-0.417
table-0.414
Band-0.409
chin-0.405
Token start
Token ID923
Feature activation-1.17e-01

Pos logit contributions
+0.132
ITNESS+0.131
 Edmund+0.117
 insure+0.112
 defences+0.111
Neg logit contributions
Hour-0.166
GAN-0.164
table-0.164
Band-0.163
chin-0.160
TokenRed
Token ID7738
Feature activation-2.48e-01

Pos logit contributions
+0.173
ITNESS+0.167
 Edmund+0.153
 defences+0.146
 insure+0.145
Neg logit contributions
GAN-0.112
Hour-0.112
table-0.112
chin-0.109
Band-0.109
Token Transit
Token ID22325
Feature activation-2.97e-01

Pos logit contributions
+1.423
ITNESS+1.369
 Edmund+1.278
 defences+1.219
 insure+1.198
Neg logit contributions
GAN-1.046
Hour-1.032
table-1.022
chin-1.006
Band-1.005
Token L
Token ID406
Feature activation-3.04e-01

Pos logit contributions
+1.644
ITNESS+1.592
 Edmund+1.478
 defences+1.403
 insure+1.390
Neg logit contributions
Hour-1.301
table-1.300
GAN-1.299
Band-1.263
chin-1.261
Tokenanes
Token ID7305
Feature activation-1.23e-01

Pos logit contributions
+1.937
ITNESS+1.894
 Edmund+1.756
 defences+1.677
 insure+1.665
Neg logit contributions
GAN-1.089
table-1.087
Hour-1.086
Band-1.050
chin-1.048
Token on
Token ID319
Feature activation-2.68e-01

Pos logit contributions
+0.692
ITNESS+0.669
 Edmund+0.621
 defences+0.593
 insure+0.589
Neg logit contributions
GAN-0.534
Hour-0.532
table-0.531
Band-0.521
chin-0.515
Token Church
Token ID4564
Feature activation-3.22e-01

Pos logit contributions
+1.370
ITNESS+1.318
 Edmund+1.219
 defences+1.147
 insure+1.131
Neg logit contributions
Hour-1.304
GAN-1.300
table-1.293
Band-1.275
chin-1.266
Token Have
Token ID8192
Feature activation-4.43e-02

Pos logit contributions
+1.580
ITNESS+1.513
 Edmund+1.410
 defences+1.317
 insure+1.291
Neg logit contributions
Hour-1.620
GAN-1.609
table-1.599
Band-1.575
chin-1.552
Token Made
Token ID14446
Feature activation-4.12e-02

Pos logit contributions
+0.260
ITNESS+0.253
 Edmund+0.237
 defences+0.226
 insure+0.222
Neg logit contributions
Hour-0.180
GAN-0.178
table-0.178
Band-0.175
chin-0.173
Token Mun
Token ID12107
Feature activation-5.23e-02

Pos logit contributions
+0.229
ITNESS+0.221
 Edmund+0.205
 defences+0.194
 insure+0.191
Neg logit contributions
GAN-0.182
Hour-0.182
table-0.181
Band-0.179
chin-0.176
Tokeni
Token ID72
Feature activation+2.14e-01

Pos logit contributions
+0.338
ITNESS+0.329
 Edmund+0.307
 defences+0.295
 insure+0.291
Neg logit contributions
GAN-0.184
Hour-0.183
table-0.182
Band-0.179
chin-0.176
Token Faster
Token ID38996
Feature activation+2.96e-01

Pos logit contributions
Hour+0.975
GAN+0.974
table+0.973
Band+0.956
chin+0.949
Neg logit contributions
-1.164
ITNESS-1.125
 Edmund-1.034
 defences-0.983
 insure-0.971
Token For
Token ID1114
Feature activation+2.30e-01

Pos logit contributions
+0.518
ITNESS+0.505
 Edmund+0.472
 defences+0.434
 insure+0.431
Neg logit contributions
Hour-0.403
GAN-0.403
table-0.400
chin-0.396
Band-0.395
Token Bob
Token ID5811
Feature activation-3.69e-01

Pos logit contributions
Hour+1.145
GAN+1.142
table+1.140
Band+1.124
chin+1.115
Neg logit contributions
-1.158
ITNESS-1.114
 Edmund-1.009
 defences-0.958
 insure-0.946
Token,
Token ID11
Feature activation+7.01e-02

Pos logit contributions
+2.081
ITNESS+2.013
 Edmund+1.851
 defences+1.760
 insure+1.743
Neg logit contributions
Hour-1.604
GAN-1.602
table-1.596
Band-1.573
chin-1.559
Token it
Token ID340
Feature activation-1.63e-01

Pos logit contributions
Hour+0.336
GAN+0.335
table+0.334
Band+0.330
chin+0.327
Neg logit contributions
-0.366
ITNESS-0.353
 Edmund-0.321
 defences-0.305
 insure-0.302
Token would
Token ID561
Feature activation-1.81e-01

Pos logit contributions
+0.888
ITNESS+0.857
 Edmund+0.793
 defences+0.752
 insure+0.749
Neg logit contributions
GAN-0.739
Hour-0.734
table-0.725
Band-0.718
chin-0.708
Token come
Token ID1282
Feature activation-7.04e-01

Pos logit contributions
+0.875
ITNESS+0.842
 Edmund+0.771
 insure+0.747
 defences+0.725
Neg logit contributions
GAN-0.930
Hour-0.923
table-0.909
Band-0.908
chin-0.897
Token in
Token ID287
Feature activation-2.67e-01

Pos logit contributions
+3.375
ITNESS+3.248
 Edmund+3.101
 defences+2.892
 insure+2.830
Neg logit contributions
GAN-3.549
Hour-3.482
Band-3.460
chin-3.439
table-3.419
Token the
Token ID262
Feature activation-3.94e-02

Pos logit contributions
+1.309
ITNESS+1.261
 Edmund+1.156
 defences+1.087
 insure+1.072
Neg logit contributions
GAN-1.354
Hour-1.353
table-1.342
Band-1.328
chin-1.327
Token form
Token ID1296
Feature activation-1.67e-03

Pos logit contributions
+0.163
ITNESS+0.156
 Edmund+0.139
 defences+0.130
 insure+0.128
Neg logit contributions
Hour-0.231
GAN-0.231
table-0.229
Band-0.227
chin-0.226
Token of
Token ID286
Feature activation+3.30e-01

Pos logit contributions
+0.009
ITNESS+0.009
 Edmund+0.008
 defences+0.008
 insure+0.007
Neg logit contributions
Hour-0.008
GAN-0.008
table-0.008
Band-0.008
chin-0.007
Token the
Token ID262
Feature activation+3.57e-01

Pos logit contributions
table+1.495
Hour+1.494
GAN+1.487
Band+1.468
chin+1.447
Neg logit contributions
-1.804
ITNESS-1.734
 Edmund-1.572
 defences-1.510
 insure-1.491
Token it
Token ID340
Feature activation-2.65e-01

Pos logit contributions
+2.101
ITNESS+2.003
 Edmund+1.874
 defences+1.771
 insure+1.770
Neg logit contributions
Hour-1.883
GAN-1.857
table-1.857
Band-1.835
chin-1.818
Token takes
Token ID2753
Feature activation-9.56e-01

Pos logit contributions
+1.405
ITNESS+1.327
 Edmund+1.247
 defences+1.194
 insure+1.182
Neg logit contributions
Hour-1.243
GAN-1.223
Band-1.208
table-1.208
risome-1.178
Token about
Token ID546
Feature activation-3.48e-01

Pos logit contributions
+4.176
ITNESS+4.083
 Edmund+4.039
 defences+3.806
 rehe+3.745
Neg logit contributions
Hour-4.800
GAN-4.793
andestine-4.769
Band-4.748
table-4.721
Token 15
Token ID1315
Feature activation-2.80e-01

Pos logit contributions
+1.563
ITNESS+1.525
 Edmund+1.445
 defences+1.345
 insure+1.330
Neg logit contributions
GAN-1.836
Hour-1.814
table-1.806
Band-1.793
andestine-1.776
Token-
Token ID12
Feature activation-1.18e-01

Pos logit contributions
+1.036
ITNESS+0.983
 Edmund+0.857
 defences+0.794
 insure+0.779
Neg logit contributions
Hour-1.767
GAN-1.764
table-1.759
Band-1.740
chin-1.731
Token20
Token ID1238
Feature activation-6.43e-01

Pos logit contributions
+0.581
ITNESS+0.562
 Edmund+0.515
 defences+0.490
 insure+0.482
Neg logit contributions
GAN-0.591
Hour-0.581
table-0.574
risome-0.571
Band-0.570
Token minutes
Token ID2431
Feature activation-5.34e-01

Pos logit contributions
+1.296
ITNESS+1.212
 Edmund+0.942
 defences+0.780
 insure+0.741
Neg logit contributions
Hour-5.039
GAN-5.039
table-5.005
Band-4.986
chin-4.961
Token for
Token ID329
Feature activation-1.46e-01

Pos logit contributions
+2.685
ITNESS+2.597
 Edmund+2.426
 defences+2.289
 insure+2.266
Neg logit contributions
GAN-2.553
Hour-2.539
table-2.512
chin-2.503
Band-2.502
Token it
Token ID340
Feature activation-1.58e-01

Pos logit contributions
+0.722
ITNESS+0.695
 Edmund+0.634
 defences+0.600
 insure+0.592
Neg logit contributions
Hour-0.735
GAN-0.731
table-0.727
Band-0.720
chin-0.716
Token to
Token ID284
Feature activation-2.90e-01

Pos logit contributions
+0.893
ITNESS+0.862
 Edmund+0.789
 defences+0.755
 insure+0.746
Neg logit contributions
Hour-0.690
GAN-0.688
table-0.686
Band-0.675
chin-0.669
Token be
Token ID307
Feature activation-2.31e-01

Pos logit contributions
+1.248
ITNESS+1.165
 Edmund+1.079
 insure+1.035
 defences+1.010
Neg logit contributions
Hour-1.645
GAN-1.626
Band-1.618
table-1.616
chin-1.597
Token,
Token ID11
Feature activation-1.02e+00

Pos logit contributions
+1.146
ITNESS+1.114
 Edmund+1.043
 defences+0.978
 insure+0.968
Neg logit contributions
GAN-1.105
Hour-1.099
table-1.091
Band-1.081
chin-1.068
Token behold
Token ID23700
Feature activation-7.73e-01

Pos logit contributions
+4.419
ITNESS+4.390
 Edmund+4.364
 insure+3.944
 defences+3.939
Neg logit contributions
GAN-5.280
Hour-5.237
Band-5.158
table-5.130
risome-5.124
Token,
Token ID11
Feature activation-1.06e+00

Pos logit contributions
+4.183
ITNESS+4.075
 Edmund+3.766
 defences+3.532
 insure+3.528
Neg logit contributions
Hour-3.454
GAN-3.454
Band-3.414
table-3.391
chin-3.350
Token all
Token ID477
Feature activation-4.02e-01

Pos logit contributions
+4.596
ITNESS+4.566
 Edmund+4.417
 defences+4.073
 insure+4.045
Neg logit contributions
GAN-5.495
Hour-5.418
Band-5.366
risome-5.337
table-5.333
Token was
Token ID373
Feature activation-5.54e-01

Pos logit contributions
+2.131
ITNESS+2.076
 Edmund+1.928
 defences+1.842
 insure+1.822
Neg logit contributions
GAN-1.848
Hour-1.845
table-1.818
Band-1.809
risome-1.781
Token vanity
Token ID39609
Feature activation-7.78e-01

Pos logit contributions
+2.451
ITNESS+2.388
 Edmund+2.257
 defences+2.107
 insure+2.088
Neg logit contributions
GAN-2.969
Hour-2.959
Band-2.890
table-2.885
chin-2.875
Token and
Token ID290
Feature activation-5.68e-01

Pos logit contributions
+3.756
ITNESS+3.616
 Edmund+3.455
 defences+3.233
 insure+3.229
Neg logit contributions
GAN-3.887
Hour-3.804
Band-3.710
table-3.700
risome-3.687
Token vex
Token ID41548
Feature activation-1.33e-01

Pos logit contributions
+2.271
ITNESS+2.202
 Edmund+2.019
 defences+1.891
 insure+1.858
Neg logit contributions
GAN-3.317
Hour-3.309
table-3.273
Band-3.255
chin-3.237
Tokenation
Token ID341
Feature activation-3.61e-01

Pos logit contributions
+0.878
ITNESS+0.867
 Edmund+0.831
 defences+0.790
 insure+0.780
Neg logit contributions
GAN-0.422
Hour-0.417
table-0.396
Band-0.395
chin-0.393
Token of
Token ID286
Feature activation-3.78e-01

Pos logit contributions
+1.991
ITNESS+1.923
 Edmund+1.764
 defences+1.681
 insure+1.665
Neg logit contributions
Hour-1.616
GAN-1.615
table-1.605
Band-1.584
chin-1.568
Token spirit
Token ID4437
Feature activation-1.80e-01

Pos logit contributions
+1.834
ITNESS+1.768
 Edmund+1.618
 defences+1.529
 insure+1.509
Neg logit contributions
Hour-1.935
GAN-1.932
table-1.920
Band-1.896
chin-1.880
Token lol
Token ID19462
Feature activation+5.48e-01

Pos logit contributions
Hour+0.323
table+0.320
GAN+0.320
Band+0.315
chin+0.311
Neg logit contributions
-0.392
ITNESS-0.380
 Edmund-0.351
 defences-0.334
 insure-0.331
Token m
Token ID285
Feature activation+4.87e-01

Pos logit contributions
GAN+2.448
table+2.432
Hour+2.430
Band+2.382
chin+2.379
Neg logit contributions
-3.026
ITNESS-2.925
 Edmund-2.648
 defences-2.539
 insure-2.512
Token2
Token ID17
Feature activation+4.79e-02

Pos logit contributions
table+1.907
GAN+1.881
Hour+1.870
Band+1.855
chin+1.837
Neg logit contributions
-2.985
ITNESS-2.908
 Edmund-2.647
 defences-2.565
 insure-2.530
Tokenk
Token ID74
Feature activation+4.52e-02

Pos logit contributions
Hour+0.245
GAN+0.244
table+0.242
risome+0.240
Band+0.240
Neg logit contributions
-0.228
ITNESS-0.220
 Edmund-0.202
 defences-0.188
 insure-0.185
Token [
Token ID685
Feature activation-5.67e-01

Pos logit contributions
Hour+0.154
table+0.152
GAN+0.151
Band+0.150
risome+0.147
Neg logit contributions
-0.296
ITNESS-0.289
 Edmund-0.272
 defences-0.259
 insure-0.256
Token10
Token ID940
Feature activation-7.74e-01

Pos logit contributions
+2.400
ITNESS+2.315
 Edmund+2.105
 defences+1.867
 rehe+1.823
Neg logit contributions
risome-3.129
andestine-3.118
GAN-3.062
Hour-3.005
table-2.984
Token:
Token ID25
Feature activation-5.93e-01

Pos logit contributions
+3.143
ITNESS+3.130
 Edmund+2.944
 defences+2.535
 rehe+2.478
Neg logit contributions
risome-4.283
GAN-4.181
andestine-4.161
chin-4.125
Hour-4.118
Token47
Token ID2857
Feature activation-4.77e-02

Pos logit contributions
+3.254
ITNESS+3.247
 Edmund+2.982
 defences+2.712
 insure+2.639
Neg logit contributions
risome-2.468
GAN-2.457
andestine-2.440
chin-2.406
Hour-2.398
Token]
Token ID60
Feature activation+3.78e-01

Pos logit contributions
+0.277
ITNESS+0.271
 Edmund+0.255
 defences+0.236
 insure+0.233
Neg logit contributions
Hour-0.192
GAN-0.192
risome-0.189
table-0.189
Band-0.188
Token <
Token ID1279
Feature activation+2.65e-01

Pos logit contributions
GAN+1.198
table+1.171
Hour+1.164
Band+1.155
chin+1.127
Neg logit contributions
-2.577
ITNESS-2.459
 Edmund-2.313
 defences-2.258
 insure-2.227
Tokenkw
Token ID46265
Feature activation+5.99e-01

Pos logit contributions
Hour+1.070
GAN+1.067
table+1.064
Band+1.047
chin+1.035
Neg logit contributions
-1.583
ITNESS-1.531
 Edmund-1.410
 defences-1.351
 insure-1.337
Token films
Token ID7328
Feature activation-1.89e-01

Pos logit contributions
Hour+0.611
GAN+0.609
table+0.605
Band+0.597
chin+0.590
Neg logit contributions
-0.712
ITNESS-0.691
 Edmund-0.637
 defences-0.602
 insure-0.596
Token Seven
Token ID13723
Feature activation+2.99e-01

Pos logit contributions
+1.021
ITNESS+0.994
 Edmund+0.917
 defences+0.866
 insure+0.859
Neg logit contributions
Hour-0.855
GAN-0.855
table-0.854
Band-0.836
chin-0.826
Token Samurai
Token ID23882
Feature activation+3.27e-01

Pos logit contributions
Hour+0.704
table+0.699
GAN+0.693
Band+0.677
chin+0.665
Neg logit contributions
-2.302
ITNESS-2.232
 Edmund-2.094
 defences-2.031
 insure-2.014
Token and
Token ID290
Feature activation+5.80e-02

Pos logit contributions
Hour+1.395
GAN+1.389
table+1.387
Band+1.375
chin+1.352
Neg logit contributions
-1.885
ITNESS-1.819
 Edmund-1.647
 defences-1.592
 insure-1.568
Token Y
Token ID575
Feature activation+4.84e-01

Pos logit contributions
Hour+0.252
GAN+0.251
table+0.251
Band+0.246
chin+0.244
Neg logit contributions
-0.327
ITNESS-0.318
 Edmund-0.292
 defences-0.278
 insure-0.275
Tokenoj
Token ID13210
Feature activation-7.21e-01

Pos logit contributions
GAN+2.167
Hour+2.159
table+2.150
Band+2.126
chin+2.110
Neg logit contributions
-2.690
ITNESS-2.574
 Edmund-2.360
 defences-2.266
 insure-2.233
Tokenim
Token ID320
Feature activation-6.18e-01

Pos logit contributions
+2.914
ITNESS+2.812
 Edmund+2.571
 defences+2.387
 insure+2.311
Neg logit contributions
Hour-4.191
GAN-4.122
table-4.055
Band-4.030
risome-4.003
Tokenbo
Token ID2127
Feature activation-2.14e-01

Pos logit contributions
+2.519
ITNESS+2.462
 Edmund+2.098
 defences+1.990
 insure+1.966
Neg logit contributions
Hour-3.680
table-3.629
Band-3.542
GAN-3.540
risome-3.518
Token,
Token ID11
Feature activation+3.75e-01

Pos logit contributions
+1.163
ITNESS+1.129
 Edmund+1.036
 defences+0.983
 insure+0.977
Neg logit contributions
Hour-0.969
GAN-0.968
table-0.966
Band-0.946
chin-0.938
Token among
Token ID1871
Feature activation+7.37e-02

Pos logit contributions
Hour+1.742
GAN+1.735
table+1.734
Band+1.712
chin+1.692
Neg logit contributions
-2.018
ITNESS-1.941
 Edmund-1.754
 defences-1.680
 insure-1.657
Token others
Token ID1854
Feature activation+4.83e-01

Pos logit contributions
Hour+0.403
GAN+0.402
table+0.402
Band+0.397
chin+0.394
Neg logit contributions
-0.336
ITNESS-0.321
 Edmund-0.287
 defences-0.271
 insure-0.267