TOP ACTIVATIONS
MAX = 1.136

er ." R ever se side of flyer (
( © Copyright 2010 The Associated Press .
Ethereum for its long - term vision of establishing an ecosystem
( © Copyright 2010 The Associated Press
screen grab ] <|endoftext|> ( Updated below ) The
com / watch ? v = 2 g 7 p k
[ ng gallery id = 159 ] The
youtube . com / channel / UC _ K k BY
Q ARS ). Atlanta , GA : US Department of Health
Tiff in , Iowa . Hide Caption 5 of 15 Photos
previously , the " R eth inking Mathematics : Teaching Social
prepare for cases . ( Photo by Kathleen Flynn , N
coming / article 35 79 23 12 . e ce /
in Pasadena , California . Hide Caption 12 of 15 Photos
. <|endoftext|> AD EL A IDE captain Taylor Walker says the
make overcl ocking easier than ever . With the industry first
. Statistics used courtesy of Basketball - Reference .
with the same short - term disability proposal the Conservatives made
- out - the - vote efforts , and populist Sen
in the business long - term , could you see yourself

MIN ACTIVATIONS
MIN = -0.968

R c `, or ` & ` ) at some point
to Rep . Will H urd , R - Texas ,
in connection with the fl ot illa incident and will prevent
Try { for ($ i ; $ i - le $
want to kick anyone , lets just move them around in
Own ed < T > ; } impl < T >
missions but Honour Among Thieves is a h oot from
Valid ) ) return space ; // 4 . Rec urs
the families of the fl ot illa victims .
( args [" images "] ): # load the image ,
3 Next All <|endoftext|> This article is over 3 years old
. - February 9 , 2010 R ipe for M ST
== 0 . 0 f ); RAW _ ASS ERT (
' a , T >> ; } In order
. - December 13 , 2010 boys beware Reviewer
-> Atomic < T > ; } We
2010 United States 10 53 Posts # 15 honestly if we
2011 United States 228 65 Posts # 13 On September 20
at nickel ( or vice versa ), then the following alignment
South Korea . <|endoftext|> This article is about the payment medium

INTERVAL 0.610 - 1.136
CONTAINS 0.146%

by suicide . Cambridge , MA : Harvard University Press ;
+ medium kick , Laura moves in the corresponding direction ,
, a former five - term member of Congress who made
with a new short - term disability plan .
os Hills , California . Hide Caption 14 of 15 Photos

INTERVAL 0.084 - 0.610
CONTAINS 30.927%

budget . Cance ling it would allow you to put back
I believe that [ there ] is this desire
OCK : Connect it to the watch , without
е к с е р о
to broadcast their experiences and provide commentary in real - time

INTERVAL -0.442 - 0.084
CONTAINS 67.872%

guests , had enjoyed their Easter brunch , when Tor i
I washed the broken st on ework with Val le jo
s World Ocean Day will sadly be remembered for the heart
up some got cha 's I encountered . Assuming you already
The concert venue was filmed in , a wonderful

INTERVAL -0.968 - -0.442
CONTAINS 1.055%

out hed client ( Frank Al ber ts on .) In
Sh and obil , right , after a debate on the
The S AC met to hear local transportation and safety experts
. If this does not describe you , don 't dose
HR P require all people conducting or overseeing human subjects research
Tokener
Token ID263
Feature activation+2.17e-02

Pos logit contributions
IDS+0.043
 Norn+0.041
 juggling+0.041
thumbnails+0.039
Thor+0.038
Neg logit contributions
theless-0.024
zens-0.023
 indo-0.022
negie-0.022
 Regulatory-0.022
Token."
Token ID526
Feature activation-2.97e-02

Pos logit contributions
IDS+0.082
 Norn+0.079
 juggling+0.077
thumbnails+0.073
Thor+0.070
Neg logit contributions
theless-0.093
zens-0.091
 indo-0.088
negie-0.088
 Regulatory-0.087
Token
Token ID198
Feature activation+3.80e-02

Pos logit contributions
theless+0.114
zens+0.112
 Regulatory+0.107
 indo+0.106
negie+0.104
Neg logit contributions
IDS-0.128
 juggling-0.121
 Norn-0.121
thumbnails-0.116
Thor-0.110
Token
Token ID198
Feature activation+2.82e-02

Pos logit contributions
IDS+0.137
 Norn+0.131
 juggling+0.127
thumbnails+0.120
Thor+0.119
Neg logit contributions
theless-0.175
 indo-0.173
zens-0.166
negie-0.162
 Regulatory-0.158
TokenR
Token ID49
Feature activation+7.56e-02

Pos logit contributions
IDS+0.116
 Norn+0.112
 juggling+0.110
thumbnails+0.105
Thor+0.101
Neg logit contributions
theless-0.111
zens-0.109
 indo-0.105
negie-0.104
 Regulatory-0.103
Tokenever
Token ID964
Feature activation+1.14e+00

Pos logit contributions
IDS+0.271
 Norn+0.260
 juggling+0.256
thumbnails+0.241
Thor+0.231
Neg logit contributions
theless-0.338
zens-0.333
 indo-0.323
negie-0.320
 Regulatory-0.317
Tokense
Token ID325
Feature activation-6.18e-02

Pos logit contributions
IDS+3.897
 Norn+3.775
 juggling+3.653
thumbnails+3.523
Thor+3.371
Neg logit contributions
theless-5.034
 indo-4.880
zens-4.761
negie-4.745
 Regulatory-4.708
Token side
Token ID1735
Feature activation-2.52e-01

Pos logit contributions
theless+0.255
zens+0.250
 indo+0.242
negie+0.238
 Regulatory+0.237
Neg logit contributions
IDS-0.244
 Norn-0.234
 juggling-0.231
thumbnails-0.219
Thor-0.211
Token of
Token ID286
Feature activation+2.36e-02

Pos logit contributions
theless+1.016
zens+0.999
 indo+0.967
 Regulatory+0.945
negie+0.945
Neg logit contributions
IDS-1.013
 Norn-0.973
 juggling-0.954
thumbnails-0.912
Thor-0.877
Token flyer
Token ID42464
Feature activation-1.07e-01

Pos logit contributions
IDS+0.087
 Norn+0.083
 juggling+0.082
thumbnails+0.078
Thor+0.074
Neg logit contributions
theless-0.103
zens-0.102
 indo-0.099
 Regulatory-0.097
negie-0.097
Token (
Token ID357
Feature activation-3.03e-02

Pos logit contributions
theless+0.447
zens+0.440
 indo+0.426
negie+0.421
 Regulatory+0.417
Neg logit contributions
IDS-0.414
 Norn-0.397
 juggling-0.392
thumbnails-0.371
Thor-0.357
Token
Token ID251
Feature activation+1.38e-01

Pos logit contributions
zens+0.302
theless+0.301
negie+0.285
 indo+0.285
 Regulatory+0.284
Neg logit contributions
IDS-0.491
 Norn-0.474
 juggling-0.468
thumbnails-0.454
Thor-0.433
Token
Token ID198
Feature activation-1.02e-02

Pos logit contributions
IDS+0.567
 juggling+0.542
 Norn+0.542
thumbnails+0.515
Thor+0.490
Neg logit contributions
theless-0.548
zens-0.539
 Regulatory-0.515
 indo-0.514
negie-0.513
Token
Token ID198
Feature activation+2.90e-02

Pos logit contributions
theless+0.047
 indo+0.046
zens+0.044
negie+0.043
 Regulatory+0.042
Neg logit contributions
IDS-0.037
 Norn-0.035
 juggling-0.034
thumbnails-0.033
Thor-0.032
Token(
Token ID7
Feature activation+2.83e-01

Pos logit contributions
IDS+0.115
 Norn+0.111
 juggling+0.110
thumbnails+0.103
Thor+0.099
Neg logit contributions
theless-0.117
zens-0.117
 indo-0.111
negie-0.111
 Regulatory-0.110
Token©
Token ID16224
Feature activation+1.01e+00

Pos logit contributions
IDS+1.064
 Norn+1.018
 juggling+1.001
thumbnails+0.949
Thor+0.915
Neg logit contributions
theless-1.224
zens-1.196
 indo-1.162
negie-1.150
 Regulatory-1.133
Token Copyright
Token ID15069
Feature activation+1.05e+00

Pos logit contributions
IDS+3.150
 Norn+2.950
 juggling+2.865
thumbnails+2.690
Thor+2.620
Neg logit contributions
theless-5.034
zens-4.890
 indo-4.787
negie-4.748
enko-4.638
Token 2010
Token ID3050
Feature activation+2.04e-01

Pos logit contributions
IDS+3.117
 Norn+2.964
 juggling+2.887
thumbnails+2.676
Thor+2.561
Neg logit contributions
theless-5.331
zens-5.225
 indo-5.058
negie-5.050
enko-4.906
Token The
Token ID383
Feature activation+3.69e-01

Pos logit contributions
IDS+0.828
 Norn+0.797
 juggling+0.786
thumbnails+0.747
Thor+0.720
Neg logit contributions
theless-0.816
zens-0.803
 indo-0.777
negie-0.768
 Regulatory-0.763
Token Associated
Token ID10575
Feature activation+1.36e-01

Pos logit contributions
IDS+1.443
 Norn+1.391
 juggling+1.364
thumbnails+1.288
Thor+1.246
Neg logit contributions
theless-1.537
zens-1.500
 indo-1.459
negie-1.438
 Regulatory-1.410
Token Press
Token ID4332
Feature activation+3.62e-01

Pos logit contributions
IDS+0.367
 Norn+0.362
 juggling+0.335
thumbnails+0.311
Thor+0.309
Neg logit contributions
theless-0.749
 indo-0.735
zens-0.715
terday-0.690
negie-0.688
Token.
Token ID13
Feature activation+4.36e-01

Pos logit contributions
IDS+1.334
 Norn+1.278
 juggling+1.261
thumbnails+1.190
Thor+1.142
Neg logit contributions
theless-1.606
zens-1.545
 indo-1.514
negie-1.481
 Regulatory-1.450
Token Ethereum
Token ID20313
Feature activation-2.16e-01

Pos logit contributions
theless+0.101
zens+0.099
 indo+0.096
negie+0.095
 Regulatory+0.094
Neg logit contributions
IDS-0.092
 Norn-0.088
 juggling-0.087
thumbnails-0.082
Thor-0.079
Token for
Token ID329
Feature activation-2.60e-02

Pos logit contributions
theless+0.896
zens+0.880
 indo+0.851
negie+0.843
 Regulatory+0.833
Neg logit contributions
IDS-0.849
 Norn-0.815
 juggling-0.804
thumbnails-0.762
Thor-0.734
Token its
Token ID663
Feature activation-3.72e-02

Pos logit contributions
theless+0.109
zens+0.107
 indo+0.104
negie+0.103
 Regulatory+0.102
Neg logit contributions
IDS-0.101
 Norn-0.097
 juggling-0.095
thumbnails-0.090
Thor-0.087
Token long
Token ID890
Feature activation+1.35e-01

Pos logit contributions
theless+0.152
zens+0.149
 indo+0.145
negie+0.143
 Regulatory+0.141
Neg logit contributions
IDS-0.147
 Norn-0.142
 juggling-0.140
thumbnails-0.132
Thor-0.127
Token-
Token ID12
Feature activation+1.57e-01

Pos logit contributions
IDS+0.523
 Norn+0.502
 juggling+0.495
thumbnails+0.469
Thor+0.451
Neg logit contributions
theless-0.570
zens-0.560
 indo-0.542
negie-0.536
 Regulatory-0.531
Tokenterm
Token ID4354
Feature activation+1.02e+00

Pos logit contributions
IDS+0.529
 Norn+0.506
 juggling+0.497
thumbnails+0.464
Thor+0.444
Neg logit contributions
theless-0.738
zens-0.728
 indo-0.702
negie-0.698
 Regulatory-0.692
Token vision
Token ID5761
Feature activation+2.38e-01

Pos logit contributions
IDS+3.542
 juggling+3.505
 Norn+3.461
 blink+3.177
thumbnails+3.153
Neg logit contributions
theless-4.468
zens-4.371
negie-4.233
 indo-4.221
enko-4.152
Token of
Token ID286
Feature activation+2.38e-01

Pos logit contributions
IDS+0.928
 Norn+0.890
 juggling+0.876
thumbnails+0.833
Thor+0.800
Neg logit contributions
theless-0.988
zens-0.973
 indo-0.941
negie-0.932
 Regulatory-0.923
Token establishing
Token ID15010
Feature activation+6.39e-02

Pos logit contributions
IDS+0.928
 juggling+0.894
 Norn+0.891
thumbnails+0.834
Thor+0.799
Neg logit contributions
theless-0.986
zens-0.972
 indo-0.931
negie-0.926
 Regulatory-0.906
Token an
Token ID281
Feature activation-1.41e-01

Pos logit contributions
IDS+0.253
 Norn+0.245
 juggling+0.242
thumbnails+0.227
Thor+0.219
Neg logit contributions
theless-0.262
zens-0.256
 indo-0.248
negie-0.244
 Regulatory-0.240
Token ecosystem
Token ID13187
Feature activation-1.97e-01

Pos logit contributions
theless+0.614
zens+0.603
 indo+0.583
negie+0.578
 Regulatory+0.572
Neg logit contributions
IDS-0.522
 Norn-0.500
 juggling-0.492
thumbnails-0.465
Thor-0.446
Token
Token ID447
Feature activation-9.88e-02

Pos logit contributions
IDS+0.389
 Norn+0.372
 juggling+0.370
thumbnails+0.350
Thor+0.335
Neg logit contributions
theless-0.410
zens-0.403
 indo-0.388
negie-0.386
 Regulatory-0.384
Token
Token ID251
Feature activation+1.38e-01

Pos logit contributions
zens+0.302
theless+0.301
negie+0.285
 indo+0.285
 Regulatory+0.284
Neg logit contributions
IDS-0.491
 Norn-0.474
 juggling-0.468
thumbnails-0.454
Thor-0.433
Token
Token ID198
Feature activation-1.02e-02

Pos logit contributions
IDS+0.567
 juggling+0.542
 Norn+0.542
thumbnails+0.515
Thor+0.490
Neg logit contributions
theless-0.548
zens-0.539
 Regulatory-0.515
 indo-0.514
negie-0.513
Token
Token ID198
Feature activation+2.90e-02

Pos logit contributions
theless+0.047
 indo+0.046
zens+0.044
negie+0.043
 Regulatory+0.042
Neg logit contributions
IDS-0.037
 Norn-0.035
 juggling-0.034
thumbnails-0.033
Thor-0.032
Token(
Token ID7
Feature activation+2.83e-01

Pos logit contributions
IDS+0.115
 Norn+0.111
 juggling+0.110
thumbnails+0.103
Thor+0.099
Neg logit contributions
theless-0.117
zens-0.117
 indo-0.111
negie-0.111
 Regulatory-0.110
Token©
Token ID16224
Feature activation+1.01e+00

Pos logit contributions
IDS+1.064
 Norn+1.018
 juggling+1.001
thumbnails+0.949
Thor+0.915
Neg logit contributions
theless-1.224
zens-1.196
 indo-1.162
negie-1.150
 Regulatory-1.133
Token Copyright
Token ID15069
Feature activation+1.05e+00

Pos logit contributions
IDS+3.150
 Norn+2.950
 juggling+2.865
thumbnails+2.690
Thor+2.620
Neg logit contributions
theless-5.034
zens-4.890
 indo-4.787
negie-4.748
enko-4.638
Token 2010
Token ID3050
Feature activation+2.04e-01

Pos logit contributions
IDS+3.117
 Norn+2.964
 juggling+2.887
thumbnails+2.676
Thor+2.561
Neg logit contributions
theless-5.331
zens-5.225
 indo-5.058
negie-5.050
enko-4.906
Token The
Token ID383
Feature activation+3.69e-01

Pos logit contributions
IDS+0.828
 Norn+0.797
 juggling+0.786
thumbnails+0.747
Thor+0.720
Neg logit contributions
theless-0.816
zens-0.803
 indo-0.777
negie-0.768
 Regulatory-0.763
Token Associated
Token ID10575
Feature activation+1.36e-01

Pos logit contributions
IDS+1.443
 Norn+1.391
 juggling+1.364
thumbnails+1.288
Thor+1.246
Neg logit contributions
theless-1.537
zens-1.500
 indo-1.459
negie-1.438
 Regulatory-1.410
Token Press
Token ID4332
Feature activation+3.62e-01

Pos logit contributions
IDS+0.367
 Norn+0.362
 juggling+0.335
thumbnails+0.311
Thor+0.309
Neg logit contributions
theless-0.749
 indo-0.735
zens-0.715
terday-0.690
negie-0.688
Token screen
Token ID3159
Feature activation+2.44e-01

Pos logit contributions
IDS+1.797
 Norn+1.723
 juggling+1.696
thumbnails+1.613
Thor+1.543
Neg logit contributions
theless-1.941
zens-1.888
 indo-1.821
negie-1.817
 Regulatory-1.766
Token grab
Token ID5552
Feature activation+4.66e-01

Pos logit contributions
IDS+0.883
 Norn+0.845
 juggling+0.834
thumbnails+0.786
Thor+0.754
Neg logit contributions
theless-1.086
zens-1.065
 indo-1.034
negie-1.025
 Regulatory-1.014
Token]
Token ID60
Feature activation+8.26e-02

Pos logit contributions
IDS+1.911
 Norn+1.835
 juggling+1.815
thumbnails+1.724
Thor+1.661
Neg logit contributions
theless-1.850
zens-1.812
 indo-1.751
negie-1.735
 Regulatory-1.709
Token<|endoftext|>
Token ID50256
Feature activation+5.35e-02

Pos logit contributions
IDS+0.335
 Norn+0.327
 juggling+0.323
thumbnails+0.303
Thor+0.290
Neg logit contributions
theless-0.330
zens-0.329
 Regulatory-0.310
negie-0.308
 indo-0.307
Token(
Token ID7
Feature activation+5.95e-01

Pos logit contributions
IDS+0.218
 Norn+0.211
 juggling+0.209
thumbnails+0.197
Thor+0.189
Neg logit contributions
theless-0.211
zens-0.209
 indo-0.200
negie-0.199
 Regulatory-0.196
TokenUpdated
Token ID17354
Feature activation+1.01e+00

Pos logit contributions
IDS+2.288
 Norn+2.186
 juggling+2.143
thumbnails+2.048
Thor+1.988
Neg logit contributions
theless-2.530
zens-2.473
 indo-2.396
negie-2.378
 Regulatory-2.323
Token below
Token ID2174
Feature activation+2.06e-01

Pos logit contributions
IDS+3.678
 Norn+3.587
 juggling+3.526
thumbnails+3.222
Thor+3.167
Neg logit contributions
theless-4.364
zens-4.195
negie-4.089
 indo-4.060
enko-3.905
Token)
Token ID8
Feature activation-1.81e-01

Pos logit contributions
IDS+0.738
 Norn+0.706
 juggling+0.696
thumbnails+0.655
Thor+0.629
Neg logit contributions
theless-0.923
zens-0.908
 indo-0.880
negie-0.872
 Regulatory-0.863
Token
Token ID198
Feature activation-1.61e-01

Pos logit contributions
theless+0.710
zens+0.700
 indo+0.673
 Regulatory+0.666
negie+0.665
Neg logit contributions
IDS-0.743
 Norn-0.713
 juggling-0.709
thumbnails-0.674
Thor-0.646
Token
Token ID198
Feature activation-1.16e-01

Pos logit contributions
theless+0.732
 indo+0.719
zens+0.695
negie+0.680
 Regulatory+0.657
Neg logit contributions
IDS-0.601
 Norn-0.570
 juggling-0.552
thumbnails-0.527
Thor-0.523
TokenThe
Token ID464
Feature activation-1.73e-02

Pos logit contributions
theless+0.450
zens+0.446
 indo+0.425
negie+0.422
 Regulatory+0.419
Neg logit contributions
IDS-0.476
 Norn-0.461
 juggling-0.456
thumbnails-0.431
Thor-0.413
Tokencom
Token ID785
Feature activation+2.60e-01

Pos logit contributions
IDS+1.434
 Norn+1.315
thumbnails+1.303
 juggling+1.278
Thor+1.216
Neg logit contributions
theless-2.636
 indo-2.566
zens-2.431
negie-2.383
terday-2.357
Token/
Token ID14
Feature activation+1.49e-01

Pos logit contributions
IDS+0.787
 Norn+0.735
 juggling+0.726
thumbnails+0.699
 blink+0.650
Neg logit contributions
theless-1.332
 indo-1.290
zens-1.279
negie-1.245
orf-1.217
Tokenwatch
Token ID8340
Feature activation+9.71e-02

Pos logit contributions
IDS+0.483
 Norn+0.475
 juggling+0.465
thumbnails+0.420
Thor+0.402
Neg logit contributions
theless-0.711
zens-0.708
 indo-0.684
negie-0.681
 Regulatory-0.674
Token?
Token ID30
Feature activation+6.39e-01

Pos logit contributions
IDS+0.378
 Norn+0.357
 juggling+0.353
thumbnails+0.339
Thor+0.328
Neg logit contributions
theless-0.422
 indo-0.410
zens-0.397
negie-0.388
 Regulatory-0.378
Tokenv
Token ID85
Feature activation+4.08e-01

Pos logit contributions
IDS+1.480
 Norn+1.336
 juggling+1.330
thumbnails+1.237
Thor+1.128
Neg logit contributions
theless-3.693
zens-3.594
 indo-3.529
negie-3.519
 Regulatory-3.475
Token=
Token ID28
Feature activation+9.80e-01

Pos logit contributions
IDS+1.203
 Norn+1.086
 juggling+1.039
thumbnails+1.028
ids+0.967
Neg logit contributions
theless-2.096
 indo-2.053
zens-2.014
negie-2.005
 Regulatory-1.969
Token2
Token ID17
Feature activation+3.81e-01

Pos logit contributions
IDS+3.788
 Norn+3.477
 juggling+3.406
thumbnails+3.313
Thor+3.262
Neg logit contributions
theless-4.199
 indo-3.964
zens-3.942
negie-3.878
 Regulatory-3.874
Tokeng
Token ID70
Feature activation+5.97e-01

Pos logit contributions
IDS+1.452
 Norn+1.360
 juggling+1.334
thumbnails+1.280
Thor+1.247
Neg logit contributions
theless-1.650
zens-1.589
 indo-1.571
negie-1.546
 Regulatory-1.539
Token7
Token ID22
Feature activation+2.13e-01

Pos logit contributions
IDS+2.270
 Norn+2.100
 juggling+2.056
thumbnails+1.983
Thor+1.942
Neg logit contributions
theless-2.616
zens-2.481
 indo-2.480
negie-2.429
 Regulatory-2.420
Tokenp
Token ID79
Feature activation+2.03e-01

Pos logit contributions
IDS+0.815
 Norn+0.768
 juggling+0.755
thumbnails+0.720
Thor+0.700
Neg logit contributions
theless-0.921
zens-0.892
 indo-0.876
negie-0.863
 Regulatory-0.857
Tokenk
Token ID74
Feature activation+3.53e-01

Pos logit contributions
IDS+0.763
 Norn+0.719
 juggling+0.705
thumbnails+0.673
Thor+0.653
Neg logit contributions
theless-0.887
zens-0.857
 indo-0.844
negie-0.831
 Regulatory-0.825
Token
Token ID198
Feature activation+3.19e-02

Pos logit contributions
IDS+0.126
 Norn+0.120
 juggling+0.116
thumbnails+0.111
Thor+0.110
Neg logit contributions
theless-0.163
 indo-0.161
zens-0.154
negie-0.152
 Regulatory-0.147
Token[
Token ID58
Feature activation+4.22e-02

Pos logit contributions
IDS+0.131
 Norn+0.126
 juggling+0.124
thumbnails+0.118
Thor+0.115
Neg logit contributions
theless-0.127
zens-0.124
 indo-0.121
negie-0.119
 Regulatory-0.117
Tokenng
Token ID782
Feature activation+1.98e-01

Pos logit contributions
IDS+0.157
 Norn+0.150
 juggling+0.148
thumbnails+0.140
Thor+0.134
Neg logit contributions
theless-0.184
zens-0.181
 indo-0.176
negie-0.174
 Regulatory-0.172
Tokengallery
Token ID24460
Feature activation+3.71e-01

Pos logit contributions
IDS+0.947
 Norn+0.915
 juggling+0.902
thumbnails+0.869
Thor+0.843
Neg logit contributions
theless-0.646
zens-0.631
 indo-0.608
negie-0.600
 Regulatory-0.587
Token id
Token ID4686
Feature activation+4.75e-01

Pos logit contributions
IDS+1.250
 Norn+1.180
 juggling+1.172
thumbnails+1.114
Thor+1.058
Neg logit contributions
theless-1.754
zens-1.720
negie-1.671
 indo-1.661
 Regulatory-1.641
Token=
Token ID28
Feature activation+9.58e-01

Pos logit contributions
IDS+1.438
 Norn+1.360
 juggling+1.334
thumbnails+1.250
Thor+1.180
Neg logit contributions
theless-2.418
zens-2.358
 indo-2.305
negie-2.281
 Regulatory-2.254
Token159
Token ID19707
Feature activation+2.59e-01

Pos logit contributions
IDS+3.749
 Norn+3.511
 juggling+3.418
thumbnails+3.345
Thor+3.232
Neg logit contributions
theless-3.994
zens-3.864
 indo-3.828
negie-3.729
 Regulatory-3.655
Token]
Token ID60
Feature activation+7.78e-02

Pos logit contributions
IDS+1.072
 Norn+1.030
 juggling+1.019
thumbnails+0.969
Thor+0.933
Neg logit contributions
theless-1.014
zens-0.995
 indo-0.960
negie-0.953
 Regulatory-0.939
Token
Token ID198
Feature activation-3.21e-01

Pos logit contributions
IDS+0.321
 Norn+0.309
 juggling+0.307
thumbnails+0.290
Thor+0.277
Neg logit contributions
theless-0.306
zens-0.302
 indo-0.288
 Regulatory-0.288
negie-0.286
Token
Token ID198
Feature activation-3.70e-01

Pos logit contributions
theless+1.383
 indo+1.336
zens+1.336
negie+1.293
 Regulatory+1.266
Neg logit contributions
IDS-1.237
 Norn-1.183
 juggling-1.161
thumbnails-1.103
Thor-1.073
TokenThe
Token ID464
Feature activation-1.63e-01

Pos logit contributions
theless+1.436
zens+1.424
 indo+1.347
negie+1.343
 Regulatory+1.339
Neg logit contributions
IDS-1.518
 Norn-1.476
 juggling-1.461
thumbnails-1.376
Thor-1.309
Tokenyoutube
Token ID11604
Feature activation+7.15e-03

Pos logit contributions
IDS+0.594
 Norn+0.566
 juggling+0.558
thumbnails+0.513
Thor+0.490
Neg logit contributions
theless-0.948
zens-0.936
 indo-0.906
negie-0.901
 Regulatory-0.889
Token.
Token ID13
Feature activation+5.21e-01

Pos logit contributions
IDS+0.025
 Norn+0.024
 juggling+0.023
thumbnails+0.022
Thor+0.021
Neg logit contributions
theless-0.034
 indo-0.033
zens-0.032
negie-0.031
 Regulatory-0.031
Tokencom
Token ID785
Feature activation+3.08e-01

Pos logit contributions
IDS+1.494
 Norn+1.370
thumbnails+1.368
 juggling+1.343
Thor+1.280
Neg logit contributions
theless-2.782
 indo-2.753
zens-2.589
negie-2.552
 Regulatory-2.508
Token/
Token ID14
Feature activation+2.14e-01

Pos logit contributions
IDS+1.081
 Norn+1.033
 juggling+1.018
thumbnails+0.956
Thor+0.917
Neg logit contributions
theless-1.401
zens-1.379
 indo-1.337
negie-1.325
 Regulatory-1.313
Tokenchannel
Token ID17620
Feature activation+1.78e-01

Pos logit contributions
IDS+0.691
 Norn+0.659
 juggling+0.648
thumbnails+0.604
Thor+0.576
Neg logit contributions
theless-1.039
zens-1.024
 indo-0.995
negie-0.986
 Regulatory-0.977
Token/
Token ID14
Feature activation+9.50e-01

Pos logit contributions
IDS+0.558
 Norn+0.518
 juggling+0.507
thumbnails+0.488
Thor+0.465
Neg logit contributions
theless-0.910
 indo-0.885
zens-0.862
negie-0.851
 Regulatory-0.830
TokenUC
Token ID9598
Feature activation+6.89e-01

Pos logit contributions
IDS+1.986
 Norn+1.709
 juggling+1.686
thumbnails+1.617
Thor+1.453
Neg logit contributions
theless-5.704
zens-5.589
 indo-5.510
negie-5.488
 Regulatory-5.413
Token_
Token ID62
Feature activation+3.80e-01

Pos logit contributions
IDS+2.695
 Norn+2.486
 juggling+2.446
thumbnails+2.368
Thor+2.313
Neg logit contributions
theless-2.932
 indo-2.778
zens-2.775
negie-2.723
 Regulatory-2.715
TokenK
Token ID42
Feature activation+1.78e-01

Pos logit contributions
IDS+1.454
 Norn+1.368
 juggling+1.344
thumbnails+1.286
Thor+1.247
Neg logit contributions
theless-1.640
zens-1.584
 indo-1.559
negie-1.537
 Regulatory-1.524
Tokenk
Token ID74
Feature activation+8.99e-02

Pos logit contributions
IDS+0.673
 Norn+0.637
 juggling+0.627
thumbnails+0.595
Thor+0.576
Neg logit contributions
theless-0.776
zens-0.755
 indo-0.738
negie-0.729
 Regulatory-0.723
TokenBY
Token ID17513
Feature activation+4.82e-01

Pos logit contributions
IDS+0.338
 Norn+0.320
 juggling+0.315
thumbnails+0.299
Thor+0.289
Neg logit contributions
theless-0.393
zens-0.383
 indo-0.374
negie-0.369
 Regulatory-0.366
TokenQ
Token ID48
Feature activation-3.75e-01

Pos logit contributions
IDS+1.729
 Norn+1.636
 juggling+1.609
thumbnails+1.533
Thor+1.474
Neg logit contributions
theless-1.773
zens-1.735
 indo-1.676
negie-1.660
 Regulatory-1.626
TokenARS
Token ID27415
Feature activation-6.31e-02

Pos logit contributions
theless+1.135
zens+1.120
 Regulatory+1.070
 indo+1.067
negie+1.057
Neg logit contributions
IDS-1.812
 juggling-1.786
 Norn-1.786
thumbnails-1.711
Thor-1.649
Token).
Token ID737
Feature activation-1.04e-01

Pos logit contributions
theless+0.252
zens+0.250
 indo+0.243
 Regulatory+0.241
negie+0.239
Neg logit contributions
IDS-0.247
 Norn-0.241
 juggling-0.240
thumbnails-0.228
Thor-0.218
Token Atlanta
Token ID9371
Feature activation-7.00e-02

Pos logit contributions
theless+0.375
zens+0.367
 indo+0.354
negie+0.350
 Regulatory+0.346
Neg logit contributions
IDS-0.464
 Norn-0.448
 juggling-0.443
thumbnails-0.423
Thor-0.409
Token,
Token ID11
Feature activation+3.13e-01

Pos logit contributions
theless+0.295
zens+0.290
 Regulatory+0.287
 indo+0.285
negie+0.281
Neg logit contributions
IDS-0.258
 juggling-0.249
 Norn-0.249
thumbnails-0.239
qqa-0.226
Token GA
Token ID14545
Feature activation+9.46e-01

Pos logit contributions
IDS+1.414
 Norn+1.372
 juggling+1.358
thumbnails+1.294
Thor+1.250
Neg logit contributions
theless-1.097
zens-1.081
 indo-1.038
 Regulatory-1.019
negie-1.018
Token:
Token ID25
Feature activation+9.90e-02

Pos logit contributions
IDS+2.164
 Norn+2.008
 juggling+1.962
thumbnails+1.773
Thor+1.664
Neg logit contributions
theless-5.467
zens-5.395
 indo-5.272
negie-5.223
 Regulatory-5.170
Token US
Token ID1294
Feature activation+1.95e-01

Pos logit contributions
IDS+0.419
 Norn+0.404
 juggling+0.400
thumbnails+0.381
Thor+0.367
Neg logit contributions
theless-0.378
zens-0.370
 indo-0.357
negie-0.354
 Regulatory-0.351
Token Department
Token ID2732
Feature activation+1.61e-01

Pos logit contributions
IDS+0.746
 Norn+0.710
 juggling+0.697
thumbnails+0.658
Thor+0.639
Neg logit contributions
theless-0.835
zens-0.821
 indo-0.796
negie-0.785
 Regulatory-0.774
Token of
Token ID286
Feature activation+1.74e-02

Pos logit contributions
IDS+0.586
 Norn+0.566
 juggling+0.544
thumbnails+0.519
Thor+0.509
Neg logit contributions
theless-0.752
 indo-0.730
zens-0.705
negie-0.681
terday-0.670
Token Health
Token ID3893
Feature activation+3.59e-01

Pos logit contributions
IDS+0.084
 Norn+0.082
 juggling+0.081
thumbnails+0.079
Thor+0.075
Neg logit contributions
theless-0.055
zens-0.053
 Regulatory-0.052
 indo-0.051
negie-0.050
Token Tiff
Token ID35775
Feature activation-1.04e-01

Pos logit contributions
theless+0.671
zens+0.662
 indo+0.637
negie+0.630
 Regulatory+0.628
Neg logit contributions
IDS-0.748
 Norn-0.715
 juggling-0.708
thumbnails-0.679
Thor-0.651
Tokenin
Token ID259
Feature activation+2.95e-02

Pos logit contributions
zens+0.445
theless+0.435
negie+0.423
 indo+0.421
orf+0.419
Neg logit contributions
IDS-0.391
 Norn-0.376
 juggling-0.369
thumbnails-0.344
Thor-0.327
Token,
Token ID11
Feature activation+2.26e-01

Pos logit contributions
IDS+0.114
 Norn+0.109
 juggling+0.107
thumbnails+0.103
Thor+0.099
Neg logit contributions
zens-0.123
theless-0.121
 indo-0.116
 Regulatory-0.116
negie-0.115
Token Iowa
Token ID9406
Feature activation+2.57e-01

Pos logit contributions
IDS+1.030
 Norn+0.995
 juggling+0.984
thumbnails+0.938
Thor+0.908
Neg logit contributions
theless-0.798
zens-0.779
 indo-0.749
negie-0.743
 Regulatory-0.730
Token.
Token ID13
Feature activation+2.97e-02

Pos logit contributions
IDS+1.064
 Norn+1.020
 juggling+1.006
thumbnails+0.968
Thor+0.926
Neg logit contributions
zens-1.003
theless-0.998
 indo-0.951
negie-0.950
 Regulatory-0.947
Token Hide
Token ID10415
Feature activation+9.42e-01

Pos logit contributions
IDS+0.141
 Norn+0.136
 juggling+0.135
thumbnails+0.129
Thor+0.125
Neg logit contributions
theless-0.098
zens-0.096
 indo-0.092
negie-0.091
 Regulatory-0.090
Token Caption
Token ID11260
Feature activation-9.98e-02

Pos logit contributions
IDS+2.641
 Norn+2.535
 juggling+2.381
thumbnails+2.268
Thor+2.265
Neg logit contributions
 indo-5.263
theless-5.156
zens-4.683
negie-4.648
terday-4.627
Token 5
Token ID642
Feature activation-2.05e-02

Pos logit contributions
theless+0.416
 indo+0.404
zens+0.392
negie+0.381
terday+0.374
Neg logit contributions
IDS-0.402
 Norn-0.392
 juggling-0.381
thumbnails-0.371
Thor-0.354
Token of
Token ID286
Feature activation+1.61e-01

Pos logit contributions
theless+0.090
 indo+0.087
zens+0.085
negie+0.083
 Regulatory+0.080
Neg logit contributions
IDS-0.078
 Norn-0.075
 juggling-0.074
thumbnails-0.070
Thor-0.068
Token 15
Token ID1315
Feature activation+1.85e-02

Pos logit contributions
IDS+0.586
 Norn+0.567
 juggling+0.552
thumbnails+0.521
Thor+0.505
Neg logit contributions
theless-0.745
 indo-0.718
zens-0.699
negie-0.688
 Regulatory-0.671
Token Photos
Token ID9434
Feature activation-5.66e-02

Pos logit contributions
IDS+0.058
 Norn+0.054
 juggling+0.053
thumbnails+0.051
Thor+0.048
Neg logit contributions
theless-0.095
 indo-0.092
zens-0.090
negie-0.088
terday-0.086
Token previously
Token ID4271
Feature activation+2.02e-01

Pos logit contributions
theless+0.687
zens+0.679
 indo+0.657
negie+0.646
 Regulatory+0.637
Neg logit contributions
IDS-0.744
 Norn-0.715
 juggling-0.697
thumbnails-0.677
Thor-0.649
Token,
Token ID11
Feature activation+2.57e-01

Pos logit contributions
IDS+0.791
 Norn+0.763
 juggling+0.729
thumbnails+0.715
Thor+0.681
Neg logit contributions
theless-0.843
zens-0.842
 indo-0.818
negie-0.798
 Regulatory-0.790
Token the
Token ID262
Feature activation+8.40e-02

Pos logit contributions
IDS+0.987
 juggling+0.949
 Norn+0.947
thumbnails+0.879
Thor+0.845
Neg logit contributions
theless-1.086
zens-1.059
negie-1.017
 indo-1.013
 Regulatory-1.003
Token "
Token ID366
Feature activation-4.84e-02

Pos logit contributions
IDS+0.344
 Norn+0.331
 juggling+0.329
thumbnails+0.310
Thor+0.298
Neg logit contributions
theless-0.335
zens-0.328
 indo-0.314
negie-0.312
 Regulatory-0.308
TokenR
Token ID49
Feature activation+2.72e-01

Pos logit contributions
theless+0.200
zens+0.198
 indo+0.191
negie+0.189
 Regulatory+0.187
Neg logit contributions
IDS-0.189
 Norn-0.183
 juggling-0.180
thumbnails-0.170
Thor-0.163
Tokeneth
Token ID2788
Feature activation+9.30e-01

Pos logit contributions
IDS+1.162
 Norn+1.103
 juggling+1.095
thumbnails+1.039
Thor+1.013
Neg logit contributions
theless-1.045
zens-1.017
 indo-0.983
negie-0.968
 Regulatory-0.961
Tokeninking
Token ID8040
Feature activation+3.57e-01

Pos logit contributions
IDS+3.657
 Norn+3.506
 juggling+3.434
thumbnails+3.258
 blink+3.151
Neg logit contributions
theless-3.765
 indo-3.658
zens-3.570
negie-3.465
 Regulatory-3.450
Token Mathematics
Token ID39448
Feature activation-1.63e-01

Pos logit contributions
IDS+1.451
 Norn+1.401
 juggling+1.381
thumbnails+1.305
Thor+1.260
Neg logit contributions
theless-1.435
zens-1.394
 indo-1.332
negie-1.327
 Regulatory-1.291
Token:
Token ID25
Feature activation+2.94e-01

Pos logit contributions
theless+0.674
zens+0.664
 indo+0.646
negie+0.636
 Regulatory+0.633
Neg logit contributions
IDS-0.639
 Norn-0.613
 juggling-0.605
thumbnails-0.575
Thor-0.553
Token Teaching
Token ID38094
Feature activation+5.31e-01

Pos logit contributions
IDS+1.179
 Norn+1.128
 juggling+1.118
thumbnails+1.047
Thor+1.010
Neg logit contributions
theless-1.209
zens-1.182
negie-1.128
 indo-1.128
orf-1.092
Token Social
Token ID5483
Feature activation+3.06e-01

Pos logit contributions
IDS+2.053
 Norn+1.979
 juggling+1.947
thumbnails+1.830
Thor+1.753
Neg logit contributions
theless-2.227
zens-2.166
negie-2.086
 indo-2.077
orf-2.026
Token prepare
Token ID8335
Feature activation+5.08e-02

Pos logit contributions
theless+0.012
zens+0.011
 indo+0.011
negie+0.011
 Regulatory+0.011
Neg logit contributions
IDS-0.012
 juggling-0.011
 Norn-0.011
thumbnails-0.011
 blink-0.010
Token for
Token ID329
Feature activation+2.11e-01

Pos logit contributions
IDS+0.201
 Norn+0.193
 juggling+0.190
thumbnails+0.181
Thor+0.174
Neg logit contributions
theless-0.209
zens-0.205
 indo-0.198
negie-0.196
 Regulatory-0.194
Token cases
Token ID2663
Feature activation+2.23e-01

Pos logit contributions
IDS+0.835
 Norn+0.801
 juggling+0.799
thumbnails+0.748
Thor+0.721
Neg logit contributions
theless-0.870
zens-0.848
 indo-0.825
negie-0.818
 Regulatory-0.801
Token.
Token ID13
Feature activation+2.93e-01

Pos logit contributions
IDS+0.860
 Norn+0.826
 juggling+0.818
thumbnails+0.767
Thor+0.742
Neg logit contributions
theless-0.942
zens-0.923
 indo-0.897
negie-0.889
 Regulatory-0.875
Token (
Token ID357
Feature activation+3.58e-01

Pos logit contributions
IDS+1.114
 Norn+1.070
 juggling+1.052
thumbnails+0.997
Thor+0.958
Neg logit contributions
theless-1.259
zens-1.228
 indo-1.198
negie-1.185
 Regulatory-1.152
TokenPhoto
Token ID6191
Feature activation+9.22e-01

Pos logit contributions
IDS+1.296
 Norn+1.238
 juggling+1.220
thumbnails+1.152
Thor+1.105
Neg logit contributions
theless-1.597
zens-1.569
 indo-1.524
negie-1.508
 Regulatory-1.491
Token by
Token ID416
Feature activation+3.59e-01

Pos logit contributions
IDS+1.878
 Norn+1.739
 juggling+1.729
thumbnails+1.510
Thor+1.401
Neg logit contributions
theless-5.546
zens-5.435
 indo-5.321
negie-5.298
 Regulatory-5.188
Token Kathleen
Token ID31275
Feature activation+4.54e-02

Pos logit contributions
IDS+1.477
 Norn+1.421
 juggling+1.402
thumbnails+1.331
Thor+1.285
Neg logit contributions
theless-1.425
zens-1.398
 indo-1.351
negie-1.336
 Regulatory-1.318
Token Flynn
Token ID14654
Feature activation-2.89e-02

Pos logit contributions
IDS+0.186
 Norn+0.178
 juggling+0.177
thumbnails+0.168
Thor+0.161
Neg logit contributions
theless-0.180
zens-0.177
 indo-0.171
negie-0.169
 Regulatory-0.168
Token,
Token ID11
Feature activation+3.02e-01

Pos logit contributions
theless+0.125
zens+0.125
 indo+0.120
negie+0.119
 Regulatory+0.118
Neg logit contributions
IDS-0.107
 Norn-0.103
 juggling-0.102
thumbnails-0.096
Thor-0.092
Token N
Token ID399
Feature activation-1.93e-02

Pos logit contributions
IDS+1.317
 Norn+1.260
 juggling+1.249
thumbnails+1.196
Thor+1.151
Neg logit contributions
theless-1.127
zens-1.089
 indo-1.053
negie-1.042
 Regulatory-1.015
Tokencoming
Token ID4976
Feature activation+5.13e-02

Pos logit contributions
IDS+0.746
 Norn+0.682
 juggling+0.675
thumbnails+0.634
Thor+0.583
Neg logit contributions
theless-1.507
zens-1.465
 indo-1.450
negie-1.426
 Regulatory-1.417
Token/
Token ID14
Feature activation+1.68e-01

Pos logit contributions
IDS+0.178
 Norn+0.170
 juggling+0.167
thumbnails+0.157
Thor+0.150
Neg logit contributions
theless-0.236
zens-0.233
 indo-0.226
negie-0.224
 Regulatory-0.222
Tokenarticle
Token ID20205
Feature activation+6.14e-01

Pos logit contributions
IDS+0.304
 Norn+0.278
 juggling+0.270
thumbnails+0.236
Thor+0.215
Neg logit contributions
theless-1.049
zens-1.038
 indo-1.016
negie-1.008
 Regulatory-1.001
Token35
Token ID2327
Feature activation+6.67e-01

Pos logit contributions
IDS+2.675
 Norn+2.574
 juggling+2.523
thumbnails+2.406
Thor+2.333
Neg logit contributions
theless-2.271
zens-2.236
 indo-2.178
negie-2.138
 Regulatory-2.104
Token79
Token ID3720
Feature activation+7.55e-01

Pos logit contributions
IDS+2.931
 Norn+2.816
 juggling+2.719
thumbnails+2.666
Thor+2.581
Neg logit contributions
theless-2.398
zens-2.340
 indo-2.329
 Regulatory-2.248
negie-2.239
Token23
Token ID1954
Feature activation+9.20e-01

Pos logit contributions
IDS+2.946
 Norn+2.761
 juggling+2.688
thumbnails+2.606
Thor+2.535
Neg logit contributions
theless-3.162
 indo-3.090
zens-3.071
negie-2.981
 Regulatory-2.946
Token12
Token ID1065
Feature activation+5.40e-01

Pos logit contributions
IDS+3.442
 Norn+3.208
 juggling+3.096
thumbnails+3.027
Thor+2.925
Neg logit contributions
theless-3.997
 indo-3.929
zens-3.817
negie-3.781
 Regulatory-3.687
Token.
Token ID13
Feature activation+6.10e-01

Pos logit contributions
IDS+1.647
 Norn+1.575
 juggling+1.485
thumbnails+1.437
Thor+1.396
Neg logit contributions
theless-2.851
 indo-2.762
zens-2.566
negie-2.537
 enthusi-2.479
Tokene
Token ID68
Feature activation-1.03e-01

Pos logit contributions
IDS+1.678
 Norn+1.486
thumbnails+1.425
 juggling+1.377
Thor+1.365
Neg logit contributions
theless-3.303
 indo-3.232
zens-3.106
negie-3.088
 Regulatory-3.014
Tokence
Token ID344
Feature activation+4.08e-01

Pos logit contributions
theless+0.453
zens+0.447
 indo+0.432
negie+0.428
 Regulatory+0.424
Neg logit contributions
IDS-0.377
 Norn-0.362
 juggling-0.356
thumbnails-0.336
Thor-0.323
Token/
Token ID14
Feature activation+2.69e-02

Pos logit contributions
 Norn+1.252
IDS+1.250
 juggling+1.181
thumbnails+1.087
 blink+1.039
Neg logit contributions
theless-2.067
zens-1.991
 indo-1.987
negie-1.963
 Regulatory-1.892
Token in
Token ID287
Feature activation+3.27e-02

Pos logit contributions
theless+0.428
zens+0.419
 indo+0.405
negie+0.401
 Regulatory+0.396
Neg logit contributions
IDS-0.415
 Norn-0.398
 juggling-0.394
thumbnails-0.372
Thor-0.359
Token Pasadena
Token ID41010
Feature activation+1.57e-02

Pos logit contributions
IDS+0.164
 Norn+0.159
 juggling+0.157
thumbnails+0.151
Thor+0.147
Neg logit contributions
theless-0.100
zens-0.097
 indo-0.093
negie-0.092
 Regulatory-0.090
Token,
Token ID11
Feature activation-6.51e-02

Pos logit contributions
IDS+0.061
 Norn+0.059
 juggling+0.058
thumbnails+0.055
Thor+0.054
Neg logit contributions
zens-0.064
theless-0.062
 indo-0.061
negie-0.060
 Regulatory-0.060
Token California
Token ID3442
Feature activation+6.74e-02

Pos logit contributions
zens+0.215
theless+0.213
 indo+0.206
negie+0.202
 Regulatory+0.200
Neg logit contributions
IDS-0.308
 Norn-0.299
 juggling-0.288
thumbnails-0.280
Thor-0.274
Token.
Token ID13
Feature activation+4.99e-02

Pos logit contributions
IDS+0.281
 Norn+0.273
 juggling+0.267
thumbnails+0.254
Thor+0.247
Neg logit contributions
zens-0.259
theless-0.253
negie-0.246
 indo-0.245
 Regulatory-0.245
Token Hide
Token ID10415
Feature activation+9.20e-01

Pos logit contributions
IDS+0.237
 Norn+0.229
 juggling+0.227
thumbnails+0.217
Thor+0.210
Neg logit contributions
theless-0.166
zens-0.162
 indo-0.156
negie-0.153
 Regulatory-0.151
Token Caption
Token ID11260
Feature activation+1.12e-01

Pos logit contributions
IDS+2.594
 Norn+2.531
 juggling+2.320
thumbnails+2.250
Thor+2.175
Neg logit contributions
 indo-5.087
theless-5.059
zens-4.593
terday-4.541
negie-4.532
Token 12
Token ID1105
Feature activation-2.64e-01

Pos logit contributions
IDS+0.434
 Norn+0.410
 juggling+0.403
thumbnails+0.402
Thor+0.378
Neg logit contributions
theless-0.501
 indo-0.484
zens-0.457
negie-0.450
terday-0.446
Token of
Token ID286
Feature activation-1.75e-02

Pos logit contributions
theless+1.113
 indo+1.069
zens+1.064
negie+1.027
 Regulatory+1.009
Neg logit contributions
IDS-1.044
 Norn-1.004
 juggling-0.990
thumbnails-0.939
Thor-0.910
Token 15
Token ID1315
Feature activation-1.42e-01

Pos logit contributions
theless+0.080
 indo+0.077
zens+0.075
negie+0.073
 Regulatory+0.072
Neg logit contributions
IDS-0.065
 Norn-0.062
 juggling-0.062
thumbnails-0.057
Thor-0.056
Token Photos
Token ID9434
Feature activation-1.43e-01

Pos logit contributions
theless+0.713
 indo+0.691
zens+0.678
negie+0.660
terday+0.647
Neg logit contributions
IDS-0.461
 Norn-0.438
 juggling-0.424
thumbnails-0.406
Thor-0.382
Token.
Token ID13
Feature activation+2.81e-02

Pos logit contributions
IDS+1.004
 Norn+0.963
 juggling+0.949
thumbnails+0.896
Thor+0.863
Neg logit contributions
theless-1.127
zens-1.107
 indo-1.075
negie-1.063
 Regulatory-1.050
Token<|endoftext|>
Token ID50256
Feature activation+4.36e-02

Pos logit contributions
IDS+0.114
 Norn+0.109
 juggling+0.108
thumbnails+0.103
Thor+0.098
Neg logit contributions
theless-0.113
zens-0.111
 indo-0.107
negie-0.106
 Regulatory-0.106
TokenAD
Token ID2885
Feature activation+1.40e-01

Pos logit contributions
IDS+0.178
 Norn+0.172
 juggling+0.170
thumbnails+0.161
Thor+0.154
Neg logit contributions
theless-0.172
zens-0.170
 indo-0.163
negie-0.162
 Regulatory-0.160
TokenEL
Token ID3698
Feature activation-3.12e-01

Pos logit contributions
IDS+0.566
 Norn+0.543
 juggling+0.536
thumbnails+0.508
Thor+0.490
Neg logit contributions
theless-0.567
zens-0.557
 indo-0.539
negie-0.533
 Regulatory-0.526
TokenA
Token ID32
Feature activation+2.32e-01

Pos logit contributions
theless+1.238
zens+1.209
 indo+1.166
negie+1.157
 Regulatory+1.151
Neg logit contributions
IDS-1.266
 Norn-1.222
 juggling-1.207
thumbnails-1.148
Thor-1.099
TokenIDE
Token ID14114
Feature activation+9.11e-01

Pos logit contributions
IDS+0.370
 Norn+0.355
 juggling+0.346
thumbnails+0.301
Thor+0.262
Neg logit contributions
theless-1.471
zens-1.453
 Regulatory-1.423
 indo-1.413
negie-1.406
Token captain
Token ID10654
Feature activation-2.92e-02

Pos logit contributions
IDS+3.467
 Norn+3.295
 juggling+3.250
thumbnails+2.985
Thor+2.926
Neg logit contributions
theless-3.840
zens-3.765
negie-3.705
 indo-3.662
 Regulatory-3.534
Token Taylor
Token ID8121
Feature activation+1.94e-01

Pos logit contributions
theless+0.118
zens+0.116
 indo+0.112
 Regulatory+0.110
negie+0.110
Neg logit contributions
IDS-0.117
 Norn-0.112
 juggling-0.111
thumbnails-0.106
Thor-0.102
Token Walker
Token ID10120
Feature activation+1.14e-01

Pos logit contributions
IDS+0.714
 Norn+0.684
 juggling+0.672
thumbnails+0.632
Thor+0.611
Neg logit contributions
theless-0.852
zens-0.835
 indo-0.811
negie-0.804
 Regulatory-0.791
Token says
Token ID1139
Feature activation+1.91e-01

Pos logit contributions
IDS+0.449
 Norn+0.433
 juggling+0.428
thumbnails+0.402
Thor+0.389
Neg logit contributions
theless-0.473
zens-0.464
 indo-0.449
negie-0.446
 Regulatory-0.440
Token the
Token ID262
Feature activation+9.03e-02

Pos logit contributions
IDS+0.717
 Norn+0.687
 juggling+0.677
thumbnails+0.640
Thor+0.615
Neg logit contributions
theless-0.825
zens-0.812
 indo-0.786
negie-0.778
 Regulatory-0.770
Token make
Token ID787
Feature activation-9.32e-02

Pos logit contributions
theless+0.298
zens+0.293
 indo+0.283
negie+0.280
 Regulatory+0.277
Neg logit contributions
IDS-0.268
 Norn-0.257
 juggling-0.254
thumbnails-0.240
Thor-0.231
Token overcl
Token ID40397
Feature activation-1.70e-02

Pos logit contributions
theless+0.375
zens+0.368
 indo+0.356
negie+0.352
 Regulatory+0.348
Neg logit contributions
IDS-0.378
 Norn-0.363
 juggling-0.359
thumbnails-0.339
Thor-0.327
Tokenocking
Token ID8629
Feature activation+8.25e-02

Pos logit contributions
theless+0.051
zens+0.050
 indo+0.047
negie+0.047
 Regulatory+0.046
Neg logit contributions
IDS-0.086
 Norn-0.083
 juggling-0.083
thumbnails-0.079
Thor-0.077
Token easier
Token ID4577
Feature activation-1.52e-01

Pos logit contributions
IDS+0.319
 Norn+0.306
 juggling+0.304
thumbnails+0.285
Thor+0.276
Neg logit contributions
theless-0.346
zens-0.339
 indo-0.329
negie-0.327
 Regulatory-0.322
Token than
Token ID621
Feature activation+5.26e-01

Pos logit contributions
theless+0.636
zens+0.626
 indo+0.603
negie+0.599
 Regulatory+0.591
Neg logit contributions
IDS-0.594
 Norn-0.570
 juggling-0.560
thumbnails-0.534
Thor-0.513
Token ever
Token ID1683
Feature activation+9.10e-01

Pos logit contributions
IDS+1.590
 juggling+1.518
 Norn+1.511
thumbnails+1.331
 blink+1.310
Neg logit contributions
theless-2.656
zens-2.581
 indo-2.543
negie-2.523
orf-2.470
Token.
Token ID13
Feature activation+2.82e-01

Pos logit contributions
IDS+3.411
 Norn+3.266
 juggling+3.266
thumbnails+2.951
Thor+2.920
Neg logit contributions
theless-3.790
zens-3.751
 indo-3.671
negie-3.607
 Regulatory-3.551
Token With
Token ID2080
Feature activation+4.63e-01

Pos logit contributions
IDS+1.143
 Norn+1.105
 juggling+1.071
thumbnails+1.010
Thor+1.001
Neg logit contributions
theless-1.129
zens-1.108
 indo-1.085
negie-1.067
orf-1.035
Token the
Token ID262
Feature activation+1.70e-01

Pos logit contributions
IDS+1.828
 Norn+1.764
 juggling+1.727
thumbnails+1.602
Thor+1.573
Neg logit contributions
theless-1.887
zens-1.838
 indo-1.810
negie-1.787
 Regulatory-1.749
Token industry
Token ID2831
Feature activation-1.50e-01

Pos logit contributions
IDS+0.688
 Norn+0.667
 juggling+0.656
thumbnails+0.612
Thor+0.598
Neg logit contributions
theless-0.681
zens-0.668
 indo-0.646
negie-0.642
 Regulatory-0.630
Token first
Token ID717
Feature activation-2.13e-02

Pos logit contributions
theless+0.565
zens+0.554
 indo+0.534
negie+0.528
 Regulatory+0.521
Neg logit contributions
IDS-0.646
 Norn-0.622
 juggling-0.615
thumbnails-0.586
Thor-0.566
Token.
Token ID13
Feature activation+1.17e-01

Pos logit contributions
IDS+0.433
 Norn+0.416
 juggling+0.410
thumbnails+0.388
Thor+0.375
Neg logit contributions
theless-0.449
zens-0.441
 indo-0.427
negie-0.422
 Regulatory-0.418
Token
Token ID198
Feature activation+2.23e-01

Pos logit contributions
IDS+0.464
 Norn+0.448
 juggling+0.439
thumbnails+0.417
Thor+0.404
Neg logit contributions
theless-0.482
zens-0.472
 indo-0.458
negie-0.452
 Regulatory-0.442
Token
Token ID198
Feature activation+2.11e-01

Pos logit contributions
IDS+0.786
 Norn+0.742
 juggling+0.722
thumbnails+0.687
Thor+0.681
Neg logit contributions
theless-1.057
 indo-1.044
zens-0.990
negie-0.974
 Regulatory-0.944
TokenStatistics
Token ID48346
Feature activation+1.29e-02

Pos logit contributions
IDS+0.684
 Norn+0.668
 juggling+0.656
thumbnails+0.599
 blink+0.563
Neg logit contributions
zens-0.997
theless-0.994
negie-0.956
 indo-0.946
 Regulatory-0.944
Token used
Token ID973
Feature activation+1.42e-01

Pos logit contributions
IDS+0.058
 Norn+0.056
 juggling+0.056
thumbnails+0.053
Thor+0.051
Neg logit contributions
theless-0.046
zens-0.045
 indo-0.043
negie-0.043
 Regulatory-0.042
Token courtesy
Token ID12537
Feature activation+8.99e-01

Pos logit contributions
IDS+0.591
 Norn+0.568
 juggling+0.558
thumbnails+0.532
Thor+0.515
Neg logit contributions
theless-0.556
zens-0.547
 indo-0.528
negie-0.521
 Regulatory-0.518
Token of
Token ID286
Feature activation+3.63e-01

Pos logit contributions
IDS+3.857
 Norn+3.708
 juggling+3.656
thumbnails+3.492
Thor+3.364
Neg logit contributions
theless-3.418
zens-3.346
 indo-3.248
negie-3.200
 Regulatory-3.182
Token Basketball
Token ID25911
Feature activation-1.40e-01

Pos logit contributions
IDS+1.028
 Norn+0.971
 juggling+0.953
thumbnails+0.881
Thor+0.834
Neg logit contributions
theless-1.903
zens-1.878
 indo-1.829
negie-1.815
 Regulatory-1.798
Token-
Token ID12
Feature activation+2.64e-01

Pos logit contributions
theless+0.610
zens+0.602
 indo+0.583
negie+0.579
 Regulatory+0.574
Neg logit contributions
IDS-0.520
 Norn-0.498
 juggling-0.491
thumbnails-0.464
Thor-0.447
TokenReference
Token ID26687
Feature activation+1.61e-01

Pos logit contributions
IDS+0.881
 Norn+0.839
 juggling+0.826
thumbnails+0.774
Thor+0.740
Neg logit contributions
theless-1.246
zens-1.227
 indo-1.192
negie-1.182
 Regulatory-1.170
Token.
Token ID13
Feature activation+3.70e-01

Pos logit contributions
IDS+0.617
 Norn+0.591
 juggling+0.584
thumbnails+0.551
Thor+0.531
Neg logit contributions
theless-0.687
zens-0.674
 indo-0.653
negie-0.646
 Regulatory-0.637
Token with
Token ID351
Feature activation-2.49e-01

Pos logit contributions
IDS+0.055
 Norn+0.052
 juggling+0.050
thumbnails+0.048
Thor+0.045
Neg logit contributions
zens-0.074
theless-0.073
 indo-0.071
negie-0.070
 Regulatory-0.070
Token the
Token ID262
Feature activation-1.71e-01

Pos logit contributions
theless+1.026
zens+1.019
 indo+0.975
 Regulatory+0.970
negie+0.969
Neg logit contributions
IDS-0.966
 Norn-0.929
 juggling-0.912
thumbnails-0.870
Thor-0.837
Token same
Token ID976
Feature activation-2.81e-01

Pos logit contributions
theless+0.714
zens+0.701
 indo+0.679
negie+0.672
 Regulatory+0.665
Neg logit contributions
IDS-0.666
 Norn-0.639
 juggling-0.631
thumbnails-0.597
Thor-0.575
Token short
Token ID1790
Feature activation+1.65e-01

Pos logit contributions
theless+1.127
zens+1.108
 indo+1.071
negie+1.057
 Regulatory+1.051
Neg logit contributions
IDS-1.138
 Norn-1.092
 juggling-1.074
thumbnails-1.023
Thor-0.986
Token-
Token ID12
Feature activation+2.35e-01

Pos logit contributions
IDS+0.648
 Norn+0.620
 juggling+0.605
thumbnails+0.576
Thor+0.555
Neg logit contributions
zens-0.680
theless-0.679
 indo-0.648
negie-0.638
 Regulatory-0.638
Tokenterm
Token ID4354
Feature activation+8.98e-01

Pos logit contributions
IDS+0.739
 Norn+0.703
 juggling+0.692
thumbnails+0.648
Thor+0.617
Neg logit contributions
theless-1.156
zens-1.138
 indo-1.114
negie-1.100
 Regulatory-1.088
Token disability
Token ID14013
Feature activation-4.64e-02

Pos logit contributions
IDS+3.412
 Norn+3.319
 juggling+3.311
thumbnails+3.038
 blink+2.980
Neg logit contributions
theless-3.737
zens-3.656
 indo-3.542
negie-3.538
 Regulatory-3.443
Token proposal
Token ID6961
Feature activation+1.23e-01

Pos logit contributions
theless+0.205
zens+0.202
 indo+0.196
negie+0.194
 Regulatory+0.192
Neg logit contributions
IDS-0.169
 Norn-0.161
 juggling-0.158
thumbnails-0.151
Thor-0.144
Token the
Token ID262
Feature activation+5.72e-02

Pos logit contributions
IDS+0.475
 Norn+0.456
 juggling+0.449
thumbnails+0.426
Thor+0.409
Neg logit contributions
theless-0.519
zens-0.511
 indo-0.494
negie-0.489
 Regulatory-0.484
Token Conservatives
Token ID17897
Feature activation+2.71e-01

Pos logit contributions
IDS+0.216
 Norn+0.207
 juggling+0.204
thumbnails+0.193
Thor+0.186
Neg logit contributions
theless-0.245
zens-0.241
 indo-0.233
negie-0.231
 Regulatory-0.229
Token made
Token ID925
Feature activation-6.44e-02

Pos logit contributions
IDS+1.068
 Norn+1.028
 juggling+1.025
thumbnails+0.956
Thor+0.927
Neg logit contributions
theless-1.103
zens-1.085
 indo-1.047
negie-1.042
 Regulatory-1.027
Token-
Token ID12
Feature activation-2.55e-01

Pos logit contributions
theless+0.888
zens+0.874
 indo+0.847
negie+0.838
 Regulatory+0.829
Neg logit contributions
IDS-0.768
 Norn-0.736
 juggling-0.726
thumbnails-0.686
Thor-0.659
Tokenout
Token ID448
Feature activation+2.30e-01

Pos logit contributions
theless+0.960
zens+0.943
 indo+0.901
negie+0.894
 Regulatory+0.879
Neg logit contributions
IDS-1.093
 Norn-1.058
 juggling-1.042
thumbnails-0.987
Thor-0.955
Token-
Token ID12
Feature activation+2.44e-01

Pos logit contributions
IDS+0.915
 Norn+0.877
 juggling+0.857
thumbnails+0.825
Thor+0.782
Neg logit contributions
zens-0.947
theless-0.940
 indo-0.906
negie-0.892
orf-0.869
Tokenthe
Token ID1169
Feature activation+1.30e-01

Pos logit contributions
IDS+0.987
 Norn+0.951
 juggling+0.938
thumbnails+0.887
Thor+0.853
Neg logit contributions
theless-0.977
zens-0.964
 indo-0.928
negie-0.920
 Regulatory-0.906
Token-
Token ID12
Feature activation+4.27e-01

Pos logit contributions
IDS+0.426
 Norn+0.407
 juggling+0.403
thumbnails+0.381
Thor+0.363
Neg logit contributions
theless-0.628
 indo-0.603
zens-0.603
negie-0.586
 Regulatory-0.581
Tokenvote
Token ID27257
Feature activation+8.91e-01

Pos logit contributions
IDS+1.380
 Norn+1.313
 juggling+1.291
thumbnails+1.206
Thor+1.151
Neg logit contributions
theless-2.069
zens-2.040
 indo-1.982
negie-1.965
 Regulatory-1.946
Token efforts
Token ID4040
Feature activation+5.09e-01

Pos logit contributions
IDS+3.669
 juggling+3.624
 Norn+3.504
thumbnails+3.300
 blink+3.273
Neg logit contributions
theless-3.411
negie-3.186
zens-3.172
 Regulatory-3.106
orf-3.027
Token,
Token ID11
Feature activation+2.17e-01

Pos logit contributions
IDS+1.846
 juggling+1.793
 Norn+1.781
thumbnails+1.629
Thor+1.594
Neg logit contributions
theless-2.233
zens-2.140
 indo-2.106
negie-2.106
 Regulatory-2.070
Token and
Token ID290
Feature activation+1.83e-01

Pos logit contributions
IDS+0.911
 Norn+0.881
 juggling+0.875
thumbnails+0.822
Thor+0.800
Neg logit contributions
theless-0.842
zens-0.819
 indo-0.794
negie-0.787
 Regulatory-0.774
Token populist
Token ID26815
Feature activation-3.93e-01

Pos logit contributions
IDS+0.719
 Norn+0.694
 juggling+0.689
thumbnails+0.643
Thor+0.623
Neg logit contributions
theless-0.753
zens-0.734
 indo-0.714
negie-0.711
 Regulatory-0.698
Token Sen
Token ID2311
Feature activation-1.44e-01

Pos logit contributions
theless+1.578
zens+1.548
 indo+1.496
negie+1.483
 Regulatory+1.463
Neg logit contributions
IDS-1.593
 Norn-1.534
 juggling-1.515
thumbnails-1.436
Thor-1.386
Token in
Token ID287
Feature activation+2.37e-02

Pos logit contributions
IDS+0.661
 Norn+0.636
 juggling+0.634
thumbnails+0.594
Thor+0.574
Neg logit contributions
theless-0.704
zens-0.692
 indo-0.668
negie-0.663
 Regulatory-0.654
Token the
Token ID262
Feature activation+1.69e-02

Pos logit contributions
IDS+0.098
 Norn+0.094
 juggling+0.094
thumbnails+0.089
Thor+0.086
Neg logit contributions
theless-0.093
zens-0.092
 indo-0.088
negie-0.087
 Regulatory-0.086
Token business
Token ID1597
Feature activation+1.24e-01

Pos logit contributions
IDS+0.068
 Norn+0.066
 juggling+0.066
thumbnails+0.062
Thor+0.060
Neg logit contributions
theless-0.067
zens-0.066
 indo-0.064
negie-0.063
 Regulatory-0.062
Token long
Token ID890
Feature activation+7.51e-02

Pos logit contributions
IDS+0.468
 Norn+0.448
 juggling+0.442
thumbnails+0.417
Thor+0.401
Neg logit contributions
theless-0.535
zens-0.527
 indo-0.510
negie-0.505
 Regulatory-0.500
Token-
Token ID12
Feature activation+2.29e-02

Pos logit contributions
IDS+0.339
 Norn+0.325
 juggling+0.320
thumbnails+0.306
Thor+0.297
Neg logit contributions
theless-0.269
zens-0.263
 indo-0.252
negie-0.248
 Regulatory-0.247
Tokenterm
Token ID4354
Feature activation+8.88e-01

Pos logit contributions
IDS+0.078
 Norn+0.074
 juggling+0.073
thumbnails+0.068
Thor+0.065
Neg logit contributions
theless-0.107
zens-0.105
 indo-0.101
negie-0.101
 Regulatory-0.100
Token,
Token ID11
Feature activation+2.00e-01

Pos logit contributions
IDS+3.115
 Norn+3.001
 juggling+2.996
thumbnails+2.760
Thor+2.674
Neg logit contributions
theless-3.991
zens-3.936
 indo-3.808
negie-3.777
 Regulatory-3.734
Token could
Token ID714
Feature activation-6.44e-03

Pos logit contributions
IDS+0.767
 juggling+0.744
 Norn+0.736
thumbnails+0.687
Thor+0.664
Neg logit contributions
theless-0.850
zens-0.833
 indo-0.804
negie-0.798
 Regulatory-0.781
Token you
Token ID345
Feature activation-8.94e-03

Pos logit contributions
theless+0.027
zens+0.027
 indo+0.026
negie+0.026
 Regulatory+0.026
Neg logit contributions
IDS-0.024
 Norn-0.023
 juggling-0.023
thumbnails-0.022
Thor-0.021
Token see
Token ID766
Feature activation-1.67e-02

Pos logit contributions
theless+0.036
zens+0.035
 indo+0.034
negie+0.034
 Regulatory+0.033
Neg logit contributions
IDS-0.036
 Norn-0.035
 juggling-0.035
thumbnails-0.032
 blink-0.032
Token yourself
Token ID3511
Feature activation-2.61e-01

Pos logit contributions
theless+0.073
zens+0.072
 indo+0.069
negie+0.069
 Regulatory+0.068
Neg logit contributions
IDS-0.062
 Norn-0.059
 juggling-0.059
thumbnails-0.055
Thor-0.053
TokenR
Token ID49
Feature activation-3.00e-02

Pos logit contributions
zens+1.240
theless+1.237
 indo+1.184
negie+1.171
 Regulatory+1.167
Neg logit contributions
IDS-1.086
 Norn-1.056
 juggling-1.044
thumbnails-0.971
Thor-0.932
Tokenc
Token ID66
Feature activation+9.29e-02

Pos logit contributions
theless+0.112
zens+0.110
 indo+0.106
negie+0.105
 Regulatory+0.104
Neg logit contributions
IDS-0.129
 Norn-0.125
 juggling-0.123
thumbnails-0.118
Thor-0.113
Token`,
Token ID47671
Feature activation+1.01e-03

Pos logit contributions
IDS+0.329
 Norn+0.315
 juggling+0.310
thumbnails+0.292
Thor+0.280
Neg logit contributions
theless-0.420
zens-0.414
 indo-0.401
negie-0.397
 Regulatory-0.393
Token or
Token ID393
Feature activation-2.65e-01

Pos logit contributions
IDS+0.004
 Norn+0.003
 juggling+0.003
thumbnails+0.003
Thor+0.003
Neg logit contributions
theless-0.004
zens-0.004
 indo-0.004
 Regulatory-0.004
negie-0.004
Token `
Token ID4600
Feature activation-1.52e-01

Pos logit contributions
theless+1.084
zens+1.077
 indo+1.037
 Regulatory+1.025
negie+1.014
Neg logit contributions
IDS-1.037
 Norn-0.998
 juggling-0.976
thumbnails-0.945
Thor-0.904
Token&
Token ID5
Feature activation-9.68e-01

Pos logit contributions
theless+0.648
zens+0.642
 indo+0.618
negie+0.613
 Regulatory+0.606
Neg logit contributions
IDS-0.569
 Norn-0.548
 juggling-0.542
thumbnails-0.508
Thor-0.488
Token`
Token ID63
Feature activation-1.92e-01

Pos logit contributions
zens+3.979
theless+3.924
 Regulatory+3.759
 indo+3.755
negie+3.716
Neg logit contributions
IDS-3.600
 Norn-3.527
 juggling-3.497
thumbnails-3.268
qqa-3.124
Token)
Token ID8
Feature activation-4.97e-01

Pos logit contributions
theless+0.808
zens+0.798
 indo+0.773
negie+0.762
 Regulatory+0.757
Neg logit contributions
IDS-0.736
 Norn-0.707
 juggling-0.697
thumbnails-0.661
Thor-0.636
Token at
Token ID379
Feature activation-1.84e-01

Pos logit contributions
theless+2.345
zens+2.334
 indo+2.290
 Regulatory+2.230
negie+2.228
Neg logit contributions
IDS-1.605
 Norn-1.528
 juggling-1.487
thumbnails-1.422
Thor-1.359
Token some
Token ID617
Feature activation-1.46e-01

Pos logit contributions
zens+0.765
theless+0.759
 indo+0.743
 Regulatory+0.727
negie+0.724
Neg logit contributions
IDS-0.699
 Norn-0.675
 juggling-0.664
thumbnails-0.640
Thor-0.613
Token point
Token ID966
Feature activation+5.86e-01

Pos logit contributions
theless+0.552
zens+0.544
 indo+0.526
negie+0.517
 Regulatory+0.514
Neg logit contributions
IDS-0.624
 Norn-0.601
 juggling-0.592
thumbnails-0.567
Thor-0.547
Token to
Token ID284
Feature activation-3.10e-03

Pos logit contributions
IDS+0.082
 Norn+0.078
 juggling+0.075
thumbnails+0.071
Thor+0.068
Neg logit contributions
theless-0.131
zens-0.127
 indo-0.127
negie-0.124
 Regulatory-0.121
Token Rep
Token ID1432
Feature activation+3.16e-02

Pos logit contributions
theless+0.012
zens+0.012
 indo+0.012
negie+0.012
 Regulatory+0.011
Neg logit contributions
IDS-0.013
 Norn-0.012
 juggling-0.012
thumbnails-0.012
Thor-0.011
Token.
Token ID13
Feature activation+2.93e-01

Pos logit contributions
IDS+0.134
 juggling+0.130
 Norn+0.129
thumbnails+0.123
Thor+0.116
Neg logit contributions
zens-0.119
theless-0.114
negie-0.114
 indo-0.113
 Regulatory-0.112
Token Will
Token ID2561
Feature activation+1.36e-02

Pos logit contributions
IDS+1.160
 Norn+1.121
 juggling+1.097
thumbnails+1.030
Thor+1.011
Neg logit contributions
theless-1.210
zens-1.178
 indo-1.144
negie-1.136
 Regulatory-1.102
Token H
Token ID367
Feature activation+2.60e-02

Pos logit contributions
IDS+0.059
 Norn+0.058
 juggling+0.057
thumbnails+0.053
Thor+0.053
Neg logit contributions
theless-0.051
zens-0.049
 indo-0.047
negie-0.047
 Regulatory-0.046
Tokenurd
Token ID2799
Feature activation-9.55e-01

Pos logit contributions
IDS+0.145
 Norn+0.141
 juggling+0.140
thumbnails+0.135
Thor+0.131
Neg logit contributions
theless-0.065
zens-0.063
 indo-0.059
negie-0.058
 Regulatory-0.057
Token,
Token ID11
Feature activation+1.12e-01

Pos logit contributions
zens+3.755
theless+3.671
orf+3.596
 Regulatory+3.516
 indo+3.515
Neg logit contributions
IDS-3.819
 Norn-3.654
 juggling-3.599
thumbnails-3.564
qqa-3.206
Token R
Token ID371
Feature activation+8.71e-02

Pos logit contributions
IDS+0.513
 Norn+0.496
 juggling+0.491
thumbnails+0.467
Thor+0.455
Neg logit contributions
theless-0.391
zens-0.382
 indo-0.367
negie-0.365
 Regulatory-0.356
Token-
Token ID12
Feature activation+1.64e-01

Pos logit contributions
IDS+0.293
 Norn+0.272
 juggling+0.264
thumbnails+0.253
Thor+0.245
Neg logit contributions
theless-0.424
zens-0.408
 indo-0.407
negie-0.393
 Regulatory-0.386
TokenTexas
Token ID21607
Feature activation+9.59e-03

Pos logit contributions
IDS+0.882
 Norn+0.855
 juggling+0.844
thumbnails+0.814
Thor+0.795
Neg logit contributions
theless-0.447
zens-0.433
 indo-0.414
negie-0.407
 Regulatory-0.397
Token,
Token ID11
Feature activation+8.53e-03

Pos logit contributions
IDS+0.037
 Norn+0.036
 juggling+0.035
thumbnails+0.034
Thor+0.032
Neg logit contributions
zens-0.039
theless-0.039
 indo-0.038
negie-0.038
 Regulatory-0.037
Token in
Token ID287
Feature activation-1.13e-01

Pos logit contributions
theless+0.118
zens+0.116
 indo+0.112
negie+0.111
 Regulatory+0.110
Neg logit contributions
IDS-0.115
 Norn-0.110
 juggling-0.109
thumbnails-0.103
Thor-0.099
Token connection
Token ID4637
Feature activation+2.92e-01

Pos logit contributions
theless+0.431
zens+0.424
 indo+0.408
negie+0.403
 Regulatory+0.399
Neg logit contributions
IDS-0.483
 Norn-0.465
 juggling-0.459
thumbnails-0.438
Thor-0.423
Token with
Token ID351
Feature activation+7.13e-04

Pos logit contributions
IDS+0.959
 Norn+0.910
 juggling+0.897
thumbnails+0.841
Thor+0.781
Neg logit contributions
theless-1.413
zens-1.407
negie-1.357
 indo-1.349
 Regulatory-1.343
Token the
Token ID262
Feature activation-7.48e-02

Pos logit contributions
IDS+0.003
 Norn+0.002
 juggling+0.002
thumbnails+0.002
Thor+0.002
Neg logit contributions
theless-0.003
zens-0.003
 indo-0.003
negie-0.003
 Regulatory-0.003
Token fl
Token ID781
Feature activation-9.75e-02

Pos logit contributions
theless+0.303
zens+0.296
 indo+0.287
negie+0.284
 Regulatory+0.280
Neg logit contributions
IDS-0.302
 Norn-0.290
 juggling-0.287
thumbnails-0.271
Thor-0.261
Tokenot
Token ID313
Feature activation-9.32e-01

Pos logit contributions
theless+0.399
 indo+0.389
zens+0.363
 Regulatory+0.363
negie+0.356
Neg logit contributions
IDS-0.399
 Norn-0.379
 juggling-0.374
thumbnails-0.366
Thor-0.353
Tokenilla
Token ID5049
Feature activation-1.25e-01

Pos logit contributions
theless+2.980
zens+2.941
negie+2.772
 indo+2.771
 Regulatory+2.718
Neg logit contributions
IDS-4.518
 Norn-4.365
 juggling-4.325
thumbnails-4.130
Thor-4.026
Token incident
Token ID4519
Feature activation-4.76e-02

Pos logit contributions
theless+0.498
zens+0.490
 indo+0.473
negie+0.467
 Regulatory+0.462
Neg logit contributions
IDS-0.508
 Norn-0.489
 juggling-0.482
thumbnails-0.458
Thor-0.442
Token and
Token ID290
Feature activation-1.54e-01

Pos logit contributions
theless+0.198
zens+0.195
 indo+0.188
negie+0.186
 Regulatory+0.184
Neg logit contributions
IDS-0.186
 Norn-0.179
 juggling-0.176
thumbnails-0.167
Thor-0.161
Token will
Token ID481
Feature activation-1.10e-01

Pos logit contributions
theless+0.638
zens+0.626
 indo+0.606
negie+0.600
 Regulatory+0.593
Neg logit contributions
IDS-0.608
 Norn-0.585
 juggling-0.577
thumbnails-0.547
Thor-0.527
Token prevent
Token ID2948
Feature activation-4.74e-02

Pos logit contributions
theless+0.440
zens+0.427
 indo+0.414
negie+0.413
 Regulatory+0.403
Neg logit contributions
IDS-0.443
 Norn-0.432
 juggling-0.426
thumbnails-0.400
 blink-0.388
Token Try
Token ID9993
Feature activation+6.14e-02

Pos logit contributions
IDS+1.315
 Norn+1.264
 juggling+1.254
thumbnails+1.188
Thor+1.148
Neg logit contributions
theless-1.134
zens-1.114
 indo-1.072
negie-1.060
 Regulatory-1.042
Token {
Token ID1391
Feature activation-1.45e-01

Pos logit contributions
IDS+0.129
 Norn+0.122
 juggling+0.117
thumbnails+0.106
Thor+0.097
Neg logit contributions
zens-0.361
theless-0.360
 indo-0.350
negie-0.348
 Regulatory-0.347
Token for
Token ID329
Feature activation-2.72e-01

Pos logit contributions
theless+0.578
zens+0.568
 indo+0.552
 Regulatory+0.543
negie+0.542
Neg logit contributions
IDS-0.586
 Norn-0.566
 juggling-0.555
thumbnails-0.530
Thor-0.511
Token ($
Token ID7198
Feature activation-1.87e-01

Pos logit contributions
theless+1.101
zens+1.088
 indo+1.047
 Regulatory+1.044
negie+1.035
Neg logit contributions
IDS-1.076
 Norn-1.036
 juggling-1.022
thumbnails-0.981
Thor-0.934
Tokeni
Token ID72
Feature activation-4.82e-01

Pos logit contributions
theless+0.726
zens+0.716
 indo+0.684
negie+0.681
 Regulatory+0.673
Neg logit contributions
IDS-0.774
 Norn-0.750
 juggling-0.740
thumbnails-0.699
Thor-0.675
Token ;
Token ID2162
Feature activation-9.29e-01

Pos logit contributions
zens+2.081
theless+2.074
negie+2.002
 indo+1.996
 Regulatory+1.962
Neg logit contributions
IDS-1.751
 Norn-1.678
 juggling-1.656
thumbnails-1.572
Thor-1.513
Token $
Token ID720
Feature activation-3.79e-01

Pos logit contributions
theless+2.498
zens+2.452
negie+2.297
orf+2.273
 indo+2.252
Neg logit contributions
IDS-4.761
 Norn-4.661
 juggling-4.573
thumbnails-4.464
Thor-4.370
Tokeni
Token ID72
Feature activation-4.02e-01

Pos logit contributions
theless+1.378
zens+1.364
negie+1.288
 indo+1.281
orf+1.277
Neg logit contributions
IDS-1.639
 Norn-1.611
 juggling-1.600
thumbnails-1.484
Thor-1.463
Token -
Token ID532
Feature activation-5.62e-01

Pos logit contributions
theless+1.527
zens+1.518
 indo+1.444
negie+1.442
 Regulatory+1.423
Neg logit contributions
IDS-1.681
 Norn-1.637
 juggling-1.611
thumbnails-1.543
Thor-1.481
Tokenle
Token ID293
Feature activation-2.97e-01

Pos logit contributions
theless+2.164
zens+2.144
 indo+2.053
negie+2.026
 Regulatory+2.010
Neg logit contributions
IDS-2.345
 Norn-2.253
 juggling-2.240
thumbnails-2.120
Thor-2.032
Token $
Token ID720
Feature activation+1.98e-01

Pos logit contributions
theless+1.052
zens+1.043
 indo+0.995
negie+0.988
 Regulatory+0.975
Neg logit contributions
IDS-1.335
 Norn-1.287
 juggling-1.270
thumbnails-1.221
Thor-1.176
Token want
Token ID765
Feature activation+2.34e-02

Pos logit contributions
theless+0.383
zens+0.375
 indo+0.362
negie+0.362
 Regulatory+0.353
Neg logit contributions
IDS-0.362
 Norn-0.350
 juggling-0.347
thumbnails-0.321
 blink-0.313
Token to
Token ID284
Feature activation+7.20e-02

Pos logit contributions
IDS+0.083
 Norn+0.080
 juggling+0.078
thumbnails+0.074
Thor+0.071
Neg logit contributions
theless-0.106
zens-0.104
 indo-0.101
negie-0.100
 Regulatory-0.099
Token kick
Token ID4829
Feature activation+1.19e-01

Pos logit contributions
IDS+0.282
 juggling+0.274
 Norn+0.274
 blink+0.249
thumbnails+0.249
Neg logit contributions
theless-0.300
zens-0.292
negie-0.283
 indo-0.278
 Regulatory-0.273
Token anyone
Token ID2687
Feature activation+7.83e-02

Pos logit contributions
IDS+0.434
 Norn+0.416
 juggling+0.410
thumbnails+0.386
Thor+0.372
Neg logit contributions
theless-0.527
zens-0.516
 indo-0.501
negie-0.498
 Regulatory-0.492
Token,
Token ID11
Feature activation-1.27e-01

Pos logit contributions
IDS+0.305
 Norn+0.296
 juggling+0.293
thumbnails+0.271
Thor+0.265
Neg logit contributions
theless-0.325
zens-0.319
negie-0.311
 indo-0.308
 Regulatory-0.303
Token lets
Token ID8781
Feature activation-9.20e-01

Pos logit contributions
theless+0.524
zens+0.514
 indo+0.497
negie+0.494
 Regulatory+0.486
Neg logit contributions
IDS-0.505
 Norn-0.486
 juggling-0.481
thumbnails-0.453
Thor-0.438
Token just
Token ID655
Feature activation-2.27e-01

Pos logit contributions
theless+3.452
zens+3.398
 indo+3.277
negie+3.230
 Regulatory+3.201
Neg logit contributions
IDS-3.942
 Norn-3.796
 juggling-3.738
thumbnails-3.584
Thor-3.451
Token move
Token ID1445
Feature activation-6.39e-02

Pos logit contributions
theless+0.924
zens+0.901
 indo+0.868
negie+0.867
 Regulatory+0.849
Neg logit contributions
IDS-0.917
 Norn-0.888
 juggling-0.881
thumbnails-0.820
Thor-0.797
Token them
Token ID606
Feature activation+1.45e-02

Pos logit contributions
theless+0.262
zens+0.255
negie+0.247
 indo+0.246
 Regulatory+0.241
Neg logit contributions
IDS-0.255
 Norn-0.247
 juggling-0.244
thumbnails-0.229
Thor-0.222
Token around
Token ID1088
Feature activation-4.12e-01

Pos logit contributions
IDS+0.055
 Norn+0.053
 juggling+0.053
thumbnails+0.048
Thor+0.047
Neg logit contributions
theless-0.062
zens-0.061
negie-0.060
 indo-0.059
 Regulatory-0.058
Token in
Token ID287
Feature activation+1.55e-01

Pos logit contributions
theless+1.692
zens+1.662
 indo+1.609
negie+1.585
 Regulatory+1.571
Neg logit contributions
IDS-1.631
 Norn-1.557
 juggling-1.538
thumbnails-1.469
Thor-1.407
Token Own
Token ID11744
Feature activation+1.40e-01

Pos logit contributions
IDS+0.812
 Norn+0.794
 juggling+0.775
thumbnails+0.723
Thor+0.699
Neg logit contributions
theless-0.924
zens-0.901
 indo-0.876
negie-0.873
orf-0.844
Tokened
Token ID276
Feature activation-1.56e-02

Pos logit contributions
IDS+0.395
 Norn+0.377
 juggling+0.357
thumbnails+0.329
Thor+0.321
Neg logit contributions
theless-0.743
 indo-0.730
zens-0.712
negie-0.702
 Regulatory-0.687
Token <
Token ID1279
Feature activation+8.61e-03

Pos logit contributions
theless+0.069
zens+0.069
 indo+0.066
 Regulatory+0.066
negie+0.065
Neg logit contributions
IDS-0.055
 Norn-0.053
 juggling-0.053
thumbnails-0.050
Thor-0.047
Token T
Token ID309
Feature activation-1.63e-01

Pos logit contributions
IDS+0.034
 Norn+0.033
 juggling+0.032
thumbnails+0.031
Thor+0.030
Neg logit contributions
theless-0.035
zens-0.035
 indo-0.033
negie-0.033
 Regulatory-0.033
Token >
Token ID1875
Feature activation-2.38e-01

Pos logit contributions
theless+0.463
zens+0.455
 indo+0.428
 Regulatory+0.419
negie+0.414
Neg logit contributions
IDS-0.848
 Norn-0.826
 juggling-0.816
thumbnails-0.795
Thor-0.770
Token ;
Token ID2162
Feature activation-9.16e-01

Pos logit contributions
theless+1.389
zens+1.371
 indo+1.335
 Regulatory+1.330
negie+1.319
Neg logit contributions
IDS-0.528
 Norn-0.490
 juggling-0.476
thumbnails-0.445
Thor-0.407
Token }
Token ID1782
Feature activation-1.10e-01

Pos logit contributions
theless+3.385
zens+3.269
 Regulatory+3.177
 indo+3.172
enko+3.131
Neg logit contributions
IDS-3.950
 Norn-3.671
 juggling-3.604
thumbnails-3.553
qqa-3.402
Token impl
Token ID4114
Feature activation-5.85e-01

Pos logit contributions
theless+0.425
zens+0.416
 indo+0.403
negie+0.396
 Regulatory+0.394
Neg logit contributions
IDS-0.468
 Norn-0.449
 juggling-0.443
thumbnails-0.423
Thor-0.408
Token <
Token ID1279
Feature activation+1.27e-01

Pos logit contributions
theless+2.516
zens+2.515
 Regulatory+2.434
 indo+2.408
negie+2.391
Neg logit contributions
IDS-2.154
 juggling-2.024
 Norn-2.022
thumbnails-1.971
Thor-1.853
Token T
Token ID309
Feature activation-1.39e-01

Pos logit contributions
IDS+0.487
 Norn+0.468
 juggling+0.461
thumbnails+0.436
Thor+0.419
Neg logit contributions
theless-0.536
zens-0.526
 indo-0.511
negie-0.506
 Regulatory-0.497
Token >
Token ID1875
Feature activation-1.99e-01

Pos logit contributions
theless+0.445
zens+0.441
 indo+0.418
 Regulatory+0.414
negie+0.410
Neg logit contributions
IDS-0.670
 Norn-0.650
 juggling-0.640
thumbnails-0.624
Thor-0.603
Token missions
Token ID10566
Feature activation+1.66e-01

Pos logit contributions
theless+0.037
zens+0.036
 indo+0.035
negie+0.035
 Regulatory+0.034
Neg logit contributions
IDS-0.049
 Norn-0.048
 juggling-0.047
thumbnails-0.045
Thor-0.043
Token
Token ID784
Feature activation+6.85e-02

Pos logit contributions
IDS+0.626
 Norn+0.601
 juggling+0.596
thumbnails+0.558
Thor+0.536
Neg logit contributions
theless-0.711
zens-0.698
 indo-0.675
negie-0.671
 Regulatory-0.663
Token but
Token ID475
Feature activation-1.20e-01

Pos logit contributions
IDS+0.276
 Norn+0.265
 juggling+0.262
thumbnails+0.248
Thor+0.239
Neg logit contributions
theless-0.277
zens-0.272
 indo-0.263
negie-0.260
 Regulatory-0.257
Token Honour
Token ID45129
Feature activation+8.10e-02

Pos logit contributions
theless+0.506
zens+0.497
 indo+0.482
negie+0.477
 Regulatory+0.472
Neg logit contributions
IDS-0.460
 Norn-0.441
 juggling-0.435
thumbnails-0.412
Thor-0.396
Token Among
Token ID9754
Feature activation-2.60e-01

Pos logit contributions
IDS+0.390
 Norn+0.379
 juggling+0.374
thumbnails+0.357
Thor+0.348
Neg logit contributions
theless-0.263
zens-0.256
 indo-0.246
negie-0.243
 Regulatory-0.239
Token Thieves
Token ID40392
Feature activation-9.14e-01

Pos logit contributions
theless+1.397
zens+1.367
 indo+1.356
negie+1.336
 Regulatory+1.313
Neg logit contributions
IDS-0.714
 Norn-0.676
 juggling-0.654
thumbnails-0.608
Thor-0.576
Token is
Token ID318
Feature activation-2.57e-01

Pos logit contributions
theless+3.512
zens+3.418
 indo+3.344
negie+3.232
 Regulatory+3.225
Neg logit contributions
IDS-3.823
 Norn-3.670
 juggling-3.585
thumbnails-3.451
Thor-3.324
Token a
Token ID257
Feature activation-2.34e-01

Pos logit contributions
theless+1.036
zens+1.017
 indo+0.986
negie+0.981
 Regulatory+0.965
Neg logit contributions
IDS-1.038
 Norn-0.998
 juggling-0.993
thumbnails-0.932
Thor-0.900
Token h
Token ID289
Feature activation-9.06e-02

Pos logit contributions
theless+0.960
zens+0.946
negie+0.917
 indo+0.916
 Regulatory+0.899
Neg logit contributions
IDS-0.915
 juggling-0.884
 Norn-0.884
thumbnails-0.821
Thor-0.796
Tokenoot
Token ID1025
Feature activation-2.26e-01

Pos logit contributions
theless+0.304
zens+0.298
 indo+0.285
negie+0.282
 Regulatory+0.278
Neg logit contributions
IDS-0.426
 Norn-0.414
 juggling-0.408
thumbnails-0.390
Thor-0.378
Token from
Token ID422
Feature activation-2.86e-01

Pos logit contributions
theless+0.956
zens+0.940
 indo+0.909
negie+0.900
 Regulatory+0.888
Neg logit contributions
IDS-0.871
 Norn-0.835
 juggling-0.821
thumbnails-0.780
Thor-0.750
TokenValid
Token ID47139
Feature activation-1.30e-01

Pos logit contributions
theless+1.279
zens+1.264
 indo+1.221
negie+1.210
 Regulatory+1.203
Neg logit contributions
IDS-1.031
 Norn-0.984
 juggling-0.972
thumbnails-0.915
Thor-0.870
Token )
Token ID1267
Feature activation-7.10e-02

Pos logit contributions
theless+0.583
zens+0.574
 indo+0.554
negie+0.549
 Regulatory+0.548
Neg logit contributions
IDS-0.458
 Norn-0.438
 juggling-0.434
thumbnails-0.410
Thor-0.384
Token )
Token ID1267
Feature activation-3.79e-01

Pos logit contributions
theless+0.310
zens+0.305
 indo+0.296
negie+0.292
 Regulatory+0.289
Neg logit contributions
IDS-0.261
 Norn-0.251
 juggling-0.247
thumbnails-0.235
Thor-0.223
Token return
Token ID1441
Feature activation-3.12e-01

Pos logit contributions
theless+2.114
zens+2.089
 indo+2.048
negie+2.023
 Regulatory+2.001
Neg logit contributions
IDS-0.927
 Norn-0.863
 juggling-0.845
thumbnails-0.775
Thor-0.726
Token space
Token ID2272
Feature activation-1.50e-01

Pos logit contributions
theless+1.226
zens+1.203
 indo+1.165
negie+1.146
 Regulatory+1.138
Neg logit contributions
IDS-1.290
 Norn-1.237
 juggling-1.222
thumbnails-1.168
Thor-1.125
Token;
Token ID26
Feature activation-9.10e-01

Pos logit contributions
theless+0.572
zens+0.569
 indo+0.542
negie+0.538
 Regulatory+0.532
Neg logit contributions
IDS-0.629
 Norn-0.608
 juggling-0.603
thumbnails-0.577
Thor-0.552
Token //
Token ID3373
Feature activation-3.47e-01

Pos logit contributions
theless+3.285
zens+3.173
 indo+3.117
negie+3.001
 Regulatory+2.997
Neg logit contributions
IDS-3.976
 Norn-3.803
 juggling-3.762
thumbnails-3.627
Thor-3.499
Token 4
Token ID604
Feature activation-4.94e-02

Pos logit contributions
theless+1.203
zens+1.168
 indo+1.131
negie+1.107
 Regulatory+1.106
Neg logit contributions
IDS-1.577
 Norn-1.531
 juggling-1.517
thumbnails-1.456
Thor-1.408
Token.
Token ID13
Feature activation+3.43e-01

Pos logit contributions
theless+0.229
 indo+0.221
zens+0.219
negie+0.212
 Regulatory+0.207
Neg logit contributions
IDS-0.176
 Norn-0.170
 juggling-0.165
thumbnails-0.155
Thor-0.151
Token Rec
Token ID3311
Feature activation+8.40e-02

Pos logit contributions
IDS+1.315
 Norn+1.277
 juggling+1.244
thumbnails+1.167
Thor+1.137
Neg logit contributions
theless-1.456
zens-1.418
 indo-1.371
negie-1.364
orf-1.321
Tokenurs
Token ID1834
Feature activation-3.05e-01

Pos logit contributions
IDS+0.315
 Norn+0.303
 juggling+0.300
thumbnails+0.280
Thor+0.269
Neg logit contributions
theless-0.359
zens-0.357
 indo-0.343
negie-0.342
 Regulatory-0.336
Token the
Token ID262
Feature activation+2.04e-01

Pos logit contributions
IDS+1.194
 Norn+1.146
 juggling+1.132
thumbnails+1.070
Thor+1.031
Neg logit contributions
theless-1.296
zens-1.267
 indo-1.238
negie-1.224
 Regulatory-1.206
Token families
Token ID4172
Feature activation+3.99e-01

Pos logit contributions
IDS+0.903
 Norn+0.872
 juggling+0.865
thumbnails+0.817
Thor+0.791
Neg logit contributions
theless-0.744
zens-0.715
 indo-0.706
negie-0.695
 Regulatory-0.678
Token of
Token ID286
Feature activation+1.29e-01

Pos logit contributions
IDS+1.564
 Norn+1.501
 juggling+1.487
thumbnails+1.394
Thor+1.353
Neg logit contributions
theless-1.650
zens-1.609
 indo-1.560
negie-1.549
 Regulatory-1.532
Token the
Token ID262
Feature activation+1.37e-01

Pos logit contributions
IDS+0.462
 Norn+0.442
 juggling+0.435
thumbnails+0.411
Thor+0.394
Neg logit contributions
theless-0.579
zens-0.571
 indo-0.552
negie-0.547
 Regulatory-0.542
Token fl
Token ID781
Feature activation-1.09e-01

Pos logit contributions
IDS+0.519
 Norn+0.495
 juggling+0.491
thumbnails+0.457
Thor+0.442
Neg logit contributions
theless-0.591
zens-0.575
 indo-0.564
negie-0.560
 Regulatory-0.551
Tokenot
Token ID313
Feature activation-8.98e-01

Pos logit contributions
theless+0.437
 indo+0.421
zens+0.398
 Regulatory+0.394
negie+0.389
Neg logit contributions
IDS-0.460
 Norn-0.437
 juggling-0.428
thumbnails-0.420
Thor-0.405
Tokenilla
Token ID5049
Feature activation-3.64e-02

Pos logit contributions
theless+2.843
zens+2.817
 indo+2.646
negie+2.640
 Regulatory+2.587
Neg logit contributions
IDS-4.376
 Norn-4.235
 juggling-4.201
thumbnails-4.007
Thor-3.910
Token victims
Token ID4970
Feature activation-2.01e-02

Pos logit contributions
theless+0.153
zens+0.150
 indo+0.145
negie+0.144
 Regulatory+0.142
Neg logit contributions
IDS-0.141
 Norn-0.136
 juggling-0.134
thumbnails-0.127
Thor-0.122
Token.
Token ID13
Feature activation-1.47e-01

Pos logit contributions
theless+0.084
zens+0.083
 indo+0.080
negie+0.079
 Regulatory+0.079
Neg logit contributions
IDS-0.078
 Norn-0.075
 juggling-0.074
thumbnails-0.070
Thor-0.067
Token
Token ID198
Feature activation-6.49e-02

Pos logit contributions
theless+0.585
zens+0.575
 indo+0.554
negie+0.549
 Regulatory+0.545
Neg logit contributions
IDS-0.599
 Norn-0.575
 juggling-0.570
thumbnails-0.540
Thor-0.519
Token
Token ID198
Feature activation-3.19e-02

Pos logit contributions
theless+0.295
 indo+0.291
zens+0.281
negie+0.275
 Regulatory+0.267
Neg logit contributions
IDS-0.238
 Norn-0.226
 juggling-0.220
thumbnails-0.210
Thor-0.207
Token(
Token ID7
Feature activation-1.04e-01

Pos logit contributions
IDS+0.034
 Norn+0.032
 juggling+0.031
thumbnails+0.029
Thor+0.027
Neg logit contributions
theless-0.069
zens-0.069
 indo-0.067
negie-0.066
 Regulatory-0.066
Tokenargs
Token ID22046
Feature activation-4.64e-02

Pos logit contributions
theless+0.467
zens+0.464
 indo+0.449
negie+0.445
 Regulatory+0.440
Neg logit contributions
IDS-0.364
 Norn-0.353
 juggling-0.346
thumbnails-0.318
Thor-0.311
Token["
Token ID14692
Feature activation-1.04e-01

Pos logit contributions
zens+0.221
theless+0.217
 indo+0.211
negie+0.209
 Regulatory+0.209
Neg logit contributions
IDS-0.149
 Norn-0.145
 juggling-0.140
thumbnails-0.132
Thor-0.127
Tokenimages
Token ID17566
Feature activation-5.37e-02

Pos logit contributions
theless+0.465
zens+0.463
 indo+0.447
negie+0.444
 Regulatory+0.438
Neg logit contributions
IDS-0.365
 Norn-0.353
 juggling-0.349
thumbnails-0.320
Thor-0.310
Token"]
Token ID8973
Feature activation+1.13e-01

Pos logit contributions
zens+0.228
theless+0.224
 indo+0.218
negie+0.217
 Regulatory+0.214
Neg logit contributions
IDS-0.203
 Norn-0.194
 juggling-0.194
thumbnails-0.178
Thor-0.174
Token):
Token ID2599
Feature activation-8.91e-01

Pos logit contributions
IDS+0.343
 Norn+0.332
 juggling+0.322
thumbnails+0.306
Thor+0.289
Neg logit contributions
zens-0.559
theless-0.557
 indo-0.541
negie-0.533
 Regulatory-0.532
Token #
Token ID1303
Feature activation-2.38e-01

Pos logit contributions
zens+3.253
theless+3.249
 indo+3.183
negie+3.073
 Regulatory+3.062
Neg logit contributions
IDS-3.838
 Norn-3.661
 juggling-3.600
thumbnails-3.400
Thor-3.371
Token load
Token ID3440
Feature activation+1.58e-01

Pos logit contributions
theless+0.956
zens+0.938
 indo+0.907
negie+0.897
 Regulatory+0.887
Neg logit contributions
IDS-0.965
 Norn-0.927
 juggling-0.915
thumbnails-0.869
Thor-0.837
Token the
Token ID262
Feature activation-1.47e-01

Pos logit contributions
IDS+0.571
 Norn+0.547
 juggling+0.539
thumbnails+0.508
Thor+0.487
Neg logit contributions
theless-0.707
zens-0.695
 indo-0.674
negie-0.668
 Regulatory-0.661
Token image
Token ID2939
Feature activation-1.11e-01

Pos logit contributions
theless+0.638
zens+0.626
 indo+0.606
negie+0.602
 Regulatory+0.593
Neg logit contributions
IDS-0.548
 Norn-0.526
 juggling-0.519
thumbnails-0.490
Thor-0.470
Token,
Token ID11
Feature activation+6.78e-02

Pos logit contributions
theless+0.465
zens+0.453
 indo+0.438
negie+0.436
 Regulatory+0.429
Neg logit contributions
IDS-0.434
 Norn-0.418
 juggling-0.412
thumbnails-0.392
Thor-0.375
Token 3
Token ID513
Feature activation-5.21e-02

Pos logit contributions
IDS+0.221
 juggling+0.211
 Norn+0.209
thumbnails+0.201
Thor+0.192
Neg logit contributions
theless-0.165
zens-0.160
 indo-0.155
negie-0.152
 Regulatory-0.152
Token Next
Token ID7406
Feature activation+8.96e-02

Pos logit contributions
theless+0.166
zens+0.162
 indo+0.157
 Regulatory+0.155
negie+0.154
Neg logit contributions
IDS-0.251
 juggling-0.242
 Norn-0.241
thumbnails-0.230
Thor-0.224
Token All
Token ID1439
Feature activation+1.82e-01

Pos logit contributions
IDS+0.330
 Norn+0.317
 juggling+0.311
thumbnails+0.294
Thor+0.282
Neg logit contributions
theless-0.394
zens-0.387
 indo-0.375
negie-0.372
 Regulatory-0.367
Token<|endoftext|>
Token ID50256
Feature activation+4.15e-02

Pos logit contributions
IDS+0.622
 Norn+0.600
 juggling+0.588
thumbnails+0.553
Thor+0.528
Neg logit contributions
theless-0.850
zens-0.835
 indo-0.814
negie-0.807
 Regulatory-0.791
TokenThis
Token ID1212
Feature activation+2.30e-01

Pos logit contributions
IDS+0.170
 Norn+0.165
 juggling+0.163
thumbnails+0.153
Thor+0.147
Neg logit contributions
theless-0.163
zens-0.162
 indo-0.154
negie-0.153
 Regulatory-0.152
Token article
Token ID2708
Feature activation-8.91e-01

Pos logit contributions
IDS+0.923
 Norn+0.891
 juggling+0.880
thumbnails+0.830
Thor+0.804
Neg logit contributions
theless-0.929
zens-0.913
 indo-0.879
negie-0.875
 Regulatory-0.860
Token is
Token ID318
Feature activation-1.91e-01

Pos logit contributions
theless+3.331
zens+3.254
 indo+3.193
 Regulatory+3.093
enko+3.036
Neg logit contributions
IDS-3.831
 Norn-3.659
 juggling-3.549
thumbnails-3.449
Thor-3.328
Token over
Token ID625
Feature activation-7.93e-02

Pos logit contributions
theless+0.702
zens+0.689
 indo+0.664
negie+0.657
 Regulatory+0.646
Neg logit contributions
IDS-0.840
 Norn-0.811
 juggling-0.803
thumbnails-0.762
Thor-0.739
Token 3
Token ID513
Feature activation-1.31e-01

Pos logit contributions
theless+0.290
zens+0.284
 indo+0.274
negie+0.271
 Regulatory+0.267
Neg logit contributions
IDS-0.350
 Norn-0.337
 juggling-0.333
thumbnails-0.318
Thor-0.307
Token years
Token ID812
Feature activation-1.13e-01

Pos logit contributions
theless+0.543
zens+0.535
 indo+0.515
negie+0.510
 Regulatory+0.507
Neg logit contributions
IDS-0.510
 Norn-0.490
 juggling-0.484
thumbnails-0.459
Thor-0.440
Token old
Token ID1468
Feature activation-1.39e-01

Pos logit contributions
theless+0.345
zens+0.340
 indo+0.324
negie+0.317
 Regulatory+0.316
Neg logit contributions
IDS-0.564
 Norn-0.548
 juggling-0.539
thumbnails-0.522
Thor-0.504
Token.
Token ID13
Feature activation-1.11e-02

Pos logit contributions
theless+0.375
zens+0.368
 indo+0.356
negie+0.352
 Regulatory+0.348
Neg logit contributions
IDS-0.360
 Norn-0.346
 juggling-0.341
thumbnails-0.323
Thor-0.312
Token -
Token ID532
Feature activation-5.15e-01

Pos logit contributions
theless+0.046
zens+0.046
 indo+0.044
negie+0.044
orf+0.042
Neg logit contributions
IDS-0.044
 Norn-0.043
 juggling-0.042
thumbnails-0.039
Thor-0.038
Token February
Token ID3945
Feature activation+1.21e-01

Pos logit contributions
theless+1.793
zens+1.782
negie+1.690
 indo+1.680
 Regulatory+1.664
Neg logit contributions
IDS-2.316
 Norn-2.247
 juggling-2.225
thumbnails-2.117
Thor-2.054
Token 9
Token ID860
Feature activation+3.80e-01

Pos logit contributions
IDS+0.446
 Norn+0.430
 juggling+0.418
thumbnails+0.385
Thor+0.384
Neg logit contributions
theless-0.548
 indo-0.535
zens-0.518
negie-0.504
terday-0.488
Token,
Token ID11
Feature activation-2.82e-01

Pos logit contributions
IDS+1.179
 Norn+1.091
 juggling+1.058
thumbnails+0.995
Thor+0.991
Neg logit contributions
theless-1.967
 indo-1.953
zens-1.866
negie-1.822
 Regulatory-1.778
Token 2010
Token ID3050
Feature activation-8.86e-01

Pos logit contributions
theless+1.336
 indo+1.306
zens+1.287
negie+1.248
enko+1.215
Neg logit contributions
IDS-0.990
 Norn-0.959
 juggling-0.915
thumbnails-0.844
 blink-0.823
TokenR
Token ID49
Feature activation-2.32e-01

Pos logit contributions
zens+2.564
theless+2.534
negie+2.400
 Regulatory+2.394
orf+2.337
Neg logit contributions
IDS-4.405
 Norn-4.357
 juggling-4.344
thumbnails-4.070
 blink-3.933
Tokenipe
Token ID3757
Feature activation+3.88e-03

Pos logit contributions
theless+0.923
zens+0.889
 indo+0.889
 Regulatory+0.846
negie+0.844
Neg logit contributions
IDS-0.970
 Norn-0.915
 juggling-0.903
thumbnails-0.864
Thor-0.839
Token for
Token ID329
Feature activation-3.97e-02

Pos logit contributions
IDS+0.016
 Norn+0.016
 juggling+0.015
thumbnails+0.015
Thor+0.014
Neg logit contributions
theless-0.015
zens-0.015
 indo-0.015
negie-0.014
 Regulatory-0.014
Token M
Token ID337
Feature activation+2.17e-01

Pos logit contributions
theless+0.160
zens+0.156
 indo+0.153
negie+0.149
 Regulatory+0.148
Neg logit contributions
IDS-0.158
 Norn-0.155
 juggling-0.150
thumbnails-0.145
Thor-0.141
TokenST
Token ID2257
Feature activation+4.82e-01

Pos logit contributions
IDS+0.731
 Norn+0.639
 juggling+0.631
thumbnails+0.618
Thor+0.602
Neg logit contributions
theless-1.086
 indo-1.075
zens-1.018
 Regulatory-1.010
negie-0.985
Token ==
Token ID6624
Feature activation-1.37e-01

Pos logit contributions
theless+0.303
zens+0.300
 indo+0.290
negie+0.286
 Regulatory+0.283
Neg logit contributions
IDS-0.276
 Norn-0.265
 juggling-0.262
thumbnails-0.248
Thor-0.239
Token 0
Token ID657
Feature activation+1.24e-01

Pos logit contributions
theless+0.553
zens+0.544
 indo+0.528
negie+0.518
 Regulatory+0.516
Neg logit contributions
IDS-0.552
 Norn-0.529
 juggling-0.521
thumbnails-0.498
Thor-0.479
Token.
Token ID13
Feature activation+1.75e-01

Pos logit contributions
IDS+0.499
 Norn+0.482
 juggling+0.474
thumbnails+0.454
Thor+0.438
Neg logit contributions
theless-0.498
zens-0.491
 indo-0.472
negie-0.467
 Regulatory-0.464
Token0
Token ID15
Feature activation+5.71e-02

Pos logit contributions
IDS+0.603
 Norn+0.574
 juggling+0.565
thumbnails+0.531
Thor+0.508
Neg logit contributions
theless-0.813
zens-0.800
 indo-0.778
negie-0.771
 Regulatory-0.762
Tokenf
Token ID69
Feature activation+1.97e-02

Pos logit contributions
IDS+0.222
 Norn+0.215
 juggling+0.212
thumbnails+0.203
Thor+0.194
Neg logit contributions
theless-0.237
zens-0.232
 indo-0.224
negie-0.220
 Regulatory-0.220
Token);
Token ID1776
Feature activation-8.82e-01

Pos logit contributions
IDS+0.078
 Norn+0.075
 juggling+0.074
thumbnails+0.070
Thor+0.068
Neg logit contributions
theless-0.081
zens-0.080
 indo-0.077
negie-0.076
 Regulatory-0.076
Token RAW
Token ID33782
Feature activation-2.92e-01

Pos logit contributions
theless+3.293
zens+3.213
 indo+3.146
 Regulatory+3.062
negie+3.003
Neg logit contributions
IDS-3.687
 Norn-3.505
 juggling-3.459
thumbnails-3.359
Thor-3.261
Token_
Token ID62
Feature activation-3.16e-01

Pos logit contributions
theless+1.310
zens+1.290
 indo+1.242
negie+1.233
 Regulatory+1.228
Neg logit contributions
IDS-1.018
 Norn-0.990
 juggling-0.976
thumbnails-0.909
Thor-0.872
TokenASS
Token ID10705
Feature activation-2.41e-01

Pos logit contributions
theless+1.335
zens+1.326
 indo+1.250
 Regulatory+1.244
negie+1.243
Neg logit contributions
IDS-1.147
 Norn-1.146
 juggling-1.131
thumbnails-1.037
Thor-1.005
TokenERT
Token ID17395
Feature activation-5.30e-01

Pos logit contributions
theless+0.612
zens+0.605
 indo+0.565
negie+0.564
 Regulatory+0.563
Neg logit contributions
IDS-1.290
 Norn-1.270
 juggling-1.254
thumbnails-1.216
Thor-1.171
Token(
Token ID7
Feature activation-3.45e-01

Pos logit contributions
theless+2.236
zens+2.168
 Regulatory+2.132
 indo+2.120
orf+2.081
Neg logit contributions
IDS-1.876
 Norn-1.845
 juggling-1.811
thumbnails-1.763
Thor-1.652
Token '
Token ID705
Feature activation-4.24e-01

Pos logit contributions
IDS+0.387
 Norn+0.371
 juggling+0.364
thumbnails+0.342
Thor+0.326
Neg logit contributions
theless-0.524
zens-0.514
 indo-0.500
negie-0.496
 Regulatory-0.488
Tokena
Token ID64
Feature activation-1.90e-01

Pos logit contributions
theless+1.707
zens+1.646
 indo+1.634
negie+1.592
 Regulatory+1.566
Neg logit contributions
IDS-1.745
 Norn-1.673
 juggling-1.651
thumbnails-1.585
Thor-1.521
Token,
Token ID837
Feature activation+1.22e-01

Pos logit contributions
zens+0.806
theless+0.802
 Regulatory+0.768
 indo+0.766
terday+0.746
Neg logit contributions
IDS-0.717
 Norn-0.687
 juggling-0.669
thumbnails-0.657
Thor-0.628
Token T
Token ID309
Feature activation-1.15e-01

Pos logit contributions
IDS+0.509
 Norn+0.484
 juggling+0.480
thumbnails+0.460
Thor+0.445
Neg logit contributions
theless-0.473
zens-0.469
 indo-0.448
 Regulatory-0.446
negie-0.442
Token >>
Token ID9609
Feature activation-4.89e-01

Pos logit contributions
theless+0.337
zens+0.336
 indo+0.315
negie+0.309
 Regulatory+0.309
Neg logit contributions
IDS-0.579
 Norn-0.565
 juggling-0.557
thumbnails-0.540
Thor-0.527
Token ;
Token ID2162
Feature activation-8.79e-01

Pos logit contributions
theless+2.714
zens+2.705
 indo+2.631
 Regulatory+2.629
negie+2.575
Neg logit contributions
IDS-1.167
 Norn-1.108
 juggling-1.055
thumbnails-1.019
Thor-0.942
Token }
Token ID1782
Feature activation-1.41e-01

Pos logit contributions
theless+3.206
zens+3.152
 indo+3.083
negie+3.042
 Regulatory+3.031
Neg logit contributions
IDS-3.790
 Norn-3.587
 juggling-3.426
thumbnails-3.398
Thor-3.293
Token
Token ID198
Feature activation-2.83e-02

Pos logit contributions
theless+0.522
zens+0.512
 indo+0.496
negie+0.486
 Regulatory+0.483
Neg logit contributions
IDS-0.619
 Norn-0.595
 juggling-0.585
thumbnails-0.562
Thor-0.543
Token
Token ID198
Feature activation-5.20e-02

Pos logit contributions
theless+0.130
 indo+0.128
zens+0.124
negie+0.122
 Regulatory+0.118
Neg logit contributions
IDS-0.102
 Norn-0.097
 juggling-0.095
thumbnails-0.090
Thor-0.089
TokenIn
Token ID818
Feature activation+1.20e-01

Pos logit contributions
theless+0.212
zens+0.207
 indo+0.201
negie+0.199
 Regulatory+0.197
Neg logit contributions
IDS-0.208
 Norn-0.200
 juggling-0.197
thumbnails-0.187
Thor-0.181
Token order
Token ID1502
Feature activation-3.09e-01

Pos logit contributions
IDS+0.447
 Norn+0.428
 juggling+0.424
thumbnails+0.398
Thor+0.382
Neg logit contributions
theless-0.527
zens-0.515
 indo-0.500
negie-0.496
 Regulatory-0.488
Token.
Token ID13
Feature activation-8.59e-02

Pos logit contributions
IDS+0.154
 Norn+0.147
 juggling+0.145
thumbnails+0.137
Thor+0.132
Neg logit contributions
theless-0.185
zens-0.181
 indo-0.176
negie-0.174
 Regulatory-0.172
Token -
Token ID532
Feature activation-5.71e-01

Pos logit contributions
theless+0.349
zens+0.343
 indo+0.331
negie+0.329
 Regulatory+0.322
Neg logit contributions
IDS-0.346
 Norn-0.332
 juggling-0.328
thumbnails-0.309
Thor-0.299
Token December
Token ID3426
Feature activation-5.13e-02

Pos logit contributions
theless+1.962
zens+1.921
 indo+1.845
negie+1.822
 Regulatory+1.797
Neg logit contributions
IDS-2.642
 Norn-2.553
 juggling-2.524
thumbnails-2.412
Thor-2.338
Token 13
Token ID1511
Feature activation+1.72e-01

Pos logit contributions
theless+0.198
 indo+0.192
zens+0.187
negie+0.184
terday+0.179
Neg logit contributions
IDS-0.220
 Norn-0.212
 juggling-0.207
thumbnails-0.197
Thor-0.195
Token,
Token ID11
Feature activation-3.18e-01

Pos logit contributions
IDS+0.555
 Norn+0.520
 juggling+0.508
thumbnails+0.476
Thor+0.466
Neg logit contributions
theless-0.860
 indo-0.844
zens-0.827
negie-0.803
 Regulatory-0.790
Token 2010
Token ID3050
Feature activation-8.70e-01

Pos logit contributions
theless+1.508
 indo+1.478
zens+1.456
negie+1.415
enko+1.377
Neg logit contributions
IDS-1.103
 Norn-1.070
 juggling-1.023
thumbnails-0.945
 blink-0.919
Tokenboys
Token ID13202
Feature activation-3.93e-01

Pos logit contributions
zens+3.900
theless+3.796
yards+3.625
 indo+3.611
negie+3.604
Neg logit contributions
 Norn-3.113
IDS-2.959
 juggling-2.958
thumbnails-2.608
 blink-2.607
Token beware
Token ID40600
Feature activation+1.77e-01

Pos logit contributions
theless+1.466
zens+1.418
 indo+1.376
negie+1.341
 Regulatory+1.321
Neg logit contributions
IDS-1.731
 Norn-1.652
 juggling-1.622
thumbnails-1.572
Thor-1.515
Token
Token ID198
Feature activation-2.84e-01

Pos logit contributions
IDS+0.717
 Norn+0.692
 juggling+0.679
thumbnails+0.649
Thor+0.624
Neg logit contributions
theless-0.709
zens-0.694
 indo-0.674
negie-0.661
 Regulatory-0.659
Token
Token ID198
Feature activation-2.13e-01

Pos logit contributions
theless+1.249
 indo+1.212
zens+1.200
negie+1.164
 Regulatory+1.137
Neg logit contributions
IDS-1.074
 Norn-1.024
 juggling-1.007
thumbnails-0.956
Thor-0.930
TokenReviewer
Token ID35407
Feature activation+5.34e-01

Pos logit contributions
theless+1.222
zens+1.217
 indo+1.171
negie+1.170
 Regulatory+1.161
Neg logit contributions
IDS-0.478
 Norn-0.466
 juggling-0.444
thumbnails-0.392
Thor-0.366
Token ->
Token ID4613
Feature activation+5.12e-02

Pos logit contributions
theless+1.947
zens+1.937
 indo+1.891
negie+1.858
 Regulatory+1.840
Neg logit contributions
IDS-1.807
 Norn-1.719
 juggling-1.700
thumbnails-1.617
Thor-1.561
Token Atomic
Token ID28976
Feature activation-1.62e-01

Pos logit contributions
IDS+0.219
 Norn+0.213
 juggling+0.210
thumbnails+0.198
Thor+0.192
Neg logit contributions
theless-0.194
zens-0.189
 indo-0.183
negie-0.181
 Regulatory-0.177
Token <
Token ID1279
Feature activation+9.89e-02

Pos logit contributions
theless+0.726
zens+0.721
 indo+0.694
negie+0.691
 Regulatory+0.690
Neg logit contributions
IDS-0.573
 Norn-0.547
 juggling-0.541
thumbnails-0.515
Thor-0.487
Token T
Token ID309
Feature activation-1.46e-01

Pos logit contributions
IDS+0.375
 Norn+0.360
 juggling+0.354
thumbnails+0.334
Thor+0.322
Neg logit contributions
theless-0.424
zens-0.416
 indo-0.404
negie-0.400
 Regulatory-0.394
Token >
Token ID1875
Feature activation-1.20e-01

Pos logit contributions
theless+0.411
zens+0.407
 indo+0.381
 Regulatory+0.373
negie+0.370
Neg logit contributions
IDS-0.760
 Norn-0.740
 juggling-0.732
thumbnails-0.712
Thor-0.692
Token ;
Token ID2162
Feature activation-8.64e-01

Pos logit contributions
theless+0.670
zens+0.664
 indo+0.646
 Regulatory+0.642
negie+0.638
Neg logit contributions
IDS-0.292
 Norn-0.272
 juggling-0.266
thumbnails-0.250
Thor-0.230
Token }
Token ID1782
Feature activation-1.75e-03

Pos logit contributions
theless+2.961
zens+2.845
 indo+2.775
 Regulatory+2.754
negie+2.743
Neg logit contributions
IDS-3.967
 Norn-3.747
 juggling-3.677
thumbnails-3.599
Thor-3.486
Token
Token ID198
Feature activation-3.71e-02

Pos logit contributions
theless+0.006
zens+0.006
 indo+0.006
negie+0.006
 Regulatory+0.006
Neg logit contributions
IDS-0.008
 Norn-0.008
 juggling-0.007
thumbnails-0.007
Thor-0.007
Token
Token ID198
Feature activation-5.53e-02

Pos logit contributions
theless+0.171
 indo+0.169
zens+0.163
negie+0.160
 Regulatory+0.154
Neg logit contributions
IDS-0.134
 Norn-0.127
 juggling-0.124
thumbnails-0.117
Thor-0.116
TokenWe
Token ID1135
Feature activation+2.14e-01

Pos logit contributions
theless+0.219
zens+0.215
 indo+0.208
negie+0.205
 Regulatory+0.203
Neg logit contributions
IDS-0.227
 Norn-0.218
 juggling-0.216
thumbnails-0.205
Thor-0.198
Token
Token ID447
Feature activation-4.22e-02

Pos logit contributions
IDS+0.882
 Norn+0.850
 juggling+0.834
thumbnails+0.793
Thor+0.768
Neg logit contributions
theless-0.844
zens-0.826
 indo-0.803
negie-0.791
 Regulatory-0.783
Token 2010
Token ID3050
Feature activation-5.21e-02

Pos logit contributions
IDS+0.106
 Norn+0.104
 juggling+0.099
thumbnails+0.093
Thor+0.090
Neg logit contributions
theless-0.166
zens-0.163
 indo-0.161
negie-0.159
enko-0.153
Token United
Token ID1578
Feature activation-3.94e-01

Pos logit contributions
theless+0.190
zens+0.186
 indo+0.180
negie+0.178
 Regulatory+0.173
Neg logit contributions
IDS-0.231
 Norn-0.224
 juggling-0.220
thumbnails-0.209
Thor-0.204
Token States
Token ID1829
Feature activation-1.06e-01

Pos logit contributions
theless+1.255
zens+1.228
 Regulatory+1.187
negie+1.174
 indo+1.161
Neg logit contributions
IDS-1.892
 juggling-1.805
 Norn-1.780
thumbnails-1.743
Thor-1.652
Token 10
Token ID838
Feature activation-1.15e-01

Pos logit contributions
theless+0.401
zens+0.394
 indo+0.385
negie+0.380
 Regulatory+0.365
Neg logit contributions
IDS-0.454
 Norn-0.442
 juggling-0.434
thumbnails-0.408
Thor-0.400
Token53
Token ID4310
Feature activation+2.97e-01

Pos logit contributions
theless+0.467
zens+0.459
 indo+0.446
negie+0.440
 Regulatory+0.432
Neg logit contributions
IDS-0.458
 Norn-0.440
 juggling-0.433
thumbnails-0.411
Thor-0.397
Token Posts
Token ID12043
Feature activation-8.45e-01

Pos logit contributions
IDS+1.112
 Norn+1.072
 juggling+1.024
thumbnails+0.980
Thor+0.949
Neg logit contributions
 indo-1.350
theless-1.331
zens-1.285
negie-1.243
terday-1.214
Token #
Token ID1303
Feature activation-5.47e-01

Pos logit contributions
theless+2.203
zens+2.199
 Regulatory+2.179
 indo+2.098
enko+2.029
Neg logit contributions
IDS-4.385
 Norn-4.212
 juggling-4.099
thumbnails-4.018
qqa-3.951
Token15
Token ID1314
Feature activation-1.39e-01

Pos logit contributions
zens+2.163
theless+2.139
negie+2.038
enko+2.024
orf+2.022
Neg logit contributions
IDS-2.161
 Norn-2.124
 juggling-2.100
thumbnails-1.937
qqa-1.862
Token honestly
Token ID12698
Feature activation-3.64e-01

Pos logit contributions
theless+0.501
zens+0.493
 indo+0.474
negie+0.470
 Regulatory+0.460
Neg logit contributions
IDS-0.620
 Norn-0.601
 juggling-0.592
thumbnails-0.562
Thor-0.547
Token if
Token ID611
Feature activation-5.11e-02

Pos logit contributions
theless+1.521
zens+1.496
 indo+1.444
negie+1.421
 Regulatory+1.417
Neg logit contributions
IDS-1.412
 Norn-1.346
 juggling-1.325
thumbnails-1.272
Thor-1.211
Token we
Token ID356
Feature activation+1.14e-01

Pos logit contributions
theless+0.216
zens+0.212
 indo+0.205
negie+0.203
 Regulatory+0.201
Neg logit contributions
IDS-0.197
 Norn-0.189
 juggling-0.186
thumbnails-0.176
Thor-0.170
Token 2011
Token ID2813
Feature activation+1.16e-01

Pos logit contributions
IDS+0.218
 Norn+0.216
 juggling+0.200
thumbnails+0.188
Thor+0.183
Neg logit contributions
theless-0.353
zens-0.351
 indo-0.345
negie-0.343
terday-0.328
Token United
Token ID1578
Feature activation-2.77e-01

Pos logit contributions
IDS+0.511
 Norn+0.498
 juggling+0.489
thumbnails+0.463
Thor+0.453
Neg logit contributions
theless-0.427
zens-0.421
 indo-0.407
negie-0.402
 Regulatory-0.387
Token States
Token ID1829
Feature activation-2.25e-02

Pos logit contributions
theless+0.918
zens+0.882
 Regulatory+0.862
 indo+0.846
negie+0.842
Neg logit contributions
IDS-1.317
 juggling-1.247
 Norn-1.237
thumbnails-1.205
Thor-1.145
Token 228
Token ID29041
Feature activation+1.79e-01

Pos logit contributions
theless+0.086
zens+0.084
 indo+0.082
negie+0.081
 Regulatory+0.078
Neg logit contributions
IDS-0.096
 Norn-0.093
 juggling-0.092
thumbnails-0.086
Thor-0.084
Token65
Token ID2996
Feature activation+3.18e-01

Pos logit contributions
IDS+0.703
 Norn+0.679
 juggling+0.661
thumbnails+0.624
Thor+0.605
Neg logit contributions
theless-0.749
zens-0.740
 indo-0.725
negie-0.709
 Regulatory-0.685
Token Posts
Token ID12043
Feature activation-8.42e-01

Pos logit contributions
IDS+1.154
 Norn+1.122
 juggling+1.057
thumbnails+1.010
Thor+0.982
Neg logit contributions
 indo-1.502
theless-1.492
zens-1.419
negie-1.386
terday-1.328
Token #
Token ID1303
Feature activation-5.66e-01

Pos logit contributions
zens+2.164
theless+2.154
 Regulatory+2.130
 indo+2.061
enko+1.988
Neg logit contributions
IDS-4.409
 Norn-4.250
 juggling-4.143
thumbnails-4.060
qqa-3.956
Token13
Token ID1485
Feature activation-1.30e-01

Pos logit contributions
theless+2.115
zens+2.096
 indo+1.991
negie+1.986
 Regulatory+1.965
Neg logit contributions
IDS-2.431
 Norn-2.349
 juggling-2.323
thumbnails-2.198
Thor-2.120
Token On
Token ID1550
Feature activation-3.35e-01

Pos logit contributions
theless+0.478
zens+0.471
 indo+0.453
negie+0.450
 Regulatory+0.440
Neg logit contributions
IDS-0.566
 Norn-0.550
 juggling-0.541
thumbnails-0.513
Thor-0.500
Token September
Token ID2693
Feature activation+1.17e-01

Pos logit contributions
theless+1.204
zens+1.161
 indo+1.138
negie+1.116
 Regulatory+1.069
Neg logit contributions
IDS-1.523
 Norn-1.474
 juggling-1.463
thumbnails-1.388
Thor-1.349
Token 20
Token ID1160
Feature activation+1.97e-01

Pos logit contributions
IDS+0.527
 juggling+0.506
 Norn+0.506
thumbnails+0.487
Thor+0.463
Neg logit contributions
theless-0.416
zens-0.413
 indo-0.391
 Regulatory-0.390
negie-0.390
Token at
Token ID379
Feature activation+2.62e-01

Pos logit contributions
theless+0.004
zens+0.004
 indo+0.003
negie+0.003
 Regulatory+0.003
Neg logit contributions
IDS-0.004
 Norn-0.004
 juggling-0.004
thumbnails-0.003
Thor-0.003
Token nickel
Token ID30666
Feature activation+6.03e-02

Pos logit contributions
IDS+0.918
 Norn+0.880
 juggling+0.874
thumbnails+0.808
Thor+0.784
Neg logit contributions
theless-1.190
zens-1.168
 indo-1.135
negie-1.132
 Regulatory-1.113
Token (
Token ID357
Feature activation-7.93e-02

Pos logit contributions
IDS+0.229
 Norn+0.220
 juggling+0.216
thumbnails+0.206
Thor+0.196
Neg logit contributions
theless-0.257
zens-0.254
 indo-0.245
negie-0.242
 Regulatory-0.239
Tokenor
Token ID273
Feature activation-2.74e-01

Pos logit contributions
theless+0.316
zens+0.311
 indo+0.299
negie+0.296
 Regulatory+0.293
Neg logit contributions
IDS-0.320
 Norn-0.311
 juggling-0.306
thumbnails-0.289
Thor-0.276
Token vice
Token ID7927
Feature activation-5.64e-01

Pos logit contributions
theless+1.147
zens+1.126
 indo+1.089
negie+1.067
 Regulatory+1.065
Neg logit contributions
IDS-1.062
 Norn-1.020
 juggling-0.998
thumbnails-0.955
Thor-0.913
Token versa
Token ID25470
Feature activation-8.39e-01

Pos logit contributions
zens+2.348
theless+2.314
 indo+2.246
negie+2.203
orf+2.155
Neg logit contributions
IDS-2.198
 Norn-2.059
 juggling-2.006
thumbnails-1.867
Thor-1.830
Token),
Token ID828
Feature activation-2.38e-01

Pos logit contributions
theless+3.088
zens+3.018
 indo+2.916
orf+2.853
enko+2.833
Neg logit contributions
IDS-3.535
 Norn-3.440
 juggling-3.320
thumbnails-3.291
Thor-3.098
Token then
Token ID788
Feature activation-4.80e-01

Pos logit contributions
theless+0.997
zens+0.980
 indo+0.949
negie+0.934
 Regulatory+0.930
Neg logit contributions
IDS-0.922
 Norn-0.884
 juggling-0.866
thumbnails-0.827
Thor-0.792
Token the
Token ID262
Feature activation-3.44e-01

Pos logit contributions
theless+1.999
zens+1.963
 indo+1.901
 Regulatory+1.862
negie+1.855
Neg logit contributions
IDS-1.868
 Norn-1.791
 juggling-1.752
thumbnails-1.687
Thor-1.603
Token following
Token ID1708
Feature activation-4.26e-01

Pos logit contributions
theless+1.572
zens+1.551
 indo+1.510
 Regulatory+1.484
negie+1.479
Neg logit contributions
IDS-1.195
 Norn-1.135
 juggling-1.108
thumbnails-1.054
Thor-1.004
Token alignment
Token ID19114
Feature activation-8.68e-02

Pos logit contributions
theless+1.699
zens+1.679
 indo+1.648
 Regulatory+1.605
negie+1.583
Neg logit contributions
IDS-1.706
 Norn-1.652
 juggling-1.595
thumbnails-1.535
Thor-1.459
Token South
Token ID2520
Feature activation-1.23e-01

Pos logit contributions
IDS+0.146
 Norn+0.141
 juggling+0.139
thumbnails+0.133
Thor+0.129
Neg logit contributions
theless-0.107
zens-0.105
 indo-0.101
negie-0.099
 Regulatory-0.098
Token Korea
Token ID4969
Feature activation-4.10e-02

Pos logit contributions
theless+0.318
zens+0.317
 indo+0.300
negie+0.295
 Regulatory+0.287
Neg logit contributions
IDS-0.671
 juggling-0.647
 Norn-0.645
thumbnails-0.624
Thor-0.601
Token.
Token ID13
Feature activation+5.38e-02

Pos logit contributions
theless+0.175
zens+0.173
 indo+0.168
negie+0.166
 Regulatory+0.165
Neg logit contributions
IDS-0.155
 Norn-0.148
 juggling-0.147
thumbnails-0.139
Thor-0.133
Token<|endoftext|>
Token ID50256
Feature activation+6.76e-02

Pos logit contributions
IDS+0.201
 Norn+0.192
 juggling+0.189
thumbnails+0.179
Thor+0.172
Neg logit contributions
theless-0.233
zens-0.229
 indo-0.222
negie-0.220
 Regulatory-0.218
TokenThis
Token ID1212
Feature activation+2.55e-01

Pos logit contributions
IDS+0.276
 Norn+0.267
 juggling+0.264
thumbnails+0.249
Thor+0.239
Neg logit contributions
theless-0.267
zens-0.264
 indo-0.253
negie-0.251
 Regulatory-0.248
Token article
Token ID2708
Feature activation-8.34e-01

Pos logit contributions
IDS+1.016
 Norn+0.982
 juggling+0.971
thumbnails+0.911
Thor+0.886
Neg logit contributions
theless-1.035
zens-1.018
 indo-0.978
negie-0.977
 Regulatory-0.957
Token is
Token ID318
Feature activation-1.85e-01

Pos logit contributions
theless+3.158
zens+3.085
 indo+3.025
 Regulatory+2.934
negie+2.886
Neg logit contributions
IDS-3.558
 Norn-3.397
 juggling-3.298
thumbnails-3.202
Thor-3.094
Token about
Token ID546
Feature activation-2.65e-02

Pos logit contributions
theless+0.686
zens+0.674
 indo+0.650
negie+0.643
 Regulatory+0.632
Neg logit contributions
IDS-0.807
 Norn-0.780
 juggling-0.774
thumbnails-0.732
Thor-0.711
Token the
Token ID262
Feature activation-1.28e-01

Pos logit contributions
theless+0.111
zens+0.109
 indo+0.106
 Regulatory+0.104
negie+0.104
Neg logit contributions
IDS-0.102
 Norn-0.096
 juggling-0.096
thumbnails-0.093
Thor-0.087
Token payment
Token ID6074
Feature activation+2.60e-02

Pos logit contributions
theless+0.505
zens+0.496
 indo+0.481
negie+0.473
 Regulatory+0.470
Neg logit contributions
IDS-0.529
 Norn-0.505
 juggling-0.500
thumbnails-0.480
Thor-0.459
Token medium
Token ID7090
Feature activation+1.69e-01

Pos logit contributions
IDS+0.111
 Norn+0.107
 juggling+0.106
thumbnails+0.100
Thor+0.097
Neg logit contributions
theless-0.099
zens-0.097
 indo-0.093
negie-0.092
 Regulatory-0.090
Token by
Token ID416
Feature activation-1.43e-01

Pos logit contributions
theless+1.109
zens+1.091
 indo+1.058
negie+1.047
 Regulatory+1.036
Neg logit contributions
IDS-0.925
 Norn-0.886
 juggling-0.873
thumbnails-0.825
Thor-0.791
Token suicide
Token ID7341
Feature activation-3.24e-01

Pos logit contributions
theless+0.599
zens+0.588
 indo+0.569
negie+0.565
 Regulatory+0.557
Neg logit contributions
IDS-0.556
 Norn-0.533
 juggling-0.529
thumbnails-0.497
Thor-0.479
Token.
Token ID13
Feature activation+1.50e-01

Pos logit contributions
theless+1.452
zens+1.435
 indo+1.392
negie+1.377
 Regulatory+1.371
Neg logit contributions
IDS-1.138
 Norn-1.091
 juggling-1.075
thumbnails-1.025
Thor-0.967
Token Cambridge
Token ID14457
Feature activation+1.95e-01

Pos logit contributions
IDS+0.654
 Norn+0.631
 juggling+0.619
thumbnails+0.586
Thor+0.573
Neg logit contributions
theless-0.568
zens-0.555
 indo-0.536
negie-0.529
 Regulatory-0.514
Token,
Token ID11
Feature activation+1.85e-01

Pos logit contributions
IDS+0.720
 juggling+0.683
 Norn+0.682
thumbnails+0.649
Thor+0.616
Neg logit contributions
theless-0.849
zens-0.834
 indo-0.819
 Regulatory-0.812
negie-0.806
Token MA
Token ID8779
Feature activation+8.26e-01

Pos logit contributions
IDS+0.841
 Norn+0.814
 juggling+0.804
thumbnails+0.766
Thor+0.744
Neg logit contributions
theless-0.649
zens-0.636
 indo-0.610
negie-0.605
 Regulatory-0.590
Token:
Token ID25
Feature activation-7.36e-02

Pos logit contributions
IDS+2.116
 Norn+2.024
 juggling+1.945
thumbnails+1.802
Thor+1.673
Neg logit contributions
theless-4.533
zens-4.471
negie-4.400
 Regulatory-4.396
 indo-4.389
Token Harvard
Token ID11131
Feature activation+1.52e-01

Pos logit contributions
theless+0.194
zens+0.189
 indo+0.180
negie+0.178
 Regulatory+0.178
Neg logit contributions
IDS-0.397
 Norn-0.385
 juggling-0.381
thumbnails-0.369
Thor-0.356
Token University
Token ID2059
Feature activation-1.50e-01

Pos logit contributions
IDS+0.668
 Norn+0.644
 juggling+0.637
thumbnails+0.607
Thor+0.587
Neg logit contributions
theless-0.562
zens-0.551
 indo-0.531
negie-0.524
 Regulatory-0.518
Token Press
Token ID4332
Feature activation-1.00e-01

Pos logit contributions
theless+0.666
zens+0.660
 Regulatory+0.652
 indo+0.651
negie+0.641
Neg logit contributions
IDS-0.526
 Norn-0.503
 juggling-0.495
thumbnails-0.475
Thor-0.445
Token;
Token ID26
Feature activation-5.67e-01

Pos logit contributions
zens+0.345
theless+0.344
 Regulatory+0.336
 indo+0.334
negie+0.328
Neg logit contributions
IDS-0.451
 Norn-0.436
 juggling-0.428
thumbnails-0.411
qqa-0.394
Token +
Token ID1343
Feature activation+2.66e-01

Pos logit contributions
IDS+0.626
 Norn+0.600
 juggling+0.595
thumbnails+0.559
Thor+0.537
Neg logit contributions
theless-0.727
zens-0.712
 indo-0.693
negie-0.686
 Regulatory-0.679
Token medium
Token ID7090
Feature activation+3.49e-01

Pos logit contributions
IDS+1.077
 Norn+1.039
 juggling+1.036
thumbnails+0.971
Thor+0.943
Neg logit contributions
theless-1.062
zens-1.044
 indo-1.009
negie-1.001
 Regulatory-0.986
Token kick
Token ID4829
Feature activation+2.22e-01

Pos logit contributions
IDS+1.401
 juggling+1.366
 Norn+1.359
thumbnails+1.266
Thor+1.228
Neg logit contributions
theless-1.385
zens-1.363
 indo-1.315
negie-1.313
 Regulatory-1.289
Token,
Token ID11
Feature activation+1.43e-01

Pos logit contributions
IDS+0.819
 Norn+0.784
 juggling+0.778
thumbnails+0.729
Thor+0.702
Neg logit contributions
theless-0.972
zens-0.954
 indo-0.927
negie-0.919
 Regulatory-0.908
Token Laura
Token ID16753
Feature activation+2.39e-01

Pos logit contributions
IDS+0.629
 Norn+0.607
 juggling+0.599
thumbnails+0.571
Thor+0.553
Neg logit contributions
theless-0.526
zens-0.516
 indo-0.497
negie-0.491
 Regulatory-0.485
Token moves
Token ID6100
Feature activation+6.92e-01

Pos logit contributions
IDS+1.000
 Norn+0.968
 juggling+0.967
thumbnails+0.908
Thor+0.883
Neg logit contributions
theless-0.918
zens-0.899
negie-0.862
 indo-0.861
 Regulatory-0.847
Token in
Token ID287
Feature activation+7.73e-02

Pos logit contributions
 juggling+2.554
IDS+2.534
 Norn+2.501
thumbnails+2.295
 blink+2.266
Neg logit contributions
theless-2.934
zens-2.875
negie-2.777
 indo-2.770
 Regulatory-2.740
Token the
Token ID262
Feature activation+1.00e-01

Pos logit contributions
IDS+0.322
 Norn+0.311
 juggling+0.305
thumbnails+0.291
Thor+0.281
Neg logit contributions
theless-0.300
zens-0.296
 indo-0.285
negie-0.282
 Regulatory-0.279
Token corresponding
Token ID11188
Feature activation-1.19e-01

Pos logit contributions
IDS+0.355
 Norn+0.340
 juggling+0.336
thumbnails+0.316
Thor+0.302
Neg logit contributions
theless-0.454
zens-0.446
 indo-0.433
negie-0.430
 Regulatory-0.425
Token direction
Token ID4571
Feature activation+1.37e-01

Pos logit contributions
theless+0.566
zens+0.559
 indo+0.543
negie+0.537
 Regulatory+0.533
Neg logit contributions
IDS-0.396
 Norn-0.377
 juggling-0.369
thumbnails-0.348
Thor-0.332
Token,
Token ID11
Feature activation-9.24e-02

Pos logit contributions
IDS+0.528
 Norn+0.508
 juggling+0.503
thumbnails+0.472
Thor+0.455
Neg logit contributions
theless-0.580
zens-0.569
 indo-0.551
negie-0.546
 Regulatory-0.539
Token,
Token ID11
Feature activation+2.98e-01

Pos logit contributions
IDS+0.420
 Norn+0.403
 juggling+0.400
thumbnails+0.374
Thor+0.362
Neg logit contributions
theless-0.478
zens-0.470
 indo-0.454
negie-0.453
 Regulatory-0.446
Token a
Token ID257
Feature activation+8.64e-02

Pos logit contributions
IDS+1.215
 juggling+1.191
 Norn+1.164
thumbnails+1.088
Thor+1.063
Neg logit contributions
theless-1.179
zens-1.156
 indo-1.117
negie-1.113
 Regulatory-1.076
Token former
Token ID1966
Feature activation+4.98e-02

Pos logit contributions
IDS+0.337
 juggling+0.324
 Norn+0.324
thumbnails+0.300
Thor+0.293
Neg logit contributions
theless-0.357
zens-0.353
 indo-0.340
negie-0.339
 Regulatory-0.330
Token five
Token ID1936
Feature activation-6.34e-03

Pos logit contributions
IDS+0.207
 Norn+0.199
 juggling+0.198
thumbnails+0.185
Thor+0.181
Neg logit contributions
theless-0.194
zens-0.191
 indo-0.184
negie-0.183
 Regulatory-0.178
Token-
Token ID12
Feature activation-2.39e-01

Pos logit contributions
zens+0.026
theless+0.025
negie+0.024
 indo+0.024
 Regulatory+0.024
Neg logit contributions
IDS-0.026
 Norn-0.025
 juggling-0.024
thumbnails-0.023
Thor-0.022
Tokenterm
Token ID4354
Feature activation+7.34e-01

Pos logit contributions
theless+1.053
zens+1.049
 indo+0.993
 Regulatory+0.988
negie+0.986
Neg logit contributions
IDS-0.868
 Norn-0.835
 juggling-0.828
thumbnails-0.763
Thor-0.732
Token member
Token ID2888
Feature activation+6.40e-02

Pos logit contributions
IDS+3.180
 Norn+3.100
 juggling+3.081
thumbnails+2.849
 blink+2.803
Neg logit contributions
theless-2.645
zens-2.593
negie-2.514
 indo-2.509
enko-2.411
Token of
Token ID286
Feature activation-1.80e-01

Pos logit contributions
IDS+0.294
 Norn+0.288
 juggling+0.276
thumbnails+0.273
Thor+0.262
Neg logit contributions
theless-0.217
zens-0.215
 indo-0.208
 Regulatory-0.206
negie-0.205
Token Congress
Token ID3162
Feature activation+1.11e-01

Pos logit contributions
theless+0.747
zens+0.739
 Regulatory+0.713
 indo+0.712
negie+0.710
Neg logit contributions
IDS-0.695
 Norn-0.676
 juggling-0.659
thumbnails-0.635
Thor-0.608
Token who
Token ID508
Feature activation+4.23e-02

Pos logit contributions
IDS+0.472
 Norn+0.458
 juggling+0.446
thumbnails+0.430
Thor+0.413
Neg logit contributions
theless-0.424
zens-0.417
 indo-0.401
negie-0.396
 Regulatory-0.393
Token made
Token ID925
Feature activation-3.78e-02

Pos logit contributions
IDS+0.163
 juggling+0.157
 Norn+0.156
thumbnails+0.146
Thor+0.141
Neg logit contributions
theless-0.177
zens-0.175
negie-0.168
 indo-0.168
 Regulatory-0.165
Token with
Token ID351
Feature activation+1.13e-02

Pos logit contributions
IDS+0.756
 Norn+0.717
 juggling+0.705
thumbnails+0.665
Thor+0.632
Neg logit contributions
theless-1.091
zens-1.083
 indo-1.049
negie-1.040
 Regulatory-1.035
Token a
Token ID257
Feature activation+9.20e-02

Pos logit contributions
IDS+0.047
 Norn+0.045
 juggling+0.045
thumbnails+0.043
Thor+0.041
Neg logit contributions
theless-0.044
zens-0.043
 indo-0.042
negie-0.041
 Regulatory-0.041
Token new
Token ID649
Feature activation+7.53e-02

Pos logit contributions
IDS+0.352
 Norn+0.338
 juggling+0.335
thumbnails+0.313
Thor+0.303
Neg logit contributions
theless-0.392
zens-0.383
 indo-0.371
negie-0.369
 Regulatory-0.360
Token short
Token ID1790
Feature activation+1.42e-01

Pos logit contributions
IDS+0.310
 Norn+0.299
 juggling+0.296
thumbnails+0.279
Thor+0.270
Neg logit contributions
theless-0.298
zens-0.292
 indo-0.282
negie-0.280
 Regulatory-0.274
Token-
Token ID12
Feature activation+3.24e-01

Pos logit contributions
IDS+0.583
 Norn+0.556
 juggling+0.538
thumbnails+0.522
Thor+0.500
Neg logit contributions
zens-0.568
theless-0.562
 indo-0.535
 Regulatory-0.528
negie-0.526
Tokenterm
Token ID4354
Feature activation+7.56e-01

Pos logit contributions
IDS+0.983
 Norn+0.931
 juggling+0.920
thumbnails+0.861
Thor+0.815
Neg logit contributions
theless-1.624
zens-1.595
 indo-1.580
negie-1.550
 Regulatory-1.529
Token disability
Token ID14013
Feature activation+5.45e-03

Pos logit contributions
IDS+3.052
 Norn+2.951
 juggling+2.933
thumbnails+2.703
Thor+2.647
Neg logit contributions
theless-3.033
zens-2.948
 indo-2.848
negie-2.842
 Regulatory-2.769
Token plan
Token ID1410
Feature activation-3.32e-03

Pos logit contributions
IDS+0.020
 Norn+0.019
 juggling+0.018
thumbnails+0.018
Thor+0.017
Neg logit contributions
theless-0.024
zens-0.024
 indo-0.023
negie-0.023
 Regulatory-0.023
Token.
Token ID13
Feature activation+3.20e-02

Pos logit contributions
theless+0.013
zens+0.013
 indo+0.013
negie+0.013
 Regulatory+0.013
Neg logit contributions
IDS-0.013
 Norn-0.013
 juggling-0.013
thumbnails-0.012
Thor-0.012
Token
Token ID198
Feature activation+6.52e-02

Pos logit contributions
IDS+0.133
 Norn+0.128
 juggling+0.127
thumbnails+0.121
Thor+0.115
Neg logit contributions
theless-0.125
zens-0.123
 Regulatory-0.118
 indo-0.118
negie-0.117
Token
Token ID198
Feature activation+8.58e-02

Pos logit contributions
IDS+0.234
 Norn+0.222
 juggling+0.215
thumbnails+0.204
Thor+0.202
Neg logit contributions
theless-0.304
 indo-0.300
zens-0.286
negie-0.281
 Regulatory-0.273
Tokenos
Token ID418
Feature activation-1.20e-01

Pos logit contributions
theless+0.028
zens+0.026
 indo+0.026
negie+0.025
 Regulatory+0.025
Neg logit contributions
IDS-0.037
 Norn-0.035
 juggling-0.035
thumbnails-0.034
Thor-0.033
Token Hills
Token ID14379
Feature activation-1.39e-01

Pos logit contributions
theless+0.512
zens+0.509
 indo+0.492
negie+0.489
 Regulatory+0.487
Neg logit contributions
IDS-0.450
 Norn-0.432
 juggling-0.426
thumbnails-0.404
Thor-0.388
Token,
Token ID11
Feature activation-1.09e-02

Pos logit contributions
zens+0.603
theless+0.602
 indo+0.582
negie+0.577
 Regulatory+0.573
Neg logit contributions
IDS-0.509
 Norn-0.488
 juggling-0.480
thumbnails-0.456
Thor-0.436
Token California
Token ID3442
Feature activation-1.09e-01

Pos logit contributions
theless+0.040
zens+0.039
 indo+0.038
negie+0.038
 Regulatory+0.037
Neg logit contributions
IDS-0.047
 Norn-0.046
 juggling-0.045
thumbnails-0.043
Thor-0.042
Token.
Token ID13
Feature activation-7.02e-02

Pos logit contributions
zens+0.404
theless+0.387
 Regulatory+0.381
negie+0.380
 indo+0.378
Neg logit contributions
IDS-0.466
 Norn-0.454
 juggling-0.443
thumbnails-0.425
Thor-0.415
Token Hide
Token ID10415
Feature activation+7.88e-01

Pos logit contributions
theless+0.235
zens+0.230
 indo+0.220
negie+0.217
 Regulatory+0.214
Neg logit contributions
IDS-0.331
 Norn-0.320
 juggling-0.317
thumbnails-0.303
Thor-0.294
Token Caption
Token ID11260
Feature activation+5.68e-02

Pos logit contributions
IDS+2.282
 Norn+2.238
 juggling+2.070
thumbnails+1.986
Thor+1.916
Neg logit contributions
theless-4.273
 indo-4.266
zens-3.895
negie-3.840
terday-3.832
Token 14
Token ID1478
Feature activation-2.39e-01

Pos logit contributions
IDS+0.236
 Norn+0.227
 juggling+0.221
thumbnails+0.214
Thor+0.203
Neg logit contributions
theless-0.237
 indo-0.229
zens-0.216
negie-0.211
terday-0.209
Token of
Token ID286
Feature activation+1.30e-02

Pos logit contributions
theless+1.021
 indo+0.983
zens+0.970
negie+0.938
 Regulatory+0.921
Neg logit contributions
IDS-0.935
 Norn-0.902
 juggling-0.887
thumbnails-0.842
Thor-0.815
Token 15
Token ID1315
Feature activation-1.25e-01

Pos logit contributions
IDS+0.048
 Norn+0.046
 juggling+0.045
thumbnails+0.042
Thor+0.041
Neg logit contributions
theless-0.059
 indo-0.058
zens-0.056
negie-0.055
 Regulatory-0.053
Token Photos
Token ID9434
Feature activation-9.99e-02

Pos logit contributions
theless+0.630
 indo+0.610
zens+0.598
negie+0.582
terday+0.571
Neg logit contributions
IDS-0.404
 Norn-0.384
 juggling-0.370
thumbnails-0.356
Thor-0.334
Token budget
Token ID4466
Feature activation-1.79e-01

Pos logit contributions
theless+0.287
zens+0.284
 indo+0.273
negie+0.269
 Regulatory+0.268
Neg logit contributions
IDS-0.376
 Norn-0.363
 juggling-0.359
thumbnails-0.344
Thor-0.331
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
theless+0.726
zens+0.713
 indo+0.688
negie+0.682
 Regulatory+0.672
Neg logit contributions
IDS-0.720
 Norn-0.691
 juggling-0.675
thumbnails-0.654
Thor-0.624
Token Cance
Token ID43780
Feature activation+1.47e-01

Pos logit contributions
IDS+0.488
 Norn+0.472
 juggling+0.458
thumbnails+0.433
Thor+0.424
Neg logit contributions
theless-0.513
zens-0.501
 indo-0.489
negie-0.480
orf-0.466
Tokenling
Token ID1359
Feature activation+2.25e-01

Pos logit contributions
IDS+0.633
 Norn+0.608
 juggling+0.603
thumbnails+0.573
Thor+0.556
Neg logit contributions
theless-0.556
zens-0.555
 indo-0.520
negie-0.519
 Regulatory-0.513
Token it
Token ID340
Feature activation+1.05e-01

Pos logit contributions
IDS+0.814
 Norn+0.780
 juggling+0.768
thumbnails+0.721
Thor+0.695
Neg logit contributions
theless-1.006
zens-0.987
 indo-0.957
negie-0.949
 Regulatory-0.935
Token would
Token ID561
Feature activation+3.49e-01

Pos logit contributions
IDS+0.382
 Norn+0.367
 juggling+0.364
thumbnails+0.338
Thor+0.326
Neg logit contributions
theless-0.464
zens-0.456
negie-0.440
 indo-0.440
 Regulatory-0.434
Token allow
Token ID1249
Feature activation+1.68e-02

Pos logit contributions
IDS+1.344
 Norn+1.316
 juggling+1.305
thumbnails+1.208
 blink+1.184
Neg logit contributions
theless-1.463
zens-1.419
 indo-1.373
negie-1.367
 Regulatory-1.348
Token you
Token ID345
Feature activation+3.57e-01

Pos logit contributions
IDS+0.066
 Norn+0.064
 juggling+0.063
thumbnails+0.060
Thor+0.057
Neg logit contributions
theless-0.069
zens-0.068
 indo-0.066
negie-0.065
 Regulatory-0.064
Token to
Token ID284
Feature activation+1.51e-01

Pos logit contributions
IDS+1.271
 Norn+1.221
 juggling+1.196
thumbnails+1.143
Thor+1.086
Neg logit contributions
theless-1.602
zens-1.596
 indo-1.543
negie-1.522
 Regulatory-1.512
Token put
Token ID1234
Feature activation+8.24e-02

Pos logit contributions
IDS+0.620
 Norn+0.603
 juggling+0.598
thumbnails+0.553
 blink+0.546
Neg logit contributions
theless-0.600
zens-0.583
negie-0.560
 indo-0.560
 Regulatory-0.547
Token back
Token ID736
Feature activation+2.52e-01

Pos logit contributions
IDS+0.323
 Norn+0.311
 juggling+0.308
thumbnails+0.288
Thor+0.278
Neg logit contributions
theless-0.344
zens-0.334
 indo-0.324
negie-0.323
 Regulatory-0.316
Token
Token ID564
Feature activation-1.31e-02

Pos logit contributions
theless+0.451
zens+0.444
 indo+0.428
negie+0.424
 Regulatory+0.423
Neg logit contributions
IDS-0.415
 Norn-0.397
 juggling-0.394
thumbnails-0.372
Thor-0.357
Token
Token ID250
Feature activation-3.14e-02

Pos logit contributions
zens+0.052
theless+0.052
 indo+0.050
negie+0.050
 Regulatory+0.049
Neg logit contributions
IDS-0.053
 Norn-0.051
 juggling-0.050
thumbnails-0.049
Thor-0.046
TokenI
Token ID40
Feature activation+3.85e-02

Pos logit contributions
theless+0.127
zens+0.124
 indo+0.121
negie+0.119
 Regulatory+0.118
Neg logit contributions
IDS-0.127
 Norn-0.121
 juggling-0.120
thumbnails-0.114
Thor-0.110
Token believe
Token ID1975
Feature activation+4.58e-02

Pos logit contributions
IDS+0.145
 Norn+0.139
 juggling+0.138
thumbnails+0.129
Thor+0.125
Neg logit contributions
theless-0.165
zens-0.162
 indo-0.157
negie-0.155
 Regulatory-0.154
Token that
Token ID326
Feature activation+7.87e-02

Pos logit contributions
IDS+0.175
 Norn+0.168
 juggling+0.165
thumbnails+0.156
Thor+0.150
Neg logit contributions
theless-0.196
zens-0.192
 indo-0.186
negie-0.185
 Regulatory-0.182
Token [
Token ID685
Feature activation+1.21e-01

Pos logit contributions
IDS+0.303
 Norn+0.291
 juggling+0.288
thumbnails+0.270
Thor+0.261
Neg logit contributions
theless-0.332
zens-0.327
 indo-0.316
negie-0.314
 Regulatory-0.309
Tokenthere
Token ID8117
Feature activation+8.30e-02

Pos logit contributions
IDS+0.481
 Norn+0.460
 juggling+0.454
thumbnails+0.432
Thor+0.417
Neg logit contributions
theless-0.499
zens-0.490
 indo-0.475
negie-0.470
 Regulatory-0.465
Token]
Token ID60
Feature activation+7.24e-02

Pos logit contributions
IDS+0.345
 Norn+0.334
 juggling+0.330
thumbnails+0.312
Thor+0.300
Neg logit contributions
theless-0.327
zens-0.318
 indo-0.306
negie-0.303
 Regulatory-0.300
Token is
Token ID318
Feature activation+1.94e-02

Pos logit contributions
IDS+0.293
 Norn+0.282
 juggling+0.278
thumbnails+0.263
Thor+0.254
Neg logit contributions
theless-0.291
zens-0.286
 indo-0.276
negie-0.273
 Regulatory-0.270
Token this
Token ID428
Feature activation+9.17e-02

Pos logit contributions
IDS+0.080
 Norn+0.077
 juggling+0.076
thumbnails+0.072
Thor+0.070
Neg logit contributions
theless-0.077
zens-0.075
 indo-0.073
negie-0.072
 Regulatory-0.071
Token desire
Token ID6227
Feature activation+9.07e-02

Pos logit contributions
IDS+0.354
 Norn+0.341
 juggling+0.339
thumbnails+0.318
Thor+0.308
Neg logit contributions
theless-0.385
zens-0.378
 indo-0.364
negie-0.362
 Regulatory-0.357
TokenOCK
Token ID11290
Feature activation+3.44e-02

Pos logit contributions
theless+0.623
zens+0.617
 indo+0.568
 Regulatory+0.565
orf+0.560
Neg logit contributions
IDS-1.503
 Norn-1.486
 juggling-1.483
thumbnails-1.418
Thor-1.378
Token:
Token ID25
Feature activation-1.88e-01

Pos logit contributions
IDS+0.133
 Norn+0.128
 juggling+0.126
thumbnails+0.120
Thor+0.115
Neg logit contributions
theless-0.144
zens-0.142
 indo-0.137
negie-0.136
 Regulatory-0.135
Token
Token ID198
Feature activation-2.37e-01

Pos logit contributions
theless+0.779
zens+0.766
 indo+0.739
negie+0.735
 Regulatory+0.721
Neg logit contributions
IDS-0.738
 Norn-0.710
 juggling-0.700
thumbnails-0.660
Thor-0.637
Token
Token ID198
Feature activation-2.66e-01

Pos logit contributions
theless+1.038
 indo+1.006
zens+1.000
negie+0.971
 Regulatory+0.946
Neg logit contributions
IDS-0.901
 Norn-0.860
 juggling-0.842
thumbnails-0.801
Thor-0.780
TokenConnect
Token ID13313
Feature activation-6.68e-02

Pos logit contributions
theless+1.071
zens+1.055
 indo+1.017
negie+1.007
 Regulatory+0.996
Neg logit contributions
IDS-1.070
 Norn-1.031
 juggling-1.018
thumbnails-0.962
Thor-0.927
Token it
Token ID340
Feature activation+2.90e-01

Pos logit contributions
theless+0.288
zens+0.284
 indo+0.275
negie+0.271
 Regulatory+0.270
Neg logit contributions
IDS-0.250
 Norn-0.239
 juggling-0.236
thumbnails-0.223
Thor-0.214
Token to
Token ID284
Feature activation+1.62e-01

Pos logit contributions
IDS+0.973
 Norn+0.929
 juggling+0.915
thumbnails+0.857
Thor+0.819
Neg logit contributions
theless-1.361
zens-1.340
 indo-1.302
negie-1.291
 Regulatory-1.279
Token the
Token ID262
Feature activation+1.78e-02

Pos logit contributions
IDS+0.609
 Norn+0.583
 juggling+0.575
thumbnails+0.544
Thor+0.522
Neg logit contributions
theless-0.698
zens-0.687
 indo-0.666
negie-0.657
 Regulatory-0.652
Token watch
Token ID2342
Feature activation-2.05e-01

Pos logit contributions
IDS+0.076
 Norn+0.074
 juggling+0.073
thumbnails+0.069
Thor+0.067
Neg logit contributions
theless-0.068
zens-0.066
 indo-0.064
negie-0.064
 Regulatory-0.062
Token,
Token ID11
Feature activation+1.50e-01

Pos logit contributions
theless+0.854
zens+0.841
 indo+0.814
negie+0.803
 Regulatory+0.796
Neg logit contributions
IDS-0.799
 Norn-0.765
 juggling-0.755
thumbnails-0.717
Thor-0.689
Token without
Token ID1231
Feature activation+2.55e-01

Pos logit contributions
IDS+0.577
 juggling+0.557
 Norn+0.555
thumbnails+0.522
 blink+0.507
Neg logit contributions
theless-0.629
zens-0.613
negie-0.595
 indo-0.592
 Regulatory-0.578
Tokenе
Token ID16843
Feature activation+1.98e-01

Pos logit contributions
theless+0.610
zens+0.601
 indo+0.582
negie+0.577
 Regulatory+0.571
Neg logit contributions
IDS-0.499
 Norn-0.478
 juggling-0.472
thumbnails-0.445
Thor-0.425
Tokenк
Token ID31583
Feature activation+7.99e-02

Pos logit contributions
IDS+0.699
 Norn+0.669
 juggling+0.660
thumbnails+0.622
Thor+0.594
Neg logit contributions
theless-0.903
zens-0.888
 indo-0.861
negie-0.851
 Regulatory-0.843
Tokenс
Token ID21727
Feature activation+3.00e-01

Pos logit contributions
IDS+0.337
 Norn+0.325
 juggling+0.321
thumbnails+0.305
Thor+0.295
Neg logit contributions
theless-0.307
zens-0.301
 indo-0.290
negie-0.287
 Regulatory-0.284
Tokenе
Token ID16843
Feature activation+1.37e-01

Pos logit contributions
IDS+1.034
 Norn+0.988
 juggling+0.973
thumbnails+0.914
Thor+0.875
Neg logit contributions
theless-1.382
zens-1.361
 indo-1.321
negie-1.309
 Regulatory-1.296
Token
Token ID140
Feature activation-1.46e-01

Pos logit contributions
IDS+0.502
 Norn+0.480
 juggling+0.475
thumbnails+0.448
Thor+0.429
Neg logit contributions
theless-0.604
zens-0.594
 indo-0.576
negie-0.570
 Regulatory-0.565
Token
Token ID117
Feature activation+1.15e-01

Pos logit contributions
theless+0.524
zens+0.517
 indo+0.496
negie+0.490
 Regulatory+0.484
Neg logit contributions
IDS-0.653
 Norn-0.631
 juggling-0.624
thumbnails-0.596
Thor-0.576
Token
Token ID12466
Feature activation-9.79e-03

Pos logit contributions
IDS+0.408
 Norn+0.391
 juggling+0.386
thumbnails+0.364
Thor+0.347
Neg logit contributions
theless-0.523
zens-0.514
 indo-0.499
negie-0.493
 Regulatory-0.490
Token
Token ID248
Feature activation-9.12e-02

Pos logit contributions
theless+0.041
zens+0.040
 indo+0.039
negie+0.038
 Regulatory+0.038
Neg logit contributions
IDS-0.038
 Norn-0.037
 juggling-0.036
thumbnails-0.034
Thor-0.033
Tokenр
Token ID21169
Feature activation+5.98e-01

Pos logit contributions
theless+0.417
zens+0.410
 indo+0.398
negie+0.394
 Regulatory+0.390
Neg logit contributions
IDS-0.319
 Norn-0.305
 juggling-0.300
thumbnails-0.282
Thor-0.270
Tokenо
Token ID15166
Feature activation+3.41e-01

Pos logit contributions
IDS+2.018
 Norn+1.907
 juggling+1.874
thumbnails+1.760
Thor+1.706
Neg logit contributions
theless-2.787
zens-2.741
 indo-2.675
negie-2.646
 Regulatory-2.614
Token
Token ID141
Feature activation-1.06e-01

Pos logit contributions
IDS+1.405
 Norn+1.348
 juggling+1.322
thumbnails+1.256
Thor+1.222
Neg logit contributions
theless-1.352
zens-1.320
 indo-1.284
negie-1.269
 Regulatory-1.245
Token to
Token ID284
Feature activation+1.99e-02

Pos logit contributions
IDS+0.188
 Norn+0.181
 juggling+0.173
thumbnails+0.168
Thor+0.161
Neg logit contributions
zens-0.220
theless-0.217
 indo-0.211
negie-0.209
 Regulatory-0.206
Token broadcast
Token ID7025
Feature activation-2.76e-02

Pos logit contributions
IDS+0.079
 Norn+0.076
 juggling+0.076
thumbnails+0.071
 blink+0.069
Neg logit contributions
theless-0.082
zens-0.079
 indo-0.077
negie-0.076
 Regulatory-0.075
Token their
Token ID511
Feature activation-5.89e-02

Pos logit contributions
theless+0.109
zens+0.106
 indo+0.103
negie+0.102
 Regulatory+0.101
Neg logit contributions
IDS-0.114
 Norn-0.109
 juggling-0.109
thumbnails-0.103
Thor-0.099
Token experiences
Token ID6461
Feature activation+1.42e-02

Pos logit contributions
theless+0.236
zens+0.230
 indo+0.223
negie+0.221
 Regulatory+0.218
Neg logit contributions
IDS-0.241
 Norn-0.232
 juggling-0.230
thumbnails-0.217
Thor-0.210
Token and
Token ID290
Feature activation+8.55e-02

Pos logit contributions
IDS+0.056
 Norn+0.054
 juggling+0.054
thumbnails+0.050
Thor+0.048
Neg logit contributions
theless-0.058
zens-0.057
 indo-0.055
negie-0.055
 Regulatory-0.054
Token provide
Token ID2148
Feature activation+1.68e-01

Pos logit contributions
IDS+0.340
 Norn+0.327
 juggling+0.326
thumbnails+0.307
Thor+0.295
Neg logit contributions
theless-0.350
zens-0.342
 indo-0.329
negie-0.329
 Regulatory-0.323
Token commentary
Token ID14604
Feature activation-1.36e-01

Pos logit contributions
IDS+0.633
 juggling+0.608
 Norn+0.605
thumbnails+0.568
Thor+0.541
Neg logit contributions
theless-0.721
zens-0.698
 indo-0.682
negie-0.678
 Regulatory-0.664
Token in
Token ID287
Feature activation+1.60e-01

Pos logit contributions
theless+0.556
zens+0.546
 indo+0.528
negie+0.522
 Regulatory+0.516
Neg logit contributions
IDS-0.542
 Norn-0.521
 juggling-0.514
thumbnails-0.487
Thor-0.469
Token real
Token ID1103
Feature activation-1.19e-01

Pos logit contributions
IDS+0.608
 Norn+0.587
 juggling+0.586
thumbnails+0.547
 blink+0.524
Neg logit contributions
theless-0.681
zens-0.660
 indo-0.641
negie-0.639
 Regulatory-0.626
Token-
Token ID12
Feature activation+1.16e-02

Pos logit contributions
zens+0.513
theless+0.508
 indo+0.492
negie+0.489
 Regulatory+0.480
Neg logit contributions
IDS-0.448
 Norn-0.429
 juggling-0.418
thumbnails-0.393
Thor-0.387
Tokentime
Token ID2435
Feature activation+6.27e-02

Pos logit contributions
IDS+0.045
 Norn+0.043
 juggling+0.043
thumbnails+0.039
Thor+0.039
Neg logit contributions
theless-0.048
zens-0.048
 indo-0.046
negie-0.046
 Regulatory-0.045
Token guests
Token ID10650
Feature activation-2.12e-02

Pos logit contributions
theless+0.493
zens+0.485
 indo+0.471
negie+0.467
 Regulatory+0.461
Neg logit contributions
IDS-0.388
 Norn-0.371
 juggling-0.366
thumbnails-0.344
Thor-0.330
Token,
Token ID11
Feature activation-1.07e-01

Pos logit contributions
theless+0.087
zens+0.086
 indo+0.083
negie+0.082
 Regulatory+0.081
Neg logit contributions
IDS-0.084
 Norn-0.080
 juggling-0.080
thumbnails-0.075
Thor-0.072
Token had
Token ID550
Feature activation-3.51e-01

Pos logit contributions
theless+0.435
zens+0.427
 indo+0.414
negie+0.412
 Regulatory+0.404
Neg logit contributions
IDS-0.425
 Norn-0.409
 juggling-0.405
thumbnails-0.381
Thor-0.369
Token enjoyed
Token ID8359
Feature activation+8.61e-03

Pos logit contributions
theless+1.429
zens+1.404
 indo+1.358
negie+1.336
 Regulatory+1.326
Neg logit contributions
IDS-1.403
 Norn-1.348
 juggling-1.329
thumbnails-1.264
Thor-1.216
Token their
Token ID511
Feature activation-3.95e-02

Pos logit contributions
IDS+0.035
 Norn+0.034
 juggling+0.033
thumbnails+0.031
Thor+0.031
Neg logit contributions
theless-0.034
zens-0.034
 indo-0.033
negie-0.032
 Regulatory-0.032
Token Easter
Token ID20111
Feature activation-1.99e-02

Pos logit contributions
theless+0.148
zens+0.145
 indo+0.139
negie+0.139
 Regulatory+0.137
Neg logit contributions
IDS-0.170
 Norn-0.165
 juggling-0.163
thumbnails-0.153
Thor-0.149
Token brunch
Token ID50219
Feature activation-2.93e-01

Pos logit contributions
theless+0.063
zens+0.062
 indo+0.059
negie+0.058
 Regulatory+0.057
Neg logit contributions
IDS-0.098
 Norn-0.094
 juggling-0.093
thumbnails-0.090
Thor-0.087
Token,
Token ID11
Feature activation-7.52e-02

Pos logit contributions
theless+1.229
zens+1.208
 indo+1.171
negie+1.155
 Regulatory+1.142
Neg logit contributions
IDS-1.135
 Norn-1.087
 juggling-1.070
thumbnails-1.019
Thor-0.979
Token when
Token ID618
Feature activation+9.97e-02

Pos logit contributions
theless+0.309
zens+0.304
 indo+0.294
negie+0.293
 Regulatory+0.287
Neg logit contributions
IDS-0.297
 Norn-0.287
 juggling-0.286
thumbnails-0.267
Thor-0.258
Token Tor
Token ID4022
Feature activation-9.47e-02

Pos logit contributions
IDS+0.395
 Norn+0.381
 juggling+0.376
thumbnails+0.352
Thor+0.341
Neg logit contributions
theless-0.410
zens-0.402
 indo-0.390
negie-0.389
 Regulatory-0.381
Tokeni
Token ID72
Feature activation+9.87e-02

Pos logit contributions
theless+0.401
zens+0.389
 indo+0.387
negie+0.373
 Regulatory+0.369
Neg logit contributions
IDS-0.369
 Norn-0.353
 juggling-0.347
thumbnails-0.329
Thor-0.319
TokenI
Token ID40
Feature activation+1.36e-01

Pos logit contributions
IDS+0.072
 Norn+0.069
 juggling+0.068
thumbnails+0.065
Thor+0.062
Neg logit contributions
theless-0.074
zens-0.072
 indo-0.070
negie-0.069
 Regulatory-0.069
Token washed
Token ID18989
Feature activation+2.86e-01

Pos logit contributions
IDS+0.518
 Norn+0.505
 juggling+0.491
thumbnails+0.464
 blink+0.450
Neg logit contributions
theless-0.569
zens-0.564
 indo-0.545
negie-0.544
 Regulatory-0.537
Token the
Token ID262
Feature activation+1.00e-01

Pos logit contributions
IDS+1.066
 Norn+1.030
 juggling+1.010
thumbnails+0.947
Thor+0.908
Neg logit contributions
theless-1.239
zens-1.221
negie-1.180
 indo-1.176
 Regulatory-1.164
Token broken
Token ID5445
Feature activation+2.98e-01

Pos logit contributions
IDS+0.399
 Norn+0.387
 juggling+0.380
thumbnails+0.358
Thor+0.346
Neg logit contributions
theless-0.408
zens-0.402
negie-0.387
 indo-0.386
 Regulatory-0.382
Token st
Token ID336
Feature activation-2.77e-04

Pos logit contributions
IDS+1.229
 Norn+1.201
 juggling+1.178
thumbnails+1.109
Thor+1.072
Neg logit contributions
theless-1.167
zens-1.147
negie-1.105
 indo-1.092
 Regulatory-1.089
Tokenon
Token ID261
Feature activation+6.49e-02

Pos logit contributions
theless+0.001
zens+0.001
 indo+0.001
negie+0.001
orf+0.001
Neg logit contributions
IDS-0.001
 juggling-0.001
 Norn-0.001
thumbnails-0.001
Thor-0.001
Tokenework
Token ID6433
Feature activation+3.25e-02

Pos logit contributions
IDS+0.252
 Norn+0.244
 juggling+0.241
thumbnails+0.226
Thor+0.220
Neg logit contributions
theless-0.271
zens-0.269
 indo-0.257
negie-0.254
 Regulatory-0.250
Token with
Token ID351
Feature activation+3.72e-02

Pos logit contributions
IDS+0.115
 Norn+0.111
 juggling+0.109
thumbnails+0.102
Thor+0.098
Neg logit contributions
theless-0.147
zens-0.145
 indo-0.140
negie-0.139
 Regulatory-0.138
Token Val
Token ID3254
Feature activation+1.41e-01

Pos logit contributions
IDS+0.151
 Norn+0.146
 juggling+0.144
thumbnails+0.136
Thor+0.131
Neg logit contributions
theless-0.148
zens-0.146
negie-0.140
 indo-0.140
 Regulatory-0.138
Tokenle
Token ID293
Feature activation-1.20e-01

Pos logit contributions
IDS+0.576
 Norn+0.554
 juggling+0.547
thumbnails+0.519
Thor+0.501
Neg logit contributions
theless-0.565
zens-0.554
 indo-0.536
negie-0.530
 Regulatory-0.524
Tokenjo
Token ID7639
Feature activation-1.24e-01

Pos logit contributions
theless+0.402
zens+0.394
 indo+0.378
negie+0.373
 Regulatory+0.368
Neg logit contributions
IDS-0.565
 Norn-0.546
 juggling-0.540
thumbnails-0.516
Thor-0.501
Tokens
Token ID82
Feature activation+1.21e-01

Pos logit contributions
theless+0.386
zens+0.379
 indo+0.368
negie+0.364
 Regulatory+0.360
Neg logit contributions
IDS-0.333
 Norn-0.319
 juggling-0.314
thumbnails-0.297
Thor-0.285
Token World
Token ID2159
Feature activation-7.97e-02

Pos logit contributions
IDS+0.492
 Norn+0.475
 juggling+0.468
thumbnails+0.438
Thor+0.429
Neg logit contributions
theless-0.484
zens-0.473
negie-0.458
 indo-0.457
 Regulatory-0.449
Token Ocean
Token ID10692
Feature activation+2.89e-01

Pos logit contributions
theless+0.268
zens+0.266
 indo+0.253
 Regulatory+0.251
negie+0.250
Neg logit contributions
IDS-0.373
 juggling-0.358
 Norn-0.356
thumbnails-0.344
Thor-0.329
Token Day
Token ID3596
Feature activation+2.51e-02

Pos logit contributions
IDS+0.957
 juggling+0.910
 Norn+0.909
thumbnails+0.868
Thor+0.820
Neg logit contributions
theless-1.359
zens-1.351
 indo-1.311
 Regulatory-1.298
negie-1.290
Token will
Token ID481
Feature activation-1.04e-01

Pos logit contributions
IDS+0.102
 Norn+0.098
 juggling+0.097
thumbnails+0.091
Thor+0.088
Neg logit contributions
theless-0.101
zens-0.099
 indo-0.095
negie-0.095
 Regulatory-0.093
Token sadly
Token ID21098
Feature activation-1.25e-01

Pos logit contributions
theless+0.403
zens+0.394
 indo+0.380
negie+0.378
 Regulatory+0.372
Neg logit contributions
IDS-0.433
 Norn-0.420
 juggling-0.414
thumbnails-0.391
Thor-0.379
Token be
Token ID307
Feature activation-3.81e-01

Pos logit contributions
theless+0.490
zens+0.481
 indo+0.464
negie+0.460
 Regulatory+0.454
Neg logit contributions
IDS-0.521
 Norn-0.502
 juggling-0.496
thumbnails-0.470
Thor-0.454
Token remembered
Token ID12086
Feature activation-1.04e-01

Pos logit contributions
theless+1.486
zens+1.459
 indo+1.409
negie+1.394
 Regulatory+1.377
Neg logit contributions
IDS-1.586
 Norn-1.526
 juggling-1.508
thumbnails-1.432
Thor-1.383
Token for
Token ID329
Feature activation-2.36e-01

Pos logit contributions
theless+0.426
zens+0.421
 indo+0.406
negie+0.401
 Regulatory+0.395
Neg logit contributions
IDS-0.412
 Norn-0.394
 juggling-0.388
thumbnails-0.372
Thor-0.355
Token the
Token ID262
Feature activation+1.10e-02

Pos logit contributions
theless+0.973
zens+0.958
 indo+0.925
negie+0.912
 Regulatory+0.907
Neg logit contributions
IDS-0.927
 Norn-0.887
 juggling-0.873
thumbnails-0.835
Thor-0.801
Token heart
Token ID2612
Feature activation-3.49e-01

Pos logit contributions
IDS+0.044
 Norn+0.042
 juggling+0.042
thumbnails+0.040
Thor+0.038
Neg logit contributions
theless-0.045
zens-0.044
 indo-0.043
negie-0.042
 Regulatory-0.042
Token up
Token ID510
Feature activation-5.24e-01

Pos logit contributions
theless+0.130
 indo+0.125
zens+0.124
negie+0.122
 Regulatory+0.120
Neg logit contributions
IDS-0.106
 juggling-0.102
 Norn-0.102
thumbnails-0.095
Thor-0.091
Token some
Token ID617
Feature activation+2.19e-01

Pos logit contributions
theless+2.166
zens+2.080
 indo+2.075
negie+2.023
 Regulatory+1.980
Neg logit contributions
IDS-2.108
 Norn-2.036
 juggling-2.026
thumbnails-1.901
Thor-1.830
Token got
Token ID1392
Feature activation-3.18e-01

Pos logit contributions
IDS+0.814
 juggling+0.789
 Norn+0.781
thumbnails+0.730
Thor+0.699
Neg logit contributions
theless-0.942
zens-0.918
 indo-0.909
negie-0.898
 Regulatory-0.871
Tokencha
Token ID11693
Feature activation+6.68e-02

Pos logit contributions
theless+0.745
 indo+0.669
zens+0.668
 Regulatory+0.658
negie+0.645
Neg logit contributions
IDS-1.835
 Norn-1.785
 juggling-1.780
thumbnails-1.742
Thor-1.675
Token's
Token ID338
Feature activation+4.08e-01

Pos logit contributions
IDS+0.270
 Norn+0.253
 juggling+0.252
thumbnails+0.243
 blink+0.232
Neg logit contributions
theless-0.281
 indo-0.272
zens-0.264
negie-0.262
terday-0.254
Token I
Token ID314
Feature activation-1.52e-01

Pos logit contributions
IDS+1.455
 Norn+1.423
 juggling+1.387
thumbnails+1.268
Thor+1.259
Neg logit contributions
theless-1.877
 indo-1.823
zens-1.778
negie-1.748
terday-1.680
Token encountered
Token ID12956
Feature activation-3.16e-01

Pos logit contributions
theless+0.692
zens+0.679
 indo+0.676
negie+0.669
 Regulatory+0.651
Neg logit contributions
IDS-0.533
 Norn-0.509
 juggling-0.500
thumbnails-0.464
Thor-0.449
Token.
Token ID13
Feature activation-8.57e-02

Pos logit contributions
theless+1.476
 indo+1.436
zens+1.408
negie+1.368
 Regulatory+1.330
Neg logit contributions
IDS-1.127
 Norn-1.084
 juggling-1.054
thumbnails-0.991
Thor-0.967
Token Assuming
Token ID33238
Feature activation-2.41e-01

Pos logit contributions
theless+0.388
zens+0.384
 indo+0.370
negie+0.368
 Regulatory+0.356
Neg logit contributions
IDS-0.309
 Norn-0.299
 juggling-0.291
thumbnails-0.272
Thor-0.262
Token you
Token ID345
Feature activation+1.76e-01

Pos logit contributions
theless+1.099
zens+1.058
 indo+1.054
negie+1.030
 Regulatory+1.014
Neg logit contributions
IDS-0.866
 Norn-0.826
 juggling-0.819
thumbnails-0.761
Thor-0.740
Token already
Token ID1541
Feature activation+8.54e-02

Pos logit contributions
IDS+0.677
 Norn+0.650
 juggling+0.645
 blink+0.594
thumbnails+0.593
Neg logit contributions
theless-0.747
 indo-0.726
zens-0.712
negie-0.702
 Regulatory-0.690
Token
Token ID198
Feature activation+5.34e-02

Pos logit contributions
IDS+0.285
 Norn+0.274
 juggling+0.269
thumbnails+0.254
Thor+0.246
Neg logit contributions
theless-0.307
zens-0.299
 indo-0.290
negie-0.286
 Regulatory-0.283
Token
Token ID198
Feature activation-1.09e-01

Pos logit contributions
IDS+0.195
 Norn+0.186
 juggling+0.180
thumbnails+0.171
Thor+0.169
Neg logit contributions
theless-0.244
 indo-0.241
zens-0.232
negie-0.226
 Regulatory-0.220
TokenThe
Token ID464
Feature activation-3.21e-01

Pos logit contributions
theless+0.430
zens+0.427
 indo+0.405
negie+0.405
 Regulatory+0.401
Neg logit contributions
IDS-0.441
 Norn-0.428
 juggling-0.424
thumbnails-0.398
Thor-0.379
Token concert
Token ID10010
Feature activation+4.94e-02

Pos logit contributions
theless+1.289
zens+1.270
 indo+1.220
negie+1.213
 Regulatory+1.202
Neg logit contributions
IDS-1.296
 Norn-1.229
 juggling-1.217
thumbnails-1.175
Thor-1.121
Token venue
Token ID14359
Feature activation-4.61e-02

Pos logit contributions
IDS+0.201
 Norn+0.193
 juggling+0.191
thumbnails+0.181
Thor+0.175
Neg logit contributions
theless-0.198
zens-0.194
 indo-0.188
negie-0.186
 Regulatory-0.184
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
theless+0.190
zens+0.187
 indo+0.181
negie+0.179
 Regulatory+0.177
Neg logit contributions
IDS-0.182
 Norn-0.175
 juggling-0.172
thumbnails-0.163
Thor-0.157
Token filmed
Token ID18976
Feature activation-5.82e-02

Pos logit contributions
IDS+0.194
 Norn+0.188
 juggling+0.186
thumbnails+0.174
Thor+0.169
Neg logit contributions
theless-0.184
zens-0.181
 indo-0.174
negie-0.173
 Regulatory-0.170
Token in
Token ID287
Feature activation+9.06e-02

Pos logit contributions
theless+0.242
zens+0.237
 indo+0.229
negie+0.228
 Regulatory+0.225
Neg logit contributions
IDS-0.228
 Norn-0.219
 juggling-0.218
thumbnails-0.204
Thor-0.197
Token,
Token ID11
Feature activation-5.75e-02

Pos logit contributions
IDS+0.381
 Norn+0.368
 juggling+0.363
thumbnails+0.344
Thor+0.332
Neg logit contributions
theless-0.350
zens-0.343
 indo-0.331
negie-0.328
 Regulatory-0.323
Token a
Token ID257
Feature activation+3.36e-02

Pos logit contributions
theless+0.232
zens+0.227
 indo+0.220
negie+0.217
 Regulatory+0.214
Neg logit contributions
IDS-0.233
 Norn-0.224
 juggling-0.222
thumbnails-0.210
Thor-0.203
Token wonderful
Token ID7932
Feature activation+2.56e-01

Pos logit contributions
IDS+0.135
 Norn+0.131
 juggling+0.129
thumbnails+0.122
Thor+0.118
Neg logit contributions
theless-0.135
zens-0.132
 indo-0.128
negie-0.127
 Regulatory-0.125
Tokenout
Token ID448
Feature activation+3.41e-01

Pos logit contributions
theless+0.573
zens+0.563
 indo+0.543
negie+0.537
 Regulatory+0.530
Neg logit contributions
IDS-0.631
 Norn-0.608
 juggling-0.600
thumbnails-0.571
Thor-0.552
Tokenhed
Token ID704
Feature activation-3.20e-01

Pos logit contributions
IDS+1.426
 Norn+1.369
 juggling+1.350
thumbnails+1.285
Thor+1.242
Neg logit contributions
theless-1.324
zens-1.289
 indo-1.254
negie-1.243
 Regulatory-1.228
Token client
Token ID5456
Feature activation+2.52e-01

Pos logit contributions
theless+1.250
zens+1.224
 indo+1.182
negie+1.168
 Regulatory+1.151
Neg logit contributions
IDS-1.328
 Norn-1.281
 juggling-1.259
thumbnails-1.208
Thor-1.161
Token (
Token ID357
Feature activation+5.07e-02

Pos logit contributions
IDS+0.964
 Norn+0.919
 juggling+0.910
thumbnails+0.858
Thor+0.829
Neg logit contributions
theless-1.080
zens-1.052
 indo-1.024
negie-1.014
 Regulatory-1.001
TokenFrank
Token ID17439
Feature activation+8.97e-03

Pos logit contributions
IDS+0.207
 Norn+0.199
 juggling+0.196
thumbnails+0.186
Thor+0.180
Neg logit contributions
theless-0.203
zens-0.199
 indo-0.193
negie-0.191
 Regulatory-0.188
Token Al
Token ID978
Feature activation-4.77e-01

Pos logit contributions
IDS+0.037
 Norn+0.035
 juggling+0.035
thumbnails+0.033
Thor+0.032
Neg logit contributions
theless-0.036
zens-0.035
 indo-0.034
negie-0.034
 Regulatory-0.033
Tokenber
Token ID527
Feature activation+1.84e-01

Pos logit contributions
zens+1.673
theless+1.659
 indo+1.564
negie+1.558
enko+1.554
Neg logit contributions
IDS-2.185
 Norn-2.099
 juggling-2.091
thumbnails-1.960
Thor-1.891
Tokents
Token ID912
Feature activation-1.69e-01

Pos logit contributions
IDS+0.808
 Norn+0.777
 juggling+0.768
thumbnails+0.734
Thor+0.712
Neg logit contributions
theless-0.675
zens-0.656
 indo-0.638
negie-0.628
 Regulatory-0.623
Tokenon
Token ID261
Feature activation+2.14e-01

Pos logit contributions
theless+0.726
zens+0.716
 indo+0.692
negie+0.685
 Regulatory+0.679
Neg logit contributions
IDS-0.640
 Norn-0.613
 juggling-0.604
thumbnails-0.572
Thor-0.548
Token.)
Token ID2014
Feature activation+9.20e-02

Pos logit contributions
IDS+0.829
 Norn+0.794
 juggling+0.785
thumbnails+0.742
Thor+0.715
Neg logit contributions
theless-0.897
zens-0.880
 indo-0.853
negie-0.846
 Regulatory-0.835
Token In
Token ID554
Feature activation-1.33e-02

Pos logit contributions
IDS+0.360
 Norn+0.347
 juggling+0.341
thumbnails+0.324
Thor+0.315
Neg logit contributions
theless-0.379
zens-0.376
 indo-0.365
negie-0.360
 Regulatory-0.354
Token Sh
Token ID911
Feature activation-2.52e-01

Pos logit contributions
theless+0.742
zens+0.733
 indo+0.703
negie+0.697
 Regulatory+0.696
Neg logit contributions
IDS-0.736
 Norn-0.701
 juggling-0.699
thumbnails-0.666
Thor-0.635
Tokenand
Token ID392
Feature activation-5.69e-01

Pos logit contributions
theless+1.089
zens+1.072
 indo+1.037
negie+1.028
 Regulatory+1.016
Neg logit contributions
IDS-0.948
 Norn-0.908
 juggling-0.896
thumbnails-0.846
Thor-0.813
Tokenobil
Token ID25898
Feature activation-4.55e-01

Pos logit contributions
zens+2.546
theless+2.539
negie+2.447
orf+2.442
 Regulatory+2.421
Neg logit contributions
IDS-2.017
 Norn-1.932
 juggling-1.874
thumbnails-1.816
qqa-1.678
Token,
Token ID11
Feature activation-1.16e-01

Pos logit contributions
theless+1.913
zens+1.890
 indo+1.821
negie+1.805
 Regulatory+1.788
Neg logit contributions
IDS-1.747
 Norn-1.680
 juggling-1.651
thumbnails-1.565
Thor-1.508
Token right
Token ID826
Feature activation-5.09e-01

Pos logit contributions
theless+0.502
zens+0.489
 indo+0.475
negie+0.473
 Regulatory+0.462
Neg logit contributions
IDS-0.437
 juggling-0.424
 Norn-0.420
thumbnails-0.391
Thor-0.376
Token,
Token ID11
Feature activation-5.68e-01

Pos logit contributions
zens+1.995
theless+1.919
 indo+1.902
 Regulatory+1.899
orf+1.863
Neg logit contributions
IDS-2.059
 Norn-2.007
 juggling-1.876
thumbnails-1.859
Thor-1.796
Token after
Token ID706
Feature activation-2.37e-01

Pos logit contributions
theless+2.354
zens+2.306
 indo+2.237
negie+2.214
 Regulatory+2.183
Neg logit contributions
IDS-2.237
 Norn-2.148
 juggling-2.128
thumbnails-2.009
Thor-1.934
Token a
Token ID257
Feature activation-2.41e-01

Pos logit contributions
theless+0.937
zens+0.919
 indo+0.889
negie+0.879
 Regulatory+0.867
Neg logit contributions
IDS-0.977
 Norn-0.939
 juggling-0.929
thumbnails-0.881
Thor-0.850
Token debate
Token ID4384
Feature activation+4.54e-02

Pos logit contributions
theless+0.935
zens+0.905
 indo+0.883
negie+0.875
 Regulatory+0.849
Neg logit contributions
IDS-1.018
 juggling-0.982
 Norn-0.974
thumbnails-0.913
Thor-0.886
Token on
Token ID319
Feature activation-2.98e-01

Pos logit contributions
IDS+0.174
 juggling+0.168
 Norn+0.167
thumbnails+0.155
Thor+0.150
Neg logit contributions
theless-0.194
zens-0.189
 indo-0.184
negie-0.183
 Regulatory-0.177
Token the
Token ID262
Feature activation-3.07e-01

Pos logit contributions
theless+1.088
zens+1.068
 indo+1.030
negie+1.016
 Regulatory+1.010
Neg logit contributions
IDS-1.311
 Norn-1.263
 juggling-1.245
thumbnails-1.190
Thor-1.151
Token The
Token ID383
Feature activation-1.35e-01

Pos logit contributions
theless+1.402
zens+1.379
 indo+1.334
negie+1.320
 Regulatory+1.306
Neg logit contributions
IDS-1.319
 Norn-1.266
 juggling-1.249
thumbnails-1.183
Thor-1.139
Token S
Token ID311
Feature activation+1.16e-01

Pos logit contributions
theless+0.526
zens+0.512
 indo+0.499
negie+0.493
 Regulatory+0.483
Neg logit contributions
IDS-0.569
 Norn-0.548
 juggling-0.544
thumbnails-0.511
Thor-0.496
TokenAC
Token ID2246
Feature activation+2.00e-01

Pos logit contributions
IDS+0.400
 Norn+0.380
 juggling+0.374
thumbnails+0.351
Thor+0.336
Neg logit contributions
theless-0.539
zens-0.529
 indo-0.515
negie-0.509
 Regulatory-0.504
Token met
Token ID1138
Feature activation-3.04e-01

Pos logit contributions
IDS+0.798
 Norn+0.760
 juggling+0.748
thumbnails+0.707
Thor+0.685
Neg logit contributions
theless-0.826
zens-0.807
 indo-0.781
negie-0.772
 Regulatory-0.760
Token to
Token ID284
Feature activation-2.89e-01

Pos logit contributions
theless+1.281
zens+1.267
 indo+1.220
negie+1.208
 Regulatory+1.197
Neg logit contributions
IDS-1.167
 Norn-1.122
 juggling-1.103
thumbnails-1.048
Thor-1.007
Token hear
Token ID3285
Feature activation-5.11e-01

Pos logit contributions
theless+1.120
zens+1.092
 indo+1.056
negie+1.047
 Regulatory+1.031
Neg logit contributions
IDS-1.217
 Norn-1.174
 juggling-1.162
thumbnails-1.097
Thor-1.064
Token local
Token ID1957
Feature activation-3.98e-01

Pos logit contributions
theless+2.168
zens+2.154
 indo+2.078
 Regulatory+2.046
negie+2.034
Neg logit contributions
IDS-1.927
 Norn-1.853
 juggling-1.796
thumbnails-1.735
Thor-1.642
Token transportation
Token ID9358
Feature activation-7.01e-02

Pos logit contributions
theless+1.730
zens+1.704
 indo+1.650
negie+1.633
 Regulatory+1.619
Neg logit contributions
IDS-1.476
 Norn-1.414
 juggling-1.392
thumbnails-1.317
Thor-1.263
Token and
Token ID290
Feature activation+3.08e-03

Pos logit contributions
theless+0.282
zens+0.277
 indo+0.268
negie+0.265
 Regulatory+0.262
Neg logit contributions
IDS-0.283
 Norn-0.272
 juggling-0.269
thumbnails-0.255
Thor-0.246
Token safety
Token ID3747
Feature activation+1.16e-01

Pos logit contributions
IDS+0.011
 Norn+0.010
 juggling+0.010
thumbnails+0.010
Thor+0.009
Neg logit contributions
theless-0.014
zens-0.014
 indo-0.013
negie-0.013
 Regulatory-0.013
Token experts
Token ID6154
Feature activation+1.40e-01

Pos logit contributions
IDS+0.485
 Norn+0.467
 juggling+0.466
thumbnails+0.433
Thor+0.424
Neg logit contributions
theless-0.455
zens-0.443
negie-0.425
 indo-0.425
 Regulatory-0.417
Token.
Token ID13
Feature activation-7.53e-02

Pos logit contributions
IDS+0.490
 Norn+0.473
 juggling+0.471
thumbnails+0.435
Thor+0.422
Neg logit contributions
theless-0.558
zens-0.553
 indo-0.534
negie-0.530
 Regulatory-0.524
Token If
Token ID1002
Feature activation-1.34e-01

Pos logit contributions
theless+0.317
zens+0.311
 indo+0.303
negie+0.297
orf+0.288
Neg logit contributions
IDS-0.298
 Norn-0.287
 juggling-0.277
thumbnails-0.263
Thor-0.258
Token this
Token ID428
Feature activation+1.84e-01

Pos logit contributions
theless+0.564
zens+0.554
 indo+0.536
negie+0.529
 Regulatory+0.526
Neg logit contributions
IDS-0.519
 Norn-0.497
 juggling-0.489
thumbnails-0.466
Thor-0.448
Token does
Token ID857
Feature activation-2.18e-01

Pos logit contributions
IDS+0.723
 Norn+0.699
 juggling+0.692
thumbnails+0.645
Thor+0.626
Neg logit contributions
theless-0.762
zens-0.754
negie-0.722
 indo-0.717
 Regulatory-0.707
Token not
Token ID407
Feature activation-3.65e-01

Pos logit contributions
theless+0.791
zens+0.787
 indo+0.758
negie+0.736
 Regulatory+0.729
Neg logit contributions
IDS-0.960
 Norn-0.919
 juggling-0.906
thumbnails-0.883
Thor-0.843
Token describe
Token ID6901
Feature activation-4.51e-01

Pos logit contributions
theless+1.435
zens+1.411
 indo+1.372
negie+1.341
 Regulatory+1.327
Neg logit contributions
IDS-1.514
 Norn-1.439
 juggling-1.420
thumbnails-1.371
Thor-1.313
Token you
Token ID345
Feature activation+8.29e-02

Pos logit contributions
theless+1.790
zens+1.726
 indo+1.698
 Regulatory+1.662
negie+1.656
Neg logit contributions
IDS-1.833
 Norn-1.725
 juggling-1.707
thumbnails-1.670
Thor-1.582
Token,
Token ID11
Feature activation-1.10e-01

Pos logit contributions
IDS+0.326
 Norn+0.312
 juggling+0.307
thumbnails+0.295
Thor+0.282
Neg logit contributions
theless-0.344
zens-0.336
 indo-0.328
negie-0.321
 Regulatory-0.318
Token don
Token ID836
Feature activation-5.55e-01

Pos logit contributions
theless+0.467
zens+0.459
 indo+0.442
negie+0.440
 Regulatory+0.434
Neg logit contributions
IDS-0.419
 Norn-0.403
 juggling-0.399
thumbnails-0.373
Thor-0.359
Token't
Token ID470
Feature activation-3.97e-01

Pos logit contributions
theless+1.777
zens+1.713
 indo+1.680
negie+1.628
 Regulatory+1.599
Neg logit contributions
IDS-2.730
 Norn-2.638
 juggling-2.604
thumbnails-2.510
Thor-2.438
Token dose
Token ID10742
Feature activation+3.28e-01

Pos logit contributions
theless+1.527
zens+1.499
 indo+1.440
negie+1.434
 Regulatory+1.408
Neg logit contributions
IDS-1.674
 Norn-1.619
 juggling-1.604
thumbnails-1.505
Thor-1.461
TokenHR
Token ID17184
Feature activation+2.41e-01

Pos logit contributions
theless+0.609
zens+0.607
 indo+0.583
negie+0.579
 Regulatory+0.575
Neg logit contributions
IDS-0.555
 Norn-0.542
 juggling-0.537
thumbnails-0.506
Thor-0.483
TokenP
Token ID47
Feature activation-1.65e-01

Pos logit contributions
IDS+1.017
 Norn+0.966
 juggling+0.948
thumbnails+0.885
Thor+0.874
Neg logit contributions
theless-0.954
zens-0.923
 indo-0.894
negie-0.884
terday-0.859
Token require
Token ID2421
Feature activation-4.06e-01

Pos logit contributions
theless+0.645
zens+0.629
 indo+0.608
negie+0.601
 Regulatory+0.590
Neg logit contributions
IDS-0.694
 Norn-0.666
 juggling-0.657
thumbnails-0.619
Thor-0.603
Token all
Token ID477
Feature activation-1.15e-01

Pos logit contributions
theless+1.676
zens+1.650
 indo+1.597
negie+1.578
 Regulatory+1.568
Neg logit contributions
IDS-1.594
 Norn-1.530
 juggling-1.508
thumbnails-1.437
Thor-1.379
Token people
Token ID661
Feature activation-2.19e-01

Pos logit contributions
theless+0.453
zens+0.445
 indo+0.426
negie+0.424
 Regulatory+0.414
Neg logit contributions
IDS-0.479
 Norn-0.462
 juggling-0.458
thumbnails-0.433
Thor-0.419
Token conducting
Token ID14523
Feature activation-5.15e-01

Pos logit contributions
theless+0.945
zens+0.930
 indo+0.901
negie+0.892
 Regulatory+0.883
Neg logit contributions
IDS-0.819
 Norn-0.784
 juggling-0.772
thumbnails-0.731
Thor-0.702
Token or
Token ID393
Feature activation-1.95e-01

Pos logit contributions
theless+2.014
zens+1.974
 indo+1.904
negie+1.886
 Regulatory+1.859
Neg logit contributions
IDS-2.144
 Norn-2.064
 juggling-2.039
thumbnails-1.934
Thor-1.869
Token overseeing
Token ID29852
Feature activation-1.20e-01

Pos logit contributions
theless+0.746
zens+0.729
 indo+0.699
negie+0.697
 Regulatory+0.678
Neg logit contributions
IDS-0.828
 juggling-0.806
 Norn-0.803
thumbnails-0.749
Thor-0.726
Token human
Token ID1692
Feature activation-4.48e-01

Pos logit contributions
theless+0.490
zens+0.477
 indo+0.459
negie+0.458
 Regulatory+0.445
Neg logit contributions
IDS-0.480
 Norn-0.462
 juggling-0.458
thumbnails-0.428
Thor-0.416
Token subjects
Token ID7481
Feature activation-3.06e-01

Pos logit contributions
theless+1.709
zens+1.676
 indo+1.612
negie+1.598
 Regulatory+1.571
Neg logit contributions
IDS-1.912
 Norn-1.842
 juggling-1.823
thumbnails-1.728
Thor-1.675
Token research
Token ID2267
Feature activation-2.66e-01

Pos logit contributions
theless+1.200
zens+1.178
 indo+1.136
negie+1.125
 Regulatory+1.110
Neg logit contributions
IDS-1.270
 Norn-1.222
 juggling-1.208
thumbnails-1.146
Thor-1.107