TOP ACTIVATIONS
MAX = 2.918

common problem , with the aim of taking back public space
H ike : Black Forest Trail - complete Date
participate do so with the aim of ensuring that it is
premiered ( along with the Instagram video of him going nuts
missions but Honour Among Thieves is a h oot from
sequels themselves . Band of Thieves feels a little dragged out
unit , had illegally collected personal location data without permission ,
criticism of their actions is anti - democratic .
born . TIME TRA VEL ER +
is merely a cover for anti - Muslim hatred . But
policy Americans for Prosper ity : Dist orting climate
sells all varieties with the intention of increased health and wellness
with machine guns , 221 anti - tank missiles and $
tract ors carried the Catalan pro - independence flag - the
a link back to your Instagram , Fac be ook Page
, is a small ash tr ay filled with kn itted
to be nothing more than personal dislike . When
which were released by the anti - abortion group Center for
Fore ster 's classic Horn bl ower saga , he began
campaign Americans for Prosper ity president Tim Phillips :

MIN ACTIVATIONS
MIN = -2.774

Dan Mer ica (@ dan meric a CNN ) October
r Gol ov ko , Ser hi y Kaz arov ,
Ze ke Miller (@ Ze ke J Miller ) May
s refinery business . Ser hi y Kaz arov is
ool , 49 Brend a McC ool , 49
s neo - Ott oman an vil fall hardest
Greater Cincinnati . Please try again soon , or contact Customer
today ruled one section of DOM A unconstitutional even though Obama
z us on Twitter @ dan han z us and check
Windows , Core Audio on OS X , and AL SA
- American push to overthrow Gaddafi and essentially shove Gaz prom
year , their relations with Gaddafi had s oured . The
stack frame | void * arg | + -------------------- --+ |
currency contracts . Ger athy , 60 , of
State Department cable described how Gaddafi had threat ened
the user . When the DOM is ready and Angular calculates
and demanding that their puppet regime be allowed to murder east
has occurred . Please try again later . View all New
has occurred . Please try again later . View all New
- shaped ep oxy resin scaff old that measures 0 .

INTERVAL 1.495 - 2.918
CONTAINS 0.232%

both emotions and perceptions as reasons , they are more sympathetic
:// . nice hash . com / api is deprecated and
in the territory . The head of the WHO 's global
would be the largest since vouchers were introduced in Milwaukee in
a brand by protecting your own brand start inhib iting consumer

INTERVAL 0.072 - 1.495
CONTAINS 42.553%

the emphasis will be on " hel ping students develop the
Campaign Committee W ants Some Of That Sweet , Sweet Koch
still largely defined by books she wrote 40 years ago ,
coaches are . He said [ ass istant coach ] Ju
be strong enough to offset the revenue losses .

INTERVAL -1.351 - 0.072
CONTAINS 56.842%

, Yuri Kh udi , and his sons found the remains
't mean back to learning . Under the reign of social
Miami Dolphins : DE Cameron Wake Contract :
{ i } " } and the do ..
the most useful things to know how to do on a

INTERVAL -2.774 - -1.351
CONTAINS 0.373%

from previous missions . The rock also resembles some unusual rocks
al s latest press tour , too , with
cz ynski , Marc R z ep cy ns ki ,
do X , Y and Z . Through this practice ,
just two minutes earlier when Danny Kane hit the home post
Token common
Token ID2219
Feature activation+8.77e-01

Pos logit contributions
brid+1.026
Honest+0.945
aught+0.921
oping+0.911
ール+0.898
Neg logit contributions
 Cipher-0.903
 dyn-0.853
 Axis-0.782
anke-0.774
SHIP-0.760
Token problem
Token ID1917
Feature activation-2.19e-01

Pos logit contributions
brid+3.305
Honest+3.052
aught+3.001
oping+2.971
ール+2.919
Neg logit contributions
 Cipher-2.500
 dyn-2.335
anke-2.153
 Axis-2.116
SHIP-2.090
Token,
Token ID11
Feature activation-3.80e-01

Pos logit contributions
 Cipher+0.612
 dyn+0.572
 Axis+0.526
anke+0.516
SHIP+0.509
Neg logit contributions
brid-0.843
Honest-0.787
aught-0.764
oping-0.755
ール-0.750
Token with
Token ID351
Feature activation-2.54e-01

Pos logit contributions
 Cipher+1.158
 dyn+1.093
 Axis+1.006
anke+0.983
SHIP+0.976
Neg logit contributions
brid-1.361
Honest-1.255
aught-1.224
oping-1.212
ール-1.203
Token the
Token ID262
Feature activation+1.23e-01

Pos logit contributions
 Cipher+0.806
 dyn+0.767
 Axis+0.709
anke+0.685
SHIP+0.677
Neg logit contributions
brid-0.863
Honest-0.799
aught-0.779
oping-0.774
ール-0.767
Token aim
Token ID4031
Feature activation+2.92e+00

Pos logit contributions
brid+0.419
Honest+0.386
aught+0.376
oping+0.373
ール+0.369
Neg logit contributions
 Cipher-0.401
 dyn-0.380
 Axis-0.351
anke-0.342
SHIP-0.337
Token of
Token ID286
Feature activation+1.26e+00

Pos logit contributions
brid+8.206
ingly+7.417
oping+7.409
aught+7.329
oted+7.008
Neg logit contributions
 Cipher-9.679
 dyn-8.922
anke-8.775
SHIP-8.469
 Axis-8.234
Token taking
Token ID2263
Feature activation+1.03e+00

Pos logit contributions
brid+4.237
Honest+3.873
aught+3.791
oping+3.768
oped+3.665
Neg logit contributions
 Cipher-4.060
 dyn-3.802
anke-3.572
 Axis-3.500
SHIP-3.486
Token back
Token ID736
Feature activation-3.82e-01

Pos logit contributions
brid+3.579
Honest+3.280
aught+3.204
oping+3.172
oped+3.105
Neg logit contributions
 Cipher-3.284
 dyn-3.093
anke-2.873
 Axis-2.837
SHIP-2.807
Token public
Token ID1171
Feature activation+3.24e-01

Pos logit contributions
 Cipher+1.167
 dyn+1.115
 Axis+1.014
anke+1.001
SHIP+0.992
Neg logit contributions
brid-1.320
Honest-1.246
aught-1.198
oping-1.194
ール-1.180
Token space
Token ID2272
Feature activation+4.52e-01

Pos logit contributions
brid+1.249
Honest+1.160
aught+1.131
oping+1.121
ール+1.105
Neg logit contributions
 Cipher-0.903
 dyn-0.849
anke-0.769
 Axis-0.769
SHIP-0.751
Token H
Token ID367
Feature activation+5.79e-01

Pos logit contributions
brid+0.511
Honest+0.467
aught+0.453
oping+0.448
ール+0.439
Neg logit contributions
 Cipher-0.574
 dyn-0.544
anke-0.506
 Axis-0.505
SHIP-0.496
Tokenike
Token ID522
Feature activation+6.58e-01

Pos logit contributions
brid+1.588
Honest+1.439
ール+1.363
aught+1.360
oping+1.335
Neg logit contributions
 Cipher-2.236
 dyn-2.129
anke-2.036
 Axis-1.994
SHIP-1.990
Token:
Token ID25
Feature activation+1.75e-01

Pos logit contributions
brid+2.301
Honest+2.119
aught+2.070
oping+2.045
ール+2.012
Neg logit contributions
 Cipher-2.084
 dyn-1.962
 Axis-1.809
anke-1.802
SHIP-1.768
Token Black
Token ID2619
Feature activation+1.98e-01

Pos logit contributions
brid+0.572
Honest+0.523
aught+0.509
oping+0.503
ール+0.494
Neg logit contributions
 Cipher-0.590
 dyn-0.558
anke-0.519
 Axis-0.515
SHIP-0.508
Token Forest
Token ID9115
Feature activation+3.44e-01

Pos logit contributions
brid+0.624
Honest+0.566
aught+0.552
oping+0.545
uts+0.534
Neg logit contributions
 Cipher-0.691
 dyn-0.657
anke-0.611
 Axis-0.607
SHIP-0.600
Token Trail
Token ID13069
Feature activation+2.69e+00

Pos logit contributions
brid+1.048
Honest+0.950
aught+0.923
oping+0.913
uts+0.894
Neg logit contributions
 Cipher-1.234
 dyn-1.171
anke-1.093
 Axis-1.086
SHIP-1.074
Token -
Token ID532
Feature activation+3.31e-01

Pos logit contributions
brid+7.495
oping+6.461
Honest+6.404
ingly+6.337
uts+6.331
Neg logit contributions
 Cipher-9.369
 dyn-8.914
anke-8.552
SHIP-8.271
 worsen-8.243
Token complete
Token ID1844
Feature activation+7.11e-01

Pos logit contributions
brid+1.069
Honest+0.977
aught+0.951
oping+0.940
ール+0.921
Neg logit contributions
 Cipher-1.134
 dyn-1.073
anke-0.996
 Axis-0.993
SHIP-0.977
Token
Token ID198
Feature activation+1.96e-02

Pos logit contributions
brid+2.460
Honest+2.241
aught+2.192
oping+2.184
uts+2.137
Neg logit contributions
 Cipher-2.252
 dyn-2.096
anke-1.974
 Axis-1.928
SHIP-1.923
Token
Token ID198
Feature activation-2.68e-01

Pos logit contributions
brid+0.069
Honest+0.064
aught+0.063
oping+0.062
ール+0.061
Neg logit contributions
 Cipher-0.060
 dyn-0.057
 Axis-0.053
anke-0.052
SHIP-0.051
TokenDate
Token ID10430
Feature activation-1.69e+00

Pos logit contributions
 Cipher+0.409
 dyn+0.361
anke+0.299
 Axis+0.296
SHIP+0.284
Neg logit contributions
brid-1.373
Honest-1.298
aught-1.278
oping-1.269
ール-1.254
Token participate
Token ID8277
Feature activation+3.44e-01

Pos logit contributions
 Cipher+0.036
 dyn+0.034
 Axis+0.031
anke+0.030
SHIP+0.030
Neg logit contributions
brid-0.051
Honest-0.048
aught-0.046
ール-0.046
uts-0.045
Token do
Token ID466
Feature activation-3.11e-01

Pos logit contributions
brid+1.242
Honest+1.145
aught+1.119
oping+1.111
oped+1.089
Neg logit contributions
 Cipher-1.045
 dyn-0.978
anke-0.902
 Axis-0.896
SHIP-0.884
Token so
Token ID523
Feature activation-2.98e-01

Pos logit contributions
 Cipher+0.920
 dyn+0.867
 Axis+0.793
anke+0.792
SHIP+0.777
Neg logit contributions
brid-1.141
Honest-1.059
aught-1.029
oping-1.018
ール-1.006
Token with
Token ID351
Feature activation-1.12e-01

Pos logit contributions
 Cipher+0.775
 dyn+0.736
 Axis+0.656
anke+0.650
SHIP+0.636
Neg logit contributions
brid-1.191
Honest-1.119
aught-1.088
ール-1.072
oping-1.070
Token the
Token ID262
Feature activation+2.99e-01

Pos logit contributions
 Cipher+0.360
 dyn+0.345
 Axis+0.317
anke+0.310
SHIP+0.307
Neg logit contributions
brid-0.378
Honest-0.348
aught-0.339
oping-0.335
ール-0.335
Token aim
Token ID4031
Feature activation+2.55e+00

Pos logit contributions
brid+0.924
Honest+0.840
aught+0.823
oping+0.808
ール+0.801
Neg logit contributions
 Cipher-1.062
 dyn-1.015
 Axis-0.940
anke-0.927
SHIP-0.915
Token of
Token ID286
Feature activation+1.20e+00

Pos logit contributions
brid+7.546
ingly+6.839
oping+6.676
aught+6.642
oted+6.480
Neg logit contributions
 Cipher-8.714
 dyn-7.796
anke-7.529
SHIP-7.492
 Axis-7.391
Token ensuring
Token ID13359
Feature activation+5.84e-01

Pos logit contributions
brid+4.174
Honest+3.817
oping+3.727
aught+3.722
ingly+3.621
Neg logit contributions
 Cipher-3.773
 dyn-3.508
anke-3.302
 Axis-3.247
SHIP-3.223
Token that
Token ID326
Feature activation-6.92e-02

Pos logit contributions
brid+1.879
Honest+1.719
aught+1.669
oping+1.650
ingly+1.620
Neg logit contributions
 Cipher-1.995
 dyn-1.880
anke-1.762
 Axis-1.744
SHIP-1.732
Token it
Token ID340
Feature activation+8.41e-02

Pos logit contributions
 Cipher+0.225
 dyn+0.214
 Axis+0.196
anke+0.195
SHIP+0.191
Neg logit contributions
brid-0.235
Honest-0.215
aught-0.211
oping-0.207
ール-0.205
Token is
Token ID318
Feature activation-8.97e-04

Pos logit contributions
brid+0.316
Honest+0.293
aught+0.285
oping+0.282
ール+0.279
Neg logit contributions
 Cipher-0.242
 dyn-0.228
anke-0.208
 Axis-0.207
SHIP-0.203
Token premiered
Token ID44119
Feature activation-2.23e-01

Pos logit contributions
brid+2.685
Honest+2.462
aught+2.425
oping+2.401
ingly+2.363
Neg logit contributions
 Cipher-2.316
 dyn-2.162
anke-2.021
 Axis-1.996
SHIP-1.971
Token (
Token ID357
Feature activation+5.71e-01

Pos logit contributions
 Cipher+0.661
 dyn+0.626
 Axis+0.569
anke+0.569
SHIP+0.555
Neg logit contributions
brid-0.819
Honest-0.759
aught-0.739
oping-0.731
ール-0.723
Tokenalong
Token ID24176
Feature activation+4.15e-01

Pos logit contributions
brid+1.692
Honest+1.533
aught+1.489
oping+1.470
ール+1.438
Neg logit contributions
 Cipher-2.103
 dyn-1.999
anke-1.867
 Axis-1.860
SHIP-1.835
Token with
Token ID351
Feature activation+2.45e-01

Pos logit contributions
brid+1.701
aught+1.574
Honest+1.573
oping+1.548
ingly+1.531
Neg logit contributions
 Cipher-1.037
 dyn-0.960
anke-0.883
SHIP-0.854
 Axis-0.846
Token the
Token ID262
Feature activation+4.79e-01

Pos logit contributions
brid+0.830
Honest+0.763
aught+0.743
oping+0.735
ール+0.722
Neg logit contributions
 Cipher-0.797
 dyn-0.753
anke-0.695
 Axis-0.693
SHIP-0.682
Token Instagram
Token ID10767
Feature activation+2.54e+00

Pos logit contributions
brid+1.606
Honest+1.470
aught+1.434
oping+1.418
ール+1.391
Neg logit contributions
 Cipher-1.577
 dyn-1.488
anke-1.382
 Axis-1.373
SHIP-1.354
Token video
Token ID2008
Feature activation+3.38e-01

Pos logit contributions
brid+8.603
Honest+7.630
aught+7.621
oping+7.547
ingly+7.545
Neg logit contributions
 Cipher-7.587
 dyn-6.984
anke-6.717
 Axis-6.642
SHIP-6.454
Token of
Token ID286
Feature activation+2.58e-02

Pos logit contributions
brid+1.227
Honest+1.136
aught+1.108
oping+1.097
ール+1.080
Neg logit contributions
 Cipher-1.020
 dyn-0.959
anke-0.880
 Axis-0.876
SHIP-0.860
Token him
Token ID683
Feature activation+5.52e-02

Pos logit contributions
brid+0.086
Honest+0.080
aught+0.077
oping+0.077
ール+0.075
Neg logit contributions
 Cipher-0.085
 dyn-0.080
 Axis-0.074
anke-0.074
SHIP-0.072
Token going
Token ID1016
Feature activation-1.23e-02

Pos logit contributions
brid+0.199
Honest+0.184
aught+0.179
oping+0.176
ール+0.175
Neg logit contributions
 Cipher-0.168
 dyn-0.159
anke-0.145
 Axis-0.145
SHIP-0.141
Token nuts
Token ID14380
Feature activation-1.02e+00

Pos logit contributions
 Cipher+0.039
 dyn+0.036
anke+0.033
 Axis+0.033
SHIP+0.033
Neg logit contributions
brid-0.044
Honest-0.040
aught-0.039
oping-0.039
ール-0.038
Token missions
Token ID10566
Feature activation+6.27e-01

Pos logit contributions
 Cipher+0.485
 dyn+0.450
 Axis+0.403
anke+0.401
SHIP+0.386
Neg logit contributions
brid-0.785
Honest-0.736
aught-0.718
oping-0.712
ール-0.705
Token
Token ID784
Feature activation+8.95e-02

Pos logit contributions
brid+2.132
Honest+1.932
aught+1.899
oping+1.882
uts+1.849
Neg logit contributions
 Cipher-2.036
 dyn-1.912
 Axis-1.771
anke-1.770
SHIP-1.751
Token but
Token ID475
Feature activation+6.97e-01

Pos logit contributions
brid+0.312
Honest+0.287
aught+0.280
oping+0.277
ール+0.272
Neg logit contributions
 Cipher-0.282
 dyn-0.266
anke-0.245
 Axis-0.244
SHIP-0.240
Token Honour
Token ID45129
Feature activation+5.83e-01

Pos logit contributions
brid+2.377
Honest+2.181
aught+2.119
oping+2.111
ingly+2.068
Neg logit contributions
 Cipher-2.237
 dyn-2.115
anke-1.965
 Axis-1.940
SHIP-1.933
Token Among
Token ID9754
Feature activation+2.00e+00

Pos logit contributions
brid+1.786
Honest+1.604
aught+1.571
oping+1.562
ingly+1.530
Neg logit contributions
 Cipher-2.066
 dyn-1.979
anke-1.848
SHIP-1.825
 Axis-1.824
Token Thieves
Token ID40392
Feature activation+2.51e+00

Pos logit contributions
brid+3.791
Honest+3.273
aught+3.082
oping+2.965
uts+2.958
Neg logit contributions
 Cipher-9.224
 dyn-8.941
anke-8.516
 Axis-8.437
SHIP-8.412
Token is
Token ID318
Feature activation+7.20e-01

Pos logit contributions
brid+8.421
oping+7.589
aught+7.550
Honest+7.542
ingly+7.453
Neg logit contributions
 Cipher-7.495
 dyn-6.906
anke-6.437
 Axis-6.412
SHIP-6.387
Token a
Token ID257
Feature activation+6.05e-01

Pos logit contributions
brid+2.457
Honest+2.254
aught+2.202
oping+2.182
ingly+2.141
Neg logit contributions
 Cipher-2.315
 dyn-2.164
anke-2.018
 Axis-2.000
SHIP-1.983
Token h
Token ID289
Feature activation+1.04e+00

Pos logit contributions
brid+2.071
Honest+1.894
aught+1.852
oping+1.836
oped+1.796
Neg logit contributions
 Cipher-1.952
 dyn-1.830
anke-1.707
 Axis-1.693
SHIP-1.665
Tokenoot
Token ID1025
Feature activation+6.41e-01

Pos logit contributions
brid+3.027
Honest+2.722
aught+2.699
oping+2.676
uts+2.618
Neg logit contributions
 Cipher-3.912
 dyn-3.694
 Axis-3.439
anke-3.403
SHIP-3.396
Token from
Token ID422
Feature activation+3.12e-01

Pos logit contributions
brid+2.256
Honest+2.058
aught+2.021
oping+2.001
ingly+1.976
Neg logit contributions
 Cipher-2.012
 dyn-1.887
anke-1.738
 Axis-1.734
SHIP-1.712
Token sequels
Token ID45176
Feature activation+9.64e-01

Pos logit contributions
brid+1.521
Honest+1.400
aught+1.366
oping+1.353
uts+1.329
Neg logit contributions
 Cipher-1.319
 dyn-1.240
anke-1.145
 Axis-1.137
SHIP-1.121
Token themselves
Token ID2405
Feature activation+9.05e-01

Pos logit contributions
brid+3.340
Honest+3.064
aught+2.978
oping+2.965
ingly+2.924
Neg logit contributions
 Cipher-3.015
 dyn-2.849
anke-2.672
 Axis-2.589
SHIP-2.583
Token.
Token ID13
Feature activation-7.21e-02

Pos logit contributions
brid+2.947
Honest+2.665
aught+2.620
oping+2.584
ingly+2.559
Neg logit contributions
 Cipher-3.047
 dyn-2.846
anke-2.668
 Axis-2.633
SHIP-2.628
Token Band
Token ID10243
Feature activation-3.96e-01

Pos logit contributions
 Cipher+0.231
 dyn+0.216
 Axis+0.201
anke+0.197
SHIP+0.194
Neg logit contributions
brid-0.250
Honest-0.229
aught-0.225
oping-0.223
oped-0.219
Token of
Token ID286
Feature activation+1.34e-01

Pos logit contributions
 Cipher+1.158
 dyn+1.102
anke+1.013
 Axis+0.998
SHIP+0.988
Neg logit contributions
brid-1.475
Honest-1.339
aught-1.320
oping-1.320
ingly-1.305
Token Thieves
Token ID40392
Feature activation+2.50e+00

Pos logit contributions
brid+0.375
Honest+0.333
aught+0.322
oping+0.321
ール+0.317
Neg logit contributions
 Cipher-0.528
 dyn-0.498
 Axis-0.468
anke-0.463
SHIP-0.455
Token feels
Token ID5300
Feature activation+1.23e+00

Pos logit contributions
brid+8.186
Honest+7.334
aught+7.316
oping+7.256
ingly+7.148
Neg logit contributions
 Cipher-7.609
 dyn-7.047
anke-6.731
 Axis-6.509
SHIP-6.489
Token a
Token ID257
Feature activation-1.23e-01

Pos logit contributions
brid+4.186
Honest+3.806
aught+3.723
ingly+3.694
oping+3.687
Neg logit contributions
 Cipher-3.913
 dyn-3.605
anke-3.415
SHIP-3.364
 Axis-3.359
Token little
Token ID1310
Feature activation-4.58e-01

Pos logit contributions
 Cipher+0.246
 dyn+0.229
 Axis+0.196
anke+0.195
SHIP+0.191
Neg logit contributions
brid-0.567
Honest-0.537
aught-0.526
oping-0.520
ール-0.515
Token dragged
Token ID17901
Feature activation-5.75e-01

Pos logit contributions
 Cipher+1.352
 dyn+1.288
 Axis+1.166
anke+1.159
SHIP+1.133
Neg logit contributions
brid-1.680
Honest-1.567
aught-1.533
oping-1.509
ール-1.495
Token out
Token ID503
Feature activation-6.95e-01

Pos logit contributions
 Cipher+1.689
 dyn+1.604
anke+1.473
 Axis+1.465
SHIP+1.434
Neg logit contributions
brid-2.097
Honest-1.963
aught-1.901
oping-1.876
ール-1.848
Token unit
Token ID4326
Feature activation-4.79e-02

Pos logit contributions
 Cipher+1.832
 dyn+1.747
 Axis+1.593
anke+1.547
SHIP+1.522
Neg logit contributions
brid-2.149
Honest-1.994
aught-1.934
ール-1.913
oping-1.911
Token,
Token ID11
Feature activation-5.14e-01

Pos logit contributions
 Cipher+0.160
 dyn+0.151
anke+0.141
 Axis+0.139
SHIP+0.138
Neg logit contributions
brid-0.158
Honest-0.144
aught-0.141
oping-0.140
ingly-0.137
Token had
Token ID550
Feature activation+5.97e-02

Pos logit contributions
 Cipher+1.421
 dyn+1.345
 Axis+1.214
anke+1.193
SHIP+1.168
Neg logit contributions
brid-1.988
Honest-1.850
aught-1.805
ール-1.775
oping-1.766
Token illegally
Token ID15572
Feature activation+8.68e-01

Pos logit contributions
brid+0.217
Honest+0.201
aught+0.196
oping+0.194
ール+0.191
Neg logit contributions
 Cipher-0.180
 dyn-0.169
anke-0.155
 Axis-0.155
SHIP-0.151
Token collected
Token ID7723
Feature activation+9.67e-01

Pos logit contributions
brid+3.108
Honest+2.845
aught+2.788
oping+2.773
oped+2.728
Neg logit contributions
 Cipher-2.656
 dyn-2.473
anke-2.307
 Axis-2.278
SHIP-2.256
Token personal
Token ID2614
Feature activation+2.47e+00

Pos logit contributions
brid+3.207
Honest+2.921
aught+2.851
oping+2.840
ingly+2.796
Neg logit contributions
 Cipher-3.170
 dyn-2.980
anke-2.839
SHIP-2.746
 Axis-2.740
Token location
Token ID4067
Feature activation+1.04e+00

Pos logit contributions
brid+6.245
ingly+5.494
oping+5.456
Honest+5.394
oted+5.364
Neg logit contributions
 Cipher-9.360
anke-8.883
 dyn-8.721
SHIP-8.508
 worsen-8.326
Token data
Token ID1366
Feature activation+4.72e-01

Pos logit contributions
brid+3.209
Honest+2.877
aught+2.818
oping+2.813
ingly+2.761
Neg logit contributions
 Cipher-3.689
 dyn-3.463
anke-3.305
SHIP-3.231
 Axis-3.218
Token without
Token ID1231
Feature activation+8.12e-01

Pos logit contributions
brid+1.715
Honest+1.582
aught+1.546
oping+1.531
ール+1.504
Neg logit contributions
 Cipher-1.422
 dyn-1.334
anke-1.228
 Axis-1.221
SHIP-1.200
Token permission
Token ID7170
Feature activation+9.59e-01

Pos logit contributions
brid+2.589
Honest+2.361
aught+2.296
oping+2.279
oped+2.224
Neg logit contributions
 Cipher-2.792
 dyn-2.645
anke-2.466
 Axis-2.446
SHIP-2.418
Token,
Token ID11
Feature activation+3.62e-01

Pos logit contributions
brid+3.266
Honest+2.960
oping+2.920
aught+2.901
ingly+2.873
Neg logit contributions
 Cipher-3.056
 dyn-2.855
anke-2.684
 Axis-2.647
SHIP-2.621
Token criticism
Token ID7734
Feature activation+3.05e-01

Pos logit contributions
 Cipher+2.107
 dyn+2.000
 Axis+1.830
anke+1.799
SHIP+1.781
Neg logit contributions
brid-2.347
Honest-2.151
aught-2.113
ール-2.071
oping-2.064
Token of
Token ID286
Feature activation+2.93e-01

Pos logit contributions
brid+1.135
Honest+1.043
aught+1.027
oping+1.019
ingly+1.003
Neg logit contributions
 Cipher-0.895
 dyn-0.829
anke-0.765
 Axis-0.757
SHIP-0.745
Token their
Token ID511
Feature activation+1.08e-01

Pos logit contributions
brid+1.006
Honest+0.924
aught+0.902
oping+0.892
ール+0.876
Neg logit contributions
 Cipher-0.943
 dyn-0.890
anke-0.821
 Axis-0.819
SHIP-0.805
Token actions
Token ID4028
Feature activation+2.26e-01

Pos logit contributions
brid+0.394
Honest+0.364
aught+0.356
oping+0.351
ール+0.346
Neg logit contributions
 Cipher-0.323
 dyn-0.305
 Axis-0.278
anke-0.277
SHIP-0.273
Token is
Token ID318
Feature activation+2.08e-01

Pos logit contributions
brid+0.834
Honest+0.769
aught+0.755
oping+0.748
uts+0.733
Neg logit contributions
 Cipher-0.666
 dyn-0.622
anke-0.571
 Axis-0.568
SHIP-0.559
Token anti
Token ID3098
Feature activation+2.47e+00

Pos logit contributions
brid+0.713
Honest+0.655
aught+0.639
oping+0.632
ール+0.620
Neg logit contributions
 Cipher-0.669
 dyn-0.631
anke-0.583
 Axis-0.580
SHIP-0.571
Token-
Token ID12
Feature activation+9.16e-01

Pos logit contributions
brid+5.827
ingly+5.165
oted+5.149
oping+5.148
oped+5.015
Neg logit contributions
 Cipher-9.725
 dyn-9.108
anke-8.936
 worsen-8.424
SHIP-8.380
Tokendemocratic
Token ID41232
Feature activation+1.95e-01

Pos logit contributions
brid+2.558
Honest+2.303
aught+2.214
oping+2.182
uts+2.139
Neg logit contributions
 Cipher-3.552
 dyn-3.373
anke-3.170
 Axis-3.152
SHIP-3.091
Token.
Token ID13
Feature activation+1.10e-01

Pos logit contributions
brid+0.684
Honest+0.630
aught+0.614
oping+0.609
ingly+0.597
Neg logit contributions
 Cipher-0.611
 dyn-0.574
anke-0.531
 Axis-0.527
SHIP-0.520
Token
Token ID198
Feature activation-1.45e-01

Pos logit contributions
brid+0.376
Honest+0.342
aught+0.337
oping+0.334
oped+0.327
Neg logit contributions
 Cipher-0.362
 dyn-0.337
 Axis-0.315
anke-0.312
SHIP-0.308
Token
Token ID198
Feature activation-1.49e-02

Pos logit contributions
 Cipher+0.430
 dyn+0.406
 Axis+0.372
anke+0.367
SHIP+0.366
Neg logit contributions
brid-0.525
Honest-0.480
aught-0.475
oping-0.473
oppers-0.465
Token born
Token ID4642
Feature activation-1.83e+00

Pos logit contributions
brid+0.080
Honest+0.074
aught+0.072
oping+0.071
ール+0.070
Neg logit contributions
 Cipher-0.070
 dyn-0.066
anke-0.061
 Axis-0.061
SHIP-0.060
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
 Cipher+4.813
 dyn+4.541
 Axis+4.206
SHIP+3.949
anke+3.913
Neg logit contributions
brid-7.037
Honest-6.495
aught-6.285
oping-6.247
ール-6.204
Token
Token ID198
Feature activation+7.00e-02

Pos logit contributions
brid+0.366
Honest+0.334
aught+0.324
ール+0.315
oping+0.314
Neg logit contributions
 Cipher-0.440
 dyn-0.425
anke-0.399
SHIP-0.388
 Axis-0.386
Token
Token ID198
Feature activation+1.36e-01

Pos logit contributions
brid+0.249
Honest+0.228
aught+0.224
oping+0.223
ール+0.219
Neg logit contributions
 Cipher-0.215
 dyn-0.204
 Axis-0.187
SHIP-0.185
anke-0.185
TokenTIME
Token ID34694
Feature activation-4.66e-01

Pos logit contributions
brid+0.449
Honest+0.411
aught+0.400
oping+0.396
ール+0.388
Neg logit contributions
 Cipher-0.456
 dyn-0.432
anke-0.400
 Axis-0.399
SHIP-0.393
Token TRA
Token ID29125
Feature activation+2.46e+00

Pos logit contributions
 Cipher+1.597
 dyn+1.501
 Axis+1.404
SHIP+1.395
anke+1.380
Neg logit contributions
brid-1.505
Honest-1.379
aught-1.340
oping-1.324
ール-1.301
TokenVEL
Token ID18697
Feature activation+1.03e+00

Pos logit contributions
brid+6.899
Honest+6.230
uts+5.969
oping+5.828
aught+5.780
Neg logit contributions
 Cipher-8.911
 dyn-8.407
anke-7.942
 Axis-7.597
 worsen-7.541
TokenER
Token ID1137
Feature activation+4.39e-01

Pos logit contributions
brid+3.312
Honest+3.038
aught+2.952
oping+2.928
ingly+2.915
Neg logit contributions
 Cipher-3.422
 dyn-3.264
anke-3.063
 Axis-2.968
 worsen-2.923
Token
Token ID198
Feature activation+3.56e-02

Pos logit contributions
brid+1.261
Honest+1.151
aught+1.111
oping+1.096
uts+1.086
Neg logit contributions
 Cipher-1.613
 dyn-1.549
anke-1.463
 Axis-1.424
 worsen-1.403
Token
Token ID198
Feature activation-3.24e-01

Pos logit contributions
brid+0.127
Honest+0.116
aught+0.114
oping+0.114
ール+0.112
Neg logit contributions
 Cipher-0.108
 dyn-0.103
 Axis-0.094
anke-0.093
SHIP-0.093
Token+
Token ID10
Feature activation-2.06e-01

Pos logit contributions
 Cipher+1.032
 dyn+0.975
anke+0.899
 Axis+0.891
SHIP+0.866
Neg logit contributions
brid-1.130
Honest-1.047
aught-1.014
oping-1.000
uts-0.983
Token is
Token ID318
Feature activation+3.02e-01

Pos logit contributions
brid+0.313
Honest+0.290
aught+0.283
oping+0.280
ール+0.275
Neg logit contributions
 Cipher-0.259
 dyn-0.244
anke-0.223
 Axis-0.222
SHIP-0.218
Token merely
Token ID6974
Feature activation-2.63e-01

Pos logit contributions
brid+1.053
Honest+0.969
aught+0.946
oping+0.935
uts+0.918
Neg logit contributions
 Cipher-0.954
 dyn-0.895
anke-0.829
 Axis-0.824
SHIP-0.813
Token a
Token ID257
Feature activation+4.84e-01

Pos logit contributions
 Cipher+0.824
 dyn+0.786
anke+0.719
 Axis+0.717
SHIP+0.699
Neg logit contributions
brid-0.920
Honest-0.847
aught-0.827
oping-0.817
ール-0.806
Token cover
Token ID3002
Feature activation+6.47e-01

Pos logit contributions
brid+1.788
Honest+1.647
aught+1.608
oping+1.600
oped+1.569
Neg logit contributions
 Cipher-1.431
 dyn-1.333
anke-1.231
 Axis-1.223
SHIP-1.204
Token for
Token ID329
Feature activation+8.57e-01

Pos logit contributions
brid+2.229
Honest+2.029
aught+1.997
oping+1.976
ingly+1.951
Neg logit contributions
 Cipher-2.067
 dyn-1.937
anke-1.794
 Axis-1.785
SHIP-1.752
Token anti
Token ID3098
Feature activation+2.44e+00

Pos logit contributions
brid+2.947
Honest+2.699
aught+2.616
oping+2.606
uts+2.566
Neg logit contributions
 Cipher-2.701
 dyn-2.495
anke-2.358
 Axis-2.312
SHIP-2.309
Token-
Token ID12
Feature activation+6.25e-01

Pos logit contributions
brid+6.138
oping+5.379
ingly+5.270
oted+5.241
oped+5.204
Neg logit contributions
 Cipher-9.430
 dyn-8.702
anke-8.469
SHIP-8.233
 Axis-8.095
TokenMuslim
Token ID17067
Feature activation+5.55e-02

Pos logit contributions
brid+1.816
Honest+1.634
aught+1.583
oping+1.565
uts+1.531
Neg logit contributions
 Cipher-2.346
 dyn-2.223
anke-2.084
 Axis-2.075
SHIP-2.044
Token hatred
Token ID13763
Feature activation+6.02e-01

Pos logit contributions
brid+0.194
Honest+0.179
aught+0.175
oping+0.172
ール+0.170
Neg logit contributions
 Cipher-0.174
 dyn-0.166
 Axis-0.152
anke-0.151
SHIP-0.148
Token.
Token ID13
Feature activation+1.38e-01

Pos logit contributions
brid+2.058
Honest+1.876
aught+1.833
oping+1.817
ingly+1.785
Neg logit contributions
 Cipher-1.952
 dyn-1.831
anke-1.699
 Axis-1.686
SHIP-1.659
Token But
Token ID887
Feature activation+6.49e-01

Pos logit contributions
brid+0.459
Honest+0.417
aught+0.410
oping+0.408
oped+0.399
Neg logit contributions
 Cipher-0.458
 dyn-0.431
 Axis-0.402
anke-0.398
SHIP-0.392
Token policy
Token ID2450
Feature activation-1.04e+00

Pos logit contributions
 Cipher+2.300
 dyn+2.190
 Axis+1.991
anke+1.986
SHIP+1.939
Neg logit contributions
brid-2.779
Honest-2.551
aught-2.502
ール-2.436
oping-2.429
Token
Token ID198
Feature activation-4.56e-01

Pos logit contributions
 Cipher+3.230
 dyn+3.081
anke+2.828
 Axis+2.808
SHIP+2.716
Neg logit contributions
brid-3.640
Honest-3.344
aught-3.228
oping-3.207
ール-3.182
Token
Token ID198
Feature activation+4.12e-03

Pos logit contributions
 Cipher+1.298
 dyn+1.230
 Axis+1.119
anke+1.110
SHIP+1.103
Neg logit contributions
brid-1.695
Honest-1.548
aught-1.539
oping-1.532
oppers-1.514
TokenAmericans
Token ID17636
Feature activation+5.89e-01

Pos logit contributions
brid+0.015
Honest+0.014
aught+0.014
oping+0.014
ール+0.014
Neg logit contributions
 Cipher-0.012
 dyn-0.011
anke-0.010
 Axis-0.010
SHIP-0.010
Token for
Token ID329
Feature activation+1.02e+00

Pos logit contributions
brid+2.077
Honest+1.910
aught+1.870
oping+1.860
oped+1.821
Neg logit contributions
 Cipher-1.825
 dyn-1.716
anke-1.585
 Axis-1.568
SHIP-1.560
Token Prosper
Token ID41342
Feature activation+2.44e+00

Pos logit contributions
brid+3.460
Honest+3.211
aught+3.136
oping+3.072
uts+3.047
Neg logit contributions
 Cipher-3.219
 dyn-3.036
anke-2.839
 Axis-2.804
SHIP-2.778
Tokenity
Token ID414
Feature activation+1.18e+00

Pos logit contributions
brid+7.998
Honest+7.321
aught+7.127
oping+7.095
uts+6.980
Neg logit contributions
 Cipher-8.037
 dyn-7.612
anke-7.056
 Axis-6.955
SHIP-6.935
Token:
Token ID25
Feature activation+5.70e-01

Pos logit contributions
brid+3.963
Honest+3.646
oping+3.583
aught+3.574
uts+3.501
Neg logit contributions
 Cipher-3.709
 dyn-3.515
anke-3.270
 Axis-3.196
SHIP-3.184
Token Dist
Token ID4307
Feature activation-1.59e-02

Pos logit contributions
brid+1.978
Honest+1.825
aught+1.778
oping+1.769
uts+1.740
Neg logit contributions
 Cipher-1.769
 dyn-1.673
anke-1.550
 Axis-1.529
SHIP-1.519
Tokenorting
Token ID24707
Feature activation-1.64e-01

Pos logit contributions
 Cipher+0.061
 dyn+0.059
anke+0.055
 Axis+0.055
SHIP+0.054
Neg logit contributions
brid-0.044
Honest-0.040
aught-0.039
oping-0.038
uts-0.037
Token climate
Token ID4258
Feature activation-7.16e-01

Pos logit contributions
 Cipher+0.534
 dyn+0.506
 Axis+0.466
anke+0.464
SHIP+0.455
Neg logit contributions
brid-0.554
Honest-0.508
aught-0.495
oping-0.489
ール-0.482
Token sells
Token ID16015
Feature activation+5.79e-01

Pos logit contributions
brid+1.388
Honest+1.282
aught+1.251
oping+1.233
ール+1.219
Neg logit contributions
 Cipher-1.130
 dyn-1.064
anke-0.974
 Axis-0.971
SHIP-0.950
Token all
Token ID477
Feature activation+1.53e+00

Pos logit contributions
brid+2.011
Honest+1.849
aught+1.804
oping+1.784
uts+1.751
Neg logit contributions
 Cipher-1.836
 dyn-1.730
anke-1.599
 Axis-1.590
SHIP-1.567
Token varieties
Token ID20328
Feature activation+2.03e-01

Pos logit contributions
brid+5.281
Honest+4.841
aught+4.724
oping+4.696
oped+4.630
Neg logit contributions
 Cipher-4.723
 dyn-4.428
anke-4.157
SHIP-4.098
 Axis-4.022
Token with
Token ID351
Feature activation+5.31e-01

Pos logit contributions
brid+0.724
Honest+0.668
aught+0.651
oping+0.645
ール+0.634
Neg logit contributions
 Cipher-0.627
 dyn-0.590
anke-0.543
 Axis-0.541
SHIP-0.531
Token the
Token ID262
Feature activation+8.03e-01

Pos logit contributions
brid+1.790
Honest+1.643
aught+1.601
oping+1.583
ール+1.554
Neg logit contributions
 Cipher-1.738
 dyn-1.641
anke-1.518
 Axis-1.512
SHIP-1.488
Token intention
Token ID6778
Feature activation+2.43e+00

Pos logit contributions
brid+2.524
Honest+2.303
aught+2.239
oping+2.207
uts+2.163
Neg logit contributions
 Cipher-2.804
 dyn-2.655
anke-2.482
 Axis-2.460
SHIP-2.436
Token of
Token ID286
Feature activation+1.49e+00

Pos logit contributions
brid+7.241
ingly+6.562
aught+6.547
oping+6.533
Honest+6.448
Neg logit contributions
 Cipher-7.994
 dyn-7.329
anke-7.075
 Axis-6.783
SHIP-6.694
Token increased
Token ID3220
Feature activation+7.30e-01

Pos logit contributions
brid+5.081
Honest+4.667
aught+4.569
oping+4.554
ingly+4.424
Neg logit contributions
 Cipher-4.670
 dyn-4.335
anke-4.070
 Axis-4.027
SHIP-4.005
Token health
Token ID1535
Feature activation+1.24e+00

Pos logit contributions
brid+2.513
Honest+2.309
aught+2.249
oping+2.229
uts+2.188
Neg logit contributions
 Cipher-2.328
 dyn-2.189
anke-2.033
 Axis-2.016
SHIP-1.991
Token and
Token ID290
Feature activation+5.93e-01

Pos logit contributions
brid+4.043
Honest+3.657
aught+3.619
oping+3.598
ingly+3.517
Neg logit contributions
 Cipher-4.202
 dyn-3.871
SHIP-3.632
anke-3.621
 Axis-3.598
Token wellness
Token ID42506
Feature activation+1.29e+00

Pos logit contributions
brid+2.056
Honest+1.889
aught+1.842
oping+1.826
oped+1.788
Neg logit contributions
 Cipher-1.885
 dyn-1.761
anke-1.638
 Axis-1.626
SHIP-1.609
Token with
Token ID351
Feature activation+2.98e-01

Pos logit contributions
brid+2.387
Honest+2.172
aught+2.136
oping+2.119
oped+2.075
Neg logit contributions
 Cipher-2.284
 dyn-2.140
anke-2.004
 Axis-1.972
SHIP-1.955
Token machine
Token ID4572
Feature activation+6.68e-01

Pos logit contributions
brid+1.026
Honest+0.942
aught+0.920
oping+0.910
uts+0.893
Neg logit contributions
 Cipher-0.951
 dyn-0.896
anke-0.830
 Axis-0.823
SHIP-0.812
Token guns
Token ID6541
Feature activation+4.65e-01

Pos logit contributions
brid+1.858
Honest+1.639
aught+1.600
oping+1.581
ール+1.550
Neg logit contributions
 Cipher-2.602
 dyn-2.456
anke-2.327
 Axis-2.291
SHIP-2.285
Token,
Token ID11
Feature activation+4.23e-01

Pos logit contributions
brid+1.590
Honest+1.453
aught+1.426
oping+1.411
uts+1.384
Neg logit contributions
 Cipher-1.500
 dyn-1.407
anke-1.309
 Axis-1.297
SHIP-1.278
Token 221
Token ID31566
Feature activation+1.29e-02

Pos logit contributions
brid+1.458
Honest+1.339
aught+1.312
oping+1.299
uts+1.276
Neg logit contributions
 Cipher-1.338
 dyn-1.261
anke-1.175
 Axis-1.154
SHIP-1.146
Token anti
Token ID3098
Feature activation+2.43e+00

Pos logit contributions
brid+0.046
Honest+0.042
aught+0.041
oping+0.041
ール+0.040
Neg logit contributions
 Cipher-0.040
 dyn-0.038
anke-0.035
 Axis-0.035
SHIP-0.034
Token-
Token ID12
Feature activation+3.51e-01

Pos logit contributions
brid+5.760
uts+5.109
ingly+5.008
oping+4.993
oped+4.982
Neg logit contributions
 Cipher-9.299
 dyn-8.875
anke-8.538
SHIP-8.248
 worsen-8.007
Tokentank
Token ID28451
Feature activation+4.75e-01

Pos logit contributions
brid+1.067
Honest+0.969
aught+0.942
oping+0.930
ール+0.911
Neg logit contributions
 Cipher-1.262
 dyn-1.198
anke-1.117
 Axis-1.113
SHIP-1.097
Token missiles
Token ID12798
Feature activation+5.28e-01

Pos logit contributions
brid+1.484
Honest+1.353
aught+1.314
oping+1.298
ール+1.273
Neg logit contributions
 Cipher-1.671
 dyn-1.585
anke-1.473
 Axis-1.470
SHIP-1.447
Token and
Token ID290
Feature activation-1.76e-01

Pos logit contributions
brid+1.730
Honest+1.572
aught+1.547
oping+1.530
uts+1.505
Neg logit contributions
 Cipher-1.770
 dyn-1.653
anke-1.550
 Axis-1.530
SHIP-1.523
Token $
Token ID720
Feature activation-5.52e-01

Pos logit contributions
 Cipher+0.573
 dyn+0.541
 Axis+0.501
anke+0.495
SHIP+0.489
Neg logit contributions
brid-0.599
Honest-0.552
aught-0.536
oping-0.529
ール-0.523
Token tract
Token ID14998
Feature activation+8.40e-01

Pos logit contributions
brid+0.742
Honest+0.689
aught+0.670
oping+0.663
ール+0.656
Neg logit contributions
 Cipher-0.591
 dyn-0.562
 Axis-0.512
anke-0.507
SHIP-0.497
Tokenors
Token ID669
Feature activation+2.70e-01

Pos logit contributions
brid+2.373
Honest+2.145
aught+2.073
oping+2.041
ール+2.003
Neg logit contributions
 Cipher-3.204
 dyn-3.058
anke-2.859
 Axis-2.854
SHIP-2.812
Token carried
Token ID5281
Feature activation+1.42e+00

Pos logit contributions
brid+0.630
Honest+0.541
oping+0.540
aught+0.537
oped+0.526
Neg logit contributions
 Cipher-1.166
 dyn-1.108
anke-1.056
 Axis-1.037
SHIP-1.032
Token the
Token ID262
Feature activation+1.22e+00

Pos logit contributions
brid+3.925
oping+3.433
ingly+3.433
aught+3.422
Honest+3.401
Neg logit contributions
 Cipher-5.211
 dyn-4.994
anke-4.852
SHIP-4.631
 Axis-4.576
Token Catalan
Token ID31066
Feature activation+1.27e+00

Pos logit contributions
brid+3.479
Honest+3.024
aught+3.010
oping+2.990
oted+2.969
Neg logit contributions
 Cipher-4.476
 dyn-4.226
anke-4.096
SHIP-4.016
 Axis-3.925
Token pro
Token ID386
Feature activation+2.42e+00

Pos logit contributions
brid+3.558
oping+3.126
Honest+3.123
aught+3.075
oped+3.038
Neg logit contributions
 Cipher-4.721
 dyn-4.390
anke-4.285
SHIP-4.176
 Axis-4.127
Token-
Token ID12
Feature activation+4.93e-01

Pos logit contributions
brid+6.673
Honest+5.941
aught+5.699
oping+5.664
ingly+5.654
Neg logit contributions
 Cipher-9.163
 dyn-8.554
anke-8.134
 Axis-7.965
SHIP-7.895
Tokenindependence
Token ID39894
Feature activation+2.15e-01

Pos logit contributions
brid+1.604
Honest+1.472
aught+1.428
oping+1.425
ール+1.419
Neg logit contributions
 Cipher-1.644
 dyn-1.571
anke-1.461
SHIP-1.449
 Axis-1.449
Token flag
Token ID6056
Feature activation+4.51e-01

Pos logit contributions
brid+0.854
Honest+0.794
aught+0.773
oping+0.762
ール+0.756
Neg logit contributions
 Cipher-0.581
 dyn-0.551
 Axis-0.497
anke-0.486
SHIP-0.480
Token -
Token ID532
Feature activation+3.41e-01

Pos logit contributions
brid+1.430
Honest+1.286
aught+1.266
oping+1.264
oped+1.236
Neg logit contributions
 Cipher-1.559
 dyn-1.456
anke-1.392
 Axis-1.349
SHIP-1.343
Tokenthe
Token ID1169
Feature activation+3.47e-01

Pos logit contributions
brid+1.295
Honest+1.198
aught+1.184
oping+1.176
oped+1.163
Neg logit contributions
 Cipher-0.963
 dyn-0.873
anke-0.843
SHIP-0.816
 Axis-0.790
Token a
Token ID257
Feature activation+5.23e-01

Pos logit contributions
 Cipher+0.544
 dyn+0.507
 Axis+0.455
anke+0.454
SHIP+0.442
Neg logit contributions
brid-0.861
Honest-0.806
aught-0.786
oping-0.779
ール-0.769
Token link
Token ID2792
Feature activation+7.50e-01

Pos logit contributions
brid+1.897
Honest+1.751
aught+1.710
oping+1.693
ール+1.664
Neg logit contributions
 Cipher-1.575
 dyn-1.480
anke-1.360
 Axis-1.353
SHIP-1.330
Token back
Token ID736
Feature activation+4.79e-01

Pos logit contributions
brid+2.539
Honest+2.311
aught+2.272
oping+2.256
oped+2.206
Neg logit contributions
 Cipher-2.438
 dyn-2.285
anke-2.127
 Axis-2.114
SHIP-2.079
Token to
Token ID284
Feature activation+4.53e-01

Pos logit contributions
brid+1.641
Honest+1.505
aught+1.470
oping+1.454
ール+1.426
Neg logit contributions
 Cipher-1.539
 dyn-1.451
anke-1.341
 Axis-1.335
SHIP-1.315
Token your
Token ID534
Feature activation+2.99e-01

Pos logit contributions
brid+1.436
Honest+1.306
aught+1.269
oping+1.257
oped+1.233
Neg logit contributions
 Cipher-1.563
 dyn-1.481
anke-1.385
 Axis-1.368
SHIP-1.355
Token Instagram
Token ID10767
Feature activation+2.40e+00

Pos logit contributions
brid+1.173
Honest+1.092
aught+1.067
oping+1.057
ール+1.042
Neg logit contributions
 Cipher-0.815
 dyn-0.762
anke-0.690
 Axis-0.689
SHIP-0.674
Token,
Token ID11
Feature activation-2.27e-02

Pos logit contributions
brid+7.555
Honest+6.723
aught+6.715
oping+6.628
oted+6.536
Neg logit contributions
 Cipher-7.728
 dyn-7.255
anke-6.778
 Axis-6.776
SHIP-6.743
Token Fac
Token ID13585
Feature activation-3.64e-01

Pos logit contributions
 Cipher+0.073
 dyn+0.068
 Axis+0.063
anke+0.063
SHIP+0.061
Neg logit contributions
brid-0.078
Honest-0.072
aught-0.070
oping-0.070
ール-0.068
Tokenbe
Token ID1350
Feature activation+1.53e+00

Pos logit contributions
 Cipher+1.133
 dyn+1.074
anke+0.990
 Axis+0.983
SHIP+0.970
Neg logit contributions
brid-1.266
Honest-1.174
aught-1.141
oping-1.125
ール-1.107
Tokenook
Token ID566
Feature activation+1.64e-01

Pos logit contributions
brid+4.587
Honest+4.129
aught+4.002
oping+3.966
uts+3.940
Neg logit contributions
 Cipher-5.428
 dyn-5.163
 Axis-4.816
anke-4.813
SHIP-4.763
Token Page
Token ID7873
Feature activation-1.82e-01

Pos logit contributions
brid+0.556
Honest+0.513
aught+0.499
oping+0.494
ール+0.485
Neg logit contributions
 Cipher-0.529
 dyn-0.500
anke-0.461
 Axis-0.460
SHIP-0.452
Token,
Token ID11
Feature activation-1.90e-02

Pos logit contributions
 Cipher+0.639
 dyn+0.606
anke+0.558
 Axis+0.558
SHIP+0.545
Neg logit contributions
brid-0.701
Honest-0.648
aught-0.630
oping-0.621
ール-0.613
Token is
Token ID318
Feature activation+3.00e-01

Pos logit contributions
 Cipher+0.055
 dyn+0.052
anke+0.047
 Axis+0.047
SHIP+0.046
Neg logit contributions
brid-0.071
Honest-0.066
aught-0.064
oping-0.064
ール-0.063
Token a
Token ID257
Feature activation+4.28e-01

Pos logit contributions
brid+1.054
Honest+0.971
aught+0.949
oping+0.938
uts+0.920
Neg logit contributions
 Cipher-0.937
 dyn-0.877
anke-0.814
 Axis-0.806
SHIP-0.795
Token small
Token ID1402
Feature activation+3.97e-01

Pos logit contributions
brid+1.511
Honest+1.389
aught+1.358
oping+1.344
ール+1.319
Neg logit contributions
 Cipher-1.329
 dyn-1.247
anke-1.155
 Axis-1.146
SHIP-1.128
Token ash
Token ID12530
Feature activation+1.44e-01

Pos logit contributions
brid+1.411
Honest+1.297
aught+1.269
oping+1.256
ingly+1.232
Neg logit contributions
 Cipher-1.222
 dyn-1.146
anke-1.061
 Axis-1.051
SHIP-1.036
Tokentr
Token ID2213
Feature activation+2.38e+00

Pos logit contributions
brid+0.276
Honest+0.241
aught+0.226
oping+0.222
ール+0.216
Neg logit contributions
 Cipher-0.680
 dyn-0.655
 Axis-0.620
anke-0.618
SHIP-0.613
Tokenay
Token ID323
Feature activation+7.45e-01

Pos logit contributions
brid+6.027
aught+5.798
uts+5.639
oping+5.554
ingly+5.417
Neg logit contributions
 Cipher-9.056
 dyn-8.578
SHIP-8.293
 Axis-7.994
 worsen-7.817
Token filled
Token ID5901
Feature activation+3.71e-02

Pos logit contributions
brid+2.646
Honest+2.415
aught+2.373
oping+2.353
ingly+2.314
Neg logit contributions
 Cipher-2.304
 dyn-2.150
anke-2.000
 Axis-1.979
SHIP-1.951
Token with
Token ID351
Feature activation+2.13e-01

Pos logit contributions
brid+0.155
Honest+0.144
aught+0.143
oping+0.142
ingly+0.142
Neg logit contributions
 Cipher-0.091
 dyn-0.084
anke-0.076
 Axis-0.075
SHIP-0.074
Token kn
Token ID638
Feature activation+5.91e-01

Pos logit contributions
brid+0.751
Honest+0.692
aught+0.675
oping+0.668
ール+0.656
Neg logit contributions
 Cipher-0.665
 dyn-0.626
anke-0.577
 Axis-0.574
SHIP-0.564
Tokenitted
Token ID2175
Feature activation+1.69e-01

Pos logit contributions
brid+1.822
aught+1.644
Honest+1.641
oping+1.635
uts+1.604
Neg logit contributions
 Cipher-2.107
 dyn-1.971
 Axis-1.837
anke-1.829
SHIP-1.824
Token to
Token ID284
Feature activation+4.95e-01

Pos logit contributions
brid+1.231
Honest+1.128
aught+1.099
oping+1.087
ール+1.067
Neg logit contributions
 Cipher-1.227
 dyn-1.161
anke-1.074
 Axis-1.071
SHIP-1.053
Token be
Token ID307
Feature activation+3.72e-01

Pos logit contributions
brid+1.825
Honest+1.686
aught+1.647
oping+1.635
ingly+1.613
Neg logit contributions
 Cipher-1.463
 dyn-1.355
anke-1.257
 Axis-1.241
SHIP-1.231
Token nothing
Token ID2147
Feature activation+3.89e-01

Pos logit contributions
brid+1.289
Honest+1.185
aught+1.156
oping+1.144
ール+1.123
Neg logit contributions
 Cipher-1.182
 dyn-1.114
anke-1.029
 Axis-1.024
SHIP-1.007
Token more
Token ID517
Feature activation+4.88e-01

Pos logit contributions
brid+1.514
Honest+1.409
aught+1.376
oping+1.364
ール+1.344
Neg logit contributions
 Cipher-1.067
 dyn-0.999
anke-0.906
 Axis-0.904
SHIP-0.884
Token than
Token ID621
Feature activation+2.10e-01

Pos logit contributions
brid+1.365
Honest+1.230
aught+1.191
oping+1.174
uts+1.150
Neg logit contributions
 Cipher-1.880
 dyn-1.779
anke-1.676
 Axis-1.663
SHIP-1.650
Token personal
Token ID2614
Feature activation+2.37e+00

Pos logit contributions
brid+0.731
Honest+0.672
aught+0.656
oping+0.649
ール+0.637
Neg logit contributions
 Cipher-0.665
 dyn-0.627
anke-0.578
 Axis-0.576
SHIP-0.566
Token dislike
Token ID23109
Feature activation+6.79e-01

Pos logit contributions
brid+7.735
Honest+7.155
oping+7.012
oted+6.987
aught+6.971
Neg logit contributions
 Cipher-7.235
 dyn-6.555
anke-6.547
SHIP-6.353
 Axis-6.143
Token.
Token ID13
Feature activation+2.78e-01

Pos logit contributions
brid+2.442
Honest+2.240
aught+2.192
oping+2.170
ingly+2.134
Neg logit contributions
 Cipher-2.076
 dyn-1.944
anke-1.799
 Axis-1.781
SHIP-1.744
Token
Token ID198
Feature activation+1.23e-01

Pos logit contributions
brid+0.916
Honest+0.830
aught+0.818
oping+0.809
oped+0.794
Neg logit contributions
 Cipher-0.941
 dyn-0.885
 Axis-0.821
anke-0.819
SHIP-0.806
Token
Token ID198
Feature activation+2.10e-01

Pos logit contributions
brid+0.436
Honest+0.398
aught+0.392
oping+0.389
oped+0.383
Neg logit contributions
 Cipher-0.376
 dyn-0.358
 Axis-0.327
anke-0.326
SHIP-0.323
TokenWhen
Token ID2215
Feature activation-3.94e-01

Pos logit contributions
brid+0.692
aught+0.623
Honest+0.620
oping+0.615
ール+0.604
Neg logit contributions
 Cipher-0.694
 dyn-0.655
anke-0.614
SHIP-0.613
 Axis-0.601
Token which
Token ID543
Feature activation-3.13e-02

Pos logit contributions
 Cipher+1.043
 dyn+0.985
 Axis+0.905
anke+0.902
SHIP+0.876
Neg logit contributions
brid-1.129
Honest-1.034
aught-1.006
oping-0.993
ール-0.984
Token were
Token ID547
Feature activation-5.55e-02

Pos logit contributions
 Cipher+0.090
 dyn+0.084
 Axis+0.077
anke+0.076
SHIP+0.075
Neg logit contributions
brid-0.119
Honest-0.110
aught-0.106
ール-0.105
oping-0.105
Token released
Token ID2716
Feature activation-1.13e-01

Pos logit contributions
 Cipher+0.180
 dyn+0.171
 Axis+0.157
anke+0.156
SHIP+0.153
Neg logit contributions
brid-0.189
Honest-0.174
aught-0.167
ール-0.167
oping-0.166
Token by
Token ID416
Feature activation+1.03e+00

Pos logit contributions
 Cipher+0.326
 dyn+0.307
 Axis+0.279
anke+0.279
SHIP+0.272
Neg logit contributions
brid-0.422
Honest-0.393
aught-0.381
oping-0.377
ール-0.372
Token the
Token ID262
Feature activation+5.46e-01

Pos logit contributions
brid+3.113
Honest+2.847
aught+2.775
oping+2.752
ingly+2.677
Neg logit contributions
 Cipher-3.620
 dyn-3.476
anke-3.303
SHIP-3.186
 Axis-3.184
Token anti
Token ID3098
Feature activation+2.36e+00

Pos logit contributions
brid+1.831
Honest+1.679
aught+1.637
oping+1.620
ール+1.586
Neg logit contributions
 Cipher-1.790
 dyn-1.692
anke-1.570
 Axis-1.557
SHIP-1.536
Token-
Token ID12
Feature activation+4.73e-01

Pos logit contributions
brid+6.658
Honest+6.019
oping+5.819
aught+5.806
ingly+5.719
Neg logit contributions
 Cipher-8.778
 dyn-8.264
anke-7.830
 Axis-7.641
SHIP-7.530
Tokenabortion
Token ID32396
Feature activation+1.86e-01

Pos logit contributions
brid+0.951
Honest+0.821
aught+0.783
oping+0.767
ール+0.743
Neg logit contributions
 Cipher-2.186
 dyn-2.099
anke-1.991
 Axis-1.986
SHIP-1.968
Token group
Token ID1448
Feature activation+3.10e-01

Pos logit contributions
brid+0.720
Honest+0.667
aught+0.652
oping+0.646
ール+0.637
Neg logit contributions
 Cipher-0.520
 dyn-0.486
anke-0.442
 Axis-0.442
SHIP-0.432
Token Center
Token ID3337
Feature activation+5.18e-01

Pos logit contributions
brid+0.968
Honest+0.878
aught+0.853
oping+0.842
ール+0.830
Neg logit contributions
 Cipher-1.099
 dyn-1.040
 Axis-0.970
anke-0.962
SHIP-0.952
Token for
Token ID329
Feature activation+8.34e-01

Pos logit contributions
brid+1.826
Honest+1.682
aught+1.641
oping+1.624
ール+1.595
Neg logit contributions
 Cipher-1.617
 dyn-1.524
anke-1.405
 Axis-1.396
SHIP-1.374
Token Fore
Token ID4558
Feature activation+2.08e-02

Pos logit contributions
 Cipher+1.632
 dyn+1.534
 Axis+1.409
anke+1.404
SHIP+1.378
Neg logit contributions
brid-1.676
Honest-1.538
aught-1.489
oping-1.477
ール-1.452
Tokenster
Token ID1706
Feature activation-1.08e-01

Pos logit contributions
brid+0.048
Honest+0.044
aught+0.041
oping+0.041
ール+0.040
Neg logit contributions
 Cipher-0.089
 dyn-0.085
anke-0.081
 Axis-0.080
SHIP-0.079
Token's
Token ID338
Feature activation-1.92e-01

Pos logit contributions
 Cipher+0.343
 dyn+0.323
anke+0.297
 Axis+0.296
SHIP+0.291
Neg logit contributions
brid-0.378
Honest-0.348
aught-0.339
oping-0.335
ール-0.330
Token classic
Token ID6833
Feature activation-6.77e-01

Pos logit contributions
 Cipher+0.619
 dyn+0.586
 Axis+0.539
anke+0.536
SHIP+0.526
Neg logit contributions
brid-0.658
Honest-0.606
aught-0.590
oping-0.584
ール-0.574
Token Horn
Token ID13549
Feature activation+4.15e-02

Pos logit contributions
 Cipher+2.157
 dyn+2.046
 Axis+1.890
anke+1.864
SHIP+1.842
Neg logit contributions
brid-2.321
Honest-2.145
aught-2.085
oping-2.058
ール-2.023
Tokenbl
Token ID2436
Feature activation+2.36e+00

Pos logit contributions
brid+0.130
Honest+0.119
aught+0.115
oping+0.114
ール+0.113
Neg logit contributions
 Cipher-0.146
 dyn-0.138
anke-0.128
 Axis-0.128
SHIP-0.126
Tokenower
Token ID789
Feature activation+5.14e-01

Pos logit contributions
brid+4.106
oping+3.650
aught+3.574
uts+3.538
oped+3.469
Neg logit contributions
 Cipher-11.098
 dyn-10.687
 Axis-10.220
SHIP-10.127
 worsen-9.907
Token saga
Token ID26784
Feature activation-3.91e-01

Pos logit contributions
brid+1.748
Honest+1.602
aught+1.562
oping+1.548
uts+1.520
Neg logit contributions
 Cipher-1.659
 dyn-1.567
anke-1.454
 Axis-1.442
SHIP-1.423
Token,
Token ID11
Feature activation-2.21e-01

Pos logit contributions
 Cipher+1.218
 dyn+1.151
 Axis+1.055
anke+1.050
SHIP+1.034
Neg logit contributions
brid-1.376
Honest-1.273
aught-1.241
oping-1.224
ール-1.208
Token he
Token ID339
Feature activation+3.32e-01

Pos logit contributions
 Cipher+0.761
 dyn+0.713
 Axis+0.665
anke+0.659
SHIP+0.645
Neg logit contributions
brid-0.713
Honest-0.653
aught-0.634
oping-0.624
ール-0.618
Token began
Token ID2540
Feature activation-4.57e-01

Pos logit contributions
brid+1.230
Honest+1.139
aught+1.111
oping+1.099
ール+1.083
Neg logit contributions
 Cipher-0.975
 dyn-0.916
anke-0.837
 Axis-0.834
SHIP-0.818
Token campaign
Token ID1923
Feature activation-1.13e-01

Pos logit contributions
 Cipher+1.359
 dyn+1.291
 Axis+1.181
anke+1.169
SHIP+1.151
Neg logit contributions
brid-1.561
Honest-1.445
aught-1.407
oping-1.380
ール-1.369
Token
Token ID198
Feature activation-4.02e-01

Pos logit contributions
 Cipher+0.363
 dyn+0.342
anke+0.316
 Axis+0.314
SHIP+0.310
Neg logit contributions
brid-0.386
Honest-0.354
aught-0.346
oping-0.343
ール-0.336
Token
Token ID198
Feature activation+3.45e-02

Pos logit contributions
 Cipher+1.148
 dyn+1.093
 Axis+0.995
anke+0.983
SHIP+0.974
Neg logit contributions
brid-1.485
Honest-1.360
aught-1.347
oping-1.341
oppers-1.325
TokenAmericans
Token ID17636
Feature activation+2.82e-01

Pos logit contributions
brid+0.122
Honest+0.112
aught+0.110
oping+0.109
ール+0.107
Neg logit contributions
 Cipher-0.106
 dyn-0.100
anke-0.092
 Axis-0.092
SHIP-0.091
Token for
Token ID329
Feature activation+9.41e-01

Pos logit contributions
brid+0.959
Honest+0.882
aught+0.862
oping+0.857
uts+0.841
Neg logit contributions
 Cipher-0.909
 dyn-0.857
anke-0.794
 Axis-0.785
SHIP-0.783
Token Prosper
Token ID41342
Feature activation+2.36e+00

Pos logit contributions
brid+3.209
Honest+2.959
aught+2.888
oping+2.837
uts+2.809
Neg logit contributions
 Cipher-2.993
 dyn-2.816
anke-2.634
 Axis-2.595
SHIP-2.593
Tokenity
Token ID414
Feature activation+7.73e-01

Pos logit contributions
brid+7.623
Honest+6.987
aught+6.839
oping+6.792
ingly+6.687
Neg logit contributions
 Cipher-7.835
 dyn-7.344
 Axis-6.786
anke-6.732
SHIP-6.646
Token president
Token ID1893
Feature activation+1.00e+00

Pos logit contributions
brid+2.576
Honest+2.371
aught+2.310
oping+2.304
uts+2.257
Neg logit contributions
 Cipher-2.504
 dyn-2.374
anke-2.202
 Axis-2.170
SHIP-2.150
Token Tim
Token ID5045
Feature activation-1.81e-01

Pos logit contributions
brid+3.365
Honest+3.106
aught+3.052
oping+3.033
ingly+2.988
Neg logit contributions
 Cipher-3.185
 dyn-2.985
anke-2.786
 Axis-2.737
SHIP-2.707
Token Phillips
Token ID17630
Feature activation-2.41e-01

Pos logit contributions
 Cipher+0.541
 dyn+0.506
anke+0.465
 Axis+0.463
SHIP+0.449
Neg logit contributions
brid-0.663
Honest-0.615
aught-0.600
oping-0.593
ール-0.585
Token:
Token ID25
Feature activation-4.49e-02

Pos logit contributions
 Cipher+0.746
 dyn+0.702
anke+0.647
 Axis+0.645
SHIP+0.631
Neg logit contributions
brid-0.857
Honest-0.790
aught-0.767
oping-0.758
ール-0.751
Token
Token ID851
Feature activation-9.04e-02

Pos logit contributions
 Cipher+1.039
 dyn+0.972
 Axis+0.909
anke+0.897
SHIP+0.878
Neg logit contributions
brid-1.121
Honest-1.022
oping-1.005
aught-1.000
oped-0.983
Token Dan
Token ID6035
Feature activation-1.46e+00

Pos logit contributions
 Cipher+0.302
 dyn+0.285
 Axis+0.264
anke+0.262
SHIP+0.257
Neg logit contributions
brid-0.300
Honest-0.275
aught-0.269
oping-0.265
ール-0.260
Token Mer
Token ID4638
Feature activation-1.22e-01

Pos logit contributions
 Cipher+4.804
 dyn+4.495
anke+4.167
 Axis+4.149
SHIP+3.909
Neg logit contributions
brid-4.617
Honest-4.422
oping-4.200
aught-4.118
oped-4.093
Tokenica
Token ID3970
Feature activation+3.61e-01

Pos logit contributions
 Cipher+0.438
 dyn+0.416
anke+0.390
 Axis+0.386
SHIP+0.381
Neg logit contributions
brid-0.370
Honest-0.340
aught-0.328
oping-0.324
ール-0.319
Token (@
Token ID4275
Feature activation+3.76e-01

Pos logit contributions
brid+1.039
Honest+0.938
aught+0.916
oping+0.903
uts+0.885
Neg logit contributions
 Cipher-1.355
 dyn-1.291
anke-1.203
 Axis-1.191
SHIP-1.187
Tokendan
Token ID25604
Feature activation-2.77e+00

Pos logit contributions
brid+1.140
Honest+1.034
aught+1.001
oping+0.989
ール+0.968
Neg logit contributions
 Cipher-1.361
 dyn-1.290
 Axis-1.204
anke-1.200
SHIP-1.182
Tokenmeric
Token ID946
Feature activation-9.67e-01

Pos logit contributions
 Cipher+9.042
 dyn+8.803
anke+8.704
SHIP+8.282
 worsen+7.815
Neg logit contributions
Honest-7.147
brid-6.943
uts-6.672
aught-6.522
ール-6.489
Tokena
Token ID64
Feature activation+1.80e-01

Pos logit contributions
 Cipher+2.352
anke+2.199
 dyn+2.194
 Axis+1.922
SHIP+1.903
Neg logit contributions
brid-3.877
Honest-3.755
ingly-3.565
aught-3.544
ール-3.528
TokenCNN
Token ID18474
Feature activation-1.40e+00

Pos logit contributions
brid+0.645
Honest+0.592
aught+0.575
oping+0.575
oped+0.563
Neg logit contributions
 Cipher-0.559
 dyn-0.522
 Axis-0.481
anke-0.481
SHIP-0.468
Token)
Token ID8
Feature activation-4.62e-01

Pos logit contributions
 Cipher+2.944
 dyn+2.824
anke+2.520
SHIP+2.484
 Axis+2.439
Neg logit contributions
brid-5.787
Honest-5.455
oppers-5.445
aught-5.414
oping-5.405
Token October
Token ID3267
Feature activation-9.19e-02

Pos logit contributions
 Cipher+1.561
 dyn+1.468
 Axis+1.367
SHIP+1.316
anke+1.300
Neg logit contributions
brid-1.527
aught-1.375
Honest-1.375
oping-1.349
oped-1.331
Tokenr
Token ID81
Feature activation+4.78e-01

Pos logit contributions
brid+2.444
Honest+2.130
aught+2.100
oping+2.083
uts+2.030
Neg logit contributions
 Cipher-3.422
 dyn-3.240
anke-3.039
 Axis-3.037
SHIP-3.020
Token Gol
Token ID26849
Feature activation-7.54e-01

Pos logit contributions
brid+1.658
Honest+1.512
aught+1.483
oping+1.475
uts+1.451
Neg logit contributions
 Cipher-1.511
 dyn-1.421
anke-1.318
 Axis-1.307
SHIP-1.303
Tokenov
Token ID709
Feature activation+2.05e-01

Pos logit contributions
 Cipher+2.865
 dyn+2.735
anke+2.635
 Axis+2.523
SHIP+2.483
Neg logit contributions
brid-2.040
Honest-1.936
ール-1.798
aught-1.767
oping-1.715
Tokenko
Token ID7204
Feature activation+2.79e-01

Pos logit contributions
brid+0.673
Honest+0.615
aught+0.600
oping+0.593
ール+0.581
Neg logit contributions
 Cipher-0.686
 dyn-0.649
anke-0.602
 Axis-0.600
SHIP-0.590
Token,
Token ID11
Feature activation-1.46e-01

Pos logit contributions
brid+0.911
Honest+0.832
aught+0.811
oping+0.803
uts+0.786
Neg logit contributions
 Cipher-0.940
 dyn-0.888
anke-0.825
 Axis-0.821
SHIP-0.809
Token Ser
Token ID2930
Feature activation-2.76e+00

Pos logit contributions
 Cipher+0.454
 dyn+0.428
anke+0.392
 Axis+0.392
SHIP+0.384
Neg logit contributions
brid-0.517
Honest-0.477
aught-0.465
oping-0.460
ール-0.453
Tokenhi
Token ID5303
Feature activation-5.72e-01

Pos logit contributions
 Cipher+6.657
 dyn+6.451
anke+6.339
SHIP+5.665
 Axis+5.601
Neg logit contributions
brid-9.641
Honest-9.517
ール-9.134
aught-8.900
oping-8.813
Tokeny
Token ID88
Feature activation-8.72e-01

Pos logit contributions
 Cipher+2.590
 dyn+2.486
anke+2.354
 Axis+2.348
SHIP+2.318
Neg logit contributions
brid-1.204
Honest-1.050
aught-1.002
oping-0.982
ール-0.953
Token Kaz
Token ID16385
Feature activation-1.14e+00

Pos logit contributions
 Cipher+2.564
 dyn+2.408
anke+2.204
 Axis+2.196
SHIP+2.151
Neg logit contributions
brid-3.226
Honest-2.986
aught-2.915
oping-2.884
ール-2.840
Tokenarov
Token ID42737
Feature activation+1.96e-02

Pos logit contributions
 Cipher+3.313
 dyn+3.112
anke+2.872
 Axis+2.819
SHIP+2.788
Neg logit contributions
brid-4.147
Honest-3.907
aught-3.755
ール-3.723
oping-3.691
Token,
Token ID11
Feature activation-1.67e-01

Pos logit contributions
brid+0.065
Honest+0.059
aught+0.058
oping+0.057
uts+0.056
Neg logit contributions
 Cipher-0.066
 dyn-0.062
anke-0.058
 Axis-0.057
SHIP-0.057
Token
Token ID851
Feature activation+1.12e-01

Pos logit contributions
 Cipher+0.657
 dyn+0.619
anke+0.569
 Axis+0.567
SHIP+0.557
Neg logit contributions
brid-0.754
Honest-0.695
aught-0.678
oping-0.671
ール-0.660
Token Ze
Token ID9033
Feature activation-1.48e+00

Pos logit contributions
brid+0.372
Honest+0.340
aught+0.332
oping+0.328
ール+0.322
Neg logit contributions
 Cipher-0.372
 dyn-0.352
 Axis-0.325
anke-0.325
SHIP-0.319
Tokenke
Token ID365
Feature activation-6.24e-01

Pos logit contributions
 Cipher+4.828
 dyn+4.550
anke+4.305
 Axis+4.192
SHIP+4.039
Neg logit contributions
brid-4.727
Honest-4.424
aught-4.235
oping-4.197
ール-4.151
Token Miller
Token ID7920
Feature activation-5.11e-01

Pos logit contributions
 Cipher+1.582
 dyn+1.469
 Axis+1.321
anke+1.312
SHIP+1.275
Neg logit contributions
brid-2.564
Honest-2.394
aught-2.343
oping-2.320
uts-2.284
Token (@
Token ID4275
Feature activation+3.50e-01

Pos logit contributions
 Cipher+1.869
 dyn+1.776
anke+1.659
 Axis+1.652
SHIP+1.630
Neg logit contributions
brid-1.526
Honest-1.384
aught-1.344
oping-1.326
ール-1.298
TokenZe
Token ID36056
Feature activation-2.55e+00

Pos logit contributions
brid+1.026
Honest+0.929
aught+0.898
oping+0.888
ール+0.867
Neg logit contributions
 Cipher-1.304
 dyn-1.236
 Axis-1.154
anke-1.151
SHIP-1.136
Tokenke
Token ID365
Feature activation-1.43e+00

Pos logit contributions
 Cipher+7.362
anke+6.984
 dyn+6.949
 Axis+6.302
SHIP+6.243
Neg logit contributions
brid-8.232
Honest-7.909
ール-7.569
aught-7.475
spection-7.427
TokenJ
Token ID41
Feature activation-5.91e-01

Pos logit contributions
 Cipher+4.513
 dyn+4.304
anke+4.009
SHIP+3.962
 Axis+3.946
Neg logit contributions
brid-4.652
Honest-4.232
aught-4.227
oping-4.156
ール-4.128
TokenMiller
Token ID33253
Feature activation-1.20e+00

Pos logit contributions
 Cipher+2.103
 dyn+2.010
anke+1.891
 Axis+1.864
SHIP+1.837
Neg logit contributions
brid-1.789
Honest-1.617
aught-1.572
oping-1.549
ール-1.547
Token)
Token ID8
Feature activation-2.49e-01

Pos logit contributions
 Cipher+3.100
 dyn+2.971
SHIP+2.700
anke+2.681
 Axis+2.674
Neg logit contributions
brid-4.614
Honest-4.290
aught-4.235
ール-4.187
oping-4.172
Token May
Token ID1737
Feature activation-2.36e-01

Pos logit contributions
 Cipher+0.822
 dyn+0.774
 Axis+0.726
SHIP+0.689
anke+0.685
Neg logit contributions
brid-0.842
aught-0.765
Honest-0.762
oping-0.750
oppers-0.742
Token
Token ID247
Feature activation-2.42e-01

Pos logit contributions
 Cipher+1.645
 dyn+1.563
anke+1.469
 Axis+1.463
SHIP+1.453
Neg logit contributions
brid-1.098
Honest-0.980
aught-0.944
oping-0.937
ール-0.921
Tokens
Token ID82
Feature activation-3.72e-01

Pos logit contributions
 Cipher+0.777
 dyn+0.732
 Axis+0.676
anke+0.674
SHIP+0.666
Neg logit contributions
brid-0.832
Honest-0.764
aught-0.747
oping-0.736
ール-0.727
Token refinery
Token ID44133
Feature activation-1.05e+00

Pos logit contributions
 Cipher+1.152
 dyn+1.092
 Axis+1.001
anke+0.982
SHIP+0.968
Neg logit contributions
brid-1.315
Honest-1.214
aught-1.193
oping-1.171
ール-1.161
Token business
Token ID1597
Feature activation-1.31e-01

Pos logit contributions
 Cipher+3.089
 dyn+2.965
 Axis+2.686
anke+2.620
SHIP+2.596
Neg logit contributions
brid-3.844
Honest-3.547
aught-3.468
oping-3.430
ール-3.415
Token.
Token ID13
Feature activation-2.53e-01

Pos logit contributions
 Cipher+0.410
 dyn+0.386
anke+0.356
 Axis+0.354
SHIP+0.348
Neg logit contributions
brid-0.460
Honest-0.423
aught-0.413
oping-0.409
ール-0.401
Token Ser
Token ID2930
Feature activation-2.46e+00

Pos logit contributions
 Cipher+0.856
 dyn+0.808
 Axis+0.749
anke+0.748
SHIP+0.735
Neg logit contributions
brid-0.826
Honest-0.755
aught-0.736
oping-0.727
ール-0.713
Tokenhi
Token ID5303
Feature activation-5.00e-01

Pos logit contributions
 Cipher+4.549
 dyn+4.431
anke+4.349
SHIP+3.993
 Axis+3.695
Neg logit contributions
brid-10.176
Honest-9.840
ール-9.762
oping-9.408
oped-9.384
Tokeny
Token ID88
Feature activation-8.88e-01

Pos logit contributions
 Cipher+1.979
 dyn+1.920
anke+1.802
 Axis+1.771
SHIP+1.703
Neg logit contributions
brid-1.242
Honest-1.181
ール-1.110
aught-1.097
oping-1.069
Token Kaz
Token ID16385
Feature activation-8.32e-01

Pos logit contributions
 Cipher+2.498
 dyn+2.324
anke+2.128
 Axis+2.113
SHIP+2.086
Neg logit contributions
brid-3.413
Honest-3.161
aught-3.096
oping-3.072
uts-3.021
Tokenarov
Token ID42737
Feature activation+8.56e-02

Pos logit contributions
 Cipher+2.077
 dyn+1.946
anke+1.808
SHIP+1.726
 Axis+1.691
Neg logit contributions
brid-3.246
Honest-3.165
ール-3.046
aught-2.965
spection-2.941
Token is
Token ID318
Feature activation-3.56e-01

Pos logit contributions
brid+0.312
Honest+0.287
aught+0.281
oping+0.279
uts+0.273
Neg logit contributions
 Cipher-0.257
 dyn-0.241
anke-0.222
 Axis-0.220
SHIP-0.217
Tokenool
Token ID970
Feature activation+1.71e-01

Pos logit contributions
 Cipher+1.013
 dyn+0.967
anke+0.908
 Axis+0.892
SHIP+0.878
Neg logit contributions
brid-0.894
Honest-0.818
ール-0.778
aught-0.777
oping-0.776
Token,
Token ID11
Feature activation+4.00e-01

Pos logit contributions
brid+0.570
Honest+0.522
aught+0.511
oping+0.505
ール+0.494
Neg logit contributions
 Cipher-0.567
 dyn-0.536
anke-0.497
 Axis-0.495
SHIP-0.488
Token 49
Token ID5125
Feature activation-5.59e-01

Pos logit contributions
brid+1.230
Honest+1.119
aught+1.085
oping+1.072
ール+1.053
Neg logit contributions
 Cipher-1.441
 dyn-1.368
 Axis-1.273
anke-1.263
SHIP-1.243
Token
Token ID198
Feature activation+2.07e-03

Pos logit contributions
 Cipher+2.004
 dyn+1.903
anke+1.773
 Axis+1.755
SHIP+1.741
Neg logit contributions
brid-1.723
Honest-1.559
aught-1.531
oping-1.512
oped-1.482
Token
Token ID198
Feature activation-1.51e-01

Pos logit contributions
brid+0.007
Honest+0.007
aught+0.007
oping+0.007
oped+0.006
Neg logit contributions
 Cipher-0.006
 dyn-0.006
 Axis-0.006
anke-0.005
SHIP-0.005
TokenBrend
Token ID48015
Feature activation-2.41e+00

Pos logit contributions
 Cipher+0.460
 dyn+0.431
anke+0.398
SHIP+0.395
 Axis+0.394
Neg logit contributions
brid-0.542
Honest-0.496
aught-0.487
oping-0.484
uts-0.476
Tokena
Token ID64
Feature activation-6.80e-01

Pos logit contributions
 Cipher+4.746
anke+4.328
 dyn+4.249
atio+3.660
ula+3.554
Neg logit contributions
brid-10.012
Honest-9.713
spection-9.441
ール-9.235
uts-8.978
Token McC
Token ID5108
Feature activation-3.61e-01

Pos logit contributions
 Cipher+2.111
 dyn+1.978
anke+1.818
 Axis+1.811
SHIP+1.782
Neg logit contributions
brid-2.408
Honest-2.222
aught-2.146
oping-2.120
uts-2.095
Tokenool
Token ID970
Feature activation-4.59e-01

Pos logit contributions
 Cipher+0.723
 dyn+0.675
anke+0.601
 Axis+0.569
SHIP+0.542
Neg logit contributions
brid-1.623
Honest-1.553
ール-1.489
oping-1.464
spection-1.464
Token,
Token ID11
Feature activation+3.55e-02

Pos logit contributions
 Cipher+1.483
 dyn+1.404
anke+1.294
 Axis+1.286
SHIP+1.269
Neg logit contributions
brid-1.551
Honest-1.433
aught-1.374
oping-1.358
ール-1.339
Token 49
Token ID5125
Feature activation-8.92e-01

Pos logit contributions
brid+0.111
Honest+0.101
aught+0.098
oping+0.097
uts+0.095
Neg logit contributions
 Cipher-0.125
 dyn-0.119
 Axis-0.110
anke-0.110
SHIP-0.108
Token
Token ID447
Feature activation-2.46e-01

Pos logit contributions
 Cipher+0.939
 dyn+0.890
 Axis+0.817
anke+0.810
SHIP+0.781
Neg logit contributions
brid-1.137
Honest-1.053
aught-1.023
oping-1.013
ール-1.001
Token
Token ID247
Feature activation-4.97e-03

Pos logit contributions
 Cipher+1.005
 dyn+0.963
anke+0.906
 Axis+0.903
SHIP+0.898
Neg logit contributions
brid-0.628
Honest-0.556
aught-0.537
oping-0.533
oped-0.515
Tokens
Token ID82
Feature activation-3.10e-01

Pos logit contributions
 Cipher+0.014
 dyn+0.013
anke+0.012
 Axis+0.012
SHIP+0.012
Neg logit contributions
brid-0.019
Honest-0.017
aught-0.017
oping-0.017
ール-0.016
Token neo
Token ID19102
Feature activation+6.14e-01

Pos logit contributions
 Cipher+0.930
 dyn+0.886
 Axis+0.811
anke+0.788
SHIP+0.774
Neg logit contributions
brid-1.129
Honest-1.048
aught-1.026
oping-1.016
ール-1.001
Token-
Token ID12
Feature activation-5.67e-01

Pos logit contributions
brid+1.773
Honest+1.580
oping+1.511
aught+1.509
ール+1.492
Neg logit contributions
 Cipher-2.330
 dyn-2.145
anke-2.056
SHIP-2.048
 Axis-1.968
TokenOtt
Token ID49092
Feature activation-2.40e+00

Pos logit contributions
 Cipher+1.623
 dyn+1.545
 Axis+1.400
anke+1.393
SHIP+1.357
Neg logit contributions
brid-2.115
Honest-1.958
aught-1.921
oping-1.906
ール-1.877
Tokenoman
Token ID5185
Feature activation-7.26e-01

Pos logit contributions
 Cipher+6.853
 dyn+6.606
anke+5.979
 Axis+5.871
SHIP+5.599
Neg logit contributions
brid-8.496
Honest-8.089
ingly-7.871
ール-7.645
aught-7.545
Token an
Token ID281
Feature activation-6.96e-01

Pos logit contributions
 Cipher+2.184
 dyn+2.097
 Axis+1.933
anke+1.855
SHIP+1.820
Neg logit contributions
brid-2.610
Honest-2.420
aught-2.371
oping-2.330
ール-2.303
Tokenvil
Token ID2991
Feature activation+1.46e-01

Pos logit contributions
 Cipher+2.420
 dyn+2.326
anke+2.176
 Axis+2.171
SHIP+2.067
Neg logit contributions
brid-2.159
Honest-1.979
aught-1.905
oping-1.890
ール-1.868
Token fall
Token ID2121
Feature activation+4.02e-01

Pos logit contributions
brid+0.532
Honest+0.492
aught+0.480
oping+0.475
ール+0.467
Neg logit contributions
 Cipher-0.435
 dyn-0.409
anke-0.375
 Axis-0.374
SHIP-0.366
Token hardest
Token ID17612
Feature activation-4.11e-01

Pos logit contributions
brid+1.401
Honest+1.287
aught+1.257
oping+1.245
ール+1.221
Neg logit contributions
 Cipher-1.269
 dyn-1.196
anke-1.103
 Axis-1.098
SHIP-1.081
Token Greater
Token ID18169
Feature activation+1.61e-01

Pos logit contributions
 Cipher+0.587
 dyn+0.559
 Axis+0.517
anke+0.512
SHIP+0.507
Neg logit contributions
brid-0.535
Honest-0.493
aught-0.474
oping-0.472
ール-0.465
Token Cincinnati
Token ID16137
Feature activation+9.05e-01

Pos logit contributions
brid+0.553
Honest+0.513
aught+0.495
oping+0.492
ール+0.486
Neg logit contributions
 Cipher-0.519
 dyn-0.483
 Axis-0.449
anke-0.444
SHIP-0.437
Token.
Token ID13
Feature activation-4.98e-02

Pos logit contributions
brid+3.029
Honest+2.758
aught+2.742
oping+2.664
uts+2.646
Neg logit contributions
 Cipher-2.948
 dyn-2.784
anke-2.602
 Axis-2.550
SHIP-2.504
Token Please
Token ID4222
Feature activation-7.24e-01

Pos logit contributions
 Cipher+0.169
 dyn+0.160
anke+0.149
 Axis+0.148
SHIP+0.145
Neg logit contributions
brid-0.162
Honest-0.148
aught-0.144
oping-0.142
uts-0.140
Token try
Token ID1949
Feature activation-2.34e-01

Pos logit contributions
 Cipher+2.277
 dyn+2.158
anke+1.965
 Axis+1.951
 worsen+1.877
Neg logit contributions
brid-2.560
Honest-2.409
aught-2.329
oping-2.254
ール-2.236
Token again
Token ID757
Feature activation-2.40e+00

Pos logit contributions
 Cipher+0.558
 dyn+0.515
anke+0.461
 Axis+0.460
SHIP+0.447
Neg logit contributions
brid-0.997
Honest-0.932
aught-0.913
oping-0.906
ール-0.893
Token soon
Token ID2582
Feature activation-1.37e+00

Pos logit contributions
 Cipher+5.301
 dyn+4.507
anke+4.316
 Axis+4.225
SHIP+4.147
Neg logit contributions
brid-10.243
Honest-9.440
oping-9.304
ール-9.206
oped-9.150
Token,
Token ID11
Feature activation-5.98e-01

Pos logit contributions
 Cipher+3.609
 dyn+3.171
 Axis+2.868
anke+2.767
SHIP+2.675
Neg logit contributions
brid-5.452
Honest-5.148
oping-5.110
oped-5.034
aught-5.006
Token or
Token ID393
Feature activation-7.62e-01

Pos logit contributions
 Cipher+1.644
 dyn+1.595
 Axis+1.466
anke+1.427
SHIP+1.410
Neg logit contributions
brid-2.276
Honest-2.121
oping-2.058
aught-2.023
ール-2.010
Token contact
Token ID2800
Feature activation+4.77e-02

Pos logit contributions
 Cipher+2.095
 dyn+1.985
 Axis+1.818
anke+1.805
 worsen+1.755
Neg logit contributions
brid-2.912
Honest-2.764
oping-2.616
aught-2.598
ール-2.584
Token Customer
Token ID22092
Feature activation-3.05e-01

Pos logit contributions
brid+0.143
Honest+0.133
aught+0.126
oping+0.126
ール+0.125
Neg logit contributions
 Cipher-0.173
 dyn-0.163
 Axis-0.153
anke-0.150
SHIP-0.150
Token today
Token ID1909
Feature activation+2.68e-01

Pos logit contributions
brid+0.599
Honest+0.554
aught+0.541
oping+0.535
ール+0.527
Neg logit contributions
 Cipher-0.487
 dyn-0.458
anke-0.420
 Axis-0.418
SHIP-0.410
Token ruled
Token ID8879
Feature activation-6.81e-01

Pos logit contributions
brid+0.979
Honest+0.908
aught+0.884
oping+0.874
ール+0.862
Neg logit contributions
 Cipher-0.800
 dyn-0.754
anke-0.691
 Axis-0.688
SHIP-0.674
Token one
Token ID530
Feature activation-1.09e+00

Pos logit contributions
 Cipher+2.159
 dyn+2.052
anke+1.883
 Axis+1.879
SHIP+1.827
Neg logit contributions
brid-2.345
Honest-2.181
aught-2.119
oping-2.075
ール-2.069
Token section
Token ID2665
Feature activation-7.95e-01

Pos logit contributions
 Cipher+3.171
 dyn+2.993
 Axis+2.733
anke+2.690
SHIP+2.623
Neg logit contributions
brid-4.019
Honest-3.751
aught-3.649
ール-3.617
oping-3.603
Token of
Token ID286
Feature activation-3.39e-01

Pos logit contributions
 Cipher+2.146
 dyn+2.007
anke+1.812
 Axis+1.792
SHIP+1.760
Neg logit contributions
brid-3.115
Honest-2.924
aught-2.833
ール-2.812
oping-2.804
Token DOM
Token ID24121
Feature activation-2.35e+00

Pos logit contributions
 Cipher+0.996
 dyn+0.936
 Axis+0.852
anke+0.850
SHIP+0.835
Neg logit contributions
brid-1.254
Honest-1.166
aught-1.137
oping-1.125
ール-1.110
TokenA
Token ID32
Feature activation+1.54e-01

Pos logit contributions
 Cipher+3.949
 dyn+3.649
SHIP+3.333
 Axis+3.320
anke+3.176
Neg logit contributions
brid-10.491
oping-9.871
spection-9.789
oppers-9.752
aught-9.724
Token unconstitutional
Token ID20727
Feature activation-1.71e-01

Pos logit contributions
brid+0.573
Honest+0.531
aught+0.518
oping+0.513
ール+0.505
Neg logit contributions
 Cipher-0.454
 dyn-0.426
anke-0.390
 Axis-0.388
SHIP-0.381
Token even
Token ID772
Feature activation-1.06e-01

Pos logit contributions
 Cipher+0.516
 dyn+0.487
anke+0.448
 Axis+0.446
SHIP+0.434
Neg logit contributions
brid-0.619
Honest-0.577
aught-0.561
oping-0.555
ール-0.546
Token though
Token ID996
Feature activation+3.60e-01

Pos logit contributions
 Cipher+0.306
 dyn+0.286
anke+0.260
 Axis+0.259
SHIP+0.253
Neg logit contributions
brid-0.403
Honest-0.374
aught-0.366
oping-0.360
ール-0.357
Token Obama
Token ID2486
Feature activation+2.18e-01

Pos logit contributions
brid+1.187
Honest+1.087
aught+1.058
oping+1.046
ール+1.027
Neg logit contributions
 Cipher-1.206
 dyn-1.140
anke-1.056
 Axis-1.053
SHIP-1.036
Tokenz
Token ID89
Feature activation-1.03e-02

Pos logit contributions
 Cipher+0.949
 dyn+0.901
anke+0.835
 Axis+0.832
SHIP+0.824
Neg logit contributions
brid-0.882
Honest-0.798
aught-0.783
oping-0.771
uts-0.758
Tokenus
Token ID385
Feature activation-1.65e-01

Pos logit contributions
 Cipher+0.041
 dyn+0.039
anke+0.036
 Axis+0.036
SHIP+0.036
Neg logit contributions
brid-0.028
Honest-0.025
aught-0.024
oping-0.024
ール-0.023
Token on
Token ID319
Feature activation-3.82e-01

Pos logit contributions
 Cipher+0.495
 dyn+0.465
anke+0.427
 Axis+0.426
SHIP+0.418
Neg logit contributions
brid-0.602
Honest-0.557
aught-0.543
oping-0.537
ール-0.529
Token Twitter
Token ID3009
Feature activation+9.52e-01

Pos logit contributions
 Cipher+1.163
 dyn+1.093
 Axis+1.002
anke+0.999
SHIP+0.981
Neg logit contributions
brid-1.377
Honest-1.271
aught-1.240
oping-1.228
ール-1.207
Token @
Token ID2488
Feature activation+1.42e-01

Pos logit contributions
brid+2.623
Honest+2.359
aught+2.310
oping+2.268
ingly+2.230
Neg logit contributions
 Cipher-3.599
 dyn-3.436
anke-3.263
 Axis-3.174
SHIP-3.164
Tokendan
Token ID25604
Feature activation-2.34e+00

Pos logit contributions
brid+0.424
Honest+0.384
aught+0.372
oping+0.368
ール+0.360
Neg logit contributions
 Cipher-0.522
 dyn-0.495
anke-0.462
 Axis-0.462
SHIP-0.454
Tokenhan
Token ID7637
Feature activation-2.19e-01

Pos logit contributions
 dyn+6.564
 Cipher+6.455
SHIP+6.355
anke+6.286
 Axis+5.636
Neg logit contributions
brid-7.260
Honest-7.138
ール-6.883
oppers-6.806
oping-6.706
Tokenz
Token ID89
Feature activation-7.75e-01

Pos logit contributions
 Cipher+0.719
 dyn+0.685
anke+0.639
 Axis+0.631
SHIP+0.622
Neg logit contributions
brid-0.721
Honest-0.668
aught-0.644
oping-0.638
ール-0.629
Tokenus
Token ID385
Feature activation-6.04e-01

Pos logit contributions
 Cipher+2.548
 dyn+2.515
anke+2.401
 Axis+2.368
SHIP+2.285
Neg logit contributions
brid-2.312
Honest-2.255
ール-2.138
oping-2.084
aught-2.049
Token and
Token ID290
Feature activation-7.32e-01

Pos logit contributions
 Cipher+1.870
 dyn+1.766
 Axis+1.618
anke+1.617
SHIP+1.591
Neg logit contributions
brid-2.124
Honest-1.965
aught-1.915
oping-1.897
ール-1.867
Token check
Token ID2198
Feature activation-6.90e-01

Pos logit contributions
 Cipher+2.203
 dyn+2.081
 Axis+1.908
anke+1.861
SHIP+1.823
Neg logit contributions
brid-2.649
Honest-2.458
aught-2.380
oping-2.359
oped-2.315
Token Windows
Token ID3964
Feature activation-7.21e-01

Pos logit contributions
 Cipher+0.677
 dyn+0.639
 Axis+0.590
anke+0.581
SHIP+0.570
Neg logit contributions
brid-0.746
Honest-0.691
aught-0.671
oping-0.664
ール-0.656
Token,
Token ID11
Feature activation-3.34e-02

Pos logit contributions
 Cipher+2.215
 dyn+2.094
 Axis+1.935
anke+1.862
SHIP+1.852
Neg logit contributions
brid-2.553
Honest-2.398
aught-2.301
oping-2.284
ール-2.270
Token Core
Token ID7231
Feature activation-4.79e-01

Pos logit contributions
 Cipher+0.111
 dyn+0.105
anke+0.098
 Axis+0.097
SHIP+0.096
Neg logit contributions
brid-0.111
Honest-0.101
aught-0.099
oping-0.098
uts-0.096
TokenAudio
Token ID21206
Feature activation+4.09e-01

Pos logit contributions
 Cipher+1.656
 dyn+1.572
 Axis+1.470
anke+1.450
SHIP+1.422
Neg logit contributions
brid-1.510
Honest-1.391
aught-1.351
oping-1.344
ール-1.326
Token on
Token ID319
Feature activation-2.85e-01

Pos logit contributions
brid+1.461
Honest+1.342
aught+1.313
oping+1.299
uts+1.278
Neg logit contributions
 Cipher-1.255
 dyn-1.181
anke-1.092
 Axis-1.076
SHIP-1.072
Token OS
Token ID7294
Feature activation-2.33e+00

Pos logit contributions
 Cipher+0.809
 dyn+0.757
anke+0.695
 Axis+0.686
SHIP+0.679
Neg logit contributions
brid-1.081
Honest-1.001
aught-0.980
oping-0.970
uts-0.954
TokenX
Token ID55
Feature activation-1.00e+00

Pos logit contributions
 Cipher+3.468
 dyn+3.295
 Axis+2.815
SHIP+2.388
anke+2.300
Neg logit contributions
brid-10.954
ール-10.262
oping-10.241
aught-10.189
Honest-10.187
Token,
Token ID11
Feature activation-1.94e-01

Pos logit contributions
 Cipher+3.265
 dyn+3.083
anke+2.851
 Axis+2.840
SHIP+2.795
Neg logit contributions
brid-3.378
Honest-3.100
aught-3.021
oping-2.988
ール-2.933
Token and
Token ID290
Feature activation-5.14e-01

Pos logit contributions
 Cipher+0.633
 dyn+0.597
anke+0.552
 Axis+0.551
SHIP+0.541
Neg logit contributions
brid-0.654
Honest-0.600
aught-0.585
oping-0.579
ール-0.568
Token AL
Token ID8355
Feature activation+9.99e-02

Pos logit contributions
 Cipher+1.706
 dyn+1.608
 Axis+1.492
anke+1.470
SHIP+1.439
Neg logit contributions
brid-1.715
Honest-1.585
aught-1.540
oping-1.522
ール-1.504
TokenSA
Token ID4090
Feature activation+4.50e-01

Pos logit contributions
brid+0.349
Honest+0.322
aught+0.314
oping+0.311
ール+0.306
Neg logit contributions
 Cipher-0.314
 dyn-0.296
anke-0.272
 Axis-0.272
SHIP-0.267
Token-
Token ID12
Feature activation-2.27e-02

Pos logit contributions
 Cipher+2.031
 dyn+1.932
 Axis+1.764
anke+1.715
SHIP+1.701
Neg logit contributions
brid-2.381
Honest-2.191
ール-2.144
aught-2.133
oping-2.122
TokenAmerican
Token ID7437
Feature activation+1.07e-02

Pos logit contributions
 Cipher+0.071
 dyn+0.067
anke+0.062
 Axis+0.062
SHIP+0.061
Neg logit contributions
brid-0.079
Honest-0.073
aught-0.071
oping-0.071
ール-0.070
Token push
Token ID4574
Feature activation+8.68e-02

Pos logit contributions
brid+0.041
Honest+0.038
aught+0.037
oping+0.037
ール+0.036
Neg logit contributions
 Cipher-0.030
 dyn-0.028
 Axis-0.025
anke-0.025
SHIP-0.025
Token to
Token ID284
Feature activation-1.53e-01

Pos logit contributions
brid+0.304
Honest+0.281
aught+0.274
oping+0.271
ール+0.267
Neg logit contributions
 Cipher-0.272
 dyn-0.256
anke-0.236
 Axis-0.235
SHIP-0.231
Token overthrow
Token ID25525
Feature activation-1.05e+00

Pos logit contributions
 Cipher+0.438
 dyn+0.412
 Axis+0.376
anke+0.373
SHIP+0.365
Neg logit contributions
brid-0.578
Honest-0.537
aught-0.524
oping-0.516
ール-0.513
Token Gaddafi
Token ID40320
Feature activation-2.32e+00

Pos logit contributions
 Cipher+2.995
 dyn+2.859
 Axis+2.581
anke+2.507
SHIP+2.427
Neg logit contributions
brid-3.952
Honest-3.652
ール-3.578
aught-3.576
oping-3.553
Token and
Token ID290
Feature activation+9.13e-02

Pos logit contributions
 Cipher+6.231
 dyn+6.005
 Axis+5.549
anke+5.509
 Hussein+5.325
Neg logit contributions
brid-8.220
Honest-7.611
aught-7.418
ール-7.364
oping-7.290
Token essentially
Token ID6986
Feature activation+3.19e-01

Pos logit contributions
brid+0.332
Honest+0.307
aught+0.299
oping+0.295
ール+0.292
Neg logit contributions
 Cipher-0.275
 dyn-0.260
 Axis-0.237
anke-0.237
SHIP-0.232
Token shove
Token ID38332
Feature activation+4.27e-01

Pos logit contributions
brid+1.194
Honest+1.106
aught+1.080
oping+1.069
ール+1.053
Neg logit contributions
 Cipher-0.925
 dyn-0.870
anke-0.793
 Axis-0.792
SHIP-0.775
Token Gaz
Token ID21347
Feature activation-8.08e-01

Pos logit contributions
brid+1.450
Honest+1.328
aught+1.295
oping+1.280
uts+1.258
Neg logit contributions
 Cipher-1.387
 dyn-1.307
anke-1.213
 Axis-1.204
SHIP-1.191
Tokenprom
Token ID16963
Feature activation+5.67e-01

Pos logit contributions
 Cipher+3.224
 dyn+3.194
anke+3.033
SHIP+2.952
 worsen+2.880
Neg logit contributions
brid-1.905
Honest-1.804
aught-1.662
ール-1.657
oping-1.588
Token year
Token ID614
Feature activation+4.75e-01

Pos logit contributions
brid+0.887
Honest+0.817
aught+0.797
oping+0.790
ール+0.775
Neg logit contributions
 Cipher-0.777
 dyn-0.731
anke-0.673
 Axis-0.670
SHIP-0.660
Token,
Token ID11
Feature activation+9.97e-02

Pos logit contributions
brid+1.521
Honest+1.382
aught+1.346
oping+1.341
ingly+1.313
Neg logit contributions
 Cipher-1.630
 dyn-1.544
anke-1.426
 Axis-1.422
SHIP-1.412
Token their
Token ID511
Feature activation+2.13e-02

Pos logit contributions
brid+0.319
Honest+0.291
aught+0.283
oping+0.280
ール+0.275
Neg logit contributions
 Cipher-0.343
 dyn-0.325
anke-0.302
 Axis-0.301
SHIP-0.296
Token relations
Token ID2316
Feature activation-5.65e-01

Pos logit contributions
brid+0.078
Honest+0.072
aught+0.070
oping+0.069
ール+0.069
Neg logit contributions
 Cipher-0.064
 dyn-0.061
 Axis-0.055
anke-0.055
SHIP-0.054
Token with
Token ID351
Feature activation-3.71e-01

Pos logit contributions
 Cipher+1.261
 dyn+1.180
anke+1.051
 Axis+1.040
SHIP+1.015
Neg logit contributions
brid-2.471
Honest-2.318
aught-2.265
oping-2.240
ール-2.226
Token Gaddafi
Token ID40320
Feature activation-2.32e+00

Pos logit contributions
 Cipher+1.103
 dyn+1.038
 Axis+0.952
anke+0.934
SHIP+0.915
Neg logit contributions
brid-1.363
Honest-1.258
aught-1.231
oping-1.218
ール-1.209
Token had
Token ID550
Feature activation+1.85e-01

Pos logit contributions
 Cipher+6.559
 dyn+6.407
 worsen+5.907
anke+5.818
 Axis+5.808
Neg logit contributions
brid-7.837
Honest-7.327
aught-7.084
ール-7.056
oppers-6.906
Token s
Token ID264
Feature activation+5.35e-01

Pos logit contributions
brid+0.715
Honest+0.664
aught+0.649
oping+0.643
ール+0.633
Neg logit contributions
 Cipher-0.512
 dyn-0.478
anke-0.436
 Axis-0.433
SHIP-0.425
Tokenoured
Token ID8167
Feature activation-4.72e-01

Pos logit contributions
brid+1.472
Honest+1.322
aught+1.292
oping+1.268
oped+1.242
Neg logit contributions
 Cipher-2.079
 dyn-1.974
anke-1.856
 Axis-1.850
SHIP-1.834
Token.
Token ID13
Feature activation-1.68e-02

Pos logit contributions
 Cipher+1.608
 dyn+1.525
anke+1.419
 Axis+1.417
SHIP+1.386
Neg logit contributions
brid-1.514
Honest-1.386
aught-1.362
oping-1.339
ール-1.313
Token The
Token ID383
Feature activation+1.77e-01

Pos logit contributions
 Cipher+0.057
 dyn+0.053
 Axis+0.050
anke+0.049
SHIP+0.048
Neg logit contributions
brid-0.055
Honest-0.050
aught-0.050
oping-0.049
oped-0.048
Token stack
Token ID8931
Feature activation-8.76e-01

Pos logit contributions
 Cipher+1.407
 dyn+1.299
anke+1.198
 Axis+1.185
SHIP+1.169
Neg logit contributions
brid-2.046
Honest-1.895
aught-1.868
oping-1.848
oped-1.819
Token frame
Token ID5739
Feature activation-3.34e-01

Pos logit contributions
 Cipher+2.065
 dyn+1.900
anke+1.701
 Axis+1.693
SHIP+1.655
Neg logit contributions
brid-3.764
Honest-3.513
aught-3.453
oping-3.423
oped-3.370
Token |
Token ID930
Feature activation-3.78e-02

Pos logit contributions
 Cipher+1.032
 dyn+0.974
anke+0.901
 Axis+0.895
SHIP+0.891
Neg logit contributions
brid-1.193
Honest-1.082
aught-1.071
oping-1.055
oped-1.038
Token void
Token ID7951
Feature activation-4.40e-01

Pos logit contributions
 Cipher+0.115
 dyn+0.108
anke+0.100
 Axis+0.099
SHIP+0.098
Neg logit contributions
brid-0.136
Honest-0.125
aught-0.122
oping-0.121
uts-0.119
Token *
Token ID1635
Feature activation+7.39e-02

Pos logit contributions
 Cipher+1.770
 dyn+1.678
anke+1.593
 Axis+1.579
SHIP+1.578
Neg logit contributions
brid-1.157
Honest-1.020
aught-1.009
oping-0.993
oped-0.969
Tokenarg
Token ID853
Feature activation-2.32e+00

Pos logit contributions
brid+0.223
aught+0.196
Honest+0.195
oping+0.194
oped+0.190
Neg logit contributions
 Cipher-0.274
 dyn-0.259
 Axis-0.242
anke-0.240
SHIP-0.239
Token |
Token ID930
Feature activation-1.84e-02

Pos logit contributions
 Cipher+5.979
 dyn+5.596
anke+4.977
 Axis+4.969
SHIP+4.701
Neg logit contributions
brid-9.031
Honest-8.615
aught-8.237
ール-8.201
oping-8.165
Token +
Token ID1343
Feature activation-1.23e-01

Pos logit contributions
 Cipher+0.064
 dyn+0.061
anke+0.057
 Axis+0.056
SHIP+0.056
Neg logit contributions
brid-0.057
Honest-0.052
aught-0.051
oping-0.050
uts-0.049
Token--------------------
Token ID19351
Feature activation-4.99e-01

Pos logit contributions
 Cipher+0.386
 dyn+0.364
anke+0.333
 Axis+0.333
SHIP+0.327
Neg logit contributions
brid-0.434
Honest-0.401
aught-0.390
oping-0.386
ール-0.381
Token--+
Token ID44785
Feature activation-2.08e-03

Pos logit contributions
 Cipher+1.426
 dyn+1.332
 Axis+1.224
anke+1.223
SHIP+1.177
Neg logit contributions
brid-1.851
Honest-1.760
aught-1.676
oping-1.657
ール-1.655
Token |
Token ID930
Feature activation+1.34e-01

Pos logit contributions
 Cipher+0.006
 dyn+0.006
anke+0.006
 Axis+0.006
SHIP+0.005
Neg logit contributions
brid-0.007
Honest-0.007
aught-0.007
oping-0.007
uts-0.006
Token currency
Token ID7395
Feature activation-2.64e-02

Pos logit contributions
brid+1.971
Honest+1.805
aught+1.758
oping+1.742
uts+1.713
Neg logit contributions
 Cipher-1.843
 dyn-1.726
anke-1.607
 Axis-1.589
SHIP-1.582
Token contracts
Token ID8592
Feature activation-9.23e-03

Pos logit contributions
 Cipher+0.082
 dyn+0.077
anke+0.071
 Axis+0.071
SHIP+0.070
Neg logit contributions
brid-0.094
Honest-0.086
aught-0.084
oping-0.083
ール-0.082
Token.
Token ID13
Feature activation+3.03e-01

Pos logit contributions
 Cipher+0.028
 dyn+0.026
anke+0.024
 Axis+0.024
SHIP+0.023
Neg logit contributions
brid-0.034
Honest-0.031
aught-0.030
oping-0.030
ール-0.030
Token
Token ID198
Feature activation+1.43e-01

Pos logit contributions
brid+0.906
Honest+0.828
aught+0.799
oping+0.782
ール+0.777
Neg logit contributions
 Cipher-1.094
 dyn-1.050
anke-0.976
SHIP-0.971
 Axis-0.959
Token
Token ID198
Feature activation+1.68e-01

Pos logit contributions
brid+0.509
Honest+0.466
aught+0.457
oping+0.457
oped+0.449
Neg logit contributions
 Cipher-0.441
 dyn-0.417
 Axis-0.384
SHIP-0.377
anke-0.377
TokenGer
Token ID38069
Feature activation-2.31e+00

Pos logit contributions
brid+0.538
Honest+0.485
aught+0.480
oping+0.477
ール+0.466
Neg logit contributions
 Cipher-0.576
 dyn-0.544
anke-0.509
 Axis-0.505
SHIP-0.503
Tokenathy
Token ID10036
Feature activation-1.39e-02

Pos logit contributions
 dyn+6.447
 Cipher+6.279
anke+6.271
SHIP+5.744
 Axis+5.702
Neg logit contributions
brid-7.238
ール-7.172
Honest-7.035
-6.914
aught-6.741
Token,
Token ID11
Feature activation+4.70e-02

Pos logit contributions
 Cipher+0.042
 dyn+0.039
 Axis+0.036
anke+0.036
SHIP+0.035
Neg logit contributions
brid-0.051
Honest-0.047
aught-0.046
oping-0.045
ール-0.045
Token 60
Token ID3126
Feature activation-2.29e-01

Pos logit contributions
brid+0.158
Honest+0.145
aught+0.141
oping+0.139
ール+0.137
Neg logit contributions
 Cipher-0.155
 dyn-0.147
 Axis-0.136
anke-0.135
SHIP-0.132
Token,
Token ID11
Feature activation-5.43e-02

Pos logit contributions
 Cipher+0.790
 dyn+0.747
anke+0.698
 Axis+0.689
SHIP+0.688
Neg logit contributions
brid-0.736
Honest-0.667
aught-0.654
oping-0.647
ingly-0.634
Token of
Token ID286
Feature activation+8.82e-02

Pos logit contributions
 Cipher+0.157
 dyn+0.148
 Axis+0.135
anke+0.134
SHIP+0.131
Neg logit contributions
brid-0.204
Honest-0.189
aught-0.184
oping-0.182
ール-0.180
Token State
Token ID1812
Feature activation-6.31e-01

Pos logit contributions
 Cipher+2.693
 dyn+2.525
 Axis+2.304
anke+2.292
SHIP+2.215
Neg logit contributions
brid-3.195
Honest-2.959
ール-2.833
aught-2.820
oping-2.801
Token Department
Token ID2732
Feature activation-3.28e-01

Pos logit contributions
 Cipher+1.862
 dyn+1.751
anke+1.609
 Axis+1.601
SHIP+1.573
Neg logit contributions
brid-2.325
Honest-2.147
aught-2.103
oping-2.082
oped-2.043
Token cable
Token ID7862
Feature activation+7.99e-02

Pos logit contributions
 Cipher+0.882
 dyn+0.819
 Axis+0.738
anke+0.734
SHIP+0.714
Neg logit contributions
brid-1.300
Honest-1.219
aught-1.182
oping-1.169
ール-1.164
Token described
Token ID3417
Feature activation+5.86e-01

Pos logit contributions
brid+0.273
Honest+0.253
aught+0.244
oping+0.241
ール+0.239
Neg logit contributions
 Cipher-0.258
 dyn-0.243
anke-0.224
 Axis-0.224
SHIP-0.219
Token how
Token ID703
Feature activation+3.35e-01

Pos logit contributions
brid+2.042
Honest+1.877
aught+1.841
oping+1.822
uts+1.792
Neg logit contributions
 Cipher-1.814
 dyn-1.704
anke-1.593
SHIP-1.570
 Axis-1.567
Token Gaddafi
Token ID40320
Feature activation-2.31e+00

Pos logit contributions
brid+1.193
Honest+1.104
aught+1.078
oping+1.071
uts+1.050
Neg logit contributions
 Cipher-1.017
 dyn-0.957
anke-0.891
 Axis-0.876
SHIP-0.875
Token had
Token ID550
Feature activation-4.05e-01

Pos logit contributions
 Cipher+6.782
 dyn+6.548
 Axis+6.015
anke+5.815
 Hussein+5.813
Neg logit contributions
brid-7.787
Honest-7.211
ール-6.918
aught-6.862
oppers-6.751
Token
Token ID564
Feature activation+4.86e-01

Pos logit contributions
 Cipher+1.243
 dyn+1.188
 Axis+1.082
anke+1.075
SHIP+1.052
Neg logit contributions
brid-1.436
Honest-1.339
aught-1.291
oping-1.275
ール-1.265
Token
Token ID246
Feature activation-5.28e-01

Pos logit contributions
brid+1.576
Honest+1.440
aught+1.402
oping+1.386
ール+1.359
Neg logit contributions
 Cipher-1.655
 dyn-1.567
anke-1.454
 Axis-1.449
SHIP-1.427
Tokenthreat
Token ID19971
Feature activation+4.47e-01

Pos logit contributions
 Cipher+1.971
 dyn+1.888
anke+1.772
 Axis+1.755
SHIP+1.753
Neg logit contributions
brid-1.455
Honest-1.341
aught-1.283
oping-1.265
ール-1.265
Tokenened
Token ID2945
Feature activation+8.80e-02

Pos logit contributions
brid+1.936
Honest+1.799
aught+1.770
oping+1.756
ingly+1.740
Neg logit contributions
 Cipher-1.031
 dyn-0.948
anke-0.852
 Axis-0.846
SHIP-0.831
Token the
Token ID262
Feature activation-4.55e-02

Pos logit contributions
 Cipher+0.160
 dyn+0.152
 Axis+0.138
anke+0.137
SHIP+0.135
Neg logit contributions
brid-0.187
Honest-0.173
aught-0.168
oping-0.166
ール-0.165
Token user
Token ID2836
Feature activation-3.89e-01

Pos logit contributions
 Cipher+0.135
 dyn+0.129
 Axis+0.117
anke+0.114
SHIP+0.112
Neg logit contributions
brid-0.168
Honest-0.157
aught-0.151
oping-0.150
ール-0.149
Token.
Token ID13
Feature activation-1.33e-02

Pos logit contributions
 Cipher+1.180
 dyn+1.121
 Axis+1.030
anke+1.016
SHIP+0.999
Neg logit contributions
brid-1.394
Honest-1.296
aught-1.245
oping-1.240
ール-1.223
Token When
Token ID1649
Feature activation-1.28e-01

Pos logit contributions
 Cipher+0.044
 dyn+0.041
 Axis+0.038
anke+0.037
SHIP+0.037
Neg logit contributions
brid-0.046
Honest-0.042
aught-0.041
oping-0.040
oped-0.040
Token the
Token ID262
Feature activation+5.53e-02

Pos logit contributions
 Cipher+0.419
 dyn+0.399
 Axis+0.366
anke+0.364
SHIP+0.356
Neg logit contributions
brid-0.433
Honest-0.399
aught-0.387
oping-0.380
ール-0.378
Token DOM
Token ID24121
Feature activation-2.29e+00

Pos logit contributions
brid+0.190
Honest+0.176
aught+0.171
oping+0.169
ール+0.166
Neg logit contributions
 Cipher-0.177
 dyn-0.168
 Axis-0.154
anke-0.154
SHIP-0.151
Token is
Token ID318
Feature activation-2.06e-01

Pos logit contributions
 Cipher+5.971
 dyn+5.811
 Axis+5.085
anke+4.835
 worsen+4.711
Neg logit contributions
brid-8.661
Honest-8.080
ール-7.902
aught-7.780
oping-7.680
Token ready
Token ID3492
Feature activation-4.25e-01

Pos logit contributions
 Cipher+0.625
 dyn+0.599
 Axis+0.543
anke+0.537
SHIP+0.523
Neg logit contributions
brid-0.742
Honest-0.688
aught-0.667
ール-0.655
oping-0.655
Token and
Token ID290
Feature activation+1.76e-01

Pos logit contributions
 Cipher+1.358
 dyn+1.283
anke+1.182
 Axis+1.179
SHIP+1.158
Neg logit contributions
brid-1.462
Honest-1.346
aught-1.311
oping-1.296
ール-1.273
Token Angular
Token ID28147
Feature activation-5.20e-02

Pos logit contributions
brid+0.600
Honest+0.551
aught+0.538
oping+0.534
oped+0.522
Neg logit contributions
 Cipher-0.563
 dyn-0.529
anke-0.493
 Axis-0.488
SHIP-0.483
Token calculates
Token ID43707
Feature activation+3.63e-01

Pos logit contributions
 Cipher+0.150
 dyn+0.141
 Axis+0.128
anke+0.128
SHIP+0.126
Neg logit contributions
brid-0.196
Honest-0.181
aught-0.177
oping-0.175
ール-0.173
Token and
Token ID290
Feature activation-1.24e-01

Pos logit contributions
brid+0.987
Honest+0.909
aught+0.887
oping+0.878
ール+0.862
Neg logit contributions
 Cipher-0.874
 dyn-0.823
anke-0.759
 Axis-0.755
SHIP-0.743
Token demanding
Token ID11334
Feature activation-5.80e-01

Pos logit contributions
 Cipher+0.401
 dyn+0.382
 Axis+0.352
anke+0.348
SHIP+0.342
Neg logit contributions
brid-0.422
Honest-0.389
aught-0.380
oping-0.370
ール-0.369
Token that
Token ID326
Feature activation-6.88e-01

Pos logit contributions
 Cipher+1.853
 dyn+1.766
 Axis+1.635
anke+1.595
SHIP+1.566
Neg logit contributions
brid-1.993
Honest-1.844
aught-1.796
oping-1.769
ール-1.765
Token their
Token ID511
Feature activation+7.59e-03

Pos logit contributions
 Cipher+2.103
 dyn+1.989
 Axis+1.849
anke+1.786
SHIP+1.758
Neg logit contributions
brid-2.455
Honest-2.276
aught-2.219
ール-2.183
oping-2.181
Token puppet
Token ID30095
Feature activation-8.37e-01

Pos logit contributions
brid+0.028
Honest+0.026
aught+0.025
oping+0.025
ール+0.025
Neg logit contributions
 Cipher-0.023
 dyn-0.021
 Axis-0.020
anke-0.019
SHIP-0.019
Token regime
Token ID7142
Feature activation-2.26e+00

Pos logit contributions
 Cipher+2.302
 dyn+2.192
 Axis+2.010
anke+1.955
SHIP+1.935
Neg logit contributions
brid-3.206
Honest-3.012
aught-2.932
oping-2.880
ール-2.876
Token be
Token ID307
Feature activation-4.02e-02

Pos logit contributions
 Cipher+5.974
 dyn+5.823
 Axis+5.435
 worsen+5.215
SHIP+4.952
Neg logit contributions
brid-8.444
Honest-7.957
ール-7.711
aught-7.695
oppers-7.424
Token allowed
Token ID3142
Feature activation-5.93e-01

Pos logit contributions
 Cipher+0.116
 dyn+0.109
 Axis+0.099
anke+0.099
SHIP+0.097
Neg logit contributions
brid-0.151
Honest-0.140
aught-0.137
oping-0.136
ール-0.134
Token to
Token ID284
Feature activation+1.80e-01

Pos logit contributions
 Cipher+1.874
 dyn+1.768
anke+1.630
 Axis+1.623
SHIP+1.597
Neg logit contributions
brid-2.062
Honest-1.898
aught-1.851
oping-1.831
ール-1.799
Token murder
Token ID5123
Feature activation+7.58e-02

Pos logit contributions
brid+0.680
Honest+0.630
aught+0.616
oping+0.610
ール+0.600
Neg logit contributions
 Cipher-0.517
 dyn-0.483
anke-0.442
 Axis-0.440
SHIP-0.432
Token east
Token ID7627
Feature activation+4.57e-01

Pos logit contributions
brid+0.265
Honest+0.244
aught+0.238
oping+0.235
uts+0.231
Neg logit contributions
 Cipher-0.239
 dyn-0.225
anke-0.208
 Axis-0.206
SHIP-0.203
Token has
Token ID468
Feature activation-3.35e-01

Pos logit contributions
brid+1.580
Honest+1.494
aught+1.443
oping+1.437
ール+1.426
Neg logit contributions
 Cipher-0.866
 dyn-0.803
anke-0.723
 Axis-0.714
SHIP-0.676
Token occurred
Token ID5091
Feature activation-1.50e-01

Pos logit contributions
 Cipher+0.537
 dyn+0.509
anke+0.427
 Axis+0.410
SHIP+0.387
Neg logit contributions
brid-1.675
Honest-1.585
aught-1.517
oping-1.515
ール-1.515
Token.
Token ID13
Feature activation-1.19e-01

Pos logit contributions
 Cipher+0.403
 dyn+0.382
 Axis+0.344
anke+0.343
SHIP+0.328
Neg logit contributions
brid-0.592
Honest-0.552
aught-0.537
oping-0.531
ール-0.530
Token Please
Token ID4222
Feature activation-4.66e-01

Pos logit contributions
 Cipher+0.395
 dyn+0.368
 Axis+0.346
anke+0.337
SHIP+0.333
Neg logit contributions
brid-0.402
Honest-0.367
aught-0.359
oping-0.357
oped-0.349
Token try
Token ID1949
Feature activation-5.15e-02

Pos logit contributions
 Cipher+1.241
 dyn+1.213
anke+1.078
 Axis+1.063
 worsen+1.032
Neg logit contributions
brid-1.818
Honest-1.721
aught-1.657
oping-1.648
ール-1.640
Token again
Token ID757
Feature activation-2.26e+00

Pos logit contributions
 Cipher+0.105
 dyn+0.100
anke+0.087
 Axis+0.086
SHIP+0.081
Neg logit contributions
brid-0.235
Honest-0.221
aught-0.216
oping-0.214
oped-0.212
Token later
Token ID1568
Feature activation-8.63e-01

Pos logit contributions
 Cipher+4.177
 dyn+3.797
 Axis+3.359
anke+3.280
SHIP+3.028
Neg logit contributions
brid-10.201
Honest-9.622
ール-9.350
oping-9.178
aught-9.162
Token.
Token ID13
Feature activation-4.92e-01

Pos logit contributions
 Cipher+2.037
 dyn+1.902
 Axis+1.720
anke+1.684
SHIP+1.638
Neg logit contributions
brid-3.561
Honest-3.335
oping-3.291
aught-3.278
ール-3.233
Token View
Token ID3582
Feature activation-2.73e-01

Pos logit contributions
 Cipher+1.485
 dyn+1.377
 Axis+1.324
SHIP+1.252
anke+1.231
Neg logit contributions
brid-1.801
Honest-1.637
oping-1.632
aught-1.605
oped-1.589
Token all
Token ID477
Feature activation+1.88e-01

Pos logit contributions
 Cipher+0.810
 dyn+0.766
 Axis+0.707
anke+0.683
SHIP+0.662
Neg logit contributions
brid-0.998
Honest-0.940
aught-0.906
oping-0.905
ール-0.901
Token New
Token ID968
Feature activation-7.19e-01

Pos logit contributions
brid+0.551
Honest+0.495
aught+0.492
oping+0.484
ール+0.472
Neg logit contributions
 Cipher-0.707
 dyn-0.665
 Axis-0.626
anke-0.616
SHIP-0.602
Token has
Token ID468
Feature activation-3.40e-01

Pos logit contributions
brid+1.425
Honest+1.348
aught+1.301
oping+1.295
ール+1.287
Neg logit contributions
 Cipher-0.778
 dyn-0.721
anke-0.649
 Axis-0.641
SHIP-0.605
Token occurred
Token ID5091
Feature activation-1.59e-01

Pos logit contributions
 Cipher+0.548
 dyn+0.518
anke+0.435
 Axis+0.418
SHIP+0.395
Neg logit contributions
brid-1.702
Honest-1.611
aught-1.542
ール-1.541
oping-1.540
Token.
Token ID13
Feature activation-1.37e-01

Pos logit contributions
 Cipher+0.429
 dyn+0.405
 Axis+0.365
anke+0.365
SHIP+0.348
Neg logit contributions
brid-0.623
Honest-0.581
aught-0.565
oping-0.559
ール-0.558
Token Please
Token ID4222
Feature activation-4.36e-01

Pos logit contributions
 Cipher+0.457
 dyn+0.425
 Axis+0.399
anke+0.390
SHIP+0.385
Neg logit contributions
brid-0.463
Honest-0.422
aught-0.413
oping-0.410
ール-0.402
Token try
Token ID1949
Feature activation-6.58e-02

Pos logit contributions
 Cipher+1.177
 dyn+1.148
anke+1.020
 Axis+1.005
 worsen+0.977
Neg logit contributions
brid-1.691
Honest-1.601
aught-1.544
oping-1.532
ール-1.526
Token again
Token ID757
Feature activation-2.25e+00

Pos logit contributions
 Cipher+0.135
 dyn+0.128
anke+0.112
 Axis+0.111
SHIP+0.104
Neg logit contributions
brid-0.300
Honest-0.282
aught-0.276
oping-0.274
ール-0.271
Token later
Token ID1568
Feature activation-9.04e-01

Pos logit contributions
 Cipher+4.157
 dyn+3.751
 Axis+3.332
anke+3.245
SHIP+2.997
Neg logit contributions
brid-10.235
Honest-9.640
ール-9.388
oping-9.201
aught-9.158
Token.
Token ID13
Feature activation-5.27e-01

Pos logit contributions
 Cipher+2.126
 dyn+1.970
 Axis+1.785
anke+1.743
SHIP+1.687
Neg logit contributions
brid-3.748
Honest-3.514
oping-3.463
aught-3.448
ール-3.408
Token View
Token ID3582
Feature activation-3.11e-01

Pos logit contributions
 Cipher+1.585
 dyn+1.469
 Axis+1.412
SHIP+1.335
anke+1.313
Neg logit contributions
brid-1.936
Honest-1.762
oping-1.750
aught-1.720
oped-1.704
Token all
Token ID477
Feature activation+1.54e-01

Pos logit contributions
 Cipher+0.916
 dyn+0.866
 Axis+0.801
anke+0.770
SHIP+0.746
Neg logit contributions
brid-1.141
Honest-1.076
oping-1.036
aught-1.034
ール-1.033
Token New
Token ID968
Feature activation-7.57e-01

Pos logit contributions
brid+0.454
Honest+0.407
aught+0.405
oping+0.399
ール+0.389
Neg logit contributions
 Cipher-0.575
 dyn-0.541
 Axis-0.510
anke-0.500
SHIP-0.488
Token-
Token ID12
Feature activation-4.14e-02

Pos logit contributions
 Cipher+0.923
 dyn+0.866
 Axis+0.792
anke+0.788
SHIP+0.779
Neg logit contributions
brid-1.119
Honest-1.041
aught-1.012
oping-0.996
ール-0.985
Tokenshaped
Token ID16760
Feature activation-1.73e-01

Pos logit contributions
 Cipher+0.140
 dyn+0.132
anke+0.123
 Axis+0.122
SHIP+0.121
Neg logit contributions
brid-0.135
Honest-0.124
aught-0.120
oping-0.119
ール-0.117
Token ep
Token ID2462
Feature activation-9.77e-01

Pos logit contributions
 Cipher+0.473
 dyn+0.441
anke+0.399
 Axis+0.399
SHIP+0.390
Neg logit contributions
brid-0.680
Honest-0.633
aught-0.619
oping-0.613
ール-0.604
Tokenoxy
Token ID23536
Feature activation-1.03e+00

Pos logit contributions
 Cipher+3.708
 dyn+3.507
anke+3.285
 Axis+3.270
SHIP+3.163
Neg logit contributions
brid-2.686
Honest-2.513
aught-2.371
ール-2.341
oping-2.298
Token resin
Token ID36536
Feature activation-1.14e+00

Pos logit contributions
 Cipher+2.876
 dyn+2.722
 Axis+2.463
anke+2.409
SHIP+2.365
Neg logit contributions
brid-3.853
Honest-3.685
aught-3.575
oping-3.501
ール-3.480
Token scaff
Token ID41498
Feature activation-2.24e+00

Pos logit contributions
 Cipher+3.352
 dyn+3.175
 Axis+2.882
anke+2.831
SHIP+2.801
Neg logit contributions
brid-4.107
Honest-3.892
aught-3.761
ール-3.675
oping-3.669
Tokenold
Token ID727
Feature activation-5.93e-01

Pos logit contributions
 Cipher+6.075
 dyn+5.955
 Axis+5.420
anke+5.320
SHIP+5.130
Neg logit contributions
brid-7.784
Honest-7.738
ール-7.244
oppers-6.947
aught-6.901
Token that
Token ID326
Feature activation-4.05e-01

Pos logit contributions
 Cipher+1.773
 dyn+1.679
 Axis+1.527
anke+1.510
SHIP+1.492
Neg logit contributions
brid-2.137
Honest-2.008
aught-1.951
oping-1.919
ール-1.905
Token measures
Token ID5260
Feature activation+8.28e-01

Pos logit contributions
 Cipher+1.150
 dyn+1.086
 Axis+0.984
anke+0.977
SHIP+0.956
Neg logit contributions
brid-1.537
Honest-1.437
aught-1.401
oping-1.375
ール-1.367
Token 0
Token ID657
Feature activation+3.65e-01

Pos logit contributions
brid+2.636
Honest+2.392
aught+2.335
oping+2.316
ingly+2.280
Neg logit contributions
 Cipher-2.831
 dyn-2.661
anke-2.508
 Axis-2.464
SHIP-2.449
Token.
Token ID13
Feature activation-4.44e-01

Pos logit contributions
brid+1.239
Honest+1.125
aught+1.100
oping+1.099
uts+1.081
Neg logit contributions
 Cipher-1.190
 dyn-1.108
anke-1.053
SHIP-1.025
 Axis-1.020
Token both
Token ID1111
Feature activation+5.94e-01

Pos logit contributions
brid+0.559
Honest+0.516
aught+0.501
oping+0.499
ール+0.492
Neg logit contributions
 Cipher-0.488
 dyn-0.464
 Axis-0.425
anke-0.423
SHIP-0.413
Token emotions
Token ID10825
Feature activation+8.67e-01

Pos logit contributions
brid+1.996
Honest+1.832
aught+1.786
oping+1.764
uts+1.729
Neg logit contributions
 Cipher-1.952
 dyn-1.838
anke-1.706
 Axis-1.694
SHIP-1.674
Token and
Token ID290
Feature activation+3.42e-01

Pos logit contributions
brid+2.829
Honest+2.580
aught+2.557
oping+2.507
ingly+2.487
Neg logit contributions
 Cipher-2.903
 dyn-2.696
anke-2.526
 Axis-2.504
SHIP-2.468
Token perceptions
Token ID23574
Feature activation+9.83e-01

Pos logit contributions
brid+1.214
Honest+1.119
aught+1.092
oping+1.081
ール+1.062
Neg logit contributions
 Cipher-1.059
 dyn-0.997
anke-0.917
 Axis-0.914
SHIP-0.898
Token as
Token ID355
Feature activation+2.97e-01

Pos logit contributions
brid+3.401
Honest+3.120
aught+3.081
oping+3.022
ingly+2.992
Neg logit contributions
 Cipher-3.096
 dyn-2.873
anke-2.675
 Axis-2.635
SHIP-2.615
Token reasons
Token ID3840
Feature activation+1.50e+00

Pos logit contributions
brid+1.049
Honest+0.967
aught+0.943
oping+0.933
ール+0.918
Neg logit contributions
 Cipher-0.927
 dyn-0.876
 Axis-0.805
anke-0.804
SHIP-0.787
Token,
Token ID11
Feature activation+5.44e-01

Pos logit contributions
brid+4.727
Honest+4.246
aught+4.213
ingly+4.149
oping+4.126
Neg logit contributions
 Cipher-5.056
 dyn-4.717
anke-4.388
 Axis-4.351
SHIP-4.334
Token they
Token ID484
Feature activation+3.36e-01

Pos logit contributions
brid+1.968
Honest+1.817
aught+1.774
oping+1.756
ール+1.726
Neg logit contributions
 Cipher-1.649
 dyn-1.551
anke-1.424
 Axis-1.418
SHIP-1.393
Token are
Token ID389
Feature activation+2.76e-01

Pos logit contributions
brid+1.257
Honest+1.166
aught+1.137
oping+1.126
ール+1.110
Neg logit contributions
 Cipher-0.972
 dyn-0.913
anke-0.834
 Axis-0.831
SHIP-0.814
Token more
Token ID517
Feature activation+1.07e+00

Pos logit contributions
brid+1.019
Honest+0.942
aught+0.920
oping+0.911
ール+0.897
Neg logit contributions
 Cipher-0.816
 dyn-0.767
anke-0.702
 Axis-0.700
SHIP-0.686
Token sympathetic
Token ID22335
Feature activation+2.91e-01

Pos logit contributions
brid+3.498
Honest+3.219
aught+3.118
oping+3.071
ingly+3.058
Neg logit contributions
 Cipher-3.592
 dyn-3.344
anke-3.178
 Axis-3.103
SHIP-3.101
Token://
Token ID1378
Feature activation+2.82e-01

Pos logit contributions
brid+3.268
Honest+3.059
aught+3.002
oping+2.987
ール+2.945
Neg logit contributions
 Cipher-1.559
 dyn-1.429
anke-1.257
 Axis-1.243
SHIP-1.220
Token.
Token ID764
Feature activation+9.36e-01

Pos logit contributions
brid+0.906
Honest+0.823
aught+0.804
oping+0.792
uts+0.776
Neg logit contributions
 Cipher-0.970
 dyn-0.914
anke-0.860
 Axis-0.853
SHIP-0.843
Tokennice
Token ID44460
Feature activation+3.50e-01

Pos logit contributions
brid+3.093
Honest+2.795
aught+2.713
oping+2.705
ingly+2.653
Neg logit contributions
 Cipher-3.126
 dyn-2.957
anke-2.739
 Axis-2.729
SHIP-2.700
Tokenhash
Token ID17831
Feature activation+1.10e+00

Pos logit contributions
brid+1.442
Honest+1.363
oping+1.333
aught+1.329
ール+1.320
Neg logit contributions
 Cipher-0.847
 dyn-0.810
anke-0.733
SHIP-0.713
 Axis-0.704
Token.
Token ID13
Feature activation+8.34e-01

Pos logit contributions
brid+3.935
Honest+3.628
aught+3.540
oping+3.503
ール+3.443
Neg logit contributions
 Cipher-3.378
 dyn-3.176
anke-2.925
 Axis-2.911
SHIP-2.861
Tokencom
Token ID785
Feature activation+1.53e+00

Pos logit contributions
brid+2.815
Honest+2.603
aught+2.533
oping+2.504
ール+2.466
Neg logit contributions
 Cipher-2.711
 dyn-2.579
anke-2.382
 Axis-2.353
SHIP-2.330
Token/
Token ID14
Feature activation+1.29e+00

Pos logit contributions
brid+4.944
Honest+4.529
aught+4.395
oping+4.359
ール+4.272
Neg logit contributions
 Cipher-5.095
 dyn-4.791
anke-4.507
 Axis-4.456
SHIP-4.394
Tokenapi
Token ID15042
Feature activation-1.72e-02

Pos logit contributions
brid+2.060
Honest+1.547
oping+1.538
aught+1.508
oped+1.444
Neg logit contributions
 Cipher-6.495
 dyn-6.156
 Axis-5.971
anke-5.952
SHIP-5.827
Token is
Token ID318
Feature activation+3.46e-01

Pos logit contributions
 Cipher+0.044
 dyn+0.042
anke+0.037
 Axis+0.037
SHIP+0.035
Neg logit contributions
brid-0.069
Honest-0.065
aught-0.064
ール-0.062
oping-0.062
Token deprecated
Token ID39224
Feature activation+1.38e-01

Pos logit contributions
brid+1.681
Honest+1.614
aught+1.568
oping+1.554
uts+1.535
Neg logit contributions
 Cipher-0.616
 dyn-0.571
 Axis-0.467
anke-0.463
SHIP-0.428
Token and
Token ID290
Feature activation+7.22e-01

Pos logit contributions
brid+0.520
Honest+0.484
aught+0.468
oping+0.465
ール+0.459
Neg logit contributions
 Cipher-0.400
 dyn-0.380
anke-0.343
 Axis-0.341
SHIP-0.325
Token in
Token ID287
Feature activation-1.23e-01

Pos logit contributions
 Cipher+1.150
 dyn+1.084
 Axis+1.000
anke+0.979
SHIP+0.964
Neg logit contributions
brid-1.418
Honest-1.324
aught-1.271
ール-1.264
uts-1.255
Token the
Token ID262
Feature activation-4.52e-01

Pos logit contributions
 Cipher+0.367
 dyn+0.345
 Axis+0.318
anke+0.312
SHIP+0.307
Neg logit contributions
brid-0.450
Honest-0.420
aught-0.405
oping-0.401
ール-0.398
Token territory
Token ID7674
Feature activation-2.48e-01

Pos logit contributions
 Cipher+1.335
 dyn+1.268
 Axis+1.165
anke+1.123
SHIP+1.092
Neg logit contributions
brid-1.641
Honest-1.565
aught-1.490
ール-1.480
oping-1.477
Token.
Token ID13
Feature activation-3.37e-02

Pos logit contributions
 Cipher+0.760
 dyn+0.714
 Axis+0.657
anke+0.655
SHIP+0.640
Neg logit contributions
brid-0.886
Honest-0.824
aught-0.799
oping-0.789
ール-0.785
Token The
Token ID383
Feature activation+8.51e-03

Pos logit contributions
 Cipher+0.113
 dyn+0.106
 Axis+0.099
anke+0.098
SHIP+0.097
Neg logit contributions
brid-0.112
Honest-0.103
aught-0.100
oping-0.098
ール-0.097
Token head
Token ID1182
Feature activation+1.69e+00

Pos logit contributions
brid+0.031
Honest+0.029
aught+0.028
ール+0.027
oping+0.027
Neg logit contributions
 Cipher-0.026
 dyn-0.024
 Axis-0.022
anke-0.022
SHIP-0.021
Token of
Token ID286
Feature activation+9.97e-02

Pos logit contributions
brid+5.658
oping+5.228
aught+5.151
ingly+5.147
uts+5.112
Neg logit contributions
 Cipher-5.323
 dyn-4.916
 Axis-4.574
anke-4.567
SHIP-4.444
Token the
Token ID262
Feature activation+3.53e-02

Pos logit contributions
brid+0.340
Honest+0.314
aught+0.304
oping+0.300
ール+0.297
Neg logit contributions
 Cipher-0.323
 dyn-0.305
 Axis-0.281
anke-0.279
SHIP-0.275
Token WHO
Token ID19494
Feature activation-2.68e-01

Pos logit contributions
brid+0.123
Honest+0.115
aught+0.110
ール+0.109
oping+0.109
Neg logit contributions
 Cipher-0.112
 dyn-0.105
 Axis-0.098
anke-0.096
SHIP-0.094
Token's
Token ID338
Feature activation-3.88e-01

Pos logit contributions
 Cipher+0.879
 dyn+0.827
 Axis+0.767
anke+0.758
SHIP+0.747
Neg logit contributions
brid-0.901
Honest-0.835
aught-0.803
ール-0.797
oping-0.784
Token global
Token ID3298
Feature activation-5.44e-02

Pos logit contributions
 Cipher+1.242
 dyn+1.175
 Axis+1.094
anke+1.057
SHIP+1.033
Neg logit contributions
brid-1.331
Honest-1.252
ール-1.189
aught-1.186
oping-1.162
Token would
Token ID561
Feature activation+3.79e-01

Pos logit contributions
 Cipher+0.973
 dyn+0.918
 Axis+0.835
anke+0.833
SHIP+0.818
Neg logit contributions
brid-1.269
Honest-1.183
aught-1.151
oping-1.135
ール-1.125
Token be
Token ID307
Feature activation+2.48e-01

Pos logit contributions
brid+1.433
Honest+1.316
aught+1.297
oping+1.291
oped+1.267
Neg logit contributions
 Cipher-1.083
 dyn-1.005
anke-0.927
 Axis-0.915
SHIP-0.899
Token the
Token ID262
Feature activation+1.75e-01

Pos logit contributions
brid+0.891
Honest+0.822
aught+0.803
oping+0.795
ール+0.780
Neg logit contributions
 Cipher-0.758
 dyn-0.712
anke-0.655
 Axis-0.652
SHIP-0.641
Token largest
Token ID4387
Feature activation+3.73e-01

Pos logit contributions
brid+0.580
Honest+0.536
aught+0.521
oping+0.514
ール+0.508
Neg logit contributions
 Cipher-0.583
 dyn-0.554
 Axis-0.514
anke-0.505
SHIP-0.495
Token since
Token ID1201
Feature activation-3.80e-01

Pos logit contributions
brid+1.347
Honest+1.237
aught+1.211
oping+1.205
oped+1.182
Neg logit contributions
 Cipher-1.127
 dyn-1.058
anke-0.981
 Axis-0.964
SHIP-0.955
Token vouchers
Token ID41262
Feature activation+1.54e+00

Pos logit contributions
 Cipher+1.270
 dyn+1.203
 Axis+1.121
anke+1.087
SHIP+1.078
Neg logit contributions
brid-1.251
Honest-1.161
aught-1.124
oping-1.100
ール-1.098
Token were
Token ID547
Feature activation+6.42e-01

Pos logit contributions
brid+5.707
aught+5.119
oping+5.110
Honest+5.076
oped+5.032
Neg logit contributions
 Cipher-4.485
 dyn-4.093
anke-3.824
 Axis-3.748
SHIP-3.734
Token introduced
Token ID5495
Feature activation+7.13e-01

Pos logit contributions
brid+2.181
Honest+1.999
aught+1.953
oping+1.933
oped+1.893
Neg logit contributions
 Cipher-2.080
 dyn-1.957
anke-1.814
 Axis-1.804
SHIP-1.779
Token in
Token ID287
Feature activation+6.05e-01

Pos logit contributions
brid+2.645
Honest+2.445
aught+2.390
oping+2.366
ール+2.326
Neg logit contributions
 Cipher-2.093
 dyn-1.962
anke-1.799
 Axis-1.790
SHIP-1.758
Token Milwaukee
Token ID16629
Feature activation+8.51e-01

Pos logit contributions
brid+1.890
Honest+1.724
aught+1.674
oping+1.654
ール+1.623
Neg logit contributions
 Cipher-2.136
 dyn-2.026
anke-1.880
 Axis-1.879
SHIP-1.847
Token in
Token ID287
Feature activation+3.91e-01

Pos logit contributions
brid+3.064
Honest+2.809
aught+2.753
oping+2.729
uts+2.678
Neg logit contributions
 Cipher-2.589
 dyn-2.427
anke-2.244
 Axis-2.226
SHIP-2.187
Token a
Token ID257
Feature activation-1.43e-01

Pos logit contributions
 Cipher+0.482
 dyn+0.454
anke+0.416
 Axis+0.416
SHIP+0.406
Neg logit contributions
brid-0.580
Honest-0.536
aught-0.525
oping-0.518
ール-0.513
Token brand
Token ID4508
Feature activation+1.46e-01

Pos logit contributions
 Cipher+0.449
 dyn+0.426
 Axis+0.389
anke+0.387
SHIP+0.379
Neg logit contributions
brid-0.502
Honest-0.464
aught-0.454
oping-0.446
ール-0.445
Token by
Token ID416
Feature activation+3.37e-01

Pos logit contributions
brid+0.544
Honest+0.505
aught+0.492
oping+0.484
ール+0.483
Neg logit contributions
 Cipher-0.426
 dyn-0.402
anke-0.366
 Axis-0.365
SHIP-0.357
Token protecting
Token ID10192
Feature activation+4.30e-01

Pos logit contributions
brid+1.182
Honest+1.089
aught+1.063
oping+1.049
ール+1.034
Neg logit contributions
 Cipher-1.058
 dyn-0.998
anke-0.919
 Axis-0.914
SHIP-0.899
Token your
Token ID534
Feature activation+7.48e-01

Pos logit contributions
brid+1.528
Honest+1.409
aught+1.374
oping+1.362
uts+1.336
Neg logit contributions
 Cipher-1.326
 dyn-1.247
anke-1.149
 Axis-1.142
SHIP-1.125
Token own
Token ID898
Feature activation+1.98e+00

Pos logit contributions
brid+2.550
Honest+2.342
aught+2.277
oping+2.264
uts+2.218
Neg logit contributions
 Cipher-2.404
 dyn-2.256
anke-2.100
 Axis-2.078
SHIP-2.063
Token brand
Token ID4508
Feature activation+1.06e+00

Pos logit contributions
brid+6.318
Honest+5.790
oping+5.620
ingly+5.599
aught+5.528
Neg logit contributions
 Cipher-6.355
 dyn-5.811
anke-5.577
SHIP-5.531
 Axis-5.447
Token start
Token ID923
Feature activation-3.45e-01

Pos logit contributions
brid+3.725
Honest+3.425
aught+3.341
oping+3.340
ingly+3.295
Neg logit contributions
 Cipher-3.252
 dyn-3.003
anke-2.798
 Axis-2.789
SHIP-2.757
Token inhib
Token ID11062
Feature activation+6.82e-01

Pos logit contributions
 Cipher+1.076
 dyn+1.019
anke+0.938
 Axis+0.928
SHIP+0.915
Neg logit contributions
brid-1.208
Honest-1.118
aught-1.093
oping-1.064
ール-1.061
Tokeniting
Token ID1780
Feature activation+1.91e-01

Pos logit contributions
brid+1.974
Honest+1.792
aught+1.724
oping+1.690
ール+1.688
Neg logit contributions
 Cipher-2.558
 dyn-2.439
anke-2.283
 Axis-2.264
SHIP-2.234
Token consumer
Token ID7172
Feature activation+8.45e-01

Pos logit contributions
brid+0.630
Honest+0.578
aught+0.564
oping+0.554
ール+0.551
Neg logit contributions
 Cipher-0.638
 dyn-0.607
 Axis-0.560
anke-0.557
SHIP-0.547
Token the
Token ID262
Feature activation+2.77e-02

Pos logit contributions
brid+0.991
Honest+0.907
aught+0.885
oping+0.875
uts+0.858
Neg logit contributions
 Cipher-0.998
 dyn-0.943
anke-0.880
 Axis-0.872
SHIP-0.862
Token emphasis
Token ID12476
Feature activation-5.53e-01

Pos logit contributions
brid+0.099
Honest+0.091
aught+0.089
oping+0.088
ール+0.087
Neg logit contributions
 Cipher-0.085
 dyn-0.080
 Axis-0.074
anke-0.073
SHIP-0.072
Token will
Token ID481
Feature activation-8.96e-02

Pos logit contributions
 Cipher+1.574
 dyn+1.491
 Axis+1.353
anke+1.350
SHIP+1.325
Neg logit contributions
brid-2.076
Honest-1.936
aught-1.874
oping-1.865
ール-1.844
Token be
Token ID307
Feature activation+1.75e-01

Pos logit contributions
 Cipher+0.265
 dyn+0.242
anke+0.224
 Axis+0.221
SHIP+0.218
Neg logit contributions
brid-0.334
Honest-0.306
aught-0.305
oping-0.301
ingly-0.297
Token on
Token ID319
Feature activation-1.19e-01

Pos logit contributions
brid+0.617
Honest+0.568
aught+0.556
oping+0.549
uts+0.540
Neg logit contributions
 Cipher-0.542
 dyn-0.508
anke-0.471
 Axis-0.467
SHIP-0.460
Token "
Token ID366
Feature activation+6.88e-01

Pos logit contributions
 Cipher+0.372
 dyn+0.352
 Axis+0.322
anke+0.319
SHIP+0.314
Neg logit contributions
brid-0.417
Honest-0.385
aught-0.374
oping-0.371
ール-0.368
Tokenhel
Token ID2978
Feature activation+1.07e-01

Pos logit contributions
brid+2.022
Honest+1.828
aught+1.771
oping+1.742
uts+1.708
Neg logit contributions
 Cipher-2.563
 dyn-2.436
anke-2.277
 Axis-2.271
SHIP-2.229
Tokenping
Token ID13886
Feature activation-7.05e-01

Pos logit contributions
brid+0.427
Honest+0.397
aught+0.389
oping+0.386
ール+0.380
Neg logit contributions
 Cipher-0.282
 dyn-0.262
anke-0.237
 Axis-0.237
SHIP-0.232
Token students
Token ID2444
Feature activation-6.69e-02

Pos logit contributions
 Cipher+2.257
 dyn+2.143
 Axis+1.982
anke+1.927
SHIP+1.915
Neg logit contributions
brid-2.396
Honest-2.218
aught-2.137
ール-2.136
oping-2.126
Token develop
Token ID1205
Feature activation-1.27e-01

Pos logit contributions
 Cipher+0.176
 dyn+0.166
 Axis+0.149
anke+0.148
SHIP+0.145
Neg logit contributions
brid-0.268
Honest-0.252
aught-0.243
oping-0.241
ール-0.241
Token the
Token ID262
Feature activation+1.20e-01

Pos logit contributions
 Cipher+0.390
 dyn+0.371
 Axis+0.339
anke+0.336
SHIP+0.331
Neg logit contributions
brid-0.450
Honest-0.415
aught-0.404
oping-0.404
ール-0.400
Token Campaign
Token ID13718
Feature activation+1.48e-02

Pos logit contributions
brid+0.197
Honest+0.176
aught+0.170
oping+0.167
ール+0.163
Neg logit contributions
 Cipher-0.319
 dyn-0.305
anke-0.286
 Axis-0.286
SHIP-0.282
Token Committee
Token ID4606
Feature activation-3.27e-01

Pos logit contributions
brid+0.045
Honest+0.040
aught+0.040
oping+0.039
uts+0.038
Neg logit contributions
 Cipher-0.053
 dyn-0.051
anke-0.047
 Axis-0.047
SHIP-0.046
Token W
Token ID370
Feature activation-1.16e+00

Pos logit contributions
 Cipher+1.066
 dyn+1.001
 Axis+0.930
anke+0.929
SHIP+0.910
Neg logit contributions
brid-1.106
Honest-1.012
aught-0.985
oping-0.974
ール-0.962
Tokenants
Token ID1187
Feature activation+9.64e-03

Pos logit contributions
 Cipher+3.802
 dyn+3.510
anke+3.327
 Axis+3.305
SHIP+3.229
Neg logit contributions
brid-3.840
Honest-3.514
ール-3.349
aught-3.345
oping-3.340
Token Some
Token ID2773
Feature activation-1.61e-01

Pos logit contributions
brid+0.030
Honest+0.027
aught+0.027
oping+0.026
ール+0.026
Neg logit contributions
 Cipher-0.034
 dyn-0.032
 Axis-0.030
anke-0.030
SHIP-0.029
Token Of
Token ID3226
Feature activation+4.01e-01

Pos logit contributions
 Cipher+0.581
 dyn+0.546
 Axis+0.515
anke+0.509
SHIP+0.498
Neg logit contributions
brid-0.495
Honest-0.450
aught-0.438
oping-0.433
oped-0.425
Token That
Token ID1320
Feature activation+3.43e-01

Pos logit contributions
brid+1.315
Honest+1.205
aught+1.174
oping+1.159
uts+1.140
Neg logit contributions
 Cipher-1.344
 dyn-1.275
anke-1.179
 Axis-1.172
SHIP-1.157
Token Sweet
Token ID15335
Feature activation-3.84e-01

Pos logit contributions
brid+1.087
Honest+0.991
aught+0.965
oping+0.954
ール+0.934
Neg logit contributions
 Cipher-1.192
 dyn-1.128
anke-1.049
 Axis-1.047
SHIP-1.029
Token,
Token ID11
Feature activation+1.02e-01

Pos logit contributions
 Cipher+1.344
 dyn+1.256
 Axis+1.184
anke+1.176
SHIP+1.141
Neg logit contributions
brid-1.208
Honest-1.110
aught-1.079
oping-1.068
oped-1.049
Token Sweet
Token ID15335
Feature activation-9.45e-01

Pos logit contributions
brid+0.315
Honest+0.284
aught+0.279
oping+0.276
oped+0.270
Neg logit contributions
 Cipher-0.367
 dyn-0.346
 Axis-0.325
anke-0.322
SHIP-0.315
Token Koch
Token ID17009
Feature activation-2.04e-02

Pos logit contributions
 Cipher+3.274
 dyn+3.034
 Axis+2.899
anke+2.860
SHIP+2.773
Neg logit contributions
brid-2.961
Honest-2.713
aught-2.648
oping-2.621
oped-2.585
Token still
Token ID991
Feature activation-2.88e-01

Pos logit contributions
brid+0.287
Honest+0.264
aught+0.257
oping+0.254
ール+0.251
Neg logit contributions
 Cipher-0.254
 dyn-0.240
anke-0.220
 Axis-0.219
SHIP-0.215
Token largely
Token ID5688
Feature activation+3.31e-01

Pos logit contributions
 Cipher+0.878
 dyn+0.831
 Axis+0.755
anke+0.755
SHIP+0.735
Neg logit contributions
brid-1.039
Honest-0.958
aught-0.931
oping-0.918
ール-0.915
Token defined
Token ID5447
Feature activation-3.41e-01

Pos logit contributions
brid+1.206
Honest+1.114
aught+1.090
oping+1.079
uts+1.059
Neg logit contributions
 Cipher-0.988
 dyn-0.925
anke-0.853
 Axis-0.847
SHIP-0.836
Token by
Token ID416
Feature activation+2.74e-03

Pos logit contributions
 Cipher+0.921
 dyn+0.861
anke+0.782
 Axis+0.777
SHIP+0.762
Neg logit contributions
brid-1.341
Honest-1.248
aught-1.219
oping-1.208
ール-1.189
Token books
Token ID3835
Feature activation+7.61e-01

Pos logit contributions
brid+0.010
Honest+0.009
aught+0.009
oping+0.008
ール+0.008
Neg logit contributions
 Cipher-0.009
 dyn-0.008
anke-0.008
 Axis-0.008
SHIP-0.007
Token she
Token ID673
Feature activation+5.52e-01

Pos logit contributions
brid+2.595
Honest+2.379
aught+2.328
oping+2.303
oped+2.256
Neg logit contributions
 Cipher-2.456
 dyn-2.312
anke-2.138
 Axis-2.135
SHIP-2.104
Token wrote
Token ID2630
Feature activation+8.02e-01

Pos logit contributions
brid+2.024
Honest+1.873
aught+1.822
oping+1.800
ール+1.780
Neg logit contributions
 Cipher-1.649
 dyn-1.553
anke-1.420
 Axis-1.413
SHIP-1.385
Token 40
Token ID2319
Feature activation-3.32e-01

Pos logit contributions
brid+2.853
Honest+2.625
aught+2.568
oping+2.540
ingly+2.497
Neg logit contributions
 Cipher-2.464
 dyn-2.310
anke-2.140
 Axis-2.120
SHIP-2.097
Token years
Token ID812
Feature activation+3.89e-01

Pos logit contributions
 Cipher+0.779
 dyn+0.723
 Axis+0.642
anke+0.638
SHIP+0.621
Neg logit contributions
brid-1.420
Honest-1.331
aught-1.301
oping-1.293
ール-1.278
Token ago
Token ID2084
Feature activation+1.04e+00

Pos logit contributions
brid+1.585
Honest+1.470
aught+1.444
oping+1.430
uts+1.415
Neg logit contributions
 Cipher-1.006
 dyn-0.926
anke-0.838
 Axis-0.835
SHIP-0.821
Token,
Token ID11
Feature activation+4.16e-01

Pos logit contributions
brid+3.560
Honest+3.255
aught+3.193
oping+3.153
ingly+3.103
Neg logit contributions
 Cipher-3.365
 dyn-3.160
anke-2.935
 Axis-2.914
SHIP-2.890
Token coaches
Token ID11070
Feature activation+1.44e+00

Pos logit contributions
 Cipher+0.090
 dyn+0.085
 Axis+0.078
anke+0.078
SHIP+0.076
Neg logit contributions
brid-0.098
Honest-0.090
aught-0.089
oping-0.087
ール-0.086
Token are
Token ID389
Feature activation-2.31e-01

Pos logit contributions
brid+4.963
Honest+4.592
aught+4.511
oping+4.477
oped+4.379
Neg logit contributions
 Cipher-4.440
 dyn-4.125
anke-3.890
 Axis-3.840
SHIP-3.789
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
 Cipher+0.695
 dyn+0.659
anke+0.599
 Axis+0.597
SHIP+0.589
Neg logit contributions
brid-0.836
Honest-0.769
aught-0.751
oping-0.744
ール-0.732
Token He
Token ID679
Feature activation+7.70e-02

Pos logit contributions
brid+0.334
Honest+0.305
aught+0.297
oping+0.294
ール+0.288
Neg logit contributions
 Cipher-0.353
 dyn-0.334
anke-0.310
 Axis-0.309
SHIP-0.304
Token said
Token ID531
Feature activation-1.62e-01

Pos logit contributions
brid+0.273
Honest+0.255
aught+0.245
oping+0.242
ール+0.239
Neg logit contributions
 Cipher-0.236
 dyn-0.224
anke-0.206
 Axis-0.204
SHIP-0.200
Token [
Token ID685
Feature activation+9.36e-01

Pos logit contributions
 Cipher+0.508
 dyn+0.478
anke+0.440
 Axis+0.438
SHIP+0.429
Neg logit contributions
brid-0.568
Honest-0.524
aught-0.509
oping-0.502
ール-0.495
Tokenass
Token ID562
Feature activation-4.39e-01

Pos logit contributions
brid+3.182
Honest+2.922
aught+2.823
oping+2.798
uts+2.763
Neg logit contributions
 Cipher-3.042
 dyn-2.882
 Axis-2.671
anke-2.650
SHIP-2.584
Tokenistant
Token ID10167
Feature activation+3.58e-01

Pos logit contributions
 Cipher+1.672
 dyn+1.591
anke+1.497
 Axis+1.482
SHIP+1.458
Neg logit contributions
brid-1.231
Honest-1.122
aught-1.079
oping-1.054
ール-1.053
Token coach
Token ID3985
Feature activation+1.23e+00

Pos logit contributions
brid+1.056
Honest+0.956
aught+0.928
oping+0.918
uts+0.897
Neg logit contributions
 Cipher-1.321
 dyn-1.255
anke-1.174
 Axis-1.170
SHIP-1.154
Token]
Token ID60
Feature activation-1.65e-01

Pos logit contributions
brid+4.090
Honest+3.773
aught+3.674
oping+3.666
ingly+3.646
Neg logit contributions
 Cipher-3.912
 dyn-3.653
anke-3.476
 Axis-3.414
SHIP-3.404
Token Ju
Token ID12585
Feature activation+1.59e-02

Pos logit contributions
 Cipher+0.555
 dyn+0.525
anke+0.486
 Axis+0.484
SHIP+0.477
Neg logit contributions
brid-0.542
Honest-0.496
aught-0.483
oping-0.477
ール-0.468
Token be
Token ID307
Feature activation-3.23e-02

Pos logit contributions
 Cipher+0.666
 dyn+0.624
anke+0.567
 Axis+0.565
SHIP+0.553
Neg logit contributions
brid-0.929
Honest-0.864
aught-0.843
oping-0.835
ール-0.823
Token strong
Token ID1913
Feature activation-4.57e-01

Pos logit contributions
 Cipher+0.101
 dyn+0.096
anke+0.088
 Axis+0.088
SHIP+0.086
Neg logit contributions
brid-0.113
Honest-0.104
aught-0.101
oping-0.101
ール-0.099
Token enough
Token ID1576
Feature activation+1.62e-02

Pos logit contributions
 Cipher+1.252
 dyn+1.188
anke+1.085
 Axis+1.070
SHIP+1.039
Neg logit contributions
brid-1.751
Honest-1.644
aught-1.601
oping-1.586
ール-1.560
Token to
Token ID284
Feature activation+4.97e-01

Pos logit contributions
brid+0.056
Honest+0.052
aught+0.050
oping+0.050
ール+0.049
Neg logit contributions
 Cipher-0.051
 dyn-0.048
anke-0.045
 Axis-0.044
SHIP-0.044
Token offset
Token ID11677
Feature activation+7.97e-01

Pos logit contributions
brid+1.840
Honest+1.693
aught+1.665
oping+1.652
oped+1.621
Neg logit contributions
 Cipher-1.457
 dyn-1.353
anke-1.249
 Axis-1.240
SHIP-1.224
Token the
Token ID262
Feature activation+5.31e-01

Pos logit contributions
brid+2.670
Honest+2.420
aught+2.377
oping+2.343
uts+2.325
Neg logit contributions
 Cipher-2.623
 dyn-2.439
 Axis-2.254
anke-2.250
SHIP-2.243
Token revenue
Token ID6426
Feature activation+1.69e-01

Pos logit contributions
brid+1.933
Honest+1.784
aught+1.748
oping+1.732
uts+1.709
Neg logit contributions
 Cipher-1.583
 dyn-1.475
anke-1.360
 Axis-1.349
SHIP-1.339
Token losses
Token ID9089
Feature activation+7.13e-01

Pos logit contributions
brid+0.633
Honest+0.586
aught+0.573
oping+0.567
ール+0.558
Neg logit contributions
 Cipher-0.489
 dyn-0.458
anke-0.419
 Axis-0.417
SHIP-0.409
Token.
Token ID13
Feature activation-5.07e-02

Pos logit contributions
brid+2.534
Honest+2.329
aught+2.281
oping+2.259
oped+2.215
Neg logit contributions
 Cipher-2.200
 dyn-2.060
anke-1.902
 Axis-1.891
SHIP-1.869
Token
Token ID447
Feature activation-2.11e-01

Pos logit contributions
 Cipher+0.186
 dyn+0.176
 Axis+0.165
anke+0.165
SHIP+0.162
Neg logit contributions
brid-0.151
Honest-0.136
aught-0.133
oping-0.132
oped-0.129
Token
Token ID251
Feature activation+3.76e-01

Pos logit contributions
 Cipher+0.889
 dyn+0.851
anke+0.805
 Axis+0.798
SHIP+0.789
Neg logit contributions
brid-0.516
Honest-0.459
aught-0.441
oping-0.435
oped-0.424
Token,
Token ID11
Feature activation+2.07e-03

Pos logit contributions
brid+0.887
Honest+0.817
aught+0.800
oping+0.791
oped+0.775
Neg logit contributions
 Cipher-0.792
 dyn-0.744
anke-0.689
 Axis-0.684
SHIP-0.674
Token Yuri
Token ID38450
Feature activation-8.63e-01

Pos logit contributions
brid+0.007
Honest+0.007
aught+0.007
oping+0.007
ール+0.006
Neg logit contributions
 Cipher-0.006
 dyn-0.006
anke-0.006
 Axis-0.006
SHIP-0.005
Token Kh
Token ID5311
Feature activation+3.32e-01

Pos logit contributions
 Cipher+2.783
 dyn+2.638
 Axis+2.436
anke+2.419
SHIP+2.352
Neg logit contributions
brid-2.914
Honest-2.706
aught-2.602
oping-2.548
ール-2.546
Tokenudi
Token ID47928
Feature activation+2.81e-01

Pos logit contributions
brid+0.972
Honest+0.884
aught+0.853
oping+0.842
ール+0.828
Neg logit contributions
 Cipher-1.233
 dyn-1.173
anke-1.099
 Axis-1.092
SHIP-1.077
Token,
Token ID11
Feature activation-3.69e-01

Pos logit contributions
brid+0.929
Honest+0.846
aught+0.830
oping+0.823
uts+0.808
Neg logit contributions
 Cipher-0.936
 dyn-0.882
anke-0.821
 Axis-0.817
SHIP-0.808
Token and
Token ID290
Feature activation-5.55e-01

Pos logit contributions
 Cipher+1.150
 dyn+1.095
 Axis+1.000
anke+0.993
SHIP+0.973
Neg logit contributions
brid-1.292
Honest-1.196
aught-1.157
oping-1.148
ール-1.143
Token his
Token ID465
Feature activation-5.06e-02

Pos logit contributions
 Cipher+1.657
 dyn+1.557
 Axis+1.424
anke+1.416
SHIP+1.389
Neg logit contributions
brid-2.028
Honest-1.877
aught-1.831
oping-1.814
ール-1.789
Token sons
Token ID11989
Feature activation-4.21e-01

Pos logit contributions
 Cipher+0.105
 dyn+0.095
anke+0.084
 Axis+0.083
SHIP+0.081
Neg logit contributions
brid-0.231
Honest-0.217
aught-0.213
oping-0.212
ール-0.209
Token found
Token ID1043
Feature activation-1.02e-02

Pos logit contributions
 Cipher+1.280
 dyn+1.212
 Axis+1.105
anke+1.090
SHIP+1.079
Neg logit contributions
brid-1.514
Honest-1.398
aught-1.359
oping-1.344
ール-1.337
Token the
Token ID262
Feature activation+1.28e-01

Pos logit contributions
 Cipher+0.033
 dyn+0.032
anke+0.029
 Axis+0.029
SHIP+0.029
Neg logit contributions
brid-0.034
Honest-0.032
aught-0.031
oping-0.030
ール-0.030
Token remains
Token ID3793
Feature activation-1.14e-01

Pos logit contributions
brid+0.419
Honest+0.384
aught+0.373
oping+0.369
ール+0.363
Neg logit contributions
 Cipher-0.430
 dyn-0.407
anke-0.377
 Axis-0.376
SHIP-0.369
Token't
Token ID470
Feature activation-8.89e-01

Pos logit contributions
 Cipher+0.960
 dyn+0.913
anke+0.847
SHIP+0.834
 Axis+0.823
Neg logit contributions
brid-1.145
Honest-1.059
oping-1.038
aught-1.032
oped-1.016
Token mean
Token ID1612
Feature activation+4.14e-01

Pos logit contributions
 Cipher+2.489
 dyn+2.436
 Axis+2.187
 worsen+2.172
anke+2.126
Neg logit contributions
brid-3.327
Honest-3.193
aught-3.034
ール-2.959
oping-2.913
Token back
Token ID736
Feature activation-6.82e-02

Pos logit contributions
brid+1.415
Honest+1.296
aught+1.267
oping+1.259
oped+1.232
Neg logit contributions
 Cipher-1.331
 dyn-1.251
anke-1.162
 Axis-1.155
SHIP-1.136
Token to
Token ID284
Feature activation-4.26e-01

Pos logit contributions
 Cipher+0.224
 dyn+0.212
anke+0.196
 Axis+0.195
SHIP+0.192
Neg logit contributions
brid-0.230
Honest-0.210
aught-0.205
oping-0.203
uts-0.199
Token learning
Token ID4673
Feature activation-6.45e-01

Pos logit contributions
 Cipher+1.296
 dyn+1.221
anke+1.117
 Axis+1.115
SHIP+1.095
Neg logit contributions
brid-1.530
Honest-1.415
aught-1.376
oping-1.361
ール-1.338
Token.
Token ID13
Feature activation+1.56e-02

Pos logit contributions
 Cipher+1.872
 dyn+1.767
 Axis+1.610
anke+1.605
SHIP+1.559
Neg logit contributions
brid-2.396
Honest-2.230
aught-2.155
oping-2.143
ール-2.122
Token Under
Token ID4698
Feature activation+1.46e-01

Pos logit contributions
brid+0.052
Honest+0.047
aught+0.046
oping+0.046
oped+0.045
Neg logit contributions
 Cipher-0.053
 dyn-0.049
 Axis-0.046
anke-0.046
SHIP-0.045
Token the
Token ID262
Feature activation+2.39e-02

Pos logit contributions
brid+0.474
Honest+0.436
aught+0.423
oping+0.419
ール+0.411
Neg logit contributions
 Cipher-0.492
 dyn-0.466
anke-0.431
 Axis-0.430
SHIP-0.424
Token reign
Token ID13580
Feature activation-8.68e-02

Pos logit contributions
brid+0.081
Honest+0.074
aught+0.072
oping+0.071
ール+0.070
Neg logit contributions
 Cipher-0.078
 dyn-0.074
 Axis-0.068
anke-0.068
SHIP-0.066
Token of
Token ID286
Feature activation-2.94e-01

Pos logit contributions
 Cipher+0.231
 dyn+0.219
anke+0.196
 Axis+0.194
SHIP+0.193
Neg logit contributions
brid-0.344
Honest-0.322
aught-0.312
oping-0.308
ール-0.305
Token social
Token ID1919
Feature activation-4.10e-01

Pos logit contributions
 Cipher+0.947
 dyn+0.900
 Axis+0.823
anke+0.819
SHIP+0.803
Neg logit contributions
brid-1.002
Honest-0.922
aught-0.896
oping-0.888
ール-0.877
Token
Token ID198
Feature activation-2.04e-01

Pos logit contributions
 Cipher+0.942
 dyn+0.901
 Axis+0.819
anke+0.811
SHIP+0.802
Neg logit contributions
brid-1.205
Honest-1.106
aught-1.090
oping-1.089
oped-1.072
TokenMiami
Token ID41191
Feature activation+3.57e-01

Pos logit contributions
 Cipher+0.629
 dyn+0.591
SHIP+0.553
anke+0.551
 Axis+0.540
Neg logit contributions
brid-0.708
Honest-0.641
aught-0.640
oping-0.635
ール-0.621
Token Dolphins
Token ID21668
Feature activation+1.93e-01

Pos logit contributions
brid+1.202
Honest+1.104
aught+1.074
oping+1.062
ール+1.045
Neg logit contributions
 Cipher-1.173
 dyn-1.109
anke-1.023
 Axis-1.022
SHIP-1.004
Token:
Token ID25
Feature activation+1.89e-01

Pos logit contributions
brid+0.604
Honest+0.551
aught+0.536
oping+0.534
uts+0.526
Neg logit contributions
 Cipher-0.668
 dyn-0.632
anke-0.598
 Axis-0.587
SHIP-0.579
Token DE
Token ID5550
Feature activation+4.58e-01

Pos logit contributions
brid+0.644
Honest+0.591
aught+0.577
oping+0.570
ール+0.560
Neg logit contributions
 Cipher-0.616
 dyn-0.579
 Axis-0.535
anke-0.534
SHIP-0.523
Token Cameron
Token ID10821
Feature activation-6.57e-01

Pos logit contributions
brid+1.541
Honest+1.416
aught+1.384
oping+1.357
uts+1.340
Neg logit contributions
 Cipher-1.488
 dyn-1.407
anke-1.309
 Axis-1.292
SHIP-1.284
Token Wake
Token ID20441
Feature activation-1.41e-01

Pos logit contributions
 Cipher+2.147
 dyn+1.974
 Axis+1.860
anke+1.823
SHIP+1.767
Neg logit contributions
brid-2.251
Honest-2.082
aught-2.006
oping-1.994
ール-1.976
Token
Token ID198
Feature activation-1.61e-01

Pos logit contributions
 Cipher+0.495
 dyn+0.469
anke+0.437
 Axis+0.435
SHIP+0.430
Neg logit contributions
brid-0.445
Honest-0.404
aught-0.395
oping-0.390
ール-0.382
Token
Token ID198
Feature activation-4.57e-01

Pos logit contributions
 Cipher+0.476
 dyn+0.454
 Axis+0.414
anke+0.410
SHIP+0.404
Neg logit contributions
brid-0.584
Honest-0.536
aught-0.527
oping-0.524
oped-0.516
TokenContract
Token ID45845
Feature activation-1.54e+00

Pos logit contributions
 Cipher+1.883
 dyn+1.800
anke+1.694
 Axis+1.689
SHIP+1.669
Neg logit contributions
brid-1.151
Honest-1.024
aught-0.988
oping-0.973
ール-0.948
Token:
Token ID25
Feature activation-9.00e-02

Pos logit contributions
 Cipher+3.877
 dyn+3.679
 Axis+3.315
SHIP+3.244
anke+3.176
Neg logit contributions
brid-6.091
Honest-5.588
oppers-5.541
oping-5.509
aught-5.480
Token{
Token ID90
Feature activation-1.46e-01

Pos logit contributions
 Cipher+1.113
 dyn+1.056
anke+0.974
 Axis+0.970
SHIP+0.956
Neg logit contributions
brid-1.076
Honest-0.987
aught-0.952
oping-0.942
ール-0.935
Tokeni
Token ID72
Feature activation-4.89e-01

Pos logit contributions
 Cipher+0.586
 dyn+0.560
anke+0.525
 Axis+0.524
SHIP+0.517
Neg logit contributions
brid-0.381
Honest-0.341
aught-0.329
oping-0.324
ール-0.317
Token}
Token ID92
Feature activation-6.57e-01

Pos logit contributions
 Cipher+1.707
 dyn+1.624
 Axis+1.502
anke+1.498
SHIP+1.479
Neg logit contributions
brid-1.535
Honest-1.404
aught-1.359
oping-1.346
ール-1.321
Token "
Token ID366
Feature activation+5.35e-01

Pos logit contributions
 Cipher+2.057
 dyn+1.958
 Axis+1.784
anke+1.781
SHIP+1.725
Neg logit contributions
brid-2.292
Honest-2.130
aught-2.037
ール-2.018
oping-2.018
Token}
Token ID92
Feature activation-4.17e-01

Pos logit contributions
brid+1.923
Honest+1.769
aught+1.738
oping+1.718
oped+1.689
Neg logit contributions
 Cipher-1.623
 dyn-1.513
anke-1.415
 Axis-1.395
SHIP-1.378
Token
Token ID198
Feature activation+3.04e-02

Pos logit contributions
 Cipher+1.383
 dyn+1.315
 Axis+1.202
anke+1.200
SHIP+1.171
Neg logit contributions
brid-1.384
Honest-1.276
aught-1.229
oping-1.217
ール-1.210
Token
Token ID198
Feature activation+1.46e-01

Pos logit contributions
brid+0.109
Honest+0.100
aught+0.098
oping+0.097
ール+0.097
Neg logit contributions
 Cipher-0.092
 dyn-0.088
 Axis-0.080
anke-0.079
SHIP-0.078
Tokenand
Token ID392
Feature activation+3.21e-01

Pos logit contributions
brid+0.435
Honest+0.396
aught+0.385
oping+0.382
ール+0.376
Neg logit contributions
 Cipher-0.532
 dyn-0.503
anke-0.472
 Axis-0.465
SHIP-0.463
Token the
Token ID262
Feature activation+6.44e-01

Pos logit contributions
brid+1.064
Honest+0.970
aught+0.945
oping+0.941
oped+0.921
Neg logit contributions
 Cipher-1.059
 dyn-1.001
anke-0.935
 Axis-0.930
SHIP-0.924
Token do
Token ID466
Feature activation+4.00e-01

Pos logit contributions
brid+2.440
Honest+2.233
aught+2.191
oping+2.188
oped+2.153
Neg logit contributions
 Cipher-1.799
 dyn-1.676
anke-1.573
SHIP-1.552
 Axis-1.541
Token..
Token ID492
Feature activation-7.41e-01

Pos logit contributions
brid+1.464
Honest+1.342
aught+1.318
oping+1.315
oped+1.290
Neg logit contributions
 Cipher-1.192
 dyn-1.117
anke-1.035
 Axis-1.026
SHIP-1.012
Token the
Token ID262
Feature activation-1.68e-01

Pos logit contributions
 Cipher+1.561
 dyn+1.473
anke+1.369
 Axis+1.360
SHIP+1.340
Neg logit contributions
brid-1.548
Honest-1.417
aught-1.382
oping-1.367
ール-1.339
Token most
Token ID749
Feature activation+2.14e-01

Pos logit contributions
 Cipher+0.547
 dyn+0.520
 Axis+0.480
anke+0.473
SHIP+0.465
Neg logit contributions
brid-0.566
Honest-0.523
aught-0.506
oping-0.503
ール-0.497
Token useful
Token ID4465
Feature activation-8.49e-01

Pos logit contributions
brid+0.672
Honest+0.613
aught+0.596
oping+0.590
ール+0.580
Neg logit contributions
 Cipher-0.749
 dyn-0.714
 Axis-0.661
anke-0.658
SHIP-0.646
Token things
Token ID1243
Feature activation+2.68e-01

Pos logit contributions
 Cipher+2.319
 dyn+2.223
 Axis+2.011
anke+1.945
SHIP+1.932
Neg logit contributions
brid-3.255
Honest-3.058
aught-2.938
oping-2.937
ール-2.913
Token to
Token ID284
Feature activation+7.79e-02

Pos logit contributions
brid+0.805
Honest+0.731
aught+0.722
oping+0.706
ingly+0.702
Neg logit contributions
 Cipher-0.977
 dyn-0.916
anke-0.862
 Axis-0.852
SHIP-0.841
Token know
Token ID760
Feature activation-6.52e-01

Pos logit contributions
brid+0.302
Honest+0.281
aught+0.273
oping+0.271
ール+0.268
Neg logit contributions
 Cipher-0.215
 dyn-0.203
 Axis-0.183
anke-0.183
SHIP-0.180
Token how
Token ID703
Feature activation+2.91e-02

Pos logit contributions
 Cipher+1.947
 dyn+1.856
 Axis+1.692
anke+1.678
SHIP+1.669
Neg logit contributions
brid-2.356
Honest-2.181
aught-2.096
oping-2.082
ール-2.066
Token to
Token ID284
Feature activation+7.89e-02

Pos logit contributions
brid+0.095
Honest+0.086
aught+0.085
oping+0.084
ingly+0.083
Neg logit contributions
 Cipher-0.100
 dyn-0.093
anke-0.087
 Axis-0.086
SHIP-0.085
Token do
Token ID466
Feature activation-5.03e-01

Pos logit contributions
brid+0.252
Honest+0.230
aught+0.223
oping+0.221
ール+0.217
Neg logit contributions
 Cipher-0.272
 dyn-0.259
anke-0.240
 Axis-0.239
SHIP-0.236
Token on
Token ID319
Feature activation-5.03e-01

Pos logit contributions
 Cipher+1.349
 dyn+1.278
 Axis+1.148
anke+1.141
SHIP+1.124
Neg logit contributions
brid-1.976
Honest-1.838
aught-1.788
oping-1.770
ール-1.750
Token a
Token ID257
Feature activation-3.98e-01

Pos logit contributions
 Cipher+1.464
 dyn+1.386
 Axis+1.263
anke+1.254
SHIP+1.236
Neg logit contributions
brid-1.863
Honest-1.727
aught-1.678
oping-1.668
ール-1.642
Token from
Token ID422
Feature activation+4.12e-01

Pos logit contributions
brid+1.735
Honest+1.579
aught+1.550
oping+1.533
ingly+1.516
Neg logit contributions
 Cipher-1.645
 dyn-1.534
anke-1.435
 Axis-1.409
SHIP-1.407
Token previous
Token ID2180
Feature activation+5.53e-01

Pos logit contributions
brid+1.368
Honest+1.247
aught+1.222
oping+1.208
uts+1.182
Neg logit contributions
 Cipher-1.363
 dyn-1.280
anke-1.199
 Axis-1.178
SHIP-1.171
Token missions
Token ID10566
Feature activation+1.04e+00

Pos logit contributions
brid+2.026
Honest+1.865
aught+1.829
oping+1.813
uts+1.780
Neg logit contributions
 Cipher-1.647
 dyn-1.534
anke-1.429
 Axis-1.404
SHIP-1.391
Token.
Token ID13
Feature activation-1.33e-01

Pos logit contributions
brid+3.510
Honest+3.164
aught+3.122
oping+3.112
ingly+3.052
Neg logit contributions
 Cipher-3.353
 dyn-3.129
anke-2.919
 Axis-2.884
SHIP-2.869
Token The
Token ID383
Feature activation+1.42e-01

Pos logit contributions
 Cipher+0.451
 dyn+0.424
 Axis+0.395
anke+0.392
SHIP+0.385
Neg logit contributions
brid-0.433
Honest-0.395
aught-0.385
oping-0.382
oped-0.374
Token rock
Token ID3881
Feature activation-1.45e+00

Pos logit contributions
brid+0.496
Honest+0.457
aught+0.445
oping+0.440
ール+0.433
Neg logit contributions
 Cipher-0.447
 dyn-0.422
anke-0.388
 Axis-0.387
SHIP-0.380
Token also
Token ID635
Feature activation-2.93e-01

Pos logit contributions
 Cipher+4.043
 dyn+3.836
 Axis+3.529
anke+3.503
SHIP+3.325
Neg logit contributions
brid-5.349
Honest-5.077
ール-4.835
aught-4.832
oping-4.738
Token resembles
Token ID22960
Feature activation-3.60e-01

Pos logit contributions
 Cipher+0.801
 dyn+0.755
anke+0.682
 Axis+0.680
SHIP+0.659
Neg logit contributions
brid-1.143
Honest-1.069
aught-1.035
oping-1.022
ール-1.018
Token some
Token ID617
Feature activation-5.11e-01

Pos logit contributions
 Cipher+1.088
 dyn+1.029
 Axis+0.941
anke+0.937
SHIP+0.916
Neg logit contributions
brid-1.302
Honest-1.205
aught-1.175
oping-1.162
ール-1.147
Token unusual
Token ID8468
Feature activation-6.56e-01

Pos logit contributions
 Cipher+1.467
 dyn+1.383
 Axis+1.259
anke+1.250
SHIP+1.224
Neg logit contributions
brid-1.929
Honest-1.790
aught-1.743
oping-1.724
ール-1.703
Token rocks
Token ID12586
Feature activation-7.32e-01

Pos logit contributions
 Cipher+1.825
 dyn+1.723
 Axis+1.565
anke+1.521
SHIP+1.497
Neg logit contributions
brid-2.535
Honest-2.371
aught-2.304
oping-2.274
ール-2.257
Tokenal
Token ID282
Feature activation-6.58e-03

Pos logit contributions
 Cipher+2.324
 dyn+2.305
anke+2.184
SHIP+2.072
 Axis+2.051
Neg logit contributions
brid-2.648
Honest-2.444
ール-2.386
oping-2.328
oppers-2.322
Token
Token ID447
Feature activation-4.18e-01

Pos logit contributions
 Cipher+0.022
 dyn+0.021
anke+0.020
 Axis+0.020
SHIP+0.019
Neg logit contributions
brid-0.021
Honest-0.020
aught-0.019
oping-0.019
ール-0.018
Token
Token ID247
Feature activation-3.79e-01

Pos logit contributions
 Cipher+1.570
 dyn+1.519
anke+1.424
SHIP+1.401
 Axis+1.393
Neg logit contributions
brid-1.186
Honest-1.046
oping-1.015
aught-1.014
uts-0.981
Tokens
Token ID82
Feature activation-3.05e-01

Pos logit contributions
 Cipher+1.072
 dyn+1.003
anke+0.919
 Axis+0.912
SHIP+0.900
Neg logit contributions
brid-1.440
Honest-1.336
aught-1.305
oping-1.290
ール-1.270
Token latest
Token ID3452
Feature activation-3.25e-01

Pos logit contributions
 Cipher+0.948
 dyn+0.901
 Axis+0.826
anke+0.810
SHIP+0.799
Neg logit contributions
brid-1.075
Honest-0.996
aught-0.973
oping-0.955
uts-0.939
Token press
Token ID1803
Feature activation-1.48e+00

Pos logit contributions
 Cipher+1.001
 dyn+0.951
 Axis+0.874
anke+0.850
SHIP+0.843
Neg logit contributions
brid-1.158
Honest-1.069
aught-1.044
oping-1.026
uts-1.011
Token tour
Token ID4205
Feature activation-6.17e-02

Pos logit contributions
 Cipher+4.002
 dyn+3.800
anke+3.417
 Axis+3.351
SHIP+3.253
Neg logit contributions
brid-5.659
Honest-5.269
aught-5.099
oping-5.040
oppers-5.013
Token,
Token ID11
Feature activation-4.22e-01

Pos logit contributions
 Cipher+0.182
 dyn+0.171
anke+0.157
 Axis+0.157
SHIP+0.154
Neg logit contributions
brid-0.227
Honest-0.210
aught-0.206
oping-0.203
ール-0.200
Token too
Token ID1165
Feature activation+1.55e-01

Pos logit contributions
 Cipher+1.388
 dyn+1.320
anke+1.224
 Axis+1.217
SHIP+1.184
Neg logit contributions
brid-1.402
Honest-1.293
aught-1.258
oping-1.229
ール-1.222
Token,
Token ID11
Feature activation-2.55e-01

Pos logit contributions
brid+0.518
Honest+0.473
aught+0.461
oping+0.459
ingly+0.450
Neg logit contributions
 Cipher-0.511
 dyn-0.481
 Axis-0.445
anke-0.444
SHIP-0.440
Token with
Token ID351
Feature activation-3.05e-01

Pos logit contributions
 Cipher+0.804
 dyn+0.763
anke+0.705
 Axis+0.701
SHIP+0.682
Neg logit contributions
brid-0.882
Honest-0.818
aught-0.795
oping-0.777
ール-0.773
Tokencz
Token ID26691
Feature activation-8.92e-02

Pos logit contributions
 Cipher+1.176
 dyn+1.121
anke+1.054
 Axis+1.043
SHIP+1.023
Neg logit contributions
brid-0.958
Honest-0.894
aught-0.854
oping-0.832
ール-0.830
Tokenynski
Token ID40008
Feature activation-3.96e-01

Pos logit contributions
 Cipher+0.270
 dyn+0.256
anke+0.236
 Axis+0.233
SHIP+0.230
Neg logit contributions
brid-0.320
Honest-0.298
aught-0.288
oping-0.286
ール-0.282
Token,
Token ID11
Feature activation-1.67e-01

Pos logit contributions
 Cipher+1.174
 dyn+1.108
 Axis+1.012
anke+1.004
SHIP+0.985
Neg logit contributions
brid-1.440
Honest-1.345
aught-1.293
oping-1.285
ール-1.276
Token Marc
Token ID13067
Feature activation-1.64e+00

Pos logit contributions
 Cipher+0.527
 dyn+0.495
 Axis+0.456
anke+0.456
SHIP+0.446
Neg logit contributions
brid-0.586
Honest-0.540
aught-0.526
oping-0.520
ール-0.512
Token R
Token ID371
Feature activation-2.51e-01

Pos logit contributions
 Cipher+4.953
 dyn+4.634
 Axis+4.229
anke+4.194
SHIP+4.135
Neg logit contributions
brid-5.505
Honest-5.220
ール-5.026
aught-4.939
oping-4.876
Tokenz
Token ID89
Feature activation-1.58e+00

Pos logit contributions
 Cipher+0.852
 dyn+0.806
anke+0.749
 Axis+0.746
SHIP+0.734
Neg logit contributions
brid-0.813
Honest-0.743
aught-0.723
oping-0.715
ール-0.702
Tokenep
Token ID538
Feature activation-9.13e-01

Pos logit contributions
 Cipher+3.837
 dyn+3.662
anke+3.566
 Axis+3.159
SHIP+3.133
Neg logit contributions
Honest-5.963
brid-5.897
ール-5.601
spection-5.397
aught-5.389
Tokency
Token ID948
Feature activation+8.38e-02

Pos logit contributions
 Cipher+2.910
 dyn+2.821
anke+2.616
 Axis+2.504
SHIP+2.457
Neg logit contributions
brid-2.984
Honest-2.895
ール-2.714
aught-2.693
oping-2.612
Tokenns
Token ID5907
Feature activation-3.81e-01

Pos logit contributions
brid+0.269
Honest+0.246
aught+0.239
oping+0.236
ール+0.232
Neg logit contributions
 Cipher-0.288
 dyn-0.272
anke-0.253
 Axis-0.252
SHIP-0.248
Tokenki
Token ID4106
Feature activation+4.25e-01

Pos logit contributions
 Cipher+1.180
 dyn+1.113
anke+1.025
 Axis+1.018
SHIP+0.989
Neg logit contributions
brid-1.351
Honest-1.249
aught-1.216
oping-1.195
ール-1.182
Token,
Token ID11
Feature activation-5.49e-01

Pos logit contributions
brid+1.568
Honest+1.450
aught+1.416
oping+1.402
ール+1.379
Neg logit contributions
 Cipher-1.258
 dyn-1.181
anke-1.082
 Axis-1.077
SHIP-1.058
Token do
Token ID466
Feature activation-4.89e-01

Pos logit contributions
brid+0.142
Honest+0.132
aught+0.128
oping+0.126
ール+0.126
Neg logit contributions
 Cipher-0.117
 dyn-0.111
 Axis-0.100
anke-0.099
SHIP-0.098
Token X
Token ID1395
Feature activation-1.79e+00

Pos logit contributions
 Cipher+1.647
 dyn+1.559
 Axis+1.436
anke+1.418
SHIP+1.388
Neg logit contributions
brid-1.602
Honest-1.474
aught-1.422
oping-1.403
ール-1.399
Token,
Token ID11
Feature activation-1.12e-01

Pos logit contributions
 Cipher+5.460
 dyn+5.275
 Axis+4.861
SHIP+4.658
anke+4.616
Neg logit contributions
brid-6.061
Honest-5.686
aught-5.390
ール-5.354
oping-5.336
Token Y
Token ID575
Feature activation-1.58e+00

Pos logit contributions
 Cipher+0.316
 dyn+0.297
 Axis+0.270
anke+0.268
SHIP+0.263
Neg logit contributions
brid-0.429
Honest-0.398
aught-0.388
oping-0.385
ール-0.379
Token and
Token ID290
Feature activation-4.67e-01

Pos logit contributions
 Cipher+4.838
 dyn+4.613
 Axis+4.252
anke+4.185
SHIP+4.094
Neg logit contributions
brid-5.538
Honest-5.147
aught-4.981
oping-4.918
ール-4.853
Token Z
Token ID1168
Feature activation-2.13e+00

Pos logit contributions
 Cipher+1.401
 dyn+1.309
 Axis+1.210
anke+1.190
SHIP+1.159
Neg logit contributions
brid-1.713
Honest-1.589
aught-1.546
oping-1.522
uts-1.497
Token.
Token ID13
Feature activation+6.63e-02

Pos logit contributions
 Cipher+5.956
 dyn+5.691
 Axis+5.191
SHIP+4.804
 worsen+4.708
Neg logit contributions
brid-7.806
Honest-7.348
aught-7.057
ール-6.961
oping-6.955
Token Through
Token ID9561
Feature activation+8.53e-01

Pos logit contributions
brid+0.221
Honest+0.199
aught+0.196
oping+0.196
oped+0.192
Neg logit contributions
 Cipher-0.222
 dyn-0.208
 Axis-0.193
anke-0.191
SHIP-0.189
Token this
Token ID428
Feature activation+1.22e-01

Pos logit contributions
brid+2.948
Honest+2.704
aught+2.643
oping+2.612
uts+2.575
Neg logit contributions
 Cipher-2.683
 dyn-2.518
anke-2.368
 Axis-2.336
SHIP-2.318
Token practice
Token ID3357
Feature activation+2.54e-01

Pos logit contributions
brid+0.483
Honest+0.449
aught+0.438
oping+0.434
ール+0.431
Neg logit contributions
 Cipher-0.328
 dyn-0.306
 Axis-0.275
anke-0.274
SHIP-0.269
Token,
Token ID11
Feature activation-2.70e-01

Pos logit contributions
brid+0.836
Honest+0.768
aught+0.753
oping+0.742
ingly+0.727
Neg logit contributions
 Cipher-0.850
 dyn-0.800
anke-0.748
 Axis-0.738
SHIP-0.727
Token just
Token ID655
Feature activation-4.29e-01

Pos logit contributions
 Cipher+0.826
 dyn+0.779
anke+0.708
 Axis+0.707
SHIP+0.692
Neg logit contributions
brid-1.103
Honest-1.031
aught-0.998
oping-0.989
ール-0.979
Token two
Token ID734
Feature activation-2.01e-01

Pos logit contributions
 Cipher+1.272
 dyn+1.207
anke+1.104
 Axis+1.097
SHIP+1.069
Neg logit contributions
brid-1.570
Honest-1.454
aught-1.413
oping-1.393
ール-1.390
Token minutes
Token ID2431
Feature activation+1.19e-01

Pos logit contributions
 Cipher+0.583
 dyn+0.546
anke+0.500
 Axis+0.498
SHIP+0.489
Neg logit contributions
brid-0.750
Honest-0.694
aught-0.679
oping-0.673
ール-0.661
Token earlier
Token ID2961
Feature activation+9.73e-01

Pos logit contributions
brid+0.449
Honest+0.415
aught+0.406
oping+0.402
ール+0.395
Neg logit contributions
 Cipher-0.343
 dyn-0.321
anke-0.293
 Axis-0.292
SHIP-0.287
Token when
Token ID618
Feature activation-2.16e-01

Pos logit contributions
brid+3.387
Honest+3.083
aught+3.051
oping+3.009
ingly+2.982
Neg logit contributions
 Cipher-3.048
 dyn-2.853
anke-2.641
 Axis-2.632
SHIP-2.603
Token Danny
Token ID15105
Feature activation-1.60e+00

Pos logit contributions
 Cipher+0.679
 dyn+0.640
anke+0.589
 Axis+0.588
SHIP+0.576
Neg logit contributions
brid-0.756
Honest-0.696
aught-0.678
oping-0.669
ール-0.660
Token Kane
Token ID20055
Feature activation+4.98e-01

Pos logit contributions
 Cipher+5.136
 dyn+4.770
 Axis+4.416
anke+4.319
SHIP+4.211
Neg logit contributions
brid-5.349
Honest-5.005
aught-4.740
ール-4.719
oping-4.661
Token hit
Token ID2277
Feature activation+6.11e-01

Pos logit contributions
brid+1.830
Honest+1.690
aught+1.653
oping+1.635
ール+1.606
Neg logit contributions
 Cipher-1.479
 dyn-1.387
anke-1.272
 Axis-1.267
SHIP-1.245
Token the
Token ID262
Feature activation-1.99e-01

Pos logit contributions
brid+2.054
Honest+1.881
aught+1.836
oping+1.820
uts+1.791
Neg logit contributions
 Cipher-1.997
 dyn-1.874
anke-1.740
 Axis-1.731
SHIP-1.716
Token home
Token ID1363
Feature activation+7.21e-02

Pos logit contributions
 Cipher+0.628
 dyn+0.610
 Axis+0.561
anke+0.554
SHIP+0.539
Neg logit contributions
brid-0.686
Honest-0.639
aught-0.615
oping-0.608
ール-0.605
Token post
Token ID1281
Feature activation+1.86e-01

Pos logit contributions
brid+0.248
Honest+0.232
aught+0.223
oping+0.221
ール+0.219
Neg logit contributions
 Cipher-0.227
 dyn-0.217
 Axis-0.199
anke-0.198
SHIP-0.194