TOP ACTIVATIONS
MAX = 2.810

Target ing Computer found within Luke Skywalker s X
're only interested in The Force Awakens . In terms of
, best known for playing Luke Skywalker in the Star Wars
The Guardian s Luke Harding , Stephanie Kir ch
a 90 F OV even double or 4 x 1080 P
the day after the final Force Awakens trailer premiered ( along
To dominate the world by force , wielding nuclear weapons ,
at the end of West Fif teenth Street . Built in
( pictured ), the Latin Academy , some areas in the
Examiner , Donald Trump Donald John Trump READ : Cohen testimony
find them . Advertisement The Big Three
n h ts a . dot . gov / people /
to fly by comet 19 P / B or rell y
repeatedly criticized President Trump Donald John Trump House committee believes it
, investors risk paying an artificially high price .
is , Hitchcock enlisted his g ifted title designer , Saul
Air Station in South We ym outh . This is an
, Joyce Foundation : " R oc kefeller Family Fund ,
exit polls . Christ ie s showing
, such as robotics , artificial intelligence , 3 - D

MIN ACTIVATIONS
MIN = -2.750

your ? s for the Dub s by posting a video
the moment , Cher rene says .
, Mal anch ak and Mak ovi ich uk , the
The property now belongs to Cit ib ank .
names also existed among the Turk ic and Mongol peoples (
one an Iranian - born Turk named Ali Kent . The
in a cable that Pr é val w ants
had 21 Fantasy points vs MIN in Week 8 .
+) "] ' ) AS version , C OUNT ( *
. Right now Dub ov ick has most of
who holds dual British and Pakistani citizenship , said of the
the reply for four . Cher rene and I exchange worried
the decision making and document sharing wasn 't done timely .
( O mega Pharma - Quick step ) s . t
also about forced sex , Zak ov itch suggests literary ass
M ud and sand mode for muddy and sandy
is ar Malik , a Pakistani journalist , and Mark Smith
ART () above is the version number . I want to
the other . See , Zak ov itch ,
} } Catch { Write - Warning - Message " F

INTERVAL 1.420 - 2.810
CONTAINS 0.176%

v 4 against pre - agh s inv oker is crucial
t live in an Orwell ian police state ,
environments . ' Very little was known about the native monkey
the years . Tom C ray was one of
was governor . Advertisement Continue reading the main story

INTERVAL 0.030 - 1.420
CONTAINS 46.865%

and explore the world with kindred spirits . Sign
irect ed edges *) graph = Fl atten [ Table [
and sound " too good to be true ." It 's
like to top mine with frozen cher ries , which are
wrong in principle with this item , but they cost $

INTERVAL -1.360 - 0.030
CONTAINS 52.675%

had ingested them directly . This fact in and of itself
a big fan of Facebook , I don t
more bass slam and better imaging . But in terms of
, located little more than 2 miles away , and called
never know whether people are getting more reasonable over time ,

INTERVAL -2.750 - -1.360
CONTAINS 0.283%

trigger the flashbacks , the memories , the nightmares it
Isaac Newton H agan , friends of the Kaufman ns (
mission , including the Comet Bor rell y encounter .[ 17
pieces : [ Option < Piece > ; 64 ], //
M ud and sand mode for muddy and sandy
Token Target
Token ID12744
Feature activation-8.67e-01

Pos logit contributions
+0.382
enegger+0.321
+0.311
6000+0.305
Son+0.303
Neg logit contributions
@#&-0.320
 deciding-0.310
hemy-0.290
igating-0.288
 offense-0.288
Tokening
Token ID278
Feature activation-3.88e-01

Pos logit contributions
@#&+2.902
 deciding+2.852
 offense+2.605
 automatic+2.559
 Answer+2.543
Neg logit contributions
-5.536
enegger-4.804
-4.659
6000-4.562
Son-4.555
Token Computer
Token ID13851
Feature activation+6.53e-02

Pos logit contributions
@#&+1.561
 deciding+1.519
 offense+1.400
hemy+1.395
igating+1.380
Neg logit contributions
-2.236
enegger-1.901
-1.849
6000-1.817
Son-1.812
Token found
Token ID1043
Feature activation+1.48e-02

Pos logit contributions
+0.370
enegger+0.315
+0.306
6000+0.300
Son+0.300
Neg logit contributions
@#&-0.267
 deciding-0.258
hemy-0.241
igating-0.239
 offense-0.237
Token within
Token ID1626
Feature activation+5.02e-02

Pos logit contributions
+0.080
enegger+0.068
+0.065
6000+0.064
Son+0.064
Neg logit contributions
@#&-0.065
 deciding-0.063
 offense-0.059
hemy-0.058
 Answer-0.058
Token Luke
Token ID11336
Feature activation+2.81e+00

Pos logit contributions
+0.301
enegger+0.256
+0.249
Son+0.244
6000+0.243
Neg logit contributions
@#&-0.190
 deciding-0.184
hemy-0.170
 offense-0.168
igating-0.167
Token Skywalker
Token ID29715
Feature activation+1.20e+00

Pos logit contributions
+11.821
+9.616
Son+9.155
+8.563
6000+8.522
Neg logit contributions
@#&-12.458
hemy-12.261
lege-11.675
 deciding-11.461
 offense-11.333
Token
Token ID447
Feature activation+4.40e-01

Pos logit contributions
+5.318
+4.277
enegger+4.251
Son+4.146
6000+3.984
Neg logit contributions
@#&-6.036
 deciding-5.678
hemy-5.667
igating-5.491
 offense-5.472
Token
Token ID247
Feature activation+4.22e-01

Pos logit contributions
+2.034
+1.614
enegger+1.608
Son+1.557
Lago+1.545
Neg logit contributions
@#&-2.232
 deciding-2.207
 offense-2.064
igating-2.042
hemy-2.036
Tokens
Token ID82
Feature activation+9.50e-01

Pos logit contributions
+2.195
enegger+1.809
+1.788
Son+1.750
6000+1.735
Neg logit contributions
@#&-1.909
 deciding-1.834
hemy-1.735
 offense-1.726
igating-1.706
Token X
Token ID1395
Feature activation+1.16e+00

Pos logit contributions
+5.058
enegger+4.209
+4.140
6000+4.053
Son+4.039
Neg logit contributions
@#&-4.082
 deciding-3.828
hemy-3.739
igating-3.637
lege-3.538
Token're
Token ID821
Feature activation-2.04e-01

Pos logit contributions
@#&+1.010
 deciding+1.002
 offense+0.916
 automatic+0.900
 recall+0.891
Neg logit contributions
-1.902
enegger-1.646
-1.597
Son-1.562
DragonMagazine-1.560
Token only
Token ID691
Feature activation-5.80e-01

Pos logit contributions
@#&+0.751
 deciding+0.740
 offense+0.680
 automatic+0.670
 Answer+0.665
Neg logit contributions
-1.235
enegger-1.063
-1.030
6000-1.006
Son-1.005
Token interested
Token ID4609
Feature activation-1.37e-01

Pos logit contributions
 deciding+2.134
@#&+2.063
 offense+1.934
 automatic+1.917
 Answer+1.885
Neg logit contributions
-3.544
enegger-3.061
-2.955
DragonMagazine-2.885
Lago-2.869
Token in
Token ID287
Feature activation-2.86e-01

Pos logit contributions
@#&+0.611
 deciding+0.588
hemy+0.551
 offense+0.550
 automatic+0.545
Neg logit contributions
-0.723
enegger-0.610
-0.589
Son-0.575
6000-0.574
Token The
Token ID383
Feature activation+1.52e-01

Pos logit contributions
@#&+1.111
 deciding+1.095
 offense+1.012
 automatic+0.996
 Answer+0.991
Neg logit contributions
-1.676
enegger-1.434
-1.387
6000-1.356
Son-1.353
Token Force
Token ID5221
Feature activation+2.39e+00

Pos logit contributions
+0.773
enegger+0.646
+0.620
Son+0.612
6000+0.606
Neg logit contributions
@#&-0.713
 deciding-0.682
 offense-0.641
hemy-0.637
igating-0.635
Token Awakens
Token ID39613
Feature activation+5.81e-01

Pos logit contributions
+8.590
enegger+6.992
Son+6.591
6000+6.569
Marvel+6.320
Neg logit contributions
@#&-11.942
hemy-11.355
igating-11.154
 deciding-11.085
lege-10.905
Token.
Token ID13
Feature activation+4.20e-01

Pos logit contributions
+2.691
enegger+2.261
+2.123
Son+2.100
6000+2.088
Neg logit contributions
@#&-2.922
 deciding-2.745
igating-2.635
hemy-2.616
 offense-2.585
Token In
Token ID554
Feature activation+7.96e-02

Pos logit contributions
+2.493
enegger+2.141
+2.072
Son+2.056
6000+2.051
Neg logit contributions
@#&-1.584
 deciding-1.550
 offense-1.434
hemy-1.425
 automatic-1.414
Token terms
Token ID2846
Feature activation+4.96e-01

Pos logit contributions
+0.554
enegger+0.486
+0.474
6000+0.467
Son+0.467
Neg logit contributions
@#&-0.222
 deciding-0.209
 offense-0.188
hemy-0.187
igating-0.185
Token of
Token ID286
Feature activation+1.23e-01

Pos logit contributions
+2.911
enegger+2.484
+2.421
6000+2.379
Son+2.374
Neg logit contributions
@#&-1.915
 deciding-1.853
 offense-1.714
hemy-1.691
igating-1.686
Token,
Token ID11
Feature activation-1.12e-01

Pos logit contributions
@#&+1.404
 deciding+1.381
 offense+1.290
 automatic+1.273
 Answer+1.267
Neg logit contributions
-1.674
enegger-1.407
-1.356
6000-1.323
Son-1.320
Token best
Token ID1266
Feature activation-6.39e-01

Pos logit contributions
@#&+0.403
 deciding+0.385
hemy+0.354
 offense+0.354
igating+0.352
Neg logit contributions
-0.687
enegger-0.596
-0.577
Son-0.568
6000-0.566
Token known
Token ID1900
Feature activation-1.73e-01

Pos logit contributions
@#&+2.232
 deciding+2.181
 offense+1.995
 automatic+1.963
 Answer+1.949
Neg logit contributions
-4.006
enegger-3.466
-3.363
6000-3.295
Son-3.290
Token for
Token ID329
Feature activation-6.31e-02

Pos logit contributions
@#&+0.778
 deciding+0.757
 offense+0.706
 automatic+0.701
hemy+0.699
Neg logit contributions
-0.915
enegger-0.773
-0.743
6000-0.725
Son-0.722
Token playing
Token ID2712
Feature activation-3.83e-01

Pos logit contributions
@#&+0.262
 deciding+0.250
 offense+0.234
hemy+0.232
 automatic+0.231
Neg logit contributions
-0.355
enegger-0.303
-0.293
Son-0.286
6000-0.285
Token Luke
Token ID11336
Feature activation+2.36e+00

Pos logit contributions
@#&+1.591
 deciding+1.549
 offense+1.432
hemy+1.421
 automatic+1.415
Neg logit contributions
-2.155
enegger-1.839
-1.771
Son-1.733
6000-1.730
Token Skywalker
Token ID29715
Feature activation+6.10e-01

Pos logit contributions
+6.314
+4.323
Son+4.199
enegger+4.167
 Son+3.839
Neg logit contributions
@#&-14.446
hemy-14.214
lege-13.716
 ado-13.512
 deciding-13.342
Token in
Token ID287
Feature activation-1.77e-02

Pos logit contributions
+3.032
enegger+2.597
+2.459
Son+2.418
6000+2.370
Neg logit contributions
@#&-2.848
 deciding-2.701
hemy-2.596
 offense-2.562
 automatic-2.548
Token the
Token ID262
Feature activation+9.57e-02

Pos logit contributions
@#&+0.078
 deciding+0.076
hemy+0.071
 offense+0.071
igating+0.070
Neg logit contributions
-0.095
enegger-0.080
-0.077
Son-0.075
6000-0.075
Token Star
Token ID2907
Feature activation+4.23e-01

Pos logit contributions
+0.498
enegger+0.416
+0.401
6000+0.393
Son+0.393
Neg logit contributions
@#&-0.436
 deciding-0.420
hemy-0.395
 offense-0.392
igating-0.391
Token Wars
Token ID6176
Feature activation+8.32e-01

Pos logit contributions
+1.787
enegger+1.369
Son+1.365
+1.344
6000+1.319
Neg logit contributions
@#&-2.357
 deciding-2.243
hemy-2.172
 automatic-2.124
igating-2.122
Token The
Token ID383
Feature activation+5.14e-03

Pos logit contributions
+0.654
enegger+0.553
+0.538
Son+0.528
6000+0.527
Neg logit contributions
@#&-0.467
 deciding-0.453
 offense-0.421
hemy-0.419
igating-0.414
Token Guardian
Token ID8283
Feature activation-4.26e-01

Pos logit contributions
+0.028
enegger+0.024
+0.023
Son+0.023
6000+0.023
Neg logit contributions
@#&-0.022
 deciding-0.021
 offense-0.020
hemy-0.020
 automatic-0.019
Token
Token ID447
Feature activation+7.58e-02

Pos logit contributions
@#&+1.968
 deciding+1.925
 offense+1.802
 automatic+1.781
igating+1.777
Neg logit contributions
-2.200
enegger-1.829
-1.772
Son-1.721
6000-1.720
Token
Token ID247
Feature activation+3.90e-02

Pos logit contributions
+0.378
enegger+0.314
+0.303
6000+0.294
Son+0.294
Neg logit contributions
@#&-0.360
 deciding-0.355
 offense-0.332
 automatic-0.328
igating-0.327
Tokens
Token ID82
Feature activation-9.65e-02

Pos logit contributions
+0.215
enegger+0.180
+0.176
Son+0.172
6000+0.171
Neg logit contributions
@#&-0.167
 deciding-0.160
 offense-0.149
 automatic-0.147
hemy-0.147
Token Luke
Token ID11336
Feature activation+2.30e+00

Pos logit contributions
@#&+0.407
 deciding+0.395
 offense+0.366
igating+0.362
hemy+0.362
Neg logit contributions
-0.535
enegger-0.453
-0.439
6000-0.430
Son-0.429
Token Harding
Token ID45732
Feature activation+8.79e-01

Pos logit contributions
+9.573
Son+7.967
 Son+7.815
+7.584
Lago+7.304
Neg logit contributions
@#&-10.446
 deciding-10.284
hemy-9.907
 offense-9.706
 automatic-9.661
Token,
Token ID11
Feature activation+6.49e-01

Pos logit contributions
+4.155
enegger+3.373
+3.337
Son+3.169
+3.161
Neg logit contributions
@#&-4.241
 deciding-4.069
hemy-3.934
 offense-3.862
igating-3.846
Token Stephanie
Token ID26085
Feature activation-1.22e-01

Pos logit contributions
+3.687
enegger+3.127
+3.055
Son+3.003
+2.988
Neg logit contributions
@#&-2.498
 deciding-2.372
hemy-2.281
igating-2.248
 offense-2.234
Token Kir
Token ID7385
Feature activation-4.60e-01

Pos logit contributions
@#&+0.558
 deciding+0.552
 offense+0.515
hemy+0.508
 automatic+0.508
Neg logit contributions
-0.627
enegger-0.524
-0.503
6000-0.497
Son-0.496
Tokench
Token ID354
Feature activation+3.21e-01

Pos logit contributions
@#&+2.011
 deciding+1.975
 offense+1.845
 automatic+1.820
 Answer+1.808
Neg logit contributions
-2.478
enegger-2.094
-2.016
6000-1.967
Son-1.966
Token a
Token ID257
Feature activation+3.35e-01

Pos logit contributions
+0.787
enegger+0.688
+0.666
6000+0.653
Son+0.652
Neg logit contributions
@#&-0.365
 deciding-0.360
 offense-0.326
 automatic-0.321
 Answer-0.318
Token 90
Token ID4101
Feature activation-8.48e-02

Pos logit contributions
+1.677
enegger+1.394
+1.342
6000+1.308
Son+1.303
Neg logit contributions
@#&-1.587
 deciding-1.559
 offense-1.463
 automatic-1.444
igating-1.440
Token F
Token ID376
Feature activation+9.50e-02

Pos logit contributions
@#&+0.333
 deciding+0.330
 offense+0.307
 automatic+0.303
 Answer+0.299
Neg logit contributions
-0.494
enegger-0.425
-0.406
Son-0.400
DragonMagazine-0.398
TokenOV
Token ID8874
Feature activation+2.68e-01

Pos logit contributions
+0.410
enegger+0.335
+0.312
DragonMagazine+0.308
Son+0.301
Neg logit contributions
@#&-0.514
 deciding-0.504
 offense-0.482
 automatic-0.476
 Answer-0.473
Token even
Token ID772
Feature activation+5.73e-01

Pos logit contributions
+1.701
enegger+1.474
+1.429
Son+1.399
6000+1.398
Neg logit contributions
@#&-0.919
 deciding-0.901
 offense-0.823
 automatic-0.811
 Answer-0.803
Token double
Token ID4274
Feature activation+2.26e+00

Pos logit contributions
+4.053
enegger+3.583
+3.498
6000+3.457
Son+3.443
Neg logit contributions
@#&-1.506
 deciding-1.418
 offense-1.258
hemy-1.246
igating-1.244
Token or
Token ID393
Feature activation+5.48e-01

Pos logit contributions
+8.278
6000+6.769
enegger+6.609
+6.383
Son+6.305
Neg logit contributions
@#&-12.037
hemy-11.093
 deciding-10.860
igating-10.691
 Explain-10.547
Token 4
Token ID604
Feature activation+1.87e-01

Pos logit contributions
+2.823
enegger+2.359
+2.290
6000+2.256
Son+2.238
Neg logit contributions
@#&-2.488
 deciding-2.411
 offense-2.267
hemy-2.250
igating-2.248
Tokenx
Token ID87
Feature activation+5.91e-01

Pos logit contributions
+1.090
enegger+0.937
+0.900
DragonMagazine+0.877
Son+0.873
Neg logit contributions
@#&-0.726
 deciding-0.717
 offense-0.663
 automatic-0.654
 Answer-0.654
Token 1080
Token ID17729
Feature activation-4.77e-01

Pos logit contributions
+2.605
enegger+2.102
+2.014
6000+1.960
Son+1.947
Neg logit contributions
@#&-3.156
 deciding-3.107
 offense-2.936
 automatic-2.903
igating-2.899
TokenP
Token ID47
Feature activation+3.77e-01

Pos logit contributions
@#&+2.367
 deciding+2.311
 offense+2.205
 Answer+2.198
 automatic+2.181
Neg logit contributions
-2.220
enegger-1.894
-1.744
DragonMagazine-1.725
Lago-1.659
Token the
Token ID262
Feature activation+2.79e-01

Pos logit contributions
+4.361
enegger+3.745
+3.484
Son+3.407
6000+3.377
Neg logit contributions
@#&-4.849
 deciding-4.565
hemy-4.433
 automatic-4.367
 offense-4.358
Token day
Token ID1110
Feature activation-2.97e-01

Pos logit contributions
+1.450
enegger+1.239
+1.185
Son+1.154
6000+1.153
Neg logit contributions
@#&-1.262
 deciding-1.204
hemy-1.147
 offense-1.131
igating-1.122
Token after
Token ID706
Feature activation+1.08e-01

Pos logit contributions
@#&+1.096
 deciding+1.068
 offense+0.984
 automatic+0.969
 Answer+0.964
Neg logit contributions
-1.802
enegger-1.552
-1.505
6000-1.474
Son-1.473
Token the
Token ID262
Feature activation-1.81e-01

Pos logit contributions
+0.609
enegger+0.518
+0.501
Son+0.494
6000+0.492
Neg logit contributions
@#&-0.448
 deciding-0.419
hemy-0.397
 offense-0.396
igating-0.393
Token final
Token ID2457
Feature activation+8.42e-01

Pos logit contributions
@#&+0.758
 deciding+0.730
 offense+0.680
hemy+0.676
igating+0.675
Neg logit contributions
-1.004
enegger-0.855
-0.825
6000-0.809
Son-0.808
Token Force
Token ID5221
Feature activation+2.25e+00

Pos logit contributions
+4.564
enegger+3.823
+3.775
Son+3.694
6000+3.661
Neg logit contributions
@#&-3.577
hemy-3.263
 deciding-3.190
igating-3.078
lege-3.070
Token Awakens
Token ID39613
Feature activation+5.18e-01

Pos logit contributions
+9.170
Son+7.435
6000+7.325
enegger+7.257
Marvel+7.176
Neg logit contributions
@#&-10.573
hemy-10.073
lege-9.722
 deciding-9.707
igating-9.678
Token trailer
Token ID12268
Feature activation+4.55e-01

Pos logit contributions
+2.625
enegger+2.182
+2.131
Son+2.101
6000+2.088
Neg logit contributions
@#&-2.383
 deciding-2.221
hemy-2.146
igating-2.134
 offense-2.096
Token premiered
Token ID44119
Feature activation+6.23e-02

Pos logit contributions
+2.258
enegger+1.865
+1.803
Son+1.785
6000+1.781
Neg logit contributions
@#&-2.153
 deciding-2.010
hemy-1.945
igating-1.911
 offense-1.892
Token (
Token ID357
Feature activation+1.52e-01

Pos logit contributions
+0.318
enegger+0.265
+0.256
6000+0.250
Son+0.250
Neg logit contributions
@#&-0.290
 deciding-0.279
hemy-0.262
 offense-0.262
 Answer-0.260
Tokenalong
Token ID24176
Feature activation-3.19e-01

Pos logit contributions
+0.816
enegger+0.680
+0.662
Son+0.655
6000+0.652
Neg logit contributions
@#&-0.672
 deciding-0.654
 offense-0.612
 automatic-0.602
hemy-0.600
TokenTo
Token ID2514
Feature activation+5.16e-01

Pos logit contributions
+1.155
enegger+0.967
+0.943
Son+0.925
6000+0.922
Neg logit contributions
@#&-0.892
 deciding-0.863
 offense-0.805
hemy-0.797
 automatic-0.793
Token dominate
Token ID17863
Feature activation+1.28e-01

Pos logit contributions
+3.078
enegger+2.616
+2.585
6000+2.542
Son+2.515
Neg logit contributions
@#&-1.931
 deciding-1.821
hemy-1.701
igating-1.682
 offense-1.657
Token the
Token ID262
Feature activation+3.78e-01

Pos logit contributions
+0.717
enegger+0.606
+0.588
6000+0.575
Son+0.573
Neg logit contributions
@#&-0.530
 deciding-0.513
 offense-0.475
igating-0.471
hemy-0.470
Token world
Token ID995
Feature activation-5.71e-02

Pos logit contributions
+1.909
enegger+1.562
+1.540
6000+1.488
Son+1.476
Neg logit contributions
@#&-1.793
 deciding-1.702
hemy-1.619
igating-1.602
 offense-1.593
Token by
Token ID416
Feature activation+4.17e-01

Pos logit contributions
@#&+0.247
 deciding+0.243
 offense+0.227
 automatic+0.224
 Answer+0.223
Neg logit contributions
-0.310
enegger-0.262
-0.252
6000-0.246
Son-0.246
Token force
Token ID2700
Feature activation+2.23e+00

Pos logit contributions
+2.390
enegger+2.024
+1.987
6000+1.936
Son+1.922
Neg logit contributions
@#&-1.686
 deciding-1.552
hemy-1.482
igating-1.464
 Answer-1.461
Token,
Token ID11
Feature activation+5.06e-01

Pos logit contributions
+10.151
enegger+8.127
+7.987
6000+7.802
Son+7.583
Neg logit contributions
@#&-10.327
hemy-9.108
 deciding-9.040
lege-8.954
 Explain-8.833
Token wielding
Token ID33949
Feature activation-6.55e-01

Pos logit contributions
+2.742
enegger+2.282
+2.245
6000+2.187
Son+2.176
Neg logit contributions
@#&-2.184
 deciding-2.057
hemy-1.953
igating-1.940
 offense-1.921
Token nuclear
Token ID4523
Feature activation+8.71e-01

Pos logit contributions
 deciding+2.605
@#&+2.578
 offense+2.429
 automatic+2.401
 recall+2.346
Neg logit contributions
-3.744
enegger-3.222
-3.090
DragonMagazine-3.039
Son-3.032
Token weapons
Token ID3777
Feature activation+3.43e-01

Pos logit contributions
+5.135
enegger+4.261
+4.193
Son+4.083
6000+4.030
Neg logit contributions
@#&-3.449
 deciding-3.036
hemy-2.970
igating-2.910
 Explain-2.887
Token,
Token ID11
Feature activation+1.11e-01

Pos logit contributions
+1.720
enegger+1.414
+1.368
6000+1.337
Son+1.329
Neg logit contributions
@#&-1.645
 deciding-1.568
hemy-1.468
 Answer-1.467
igating-1.466
Token at
Token ID379
Feature activation-3.32e-02

Pos logit contributions
+1.822
enegger+1.506
+1.480
Son+1.434
6000+1.423
Neg logit contributions
@#&-1.691
 deciding-1.644
hemy-1.534
 offense-1.527
 Answer-1.526
Token the
Token ID262
Feature activation+1.11e-01

Pos logit contributions
@#&+0.137
 deciding+0.135
 offense+0.125
hemy+0.124
igating+0.123
Neg logit contributions
-0.185
enegger-0.157
-0.152
6000-0.149
Son-0.149
Token end
Token ID886
Feature activation+5.22e-01

Pos logit contributions
+0.604
enegger+0.517
+0.500
Son+0.489
6000+0.487
Neg logit contributions
@#&-0.473
 deciding-0.462
hemy-0.433
 offense-0.430
igating-0.426
Token of
Token ID286
Feature activation+2.92e-01

Pos logit contributions
+3.089
enegger+2.636
+2.570
6000+2.512
Son+2.512
Neg logit contributions
@#&-1.999
 deciding-1.947
 offense-1.798
hemy-1.774
 automatic-1.773
Token West
Token ID2688
Feature activation+1.23e-01

Pos logit contributions
+1.653
enegger+1.409
+1.351
6000+1.342
Son+1.340
Neg logit contributions
@#&-1.177
 deciding-1.132
hemy-1.073
 offense-1.051
igating-1.048
Token Fif
Token ID12830
Feature activation+2.18e+00

Pos logit contributions
+0.649
enegger+0.540
+0.530
Son+0.521
6000+0.517
Neg logit contributions
@#&-0.551
 deciding-0.537
 offense-0.505
hemy-0.499
 automatic-0.498
Tokenteenth
Token ID20283
Feature activation+1.11e+00

Pos logit contributions
+9.333
+7.945
Son+7.664
izons+7.648
asters+7.609
Neg logit contributions
 deciding-9.469
@#&-9.434
 offense-9.140
hemy-8.794
igating-8.759
Token Street
Token ID3530
Feature activation+6.80e-01

Pos logit contributions
+4.562
+3.670
enegger+3.461
Son+3.407
+3.339
Neg logit contributions
@#&-6.071
 deciding-5.679
hemy-5.604
igating-5.468
 offense-5.422
Token.
Token ID13
Feature activation+3.82e-02

Pos logit contributions
+3.262
enegger+2.667
+2.630
+2.534
Son+2.513
Neg logit contributions
@#&-3.297
 deciding-3.176
hemy-3.055
 offense-2.996
igating-2.979
Token Built
Token ID28477
Feature activation-3.57e-01

Pos logit contributions
+0.218
enegger+0.185
+0.179
Son+0.176
6000+0.175
Neg logit contributions
@#&-0.155
 deciding-0.152
 offense-0.141
hemy-0.139
 automatic-0.139
Token in
Token ID287
Feature activation+1.59e-01

Pos logit contributions
@#&+1.516
 deciding+1.492
 offense+1.388
 automatic+1.371
 Answer+1.362
Neg logit contributions
-1.961
enegger-1.660
-1.600
6000-1.563
Son-1.560
Token (
Token ID357
Feature activation+2.47e-01

Pos logit contributions
+2.392
enegger+2.022
+1.958
6000+1.916
Son+1.911
Neg logit contributions
@#&-1.838
 deciding-1.804
 offense-1.679
 automatic-1.654
igating-1.650
Tokenpictured
Token ID28852
Feature activation+5.13e-01

Pos logit contributions
+1.192
enegger+0.970
+0.935
6000+0.925
Son+0.921
Neg logit contributions
@#&-1.220
 deciding-1.202
 offense-1.143
 Answer-1.122
 automatic-1.119
Token),
Token ID828
Feature activation+9.59e-01

Pos logit contributions
+2.708
enegger+2.285
+2.209
Son+2.189
6000+2.174
Neg logit contributions
@#&-2.268
 deciding-2.201
 offense-2.038
 automatic-2.019
igating-2.016
Token the
Token ID262
Feature activation+2.10e-01

Pos logit contributions
+5.372
enegger+4.530
+4.445
Son+4.378
6000+4.373
Neg logit contributions
@#&-3.887
 deciding-3.746
 offense-3.484
hemy-3.472
igating-3.456
Token Latin
Token ID9133
Feature activation+4.14e-01

Pos logit contributions
+1.042
enegger+0.857
+0.841
Son+0.835
6000+0.818
Neg logit contributions
@#&-0.997
 deciding-0.969
 offense-0.911
 recall-0.903
igating-0.895
Token Academy
Token ID8581
Feature activation+2.17e+00

Pos logit contributions
+2.068
enegger+1.716
+1.655
6000+1.614
Son+1.613
Neg logit contributions
@#&-1.964
 deciding-1.927
 offense-1.806
 automatic-1.783
hemy-1.782
Token,
Token ID11
Feature activation+8.19e-01

Pos logit contributions
+7.702
Son+6.229
6000+6.216
+5.796
+5.796
Neg logit contributions
@#&-12.048
 deciding-11.223
hemy-10.989
igating-10.951
lege-10.714
Token some
Token ID617
Feature activation-5.56e-02

Pos logit contributions
+4.774
enegger+4.052
+3.964
6000+3.915
Son+3.899
Neg logit contributions
@#&-3.165
 deciding-3.033
 offense-2.821
igating-2.804
 Answer-2.784
Token areas
Token ID3006
Feature activation+8.06e-01

Pos logit contributions
 deciding+0.239
@#&+0.232
 offense+0.223
 automatic+0.220
 recall+0.215
Neg logit contributions
-0.306
enegger-0.265
-0.249
DragonMagazine-0.249
Son-0.242
Token in
Token ID287
Feature activation+6.74e-01

Pos logit contributions
+4.170
enegger+3.474
+3.383
6000+3.326
Son+3.322
Neg logit contributions
@#&-3.656
 deciding-3.514
hemy-3.298
 offense-3.279
igating-3.260
Token the
Token ID262
Feature activation+5.82e-01

Pos logit contributions
+3.899
enegger+3.327
+3.235
6000+3.179
Son+3.174
Neg logit contributions
@#&-2.654
 deciding-2.578
 offense-2.384
hemy-2.365
igating-2.350
Token Examiner
Token ID35855
Feature activation-2.54e-01

Pos logit contributions
@#&+1.293
 deciding+1.266
 offense+1.174
 automatic+1.156
igating+1.155
Neg logit contributions
-1.721
enegger-1.457
-1.417
Son-1.385
6000-1.383
Token,
Token ID11
Feature activation-3.13e-01

Pos logit contributions
@#&+1.203
 deciding+1.164
 offense+1.090
 automatic+1.083
 Answer+1.076
Neg logit contributions
-1.292
enegger-1.081
-1.039
Son-1.011
6000-1.009
Token Donald
Token ID3759
Feature activation+9.42e-01

Pos logit contributions
@#&+1.377
 deciding+1.334
 offense+1.243
hemy+1.233
igating+1.229
Neg logit contributions
-1.687
enegger-1.430
-1.371
Son-1.345
6000-1.341
Token Trump
Token ID1301
Feature activation+3.71e-01

Pos logit contributions
+4.050
enegger+3.337
Son+3.153
+3.147
Lago+3.051
Neg logit contributions
@#&-4.918
 deciding-4.749
hemy-4.511
igating-4.483
 offense-4.462
Token Donald
Token ID3759
Feature activation+1.12e+00

Pos logit contributions
+2.043
enegger+1.760
+1.682
Son+1.651
6000+1.632
Neg logit contributions
@#&-1.572
 deciding-1.502
hemy-1.390
 offense-1.388
igating-1.385
Token John
Token ID1757
Feature activation+2.13e+00

Pos logit contributions
+5.051
enegger+4.103
Son+4.006
+3.958
DragonMagazine+3.809
Neg logit contributions
@#&-5.697
 deciding-5.511
 offense-5.243
igating-5.194
hemy-5.190
Token Trump
Token ID1301
Feature activation+2.95e-01

Pos logit contributions
+7.904
Son+6.458
+6.199
enegger+6.025
Lago+5.985
Neg logit contributions
@#&-11.318
 deciding-10.974
hemy-10.910
igating-10.342
 automatic-10.335
TokenREAD
Token ID15675
Feature activation+5.08e-01

Pos logit contributions
+1.292
enegger+1.028
+0.993
Son+0.980
6000+0.966
Neg logit contributions
@#&-1.596
 deciding-1.566
 offense-1.479
 automatic-1.469
hemy-1.461
Token:
Token ID25
Feature activation+2.28e-01

Pos logit contributions
+2.419
enegger+1.990
+1.922
Son+1.881
6000+1.864
Neg logit contributions
@#&-2.521
 deciding-2.424
 offense-2.289
 Answer-2.271
hemy-2.271
Token Cohen
Token ID17154
Feature activation+5.08e-01

Pos logit contributions
+1.279
enegger+1.074
+1.050
Son+1.032
6000+1.024
Neg logit contributions
@#&-0.954
 deciding-0.928
 offense-0.864
 automatic-0.852
hemy-0.844
Token testimony
Token ID9709
Feature activation+8.11e-02

Pos logit contributions
+2.659
enegger+2.197
+2.158
Son+2.131
6000+2.089
Neg logit contributions
@#&-2.304
 deciding-2.149
hemy-2.030
 Answer-2.030
 automatic-2.012
Token find
Token ID1064
Feature activation-5.04e-02

Pos logit contributions
+0.060
enegger+0.052
+0.051
6000+0.050
Son+0.050
Neg logit contributions
@#&-0.028
 deciding-0.027
 offense-0.025
hemy-0.024
igating-0.024
Token them
Token ID606
Feature activation+5.11e-01

Pos logit contributions
@#&+0.197
 deciding+0.190
 offense+0.175
hemy+0.174
igating+0.173
Neg logit contributions
-0.295
enegger-0.252
-0.243
Son-0.239
6000-0.239
Token.
Token ID13
Feature activation+2.23e-01

Pos logit contributions
+2.518
enegger+2.127
+1.988
Son+1.970
6000+1.938
Neg logit contributions
@#&-2.455
 deciding-2.304
hemy-2.197
 offense-2.186
 Answer-2.184
Token
Token ID198
Feature activation-3.67e-02

Pos logit contributions
+1.237
enegger+1.053
Son+1.012
+1.008
6000+0.994
Neg logit contributions
@#&-0.924
 deciding-0.903
hemy-0.841
 offense-0.839
 automatic-0.829
Token
Token ID198
Feature activation+2.94e-02

Pos logit contributions
@#&+0.129
 deciding+0.127
 offense+0.116
 Answer+0.116
 automatic+0.115
Neg logit contributions
-0.229
enegger-0.200
-0.191
DragonMagazine-0.187
6000-0.185
TokenAdvertisement
Token ID4723
Feature activation+2.10e+00

Pos logit contributions
+0.134
enegger+0.108
+0.104
Son+0.101
6000+0.101
Neg logit contributions
@#&-0.154
 deciding-0.152
 offense-0.143
 automatic-0.141
igating-0.141
Token
Token ID198
Feature activation+5.04e-01

Pos logit contributions
+7.720
enegger+5.982
Son+5.926
Marvel+5.838
Sea+5.809
Neg logit contributions
@#&-10.569
 deciding-9.893
 offense-9.419
 recall-9.382
igating-9.252
Token
Token ID198
Feature activation+1.70e-01

Pos logit contributions
+2.944
enegger+2.509
+2.443
6000+2.401
Son+2.398
Neg logit contributions
@#&-1.965
 deciding-1.921
 offense-1.777
 automatic-1.746
hemy-1.743
TokenThe
Token ID464
Feature activation-1.86e-02

Pos logit contributions
+0.793
enegger+0.639
+0.621
Son+0.611
6000+0.608
Neg logit contributions
@#&-0.872
 deciding-0.859
 offense-0.812
 automatic-0.800
hemy-0.797
Token Big
Token ID4403
Feature activation-1.44e-01

Pos logit contributions
@#&+0.073
 deciding+0.070
 offense+0.065
hemy+0.065
igating+0.065
Neg logit contributions
-0.108
enegger-0.092
-0.089
Son-0.088
6000-0.087
Token Three
Token ID7683
Feature activation+3.71e-01

Pos logit contributions
@#&+0.606
 deciding+0.586
 offense+0.549
hemy+0.541
 automatic+0.539
Neg logit contributions
-0.802
enegger-0.682
-0.654
Son-0.649
6000-0.643
Tokenn
Token ID77
Feature activation+1.10e+00

Pos logit contributions
+1.180
enegger+0.981
6000+0.935
Son+0.934
+0.929
Neg logit contributions
@#&-1.312
 deciding-1.299
 offense-1.215
 Answer-1.213
 automatic-1.199
Tokenh
Token ID71
Feature activation+2.08e-01

Pos logit contributions
+5.543
enegger+5.050
6000+4.978
Son+4.820
+4.670
Neg logit contributions
 deciding-4.444
@#&-4.417
 Answer-4.166
 offense-4.052
 automatic-3.967
Tokents
Token ID912
Feature activation+4.62e-01

Pos logit contributions
+0.812
enegger+0.641
6000+0.606
+0.604
Son+0.598
Neg logit contributions
@#&-1.210
 deciding-1.189
 offense-1.132
 Answer-1.118
igating-1.117
Tokena
Token ID64
Feature activation+1.06e+00

Pos logit contributions
+3.157
enegger+2.769
+2.705
6000+2.689
Son+2.673
Neg logit contributions
@#&-1.325
 deciding-1.277
 offense-1.142
 automatic-1.110
hemy-1.096
Token.
Token ID13
Feature activation+3.95e-01

Pos logit contributions
+4.825
enegger+3.915
+3.825
6000+3.817
Son+3.776
Neg logit contributions
@#&-5.361
 deciding-5.239
 offense-4.942
hemy-4.867
igating-4.860
Tokendot
Token ID26518
Feature activation+2.09e+00

Pos logit contributions
+1.751
enegger+1.407
6000+1.351
Son+1.349
+1.347
Neg logit contributions
@#&-2.112
 deciding-2.049
 offense-1.904
hemy-1.902
 automatic-1.896
Token.
Token ID13
Feature activation+4.79e-01

Pos logit contributions
+7.571
6000+6.057
Son+5.987
+5.656
Lago+5.655
Neg logit contributions
@#&-11.500
 deciding-11.138
hemy-10.740
igating-10.529
 offense-10.440
Tokengov
Token ID9567
Feature activation+8.37e-01

Pos logit contributions
+3.048
enegger+2.638
+2.567
6000+2.525
Son+2.518
Neg logit contributions
@#&-1.624
 deciding-1.585
 offense-1.445
 automatic-1.420
hemy-1.410
Token/
Token ID14
Feature activation+1.16e-01

Pos logit contributions
+3.347
enegger+2.639
+2.534
6000+2.481
Son+2.472
Neg logit contributions
@#&-4.766
 deciding-4.692
 offense-4.443
hemy-4.396
igating-4.392
Tokenpeople
Token ID15332
Feature activation-4.56e-01

Pos logit contributions
+0.598
enegger+0.498
+0.482
6000+0.472
Son+0.472
Neg logit contributions
@#&-0.537
 deciding-0.527
 offense-0.490
 automatic-0.485
 Answer-0.484
Token/
Token ID14
Feature activation-1.17e-01

Pos logit contributions
@#&+2.444
 deciding+2.408
 offense+2.274
 automatic+2.253
 Answer+2.244
Neg logit contributions
-2.003
enegger-1.619
-1.549
6000-1.503
Son-1.499
Token to
Token ID284
Feature activation+1.26e-01

Pos logit contributions
+0.553
enegger+0.461
+0.446
6000+0.437
Son+0.434
Neg logit contributions
@#&-0.484
 deciding-0.469
 offense-0.439
hemy-0.436
 Answer-0.435
Token fly
Token ID6129
Feature activation-2.19e-01

Pos logit contributions
+0.707
enegger+0.595
+0.579
6000+0.568
Son+0.566
Neg logit contributions
@#&-0.529
 deciding-0.514
 offense-0.476
hemy-0.475
igating-0.471
Token by
Token ID416
Feature activation+9.35e-01

Pos logit contributions
@#&+0.932
 deciding+0.919
 offense+0.859
 automatic+0.849
 Answer+0.842
Neg logit contributions
-1.199
enegger-1.018
-0.981
Son-0.954
6000-0.952
Token comet
Token ID31733
Feature activation+4.20e-01

Pos logit contributions
+5.001
enegger+4.158
+4.078
6000+4.040
Son+3.983
Neg logit contributions
@#&-4.024
 deciding-3.773
hemy-3.691
igating-3.552
lege-3.514
Token 19
Token ID678
Feature activation+9.76e-02

Pos logit contributions
+2.271
enegger+1.900
+1.857
6000+1.816
Son+1.810
Neg logit contributions
@#&-1.822
 deciding-1.765
hemy-1.642
 offense-1.641
igating-1.626
TokenP
Token ID47
Feature activation+2.08e+00

Pos logit contributions
+0.407
enegger+0.321
+0.308
6000+0.298
Son+0.298
Neg logit contributions
@#&-0.548
 deciding-0.540
 offense-0.511
 automatic-0.506
igating-0.505
Token/
Token ID14
Feature activation+1.03e+00

Pos logit contributions
+7.794
+5.803
6000+5.793
+5.583
enegger+5.565
Neg logit contributions
@#&-11.346
 deciding-10.602
hemy-10.503
 Answer-10.085
igating-10.037
TokenB
Token ID33
Feature activation+1.23e+00

Pos logit contributions
+4.334
Son+3.372
+3.344
enegger+3.335
6000+3.296
Neg logit contributions
@#&-5.607
 deciding-5.498
 offense-5.210
hemy-5.206
igating-5.132
Tokenor
Token ID273
Feature activation-4.03e-01

Pos logit contributions
+5.065
enegger+4.040
6000+3.988
+3.875
Son+3.813
Neg logit contributions
@#&-6.527
 deciding-6.354
 offense-5.895
 automatic-5.855
 Answer-5.806
Tokenrell
Token ID11252
Feature activation-3.51e-01

Pos logit contributions
@#&+1.883
 deciding+1.801
lege+1.723
igating+1.714
 recall+1.689
Neg logit contributions
-2.057
enegger-1.724
-1.648
DragonMagazine-1.618
Son-1.575
Tokeny
Token ID88
Feature activation+8.82e-01

Pos logit contributions
@#&+1.272
 deciding+1.251
 Answer+1.151
 offense+1.146
 automatic+1.140
Neg logit contributions
-2.144
enegger-1.853
-1.776
DragonMagazine-1.749
6000-1.708
Token repeatedly
Token ID7830
Feature activation+4.32e-01

Pos logit contributions
+2.097
enegger+1.806
+1.739
Son+1.705
6000+1.695
Neg logit contributions
@#&-1.563
 deciding-1.473
hemy-1.373
 Answer-1.362
igating-1.355
Token criticized
Token ID12318
Feature activation+1.58e-01

Pos logit contributions
+2.397
enegger+2.076
+1.982
Son+1.935
6000+1.933
Neg logit contributions
@#&-1.811
 deciding-1.696
hemy-1.575
 Answer-1.557
igating-1.552
Token President
Token ID1992
Feature activation+1.70e-01

Pos logit contributions
+0.828
enegger+0.705
+0.674
Son+0.660
6000+0.657
Neg logit contributions
@#&-0.715
 deciding-0.681
hemy-0.640
 offense-0.632
igating-0.632
Token Trump
Token ID1301
Feature activation+5.09e-01

Pos logit contributions
+0.827
enegger+0.690
+0.667
Son+0.653
6000+0.640
Neg logit contributions
@#&-0.826
 deciding-0.798
 offense-0.748
hemy-0.745
igating-0.741
Token Donald
Token ID3759
Feature activation+1.16e+00

Pos logit contributions
+2.434
enegger+2.021
+1.956
Son+1.896
6000+1.869
Neg logit contributions
@#&-2.515
 deciding-2.398
 Answer-2.265
 offense-2.253
igating-2.249
Token John
Token ID1757
Feature activation+2.08e+00

Pos logit contributions
+5.262
enegger+4.267
Son+4.123
+4.100
DragonMagazine+3.922
Neg logit contributions
@#&-5.865
 deciding-5.726
 offense-5.455
igating-5.406
hemy-5.367
Token Trump
Token ID1301
Feature activation+3.73e-01

Pos logit contributions
+7.897
Son+6.517
+6.267
enegger+6.176
Lago+6.064
Neg logit contributions
@#&-11.061
 deciding-10.727
hemy-10.447
igating-10.069
 automatic-10.002
TokenHouse
Token ID18102
Feature activation+2.62e-01

Pos logit contributions
+1.609
enegger+1.273
+1.231
Son+1.218
6000+1.198
Neg logit contributions
@#&-2.035
 deciding-1.999
 offense-1.889
 automatic-1.874
 Answer-1.862
Token committee
Token ID5583
Feature activation-1.29e-01

Pos logit contributions
+1.576
enegger+1.356
+1.313
6000+1.285
Son+1.282
Neg logit contributions
@#&-0.978
 deciding-0.960
 offense-0.885
 automatic-0.870
 Answer-0.865
Token believes
Token ID5804
Feature activation-7.73e-02

Pos logit contributions
@#&+0.512
 deciding+0.511
 offense+0.472
 automatic+0.466
 Answer+0.460
Neg logit contributions
-0.743
enegger-0.638
-0.612
6000-0.597
DragonMagazine-0.597
Token it
Token ID340
Feature activation-6.33e-01

Pos logit contributions
@#&+0.346
 deciding+0.344
 offense+0.323
 automatic+0.319
 recall+0.315
Neg logit contributions
-0.407
enegger-0.346
-0.329
6000-0.321
Son-0.319
Token,
Token ID11
Feature activation+1.42e-01

Pos logit contributions
@#&+3.818
 deciding+3.801
 offense+3.520
 automatic+3.489
 Answer+3.440
Neg logit contributions
-4.744
enegger-4.003
-3.865
Son-3.768
6000-3.767
Token investors
Token ID7713
Feature activation-3.42e-01

Pos logit contributions
+0.797
enegger+0.677
+0.656
6000+0.643
Son+0.642
Neg logit contributions
@#&-0.584
 deciding-0.558
 offense-0.525
igating-0.519
hemy-0.519
Token risk
Token ID2526
Feature activation+3.22e-01

Pos logit contributions
@#&+1.410
 deciding+1.407
 offense+1.300
 automatic+1.285
 recall+1.270
Neg logit contributions
-1.928
enegger-1.632
-1.578
Son-1.541
6000-1.538
Token paying
Token ID5989
Feature activation+4.00e-01

Pos logit contributions
+1.962
enegger+1.693
+1.652
6000+1.619
Son+1.609
Neg logit contributions
@#&-1.172
 deciding-1.089
 Answer-1.026
hemy-1.020
 offense-1.016
Token an
Token ID281
Feature activation+6.75e-01

Pos logit contributions
+2.286
enegger+1.943
+1.887
6000+1.854
Son+1.841
Neg logit contributions
@#&-1.602
 deciding-1.539
hemy-1.423
 Answer-1.422
 offense-1.421
Token artificially
Token ID32455
Feature activation+2.07e+00

Pos logit contributions
+3.870
+3.208
enegger+3.208
6000+3.070
Son+3.067
Neg logit contributions
@#&-2.683
 deciding-2.548
hemy-2.415
 Answer-2.338
igating-2.327
Token high
Token ID1029
Feature activation+7.63e-01

Pos logit contributions
+10.117
+8.438
+8.215
enegger+8.115
Son+7.918
Neg logit contributions
@#&-8.542
hemy-8.114
 Answer-8.053
 deciding-7.891
igating-7.867
Token price
Token ID2756
Feature activation+1.12e-01

Pos logit contributions
+3.938
enegger+3.292
+3.212
6000+3.124
Son+3.119
Neg logit contributions
@#&-3.397
 deciding-3.183
hemy-3.089
igating-3.039
lege-3.011
Token.
Token ID13
Feature activation+2.47e-01

Pos logit contributions
+0.589
enegger+0.488
+0.475
6000+0.464
Son+0.462
Neg logit contributions
@#&-0.510
 deciding-0.491
igating-0.460
hemy-0.460
 offense-0.459
Token
Token ID198
Feature activation+2.11e-01

Pos logit contributions
+1.441
enegger+1.231
+1.194
Son+1.175
6000+1.170
Neg logit contributions
@#&-0.971
 deciding-0.946
 offense-0.882
hemy-0.868
 automatic-0.864
Token
Token ID198
Feature activation+2.36e-01

Pos logit contributions
+1.272
enegger+1.096
+1.059
6000+1.034
Son+1.032
Neg logit contributions
@#&-0.788
 deciding-0.773
 offense-0.712
 automatic-0.702
 Answer-0.699
Token is
Token ID318
Feature activation-2.94e-01

Pos logit contributions
@#&+0.447
 deciding+0.436
 offense+0.412
 automatic+0.408
hemy+0.408
Neg logit contributions
-0.430
enegger-0.355
-0.341
Son-0.333
6000-0.331
Token,
Token ID11
Feature activation-3.50e-01

Pos logit contributions
@#&+1.150
 deciding+1.113
 offense+1.034
hemy+1.019
igating+1.016
Neg logit contributions
-1.728
enegger-1.479
-1.431
6000-1.403
Son-1.403
Token Hitchcock
Token ID49817
Feature activation+2.96e-01

Pos logit contributions
@#&+1.481
 deciding+1.452
 offense+1.352
 automatic+1.333
 Answer+1.328
Neg logit contributions
-1.934
enegger-1.639
-1.582
6000-1.545
Son-1.543
Token enlisted
Token ID30166
Feature activation+1.84e+00

Pos logit contributions
+1.596
enegger+1.360
+1.300
6000+1.271
Son+1.270
Neg logit contributions
@#&-1.293
 deciding-1.254
 offense-1.172
 automatic-1.156
 Answer-1.156
Token his
Token ID465
Feature activation+6.38e-01

Pos logit contributions
+9.523
enegger+8.277
Son+7.909
6000+7.854
+7.829
Neg logit contributions
@#&-7.182
 deciding-6.897
hemy-6.633
 Answer-6.318
 offense-6.278
Tokeng
Token ID70
Feature activation+2.06e+00

Pos logit contributions
+3.609
enegger+3.121
+2.982
Son+2.964
6000+2.936
Neg logit contributions
@#&-2.568
 deciding-2.459
hemy-2.316
 Answer-2.269
 offense-2.265
Tokenifted
Token ID21715
Feature activation+7.48e-01

Pos logit contributions
+9.639
enegger+8.432
6000+7.913
asters+7.869
+7.862
Neg logit contributions
@#&-8.839
 deciding-8.332
 Answer-7.985
 offense-7.911
 recall-7.786
Token title
Token ID3670
Feature activation-4.40e-02

Pos logit contributions
+3.965
enegger+3.396
+3.255
Son+3.226
6000+3.203
Neg logit contributions
@#&-3.265
 deciding-3.122
hemy-2.950
 Answer-2.889
 offense-2.882
Token designer
Token ID11915
Feature activation+7.27e-02

Pos logit contributions
@#&+0.203
 deciding+0.198
 offense+0.185
 Answer+0.183
hemy+0.183
Neg logit contributions
-0.227
enegger-0.190
-0.183
6000-0.179
Son-0.179
Token,
Token ID11
Feature activation+2.24e-01

Pos logit contributions
+0.372
enegger+0.308
+0.298
Son+0.293
6000+0.291
Neg logit contributions
@#&-0.339
 deciding-0.331
 offense-0.310
hemy-0.307
 automatic-0.306
Token Saul
Token ID31603
Feature activation+8.06e-01

Pos logit contributions
+1.164
enegger+0.967
+0.939
Son+0.928
6000+0.916
Neg logit contributions
@#&-1.022
 deciding-0.992
 offense-0.931
hemy-0.930
igating-0.921
Token Air
Token ID3701
Feature activation-1.06e+00

Pos logit contributions
+1.554
enegger+1.262
+1.224
Son+1.184
Lago+1.180
Neg logit contributions
@#&-1.686
 deciding-1.622
igating-1.519
hemy-1.511
 Answer-1.509
Token Station
Token ID9327
Feature activation-6.06e-01

Pos logit contributions
@#&+5.531
 deciding+5.526
 offense+5.285
 automatic+5.173
 Answer+5.158
Neg logit contributions
-4.604
enegger-3.778
-3.582
6000-3.479
DragonMagazine-3.459
Token in
Token ID287
Feature activation-5.44e-01

Pos logit contributions
@#&+2.660
 deciding+2.634
 offense+2.457
 automatic+2.426
 Answer+2.410
Neg logit contributions
-3.228
enegger-2.718
-2.619
6000-2.557
Son-2.550
Token South
Token ID2520
Feature activation+5.38e-02

Pos logit contributions
@#&+2.315
 deciding+2.284
 offense+2.129
 automatic+2.099
 Answer+2.088
Neg logit contributions
-2.983
enegger-2.518
-2.431
6000-2.373
Son-2.369
Token We
Token ID775
Feature activation+4.00e-01

Pos logit contributions
+0.285
enegger+0.242
+0.234
Son+0.232
Lago+0.229
Neg logit contributions
@#&-0.239
 deciding-0.232
 offense-0.215
 automatic-0.213
 Answer-0.210
Tokenym
Token ID4948
Feature activation+2.05e+00

Pos logit contributions
+1.996
enegger+1.669
Son+1.638
6000+1.597
+1.582
Neg logit contributions
@#&-1.834
 deciding-1.796
 automatic-1.689
 offense-1.685
hemy-1.648
Tokenouth
Token ID1536
Feature activation-4.08e-01

Pos logit contributions
+5.469
asters+4.485
enegger+4.119
6000+3.770
asse+3.656
Neg logit contributions
 deciding-12.671
@#&-12.121
 offense-12.114
 automatic-12.037
 Explain-11.967
Token.
Token ID13
Feature activation-1.88e-01

Pos logit contributions
@#&+1.876
 deciding+1.841
 offense+1.725
 automatic+1.704
 Answer+1.697
Neg logit contributions
-2.102
enegger-1.758
-1.694
6000-1.647
Son-1.645
Token This
Token ID770
Feature activation+1.53e-01

Pos logit contributions
@#&+0.747
 deciding+0.730
 offense+0.677
hemy+0.671
 automatic+0.667
Neg logit contributions
-1.094
enegger-0.934
-0.906
6000-0.888
Son-0.887
Token is
Token ID318
Feature activation+4.07e-01

Pos logit contributions
+0.774
enegger+0.645
+0.623
6000+0.608
Son+0.603
Neg logit contributions
@#&-0.714
 deciding-0.697
 offense-0.650
 Answer-0.646
hemy-0.646
Token an
Token ID281
Feature activation+2.62e-01

Pos logit contributions
+2.474
enegger+2.119
+2.052
6000+2.033
Son+2.014
Neg logit contributions
@#&-1.497
 deciding-1.414
 offense-1.312
hemy-1.312
igating-1.308
Token,
Token ID11
Feature activation+1.18e-01

Pos logit contributions
+3.409
enegger+2.752
+2.625
+2.590
DragonMagazine+2.585
Neg logit contributions
@#&-3.798
 deciding-3.627
hemy-3.405
 offense-3.397
 automatic-3.393
Token Joyce
Token ID25936
Feature activation+3.59e-01

Pos logit contributions
+0.607
enegger+0.513
+0.489
+0.477
Lago+0.475
Neg logit contributions
@#&-0.541
 deciding-0.526
hemy-0.492
 offense-0.491
 automatic-0.486
Token Foundation
Token ID5693
Feature activation+7.40e-01

Pos logit contributions
+1.805
enegger+1.499
+1.447
6000+1.411
Son+1.409
Neg logit contributions
@#&-1.697
 deciding-1.661
 offense-1.557
 automatic-1.538
igating-1.537
Token:
Token ID25
Feature activation+7.98e-01

Pos logit contributions
+3.304
enegger+2.687
+2.600
+2.521
Son+2.514
Neg logit contributions
@#&-3.815
 deciding-3.618
hemy-3.440
igating-3.393
 offense-3.388
Token "
Token ID366
Feature activation+8.89e-01

Pos logit contributions
+3.733
enegger+3.134
+2.913
Lago+2.877
Son+2.866
Neg logit contributions
@#&-3.943
 deciding-3.787
 offense-3.558
hemy-3.554
 automatic-3.521
TokenR
Token ID49
Feature activation+2.05e+00

Pos logit contributions
+3.411
enegger+2.594
+2.567
Son+2.548
6000+2.474
Neg logit contributions
@#&-5.190
 deciding-5.089
hemy-4.896
 offense-4.857
 automatic-4.772
Tokenoc
Token ID420
Feature activation+1.23e-01

Pos logit contributions
+8.080
enegger+6.355
6000+6.341
+6.279
Son+6.209
Neg logit contributions
@#&-10.680
 deciding-10.524
 offense-9.814
hemy-9.706
 Answer-9.631
Tokenkefeller
Token ID31424
Feature activation+5.26e-02

Pos logit contributions
+0.408
enegger+0.299
+0.283
DragonMagazine+0.267
6000+0.263
Neg logit contributions
@#&-0.792
 deciding-0.783
 offense-0.749
igating-0.743
 Answer-0.743
Token Family
Token ID7884
Feature activation+8.25e-01

Pos logit contributions
+0.249
enegger+0.205
+0.196
6000+0.190
Son+0.190
Neg logit contributions
@#&-0.264
 deciding-0.260
 offense-0.245
 automatic-0.242
 Answer-0.241
Token Fund
Token ID7557
Feature activation+9.91e-01

Pos logit contributions
+4.070
enegger+3.378
+3.300
6000+3.232
Son+3.218
Neg logit contributions
@#&-3.884
 deciding-3.792
hemy-3.572
 offense-3.542
igating-3.539
Token,
Token ID11
Feature activation+9.25e-01

Pos logit contributions
+4.288
enegger+3.590
+3.320
Son+3.288
Lago+3.260
Neg logit contributions
@#&-5.103
 deciding-4.868
hemy-4.612
 offense-4.538
 Answer-4.529
Token exit
Token ID8420
Feature activation+5.91e-01

Pos logit contributions
@#&+0.468
 deciding+0.451
 offense+0.418
hemy+0.416
igating+0.413
Neg logit contributions
-0.735
enegger-0.626
-0.609
Son-0.599
6000-0.597
Token polls
Token ID9231
Feature activation+6.22e-02

Pos logit contributions
+4.039
+3.438
enegger+3.416
+3.408
6000+3.404
Neg logit contributions
@#&-1.692
 deciding-1.573
hemy-1.496
lege-1.409
igating-1.406
Token.
Token ID13
Feature activation-9.17e-02

Pos logit contributions
+0.310
enegger+0.254
+0.247
6000+0.242
Son+0.241
Neg logit contributions
@#&-0.299
 deciding-0.287
 offense-0.271
hemy-0.270
igating-0.270
Token
Token ID198
Feature activation+1.06e-01

Pos logit contributions
@#&+0.369
 deciding+0.363
 offense+0.336
 automatic+0.331
hemy+0.330
Neg logit contributions
-0.526
enegger-0.447
-0.433
Son-0.426
6000-0.425
Token
Token ID198
Feature activation+6.35e-03

Pos logit contributions
+0.658
enegger+0.573
+0.549
DragonMagazine+0.536
6000+0.533
Neg logit contributions
@#&-0.381
 deciding-0.374
 offense-0.344
 Answer-0.341
 automatic-0.340
TokenChrist
Token ID10684
Feature activation+2.04e+00

Pos logit contributions
+0.029
enegger+0.023
+0.022
Son+0.022
6000+0.022
Neg logit contributions
@#&-0.033
 deciding-0.033
 offense-0.031
 automatic-0.031
 Answer-0.031
Tokenie
Token ID494
Feature activation-1.08e-01

Pos logit contributions
+7.979
Son+6.641
enegger+6.605
6000+6.312
+6.269
Neg logit contributions
@#&-10.107
 deciding-10.015
igating-9.395
 automatic-9.300
lege-9.234
Token
Token ID447
Feature activation+6.03e-01

Pos logit contributions
@#&+0.486
 deciding+0.475
 offense+0.444
 automatic+0.438
 Answer+0.437
Neg logit contributions
-0.573
enegger-0.481
-0.464
6000-0.452
Son-0.451
Token
Token ID247
Feature activation+1.44e-01

Pos logit contributions
+2.182
enegger+1.654
+1.619
Son+1.583
Lago+1.566
Neg logit contributions
@#&-3.633
 deciding-3.576
igating-3.379
 offense-3.377
hemy-3.362
Tokens
Token ID82
Feature activation+2.66e-01

Pos logit contributions
+0.802
enegger+0.679
+0.657
Son+0.642
6000+0.639
Neg logit contributions
@#&-0.608
 deciding-0.592
 offense-0.550
 Answer-0.546
hemy-0.544
Token showing
Token ID4478
Feature activation+3.64e-01

Pos logit contributions
+1.589
enegger+1.362
+1.326
Son+1.300
6000+1.294
Neg logit contributions
@#&-1.003
 deciding-0.949
hemy-0.890
igating-0.877
 Answer-0.868
Token,
Token ID11
Feature activation+1.65e-01

Pos logit contributions
@#&+0.794
 deciding+0.780
 offense+0.726
 automatic+0.716
 Answer+0.712
Neg logit contributions
-1.048
enegger-0.890
-0.859
6000-0.839
Son-0.837
Token such
Token ID884
Feature activation-2.59e-01

Pos logit contributions
+0.924
enegger+0.779
+0.759
6000+0.740
Son+0.739
Neg logit contributions
@#&-0.689
 deciding-0.666
 offense-0.621
hemy-0.616
 Answer-0.615
Token as
Token ID355
Feature activation-1.98e-02

Pos logit contributions
@#&+1.156
 deciding+1.111
hemy+1.045
 Answer+1.041
igating+1.034
Neg logit contributions
-1.381
enegger-1.153
-1.125
6000-1.088
Son-1.085
Token robotics
Token ID36359
Feature activation+7.19e-01

Pos logit contributions
@#&+0.087
 deciding+0.084
 offense+0.078
hemy+0.078
igating+0.078
Neg logit contributions
-0.106
enegger-0.089
-0.086
Son-0.085
6000-0.084
Token,
Token ID11
Feature activation+4.09e-01

Pos logit contributions
+3.356
enegger+2.699
+2.658
Son+2.555
6000+2.517
Neg logit contributions
@#&-3.671
 deciding-3.463
hemy-3.381
 Answer-3.314
igating-3.286
Token artificial
Token ID11666
Feature activation+2.02e+00

Pos logit contributions
+2.135
enegger+1.782
+1.733
Son+1.714
6000+1.688
Neg logit contributions
@#&-1.831
 deciding-1.737
hemy-1.670
igating-1.640
 Answer-1.626
Token intelligence
Token ID4430
Feature activation+2.70e-01

Pos logit contributions
+8.578
enegger+7.242
+7.071
Son+6.946
asters+6.647
Neg logit contributions
@#&-9.415
hemy-8.814
igating-8.373
 Answer-8.315
 deciding-8.150
Token,
Token ID11
Feature activation+4.11e-01

Pos logit contributions
+1.293
enegger+1.040
+1.030
Son+0.998
6000+0.981
Neg logit contributions
@#&-1.368
 deciding-1.296
hemy-1.235
 Answer-1.234
igating-1.219
Token 3
Token ID513
Feature activation+7.11e-02

Pos logit contributions
+2.008
enegger+1.666
+1.606
Son+1.581
6000+1.562
Neg logit contributions
@#&-1.977
 deciding-1.890
hemy-1.813
igating-1.778
 Answer-1.769
Token-
Token ID12
Feature activation-3.21e-01

Pos logit contributions
+0.353
enegger+0.288
+0.282
Son+0.277
6000+0.273
Neg logit contributions
@#&-0.342
 deciding-0.333
 offense-0.311
igating-0.311
 Answer-0.310
TokenD
Token ID35
Feature activation+1.93e-01

Pos logit contributions
@#&+1.434
 deciding+1.411
 offense+1.318
 automatic+1.302
 Answer+1.295
Neg logit contributions
-1.692
enegger-1.421
-1.370
6000-1.331
Son-1.328
Token your
Token ID534
Feature activation-8.81e-01

Pos logit contributions
@#&+1.346
 deciding+1.303
 offense+1.146
 automatic+1.119
igating+1.108
Neg logit contributions
-3.858
enegger-3.408
-3.323
6000-3.266
Son-3.263
Token?
Token ID5633
Feature activation-6.83e-01

Pos logit contributions
@#&+2.248
 deciding+2.243
 offense+2.015
 Answer+1.977
 automatic+1.951
Neg logit contributions
-6.288
enegger-5.520
-5.381
Lago-5.301
6000-5.294
Tokens
Token ID82
Feature activation+3.40e-01

Pos logit contributions
@#&+3.076
 deciding+3.030
 offense+2.837
 Answer+2.799
 automatic+2.796
Neg logit contributions
-3.573
enegger-3.001
-2.886
6000-2.808
Son-2.798
Token for
Token ID329
Feature activation+1.72e-01

Pos logit contributions
+1.717
enegger+1.445
+1.382
6000+1.367
Son+1.365
Neg logit contributions
@#&-1.563
 deciding-1.545
hemy-1.424
 automatic-1.423
 offense-1.421
Token the
Token ID262
Feature activation-9.30e-02

Pos logit contributions
+1.011
enegger+0.874
+0.848
Son+0.832
6000+0.821
Neg logit contributions
@#&-0.670
 deciding-0.626
hemy-0.589
igating-0.579
 offense-0.577
Token Dub
Token ID10322
Feature activation-2.75e+00

Pos logit contributions
@#&+0.364
 deciding+0.350
 offense+0.323
hemy+0.322
igating+0.321
Neg logit contributions
-0.544
enegger-0.465
-0.452
6000-0.442
Son-0.442
Tokens
Token ID82
Feature activation-6.43e-01

Pos logit contributions
 deciding+10.784
@#&+10.722
 offense+10.470
 Answer+10.405
 automatic+9.921
Neg logit contributions
-13.667
enegger-11.864
Lago-11.028
-11.017
6000-10.867
Token by
Token ID416
Feature activation+3.80e-01

Pos logit contributions
@#&+2.759
 deciding+2.723
 offense+2.547
 automatic+2.510
 Answer+2.502
Neg logit contributions
-3.495
enegger-2.961
-2.848
6000-2.778
Son-2.763
Token posting
Token ID10754
Feature activation-6.54e-01

Pos logit contributions
+2.325
enegger+2.002
+1.937
Son+1.907
6000+1.898
Neg logit contributions
@#&-1.376
 deciding-1.312
hemy-1.222
 offense-1.214
igating-1.203
Token a
Token ID257
Feature activation-2.86e-01

Pos logit contributions
@#&+2.520
 deciding+2.482
 offense+2.307
 automatic+2.269
 Answer+2.255
Neg logit contributions
-3.849
enegger-3.289
-3.189
6000-3.120
DragonMagazine-3.100
Token video
Token ID2008
Feature activation-4.32e-01

Pos logit contributions
@#&+1.122
 deciding+1.095
 offense+1.011
 automatic+0.998
igating+0.998
Neg logit contributions
-1.669
enegger-1.428
-1.384
Son-1.355
6000-1.354
Token the
Token ID262
Feature activation-4.28e-01

Pos logit contributions
@#&+1.112
 deciding+1.084
 offense+0.997
 Answer+0.981
igating+0.981
Neg logit contributions
-1.819
enegger-1.560
-1.519
6000-1.487
Son-1.484
Token moment
Token ID2589
Feature activation-1.27e+00

Pos logit contributions
@#&+1.683
 deciding+1.653
 offense+1.529
 automatic+1.507
 Answer+1.497
Neg logit contributions
-2.488
enegger-2.127
-2.057
6000-2.012
Son-2.008
Token,
Token ID11
Feature activation-6.79e-01

Pos logit contributions
 deciding+5.365
@#&+5.320
 offense+4.925
 automatic+4.865
 recall+4.840
Neg logit contributions
-6.729
enegger-5.765
-5.564
DragonMagazine-5.465
6000-5.412
Token
Token ID447
Feature activation-7.51e-01

Pos logit contributions
@#&+2.882
 deciding+2.832
 offense+2.636
 automatic+2.600
 Answer+2.587
Neg logit contributions
-3.738
enegger-3.167
-3.056
6000-2.984
Son-2.977
Token
Token ID247
Feature activation-2.92e-01

Pos logit contributions
@#&+3.137
 deciding+3.081
 offense+2.900
 Answer+2.868
 automatic+2.857
Neg logit contributions
-4.143
enegger-3.584
-3.355
DragonMagazine-3.325
6000-3.316
Token Cher
Token ID19305
Feature activation-2.55e+00

Pos logit contributions
@#&+1.071
 deciding+1.037
 offense+0.956
igating+0.947
hemy+0.943
Neg logit contributions
-1.782
enegger-1.528
-1.487
6000-1.459
Son-1.459
Tokenrene
Token ID25924
Feature activation-1.72e-01

Pos logit contributions
@#&+7.411
lege+7.403
 deciding+7.354
 recall+7.354
igating+7.226
Neg logit contributions
-14.805
DragonMagazine-12.367
-12.171
enegger-11.916
-11.629
Token says
Token ID1139
Feature activation+2.69e-01

Pos logit contributions
@#&+0.732
 deciding+0.707
 offense+0.661
igating+0.657
hemy+0.654
Neg logit contributions
-0.944
enegger-0.799
-0.775
6000-0.756
Son-0.754
Token.
Token ID13
Feature activation-1.47e-01

Pos logit contributions
+1.303
enegger+1.076
+1.049
Son+1.010
+1.009
Neg logit contributions
@#&-1.312
 deciding-1.231
hemy-1.191
 offense-1.169
igating-1.169
Token
Token ID198
Feature activation-1.25e-01

Pos logit contributions
@#&+0.665
 deciding+0.651
 offense+0.610
hemy+0.608
igating+0.605
Neg logit contributions
-0.774
enegger-0.640
-0.624
Son-0.611
6000-0.606
Token
Token ID198
Feature activation+1.02e-01

Pos logit contributions
@#&+0.430
 deciding+0.424
 Answer+0.388
 offense+0.387
 automatic+0.384
Neg logit contributions
-0.791
enegger-0.692
-0.661
DragonMagazine-0.650
catentry-0.640
Token,
Token ID11
Feature activation+3.66e-01

Pos logit contributions
+1.841
enegger+1.496
+1.449
Son+1.427
6000+1.381
Neg logit contributions
@#&-2.036
 deciding-1.929
 offense-1.854
 Answer-1.842
 automatic-1.825
Token Mal
Token ID4434
Feature activation-7.73e-01

Pos logit contributions
+1.987
enegger+1.662
+1.621
Son+1.607
6000+1.598
Neg logit contributions
@#&-1.574
 deciding-1.488
igating-1.401
 offense-1.389
hemy-1.389
Tokenanch
Token ID3702
Feature activation-1.08e+00

Pos logit contributions
@#&+2.406
 deciding+2.345
hemy+2.183
igating+2.158
 offense+2.148
Neg logit contributions
-5.075
enegger-4.358
-4.297
DragonMagazine-4.159
6000-4.097
Tokenak
Token ID461
Feature activation+1.63e-02

Pos logit contributions
@#&+3.371
 deciding+3.359
 Answer+3.104
 automatic+3.097
 recall+3.081
Neg logit contributions
-6.961
enegger-6.011
-5.885
DragonMagazine-5.816
-5.723
Token and
Token ID290
Feature activation+1.27e-01

Pos logit contributions
+0.080
enegger+0.066
+0.064
Son+0.062
6000+0.062
Neg logit contributions
@#&-0.079
 deciding-0.078
 offense-0.073
 Answer-0.072
igating-0.072
Token Mak
Token ID15841
Feature activation-2.50e+00

Pos logit contributions
+0.641
enegger+0.529
+0.513
Son+0.506
6000+0.502
Neg logit contributions
@#&-0.596
 deciding-0.574
igating-0.538
 offense-0.536
hemy-0.532
Tokenovi
Token ID47297
Feature activation-1.01e+00

Pos logit contributions
@#&+7.821
igating+7.754
 deciding+7.650
 automatic+7.494
 recall+7.411
Neg logit contributions
-14.262
-11.645
DragonMagazine-11.546
enegger-11.460
-11.452
Tokenich
Token ID488
Feature activation-7.67e-01

Pos logit contributions
@#&+4.045
 deciding+3.955
 recall+3.665
 offense+3.663
 Answer+3.659
Neg logit contributions
-5.696
enegger-4.834
-4.714
DragonMagazine-4.568
Lago-4.540
Tokenuk
Token ID2724
Feature activation-8.26e-01

Pos logit contributions
@#&+3.555
 deciding+3.495
 recall+3.288
igating+3.258
 offense+3.246
Neg logit contributions
-3.892
enegger-3.285
-3.097
DragonMagazine-3.060
-2.993
Token,
Token ID11
Feature activation+8.33e-02

Pos logit contributions
@#&+3.312
 deciding+3.310
 offense+3.079
 automatic+3.044
 recall+3.026
Neg logit contributions
-4.702
enegger-3.981
-3.847
DragonMagazine-3.775
6000-3.762
Token the
Token ID262
Feature activation-4.32e-02

Pos logit contributions
+0.464
enegger+0.391
+0.380
6000+0.374
Son+0.373
Neg logit contributions
@#&-0.349
 deciding-0.336
 offense-0.313
igating-0.311
hemy-0.310
Token The
Token ID383
Feature activation-1.72e-01

Pos logit contributions
@#&+0.199
 deciding+0.195
 offense+0.181
hemy+0.180
 automatic+0.179
Neg logit contributions
-0.278
enegger-0.234
-0.227
Son-0.224
6000-0.222
Token property
Token ID3119
Feature activation-1.45e-01

Pos logit contributions
@#&+0.701
 deciding+0.680
 offense+0.631
hemy+0.624
igating+0.623
Neg logit contributions
-0.979
enegger-0.831
-0.807
Son-0.788
6000-0.787
Token now
Token ID783
Feature activation-8.19e-01

Pos logit contributions
@#&+0.666
 deciding+0.653
 offense+0.610
 automatic+0.603
igating+0.602
Neg logit contributions
-0.750
enegger-0.626
-0.604
Son-0.588
6000-0.587
Token belongs
Token ID14448
Feature activation+5.67e-01

Pos logit contributions
@#&+3.641
 deciding+3.627
 offense+3.406
 automatic+3.369
 recall+3.324
Neg logit contributions
-4.278
enegger-3.606
-3.441
DragonMagazine-3.412
6000-3.394
Token to
Token ID284
Feature activation+2.62e-01

Pos logit contributions
+2.695
enegger+2.208
+2.126
6000+2.074
Son+2.071
Neg logit contributions
@#&-2.819
 deciding-2.760
 offense-2.598
igating-2.570
hemy-2.566
Token Cit
Token ID15792
Feature activation-2.48e+00

Pos logit contributions
+1.409
enegger+1.197
+1.150
Son+1.142
6000+1.132
Neg logit contributions
@#&-1.125
 deciding-1.088
 offense-1.015
hemy-1.013
igating-1.008
Tokenib
Token ID571
Feature activation-1.64e+00

Pos logit contributions
igating+6.768
 deciding+6.509
lege+6.333
 automatic+6.167
@#&+6.128
Neg logit contributions
-15.269
enegger-13.144
DragonMagazine-13.085
-12.695
catentry-12.678
Tokenank
Token ID962
Feature activation-1.01e+00

Pos logit contributions
@#&+2.720
lege+2.542
 deciding+2.399
igating+2.272
 Answer+2.122
Neg logit contributions
-12.546
enegger-11.134
-10.975
DragonMagazine-10.920
-10.666
Token.
Token ID13
Feature activation+5.50e-02

Pos logit contributions
@#&+4.442
 deciding+4.412
 offense+4.106
 automatic+4.077
 Answer+4.062
Neg logit contributions
-5.274
enegger-4.462
-4.276
6000-4.197
DragonMagazine-4.190
Token
Token ID198
Feature activation+1.07e-02

Pos logit contributions
+0.310
enegger+0.259
+0.252
Son+0.250
6000+0.246
Neg logit contributions
@#&-0.229
 deciding-0.225
 offense-0.209
hemy-0.207
 automatic-0.206
Token
Token ID198
Feature activation+1.42e-01

Pos logit contributions
+0.067
enegger+0.058
+0.056
DragonMagazine+0.054
6000+0.054
Neg logit contributions
@#&-0.038
 deciding-0.037
 offense-0.034
 Answer-0.034
 automatic-0.034
Token names
Token ID3891
Feature activation+8.06e-01

Pos logit contributions
+3.176
enegger+2.747
+2.731
Son+2.667
+2.645
Neg logit contributions
@#&-1.499
 deciding-1.434
hemy-1.293
igating-1.286
 offense-1.280
Token also
Token ID635
Feature activation+1.98e-01

Pos logit contributions
+4.327
+3.612
enegger+3.578
Son+3.494
+3.459
Neg logit contributions
@#&-3.394
 deciding-3.222
hemy-3.098
igating-3.041
 offense-3.002
Token existed
Token ID11196
Feature activation+6.90e-01

Pos logit contributions
+1.095
enegger+0.929
+0.908
6000+0.881
Son+0.878
Neg logit contributions
@#&-0.835
 deciding-0.800
hemy-0.746
 offense-0.741
igating-0.738
Token among
Token ID1871
Feature activation+2.99e-01

Pos logit contributions
+3.405
enegger+2.803
+2.756
6000+2.669
Son+2.651
Neg logit contributions
@#&-3.267
 deciding-3.101
hemy-2.985
 Answer-2.943
 offense-2.930
Token the
Token ID262
Feature activation+3.61e-01

Pos logit contributions
+1.461
enegger+1.198
+1.185
Son+1.142
6000+1.140
Neg logit contributions
@#&-1.431
 deciding-1.381
hemy-1.320
 offense-1.303
igating-1.299
Token Turk
Token ID8484
Feature activation-2.37e+00

Pos logit contributions
+1.662
enegger+1.342
+1.324
Son+1.278
6000+1.278
Neg logit contributions
@#&-1.825
 deciding-1.763
hemy-1.693
igating-1.676
 offense-1.671
Tokenic
Token ID291
Feature activation-9.19e-02

Pos logit contributions
 deciding+8.165
@#&+8.164
 offense+7.527
 automatic+7.448
 Answer+7.338
Neg logit contributions
-13.501
enegger-12.014
-11.196
Lago-11.175
catentry-10.989
Token and
Token ID290
Feature activation+9.46e-01

Pos logit contributions
@#&+0.334
 deciding+0.325
 offense+0.298
igating+0.294
hemy+0.294
Neg logit contributions
-0.563
enegger-0.482
-0.470
6000-0.461
Son-0.460
Token Mongol
Token ID42188
Feature activation-5.24e-01

Pos logit contributions
+4.691
+3.801
enegger+3.770
Son+3.725
6000+3.716
Neg logit contributions
@#&-4.253
 deciding-4.106
hemy-3.985
igating-3.958
 offense-3.923
Token peoples
Token ID14366
Feature activation+1.48e-01

Pos logit contributions
@#&+2.639
 deciding+2.598
 offense+2.446
 automatic+2.419
 Answer+2.410
Neg logit contributions
-2.475
enegger-2.031
-1.948
6000-1.892
Son-1.887
Token (
Token ID357
Feature activation-8.38e-02

Pos logit contributions
+0.762
enegger+0.630
+0.616
6000+0.599
Son+0.594
Neg logit contributions
@#&-0.684
 deciding-0.661
 offense-0.622
hemy-0.622
 Answer-0.617
Token one
Token ID530
Feature activation-2.34e-01

Pos logit contributions
@#&+1.899
 deciding+1.847
 offense+1.701
 Answer+1.678
 automatic+1.675
Neg logit contributions
-2.869
enegger-2.457
-2.380
6000-2.330
Son-2.329
Token an
Token ID281
Feature activation+2.28e-01

Pos logit contributions
@#&+0.826
 deciding+0.787
 offense+0.719
 Answer+0.712
igating+0.711
Neg logit contributions
-1.459
enegger-1.258
-1.228
6000-1.205
Son-1.203
Token Iranian
Token ID10704
Feature activation-1.35e+00

Pos logit contributions
+1.197
enegger+0.996
+0.977
Son+0.953
6000+0.945
Neg logit contributions
@#&-1.010
 deciding-0.981
hemy-0.909
igating-0.895
lege-0.890
Token-
Token ID12
Feature activation+8.57e-02

Pos logit contributions
 deciding+5.289
@#&+5.274
 offense+5.029
 automatic+4.925
 recall+4.811
Neg logit contributions
-7.578
enegger-6.551
-6.233
DragonMagazine-6.176
6000-6.086
Tokenborn
Token ID6286
Feature activation-7.20e-01

Pos logit contributions
+0.484
enegger+0.408
+0.401
Son+0.398
6000+0.393
Neg logit contributions
@#&-0.351
 deciding-0.342
 offense-0.319
 Answer-0.314
igating-0.314
Token Turk
Token ID8484
Feature activation-2.33e+00

Pos logit contributions
@#&+3.035
 deciding+3.009
 offense+2.824
 automatic+2.784
 Answer+2.743
Neg logit contributions
-3.967
enegger-3.371
-3.239
6000-3.156
DragonMagazine-3.153
Token named
Token ID3706
Feature activation-4.02e-01

Pos logit contributions
 deciding+9.232
@#&+8.932
 offense+8.784
 automatic+8.648
 whose+8.353
Neg logit contributions
-12.136
enegger-10.530
DragonMagazine-9.853
-9.792
-9.739
Token Ali
Token ID12104
Feature activation-2.52e-01

Pos logit contributions
@#&+1.730
 deciding+1.691
 offense+1.572
igating+1.554
 automatic+1.551
Neg logit contributions
-2.200
enegger-1.858
-1.798
Son-1.760
6000-1.754
Token Kent
Token ID8758
Feature activation-6.65e-01

Pos logit contributions
@#&+1.106
 deciding+1.083
 offense+1.009
 automatic+0.996
igating+0.996
Neg logit contributions
-1.352
enegger-1.139
-1.099
Son-1.078
6000-1.070
Token.
Token ID13
Feature activation-4.49e-01

Pos logit contributions
@#&+2.938
 deciding+2.885
 offense+2.692
 automatic+2.656
 Answer+2.643
Neg logit contributions
-3.550
enegger-3.000
-2.885
6000-2.814
Son-2.812
Token The
Token ID383
Feature activation-3.12e-01

Pos logit contributions
@#&+1.725
 deciding+1.693
 offense+1.566
 Answer+1.546
 automatic+1.540
Neg logit contributions
-2.651
enegger-2.274
-2.198
6000-2.149
Son-2.140
Token in
Token ID287
Feature activation-5.38e-01

Pos logit contributions
+0.733
enegger+0.619
+0.605
Son+0.592
6000+0.592
Neg logit contributions
@#&-0.520
 deciding-0.491
hemy-0.459
 offense-0.458
 Answer-0.457
Token a
Token ID257
Feature activation-2.51e-01

Pos logit contributions
@#&+2.065
 deciding+2.037
 offense+1.880
 automatic+1.852
 Answer+1.842
Neg logit contributions
-3.179
enegger-2.723
-2.635
6000-2.576
Son-2.571
Token cable
Token ID7862
Feature activation-9.14e-02

Pos logit contributions
@#&+0.982
 deciding+0.959
 offense+0.888
 automatic+0.875
 Answer+0.872
Neg logit contributions
-1.466
enegger-1.253
-1.215
6000-1.190
Son-1.185
Token that
Token ID326
Feature activation+3.30e-02

Pos logit contributions
@#&+0.388
 deciding+0.373
 offense+0.348
hemy+0.346
 Answer+0.345
Neg logit contributions
-0.507
enegger-0.428
-0.416
6000-0.405
Son-0.405
Token Pr
Token ID1736
Feature activation-1.31e+00

Pos logit contributions
+0.179
enegger+0.149
+0.145
Son+0.142
6000+0.141
Neg logit contributions
@#&-0.145
 deciding-0.139
 offense-0.130
hemy-0.129
 Answer-0.129
Tokené
Token ID2634
Feature activation-2.31e+00

Pos logit contributions
@#&+3.796
 deciding+3.671
lege+3.489
 automatic+3.482
 Answer+3.424
Neg logit contributions
-8.569
enegger-7.374
DragonMagazine-7.211
-7.162
Lago-6.998
Tokenval
Token ID2100
Feature activation-1.29e-01

Pos logit contributions
lege+7.141
 deciding+6.958
@#&+6.667
 recall+6.628
hemy+6.451
Neg logit contributions
-13.570
enegger-12.067
DragonMagazine-11.524
-11.105
-10.999
Token
Token ID564
Feature activation+3.61e-01

Pos logit contributions
@#&+0.561
 deciding+0.548
 offense+0.512
 automatic+0.506
 Answer+0.504
Neg logit contributions
-0.699
enegger-0.590
-0.570
6000-0.555
Son-0.554
Token
Token ID250
Feature activation-3.14e-01

Pos logit contributions
+1.482
enegger+1.173
+1.134
Son+1.088
6000+1.078
Neg logit contributions
@#&-2.029
 deciding-1.995
 offense-1.888
igating-1.876
 automatic-1.869
Tokenw
Token ID86
Feature activation+4.84e-01

Pos logit contributions
@#&+1.461
 deciding+1.437
 offense+1.346
 automatic+1.330
 Answer+1.324
Neg logit contributions
-1.603
enegger-1.338
-1.287
6000-1.254
Son-1.251
Tokenants
Token ID1187
Feature activation+4.42e-01

Pos logit contributions
+2.329
enegger+1.970
+1.848
6000+1.810
Son+1.791
Neg logit contributions
@#&-2.338
 deciding-2.284
hemy-2.151
 offense-2.149
 automatic-2.135
Token had
Token ID550
Feature activation-5.83e-01

Pos logit contributions
@#&+2.467
 deciding+2.444
 offense+2.304
 automatic+2.262
igating+2.244
Neg logit contributions
-2.812
enegger-2.349
-2.264
DragonMagazine-2.222
6000-2.213
Token 21
Token ID2310
Feature activation-1.41e+00

Pos logit contributions
 deciding+2.211
@#&+2.204
 offense+2.063
 automatic+2.030
 Answer+1.983
Neg logit contributions
-3.454
enegger-2.979
-2.872
DragonMagazine-2.813
6000-2.794
Token Fantasy
Token ID10982
Feature activation-6.57e-01

Pos logit contributions
 deciding+5.013
@#&+4.909
 offense+4.712
 automatic+4.592
 recall+4.440
Neg logit contributions
-8.499
enegger-7.434
-7.063
DragonMagazine-7.046
6000-6.945
Token points
Token ID2173
Feature activation-3.71e-01

Pos logit contributions
 deciding+1.422
@#&+1.397
 offense+1.308
 automatic+1.214
 recall+1.175
Neg logit contributions
-4.971
enegger-4.461
-4.301
DragonMagazine-4.279
6000-4.201
Token vs
Token ID3691
Feature activation-4.56e-01

Pos logit contributions
@#&+1.485
 deciding+1.465
 offense+1.361
 automatic+1.338
igating+1.326
Neg logit contributions
-2.129
enegger-1.820
-1.757
6000-1.717
Son-1.713
Token MIN
Token ID20625
Feature activation-2.31e+00

Pos logit contributions
@#&+2.177
 deciding+2.114
 offense+1.965
igating+1.962
hemy+1.962
Neg logit contributions
-2.286
enegger-1.897
-1.834
Son-1.788
6000-1.774
Token in
Token ID287
Feature activation-1.09e+00

Pos logit contributions
 deciding+7.562
 offense+7.319
@#&+7.224
 automatic+6.808
 recall+6.657
Neg logit contributions
-13.804
enegger-12.332
DragonMagazine-11.752
-11.560
catentry-11.530
Token Week
Token ID6119
Feature activation-2.93e-01

Pos logit contributions
 deciding+3.914
@#&+3.829
 offense+3.629
 automatic+3.550
 recall+3.450
Neg logit contributions
-6.691
enegger-5.813
-5.599
DragonMagazine-5.530
6000-5.485
Token 8
Token ID807
Feature activation-1.52e+00

Pos logit contributions
@#&+1.234
 deciding+1.218
 offense+1.135
 Answer+1.120
 automatic+1.117
Neg logit contributions
-1.629
enegger-1.383
-1.332
DragonMagazine-1.296
6000-1.294
Token.
Token ID13
Feature activation-4.58e-01

Pos logit contributions
 deciding+6.281
@#&+6.170
 offense+5.845
 automatic+5.687
 recall+5.644
Neg logit contributions
-8.230
enegger-7.097
-6.718
DragonMagazine-6.684
6000-6.634
Token
Token ID198
Feature activation-2.63e-01

Pos logit contributions
@#&+1.928
 deciding+1.893
 offense+1.765
 automatic+1.738
 Answer+1.736
Neg logit contributions
-2.537
enegger-2.151
-2.076
6000-2.023
Son-2.017
Token+)
Token ID28988
Feature activation+8.10e-01

Pos logit contributions
 deciding+0.794
@#&+0.793
 offense+0.736
igating+0.730
 automatic+0.730
Neg logit contributions
-0.830
enegger-0.692
-0.661
6000-0.648
Son-0.644
Token"]
Token ID8973
Feature activation+3.99e-01

Pos logit contributions
+3.708
enegger+3.083
+2.896
6000+2.869
Son+2.849
Neg logit contributions
 deciding-4.010
@#&-3.937
hemy-3.732
igating-3.729
 offense-3.712
Token'
Token ID6
Feature activation-1.93e-01

Pos logit contributions
+1.735
enegger+1.441
+1.348
6000+1.336
Son+1.321
Neg logit contributions
 deciding-2.073
@#&-2.062
hemy-1.937
 offense-1.936
igating-1.930
Token )
Token ID1267
Feature activation-7.02e-01

Pos logit contributions
@#&+0.894
 deciding+0.871
 offense+0.816
igating+0.812
hemy+0.806
Neg logit contributions
-0.994
enegger-0.833
-0.795
6000-0.777
Son-0.775
Token AS
Token ID7054
Feature activation-1.29e+00

Pos logit contributions
@#&+2.736
 deciding+2.681
 offense+2.481
 automatic+2.446
 Answer+2.432
Neg logit contributions
-4.114
enegger-3.521
-3.407
6000-3.329
Son-3.326
Token version
Token ID2196
Feature activation-2.30e+00

Pos logit contributions
 deciding+5.156
@#&+5.055
 offense+4.870
 automatic+4.802
 Answer+4.733
Neg logit contributions
-7.194
enegger-6.095
-5.946
Lago-5.792
DragonMagazine-5.705
Token,
Token ID837
Feature activation-9.89e-01

Pos logit contributions
@#&+8.126
 deciding+8.034
 Answer+7.538
 automatic+7.523
 offense+7.503
Neg logit contributions
-12.810
enegger-11.183
-10.664
Lago-10.528
-10.441
Token C
Token ID327
Feature activation-9.22e-01

Pos logit contributions
@#&+4.055
 deciding+4.005
 offense+3.734
 automatic+3.709
 Answer+3.665
Neg logit contributions
-5.543
enegger-4.721
-4.544
Son-4.432
6000-4.412
TokenOUNT
Token ID28270
Feature activation-6.40e-01

Pos logit contributions
@#&+5.249
 deciding+5.158
 offense+4.887
 automatic+4.881
 Answer+4.860
Neg logit contributions
-3.663
enegger-2.950
-2.773
Lago-2.665
DragonMagazine-2.649
Token (
Token ID357
Feature activation-9.32e-01

Pos logit contributions
@#&+2.873
 deciding+2.824
 offense+2.641
 automatic+2.609
 Answer+2.596
Neg logit contributions
-3.367
enegger-2.822
-2.721
Son-2.647
6000-2.646
Token *
Token ID1635
Feature activation+3.09e-01

Pos logit contributions
@#&+3.796
 deciding+3.741
 offense+3.490
 automatic+3.449
 Answer+3.415
Neg logit contributions
-5.263
enegger-4.474
-4.315
Lago-4.210
Son-4.207
Token.
Token ID13
Feature activation-2.75e-01

Pos logit contributions
+3.161
enegger+2.549
+2.492
6000+2.439
Son+2.425
Neg logit contributions
@#&-3.450
 deciding-3.226
hemy-3.097
 Answer-3.078
igating-3.076
Token
Token ID198
Feature activation+1.17e-01

Pos logit contributions
@#&+1.062
 deciding+1.040
 offense+0.962
 automatic+0.948
igating+0.941
Neg logit contributions
-1.618
enegger-1.384
-1.342
Son-1.315
6000-1.314
Token
Token ID198
Feature activation+2.20e-01

Pos logit contributions
+0.715
enegger+0.620
+0.596
DragonMagazine+0.581
6000+0.580
Neg logit contributions
@#&-0.425
 deciding-0.418
 offense-0.384
 automatic-0.379
 Answer-0.379
TokenRight
Token ID11028
Feature activation+8.81e-01

Pos logit contributions
+1.011
enegger+0.808
+0.791
Son+0.780
6000+0.773
Neg logit contributions
@#&-1.134
 deciding-1.119
 offense-1.058
 automatic-1.044
 Answer-1.036
Token now
Token ID783
Feature activation+1.17e-01

Pos logit contributions
+4.721
enegger+3.957
+3.861
Son+3.802
6000+3.793
Neg logit contributions
@#&-3.648
 deciding-3.527
hemy-3.310
 automatic-3.285
 offense-3.264
Token Dub
Token ID10322
Feature activation-2.26e+00

Pos logit contributions
+0.622
enegger+0.518
+0.507
Son+0.497
Lago+0.492
Neg logit contributions
@#&-0.516
 deciding-0.486
 offense-0.459
 Answer-0.455
hemy-0.455
Tokenov
Token ID709
Feature activation-8.50e-01

Pos logit contributions
 deciding+6.129
 offense+5.987
@#&+5.924
 recall+5.770
igating+5.753
Neg logit contributions
-13.983
DragonMagazine-12.251
enegger-12.014
-11.726
catentry-11.500
Tokenick
Token ID624
Feature activation-2.25e-01

Pos logit contributions
@#&+2.278
 deciding+2.083
igating+1.893
 offense+1.892
 Answer+1.870
Neg logit contributions
-6.091
enegger-5.334
-5.179
DragonMagazine-5.110
-5.046
Token has
Token ID468
Feature activation+6.34e-01

Pos logit contributions
@#&+1.055
 deciding+1.043
 offense+0.977
 automatic+0.966
 Answer+0.959
Neg logit contributions
-1.137
enegger-0.947
-0.909
6000-0.885
Son-0.883
Token most
Token ID749
Feature activation-9.89e-02

Pos logit contributions
+3.626
enegger+3.030
+3.001
Son+2.964
6000+2.959
Neg logit contributions
@#&-2.547
 deciding-2.319
hemy-2.216
igating-2.180
 Answer-2.165
Token of
Token ID286
Feature activation+5.08e-01

Pos logit contributions
@#&+0.398
 deciding+0.378
 Answer+0.354
 offense+0.351
hemy+0.348
Neg logit contributions
-0.572
enegger-0.480
-0.472
Son-0.461
6000-0.460
Token who
Token ID508
Feature activation-1.69e-01

Pos logit contributions
+0.105
enegger+0.092
+0.089
Son+0.088
6000+0.088
Neg logit contributions
@#&-0.051
 deciding-0.048
 offense-0.044
igating-0.044
hemy-0.044
Token holds
Token ID6622
Feature activation-6.82e-01

Pos logit contributions
@#&+0.742
 deciding+0.724
 offense+0.676
hemy+0.667
 automatic+0.666
Neg logit contributions
-0.902
enegger-0.767
-0.737
6000-0.719
Son-0.719
Token dual
Token ID10668
Feature activation-5.26e-01

Pos logit contributions
@#&+2.478
 deciding+2.427
 offense+2.230
 automatic+2.195
 Answer+2.181
Neg logit contributions
-4.173
enegger-3.597
-3.487
6000-3.415
Son-3.409
Token British
Token ID3517
Feature activation-5.76e-01

Pos logit contributions
@#&+2.053
 deciding+2.004
 offense+1.857
 Answer+1.820
hemy+1.820
Neg logit contributions
-3.088
enegger-2.634
-2.562
6000-2.505
Son-2.497
Token and
Token ID290
Feature activation+8.40e-01

Pos logit contributions
@#&+2.750
 deciding+2.704
 offense+2.538
 automatic+2.505
 Answer+2.497
Neg logit contributions
-2.870
enegger-2.383
-2.294
6000-2.230
Son-2.228
Token Pakistani
Token ID17807
Feature activation-2.24e+00

Pos logit contributions
+4.057
enegger+3.332
+3.312
Son+3.257
+3.252
Neg logit contributions
@#&-3.840
 deciding-3.657
 offense-3.535
igating-3.514
hemy-3.472
Token citizenship
Token ID14218
Feature activation-8.93e-01

Pos logit contributions
 deciding+7.885
@#&+7.564
 automatic+7.397
 offense+7.078
 recall+6.998
Neg logit contributions
-13.052
enegger-11.187
DragonMagazine-10.698
-10.612
6000-10.436
Token,
Token ID11
Feature activation+1.44e-02

Pos logit contributions
@#&+4.009
 deciding+3.954
 offense+3.688
 automatic+3.645
igating+3.618
Neg logit contributions
-4.671
enegger-3.930
-3.776
6000-3.689
Son-3.675
Token said
Token ID531
Feature activation+2.90e-01

Pos logit contributions
+0.081
enegger+0.070
+0.067
Son+0.065
6000+0.065
Neg logit contributions
@#&-0.059
 deciding-0.057
 offense-0.053
 Answer-0.053
hemy-0.052
Token of
Token ID286
Feature activation+3.78e-02

Pos logit contributions
+1.633
enegger+1.380
+1.346
Son+1.312
6000+1.309
Neg logit contributions
@#&-1.201
 deciding-1.129
 offense-1.062
igating-1.056
hemy-1.050
Token the
Token ID262
Feature activation+5.50e-02

Pos logit contributions
+0.207
enegger+0.175
+0.170
Son+0.165
6000+0.164
Neg logit contributions
@#&-0.162
 deciding-0.155
 offense-0.146
hemy-0.144
igating-0.144
Token the
Token ID262
Feature activation-1.24e-01

Pos logit contributions
@#&+0.715
 deciding+0.693
 offense+0.643
igating+0.639
hemy+0.638
Neg logit contributions
-0.944
enegger-0.798
-0.775
Son-0.758
6000-0.758
Token reply
Token ID10971
Feature activation-3.64e-01

Pos logit contributions
@#&+0.500
 deciding+0.486
 offense+0.451
 automatic+0.445
igating+0.444
Neg logit contributions
-0.709
enegger-0.603
-0.586
6000-0.572
Son-0.572
Token for
Token ID329
Feature activation-2.94e-01

Pos logit contributions
@#&+1.637
 deciding+1.600
 offense+1.492
 automatic+1.474
hemy+1.474
Neg logit contributions
-1.921
enegger-1.606
-1.557
Son-1.520
6000-1.518
Token four
Token ID1440
Feature activation-6.95e-01

Pos logit contributions
@#&+1.223
 deciding+1.190
 offense+1.103
igating+1.092
 automatic+1.090
Neg logit contributions
-1.649
enegger-1.399
-1.357
Son-1.326
6000-1.324
Token.
Token ID13
Feature activation-2.72e-01

Pos logit contributions
@#&+3.078
 deciding+3.065
 offense+2.856
 automatic+2.832
 Answer+2.790
Neg logit contributions
-3.662
enegger-3.105
-2.963
DragonMagazine-2.906
6000-2.898
Token Cher
Token ID19305
Feature activation-2.24e+00

Pos logit contributions
@#&+1.129
 deciding+1.109
 offense+1.030
 automatic+1.016
igating+1.012
Neg logit contributions
-1.530
enegger-1.295
-1.256
Son-1.229
6000-1.227
Tokenrene
Token ID25924
Feature activation-4.58e-02

Pos logit contributions
@#&+7.519
 deciding+7.518
igating+7.372
 recall+7.296
lege+7.290
Neg logit contributions
-12.337
DragonMagazine-10.316
-10.104
enegger-9.974
6000-9.773
Token and
Token ID290
Feature activation+2.20e-01

Pos logit contributions
@#&+0.201
 deciding+0.196
 offense+0.183
igating+0.181
 automatic+0.180
Neg logit contributions
-0.246
enegger-0.207
-0.201
Son-0.195
6000-0.195
Token I
Token ID314
Feature activation-6.22e-01

Pos logit contributions
+1.207
enegger+1.020
+0.995
Son+0.980
6000+0.973
Neg logit contributions
@#&-0.930
 deciding-0.890
igating-0.829
hemy-0.829
 offense-0.824
Token exchange
Token ID5163
Feature activation-5.39e-01

Pos logit contributions
@#&+2.245
 deciding+2.229
 offense+2.047
 automatic+2.023
 recall+2.019
Neg logit contributions
-3.815
enegger-3.277
-3.171
DragonMagazine-3.112
Son-3.101
Token worried
Token ID7960
Feature activation+3.38e-01

Pos logit contributions
@#&+2.021
 deciding+1.993
 offense+1.839
 automatic+1.811
 Answer+1.793
Neg logit contributions
-3.236
enegger-2.780
-2.689
6000-2.631
Son-2.627
Token the
Token ID262
Feature activation-1.62e-01

Pos logit contributions
+0.563
enegger+0.481
+0.462
Son+0.457
6000+0.455
Neg logit contributions
@#&-0.392
 deciding-0.369
 offense-0.341
igating-0.341
hemy-0.341
Token decision
Token ID2551
Feature activation+3.82e-01

Pos logit contributions
@#&+0.660
 deciding+0.632
igating+0.584
hemy+0.584
 offense+0.583
Neg logit contributions
-0.919
enegger-0.781
-0.754
6000-0.743
Son-0.742
Token making
Token ID1642
Feature activation-5.87e-01

Pos logit contributions
+1.954
enegger+1.593
+1.555
Son+1.518
6000+1.516
Neg logit contributions
@#&-1.777
 deciding-1.663
hemy-1.583
 offense-1.580
 Answer-1.578
Token and
Token ID290
Feature activation-1.94e-02

Pos logit contributions
 deciding+2.167
@#&+2.147
 offense+2.000
 automatic+1.970
 Answer+1.923
Neg logit contributions
-3.547
enegger-3.070
-2.965
6000-2.889
Son-2.886
Token document
Token ID3188
Feature activation-4.43e-01

Pos logit contributions
@#&+0.067
 deciding+0.062
hemy+0.058
 offense+0.057
igating+0.057
Neg logit contributions
-0.122
enegger-0.106
-0.103
6000-0.101
Son-0.101
Token sharing
Token ID7373
Feature activation-2.23e+00

Pos logit contributions
@#&+1.778
 deciding+1.745
 offense+1.617
 automatic+1.594
 Answer+1.585
Neg logit contributions
-2.545
enegger-2.171
-2.099
6000-2.052
Son-2.049
Token wasn
Token ID2492
Feature activation+1.26e-01

Pos logit contributions
 deciding+8.188
 offense+7.723
 automatic+7.528
 recall+7.351
@#&+7.125
Neg logit contributions
-12.566
enegger-11.194
-10.548
DragonMagazine-10.385
Lago-10.336
Token't
Token ID470
Feature activation-7.33e-01

Pos logit contributions
+0.668
enegger+0.565
+0.538
DragonMagazine+0.524
Son+0.521
Neg logit contributions
@#&-0.566
 deciding-0.557
 offense-0.520
 Answer-0.517
 automatic-0.516
Token done
Token ID1760
Feature activation-4.98e-01

Pos logit contributions
 deciding+2.702
@#&+2.678
 automatic+2.481
 offense+2.468
 recall+2.400
Neg logit contributions
-4.447
enegger-3.838
-3.715
DragonMagazine-3.636
6000-3.605
Token timely
Token ID19376
Feature activation-3.31e-02

Pos logit contributions
@#&+1.881
 deciding+1.874
 offense+1.725
 automatic+1.711
 Answer+1.678
Neg logit contributions
-2.973
enegger-2.552
-2.469
DragonMagazine-2.412
6000-2.405
Token.
Token ID13
Feature activation-2.40e-01

Pos logit contributions
@#&+0.142
 deciding+0.138
 offense+0.129
hemy+0.127
 automatic+0.127
Neg logit contributions
-0.181
enegger-0.153
-0.148
6000-0.145
Son-0.144
Token (
Token ID357
Feature activation+8.54e-02

Pos logit contributions
@#&+2.348
 deciding+2.307
 offense+2.147
 automatic+2.118
 Answer+2.107
Neg logit contributions
-3.081
enegger-2.610
-2.521
6000-2.462
Son-2.457
TokenO
Token ID46
Feature activation+6.20e-01

Pos logit contributions
+0.385
enegger+0.313
+0.299
6000+0.290
Son+0.289
Neg logit contributions
@#&-0.448
 deciding-0.442
 offense-0.417
 automatic-0.413
 Answer-0.411
Tokenmega
Token ID13731
Feature activation+4.19e-02

Pos logit contributions
+2.644
enegger+2.045
+2.039
6000+1.994
Son+1.986
Neg logit contributions
@#&-3.371
 deciding-3.307
 offense-3.168
 Answer-3.093
hemy-3.086
Token Pharma
Token ID41845
Feature activation+3.40e-01

Pos logit contributions
+0.268
enegger+0.231
+0.225
DragonMagazine+0.220
Son+0.219
Neg logit contributions
@#&-0.141
 deciding-0.139
 offense-0.127
 automatic-0.125
 Answer-0.124
Token -
Token ID532
Feature activation-6.75e-01

Pos logit contributions
+1.737
enegger+1.451
+1.395
6000+1.359
Son+1.355
Neg logit contributions
@#&-1.575
 deciding-1.551
 offense-1.453
 automatic-1.436
 Answer-1.430
Token Quick
Token ID12029
Feature activation-2.22e+00

Pos logit contributions
@#&+1.891
 deciding+1.876
 Answer+1.726
 automatic+1.690
 offense+1.644
Neg logit contributions
-4.619
enegger-4.121
DragonMagazine-3.948
-3.946
6000-3.844
Tokenstep
Token ID9662
Feature activation-4.18e-01

Pos logit contributions
 deciding+4.114
 Answer+4.040
lege+3.955
 recall+3.857
@#&+3.707
Neg logit contributions
-15.799
enegger-14.595
-14.007
DragonMagazine-13.716
-13.341
Token)
Token ID8
Feature activation-3.15e-02

Pos logit contributions
@#&+1.674
 deciding+1.667
 Answer+1.568
 offense+1.558
 automatic+1.541
Neg logit contributions
-2.355
enegger-2.022
-1.924
DragonMagazine-1.900
catentry-1.856
Token s
Token ID264
Feature activation+6.06e-01

Pos logit contributions
@#&+0.143
 deciding+0.137
hemy+0.129
 offense+0.127
lege+0.127
Neg logit contributions
-0.166
enegger-0.139
-0.136
Son-0.133
-0.130
Token.
Token ID13
Feature activation-6.39e-01

Pos logit contributions
+2.945
enegger+2.424
+2.348
6000+2.304
Son+2.295
Neg logit contributions
@#&-2.948
 deciding-2.889
 offense-2.715
igating-2.677
 automatic-2.677
Tokent
Token ID83
Feature activation+5.69e-01

Pos logit contributions
@#&+2.353
 deciding+2.319
 Answer+2.137
 automatic+2.120
 offense+2.114
Neg logit contributions
-3.802
enegger-3.319
DragonMagazine-3.155
-3.154
-3.041
Token also
Token ID635
Feature activation+2.57e-02

Pos logit contributions
+1.855
enegger+1.568
+1.536
Son+1.532
6000+1.510
Neg logit contributions
@#&-1.306
 deciding-1.231
igating-1.170
hemy-1.146
 offense-1.144
Token about
Token ID546
Feature activation-6.54e-01

Pos logit contributions
+0.145
enegger+0.123
+0.120
Son+0.119
6000+0.117
Neg logit contributions
@#&-0.105
 deciding-0.101
igating-0.094
 offense-0.094
hemy-0.093
Token forced
Token ID4137
Feature activation+6.46e-01

Pos logit contributions
@#&+2.258
 deciding+2.231
 offense+2.044
 automatic+2.012
 Answer+1.982
Neg logit contributions
-4.108
enegger-3.562
-3.448
6000-3.380
Son-3.365
Token sex
Token ID1714
Feature activation-6.30e-02

Pos logit contributions
+4.003
enegger+3.417
+3.378
Son+3.327
6000+3.297
Neg logit contributions
@#&-2.264
 deciding-2.011
igating-1.908
 Answer-1.900
hemy-1.879
Token,
Token ID11
Feature activation-2.27e-01

Pos logit contributions
@#&+0.300
 deciding+0.290
 offense+0.269
hemy+0.269
 Answer+0.269
Neg logit contributions
-0.317
enegger-0.263
-0.253
Son-0.247
6000-0.245
Token Zak
Token ID32605
Feature activation-2.22e+00

Pos logit contributions
@#&+0.993
 deciding+0.969
 offense+0.904
 automatic+0.893
igating+0.891
Neg logit contributions
-1.224
enegger-1.030
-0.995
6000-0.972
Son-0.971
Tokenov
Token ID709
Feature activation-1.19e+00

Pos logit contributions
 deciding+5.643
@#&+5.610
 recall+5.408
igating+5.328
 offense+5.311
Neg logit contributions
-14.345
DragonMagazine-12.296
-11.929
MIC-11.706
-11.694
Tokenitch
Token ID2007
Feature activation-2.88e-01

Pos logit contributions
@#&+4.857
 deciding+4.692
hemy+4.466
 Answer+4.445
 offense+4.395
Neg logit contributions
-6.640
enegger-5.543
-5.401
DragonMagazine-5.190
Son-5.081
Token suggests
Token ID5644
Feature activation+9.30e-01

Pos logit contributions
@#&+1.184
 deciding+1.175
 offense+1.090
 automatic+1.075
 Answer+1.067
Neg logit contributions
-1.626
enegger-1.379
-1.332
DragonMagazine-1.300
6000-1.299
Token literary
Token ID16716
Feature activation+7.90e-01

Pos logit contributions
+5.530
enegger+4.675
Son+4.613
+4.609
+4.546
Neg logit contributions
@#&-3.308
 deciding-2.912
hemy-2.850
igating-2.794
 Answer-2.745
Token ass
Token ID840
Feature activation+4.61e-01

Pos logit contributions
+4.577
enegger+3.885
+3.858
Son+3.758
+3.724
Neg logit contributions
@#&-3.047
 deciding-2.683
hemy-2.627
igating-2.592
lege-2.495
Token
Token ID198
Feature activation-3.05e-01

Pos logit contributions
@#&+2.097
 deciding+2.024
 offense+1.871
 Answer+1.849
 automatic+1.841
Neg logit contributions
-3.125
enegger-2.658
-2.585
6000-2.526
Son-2.522
Token
Token ID198
Feature activation-1.58e-01

Pos logit contributions
@#&+0.997
 deciding+0.996
 Answer+0.910
 offense+0.906
 automatic+0.902
Neg logit contributions
-1.962
enegger-1.730
-1.646
DragonMagazine-1.618
catentry-1.594
TokenM
Token ID44
Feature activation+1.12e+00

Pos logit contributions
@#&+0.841
 deciding+0.829
 offense+0.785
 automatic+0.781
 Answer+0.776
Neg logit contributions
-0.688
enegger-0.566
-0.528
6000-0.508
DragonMagazine-0.504
Tokenud
Token ID463
Feature activation-9.86e-01

Pos logit contributions
+4.806
6000+3.912
enegger+3.884
asters+3.754
+3.738
Neg logit contributions
@#&-5.781
 deciding-5.648
 Answer-5.268
 offense-5.217
 automatic-5.196
Token and
Token ID290
Feature activation-4.21e-01

Pos logit contributions
 deciding+4.231
@#&+4.197
 offense+3.957
 automatic+3.927
igating+3.866
Neg logit contributions
-5.267
enegger-4.479
-4.283
6000-4.195
Son-4.170
Token sand
Token ID6450
Feature activation-2.21e+00

Pos logit contributions
@#&+1.885
 deciding+1.839
 offense+1.715
igating+1.691
 automatic+1.690
Neg logit contributions
-2.230
enegger-1.869
-1.805
6000-1.762
Son-1.761
Token mode
Token ID4235
Feature activation-1.74e-01

Pos logit contributions
 deciding+8.795
 automatic+8.396
 offense+8.326
igating+8.010
@#&+8.010
Neg logit contributions
-11.474
enegger-9.896
DragonMagazine-9.403
-9.377
catentry-9.248
Token for
Token ID329
Feature activation+4.12e-01

Pos logit contributions
@#&+0.794
 deciding+0.762
hemy+0.712
 offense+0.711
 Answer+0.708
Neg logit contributions
-0.910
enegger-0.753
-0.738
6000-0.722
Son-0.715
Token muddy
Token ID39085
Feature activation+1.18e-01

Pos logit contributions
+2.272
enegger+1.924
+1.861
6000+1.832
Son+1.821
Neg logit contributions
@#&-1.740
 deciding-1.609
hemy-1.545
 Answer-1.520
igating-1.503
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
+0.668
enegger+0.566
+0.549
6000+0.535
Son+0.532
Neg logit contributions
@#&-0.490
 deciding-0.460
 Answer-0.431
hemy-0.430
 offense-0.427
Token sandy
Token ID44039
Feature activation-1.19e+00

Pos logit contributions
+5.140
enegger+4.304
+4.109
Son+4.072
6000+4.057
Neg logit contributions
@#&-4.672
 deciding-4.257
hemy-4.142
 Answer-4.136
igating-4.041
Tokenis
Token ID271
Feature activation-2.82e-01

Pos logit contributions
+5.227
enegger+4.392
+4.103
6000+4.099
Son+4.048
Neg logit contributions
 deciding-5.606
@#&-5.573
 offense-5.238
 Answer-5.136
 automatic-5.108
Tokenar
Token ID283
Feature activation+4.02e-01

Pos logit contributions
@#&+1.274
 deciding+1.252
 offense+1.169
 automatic+1.152
 Answer+1.150
Neg logit contributions
-1.479
enegger-1.248
-1.196
Son-1.170
6000-1.167
Token Malik
Token ID31745
Feature activation-7.73e-01

Pos logit contributions
+2.144
enegger+1.795
+1.738
Son+1.726
6000+1.719
Neg logit contributions
@#&-1.756
 deciding-1.715
 offense-1.586
igating-1.576
 Answer-1.564
Token,
Token ID11
Feature activation+3.91e-01

Pos logit contributions
@#&+3.416
 deciding+3.385
 offense+3.171
 automatic+3.128
 Answer+3.105
Neg logit contributions
-4.090
enegger-3.429
-3.298
6000-3.222
Son-3.201
Token a
Token ID257
Feature activation+3.40e-04

Pos logit contributions
+2.233
enegger+1.925
+1.867
Son+1.842
+1.820
Neg logit contributions
@#&-1.533
 deciding-1.457
hemy-1.382
igating-1.353
 offense-1.348
Token Pakistani
Token ID17807
Feature activation-2.18e+00

Pos logit contributions
+0.002
enegger+0.002
+0.002
6000+0.002
Son+0.002
Neg logit contributions
@#&-0.001
 deciding-0.001
 offense-0.001
hemy-0.001
igating-0.001
Token journalist
Token ID10099
Feature activation-3.40e-01

Pos logit contributions
 deciding+8.554
@#&+8.291
 offense+8.036
 automatic+8.028
 recall+7.763
Neg logit contributions
-11.808
enegger-10.038
-9.564
Lago-9.306
Son-9.301
Token,
Token ID11
Feature activation+1.76e-01

Pos logit contributions
@#&+1.440
 deciding+1.397
 offense+1.305
 Answer+1.288
 automatic+1.287
Neg logit contributions
-1.876
enegger-1.588
-1.539
6000-1.507
Son-1.504
Token and
Token ID290
Feature activation+2.53e-01

Pos logit contributions
+0.981
enegger+0.840
+0.816
Son+0.799
6000+0.797
Neg logit contributions
@#&-0.727
 deciding-0.693
hemy-0.649
 offense-0.643
igating-0.642
Token Mark
Token ID2940
Feature activation+4.84e-01

Pos logit contributions
+1.364
enegger+1.159
+1.122
Son+1.111
6000+1.102
Neg logit contributions
@#&-1.091
 deciding-1.038
hemy-0.971
igating-0.969
 offense-0.965
Token Smith
Token ID4176
Feature activation-1.47e-01

Pos logit contributions
+2.473
enegger+2.077
+1.999
Son+1.975
6000+1.962
Neg logit contributions
@#&-2.177
 deciding-2.147
 offense-2.010
 automatic-1.978
hemy-1.966
TokenART
Token ID7227
Feature activation-1.06e+00

Pos logit contributions
@#&+0.685
 deciding+0.670
 offense+0.638
 Answer+0.635
 automatic+0.635
Neg logit contributions
-0.491
enegger-0.400
-0.365
DragonMagazine-0.357
6000-0.351
Token()
Token ID3419
Feature activation-3.98e-01

Pos logit contributions
@#&+4.574
 deciding+4.523
 offense+4.210
 automatic+4.192
 Answer+4.127
Neg logit contributions
-5.653
enegger-4.777
-4.570
DragonMagazine-4.506
Son-4.496
Token above
Token ID2029
Feature activation-6.16e-01

Pos logit contributions
@#&+1.810
 deciding+1.782
 offense+1.667
 automatic+1.647
 Answer+1.638
Neg logit contributions
-2.075
enegger-1.739
-1.674
Son-1.630
6000-1.629
Token is
Token ID318
Feature activation-3.05e-01

Pos logit contributions
@#&+2.767
 deciding+2.756
 offense+2.574
 automatic+2.549
 Answer+2.523
Neg logit contributions
-3.225
enegger-2.697
-2.601
DragonMagazine-2.533
Son-2.532
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
@#&+1.163
 deciding+1.140
 offense+1.052
 automatic+1.036
 Answer+1.030
Neg logit contributions
-1.813
enegger-1.555
-1.506
6000-1.474
Son-1.471
Token version
Token ID2196
Feature activation-2.18e+00

Pos logit contributions
 deciding+3.996
@#&+3.965
 automatic+3.690
 offense+3.681
 Answer+3.597
Neg logit contributions
-6.030
enegger-5.159
-5.017
Lago-4.904
DragonMagazine-4.900
Token number
Token ID1271
Feature activation-6.97e-01

Pos logit contributions
 deciding+7.599
@#&+7.480
 automatic+6.998
 offense+6.955
 recall+6.889
Neg logit contributions
-12.706
enegger-11.133
-10.723
DragonMagazine-10.440
Son-10.327
Token.
Token ID13
Feature activation-4.05e-01

Pos logit contributions
@#&+2.762
 deciding+2.727
 offense+2.520
 automatic+2.493
 Answer+2.471
Neg logit contributions
-4.022
enegger-3.437
-3.331
Son-3.247
6000-3.237
Token I
Token ID314
Feature activation-4.93e-01

Pos logit contributions
@#&+1.510
 deciding+1.479
 offense+1.363
 automatic+1.341
 Answer+1.334
Neg logit contributions
-2.436
enegger-2.095
-2.029
6000-1.987
Son-1.983
Token want
Token ID765
Feature activation-3.45e-01

Pos logit contributions
@#&+1.896
 deciding+1.866
 offense+1.719
 automatic+1.697
 recall+1.690
Neg logit contributions
-2.912
enegger-2.506
-2.406
Son-2.364
DragonMagazine-2.359
Token to
Token ID284
Feature activation-5.68e-01

Pos logit contributions
@#&+1.539
 deciding+1.510
 offense+1.405
igating+1.396
hemy+1.393
Neg logit contributions
-1.824
enegger-1.523
-1.475
6000-1.454
Son-1.435
Token the
Token ID262
Feature activation+3.57e-01

Pos logit contributions
+4.022
enegger+3.350
Son+3.322
+3.303
6000+3.264
Neg logit contributions
@#&-2.771
 deciding-2.553
igating-2.462
hemy-2.421
lege-2.387
Token other
Token ID584
Feature activation+8.29e-01

Pos logit contributions
+1.999
enegger+1.686
+1.656
Son+1.645
6000+1.616
Neg logit contributions
@#&-1.457
 deciding-1.365
igating-1.293
hemy-1.284
lege-1.272
Token.
Token ID13
Feature activation+1.25e-01

Pos logit contributions
+4.515
enegger+3.794
Son+3.741
+3.716
6000+3.680
Neg logit contributions
@#&-3.434
 deciding-3.127
hemy-2.998
igating-2.972
lege-2.918
Token See
Token ID4091
Feature activation-1.53e-01

Pos logit contributions
+0.705
enegger+0.594
+0.580
Son+0.570
6000+0.567
Neg logit contributions
@#&-0.517
 deciding-0.507
 offense-0.470
 automatic-0.464
hemy-0.464
Token,
Token ID11
Feature activation+5.07e-01

Pos logit contributions
@#&+0.710
 deciding+0.691
 offense+0.644
igating+0.638
 automatic+0.637
Neg logit contributions
-0.784
enegger-0.655
-0.633
Son-0.618
6000-0.613
Token Zak
Token ID32605
Feature activation-2.18e+00

Pos logit contributions
+2.604
enegger+2.173
+2.140
Son+2.094
+2.085
Neg logit contributions
@#&-2.263
 deciding-2.162
igating-2.039
 offense-2.025
hemy-2.014
Tokenov
Token ID709
Feature activation-1.05e+00

Pos logit contributions
 deciding+5.552
@#&+5.543
 recall+5.506
 offense+5.337
igating+5.310
Neg logit contributions
-14.181
DragonMagazine-11.928
-11.718
enegger-11.572
MIC-11.468
Tokenitch
Token ID2007
Feature activation-1.79e-02

Pos logit contributions
@#&+4.348
 deciding+4.238
hemy+4.036
 Answer+4.013
 offense+3.995
Neg logit contributions
-5.838
enegger-4.818
-4.702
DragonMagazine-4.521
Son-4.427
Token,
Token ID11
Feature activation+3.33e-01

Pos logit contributions
@#&+0.084
 deciding+0.081
 offense+0.076
 automatic+0.076
igating+0.075
Neg logit contributions
-0.091
enegger-0.076
-0.074
Son-0.072
6000-0.072
Token
Token ID564
Feature activation+8.79e-01

Pos logit contributions
+1.736
enegger+1.452
+1.417
Son+1.406
+1.375
Neg logit contributions
@#&-1.478
 deciding-1.436
 offense-1.332
 automatic-1.328
hemy-1.327
Token
Token ID250
Feature activation-1.51e-01

Pos logit contributions
+3.364
enegger+2.541
+2.532
Son+2.406
6000+2.365
Neg logit contributions
@#&-5.120
 deciding-5.034
 offense-4.789
hemy-4.728
igating-4.727
Token }
Token ID1782
Feature activation-6.79e-01

Pos logit contributions
@#&+3.673
 deciding+3.598
 offense+3.321
 automatic+3.283
 Answer+3.258
Neg logit contributions
-6.242
enegger-5.376
-5.214
Son-5.079
DragonMagazine-5.068
Token }
Token ID1782
Feature activation-3.66e-01

Pos logit contributions
@#&+2.610
 deciding+2.558
 offense+2.363
 automatic+2.327
 Answer+2.315
Neg logit contributions
-4.013
enegger-3.439
-3.329
6000-3.257
Son-3.251
Token Catch
Token ID25750
Feature activation+8.92e-01

Pos logit contributions
@#&+1.403
 deciding+1.379
 offense+1.267
hemy+1.260
igating+1.255
Neg logit contributions
-2.148
enegger-1.840
-1.786
6000-1.765
Son-1.755
Token {
Token ID1391
Feature activation-3.60e-01

Pos logit contributions
+4.767
enegger+3.985
+3.944
Son+3.893
6000+3.886
Neg logit contributions
@#&-3.723
 deciding-3.684
hemy-3.494
igating-3.394
 offense-3.378
Token Write
Token ID19430
Feature activation-5.32e-02

Pos logit contributions
@#&+0.873
 deciding+0.849
hemy+0.730
 offense+0.721
igating+0.720
Neg logit contributions
-2.624
enegger-2.333
-2.269
6000-2.261
Son-2.255
Token-
Token ID12
Feature activation-2.18e+00

Pos logit contributions
@#&+0.258
 deciding+0.254
 Answer+0.241
 offense+0.241
 automatic+0.238
Neg logit contributions
-0.259
enegger-0.218
-0.204
DragonMagazine-0.196
6000-0.195
TokenWarning
Token ID20361
Feature activation+2.25e-01

Pos logit contributions
@#&+7.053
 Answer+6.862
 offense+6.686
 deciding+6.474
 automatic+6.459
Neg logit contributions
-11.997
enegger-11.514
izons-10.235
-10.017
-9.748
Token -
Token ID532
Feature activation-1.12e+00

Pos logit contributions
+1.161
enegger+0.971
+0.934
6000+0.908
Son+0.905
Neg logit contributions
@#&-1.036
 deciding-1.019
 offense-0.956
 automatic-0.944
 Answer-0.942
TokenMessage
Token ID12837
Feature activation-1.61e-01

Pos logit contributions
@#&+3.528
 Answer+3.426
 deciding+3.288
 offense+3.160
 automatic+3.116
Neg logit contributions
-6.933
enegger-6.431
-5.901
izons-5.773
-5.604
Token "
Token ID366
Feature activation-5.24e-01

Pos logit contributions
@#&+0.769
 deciding+0.754
 offense+0.704
hemy+0.701
igating+0.698
Neg logit contributions
-0.807
enegger-0.676
-0.643
6000-0.631
Son-0.629
TokenF
Token ID37
Feature activation+2.27e-01

Pos logit contributions
@#&+2.614
 deciding+2.571
 offense+2.421
 automatic+2.392
 Answer+2.381
Neg logit contributions
-2.508
enegger-2.055
-1.979
6000-1.926
Son-1.925
Tokenv
Token ID85
Feature activation+6.97e-01

Pos logit contributions
@#&+1.840
 deciding+1.812
 offense+1.689
 automatic+1.667
 Answer+1.656
Neg logit contributions
-2.323
enegger-1.967
-1.893
6000-1.849
Son-1.847
Token4
Token ID19
Feature activation-2.26e-02

Pos logit contributions
+2.624
6000+1.966
enegger+1.939
+1.917
+1.834
Neg logit contributions
@#&-4.104
 deciding-3.973
 offense-3.797
 Answer-3.792
hemy-3.768
Token against
Token ID1028
Feature activation-2.90e-02

Pos logit contributions
@#&+0.098
 deciding+0.095
 offense+0.088
 Answer+0.087
 automatic+0.087
Neg logit contributions
-0.123
enegger-0.104
-0.100
6000-0.098
Son-0.098
Token pre
Token ID662
Feature activation+6.07e-01

Pos logit contributions
@#&+0.108
 deciding+0.105
igating+0.095
 offense+0.095
hemy+0.095
Neg logit contributions
-0.175
enegger-0.150
-0.146
Son-0.143
6000-0.143
Token-
Token ID12
Feature activation+3.71e-03

Pos logit contributions
+2.992
enegger+2.407
Son+2.387
+2.385
6000+2.324
Neg logit contributions
@#&-2.872
 deciding-2.756
igating-2.608
 Answer-2.587
 offense-2.578
Tokenagh
Token ID10471
Feature activation+1.42e+00

Pos logit contributions
+0.018
enegger+0.015
+0.014
Son+0.014
6000+0.014
Neg logit contributions
@#&-0.018
 deciding-0.018
 offense-0.017
 Answer-0.017
 automatic-0.017
Tokens
Token ID82
Feature activation+6.66e-02

Pos logit contributions
+7.015
6000+5.926
enegger+5.865
+5.712
Son+5.635
Neg logit contributions
@#&-6.085
 deciding-5.669
 Answer-5.429
 Explain-5.336
igating-5.325
Token inv
Token ID800
Feature activation+4.35e-02

Pos logit contributions
+0.359
enegger+0.305
+0.294
6000+0.287
Son+0.285
Neg logit contributions
@#&-0.289
 deciding-0.281
 offense-0.260
 Answer-0.260
igating-0.259
Tokenoker
Token ID11020
Feature activation+4.78e-01

Pos logit contributions
+0.229
enegger+0.197
+0.186
6000+0.184
Son+0.180
Neg logit contributions
@#&-0.195
 deciding-0.190
 automatic-0.176
 Answer-0.175
 offense-0.175
Token is
Token ID318
Feature activation+1.59e-01

Pos logit contributions
+2.451
enegger+2.064
+2.001
Son+1.966
6000+1.957
Neg logit contributions
@#&-2.155
 deciding-2.044
igating-1.931
hemy-1.917
 Answer-1.917
Token crucial
Token ID8780
Feature activation-7.34e-01

Pos logit contributions
+0.940
enegger+0.801
+0.775
6000+0.770
Son+0.764
Neg logit contributions
@#&-0.618
 deciding-0.578
 offense-0.536
igating-0.536
 Answer-0.536
Token
Token ID247
Feature activation-5.12e-01

Pos logit contributions
@#&+0.848
 deciding+0.838
 offense+0.785
igating+0.776
 automatic+0.774
Neg logit contributions
-0.851
enegger-0.696
-0.679
Son-0.656
6000-0.653
Tokent
Token ID83
Feature activation+5.61e-02

Pos logit contributions
@#&+2.202
 deciding+2.165
 offense+2.016
 automatic+1.989
 Answer+1.980
Neg logit contributions
-2.787
enegger-2.356
-2.273
6000-2.217
Son-2.214
Token live
Token ID2107
Feature activation-2.35e-01

Pos logit contributions
+0.348
enegger+0.301
+0.291
6000+0.287
Son+0.285
Neg logit contributions
@#&-0.198
 deciding-0.192
 offense-0.176
igating-0.174
hemy-0.174
Token in
Token ID287
Feature activation-1.54e-01

Pos logit contributions
@#&+0.985
 deciding+0.967
 offense+0.899
 automatic+0.887
 Answer+0.882
Neg logit contributions
-1.308
enegger-1.110
-1.072
6000-1.047
Son-1.045
Token an
Token ID281
Feature activation+1.89e-01

Pos logit contributions
@#&+0.552
 deciding+0.535
 offense+0.488
igating+0.484
hemy+0.483
Neg logit contributions
-0.949
enegger-0.815
-0.789
6000-0.780
Son-0.777
Token Orwell
Token ID32226
Feature activation+1.48e+00

Pos logit contributions
+1.083
enegger+0.917
+0.887
DragonMagazine+0.868
6000+0.865
Neg logit contributions
@#&-0.756
 deciding-0.731
hemy-0.667
igating-0.663
 recall-0.660
Tokenian
Token ID666
Feature activation+2.31e-01

Pos logit contributions
+5.282
+4.173
+3.935
enegger+3.896
DragonMagazine+3.751
Neg logit contributions
@#&-8.369
 deciding-7.988
igating-7.951
hemy-7.907
lege-7.815
Token police
Token ID1644
Feature activation+2.89e-01

Pos logit contributions
+1.137
enegger+0.937
+0.903
6000+0.884
Son+0.878
Neg logit contributions
@#&-1.114
 deciding-1.075
igating-1.010
 offense-1.006
hemy-1.004
Token state
Token ID1181
Feature activation+1.03e-01

Pos logit contributions
+1.802
enegger+1.536
+1.509
6000+1.477
Son+1.472
Neg logit contributions
@#&-1.021
 deciding-0.959
igating-0.887
hemy-0.881
 Answer-0.877
Token,
Token ID11
Feature activation-1.71e-01

Pos logit contributions
+0.553
enegger+0.462
+0.448
6000+0.437
Son+0.436
Neg logit contributions
@#&-0.451
 deciding-0.432
igating-0.407
 offense-0.406
hemy-0.404
Token
Token ID447
Feature activation-3.21e-01

Pos logit contributions
@#&+0.677
 deciding+0.654
 offense+0.608
igating+0.603
hemy+0.601
Neg logit contributions
-0.994
enegger-0.846
-0.822
6000-0.803
Son-0.802
Token environments
Token ID12493
Feature activation-1.32e-01

Pos logit contributions
+0.044
enegger+0.036
+0.034
6000+0.034
Son+0.033
Neg logit contributions
@#&-0.046
 deciding-0.045
 offense-0.043
hemy-0.042
 Answer-0.042
Token.
Token ID13
Feature activation+1.16e-01

Pos logit contributions
@#&+0.638
 deciding+0.621
 offense+0.585
hemy+0.581
igating+0.579
Neg logit contributions
-0.654
enegger-0.537
-0.520
6000-0.508
Son-0.503
Token '
Token ID705
Feature activation+3.95e-01

Pos logit contributions
+0.661
enegger+0.560
+0.549
Son+0.537
6000+0.537
Neg logit contributions
@#&-0.474
 deciding-0.462
 offense-0.432
hemy-0.426
 automatic-0.426
TokenVery
Token ID16371
Feature activation+5.84e-01

Pos logit contributions
+1.737
enegger+1.387
+1.343
Son+1.314
6000+1.307
Neg logit contributions
@#&-2.115
 deciding-2.088
 offense-1.980
 automatic-1.954
hemy-1.948
Token little
Token ID1310
Feature activation+1.27e+00

Pos logit contributions
+3.205
enegger+2.685
+2.614
Son+2.581
6000+2.576
Neg logit contributions
@#&-2.461
 deciding-2.348
hemy-2.230
 offense-2.215
igating-2.201
Token was
Token ID373
Feature activation+1.47e+00

Pos logit contributions
+6.282
enegger+5.242
+5.057
6000+5.044
Son+5.024
Neg logit contributions
@#&-5.553
hemy-5.252
 deciding-5.210
 Answer-4.895
igating-4.873
Token known
Token ID1900
Feature activation+5.65e-01

Pos logit contributions
+7.875
enegger+6.529
6000+6.513
+6.396
Son+6.386
Neg logit contributions
@#&-6.128
hemy-5.596
 deciding-5.414
igating-5.301
lege-5.219
Token about
Token ID546
Feature activation+5.10e-04

Pos logit contributions
+2.963
enegger+2.459
+2.414
6000+2.372
Son+2.342
Neg logit contributions
@#&-2.515
 deciding-2.382
hemy-2.292
 offense-2.261
 Answer-2.236
Token the
Token ID262
Feature activation+1.24e-01

Pos logit contributions
+0.003
enegger+0.003
+0.002
6000+0.002
Son+0.002
Neg logit contributions
@#&-0.002
 deciding-0.002
 offense-0.002
hemy-0.002
 automatic-0.002
Token native
Token ID6868
Feature activation-7.34e-01

Pos logit contributions
+0.684
enegger+0.576
+0.562
6000+0.549
Son+0.548
Neg logit contributions
@#&-0.524
 deciding-0.507
 offense-0.472
hemy-0.470
igating-0.467
Token monkey
Token ID21657
Feature activation+5.64e-01

Pos logit contributions
 deciding+3.190
@#&+3.186
 offense+2.988
 automatic+2.949
 Answer+2.917
Neg logit contributions
-3.927
enegger-3.345
-3.175
DragonMagazine-3.157
catentry-3.108
Token the
Token ID262
Feature activation+4.70e-01

Pos logit contributions
+1.199
enegger+1.024
+0.990
Son+0.972
6000+0.969
Neg logit contributions
@#&-0.889
 deciding-0.846
hemy-0.800
 offense-0.784
igating-0.780
Token years
Token ID812
Feature activation-1.96e-01

Pos logit contributions
+2.440
enegger+2.053
+2.020
6000+1.975
Son+1.975
Neg logit contributions
@#&-2.072
 deciding-1.952
hemy-1.897
igating-1.861
lege-1.840
Token.
Token ID13
Feature activation+1.87e-01

Pos logit contributions
@#&+0.915
 deciding+0.891
 offense+0.839
 automatic+0.828
 Answer+0.827
Neg logit contributions
-1.000
enegger-0.831
-0.802
6000-0.781
Son-0.780
Token
Token ID198
Feature activation+1.48e-01

Pos logit contributions
+1.061
enegger+0.899
+0.872
Son+0.859
6000+0.854
Neg logit contributions
@#&-0.769
 deciding-0.756
 offense-0.700
hemy-0.697
 automatic-0.694
Token
Token ID198
Feature activation+1.63e-01

Pos logit contributions
+0.906
enegger+0.786
+0.755
DragonMagazine+0.737
6000+0.736
Neg logit contributions
@#&-0.539
 deciding-0.529
 offense-0.486
 automatic-0.481
 Answer-0.480
TokenTom
Token ID13787
Feature activation+1.69e+00

Pos logit contributions
+0.728
enegger+0.579
+0.563
Son+0.554
6000+0.549
Neg logit contributions
@#&-0.865
 deciding-0.856
 offense-0.809
 automatic-0.799
igating-0.795
Token C
Token ID327
Feature activation+8.89e-02

Pos logit contributions
+8.273
enegger+7.082
Son+6.767
+6.657
6000+6.465
Neg logit contributions
@#&-7.231
 deciding-6.960
hemy-6.528
 offense-6.501
 automatic-6.441
Tokenray
Token ID2433
Feature activation-3.43e-01

Pos logit contributions
+0.428
enegger+0.356
+0.338
6000+0.333
Son+0.332
Neg logit contributions
@#&-0.438
 deciding-0.430
 offense-0.403
 automatic-0.399
 Answer-0.395
Token was
Token ID373
Feature activation+7.44e-01

Pos logit contributions
@#&+1.437
 deciding+1.404
 offense+1.306
 automatic+1.287
igating+1.283
Neg logit contributions
-1.915
enegger-1.633
-1.571
6000-1.536
Son-1.534
Token one
Token ID530
Feature activation-4.80e-01

Pos logit contributions
+4.143
enegger+3.536
+3.399
Son+3.389
6000+3.345
Neg logit contributions
@#&-2.971
 deciding-2.747
hemy-2.675
igating-2.632
 offense-2.611
Token of
Token ID286
Feature activation+1.23e-01

Pos logit contributions
@#&+1.893
 deciding+1.831
hemy+1.706
 offense+1.690
igating+1.681
Neg logit contributions
-2.796
enegger-2.363
-2.314
Son-2.260
6000-2.256
Token was
Token ID373
Feature activation+1.33e-01

Pos logit contributions
+0.802
enegger+0.683
+0.663
6000+0.645
Son+0.645
Neg logit contributions
@#&-0.580
 deciding-0.555
 offense-0.517
hemy-0.517
 Answer-0.516
Token governor
Token ID8153
Feature activation-2.17e-01

Pos logit contributions
+0.837
enegger+0.727
+0.702
6000+0.694
Son+0.693
Neg logit contributions
@#&-0.461
 deciding-0.428
hemy-0.405
 offense-0.399
igating-0.395
Token.
Token ID13
Feature activation-1.24e-01

Pos logit contributions
@#&+1.014
 deciding+0.983
 offense+0.933
hemy+0.927
 Answer+0.923
Neg logit contributions
-1.104
enegger-0.914
-0.888
6000-0.860
Son-0.858
Token
Token ID198
Feature activation-1.08e-01

Pos logit contributions
@#&+0.498
 deciding+0.487
 offense+0.453
 automatic+0.445
hemy+0.444
Neg logit contributions
-0.717
enegger-0.609
-0.590
6000-0.579
Son-0.579
Token
Token ID198
Feature activation+1.57e-01

Pos logit contributions
@#&+0.374
 deciding+0.367
 Answer+0.338
 offense+0.335
 automatic+0.333
Neg logit contributions
-0.681
enegger-0.598
-0.570
DragonMagazine-0.561
6000-0.552
TokenAdvertisement
Token ID4723
Feature activation+1.71e+00

Pos logit contributions
+0.675
enegger+0.528
+0.516
Son+0.506
6000+0.502
Neg logit contributions
@#&-0.859
 deciding-0.849
 offense-0.805
 automatic-0.794
hemy-0.792
Token Continue
Token ID10054
Feature activation-3.29e-01

Pos logit contributions
+7.995
Son+6.607
6000+6.468
+6.454
+6.346
Neg logit contributions
@#&-7.627
 deciding-7.576
igating-6.991
lege-6.773
 recall-6.756
Token reading
Token ID3555
Feature activation-1.29e-01

Pos logit contributions
 deciding+0.836
@#&+0.804
 offense+0.738
 automatic+0.725
 Answer+0.706
Neg logit contributions
-2.383
enegger-2.120
-2.044
DragonMagazine-2.027
Son-1.993
Token the
Token ID262
Feature activation+3.96e-01

Pos logit contributions
 deciding+0.439
@#&+0.439
 offense+0.401
 automatic+0.397
 Answer+0.392
Neg logit contributions
-0.818
enegger-0.710
-0.685
DragonMagazine-0.675
Son-0.668
Token main
Token ID1388
Feature activation-7.75e-01

Pos logit contributions
+2.139
enegger+1.806
+1.742
6000+1.715
Son+1.704
Neg logit contributions
@#&-1.718
 deciding-1.664
 offense-1.544
igating-1.537
hemy-1.533
Token story
Token ID1621
Feature activation-2.02e-01

Pos logit contributions
 deciding+1.819
@#&+1.672
 offense+1.662
 automatic+1.629
 recall+1.565
Neg logit contributions
-5.732
enegger-5.091
DragonMagazine-4.960
-4.939
Son-4.826
Token and
Token ID290
Feature activation+3.67e-01

Pos logit contributions
+4.472
enegger+3.536
+3.466
Son+3.404
6000+3.393
Neg logit contributions
@#&-5.466
 deciding-5.333
 offense-5.039
hemy-5.036
 automatic-5.004
Token explore
Token ID7301
Feature activation+2.61e-01

Pos logit contributions
+1.999
enegger+1.692
+1.630
6000+1.593
Son+1.588
Neg logit contributions
@#&-1.579
 deciding-1.556
 offense-1.450
 automatic-1.431
igating-1.424
Token the
Token ID262
Feature activation+9.79e-01

Pos logit contributions
+1.609
enegger+1.388
+1.342
6000+1.311
DragonMagazine+1.310
Neg logit contributions
@#&-0.937
 deciding-0.925
 offense-0.850
 automatic-0.838
 Answer-0.831
Token world
Token ID995
Feature activation+3.87e-01

Pos logit contributions
+3.773
enegger+2.925
+2.845
6000+2.751
Son+2.718
Neg logit contributions
@#&-5.696
 deciding-5.473
 Answer-5.220
hemy-5.205
igating-5.189
Token with
Token ID351
Feature activation-4.25e-01

Pos logit contributions
+2.117
enegger+1.796
+1.737
Son+1.700
6000+1.693
Neg logit contributions
@#&-1.650
 deciding-1.611
 offense-1.497
 automatic-1.483
hemy-1.479
Token kindred
Token ID46449
Feature activation+2.70e-01

Pos logit contributions
@#&+2.437
 deciding+2.418
 offense+2.334
 automatic+2.292
 recall+2.291
Neg logit contributions
-1.711
enegger-1.349
-1.246
6000-1.221
DragonMagazine-1.218
Token spirits
Token ID13747
Feature activation+1.92e-01

Pos logit contributions
+1.913
enegger+1.687
+1.640
6000+1.608
Son+1.606
Neg logit contributions
@#&-0.717
 deciding-0.703
 offense-0.624
 automatic-0.609
igating-0.602
Token.
Token ID13
Feature activation+2.27e-01

Pos logit contributions
+0.993
enegger+0.835
+0.799
6000+0.775
DragonMagazine+0.774
Neg logit contributions
@#&-0.881
 deciding-0.868
 offense-0.813
 Answer-0.804
 automatic-0.803
Token
Token ID198
Feature activation-6.70e-03

Pos logit contributions
+1.377
enegger+1.189
+1.146
6000+1.121
Son+1.118
Neg logit contributions
@#&-0.837
 deciding-0.822
 offense-0.757
 Answer-0.746
 automatic-0.745
Token
Token ID198
Feature activation+7.57e-02

Pos logit contributions
@#&+0.023
 deciding+0.023
 offense+0.021
 Answer+0.021
 automatic+0.021
Neg logit contributions
-0.042
enegger-0.037
-0.035
DragonMagazine-0.034
6000-0.034
TokenSign
Token ID11712
Feature activation+1.68e+00

Pos logit contributions
+0.394
enegger+0.337
+0.316
DragonMagazine+0.312
6000+0.310
Neg logit contributions
@#&-0.336
 deciding-0.334
 offense-0.313
igating-0.313
 automatic-0.312
Tokenirect
Token ID1060
Feature activation-1.77e-01

Pos logit contributions
+0.092
enegger+0.075
+0.072
6000+0.070
Son+0.069
Neg logit contributions
@#&-0.108
 deciding-0.106
 offense-0.100
 Answer-0.099
hemy-0.099
Tokened
Token ID276
Feature activation-6.27e-01

Pos logit contributions
@#&+0.776
 deciding+0.756
 offense+0.706
 automatic+0.695
hemy+0.692
Neg logit contributions
-0.949
enegger-0.791
-0.771
6000-0.757
Son-0.754
Token edges
Token ID13015
Feature activation-8.80e-02

Pos logit contributions
@#&+2.797
 deciding+2.767
 offense+2.578
 automatic+2.554
 Answer+2.540
Neg logit contributions
-3.299
enegger-2.792
-2.671
6000-2.597
DragonMagazine-2.589
Token *)
Token ID31936
Feature activation-3.31e-01

Pos logit contributions
@#&+0.371
 deciding+0.362
 offense+0.338
 automatic+0.333
hemy+0.333
Neg logit contributions
-0.487
enegger-0.411
-0.397
6000-0.390
Son-0.389
Token graph
Token ID4823
Feature activation-4.53e-01

Pos logit contributions
@#&+1.378
 deciding+1.348
 offense+1.254
hemy+1.237
igating+1.235
Neg logit contributions
-1.842
enegger-1.560
-1.512
6000-1.481
Son-1.480
Token =
Token ID796
Feature activation+2.19e-01

Pos logit contributions
@#&+2.256
 deciding+2.221
 offense+2.091
 automatic+2.067
 Answer+2.058
Neg logit contributions
-2.158
enegger-1.775
-1.703
6000-1.655
Son-1.652
Token Fl
Token ID1610
Feature activation-7.33e-01

Pos logit contributions
+1.151
enegger+0.956
+0.942
Son+0.926
6000+0.914
Neg logit contributions
@#&-0.965
 deciding-0.930
hemy-0.889
 offense-0.879
igating-0.871
Tokenatten
Token ID41769
Feature activation-8.53e-01

Pos logit contributions
@#&+4.637
 deciding+4.579
 offense+4.365
 automatic+4.332
igating+4.320
Neg logit contributions
-2.509
enegger-1.898
-1.764
6000-1.685
Son-1.674
Token[
Token ID58
Feature activation-5.20e-01

Pos logit contributions
@#&+4.425
 deciding+4.368
 offense+4.128
 automatic+4.083
igating+4.070
Neg logit contributions
-3.844
enegger-3.155
-2.988
6000-2.906
DragonMagazine-2.886
Token Table
Token ID8655
Feature activation-7.60e-01

Pos logit contributions
@#&+2.373
 deciding+2.333
 offense+2.183
 automatic+2.156
 Answer+2.146
Neg logit contributions
-2.702
enegger-2.262
-2.178
6000-2.123
Son-2.119
Token[
Token ID58
Feature activation-6.54e-01

Pos logit contributions
@#&+4.099
 deciding+4.041
 offense+3.822
 automatic+3.782
 Answer+3.768
Neg logit contributions
-3.318
enegger-2.676
-2.553
6000-2.472
Son-2.467
Token and
Token ID290
Feature activation+5.84e-01

Pos logit contributions
+4.134
enegger+3.229
+3.220
6000+3.120
Son+3.110
Neg logit contributions
@#&-4.611
 deciding-4.264
hemy-4.172
igating-4.064
 Answer-4.004
Token sound
Token ID2128
Feature activation-2.47e-01

Pos logit contributions
+3.344
enegger+2.816
+2.777
Son+2.700
6000+2.695
Neg logit contributions
@#&-2.339
 deciding-2.171
hemy-2.061
igating-2.029
 Answer-2.010
Token "
Token ID366
Feature activation+5.64e-01

Pos logit contributions
@#&+1.165
 deciding+1.126
 offense+1.046
hemy+1.044
 Answer+1.044
Neg logit contributions
-1.265
enegger-1.046
-1.015
6000-0.987
Son-0.984
Tokentoo
Token ID18820
Feature activation+4.03e-01

Pos logit contributions
+3.073
enegger+2.538
+2.511
Son+2.497
6000+2.476
Neg logit contributions
@#&-2.408
 deciding-2.335
 offense-2.182
hemy-2.146
 Answer-2.135
Token good
Token ID922
Feature activation+4.82e-01

Pos logit contributions
+2.338
enegger+1.979
+1.932
6000+1.896
Son+1.892
Neg logit contributions
@#&-1.587
 deciding-1.517
hemy-1.407
igating-1.403
 offense-1.383
Token to
Token ID284
Feature activation+7.27e-01

Pos logit contributions
+2.124
enegger+1.681
+1.632
Son+1.588
6000+1.588
Neg logit contributions
@#&-2.574
 deciding-2.461
hemy-2.363
igating-2.327
 Answer-2.316
Token be
Token ID307
Feature activation+1.10e+00

Pos logit contributions
+3.477
enegger+2.893
6000+2.825
+2.808
Son+2.775
Neg logit contributions
@#&-3.474
 deciding-3.338
igating-3.138
hemy-3.104
 Answer-3.064
Token true
Token ID2081
Feature activation+4.19e-01

Pos logit contributions
+4.908
enegger+4.096
6000+4.044
Son+3.980
+3.945
Neg logit contributions
@#&-5.340
 deciding-4.969
igating-4.796
 Answer-4.776
 Explain-4.707
Token."
Token ID526
Feature activation+1.35e-01

Pos logit contributions
+1.753
enegger+1.369
+1.322
Son+1.280
6000+1.271
Neg logit contributions
@#&-2.338
 deciding-2.241
hemy-2.151
igating-2.116
 Answer-2.107
Token It
Token ID632
Feature activation-4.77e-01

Pos logit contributions
+0.777
enegger+0.660
+0.642
Son+0.633
6000+0.632
Neg logit contributions
@#&-0.535
 deciding-0.522
 offense-0.481
hemy-0.480
 automatic-0.475
Token's
Token ID338
Feature activation-4.03e-01

Pos logit contributions
@#&+2.129
 deciding+2.127
 offense+1.988
 automatic+1.964
 recall+1.947
Neg logit contributions
-2.510
enegger-2.098
-2.025
DragonMagazine-1.975
Son-1.971
Token like
Token ID588
Feature activation-6.47e-01

Pos logit contributions
@#&+1.533
 deciding+1.506
 offense+1.384
 automatic+1.364
 Answer+1.358
Neg logit contributions
-2.484
enegger-2.136
-2.066
6000-2.021
Son-2.018
Token to
Token ID284
Feature activation-4.63e-01

Pos logit contributions
@#&+2.854
 deciding+2.840
 offense+2.645
 automatic+2.623
 Answer+2.591
Neg logit contributions
-3.443
enegger-2.912
-2.785
6000-2.711
DragonMagazine-2.709
Token top
Token ID1353
Feature activation-3.66e-01

Pos logit contributions
@#&+1.613
 deciding+1.600
 offense+1.471
 automatic+1.452
 recall+1.444
Neg logit contributions
-2.901
enegger-2.512
-2.419
DragonMagazine-2.383
Son-2.371
Token mine
Token ID6164
Feature activation-1.12e-01

Pos logit contributions
@#&+1.243
 deciding+1.215
 offense+1.109
 automatic+1.090
 Answer+1.084
Neg logit contributions
-2.324
enegger-2.014
-1.957
6000-1.917
Son-1.915
Token with
Token ID351
Feature activation-3.24e-01

Pos logit contributions
@#&+0.475
 deciding+0.467
 offense+0.434
 automatic+0.428
 Answer+0.427
Neg logit contributions
-0.612
enegger-0.518
-0.501
6000-0.488
Son-0.488
Token frozen
Token ID12912
Feature activation+9.54e-01

Pos logit contributions
@#&+1.335
 deciding+1.309
 offense+1.215
 automatic+1.197
 Answer+1.193
Neg logit contributions
-1.823
enegger-1.548
-1.498
6000-1.464
Son-1.462
Token cher
Token ID21382
Feature activation-2.76e-01

Pos logit contributions
+4.683
+3.779
enegger+3.759
6000+3.722
Son+3.640
Neg logit contributions
@#&-4.395
 deciding-4.123
hemy-3.994
lege-3.931
igating-3.927
Tokenries
Token ID1678
Feature activation+2.50e-01

Pos logit contributions
@#&+1.565
 deciding+1.553
 offense+1.469
igating+1.455
 automatic+1.454
Neg logit contributions
-1.132
enegger-0.898
-0.851
DragonMagazine-0.814
Son-0.812
Token,
Token ID11
Feature activation-1.65e-01

Pos logit contributions
+1.256
enegger+1.037
+1.013
6000+0.984
Son+0.980
Neg logit contributions
@#&-1.172
 deciding-1.130
 offense-1.066
igating-1.062
hemy-1.058
Token which
Token ID543
Feature activation-1.65e-01

Pos logit contributions
@#&+0.704
 deciding+0.686
 offense+0.640
igating+0.631
 Answer+0.630
Neg logit contributions
-0.903
enegger-0.762
-0.738
6000-0.722
Son-0.720
Token are
Token ID389
Feature activation-2.62e-01

Pos logit contributions
@#&+0.649
 deciding+0.636
 offense+0.587
igating+0.578
 automatic+0.577
Neg logit contributions
-0.962
enegger-0.821
-0.797
Son-0.779
6000-0.779
Token wrong
Token ID2642
Feature activation+2.33e-01

Pos logit contributions
+3.303
enegger+2.807
+2.719
6000+2.677
Son+2.672
Neg logit contributions
@#&-2.619
 deciding-2.473
hemy-2.340
igating-2.326
 Answer-2.308
Token in
Token ID287
Feature activation+9.32e-03

Pos logit contributions
+1.214
enegger+1.012
+0.988
6000+0.961
Son+0.958
Neg logit contributions
@#&-1.055
 deciding-0.986
hemy-0.942
 Answer-0.941
 offense-0.940
Token principle
Token ID7989
Feature activation+1.87e-01

Pos logit contributions
+0.057
enegger+0.049
+0.047
6000+0.047
Son+0.046
Neg logit contributions
@#&-0.034
 deciding-0.033
 offense-0.030
hemy-0.030
igating-0.030
Token with
Token ID351
Feature activation-1.98e-01

Pos logit contributions
+0.966
enegger+0.804
+0.777
6000+0.761
Son+0.757
Neg logit contributions
@#&-0.865
 deciding-0.822
 offense-0.777
hemy-0.775
 Answer-0.773
Token this
Token ID428
Feature activation-1.56e-01

Pos logit contributions
@#&+0.704
 deciding+0.680
 offense+0.626
 Answer+0.614
hemy+0.614
Neg logit contributions
-1.232
enegger-1.063
-1.036
6000-1.014
Son-1.014
Token item
Token ID2378
Feature activation+8.42e-02

Pos logit contributions
@#&+0.679
 deciding+0.668
 offense+0.623
 automatic+0.615
 Answer+0.612
Neg logit contributions
-0.840
enegger-0.708
-0.683
6000-0.666
Son-0.665
Token,
Token ID11
Feature activation-7.15e-01

Pos logit contributions
+0.440
enegger+0.366
+0.354
6000+0.346
Son+0.345
Neg logit contributions
@#&-0.384
 deciding-0.370
 offense-0.348
hemy-0.345
 Answer-0.345
Token but
Token ID475
Feature activation-8.30e-01

Pos logit contributions
 deciding+2.794
@#&+2.769
 offense+2.581
 automatic+2.550
 recall+2.513
Neg logit contributions
-4.159
enegger-3.574
-3.436
DragonMagazine-3.350
6000-3.338
Token they
Token ID484
Feature activation-5.44e-01

Pos logit contributions
 deciding+3.036
@#&+2.885
 offense+2.753
 automatic+2.734
 recall+2.712
Neg logit contributions
-5.095
enegger-4.400
-4.234
DragonMagazine-4.177
-4.131
Token cost
Token ID1575
Feature activation+4.31e-01

Pos logit contributions
@#&+2.001
 deciding+1.997
 offense+1.841
 automatic+1.813
 recall+1.796
Neg logit contributions
-3.293
enegger-2.828
-2.734
DragonMagazine-2.676
Son-2.675
Token $
Token ID720
Feature activation+4.75e-01

Pos logit contributions
+2.333
enegger+1.967
+1.909
6000+1.889
Son+1.862
Neg logit contributions
@#&-1.862
 deciding-1.783
hemy-1.664
igating-1.649
 offense-1.643
Token had
Token ID550
Feature activation+2.55e-01

Pos logit contributions
+0.614
enegger+0.528
+0.511
6000+0.502
Son+0.500
Neg logit contributions
@#&-0.402
 deciding-0.389
 offense-0.357
hemy-0.354
igating-0.354
Token ingested
Token ID44694
Feature activation+7.74e-02

Pos logit contributions
+1.487
enegger+1.266
+1.227
6000+1.209
Son+1.205
Neg logit contributions
@#&-1.000
 deciding-0.961
 offense-0.883
igating-0.882
hemy-0.877
Token them
Token ID606
Feature activation+6.60e-01

Pos logit contributions
+0.451
enegger+0.384
+0.372
6000+0.366
Son+0.365
Neg logit contributions
@#&-0.304
 deciding-0.295
 offense-0.271
hemy-0.268
igating-0.268
Token directly
Token ID3264
Feature activation+1.14e+00

Pos logit contributions
+3.536
enegger+2.998
+2.913
Son+2.873
6000+2.865
Neg logit contributions
@#&-2.779
 deciding-2.666
igating-2.483
hemy-2.482
 offense-2.472
Token.
Token ID13
Feature activation+6.72e-02

Pos logit contributions
+5.053
enegger+4.145
+3.931
6000+3.906
Son+3.897
Neg logit contributions
@#&-5.676
 deciding-5.326
hemy-5.130
igating-5.056
 Explain-5.049
Token This
Token ID770
Feature activation-9.80e-02

Pos logit contributions
+0.386
enegger+0.329
+0.319
Son+0.313
6000+0.312
Neg logit contributions
@#&-0.269
 deciding-0.264
 offense-0.244
 automatic-0.241
igating-0.240
Token fact
Token ID1109
Feature activation-2.12e-01

Pos logit contributions
@#&+0.431
 deciding+0.420
 offense+0.390
igating+0.388
hemy+0.386
Neg logit contributions
-0.523
enegger-0.440
-0.426
6000-0.416
Son-0.415
Token in
Token ID287
Feature activation-6.36e-01

Pos logit contributions
@#&+1.010
 deciding+0.986
 offense+0.927
 Answer+0.915
 automatic+0.914
Neg logit contributions
-1.061
enegger-0.882
-0.848
6000-0.826
Son-0.823
Token and
Token ID290
Feature activation+4.61e-01

Pos logit contributions
@#&+2.558
 deciding+2.524
 offense+2.335
 automatic+2.303
 Answer+2.289
Neg logit contributions
-3.633
enegger-3.100
-2.996
6000-2.926
Son-2.922
Token of
Token ID286
Feature activation-5.41e-01

Pos logit contributions
+2.456
enegger+2.071
+2.011
DragonMagazine+1.963
+1.940
Neg logit contributions
@#&-1.971
 deciding-1.873
hemy-1.745
igating-1.736
 Answer-1.723
Token itself
Token ID2346
Feature activation-2.50e-02

Pos logit contributions
 deciding+1.950
@#&+1.893
 offense+1.812
 automatic+1.771
 recall+1.731
Neg logit contributions
-3.326
enegger-2.893
-2.779
DragonMagazine-2.736
Son-2.712
Token a
Token ID257
Feature activation-2.83e-01

Pos logit contributions
 deciding+1.865
@#&+1.846
 offense+1.697
 automatic+1.668
 Answer+1.649
Neg logit contributions
-3.309
enegger-2.858
-2.768
DragonMagazine-2.711
Son-2.696
Token big
Token ID1263
Feature activation-8.01e-01

Pos logit contributions
@#&+1.138
 deciding+1.119
 offense+1.037
 automatic+1.022
 Answer+1.015
Neg logit contributions
-1.621
enegger-1.382
-1.335
6000-1.303
Son-1.302
Token fan
Token ID4336
Feature activation-4.16e-01

Pos logit contributions
 deciding+2.859
@#&+2.836
 offense+2.656
 automatic+2.601
 Answer+2.566
Neg logit contributions
-4.916
enegger-4.219
-4.073
DragonMagazine-4.009
6000-3.997
Token of
Token ID286
Feature activation-2.92e-01

Pos logit contributions
@#&+1.598
 deciding+1.561
 offense+1.445
 automatic+1.428
igating+1.417
Neg logit contributions
-2.458
enegger-2.096
-2.039
6000-1.993
Son-1.990
Token Facebook
Token ID3203
Feature activation+1.82e-01

Pos logit contributions
@#&+1.178
 deciding+1.157
 offense+1.072
 automatic+1.058
 Answer+1.051
Neg logit contributions
-1.670
enegger-1.424
-1.375
6000-1.345
Son-1.342
Token,
Token ID11
Feature activation-1.93e-01

Pos logit contributions
+0.955
enegger+0.801
+0.776
Son+0.756
6000+0.752
Neg logit contributions
@#&-0.817
 deciding-0.781
 offense-0.739
igating-0.733
hemy-0.728
Token I
Token ID314
Feature activation-9.45e-01

Pos logit contributions
@#&+0.799
 deciding+0.778
 offense+0.725
 automatic+0.715
igating+0.715
Neg logit contributions
-1.087
enegger-0.921
-0.893
6000-0.872
Son-0.872
Token don
Token ID836
Feature activation+1.22e-01

Pos logit contributions
@#&+3.575
 deciding+3.518
 offense+3.236
 recall+3.201
 automatic+3.187
Neg logit contributions
-5.595
enegger-4.812
-4.637
DragonMagazine-4.543
Son-4.538
Token
Token ID447
Feature activation-8.40e-01

Pos logit contributions
+0.653
enegger+0.548
+0.530
6000+0.517
Son+0.516
Neg logit contributions
@#&-0.541
 deciding-0.532
 offense-0.497
 automatic-0.490
igating-0.488
Token
Token ID247
Feature activation-7.62e-01

Pos logit contributions
@#&+3.526
 deciding+3.408
 offense+3.220
 automatic+3.202
 recall+3.181
Neg logit contributions
-4.636
enegger-4.050
DragonMagazine-3.752
6000-3.723
-3.702
Tokent
Token ID83
Feature activation-2.72e-01

Pos logit contributions
@#&+3.154
 deciding+3.119
 offense+2.887
 automatic+2.846
 recall+2.843
Neg logit contributions
-4.257
enegger-3.619
-3.481
DragonMagazine-3.411
6000-3.387
Token more
Token ID517
Feature activation-3.97e-01

Pos logit contributions
@#&+0.366
 deciding+0.354
 offense+0.327
 Answer+0.323
igating+0.323
Neg logit contributions
-0.540
enegger-0.461
-0.447
6000-0.438
Son-0.438
Token bass
Token ID12702
Feature activation+2.14e-01

Pos logit contributions
@#&+1.680
 deciding+1.647
 offense+1.531
 automatic+1.510
 Answer+1.506
Neg logit contributions
-2.193
enegger-1.857
-1.794
6000-1.752
Son-1.750
Token slam
Token ID21158
Feature activation-3.93e-01

Pos logit contributions
+1.082
enegger+0.892
+0.864
Son+0.850
6000+0.839
Neg logit contributions
@#&-1.008
 deciding-0.980
hemy-0.916
 offense-0.911
igating-0.909
Token and
Token ID290
Feature activation-1.57e-01

Pos logit contributions
@#&+1.862
 deciding+1.825
 offense+1.707
 automatic+1.688
 Answer+1.687
Neg logit contributions
-1.980
enegger-1.649
-1.582
6000-1.544
Son-1.542
Token better
Token ID1365
Feature activation-4.25e-01

Pos logit contributions
@#&+0.638
 deciding+0.619
 offense+0.572
igating+0.569
hemy+0.567
Neg logit contributions
-0.893
enegger-0.758
-0.734
Son-0.720
6000-0.718
Token imaging
Token ID19560
Feature activation-2.92e-01

Pos logit contributions
@#&+1.799
 deciding+1.772
 offense+1.652
 automatic+1.631
 Answer+1.619
Neg logit contributions
-2.347
enegger-1.993
-1.921
6000-1.876
Son-1.867
Token.
Token ID13
Feature activation+1.06e-01

Pos logit contributions
@#&+1.323
 deciding+1.291
 offense+1.205
 automatic+1.190
 Answer+1.190
Neg logit contributions
-1.535
enegger-1.284
-1.238
Son-1.211
6000-1.210
Token But
Token ID887
Feature activation-2.71e-01

Pos logit contributions
+0.615
enegger+0.521
+0.509
Son+0.503
6000+0.501
Neg logit contributions
@#&-0.415
 deciding-0.408
 offense-0.379
hemy-0.375
 automatic-0.373
Token in
Token ID287
Feature activation-6.43e-01

Pos logit contributions
@#&+1.044
 deciding+1.022
 offense+0.944
 automatic+0.930
 Answer+0.925
Neg logit contributions
-1.605
enegger-1.375
-1.331
6000-1.303
Son-1.301
Token terms
Token ID2846
Feature activation+2.06e-01

Pos logit contributions
@#&+2.226
 deciding+2.225
 offense+2.023
 automatic+1.988
 Answer+1.967
Neg logit contributions
-4.020
enegger-3.485
-3.373
6000-3.303
DragonMagazine-3.301
Token of
Token ID286
Feature activation-2.65e-02

Pos logit contributions
+1.241
enegger+1.067
+1.034
6000+1.013
Son+1.011
Neg logit contributions
@#&-0.765
 deciding-0.749
 offense-0.690
 automatic-0.679
 Answer-0.675
Token,
Token ID11
Feature activation-3.52e-01

Pos logit contributions
@#&+1.406
 deciding+1.382
 offense+1.289
 automatic+1.272
 Answer+1.266
Neg logit contributions
-1.744
enegger-1.471
-1.419
6000-1.384
Son-1.382
Token located
Token ID5140
Feature activation-3.58e-01

Pos logit contributions
@#&+1.329
 deciding+1.305
 offense+1.203
 automatic+1.185
 Answer+1.178
Neg logit contributions
-2.104
enegger-1.806
-1.750
6000-1.711
Son-1.709
Token little
Token ID1310
Feature activation+4.50e-01

Pos logit contributions
@#&+1.482
 deciding+1.454
 offense+1.350
 automatic+1.331
 Answer+1.326
Neg logit contributions
-2.009
enegger-1.706
-1.649
6000-1.612
Son-1.608
Token more
Token ID517
Feature activation+2.94e-01

Pos logit contributions
+2.359
enegger+1.946
+1.894
Son+1.859
+1.855
Neg logit contributions
@#&-1.985
 deciding-1.914
hemy-1.852
igating-1.785
 Answer-1.783
Token than
Token ID621
Feature activation-1.53e-01

Pos logit contributions
+1.603
enegger+1.354
+1.307
6000+1.276
Son+1.274
Neg logit contributions
@#&-1.267
 deciding-1.243
 offense-1.158
 automatic-1.143
 Answer-1.138
Token 2
Token ID362
Feature activation-1.98e-01

Pos logit contributions
@#&+0.537
 deciding+0.522
 offense+0.477
 automatic+0.468
 Answer+0.467
Neg logit contributions
-0.952
enegger-0.823
-0.799
6000-0.784
Son-0.782
Token miles
Token ID4608
Feature activation+5.66e-01

Pos logit contributions
@#&+0.861
 deciding+0.838
 offense+0.777
hemy+0.771
 automatic+0.768
Neg logit contributions
-1.085
enegger-0.903
-0.885
Son-0.863
6000-0.860
Token away
Token ID1497
Feature activation+1.86e-01

Pos logit contributions
+2.738
enegger+2.227
+2.178
6000+2.147
Son+2.134
Neg logit contributions
@#&-2.737
 deciding-2.594
hemy-2.510
igating-2.436
lege-2.421
Token,
Token ID11
Feature activation-1.13e-01

Pos logit contributions
+0.903
enegger+0.741
+0.718
Son+0.701
6000+0.700
Neg logit contributions
@#&-0.916
 deciding-0.872
hemy-0.824
 offense-0.818
 Answer-0.817
Token and
Token ID290
Feature activation-1.65e-01

Pos logit contributions
@#&+0.455
 deciding+0.439
 offense+0.408
hemy+0.403
igating+0.403
Neg logit contributions
-0.653
enegger-0.558
-0.539
6000-0.529
Son-0.527
Token called
Token ID1444
Feature activation-8.20e-03

Pos logit contributions
@#&+0.663
 deciding+0.648
 offense+0.601
 automatic+0.592
igating+0.590
Neg logit contributions
-0.943
enegger-0.805
-0.777
6000-0.761
Son-0.759
Token never
Token ID1239
Feature activation+1.20e-01

Pos logit contributions
@#&+1.764
 deciding+1.734
 offense+1.589
 automatic+1.564
 Answer+1.553
Neg logit contributions
-3.105
enegger-2.685
-2.602
6000-2.546
Son-2.544
Token know
Token ID760
Feature activation+6.61e-01

Pos logit contributions
+0.810
enegger+0.711
+0.689
6000+0.683
Son+0.679
Neg logit contributions
@#&-0.362
 deciding-0.345
 offense-0.311
igating-0.306
hemy-0.306
Token whether
Token ID1771
Feature activation+2.24e-01

Pos logit contributions
+3.847
enegger+3.264
+3.160
Son+3.154
+3.126
Neg logit contributions
@#&-2.531
 deciding-2.341
igating-2.229
hemy-2.220
 offense-2.181
Token people
Token ID661
Feature activation-5.35e-01

Pos logit contributions
+1.376
enegger+1.184
+1.151
Son+1.138
6000+1.133
Neg logit contributions
@#&-0.813
 deciding-0.752
igating-0.697
hemy-0.693
 Answer-0.683
Token are
Token ID389
Feature activation-2.05e-02

Pos logit contributions
 deciding+1.983
@#&+1.976
 offense+1.826
 automatic+1.788
 recall+1.778
Neg logit contributions
-3.227
enegger-2.777
-2.679
DragonMagazine-2.628
6000-2.623
Token getting
Token ID1972
Feature activation-7.74e-02

Pos logit contributions
@#&+0.080
 deciding+0.077
 offense+0.071
igating+0.070
hemy+0.070
Neg logit contributions
-0.120
enegger-0.103
-0.100
6000-0.098
Son-0.098
Token more
Token ID517
Feature activation-4.40e-01

Pos logit contributions
@#&+0.303
 deciding+0.292
 offense+0.267
hemy+0.267
igating+0.266
Neg logit contributions
-0.453
enegger-0.388
-0.375
6000-0.369
Son-0.368
Token reasonable
Token ID6397
Feature activation+4.97e-01

Pos logit contributions
@#&+1.776
 deciding+1.764
 offense+1.645
 automatic+1.622
 Answer+1.600
Neg logit contributions
-2.503
enegger-2.132
-2.060
6000-2.011
Son-2.007
Token over
Token ID625
Feature activation+5.46e-01

Pos logit contributions
+2.642
enegger+2.217
+2.140
6000+2.081
Son+2.076
Neg logit contributions
@#&-2.204
 deciding-2.086
hemy-1.994
igating-1.952
lege-1.933
Token time
Token ID640
Feature activation-2.74e-01

Pos logit contributions
+3.218
enegger+2.789
+2.707
6000+2.646
Son+2.640
Neg logit contributions
@#&-2.084
 deciding-1.912
hemy-1.813
igating-1.778
 offense-1.776
Token,
Token ID11
Feature activation+2.17e-01

Pos logit contributions
@#&+1.237
 deciding+1.216
 offense+1.137
 automatic+1.123
 Answer+1.118
Neg logit contributions
-1.434
enegger-1.202
-1.158
6000-1.129
Son-1.127
Token trigger
Token ID7616
Feature activation-3.91e-01

Pos logit contributions
@#&+1.378
 deciding+1.369
 offense+1.253
 automatic+1.234
 Answer+1.211
Neg logit contributions
-2.503
enegger-2.171
-2.098
6000-2.055
Son-2.052
Token the
Token ID262
Feature activation-1.75e-01

Pos logit contributions
@#&+1.438
 deciding+1.415
 offense+1.303
 automatic+1.284
 Answer+1.271
Neg logit contributions
-2.371
enegger-2.041
-1.977
6000-1.934
Son-1.932
Token flashbacks
Token ID49555
Feature activation-4.69e-01

Pos logit contributions
@#&+0.610
 deciding+0.591
 offense+0.540
igating+0.531
hemy+0.531
Neg logit contributions
-1.102
enegger-0.955
-0.926
6000-0.908
Son-0.907
Token,
Token ID11
Feature activation-2.55e-01

Pos logit contributions
@#&+2.164
 deciding+2.147
 offense+2.012
 automatic+1.989
 Answer+1.969
Neg logit contributions
-2.400
enegger-2.011
-1.933
6000-1.878
Son-1.876
Token the
Token ID262
Feature activation-1.09e+00

Pos logit contributions
@#&+1.004
 deciding+0.981
 offense+0.909
 automatic+0.894
 Answer+0.892
Neg logit contributions
-1.484
enegger-1.269
-1.227
6000-1.201
Son-1.199
Token memories
Token ID9846
Feature activation-1.45e+00

Pos logit contributions
 deciding+3.727
@#&+3.617
 automatic+3.424
 offense+3.403
 recall+3.357
Neg logit contributions
-6.790
enegger-5.871
-5.681
DragonMagazine-5.645
Lago-5.564
Token,
Token ID11
Feature activation-2.46e-01

Pos logit contributions
 deciding+6.120
@#&+5.872
 recall+5.670
 offense+5.645
 automatic+5.644
Neg logit contributions
-7.787
enegger-6.745
-6.419
DragonMagazine-6.404
Son-6.256
Token the
Token ID262
Feature activation-8.15e-01

Pos logit contributions
@#&+0.974
 deciding+0.944
 offense+0.876
 Answer+0.863
hemy+0.861
Neg logit contributions
-1.432
enegger-1.225
-1.185
6000-1.161
Son-1.158
Token nightmares
Token ID33301
Feature activation-6.38e-01

Pos logit contributions
 deciding+2.763
@#&+2.723
 offense+2.522
 automatic+2.510
 recall+2.458
Neg logit contributions
-5.159
enegger-4.469
-4.327
DragonMagazine-4.264
6000-4.237
Token
Token ID784
Feature activation-5.53e-01

Pos logit contributions
@#&+2.986
 deciding+2.965
 offense+2.777
 automatic+2.749
 Answer+2.720
Neg logit contributions
-3.214
enegger-2.688
-2.577
6000-2.507
Son-2.504
Token it
Token ID340
Feature activation-7.88e-01

Pos logit contributions
@#&+2.054
 deciding+2.039
 offense+1.872
 automatic+1.854
 recall+1.829
Neg logit contributions
-3.324
enegger-2.856
-2.770
DragonMagazine-2.702
Son-2.700
Token Isaac
Token ID19068
Feature activation+6.33e-01

Pos logit contributions
+3.154
enegger+2.674
+2.592
Son+2.564
6000+2.532
Neg logit contributions
@#&-2.601
 deciding-2.481
hemy-2.350
igating-2.318
 offense-2.310
Token Newton
Token ID17321
Feature activation-1.40e-01

Pos logit contributions
+3.300
enegger+2.735
+2.639
Son+2.597
6000+2.541
Neg logit contributions
@#&-2.840
 deciding-2.737
hemy-2.548
 offense-2.548
 automatic-2.530
Token H
Token ID367
Feature activation+7.59e-02

Pos logit contributions
@#&+0.665
 deciding+0.648
 offense+0.609
 automatic+0.601
igating+0.601
Neg logit contributions
-0.706
enegger-0.588
-0.566
Son-0.549
6000-0.547
Tokenagan
Token ID7329
Feature activation+7.84e-01

Pos logit contributions
+0.366
enegger+0.302
+0.292
6000+0.284
Son+0.284
Neg logit contributions
@#&-0.374
 deciding-0.367
 offense-0.344
 automatic-0.340
 Answer-0.337
Token,
Token ID11
Feature activation-1.54e-01

Pos logit contributions
+3.964
enegger+3.301
+3.217
Son+3.107
+3.084
Neg logit contributions
@#&-3.594
 deciding-3.440
 automatic-3.208
hemy-3.202
igating-3.197
Token friends
Token ID2460
Feature activation-1.60e+00

Pos logit contributions
@#&+0.612
 deciding+0.598
 offense+0.554
 automatic+0.546
 Answer+0.544
Neg logit contributions
-0.893
enegger-0.763
-0.738
6000-0.722
Son-0.721
Token of
Token ID286
Feature activation+2.27e-01

Pos logit contributions
 deciding+6.006
@#&+5.945
 recall+5.511
 offense+5.465
 automatic+5.361
Neg logit contributions
-9.299
enegger-8.025
DragonMagazine-7.618
-7.615
6000-7.463
Token the
Token ID262
Feature activation+4.26e-01

Pos logit contributions
+1.229
enegger+1.046
+1.018
Son+0.985
6000+0.985
Neg logit contributions
@#&-0.972
 deciding-0.931
hemy-0.874
 offense-0.868
igating-0.866
Token Kaufman
Token ID46991
Feature activation+2.06e-01

Pos logit contributions
+2.344
enegger+2.001
+1.946
Son+1.895
6000+1.890
Neg logit contributions
@#&-1.782
 deciding-1.706
hemy-1.606
igating-1.588
 offense-1.580
Tokenns
Token ID5907
Feature activation+1.57e-01

Pos logit contributions
+0.958
enegger+0.797
+0.757
Son+0.743
6000+0.727
Neg logit contributions
@#&-1.041
 deciding-1.019
hemy-0.957
igating-0.955
 offense-0.953
Token (
Token ID357
Feature activation-5.23e-01

Pos logit contributions
+0.812
enegger+0.694
+0.660
Son+0.645
6000+0.642
Neg logit contributions
@#&-0.716
 deciding-0.693
igating-0.649
 offense-0.649
hemy-0.649
Token mission
Token ID4365
Feature activation+9.20e-01

Pos logit contributions
+0.472
enegger+0.398
+0.387
6000+0.379
Son+0.378
Neg logit contributions
@#&-0.376
 deciding-0.364
 offense-0.337
hemy-0.337
 Answer-0.335
Token,
Token ID11
Feature activation+3.72e-01

Pos logit contributions
+4.397
enegger+3.569
+3.553
6000+3.478
Son+3.441
Neg logit contributions
@#&-4.449
 deciding-4.105
hemy-4.093
 Answer-3.989
 offense-3.865
Token including
Token ID1390
Feature activation-1.12e-01

Pos logit contributions
+2.107
enegger+1.775
+1.739
6000+1.714
Son+1.702
Neg logit contributions
@#&-1.517
 deciding-1.417
hemy-1.360
 offense-1.332
 Answer-1.328
Token the
Token ID262
Feature activation-9.36e-02

Pos logit contributions
@#&+0.439
 deciding+0.424
 offense+0.394
hemy+0.389
 Answer+0.387
Neg logit contributions
-0.658
enegger-0.562
-0.546
6000-0.535
Son-0.535
Token Comet
Token ID36238
Feature activation-4.55e-01

Pos logit contributions
@#&+0.368
 deciding+0.355
 offense+0.330
hemy+0.329
igating+0.325
Neg logit contributions
-0.545
enegger-0.465
-0.453
6000-0.444
Son-0.442
Token Bor
Token ID12182
Feature activation-1.55e+00

Pos logit contributions
@#&+2.035
 deciding+2.003
 offense+1.872
 automatic+1.848
 Answer+1.840
Neg logit contributions
-2.399
enegger-2.016
-1.941
6000-1.892
Son-1.887
Tokenrell
Token ID11252
Feature activation-6.53e-01

Pos logit contributions
@#&+6.193
 deciding+6.008
lege+5.902
igating+5.850
 recall+5.766
Neg logit contributions
-8.316
enegger-7.080
-6.802
DragonMagazine-6.763
-6.570
Tokeny
Token ID88
Feature activation-2.00e-02

Pos logit contributions
@#&+2.333
 deciding+2.307
 offense+2.123
 Answer+2.101
 automatic+2.098
Neg logit contributions
-4.008
enegger-3.470
-3.322
DragonMagazine-3.285
6000-3.218
Token encounter
Token ID8791
Feature activation+5.43e-02

Pos logit contributions
@#&+0.088
 deciding+0.086
hemy+0.079
 offense+0.079
igating+0.079
Neg logit contributions
-0.107
enegger-0.090
-0.088
6000-0.086
Son-0.086
Token.[
Token ID3693
Feature activation-5.00e-01

Pos logit contributions
+0.276
enegger+0.228
+0.223
6000+0.218
Son+0.217
Neg logit contributions
@#&-0.256
 deciding-0.246
hemy-0.232
 offense-0.230
 Answer-0.230
Token17
Token ID1558
Feature activation-1.34e-01

Pos logit contributions
@#&+2.665
 deciding+2.629
 offense+2.479
 automatic+2.465
 Answer+2.463
Neg logit contributions
-2.181
enegger-1.786
-1.679
DragonMagazine-1.623
Son-1.618
Token pieces
Token ID5207
Feature activation-8.55e-01

Pos logit contributions
@#&+3.562
 deciding+3.502
 offense+3.236
 automatic+3.193
 Answer+3.179
Neg logit contributions
-5.304
enegger-4.551
-4.390
6000-4.282
Son-4.277
Token :
Token ID1058
Feature activation-5.36e-01

Pos logit contributions
@#&+3.707
 deciding+3.650
 offense+3.397
 automatic+3.353
 Answer+3.335
Neg logit contributions
-4.594
enegger-3.904
-3.755
6000-3.653
DragonMagazine-3.649
Token [
Token ID685
Feature activation-1.21e+00

Pos logit contributions
@#&+2.383
 deciding+2.334
 offense+2.181
 automatic+2.151
igating+2.148
Neg logit contributions
-2.846
enegger-2.385
-2.307
6000-2.257
Son-2.252
Token Option
Token ID16018
Feature activation-1.15e+00

Pos logit contributions
@#&+5.055
 deciding+4.955
 offense+4.591
 automatic+4.581
 Answer+4.538
Neg logit contributions
-6.650
enegger-5.710
-5.452
DragonMagazine-5.327
Lago-5.320
Token <
Token ID1279
Feature activation-1.29e+00

Pos logit contributions
@#&+4.886
 deciding+4.855
 offense+4.509
 automatic+4.463
 Answer+4.462
Neg logit contributions
-6.252
enegger-5.300
-5.118
6000-4.943
Lago-4.941
Token Piece
Token ID27053
Feature activation-1.74e+00

Pos logit contributions
@#&+5.274
 deciding+5.184
 offense+4.816
 Answer+4.766
 automatic+4.758
Neg logit contributions
-7.157
enegger-6.183
-5.900
Lago-5.762
DragonMagazine-5.740
Token >
Token ID1875
Feature activation-1.32e+00

Pos logit contributions
 deciding+6.439
@#&+6.426
 offense+5.949
 automatic+5.858
 Answer+5.787
Neg logit contributions
-9.880
enegger-8.606
-8.252
DragonMagazine-8.088
Lago-8.085
Token ;
Token ID2162
Feature activation-3.38e-01

Pos logit contributions
@#&+4.529
 deciding+4.470
 offense+4.099
 automatic+4.043
 recall+3.966
Neg logit contributions
-8.121
enegger-7.098
-6.830
DragonMagazine-6.694
Lago-6.678
Token 64
Token ID5598
Feature activation-4.98e-01

Pos logit contributions
@#&+1.343
 deciding+1.309
 offense+1.213
hemy+1.206
igating+1.201
Neg logit contributions
-1.945
enegger-1.645
-1.602
Son-1.582
6000-1.579
Token ],
Token ID16589
Feature activation-5.86e-01

Pos logit contributions
@#&+2.022
 deciding+1.975
 offense+1.836
 automatic+1.803
igating+1.802
Neg logit contributions
-2.850
enegger-2.422
-2.349
6000-2.302
Son-2.298
Token //
Token ID3373
Feature activation-4.85e-01

Pos logit contributions
@#&+2.220
 deciding+2.163
 offense+1.993
hemy+1.963
igating+1.963
Neg logit contributions
-3.495
enegger-2.989
-2.909
6000-2.851
Son-2.844
Token
Token ID198
Feature activation-3.05e-01

Pos logit contributions
@#&+2.097
 deciding+2.024
 offense+1.871
 Answer+1.849
 automatic+1.841
Neg logit contributions
-3.125
enegger-2.658
-2.585
6000-2.526
Son-2.522
Token
Token ID198
Feature activation-1.58e-01

Pos logit contributions
@#&+0.997
 deciding+0.996
 Answer+0.910
 offense+0.906
 automatic+0.902
Neg logit contributions
-1.962
enegger-1.730
-1.646
DragonMagazine-1.618
catentry-1.594
TokenM
Token ID44
Feature activation+1.12e+00

Pos logit contributions
@#&+0.841
 deciding+0.829
 offense+0.785
 automatic+0.781
 Answer+0.776
Neg logit contributions
-0.688
enegger-0.566
-0.528
6000-0.508
DragonMagazine-0.504
Tokenud
Token ID463
Feature activation-9.86e-01

Pos logit contributions
+4.806
6000+3.912
enegger+3.884
asters+3.754
+3.738
Neg logit contributions
@#&-5.781
 deciding-5.648
 Answer-5.268
 offense-5.217
 automatic-5.196
Token and
Token ID290
Feature activation-4.21e-01

Pos logit contributions
 deciding+4.231
@#&+4.197
 offense+3.957
 automatic+3.927
igating+3.866
Neg logit contributions
-5.267
enegger-4.479
-4.283
6000-4.195
Son-4.170
Token sand
Token ID6450
Feature activation-2.21e+00

Pos logit contributions
@#&+1.885
 deciding+1.839
 offense+1.715
igating+1.691
 automatic+1.690
Neg logit contributions
-2.230
enegger-1.869
-1.805
6000-1.762
Son-1.761
Token mode
Token ID4235
Feature activation-1.74e-01

Pos logit contributions
 deciding+8.795
 automatic+8.396
 offense+8.326
igating+8.010
@#&+8.010
Neg logit contributions
-11.474
enegger-9.896
DragonMagazine-9.403
-9.377
catentry-9.248
Token for
Token ID329
Feature activation+4.12e-01

Pos logit contributions
@#&+0.794
 deciding+0.762
hemy+0.712
 offense+0.711
 Answer+0.708
Neg logit contributions
-0.910
enegger-0.753
-0.738
6000-0.722
Son-0.715
Token muddy
Token ID39085
Feature activation+1.18e-01

Pos logit contributions
+2.272
enegger+1.924
+1.861
6000+1.832
Son+1.821
Neg logit contributions
@#&-1.740
 deciding-1.609
hemy-1.545
 Answer-1.520
igating-1.503
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
+0.668
enegger+0.566
+0.549
6000+0.535
Son+0.532
Neg logit contributions
@#&-0.490
 deciding-0.460
 Answer-0.431
hemy-0.430
 offense-0.427
Token sandy
Token ID44039
Feature activation-1.19e+00

Pos logit contributions
+5.140
enegger+4.304
+4.109
Son+4.072
6000+4.057
Neg logit contributions
@#&-4.672
 deciding-4.257
hemy-4.142
 Answer-4.136
igating-4.041