TOP ACTIVATIONS
MAX = 1.414

But …. " Ah ." " Ste
10 or 26 hex ade c imal digits ( 40 or
. Ah . Chad started
16 , 2017 at 2 : 29 am PST
be like , Oh , they are killing me
shelter ." Oh my …. How bad is
30 , 2016 at 9 : 01 am PDT
16 , 2017 at 9 : 28 am PST
was like , Oh my god , he thinks
- 20 01 : 44 : 24 # 19 edit ,
Mike Don ais : Oh yeah . Often , when
Charles Burns : Oh yeah , of course .
bit more top - end spark le and superb neutrality ,
data _ full v ) { v . ver tex .
: " I thought , Oh , my g osh ,
by the mess of the inco herent politics that is has
11 - 2014 at 08 : 05 UTC , updated at
that suggestion , Stella opened her scar let eyes wide .
was 18 when he first started making photos at Los Angeles
s like , Oh boy , I have not

MIN ACTIVATIONS
MIN = -1.377

The woman , whose name has not been released
Sanders Are More Similar Than You Think The biggest
Are More Similar Than You Think The biggest difference
doing drugs or enter a mental - health program , now
of the United States , whose App oint ments are not
Medline | ISI My ers , D . (
fix a hem : Another common fix that everyone may run
to the pitch and being mentally free is a massive thing
of criminal convictions . Crim in ology , 35 , 177
. name "> span > < p ng - bind ="
, weapons , furniture and more Capture unique monsters
:// www . bel fast te legraph . co . uk
is the prime challenge Belfast Te legraph . co . uk
. As it won 't find it there , it will
Some kind of bailout , whether granted with soft or tough
feedback gain factor G ( E q . 4 ), equivalent
week that Britain had " black mail ed and made a
Department . The woman , whose name has not
Kick Off will be at midnight ! ( Bring your own
lines then you 'll go off at some speed . Racing

INTERVAL 0.716 - 1.414
CONTAINS 0.181%

House Majority Whip Steve Scal ise ( R - La .),
the super - rich together constit uting the top 10 %
j okes " about Internal Revenue Service audits to the recent
Sw amin ar ayan A ks hard ham Temple
Naval ny and Py ot r Of its er ov ,

INTERVAL 0.018 - 0.716
CONTAINS 45.926%

Chuck Ye ager , http
must orient themselves according to an urban center , which in
no official party publication was made available to the members
them from the inside may be our best hope . In
to Bar st ow , to the much warmer Death Valley

INTERVAL -0.679 - 0.018
CONTAINS 53.649%

actually gets in the way , as it 's possible for
but Honour Among Thieves is a h oot from bold beginning
origin carefully though , as you may find yourself stuck with
( Anti Ali asing is a technique used to eliminate this
oe gee 's episode and I think it 's a real

INTERVAL -1.377 - -0.679
CONTAINS 0.244%

Hai y un , bestowed the name Z hen jin (
Temple . Like the first time you enter the
Mother board and Boot " green , because instances of this
marks on top of the deep routes , another player in
of the quarterback ) like Cover 3 , man free ,
TokenBut
Token ID1537
Feature activation+3.53e-01

Pos logit contributions
nard+1.377
+1.339
tan+1.335
INESS+1.264
serving+1.246
Neg logit contributions
nder-1.236
Barn-1.230
stead-1.217
phabet-1.163
ð-1.155
Token….
Token ID11580
Feature activation+2.28e-01

Pos logit contributions
nard+3.458
tan+3.335
+3.293
usal+3.075
serving+3.066
Neg logit contributions
Barn-3.839
nder-3.754
stead-3.722
phabet-3.615
ovember-3.602
Token
Token ID198
Feature activation+6.21e-03

Pos logit contributions
nard+2.418
tan+2.342
+2.316
INESS+2.205
usal+2.193
Neg logit contributions
Barn-2.330
nder-2.292
stead-2.255
ð-2.153
ovember-2.146
Token
Token ID198
Feature activation+1.01e-01

Pos logit contributions
nard+0.069
tan+0.066
+0.065
INESS+0.062
serving+0.062
Neg logit contributions
Barn-0.062
nder-0.061
stead-0.060
phabet-0.058
ð-0.058
Token"
Token ID1
Feature activation+5.01e-01

Pos logit contributions
nard+1.124
+1.089
tan+1.087
INESS+1.031
serving+1.016
Neg logit contributions
Barn-0.972
nder-0.967
stead-0.952
ð-0.907
phabet-0.900
TokenAh
Token ID10910
Feature activation+1.41e+00

Pos logit contributions
nard+5.122
tan+5.080
+5.076
Champ+4.761
INESS+4.719
Neg logit contributions
nder-4.925
phabet-4.863
stead-4.772
ovember-4.748
Barn-4.648
Token."
Token ID526
Feature activation+1.21e-01

Pos logit contributions
tan+11.985
nard+11.607
+11.353
hai+11.069
usal+10.791
Neg logit contributions
Barn-12.805
ovember-12.589
phabet-12.039
nder-11.796
stead-11.752
Token
Token ID198
Feature activation+7.85e-02

Pos logit contributions
nard+1.401
+1.344
tan+1.341
INESS+1.280
usal+1.279
Neg logit contributions
Barn-1.139
nder-1.118
stead-1.105
ð-1.051
phabet-1.010
Token
Token ID198
Feature activation+1.18e-01

Pos logit contributions
nard+0.842
tan+0.806
+0.797
serving+0.750
INESS+0.747
Neg logit contributions
Barn-0.805
nder-0.799
stead-0.783
phabet-0.774
 myster-0.767
Token"
Token ID1
Feature activation+3.82e-01

Pos logit contributions
nard+1.278
+1.242
tan+1.238
INESS+1.167
serving+1.154
Neg logit contributions
nder-1.168
Barn-1.161
stead-1.142
phabet-1.096
ð-1.092
TokenSte
Token ID7447
Feature activation-1.52e-03

Pos logit contributions
nard+4.023
tan+3.988
+3.981
Champ+3.716
INESS+3.711
Neg logit contributions
nder-3.748
phabet-3.691
stead-3.655
Barn-3.565
ovember-3.559
Token 10
Token ID838
Feature activation-2.58e-01

Pos logit contributions
Barn+0.291
nder+0.285
stead+0.282
ð+0.269
phabet+0.259
Neg logit contributions
nard-0.314
-0.300
tan-0.299
usal-0.286
INESS-0.285
Token or
Token ID393
Feature activation-2.77e-01

Pos logit contributions
Barn+2.657
nder+2.637
stead+2.605
ð+2.484
say+2.402
Neg logit contributions
nard-2.750
-2.655
tan-2.604
INESS-2.495
usal-2.495
Token 26
Token ID2608
Feature activation+1.47e-01

Pos logit contributions
Barn+2.555
nder+2.505
stead+2.491
ð+2.363
say+2.260
Neg logit contributions
nard-3.210
-3.136
tan-3.078
INESS-2.977
usal-2.947
Token hex
Token ID17910
Feature activation+2.21e-01

Pos logit contributions
nard+1.359
tan+1.300
+1.293
usal+1.233
INESS+1.225
Neg logit contributions
Barn-1.729
nder-1.697
stead-1.682
ð-1.611
phabet-1.567
Tokenade
Token ID671
Feature activation+2.04e-01

Pos logit contributions
nard+1.949
tan+1.827
+1.821
usal+1.739
INESS+1.706
Neg logit contributions
Barn-2.724
nder-2.694
stead-2.617
ð-2.520
ovember-2.462
Tokenc
Token ID66
Feature activation+1.32e+00

Pos logit contributions
nard+2.498
tan+2.401
+2.391
usal+2.333
serving+2.280
Neg logit contributions
Barn-1.788
nder-1.733
stead-1.693
ð-1.594
phabet-1.545
Tokenimal
Token ID4402
Feature activation-2.93e-03

Pos logit contributions
imal+8.044
usal+6.729
ipal+6.413
nard+6.398
tan+5.650
Neg logit contributions
Barn-18.301
nder-17.429
ovember-16.863
stead-16.771
 myster-16.452
Token digits
Token ID19561
Feature activation+1.17e-01

Pos logit contributions
Barn+0.041
nder+0.041
stead+0.041
ð+0.039
phabet+0.038
Neg logit contributions
nard-0.020
-0.019
tan-0.019
usal-0.018
INESS-0.018
Token (
Token ID357
Feature activation-3.21e-03

Pos logit contributions
nard+1.286
tan+1.225
+1.224
usal+1.167
INESS+1.165
Neg logit contributions
Barn-1.193
nder-1.170
stead-1.153
ð-1.100
phabet-1.059
Token40
Token ID1821
Feature activation-1.76e-01

Pos logit contributions
Barn+0.034
nder+0.033
stead+0.033
ð+0.031
phabet+0.030
Neg logit contributions
nard-0.034
-0.033
tan-0.032
usal-0.031
INESS-0.031
Token or
Token ID393
Feature activation-2.45e-03

Pos logit contributions
Barn+1.738
nder+1.699
stead+1.694
ð+1.609
phabet+1.548
Neg logit contributions
nard-1.958
-1.883
tan-1.853
INESS-1.789
usal-1.777
Token.
Token ID13
Feature activation-1.36e-01

Pos logit contributions
Barn+1.021
nder+1.001
stead+0.994
ð+0.945
phabet+0.901
Neg logit contributions
nard-1.174
-1.131
tan-1.122
INESS-1.077
usal-1.076
Token
Token ID447
Feature activation+8.36e-02

Pos logit contributions
Barn+1.271
nder+1.238
stead+1.230
ð+1.172
say+1.114
Neg logit contributions
nard-1.603
-1.541
tan-1.530
usal-1.476
INESS-1.470
Token
Token ID251
Feature activation-2.09e-01

Pos logit contributions
nard+0.846
+0.817
tan+0.814
usal+0.763
INESS+0.760
Neg logit contributions
Barn-0.904
stead-0.888
nder-0.886
ð-0.837
ovember-0.817
Token
Token ID564
Feature activation-2.84e-01

Pos logit contributions
Barn+1.924
nder+1.863
stead+1.863
ð+1.782
say+1.683
Neg logit contributions
nard-2.478
-2.387
tan-2.362
usal-2.308
INESS-2.282
Token
Token ID250
Feature activation-9.08e-02

Pos logit contributions
Barn+3.367
nder+3.334
stead+3.286
ð+3.180
say+3.069
Neg logit contributions
nard-2.583
-2.434
tan-2.418
usal-2.332
INESS-2.319
TokenAh
Token ID10910
Feature activation+1.24e+00

Pos logit contributions
Barn+0.868
nder+0.853
stead+0.845
ð+0.803
phabet+0.774
Neg logit contributions
nard-1.043
-1.003
tan-1.000
usal-0.953
INESS-0.953
Token.
Token ID13
Feature activation+1.74e-01

Pos logit contributions
tan+11.036
nard+10.881
+10.629
hai+10.204
usal+10.039
Neg logit contributions
Barn-11.722
ovember-10.952
phabet-10.750
nder-10.542
stead-10.400
Token
Token ID447
Feature activation+1.88e-01

Pos logit contributions
nard+1.980
tan+1.920
+1.905
INESS+1.799
usal+1.779
Neg logit contributions
Barn-1.662
nder-1.643
stead-1.623
ð-1.518
phabet-1.516
Token
Token ID251
Feature activation-2.05e-01

Pos logit contributions
nard+1.786
+1.728
tan+1.724
usal+1.593
INESS+1.585
Neg logit contributions
Barn-2.119
stead-2.090
nder-2.070
ð-1.965
ovember-1.946
Token Chad
Token ID19800
Feature activation+2.63e-01

Pos logit contributions
Barn+1.866
nder+1.811
stead+1.806
ð+1.723
say+1.630
Neg logit contributions
nard-2.442
-2.355
tan-2.331
usal-2.274
INESS-2.247
Token started
Token ID2067
Feature activation+9.69e-01

Pos logit contributions
nard+2.667
tan+2.541
+2.523
usal+2.332
INESS+2.324
Neg logit contributions
Barn-2.865
nder-2.774
stead-2.751
phabet-2.637
ð-2.625
Token 16
Token ID1467
Feature activation+1.37e-01

Pos logit contributions
nard+3.370
tan+3.163
+3.119
serving+2.933
minster+2.909
Neg logit contributions
Barn-3.075
phabet-3.048
nder-3.001
stead-2.980
 myster-2.894
Token,
Token ID11
Feature activation+3.16e-01

Pos logit contributions
nard+1.403
tan+1.334
+1.309
serving+1.228
INESS+1.226
Neg logit contributions
Barn-1.492
nder-1.462
phabet-1.435
 myster-1.435
stead-1.432
Token 2017
Token ID2177
Feature activation-2.50e-01

Pos logit contributions
nard+3.123
tan+2.892
+2.877
INESS+2.729
minster+2.691
Neg logit contributions
Barn-3.503
nder-3.437
phabet-3.378
stead-3.364
 myster-3.278
Token at
Token ID379
Feature activation-7.33e-02

Pos logit contributions
Barn+2.514
stead+2.463
nder+2.463
ð+2.337
say+2.242
Neg logit contributions
nard-2.726
-2.636
tan-2.608
INESS-2.520
usal-2.509
Token 2
Token ID362
Feature activation+2.15e-01

Pos logit contributions
Barn+0.715
nder+0.700
stead+0.694
ð+0.660
phabet+0.643
Neg logit contributions
nard-0.833
tan-0.797
-0.796
usal-0.758
INESS-0.753
Token:
Token ID25
Feature activation+1.21e+00

Pos logit contributions
nard+2.163
tan+2.092
+2.039
serving+1.922
minster+1.920
Neg logit contributions
Barn-2.331
 myster-2.291
phabet-2.283
nder-2.258
stead-2.228
Token29
Token ID1959
Feature activation-2.13e-01

Pos logit contributions
tan+6.671
nard+6.470
+6.118
DCS+6.021
serving+5.726
Neg logit contributions
phabet-16.631
 myster-16.523
ovember-14.857
igans-14.648
nder-14.525
Tokenam
Token ID321
Feature activation-1.38e-01

Pos logit contributions
nder+1.940
Barn+1.908
stead+1.902
ð+1.802
say+1.750
Neg logit contributions
nard-2.488
-2.467
tan-2.355
INESS-2.339
usal-2.277
Token PST
Token ID28220
Feature activation+2.73e-01

Pos logit contributions
Barn+1.553
nder+1.524
stead+1.513
ð+1.448
phabet+1.400
Neg logit contributions
nard-1.369
-1.305
tan-1.299
INESS-1.232
usal-1.232
Token
Token ID198
Feature activation+5.13e-02

Pos logit contributions
nard+3.104
+2.963
tan+2.959
INESS+2.826
usal+2.819
Neg logit contributions
Barn-2.624
nder-2.576
stead-2.549
ð-2.428
phabet-2.326
Token
Token ID198
Feature activation-1.01e-02

Pos logit contributions
nard+0.554
tan+0.530
+0.522
serving+0.494
usal+0.493
Neg logit contributions
Barn-0.526
nder-0.517
stead-0.509
phabet-0.499
 myster-0.494
Token be
Token ID307
Feature activation+1.55e-01

Pos logit contributions
nard+1.511
tan+1.425
+1.415
usal+1.354
minster+1.318
Neg logit contributions
Barn-1.598
nder-1.543
stead-1.543
ð-1.448
phabet-1.442
Token like
Token ID588
Feature activation+3.23e-01

Pos logit contributions
nard+1.691
+1.637
tan+1.628
usal+1.550
serving+1.532
Neg logit contributions
Barn-1.561
nder-1.525
stead-1.506
ð-1.421
phabet-1.408
Token,
Token ID11
Feature activation+2.64e-01

Pos logit contributions
nard+3.385
+3.255
tan+3.233
usal+3.072
INESS+3.038
Neg logit contributions
Barn-3.328
nder-3.236
stead-3.203
ovember-3.005
ð-2.989
Token
Token ID564
Feature activation+1.72e-01

Pos logit contributions
nard+2.985
+2.884
tan+2.878
usal+2.742
INESS+2.730
Neg logit contributions
Barn-2.521
nder-2.464
stead-2.436
ð-2.284
ovember-2.236
Token
Token ID250
Feature activation+9.84e-02

Pos logit contributions
nard+1.445
+1.410
tan+1.396
usal+1.299
serving+1.277
Neg logit contributions
Barn-2.138
stead-2.097
nder-2.083
ð-1.983
neys-1.944
TokenOh
Token ID5812
Feature activation+1.21e+00

Pos logit contributions
nard+1.138
+1.102
tan+1.099
usal+1.045
INESS+1.040
Neg logit contributions
Barn-0.919
nder-0.903
stead-0.891
ð-0.843
phabet-0.819
Token,
Token ID11
Feature activation+8.17e-01

Pos logit contributions
+9.412
nard+9.348
tan+9.034
hai+8.518
usal+8.431
Neg logit contributions
Barn-11.917
ovember-11.591
nder-11.357
stead-11.134
neys-11.032
Token they
Token ID484
Feature activation+5.19e-01

Pos logit contributions
nard+7.274
+7.204
tan+7.051
usal+6.541
hai+6.467
Neg logit contributions
Barn-8.283
nder-8.162
stead-8.036
ovember-7.982
neys-7.688
Token are
Token ID389
Feature activation+3.92e-01

Pos logit contributions
nard+4.648
+4.445
tan+4.423
usal+4.099
serving+4.008
Neg logit contributions
Barn-5.964
stead-5.689
nder-5.632
phabet-5.479
ovember-5.410
Token killing
Token ID5170
Feature activation+4.20e-01

Pos logit contributions
nard+3.700
+3.629
tan+3.598
serving+3.354
usal+3.347
Neg logit contributions
Barn-4.346
nder-4.246
stead-4.165
phabet-4.016
ovember-4.011
Token me
Token ID502
Feature activation+2.18e-01

Pos logit contributions
+4.202
nard+4.189
tan+4.104
usal+3.877
ipal+3.790
Neg logit contributions
Barn-4.309
nder-4.253
stead-4.175
ð-3.988
ovember-3.951
Token shelter
Token ID11772
Feature activation+3.81e-02

Pos logit contributions
nard+2.028
tan+1.979
+1.957
usal+1.817
INESS+1.812
Neg logit contributions
Barn-2.584
nder-2.538
stead-2.513
ð-2.418
phabet-2.375
Token."
Token ID526
Feature activation+5.68e-02

Pos logit contributions
nard+0.409
+0.391
tan+0.390
INESS+0.371
usal+0.371
Neg logit contributions
Barn-0.394
nder-0.386
stead-0.383
ð-0.365
phabet-0.353
Token
Token ID198
Feature activation+2.48e-02

Pos logit contributions
nard+0.667
+0.641
tan+0.638
usal+0.612
INESS+0.611
Neg logit contributions
Barn-0.531
nder-0.519
stead-0.515
ð-0.488
phabet-0.466
Token
Token ID198
Feature activation+2.89e-01

Pos logit contributions
nard+0.271
tan+0.260
+0.257
serving+0.243
INESS+0.242
Neg logit contributions
Barn-0.250
nder-0.246
stead-0.243
phabet-0.236
 myster-0.232
Token
Token ID13697
Feature activation+8.08e-02

Pos logit contributions
nard+3.062
+3.030
tan+3.009
INESS+2.804
serving+2.787
Neg logit contributions
nder-2.889
Barn-2.821
stead-2.809
phabet-2.769
ovember-2.720
TokenOh
Token ID5812
Feature activation+1.19e+00

Pos logit contributions
nard+0.909
+0.884
tan+0.881
INESS+0.833
usal+0.823
Neg logit contributions
nder-0.771
Barn-0.770
stead-0.763
ð-0.722
phabet-0.720
Token my
Token ID616
Feature activation+5.18e-01

Pos logit contributions
tan+10.182
+10.015
nard+9.994
hai+9.244
serving+8.936
Neg logit contributions
ovember-11.389
phabet-11.246
Barn-11.201
nder-10.990
stead-10.882
Token….
Token ID11580
Feature activation+3.96e-01

Pos logit contributions
nard+5.308
+5.189
tan+5.153
usal+4.836
INESS+4.813
Neg logit contributions
Barn-5.282
stead-5.040
nder-5.018
ovember-4.975
phabet-4.825
Token How
Token ID1374
Feature activation+1.78e-01

Pos logit contributions
nard+4.393
tan+4.334
+4.220
serving+4.011
INESS+4.007
Neg logit contributions
Barn-3.699
nder-3.646
stead-3.576
ovember-3.459
phabet-3.405
Token bad
Token ID2089
Feature activation+4.13e-01

Pos logit contributions
nard+1.785
+1.720
tan+1.710
usal+1.594
INESS+1.588
Neg logit contributions
Barn-1.957
stead-1.886
nder-1.878
ð-1.785
phabet-1.781
Token is
Token ID318
Feature activation+1.39e-01

Pos logit contributions
nard+4.097
tan+3.935
+3.909
usal+3.674
INESS+3.672
Neg logit contributions
Barn-4.520
stead-4.416
nder-4.323
phabet-4.074
ð-4.073
Token 30
Token ID1542
Feature activation-1.06e-01

Pos logit contributions
nard+3.968
tan+3.736
+3.645
minster+3.417
serving+3.416
Neg logit contributions
Barn-3.716
phabet-3.607
stead-3.580
nder-3.554
 myster-3.489
Token,
Token ID11
Feature activation+8.75e-02

Pos logit contributions
Barn+1.062
nder+1.040
stead+1.030
ð+0.984
phabet+0.956
Neg logit contributions
nard-1.180
-1.127
tan-1.127
usal-1.069
INESS-1.068
Token 2016
Token ID1584
Feature activation-1.35e-01

Pos logit contributions
nard+1.016
+0.975
tan+0.972
usal+0.929
INESS+0.928
Neg logit contributions
Barn-0.828
nder-0.810
stead-0.802
ð-0.763
phabet-0.732
Token at
Token ID379
Feature activation-2.75e-02

Pos logit contributions
Barn+1.428
nder+1.401
stead+1.390
ð+1.328
phabet+1.278
Neg logit contributions
nard-1.412
-1.351
tan-1.345
usal-1.281
INESS-1.279
Token 9
Token ID860
Feature activation+2.08e-01

Pos logit contributions
Barn+0.272
nder+0.267
stead+0.264
ð+0.251
phabet+0.248
Neg logit contributions
nard-0.308
tan-0.294
-0.293
usal-0.279
minster-0.277
Token:
Token ID25
Feature activation+1.16e+00

Pos logit contributions
nard+2.110
tan+2.036
+1.992
serving+1.872
minster+1.866
Neg logit contributions
Barn-2.253
 myster-2.204
phabet-2.201
nder-2.198
stead-2.162
Token01
Token ID486
Feature activation+2.34e-02

Pos logit contributions
tan+6.769
nard+6.601
+6.257
DCS+6.019
serving+5.744
Neg logit contributions
phabet-15.760
 myster-15.742
ovember-14.132
nder-13.956
igans-13.835
Tokenam
Token ID321
Feature activation-5.16e-02

Pos logit contributions
nard+0.250
+0.240
tan+0.237
INESS+0.228
usal+0.226
Neg logit contributions
Barn-0.243
nder-0.239
stead-0.237
ð-0.226
say-0.217
Token PDT
Token ID28288
Feature activation+2.42e-01

Pos logit contributions
Barn+0.557
nder+0.546
stead+0.541
ð+0.518
phabet+0.505
Neg logit contributions
nard-0.534
-0.508
tan-0.508
INESS-0.482
usal-0.479
Token
Token ID198
Feature activation+1.54e-01

Pos logit contributions
nard+2.770
+2.653
tan+2.647
INESS+2.531
usal+2.524
Neg logit contributions
Barn-2.311
nder-2.267
stead-2.241
ð-2.132
phabet-2.045
Token
Token ID198
Feature activation+5.30e-02

Pos logit contributions
nard+1.598
tan+1.524
+1.492
serving+1.415
usal+1.401
Neg logit contributions
Barn-1.632
nder-1.604
 myster-1.594
phabet-1.592
stead-1.577
Token 16
Token ID1467
Feature activation-4.24e-03

Pos logit contributions
nard+1.208
tan+1.145
+1.139
usal+1.073
serving+1.071
Neg logit contributions
Barn-1.010
nder-0.986
stead-0.980
phabet-0.951
ð-0.935
Token,
Token ID11
Feature activation+3.48e-01

Pos logit contributions
Barn+0.044
nder+0.043
stead+0.042
ð+0.041
phabet+0.041
Neg logit contributions
nard-0.046
tan-0.044
-0.043
INESS-0.041
usal-0.041
Token 2017
Token ID2177
Feature activation-1.74e-02

Pos logit contributions
nard+3.628
+3.409
tan+3.396
INESS+3.240
usal+3.199
Neg logit contributions
Barn-3.653
nder-3.571
stead-3.508
phabet-3.393
ð-3.384
Token at
Token ID379
Feature activation+3.44e-02

Pos logit contributions
Barn+0.192
nder+0.188
stead+0.186
ð+0.179
phabet+0.174
Neg logit contributions
nard-0.176
-0.166
tan-0.166
usal-0.157
serving-0.156
Token 9
Token ID860
Feature activation+7.32e-02

Pos logit contributions
nard+0.379
tan+0.362
+0.359
usal+0.341
minster+0.341
Neg logit contributions
Barn-0.347
nder-0.341
stead-0.337
phabet-0.321
ð-0.321
Token:
Token ID25
Feature activation+1.15e+00

Pos logit contributions
nard+0.788
tan+0.758
+0.747
usal+0.705
serving+0.704
Neg logit contributions
Barn-0.756
nder-0.738
stead-0.729
phabet-0.711
ð-0.701
Token28
Token ID2078
Feature activation-1.78e-01

Pos logit contributions
tan+6.738
nard+6.527
+6.167
DCS+5.988
serving+5.726
Neg logit contributions
phabet-15.526
 myster-15.399
ovember-13.942
nder-13.719
igans-13.713
Tokenam
Token ID321
Feature activation-2.30e-01

Pos logit contributions
nder+1.752
Barn+1.745
stead+1.719
ð+1.628
say+1.591
Neg logit contributions
nard-1.944
-1.911
tan-1.834
INESS-1.812
usal-1.773
Token PST
Token ID28220
Feature activation+3.24e-02

Pos logit contributions
Barn+2.502
nder+2.469
stead+2.448
ð+2.350
say+2.240
Neg logit contributions
nard-2.316
-2.230
tan-2.218
usal-2.130
INESS-2.098
Token
Token ID198
Feature activation+2.62e-01

Pos logit contributions
nard+0.387
+0.373
tan+0.371
usal+0.357
INESS+0.356
Neg logit contributions
Barn-0.297
nder-0.290
stead-0.287
ð-0.272
phabet-0.260
Token
Token ID198
Feature activation+8.89e-03

Pos logit contributions
nard+2.587
tan+2.467
+2.400
serving+2.271
minster+2.243
Neg logit contributions
 myster-2.880
Barn-2.849
phabet-2.848
nder-2.807
stead-2.768
Token was
Token ID373
Feature activation-7.20e-02

Pos logit contributions
Barn+1.358
nder+1.332
stead+1.322
ð+1.262
phabet+1.212
Neg logit contributions
nard-1.400
-1.340
tan-1.335
usal-1.272
INESS-1.271
Token like
Token ID588
Feature activation+1.62e-01

Pos logit contributions
Barn+0.723
nder+0.708
stead+0.701
ð+0.668
phabet+0.643
Neg logit contributions
nard-0.795
-0.763
tan-0.759
usal-0.725
INESS-0.723
Token,
Token ID11
Feature activation+2.63e-01

Pos logit contributions
nard+1.761
+1.686
tan+1.680
usal+1.599
INESS+1.589
Neg logit contributions
Barn-1.637
nder-1.597
stead-1.579
ð-1.482
phabet-1.464
Token
Token ID564
Feature activation+1.45e-01

Pos logit contributions
nard+2.804
tan+2.690
+2.679
INESS+2.536
usal+2.535
Neg logit contributions
Barn-2.675
nder-2.617
stead-2.578
ð-2.407
ovember-2.386
Token
Token ID250
Feature activation+6.50e-02

Pos logit contributions
nard+1.339
+1.290
tan+1.285
usal+1.212
INESS+1.201
Neg logit contributions
Barn-1.706
nder-1.668
stead-1.666
ð-1.586
phabet-1.539
TokenOh
Token ID5812
Feature activation+1.13e+00

Pos logit contributions
nard+0.752
+0.725
tan+0.724
usal+0.691
INESS+0.688
Neg logit contributions
Barn-0.610
nder-0.597
stead-0.590
ð-0.559
phabet-0.541
Token my
Token ID616
Feature activation+1.55e-01

Pos logit contributions
nard+9.244
+9.193
tan+8.901
hai+8.508
usal+8.216
Neg logit contributions
Barn-11.016
ovember-10.621
nder-10.543
stead-10.382
neys-10.378
Token god
Token ID5770
Feature activation+5.04e-01

Pos logit contributions
nard+1.578
+1.532
tan+1.505
usal+1.439
INESS+1.425
Neg logit contributions
Barn-1.673
nder-1.631
stead-1.616
ð-1.535
ovember-1.508
Token,
Token ID11
Feature activation+4.72e-01

Pos logit contributions
nard+4.954
+4.775
tan+4.702
INESS+4.465
usal+4.458
Neg logit contributions
Barn-5.285
nder-5.093
stead-5.045
ovember-4.857
say-4.739
Token he
Token ID339
Feature activation+7.22e-01

Pos logit contributions
nard+4.638
+4.480
tan+4.413
usal+4.194
INESS+4.118
Neg logit contributions
Barn-4.907
nder-4.799
stead-4.777
ovember-4.514
phabet-4.428
Token thinks
Token ID6834
Feature activation+2.22e-01

Pos logit contributions
nard+6.560
+6.388
tan+6.158
usal+5.622
ipal+5.596
Neg logit contributions
Barn-7.800
stead-7.472
nder-7.268
phabet-7.213
ovember-7.144
Token-
Token ID12
Feature activation+2.47e-01

Pos logit contributions
Barn+0.062
nder+0.061
stead+0.061
phabet+0.058
ð+0.058
Neg logit contributions
nard-0.066
tan-0.063
-0.062
usal-0.059
INESS-0.059
Token20
Token ID1238
Feature activation+1.54e-01

Pos logit contributions
nard+2.281
tan+2.190
+2.171
serving+2.015
INESS+2.000
Neg logit contributions
phabet-2.846
nder-2.811
 myster-2.796
stead-2.791
Barn-2.773
Token 01
Token ID5534
Feature activation+5.00e-01

Pos logit contributions
nard+1.661
tan+1.576
+1.566
minster+1.484
INESS+1.476
Neg logit contributions
nder-1.550
Barn-1.545
stead-1.542
phabet-1.519
ð-1.450
Token:
Token ID25
Feature activation+1.03e+00

Pos logit contributions
nard+4.280
tan+4.108
+3.920
serving+3.667
minster+3.655
Neg logit contributions
 myster-6.256
phabet-6.162
nder-5.742
Barn-5.717
ovember-5.653
Token44
Token ID2598
Feature activation+1.58e-01

Pos logit contributions
tan+6.215
nard+6.030
+5.724
DCS+5.710
serving+5.316
Neg logit contributions
phabet-14.047
 myster-13.778
ovember-12.633
nder-12.623
igans-12.493
Token:
Token ID25
Feature activation+1.13e+00

Pos logit contributions
nard+1.640
tan+1.567
+1.541
minster+1.452
INESS+1.451
Neg logit contributions
Barn-1.660
stead-1.655
nder-1.639
phabet-1.594
 myster-1.569
Token24
Token ID1731
Feature activation+2.98e-01

Pos logit contributions
tan+6.382
nard+6.278
DCS+5.995
+5.876
serving+5.487
Neg logit contributions
phabet-15.425
 myster-15.111
ovember-13.884
nder-13.872
igans-13.671
Token #
Token ID1303
Feature activation-3.50e-01

Pos logit contributions
nard+2.997
tan+2.968
+2.817
INESS+2.742
minster+2.684
Neg logit contributions
stead-3.171
Barn-3.152
nder-3.115
phabet-3.041
 myster-3.018
Token19
Token ID1129
Feature activation+3.53e-01

Pos logit contributions
Barn+3.128
nder+3.004
stead+2.928
ð+2.845
say+2.694
Neg logit contributions
nard-4.194
-4.126
tan-4.030
INESS-3.953
minster-3.874
Token edit
Token ID4370
Feature activation+3.08e-01

Pos logit contributions
nard+3.704
tan+3.545
+3.502
usal+3.334
minster+3.294
Neg logit contributions
Barn-3.626
nder-3.540
stead-3.501
ð-3.294
phabet-3.254
Token,
Token ID11
Feature activation+2.71e-01

Pos logit contributions
nard+3.318
tan+3.192
+3.131
usal+3.009
serving+2.936
Neg logit contributions
Barn-3.124
nder-3.072
stead-3.005
ð-2.877
phabet-2.787
Token
Token ID198
Feature activation+4.11e-01

Pos logit contributions
nard+1.245
tan+1.185
+1.170
serving+1.105
minster+1.095
Neg logit contributions
Barn-1.230
nder-1.213
phabet-1.193
stead-1.192
 myster-1.190
TokenMike
Token ID16073
Feature activation-1.00e-01

Pos logit contributions
nard+3.816
+3.719
tan+3.628
INESS+3.413
usal+3.405
Neg logit contributions
nder-4.670
Barn-4.648
stead-4.558
phabet-4.332
ð-4.307
Token Don
Token ID2094
Feature activation+7.23e-03

Pos logit contributions
Barn+1.186
nder+1.171
stead+1.158
ð+1.112
phabet+1.094
Neg logit contributions
nard-0.932
tan-0.882
-0.876
INESS-0.821
usal-0.819
Tokenais
Token ID15152
Feature activation-6.08e-02

Pos logit contributions
nard+0.084
tan+0.080
+0.079
usal+0.076
serving+0.075
Neg logit contributions
Barn-0.070
nder-0.069
stead-0.067
ð-0.064
phabet-0.063
Token:
Token ID25
Feature activation+1.69e-01

Pos logit contributions
Barn+0.592
nder+0.580
stead+0.572
ð+0.544
phabet+0.536
Neg logit contributions
nard-0.691
-0.662
tan-0.662
INESS-0.627
usal-0.626
Token Oh
Token ID3966
Feature activation+1.10e+00

Pos logit contributions
nard+1.881
tan+1.804
+1.802
INESS+1.674
serving+1.668
Neg logit contributions
Barn-1.659
nder-1.639
stead-1.622
phabet-1.535
ð-1.527
Token yeah
Token ID10194
Feature activation+5.26e-01

Pos logit contributions
nard+8.633
+8.560
tan+8.529
hai+7.691
usal+7.562
Neg logit contributions
Barn-11.346
ovember-10.872
nder-10.743
stead-10.596
phabet-10.514
Token.
Token ID13
Feature activation+7.51e-02

Pos logit contributions
nard+5.126
+4.943
tan+4.903
usal+4.538
INESS+4.532
Neg logit contributions
Barn-5.628
nder-5.477
stead-5.407
ovember-5.126
phabet-5.109
Token Often
Token ID18023
Feature activation-1.49e-01

Pos logit contributions
nard+0.837
+0.803
tan+0.801
INESS+0.753
usal+0.751
Neg logit contributions
Barn-0.740
nder-0.731
stead-0.723
ð-0.683
phabet-0.672
Token,
Token ID11
Feature activation+9.43e-02

Pos logit contributions
Barn+1.499
nder+1.473
stead+1.457
ð+1.393
say+1.333
Neg logit contributions
nard-1.619
-1.560
tan-1.549
INESS-1.485
usal-1.479
Token when
Token ID618
Feature activation+4.07e-02

Pos logit contributions
nard+0.978
tan+0.931
+0.930
usal+0.884
serving+0.872
Neg logit contributions
Barn-1.011
nder-0.991
stead-0.985
ð-0.935
phabet-0.911
Token
Token ID198
Feature activation-1.79e-01

Pos logit contributions
nard+1.238
tan+1.191
+1.184
INESS+1.127
usal+1.122
Neg logit contributions
Barn-1.078
nder-1.055
stead-1.043
ð-0.988
phabet-0.970
Token
Token ID198
Feature activation+2.13e-01

Pos logit contributions
Barn+1.661
nder+1.625
stead+1.612
ð+1.528
say+1.455
Neg logit contributions
nard-2.117
-2.037
tan-2.027
usal-1.946
INESS-1.944
TokenCharles
Token ID28711
Feature activation-1.52e-01

Pos logit contributions
nard+2.287
+2.194
tan+2.185
INESS+2.075
usal+2.061
Neg logit contributions
Barn-2.145
nder-2.135
stead-2.115
ð-2.013
phabet-1.960
Token Burns
Token ID23592
Feature activation+5.12e-01

Pos logit contributions
Barn+1.222
nder+1.192
stead+1.179
ð+1.109
phabet+1.053
Neg logit contributions
nard-1.983
-1.913
tan-1.907
usal-1.834
INESS-1.832
Token :
Token ID1058
Feature activation+4.20e-01

Pos logit contributions
nard+3.873
tan+3.639
+3.520
minster+3.238
INESS+3.174
Neg logit contributions
Barn-6.617
 myster-6.607
phabet-6.532
nder-6.371
stead-6.318
Token Oh
Token ID3966
Feature activation+1.09e+00

Pos logit contributions
nard+4.439
tan+4.311
+4.274
INESS+3.929
serving+3.877
Neg logit contributions
Barn-4.133
nder-4.028
stead-3.977
phabet-3.953
ovember-3.940
Token yeah
Token ID10194
Feature activation+5.56e-01

Pos logit contributions
nard+8.492
+8.440
tan+8.401
hai+7.461
usal+7.456
Neg logit contributions
Barn-11.403
ovember-11.090
phabet-10.718
nder-10.637
neys-10.589
Token,
Token ID11
Feature activation+4.74e-01

Pos logit contributions
nard+5.294
tan+5.099
+5.053
INESS+4.726
usal+4.646
Neg logit contributions
Barn-6.021
nder-5.780
stead-5.727
ovember-5.553
phabet-5.521
Token of
Token ID286
Feature activation+2.66e-01

Pos logit contributions
nard+4.615
tan+4.434
+4.425
usal+4.050
serving+4.037
Neg logit contributions
Barn-5.000
nder-4.898
stead-4.830
ovember-4.655
ð-4.574
Token course
Token ID1781
Feature activation+5.49e-02

Pos logit contributions
nard+2.940
+2.842
tan+2.817
usal+2.682
serving+2.653
Neg logit contributions
Barn-2.606
nder-2.527
stead-2.505
ð-2.398
phabet-2.369
Token.
Token ID13
Feature activation+4.90e-02

Pos logit contributions
nard+0.616
+0.591
tan+0.589
usal+0.561
INESS+0.561
Neg logit contributions
Barn-0.542
nder-0.530
stead-0.525
ð-0.499
phabet-0.481
Token bit
Token ID1643
Feature activation-6.13e-02

Pos logit contributions
nard+1.408
+1.335
tan+1.329
usal+1.262
INESS+1.249
Neg logit contributions
Barn-1.610
nder-1.566
stead-1.557
ð-1.492
phabet-1.446
Token more
Token ID517
Feature activation-6.43e-02

Pos logit contributions
Barn+0.665
nder+0.653
stead+0.648
ð+0.620
phabet+0.597
Neg logit contributions
nard-0.628
-0.600
tan-0.597
usal-0.568
INESS-0.567
Token top
Token ID1353
Feature activation-8.15e-02

Pos logit contributions
Barn+0.694
nder+0.681
stead+0.678
ð+0.648
phabet+0.622
Neg logit contributions
nard-0.660
-0.632
tan-0.628
INESS-0.599
usal-0.598
Token-
Token ID12
Feature activation-1.94e-01

Pos logit contributions
Barn+0.795
nder+0.779
stead+0.773
ð+0.735
phabet+0.706
Neg logit contributions
nard-0.921
-0.885
tan-0.881
usal-0.842
INESS-0.842
Tokenend
Token ID437
Feature activation+3.80e-01

Pos logit contributions
Barn+2.104
stead+2.084
nder+2.081
ð+1.974
say+1.923
Neg logit contributions
nard-1.951
-1.889
tan-1.847
INESS-1.791
usal-1.774
Token spark
Token ID9009
Feature activation+1.08e+00

Pos logit contributions
nard+3.749
+3.538
tan+3.505
usal+3.379
INESS+3.328
Neg logit contributions
Barn-4.184
nder-4.053
stead-3.988
ð-3.840
ovember-3.735
Tokenle
Token ID293
Feature activation+4.94e-01

Pos logit contributions
nard+9.899
tan+9.101
+9.018
INESS+8.734
usal+8.565
Neg logit contributions
Barn-10.605
nder-10.600
stead-10.247
ovember-10.191
ð-10.001
Token and
Token ID290
Feature activation+1.11e-01

Pos logit contributions
nard+4.708
+4.387
tan+4.354
usal+4.180
INESS+4.163
Neg logit contributions
Barn-5.527
nder-5.368
stead-5.250
ð-5.095
ovember-4.974
Token superb
Token ID21840
Feature activation+8.04e-02

Pos logit contributions
nard+1.124
tan+1.072
+1.070
usal+1.017
INESS+1.012
Neg logit contributions
Barn-1.208
nder-1.183
stead-1.171
ð-1.120
ovember-1.082
Token neutrality
Token ID20723
Feature activation+2.51e-01

Pos logit contributions
nard+0.823
+0.784
tan+0.783
usal+0.743
INESS+0.741
Neg logit contributions
Barn-0.874
nder-0.854
stead-0.845
ð-0.810
ovember-0.784
Token,
Token ID11
Feature activation+1.93e-01

Pos logit contributions
nard+2.514
+2.371
tan+2.345
INESS+2.262
usal+2.262
Neg logit contributions
Barn-2.772
nder-2.706
stead-2.653
ð-2.563
ovember-2.476
Tokendata
Token ID7890
Feature activation+1.23e-01

Pos logit contributions
nard+3.648
tan+3.448
+3.315
serving+3.137
usal+3.118
Neg logit contributions
Barn-7.563
nder-7.229
stead-7.202
ovember-6.989
ð-6.978
Token_
Token ID62
Feature activation+6.48e-01

Pos logit contributions
nard+1.519
+1.466
tan+1.460
usal+1.401
INESS+1.398
Neg logit contributions
Barn-1.076
nder-1.050
stead-1.042
ð-0.983
phabet-0.937
Tokenfull
Token ID12853
Feature activation+5.72e-01

Pos logit contributions
tan+6.411
nard+6.404
+6.101
serving+5.907
usal+5.882
Neg logit contributions
Barn-6.732
nder-6.350
stead-6.271
ovember-6.130
phabet-6.122
Token v
Token ID410
Feature activation+4.28e-01

Pos logit contributions
nard+5.690
tan+5.529
+5.462
usal+5.071
serving+5.065
Neg logit contributions
Barn-6.012
stead-5.779
nder-5.675
ð-5.395
ovember-5.375
Token)
Token ID8
Feature activation+4.89e-01

Pos logit contributions
nard+4.486
tan+4.373
+4.260
usal+4.011
serving+4.003
Neg logit contributions
Barn-4.391
stead-4.179
nder-4.093
ð-4.016
ovember-3.953
Token {
Token ID1391
Feature activation+1.08e+00

Pos logit contributions
nard+4.854
tan+4.802
+4.700
usal+4.513
minster+4.353
Neg logit contributions
Barn-5.193
nder-4.985
stead-4.939
ð-4.676
phabet-4.670
Token v
Token ID410
Feature activation+2.84e-01

Pos logit contributions
nard+8.985
tan+8.981
+8.370
usal+7.955
 typed+7.931
Neg logit contributions
Barn-11.316
stead-10.712
nder-10.648
phabet-10.285
ð-10.155
Token.
Token ID13
Feature activation+2.61e-01

Pos logit contributions
nard+3.073
tan+2.992
+2.945
usal+2.790
INESS+2.775
Neg logit contributions
Barn-2.865
stead-2.780
nder-2.767
ð-2.632
phabet-2.568
Tokenver
Token ID332
Feature activation+2.24e-02

Pos logit contributions
nard+2.717
tan+2.669
+2.622
usal+2.466
INESS+2.434
Neg logit contributions
Barn-2.775
nder-2.676
stead-2.651
ð-2.530
phabet-2.481
Tokentex
Token ID16886
Feature activation+6.89e-01

Pos logit contributions
nard+0.183
+0.173
tan+0.172
usal+0.161
INESS+0.160
Neg logit contributions
Barn-0.290
nder-0.285
stead-0.283
ð-0.272
phabet-0.265
Token.
Token ID13
Feature activation+3.83e-01

Pos logit contributions
nard+7.449
tan+7.387
+7.324
INESS+6.892
usal+6.756
Neg logit contributions
Barn-6.143
stead-6.082
nder-6.028
phabet-5.703
ð-5.642
Token:
Token ID25
Feature activation-1.29e-02

Pos logit contributions
Barn+0.415
nder+0.407
stead+0.404
ð+0.386
phabet+0.372
Neg logit contributions
nard-0.401
-0.383
tan-0.381
usal-0.363
INESS-0.362
Token "
Token ID366
Feature activation-1.18e-01

Pos logit contributions
Barn+0.129
nder+0.126
stead+0.125
ð+0.119
phabet+0.114
Neg logit contributions
nard-0.144
-0.138
tan-0.137
usal-0.131
INESS-0.131
TokenI
Token ID40
Feature activation-1.62e-01

Pos logit contributions
Barn+1.129
nder+1.102
stead+1.094
ð+1.038
phabet+0.993
Neg logit contributions
nard-1.364
-1.309
tan-1.303
usal-1.249
INESS-1.247
Token thought
Token ID1807
Feature activation+1.13e-01

Pos logit contributions
Barn+1.712
nder+1.682
stead+1.668
ð+1.593
phabet+1.531
Neg logit contributions
nard-1.703
-1.631
tan-1.623
usal-1.546
INESS-1.545
Token,
Token ID11
Feature activation+3.17e-01

Pos logit contributions
nard+1.162
+1.108
tan+1.103
usal+1.052
INESS+1.047
Neg logit contributions
Barn-1.220
nder-1.196
stead-1.185
ð-1.135
ovember-1.109
Token Oh
Token ID3966
Feature activation+1.07e+00

Pos logit contributions
nard+3.405
tan+3.249
+3.226
INESS+3.087
usal+3.083
Neg logit contributions
Barn-3.148
nder-3.107
stead-3.050
ð-2.920
phabet-2.848
Token,
Token ID11
Feature activation+8.64e-01

Pos logit contributions
nard+7.704
+7.489
tan+7.478
usal+6.747
hai+6.619
Neg logit contributions
Barn-11.660
ovember-11.537
nder-11.351
stead-11.197
neys-11.027
Token my
Token ID616
Feature activation+5.67e-02

Pos logit contributions
nard+6.713
tan+6.405
+6.383
serving+5.988
usal+5.932
Neg logit contributions
Barn-9.545
nder-9.524
stead-9.420
ovember-9.396
ð-9.042
Token g
Token ID308
Feature activation+2.92e-01

Pos logit contributions
nard+0.547
+0.526
tan+0.521
usal+0.495
INESS+0.491
Neg logit contributions
Barn-0.644
nder-0.631
stead-0.626
ð-0.601
phabet-0.585
Tokenosh
Token ID3768
Feature activation+4.21e-01

Pos logit contributions
nard+2.505
tan+2.395
usal+2.282
+2.276
ipal+2.175
Neg logit contributions
Barn-3.631
stead-3.487
nder-3.487
ð-3.341
ovember-3.329
Token,
Token ID11
Feature activation+2.30e-01

Pos logit contributions
nard+4.174
tan+3.969
+3.916
INESS+3.760
usal+3.721
Neg logit contributions
Barn-4.513
nder-4.407
stead-4.293
ovember-4.178
ð-4.167
Token by
Token ID416
Feature activation-1.90e-01

Pos logit contributions
Barn+2.008
nder+1.976
stead+1.966
ð+1.886
phabet+1.779
Neg logit contributions
nard-2.129
-2.035
tan-2.034
INESS-1.937
usal-1.934
Token the
Token ID262
Feature activation+7.86e-03

Pos logit contributions
Barn+1.964
nder+1.918
stead+1.909
ð+1.836
phabet+1.745
Neg logit contributions
nard-2.038
-1.943
tan-1.939
INESS-1.857
usal-1.841
Token mess
Token ID2085
Feature activation+5.46e-01

Pos logit contributions
nard+0.078
+0.075
tan+0.075
usal+0.071
INESS+0.071
Neg logit contributions
Barn-0.087
nder-0.086
stead-0.085
ð-0.081
phabet-0.079
Token of
Token ID286
Feature activation+4.77e-02

Pos logit contributions
nard+5.115
+4.899
INESS+4.829
tan+4.825
minster+4.674
Neg logit contributions
Barn-5.992
nder-5.957
stead-5.739
ovember-5.533
ð-5.485
Token the
Token ID262
Feature activation+1.56e-01

Pos logit contributions
nard+0.484
+0.464
tan+0.462
usal+0.440
INESS+0.436
Neg logit contributions
Barn-0.521
nder-0.513
stead-0.507
ð-0.484
phabet-0.469
Token inco
Token ID44237
Feature activation+1.06e+00

Pos logit contributions
nard+1.629
+1.574
tan+1.566
usal+1.486
INESS+1.466
Neg logit contributions
Barn-1.635
nder-1.608
stead-1.587
ð-1.503
ovember-1.475
Tokenherent
Token ID8334
Feature activation+2.16e-01

Pos logit contributions
hesive+5.808
usal+5.663
nard+5.451
minster+5.194
verage+5.187
Neg logit contributions
Barn-14.250
nder-13.764
stead-13.624
ovember-13.387
 myster-13.146
Token politics
Token ID4819
Feature activation+2.65e-01

Pos logit contributions
nard+2.220
+2.136
tan+2.124
usal+2.034
INESS+2.013
Neg logit contributions
Barn-2.277
nder-2.234
stead-2.231
ð-2.098
ovember-2.083
Token that
Token ID326
Feature activation+4.23e-02

Pos logit contributions
nard+2.608
+2.512
tan+2.462
usal+2.365
serving+2.338
Neg logit contributions
Barn-2.929
nder-2.879
stead-2.824
ð-2.685
ovember-2.658
Token is
Token ID318
Feature activation-1.70e-02

Pos logit contributions
nard+0.429
+0.410
tan+0.408
usal+0.388
INESS+0.385
Neg logit contributions
Barn-0.463
nder-0.454
stead-0.451
ð-0.429
phabet-0.416
Token has
Token ID468
Feature activation+4.13e-01

Pos logit contributions
Barn+0.179
nder+0.176
stead+0.174
ð+0.166
phabet+0.160
Neg logit contributions
nard-0.179
-0.172
tan-0.171
usal-0.163
INESS-0.162
Token11
Token ID1157
Feature activation-1.53e-01

Pos logit contributions
nard+0.389
+0.374
tan+0.373
INESS+0.353
usal+0.353
Neg logit contributions
Barn-0.326
nder-0.322
stead-0.318
ð-0.303
phabet-0.297
Token-
Token ID12
Feature activation+2.87e-01

Pos logit contributions
Barn+1.362
nder+1.342
stead+1.320
ð+1.259
ovember+1.195
Neg logit contributions
nard-1.842
-1.776
tan-1.765
usal-1.695
INESS-1.693
Token2014
Token ID4967
Feature activation+3.14e-02

Pos logit contributions
nard+2.841
tan+2.719
+2.708
usal+2.569
INESS+2.567
Neg logit contributions
Barn-3.148
nder-3.092
stead-3.064
ð-2.962
phabet-2.861
Token at
Token ID379
Feature activation+3.87e-01

Pos logit contributions
nard+0.385
+0.370
tan+0.369
INESS+0.354
usal+0.354
Neg logit contributions
Barn-0.277
nder-0.272
stead-0.269
ð-0.254
phabet-0.243
Token 08
Token ID8487
Feature activation+3.66e-01

Pos logit contributions
nard+4.148
+4.016
tan+3.996
usal+3.827
minster+3.786
Neg logit contributions
nder-3.777
Barn-3.768
stead-3.758
ð-3.503
phabet-3.422
Token:
Token ID25
Feature activation+1.06e+00

Pos logit contributions
nard+3.821
tan+3.684
+3.670
usal+3.499
INESS+3.462
Neg logit contributions
Barn-3.745
stead-3.660
nder-3.646
ð-3.463
phabet-3.393
Token05
Token ID2713
Feature activation+9.15e-02

Pos logit contributions
tan+6.802
nard+6.702
+6.491
DCS+6.267
serving+5.890
Neg logit contributions
phabet-13.205
 myster-13.010
nder-12.391
igans-12.196
ovember-12.078
Token UTC
Token ID18119
Feature activation-2.76e-01

Pos logit contributions
nard+1.031
+0.988
tan+0.988
INESS+0.943
usal+0.943
Neg logit contributions
Barn-0.896
nder-0.879
stead-0.872
ð-0.826
phabet-0.795
Token,
Token ID11
Feature activation-1.62e-02

Pos logit contributions
Barn+2.395
nder+2.322
stead+2.310
ð+2.203
phabet+2.084
Neg logit contributions
nard-3.383
-3.248
tan-3.245
usal-3.130
INESS-3.085
Token updated
Token ID6153
Feature activation-7.84e-02

Pos logit contributions
Barn+0.172
nder+0.169
stead+0.167
ð+0.160
phabet+0.154
Neg logit contributions
nard-0.169
-0.162
tan-0.161
usal-0.153
INESS-0.153
Token at
Token ID379
Feature activation-9.20e-02

Pos logit contributions
Barn+0.594
nder+0.579
stead+0.572
ð+0.536
phabet+0.507
Neg logit contributions
nard-1.058
-1.023
tan-1.019
usal-0.982
INESS-0.981
Token that
Token ID326
Feature activation+5.74e-01

Pos logit contributions
nard+6.208
+6.016
tan+5.923
usal+5.653
ipal+5.538
Neg logit contributions
nder-5.852
Barn-5.836
stead-5.669
phabet-5.606
ovember-5.546
Token suggestion
Token ID13052
Feature activation+3.46e-01

Pos logit contributions
nard+5.539
+5.269
tan+5.250
minster+4.998
usal+4.953
Neg logit contributions
Barn-6.112
stead-5.898
nder-5.896
phabet-5.878
ð-5.697
Token,
Token ID11
Feature activation+4.41e-01

Pos logit contributions
nard+3.621
+3.469
tan+3.446
INESS+3.274
usal+3.267
Neg logit contributions
Barn-3.581
nder-3.511
stead-3.471
ð-3.317
phabet-3.295
Token Stella
Token ID45856
Feature activation+3.00e-01

Pos logit contributions
nard+4.090
+3.905
tan+3.895
INESS+3.589
usal+3.588
Neg logit contributions
Barn-4.931
nder-4.877
stead-4.822
phabet-4.688
ovember-4.663
Token opened
Token ID4721
Feature activation+7.46e-01

Pos logit contributions
nard+2.916
tan+2.786
+2.781
usal+2.586
minster+2.523
Neg logit contributions
Barn-3.351
nder-3.208
stead-3.192
phabet-3.153
ð-3.043
Token her
Token ID607
Feature activation+1.06e+00

Pos logit contributions
nard+5.723
+5.636
tan+5.573
usal+5.235
INESS+5.047
Neg logit contributions
stead-8.713
nder-8.701
Barn-8.661
ovember-8.472
phabet-8.255
Token scar
Token ID10153
Feature activation+1.24e-01

Pos logit contributions
tan+7.450
+7.328
nard+7.271
 commanding+7.016
serving+6.573
Neg logit contributions
Barn-11.992
phabet-11.726
stead-11.636
nder-11.461
ovember-11.423
Tokenlet
Token ID1616
Feature activation+2.21e-01

Pos logit contributions
nard+1.269
tan+1.204
+1.194
usal+1.146
INESS+1.133
Neg logit contributions
Barn-1.353
nder-1.328
stead-1.309
ð-1.251
phabet-1.216
Token eyes
Token ID2951
Feature activation+5.06e-01

Pos logit contributions
nard+2.331
tan+2.270
+2.233
usal+2.120
INESS+2.096
Neg logit contributions
Barn-2.294
nder-2.226
stead-2.226
phabet-2.122
ð-2.106
Token wide
Token ID3094
Feature activation+8.89e-01

Pos logit contributions
nard+4.721
+4.508
tan+4.493
INESS+4.235
usal+4.200
Neg logit contributions
Barn-5.694
nder-5.511
stead-5.471
phabet-5.414
ð-5.247
Token.
Token ID13
Feature activation+4.43e-01

Pos logit contributions
nard+7.032
tan+6.830
+6.794
INESS+6.322
usal+6.317
Neg logit contributions
Barn-10.367
phabet-9.790
nder-9.763
stead-9.740
ovember-9.697
Token was
Token ID373
Feature activation+3.87e-01

Pos logit contributions
nard+5.200
+4.892
tan+4.801
minster+4.541
INESS+4.516
Neg logit contributions
Barn-5.287
nder-5.259
stead-5.179
phabet-4.923
ð-4.919
Token 18
Token ID1248
Feature activation+2.76e-01

Pos logit contributions
nard+3.775
tan+3.584
+3.566
serving+3.357
INESS+3.340
Neg logit contributions
Barn-4.251
nder-4.205
stead-4.125
ð-3.958
phabet-3.922
Token when
Token ID618
Feature activation+1.15e-01

Pos logit contributions
nard+2.834
+2.721
tan+2.703
serving+2.563
INESS+2.528
Neg logit contributions
Barn-2.932
nder-2.871
stead-2.814
ð-2.678
phabet-2.672
Token he
Token ID339
Feature activation+1.49e-01

Pos logit contributions
nard+1.185
+1.128
tan+1.120
usal+1.064
INESS+1.058
Neg logit contributions
Barn-1.230
nder-1.212
stead-1.199
ð-1.145
phabet-1.105
Token first
Token ID717
Feature activation+1.56e-01

Pos logit contributions
nard+1.471
tan+1.391
+1.383
usal+1.289
minster+1.279
Neg logit contributions
Barn-1.670
nder-1.634
stead-1.630
ð-1.555
phabet-1.541
Token started
Token ID2067
Feature activation+1.06e+00

Pos logit contributions
nard+1.548
tan+1.467
+1.457
usal+1.363
minster+1.342
Neg logit contributions
Barn-1.728
nder-1.696
stead-1.694
ð-1.607
phabet-1.605
Token making
Token ID1642
Feature activation+5.76e-01

Pos logit contributions
nard+8.209
tan+7.975
serving+7.684
+7.554
usal+7.426
Neg logit contributions
nder-11.306
Barn-11.226
stead-10.813
phabet-10.512
ð-10.469
Token photos
Token ID5205
Feature activation+2.71e-01

Pos logit contributions
nard+5.126
tan+4.886
+4.723
usal+4.618
serving+4.427
Neg logit contributions
nder-6.507
Barn-6.505
stead-6.225
ð-6.085
phabet-6.074
Token at
Token ID379
Feature activation+2.71e-02

Pos logit contributions
nard+2.662
+2.537
tan+2.523
usal+2.419
serving+2.367
Neg logit contributions
Barn-3.024
nder-2.975
stead-2.892
ð-2.798
phabet-2.727
Token Los
Token ID5401
Feature activation-8.09e-02

Pos logit contributions
nard+0.283
+0.270
tan+0.269
usal+0.256
INESS+0.255
Neg logit contributions
Barn-0.288
nder-0.283
stead-0.281
ð-0.268
phabet-0.259
Token Angeles
Token ID5652
Feature activation+2.40e-01

Pos logit contributions
Barn+1.305
nder+1.289
stead+1.282
ð+1.245
phabet+1.216
Neg logit contributions
nard-0.402
-0.364
tan-0.362
usal-0.322
INESS-0.321
Tokens
Token ID82
Feature activation+4.57e-01

Pos logit contributions
Barn+0.997
nder+0.980
stead+0.974
ð+0.931
say+0.889
Neg logit contributions
nard-0.985
-0.941
tan-0.937
INESS-0.895
usal-0.893
Token like
Token ID588
Feature activation+4.95e-02

Pos logit contributions
nard+4.470
+4.341
tan+4.311
usal+3.945
minster+3.915
Neg logit contributions
Barn-4.909
stead-4.785
nder-4.731
phabet-4.566
ovember-4.503
Token,
Token ID11
Feature activation+2.77e-01

Pos logit contributions
nard+0.515
+0.493
tan+0.490
usal+0.464
INESS+0.462
Neg logit contributions
Barn-0.530
nder-0.518
stead-0.517
ð-0.491
phabet-0.477
Token
Token ID564
Feature activation+5.75e-02

Pos logit contributions
nard+2.919
tan+2.825
+2.822
usal+2.649
INESS+2.640
Neg logit contributions
Barn-2.824
nder-2.762
stead-2.745
ð-2.600
phabet-2.537
Token
Token ID246
Feature activation+2.11e-01

Pos logit contributions
nard+0.524
+0.501
tan+0.498
usal+0.470
INESS+0.469
Neg logit contributions
Barn-0.685
nder-0.672
stead-0.670
ð-0.641
phabet-0.621
TokenOh
Token ID5812
Feature activation+1.05e+00

Pos logit contributions
nard+2.379
tan+2.315
+2.314
usal+2.183
serving+2.168
Neg logit contributions
Barn-2.012
nder-1.962
stead-1.947
ð-1.842
phabet-1.805
Token boy
Token ID2933
Feature activation+3.15e-01

Pos logit contributions
+8.816
nard+8.690
tan+8.629
hai+7.770
usal+7.635
Neg logit contributions
Barn-10.492
ovember-10.252
nder-10.150
neys-9.883
stead-9.850
Token,
Token ID11
Feature activation+2.56e-01

Pos logit contributions
nard+3.310
+3.180
tan+3.174
INESS+2.996
usal+2.983
Neg logit contributions
Barn-3.260
nder-3.133
stead-3.095
ð-2.951
ovember-2.909
Token I
Token ID314
Feature activation+5.79e-02

Pos logit contributions
nard+2.781
+2.707
tan+2.675
usal+2.529
INESS+2.518
Neg logit contributions
Barn-2.548
nder-2.492
stead-2.474
ð-2.330
ovember-2.273
Token have
Token ID423
Feature activation+3.94e-02

Pos logit contributions
nard+0.580
+0.552
tan+0.550
usal+0.522
INESS+0.516
Neg logit contributions
Barn-0.643
nder-0.622
stead-0.622
ð-0.589
phabet-0.577
Token not
Token ID407
Feature activation+1.47e-01

Pos logit contributions
nard+0.411
+0.395
tan+0.392
usal+0.372
INESS+0.371
Neg logit contributions
Barn-0.420
nder-0.411
stead-0.408
ð-0.389
phabet-0.376
Token
Token ID198
Feature activation-2.32e-02

Pos logit contributions
Barn+1.162
nder+1.136
stead+1.126
ð+1.071
say+1.024
Neg logit contributions
nard-1.360
-1.306
tan-1.300
usal-1.248
INESS-1.244
Token
Token ID198
Feature activation-1.06e-01

Pos logit contributions
Barn+0.229
nder+0.225
stead+0.223
ð+0.212
phabet+0.212
Neg logit contributions
nard-0.259
tan-0.248
-0.247
usal-0.234
INESS-0.233
TokenThe
Token ID464
Feature activation-9.28e-02

Pos logit contributions
Barn+1.068
nder+1.044
stead+1.036
ð+0.987
phabet+0.945
Neg logit contributions
nard-1.178
-1.128
tan-1.123
usal-1.074
INESS-1.072
Token woman
Token ID2415
Feature activation-1.01e+00

Pos logit contributions
Barn+1.092
nder+1.073
stead+1.066
ð+1.023
phabet+0.988
Neg logit contributions
nard-0.864
-0.822
tan-0.818
usal-0.774
INESS-0.773
Token,
Token ID11
Feature activation-2.42e-01

Pos logit contributions
nder+8.362
stead+8.179
Barn+7.885
ð+7.769
say+7.332
Neg logit contributions
nard-10.973
-10.907
INESS-10.880
ongyang-10.705
tan-10.445
Token whose
Token ID3025
Feature activation-1.38e+00

Pos logit contributions
Barn+2.517
nder+2.457
stead+2.437
ð+2.322
say+2.248
Neg logit contributions
nard-2.581
-2.484
tan-2.446
INESS-2.357
usal-2.346
Token name
Token ID1438
Feature activation+2.08e-01

Pos logit contributions
stead+8.899
nder+8.850
Barn+8.412
ð+8.147
 Koh+7.884
Neg logit contributions
nard-15.266
ongyang-15.143
minster-14.934
INESS-14.798
serving-14.628
Token has
Token ID468
Feature activation+3.46e-02

Pos logit contributions
nard+2.093
+1.981
tan+1.974
INESS+1.868
usal+1.867
Neg logit contributions
Barn-2.287
nder-2.234
stead-2.233
ð-2.142
say-2.076
Token not
Token ID407
Feature activation-3.10e-01

Pos logit contributions
nard+0.406
+0.390
tan+0.389
INESS+0.373
usal+0.372
Neg logit contributions
Barn-0.322
nder-0.315
stead-0.313
ð-0.297
phabet-0.283
Token been
Token ID587
Feature activation-3.87e-01

Pos logit contributions
Barn+3.016
stead+2.991
nder+2.982
ð+2.857
neys+2.702
Neg logit contributions
nard-3.342
-3.278
tan-3.256
INESS-3.191
minster-3.094
Token released
Token ID2716
Feature activation+1.81e-01

Pos logit contributions
Barn+3.353
stead+3.353
nder+3.329
ð+3.192
say+3.076
Neg logit contributions
nard-4.441
-4.412
tan-4.398
INESS-4.381
serving-4.233
Token Sanders
Token ID5831
Feature activation-6.26e-02

Pos logit contributions
nard+2.826
tan+2.688
+2.683
usal+2.552
INESS+2.528
Neg logit contributions
Barn-2.193
nder-2.154
stead-2.100
ð-1.995
ovember-1.946
Token Are
Token ID4231
Feature activation-3.37e-01

Pos logit contributions
Barn+0.677
nder+0.664
stead+0.658
ð+0.630
phabet+0.609
Neg logit contributions
nard-0.643
-0.612
tan-0.611
usal-0.581
INESS-0.578
Token More
Token ID3125
Feature activation-1.78e-01

Pos logit contributions
Barn+3.282
stead+3.252
nder+3.207
ð+3.065
say+2.943
Neg logit contributions
nard-3.716
-3.618
tan-3.569
INESS-3.486
usal-3.406
Token Similar
Token ID11014
Feature activation+7.07e-02

Pos logit contributions
Barn+1.820
nder+1.787
stead+1.782
ð+1.694
say+1.628
Neg logit contributions
nard-1.911
-1.850
tan-1.824
INESS-1.765
usal-1.746
Token Than
Token ID17924
Feature activation-7.81e-01

Pos logit contributions
nard+0.827
tan+0.790
+0.790
usal+0.760
INESS+0.755
Neg logit contributions
Barn-0.665
nder-0.644
stead-0.640
ð-0.606
phabet-0.585
Token You
Token ID921
Feature activation-1.21e+00

Pos logit contributions
Barn+6.920
stead+6.816
nder+6.738
ð+6.428
say+6.306
Neg logit contributions
-8.537
nard-8.362
INESS-8.277
tan-8.206
usal-8.014
Token Think
Token ID11382
Feature activation-1.17e+00

Pos logit contributions
Barn+8.267
stead+8.192
nder+8.068
say+7.744
ð+7.730
Neg logit contributions
-14.627
nard-13.858
minster-13.543
INESS-13.463
tan-13.404
Token
Token ID198
Feature activation-3.48e-01

Pos logit contributions
Barn+8.260
stead+8.113
nder+8.055
ð+7.702
 Barn+7.159
Neg logit contributions
nard-13.254
-13.180
tan-12.606
INESS-12.523
serving-12.479
Token
Token ID198
Feature activation-1.71e-01

Pos logit contributions
Barn+3.040
nder+2.888
stead+2.886
ð+2.700
say+2.605
Neg logit contributions
nard-4.292
-4.105
tan-4.095
usal-4.028
INESS-3.947
TokenThe
Token ID464
Feature activation+1.38e-02

Pos logit contributions
Barn+1.703
nder+1.654
stead+1.646
ð+1.562
say+1.500
Neg logit contributions
nard-1.913
-1.828
tan-1.821
usal-1.749
INESS-1.742
Token biggest
Token ID4094
Feature activation+6.08e-01

Pos logit contributions
nard+0.144
tan+0.138
+0.138
usal+0.130
INESS+0.130
Neg logit contributions
Barn-0.147
nder-0.144
stead-0.142
ð-0.136
phabet-0.131
Token Are
Token ID4231
Feature activation-3.37e-01

Pos logit contributions
Barn+0.677
nder+0.664
stead+0.658
ð+0.630
phabet+0.609
Neg logit contributions
nard-0.643
-0.612
tan-0.611
usal-0.581
INESS-0.578
Token More
Token ID3125
Feature activation-1.78e-01

Pos logit contributions
Barn+3.282
stead+3.252
nder+3.207
ð+3.065
say+2.943
Neg logit contributions
nard-3.716
-3.618
tan-3.569
INESS-3.486
usal-3.406
Token Similar
Token ID11014
Feature activation+7.07e-02

Pos logit contributions
Barn+1.820
nder+1.787
stead+1.782
ð+1.694
say+1.628
Neg logit contributions
nard-1.911
-1.850
tan-1.824
INESS-1.765
usal-1.746
Token Than
Token ID17924
Feature activation-7.81e-01

Pos logit contributions
nard+0.827
tan+0.790
+0.790
usal+0.760
INESS+0.755
Neg logit contributions
Barn-0.665
nder-0.644
stead-0.640
ð-0.606
phabet-0.585
Token You
Token ID921
Feature activation-1.21e+00

Pos logit contributions
Barn+6.920
stead+6.816
nder+6.738
ð+6.428
say+6.306
Neg logit contributions
-8.537
nard-8.362
INESS-8.277
tan-8.206
usal-8.014
Token Think
Token ID11382
Feature activation-1.17e+00

Pos logit contributions
Barn+8.267
stead+8.192
nder+8.068
say+7.744
ð+7.730
Neg logit contributions
-14.627
nard-13.858
minster-13.543
INESS-13.463
tan-13.404
Token
Token ID198
Feature activation-3.48e-01

Pos logit contributions
Barn+8.260
stead+8.113
nder+8.055
ð+7.702
 Barn+7.159
Neg logit contributions
nard-13.254
-13.180
tan-12.606
INESS-12.523
serving-12.479
Token
Token ID198
Feature activation-1.71e-01

Pos logit contributions
Barn+3.040
nder+2.888
stead+2.886
ð+2.700
say+2.605
Neg logit contributions
nard-4.292
-4.105
tan-4.095
usal-4.028
INESS-3.947
TokenThe
Token ID464
Feature activation+1.38e-02

Pos logit contributions
Barn+1.703
nder+1.654
stead+1.646
ð+1.562
say+1.500
Neg logit contributions
nard-1.913
-1.828
tan-1.821
usal-1.749
INESS-1.742
Token biggest
Token ID4094
Feature activation+6.08e-01

Pos logit contributions
nard+0.144
tan+0.138
+0.138
usal+0.130
INESS+0.130
Neg logit contributions
Barn-0.147
nder-0.144
stead-0.142
ð-0.136
phabet-0.131
Token difference
Token ID3580
Feature activation+4.16e-02

Pos logit contributions
nard+5.535
tan+5.437
+5.224
minster+5.005
usal+4.966
Neg logit contributions
Barn-6.698
nder-6.658
stead-6.496
phabet-6.245
ð-6.215
Token doing
Token ID1804
Feature activation+2.77e-01

Pos logit contributions
nard+0.978
+0.935
tan+0.908
usal+0.864
serving+0.861
Neg logit contributions
Barn-1.461
nder-1.438
stead-1.409
ð-1.363
phabet-1.356
Token drugs
Token ID5010
Feature activation+1.27e-01

Pos logit contributions
nard+2.795
+2.711
tan+2.637
INESS+2.507
usal+2.494
Neg logit contributions
Barn-2.998
nder-2.929
stead-2.886
ð-2.747
phabet-2.717
Token or
Token ID393
Feature activation+2.03e-01

Pos logit contributions
nard+1.389
+1.326
tan+1.318
usal+1.256
INESS+1.252
Neg logit contributions
Barn-1.300
nder-1.271
stead-1.254
ð-1.192
phabet-1.164
Token enter
Token ID3802
Feature activation+3.28e-01

Pos logit contributions
nard+2.045
tan+1.958
+1.950
usal+1.833
INESS+1.822
Neg logit contributions
Barn-2.204
nder-2.176
stead-2.150
ð-2.065
phabet-2.039
Token a
Token ID257
Feature activation-1.44e-01

Pos logit contributions
nard+3.276
+3.120
tan+3.109
minster+2.905
usal+2.898
Neg logit contributions
Barn-3.547
nder-3.519
stead-3.494
ð-3.326
say-3.222
Token mental
Token ID5110
Feature activation-1.16e+00

Pos logit contributions
Barn+1.544
nder+1.517
stead+1.503
ð+1.437
phabet+1.381
Neg logit contributions
nard-1.499
-1.430
tan-1.425
usal-1.358
INESS-1.357
Token-
Token ID12
Feature activation-4.81e-01

Pos logit contributions
nder+7.563
Barn+7.316
stead+7.156
ð+6.626
say+6.278
Neg logit contributions
nard-14.508
-13.897
blance-13.593
ipal-13.471
INESS-13.395
Tokenhealth
Token ID13948
Feature activation-3.82e-01

Pos logit contributions
Barn+4.722
stead+4.653
nder+4.546
ð+4.327
say+4.316
Neg logit contributions
nard-5.280
-5.044
INESS-4.856
tan-4.840
usal-4.784
Token program
Token ID1430
Feature activation-6.06e-03

Pos logit contributions
Barn+3.573
nder+3.542
stead+3.496
ð+3.277
say+3.156
Neg logit contributions
nard-4.403
-4.207
tan-4.149
usal-4.023
INESS-4.009
Token,
Token ID11
Feature activation-3.47e-01

Pos logit contributions
Barn+0.060
nder+0.059
stead+0.058
ð+0.056
phabet+0.053
Neg logit contributions
nard-0.068
-0.065
tan-0.065
usal-0.062
INESS-0.062
Token now
Token ID783
Feature activation-2.75e-01

Pos logit contributions
Barn+3.272
stead+3.211
nder+3.184
ð+3.076
say+2.931
Neg logit contributions
nard-3.925
tan-3.816
-3.754
usal-3.622
INESS-3.606
Token of
Token ID286
Feature activation+4.25e-02

Pos logit contributions
stead+4.966
Barn+4.940
nder+4.937
ð+4.703
 afore+4.430
Neg logit contributions
nard-7.271
-7.139
tan-6.951
usal-6.778
INESS-6.749
Token the
Token ID262
Feature activation-6.94e-02

Pos logit contributions
nard+0.431
tan+0.411
+0.410
INESS+0.389
usal+0.388
Neg logit contributions
Barn-0.466
nder-0.457
stead-0.452
ð-0.433
phabet-0.417
Token United
Token ID1578
Feature activation+1.05e-01

Pos logit contributions
Barn+0.586
nder+0.571
stead+0.568
ð+0.534
phabet+0.507
Neg logit contributions
nard-0.876
-0.846
tan-0.840
usal-0.808
INESS-0.806
Token States
Token ID1829
Feature activation+3.81e-02

Pos logit contributions
nard+0.761
tan+0.713
+0.705
usal+0.653
serving+0.640
Neg logit contributions
Barn-1.458
nder-1.425
stead-1.410
ð-1.379
phabet-1.374
Token,
Token ID11
Feature activation-5.12e-01

Pos logit contributions
nard+0.415
+0.397
tan+0.396
usal+0.378
INESS+0.378
Neg logit contributions
Barn-0.389
nder-0.381
stead-0.375
ð-0.359
phabet-0.346
Token whose
Token ID3025
Feature activation-1.14e+00

Pos logit contributions
Barn+4.832
stead+4.731
nder+4.603
ð+4.485
say+4.265
Neg logit contributions
nard-5.733
-5.515
tan-5.430
usal-5.250
INESS-5.188
Token App
Token ID2034
Feature activation-1.65e-01

Pos logit contributions
nder+10.083
stead+10.066
Barn+9.925
ð+9.446
say+9.395
Neg logit contributions
nard-11.559
-10.961
tan-10.347
usal-10.207
Champ-10.101
Tokenoint
Token ID1563
Feature activation-2.98e-01

Pos logit contributions
Barn+1.473
nder+1.462
stead+1.444
ð+1.361
phabet+1.297
Neg logit contributions
nard-1.991
-1.932
tan-1.912
INESS-1.826
usal-1.826
Tokenments
Token ID902
Feature activation-1.86e-01

Pos logit contributions
Barn+2.725
nder+2.696
stead+2.652
ð+2.518
say+2.412
Neg logit contributions
nard-3.525
-3.393
tan-3.354
INESS-3.240
usal-3.225
Token are
Token ID389
Feature activation-6.61e-01

Pos logit contributions
Barn+1.957
nder+1.946
stead+1.925
ð+1.856
phabet+1.763
Neg logit contributions
nard-1.922
-1.850
tan-1.829
usal-1.747
INESS-1.731
Token not
Token ID407
Feature activation-6.04e-01

Pos logit contributions
stead+6.066
Barn+5.928
nder+5.722
ð+5.560
say+5.344
Neg logit contributions
nard-7.186
-7.064
tan-6.877
INESS-6.764
Champ-6.623
Token Medline
Token ID34582
Feature activation+1.84e-01

Pos logit contributions
Barn+2.063
nder+1.983
stead+1.937
ð+1.808
say+1.679
Neg logit contributions
nard-4.634
-4.478
tan-4.464
usal-4.349
INESS-4.300
Token |
Token ID930
Feature activation-2.55e-01

Pos logit contributions
nard+2.065
+1.989
tan+1.974
usal+1.890
INESS+1.886
Neg logit contributions
Barn-1.812
nder-1.778
stead-1.764
ð-1.672
phabet-1.605
Token ISI
Token ID22048
Feature activation+3.04e-01

Pos logit contributions
Barn+1.755
nder+1.690
stead+1.638
ð+1.535
say+1.425
Neg logit contributions
nard-3.647
tan-3.508
-3.489
usal-3.430
minster-3.385
Token
Token ID198
Feature activation-1.13e-01

Pos logit contributions
nard+3.404
+3.274
tan+3.247
usal+3.105
INESS+3.087
Neg logit contributions
Barn-2.964
nder-2.907
stead-2.892
ð-2.729
ovember-2.618
Token
Token ID198
Feature activation-5.57e-03

Pos logit contributions
Barn+1.083
nder+1.061
stead+1.052
ð+0.999
phabet+0.972
Neg logit contributions
nard-1.310
-1.253
tan-1.253
usal-1.193
INESS-1.190
TokenMy
Token ID3666
Feature activation-1.12e+00

Pos logit contributions
Barn+0.058
nder+0.057
stead+0.057
ð+0.054
phabet+0.053
Neg logit contributions
nard-0.059
-0.056
tan-0.056
usal-0.053
INESS-0.053
Tokeners
Token ID364
Feature activation-3.68e-01

Pos logit contributions
nder+11.699
stead+11.393
Barn+11.045
ð+10.543
neys+10.102
Neg logit contributions
nard-9.370
-9.330
INESS-9.249
minster-8.851
tan-8.610
Token,
Token ID11
Feature activation-5.06e-01

Pos logit contributions
stead+3.198
Barn+3.171
nder+3.144
ð+2.920
phabet+2.832
Neg logit contributions
nard-4.342
-4.311
tan-4.178
INESS-4.118
usal-4.021
Token D
Token ID360
Feature activation-5.90e-01

Pos logit contributions
Barn+4.344
stead+4.259
nder+4.252
ð+4.040
say+3.876
Neg logit contributions
nard-6.012
-5.949
tan-5.782
usal-5.778
ongyang-5.733
Token.
Token ID13
Feature activation-3.82e-01

Pos logit contributions
nder+4.769
Barn+4.613
stead+4.510
ð+4.380
phabet+4.069
Neg logit contributions
-7.079
nard-7.078
tan-6.817
serving-6.606
INESS-6.576
Token (
Token ID357
Feature activation+2.76e-01

Pos logit contributions
Barn+3.340
nder+3.268
stead+3.233
ð+3.090
say+2.960
Neg logit contributions
nard-4.541
-4.478
tan-4.379
usal-4.313
INESS-4.242
Token fix
Token ID4259
Feature activation-4.06e-01

Pos logit contributions
Barn+1.732
nder+1.709
stead+1.690
ð+1.628
say+1.559
Neg logit contributions
nard-1.604
-1.535
tan-1.519
INESS-1.459
usal-1.448
Token a
Token ID257
Feature activation-3.05e-01

Pos logit contributions
Barn+3.786
nder+3.760
stead+3.691
ð+3.577
say+3.358
Neg logit contributions
nard-4.577
-4.400
tan-4.294
INESS-4.249
usal-4.221
Token hem
Token ID16869
Feature activation-4.39e-01

Pos logit contributions
Barn+3.108
nder+3.091
stead+3.039
ð+2.923
say+2.768
Neg logit contributions
nard-3.251
-3.114
tan-3.082
INESS-3.002
usal-2.978
Token:
Token ID25
Feature activation-2.71e-01

Pos logit contributions
Barn+4.426
stead+4.420
nder+4.396
ð+4.204
phabet+3.984
Neg logit contributions
nard-4.633
-4.444
tan-4.399
INESS-4.271
usal-4.212
Token Another
Token ID6023
Feature activation-6.94e-01

Pos logit contributions
Barn+2.686
nder+2.624
stead+2.606
ð+2.496
say+2.398
Neg logit contributions
nard-2.996
-2.875
tan-2.845
INESS-2.763
usal-2.744
Token common
Token ID2219
Feature activation-1.08e+00

Pos logit contributions
stead+6.516
Barn+6.483
nder+6.421
ð+6.234
ovember+5.767
Neg logit contributions
nard-7.396
-6.957
INESS-6.955
tan-6.948
ongyang-6.729
Token fix
Token ID4259
Feature activation-2.97e-01

Pos logit contributions
nder+9.364
Barn+9.209
stead+9.105
ð+8.756
say+8.330
Neg logit contributions
nard-11.160
INESS-10.424
-10.291
ongyang-10.255
tan-10.111
Token that
Token ID326
Feature activation-7.13e-01

Pos logit contributions
Barn+2.927
nder+2.893
stead+2.876
ð+2.741
say+2.619
Neg logit contributions
nard-3.234
-3.095
tan-3.069
INESS-3.000
usal-2.937
Token everyone
Token ID2506
Feature activation-1.67e-01

Pos logit contributions
Barn+6.268
nder+6.139
stead+5.983
ð+5.742
ovember+5.472
Neg logit contributions
nard-8.009
-7.712
tan-7.617
INESS-7.576
ongyang-7.318
Token may
Token ID743
Feature activation+1.48e-01

Pos logit contributions
Barn+1.750
nder+1.722
stead+1.705
ð+1.629
say+1.561
Neg logit contributions
nard-1.757
-1.686
tan-1.675
INESS-1.598
usal-1.596
Token run
Token ID1057
Feature activation+3.20e-01

Pos logit contributions
nard+1.383
tan+1.285
+1.216
usal+1.201
minster+1.173
Neg logit contributions
Barn-1.788
stead-1.727
nder-1.697
ð-1.637
phabet-1.628
Token to
Token ID284
Feature activation+2.77e-01

Pos logit contributions
Barn+0.954
nder+0.935
stead+0.927
ð+0.884
phabet+0.847
Neg logit contributions
nard-1.050
-1.006
tan-1.003
usal-0.957
INESS-0.957
Token the
Token ID262
Feature activation-9.51e-02

Pos logit contributions
nard+2.648
+2.528
tan+2.517
usal+2.388
minster+2.368
Neg logit contributions
Barn-3.081
nder-3.040
stead-3.039
ð-2.864
ovember-2.795
Token pitch
Token ID7078
Feature activation-1.12e-01

Pos logit contributions
Barn+1.045
nder+1.026
stead+1.017
ð+0.974
phabet+0.938
Neg logit contributions
nard-0.961
-0.917
tan-0.913
usal-0.868
INESS-0.867
Token and
Token ID290
Feature activation-2.59e-01

Pos logit contributions
Barn+1.164
nder+1.143
stead+1.135
ð+1.084
phabet+1.037
Neg logit contributions
nard-1.191
-1.140
tan-1.136
usal-1.083
INESS-1.083
Token being
Token ID852
Feature activation-2.86e-01

Pos logit contributions
Barn+2.865
nder+2.814
stead+2.803
ð+2.694
say+2.579
Neg logit contributions
nard-2.578
tan-2.454
-2.448
INESS-2.351
usal-2.336
Token mentally
Token ID14946
Feature activation-1.08e+00

Pos logit contributions
Barn+3.017
nder+2.985
stead+2.954
ð+2.838
say+2.695
Neg logit contributions
nard-3.005
-2.856
tan-2.854
INESS-2.763
usal-2.721
Token free
Token ID1479
Feature activation+1.17e-01

Pos logit contributions
stead+10.519
Barn+10.265
nder+10.245
ð+10.165
igans+9.519
Neg logit contributions
nard-10.573
INESS-10.134
tan-10.009
ongyang-9.987
-9.696
Token is
Token ID318
Feature activation+1.61e-01

Pos logit contributions
nard+1.180
+1.131
tan+1.115
usal+1.064
INESS+1.055
Neg logit contributions
Barn-1.286
nder-1.254
stead-1.234
ð-1.177
phabet-1.160
Token a
Token ID257
Feature activation-2.01e-01

Pos logit contributions
nard+1.609
+1.538
tan+1.529
usal+1.455
minster+1.438
Neg logit contributions
Barn-1.769
nder-1.734
stead-1.709
phabet-1.609
ð-1.606
Token massive
Token ID4858
Feature activation+8.31e-02

Pos logit contributions
Barn+2.203
nder+2.175
stead+2.161
ð+2.078
phabet+1.972
Neg logit contributions
nard-2.030
-1.932
tan-1.927
INESS-1.836
usal-1.829
Token thing
Token ID1517
Feature activation-2.22e-01

Pos logit contributions
nard+0.821
+0.784
tan+0.783
usal+0.742
INESS+0.739
Neg logit contributions
Barn-0.929
nder-0.910
stead-0.903
ð-0.857
phabet-0.839
Token of
Token ID286
Feature activation+7.84e-03

Pos logit contributions
Barn+1.568
stead+1.528
nder+1.526
ð+1.447
phabet+1.387
Neg logit contributions
nard-1.720
tan-1.643
-1.640
INESS-1.560
usal-1.551
Token criminal
Token ID4301
Feature activation-5.02e-02

Pos logit contributions
nard+0.079
+0.075
tan+0.075
usal+0.071
INESS+0.071
Neg logit contributions
Barn-0.086
nder-0.085
stead-0.084
ð-0.081
phabet-0.078
Token convictions
Token ID19131
Feature activation+3.31e-02

Pos logit contributions
Barn+0.552
nder+0.540
stead+0.537
ð+0.514
phabet+0.496
Neg logit contributions
nard-0.507
-0.485
tan-0.483
usal-0.459
INESS-0.458
Token.
Token ID764
Feature activation-1.08e-01

Pos logit contributions
nard+0.352
+0.337
tan+0.335
usal+0.320
INESS+0.319
Neg logit contributions
Barn-0.346
nder-0.338
stead-0.335
ð-0.321
phabet-0.308
Token Crim
Token ID9822
Feature activation-7.65e-02

Pos logit contributions
Barn+1.022
nder+0.998
stead+0.988
ð+0.937
say+0.895
Neg logit contributions
nard-1.256
-1.209
tan-1.205
usal-1.155
INESS-1.150
Tokenin
Token ID259
Feature activation-1.08e+00

Pos logit contributions
Barn+0.663
nder+0.649
stead+0.643
ð+0.608
phabet+0.578
Neg logit contributions
nard-0.948
-0.913
tan-0.910
INESS-0.873
usal-0.872
Tokenology
Token ID1435
Feature activation-6.73e-02

Pos logit contributions
Barn+7.025
nder+6.854
stead+6.299
neys+6.067
ð+5.800
Neg logit contributions
tan-13.642
nard-13.505
-13.270
blance-12.860
INESS-12.761
Token,
Token ID11
Feature activation-2.19e-01

Pos logit contributions
Barn+0.650
nder+0.638
stead+0.632
ð+0.601
say+0.573
Neg logit contributions
nard-0.768
-0.740
tan-0.737
usal-0.703
INESS-0.700
Token 35
Token ID3439
Feature activation-3.49e-01

Pos logit contributions
Barn+1.942
nder+1.892
stead+1.875
ð+1.774
say+1.687
Neg logit contributions
nard-2.676
-2.577
tan-2.568
usal-2.476
INESS-2.464
Token,
Token ID11
Feature activation-1.87e-02

Pos logit contributions
Barn+3.025
stead+2.960
nder+2.956
ð+2.779
phabet+2.597
Neg logit contributions
nard-4.218
-4.073
tan-4.053
usal-3.911
INESS-3.873
Token 177
Token ID26607
Feature activation+4.39e-01

Pos logit contributions
Barn+0.177
nder+0.174
stead+0.172
ð+0.163
phabet+0.158
Neg logit contributions
nard-0.218
-0.209
tan-0.208
INESS-0.199
usal-0.198
Token.
Token ID13
Feature activation-1.42e-01

Pos logit contributions
nard+0.452
+0.434
tan+0.432
usal+0.414
INESS+0.413
Neg logit contributions
Barn-0.388
nder-0.380
stead-0.377
ð-0.359
phabet-0.344
Tokenname
Token ID3672
Feature activation-2.27e-02

Pos logit contributions
Barn+1.134
nder+1.103
stead+1.100
ð+1.029
say+0.997
Neg logit contributions
nard-1.847
-1.791
tan-1.772
INESS-1.720
usal-1.713
Token">
Token ID23984
Feature activation-4.09e-01

Pos logit contributions
Barn+0.209
nder+0.204
stead+0.202
ð+0.192
say+0.183
Neg logit contributions
nard-0.270
-0.260
tan-0.259
usal-0.248
INESS-0.247
Tokenspan
Token ID12626
Feature activation-4.90e-01

Pos logit contributions
stead+2.833
Barn+2.795
nder+2.794
ð+2.539
say+2.476
Neg logit contributions
nard-5.664
-5.575
tan-5.401
INESS-5.366
usal-5.346
Token>
Token ID29
Feature activation-1.95e-01

Pos logit contributions
nder+2.578
Barn+2.554
stead+2.379
ð+2.161
say+2.118
Neg logit contributions
nard-7.327
-7.280
tan-7.179
INESS-7.058
usal-7.005
Token <
Token ID1279
Feature activation-1.07e+00

Pos logit contributions
Barn+1.342
nder+1.307
stead+1.287
ð+1.190
say+1.151
Neg logit contributions
nard-2.748
-2.676
tan-2.667
usal-2.580
INESS-2.573
Tokenp
Token ID79
Feature activation-3.29e-01

Pos logit contributions
stead+8.776
nder+8.426
Barn+8.347
say+8.082
ð+7.515
Neg logit contributions
nard-11.573
INESS-11.164
-11.076
ongyang-10.874
minster-10.761
Token ng
Token ID23370
Feature activation-3.67e-01

Pos logit contributions
Barn+1.865
nder+1.865
stead+1.806
ð+1.625
say+1.549
Neg logit contributions
nard-4.931
-4.783
tan-4.764
INESS-4.621
usal-4.599
Token-
Token ID12
Feature activation-3.14e-01

Pos logit contributions
Barn+3.281
nder+3.226
stead+3.187
ð+3.022
say+2.918
Neg logit contributions
nard-4.379
-4.214
tan-4.165
INESS-4.092
usal-4.047
Tokenbind
Token ID21653
Feature activation-2.36e-01

Pos logit contributions
Barn+2.548
stead+2.516
nder+2.512
ð+2.348
say+2.340
Neg logit contributions
nard-3.935
-3.854
INESS-3.744
tan-3.727
minster-3.659
Token="
Token ID2625
Feature activation+1.81e-01

Pos logit contributions
Barn+1.240
nder+1.193
stead+1.166
ð+1.065
say+1.022
Neg logit contributions
nard-3.695
tan-3.544
-3.538
INESS-3.465
usal-3.458
Token,
Token ID11
Feature activation+5.85e-02

Pos logit contributions
stead+4.766
Barn+4.718
nder+4.624
ð+4.372
neys+4.147
Neg logit contributions
nard-6.652
-6.624
tan-6.430
INESS-6.314
ipal-6.299
Token weapons
Token ID3777
Feature activation-5.29e-01

Pos logit contributions
nard+0.653
tan+0.626
+0.626
usal+0.595
INESS+0.591
Neg logit contributions
Barn-0.582
nder-0.567
stead-0.562
ð-0.538
phabet-0.518
Token,
Token ID11
Feature activation-1.03e-01

Pos logit contributions
stead+4.390
Barn+4.316
nder+4.296
ð+3.991
neys+3.797
Neg logit contributions
nard-6.262
-6.215
tan-6.071
INESS-5.893
ipal-5.857
Token furniture
Token ID13091
Feature activation-8.72e-01

Pos logit contributions
Barn+1.070
nder+1.054
stead+1.045
ð+0.993
phabet+0.954
Neg logit contributions
nard-1.106
-1.060
tan-1.052
INESS-1.008
usal-1.006
Token and
Token ID290
Feature activation-1.86e-01

Pos logit contributions
Barn+6.377
stead+6.353
nder+6.122
ð+5.684
neys+5.469
Neg logit contributions
nard-10.320
-10.275
tan-9.938
ipal-9.716
INESS-9.715
Token more
Token ID517
Feature activation-1.06e+00

Pos logit contributions
Barn+1.968
nder+1.949
stead+1.926
ð+1.836
phabet+1.765
Neg logit contributions
nard-1.924
-1.846
tan-1.826
INESS-1.754
usal-1.748
Token
Token ID198
Feature activation-3.55e-02

Pos logit contributions
stead+7.447
Barn+7.232
nder+7.193
ð+6.672
say+6.234
Neg logit contributions
-12.296
nard-12.138
tan-11.747
ipal-11.459
INESS-11.435
Token
Token ID198
Feature activation-4.68e-01

Pos logit contributions
Barn+0.350
nder+0.344
stead+0.339
ð+0.324
phabet+0.322
Neg logit contributions
nard-0.399
tan-0.381
-0.379
usal-0.360
INESS-0.359
TokenCapture
Token ID49630
Feature activation-3.29e-01

Pos logit contributions
Barn+4.671
stead+4.394
nder+4.368
ð+4.113
say+4.076
Neg logit contributions
nard-5.263
-4.940
tan-4.875
usal-4.843
minster-4.782
Token unique
Token ID3748
Feature activation-3.84e-01

Pos logit contributions
Barn+3.450
nder+3.396
stead+3.357
ð+3.186
neys+3.050
Neg logit contributions
nard-3.388
-3.307
tan-3.224
INESS-3.148
usal-3.140
Token monsters
Token ID10963
Feature activation-4.95e-01

Pos logit contributions
Barn+4.051
nder+4.046
stead+3.979
ð+3.774
say+3.637
Neg logit contributions
nard-3.895
-3.758
tan-3.726
INESS-3.598
usal-3.594
Token://
Token ID1378
Feature activation+4.12e-01

Pos logit contributions
Barn+0.034
nder+0.033
stead+0.033
ð+0.032
phabet+0.031
Neg logit contributions
nard-0.034
-0.032
tan-0.032
INESS-0.030
serving-0.030
Tokenwww
Token ID2503
Feature activation+1.54e-01

Pos logit contributions
nard+3.208
tan+3.037
+2.893
serving+2.828
minster+2.819
Neg logit contributions
Barn-5.347
stead-5.139
nder-5.135
 myster-5.127
ð-5.093
Token.
Token ID13
Feature activation-7.62e-02

Pos logit contributions
nard+1.492
tan+1.421
+1.401
serving+1.319
INESS+1.310
Neg logit contributions
Barn-1.730
stead-1.680
nder-1.678
 myster-1.677
phabet-1.672
Tokenbel
Token ID6667
Feature activation-3.21e-01

Pos logit contributions
Barn+0.705
nder+0.691
stead+0.685
ð+0.648
phabet+0.618
Neg logit contributions
nard-0.901
-0.869
tan-0.863
INESS-0.829
usal-0.828
Tokenfast
Token ID7217
Feature activation-7.85e-01

Pos logit contributions
Barn+2.383
nder+2.372
stead+2.351
ð+2.193
say+2.123
Neg logit contributions
nard-4.315
-4.158
tan-4.108
INESS-4.040
usal-3.986
Tokente
Token ID660
Feature activation-1.06e+00

Pos logit contributions
nder+4.914
stead+4.732
Barn+4.727
ð+4.384
say+4.284
Neg logit contributions
nard-11.248
usal-10.654
INESS-10.530
ongyang-10.489
-10.421
Tokenlegraph
Token ID16606
Feature activation-5.68e-01

Pos logit contributions
stead+4.311
Barn+4.277
nder+4.265
say+4.157
ð+3.811
Neg logit contributions
nard-16.282
-15.822
 unbeliev-15.497
tan-15.424
INESS-15.403
Token.
Token ID13
Feature activation-2.78e-01

Pos logit contributions
nder+4.597
Barn+4.589
stead+4.513
ð+4.194
say+4.055
Neg logit contributions
nard-7.014
 metic-6.946
-6.879
 unbeliev-6.856
tan-6.678
Tokenco
Token ID1073
Feature activation-2.71e-01

Pos logit contributions
Barn+2.285
nder+2.284
stead+2.248
ð+2.107
say+2.025
Neg logit contributions
nard-3.507
-3.427
tan-3.357
INESS-3.305
usal-3.231
Token.
Token ID13
Feature activation-1.37e-01

Pos logit contributions
Barn+2.549
nder+2.541
stead+2.492
ð+2.366
say+2.263
Neg logit contributions
nard-3.134
-3.039
tan-2.992
INESS-2.896
usal-2.878
Tokenuk
Token ID2724
Feature activation-2.02e-02

Pos logit contributions
Barn+1.207
nder+1.180
stead+1.168
ð+1.105
phabet+1.052
Neg logit contributions
nard-1.691
-1.629
tan-1.621
usal-1.557
INESS-1.556
Token is
Token ID318
Feature activation-3.06e-02

Pos logit contributions
nard+2.523
+2.379
tan+2.378
usal+2.266
INESS+2.265
Neg logit contributions
Barn-2.589
nder-2.546
stead-2.522
ð-2.424
phabet-2.355
Token the
Token ID262
Feature activation+1.20e-01

Pos logit contributions
Barn+0.322
nder+0.315
stead+0.313
ð+0.299
phabet+0.288
Neg logit contributions
nard-0.323
-0.309
tan-0.308
usal-0.294
INESS-0.293
Token prime
Token ID6994
Feature activation+2.12e-02

Pos logit contributions
nard+1.215
tan+1.169
+1.154
usal+1.099
INESS+1.089
Neg logit contributions
Barn-1.314
nder-1.289
stead-1.274
ð-1.219
phabet-1.198
Token challenge
Token ID4427
Feature activation-1.74e-02

Pos logit contributions
nard+0.220
+0.211
tan+0.210
usal+0.200
INESS+0.199
Neg logit contributions
Barn-0.226
nder-0.222
stead-0.220
ð-0.211
phabet-0.203
Token Belfast
Token ID29713
Feature activation-5.24e-01

Pos logit contributions
Barn+0.179
nder+0.175
stead+0.174
ð+0.166
phabet+0.160
Neg logit contributions
nard-0.188
-0.180
tan-0.180
usal-0.171
INESS-0.171
TokenTe
Token ID6767
Feature activation-1.03e+00

Pos logit contributions
Barn+4.881
nder+4.876
stead+4.720
ð+4.419
phabet+4.238
Neg logit contributions
nard-6.082
-5.789
tan-5.788
usal-5.551
minster-5.483
Tokenlegraph
Token ID16606
Feature activation-3.14e-01

Pos logit contributions
Barn+4.627
nder+4.176
stead+3.909
ð+3.704
say+3.467
Neg logit contributions
nard-15.957
blance-15.918
tan-15.470
serving-15.083
INESS-14.983
Token.
Token ID13
Feature activation-1.47e-01

Pos logit contributions
Barn+2.829
nder+2.760
stead+2.698
ð+2.550
phabet+2.445
Neg logit contributions
nard-3.811
tan-3.642
-3.608
usal-3.544
serving-3.438
Tokenco
Token ID1073
Feature activation-2.99e-02

Pos logit contributions
Barn+1.225
nder+1.212
stead+1.167
ð+1.110
say+1.068
Neg logit contributions
nard-1.901
-1.814
tan-1.803
INESS-1.755
usal-1.744
Token.
Token ID13
Feature activation+7.24e-02

Pos logit contributions
Barn+0.312
nder+0.302
stead+0.301
ð+0.287
phabet+0.281
Neg logit contributions
nard-0.321
tan-0.306
-0.305
usal-0.290
INESS-0.288
Tokenuk
Token ID2724
Feature activation+3.48e-02

Pos logit contributions
nard+0.835
tan+0.799
+0.791
usal+0.757
serving+0.753
Neg logit contributions
Barn-0.690
nder-0.666
stead-0.664
ð-0.632
 myster-0.628
Token.
Token ID13
Feature activation+2.44e-01

Pos logit contributions
Barn+0.384
nder+0.376
stead+0.374
ð+0.356
phabet+0.341
Neg logit contributions
nard-0.426
-0.409
tan-0.407
usal-0.389
INESS-0.388
Token As
Token ID1081
Feature activation+7.83e-03

Pos logit contributions
nard+2.586
tan+2.496
+2.459
usal+2.327
INESS+2.313
Neg logit contributions
Barn-2.518
nder-2.477
stead-2.455
ð-2.332
phabet-2.286
Token it
Token ID340
Feature activation-7.38e-02

Pos logit contributions
nard+0.081
tan+0.077
+0.077
usal+0.073
INESS+0.073
Neg logit contributions
Barn-0.085
nder-0.083
stead-0.082
ð-0.079
phabet-0.076
Token won
Token ID1839
Feature activation+2.97e-01

Pos logit contributions
Barn+0.777
nder+0.761
stead+0.756
ð+0.721
phabet+0.694
Neg logit contributions
nard-0.781
-0.746
tan-0.744
usal-0.708
INESS-0.707
Token't
Token ID470
Feature activation-1.45e-01

Pos logit contributions
nard+2.790
tan+2.693
+2.620
usal+2.443
serving+2.434
Neg logit contributions
Barn-3.362
 myster-3.351
nder-3.336
phabet-3.306
stead-3.264
Token find
Token ID1064
Feature activation-1.03e+00

Pos logit contributions
Barn+1.590
nder+1.562
stead+1.550
ð+1.485
phabet+1.427
Neg logit contributions
nard-1.462
-1.402
tan-1.391
INESS-1.324
usal-1.322
Token it
Token ID340
Feature activation-4.69e-01

Pos logit contributions
Barn+7.714
nder+7.443
stead+7.416
ð+7.191
neys+7.023
Neg logit contributions
-11.309
minster-10.952
nard-10.951
ongyang-10.757
Champ-10.716
Token there
Token ID612
Feature activation-2.12e-01

Pos logit contributions
Barn+4.102
nder+4.097
stead+4.028
ð+3.843
phabet+3.642
Neg logit contributions
-5.337
nard-5.313
tan-5.173
INESS-5.010
serving-4.998
Token,
Token ID11
Feature activation-5.40e-02

Pos logit contributions
Barn+1.999
nder+1.971
stead+1.949
ð+1.864
phabet+1.777
Neg logit contributions
nard-2.407
-2.358
tan-2.315
INESS-2.228
usal-2.221
Token it
Token ID340
Feature activation+1.14e-01

Pos logit contributions
Barn+0.588
nder+0.578
stead+0.573
ð+0.549
phabet+0.528
Neg logit contributions
nard-0.550
-0.525
tan-0.522
usal-0.497
INESS-0.496
Token will
Token ID481
Feature activation+1.94e-02

Pos logit contributions
nard+1.110
tan+1.053
+1.040
usal+0.998
minster+0.967
Neg logit contributions
Barn-1.304
nder-1.269
stead-1.269
ð-1.209
phabet-1.190
TokenSome
Token ID4366
Feature activation+3.82e-01

Pos logit contributions
nard+0.509
+0.494
tan+0.488
INESS+0.465
usal+0.462
Neg logit contributions
Barn-0.473
nder-0.471
stead-0.465
ð-0.444
phabet-0.434
Token kind
Token ID1611
Feature activation-9.22e-02

Pos logit contributions
nard+3.786
tan+3.617
+3.543
usal+3.423
serving+3.421
Neg logit contributions
Barn-4.097
nder-4.012
stead-3.974
ð-3.784
phabet-3.738
Token of
Token ID286
Feature activation-1.65e-01

Pos logit contributions
Barn+0.918
nder+0.906
stead+0.899
ð+0.860
phabet+0.830
Neg logit contributions
nard-1.017
-0.976
tan-0.970
INESS-0.921
usal-0.917
Token bailout
Token ID29928
Feature activation-3.58e-01

Pos logit contributions
Barn+1.775
nder+1.743
stead+1.731
ð+1.656
say+1.592
Neg logit contributions
nard-1.696
-1.626
tan-1.615
INESS-1.538
usal-1.536
Token,
Token ID11
Feature activation-3.66e-01

Pos logit contributions
Barn+3.516
nder+3.458
stead+3.454
ð+3.297
say+3.155
Neg logit contributions
nard-3.892
-3.789
tan-3.761
INESS-3.591
usal-3.563
Token whether
Token ID1771
Feature activation-1.03e+00

Pos logit contributions
Barn+3.615
nder+3.541
stead+3.517
ð+3.339
say+3.269
Neg logit contributions
nard-4.026
-3.870
tan-3.857
INESS-3.698
usal-3.673
Token granted
Token ID7520
Feature activation-3.73e-01

Pos logit contributions
Barn+8.759
nder+8.526
stead+8.512
ð+8.212
say+7.992
Neg logit contributions
nard-10.558
-10.288
INESS-10.149
ongyang-10.146
tan-10.133
Token with
Token ID351
Feature activation-4.08e-01

Pos logit contributions
Barn+3.432
nder+3.390
stead+3.361
ð+3.243
phabet+3.019
Neg logit contributions
nard-4.303
-4.175
tan-4.163
usal-3.947
INESS-3.942
Token soft
Token ID2705
Feature activation-5.10e-01

Pos logit contributions
Barn+4.096
stead+4.050
nder+4.037
ð+3.847
say+3.684
Neg logit contributions
nard-4.371
-4.161
tan-4.111
INESS-3.998
usal-3.950
Token or
Token ID393
Feature activation-1.76e-01

Pos logit contributions
nder+5.311
Barn+5.230
stead+5.128
ð+4.865
say+4.744
Neg logit contributions
nard-5.148
-5.030
tan-4.833
usal-4.720
INESS-4.634
Token tough
Token ID5802
Feature activation+8.50e-02

Pos logit contributions
Barn+1.998
nder+1.975
stead+1.958
ð+1.869
phabet+1.793
Neg logit contributions
nard-1.698
-1.622
tan-1.600
INESS-1.537
usal-1.527
Token feedback
Token ID7538
Feature activation-1.12e-01

Pos logit contributions
Barn+1.135
nder+1.115
stead+1.108
ð+1.058
say+1.014
Neg logit contributions
nard-1.101
-1.051
tan-1.042
INESS-0.998
usal-0.996
Token gain
Token ID4461
Feature activation-2.29e-01

Pos logit contributions
Barn+1.165
nder+1.144
stead+1.138
ð+1.086
phabet+1.038
Neg logit contributions
nard-1.193
-1.138
tan-1.131
INESS-1.082
usal-1.080
Token factor
Token ID5766
Feature activation-3.19e-01

Pos logit contributions
Barn+2.280
stead+2.249
nder+2.241
ð+2.134
say+2.024
Neg logit contributions
nard-2.512
-2.416
tan-2.382
usal-2.291
INESS-2.288
Token G
Token ID402
Feature activation-2.25e-01

Pos logit contributions
Barn+2.936
stead+2.901
nder+2.882
ð+2.772
say+2.610
Neg logit contributions
nard-3.684
-3.566
tan-3.489
INESS-3.400
usal-3.365
Token (
Token ID357
Feature activation-4.94e-01

Pos logit contributions
Barn+2.033
nder+2.011
stead+1.984
ð+1.900
say+1.793
Neg logit contributions
nard-2.682
-2.582
tan-2.545
INESS-2.477
usal-2.467
TokenE
Token ID36
Feature activation-1.02e+00

Pos logit contributions
Barn+4.843
stead+4.714
nder+4.599
ð+4.424
say+4.382
Neg logit contributions
nard-5.341
-5.079
INESS-5.000
tan-4.992
usal-4.917
Tokenq
Token ID80
Feature activation-1.59e-01

Pos logit contributions
nder+8.063
stead+7.560
Barn+7.543
ð+7.075
say+6.648
Neg logit contributions
nard-11.723
-11.453
INESS-11.322
ongyang-10.977
usal-10.809
Token.
Token ID13
Feature activation-3.87e-01

Pos logit contributions
nder+1.481
Barn+1.477
stead+1.446
ð+1.390
phabet+1.320
Neg logit contributions
nard-1.845
-1.798
tan-1.757
INESS-1.715
usal-1.686
Token 4
Token ID604
Feature activation-2.23e-01

Pos logit contributions
Barn+3.191
nder+3.126
stead+3.083
ð+2.981
phabet+2.783
Neg logit contributions
nard-4.808
-4.662
tan-4.594
INESS-4.503
usal-4.482
Token),
Token ID828
Feature activation-6.97e-01

Pos logit contributions
Barn+1.960
nder+1.922
stead+1.886
ð+1.811
phabet+1.717
Neg logit contributions
nard-2.700
-2.623
tan-2.579
usal-2.492
INESS-2.490
Token equivalent
Token ID7548
Feature activation-3.70e-01

Pos logit contributions
Barn+6.511
stead+6.476
nder+6.222
ð+6.169
say+6.030
Neg logit contributions
nard-7.372
-7.189
INESS-7.006
ongyang-6.820
tan-6.783
Token week
Token ID1285
Feature activation+2.00e-01

Pos logit contributions
nard+1.033
+0.990
tan+0.989
usal+0.932
INESS+0.923
Neg logit contributions
Barn-1.198
nder-1.166
stead-1.160
ð-1.113
phabet-1.085
Token that
Token ID326
Feature activation-1.63e-01

Pos logit contributions
nard+2.053
+1.977
tan+1.946
usal+1.832
serving+1.830
Neg logit contributions
Barn-2.145
stead-2.100
nder-2.100
ð-1.991
ovember-1.942
Token Britain
Token ID5491
Feature activation-3.83e-02

Pos logit contributions
Barn+1.628
nder+1.596
stead+1.582
ð+1.508
phabet+1.441
Neg logit contributions
nard-1.809
-1.737
tan-1.730
INESS-1.654
usal-1.654
Token had
Token ID550
Feature activation-7.16e-02

Pos logit contributions
Barn+0.403
nder+0.396
stead+0.392
ð+0.375
phabet+0.360
Neg logit contributions
nard-0.405
-0.388
tan-0.386
usal-0.368
INESS-0.368
Token "
Token ID366
Feature activation-5.41e-01

Pos logit contributions
Barn+0.754
nder+0.740
stead+0.734
ð+0.701
phabet+0.673
Neg logit contributions
nard-0.756
-0.724
tan-0.720
usal-0.687
INESS-0.686
Tokenblack
Token ID13424
Feature activation-1.01e+00

Pos logit contributions
stead+5.646
Barn+5.558
nder+5.379
ð+5.122
say+5.106
Neg logit contributions
nard-5.548
-5.282
INESS-5.227
tan-5.099
usal-5.022
Tokenmail
Token ID4529
Feature activation+1.86e-01

Pos logit contributions
nder+7.975
Barn+7.951
stead+7.605
ð+7.261
neys+7.043
Neg logit contributions
nard-10.949
usal-10.801
INESS-10.609
tan-10.592
-10.583
Tokened
Token ID276
Feature activation-1.22e-01

Pos logit contributions
nard+1.440
+1.389
tan+1.358
usal+1.254
INESS+1.243
Neg logit contributions
Barn-2.461
nder-2.417
stead-2.396
ð-2.309
phabet-2.275
Token and
Token ID290
Feature activation-1.35e-01

Pos logit contributions
Barn+1.147
nder+1.120
stead+1.111
ð+1.055
say+1.006
Neg logit contributions
nard-1.430
-1.373
tan-1.368
usal-1.308
INESS-1.308
Token made
Token ID925
Feature activation-2.76e-01

Pos logit contributions
Barn+1.423
nder+1.399
stead+1.385
ð+1.326
say+1.270
Neg logit contributions
nard-1.407
tan-1.345
-1.342
INESS-1.293
usal-1.283
Token a
Token ID257
Feature activation-2.75e-01

Pos logit contributions
Barn+2.884
nder+2.838
stead+2.824
ð+2.699
say+2.570
Neg logit contributions
nard-2.932
tan-2.779
-2.774
INESS-2.673
usal-2.610
Token Department
Token ID2732
Feature activation+1.17e-02

Pos logit contributions
Barn+1.890
nder+1.852
stead+1.847
ð+1.752
say+1.671
Neg logit contributions
nard-2.559
-2.508
tan-2.496
INESS-2.384
serving-2.359
Token.
Token ID13
Feature activation-1.20e-01

Pos logit contributions
nard+0.131
+0.126
tan+0.126
INESS+0.120
usal+0.120
Neg logit contributions
Barn-0.115
nder-0.113
stead-0.112
ð-0.107
say-0.102
Token
Token ID198
Feature activation-2.32e-02

Pos logit contributions
Barn+1.162
nder+1.136
stead+1.126
ð+1.071
say+1.024
Neg logit contributions
nard-1.360
-1.306
tan-1.300
usal-1.248
INESS-1.244
Token
Token ID198
Feature activation-1.06e-01

Pos logit contributions
Barn+0.229
nder+0.225
stead+0.223
ð+0.212
phabet+0.212
Neg logit contributions
nard-0.259
tan-0.248
-0.247
usal-0.234
INESS-0.233
TokenThe
Token ID464
Feature activation-9.28e-02

Pos logit contributions
Barn+1.068
nder+1.044
stead+1.036
ð+0.987
phabet+0.945
Neg logit contributions
nard-1.178
-1.128
tan-1.123
usal-1.074
INESS-1.072
Token woman
Token ID2415
Feature activation-1.01e+00

Pos logit contributions
Barn+1.092
nder+1.073
stead+1.066
ð+1.023
phabet+0.988
Neg logit contributions
nard-0.864
-0.822
tan-0.818
usal-0.774
INESS-0.773
Token,
Token ID11
Feature activation-2.42e-01

Pos logit contributions
nder+8.362
stead+8.179
Barn+7.885
ð+7.769
say+7.332
Neg logit contributions
nard-10.973
-10.907
INESS-10.880
ongyang-10.705
tan-10.445
Token whose
Token ID3025
Feature activation-1.38e+00

Pos logit contributions
Barn+2.517
nder+2.457
stead+2.437
ð+2.322
say+2.248
Neg logit contributions
nard-2.581
-2.484
tan-2.446
INESS-2.357
usal-2.346
Token name
Token ID1438
Feature activation+2.08e-01

Pos logit contributions
stead+8.899
nder+8.850
Barn+8.412
ð+8.147
 Koh+7.884
Neg logit contributions
nard-15.266
ongyang-15.143
minster-14.934
INESS-14.798
serving-14.628
Token has
Token ID468
Feature activation+3.46e-02

Pos logit contributions
nard+2.093
+1.981
tan+1.974
INESS+1.868
usal+1.867
Neg logit contributions
Barn-2.287
nder-2.234
stead-2.233
ð-2.142
say-2.076
Token not
Token ID407
Feature activation-3.10e-01

Pos logit contributions
nard+0.406
+0.390
tan+0.389
INESS+0.373
usal+0.372
Neg logit contributions
Barn-0.322
nder-0.315
stead-0.313
ð-0.297
phabet-0.283
Token Kick
Token ID10279
Feature activation-5.50e-02

Pos logit contributions
Barn+1.793
nder+1.747
stead+1.731
ð+1.654
say+1.577
Neg logit contributions
nard-2.215
-2.122
tan-2.121
usal-2.041
INESS-2.033
Token Off
Token ID3242
Feature activation+1.10e-01

Pos logit contributions
Barn+0.565
nder+0.553
stead+0.550
ð+0.525
phabet+0.503
Neg logit contributions
nard-0.595
-0.568
tan-0.565
usal-0.541
INESS-0.541
Token will
Token ID481
Feature activation-1.11e-01

Pos logit contributions
nard+1.154
+1.099
tan+1.096
usal+1.036
INESS+1.036
Neg logit contributions
Barn-1.165
nder-1.142
stead-1.132
ð-1.072
phabet-1.049
Token be
Token ID307
Feature activation+7.47e-03

Pos logit contributions
Barn+1.233
nder+1.211
stead+1.202
ð+1.151
phabet+1.108
Neg logit contributions
nard-1.115
-1.065
tan-1.060
usal-1.007
INESS-1.006
Token at
Token ID379
Feature activation-1.44e-01

Pos logit contributions
nard+0.078
+0.075
tan+0.075
usal+0.071
INESS+0.071
Neg logit contributions
Barn-0.079
nder-0.077
stead-0.077
ð-0.073
phabet-0.072
Token midnight
Token ID15896
Feature activation-1.01e+00

Pos logit contributions
Barn+1.339
nder+1.304
stead+1.294
ð+1.232
say+1.171
Neg logit contributions
nard-1.696
-1.634
tan-1.626
INESS-1.562
usal-1.556
Token!
Token ID0
Feature activation-2.46e-01

Pos logit contributions
Barn+7.945
stead+7.782
ð+7.581
nder+7.516
 Koh+6.698
Neg logit contributions
nard-11.239
tan-10.821
-10.784
INESS-10.757
usal-10.582
Token (
Token ID357
Feature activation+3.28e-02

Pos logit contributions
Barn+2.073
nder+2.014
stead+1.982
ð+1.900
say+1.802
Neg logit contributions
nard-3.093
tan-2.975
-2.971
usal-2.872
INESS-2.859
TokenBring
Token ID31416
Feature activation-4.18e-01

Pos logit contributions
nard+0.363
+0.351
tan+0.348
usal+0.332
INESS+0.331
Neg logit contributions
Barn-0.325
nder-0.321
stead-0.318
ð-0.302
phabet-0.293
Token your
Token ID534
Feature activation-3.22e-01

Pos logit contributions
Barn+4.135
nder+4.109
stead+4.026
ð+3.901
igans+3.662
Neg logit contributions
nard-4.509
-4.278
tan-4.246
INESS-4.214
minster-4.140
Token own
Token ID898
Feature activation-3.56e-01

Pos logit contributions
Barn+3.544
nder+3.493
stead+3.432
ð+3.315
say+3.146
Neg logit contributions
nard-3.220
-3.056
tan-3.039
INESS-2.945
usal-2.888
Token lines
Token ID3951
Feature activation+1.84e-01

Pos logit contributions
nard+0.223
+0.209
tan+0.208
usal+0.193
INESS+0.192
Neg logit contributions
Barn-0.427
nder-0.420
stead-0.417
ð-0.403
phabet-0.392
Token then
Token ID788
Feature activation-3.61e-01

Pos logit contributions
nard+1.841
+1.738
tan+1.730
usal+1.647
INESS+1.629
Neg logit contributions
Barn-2.046
nder-1.989
stead-1.961
ð-1.898
ovember-1.851
Token you
Token ID345
Feature activation-9.49e-02

Pos logit contributions
Barn+3.732
nder+3.712
stead+3.659
ð+3.516
say+3.387
Neg logit contributions
nard-3.722
-3.598
tan-3.567
usal-3.452
INESS-3.438
Token'll
Token ID1183
Feature activation-1.80e-01

Pos logit contributions
Barn+1.076
nder+1.050
stead+1.046
ð+1.001
phabet+0.973
Neg logit contributions
nard-0.929
tan-0.881
-0.880
usal-0.834
INESS-0.829
Token go
Token ID467
Feature activation-1.56e-01

Pos logit contributions
Barn+1.996
nder+1.969
stead+1.948
ð+1.868
say+1.797
Neg logit contributions
nard-1.787
-1.717
tan-1.702
INESS-1.623
usal-1.618
Token off
Token ID572
Feature activation-1.00e+00

Pos logit contributions
Barn+1.644
nder+1.619
stead+1.604
ð+1.532
say+1.467
Neg logit contributions
nard-1.646
-1.576
tan-1.568
INESS-1.498
usal-1.494
Token at
Token ID379
Feature activation-3.68e-01

Pos logit contributions
stead+8.340
nder+8.270
Barn+8.116
ð+7.794
say+7.481
Neg logit contributions
nard-10.727
-10.477
INESS-10.439
minster-9.961
tan-9.918
Token some
Token ID617
Feature activation-1.91e-01

Pos logit contributions
Barn+3.634
nder+3.561
stead+3.496
ð+3.343
say+3.251
Neg logit contributions
nard-4.033
-3.940
tan-3.845
INESS-3.798
usal-3.696
Token speed
Token ID2866
Feature activation-3.01e-01

Pos logit contributions
Barn+2.003
nder+1.968
stead+1.948
ð+1.859
say+1.782
Neg logit contributions
nard-2.016
-1.930
tan-1.918
INESS-1.849
usal-1.842
Token.
Token ID13
Feature activation-1.97e-01

Pos logit contributions
Barn+2.898
stead+2.867
nder+2.854
ð+2.690
say+2.588
Neg logit contributions
nard-3.358
-3.260
tan-3.228
INESS-3.101
usal-3.091
Token Racing
Token ID14954
Feature activation+3.81e-01

Pos logit contributions
Barn+1.949
nder+1.906
stead+1.892
ð+1.795
say+1.720
Neg logit contributions
nard-2.207
-2.121
tan-2.110
usal-2.036
INESS-2.024
Token House
Token ID2097
Feature activation+6.31e-01

Pos logit contributions
nard+2.685
tan+2.512
+2.502
serving+2.413
usal+2.392
Neg logit contributions
Barn-2.855
stead-2.814
nder-2.796
ð-2.690
phabet-2.632
Token Majority
Token ID22171
Feature activation+6.22e-01

Pos logit contributions
nard+4.766
+4.398
tan+4.364
serving+4.167
usal+3.962
Neg logit contributions
Barn-8.172
stead-7.838
nder-7.726
ð-7.667
neys-7.423
Token Whip
Token ID40930
Feature activation+2.67e-01

Pos logit contributions
nard+4.023
+3.910
tan+3.878
serving+3.565
usal+3.512
Neg logit contributions
Barn-8.635
stead-8.407
nder-8.269
neys-8.178
ð-8.117
Token Steve
Token ID6542
Feature activation+3.29e-02

Pos logit contributions
nard+2.380
+2.227
tan+2.226
usal+2.116
serving+2.078
Neg logit contributions
Barn-3.244
nder-3.182
stead-3.176
ð-3.056
phabet-2.974
Token Scal
Token ID34529
Feature activation+1.46e-01

Pos logit contributions
nard+0.270
tan+0.253
+0.251
usal+0.237
INESS+0.234
Neg logit contributions
Barn-0.425
stead-0.415
nder-0.414
ð-0.398
phabet-0.389
Tokenise
Token ID786
Feature activation+8.40e-01

Pos logit contributions
nard+1.220
tan+1.148
+1.113
usal+1.104
minster+1.087
Neg logit contributions
Barn-1.837
stead-1.830
nder-1.763
ovember-1.684
ð-1.682
Token (
Token ID357
Feature activation+4.88e-01

Pos logit contributions
nard+7.197
+6.338
tan+6.228
usal+6.105
INESS+6.059
Neg logit contributions
stead-9.464
Barn-9.292
nder-9.182
neys-8.947
phabet-8.790
TokenR
Token ID49
Feature activation-2.32e-02

Pos logit contributions
nard+3.785
+3.647
tan+3.459
usal+3.330
serving+3.326
Neg logit contributions
nder-6.185
Barn-6.115
stead-6.070
phabet-5.913
neys-5.896
Token-
Token ID12
Feature activation+8.02e-01

Pos logit contributions
Barn+0.213
nder+0.208
stead+0.207
ð+0.197
phabet+0.188
Neg logit contributions
nard-0.274
-0.265
tan-0.262
INESS-0.252
usal-0.249
TokenLa
Token ID14772
Feature activation-9.45e-02

Pos logit contributions
nard+6.417
+6.288
serving+5.963
usal+5.908
tan+5.903
Neg logit contributions
stead-9.164
nder-9.032
neys-8.941
phabet-8.892
Barn-8.743
Token.),
Token ID12179
Feature activation+4.40e-01

Pos logit contributions
Barn+0.963
nder+0.941
stead+0.939
ð+0.901
say+0.853
Neg logit contributions
nard-1.024
-0.988
tan-0.976
INESS-0.938
minster-0.920
Token the
Token ID262
Feature activation+1.18e-02

Pos logit contributions
Barn+0.078
nder+0.077
stead+0.076
ð+0.073
phabet+0.071
Neg logit contributions
nard-0.066
-0.063
tan-0.063
usal-0.060
INESS-0.059
Token super
Token ID2208
Feature activation+5.08e-02

Pos logit contributions
nard+0.104
+0.099
tan+0.099
usal+0.093
INESS+0.092
Neg logit contributions
Barn-0.144
nder-0.141
stead-0.140
ð-0.135
phabet-0.131
Token-
Token ID12
Feature activation+1.24e-01

Pos logit contributions
nard+0.533
+0.509
tan+0.508
usal+0.485
INESS+0.481
Neg logit contributions
Barn-0.541
nder-0.527
stead-0.523
ð-0.502
ovember-0.482
Tokenrich
Token ID7527
Feature activation+4.04e-01

Pos logit contributions
nard+1.344
tan+1.285
+1.277
usal+1.228
serving+1.228
Neg logit contributions
Barn-1.270
nder-1.242
stead-1.221
ð-1.180
ovember-1.145
Token together
Token ID1978
Feature activation+2.17e-01

Pos logit contributions
nard+3.772
+3.480
tan+3.404
usal+3.381
serving+3.367
Neg logit contributions
Barn-4.645
nder-4.551
stead-4.440
ovember-4.362
ð-4.293
Token constit
Token ID7892
Feature activation+7.58e-01

Pos logit contributions
nard+2.190
+2.074
tan+2.046
usal+1.965
serving+1.962
Neg logit contributions
Barn-2.387
nder-2.320
stead-2.293
ð-2.194
ovember-2.165
Tokenuting
Token ID15129
Feature activation+1.16e-01

Pos logit contributions
nard+2.002
serving+1.739
usal+1.691
tan+1.300
minster+1.226
Neg logit contributions
Barn-13.315
phabet-12.715
nder-12.627
stead-12.580
ovember-12.487
Token the
Token ID262
Feature activation-5.21e-03

Pos logit contributions
nard+1.192
tan+1.135
+1.134
usal+1.080
INESS+1.070
Neg logit contributions
Barn-1.252
nder-1.226
stead-1.212
ð-1.158
phabet-1.121
Token top
Token ID1353
Feature activation+1.72e-01

Pos logit contributions
Barn+0.065
nder+0.063
stead+0.063
ð+0.061
phabet+0.059
Neg logit contributions
nard-0.045
-0.043
tan-0.043
usal-0.040
INESS-0.040
Token 10
Token ID838
Feature activation+2.28e-01

Pos logit contributions
nard+1.798
tan+1.720
+1.707
usal+1.637
serving+1.608
Neg logit contributions
Barn-1.851
stead-1.783
nder-1.783
ð-1.703
ovember-1.636
Token%
Token ID4
Feature activation+2.72e-01

Pos logit contributions
nard+2.231
tan+2.134
+2.128
usal+2.035
INESS+1.999
Neg logit contributions
Barn-2.558
nder-2.476
stead-2.471
ð-2.360
say-2.282
Tokenj
Token ID73
Feature activation+4.48e-02

Pos logit contributions
Barn+0.861
nder+0.843
stead+0.838
ð+0.797
say+0.767
Neg logit contributions
nard-0.882
-0.843
tan-0.839
INESS-0.802
usal-0.801
Tokenokes
Token ID3369
Feature activation+5.00e-01

Pos logit contributions
nard+0.484
tan+0.463
+0.459
usal+0.446
serving+0.434
Neg logit contributions
Barn-0.460
nder-0.451
stead-0.445
ð-0.429
phabet-0.412
Token"
Token ID1
Feature activation+2.01e-01

Pos logit contributions
nard+5.225
+4.999
tan+4.893
usal+4.741
minster+4.663
Neg logit contributions
Barn-5.134
stead-4.870
nder-4.851
ð-4.694
phabet-4.497
Token about
Token ID546
Feature activation-1.84e-02

Pos logit contributions
nard+2.035
+1.966
tan+1.913
usal+1.834
serving+1.813
Neg logit contributions
Barn-2.185
nder-2.133
stead-2.125
ð-2.042
phabet-1.962
Token Internal
Token ID18628
Feature activation+3.22e-02

Pos logit contributions
Barn+0.191
nder+0.187
stead+0.186
ð+0.178
phabet+0.171
Neg logit contributions
nard-0.197
-0.189
tan-0.188
usal-0.179
INESS-0.178
Token Revenue
Token ID20197
Feature activation+7.63e-01

Pos logit contributions
nard+0.313
tan+0.302
+0.299
usal+0.284
minster+0.280
Neg logit contributions
Barn-0.364
stead-0.356
nder-0.353
ð-0.345
ovember-0.329
Token Service
Token ID4809
Feature activation+3.82e-01

Pos logit contributions
nard+4.241
+4.051
tan+3.873
serving+3.591
usal+3.447
Neg logit contributions
stead-11.117
Barn-10.998
nder-10.788
ð-10.429
phabet-10.259
Token audits
Token ID41973
Feature activation+7.19e-02

Pos logit contributions
nard+3.597
+3.483
tan+3.427
usal+3.194
INESS+3.175
Neg logit contributions
Barn-4.313
stead-4.249
nder-4.190
ð-4.024
phabet-3.977
Token to
Token ID284
Feature activation-6.34e-02

Pos logit contributions
nard+0.757
+0.724
tan+0.718
usal+0.685
INESS+0.682
Neg logit contributions
Barn-0.760
nder-0.744
stead-0.739
ð-0.706
phabet-0.681
Token the
Token ID262
Feature activation-2.20e-01

Pos logit contributions
Barn+0.673
nder+0.661
stead+0.656
ð+0.626
phabet+0.602
Neg logit contributions
nard-0.662
-0.633
tan-0.630
INESS-0.600
usal-0.600
Token recent
Token ID2274
Feature activation-1.00e-01

Pos logit contributions
Barn+2.377
nder+2.341
stead+2.324
ð+2.216
say+2.137
Neg logit contributions
nard-2.245
-2.144
tan-2.120
INESS-2.057
minster-2.030
Token Sw
Token ID2451
Feature activation+3.26e-01

Pos logit contributions
nard+1.179
tan+1.100
+1.080
usal+1.016
INESS+1.010
Neg logit contributions
Barn-1.895
nder-1.858
stead-1.821
ð-1.765
ovember-1.724
Tokenamin
Token ID5669
Feature activation-4.71e-01

Pos logit contributions
nard+3.124
tan+3.001
usal+2.818
serving+2.751
ipal+2.748
Neg logit contributions
Barn-3.755
stead-3.626
nder-3.563
 myster-3.560
ð-3.443
Tokenar
Token ID283
Feature activation-1.64e-01

Pos logit contributions
Barn+3.486
nder+3.436
stead+3.380
ð+3.127
say+2.974
Neg logit contributions
nard-6.156
-6.071
INESS-6.057
tan-5.905
usal-5.842
Tokenayan
Token ID22931
Feature activation+8.91e-02

Pos logit contributions
Barn+1.206
nder+1.175
stead+1.160
ð+1.087
phabet+1.023
Neg logit contributions
nard-2.247
-2.175
tan-2.163
INESS-2.087
usal-2.086
Token A
Token ID317
Feature activation+2.10e-01

Pos logit contributions
nard+1.079
tan+1.047
+1.027
usal+0.990
minster+0.988
Neg logit contributions
Barn-0.791
nder-0.772
stead-0.754
ð-0.722
phabet-0.711
Tokenks
Token ID591
Feature activation+9.43e-01

Pos logit contributions
nard+2.214
tan+2.092
+2.030
usal+1.991
minster+1.952
Neg logit contributions
Barn-2.227
nder-2.122
 myster-2.083
stead-2.081
ovember-2.035
Tokenhard
Token ID10424
Feature activation-3.21e-02

Pos logit contributions
nard+3.202
tan+3.071
hard+2.333
serving+2.227
hi+2.110
Neg logit contributions
 myster-16.436
Barn-15.543
phabet-14.844
nder-14.496
ð-14.447
Tokenham
Token ID2763
Feature activation+2.30e-01

Pos logit contributions
Barn+0.335
nder+0.324
stead+0.322
ð+0.308
phabet+0.301
Neg logit contributions
nard-0.346
tan-0.330
-0.327
usal-0.311
minster-0.310
Token Temple
Token ID10857
Feature activation-1.56e-02

Pos logit contributions
nard+1.434
tan+1.354
+1.336
minster+1.215
usal+1.194
Neg logit contributions
Barn-3.386
nder-3.296
stead-3.211
ovember-3.183
ð-3.180
Token
Token ID198
Feature activation+1.88e-01

Pos logit contributions
Barn+0.153
nder+0.150
stead+0.149
ð+0.142
phabet+0.137
Neg logit contributions
nard-0.176
tan-0.168
-0.168
INESS-0.160
usal-0.160
Token
Token ID198
Feature activation-2.20e-01

Pos logit contributions
nard+1.910
tan+1.828
+1.787
serving+1.693
minster+1.673
Neg logit contributions
Barn-1.999
 myster-1.993
nder-1.981
phabet-1.969
stead-1.946
Token Naval
Token ID20818
Feature activation-1.66e-01

Pos logit contributions
nard+4.674
tan+4.417
+4.409
usal+4.246
serving+4.121
Neg logit contributions
Barn-5.299
stead-5.196
nder-5.143
ovember-4.885
ð-4.829
Tokenny
Token ID3281
Feature activation+2.21e-01

Pos logit contributions
Barn+1.595
nder+1.562
stead+1.549
ð+1.472
phabet+1.410
Neg logit contributions
nard-1.897
-1.821
tan-1.814
usal-1.735
INESS-1.733
Token and
Token ID290
Feature activation+5.58e-01

Pos logit contributions
nard+2.327
+2.223
tan+2.220
usal+2.104
serving+2.077
Neg logit contributions
Barn-2.327
nder-2.256
stead-2.248
ð-2.117
phabet-2.106
Token Py
Token ID9485
Feature activation-9.21e-02

Pos logit contributions
nard+5.005
+4.891
tan+4.791
usal+4.424
serving+4.257
Neg logit contributions
Barn-6.332
stead-6.278
nder-6.246
phabet-5.871
ð-5.783
Tokenot
Token ID313
Feature activation-3.01e-01

Pos logit contributions
Barn+0.730
nder+0.712
stead+0.704
ð+0.662
phabet+0.627
Neg logit contributions
nard-1.211
-1.169
tan-1.165
usal-1.121
INESS-1.120
Tokenr
Token ID81
Feature activation+9.65e-01

Pos logit contributions
Barn+2.655
nder+2.632
stead+2.593
ð+2.454
say+2.356
Neg logit contributions
nard-3.626
-3.545
tan-3.428
INESS-3.410
usal-3.330
Token Of
Token ID3226
Feature activation-2.79e-01

Pos logit contributions
nard+9.636
tan+9.571
usal+9.492
 Zin+8.926
+8.809
Neg logit contributions
Barn-8.616
stead-8.562
nder-8.500
ovember-8.008
phabet-7.878
Tokenits
Token ID896
Feature activation+2.28e-01

Pos logit contributions
Barn+2.823
nder+2.795
stead+2.783
ð+2.658
say+2.562
Neg logit contributions
nard-2.964
-2.894
tan-2.810
INESS-2.802
usal-2.698
Tokener
Token ID263
Feature activation+2.38e-01

Pos logit contributions
nard+2.414
tan+2.317
+2.253
usal+2.190
serving+2.146
Neg logit contributions
Barn-2.407
nder-2.346
stead-2.310
ð-2.199
phabet-2.158
Tokenov
Token ID709
Feature activation+2.66e-01

Pos logit contributions
nard+2.549
tan+2.467
+2.439
usal+2.337
serving+2.283
Neg logit contributions
Barn-2.451
nder-2.367
stead-2.341
ð-2.218
phabet-2.195
Token,
Token ID11
Feature activation+2.49e-01

Pos logit contributions
nard+2.776
tan+2.672
+2.660
usal+2.516
serving+2.475
Neg logit contributions
Barn-2.818
nder-2.714
stead-2.709
ð-2.551
phabet-2.540
Token
Token ID198
Feature activation+1.25e-01

Pos logit contributions
nard+1.896
+1.824
tan+1.815
usal+1.734
INESS+1.732
Neg logit contributions
Barn-1.623
nder-1.584
stead-1.573
ð-1.497
phabet-1.434
Token
Token ID198
Feature activation+5.36e-02

Pos logit contributions
nard+1.314
tan+1.257
+1.242
serving+1.173
usal+1.162
Neg logit contributions
Barn-1.305
nder-1.286
stead-1.262
phabet-1.262
 myster-1.251
Token
Token ID447
Feature activation-7.87e-02

Pos logit contributions
nard+0.632
+0.610
tan+0.608
INESS+0.580
usal+0.580
Neg logit contributions
Barn-0.493
nder-0.485
stead-0.480
ð-0.454
phabet-0.436
Token
Token ID250
Feature activation-2.12e-01

Pos logit contributions
Barn+0.942
nder+0.929
stead+0.920
ð+0.886
phabet+0.857
Neg logit contributions
nard-0.717
-0.676
tan-0.674
usal-0.637
INESS-0.636
TokenChuck
Token ID44324
Feature activation-3.63e-02

Pos logit contributions
Barn+2.165
nder+2.098
stead+2.091
ð+1.998
say+1.912
Neg logit contributions
nard-2.300
-2.195
tan-2.175
usal-2.105
INESS-2.083
Token Ye
Token ID11609
Feature activation+5.55e-02

Pos logit contributions
Barn+0.373
nder+0.368
stead+0.364
ð+0.348
phabet+0.335
Neg logit contributions
nard-0.391
-0.376
tan-0.372
usal-0.357
INESS-0.355
Tokenager
Token ID3536
Feature activation-1.23e-01

Pos logit contributions
nard+0.504
tan+0.475
+0.461
usal+0.444
serving+0.439
Neg logit contributions
Barn-0.678
nder-0.656
stead-0.653
ð-0.626
 myster-0.619
Token,
Token ID11
Feature activation+2.89e-02

Pos logit contributions
Barn+1.164
nder+1.142
stead+1.131
ð+1.084
phabet+1.026
Neg logit contributions
nard-1.432
-1.375
tan-1.367
usal-1.314
INESS-1.308
Token
Token ID447
Feature activation+7.81e-02

Pos logit contributions
nard+0.367
+0.353
tan+0.352
usal+0.338
INESS+0.338
Neg logit contributions
Barn-0.243
nder-0.237
stead-0.235
ð-0.222
phabet-0.211
Token
Token ID251
Feature activation-1.01e-01

Pos logit contributions
nard+0.843
+0.809
tan+0.805
usal+0.768
INESS+0.767
Neg logit contributions
Barn-0.801
nder-0.786
stead-0.780
ð-0.743
phabet-0.714
Token http
Token ID2638
Feature activation+2.55e-01

Pos logit contributions
Barn+0.986
nder+0.964
stead+0.957
ð+0.912
phabet+0.873
Neg logit contributions
nard-1.147
-1.100
tan-1.094
usal-1.049
INESS-1.046
Token must
Token ID1276
Feature activation+5.10e-01

Pos logit contributions
nard+2.507
tan+2.354
+2.336
usal+2.272
serving+2.246
Neg logit contributions
Barn-2.905
nder-2.834
stead-2.768
ð-2.664
phabet-2.647
Token orient
Token ID11367
Feature activation-1.77e-02

Pos logit contributions
nard+4.544
tan+4.084
usal+3.951
+3.884
minster+3.798
Neg logit contributions
Barn-6.207
stead-5.907
nder-5.842
phabet-5.664
ð-5.581
Token themselves
Token ID2405
Feature activation-2.04e-02

Pos logit contributions
Barn+0.195
nder+0.192
stead+0.190
ð+0.182
phabet+0.176
Neg logit contributions
nard-0.177
-0.169
tan-0.168
usal-0.159
INESS-0.159
Token according
Token ID1864
Feature activation+1.99e-01

Pos logit contributions
Barn+0.222
nder+0.218
stead+0.217
ð+0.207
phabet+0.200
Neg logit contributions
nard-0.208
-0.198
tan-0.198
usal-0.188
INESS-0.187
Token to
Token ID284
Feature activation+9.13e-02

Pos logit contributions
nard+2.023
tan+1.928
+1.927
usal+1.840
INESS+1.826
Neg logit contributions
Barn-2.160
nder-2.129
stead-2.102
ð-1.997
ovember-1.939
Token an
Token ID281
Feature activation+3.47e-01

Pos logit contributions
nard+0.931
tan+0.886
+0.881
usal+0.843
INESS+0.831
Neg logit contributions
Barn-0.992
nder-0.985
stead-0.964
ð-0.924
phabet-0.898
Token urban
Token ID7876
Feature activation+1.94e-02

Pos logit contributions
nard+3.346
tan+3.162
+3.117
usal+3.067
serving+3.011
Neg logit contributions
nder-3.869
Barn-3.829
stead-3.686
ð-3.599
ovember-3.543
Token center
Token ID3641
Feature activation+1.41e-01

Pos logit contributions
nard+0.207
+0.198
tan+0.197
usal+0.188
INESS+0.187
Neg logit contributions
Barn-0.203
nder-0.200
stead-0.197
ð-0.189
phabet-0.182
Token,
Token ID11
Feature activation+2.35e-01

Pos logit contributions
nard+1.483
tan+1.409
+1.407
usal+1.353
serving+1.333
Neg logit contributions
Barn-1.484
nder-1.458
stead-1.419
ð-1.374
ovember-1.339
Token which
Token ID543
Feature activation+4.40e-01

Pos logit contributions
nard+2.295
tan+2.183
+2.156
usal+2.075
serving+2.059
Neg logit contributions
Barn-2.622
nder-2.620
stead-2.556
ð-2.452
ovember-2.407
Token in
Token ID287
Feature activation-4.78e-02

Pos logit contributions
nard+4.068
tan+3.790
+3.677
usal+3.636
serving+3.527
Neg logit contributions
Barn-5.039
nder-4.987
stead-4.887
phabet-4.671
ð-4.642
Token
Token ID960
Feature activation-7.07e-02

Pos logit contributions
Barn+1.969
stead+1.939
nder+1.935
ð+1.813
say+1.744
Neg logit contributions
nard-2.363
-2.282
tan-2.271
usal-2.172
INESS-2.170
Tokenno
Token ID3919
Feature activation-1.74e-01

Pos logit contributions
Barn+0.668
nder+0.654
stead+0.648
ð+0.615
phabet+0.589
Neg logit contributions
nard-0.822
-0.790
tan-0.786
usal-0.753
INESS-0.752
Token official
Token ID1743
Feature activation-2.88e-01

Pos logit contributions
Barn+1.895
nder+1.865
stead+1.852
ð+1.763
say+1.696
Neg logit contributions
nard-1.776
-1.697
tan-1.684
INESS-1.608
usal-1.602
Token party
Token ID2151
Feature activation-1.09e-01

Pos logit contributions
Barn+3.112
nder+3.059
stead+3.044
ð+2.894
say+2.808
Neg logit contributions
nard-2.919
-2.786
tan-2.756
INESS-2.654
usal-2.629
Token publication
Token ID9207
Feature activation-2.74e-01

Pos logit contributions
Barn+1.184
nder+1.162
stead+1.155
ð+1.103
phabet+1.061
Neg logit contributions
nard-1.111
-1.061
tan-1.057
usal-1.004
INESS-1.004
Token was
Token ID373
Feature activation+1.02e-01

Pos logit contributions
Barn+2.640
stead+2.632
nder+2.596
ð+2.445
say+2.360
Neg logit contributions
nard-3.082
-2.960
tan-2.942
INESS-2.804
usal-2.802
Token made
Token ID925
Feature activation-9.41e-02

Pos logit contributions
nard+1.010
+0.972
tan+0.965
usal+0.916
minster+0.904
Neg logit contributions
Barn-1.144
nder-1.111
stead-1.097
ð-1.061
phabet-1.036
Token available
Token ID1695
Feature activation-7.97e-02

Pos logit contributions
Barn+1.007
nder+0.989
stead+0.983
ð+0.939
phabet+0.903
Neg logit contributions
nard-0.975
-0.932
tan-0.928
INESS-0.883
usal-0.883
Token to
Token ID284
Feature activation+1.60e-01

Pos logit contributions
Barn+0.823
nder+0.809
stead+0.804
ð+0.766
phabet+0.734
Neg logit contributions
nard-0.854
-0.818
tan-0.815
INESS-0.778
usal-0.776
Token the
Token ID262
Feature activation-1.31e-01

Pos logit contributions
nard+1.609
+1.534
tan+1.530
usal+1.443
serving+1.427
Neg logit contributions
Barn-1.752
nder-1.736
stead-1.721
ð-1.642
phabet-1.597
Token members
Token ID1866
Feature activation-1.25e-02

Pos logit contributions
Barn+1.462
nder+1.436
stead+1.426
ð+1.364
phabet+1.314
Neg logit contributions
nard-1.300
-1.240
tan-1.235
usal-1.172
INESS-1.172
Token them
Token ID606
Feature activation-1.21e-01

Pos logit contributions
nard+0.083
+0.079
tan+0.079
usal+0.075
INESS+0.075
Neg logit contributions
Barn-0.087
nder-0.085
stead-0.085
ð-0.081
phabet-0.078
Token from
Token ID422
Feature activation-8.76e-03

Pos logit contributions
Barn+1.244
nder+1.225
stead+1.217
ð+1.157
say+1.106
Neg logit contributions
nard-1.302
-1.251
tan-1.243
INESS-1.188
usal-1.185
Token the
Token ID262
Feature activation+1.29e-01

Pos logit contributions
Barn+0.094
nder+0.092
stead+0.091
ð+0.087
phabet+0.085
Neg logit contributions
nard-0.091
-0.087
tan-0.087
usal-0.083
INESS-0.082
Token inside
Token ID2641
Feature activation-3.48e-02

Pos logit contributions
nard+1.112
tan+1.075
+1.070
usal+0.999
serving+0.975
Neg logit contributions
Barn-1.587
nder-1.555
stead-1.528
ð-1.475
phabet-1.470
Token may
Token ID743
Feature activation+2.46e-01

Pos logit contributions
Barn+0.363
nder+0.356
stead+0.353
ð+0.337
phabet+0.325
Neg logit contributions
nard-0.371
-0.354
tan-0.353
usal-0.337
INESS-0.336
Token be
Token ID307
Feature activation+1.00e-01

Pos logit contributions
nard+2.263
tan+2.064
+2.024
usal+1.958
minster+1.931
Neg logit contributions
Barn-2.981
stead-2.896
nder-2.884
phabet-2.752
ð-2.735
Token our
Token ID674
Feature activation-3.55e-01

Pos logit contributions
nard+1.017
tan+0.964
+0.961
usal+0.918
minster+0.904
Neg logit contributions
Barn-1.100
nder-1.084
stead-1.076
ð-1.025
phabet-1.007
Token best
Token ID1266
Feature activation+2.68e-01

Pos logit contributions
Barn+3.532
nder+3.467
stead+3.458
ð+3.318
say+3.149
Neg logit contributions
nard-3.824
-3.645
tan-3.592
INESS-3.514
minster-3.457
Token hope
Token ID2911
Feature activation-2.02e-01

Pos logit contributions
nard+2.435
+2.319
tan+2.308
usal+2.200
serving+2.147
Neg logit contributions
Barn-3.212
nder-3.173
stead-3.094
ð-2.987
phabet-2.970
Token.
Token ID13
Feature activation+1.48e-01

Pos logit contributions
Barn+1.933
stead+1.907
nder+1.895
ð+1.787
say+1.704
Neg logit contributions
nard-2.266
-2.199
tan-2.185
INESS-2.093
usal-2.081
Token In
Token ID554
Feature activation+4.42e-01

Pos logit contributions
nard+1.568
tan+1.502
+1.484
INESS+1.409
serving+1.405
Neg logit contributions
Barn-1.521
nder-1.509
stead-1.480
phabet-1.441
ð-1.423
Token to
Token ID284
Feature activation+4.60e-01

Pos logit contributions
nard+4.329
tan+4.155
+4.073
usal+3.822
serving+3.814
Neg logit contributions
Barn-4.888
nder-4.796
stead-4.613
phabet-4.496
ovember-4.483
Token Bar
Token ID2409
Feature activation-3.57e-01

Pos logit contributions
nard+4.805
tan+4.592
+4.356
usal+4.308
minster+4.201
Neg logit contributions
Barn-4.683
nder-4.664
stead-4.535
ð-4.278
ovember-4.246
Tokenst
Token ID301
Feature activation+1.50e-01

Pos logit contributions
Barn+3.409
stead+3.407
nder+3.396
ð+3.241
neys+3.144
Neg logit contributions
nard-3.939
-3.876
INESS-3.777
tan-3.765
usal-3.594
Tokenow
Token ID322
Feature activation-6.08e-02

Pos logit contributions
nard+1.534
tan+1.449
+1.414
usal+1.382
minster+1.367
Neg logit contributions
Barn-1.632
nder-1.594
stead-1.575
ð-1.511
phabet-1.490
Token,
Token ID11
Feature activation+8.44e-02

Pos logit contributions
Barn+0.620
nder+0.608
stead+0.605
ð+0.576
phabet+0.552
Neg logit contributions
nard-0.660
-0.635
tan-0.630
INESS-0.602
usal-0.602
Token to
Token ID284
Feature activation+3.32e-01

Pos logit contributions
nard+0.911
tan+0.873
+0.863
usal+0.829
serving+0.821
Neg logit contributions
Barn-0.865
nder-0.854
stead-0.838
ð-0.802
phabet-0.777
Token the
Token ID262
Feature activation+2.18e-02

Pos logit contributions
nard+3.435
tan+3.292
+3.170
usal+3.123
minster+3.041
Neg logit contributions
Barn-3.482
nder-3.458
stead-3.380
ð-3.189
ovember-3.145
Token much
Token ID881
Feature activation-5.37e-01

Pos logit contributions
nard+0.233
tan+0.223
+0.222
usal+0.212
INESS+0.210
Neg logit contributions
Barn-0.226
nder-0.222
stead-0.220
ð-0.210
phabet-0.203
Token warmer
Token ID23254
Feature activation-1.99e-01

Pos logit contributions
Barn+4.869
nder+4.823
stead+4.707
ð+4.481
neys+4.342
Neg logit contributions
nard-6.021
-6.002
INESS-5.912
tan-5.799
ongyang-5.635
Token Death
Token ID5830
Feature activation+2.77e-01

Pos logit contributions
Barn+2.030
nder+1.997
stead+1.984
ð+1.885
say+1.803
Neg logit contributions
nard-2.146
-2.068
tan-2.042
INESS-1.970
usal-1.956
Token Valley
Token ID6916
Feature activation-1.05e-01

Pos logit contributions
nard+2.276
tan+2.141
+2.080
minster+1.973
usal+1.947
Neg logit contributions
Barn-3.532
nder-3.469
stead-3.428
ð-3.290
phabet-3.274
Token actually
Token ID1682
Feature activation-6.58e-01

Pos logit contributions
Barn+4.689
stead+4.645
nder+4.630
ð+4.402
igans+4.206
Neg logit contributions
nard-5.882
-5.684
tan-5.637
INESS-5.533
minster-5.368
Token gets
Token ID3011
Feature activation-2.07e-01

Pos logit contributions
stead+5.804
Barn+5.753
nder+5.656
ð+5.437
igans+5.257
Neg logit contributions
nard-7.489
-7.164
tan-7.140
INESS-6.981
ongyang-6.870
Token in
Token ID287
Feature activation+1.06e-01

Pos logit contributions
Barn+2.202
nder+2.167
stead+2.154
ð+2.065
say+1.968
Neg logit contributions
nard-2.156
-2.049
tan-2.047
usal-1.950
INESS-1.945
Token the
Token ID262
Feature activation+1.70e-01

Pos logit contributions
nard+1.065
+1.010
tan+1.007
INESS+0.962
usal+0.961
Neg logit contributions
Barn-1.166
nder-1.150
stead-1.143
ð-1.088
ovember-1.047
Token way
Token ID835
Feature activation-1.57e-02

Pos logit contributions
nard+1.250
+1.189
tan+1.182
usal+1.108
INESS+1.095
Neg logit contributions
Barn-2.307
stead-2.283
nder-2.282
ð-2.187
ovember-2.139
Token,
Token ID11
Feature activation-1.36e-01

Pos logit contributions
Barn+0.159
nder+0.156
stead+0.155
ð+0.148
phabet+0.142
Neg logit contributions
nard-0.172
-0.164
tan-0.164
usal-0.156
INESS-0.156
Token as
Token ID355
Feature activation-3.13e-01

Pos logit contributions
Barn+1.369
nder+1.340
stead+1.329
ð+1.269
say+1.219
Neg logit contributions
nard-1.498
-1.437
tan-1.429
INESS-1.369
usal-1.364
Token it
Token ID340
Feature activation-4.30e-01

Pos logit contributions
Barn+3.181
nder+3.090
stead+3.051
ð+2.950
say+2.831
Neg logit contributions
nard-3.355
tan-3.224
-3.223
INESS-3.096
usal-3.057
Token's
Token ID338
Feature activation-2.34e-01

Pos logit contributions
Barn+4.093
nder+4.083
stead+4.021
ð+3.867
say+3.681
Neg logit contributions
nard-4.740
-4.556
tan-4.550
INESS-4.484
serving-4.342
Token possible
Token ID1744
Feature activation-1.17e-01

Pos logit contributions
Barn+2.422
nder+2.368
stead+2.359
ð+2.263
say+2.171
Neg logit contributions
nard-2.490
-2.374
tan-2.360
INESS-2.267
usal-2.251
Token for
Token ID329
Feature activation-1.49e-01

Pos logit contributions
Barn+1.160
nder+1.129
stead+1.121
ð+1.076
phabet+1.026
Neg logit contributions
nard-1.298
tan-1.243
-1.241
INESS-1.187
usal-1.176
Token but
Token ID475
Feature activation-2.18e-01

Pos logit contributions
Barn+1.700
nder+1.672
stead+1.666
ð+1.593
say+1.531
Neg logit contributions
nard-1.571
-1.502
tan-1.495
INESS-1.428
usal-1.417
Token Honour
Token ID45129
Feature activation-2.32e-01

Pos logit contributions
Barn+2.226
stead+2.200
nder+2.185
ð+2.088
phabet+1.984
Neg logit contributions
nard-2.327
-2.231
tan-2.229
INESS-2.135
usal-2.128
Token Among
Token ID9754
Feature activation-1.40e-01

Pos logit contributions
Barn+2.584
nder+2.545
stead+2.515
ð+2.439
phabet+2.348
Neg logit contributions
nard-2.256
-2.193
tan-2.172
usal-2.062
INESS-2.060
Token Thieves
Token ID40392
Feature activation-2.74e-01

Pos logit contributions
Barn+1.657
nder+1.628
stead+1.612
ð+1.550
phabet+1.505
Neg logit contributions
nard-1.297
-1.227
tan-1.225
usal-1.158
INESS-1.153
Token is
Token ID318
Feature activation+5.74e-02

Pos logit contributions
Barn+2.703
stead+2.668
nder+2.662
ð+2.549
say+2.392
Neg logit contributions
nard-3.015
tan-2.883
-2.880
INESS-2.766
usal-2.753
Token a
Token ID257
Feature activation+1.41e-02

Pos logit contributions
nard+0.598
+0.570
tan+0.570
usal+0.542
INESS+0.537
Neg logit contributions
Barn-0.614
nder-0.599
stead-0.590
ð-0.566
phabet-0.552
Token h
Token ID289
Feature activation+2.06e-01

Pos logit contributions
nard+0.146
+0.140
tan+0.139
usal+0.133
INESS+0.132
Neg logit contributions
Barn-0.150
nder-0.147
stead-0.145
ð-0.139
phabet-0.135
Tokenoot
Token ID1025
Feature activation-8.29e-02

Pos logit contributions
nard+2.117
tan+2.028
+2.004
usal+1.974
serving+1.909
Neg logit contributions
Barn-2.195
nder-2.119
stead-2.094
ð-2.034
ovember-1.978
Token from
Token ID422
Feature activation-5.48e-01

Pos logit contributions
Barn+0.845
nder+0.833
stead+0.829
ð+0.788
phabet+0.754
Neg logit contributions
nard-0.896
-0.861
tan-0.857
usal-0.818
INESS-0.817
Token bold
Token ID10758
Feature activation+8.61e-03

Pos logit contributions
stead+5.092
Barn+5.042
nder+4.874
ð+4.768
say+4.592
Neg logit contributions
nard-6.051
tan-5.869
-5.838
INESS-5.657
serving-5.632
Token beginning
Token ID3726
Feature activation+2.08e-01

Pos logit contributions
nard+0.088
+0.083
tan+0.083
usal+0.079
INESS+0.078
Neg logit contributions
Barn-0.095
nder-0.093
stead-0.092
ð-0.088
phabet-0.085
Tokenorigin
Token ID47103
Feature activation-1.36e-01

Pos logit contributions
nard+6.445
tan+6.001
usal+5.864
+5.606
minster+5.485
Neg logit contributions
Barn-7.705
stead-7.464
nder-7.264
 myster-7.011
phabet-6.828
Token carefully
Token ID7773
Feature activation-1.25e-01

Pos logit contributions
Barn+1.392
nder+1.369
stead+1.358
ð+1.300
phabet+1.238
Neg logit contributions
nard-1.459
-1.395
tan-1.386
usal-1.328
INESS-1.326
Token though
Token ID996
Feature activation-4.15e-01

Pos logit contributions
Barn+1.207
nder+1.184
stead+1.175
ð+1.123
phabet+1.065
Neg logit contributions
nard-1.411
-1.356
tan-1.351
usal-1.294
INESS-1.294
Token,
Token ID11
Feature activation-1.07e-01

Pos logit contributions
Barn+3.537
nder+3.523
stead+3.516
ð+3.405
say+3.127
Neg logit contributions
nard-4.918
-4.721
tan-4.689
minster-4.597
INESS-4.579
Token as
Token ID355
Feature activation+1.80e-01

Pos logit contributions
Barn+1.031
nder+1.010
stead+1.002
ð+0.956
phabet+0.910
Neg logit contributions
nard-1.217
-1.166
tan-1.162
usal-1.113
INESS-1.113
Token you
Token ID345
Feature activation-1.38e-01

Pos logit contributions
nard+1.799
+1.721
tan+1.710
usal+1.623
INESS+1.591
Neg logit contributions
Barn-1.986
nder-1.929
stead-1.929
phabet-1.817
ð-1.808
Token may
Token ID743
Feature activation+3.32e-02

Pos logit contributions
Barn+1.500
nder+1.477
stead+1.462
ð+1.405
say+1.346
Neg logit contributions
nard-1.402
-1.341
tan-1.334
INESS-1.270
usal-1.269
Token find
Token ID1064
Feature activation-3.04e-01

Pos logit contributions
nard+0.332
tan+0.313
+0.310
usal+0.297
INESS+0.292
Neg logit contributions
Barn-0.374
stead-0.364
nder-0.363
ð-0.343
phabet-0.339
Token yourself
Token ID3511
Feature activation-3.58e-01

Pos logit contributions
Barn+2.963
nder+2.940
stead+2.881
ð+2.798
say+2.616
Neg logit contributions
nard-3.359
-3.226
tan-3.224
INESS-3.109
usal-3.077
Token stuck
Token ID7819
Feature activation-3.58e-01

Pos logit contributions
Barn+3.666
nder+3.636
stead+3.598
ð+3.486
say+3.311
Neg logit contributions
nard-3.736
-3.601
tan-3.591
INESS-3.450
usal-3.415
Token with
Token ID351
Feature activation-2.74e-01

Pos logit contributions
Barn+3.512
nder+3.483
stead+3.449
ð+3.355
say+3.131
Neg logit contributions
nard-3.887
-3.731
tan-3.704
INESS-3.586
usal-3.559
Token (
Token ID357
Feature activation+1.61e-01

Pos logit contributions
Barn+3.891
stead+3.826
nder+3.787
ð+3.569
say+3.439
Neg logit contributions
nard-5.421
-5.185
tan-5.156
INESS-5.050
usal-5.042
TokenAnti
Token ID28795
Feature activation-1.20e-01

Pos logit contributions
+1.063
nard+1.058
tan+1.048
usal+0.915
serving+0.902
Neg logit contributions
nder-2.272
Barn-2.245
stead-2.239
phabet-2.211
ð-2.183
TokenAli
Token ID37893
Feature activation+4.56e-01

Pos logit contributions
Barn+1.193
nder+1.170
stead+1.160
ð+1.105
phabet+1.062
Neg logit contributions
nard-1.337
-1.284
tan-1.278
usal-1.219
INESS-1.217
Tokenasing
Token ID2313
Feature activation-9.20e-02

Pos logit contributions
nard+3.612
tan+3.609
+3.468
serving+3.422
usal+3.326
Neg logit contributions
Barn-5.563
 myster-5.446
stead-5.336
neys-5.218
nder-5.170
Token is
Token ID318
Feature activation-4.88e-02

Pos logit contributions
Barn+0.986
nder+0.967
stead+0.959
ð+0.917
phabet+0.885
Neg logit contributions
nard-0.955
-0.914
tan-0.910
usal-0.865
INESS-0.863
Token a
Token ID257
Feature activation-9.74e-02

Pos logit contributions
Barn+0.514
nder+0.501
stead+0.496
ð+0.474
phabet+0.463
Neg logit contributions
nard-0.520
tan-0.497
-0.496
usal-0.473
INESS-0.469
Token technique
Token ID8173
Feature activation+5.49e-02

Pos logit contributions
Barn+0.987
nder+0.964
stead+0.953
ð+0.912
phabet+0.879
Neg logit contributions
nard-1.071
-1.027
tan-1.024
usal-0.979
INESS-0.974
Token used
Token ID973
Feature activation+7.00e-02

Pos logit contributions
nard+0.477
tan+0.454
+0.454
usal+0.430
serving+0.418
Neg logit contributions
Barn-0.684
nder-0.667
stead-0.661
ð-0.638
phabet-0.630
Token to
Token ID284
Feature activation-8.98e-02

Pos logit contributions
nard+0.683
+0.653
tan+0.649
usal+0.616
INESS+0.610
Neg logit contributions
Barn-0.798
nder-0.775
stead-0.766
phabet-0.733
ð-0.733
Token eliminate
Token ID11005
Feature activation+3.26e-01

Pos logit contributions
Barn+0.983
nder+0.961
stead+0.957
ð+0.912
phabet+0.883
Neg logit contributions
nard-0.917
tan-0.873
-0.871
usal-0.830
INESS-0.826
Token this
Token ID428
Feature activation-2.12e-01

Pos logit contributions
nard+2.669
tan+2.596
+2.507
usal+2.341
serving+2.295
Neg logit contributions
Barn-4.104
stead-3.979
nder-3.968
phabet-3.890
 myster-3.878
Tokenoe
Token ID2577
Feature activation-9.79e-03

Pos logit contributions
Barn+4.570
nder+4.540
stead+4.461
ð+4.193
say+4.148
Neg logit contributions
nard-4.204
-4.101
tan-3.996
INESS-3.903
serving-3.819
Tokengee
Token ID29622
Feature activation+3.82e-01

Pos logit contributions
Barn+0.098
nder+0.096
stead+0.095
ð+0.090
phabet+0.087
Neg logit contributions
nard-0.108
-0.104
tan-0.104
usal-0.099
INESS-0.098
Token's
Token ID338
Feature activation-2.87e-01

Pos logit contributions
nard+3.834
tan+3.649
+3.641
serving+3.409
minster+3.383
Neg logit contributions
Barn-4.109
nder-3.945
stead-3.921
ð-3.736
ovember-3.730
Token episode
Token ID4471
Feature activation-3.88e-01

Pos logit contributions
Barn+3.187
nder+3.150
stead+3.138
ð+2.995
say+2.897
Neg logit contributions
nard-2.825
-2.680
tan-2.646
INESS-2.562
usal-2.535
Token and
Token ID290
Feature activation-2.20e-01

Pos logit contributions
Barn+3.601
stead+3.537
nder+3.525
ð+3.300
phabet+3.205
Neg logit contributions
nard-4.402
-4.255
tan-4.242
usal-4.101
INESS-4.064
Token I
Token ID314
Feature activation-2.26e-01

Pos logit contributions
Barn+2.180
nder+2.127
stead+2.119
ð+2.015
phabet+1.929
Neg logit contributions
nard-2.430
tan-2.334
-2.331
usal-2.242
INESS-2.234
Token think
Token ID892
Feature activation-3.89e-01

Pos logit contributions
Barn+2.254
nder+2.234
stead+2.203
ð+2.102
say+2.015
Neg logit contributions
nard-2.492
-2.400
tan-2.380
INESS-2.288
usal-2.276
Token it
Token ID340
Feature activation-1.81e-01

Pos logit contributions
Barn+3.766
stead+3.662
nder+3.648
ð+3.484
phabet+3.326
Neg logit contributions
nard-4.267
tan-4.146
-4.104
usal-3.996
INESS-3.961
Token's
Token ID338
Feature activation+1.79e-01

Pos logit contributions
Barn+1.833
nder+1.801
stead+1.787
ð+1.705
say+1.625
Neg logit contributions
nard-1.960
-1.880
tan-1.867
INESS-1.806
usal-1.793
Token a
Token ID257
Feature activation-2.05e-02

Pos logit contributions
nard+1.803
tan+1.743
+1.726
usal+1.625
serving+1.618
Neg logit contributions
Barn-1.931
nder-1.913
stead-1.880
ð-1.788
phabet-1.771
Token real
Token ID1103
Feature activation-2.15e-01

Pos logit contributions
Barn+0.219
nder+0.214
stead+0.212
ð+0.203
phabet+0.196
Neg logit contributions
nard-0.214
-0.205
tan-0.204
usal-0.194
INESS-0.193
Token Hai
Token ID36876
Feature activation-8.29e-02

Pos logit contributions
nard+1.069
tan+1.028
+1.025
usal+0.975
INESS+0.965
Neg logit contributions
Barn-0.917
nder-0.899
stead-0.894
ð-0.838
phabet-0.815
Tokeny
Token ID88
Feature activation+3.55e-01

Pos logit contributions
Barn+0.805
nder+0.792
stead+0.782
ð+0.750
phabet+0.713
Neg logit contributions
nard-0.940
-0.904
tan-0.895
INESS-0.863
usal-0.858
Tokenun
Token ID403
Feature activation+2.38e-01

Pos logit contributions
nard+3.919
tan+3.811
+3.661
usal+3.607
ongyang+3.563
Neg logit contributions
Barn-3.537
stead-3.445
nder-3.298
phabet-3.100
ovember-3.088
Token,
Token ID11
Feature activation+1.15e-01

Pos logit contributions
nard+2.611
tan+2.540
+2.489
usal+2.378
serving+2.343
Neg logit contributions
Barn-2.400
nder-2.310
stead-2.306
ð-2.155
ovember-2.140
Token bestowed
Token ID40293
Feature activation-2.63e-01

Pos logit contributions
nard+1.183
tan+1.142
+1.137
usal+1.075
INESS+1.058
Neg logit contributions
Barn-1.227
nder-1.212
stead-1.201
ð-1.130
phabet-1.107
Token the
Token ID262
Feature activation-6.83e-01

Pos logit contributions
Barn+2.583
nder+2.545
stead+2.496
ð+2.456
say+2.297
Neg logit contributions
nard-2.933
-2.780
tan-2.737
INESS-2.681
usal-2.653
Token name
Token ID1438
Feature activation+9.85e-02

Pos logit contributions
Barn+5.002
nder+4.866
ð+4.800
stead+4.748
say+4.366
Neg logit contributions
nard-8.722
-8.180
INESS-8.124
minster-8.031
serving-7.940
Token Z
Token ID1168
Feature activation+3.40e-01

Pos logit contributions
nard+1.095
tan+1.056
+1.051
usal+1.000
INESS+0.989
Neg logit contributions
Barn-0.979
nder-0.960
stead-0.955
ð-0.894
phabet-0.873
Tokenhen
Token ID831
Feature activation+3.43e-01

Pos logit contributions
nard+3.731
tan+3.607
+3.501
usal+3.426
ipal+3.342
Neg logit contributions
Barn-3.405
stead-3.264
nder-3.233
ð-3.062
phabet-3.014
Tokenjin
Token ID18594
Feature activation+3.58e-01

Pos logit contributions
nard+3.788
tan+3.717
+3.577
usal+3.452
serving+3.395
Neg logit contributions
Barn-3.427
stead-3.288
nder-3.239
ovember-3.059
ð-3.041
Token (
Token ID357
Feature activation-8.38e-02

Pos logit contributions
nard+3.713
tan+3.662
+3.593
usal+3.376
ipal+3.326
Neg logit contributions
Barn-3.756
stead-3.613
nder-3.610
ovember-3.400
ð-3.322
Token Temple
Token ID10857
Feature activation-2.88e-01

Pos logit contributions
Barn+2.515
stead+2.478
nder+2.455
ð+2.355
say+2.248
Neg logit contributions
nard-2.529
tan-2.395
-2.388
INESS-2.291
usal-2.279
Token
Token ID447
Feature activation-2.43e-01

Pos logit contributions
Barn+2.299
stead+2.286
nder+2.263
ð+2.160
phabet+2.008
Neg logit contributions
nard-3.678
tan-3.544
-3.532
INESS-3.433
usal-3.414
Token
Token ID247
Feature activation-9.25e-02

Pos logit contributions
Barn+2.060
nder+2.033
stead+1.997
ð+1.913
say+1.807
Neg logit contributions
nard-3.029
tan-2.914
-2.889
INESS-2.801
usal-2.789
Token.
Token ID13
Feature activation-7.49e-02

Pos logit contributions
Barn+0.944
nder+0.927
stead+0.923
ð+0.880
phabet+0.843
Neg logit contributions
nard-1.003
-0.961
tan-0.958
INESS-0.914
usal-0.911
Token Like
Token ID4525
Feature activation-8.61e-01

Pos logit contributions
Barn+0.696
nder+0.680
stead+0.675
ð+0.642
say+0.611
Neg logit contributions
nard-0.882
-0.847
tan-0.844
usal-0.812
INESS-0.810
Token the
Token ID262
Feature activation-8.95e-01

Pos logit contributions
stead+7.725
nder+7.689
Barn+7.619
ð+7.293
say+7.015
Neg logit contributions
nard-8.788
tan-8.590
INESS-8.549
-8.416
usal-8.308
Token first
Token ID717
Feature activation-7.98e-01

Pos logit contributions
stead+8.373
Barn+8.336
nder+8.137
ð+7.753
say+7.405
Neg logit contributions
nard-8.797
INESS-8.558
tan-8.512
-8.510
serving-8.366
Token time
Token ID640
Feature activation-4.67e-01

Pos logit contributions
Barn+6.774
nder+6.746
stead+6.706
ð+6.307
say+5.957
Neg logit contributions
nard-8.759
-8.476
tan-8.442
INESS-8.360
serving-8.265
Token you
Token ID345
Feature activation-6.77e-01

Pos logit contributions
Barn+4.410
nder+4.366
stead+4.331
ð+4.211
say+3.870
Neg logit contributions
nard-5.149
tan-4.984
-4.967
INESS-4.747
serving-4.719
Token enter
Token ID3802
Feature activation-7.16e-01

Pos logit contributions
nder+7.019
Barn+6.955
stead+6.839
ð+6.754
say+6.457
Neg logit contributions
nard-6.634
-6.466
INESS-6.317
tan-6.298
minster-6.092
Token the
Token ID262
Feature activation-6.94e-01

Pos logit contributions
Barn+6.237
stead+6.092
nder+6.030
ð+5.879
 Koh+5.480
Neg logit contributions
nard-7.881
-7.680
tan-7.502
serving-7.484
INESS-7.463
TokenMother
Token ID31398
Feature activation-3.62e-01

Pos logit contributions
nard+0.539
+0.516
tan+0.515
usal+0.490
INESS+0.488
Neg logit contributions
Barn-0.521
nder-0.512
stead-0.506
ð-0.485
phabet-0.467
Tokenboard
Token ID3526
Feature activation-6.55e-01

Pos logit contributions
Barn+3.122
stead+3.058
nder+3.053
ð+2.866
say+2.744
Neg logit contributions
nard-4.444
-4.266
tan-4.211
INESS-4.104
minster-4.055
Token and
Token ID290
Feature activation-1.43e-01

Pos logit contributions
stead+6.402
nder+6.388
Barn+6.378
ð+5.985
igans+5.811
Neg logit contributions
nard-6.841
tan-6.433
-6.372
INESS-6.151
serving-6.103
Token Boot
Token ID18892
Feature activation-5.35e-01

Pos logit contributions
Barn+1.540
nder+1.511
stead+1.503
ð+1.434
phabet+1.377
Neg logit contributions
nard-1.481
-1.414
tan-1.408
INESS-1.345
usal-1.341
Token"
Token ID1
Feature activation-2.30e-01

Pos logit contributions
stead+5.290
Barn+5.239
nder+5.163
ð+4.852
neys+4.752
Neg logit contributions
nard-5.640
-5.403
tan-5.389
INESS-5.139
usal-5.128
Token green
Token ID4077
Feature activation-8.74e-01

Pos logit contributions
Barn+2.413
nder+2.356
stead+2.347
ð+2.238
say+2.159
Neg logit contributions
nard-2.419
-2.312
tan-2.293
INESS-2.198
usal-2.176
Token,
Token ID11
Feature activation-2.56e-01

Pos logit contributions
stead+7.052
Barn+6.948
nder+6.921
ð+6.483
igans+6.395
Neg logit contributions
nard-9.667
-9.331
ongyang-9.178
tan-9.083
INESS-8.912
Token because
Token ID780
Feature activation-1.88e-01

Pos logit contributions
Barn+2.601
nder+2.535
stead+2.529
ð+2.408
say+2.332
Neg logit contributions
nard-2.768
-2.637
tan-2.601
INESS-2.524
usal-2.493
Token instances
Token ID10245
Feature activation-2.81e-01

Pos logit contributions
Barn+1.883
nder+1.844
stead+1.830
ð+1.747
phabet+1.667
Neg logit contributions
nard-2.060
-1.973
tan-1.965
INESS-1.884
usal-1.875
Token of
Token ID286
Feature activation-3.18e-01

Pos logit contributions
Barn+2.819
nder+2.796
stead+2.767
ð+2.645
say+2.547
Neg logit contributions
nard-3.042
-2.940
tan-2.896
INESS-2.785
minster-2.751
Token this
Token ID428
Feature activation-3.90e-01

Pos logit contributions
Barn+3.238
nder+3.180
stead+3.164
ð+3.010
say+2.902
Neg logit contributions
nard-3.408
-3.253
tan-3.229
INESS-3.138
usal-3.076
Token marks
Token ID8849
Feature activation+2.44e-01

Pos logit contributions
nard+2.476
+2.387
tan+2.368
usal+2.279
INESS+2.269
Neg logit contributions
Barn-1.414
nder-1.383
stead-1.338
ð-1.306
ovember-1.233
Token on
Token ID319
Feature activation-3.90e-01

Pos logit contributions
nard+2.490
+2.327
tan+2.320
usal+2.201
minster+2.188
Neg logit contributions
Barn-2.642
nder-2.592
stead-2.552
ð-2.466
phabet-2.402
Token top
Token ID1353
Feature activation+2.28e-03

Pos logit contributions
Barn+3.966
stead+3.897
nder+3.861
ð+3.651
say+3.501
Neg logit contributions
nard-4.162
-4.020
tan-3.953
INESS-3.903
serving-3.774
Token of
Token ID286
Feature activation+2.17e-01

Pos logit contributions
nard+0.025
+0.024
tan+0.024
INESS+0.023
usal+0.022
Neg logit contributions
Barn-0.023
nder-0.023
stead-0.023
ð-0.022
phabet-0.021
Token the
Token ID262
Feature activation-2.26e-01

Pos logit contributions
nard+2.147
tan+2.041
+2.023
usal+1.924
serving+1.876
Neg logit contributions
Barn-2.390
nder-2.356
stead-2.320
ð-2.239
phabet-2.195
Token deep
Token ID2769
Feature activation-7.68e-01

Pos logit contributions
Barn+2.452
nder+2.407
stead+2.393
ð+2.263
say+2.159
Neg logit contributions
nard-2.316
-2.220
tan-2.192
INESS-2.125
usal-2.094
Token routes
Token ID11926
Feature activation+1.66e-01

Pos logit contributions
Barn+7.506
nder+7.441
stead+7.279
ð+6.732
neys+6.579
Neg logit contributions
nard-7.598
-7.493
tan-7.261
INESS-7.187
minster-7.131
Token,
Token ID11
Feature activation+9.22e-02

Pos logit contributions
nard+1.742
tan+1.643
+1.639
usal+1.565
INESS+1.537
Neg logit contributions
Barn-1.761
nder-1.732
stead-1.714
ð-1.648
ovember-1.601
Token another
Token ID1194
Feature activation-7.82e-02

Pos logit contributions
nard+0.892
tan+0.845
+0.844
usal+0.803
serving+0.791
Neg logit contributions
Barn-1.045
nder-1.034
stead-1.019
ð-0.980
phabet-0.953
Token player
Token ID2137
Feature activation+1.48e-02

Pos logit contributions
Barn+0.857
nder+0.842
stead+0.835
ð+0.799
phabet+0.767
Neg logit contributions
nard-0.792
-0.756
tan-0.753
INESS-0.716
usal-0.715
Token in
Token ID287
Feature activation+2.41e-01

Pos logit contributions
nard+0.151
+0.144
tan+0.144
usal+0.137
INESS+0.136
Neg logit contributions
Barn-0.161
nder-0.158
stead-0.156
ð-0.150
phabet-0.145
Token of
Token ID286
Feature activation-6.62e-02

Pos logit contributions
Barn+4.610
stead+4.572
nder+4.416
ð+4.228
phabet+4.052
Neg logit contributions
nard-5.961
-5.832
tan-5.820
INESS-5.768
serving-5.557
Token the
Token ID262
Feature activation-3.40e-01

Pos logit contributions
Barn+0.650
nder+0.636
stead+0.631
ð+0.600
phabet+0.574
Neg logit contributions
nard-0.746
-0.716
tan-0.713
INESS-0.684
usal-0.683
Token quarterback
Token ID7880
Feature activation+8.49e-02

Pos logit contributions
Barn+3.688
nder+3.602
stead+3.575
ð+3.351
say+3.219
Neg logit contributions
nard-3.481
-3.321
tan-3.288
INESS-3.210
usal-3.153
Token)
Token ID8
Feature activation+4.49e-02

Pos logit contributions
nard+0.953
+0.913
tan+0.905
usal+0.870
INESS+0.860
Neg logit contributions
Barn-0.835
nder-0.820
stead-0.806
ð-0.773
phabet-0.742
Token like
Token ID588
Feature activation-2.59e-01

Pos logit contributions
nard+0.466
+0.443
tan+0.443
usal+0.420
INESS+0.418
Neg logit contributions
Barn-0.483
nder-0.472
stead-0.468
ð-0.449
phabet-0.435
Token Cover
Token ID17546
Feature activation-8.25e-01

Pos logit contributions
Barn+2.424
nder+2.370
stead+2.346
ð+2.217
say+2.111
Neg logit contributions
nard-3.017
-2.912
tan-2.901
INESS-2.819
usal-2.776
Token 3
Token ID513
Feature activation-1.44e-01

Pos logit contributions
Barn+6.481
nder+6.335
stead+6.067
ð+5.903
say+5.383
Neg logit contributions
nard-10.255
INESS-10.163
-9.739
ongyang-9.394
usal-9.351
Token,
Token ID11
Feature activation-3.53e-01

Pos logit contributions
Barn+1.452
nder+1.423
stead+1.410
ð+1.337
say+1.288
Neg logit contributions
nard-1.596
-1.526
tan-1.518
INESS-1.460
usal-1.450
Token man
Token ID582
Feature activation-1.23e-01

Pos logit contributions
Barn+3.515
nder+3.437
stead+3.421
ð+3.230
say+3.125
Neg logit contributions
nard-3.883
-3.715
tan-3.679
INESS-3.593
minster-3.521
Token free
Token ID1479
Feature activation+1.21e-01

Pos logit contributions
Barn+1.549
nder+1.523
stead+1.518
ð+1.459
phabet+1.413
Neg logit contributions
nard-1.038
-0.986
tan-0.983
INESS-0.925
usal-0.916
Token,
Token ID11
Feature activation-4.35e-01

Pos logit contributions
nard+1.439
+1.382
tan+1.376
usal+1.322
INESS+1.313
Neg logit contributions
Barn-1.116
nder-1.092
stead-1.079
ð-1.023
phabet-0.981