TOP ACTIVATIONS
MAX = 12.132

dreams . Tom Vas el - The Dice Tower
to Marshal Ale ks andr Vas ile vs ky , the
use it to explore a proced urally generated universe
written by Ale ks andr Vas ile vs ky .[ 27
fiction In Peter Matth i essen 's 1978 travel
The Icelandic singer Bj ör k has composed and
hide details of the massive backlog they were am assing as
stealth game set in a proced urally generated city . This
Forest Park Village Attorney Tom Bast ian , advised the electoral
The traditionally designed stone mug costs 16 euros ($ 18
led immortal eld ritch mon stros ities year after year and
process . Contrary to numerous revolutionary workers for whom this was
n aps on a metal bench . A sales woman behind
, whose new series Van Hels ing is a female -
y ane Wade , Chris Bosh , Kevin Durant and others
tten 's enc ore in Minneapolis Sh aron Van
flight path towards approaching lunar surface with the high gain antenna
, TX from the east coast knowing all of five people
oked Hillary Clinton and Deborah Wasserman Schultz will not allow Bernie
that when confronted by a revolutionary movement like the M akh

MIN ACTIVATIONS
MIN = -10.529

flights a week hasn 't fost ered a deep sense of
the settings for Network ( Home / Office ) and enable
impl icit bias dogs police officers ( even African
past the point where the Dev ial et s
But the project has since pet ered out , and has
new Declaration of Principles which infl amed the " Old Guard
ts cars , allowing the canine to tro t across to
owned by Alex Jones that propag ates conspiracy theories like the
, as police investigate whether Ford s close friend
0 . 0 f ); RAW _ ASS ERT ( it
and public health should be fost ered ( 19 ). As
. ( Free ) [ Home page ] [ App Brain
effects and frames ) [ Home page ] [ App Brain
keepers , masc ots , dogs . A video
harassed , t rolled , dog p iled on , dumped
version removes ads ) [ Home page ] [ App Brain
app adds features ) [ Home page ] [ App Brain
thing , he has a dog , Angel , that might
, the video shows a stray dog standing patiently next to
to intrinsic value , it fost ers the creative power of

INTERVAL 6.466 - 12.132
CONTAINS 0.053%

like an inter min able stage whisper ( there are odd
US - UK occupation and civil war that claimed the lives
multi - store y rectangular podium . In the span of
Never Mind the Boll ocks , Here
s 01 . html . Jo iner T . Why people

INTERVAL 0.801 - 6.466
CONTAINS 25.631%

as those who perpetrated the horrific massacre in Har No f
Jerusalem , but will not go crazy
child sexual abuse content in Australia , the spokesperson
ique : Great hike , tough as nails though . The
, a Hispanic man of Puerto Rican heritage , was born

INTERVAL -4.864 - 0.801
CONTAINS 73.820%

actually start grading . Parents who had been thrilled with consistent
. 0 and Oklahoma Strong . In 2014 , The New
. Visit www . play st ations chool sc up .
earlier flashbacks to a super - p reg nant
These I do not wish to slay , even if

INTERVAL -10.529 - -4.864
CONTAINS 0.497%

for deployment to enable these techniques . Even without them I
- wall aby populations are genetically and morph ologically distinct and
, some North Korean authorities care about their international image .
thus shut up to the home supply ; from the works
our vendor has fine - tun ed all components so both
Token dreams
Token ID10625
Feature activation-1.63e+00

Pos logit contributions
alid+0.043
 Agency+0.043
vier+0.042
edin+0.042
cot+0.042
Neg logit contributions
erva-0.048
 depress-0.047
 regress-0.045
ック-0.044
riots-0.043
Token.
Token ID13
Feature activation+6.38e-01

Pos logit contributions
alid+1.108
vier+1.090
 Agency+1.087
cot+1.075
edin+1.070
Neg logit contributions
erva-1.223
 depress-1.176
 regress-1.134
ック-1.104
riots-1.091
Token
Token ID198
Feature activation+4.53e-01

Pos logit contributions
erva+0.490
 depress+0.480
 regress+0.462
ック+0.445
riots+0.442
Neg logit contributions
alid-0.427
 Agency-0.420
vier-0.418
edin-0.412
cot-0.408
Token
Token ID198
Feature activation+5.75e-01

Pos logit contributions
erva+0.347
 depress+0.339
 regress+0.325
ック+0.316
riots+0.311
Neg logit contributions
alid-0.305
vier-0.300
 Agency-0.299
edin-0.295
cot-0.292
TokenTom
Token ID13787
Feature activation+7.66e-01

Pos logit contributions
erva+0.481
 depress+0.472
 regress+0.456
ック+0.439
riots+0.436
Neg logit contributions
alid-0.345
vier-0.337
 Agency-0.334
edin-0.331
cot-0.329
Token Vas
Token ID23663
Feature activation+1.21e+01

Pos logit contributions
erva+0.546
 depress+0.533
 regress+0.510
ック+0.496
riots+0.486
Neg logit contributions
alid-0.553
vier-0.542
 Agency-0.538
edin-0.534
cot-0.530
Tokenel
Token ID417
Feature activation+8.12e-01

Pos logit contributions
erva+8.763
 depress+8.114
uries+7.733
 regress+7.719
riots+7.639
Neg logit contributions
 Agency-7.726
alid-7.548
Saharan-7.471
vier-7.463
edin-7.437
Token -
Token ID532
Feature activation-1.38e+00

Pos logit contributions
erva+0.622
 depress+0.610
 regress+0.585
ック+0.570
riots+0.561
Neg logit contributions
alid-0.543
vier-0.532
 Agency-0.527
edin-0.523
cot-0.519
Token The
Token ID383
Feature activation+1.30e-01

Pos logit contributions
alid+0.906
 Agency+0.895
vier+0.890
edin+0.877
cot+0.874
Neg logit contributions
erva-1.065
 depress-1.038
 regress-1.002
ック-0.968
riots-0.959
Token Dice
Token ID34381
Feature activation+2.47e+00

Pos logit contributions
erva+0.101
 depress+0.098
 regress+0.095
ック+0.092
riots+0.091
Neg logit contributions
alid-0.085
 Agency-0.084
vier-0.083
edin-0.082
cot-0.082
Token Tower
Token ID8765
Feature activation-1.15e+00

Pos logit contributions
erva+1.923
 depress+1.876
 regress+1.802
ック+1.759
riots+1.724
Neg logit contributions
alid-1.611
vier-1.576
 Agency-1.549
edin-1.544
cot-1.533
Token to
Token ID284
Feature activation+1.64e+00

Pos logit contributions
erva+3.510
 depress+3.451
 regress+3.310
ック+3.202
uries+3.161
Neg logit contributions
alid-2.799
vier-2.746
 Agency-2.691
edin-2.684
cot-2.680
Token Marshal
Token ID37899
Feature activation+2.50e+00

Pos logit contributions
erva+1.209
 depress+1.180
 regress+1.130
ック+1.101
riots+1.083
Neg logit contributions
alid-1.140
vier-1.117
 Agency-1.110
edin-1.100
cot-1.092
Token Ale
Token ID9300
Feature activation+3.18e+00

Pos logit contributions
erva+1.820
 depress+1.776
 regress+1.697
ック+1.655
riots+1.626
Neg logit contributions
alid-1.766
vier-1.732
 Agency-1.718
edin-1.706
cot-1.693
Tokenks
Token ID591
Feature activation+1.01e+00

Pos logit contributions
erva+2.487
 depress+2.408
 regress+2.312
ック+2.249
riots+2.234
Neg logit contributions
alid-2.078
 Agency-2.026
vier-2.015
edin-1.989
cot-1.965
Tokenandr
Token ID46273
Feature activation+6.11e-01

Pos logit contributions
erva+0.671
 depress+0.657
 regress+0.627
ック+0.610
riots+0.596
Neg logit contributions
alid-0.783
vier-0.765
 Agency-0.761
edin-0.758
cot-0.751
Token Vas
Token ID23663
Feature activation+1.19e+01

Pos logit contributions
erva+0.436
 depress+0.428
 regress+0.410
ック+0.398
uries+0.391
Neg logit contributions
alid-0.439
vier-0.430
 Agency-0.429
edin-0.425
cot-0.421
Tokenile
Token ID576
Feature activation+2.29e-01

Pos logit contributions
erva+8.258
 depress+7.804
uries+7.379
riots+7.195
 regress+7.172
Neg logit contributions
 Agency-7.899
vier-7.818
alid-7.657
Saharan-7.630
ovember-7.508
Tokenvs
Token ID14259
Feature activation+1.18e+00

Pos logit contributions
erva+0.217
 depress+0.213
 regress+0.206
ック+0.203
riots+0.199
Neg logit contributions
alid-0.110
vier-0.109
edin-0.106
 Agency-0.106
cot-0.105
Tokenky
Token ID2584
Feature activation-1.16e+00

Pos logit contributions
erva+1.116
 depress+1.094
 regress+1.058
ック+1.035
riots+1.023
Neg logit contributions
alid-0.579
vier-0.566
 Agency-0.557
edin-0.552
cot-0.545
Token,
Token ID11
Feature activation+8.41e-02

Pos logit contributions
alid+0.872
 Agency+0.855
vier+0.854
edin+0.842
cot+0.837
Neg logit contributions
erva-0.788
 depress-0.768
 regress-0.733
ック-0.714
riots-0.699
Token the
Token ID262
Feature activation-3.59e-01

Pos logit contributions
erva+0.068
 depress+0.066
 regress+0.064
ック+0.063
uries+0.062
Neg logit contributions
alid-0.052
 Agency-0.051
vier-0.051
edin-0.050
cot-0.050
Token use
Token ID779
Feature activation+2.79e+00

Pos logit contributions
erva+0.723
 depress+0.697
 regress+0.669
ック+0.662
riots+0.652
Neg logit contributions
alid-0.591
vier-0.578
 Agency-0.577
edin-0.571
cot-0.566
Token it
Token ID340
Feature activation+3.05e+00

Pos logit contributions
erva+2.093
 depress+2.051
 regress+1.961
ック+1.913
riots+1.878
Neg logit contributions
alid-1.902
vier-1.866
 Agency-1.847
edin-1.837
cot-1.822
Token to
Token ID284
Feature activation+5.12e-01

Pos logit contributions
erva+2.456
 depress+2.412
 regress+2.316
ック+2.259
riots+2.222
Neg logit contributions
alid-1.914
vier-1.871
 Agency-1.853
edin-1.838
cot-1.823
Token explore
Token ID7301
Feature activation+1.02e+00

Pos logit contributions
erva+0.422
 depress+0.407
 regress+0.391
ック+0.386
riots+0.381
Neg logit contributions
alid-0.315
vier-0.307
 Agency-0.307
edin-0.302
cot-0.301
Token a
Token ID257
Feature activation+8.27e-01

Pos logit contributions
erva+0.804
 depress+0.774
 regress+0.751
ック+0.734
riots+0.724
Neg logit contributions
alid-0.662
 Agency-0.654
vier-0.653
edin-0.634
cot-0.634
Token proced
Token ID5131
Feature activation+1.04e+01

Pos logit contributions
erva+0.682
 depress+0.657
 regress+0.637
ック+0.627
riots+0.620
Neg logit contributions
alid-0.508
 Agency-0.496
vier-0.495
edin-0.487
cot-0.484
Tokenurally
Token ID20221
Feature activation+4.48e+00

Pos logit contributions
erva+2.001
 depress+1.875
uries+1.501
 regress+1.484
riots+1.314
Neg logit contributions
 Agency-12.310
vier-12.185
cot-12.131
alid-12.068
edin-12.039
Token generated
Token ID7560
Feature activation+2.16e+00

Pos logit contributions
 depress+3.645
erva+3.615
 regress+3.483
ック+3.373
uries+3.316
Neg logit contributions
vier-2.658
alid-2.652
edin-2.624
 Agency-2.573
cot-2.569
Token universe
Token ID6881
Feature activation+3.61e+00

Pos logit contributions
erva+1.752
 depress+1.696
 regress+1.639
ック+1.607
riots+1.587
Neg logit contributions
alid-1.365
 Agency-1.333
vier-1.330
edin-1.306
cot-1.302
Token
Token ID198
Feature activation+9.10e-01

Pos logit contributions
erva+2.569
 depress+2.532
 regress+2.401
ック+2.339
riots+2.293
Neg logit contributions
alid-2.575
vier-2.533
 Agency-2.500
edin-2.492
cot-2.468
Token
Token ID198
Feature activation+9.27e-01

Pos logit contributions
erva+0.685
 depress+0.668
 regress+0.641
ック+0.623
riots+0.614
Neg logit contributions
alid-0.623
vier-0.611
 Agency-0.608
edin-0.601
cot-0.596
Token written
Token ID3194
Feature activation-1.87e-01

Pos logit contributions
alid+0.066
 Agency+0.065
vier+0.065
edin+0.064
cot+0.063
Neg logit contributions
erva-0.096
 depress-0.092
 regress-0.089
ック-0.088
riots-0.087
Token by
Token ID416
Feature activation+2.71e+00

Pos logit contributions
alid+0.105
vier+0.103
 Agency+0.102
edin+0.101
cot+0.100
Neg logit contributions
erva-0.162
 depress-0.158
 regress-0.153
ック-0.150
riots-0.148
Token Ale
Token ID9300
Feature activation+2.02e+00

Pos logit contributions
erva+2.020
 depress+1.976
 regress+1.890
ック+1.841
riots+1.810
Neg logit contributions
alid-1.870
vier-1.835
 Agency-1.815
edin-1.805
cot-1.792
Tokenks
Token ID591
Feature activation-3.52e-01

Pos logit contributions
erva+1.610
 depress+1.571
 regress+1.508
ック+1.473
riots+1.453
Neg logit contributions
alid-1.289
vier-1.258
 Agency-1.252
edin-1.238
cot-1.227
Tokenandr
Token ID46273
Feature activation-5.46e-01

Pos logit contributions
alid+0.284
 Agency+0.275
vier+0.275
edin+0.275
cot+0.271
Neg logit contributions
erva-0.222
 depress-0.219
 regress-0.209
ック-0.202
riots-0.197
Token Vas
Token ID23663
Feature activation+1.03e+01

Pos logit contributions
alid+0.396
 Agency+0.388
vier+0.388
edin+0.384
cot+0.378
Neg logit contributions
erva-0.385
 depress-0.379
 regress-0.365
ック-0.352
uries-0.347
Tokenile
Token ID576
Feature activation-5.51e-01

Pos logit contributions
erva+6.242
 depress+5.887
uries+5.519
 regress+5.472
riots+5.383
Neg logit contributions
 Agency-7.908
alid-7.843
vier-7.753
ovember-7.734
edin-7.725
Tokenvs
Token ID14259
Feature activation+2.75e-01

Pos logit contributions
vier+0.229
alid+0.227
cot+0.218
edin+0.216
 Agency+0.212
Neg logit contributions
erva-0.560
 depress-0.553
 regress-0.535
ック-0.527
 Julius-0.514
Tokenky
Token ID2584
Feature activation-1.81e+00

Pos logit contributions
erva+0.271
 depress+0.266
 regress+0.258
ック+0.254
riots+0.249
Neg logit contributions
alid-0.125
vier-0.121
 Agency-0.119
edin-0.118
cot-0.117
Token.[
Token ID3693
Feature activation-5.52e-01

Pos logit contributions
alid+1.288
 Agency+1.269
vier+1.262
cot+1.235
edin+1.235
Neg logit contributions
erva-1.294
 depress-1.258
 regress-1.210
ック-1.181
uries-1.168
Token27
Token ID1983
Feature activation+2.89e+00

Pos logit contributions
alid+0.349
vier+0.342
 Agency+0.339
edin+0.336
cot+0.333
Neg logit contributions
erva-0.443
 depress-0.433
 regress-0.416
ック-0.406
riots-0.400
Tokenfiction
Token ID24046
Feature activation-9.99e-01

Pos logit contributions
erva+0.652
 depress+0.636
 regress+0.611
ック+0.595
uries+0.584
Neg logit contributions
alid-0.570
vier-0.559
 Agency-0.555
edin-0.550
cot-0.549
Token
Token ID198
Feature activation+3.49e-01

Pos logit contributions
alid+0.701
 Agency+0.691
vier+0.691
edin+0.677
cot+0.676
Neg logit contributions
erva-0.730
 depress-0.708
 regress-0.680
ック-0.662
riots-0.652
Token
Token ID198
Feature activation+2.57e-01

Pos logit contributions
erva+0.266
 depress+0.260
 regress+0.249
ック+0.243
riots+0.239
Neg logit contributions
alid-0.236
vier-0.231
 Agency-0.231
edin-0.228
cot-0.226
TokenIn
Token ID818
Feature activation+1.66e-02

Pos logit contributions
erva+0.215
 depress+0.211
 regress+0.203
ック+0.197
uries+0.194
Neg logit contributions
alid-0.154
vier-0.151
 Agency-0.150
edin-0.148
cot-0.147
Token Peter
Token ID5613
Feature activation+3.42e+00

Pos logit contributions
erva+0.013
 depress+0.012
 regress+0.012
ック+0.012
riots+0.011
Neg logit contributions
alid-0.011
vier-0.011
 Agency-0.011
edin-0.011
cot-0.011
Token Matth
Token ID45524
Feature activation+1.01e+01

Pos logit contributions
erva+2.423
 depress+2.365
 regress+2.259
ック+2.199
riots+2.159
Neg logit contributions
alid-2.479
vier-2.432
 Agency-2.414
edin-2.394
cot-2.379
Tokeni
Token ID72
Feature activation+3.61e-01

Pos logit contributions
erva+7.435
 depress+7.350
asking+6.754
 regress+6.691
area+6.667
Neg logit contributions
 Agency-6.493
edin-6.423
vier-6.252
alid-6.227
Saharan-6.226
Tokenessen
Token ID44483
Feature activation-7.35e-01

Pos logit contributions
 depress+0.204
erva+0.204
 regress+0.190
ック+0.185
riots+0.185
Neg logit contributions
alid-0.312
vier-0.306
 Agency-0.304
edin-0.297
cot-0.295
Token's
Token ID338
Feature activation+5.21e-01

Pos logit contributions
alid+0.590
vier+0.580
 Agency+0.577
edin+0.573
cot+0.569
Neg logit contributions
erva-0.463
 depress-0.451
 regress-0.428
ック-0.415
riots-0.407
Token 1978
Token ID15524
Feature activation+4.15e+00

Pos logit contributions
erva+0.396
 depress+0.386
 regress+0.371
ック+0.362
riots+0.356
Neg logit contributions
alid-0.350
vier-0.344
 Agency-0.343
edin-0.338
cot-0.337
Token travel
Token ID3067
Feature activation+7.70e-01

Pos logit contributions
erva+3.190
 depress+3.149
 regress+3.006
ック+2.924
riots+2.870
Neg logit contributions
alid-2.733
vier-2.648
edin-2.617
 Agency-2.610
cot-2.586
Token
Token ID198
Feature activation+5.05e-01

Pos logit contributions
erva+0.172
 depress+0.169
 regress+0.162
ック+0.157
riots+0.155
Neg logit contributions
alid-0.153
 Agency-0.150
vier-0.150
edin-0.148
cot-0.146
Token
Token ID198
Feature activation+4.36e-01

Pos logit contributions
erva+0.383
 depress+0.374
 regress+0.358
ック+0.349
riots+0.344
Neg logit contributions
alid-0.343
vier-0.337
 Agency-0.335
edin-0.332
cot-0.328
TokenThe
Token ID464
Feature activation+5.58e-01

Pos logit contributions
erva+0.360
 depress+0.355
 regress+0.340
ック+0.331
uries+0.326
Neg logit contributions
alid-0.266
vier-0.259
 Agency-0.257
edin-0.255
cot-0.253
Token Icelandic
Token ID41959
Feature activation+1.40e+00

Pos logit contributions
erva+0.416
 depress+0.405
 regress+0.387
ック+0.379
riots+0.372
Neg logit contributions
alid-0.385
 Agency-0.378
vier-0.376
edin-0.370
cot-0.369
Token singer
Token ID14015
Feature activation+1.76e+00

Pos logit contributions
erva+1.056
 depress+1.028
 regress+0.985
ック+0.961
riots+0.946
Neg logit contributions
alid-0.957
 Agency-0.937
vier-0.936
edin-0.923
cot-0.915
Token Bj
Token ID34254
Feature activation+1.00e+01

Pos logit contributions
erva+1.246
 depress+1.218
 regress+1.163
ック+1.134
riots+1.113
Neg logit contributions
alid-1.286
vier-1.260
 Agency-1.255
edin-1.242
cot-1.232
Tokenör
Token ID30570
Feature activation+5.16e+00

Pos logit contributions
erva+6.298
 depress+5.782
 regress+5.424
uries+5.359
riots+5.357
Neg logit contributions
vier-7.692
 Agency-7.601
alid-7.482
Saharan-7.445
cot-7.408
Tokenk
Token ID74
Feature activation+1.72e+00

Pos logit contributions
erva+3.405
 depress+3.309
 regress+3.140
ック+3.046
riots+2.992
Neg logit contributions
alid-3.968
vier-3.888
 Agency-3.850
edin-3.845
cot-3.817
Token has
Token ID468
Feature activation-1.01e-01

Pos logit contributions
erva+1.371
 depress+1.341
 regress+1.289
ック+1.259
riots+1.242
Neg logit contributions
alid-1.102
vier-1.076
 Agency-1.073
edin-1.058
cot-1.049
Token composed
Token ID13160
Feature activation-3.15e-02

Pos logit contributions
alid+0.060
 Agency+0.059
vier+0.058
edin+0.058
cot+0.057
Neg logit contributions
erva-0.085
 depress-0.082
 regress-0.079
ック-0.078
riots-0.078
Token and
Token ID290
Feature activation-4.99e-01

Pos logit contributions
alid+0.022
 Agency+0.021
vier+0.021
edin+0.021
cot+0.021
Neg logit contributions
erva-0.023
 depress-0.022
 regress-0.021
ック-0.021
uries-0.021
Token hide
Token ID7808
Feature activation-7.19e+00

Pos logit contributions
erva+0.024
 depress+0.023
 regress+0.022
ック+0.022
riots+0.021
Neg logit contributions
alid-0.019
vier-0.019
 Agency-0.019
edin-0.019
cot-0.018
Token details
Token ID3307
Feature activation-2.92e+00

Pos logit contributions
 Agency+4.321
alid+4.186
vier+4.082
cot+4.014
edin+4.014
Neg logit contributions
erva-5.858
 depress-5.548
ック-5.466
riots-5.317
 regress-5.295
Token of
Token ID286
Feature activation-1.12e+00

Pos logit contributions
alid+1.721
 Agency+1.688
vier+1.674
edin+1.654
cot+1.651
Neg logit contributions
erva-2.463
 depress-2.360
 regress-2.294
ック-2.267
riots-2.241
Token the
Token ID262
Feature activation-5.14e-01

Pos logit contributions
alid+0.735
 Agency+0.723
vier+0.718
edin+0.705
cot+0.702
Neg logit contributions
erva-0.865
 depress-0.840
 regress-0.803
ック-0.798
riots-0.780
Token massive
Token ID4858
Feature activation+2.69e+00

Pos logit contributions
alid+0.336
 Agency+0.332
vier+0.326
edin+0.322
cot+0.321
Neg logit contributions
erva-0.403
 depress-0.391
 regress-0.373
ック-0.372
riots-0.364
Token backlog
Token ID38780
Feature activation+9.97e+00

Pos logit contributions
erva+2.206
 depress+2.165
 regress+2.086
ック+2.022
riots+1.998
Neg logit contributions
alid-1.651
vier-1.619
 Agency-1.589
edin-1.588
cot-1.571
Token they
Token ID484
Feature activation+8.40e-01

Pos logit contributions
 depress+7.422
erva+7.403
 regress+7.059
asking+6.832
uries+6.814
Neg logit contributions
vier-6.319
alid-6.150
ovember-6.117
cot-5.978
edin-5.938
Token were
Token ID547
Feature activation+3.80e-01

Pos logit contributions
erva+0.651
 depress+0.632
 regress+0.607
ック+0.596
riots+0.586
Neg logit contributions
alid-0.555
vier-0.543
 Agency-0.541
edin-0.536
cot-0.532
Token am
Token ID716
Feature activation+1.00e+00

Pos logit contributions
erva+0.310
 depress+0.300
 regress+0.290
ック+0.286
uries+0.281
Neg logit contributions
alid-0.236
 Agency-0.231
vier-0.230
edin-0.228
cot-0.226
Tokenassing
Token ID19696
Feature activation+1.07e+00

Pos logit contributions
erva+0.696
 depress+0.676
 regress+0.643
ック+0.625
riots+0.616
Neg logit contributions
alid-0.743
vier-0.732
 Agency-0.725
edin-0.718
cot-0.713
Token as
Token ID355
Feature activation+7.11e-01

Pos logit contributions
erva+0.787
 depress+0.767
 regress+0.735
ック+0.717
riots+0.705
Neg logit contributions
alid-0.752
vier-0.736
 Agency-0.733
edin-0.726
cot-0.721
Token stealth
Token ID19159
Feature activation-1.54e-01

Pos logit contributions
erva+1.449
 depress+1.404
 regress+1.350
ック+1.322
riots+1.299
Neg logit contributions
alid-1.361
 Agency-1.335
vier-1.329
edin-1.313
cot-1.302
Token game
Token ID983
Feature activation+2.41e+00

Pos logit contributions
alid+0.110
 Agency+0.108
vier+0.107
edin+0.106
cot+0.105
Neg logit contributions
erva-0.111
 depress-0.108
 regress-0.103
ック-0.101
uries-0.099
Token set
Token ID900
Feature activation+4.02e+00

Pos logit contributions
erva+1.884
 depress+1.848
 regress+1.770
ック+1.724
riots+1.698
Neg logit contributions
alid-1.571
vier-1.537
 Agency-1.520
edin-1.511
cot-1.500
Token in
Token ID287
Feature activation+8.83e-01

Pos logit contributions
erva+3.257
 depress+3.209
 regress+3.074
ック+3.006
riots+2.961
Neg logit contributions
alid-2.493
vier-2.434
 Agency-2.389
cot-2.369
edin-2.365
Token a
Token ID257
Feature activation+8.50e-01

Pos logit contributions
erva+0.669
 depress+0.648
 regress+0.625
ック+0.612
uries+0.599
Neg logit contributions
alid-0.599
 Agency-0.587
vier-0.585
edin-0.575
cot-0.572
Token proced
Token ID5131
Feature activation+9.90e+00

Pos logit contributions
erva+0.648
 depress+0.619
 regress+0.600
ック+0.595
riots+0.581
Neg logit contributions
alid-0.576
 Agency-0.567
vier-0.559
edin-0.553
cot-0.547
Tokenurally
Token ID20221
Feature activation+4.56e+00

Pos logit contributions
erva+1.938
 depress+1.795
 regress+1.463
uries+1.421
riots+1.266
Neg logit contributions
 Agency-11.746
vier-11.658
cot-11.657
alid-11.588
edin-11.520
Token generated
Token ID7560
Feature activation+1.75e+00

Pos logit contributions
 depress+3.724
erva+3.698
 regress+3.568
ック+3.450
uries+3.402
Neg logit contributions
alid-2.691
vier-2.684
edin-2.652
cot-2.610
 Agency-2.596
Token city
Token ID1748
Feature activation+3.05e+00

Pos logit contributions
erva+1.408
 depress+1.362
 regress+1.315
ック+1.290
riots+1.269
Neg logit contributions
alid-1.119
 Agency-1.093
vier-1.088
edin-1.072
cot-1.064
Token.
Token ID13
Feature activation+9.50e-01

Pos logit contributions
erva+2.202
 depress+2.158
 regress+2.060
ック+1.999
riots+1.969
Neg logit contributions
alid-2.161
vier-2.121
 Agency-2.102
edin-2.089
cot-2.074
Token This
Token ID770
Feature activation-1.45e+00

Pos logit contributions
erva+0.715
 depress+0.698
 regress+0.673
ック+0.650
riots+0.640
Neg logit contributions
alid-0.651
 Agency-0.643
vier-0.637
edin-0.629
cot-0.620
Token Forest
Token ID9115
Feature activation-3.85e+00

Pos logit contributions
erva+0.167
 depress+0.162
 regress+0.156
ック+0.153
riots+0.151
Neg logit contributions
alid-0.137
vier-0.135
 Agency-0.134
edin-0.132
cot-0.131
Token Park
Token ID3250
Feature activation+7.25e-01

Pos logit contributions
 Agency+2.378
alid+2.346
vier+2.292
edin+2.263
cot+2.261
Neg logit contributions
erva-3.111
 depress-3.026
 regress-2.923
ック-2.868
riots-2.858
Token Village
Token ID14812
Feature activation-5.94e-01

Pos logit contributions
erva+0.572
 depress+0.558
 regress+0.535
ック+0.525
riots+0.516
Neg logit contributions
alid-0.468
 Agency-0.460
vier-0.459
edin-0.452
cot-0.449
Token Attorney
Token ID8123
Feature activation+3.37e+00

Pos logit contributions
alid+0.371
 Agency+0.370
vier+0.363
edin+0.358
cot+0.356
Neg logit contributions
erva-0.482
 depress-0.465
 regress-0.448
ック-0.443
riots-0.434
Token Tom
Token ID4186
Feature activation+9.81e-01

Pos logit contributions
erva+2.147
 depress+2.097
 regress+1.989
ック+1.918
riots+1.887
Neg logit contributions
alid-2.662
vier-2.621
edin-2.584
 Agency-2.578
cot-2.569
Token Bast
Token ID17520
Feature activation+9.64e+00

Pos logit contributions
erva+0.670
 depress+0.654
 regress+0.624
ック+0.606
riots+0.595
Neg logit contributions
alid-0.737
vier-0.724
 Agency-0.719
edin-0.713
cot-0.709
Tokenian
Token ID666
Feature activation+2.77e+00

Pos logit contributions
erva+6.982
 depress+6.570
 regress+6.319
riots+6.198
asking+6.138
Neg logit contributions
 Agency-6.282
alid-6.150
Saharan-6.125
vier-6.068
edin-5.997
Token,
Token ID11
Feature activation+2.77e-01

Pos logit contributions
erva+2.268
 depress+2.219
 regress+2.131
ック+2.081
riots+2.055
Neg logit contributions
alid-1.696
vier-1.655
 Agency-1.646
edin-1.633
cot-1.614
Token advised
Token ID13030
Feature activation-2.09e+00

Pos logit contributions
erva+0.234
 depress+0.230
 regress+0.221
ック+0.216
riots+0.213
Neg logit contributions
alid-0.163
vier-0.159
 Agency-0.157
edin-0.156
cot-0.155
Token the
Token ID262
Feature activation-3.45e-01

Pos logit contributions
alid+1.383
 Agency+1.381
vier+1.350
edin+1.338
cot+1.329
Neg logit contributions
erva-1.598
 depress-1.546
 regress-1.486
ック-1.470
riots-1.443
Token electoral
Token ID13901
Feature activation+1.29e+00

Pos logit contributions
alid+0.215
 Agency+0.214
vier+0.210
cot+0.208
edin+0.207
Neg logit contributions
erva-0.279
 depress-0.272
 regress-0.261
ック-0.257
riots-0.254
Token
Token ID198
Feature activation+8.94e-02

Pos logit contributions
alid+0.082
vier+0.081
 Agency+0.081
edin+0.079
cot+0.079
Neg logit contributions
erva-0.095
 depress-0.093
 regress-0.089
ック-0.086
riots-0.086
TokenThe
Token ID464
Feature activation+7.40e-01

Pos logit contributions
erva+0.075
 depress+0.074
 regress+0.071
ック+0.068
uries+0.068
Neg logit contributions
alid-0.054
vier-0.053
 Agency-0.052
edin-0.052
cot-0.051
Token traditionally
Token ID16083
Feature activation-2.62e+00

Pos logit contributions
erva+0.564
 depress+0.545
 regress+0.529
ック+0.517
riots+0.510
Neg logit contributions
alid-0.493
 Agency-0.489
vier-0.487
edin-0.480
cot-0.478
Token designed
Token ID3562
Feature activation+2.08e-01

Pos logit contributions
 Agency+1.639
alid+1.637
vier+1.609
cot+1.590
edin+1.586
Neg logit contributions
erva-2.086
 depress-1.976
 regress-1.943
ック-1.913
riots-1.895
Token stone
Token ID7815
Feature activation-7.92e-01

Pos logit contributions
erva+0.157
 depress+0.150
 regress+0.147
ック+0.144
riots+0.143
Neg logit contributions
 Agency-0.139
alid-0.139
vier-0.138
cot-0.136
edin-0.136
Token mug
Token ID25152
Feature activation+9.44e+00

Pos logit contributions
alid+0.524
vier+0.523
 Agency+0.521
edin+0.515
cot+0.514
Neg logit contributions
erva-0.598
 depress-0.575
 regress-0.557
ック-0.548
riots-0.545
Token costs
Token ID3484
Feature activation+1.61e+00

Pos logit contributions
erva+6.780
 depress+6.734
 regress+6.299
uries+6.084
ック+6.058
Neg logit contributions
alid-6.466
vier-6.260
edin-6.171
 Agency-6.141
Saharan-6.085
Token 16
Token ID1467
Feature activation+6.19e-01

Pos logit contributions
erva+1.413
 depress+1.385
 regress+1.336
ック+1.307
riots+1.289
Neg logit contributions
alid-0.897
vier-0.875
 Agency-0.868
edin-0.858
cot-0.850
Token euros
Token ID21138
Feature activation-8.63e-01

Pos logit contributions
erva+0.475
 depress+0.465
 regress+0.449
ック+0.435
riots+0.429
Neg logit contributions
alid-0.410
vier-0.403
 Agency-0.401
edin-0.398
cot-0.396
Token ($
Token ID7198
Feature activation-6.11e-01

Pos logit contributions
alid+0.580
vier+0.572
 Agency+0.570
edin+0.562
cot+0.560
Neg logit contributions
erva-0.654
 depress-0.638
 regress-0.616
ック-0.599
riots-0.589
Token18
Token ID1507
Feature activation-3.04e-01

Pos logit contributions
alid+0.316
vier+0.310
 Agency+0.306
edin+0.303
cot+0.298
Neg logit contributions
erva-0.561
 depress-0.553
 regress-0.535
ック-0.518
riots-0.513
Tokenled
Token ID992
Feature activation-9.58e-01

Pos logit contributions
alid+0.131
vier+0.127
 Agency+0.126
edin+0.125
cot+0.123
Neg logit contributions
erva-0.223
 depress-0.219
 regress-0.211
ック-0.207
riots-0.204
Token immortal
Token ID26156
Feature activation+2.53e+00

Pos logit contributions
alid+0.563
 Agency+0.559
vier+0.542
edin+0.540
cot+0.533
Neg logit contributions
erva-0.806
 depress-0.773
 regress-0.746
ック-0.745
uries-0.729
Token eld
Token ID18441
Feature activation+6.51e+00

Pos logit contributions
erva+1.709
 depress+1.656
 regress+1.572
ック+1.532
riots+1.505
Neg logit contributions
alid-1.924
 Agency-1.891
vier-1.890
edin-1.862
cot-1.855
Tokenritch
Token ID46510
Feature activation+4.99e+00

Pos logit contributions
erva+4.761
 depress+4.644
 regress+4.415
uries+4.311
ック+4.288
Neg logit contributions
alid-4.475
 Agency-4.429
vier-4.417
cot-4.336
edin-4.319
Token mon
Token ID937
Feature activation+2.19e+00

Pos logit contributions
erva+3.230
 depress+3.180
 regress+3.033
ック+2.928
riots+2.864
Neg logit contributions
alid-3.903
vier-3.837
edin-3.790
 Agency-3.790
cot-3.758
Tokenstros
Token ID48288
Feature activation+9.43e+00

Pos logit contributions
erva+1.088
 depress+1.051
 regress+0.985
ック+0.946
riots+0.919
Neg logit contributions
alid-2.056
vier-2.025
 Agency-2.013
edin-2.000
cot-1.993
Tokenities
Token ID871
Feature activation+2.47e+00

Pos logit contributions
erva+5.601
 depress+5.298
 regress+5.049
riots+5.022
uries+5.007
Neg logit contributions
edin-7.600
 Agency-7.506
vier-7.484
alid-7.400
cot-7.280
Token year
Token ID614
Feature activation+6.52e-01

Pos logit contributions
erva+2.051
 depress+2.005
 regress+1.929
ック+1.888
riots+1.859
Neg logit contributions
alid-1.498
vier-1.464
 Agency-1.454
edin-1.438
cot-1.427
Token after
Token ID706
Feature activation+6.75e-01

Pos logit contributions
erva+0.529
 depress+0.518
 regress+0.498
ック+0.485
uries+0.480
Neg logit contributions
alid-0.406
vier-0.396
 Agency-0.393
edin-0.389
cot-0.386
Token year
Token ID614
Feature activation-1.85e-01

Pos logit contributions
erva+0.614
 depress+0.598
 regress+0.577
ック+0.572
riots+0.560
Neg logit contributions
alid-0.356
vier-0.346
 Agency-0.346
edin-0.341
cot-0.336
Token and
Token ID290
Feature activation-1.32e-01

Pos logit contributions
alid+0.134
 Agency+0.131
vier+0.131
edin+0.130
cot+0.130
Neg logit contributions
erva-0.132
 depress-0.127
 regress-0.121
ック-0.119
riots-0.117
Token process
Token ID1429
Feature activation+5.72e-01

Pos logit contributions
erva+1.625
 depress+1.587
 regress+1.521
ック+1.484
riots+1.459
Neg logit contributions
alid-1.461
vier-1.431
 Agency-1.422
edin-1.408
cot-1.398
Token.
Token ID13
Feature activation-2.94e-01

Pos logit contributions
erva+0.444
 depress+0.434
 regress+0.416
ック+0.406
riots+0.400
Neg logit contributions
alid-0.378
vier-0.369
 Agency-0.366
edin-0.363
cot-0.360
Token Contrary
Token ID43607
Feature activation+3.01e+00

Pos logit contributions
alid+0.190
vier+0.186
 Agency+0.186
edin+0.183
cot+0.182
Neg logit contributions
erva-0.231
 depress-0.227
 regress-0.218
ック-0.212
uries-0.209
Token to
Token ID284
Feature activation+1.32e+00

Pos logit contributions
erva+2.337
 depress+2.295
 regress+2.186
ック+2.113
riots+2.090
Neg logit contributions
alid-1.951
vier-1.909
 Agency-1.905
edin-1.897
cot-1.874
Token numerous
Token ID6409
Feature activation+3.63e+00

Pos logit contributions
erva+1.021
 depress+0.998
 regress+0.958
ック+0.934
riots+0.919
Neg logit contributions
alid-0.866
vier-0.848
 Agency-0.842
edin-0.834
cot-0.828
Token revolutionary
Token ID12253
Feature activation+8.79e+00

Pos logit contributions
erva+2.651
 depress+2.609
 regress+2.496
ック+2.412
riots+2.378
Neg logit contributions
alid-2.532
vier-2.487
 Agency-2.457
edin-2.455
cot-2.424
Token workers
Token ID3259
Feature activation+5.38e+00

Pos logit contributions
 depress+6.041
erva+5.945
 regress+5.803
riots+5.422
ック+5.367
Neg logit contributions
alid-6.237
vier-6.170
edin-6.142
 Agency-6.002
cot-5.953
Token for
Token ID329
Feature activation+1.44e+00

Pos logit contributions
erva+3.704
 depress+3.687
 regress+3.511
riots+3.356
ック+3.351
Neg logit contributions
alid-3.897
vier-3.826
edin-3.811
 Agency-3.803
cot-3.723
Token whom
Token ID4150
Feature activation-4.32e-01

Pos logit contributions
erva+1.123
 depress+1.098
 regress+1.054
ック+1.029
riots+1.012
Neg logit contributions
alid-0.941
vier-0.922
 Agency-0.914
edin-0.904
cot-0.900
Token this
Token ID428
Feature activation-2.57e+00

Pos logit contributions
alid+0.284
vier+0.278
 Agency+0.277
edin+0.272
cot+0.271
Neg logit contributions
erva-0.336
 depress-0.328
 regress-0.315
ック-0.309
uries-0.303
Token was
Token ID373
Feature activation-4.99e-01

Pos logit contributions
alid+1.746
 Agency+1.718
vier+1.704
edin+1.681
cot+1.667
Neg logit contributions
erva-1.924
 depress-1.853
ック-1.786
 regress-1.754
riots-1.717
Token n
Token ID299
Feature activation+1.43e+00

Pos logit contributions
erva+0.204
 depress+0.197
 regress+0.189
ック+0.188
riots+0.183
Neg logit contributions
alid-0.189
vier-0.186
 Agency-0.186
edin-0.183
cot-0.182
Tokenaps
Token ID1686
Feature activation-1.01e+00

Pos logit contributions
erva+1.137
 depress+1.113
 regress+1.074
ック+1.048
riots+1.028
Neg logit contributions
alid-0.913
vier-0.893
 Agency-0.883
edin-0.877
cot-0.872
Token on
Token ID319
Feature activation+2.65e-01

Pos logit contributions
alid+0.582
 Agency+0.579
vier+0.565
edin+0.561
cot+0.555
Neg logit contributions
erva-0.859
 depress-0.821
 regress-0.801
ック-0.799
riots-0.784
Token a
Token ID257
Feature activation+6.30e-01

Pos logit contributions
erva+0.203
 depress+0.197
 regress+0.190
ック+0.186
riots+0.183
Neg logit contributions
alid-0.177
 Agency-0.173
vier-0.173
edin-0.171
cot-0.170
Token metal
Token ID6147
Feature activation+1.08e+00

Pos logit contributions
erva+0.505
 depress+0.490
 regress+0.472
ック+0.464
riots+0.457
Neg logit contributions
alid-0.398
 Agency-0.389
vier-0.389
edin-0.383
cot-0.381
Token bench
Token ID7624
Feature activation+8.42e+00

Pos logit contributions
erva+0.882
 depress+0.862
 regress+0.829
ック+0.810
riots+0.799
Neg logit contributions
alid-0.672
vier-0.658
 Agency-0.653
edin-0.646
cot-0.641
Token.
Token ID13
Feature activation+5.89e-01

Pos logit contributions
 depress+6.138
erva+6.071
 regress+5.676
ック+5.502
asking+5.391
Neg logit contributions
alid-5.780
vier-5.621
edin-5.503
 Agency-5.501
cot-5.397
Token A
Token ID317
Feature activation+2.37e+00

Pos logit contributions
erva+0.449
 depress+0.439
 regress+0.421
ック+0.411
riots+0.404
Neg logit contributions
alid-0.395
vier-0.387
 Agency-0.384
edin-0.381
cot-0.378
Token sales
Token ID4200
Feature activation-3.18e+00

Pos logit contributions
erva+1.922
 depress+1.891
 regress+1.813
ック+1.765
riots+1.738
Neg logit contributions
alid-1.470
vier-1.436
 Agency-1.420
edin-1.410
cot-1.397
Tokenwoman
Token ID8580
Feature activation+4.83e-02

Pos logit contributions
alid+1.893
 Agency+1.883
vier+1.878
edin+1.845
cot+1.826
Neg logit contributions
erva-2.616
 depress-2.523
 regress-2.469
ック-2.452
riots-2.445
Token behind
Token ID2157
Feature activation-4.97e-01

Pos logit contributions
erva+0.035
 depress+0.034
 regress+0.033
ック+0.032
riots+0.032
Neg logit contributions
alid-0.034
vier-0.033
 Agency-0.033
edin-0.033
cot-0.033
Token,
Token ID11
Feature activation+6.77e-02

Pos logit contributions
erva+0.712
 depress+0.694
 regress+0.661
ック+0.642
riots+0.630
Neg logit contributions
alid-0.817
vier-0.802
 Agency-0.797
edin-0.791
cot-0.786
Token whose
Token ID3025
Feature activation-1.15e+00

Pos logit contributions
erva+0.055
 depress+0.054
 regress+0.052
ック+0.050
riots+0.050
Neg logit contributions
alid-0.042
vier-0.041
 Agency-0.041
edin-0.041
cot-0.040
Token new
Token ID649
Feature activation+2.92e+00

Pos logit contributions
alid+0.734
 Agency+0.729
vier+0.722
edin+0.709
cot+0.703
Neg logit contributions
erva-0.906
 depress-0.875
 regress-0.841
ック-0.833
riots-0.822
Token series
Token ID2168
Feature activation+2.27e+00

Pos logit contributions
erva+2.208
 depress+2.162
 regress+2.072
ック+2.014
uries+1.978
Neg logit contributions
alid-1.979
vier-1.935
 Agency-1.911
edin-1.905
cot-1.893
Token Van
Token ID6656
Feature activation+2.89e+00

Pos logit contributions
erva+1.719
 depress+1.682
 regress+1.611
ック+1.568
riots+1.542
Neg logit contributions
alid-1.541
vier-1.510
 Agency-1.494
edin-1.483
cot-1.474
Token Hels
Token ID33889
Feature activation+8.41e+00

Pos logit contributions
erva+2.355
 depress+2.305
 regress+2.215
ック+2.163
riots+2.132
Neg logit contributions
alid-1.789
vier-1.745
 Agency-1.731
edin-1.715
cot-1.703
Tokening
Token ID278
Feature activation+1.71e+00

Pos logit contributions
 depress+5.267
erva+5.251
 regress+4.979
ック+4.774
uries+4.721
Neg logit contributions
alid-6.511
vier-6.296
 Agency-6.234
edin-6.220
cot-6.111
Token is
Token ID318
Feature activation-3.15e-03

Pos logit contributions
erva+1.313
 depress+1.283
 regress+1.230
ック+1.200
riots+1.181
Neg logit contributions
alid-1.142
vier-1.118
 Agency-1.111
edin-1.100
cot-1.092
Token a
Token ID257
Feature activation+8.62e-02

Pos logit contributions
alid+0.002
 Agency+0.002
vier+0.002
edin+0.002
cot+0.002
Neg logit contributions
erva-0.003
 depress-0.002
 regress-0.002
ック-0.002
riots-0.002
Token female
Token ID4048
Feature activation-3.41e+00

Pos logit contributions
erva+0.067
 depress+0.064
 regress+0.062
ック+0.061
riots+0.061
Neg logit contributions
alid-0.057
 Agency-0.056
vier-0.055
cot-0.054
edin-0.054
Token-
Token ID12
Feature activation+1.86e-01

Pos logit contributions
 Agency+2.266
alid+2.247
vier+2.191
cot+2.148
edin+2.137
Neg logit contributions
erva-2.629
 depress-2.451
 regress-2.385
uries-2.350
riots-2.346
Tokeny
Token ID88
Feature activation+7.94e-01

Pos logit contributions
erva+2.608
 depress+2.525
 regress+2.422
riots+2.371
ック+2.344
Neg logit contributions
alid-2.451
vier-2.431
 Agency-2.420
edin-2.407
cot-2.354
Tokenane
Token ID1531
Feature activation+3.60e-01

Pos logit contributions
erva+0.383
 depress+0.365
 regress+0.342
ック+0.332
riots+0.317
Neg logit contributions
alid-0.763
vier-0.749
 Agency-0.748
edin-0.742
cot-0.736
Token Wade
Token ID19136
Feature activation+6.44e+00

Pos logit contributions
erva+0.176
 depress+0.168
 regress+0.157
ック+0.153
riots+0.149
Neg logit contributions
alid-0.342
 Agency-0.336
vier-0.336
edin-0.333
cot-0.330
Token,
Token ID11
Feature activation+4.92e-01

Pos logit contributions
erva+4.368
 depress+4.324
 regress+4.173
ック+3.983
asking+3.897
Neg logit contributions
alid-4.675
vier-4.563
 Agency-4.511
edin-4.497
cot-4.458
Token Chris
Token ID5180
Feature activation+6.05e+00

Pos logit contributions
erva+0.348
 depress+0.336
 regress+0.320
ック+0.315
uries+0.309
Neg logit contributions
alid-0.359
 Agency-0.354
vier-0.350
edin-0.347
cot-0.344
Token Bosh
Token ID42776
Feature activation+8.41e+00

Pos logit contributions
erva+2.220
 depress+2.024
 regress+1.910
uries+1.772
 Julius+1.738
Neg logit contributions
alid-6.420
vier-6.360
edin-6.300
cot-6.213
 Agency-6.178
Token,
Token ID11
Feature activation+7.18e-01

Pos logit contributions
erva+5.136
 depress+5.092
 regress+4.889
ック+4.630
uries+4.573
Neg logit contributions
alid-6.440
vier-6.356
 Agency-6.236
edin-6.197
cot-6.162
Token Kevin
Token ID7939
Feature activation+2.79e+00

Pos logit contributions
erva+0.464
 depress+0.447
 regress+0.424
ック+0.418
riots+0.408
Neg logit contributions
alid-0.568
 Agency-0.560
vier-0.554
edin-0.550
cot-0.545
Token Durant
Token ID27686
Feature activation+1.26e+00

Pos logit contributions
erva+1.659
 depress+1.612
 regress+1.527
ック+1.476
riots+1.442
Neg logit contributions
alid-2.342
vier-2.307
 Agency-2.287
edin-2.277
cot-2.263
Token and
Token ID290
Feature activation-5.14e-01

Pos logit contributions
erva+0.859
 depress+0.840
 regress+0.801
ック+0.776
uries+0.762
Neg logit contributions
alid-0.944
vier-0.926
 Agency-0.919
edin-0.913
cot-0.907
Token others
Token ID1854
Feature activation+7.85e-02

Pos logit contributions
 Agency+0.345
alid+0.343
vier+0.336
edin+0.330
cot+0.329
Neg logit contributions
erva-0.393
 depress-0.376
ック-0.362
 regress-0.358
riots-0.351
Tokentten
Token ID32407
Feature activation+1.12e+00

Pos logit contributions
erva+2.997
 depress+2.936
 regress+2.837
ック+2.787
uries+2.751
Neg logit contributions
alid-1.425
vier-1.379
 Agency-1.364
edin-1.337
cot-1.316
Token's
Token ID338
Feature activation+1.45e+00

Pos logit contributions
erva+0.781
 depress+0.764
 regress+0.729
ック+0.709
riots+0.695
Neg logit contributions
alid-0.830
vier-0.816
 Agency-0.812
edin-0.803
cot-0.801
Token enc
Token ID2207
Feature activation+1.64e+00

Pos logit contributions
erva+1.134
 depress+1.106
 regress+1.062
ック+1.040
riots+1.022
Neg logit contributions
alid-0.947
vier-0.926
 Agency-0.925
edin-0.912
cot-0.908
Tokenore
Token ID382
Feature activation+4.27e-01

Pos logit contributions
erva+1.529
 depress+1.497
 regress+1.448
ック+1.411
riots+1.401
Neg logit contributions
alid-0.828
vier-0.806
 Agency-0.799
edin-0.793
cot-0.781
Token in
Token ID287
Feature activation+7.60e-01

Pos logit contributions
erva+0.334
 depress+0.326
 regress+0.314
ック+0.308
riots+0.301
Neg logit contributions
alid-0.277
 Agency-0.271
vier-0.270
edin-0.267
cot-0.266
Token Minneapolis
Token ID20455
Feature activation+8.35e+00

Pos logit contributions
erva+0.567
 depress+0.551
 regress+0.528
ック+0.519
riots+0.509
Neg logit contributions
alid-0.523
 Agency-0.516
vier-0.513
edin-0.505
cot-0.504
Token
Token ID198
Feature activation+2.46e-01

Pos logit contributions
erva+6.187
 depress+6.065
 regress+5.817
uries+5.550
ック+5.548
Neg logit contributions
alid-5.494
vier-5.431
edin-5.243
Saharan-5.204
 Agency-5.185
Token
Token ID198
Feature activation+2.64e-01

Pos logit contributions
erva+0.188
 depress+0.184
 regress+0.177
ック+0.171
riots+0.169
Neg logit contributions
alid-0.165
vier-0.162
 Agency-0.161
edin-0.159
cot-0.158
TokenSh
Token ID2484
Feature activation-1.18e+00

Pos logit contributions
erva+0.218
 depress+0.215
 regress+0.206
ック+0.200
uries+0.197
Neg logit contributions
alid-0.161
vier-0.157
 Agency-0.155
edin-0.154
cot-0.153
Tokenaron
Token ID8045
Feature activation+1.34e+00

Pos logit contributions
alid+0.853
vier+0.821
edin+0.814
cot+0.811
 Agency+0.807
Neg logit contributions
erva-0.845
 depress-0.843
 regress-0.815
ック-0.790
riots-0.759
Token Van
Token ID6656
Feature activation+3.90e+00

Pos logit contributions
erva+0.922
 depress+0.900
 regress+0.860
ック+0.835
riots+0.819
Neg logit contributions
alid-0.995
vier-0.977
 Agency-0.972
edin-0.962
cot-0.955
Token flight
Token ID5474
Feature activation+1.38e-01

Pos logit contributions
alid+0.690
 Agency+0.676
vier+0.663
cot+0.658
edin+0.657
Neg logit contributions
erva-0.906
 depress-0.876
 regress-0.854
ック-0.851
riots-0.837
Token path
Token ID3108
Feature activation+1.54e+00

Pos logit contributions
erva+0.119
 depress+0.116
 regress+0.112
ック+0.111
riots+0.110
Neg logit contributions
alid-0.080
 Agency-0.077
vier-0.076
edin-0.076
cot-0.075
Token towards
Token ID3371
Feature activation-2.15e+00

Pos logit contributions
erva+1.195
 depress+1.167
 regress+1.120
ック+1.094
riots+1.077
Neg logit contributions
alid-1.021
vier-0.999
 Agency-0.993
edin-0.983
cot-0.976
Token approaching
Token ID13885
Feature activation+5.04e-01

Pos logit contributions
alid+1.375
 Agency+1.357
vier+1.326
edin+1.317
cot+1.309
Neg logit contributions
erva-1.705
 depress-1.643
 regress-1.601
ック-1.570
riots-1.547
Token lunar
Token ID25572
Feature activation+4.13e+00

Pos logit contributions
erva+0.398
 depress+0.383
 regress+0.372
ック+0.365
riots+0.359
Neg logit contributions
alid-0.324
 Agency-0.321
vier-0.315
edin-0.312
cot-0.310
Token surface
Token ID4417
Feature activation+8.32e+00

Pos logit contributions
erva+3.072
 depress+3.021
 regress+2.886
ック+2.808
uries+2.753
Neg logit contributions
alid-2.838
vier-2.792
 Agency-2.752
edin-2.743
cot-2.716
Token with
Token ID351
Feature activation+2.32e+00

Pos logit contributions
 depress+5.724
erva+5.646
 regress+5.360
ック+5.136
uries+5.060
Neg logit contributions
alid-6.064
vier-5.938
edin-5.878
cot-5.763
 Agency-5.760
Token the
Token ID262
Feature activation-3.75e-02

Pos logit contributions
erva+1.784
 depress+1.762
 regress+1.677
ック+1.621
uries+1.597
Neg logit contributions
alid-1.530
vier-1.513
edin-1.487
 Agency-1.483
cot-1.471
Token high
Token ID1029
Feature activation+3.12e-01

Pos logit contributions
alid+0.022
 Agency+0.022
vier+0.022
edin+0.021
cot+0.021
Neg logit contributions
erva-0.031
 depress-0.030
 regress-0.029
ック-0.029
riots-0.029
Token gain
Token ID4461
Feature activation-6.66e-01

Pos logit contributions
erva+0.237
 depress+0.231
 regress+0.222
ック+0.217
riots+0.214
Neg logit contributions
alid-0.210
 Agency-0.205
vier-0.204
edin-0.202
cot-0.201
Token antenna
Token ID20509
Feature activation+1.01e+00

Pos logit contributions
alid+0.415
 Agency+0.406
vier+0.399
edin+0.397
cot+0.394
Neg logit contributions
erva-0.541
 depress-0.524
 regress-0.507
ック-0.497
riots-0.495
Token,
Token ID11
Feature activation+7.27e-01

Pos logit contributions
erva+4.398
 depress+4.299
 regress+4.080
ック+3.929
riots+3.897
Neg logit contributions
alid-4.364
vier-4.280
edin-4.200
 Agency-4.185
cot-4.168
Token TX
Token ID15326
Feature activation+6.92e+00

Pos logit contributions
erva+0.498
 depress+0.486
 regress+0.464
ック+0.452
riots+0.443
Neg logit contributions
alid-0.544
vier-0.533
 Agency-0.531
edin-0.526
cot-0.523
Token from
Token ID422
Feature activation+9.46e-01

Pos logit contributions
erva+4.992
 depress+4.819
 regress+4.592
ック+4.406
asking+4.384
Neg logit contributions
alid-4.783
vier-4.689
edin-4.600
 Agency-4.575
cot-4.551
Token the
Token ID262
Feature activation+7.81e-01

Pos logit contributions
erva+0.714
 depress+0.689
 regress+0.662
ック+0.652
riots+0.639
Neg logit contributions
alid-0.645
 Agency-0.634
vier-0.633
edin-0.625
cot-0.619
Token east
Token ID7627
Feature activation-3.07e+00

Pos logit contributions
erva+0.667
 depress+0.643
 regress+0.622
ック+0.614
riots+0.604
Neg logit contributions
alid-0.456
 Agency-0.450
vier-0.445
edin-0.439
cot-0.434
Token coast
Token ID7051
Feature activation+8.29e+00

Pos logit contributions
alid+1.578
 Agency+1.549
vier+1.535
cot+1.506
edin+1.499
Neg logit contributions
erva-2.798
 depress-2.716
 regress-2.645
ック-2.596
uries-2.585
Token knowing
Token ID6970
Feature activation-1.74e+00

Pos logit contributions
erva+6.051
 depress+6.014
 regress+5.668
ック+5.468
riots+5.434
Neg logit contributions
alid-5.547
vier-5.399
cot-5.363
edin-5.293
Saharan-5.230
Token all
Token ID477
Feature activation+3.53e+00

Pos logit contributions
alid+1.060
 Agency+1.054
vier+1.033
edin+1.021
cot+1.004
Neg logit contributions
erva-1.442
 depress-1.364
ック-1.328
 regress-1.325
uries-1.298
Token of
Token ID286
Feature activation+5.07e-01

Pos logit contributions
erva+2.968
 depress+2.952
 regress+2.828
ック+2.744
riots+2.712
Neg logit contributions
alid-2.063
vier-2.015
edin-1.981
 Agency-1.970
cot-1.946
Token five
Token ID1936
Feature activation-1.09e+00

Pos logit contributions
erva+0.392
 depress+0.382
 regress+0.367
ック+0.359
riots+0.353
Neg logit contributions
alid-0.335
vier-0.328
 Agency-0.327
edin-0.322
cot-0.321
Token people
Token ID661
Feature activation+2.08e+00

Pos logit contributions
alid+0.710
 Agency+0.698
vier+0.686
edin+0.677
cot+0.674
Neg logit contributions
erva-0.858
 depress-0.822
 regress-0.788
ック-0.777
riots-0.766
Tokenoked
Token ID6545
Feature activation+2.00e+00

Pos logit contributions
alid+1.146
vier+1.133
edin+1.117
 Agency+1.116
cot+1.111
Neg logit contributions
erva-1.523
 depress-1.482
 regress-1.431
ック-1.404
uries-1.379
Token Hillary
Token ID5073
Feature activation+7.56e-01

Pos logit contributions
erva+1.411
 depress+1.377
 regress+1.316
ック+1.280
riots+1.256
Neg logit contributions
alid-1.464
vier-1.437
 Agency-1.427
edin-1.415
cot-1.406
Token Clinton
Token ID2605
Feature activation-2.56e-01

Pos logit contributions
erva+0.597
 depress+0.581
 regress+0.559
ック+0.547
uries+0.537
Neg logit contributions
alid-0.488
vier-0.479
 Agency-0.477
edin-0.471
cot-0.468
Token and
Token ID290
Feature activation-3.80e-01

Pos logit contributions
alid+0.154
vier+0.151
 Agency+0.151
edin+0.149
cot+0.148
Neg logit contributions
erva-0.213
 depress-0.206
 regress-0.199
ック-0.196
riots-0.192
Token Deborah
Token ID36976
Feature activation+1.98e+00

Pos logit contributions
alid+0.245
 Agency+0.245
vier+0.240
cot+0.238
edin+0.237
Neg logit contributions
erva-0.297
 depress-0.288
ック-0.277
 regress-0.277
uries-0.267
Token Wasserman
Token ID38997
Feature activation+8.28e+00

Pos logit contributions
erva+1.295
 depress+1.260
 regress+1.199
ック+1.165
riots+1.142
Neg logit contributions
alid-1.543
vier-1.517
 Agency-1.510
edin-1.499
cot-1.487
Token Schultz
Token ID26530
Feature activation+3.46e+00

Pos logit contributions
erva+4.537
 depress+4.456
 regress+4.142
ック+4.054
riots+3.898
Neg logit contributions
alid-7.112
vier-6.935
edin-6.916
 Agency-6.825
ovember-6.818
Token will
Token ID481
Feature activation+1.06e+00

Pos logit contributions
erva+2.599
 depress+2.555
 regress+2.444
ック+2.368
riots+2.335
Neg logit contributions
alid-2.353
vier-2.300
 Agency-2.275
edin-2.261
cot-2.239
Token not
Token ID407
Feature activation+6.55e-01

Pos logit contributions
erva+0.856
 depress+0.826
 regress+0.794
ック+0.785
riots+0.773
Neg logit contributions
alid-0.669
vier-0.653
 Agency-0.652
edin-0.646
cot-0.643
Token allow
Token ID1249
Feature activation-9.81e-02

Pos logit contributions
erva+0.528
 depress+0.507
 regress+0.486
ック+0.485
riots+0.477
Neg logit contributions
alid-0.409
vier-0.401
 Agency-0.401
edin-0.399
cot-0.398
Token Bernie
Token ID10477
Feature activation+1.71e+00

Pos logit contributions
alid+0.063
 Agency+0.063
vier+0.062
cot+0.061
edin+0.061
Neg logit contributions
erva-0.076
 depress-0.074
 regress-0.071
ック-0.071
uries-0.069
Token that
Token ID326
Feature activation+3.88e-01

Pos logit contributions
erva+0.700
 depress+0.684
 regress+0.657
ック+0.641
riots+0.630
Neg logit contributions
alid-0.600
vier-0.587
 Agency-0.583
edin-0.578
cot-0.573
Token when
Token ID618
Feature activation-1.26e+00

Pos logit contributions
erva+0.300
 depress+0.291
 regress+0.280
ック+0.274
riots+0.270
Neg logit contributions
alid-0.257
 Agency-0.252
vier-0.251
edin-0.248
cot-0.248
Token confronted
Token ID16675
Feature activation+3.40e+00

Pos logit contributions
alid+0.845
 Agency+0.832
vier+0.825
cot+0.817
edin+0.815
Neg logit contributions
erva-0.954
 depress-0.924
 regress-0.895
ック-0.871
riots-0.860
Token by
Token ID416
Feature activation+1.81e+00

Pos logit contributions
erva+2.721
 depress+2.699
 regress+2.586
ック+2.513
riots+2.466
Neg logit contributions
alid-2.105
vier-2.086
 Agency-2.056
edin-2.028
cot-2.013
Token a
Token ID257
Feature activation+1.90e-01

Pos logit contributions
erva+1.358
 depress+1.325
 regress+1.269
ック+1.238
riots+1.217
Neg logit contributions
alid-1.246
vier-1.221
 Agency-1.213
edin-1.201
cot-1.193
Token revolutionary
Token ID12253
Feature activation+8.22e+00

Pos logit contributions
erva+0.145
 depress+0.140
 regress+0.134
ック+0.132
riots+0.130
Neg logit contributions
alid-0.127
vier-0.125
 Agency-0.125
edin-0.123
cot-0.122
Token movement
Token ID3356
Feature activation+4.16e+00

Pos logit contributions
 depress+6.044
erva+5.919
 regress+5.815
 Julius+5.472
riots+5.428
Neg logit contributions
alid-5.454
vier-5.386
edin-5.224
cot-5.212
ovember-5.176
Token like
Token ID588
Feature activation+2.64e+00

Pos logit contributions
erva+3.255
 depress+3.214
 regress+3.079
ック+2.986
riots+2.939
Neg logit contributions
alid-2.686
vier-2.619
 Agency-2.597
edin-2.580
cot-2.554
Token the
Token ID262
Feature activation+8.18e-02

Pos logit contributions
erva+2.062
 depress+2.024
 regress+1.939
ック+1.890
riots+1.864
Neg logit contributions
alid-1.721
vier-1.691
 Agency-1.668
edin-1.658
cot-1.640
Token M
Token ID337
Feature activation+2.44e+00

Pos logit contributions
erva+0.063
 depress+0.062
 regress+0.059
ック+0.058
uries+0.057
Neg logit contributions
alid-0.054
 Agency-0.053
vier-0.053
edin-0.052
cot-0.052
Tokenakh
Token ID11322
Feature activation-1.29e+00

Pos logit contributions
erva+2.023
 depress+1.985
 regress+1.907
ック+1.864
riots+1.841
Neg logit contributions
alid-1.477
vier-1.446
 Agency-1.431
edin-1.415
cot-1.403
Token flights
Token ID13956
Feature activation-7.83e-02

Pos logit contributions
alid+0.007
vier+0.007
 Agency+0.007
edin+0.007
cot+0.007
Neg logit contributions
erva-0.009
 depress-0.008
 regress-0.008
ック-0.008
riots-0.008
Token a
Token ID257
Feature activation-2.52e-01

Pos logit contributions
alid+0.048
vier+0.047
 Agency+0.047
edin+0.046
cot+0.046
Neg logit contributions
erva-0.064
 depress-0.062
 regress-0.060
ック-0.059
riots-0.058
Token week
Token ID1285
Feature activation-2.83e+00

Pos logit contributions
alid+0.148
vier+0.144
 Agency+0.144
edin+0.142
cot+0.141
Neg logit contributions
erva-0.213
 depress-0.207
 regress-0.201
ック-0.197
riots-0.194
Token hasn
Token ID5818
Feature activation-2.19e+00

Pos logit contributions
alid+1.738
 Agency+1.703
vier+1.688
edin+1.671
cot+1.660
Neg logit contributions
erva-2.307
 depress-2.198
 regress-2.146
ック-2.131
riots-2.071
Token't
Token ID470
Feature activation+1.66e-01

Pos logit contributions
alid+0.999
vier+0.985
 Agency+0.980
edin+0.969
cot+0.937
Neg logit contributions
erva-2.160
 depress-2.105
 regress-2.039
ック-1.984
uries-1.972
Token fost
Token ID26546
Feature activation-1.05e+01

Pos logit contributions
erva+0.138
 depress+0.131
ック+0.129
 regress+0.128
riots+0.125
Neg logit contributions
alid-0.101
vier-0.098
 Agency-0.098
edin-0.097
cot-0.097
Tokenered
Token ID1068
Feature activation-3.65e+00

Pos logit contributions
cot+3.208
edin+3.192
alid+3.102
vier+2.937
eyed+2.747
Neg logit contributions
 depress-11.104
ック-11.029
 regress-10.995
erva-10.957
��-10.696
Token a
Token ID257
Feature activation-1.85e-01

Pos logit contributions
 Agency+2.225
alid+2.211
cot+2.172
vier+2.162
edin+2.115
Neg logit contributions
erva-2.944
 depress-2.771
ック-2.743
 regress-2.707
uries-2.674
Token deep
Token ID2769
Feature activation-8.07e-01

Pos logit contributions
alid+0.116
 Agency+0.114
vier+0.114
edin+0.112
cot+0.112
Neg logit contributions
erva-0.150
 depress-0.144
 regress-0.140
ック-0.138
riots-0.136
Token sense
Token ID2565
Feature activation-7.24e-01

Pos logit contributions
alid+0.517
 Agency+0.510
vier+0.506
cot+0.502
edin+0.498
Neg logit contributions
erva-0.637
 depress-0.613
 regress-0.594
ック-0.587
riots-0.579
Token of
Token ID286
Feature activation-1.93e-02

Pos logit contributions
alid+0.408
vier+0.399
 Agency+0.396
edin+0.391
cot+0.390
Neg logit contributions
erva-0.628
 depress-0.620
 regress-0.593
ック-0.583
riots-0.571
Token the
Token ID262
Feature activation-2.03e-01

Pos logit contributions
alid+0.128
 Agency+0.126
vier+0.125
edin+0.124
cot+0.122
Neg logit contributions
erva-0.175
 depress-0.169
 regress-0.163
ック-0.160
riots-0.159
Token settings
Token ID6460
Feature activation-4.02e+00

Pos logit contributions
alid+0.135
vier+0.132
 Agency+0.132
edin+0.130
cot+0.129
Neg logit contributions
erva-0.157
 depress-0.153
 regress-0.147
ック-0.143
riots-0.141
Token for
Token ID329
Feature activation-2.39e-01

Pos logit contributions
alid+2.465
 Agency+2.433
edin+2.374
vier+2.374
cot+2.367
Neg logit contributions
erva-3.243
 depress-3.123
ック-3.002
 regress-2.996
riots-2.955
Token Network
Token ID7311
Feature activation-4.14e+00

Pos logit contributions
alid+0.154
vier+0.151
 Agency+0.151
edin+0.148
cot+0.147
Neg logit contributions
erva-0.189
 depress-0.184
 regress-0.176
ック-0.174
riots-0.171
Token (
Token ID357
Feature activation-1.43e+00

Pos logit contributions
alid+2.601
 Agency+2.594
edin+2.537
vier+2.514
cot+2.474
Neg logit contributions
erva-3.301
 depress-3.177
 regress-3.083
ック-3.001
riots-2.983
TokenHome
Token ID16060
Feature activation-9.79e+00

Pos logit contributions
alid+0.895
vier+0.874
 Agency+0.870
edin+0.865
cot+0.859
Neg logit contributions
erva-1.148
 depress-1.122
 regress-1.080
ック-1.053
uries-1.033
Token/
Token ID14
Feature activation-3.73e-01

Pos logit contributions
 Agency+5.566
alid+5.542
vier+5.322
cot+5.300
edin+5.280
Neg logit contributions
erva-7.850
 depress-7.707
 regress-7.530
ック-7.411
riots-7.201
TokenOffice
Token ID27743
Feature activation-2.49e+00

Pos logit contributions
alid+0.210
vier+0.205
 Agency+0.204
edin+0.201
cot+0.199
Neg logit contributions
erva-0.325
 depress-0.318
 regress-0.307
ック-0.300
riots-0.295
Token)
Token ID8
Feature activation-7.64e-01

Pos logit contributions
alid+1.423
 Agency+1.402
vier+1.388
edin+1.367
cot+1.359
Neg logit contributions
erva-2.127
 depress-2.077
 regress-2.003
ック-1.974
riots-1.937
Token and
Token ID290
Feature activation-1.84e+00

Pos logit contributions
alid+0.472
 Agency+0.463
vier+0.461
edin+0.455
cot+0.451
Neg logit contributions
erva-0.623
 depress-0.607
 regress-0.583
ック-0.573
riots-0.564
Token enable
Token ID7139
Feature activation-5.13e-01

Pos logit contributions
 Agency+1.070
alid+1.064
vier+1.048
edin+1.037
cot+1.028
Neg logit contributions
erva-1.568
 depress-1.513
 regress-1.460
ック-1.445
riots-1.421
Tokenimpl
Token ID23928
Feature activation-2.97e+00

Pos logit contributions
erva+0.203
 depress+0.198
 regress+0.190
ック+0.185
uries+0.180
Neg logit contributions
alid-0.187
vier-0.182
 Agency-0.181
edin-0.180
cot-0.180
Tokenicit
Token ID3628
Feature activation-1.96e+00

Pos logit contributions
alid+1.621
edin+1.558
vier+1.558
 Agency+1.528
cot+1.504
Neg logit contributions
erva-2.634
 depress-2.585
 regress-2.460
ック-2.425
riots-2.370
Token bias
Token ID10690
Feature activation+7.18e-01

Pos logit contributions
alid+1.346
 Agency+1.336
edin+1.304
vier+1.304
cot+1.300
Neg logit contributions
erva-1.460
 depress-1.406
 regress-1.334
ック-1.313
riots-1.310
Token
Token ID447
Feature activation+3.19e-01

Pos logit contributions
erva+0.597
 depress+0.584
 regress+0.561
ック+0.549
riots+0.541
Neg logit contributions
alid-0.433
vier-0.424
 Agency-0.420
edin-0.417
cot-0.413
Token
Token ID251
Feature activation+5.08e-01

Pos logit contributions
erva+0.161
 depress+0.159
 regress+0.148
ック+0.141
uries+0.139
Neg logit contributions
alid-0.294
vier-0.289
 Agency-0.287
edin-0.287
cot-0.285
Token dogs
Token ID6844
Feature activation-9.49e+00

Pos logit contributions
erva+0.403
 depress+0.393
 regress+0.377
ック+0.369
riots+0.363
Neg logit contributions
alid-0.325
vier-0.319
 Agency-0.317
edin-0.314
cot-0.312
Token police
Token ID1644
Feature activation+4.86e-01

Pos logit contributions
 Agency+5.844
cot+5.768
vier+5.711
alid+5.686
edin+5.604
Neg logit contributions
erva-7.039
 depress-6.550
ック-6.494
riots-6.286
 regress-6.252
Token officers
Token ID3790
Feature activation+1.09e+00

Pos logit contributions
erva+0.354
 depress+0.345
 regress+0.328
ック+0.323
riots+0.316
Neg logit contributions
alid-0.342
vier-0.337
 Agency-0.336
edin-0.332
cot-0.330
Token (
Token ID357
Feature activation-4.76e-01

Pos logit contributions
erva+0.782
 depress+0.764
 regress+0.731
ック+0.711
riots+0.698
Neg logit contributions
alid-0.779
vier-0.763
 Agency-0.758
edin-0.752
cot-0.746
Tokeneven
Token ID10197
Feature activation+3.43e+00

Pos logit contributions
alid+0.314
vier+0.307
 Agency+0.304
cot+0.303
edin+0.303
Neg logit contributions
erva-0.369
 depress-0.360
 regress-0.346
ック-0.334
uries-0.329
Token African
Token ID5510
Feature activation-1.44e+00

Pos logit contributions
erva+2.367
 depress+2.319
 regress+2.209
ック+2.143
uries+2.102
Neg logit contributions
alid-2.544
vier-2.491
 Agency-2.470
edin-2.454
cot-2.440
Token past
Token ID1613
Feature activation-1.44e+00

Pos logit contributions
erva+0.870
 depress+0.855
 regress+0.825
ック+0.808
riots+0.796
Neg logit contributions
alid-0.503
vier-0.489
 Agency-0.484
edin-0.478
cot-0.474
Token the
Token ID262
Feature activation+2.64e-02

Pos logit contributions
alid+0.892
 Agency+0.877
vier+0.876
cot+0.862
edin+0.850
Neg logit contributions
erva-1.174
 depress-1.127
 regress-1.087
riots-1.069
ック-1.065
Token point
Token ID966
Feature activation+3.46e+00

Pos logit contributions
erva+0.022
 depress+0.021
 regress+0.020
ック+0.020
riots+0.020
Neg logit contributions
alid-0.016
 Agency-0.016
vier-0.016
edin-0.016
cot-0.016
Token where
Token ID810
Feature activation-9.32e-01

Pos logit contributions
erva+2.772
 depress+2.728
 regress+2.609
ック+2.549
uries+2.484
Neg logit contributions
alid-2.180
vier-2.131
edin-2.094
 Agency-2.092
cot-2.075
Token the
Token ID262
Feature activation-9.08e-01

Pos logit contributions
alid+0.604
 Agency+0.600
vier+0.599
edin+0.584
cot+0.581
Neg logit contributions
erva-0.727
 depress-0.697
 regress-0.670
ック-0.661
riots-0.660
Token Dev
Token ID6245
Feature activation-9.48e+00

Pos logit contributions
alid+0.607
 Agency+0.602
vier+0.594
edin+0.587
cot+0.583
Neg logit contributions
erva-0.692
 depress-0.665
 regress-0.635
ック-0.627
riots-0.626
Tokenial
Token ID498
Feature activation-2.99e+00

Pos logit contributions
alid+6.112
vier+5.861
edin+5.699
cot+5.485
 Agency+5.458
Neg logit contributions
 depress-7.185
erva-7.036
ック-6.933
 regress-6.752
��-6.659
Tokenet
Token ID316
Feature activation-4.04e+00

Pos logit contributions
alid+2.369
vier+2.357
cot+2.324
edin+2.316
 Agency+2.284
Neg logit contributions
erva-1.875
 depress-1.843
 regress-1.728
ック-1.704
��-1.694
Token
Token ID447
Feature activation-1.70e-01

Pos logit contributions
alid+2.782
 Agency+2.730
vier+2.706
cot+2.696
edin+2.687
Neg logit contributions
erva-2.940
 depress-2.822
 regress-2.685
riots-2.625
ック-2.618
Token
Token ID247
Feature activation-3.31e-01

Pos logit contributions
alid+0.144
 Agency+0.141
edin+0.141
vier+0.140
cot+0.139
Neg logit contributions
 depress-0.096
erva-0.095
 regress-0.089
ック-0.088
uries-0.084
Tokens
Token ID82
Feature activation-1.86e+00

Pos logit contributions
alid+0.268
vier+0.263
 Agency+0.262
edin+0.260
cot+0.259
Neg logit contributions
erva-0.207
 depress-0.201
 regress-0.191
ック-0.185
riots-0.181
TokenBut
Token ID1537
Feature activation-2.84e+00

Pos logit contributions
alid+0.017
vier+0.017
 Agency+0.017
edin+0.017
cot+0.016
Neg logit contributions
erva-0.024
 depress-0.024
 regress-0.023
ック-0.022
uries-0.022
Token the
Token ID262
Feature activation-3.06e-01

Pos logit contributions
alid+1.768
 Agency+1.766
edin+1.733
vier+1.730
cot+1.721
Neg logit contributions
erva-2.245
 depress-2.164
 regress-2.108
ック-2.073
uries-2.041
Token project
Token ID1628
Feature activation+1.13e+00

Pos logit contributions
alid+0.197
 Agency+0.195
vier+0.193
edin+0.191
cot+0.190
Neg logit contributions
erva-0.242
 depress-0.233
 regress-0.226
ック-0.221
riots-0.218
Token has
Token ID468
Feature activation-6.92e-02

Pos logit contributions
erva+0.886
 depress+0.860
 regress+0.826
ック+0.814
uries+0.796
Neg logit contributions
alid-0.739
vier-0.725
 Agency-0.724
edin-0.715
cot-0.705
Token since
Token ID1201
Feature activation+2.80e-01

Pos logit contributions
alid+0.043
 Agency+0.041
vier+0.041
edin+0.041
cot+0.041
Neg logit contributions
erva-0.057
 depress-0.054
ック-0.052
 regress-0.052
uries-0.051
Token pet
Token ID4273
Feature activation-9.41e+00

Pos logit contributions
erva+0.222
 depress+0.213
 regress+0.205
ック+0.204
uries+0.200
Neg logit contributions
alid-0.182
vier-0.177
 Agency-0.177
edin-0.175
cot-0.174
Tokenered
Token ID1068
Feature activation-2.19e+00

Pos logit contributions
alid+2.885
cot+2.779
edin+2.755
 Agency+2.661
vier+2.554
Neg logit contributions
erva-10.158
 depress-9.925
ック-9.805
 regress-9.540
 Julius-9.540
Token out
Token ID503
Feature activation-1.69e+00

Pos logit contributions
alid+1.233
vier+1.202
 Agency+1.201
edin+1.189
cot+1.178
Neg logit contributions
erva-1.902
 depress-1.843
 regress-1.785
ック-1.756
riots-1.737
Token,
Token ID11
Feature activation+6.83e-01

Pos logit contributions
alid+1.099
vier+1.080
 Agency+1.080
edin+1.067
cot+1.057
Neg logit contributions
erva-1.326
 depress-1.276
 regress-1.233
ック-1.219
riots-1.194
Token and
Token ID290
Feature activation+5.56e-01

Pos logit contributions
erva+0.560
 depress+0.546
 regress+0.526
ック+0.515
riots+0.507
Neg logit contributions
alid-0.421
vier-0.411
 Agency-0.409
edin-0.405
cot-0.401
Token has
Token ID468
Feature activation+9.38e-01

Pos logit contributions
erva+0.428
 depress+0.416
 regress+0.401
ック+0.394
riots+0.386
Neg logit contributions
alid-0.369
 Agency-0.363
vier-0.362
edin-0.358
cot-0.356
Token new
Token ID649
Feature activation+2.82e+00

Pos logit contributions
alid+0.102
 Agency+0.100
vier+0.100
edin+0.098
cot+0.097
Neg logit contributions
erva-0.141
 depress-0.137
 regress-0.132
ック-0.130
riots-0.128
Token Declaration
Token ID24720
Feature activation+3.23e+00

Pos logit contributions
erva+2.329
 depress+2.298
 regress+2.206
ック+2.145
riots+2.113
Neg logit contributions
alid-1.701
vier-1.670
edin-1.639
 Agency-1.632
cot-1.615
Token of
Token ID286
Feature activation-8.80e-01

Pos logit contributions
erva+2.757
 depress+2.721
 regress+2.615
ック+2.543
riots+2.516
Neg logit contributions
alid-1.850
vier-1.824
edin-1.786
 Agency-1.763
cot-1.754
Token Principles
Token ID34200
Feature activation-1.52e+00

Pos logit contributions
 Agency+0.488
alid+0.487
vier+0.473
cot+0.469
edin+0.461
Neg logit contributions
erva-0.776
 depress-0.753
 regress-0.731
ック-0.720
uries-0.707
Token which
Token ID543
Feature activation-8.37e-01

Pos logit contributions
alid+0.965
 Agency+0.950
vier+0.942
edin+0.927
cot+0.926
Neg logit contributions
erva-1.219
 depress-1.178
 regress-1.136
ック-1.118
uries-1.100
Token infl
Token ID28472
Feature activation-9.21e+00

Pos logit contributions
alid+0.480
 Agency+0.472
vier+0.468
edin+0.460
cot+0.458
Neg logit contributions
erva-0.722
 depress-0.696
 regress-0.676
ック-0.668
riots-0.654
Tokenamed
Token ID2434
Feature activation-1.75e+00

Pos logit contributions
alid+2.643
vier+2.590
cot+2.457
edin+2.448
ovember+2.291
Neg logit contributions
erva-10.055
 depress-10.011
ック-9.912
 regress-9.753
��-9.691
Token the
Token ID262
Feature activation-4.54e-01

Pos logit contributions
alid+1.119
 Agency+1.110
vier+1.093
cot+1.082
edin+1.073
Neg logit contributions
erva-1.388
 depress-1.323
 regress-1.288
ック-1.276
uries-1.239
Token "
Token ID366
Feature activation+1.19e+00

Pos logit contributions
alid+0.299
 Agency+0.298
vier+0.292
cot+0.290
edin+0.288
Neg logit contributions
erva-0.354
 depress-0.337
 regress-0.326
ック-0.323
uries-0.314
TokenOld
Token ID19620
Feature activation-9.75e-01

Pos logit contributions
erva+0.923
 depress+0.902
 regress+0.866
ック+0.845
uries+0.829
Neg logit contributions
alid-0.782
vier-0.765
 Agency-0.760
edin-0.753
cot-0.748
Token Guard
Token ID4932
Feature activation+1.17e+00

Pos logit contributions
alid+0.771
vier+0.758
 Agency+0.757
edin+0.749
cot+0.744
Neg logit contributions
erva-0.626
 depress-0.610
 regress-0.580
ック-0.564
riots-0.550
Tokents
Token ID912
Feature activation-2.30e-01

Pos logit contributions
alid+0.333
vier+0.329
cot+0.326
 Agency+0.326
edin+0.324
Neg logit contributions
erva-0.212
 depress-0.204
 regress-0.193
ック-0.185
riots-0.181
Token cars
Token ID5006
Feature activation-7.49e-01

Pos logit contributions
alid+0.150
 Agency+0.148
vier+0.147
edin+0.146
cot+0.143
Neg logit contributions
erva-0.180
 depress-0.174
 regress-0.168
ック-0.166
riots-0.163
Token,
Token ID11
Feature activation+3.09e-01

Pos logit contributions
alid+0.480
vier+0.472
 Agency+0.469
edin+0.468
cot+0.463
Neg logit contributions
erva-0.589
 depress-0.569
 regress-0.551
ック-0.546
riots-0.536
Token allowing
Token ID5086
Feature activation+5.36e-01

Pos logit contributions
erva+0.245
 depress+0.237
 regress+0.228
ック+0.224
riots+0.220
Neg logit contributions
alid-0.199
vier-0.195
 Agency-0.194
edin-0.192
cot-0.190
Token the
Token ID262
Feature activation-6.83e-01

Pos logit contributions
erva+0.414
 depress+0.403
 regress+0.388
ック+0.380
riots+0.372
Neg logit contributions
alid-0.354
vier-0.347
 Agency-0.346
edin-0.342
cot-0.338
Token canine
Token ID43003
Feature activation-9.13e+00

Pos logit contributions
alid+0.338
vier+0.331
 Agency+0.330
edin+0.324
cot+0.320
Neg logit contributions
erva-0.641
 depress-0.623
 regress-0.604
ック-0.595
riots-0.586
Token to
Token ID284
Feature activation-8.75e-02

Pos logit contributions
alid+4.603
 Agency+4.545
vier+4.475
edin+4.400
cot+4.229
Neg logit contributions
erva-8.067
 depress-7.484
ック-7.379
uries-7.297
riots-7.234
Token tro
Token ID4161
Feature activation-4.11e+00

Pos logit contributions
alid+0.057
vier+0.056
 Agency+0.056
edin+0.055
cot+0.055
Neg logit contributions
erva-0.068
 depress-0.066
 regress-0.063
ック-0.063
riots-0.062
Tokent
Token ID83
Feature activation-2.99e-01

Pos logit contributions
alid+3.398
vier+3.360
cot+3.297
edin+3.295
 Agency+3.261
Neg logit contributions
erva-2.510
 depress-2.408
 regress-2.297
ック-2.228
riots-2.197
Token across
Token ID1973
Feature activation+8.38e-01

Pos logit contributions
alid+0.177
vier+0.174
 Agency+0.173
edin+0.171
cot+0.170
Neg logit contributions
erva-0.252
 depress-0.244
 regress-0.235
ック-0.232
riots-0.228
Token to
Token ID284
Feature activation+1.17e+00

Pos logit contributions
erva+0.645
 depress+0.630
 regress+0.605
ック+0.590
riots+0.580
Neg logit contributions
alid-0.557
vier-0.545
 Agency-0.542
edin-0.537
cot-0.533
Token owned
Token ID6898
Feature activation-4.16e+00

Pos logit contributions
erva+1.021
 depress+0.995
 regress+0.959
ック+0.936
riots+0.922
Neg logit contributions
alid-0.786
vier-0.771
 Agency-0.769
edin-0.757
cot-0.753
Token by
Token ID416
Feature activation+1.02e+00

Pos logit contributions
alid+2.262
vier+2.220
 Agency+2.213
edin+2.178
cot+2.156
Neg logit contributions
erva-3.661
 depress-3.584
 regress-3.468
ック-3.386
riots-3.343
Token Alex
Token ID4422
Feature activation-1.76e+00

Pos logit contributions
erva+0.772
 depress+0.751
 regress+0.722
ック+0.706
uries+0.694
Neg logit contributions
alid-0.696
 Agency-0.687
vier-0.684
edin-0.671
cot-0.669
Token Jones
Token ID5437
Feature activation+1.25e+00

Pos logit contributions
alid+1.200
 Agency+1.187
vier+1.158
edin+1.156
cot+1.139
Neg logit contributions
erva-1.340
 depress-1.300
 regress-1.251
ック-1.201
uries-1.196
Token that
Token ID326
Feature activation-3.02e-01

Pos logit contributions
erva+0.863
 depress+0.842
 regress+0.803
ック+0.781
riots+0.767
Neg logit contributions
alid-0.932
vier-0.915
 Agency-0.908
edin-0.901
cot-0.895
Token propag
Token ID8928
Feature activation-9.10e+00

Pos logit contributions
alid+0.179
vier+0.177
 Agency+0.177
edin+0.172
cot+0.172
Neg logit contributions
erva-0.256
 depress-0.246
 regress-0.239
ック-0.233
riots-0.231
Tokenates
Token ID689
Feature activation-2.62e+00

Pos logit contributions
alid+4.071
vier+3.936
cot+3.755
 Agency+3.722
edin+3.653
Neg logit contributions
erva-8.739
 depress-8.535
��-8.290
 regress-8.190
ック-8.116
Token conspiracy
Token ID10086
Feature activation+1.67e+00

Pos logit contributions
 Agency+1.760
alid+1.750
vier+1.714
cot+1.689
edin+1.673
Neg logit contributions
erva-1.989
 depress-1.872
 regress-1.807
ック-1.805
uries-1.778
Token theories
Token ID10946
Feature activation-1.22e+00

Pos logit contributions
erva+1.070
 depress+1.037
 regress+0.987
ック+0.958
riots+0.941
Neg logit contributions
alid-1.324
vier-1.302
 Agency-1.295
edin-1.283
cot-1.276
Token like
Token ID588
Feature activation+2.22e+00

Pos logit contributions
alid+0.725
 Agency+0.712
vier+0.711
edin+0.696
cot+0.695
Neg logit contributions
erva-1.024
 depress-0.986
 regress-0.957
ック-0.936
riots-0.930
Token the
Token ID262
Feature activation-3.82e-01

Pos logit contributions
erva+1.736
 depress+1.697
 regress+1.629
ック+1.589
riots+1.564
Neg logit contributions
alid-1.454
vier-1.424
 Agency-1.414
edin-1.400
cot-1.390
Token,
Token ID11
Feature activation+4.50e-01

Pos logit contributions
erva+0.184
 depress+0.178
 regress+0.171
ック+0.167
riots+0.166
Neg logit contributions
alid-0.153
vier-0.150
 Agency-0.150
edin-0.148
cot-0.147
Token as
Token ID355
Feature activation+1.61e+00

Pos logit contributions
erva+0.363
 depress+0.353
 regress+0.340
ック+0.332
riots+0.328
Neg logit contributions
alid-0.284
vier-0.277
 Agency-0.277
edin-0.273
cot-0.271
Token police
Token ID1644
Feature activation-8.10e-01

Pos logit contributions
erva+1.222
 depress+1.189
 regress+1.141
ック+1.114
riots+1.096
Neg logit contributions
alid-1.094
vier-1.067
 Agency-1.067
edin-1.053
cot-1.048
Token investigate
Token ID9161
Feature activation-3.01e-01

Pos logit contributions
alid+0.465
 Agency+0.459
vier+0.447
edin+0.444
cot+0.442
Neg logit contributions
erva-0.704
 depress-0.675
 regress-0.651
ック-0.642
riots-0.635
Token whether
Token ID1771
Feature activation-8.99e-01

Pos logit contributions
alid+0.201
 Agency+0.198
vier+0.194
cot+0.192
edin+0.191
Neg logit contributions
erva-0.235
 depress-0.225
 regress-0.214
ック-0.212
riots-0.208
Token Ford
Token ID8092
Feature activation-9.05e+00

Pos logit contributions
alid+0.587
 Agency+0.578
vier+0.567
edin+0.566
cot+0.560
Neg logit contributions
erva-0.706
 depress-0.682
 regress-0.656
ック-0.641
riots-0.635
Token
Token ID447
Feature activation-1.03e+00

Pos logit contributions
alid+5.192
 Agency+5.148
vier+5.094
edin+5.087
 breached+4.996
Neg logit contributions
erva-7.359
 depress-6.867
 regress-6.811
riots-6.659
uries-6.578
Token
Token ID247
Feature activation-6.27e-01

Pos logit contributions
alid+0.887
vier+0.870
edin+0.868
 Agency+0.867
cot+0.864
Neg logit contributions
erva-0.571
 depress-0.570
 regress-0.532
ック-0.514
uries-0.506
Tokens
Token ID82
Feature activation-1.43e+00

Pos logit contributions
alid+0.492
vier+0.483
 Agency+0.481
edin+0.477
cot+0.476
Neg logit contributions
erva-0.410
 depress-0.396
 regress-0.380
ック-0.366
uries-0.359
Token close
Token ID1969
Feature activation-6.52e-01

Pos logit contributions
alid+0.895
 Agency+0.889
edin+0.858
cot+0.857
vier+0.856
Neg logit contributions
erva-1.163
 depress-1.119
 regress-1.076
ック-1.066
riots-1.053
Token friend
Token ID1545
Feature activation+1.80e+00

Pos logit contributions
alid+0.409
 Agency+0.400
vier+0.397
edin+0.392
cot+0.389
Neg logit contributions
erva-0.528
 depress-0.511
 regress-0.495
ック-0.483
riots-0.477
Token 0
Token ID657
Feature activation+1.75e+00

Pos logit contributions
alid+0.428
 Agency+0.418
vier+0.415
edin+0.409
cot+0.407
Neg logit contributions
erva-0.603
 depress-0.586
 regress-0.562
ック-0.555
riots-0.551
Token.
Token ID13
Feature activation-2.54e+00

Pos logit contributions
erva+1.439
 depress+1.409
 regress+1.354
ック+1.323
riots+1.303
Neg logit contributions
alid-1.074
vier-1.052
 Agency-1.042
edin-1.031
cot-1.023
Token0
Token ID15
Feature activation+4.01e-01

Pos logit contributions
alid+1.425
edin+1.379
 Agency+1.373
vier+1.370
cot+1.348
Neg logit contributions
erva-2.201
 depress-2.158
 regress-2.091
ック-2.046
riots-2.009
Tokenf
Token ID69
Feature activation-1.17e+00

Pos logit contributions
erva+0.339
 depress+0.332
 regress+0.320
ック+0.313
riots+0.309
Neg logit contributions
alid-0.237
vier-0.230
 Agency-0.229
edin-0.227
cot-0.224
Token);
Token ID1776
Feature activation+6.67e-01

Pos logit contributions
alid+0.652
 Agency+0.631
vier+0.627
edin+0.624
cot+0.616
Neg logit contributions
erva-1.015
 depress-0.997
 regress-0.962
ック-0.939
uries-0.931
Token RAW
Token ID33782
Feature activation-8.95e+00

Pos logit contributions
erva+0.533
 depress+0.518
 regress+0.497
ック+0.490
riots+0.483
Neg logit contributions
alid-0.426
vier-0.415
 Agency-0.415
edin-0.409
cot-0.406
Token_
Token ID62
Feature activation-1.08e+00

Pos logit contributions
alid+4.961
cot+4.853
vier+4.843
 Agency+4.772
edin+4.612
Neg logit contributions
erva-7.513
 depress-7.165
ック-6.899
riots-6.830
 regress-6.802
TokenASS
Token ID10705
Feature activation+2.90e+00

Pos logit contributions
alid+0.666
vier+0.654
 Agency+0.644
edin+0.638
cot+0.638
Neg logit contributions
erva-0.880
 depress-0.860
 regress-0.826
ック-0.808
riots-0.795
TokenERT
Token ID17395
Feature activation-2.21e-01

Pos logit contributions
erva+1.750
 depress+1.696
 regress+1.604
ック+1.545
uries+1.512
Neg logit contributions
alid-2.408
vier-2.373
 Agency-2.353
edin-2.344
cot-2.324
Token(
Token ID7
Feature activation+5.64e-01

Pos logit contributions
alid+0.144
vier+0.141
 Agency+0.140
cot+0.138
edin+0.138
Neg logit contributions
erva-0.173
 depress-0.168
 regress-0.162
ック-0.158
riots-0.156
Tokenit
Token ID270
Feature activation-4.81e-01

Pos logit contributions
erva+0.417
 depress+0.407
 regress+0.390
ック+0.379
riots+0.373
Neg logit contributions
alid-0.393
vier-0.386
 Agency-0.383
edin-0.380
cot-0.378
Token and
Token ID290
Feature activation-2.79e-01

Pos logit contributions
alid+1.545
vier+1.531
 Agency+1.526
edin+1.511
cot+1.481
Neg logit contributions
erva-2.079
 depress-2.020
 regress-1.932
ック-1.925
riots-1.893
Token public
Token ID1171
Feature activation+1.77e+00

Pos logit contributions
alid+0.182
 Agency+0.180
vier+0.179
edin+0.177
cot+0.175
Neg logit contributions
erva-0.217
 depress-0.210
 regress-0.201
ック-0.201
riots-0.196
Token health
Token ID1535
Feature activation+6.23e-01

Pos logit contributions
erva+1.311
 depress+1.281
 regress+1.226
ック+1.196
riots+1.175
Neg logit contributions
alid-1.232
vier-1.208
 Agency-1.200
edin-1.189
cot-1.181
Token should
Token ID815
Feature activation-1.47e+00

Pos logit contributions
erva+0.508
 depress+0.496
 regress+0.476
ック+0.469
riots+0.462
Neg logit contributions
alid-0.384
vier-0.378
 Agency-0.377
edin-0.372
cot-0.368
Token be
Token ID307
Feature activation-2.34e+00

Pos logit contributions
alid+0.908
 Agency+0.887
vier+0.882
edin+0.880
cot+0.870
Neg logit contributions
erva-1.194
 depress-1.137
ック-1.107
 regress-1.096
riots-1.091
Token fost
Token ID26546
Feature activation-8.93e+00

Pos logit contributions
alid+1.295
 Agency+1.280
vier+1.259
edin+1.245
cot+1.228
Neg logit contributions
erva-2.061
 depress-1.971
ック-1.925
 regress-1.919
riots-1.884
Tokenered
Token ID1068
Feature activation-2.82e+00

Pos logit contributions
alid+2.758
edin+2.684
vier+2.645
cot+2.543
eyed+2.361
Neg logit contributions
 depress-9.611
erva-9.554
ック-9.516
 regress-9.471
��-9.192
Token (
Token ID357
Feature activation-1.08e+00

Pos logit contributions
alid+1.873
 Agency+1.873
vier+1.848
cot+1.826
edin+1.820
Neg logit contributions
erva-2.132
 depress-2.051
ック-1.969
 regress-1.968
riots-1.937
Token19
Token ID1129
Feature activation+3.38e+00

Pos logit contributions
alid+0.612
vier+0.597
 Agency+0.592
edin+0.586
cot+0.581
Neg logit contributions
erva-0.932
 depress-0.913
 regress-0.881
ック-0.861
riots-0.848
Token).
Token ID737
Feature activation-8.49e-01

Pos logit contributions
erva+2.688
 depress+2.640
 regress+2.529
ック+2.455
uries+2.431
Neg logit contributions
alid-2.108
vier-2.062
 Agency-2.022
cot-2.014
edin-2.013
Token As
Token ID1081
Feature activation+1.20e+00

Pos logit contributions
alid+0.564
 Agency+0.558
vier+0.553
edin+0.546
cot+0.540
Neg logit contributions
erva-0.653
 depress-0.641
 regress-0.615
ック-0.596
riots-0.588
Token.
Token ID13
Feature activation+2.13e-02

Pos logit contributions
alid+1.129
 Agency+1.113
vier+1.101
edin+1.091
cot+1.075
Neg logit contributions
erva-1.124
 depress-1.074
 regress-1.028
ック-1.017
riots-1.005
Token (
Token ID357
Feature activation-9.21e-01

Pos logit contributions
erva+0.017
 depress+0.016
 regress+0.016
ック+0.015
riots+0.015
Neg logit contributions
alid-0.014
 Agency-0.014
vier-0.014
edin-0.014
cot-0.013
TokenFree
Token ID11146
Feature activation-2.08e+00

Pos logit contributions
alid+0.648
vier+0.638
 Agency+0.634
edin+0.631
cot+0.622
Neg logit contributions
erva-0.675
 depress-0.659
 regress-0.631
ック-0.610
riots-0.600
Token)
Token ID8
Feature activation-2.96e-01

Pos logit contributions
alid+1.468
 Agency+1.455
vier+1.444
edin+1.436
cot+1.411
Neg logit contributions
erva-1.500
 depress-1.464
 regress-1.394
ック-1.368
riots-1.346
Token [
Token ID685
Feature activation-1.84e+00

Pos logit contributions
alid+0.197
vier+0.193
 Agency+0.190
cot+0.189
edin+0.189
Neg logit contributions
erva-0.227
 depress-0.222
 regress-0.213
ック-0.207
uries-0.205
TokenHome
Token ID16060
Feature activation-8.89e+00

Pos logit contributions
alid+0.554
vier+0.528
 Agency+0.519
edin+0.507
cot+0.500
Neg logit contributions
erva-2.083
 depress-2.051
 regress-1.995
ック-1.962
riots-1.943
Tokenpage
Token ID7700
Feature activation-7.24e-01

Pos logit contributions
vier+2.042
alid+1.905
edin+1.854
 Agency+1.791
cot+1.677
Neg logit contributions
erva-10.451
 depress-10.389
��-10.133
 regress-10.107
ック-10.067
Token]
Token ID60
Feature activation-2.30e+00

Pos logit contributions
alid+0.455
 Agency+0.452
vier+0.444
edin+0.441
cot+0.433
Neg logit contributions
erva-0.586
 depress-0.571
 regress-0.548
ック-0.536
uries-0.529
Token [
Token ID685
Feature activation-1.24e+00

Pos logit contributions
alid+1.482
vier+1.450
 Agency+1.439
edin+1.426
cot+1.414
Neg logit contributions
erva-1.818
 depress-1.779
 regress-1.708
ック-1.667
riots-1.641
TokenApp
Token ID4677
Feature activation-2.69e+00

Pos logit contributions
alid+0.636
vier+0.619
 Agency+0.612
edin+0.604
cot+0.600
Neg logit contributions
erva-1.146
 depress-1.125
 regress-1.086
ック-1.065
riots-1.051
TokenBrain
Token ID44687
Feature activation+1.46e-01

Pos logit contributions
vier+1.005
alid+0.997
edin+0.979
 Agency+0.975
Saharan+0.927
Neg logit contributions
erva-2.845
 depress-2.823
 regress-2.748
ック-2.677
riots-2.633
Token effects
Token ID3048
Feature activation+2.56e+00

Pos logit contributions
 Agency+1.551
alid+1.546
vier+1.505
edin+1.489
cot+1.487
Neg logit contributions
erva-1.795
 depress-1.713
 regress-1.645
ック-1.625
riots-1.616
Token and
Token ID290
Feature activation-7.27e-01

Pos logit contributions
erva+2.022
 depress+1.981
 regress+1.903
ック+1.855
riots+1.823
Neg logit contributions
alid-1.651
vier-1.616
 Agency-1.600
edin-1.588
cot-1.575
Token frames
Token ID13431
Feature activation+8.69e-01

Pos logit contributions
alid+0.492
 Agency+0.489
vier+0.486
edin+0.479
cot+0.475
Neg logit contributions
erva-0.553
 depress-0.524
 regress-0.503
ック-0.503
riots-0.496
Token)
Token ID8
Feature activation+4.95e-01

Pos logit contributions
erva+0.694
 depress+0.679
 regress+0.651
ック+0.637
riots+0.629
Neg logit contributions
alid-0.553
vier-0.541
 Agency-0.538
edin-0.531
cot-0.528
Token [
Token ID685
Feature activation-2.59e+00

Pos logit contributions
erva+0.374
 depress+0.365
 regress+0.350
ック+0.341
riots+0.336
Neg logit contributions
alid-0.337
vier-0.330
 Agency-0.328
edin-0.325
cot-0.322
TokenHome
Token ID16060
Feature activation-8.86e+00

Pos logit contributions
alid+1.593
vier+1.570
 Agency+1.558
edin+1.558
cot+1.535
Neg logit contributions
erva-2.120
 depress-2.065
 regress-1.991
ック-1.932
riots-1.910
Tokenpage
Token ID7700
Feature activation-9.68e-01

Pos logit contributions
alid+5.109
 Agency+5.052
vier+4.992
edin+4.968
cot+4.856
Neg logit contributions
erva-7.264
 depress-7.010
 regress-6.826
ック-6.761
riots-6.574
Token]
Token ID60
Feature activation-2.15e+00

Pos logit contributions
alid+0.632
 Agency+0.619
vier+0.616
edin+0.612
cot+0.602
Neg logit contributions
erva-0.758
 depress-0.738
 regress-0.710
ック-0.694
riots-0.683
Token [
Token ID685
Feature activation-1.61e+00

Pos logit contributions
alid+1.355
 Agency+1.329
vier+1.325
edin+1.316
cot+1.285
Neg logit contributions
erva-1.719
 depress-1.681
 regress-1.620
ック-1.576
riots-1.556
TokenApp
Token ID4677
Feature activation-2.47e+00

Pos logit contributions
alid+0.938
vier+0.921
 Agency+0.917
edin+0.914
cot+0.894
Neg logit contributions
erva-1.380
 depress-1.349
 regress-1.299
ック-1.269
riots-1.247
TokenBrain
Token ID44687
Feature activation+1.64e+00

Pos logit contributions
alid+1.601
 Agency+1.549
vier+1.541
edin+1.531
cot+1.515
Neg logit contributions
erva-1.956
 depress-1.909
 regress-1.844
ック-1.808
riots-1.758
Tokenkeepers
Token ID24952
Feature activation+8.96e-01

Pos logit contributions
erva+0.030
 depress+0.029
 regress+0.028
ック+0.027
riots+0.027
Neg logit contributions
alid-0.034
 Agency-0.034
vier-0.034
cot-0.034
edin-0.033
Token,
Token ID11
Feature activation+1.59e+00

Pos logit contributions
erva+0.600
 depress+0.586
 regress+0.558
ック+0.542
riots+0.531
Neg logit contributions
alid-0.684
vier-0.672
 Agency-0.666
edin-0.663
cot-0.657
Token masc
Token ID23977
Feature activation-5.87e-01

Pos logit contributions
erva+1.142
 depress+1.115
 regress+1.066
ック+1.037
riots+1.019
Neg logit contributions
alid-1.138
vier-1.116
 Agency-1.109
edin-1.099
cot-1.092
Tokenots
Token ID1747
Feature activation-6.74e-01

Pos logit contributions
alid+0.377
vier+0.372
cot+0.367
edin+0.363
 Agency+0.363
Neg logit contributions
erva-0.465
 depress-0.455
 regress-0.434
ック-0.428
 Julius-0.410
Token,
Token ID11
Feature activation+1.35e+00

Pos logit contributions
alid+0.497
vier+0.488
 Agency+0.485
edin+0.481
cot+0.478
Neg logit contributions
erva-0.469
 depress-0.457
 regress-0.436
ック-0.425
riots-0.417
Token dogs
Token ID6844
Feature activation-8.77e+00

Pos logit contributions
erva+0.969
 depress+0.944
 regress+0.902
ック+0.879
riots+0.864
Neg logit contributions
alid-0.975
vier-0.956
 Agency-0.951
edin-0.941
cot-0.936
Token.
Token ID13
Feature activation+2.12e-01

Pos logit contributions
alid+5.728
 Agency+5.726
cot+5.713
vier+5.660
edin+5.549
Neg logit contributions
erva-6.361
 depress-6.048
 regress-5.833
ック-5.783
riots-5.646
Token
Token ID198
Feature activation+2.41e-01

Pos logit contributions
erva+0.161
 depress+0.157
 regress+0.151
ック+0.146
riots+0.144
Neg logit contributions
alid-0.143
 Agency-0.142
vier-0.141
edin-0.139
cot-0.138
Token
Token ID198
Feature activation+5.20e-01

Pos logit contributions
erva+0.184
 depress+0.180
 regress+0.172
ック+0.167
riots+0.165
Neg logit contributions
alid-0.163
vier-0.160
 Agency-0.159
edin-0.157
cot-0.156
TokenA
Token ID32
Feature activation+2.03e+00

Pos logit contributions
erva+0.437
 depress+0.428
 regress+0.412
ック+0.402
uries+0.396
Neg logit contributions
alid-0.309
vier-0.303
 Agency-0.300
edin-0.298
cot-0.295
Token video
Token ID2008
Feature activation-1.99e+00

Pos logit contributions
erva+1.563
 depress+1.533
 regress+1.465
ック+1.430
riots+1.407
Neg logit contributions
alid-1.349
vier-1.319
 Agency-1.307
edin-1.297
cot-1.284
Token harassed
Token ID26775
Feature activation-9.29e-01

Pos logit contributions
erva+0.950
 depress+0.928
 regress+0.894
ック+0.874
riots+0.860
Neg logit contributions
alid-0.714
vier-0.698
 Agency-0.693
edin-0.686
cot-0.681
Token,
Token ID11
Feature activation+9.70e-01

Pos logit contributions
alid+0.625
vier+0.615
 Agency+0.609
edin+0.602
cot+0.601
Neg logit contributions
erva-0.706
 depress-0.683
 regress-0.656
ック-0.643
riots-0.635
Token t
Token ID256
Feature activation+1.86e+00

Pos logit contributions
erva+0.793
 depress+0.777
 regress+0.747
ック+0.729
riots+0.718
Neg logit contributions
alid-0.598
vier-0.585
 Agency-0.579
edin-0.574
cot-0.569
Tokenrolled
Token ID8375
Feature activation-1.11e+00

Pos logit contributions
erva+1.302
 depress+1.273
 regress+1.216
ック+1.184
riots+1.159
Neg logit contributions
alid-1.364
vier-1.336
 Agency-1.327
edin-1.318
cot-1.309
Token,
Token ID11
Feature activation+1.32e+00

Pos logit contributions
alid+0.832
vier+0.817
 Agency+0.811
edin+0.804
cot+0.800
Neg logit contributions
erva-0.758
 depress-0.734
 regress-0.703
ック-0.684
riots-0.673
Token dog
Token ID3290
Feature activation-8.76e+00

Pos logit contributions
erva+1.065
 depress+1.047
 regress+1.005
ック+0.979
riots+0.965
Neg logit contributions
alid-0.820
vier-0.803
 Agency-0.794
edin-0.788
cot-0.781
Tokenp
Token ID79
Feature activation+2.13e+00

Pos logit contributions
cot+5.634
alid+5.620
 Agency+5.424
edin+5.346
vier+5.317
Neg logit contributions
erva-6.872
 depress-6.485
 regress-6.260
ック-6.072
uries-6.055
Tokeniled
Token ID3902
Feature activation-1.02e+00

Pos logit contributions
erva+1.707
 depress+1.689
 regress+1.596
uries+1.585
riots+1.565
Neg logit contributions
alid-1.312
 Agency-1.302
vier-1.292
edin-1.267
cot-1.239
Token on
Token ID319
Feature activation+8.22e-01

Pos logit contributions
alid+0.754
vier+0.739
 Agency+0.735
edin+0.727
cot+0.725
Neg logit contributions
erva-0.709
 depress-0.686
 regress-0.658
ック-0.640
riots-0.630
Token,
Token ID11
Feature activation+8.90e-01

Pos logit contributions
erva+0.742
 depress+0.727
 regress+0.702
ック+0.688
riots+0.678
Neg logit contributions
alid-0.437
vier-0.426
 Agency-0.423
edin-0.417
cot-0.414
Token dumped
Token ID24105
Feature activation-1.26e+00

Pos logit contributions
erva+0.719
 depress+0.706
 regress+0.678
ック+0.661
riots+0.651
Neg logit contributions
alid-0.557
vier-0.545
 Agency-0.540
edin-0.535
cot-0.530
Token version
Token ID2196
Feature activation+1.45e+00

Pos logit contributions
alid+2.136
 Agency+2.132
vier+2.098
edin+2.097
cot+2.052
Neg logit contributions
erva-3.143
 depress-3.046
 regress-2.923
ック-2.897
uries-2.843
Token removes
Token ID20694
Feature activation-2.28e+00

Pos logit contributions
erva+0.916
 depress+0.890
 regress+0.845
ック+0.821
riots+0.804
Neg logit contributions
alid-1.167
vier-1.148
 Agency-1.141
edin-1.132
cot-1.125
Token ads
Token ID9011
Feature activation-5.75e+00

Pos logit contributions
alid+1.479
 Agency+1.470
vier+1.440
cot+1.417
edin+1.415
Neg logit contributions
erva-1.793
 depress-1.700
riots-1.628
ック-1.620
 regress-1.619
Token)
Token ID8
Feature activation-1.96e-01

Pos logit contributions
alid+3.528
vier+3.455
 Agency+3.452
cot+3.413
edin+3.409
Neg logit contributions
erva-4.650
 depress-4.479
 regress-4.287
ック-4.240
riots-4.190
Token [
Token ID685
Feature activation-1.67e+00

Pos logit contributions
alid+0.130
vier+0.127
 Agency+0.125
cot+0.125
edin+0.124
Neg logit contributions
erva-0.151
 depress-0.148
 regress-0.141
ック-0.137
uries-0.136
TokenHome
Token ID16060
Feature activation-8.71e+00

Pos logit contributions
alid+0.438
vier+0.414
 Agency+0.406
edin+0.395
cot+0.389
Neg logit contributions
erva-1.954
 depress-1.926
 regress-1.874
ック-1.845
riots-1.827
Tokenpage
Token ID7700
Feature activation-1.17e+00

Pos logit contributions
vier+2.007
alid+1.900
edin+1.833
 Agency+1.772
cot+1.641
Neg logit contributions
erva-10.239
 depress-10.196
��-9.913
 regress-9.890
ック-9.876
Token]
Token ID60
Feature activation-2.26e+00

Pos logit contributions
alid+0.722
 Agency+0.720
vier+0.704
edin+0.700
cot+0.686
Neg logit contributions
erva-0.955
 depress-0.932
 regress-0.895
ック-0.875
uries-0.864
Token [
Token ID685
Feature activation-9.71e-01

Pos logit contributions
alid+1.445
vier+1.413
 Agency+1.404
edin+1.390
cot+1.378
Neg logit contributions
erva-1.789
 depress-1.751
 regress-1.682
ック-1.642
riots-1.616
TokenApp
Token ID4677
Feature activation-2.62e+00

Pos logit contributions
alid+0.491
vier+0.477
 Agency+0.472
edin+0.466
cot+0.462
Neg logit contributions
erva-0.901
 depress-0.884
 regress-0.855
ック-0.837
riots-0.827
TokenBrain
Token ID44687
Feature activation+3.98e-01

Pos logit contributions
vier+0.979
alid+0.972
edin+0.953
 Agency+0.950
cot+0.902
Neg logit contributions
erva-2.767
 depress-2.746
 regress-2.672
ック-2.603
riots-2.557
Token app
Token ID598
Feature activation-2.84e+00

Pos logit contributions
alid+2.379
vier+2.351
 Agency+2.339
edin+2.313
cot+2.288
Neg logit contributions
erva-2.969
 depress-2.880
 regress-2.764
ック-2.710
riots-2.671
Token adds
Token ID6673
Feature activation-7.44e-01

Pos logit contributions
alid+2.004
vier+1.979
 Agency+1.966
edin+1.958
cot+1.927
Neg logit contributions
erva-2.041
 depress-1.965
 regress-1.880
ック-1.853
riots-1.808
Token features
Token ID3033
Feature activation-1.65e+00

Pos logit contributions
alid+0.484
 Agency+0.479
vier+0.475
edin+0.469
cot+0.468
Neg logit contributions
erva-0.583
 depress-0.558
 regress-0.536
ック-0.526
riots-0.525
Token)
Token ID8
Feature activation-3.62e-01

Pos logit contributions
alid+1.064
 Agency+1.045
vier+1.044
edin+1.034
cot+1.022
Neg logit contributions
erva-1.297
 depress-1.254
 regress-1.207
ック-1.188
riots-1.169
Token [
Token ID685
Feature activation-1.87e+00

Pos logit contributions
alid+0.233
vier+0.229
 Agency+0.226
edin+0.224
cot+0.223
Neg logit contributions
erva-0.286
 depress-0.279
 regress-0.268
ック-0.262
riots-0.258
TokenHome
Token ID16060
Feature activation-8.69e+00

Pos logit contributions
alid+0.705
vier+0.684
 Agency+0.677
edin+0.673
cot+0.655
Neg logit contributions
erva-1.974
 depress-1.942
 regress-1.886
ック-1.849
riots-1.821
Tokenpage
Token ID7700
Feature activation-7.25e-01

Pos logit contributions
vier+1.991
alid+1.890
edin+1.822
 Agency+1.803
cot+1.631
Neg logit contributions
erva-10.197
 depress-10.171
ック-9.869
 regress-9.853
��-9.781
Token]
Token ID60
Feature activation-2.35e+00

Pos logit contributions
alid+0.499
vier+0.490
 Agency+0.484
edin+0.480
cot+0.479
Neg logit contributions
erva-0.541
 depress-0.528
 regress-0.506
ック-0.493
riots-0.484
Token [
Token ID685
Feature activation-1.07e+00

Pos logit contributions
alid+1.523
vier+1.490
 Agency+1.482
edin+1.468
cot+1.453
Neg logit contributions
erva-1.848
 depress-1.808
 regress-1.737
ック-1.694
riots-1.668
TokenApp
Token ID4677
Feature activation-2.42e+00

Pos logit contributions
alid+0.551
vier+0.539
 Agency+0.533
edin+0.532
cot+0.523
Neg logit contributions
erva-0.986
 depress-0.969
 regress-0.936
ック-0.914
riots-0.899
TokenBrain
Token ID44687
Feature activation+7.15e-01

Pos logit contributions
alid+1.078
vier+1.059
 Agency+1.038
edin+1.035
cot+1.012
Neg logit contributions
erva-2.376
 depress-2.342
 regress-2.271
ック-2.229
riots-2.189
Token thing
Token ID1517
Feature activation+3.56e-01

Pos logit contributions
alid+2.205
 Agency+2.181
vier+2.152
edin+2.129
cot+2.107
Neg logit contributions
erva-2.576
 depress-2.432
 regress-2.349
ック-2.337
riots-2.314
Token,
Token ID11
Feature activation+2.28e-01

Pos logit contributions
erva+0.222
 depress+0.216
 regress+0.205
ック+0.199
riots+0.195
Neg logit contributions
alid-0.287
vier-0.283
 Agency-0.281
edin-0.279
cot-0.277
Token he
Token ID339
Feature activation-7.09e-01

Pos logit contributions
erva+0.174
 depress+0.169
 regress+0.162
ック+0.158
riots+0.156
Neg logit contributions
alid-0.154
 Agency-0.150
vier-0.149
edin-0.147
cot-0.146
Token has
Token ID468
Feature activation+5.24e-01

Pos logit contributions
alid+0.427
vier+0.418
 Agency+0.414
edin+0.412
cot+0.407
Neg logit contributions
erva-0.591
 depress-0.573
 regress-0.555
ック-0.541
riots-0.535
Token a
Token ID257
Feature activation-1.45e-01

Pos logit contributions
erva+0.416
 depress+0.400
 regress+0.388
ック+0.379
riots+0.374
Neg logit contributions
alid-0.340
 Agency-0.331
vier-0.330
edin-0.327
cot-0.325
Token dog
Token ID3290
Feature activation-8.68e+00

Pos logit contributions
alid+0.096
 Agency+0.094
vier+0.094
cot+0.092
edin+0.092
Neg logit contributions
erva-0.113
 depress-0.108
 regress-0.105
ック-0.103
riots-0.101
Token,
Token ID11
Feature activation+3.86e-01

Pos logit contributions
 Agency+5.128
alid+5.077
vier+5.067
cot+5.063
edin+4.818
Neg logit contributions
erva-6.895
 depress-6.452
 regress-6.213
uries-6.186
riots-6.164
Token Angel
Token ID3905
Feature activation+1.54e+00

Pos logit contributions
erva+0.303
 depress+0.296
 regress+0.285
ック+0.278
riots+0.274
Neg logit contributions
alid-0.251
vier-0.245
 Agency-0.244
edin-0.241
cot-0.239
Token,
Token ID11
Feature activation+6.76e-01

Pos logit contributions
erva+1.015
 depress+0.988
 regress+0.941
ック+0.915
riots+0.896
Neg logit contributions
alid-1.189
vier-1.167
 Agency-1.160
edin-1.153
cot-1.143
Token that
Token ID326
Feature activation-1.09e-02

Pos logit contributions
erva+0.560
 depress+0.548
 regress+0.527
ック+0.515
riots+0.508
Neg logit contributions
alid-0.410
vier-0.400
 Agency-0.397
edin-0.393
cot-0.390
Token might
Token ID1244
Feature activation-1.92e+00

Pos logit contributions
alid+0.007
vier+0.007
 Agency+0.007
edin+0.007
cot+0.007
Neg logit contributions
erva-0.009
 depress-0.008
 regress-0.008
ック-0.008
riots-0.008
Token,
Token ID11
Feature activation-1.67e-01

Pos logit contributions
erva+1.024
 depress+1.002
 regress+0.960
ック+0.936
riots+0.921
Neg logit contributions
alid-0.877
vier-0.860
 Agency-0.852
edin-0.845
cot-0.839
Token the
Token ID262
Feature activation-2.87e-01

Pos logit contributions
alid+0.110
vier+0.108
 Agency+0.107
edin+0.106
cot+0.105
Neg logit contributions
erva-0.130
 depress-0.127
 regress-0.122
ック-0.119
riots-0.117
Token video
Token ID2008
Feature activation-9.54e-01

Pos logit contributions
alid+0.161
 Agency+0.157
vier+0.157
edin+0.154
cot+0.153
Neg logit contributions
erva-0.250
 depress-0.244
 regress-0.236
ック-0.232
riots-0.228
Token shows
Token ID2523
Feature activation-1.85e-01

Pos logit contributions
alid+0.607
 Agency+0.597
vier+0.590
edin+0.586
cot+0.575
Neg logit contributions
erva-0.764
 depress-0.730
 regress-0.706
ック-0.701
riots-0.684
Token a
Token ID257
Feature activation-5.85e-01

Pos logit contributions
alid+0.123
 Agency+0.121
vier+0.120
edin+0.118
cot+0.117
Neg logit contributions
erva-0.143
 depress-0.138
 regress-0.133
ック-0.131
uries-0.128
Token stray
Token ID28583
Feature activation-8.67e+00

Pos logit contributions
alid+0.360
 Agency+0.355
vier+0.353
edin+0.348
cot+0.345
Neg logit contributions
erva-0.477
 depress-0.463
 regress-0.447
ック-0.440
riots-0.433
Token dog
Token ID3290
Feature activation-7.81e+00

Pos logit contributions
 Agency+4.372
vier+4.199
alid+4.184
cot+4.053
edin+4.039
Neg logit contributions
erva-7.752
 depress-7.344
uries-7.180
 regress-7.113
ック-7.107
Token standing
Token ID5055
Feature activation-3.19e-01

Pos logit contributions
 Agency+4.765
vier+4.708
alid+4.703
cot+4.617
edin+4.602
Neg logit contributions
erva-6.154
 depress-5.589
uries-5.570
ック-5.499
riots-5.497
Token patiently
Token ID37220
Feature activation-1.67e+00

Pos logit contributions
alid+0.184
vier+0.181
 Agency+0.180
edin+0.178
cot+0.176
Neg logit contributions
erva-0.272
 depress-0.265
 regress-0.256
ック-0.251
riots-0.248
Token next
Token ID1306
Feature activation+1.35e+00

Pos logit contributions
alid+0.941
vier+0.940
 Agency+0.937
edin+0.925
cot+0.917
Neg logit contributions
erva-1.418
 depress-1.355
 regress-1.320
ック-1.312
riots-1.296
Token to
Token ID284
Feature activation+2.27e-01

Pos logit contributions
erva+1.115
 depress+1.093
 regress+1.050
ック+1.026
riots+1.010
Neg logit contributions
alid-0.827
vier-0.808
 Agency-0.800
edin-0.793
cot-0.787
Token to
Token ID284
Feature activation+1.03e+00

Pos logit contributions
alid+2.983
vier+2.948
 Agency+2.898
cot+2.871
edin+2.858
Neg logit contributions
erva-3.050
 depress-2.915
 regress-2.779
ック-2.754
riots-2.741
Token intrinsic
Token ID28327
Feature activation+1.95e+00

Pos logit contributions
erva+0.809
 depress+0.785
 regress+0.752
ック+0.738
riots+0.731
Neg logit contributions
alid-0.675
vier-0.661
 Agency-0.659
edin-0.648
cot-0.646
Token value
Token ID1988
Feature activation+1.67e-01

Pos logit contributions
erva+1.404
 depress+1.354
 regress+1.289
ック+1.266
riots+1.253
Neg logit contributions
alid-1.395
 Agency-1.373
vier-1.368
edin-1.345
cot-1.344
Token,
Token ID11
Feature activation+4.16e-01

Pos logit contributions
erva+0.115
 depress+0.112
 regress+0.106
ック+0.104
riots+0.103
Neg logit contributions
alid-0.125
vier-0.122
 Agency-0.122
edin-0.120
cot-0.120
Token it
Token ID340
Feature activation+2.26e+00

Pos logit contributions
erva+0.335
 depress+0.326
 regress+0.313
ック+0.308
riots+0.305
Neg logit contributions
alid-0.261
 Agency-0.255
vier-0.255
edin-0.250
cot-0.249
Token fost
Token ID26546
Feature activation-8.61e+00

Pos logit contributions
erva+1.774
 depress+1.743
 regress+1.673
ック+1.626
riots+1.601
Neg logit contributions
alid-1.469
vier-1.437
 Agency-1.425
edin-1.412
cot-1.400
Tokeners
Token ID364
Feature activation-2.40e+00

Pos logit contributions
alid+5.767
cot+5.709
edin+5.556
vier+5.541
 Agency+5.411
Neg logit contributions
��-5.896
erva-5.861
ック-5.738
 depress-5.697
 regress-5.553
Token the
Token ID262
Feature activation-6.16e-01

Pos logit contributions
 Agency+1.590
alid+1.586
vier+1.542
cot+1.528
edin+1.512
Neg logit contributions
erva-1.834
 depress-1.723
ック-1.670
riots-1.657
uries-1.653
Token creative
Token ID7325
Feature activation+1.20e+00

Pos logit contributions
alid+0.416
 Agency+0.409
vier+0.407
edin+0.399
cot+0.398
Neg logit contributions
erva-0.468
 depress-0.450
 regress-0.428
ック-0.426
riots-0.422
Token power
Token ID1176
Feature activation-1.59e-01

Pos logit contributions
erva+0.794
 depress+0.763
 regress+0.720
ック+0.710
riots+0.701
Neg logit contributions
alid-0.937
 Agency-0.924
vier-0.919
edin-0.906
cot-0.902
Token of
Token ID286
Feature activation-7.20e-01

Pos logit contributions
alid+0.089
vier+0.086
 Agency+0.086
edin+0.085
cot+0.084
Neg logit contributions
erva-0.140
 depress-0.136
 regress-0.132
ック-0.129
riots-0.127
Token like
Token ID588
Feature activation+3.42e+00

Pos logit contributions
erva+0.390
 depress+0.379
 regress+0.366
ック+0.358
riots+0.353
Neg logit contributions
alid-0.312
vier-0.304
 Agency-0.302
edin-0.299
cot-0.297
Token an
Token ID281
Feature activation-6.55e-01

Pos logit contributions
erva+2.542
 depress+2.492
 regress+2.382
ック+2.319
riots+2.279
Neg logit contributions
alid-2.361
vier-2.316
 Agency-2.295
edin-2.277
cot-2.259
Token inter
Token ID987
Feature activation+2.46e-01

Pos logit contributions
alid+0.425
 Agency+0.416
vier+0.414
edin+0.408
cot+0.405
Neg logit contributions
erva-0.517
 depress-0.500
 regress-0.482
ック-0.473
riots-0.466
Tokenmin
Token ID1084
Feature activation+5.05e+00

Pos logit contributions
erva+0.163
 depress+0.158
 regress+0.151
ック+0.147
uries+0.143
Neg logit contributions
alid-0.190
vier-0.187
 Agency-0.186
edin-0.184
cot-0.184
Tokenable
Token ID540
Feature activation+4.61e+00

Pos logit contributions
erva+3.499
 depress+3.318
 regress+3.233
ック+3.096
uries+3.079
Neg logit contributions
alid-3.726
 Agency-3.661
vier-3.634
cot-3.602
edin-3.590
Token stage
Token ID3800
Feature activation+6.66e+00

Pos logit contributions
erva+3.638
 depress+3.614
 regress+3.464
ック+3.341
uries+3.292
Neg logit contributions
alid-2.933
vier-2.877
 Agency-2.823
edin-2.812
cot-2.794
Token whisper
Token ID31992
Feature activation-1.48e+00

Pos logit contributions
erva+5.103
 depress+5.055
 regress+4.845
ック+4.627
riots+4.542
Neg logit contributions
alid-4.311
vier-4.275
 Agency-4.166
edin-4.141
cot-4.092
Token (
Token ID357
Feature activation+2.42e-01

Pos logit contributions
alid+1.007
vier+0.991
 Agency+0.981
edin+0.973
cot+0.972
Neg logit contributions
erva-1.117
 depress-1.075
 regress-1.039
ック-1.015
riots-1.004
Tokenthere
Token ID8117
Feature activation-5.96e-01

Pos logit contributions
erva+0.187
 depress+0.183
 regress+0.176
ック+0.170
uries+0.168
Neg logit contributions
alid-0.161
vier-0.157
 Agency-0.156
edin-0.155
cot-0.154
Token are
Token ID389
Feature activation+3.93e-01

Pos logit contributions
alid+0.381
vier+0.371
 Agency+0.369
edin+0.367
cot+0.364
Neg logit contributions
erva-0.474
 depress-0.465
 regress-0.446
ック-0.434
riots-0.429
Token odd
Token ID5629
Feature activation+2.48e+00

Pos logit contributions
erva+0.302
 depress+0.288
 regress+0.279
ック+0.272
riots+0.269
Neg logit contributions
alid-0.264
 Agency-0.259
vier-0.258
edin-0.254
cot-0.252
Token US
Token ID1294
Feature activation-4.37e-01

Pos logit contributions
erva+1.626
 depress+1.585
 regress+1.516
ック+1.478
riots+1.451
Neg logit contributions
alid-1.619
vier-1.585
 Agency-1.577
edin-1.562
cot-1.552
Token-
Token ID12
Feature activation+7.47e-01

Pos logit contributions
alid+0.314
 Agency+0.307
vier+0.305
cot+0.302
edin+0.301
Neg logit contributions
erva-0.313
 depress-0.303
 regress-0.291
ック-0.285
riots-0.280
TokenUK
Token ID15039
Feature activation-1.33e+00

Pos logit contributions
erva+0.635
 depress+0.624
 regress+0.597
ック+0.584
riots+0.580
Neg logit contributions
alid-0.431
vier-0.426
 Agency-0.418
edin-0.415
cot-0.409
Token occupation
Token ID13755
Feature activation+1.41e+00

Pos logit contributions
alid+0.927
 Agency+0.906
vier+0.898
cot+0.888
edin+0.884
Neg logit contributions
erva-0.984
 depress-0.952
 regress-0.915
ック-0.896
riots-0.879
Token and
Token ID290
Feature activation-1.29e-01

Pos logit contributions
erva+1.025
 depress+1.000
 regress+0.957
ック+0.933
riots+0.917
Neg logit contributions
alid-0.994
vier-0.974
 Agency-0.969
edin-0.959
cot-0.954
Token civil
Token ID3026
Feature activation+6.71e+00

Pos logit contributions
alid+0.092
 Agency+0.090
vier+0.089
edin+0.088
cot+0.088
Neg logit contributions
erva-0.094
 depress-0.090
 regress-0.086
ック-0.086
riots-0.084
Token war
Token ID1175
Feature activation+2.51e+00

Pos logit contributions
erva+4.862
 depress+4.795
 regress+4.587
ック+4.413
riots+4.374
Neg logit contributions
alid-4.620
vier-4.572
edin-4.567
 Agency-4.497
cot-4.410
Token that
Token ID326
Feature activation+4.48e-01

Pos logit contributions
erva+1.770
 depress+1.727
 regress+1.649
ック+1.605
riots+1.577
Neg logit contributions
alid-1.834
vier-1.799
 Agency-1.787
edin-1.772
cot-1.761
Token claimed
Token ID4752
Feature activation-9.15e-01

Pos logit contributions
erva+0.370
 depress+0.358
 regress+0.345
ック+0.342
riots+0.336
Neg logit contributions
alid-0.274
 Agency-0.267
vier-0.267
edin-0.262
cot-0.261
Token the
Token ID262
Feature activation-1.35e+00

Pos logit contributions
alid+0.552
 Agency+0.542
vier+0.533
edin+0.525
cot+0.525
Neg logit contributions
erva-0.760
 depress-0.734
 regress-0.711
ック-0.703
riots-0.692
Token lives
Token ID3160
Feature activation-1.87e+00

Pos logit contributions
alid+0.832
 Agency+0.825
vier+0.802
cot+0.793
edin+0.790
Neg logit contributions
erva-1.105
 depress-1.071
 regress-1.031
ック-1.018
uries-1.002
Token multi
Token ID5021
Feature activation+2.34e+00

Pos logit contributions
erva+0.451
 depress+0.438
 regress+0.422
ック+0.415
riots+0.409
Neg logit contributions
alid-0.353
vier-0.346
 Agency-0.345
edin-0.340
cot-0.339
Token-
Token ID12
Feature activation+1.64e+00

Pos logit contributions
erva+1.538
 depress+1.519
 regress+1.446
ック+1.400
uries+1.379
Neg logit contributions
alid-1.772
vier-1.743
edin-1.742
 Agency-1.723
ovember-1.698
Tokenstore
Token ID8095
Feature activation+1.08e+00

Pos logit contributions
erva+1.206
 depress+1.180
 regress+1.129
ック+1.098
riots+1.084
Neg logit contributions
alid-1.135
vier-1.115
 Agency-1.106
edin-1.097
cot-1.083
Tokeny
Token ID88
Feature activation+1.26e+00

Pos logit contributions
erva+0.885
 depress+0.867
 regress+0.834
ック+0.816
riots+0.802
Neg logit contributions
alid-0.667
vier-0.652
 Agency-0.648
cot-0.639
edin-0.633
Token rectangular
Token ID36954
Feature activation+3.41e+00

Pos logit contributions
erva+1.040
 depress+1.012
 regress+0.976
ック+0.957
riots+0.944
Neg logit contributions
alid-0.768
vier-0.750
 Agency-0.750
edin-0.739
cot-0.734
Token podium
Token ID27941
Feature activation+6.92e+00

Pos logit contributions
erva+2.534
 depress+2.491
 regress+2.381
ック+2.312
riots+2.268
Neg logit contributions
alid-2.350
vier-2.308
 Agency-2.280
edin-2.264
cot-2.248
Token.
Token ID13
Feature activation+6.11e-01

Pos logit contributions
erva+5.184
 depress+5.175
 regress+4.880
ック+4.701
uries+4.626
Neg logit contributions
alid-4.603
vier-4.537
 Agency-4.441
edin-4.422
cot-4.342
Token In
Token ID554
Feature activation+4.99e-01

Pos logit contributions
erva+0.457
 depress+0.448
 regress+0.430
ック+0.416
riots+0.410
Neg logit contributions
alid-0.420
 Agency-0.413
vier-0.412
edin-0.406
cot-0.402
Token the
Token ID262
Feature activation+5.95e-01

Pos logit contributions
erva+0.387
 depress+0.378
 regress+0.364
ック+0.356
riots+0.352
Neg logit contributions
alid-0.327
vier-0.320
 Agency-0.320
edin-0.316
cot-0.314
Token span
Token ID11506
Feature activation+6.02e-01

Pos logit contributions
erva+0.457
 depress+0.443
 regress+0.425
ック+0.417
riots+0.411
Neg logit contributions
alid-0.398
 Agency-0.390
vier-0.389
edin-0.384
cot-0.382
Token of
Token ID286
Feature activation-3.82e-01

Pos logit contributions
erva+0.530
 depress+0.519
 regress+0.502
ック+0.490
uries+0.484
Neg logit contributions
alid-0.332
vier-0.326
 Agency-0.323
edin-0.319
cot-0.316
Token
Token ID447
Feature activation+1.01e+00

Pos logit contributions
erva+0.550
 depress+0.539
 regress+0.516
ック+0.499
riots+0.494
Neg logit contributions
alid-0.530
 Agency-0.521
vier-0.521
edin-0.514
cot-0.509
Token
Token ID251
Feature activation+6.38e-01

Pos logit contributions
erva+0.528
 depress+0.516
 regress+0.483
ック+0.465
riots+0.454
Neg logit contributions
alid-0.916
vier-0.898
 Agency-0.896
edin-0.891
cot-0.888
Token Never
Token ID7236
Feature activation-8.14e-01

Pos logit contributions
erva+0.470
 depress+0.460
 regress+0.441
ック+0.428
riots+0.421
Neg logit contributions
alid-0.444
vier-0.435
 Agency-0.434
edin-0.430
cot-0.427
Token Mind
Token ID10175
Feature activation+3.34e+00

Pos logit contributions
alid+0.508
vier+0.505
 Agency+0.499
edin+0.493
cot+0.491
Neg logit contributions
erva-0.654
 depress-0.633
 regress-0.604
ック-0.597
riots-0.593
Token the
Token ID262
Feature activation+8.53e-01

Pos logit contributions
erva+2.511
 depress+2.453
 regress+2.349
ック+2.292
riots+2.253
Neg logit contributions
alid-2.286
vier-2.231
 Agency-2.209
edin-2.198
cot-2.173
Token Boll
Token ID43479
Feature activation+6.47e+00

Pos logit contributions
erva+0.643
 depress+0.628
 regress+0.601
ック+0.588
riots+0.576
Neg logit contributions
alid-0.581
vier-0.569
 Agency-0.569
edin-0.560
cot-0.557
Tokenocks
Token ID3320
Feature activation+4.48e-01

Pos logit contributions
erva+2.762
 depress+2.666
 regress+2.460
uries+2.321
ック+2.320
Neg logit contributions
alid-6.424
vier-6.324
 Agency-6.274
edin-6.223
Saharan-6.222
Token,
Token ID11
Feature activation+2.06e-01

Pos logit contributions
erva+0.326
 depress+0.317
 regress+0.303
ック+0.297
riots+0.293
Neg logit contributions
alid-0.316
 Agency-0.310
vier-0.310
edin-0.305
cot-0.304
Token Here
Token ID3423
Feature activation-6.07e-01

Pos logit contributions
erva+0.154
 depress+0.149
 regress+0.144
ック+0.140
riots+0.139
Neg logit contributions
alid-0.141
 Agency-0.138
vier-0.138
edin-0.136
cot-0.136
Token
Token ID447
Feature activation+9.81e-01

Pos logit contributions
alid+0.328
vier+0.322
 Agency+0.321
cot+0.316
edin+0.315
Neg logit contributions
erva-0.538
 depress-0.531
 regress-0.514
ック-0.501
riots-0.492
Token
Token ID247
Feature activation+7.62e-02

Pos logit contributions
erva+0.529
 depress+0.514
 regress+0.483
ック+0.466
riots+0.455
Neg logit contributions
alid-0.876
vier-0.861
 Agency-0.857
edin-0.852
cot-0.848
Tokens
Token ID82
Feature activation-3.75e-01

Pos logit contributions
erva+1.301
 depress+1.267
 regress+1.209
ック+1.158
riots+1.146
Neg logit contributions
alid-1.701
vier-1.675
 Agency-1.670
edin-1.650
cot-1.643
Token01
Token ID486
Feature activation+1.67e+00

Pos logit contributions
alid+0.269
vier+0.264
 Agency+0.263
edin+0.260
cot+0.258
Neg logit contributions
erva-0.269
 depress-0.262
 regress-0.251
ック-0.244
riots-0.240
Token.
Token ID13
Feature activation-1.42e+00

Pos logit contributions
erva+1.123
 depress+1.091
 regress+1.045
ック+1.003
riots+0.992
Neg logit contributions
alid-1.274
vier-1.249
 Agency-1.244
edin-1.232
cot-1.225
Tokenhtml
Token ID6494
Feature activation-1.10e+00

Pos logit contributions
alid+0.578
vier+0.562
 Agency+0.558
edin+0.548
cot+0.540
Neg logit contributions
erva-1.463
 depress-1.437
 regress-1.395
ック-1.372
uries-1.352
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
alid+0.893
vier+0.879
 Agency+0.869
edin+0.866
cot+0.860
Neg logit contributions
erva-0.681
 depress-0.662
 regress-0.628
ック-0.607
riots-0.596
Token Jo
Token ID5302
Feature activation+6.51e+00

Pos logit contributions
alid+0.612
 Agency+0.598
vier+0.597
edin+0.586
cot+0.580
Neg logit contributions
erva-0.982
 depress-0.962
 regress-0.929
ック-0.910
riots-0.897
Tokeniner
Token ID7274
Feature activation+3.63e+00

Pos logit contributions
erva+4.403
 depress+4.265
 regress+4.025
ック+3.924
uries+3.881
Neg logit contributions
alid-4.801
vier-4.743
 Agency-4.718
cot-4.666
edin-4.657
Token T
Token ID309
Feature activation+2.59e+00

Pos logit contributions
erva+2.605
 depress+2.545
 regress+2.429
ック+2.357
riots+2.324
Neg logit contributions
alid-2.578
vier-2.521
edin-2.491
 Agency-2.481
cot-2.470
Token.
Token ID13
Feature activation-5.81e-01

Pos logit contributions
erva+1.674
 depress+1.629
 regress+1.548
ック+1.502
riots+1.474
Neg logit contributions
alid-2.039
vier-2.004
 Agency-1.990
edin-1.976
cot-1.964
Token Why
Token ID4162
Feature activation-5.07e+00

Pos logit contributions
alid+0.418
 Agency+0.413
vier+0.411
edin+0.405
cot+0.401
Neg logit contributions
erva-0.415
 depress-0.406
 regress-0.391
ック-0.376
riots-0.371
Token people
Token ID661
Feature activation+2.10e-01

Pos logit contributions
vier+3.240
alid+3.234
cot+3.216
edin+3.202
 Agency+3.178
Neg logit contributions
erva-3.897
 depress-3.830
ック-3.652
 regress-3.647
uries-3.550
Token as
Token ID355
Feature activation+2.02e+00

Pos logit contributions
erva+1.854
 depress+1.828
 regress+1.738
ック+1.705
riots+1.652
Neg logit contributions
alid-2.064
vier-2.041
 Agency-2.008
cot-2.003
edin-1.999
Token those
Token ID883
Feature activation+6.16e-02

Pos logit contributions
erva+1.461
 depress+1.422
 regress+1.362
ック+1.329
uries+1.306
Neg logit contributions
alid-1.439
 Agency-1.405
vier-1.404
edin-1.387
cot-1.378
Token who
Token ID508
Feature activation-1.00e+00

Pos logit contributions
erva+0.053
 depress+0.052
 regress+0.050
ック+0.049
riots+0.048
Neg logit contributions
alid-0.035
 Agency-0.034
vier-0.034
edin-0.034
cot-0.033
Token perpetrated
Token ID35524
Feature activation-1.69e+00

Pos logit contributions
alid+0.599
 Agency+0.582
vier+0.579
edin+0.573
cot+0.570
Neg logit contributions
erva-0.842
 depress-0.812
 regress-0.786
ック-0.778
riots-0.765
Token the
Token ID262
Feature activation-6.99e-01

Pos logit contributions
alid+1.105
 Agency+1.076
vier+1.055
cot+1.054
edin+1.038
Neg logit contributions
erva-1.320
 depress-1.277
 regress-1.229
ック-1.217
uries-1.195
Token horrific
Token ID19447
Feature activation+1.62e+00

Pos logit contributions
alid+0.458
 Agency+0.448
vier+0.440
cot+0.436
edin+0.434
Neg logit contributions
erva-0.546
 depress-0.529
 regress-0.511
ック-0.502
riots-0.492
Token massacre
Token ID19050
Feature activation+2.59e+00

Pos logit contributions
erva+1.189
 depress+1.160
 regress+1.111
ック+1.083
riots+1.063
Neg logit contributions
alid-1.142
vier-1.113
 Agency-1.111
edin-1.098
cot-1.093
Token in
Token ID287
Feature activation+3.59e-01

Pos logit contributions
erva+2.181
 depress+2.138
 regress+2.052
ック+2.007
riots+1.978
Neg logit contributions
alid-1.530
vier-1.505
 Agency-1.485
edin-1.471
cot-1.463
Token Har
Token ID2113
Feature activation-3.39e-01

Pos logit contributions
erva+0.276
 depress+0.269
 regress+0.258
ック+0.253
uries+0.249
Neg logit contributions
alid-0.241
 Agency-0.235
vier-0.233
edin-0.230
cot-0.230
Token No
Token ID1400
Feature activation+2.43e+00

Pos logit contributions
alid+0.241
 Agency+0.234
vier+0.234
edin+0.232
cot+0.231
Neg logit contributions
erva-0.247
 depress-0.243
 regress-0.232
ック-0.227
uries-0.221
Tokenf
Token ID69
Feature activation+1.85e-01

Pos logit contributions
erva+1.866
 depress+1.819
 regress+1.745
ック+1.701
riots+1.676
Neg logit contributions
alid-1.620
vier-1.587
 Agency-1.573
edin-1.561
cot-1.550
Token Jerusalem
Token ID10843
Feature activation-6.51e-01

Pos logit contributions
alid+0.048
 Agency+0.047
vier+0.046
cot+0.046
edin+0.046
Neg logit contributions
erva-0.070
 depress-0.067
 regress-0.065
ック-0.064
riots-0.063
Token,
Token ID11
Feature activation+1.74e-01

Pos logit contributions
alid+0.467
 Agency+0.457
vier+0.457
cot+0.451
edin+0.448
Neg logit contributions
erva-0.469
 depress-0.453
 regress-0.436
ック-0.424
riots-0.419
Token but
Token ID475
Feature activation-2.69e+00

Pos logit contributions
erva+0.137
 depress+0.134
 regress+0.129
ック+0.125
riots+0.124
Neg logit contributions
alid-0.114
vier-0.111
 Agency-0.110
edin-0.109
cot-0.108
Token
Token ID564
Feature activation+1.00e+00

Pos logit contributions
alid+1.622
 Agency+1.606
vier+1.575
edin+1.573
cot+1.560
Neg logit contributions
erva-2.216
 depress-2.121
 regress-2.063
ック-2.032
riots-1.995
Token
Token ID250
Feature activation+3.73e-01

Pos logit contributions
erva+0.385
 depress+0.368
 regress+0.337
ック+0.319
riots+0.308
Neg logit contributions
alid-1.054
vier-1.040
 Agency-1.035
edin-1.029
cot-1.024
Tokenwill
Token ID10594
Feature activation+1.94e+00

Pos logit contributions
erva+0.272
 depress+0.266
 regress+0.255
ック+0.248
uries+0.243
Neg logit contributions
alid-0.263
vier-0.258
 Agency-0.255
edin-0.254
cot-0.254
Token not
Token ID407
Feature activation+1.29e+00

Pos logit contributions
erva+1.639
 depress+1.602
 regress+1.542
ック+1.510
riots+1.488
Neg logit contributions
alid-1.146
vier-1.118
 Agency-1.109
edin-1.100
cot-1.092
Token go
Token ID467
Feature activation-8.58e-01

Pos logit contributions
erva+1.028
 depress+0.990
 regress+0.952
ック+0.941
riots+0.926
Neg logit contributions
alid-0.822
vier-0.800
 Agency-0.798
edin-0.792
cot-0.792
Token crazy
Token ID7165
Feature activation+2.16e+00

Pos logit contributions
alid+0.547
vier+0.540
 Agency+0.539
cot+0.527
edin+0.526
Neg logit contributions
erva-0.683
 depress-0.657
 regress-0.636
ック-0.621
riots-0.612
Token
Token ID447
Feature activation+3.81e-01

Pos logit contributions
erva+1.708
 depress+1.675
 regress+1.607
ック+1.570
riots+1.543
Neg logit contributions
alid-1.394
vier-1.361
 Agency-1.353
edin-1.340
cot-1.327
Token
Token ID251
Feature activation+4.18e-01

Pos logit contributions
erva+0.189
 depress+0.184
 regress+0.172
ック+0.165
riots+0.160
Neg logit contributions
alid-0.356
vier-0.351
 Agency-0.349
edin-0.347
cot-0.345
Token child
Token ID1200
Feature activation-4.38e+00

Pos logit contributions
alid+0.216
 Agency+0.212
vier+0.211
edin+0.208
cot+0.207
Neg logit contributions
erva-0.269
 depress-0.261
 regress-0.250
ック-0.245
riots-0.242
Token sexual
Token ID3206
Feature activation-8.62e-02

Pos logit contributions
alid+2.813
cot+2.792
 Agency+2.780
vier+2.748
edin+2.722
Neg logit contributions
erva-3.407
 depress-3.275
 regress-3.116
ック-3.082
uries-3.047
Token abuse
Token ID5076
Feature activation-3.46e+00

Pos logit contributions
alid+0.055
vier+0.054
 Agency+0.054
edin+0.053
cot+0.053
Neg logit contributions
erva-0.068
 depress-0.067
 regress-0.064
ック-0.063
riots-0.062
Token content
Token ID2695
Feature activation-3.67e-01

Pos logit contributions
 Agency+2.157
alid+2.151
vier+2.093
cot+2.092
edin+2.084
Neg logit contributions
erva-2.781
 depress-2.661
 regress-2.547
ック-2.544
riots-2.537
Token in
Token ID287
Feature activation+3.12e-02

Pos logit contributions
alid+0.224
vier+0.220
 Agency+0.218
edin+0.216
cot+0.214
Neg logit contributions
erva-0.303
 depress-0.296
 regress-0.285
ック-0.279
riots-0.276
Token Australia
Token ID4505
Feature activation+2.43e+00

Pos logit contributions
erva+0.026
 depress+0.025
 regress+0.024
ック+0.023
riots+0.023
Neg logit contributions
alid-0.019
 Agency-0.019
vier-0.019
edin-0.019
cot-0.018
Token,
Token ID11
Feature activation-1.07e-01

Pos logit contributions
erva+1.799
 depress+1.760
 regress+1.685
ック+1.639
uries+1.612
Neg logit contributions
alid-1.675
vier-1.638
 Agency-1.621
edin-1.612
cot-1.601
Token
Token ID447
Feature activation-5.56e-02

Pos logit contributions
alid+0.067
vier+0.065
 Agency+0.065
edin+0.064
cot+0.064
Neg logit contributions
erva-0.086
 depress-0.084
 regress-0.081
ック-0.079
riots-0.078
Token
Token ID251
Feature activation-1.39e-01

Pos logit contributions
alid+0.054
vier+0.053
 Agency+0.053
edin+0.053
cot+0.053
Neg logit contributions
erva-0.025
 depress-0.025
 regress-0.023
ック-0.022
uries-0.021
Token the
Token ID262
Feature activation-7.73e-01

Pos logit contributions
alid+0.088
vier+0.085
 Agency+0.085
edin+0.084
cot+0.084
Neg logit contributions
erva-0.112
 depress-0.109
 regress-0.105
ック-0.102
uries-0.101
Token spokesperson
Token ID11821
Feature activation-2.26e+00

Pos logit contributions
alid+0.341
vier+0.332
 Agency+0.331
edin+0.324
cot+0.321
Neg logit contributions
erva-0.768
 depress-0.753
 regress-0.729
ック-0.715
riots-0.707
Tokenique
Token ID2350
Feature activation-2.26e+00

Pos logit contributions
alid+0.972
 Agency+0.956
vier+0.951
edin+0.941
cot+0.937
Neg logit contributions
erva-0.711
 depress-0.697
 regress-0.664
ック-0.635
riots-0.620
Token:
Token ID25
Feature activation-2.72e-01

Pos logit contributions
alid+1.552
 Agency+1.522
vier+1.515
edin+1.499
cot+1.484
Neg logit contributions
erva-1.700
 depress-1.654
 regress-1.586
ック-1.547
riots-1.523
Token Great
Token ID3878
Feature activation-5.89e-01

Pos logit contributions
alid+0.176
 Agency+0.174
vier+0.173
edin+0.170
cot+0.169
Neg logit contributions
erva-0.214
 depress-0.208
 regress-0.200
ック-0.196
riots-0.193
Token hike
Token ID18390
Feature activation+1.19e+00

Pos logit contributions
 Agency+0.378
alid+0.376
vier+0.370
edin+0.367
cot+0.366
Neg logit contributions
erva-0.465
 depress-0.443
 regress-0.432
ック-0.428
riots-0.423
Token,
Token ID11
Feature activation+2.93e-01

Pos logit contributions
erva+0.890
 depress+0.865
 regress+0.834
ック+0.813
riots+0.804
Neg logit contributions
alid-0.814
 Agency-0.799
vier-0.796
edin-0.787
cot-0.783
Token tough
Token ID5802
Feature activation+2.04e+00

Pos logit contributions
erva+0.228
 depress+0.220
 regress+0.213
ック+0.210
riots+0.205
Neg logit contributions
alid-0.193
 Agency-0.189
vier-0.188
edin-0.186
cot-0.185
Token as
Token ID355
Feature activation+1.81e+00

Pos logit contributions
erva+1.572
 depress+1.521
 regress+1.469
ック+1.439
uries+1.417
Neg logit contributions
alid-1.356
 Agency-1.331
vier-1.324
edin-1.316
cot-1.304
Token nails
Token ID23361
Feature activation-5.60e+00

Pos logit contributions
erva+1.348
 depress+1.316
 regress+1.262
ック+1.229
riots+1.207
Neg logit contributions
alid-1.242
vier-1.218
 Agency-1.213
edin-1.201
cot-1.195
Token though
Token ID996
Feature activation+5.19e-01

Pos logit contributions
 Agency+3.669
alid+3.577
cot+3.551
edin+3.519
vier+3.424
Neg logit contributions
erva-4.212
ック-3.923
 depress-3.922
 regress-3.897
riots-3.824
Token.
Token ID13
Feature activation+4.13e-01

Pos logit contributions
erva+0.364
 depress+0.352
 regress+0.339
ック+0.331
riots+0.325
Neg logit contributions
alid-0.380
 Agency-0.373
vier-0.372
edin-0.369
cot-0.367
Token The
Token ID383
Feature activation+5.79e-01

Pos logit contributions
erva+0.313
 depress+0.305
 regress+0.293
ック+0.286
riots+0.281
Neg logit contributions
alid-0.280
 Agency-0.275
vier-0.274
edin-0.271
cot-0.269
Token,
Token ID11
Feature activation+9.93e-02

Pos logit contributions
erva+0.220
 depress+0.214
 regress+0.206
ック+0.202
riots+0.198
Neg logit contributions
alid-0.183
vier-0.179
 Agency-0.179
edin-0.176
cot-0.175
Token a
Token ID257
Feature activation+5.31e-01

Pos logit contributions
erva+0.085
 depress+0.083
 regress+0.080
ック+0.079
riots+0.077
Neg logit contributions
alid-0.057
vier-0.056
 Agency-0.056
edin-0.055
cot-0.054
Token Hispanic
Token ID16949
Feature activation-9.35e-01

Pos logit contributions
erva+0.407
 depress+0.392
 regress+0.379
ック+0.371
riots+0.366
Neg logit contributions
alid-0.355
 Agency-0.348
vier-0.347
edin-0.341
cot-0.340
Token man
Token ID582
Feature activation-2.30e+00

Pos logit contributions
alid+0.595
 Agency+0.587
vier+0.584
edin+0.571
cot+0.568
Neg logit contributions
erva-0.747
 depress-0.718
 regress-0.700
ック-0.680
uries-0.671
Token of
Token ID286
Feature activation-8.27e-01

Pos logit contributions
alid+1.503
vier+1.467
 Agency+1.462
edin+1.436
cot+1.436
Neg logit contributions
erva-1.790
 depress-1.743
 regress-1.681
ック-1.641
riots-1.618
Token Puerto
Token ID15295
Feature activation+4.96e+00

Pos logit contributions
alid+0.521
vier+0.511
 Agency+0.504
edin+0.502
cot+0.497
Neg logit contributions
erva-0.664
 depress-0.654
 regress-0.626
ック-0.610
riots-0.601
Token Rican
Token ID44459
Feature activation+2.26e+00

Pos logit contributions
erva+2.332
 depress+2.278
 regress+2.063
ック+1.997
riots+1.959
Neg logit contributions
alid-4.708
vier-4.651
edin-4.648
 Agency-4.616
cot-4.577
Token heritage
Token ID15012
Feature activation+6.58e-02

Pos logit contributions
erva+1.629
 depress+1.613
 regress+1.528
ック+1.482
riots+1.452
Neg logit contributions
alid-1.606
vier-1.581
edin-1.560
 Agency-1.556
cot-1.536
Token,
Token ID11
Feature activation+5.48e-01

Pos logit contributions
erva+0.042
 depress+0.041
 regress+0.039
ック+0.038
riots+0.037
Neg logit contributions
alid-0.052
vier-0.051
edin-0.051
 Agency-0.051
cot-0.050
Token was
Token ID373
Feature activation+4.72e-01

Pos logit contributions
erva+0.445
 depress+0.435
 regress+0.419
ック+0.409
riots+0.403
Neg logit contributions
alid-0.341
vier-0.333
 Agency-0.331
edin-0.328
cot-0.325
Token born
Token ID4642
Feature activation+8.56e-01

Pos logit contributions
erva+0.404
 depress+0.395
 regress+0.381
ック+0.374
riots+0.368
Neg logit contributions
alid-0.272
vier-0.265
 Agency-0.264
edin-0.261
cot-0.259
Token actually
Token ID1682
Feature activation+3.82e+00

Pos logit contributions
erva+0.570
 depress+0.548
 regress+0.525
ック+0.521
riots+0.515
Neg logit contributions
alid-0.450
 Agency-0.441
vier-0.440
edin-0.434
cot-0.432
Token start
Token ID923
Feature activation+3.32e+00

Pos logit contributions
erva+3.004
 depress+2.989
 regress+2.877
ック+2.765
uries+2.705
Neg logit contributions
alid-2.440
vier-2.393
 Agency-2.356
edin-2.337
cot-2.323
Token grading
Token ID43165
Feature activation-2.37e+00

Pos logit contributions
erva+2.565
 depress+2.520
 regress+2.420
ック+2.350
riots+2.305
Neg logit contributions
alid-2.185
vier-2.133
 Agency-2.111
edin-2.094
cot-2.079
Token.
Token ID13
Feature activation+7.60e-01

Pos logit contributions
 Agency+1.683
alid+1.662
vier+1.614
edin+1.613
cot+1.610
Neg logit contributions
erva-1.715
 depress-1.618
ック-1.541
riots-1.534
��-1.513
Token Parents
Token ID28231
Feature activation-2.53e+00

Pos logit contributions
erva+0.587
 depress+0.575
 regress+0.551
ック+0.535
riots+0.530
Neg logit contributions
alid-0.504
 Agency-0.498
vier-0.494
edin-0.487
cot-0.482
Token who
Token ID508
Feature activation-2.64e-01

Pos logit contributions
alid+1.541
 Agency+1.541
vier+1.533
cot+1.514
edin+1.489
Neg logit contributions
erva-2.039
 depress-1.941
ック-1.881
 regress-1.862
riots-1.846
Token had
Token ID550
Feature activation+8.12e-01

Pos logit contributions
alid+0.166
vier+0.164
 Agency+0.163
edin+0.161
cot+0.160
Neg logit contributions
erva-0.212
 depress-0.204
 regress-0.197
ック-0.195
riots-0.192
Token been
Token ID587
Feature activation-2.55e+00

Pos logit contributions
erva+0.680
 depress+0.659
 regress+0.635
ック+0.627
riots+0.617
Neg logit contributions
alid-0.487
vier-0.477
 Agency-0.475
edin-0.468
cot-0.465
Token thrilled
Token ID20536
Feature activation-1.36e+00

Pos logit contributions
 Agency+1.494
alid+1.487
vier+1.474
edin+1.462
cot+1.453
Neg logit contributions
erva-2.142
 depress-2.021
ック-1.964
 regress-1.960
riots-1.937
Token with
Token ID351
Feature activation+1.26e+00

Pos logit contributions
alid+0.759
vier+0.744
 Agency+0.735
edin+0.730
cot+0.721
Neg logit contributions
erva-1.188
 depress-1.158
 regress-1.121
ック-1.100
riots-1.082
Token consistent
Token ID6414
Feature activation+1.26e+00

Pos logit contributions
erva+0.960
 depress+0.938
 regress+0.898
ック+0.878
riots+0.864
Neg logit contributions
alid-0.843
vier-0.825
 Agency-0.821
edin-0.812
cot-0.807
Token.
Token ID13
Feature activation-1.74e+00

Pos logit contributions
erva+0.378
 depress+0.369
 regress+0.352
ック+0.342
riots+0.335
Neg logit contributions
alid-0.409
vier-0.403
 Agency-0.402
edin-0.396
cot-0.394
Token0
Token ID15
Feature activation-9.36e-01

Pos logit contributions
alid+0.968
 Agency+0.958
vier+0.951
edin+0.929
cot+0.924
Neg logit contributions
erva-1.524
 depress-1.494
 regress-1.443
ック-1.392
riots-1.386
Token and
Token ID290
Feature activation-1.61e+00

Pos logit contributions
alid+0.601
 Agency+0.596
vier+0.590
cot+0.578
edin+0.578
Neg logit contributions
erva-0.734
 depress-0.721
 regress-0.690
ック-0.672
riots-0.665
Token Oklahoma
Token ID10433
Feature activation+2.61e-01

Pos logit contributions
alid+1.056
 Agency+1.053
vier+1.038
cot+1.027
edin+1.013
Neg logit contributions
erva-1.248
 depress-1.217
 regress-1.167
ック-1.139
riots-1.124
Token Strong
Token ID13535
Feature activation-4.85e+00

Pos logit contributions
erva+0.205
 depress+0.200
 regress+0.192
ック+0.187
riots+0.185
Neg logit contributions
alid-0.169
vier-0.166
 Agency-0.165
edin-0.163
cot-0.162
Token.
Token ID13
Feature activation-6.49e-01

Pos logit contributions
 Agency+3.320
vier+3.211
edin+3.200
alid+3.177
cot+3.154
Neg logit contributions
erva-3.603
 depress-3.527
 regress-3.383
��-3.370
ック-3.278
Token In
Token ID554
Feature activation-3.41e-01

Pos logit contributions
alid+0.437
 Agency+0.431
vier+0.426
cot+0.420
edin+0.418
Neg logit contributions
erva-0.494
 depress-0.485
 regress-0.466
ック-0.449
riots-0.445
Token 2014
Token ID1946
Feature activation+3.47e+00

Pos logit contributions
alid+0.226
 Agency+0.223
vier+0.222
cot+0.218
edin+0.218
Neg logit contributions
erva-0.261
 depress-0.254
 regress-0.243
ック-0.240
riots-0.237
Token,
Token ID11
Feature activation-6.61e-01

Pos logit contributions
erva+2.145
 depress+2.092
 regress+1.998
ック+1.932
uries+1.881
Neg logit contributions
alid-2.805
vier-2.777
edin-2.746
 Agency-2.700
cot-2.688
Token The
Token ID383
Feature activation-9.92e-01

Pos logit contributions
alid+0.443
 Agency+0.439
vier+0.430
cot+0.427
edin+0.421
Neg logit contributions
erva-0.505
 depress-0.489
 regress-0.468
ック-0.460
riots-0.453
Token New
Token ID968
Feature activation+1.67e+00

Pos logit contributions
alid+0.628
 Agency+0.617
vier+0.617
edin+0.601
cot+0.599
Neg logit contributions
erva-0.796
 depress-0.780
 regress-0.748
ック-0.731
riots-0.719
Token.
Token ID13
Feature activation-6.98e-01

Pos logit contributions
 Agency+2.415
alid+2.377
vier+2.369
cot+2.324
edin+2.313
Neg logit contributions
erva-2.672
 depress-2.603
 regress-2.501
ック-2.430
riots-2.422
Token Visit
Token ID16440
Feature activation+3.61e-01

Pos logit contributions
alid+0.437
vier+0.428
 Agency+0.424
edin+0.420
cot+0.417
Neg logit contributions
erva-0.565
 depress-0.553
 regress-0.531
ック-0.519
riots-0.511
Token www
Token ID7324
Feature activation+2.35e+00

Pos logit contributions
erva+0.300
 depress+0.293
 regress+0.281
ック+0.275
riots+0.271
Neg logit contributions
alid-0.219
vier-0.214
 Agency-0.213
edin-0.210
cot-0.209
Token.
Token ID13
Feature activation-9.83e-01

Pos logit contributions
erva+1.412
 depress+1.371
 regress+1.298
ック+1.267
uries+1.234
Neg logit contributions
alid-1.943
vier-1.907
 Agency-1.895
edin-1.882
cot-1.869
Tokenplay
Token ID1759
Feature activation-2.96e-01

Pos logit contributions
alid+0.701
vier+0.688
 Agency+0.679
edin+0.679
cot+0.678
Neg logit contributions
erva-0.709
 depress-0.695
 regress-0.665
ック-0.644
riots-0.630
Tokenst
Token ID301
Feature activation+5.18e-01

Pos logit contributions
alid+0.195
vier+0.191
 Agency+0.189
edin+0.188
cot+0.188
Neg logit contributions
erva-0.228
 depress-0.224
 regress-0.215
ック-0.210
riots-0.206
Tokenations
Token ID602
Feature activation-1.19e+00

Pos logit contributions
erva+0.361
 depress+0.354
 regress+0.338
ック+0.329
riots+0.322
Neg logit contributions
alid-0.381
vier-0.374
 Agency-0.370
edin-0.368
cot-0.367
Tokenchool
Token ID1251
Feature activation-2.47e+00

Pos logit contributions
alid+0.890
vier+0.874
 Agency+0.864
edin+0.861
cot+0.859
Neg logit contributions
erva-0.816
 depress-0.799
 regress-0.763
ック-0.745
riots-0.726
Tokensc
Token ID1416
Feature activation+4.39e-01

Pos logit contributions
alid+1.962
vier+1.930
 Agency+1.909
cot+1.906
edin+1.904
Neg logit contributions
erva-1.585
 depress-1.555
 regress-1.474
ック-1.438
riots-1.388
Tokenup
Token ID929
Feature activation+1.44e+00

Pos logit contributions
erva+0.313
 depress+0.306
 regress+0.293
ック+0.286
riots+0.279
Neg logit contributions
alid-0.317
vier-0.310
 Agency-0.308
edin-0.306
cot-0.304
Token.
Token ID13
Feature activation+8.18e-04

Pos logit contributions
erva+0.970
 depress+0.941
 regress+0.898
ック+0.868
riots+0.858
Neg logit contributions
alid-1.100
vier-1.077
 Agency-1.072
edin-1.063
cot-1.054
Token earlier
Token ID2961
Feature activation+6.77e-01

Pos logit contributions
alid+1.014
vier+1.000
 Agency+0.992
edin+0.978
cot+0.973
Neg logit contributions
erva-1.579
 depress-1.539
 regress-1.481
ック-1.458
riots-1.445
Token
Token ID447
Feature activation+4.77e-01

Pos logit contributions
erva+0.529
 depress+0.514
 regress+0.492
ック+0.481
riots+0.476
Neg logit contributions
alid-0.443
vier-0.436
 Agency-0.435
edin-0.426
cot-0.425
Token
Token ID251
Feature activation+5.16e-01

Pos logit contributions
erva+0.260
 depress+0.258
 regress+0.241
ック+0.234
riots+0.226
Neg logit contributions
alid-0.418
vier-0.410
 Agency-0.409
edin-0.407
cot-0.405
Token flashbacks
Token ID49555
Feature activation+7.56e-01

Pos logit contributions
erva+0.350
 depress+0.342
 regress+0.326
ック+0.318
riots+0.310
Neg logit contributions
alid-0.390
vier-0.383
 Agency-0.380
edin-0.378
cot-0.376
Token to
Token ID284
Feature activation+5.94e-01

Pos logit contributions
erva+0.613
 depress+0.599
 regress+0.576
ック+0.563
riots+0.555
Neg logit contributions
alid-0.472
vier-0.461
 Agency-0.458
edin-0.453
cot-0.450
Token a
Token ID257
Feature activation+3.92e-01

Pos logit contributions
erva+0.468
 depress+0.452
 regress+0.434
ック+0.426
riots+0.419
Neg logit contributions
alid-0.387
 Agency-0.379
vier-0.378
edin-0.372
cot-0.370
Token super
Token ID2208
Feature activation+3.17e+00

Pos logit contributions
erva+0.307
 depress+0.296
 regress+0.285
ック+0.280
riots+0.275
Neg logit contributions
alid-0.255
vier-0.251
 Agency-0.250
edin-0.247
cot-0.245
Token-
Token ID12
Feature activation+1.66e+00

Pos logit contributions
erva+2.280
 depress+2.248
 regress+2.151
ック+2.078
riots+2.055
Neg logit contributions
alid-2.244
vier-2.191
 Agency-2.178
edin-2.177
cot-2.141
Tokenp
Token ID79
Feature activation+1.01e+00

Pos logit contributions
erva+1.233
 depress+1.207
 regress+1.155
ック+1.124
riots+1.109
Neg logit contributions
alid-1.149
vier-1.125
 Agency-1.118
edin-1.109
cot-1.099
Tokenreg
Token ID2301
Feature activation+1.61e+00

Pos logit contributions
erva+0.678
 depress+0.662
 regress+0.632
ック+0.617
riots+0.599
Neg logit contributions
alid-0.780
vier-0.762
 Agency-0.755
edin-0.751
cot-0.746
Tokennant
Token ID22057
Feature activation+3.10e+00

Pos logit contributions
erva+1.271
 depress+1.244
 regress+1.187
ック+1.155
riots+1.143
Neg logit contributions
vier-1.018
alid-1.015
 Agency-1.002
edin-0.989
cot-0.983
Token
Token ID198
Feature activation-2.48e-01

Pos logit contributions
alid+0.306
 Agency+0.301
vier+0.301
edin+0.296
cot+0.293
Neg logit contributions
erva-0.361
 depress-0.352
 regress-0.338
ック-0.328
riots-0.324
TokenThese
Token ID4711
Feature activation-5.83e-01

Pos logit contributions
alid+0.151
vier+0.149
 Agency+0.147
edin+0.145
cot+0.145
Neg logit contributions
erva-0.204
 depress-0.201
 regress-0.193
ック-0.187
riots-0.184
Token I
Token ID314
Feature activation-1.52e+00

Pos logit contributions
alid+0.390
vier+0.386
 Agency+0.385
cot+0.379
edin+0.378
Neg logit contributions
erva-0.443
 depress-0.426
 regress-0.407
ック-0.404
riots-0.395
Token do
Token ID466
Feature activation-6.49e-01

Pos logit contributions
alid+0.940
vier+0.916
 Agency+0.912
cot+0.910
edin+0.899
Neg logit contributions
erva-1.227
 depress-1.186
 regress-1.137
ック-1.136
riots-1.112
Token not
Token ID407
Feature activation+3.94e-01

Pos logit contributions
alid+0.351
vier+0.343
 Agency+0.342
cot+0.336
edin+0.334
Neg logit contributions
erva-0.578
 depress-0.560
 regress-0.540
ック-0.535
riots-0.530
Token wish
Token ID4601
Feature activation-5.10e-01

Pos logit contributions
erva+0.309
 depress+0.298
 regress+0.286
ック+0.283
riots+0.279
Neg logit contributions
alid-0.256
vier-0.251
 Agency-0.250
edin-0.247
cot-0.247
Token to
Token ID284
Feature activation-7.37e-01

Pos logit contributions
alid+0.322
vier+0.317
 Agency+0.316
edin+0.311
cot+0.310
Neg logit contributions
erva-0.406
 depress-0.395
 regress-0.379
ック-0.374
riots-0.368
Token slay
Token ID42596
Feature activation+2.42e+00

Pos logit contributions
alid+0.459
vier+0.449
 Agency+0.447
edin+0.444
cot+0.442
Neg logit contributions
erva-0.598
 depress-0.574
 regress-0.551
ック-0.549
riots-0.543
Token,
Token ID11
Feature activation-9.04e-02

Pos logit contributions
erva+1.659
 depress+1.621
 regress+1.546
ック+1.499
riots+1.470
Neg logit contributions
alid-1.808
vier-1.775
 Agency-1.760
edin-1.746
cot-1.734
Token even
Token ID772
Feature activation+3.43e+00

Pos logit contributions
alid+0.056
vier+0.055
 Agency+0.055
edin+0.054
cot+0.053
Neg logit contributions
erva-0.073
 depress-0.072
 regress-0.069
ック-0.068
riots-0.067
Token if
Token ID611
Feature activation+5.23e-01

Pos logit contributions
erva+2.617
 depress+2.567
 regress+2.459
ック+2.388
uries+2.352
Neg logit contributions
alid-2.308
vier-2.258
 Agency-2.237
edin-2.221
cot-2.203
Token for
Token ID329
Feature activation+9.51e-01

Pos logit contributions
erva+0.112
 depress+0.109
 regress+0.104
ック+0.103
riots+0.101
Neg logit contributions
alid-0.088
vier-0.086
 Agency-0.086
edin-0.085
cot-0.084
Token deployment
Token ID14833
Feature activation+4.66e-01

Pos logit contributions
erva+0.738
 depress+0.717
 regress+0.685
ック+0.676
riots+0.666
Neg logit contributions
alid-0.629
vier-0.617
 Agency-0.615
edin-0.603
cot-0.601
Token to
Token ID284
Feature activation+1.98e-01

Pos logit contributions
erva+0.346
 depress+0.336
 regress+0.321
ック+0.316
riots+0.311
Neg logit contributions
alid-0.322
 Agency-0.315
vier-0.315
edin-0.311
cot-0.310
Token enable
Token ID7139
Feature activation-3.85e-01

Pos logit contributions
erva+0.156
 depress+0.151
 regress+0.144
ック+0.142
riots+0.141
Neg logit contributions
alid-0.129
vier-0.127
 Agency-0.126
edin-0.124
cot-0.124
Token these
Token ID777
Feature activation-5.61e-01

Pos logit contributions
alid+0.245
 Agency+0.241
vier+0.240
edin+0.234
cot+0.234
Neg logit contributions
erva-0.308
 depress-0.298
 regress-0.284
ック-0.282
riots-0.279
Token techniques
Token ID7605
Feature activation-5.67e+00

Pos logit contributions
alid+0.409
 Agency+0.406
vier+0.401
cot+0.398
edin+0.395
Neg logit contributions
erva-0.396
 depress-0.376
ック-0.357
riots-0.354
 regress-0.352
Token.
Token ID13
Feature activation-3.33e-01

Pos logit contributions
alid+3.713
vier+3.630
 Agency+3.627
cot+3.570
edin+3.551
Neg logit contributions
erva-4.315
 depress-4.101
ック-3.954
riots-3.921
 regress-3.883
Token Even
Token ID3412
Feature activation+3.53e+00

Pos logit contributions
alid+0.221
 Agency+0.218
vier+0.216
edin+0.213
cot+0.211
Neg logit contributions
erva-0.258
 depress-0.252
 regress-0.241
ック-0.235
riots-0.234
Token without
Token ID1231
Feature activation-5.40e-01

Pos logit contributions
erva+2.714
 depress+2.662
 regress+2.559
ック+2.481
riots+2.440
Neg logit contributions
alid-2.341
vier-2.288
 Agency-2.266
edin-2.249
cot-2.237
Token them
Token ID606
Feature activation+1.74e+00

Pos logit contributions
alid+0.356
 Agency+0.350
vier+0.348
edin+0.341
cot+0.339
Neg logit contributions
erva-0.421
 depress-0.405
ック-0.385
 regress-0.384
riots-0.381
Token I
Token ID314
Feature activation-1.96e+00

Pos logit contributions
erva+1.173
 depress+1.143
 regress+1.088
ック+1.059
riots+1.039
Neg logit contributions
alid-1.325
vier-1.301
 Agency-1.292
edin-1.282
cot-1.274
Token-
Token ID12
Feature activation+1.16e+00

Pos logit contributions
erva+1.554
 depress+1.526
 regress+1.448
ック+1.401
riots+1.370
Neg logit contributions
alid-1.693
 Agency-1.656
vier-1.652
edin-1.635
cot-1.609
Tokenwall
Token ID11930
Feature activation+2.83e+00

Pos logit contributions
erva+0.674
 depress+0.651
 regress+0.614
ック+0.593
riots+0.587
Neg logit contributions
alid-0.994
 Agency-0.977
vier-0.974
edin-0.964
cot-0.953
Tokenaby
Token ID3930
Feature activation+1.87e+00

Pos logit contributions
erva+1.992
 depress+1.944
 regress+1.857
ック+1.808
riots+1.772
Neg logit contributions
alid-2.076
vier-2.037
 Agency-2.022
edin-2.006
cot-1.993
Token populations
Token ID9684
Feature activation+1.09e+00

Pos logit contributions
erva+1.275
 depress+1.229
 regress+1.172
ック+1.150
riots+1.126
Neg logit contributions
alid-1.416
vier-1.389
 Agency-1.386
edin-1.366
cot-1.365
Token are
Token ID389
Feature activation+2.68e-01

Pos logit contributions
erva+0.872
 depress+0.847
 regress+0.812
ック+0.800
riots+0.787
Neg logit contributions
alid-0.690
vier-0.676
 Agency-0.670
edin-0.663
cot-0.659
Token genetically
Token ID20807
Feature activation-4.97e+00

Pos logit contributions
erva+0.241
 depress+0.234
 regress+0.226
ック+0.223
uries+0.220
Neg logit contributions
alid-0.144
vier-0.140
 Agency-0.140
edin-0.137
cot-0.137
Token and
Token ID290
Feature activation-3.72e-01

Pos logit contributions
alid+1.852
 Agency+1.824
vier+1.753
 differentiated+1.736
cot+1.692
Neg logit contributions
erva-5.255
 depress-4.957
ック-4.869
uries-4.839
 regress-4.806
Token morph
Token ID17488
Feature activation-2.03e+00

Pos logit contributions
alid+0.226
vier+0.221
 Agency+0.220
edin+0.217
cot+0.216
Neg logit contributions
erva-0.307
 depress-0.299
 regress-0.288
ック-0.283
riots-0.279
Tokenologically
Token ID13437
Feature activation+5.76e-02

Pos logit contributions
alid+0.958
vier+0.916
 Agency+0.905
edin+0.900
cot+0.890
Neg logit contributions
erva-1.968
 depress-1.923
 regress-1.862
ック-1.834
riots-1.811
Token distinct
Token ID7310
Feature activation+1.74e-02

Pos logit contributions
erva+0.057
 depress+0.056
 regress+0.054
ック+0.054
riots+0.053
Neg logit contributions
alid-0.025
vier-0.024
 Agency-0.024
edin-0.024
cot-0.024
Token and
Token ID290
Feature activation+5.37e-01

Pos logit contributions
erva+0.013
 depress+0.012
 regress+0.012
ック+0.011
riots+0.011
Neg logit contributions
alid-0.012
 Agency-0.012
vier-0.012
cot-0.012
edin-0.012
Token,
Token ID11
Feature activation+1.03e-01

Pos logit contributions
alid+0.373
vier+0.366
 Agency+0.364
edin+0.361
cot+0.359
Neg logit contributions
erva-0.290
 depress-0.282
 regress-0.268
ック-0.260
riots-0.255
Token some
Token ID617
Feature activation-6.29e-01

Pos logit contributions
erva+0.083
 depress+0.080
 regress+0.078
ック+0.077
riots+0.075
Neg logit contributions
alid-0.064
 Agency-0.063
vier-0.063
edin-0.062
cot-0.061
Token North
Token ID2258
Feature activation-4.89e+00

Pos logit contributions
alid+0.368
 Agency+0.364
vier+0.360
cot+0.354
edin+0.351
Neg logit contributions
erva-0.534
 depress-0.514
 regress-0.498
ック-0.491
riots-0.484
Token Korean
Token ID6983
Feature activation+1.52e-01

Pos logit contributions
alid+2.588
 Agency+2.572
vier+2.515
edin+2.489
cot+2.474
Neg logit contributions
erva-4.378
 depress-4.219
 regress-4.113
ック-4.000
uries-3.967
Token authorities
Token ID4773
Feature activation+1.03e+00

Pos logit contributions
erva+0.116
 depress+0.111
 regress+0.107
ック+0.105
uries+0.103
Neg logit contributions
alid-0.102
 Agency-0.101
vier-0.100
edin-0.098
cot-0.098
Token care
Token ID1337
Feature activation-5.64e+00

Pos logit contributions
erva+0.826
 depress+0.796
 regress+0.766
ック+0.759
riots+0.743
Neg logit contributions
alid-0.649
vier-0.637
 Agency-0.636
edin-0.627
cot-0.622
Token about
Token ID546
Feature activation-4.55e-01

Pos logit contributions
alid+2.815
vier+2.780
 Agency+2.749
cot+2.686
edin+2.672
Neg logit contributions
erva-5.183
ック-4.906
 depress-4.905
 regress-4.810
uries-4.735
Token their
Token ID511
Feature activation+1.36e+00

Pos logit contributions
alid+0.292
 Agency+0.287
vier+0.286
edin+0.280
cot+0.280
Neg logit contributions
erva-0.361
 depress-0.348
 regress-0.335
ック-0.331
riots-0.324
Token international
Token ID3230
Feature activation+1.60e-01

Pos logit contributions
erva+1.029
 depress+1.009
 regress+0.967
ック+0.940
riots+0.925
Neg logit contributions
alid-0.921
vier-0.904
 Agency-0.893
edin-0.888
cot-0.880
Token image
Token ID2939
Feature activation-2.69e+00

Pos logit contributions
erva+0.130
 depress+0.126
 regress+0.121
ック+0.119
riots+0.117
Neg logit contributions
alid-0.100
 Agency-0.098
vier-0.098
cot-0.096
edin-0.096
Token.
Token ID13
Feature activation+2.30e-01

Pos logit contributions
alid+1.737
 Agency+1.727
vier+1.719
cot+1.681
edin+1.676
Neg logit contributions
erva-2.095
 depress-2.003
 regress-1.936
ック-1.915
riots-1.888
Token thus
Token ID4145
Feature activation+1.09e+00

Pos logit contributions
erva+0.770
 depress+0.745
 regress+0.720
ック+0.708
riots+0.696
Neg logit contributions
alid-0.582
vier-0.570
 Agency-0.568
edin-0.562
cot-0.558
Token shut
Token ID4423
Feature activation+2.40e+00

Pos logit contributions
erva+0.901
 depress+0.877
 regress+0.845
ック+0.830
riots+0.816
Neg logit contributions
alid-0.669
vier-0.653
 Agency-0.652
edin-0.644
cot-0.639
Token up
Token ID510
Feature activation+1.86e+00

Pos logit contributions
erva+1.937
 depress+1.907
 regress+1.828
ック+1.783
riots+1.754
Neg logit contributions
alid-1.499
vier-1.466
 Agency-1.451
edin-1.426
cot-1.419
Token to
Token ID284
Feature activation+8.59e-01

Pos logit contributions
erva+1.521
 depress+1.489
 regress+1.432
ック+1.399
riots+1.378
Neg logit contributions
alid-1.148
vier-1.123
 Agency-1.114
edin-1.103
cot-1.094
Token the
Token ID262
Feature activation+1.64e-01

Pos logit contributions
erva+0.681
 depress+0.660
 regress+0.637
ック+0.623
riots+0.613
Neg logit contributions
alid-0.551
vier-0.541
 Agency-0.538
edin-0.531
cot-0.528
Token home
Token ID1363
Feature activation-5.76e+00

Pos logit contributions
erva+0.130
 depress+0.125
 regress+0.121
ック+0.119
riots+0.117
Neg logit contributions
alid-0.106
 Agency-0.104
vier-0.103
edin-0.102
cot-0.101
Token supply
Token ID5127
Feature activation+3.45e-01

Pos logit contributions
 Agency+3.507
alid+3.495
vier+3.468
cot+3.454
edin+3.452
Neg logit contributions
erva-4.540
 depress-4.300
 regress-4.213
��-4.116
ック-4.100
Token;
Token ID26
Feature activation+8.64e-01

Pos logit contributions
erva+0.287
 depress+0.278
 regress+0.268
ック+0.264
riots+0.259
Neg logit contributions
alid-0.208
 Agency-0.204
vier-0.203
edin-0.200
cot-0.199
Token from
Token ID422
Feature activation+1.23e+00

Pos logit contributions
erva+0.685
 depress+0.665
 regress+0.642
ック+0.630
riots+0.617
Neg logit contributions
alid-0.553
vier-0.542
 Agency-0.541
edin-0.533
cot-0.530
Token the
Token ID262
Feature activation+6.16e-01

Pos logit contributions
erva+0.979
 depress+0.951
 regress+0.916
ック+0.899
riots+0.883
Neg logit contributions
alid-0.779
vier-0.765
 Agency-0.763
edin-0.751
cot-0.747
Token works
Token ID2499
Feature activation+5.46e-01

Pos logit contributions
erva+0.489
 depress+0.472
 regress+0.455
ック+0.449
riots+0.440
Neg logit contributions
alid-0.394
 Agency-0.389
vier-0.386
edin-0.380
cot-0.378
Token our
Token ID674
Feature activation-2.64e-01

Pos logit contributions
alid+0.373
 Agency+0.368
vier+0.365
edin+0.358
cot+0.357
Neg logit contributions
erva-0.493
 depress-0.476
 regress-0.457
ック-0.453
riots-0.447
Token vendor
Token ID18371
Feature activation+4.95e-02

Pos logit contributions
alid+0.162
 Agency+0.159
vier+0.158
cot+0.155
edin+0.155
Neg logit contributions
erva-0.218
 depress-0.212
 regress-0.203
ック-0.200
riots-0.197
Token has
Token ID468
Feature activation-6.19e-02

Pos logit contributions
erva+0.039
 depress+0.039
 regress+0.037
ック+0.036
riots+0.036
Neg logit contributions
alid-0.032
vier-0.031
 Agency-0.031
edin-0.030
cot-0.030
Token fine
Token ID3734
Feature activation-1.60e+00

Pos logit contributions
alid+0.035
vier+0.034
 Agency+0.034
edin+0.033
cot+0.033
Neg logit contributions
erva-0.054
 depress-0.053
 regress-0.051
ック-0.050
riots-0.050
Token-
Token ID12
Feature activation+7.17e-01

Pos logit contributions
alid+1.016
 Agency+0.991
vier+0.990
edin+0.977
cot+0.977
Neg logit contributions
erva-1.273
 depress-1.237
 regress-1.195
ック-1.165
riots-1.161
Tokentun
Token ID28286
Feature activation-5.15e+00

Pos logit contributions
erva+0.478
 depress+0.466
 regress+0.444
ック+0.431
riots+0.424
Neg logit contributions
alid-0.549
vier-0.540
 Agency-0.536
edin-0.532
cot-0.527
Tokened
Token ID276
Feature activation+1.76e-01

Pos logit contributions
alid+2.115
edin+2.115
vier+2.089
 Agency+1.987
cot+1.970
Neg logit contributions
erva-5.217
 depress-5.106
 regress-4.884
ック-4.871
riots-4.853
Token all
Token ID477
Feature activation+3.57e+00

Pos logit contributions
erva+0.134
 depress+0.132
 regress+0.126
ック+0.123
riots+0.121
Neg logit contributions
alid-0.118
vier-0.116
 Agency-0.115
edin-0.114
cot-0.113
Token components
Token ID6805
Feature activation+2.51e+00

Pos logit contributions
erva+2.803
 depress+2.771
 regress+2.676
ック+2.595
uries+2.534
Neg logit contributions
alid-2.281
vier-2.243
edin-2.216
 Agency-2.178
cot-2.153
Token so
Token ID523
Feature activation+4.71e-01

Pos logit contributions
erva+1.993
 depress+1.964
 regress+1.888
ック+1.832
uries+1.803
Neg logit contributions
alid-1.600
vier-1.566
 Agency-1.549
edin-1.538
cot-1.522
Token both
Token ID1111
Feature activation+1.37e+00

Pos logit contributions
erva+0.364
 depress+0.355
 regress+0.341
ック+0.333
riots+0.328
Neg logit contributions
alid-0.313
vier-0.305
 Agency-0.304
edin-0.300
cot-0.298