TOP ACTIVATIONS
MAX = 5.925

me later . Rated 5 out of 5 by
9 , 2010 Subject : R ipe for M
15 , 2010 Subject : Que er magnet ?
13 , 2010 Subject : boys beware This film
was about homosexuality Reviewer : k imp unk rock
in America . Everyone gave themselves the rhetorical authority to tweet
6 , 2010 Subject : R iff Tra x
t already , sign up to our mailing list
5 , 2009 Subject : Que ers are Here
warns : Question marks over the rights of EU
4 , 2010 Subject : User reviews are a
object pointers ). You may hear that GHC s
, I think Reviewer : B iz r Pix
S aki , who proclaimed himself to be the Sh red
Crit ically , these approaches strike right at the heart of
and night ! Rated 5 out of 5 by
iff Tra x Reviewer : V the Happy L
26 , 2009 Subject : THE IR ONY OF
sound system . Rated 5 out of 5 by
: " Why in the hell do journalists insist on coming

MIN ACTIVATIONS
MIN = -5.916

to him : " Th ou hast judged with the judgement
ALTH AM , Mass . -- Boston Celtics swing man Jae
L ANS ING , MI -- The Michigan House is looking
TA WA ( Reuters ) - Canada s top
T AM PA , FL Tampa Bay area residents are
SON VILLE , Fla . -- A Jacksonville sheriff 's officer
alling together in a cr um pled mess of mutual uncertainty
next time ! <|endoftext|> BR USS ELS ( Reuters ) -
Activision . B ung ie 's contract with Activision
geometry , by adding a vertex displacement function . This can
said Trump could learn several lessons from former President Harry Truman
UAL A L UMP UR -- With barely a month leading
V ANA ( Reuters ) - Latin American and Caribbean leaders
C OOK COUNTY CR IM INAL COUR TH OU SE
SAN FR ANC IS CO - Reddit and Google are taking
. <|endoftext|> ED MON TON Omar Kh adr has agreed
pal ek . <|endoftext|> WASHINGTON A Cy gn us cargo
IGN : Got cha . Just a slight side
USS ELS ( Reuters ) - The world s
BRE ATH ING AND AC OU ST IC RES ON ANCE

INTERVAL 2.964 - 5.925
CONTAINS 0.373%

unpopular as ever with the public . Only 24 percent of
[ 10 : 47 ] < met ros xual th ug
This DIY is going to be focusing on those
optimized Store . As you land on the Store from Home
the Sch m oes on Twitter ! Subscribe to

INTERVAL 0.004 - 2.964
CONTAINS 49.632%

addition , Hamas activities in Turkey in general will be limited
, the first modified Block III spacecraft , designated Ranger A
with already athlet ically fit people , then it offers you
2017 salary and bonuses : $ 1 million (
. d Now we re going to

INTERVAL -2.956 - 0.004
CONTAINS 49.611%

Ivy League universities . Photo : Luis Sin co / Los
low commodity prices have all hurt Argentina ; true enough .
, and four panel ists voting in favour of a redesign
of Honey comb - optim ized apps . Since Android devices
The O Reilly Factor

INTERVAL -5.916 - -2.956
CONTAINS 0.383%

protests ? WASHINGTON -- The United Nations envoy for
. get Struct uring Element ( cv 2 . M OR
you out in a single turn ! One way to avoid
next to a matte black muscle car , suggesting that the
met with D endi getting up and dancing about nearby .
Token me
Token ID502
Feature activation-5.65e-01

Pos logit contributions
eta+0.824
adh+0.783
�醒+0.698
chin+0.686
mast+0.680
Neg logit contributions
 Fisher-0.507
 APPLIC-0.468
 Drinking-0.468
 dams-0.457
 Tile-0.453
Token later
Token ID1568
Feature activation+2.21e+00

Pos logit contributions
 Fisher+0.349
 Drinking+0.323
 APPLIC+0.321
 dams+0.315
 Tile+0.312
Neg logit contributions
eta-0.528
adh-0.501
�醒-0.448
chin-0.438
arms-0.434
Token.
Token ID13
Feature activation-3.44e-01

Pos logit contributions
eta+1.865
adh+1.762
�醒+1.536
chin+1.507
arms+1.493
Neg logit contributions
 Fisher-1.546
 APPLIC-1.450
 Drinking-1.449
 dams-1.431
 Toll-1.412
Token
Token ID198
Feature activation+6.60e-01

Pos logit contributions
 Fisher+0.218
 APPLIC+0.203
 Drinking+0.202
 dams+0.200
 Tile+0.196
Neg logit contributions
eta-0.314
adh-0.298
�醒-0.264
chin-0.260
arms-0.257
Token
Token ID198
Feature activation+1.21e-01

Pos logit contributions
eta+0.590
adh+0.558
�醒+0.495
chin+0.484
arms+0.479
Neg logit contributions
 Fisher-0.436
 Drinking-0.405
 APPLIC-0.404
 dams-0.396
 Tile-0.393
TokenRated
Token ID15322
Feature activation+5.92e+00

Pos logit contributions
eta+0.101
adh+0.096
�醒+0.084
chin+0.082
arms+0.081
Neg logit contributions
 Fisher-0.087
 Drinking-0.081
 APPLIC-0.081
 dams-0.080
 Tile-0.079
Token 5
Token ID642
Feature activation+1.33e+00

Pos logit contributions
eta+5.524
adh+5.292
�醒+4.627
 bombshell+4.615
chin+4.599
Neg logit contributions
 Fisher-3.270
 dams-3.145
 Drinking-3.090
 APPLIC-3.060
stanbul-3.025
Token out
Token ID503
Feature activation+1.45e+00

Pos logit contributions
eta+1.111
adh+1.049
�醒+0.908
chin+0.894
arms+0.887
Neg logit contributions
 Fisher-0.944
 Drinking-0.887
 APPLIC-0.883
 dams-0.869
 Tile-0.865
Token of
Token ID286
Feature activation+8.51e-01

Pos logit contributions
eta+1.296
adh+1.224
chin+1.071
mast+1.059
arms+1.046
Neg logit contributions
 Fisher-0.939
 Drinking-0.881
 dams-0.874
 APPLIC-0.865
 Tile-0.864
Token 5
Token ID642
Feature activation-5.77e-01

Pos logit contributions
eta+0.879
adh+0.834
chin+0.751
mast+0.741
�醒+0.735
Neg logit contributions
 Fisher-0.426
 Drinking-0.400
 APPLIC-0.392
 dams-0.392
 Tile-0.386
Token by
Token ID416
Feature activation+3.80e-01

Pos logit contributions
 Fisher+0.399
 Drinking+0.371
 APPLIC+0.371
 dams+0.364
 Tile+0.361
Neg logit contributions
eta-0.499
adh-0.471
�醒-0.417
chin-0.406
arms-0.402
Token 9
Token ID860
Feature activation+7.81e-01

Pos logit contributions
eta+0.532
adh+0.501
chin+0.433
mast+0.428
arms+0.427
Neg logit contributions
 Fisher-0.409
 Drinking-0.383
 APPLIC-0.380
 dams-0.380
 Tile-0.374
Token,
Token ID11
Feature activation+9.82e-01

Pos logit contributions
eta+0.693
adh+0.651
chin+0.566
mast+0.563
arms+0.560
Neg logit contributions
 Fisher-0.519
 Drinking-0.488
 APPLIC-0.483
 dams-0.479
 Tile-0.474
Token 2010
Token ID3050
Feature activation+6.29e-01

Pos logit contributions
eta+0.718
adh+0.669
�醒+0.579
chin+0.557
arms+0.553
Neg logit contributions
 Fisher-0.812
 Drinking-0.767
 APPLIC-0.765
 dams-0.753
 Toll-0.748
Token
Token ID198
Feature activation+3.59e-01

Pos logit contributions
eta+0.544
adh+0.513
chin+0.443
arms+0.439
mast+0.439
Neg logit contributions
 Fisher-0.430
 Drinking-0.404
 APPLIC-0.400
 dams-0.397
 Tile-0.393
Token
Token ID198
Feature activation-1.02e-01

Pos logit contributions
eta+0.314
adh+0.296
�醒+0.256
chin+0.256
arms+0.254
Neg logit contributions
 Fisher-0.243
 Drinking-0.227
 APPLIC-0.226
 dams-0.223
 Tile-0.221
TokenSubject
Token ID19776
Feature activation+5.82e+00

Pos logit contributions
 Fisher+0.091
 APPLIC+0.086
 Drinking+0.086
 DUI+0.084
 Toll+0.084
Neg logit contributions
eta-0.065
adh-0.061
�醒-0.057
chin-0.049
 bombshell-0.049
Token:
Token ID25
Feature activation+2.57e+00

Pos logit contributions
eta+4.326
adh+3.996
mast+3.476
chin+3.418
arms+3.322
Neg logit contributions
 Fisher-4.472
 dams-4.294
 Drinking-4.289
-+-+-4.230
 Tile-4.230
Token R
Token ID371
Feature activation-2.86e-01

Pos logit contributions
eta+2.372
adh+2.261
chin+1.987
�醒+1.974
 bombshell+1.967
Neg logit contributions
 Fisher-1.541
 APPLIC-1.435
 dams-1.420
 Drinking-1.408
stanbul-1.398
Tokenipe
Token ID3757
Feature activation+9.96e-02

Pos logit contributions
 Fisher+0.205
 APPLIC+0.193
 Drinking+0.192
 dams+0.189
 Tile+0.187
Neg logit contributions
eta-0.240
adh-0.226
�醒-0.197
chin-0.194
arms-0.192
Token for
Token ID329
Feature activation+7.55e-02

Pos logit contributions
eta+0.091
adh+0.086
�醒+0.076
chin+0.075
arms+0.074
Neg logit contributions
 Fisher-0.064
 APPLIC-0.059
 Drinking-0.059
 dams-0.058
 Tile-0.058
Token M
Token ID337
Feature activation-1.66e-01

Pos logit contributions
eta+0.071
adh+0.067
�醒+0.060
chin+0.059
mast+0.059
Neg logit contributions
 Fisher-0.046
 APPLIC-0.043
 Drinking-0.042
 dams-0.042
 Tile-0.041
Token 15
Token ID1315
Feature activation-2.57e-01

Pos logit contributions
eta+0.038
adh+0.036
chin+0.031
�醒+0.031
mast+0.031
Neg logit contributions
 Fisher-0.030
 Drinking-0.029
 APPLIC-0.028
 dams-0.028
 Tile-0.028
Token,
Token ID11
Feature activation+4.23e-01

Pos logit contributions
 Fisher+0.167
 Drinking+0.156
 APPLIC+0.155
 dams+0.153
 Tile+0.151
Neg logit contributions
eta-0.232
adh-0.219
�醒-0.192
chin-0.191
mast-0.189
Token 2010
Token ID3050
Feature activation+3.81e-01

Pos logit contributions
eta+0.315
adh+0.294
�醒+0.255
chin+0.245
arms+0.244
Neg logit contributions
 Fisher-0.345
 Drinking-0.326
 APPLIC-0.324
 dams-0.319
 Toll-0.318
Token
Token ID198
Feature activation+3.14e-01

Pos logit contributions
eta+0.332
adh+0.313
chin+0.271
�醒+0.270
arms+0.269
Neg logit contributions
 Fisher-0.258
 Drinking-0.242
 APPLIC-0.240
 dams-0.238
 Tile-0.235
Token
Token ID198
Feature activation-1.65e-01

Pos logit contributions
eta+0.275
adh+0.259
�醒+0.225
chin+0.224
arms+0.222
Neg logit contributions
 Fisher-0.213
 Drinking-0.199
 APPLIC-0.198
 dams-0.195
 Tile-0.193
TokenSubject
Token ID19776
Feature activation+5.79e+00

Pos logit contributions
 Fisher+0.149
 APPLIC+0.140
 Drinking+0.140
 Toll+0.137
 DUI+0.137
Neg logit contributions
eta-0.105
adh-0.099
�醒-0.090
chin-0.079
 bombshell-0.078
Token:
Token ID25
Feature activation+2.71e+00

Pos logit contributions
eta+4.318
adh+3.990
mast+3.467
chin+3.407
arms+3.320
Neg logit contributions
 Fisher-4.437
 dams-4.261
 Drinking-4.253
 Tile-4.193
-+-+-4.192
Token Que
Token ID4670
Feature activation-1.31e+00

Pos logit contributions
eta+2.500
adh+2.386
chin+2.094
�醒+2.082
 bombshell+2.073
Neg logit contributions
 Fisher-1.627
 APPLIC-1.511
 dams-1.499
 Drinking-1.487
stanbul-1.477
Tokener
Token ID263
Feature activation-1.12e+00

Pos logit contributions
 Fisher+1.079
 Drinking+1.018
 APPLIC+1.015
 dams+0.998
 Tile+0.994
Neg logit contributions
eta-0.946
adh-0.886
�醒-0.762
chin-0.736
arms-0.731
Token magnet
Token ID19972
Feature activation+2.22e-01

Pos logit contributions
 Fisher+0.711
 APPLIC+0.659
 Drinking+0.658
 dams+0.646
 Tile+0.640
Neg logit contributions
eta-1.022
adh-0.969
�醒-0.860
chin-0.843
arms-0.835
Token?
Token ID30
Feature activation-4.42e-01

Pos logit contributions
eta+0.206
adh+0.196
�醒+0.173
chin+0.170
arms+0.169
Neg logit contributions
 Fisher-0.139
 APPLIC-0.129
 Drinking-0.128
 dams-0.126
 Tile-0.125
Token 13
Token ID1511
Feature activation+5.17e-01

Pos logit contributions
eta+0.297
adh+0.279
chin+0.242
mast+0.239
arms+0.238
Neg logit contributions
 Fisher-0.230
 Drinking-0.215
 APPLIC-0.213
 dams-0.213
 Tile-0.210
Token,
Token ID11
Feature activation+8.27e-01

Pos logit contributions
eta+0.465
adh+0.440
�醒+0.385
chin+0.382
mast+0.379
Neg logit contributions
 Fisher-0.337
 Drinking-0.314
 APPLIC-0.313
 dams-0.308
 Tile-0.305
Token 2010
Token ID3050
Feature activation+5.24e-01

Pos logit contributions
eta+0.563
adh+0.524
�醒+0.441
chin+0.431
mast+0.425
Neg logit contributions
 Fisher-0.719
 Drinking-0.681
 APPLIC-0.680
 dams-0.671
 Tile-0.667
Token
Token ID198
Feature activation+1.42e+00

Pos logit contributions
eta+0.457
adh+0.432
chin+0.374
�醒+0.373
mast+0.370
Neg logit contributions
 Fisher-0.354
 Drinking-0.331
 APPLIC-0.329
 dams-0.326
 Tile-0.322
Token
Token ID198
Feature activation+1.03e+00

Pos logit contributions
eta+1.212
adh+1.138
chin+0.984
arms+0.976
mast+0.974
Neg logit contributions
 Fisher-0.980
 Drinking-0.920
 APPLIC-0.911
 dams-0.907
 Tile-0.895
TokenSubject
Token ID19776
Feature activation+5.76e+00

Pos logit contributions
eta+0.658
adh+0.599
�醒+0.528
chin+0.490
mast+0.480
Neg logit contributions
 Fisher-0.935
 Drinking-0.888
 APPLIC-0.886
 dams-0.868
 DUI-0.861
Token:
Token ID25
Feature activation+2.68e+00

Pos logit contributions
eta+4.280
adh+3.998
mast+3.436
chin+3.404
robe+3.298
Neg logit contributions
 Fisher-4.379
 dams-4.200
 Drinking-4.167
-+-+-4.157
 Tile-4.095
Token boys
Token ID6510
Feature activation-5.79e-01

Pos logit contributions
eta+2.469
adh+2.359
chin+2.073
�醒+2.054
 bombshell+2.049
Neg logit contributions
 Fisher-1.606
 APPLIC-1.494
 dams-1.483
 Drinking-1.470
stanbul-1.461
Token beware
Token ID40600
Feature activation-2.46e+00

Pos logit contributions
 Fisher+0.368
 Drinking+0.341
 APPLIC+0.341
 dams+0.334
 Tile+0.331
Neg logit contributions
eta-0.531
adh-0.503
�醒-0.448
chin-0.438
arms-0.434
Token This
Token ID770
Feature activation-5.65e-01

Pos logit contributions
 Fisher+1.583
 Drinking+1.469
 APPLIC+1.461
 dams+1.435
 Tile+1.422
Neg logit contributions
eta-2.245
adh-2.126
�醒-1.896
chin-1.849
arms-1.834
Token film
Token ID2646
Feature activation+1.48e+00

Pos logit contributions
 Fisher+0.326
 Drinking+0.299
 APPLIC+0.299
 dams+0.292
 Tile+0.289
Neg logit contributions
eta-0.552
adh-0.525
�醒-0.470
chin-0.462
arms-0.458
Token was
Token ID373
Feature activation-7.50e-01

Pos logit contributions
 Fisher+1.016
 Drinking+0.933
 APPLIC+0.930
 dams+0.912
 Tile+0.903
Neg logit contributions
eta-1.601
adh-1.523
�醒-1.368
chin-1.325
arms-1.322
Token about
Token ID546
Feature activation-2.74e-02

Pos logit contributions
 Fisher+0.479
 APPLIC+0.446
 Drinking+0.445
 dams+0.437
 Tile+0.433
Neg logit contributions
eta-0.683
adh-0.648
�醒-0.571
chin-0.563
mast-0.558
Token homosexuality
Token ID19822
Feature activation+7.27e-01

Pos logit contributions
 Fisher+0.017
 APPLIC+0.016
 Drinking+0.016
 dams+0.016
 Tile+0.016
Neg logit contributions
eta-0.025
adh-0.024
�醒-0.021
chin-0.021
mast-0.021
Token
Token ID198
Feature activation+1.83e+00

Pos logit contributions
eta+0.649
adh+0.614
�醒+0.542
chin+0.532
mast+0.530
Neg logit contributions
 Fisher-0.478
 APPLIC-0.446
 Drinking-0.445
 dams-0.437
 Tile-0.434
Token
Token ID198
Feature activation+1.06e+00

Pos logit contributions
eta+1.610
adh+1.521
�醒+1.334
chin+1.316
arms+1.306
Neg logit contributions
 Fisher-1.233
 APPLIC-1.148
 Drinking-1.147
 dams-1.131
 Tile-1.119
TokenReviewer
Token ID35407
Feature activation+5.64e+00

Pos logit contributions
eta+0.899
adh+0.848
�醒+0.743
chin+0.729
arms+0.722
Neg logit contributions
 Fisher-0.752
 APPLIC-0.702
 Drinking-0.702
 dams-0.690
 Tile-0.684
Token:
Token ID25
Feature activation+1.63e+00

Pos logit contributions
eta+4.298
adh+3.979
mast+3.376
chin+3.351
arms+3.337
Neg logit contributions
 Fisher-4.179
 Drinking-3.972
 APPLIC-3.949
 dams-3.910
stanbul-3.898
Token k
Token ID479
Feature activation-3.63e-01

Pos logit contributions
eta+1.489
adh+1.415
�醒+1.244
chin+1.238
mast+1.224
Neg logit contributions
 Fisher-1.021
 APPLIC-0.953
 Drinking-0.948
 dams-0.934
 Tile-0.919
Tokenimp
Token ID11011
Feature activation-1.63e+00

Pos logit contributions
 Fisher+0.282
 Drinking+0.265
 APPLIC+0.264
 dams+0.260
 Tile+0.258
Neg logit contributions
eta-0.282
adh-0.264
�醒-0.230
chin-0.223
arms-0.221
Tokenunk
Token ID2954
Feature activation-7.30e-01

Pos logit contributions
 Fisher+1.215
 Drinking+1.136
 APPLIC+1.135
 dams+1.116
 Tile+1.107
Neg logit contributions
eta-1.299
adh-1.223
�醒-1.081
chin-1.039
arms-1.029
Tokenrock
Token ID10823
Feature activation-8.28e-01

Pos logit contributions
 Fisher+0.488
 Drinking+0.453
 APPLIC+0.452
 dams+0.444
 Tile+0.440
Neg logit contributions
eta-0.645
adh-0.610
�醒-0.542
chin-0.528
arms-0.523
Token in
Token ID287
Feature activation-1.00e-01

Pos logit contributions
 Fisher+0.657
 Drinking+0.608
 APPLIC+0.608
 dams+0.595
 Tile+0.590
Neg logit contributions
eta-0.972
adh-0.921
�醒-0.819
chin-0.803
arms-0.796
Token America
Token ID2253
Feature activation+1.71e+00

Pos logit contributions
 Fisher+0.057
 Drinking+0.052
 APPLIC+0.052
 dams+0.051
 Tile+0.050
Neg logit contributions
eta-0.099
adh-0.094
�醒-0.085
chin-0.083
arms-0.082
Token.
Token ID13
Feature activation+9.95e-01

Pos logit contributions
eta+1.457
adh+1.372
�醒+1.199
chin+1.184
mast+1.173
Neg logit contributions
 Fisher-1.188
 APPLIC-1.112
 Drinking-1.107
 dams-1.089
 Tile-1.082
Token Everyone
Token ID11075
Feature activation+5.66e-01

Pos logit contributions
eta+0.893
adh+0.841
chin+0.735
�醒+0.729
arms+0.727
Neg logit contributions
 Fisher-0.627
 dams-0.591
 APPLIC-0.587
 Drinking-0.578
 Tile-0.569
Token gave
Token ID2921
Feature activation+4.96e-01

Pos logit contributions
eta+0.534
adh+0.503
chin+0.444
arms+0.439
�醒+0.439
Neg logit contributions
 Fisher-0.342
 APPLIC-0.319
 Drinking-0.317
 dams-0.308
 Tile-0.307
Token themselves
Token ID2405
Feature activation+5.62e+00

Pos logit contributions
eta+0.456
adh+0.432
�醒+0.382
chin+0.376
arms+0.372
Neg logit contributions
 Fisher-0.314
 APPLIC-0.292
 Drinking-0.291
 dams-0.285
 Tile-0.283
Token the
Token ID262
Feature activation+6.36e-01

Pos logit contributions
eta+4.953
adh+4.661
�醒+4.083
chin+4.051
 bombshell+4.050
Neg logit contributions
 Fisher-3.484
 APPLIC-3.360
 Drinking-3.329
 Tile-3.227
 Toll-3.196
Token rhetorical
Token ID38505
Feature activation+2.03e+00

Pos logit contributions
eta+0.601
adh+0.568
�醒+0.506
chin+0.498
arms+0.495
Neg logit contributions
 Fisher-0.383
 APPLIC-0.357
 Drinking-0.355
 dams-0.346
 Tile-0.344
Token authority
Token ID4934
Feature activation+2.24e+00

Pos logit contributions
eta+1.862
adh+1.759
�醒+1.543
arms+1.539
 bombshell+1.532
Neg logit contributions
 Fisher-1.262
 Drinking-1.183
 APPLIC-1.181
 Tile-1.139
 dams-1.132
Token to
Token ID284
Feature activation+1.91e+00

Pos logit contributions
eta+2.112
adh+2.004
�醒+1.771
chin+1.750
arms+1.747
Neg logit contributions
 Fisher-1.337
 APPLIC-1.245
 Drinking-1.241
 Tile-1.204
 dams-1.194
Token tweet
Token ID6126
Feature activation+1.42e+00

Pos logit contributions
eta+1.729
adh+1.631
 bombshell+1.431
chin+1.425
mast+1.421
Neg logit contributions
 Fisher-1.201
 Drinking-1.127
 APPLIC-1.127
 Tile-1.091
-+-+-1.088
Token 6
Token ID718
Feature activation+7.30e-01

Pos logit contributions
eta+0.501
adh+0.470
chin+0.406
mast+0.402
arms+0.400
Neg logit contributions
 Fisher-0.410
 Drinking-0.385
 APPLIC-0.382
 dams-0.382
 Tile-0.377
Token,
Token ID11
Feature activation+7.81e-01

Pos logit contributions
eta+0.652
adh+0.613
chin+0.533
mast+0.530
�醒+0.528
Neg logit contributions
 Fisher-0.482
 Drinking-0.452
 APPLIC-0.448
 dams-0.443
 Tile-0.439
Token 2010
Token ID3050
Feature activation-1.28e-01

Pos logit contributions
eta+0.578
adh+0.539
�醒+0.466
chin+0.449
arms+0.446
Neg logit contributions
 Fisher-0.640
 Drinking-0.603
 APPLIC-0.603
 dams-0.593
 Tile-0.589
Token
Token ID198
Feature activation+1.78e-01

Pos logit contributions
 Fisher+0.086
 Drinking+0.080
 APPLIC+0.080
 dams+0.079
 Tile+0.078
Neg logit contributions
eta-0.113
adh-0.107
�醒-0.093
chin-0.093
arms-0.092
Token
Token ID198
Feature activation-1.77e-01

Pos logit contributions
eta+0.157
adh+0.148
�醒+0.129
chin+0.128
arms+0.127
Neg logit contributions
 Fisher-0.120
 Drinking-0.112
 APPLIC-0.111
 dams-0.110
 Tile-0.109
TokenSubject
Token ID19776
Feature activation+5.53e+00

Pos logit contributions
 Fisher+0.156
 APPLIC+0.148
 Drinking+0.148
 DUI+0.144
 Toll+0.143
Neg logit contributions
eta-0.116
adh-0.108
�醒-0.102
chin-0.088
 bombshell-0.088
Token:
Token ID25
Feature activation+2.42e+00

Pos logit contributions
eta+4.143
adh+3.838
mast+3.338
chin+3.281
arms+3.190
Neg logit contributions
 Fisher-4.230
 dams-4.059
 Drinking-4.046
-+-+-4.000
 Tile-3.984
Token R
Token ID371
Feature activation-4.98e-01

Pos logit contributions
eta+2.237
adh+2.135
chin+1.874
�醒+1.861
 bombshell+1.858
Neg logit contributions
 Fisher-1.441
 APPLIC-1.342
 dams-1.325
 Drinking-1.316
stanbul-1.306
Tokeniff
Token ID733
Feature activation+2.33e-01

Pos logit contributions
 Fisher+0.376
 Drinking+0.353
 APPLIC+0.353
 dams+0.347
 Tile+0.344
Neg logit contributions
eta-0.397
adh-0.373
�醒-0.324
chin-0.317
arms-0.314
TokenTra
Token ID15721
Feature activation-1.94e-01

Pos logit contributions
eta+0.201
adh+0.189
�醒+0.167
chin+0.163
arms+0.162
Neg logit contributions
 Fisher-0.160
 APPLIC-0.150
 Drinking-0.150
 dams-0.147
 Tile-0.145
Tokenx
Token ID87
Feature activation-1.33e+00

Pos logit contributions
 Fisher+0.168
 Drinking+0.159
 APPLIC+0.158
 dams+0.156
 Tile+0.155
Neg logit contributions
eta-0.134
adh-0.125
�醒-0.106
chin-0.103
arms-0.102
Token
Token ID447
Feature activation-6.57e-01

Pos logit contributions
eta+0.966
adh+0.909
chin+0.800
mast+0.792
�醒+0.792
Neg logit contributions
 Fisher-0.626
 Drinking-0.582
 APPLIC-0.574
 dams-0.567
 Tile-0.563
Token
Token ID247
Feature activation-3.14e-01

Pos logit contributions
 Fisher+0.469
 Drinking+0.439
 APPLIC+0.438
 dams+0.431
 Tile+0.427
Neg logit contributions
eta-0.551
adh-0.520
�醒-0.456
chin-0.446
arms-0.441
Tokent
Token ID83
Feature activation-1.84e-01

Pos logit contributions
 Fisher+0.217
 Drinking+0.203
 APPLIC+0.203
 dams+0.199
 Tile+0.197
Neg logit contributions
eta-0.271
adh-0.255
�醒-0.224
chin-0.220
arms-0.218
Token already
Token ID1541
Feature activation-9.11e-01

Pos logit contributions
 Fisher+0.118
 APPLIC+0.110
 Drinking+0.109
 dams+0.107
 Tile+0.106
Neg logit contributions
eta-0.169
adh-0.159
�醒-0.140
chin-0.139
mast-0.137
Token,
Token ID11
Feature activation+4.73e-01

Pos logit contributions
 Fisher+0.590
 APPLIC+0.548
 Drinking+0.547
 dams+0.537
 Tile+0.532
Neg logit contributions
eta-0.824
adh-0.780
�醒-0.690
chin-0.678
arms-0.671
Token sign
Token ID1051
Feature activation+5.50e+00

Pos logit contributions
eta+0.447
adh+0.424
chin+0.373
�醒+0.372
arms+0.368
Neg logit contributions
 Fisher-0.283
 APPLIC-0.264
 Drinking-0.262
 dams-0.257
 Tile-0.254
Token up
Token ID510
Feature activation+2.92e+00

Pos logit contributions
eta+4.245
adh+3.950
chin+3.409
�醒+3.352
arms+3.298
Neg logit contributions
 Fisher-4.155
 APPLIC-3.949
 Drinking-3.913
 Tile-3.866
 dams-3.831
Token to
Token ID284
Feature activation+3.03e+00

Pos logit contributions
eta+2.720
adh+2.552
�醒+2.255
chin+2.230
arms+2.212
Neg logit contributions
 Fisher-1.799
 APPLIC-1.667
 Drinking-1.658
 dams-1.634
 DUI-1.607
Token our
Token ID674
Feature activation+7.23e-01

Pos logit contributions
eta+2.790
adh+2.625
�醒+2.314
chin+2.297
mast+2.269
Neg logit contributions
 Fisher-1.895
 APPLIC-1.774
 Drinking-1.738
 dams-1.726
 DUI-1.702
Token mailing
Token ID21898
Feature activation+1.67e-01

Pos logit contributions
eta+0.674
adh+0.638
�醒+0.568
chin+0.558
arms+0.552
Neg logit contributions
 Fisher-0.449
 APPLIC-0.415
 Drinking-0.415
 dams-0.407
 Tile-0.403
Token list
Token ID1351
Feature activation+2.58e-01

Pos logit contributions
eta+0.166
adh+0.158
�醒+0.142
chin+0.139
arms+0.138
Neg logit contributions
 Fisher-0.094
 Drinking-0.086
 APPLIC-0.086
 dams-0.084
 Tile-0.083
Token 5
Token ID642
Feature activation-6.69e-02

Pos logit contributions
eta+0.425
adh+0.400
chin+0.346
mast+0.343
arms+0.341
Neg logit contributions
 Fisher-0.336
 Drinking-0.315
 APPLIC-0.314
 dams-0.312
 Tile-0.308
Token,
Token ID11
Feature activation+5.92e-01

Pos logit contributions
 Fisher+0.043
 Drinking+0.040
 APPLIC+0.040
 dams+0.040
 Tile+0.039
Neg logit contributions
eta-0.061
adh-0.057
�醒-0.050
chin-0.050
mast-0.049
Token 2009
Token ID3717
Feature activation+2.22e-01

Pos logit contributions
eta+0.436
adh+0.407
�醒+0.352
chin+0.339
arms+0.336
Neg logit contributions
 Fisher-0.488
 Drinking-0.462
 APPLIC-0.459
 Toll-0.451
 dams-0.451
Token
Token ID198
Feature activation+1.40e-01

Pos logit contributions
eta+0.194
adh+0.183
�醒+0.159
chin+0.159
arms+0.157
Neg logit contributions
 Fisher-0.149
 Drinking-0.140
 APPLIC-0.139
 dams-0.137
 Tile-0.136
Token
Token ID198
Feature activation-3.43e-01

Pos logit contributions
eta+0.123
adh+0.116
�醒+0.101
chin+0.100
arms+0.100
Neg logit contributions
 Fisher-0.095
 Drinking-0.088
 APPLIC-0.088
 dams-0.087
 Tile-0.086
TokenSubject
Token ID19776
Feature activation+5.43e+00

Pos logit contributions
 Fisher+0.317
 APPLIC+0.300
 Drinking+0.300
 Toll+0.293
 dams+0.293
Neg logit contributions
eta-0.214
adh-0.199
�醒-0.179
chin-0.160
 bombshell-0.156
Token:
Token ID25
Feature activation+2.51e+00

Pos logit contributions
eta+4.117
adh+3.798
mast+3.307
chin+3.259
arms+3.175
Neg logit contributions
 Fisher-4.141
 Drinking-3.987
 dams-3.949
 Tile-3.912
-+-+-3.896
Token Que
Token ID4670
Feature activation-1.96e+00

Pos logit contributions
eta+2.316
adh+2.210
chin+1.934
�醒+1.932
 bombshell+1.918
Neg logit contributions
 Fisher-1.501
 APPLIC-1.390
 dams-1.384
 Drinking-1.372
stanbul-1.360
Tokeners
Token ID364
Feature activation-1.44e+00

Pos logit contributions
 Fisher+1.594
 Drinking+1.497
 APPLIC+1.485
 Tile+1.462
 dams+1.450
Neg logit contributions
eta-1.425
adh-1.356
�醒-1.167
arms-1.127
mast-1.126
Token are
Token ID389
Feature activation+3.55e-01

Pos logit contributions
 Fisher+0.924
 Drinking+0.858
 APPLIC+0.854
 dams+0.838
 Tile+0.831
Neg logit contributions
eta-1.311
adh-1.242
�醒-1.107
chin-1.080
arms-1.071
Token Here
Token ID3423
Feature activation-3.31e-01

Pos logit contributions
eta+0.330
adh+0.312
�醒+0.276
chin+0.272
mast+0.270
Neg logit contributions
 Fisher-0.219
 APPLIC-0.204
 Drinking-0.203
 dams-0.199
 Tile-0.198
Token warns
Token ID22145
Feature activation-2.99e-01

Pos logit contributions
eta+0.162
adh+0.155
�醒+0.138
chin+0.137
arms+0.135
Neg logit contributions
 Fisher-0.088
 Drinking-0.082
 APPLIC-0.081
 Tile-0.079
 dams-0.078
Token:
Token ID25
Feature activation-1.34e+00

Pos logit contributions
 Fisher+0.194
 Drinking+0.180
 APPLIC+0.179
 dams+0.176
 Tile+0.174
Neg logit contributions
eta-0.272
adh-0.257
�醒-0.228
chin-0.224
arms-0.221
Token
Token ID198
Feature activation-1.61e+00

Pos logit contributions
 Fisher+0.808
 Drinking+0.743
 APPLIC+0.736
 dams+0.727
 Tile+0.712
Neg logit contributions
eta-1.283
adh-1.212
�醒-1.084
chin-1.066
mast-1.059
Token
Token ID198
Feature activation-1.86e+00

Pos logit contributions
 Fisher+1.033
 APPLIC+0.957
 Drinking+0.955
 dams+0.933
 Tile+0.926
Neg logit contributions
eta-1.462
adh-1.389
�醒-1.253
chin-1.206
arms-1.193
TokenQuestion
Token ID24361
Feature activation+1.88e+00

Pos logit contributions
 Fisher+1.105
 APPLIC+1.016
 Drinking+1.015
 dams+0.989
 Tile+0.979
Neg logit contributions
eta-1.774
adh-1.681
�醒-1.520
chin-1.477
mast-1.461
Token marks
Token ID8849
Feature activation+5.40e+00

Pos logit contributions
eta+1.619
adh+1.528
�醒+1.350
chin+1.312
mast+1.309
Neg logit contributions
 Fisher-1.291
 Drinking-1.205
 APPLIC-1.203
 Tile-1.179
 dams-1.179
Token over
Token ID625
Feature activation+3.87e-01

Pos logit contributions
eta+4.356
adh+4.126
chin+3.528
 bombshell+3.441
mast+3.431
Neg logit contributions
 Fisher-3.790
 Drinking-3.561
 APPLIC-3.532
 Tile-3.523
stanbul-3.491
Token the
Token ID262
Feature activation+6.98e-01

Pos logit contributions
eta+0.370
adh+0.352
�醒+0.313
chin+0.307
mast+0.304
Neg logit contributions
 Fisher-0.231
 Drinking-0.214
 APPLIC-0.214
 dams-0.208
 Tile-0.207
Token rights
Token ID2489
Feature activation+6.71e-01

Pos logit contributions
eta+0.657
adh+0.625
�醒+0.555
chin+0.545
arms+0.539
Neg logit contributions
 Fisher-0.425
 APPLIC-0.393
 Drinking-0.393
 dams-0.384
 Tile-0.381
Token of
Token ID286
Feature activation-1.07e+00

Pos logit contributions
eta+0.626
adh+0.595
�醒+0.529
chin+0.518
arms+0.513
Neg logit contributions
 Fisher-0.415
 Drinking-0.384
 APPLIC-0.383
 dams-0.375
 Tile-0.373
Token EU
Token ID4576
Feature activation+1.26e+00

Pos logit contributions
 Fisher+0.566
 Drinking+0.515
 APPLIC+0.514
 dams+0.502
 Tile+0.496
Neg logit contributions
eta-1.095
adh-1.043
�醒-0.943
chin-0.925
arms-0.917
Token 4
Token ID604
Feature activation+6.00e-01

Pos logit contributions
 Fisher+0.135
 Drinking+0.126
 APPLIC+0.125
 dams+0.125
 Tile+0.123
Neg logit contributions
eta-0.195
adh-0.184
chin-0.161
�醒-0.159
mast-0.159
Token,
Token ID11
Feature activation+7.09e-01

Pos logit contributions
eta+0.536
adh+0.505
chin+0.439
�醒+0.437
mast+0.436
Neg logit contributions
 Fisher-0.395
 Drinking-0.370
 APPLIC-0.367
 dams-0.363
 Tile-0.359
Token 2010
Token ID3050
Feature activation+3.05e-01

Pos logit contributions
eta+0.498
adh+0.464
�醒+0.390
chin+0.385
mast+0.381
Neg logit contributions
 Fisher-0.601
 Drinking-0.567
 APPLIC-0.567
 dams-0.560
 Tile-0.555
Token
Token ID198
Feature activation+1.16e-01

Pos logit contributions
eta+0.267
adh+0.251
chin+0.218
�醒+0.217
arms+0.216
Neg logit contributions
 Fisher-0.206
 Drinking-0.193
 APPLIC-0.192
 dams-0.190
 Tile-0.188
Token
Token ID198
Feature activation-3.39e-01

Pos logit contributions
eta+0.102
adh+0.096
�醒+0.083
chin+0.083
arms+0.082
Neg logit contributions
 Fisher-0.079
 Drinking-0.073
 APPLIC-0.073
 dams-0.072
 Tile-0.071
TokenSubject
Token ID19776
Feature activation+5.38e+00

Pos logit contributions
 Fisher+0.314
 APPLIC+0.298
 Drinking+0.297
 dams+0.291
 Toll+0.291
Neg logit contributions
eta-0.209
adh-0.195
�醒-0.175
chin-0.156
 bombshell-0.153
Token:
Token ID25
Feature activation+2.56e+00

Pos logit contributions
eta+4.061
adh+3.751
mast+3.267
chin+3.218
arms+3.133
Neg logit contributions
 Fisher-4.112
 Drinking-3.951
 dams-3.930
 Tile-3.882
-+-+-3.876
Token User
Token ID11787
Feature activation-3.07e-01

Pos logit contributions
eta+2.366
adh+2.258
chin+1.980
�醒+1.975
 bombshell+1.963
Neg logit contributions
 Fisher-1.528
 APPLIC-1.417
 dams-1.406
 Drinking-1.395
stanbul-1.380
Token reviews
Token ID8088
Feature activation+2.03e+00

Pos logit contributions
 Fisher+0.192
 Drinking+0.178
 APPLIC+0.178
 dams+0.174
 Tile+0.173
Neg logit contributions
eta-0.284
adh-0.269
�醒-0.239
chin-0.235
arms-0.233
Token are
Token ID389
Feature activation+1.08e-01

Pos logit contributions
eta+1.857
adh+1.764
chin+1.533
�醒+1.523
 bombshell+1.522
Neg logit contributions
 Fisher-1.248
 APPLIC-1.172
 dams-1.151
 Drinking-1.150
stanbul-1.139
Token a
Token ID257
Feature activation-3.59e-01

Pos logit contributions
eta+0.100
adh+0.095
�醒+0.083
chin+0.083
arms+0.082
Neg logit contributions
 Fisher-0.068
 Drinking-0.063
 APPLIC-0.063
 dams-0.061
 Tile-0.061
Token object
Token ID2134
Feature activation+4.08e-01

Pos logit contributions
 Fisher+0.965
 Drinking+0.889
 APPLIC+0.889
 dams+0.872
 Tile+0.867
Neg logit contributions
eta-1.437
adh-1.365
�醒-1.218
chin-1.189
arms-1.179
Token pointers
Token ID32007
Feature activation+1.31e+00

Pos logit contributions
eta+0.298
adh+0.278
�醒+0.238
chin+0.232
arms+0.229
Neg logit contributions
 Fisher-0.336
 Drinking-0.317
 APPLIC-0.317
 dams-0.312
 Tile-0.310
Token).
Token ID737
Feature activation-5.34e-01

Pos logit contributions
eta+1.288
adh+1.219
�醒+1.084
chin+1.077
mast+1.064
Neg logit contributions
 Fisher-0.748
 Drinking-0.690
 APPLIC-0.686
 dams-0.670
 Tile-0.662
Token You
Token ID921
Feature activation+5.80e-01

Pos logit contributions
 Fisher+0.339
 Drinking+0.314
 APPLIC+0.314
 dams+0.313
 DUI+0.305
Neg logit contributions
eta-0.487
adh-0.459
�醒-0.404
chin-0.401
mast-0.395
Token may
Token ID743
Feature activation+7.28e-01

Pos logit contributions
eta+0.500
adh+0.473
chin+0.409
�醒+0.406
mast+0.402
Neg logit contributions
 Fisher-0.399
 Drinking-0.375
 APPLIC-0.371
 dams-0.364
 Tile-0.362
Token hear
Token ID3285
Feature activation+5.38e+00

Pos logit contributions
eta+0.661
adh+0.627
chin+0.545
mast+0.541
�醒+0.538
Neg logit contributions
 Fisher-0.467
 Drinking-0.437
 APPLIC-0.431
 dams-0.423
 Tile-0.418
Token that
Token ID326
Feature activation+8.76e-01

Pos logit contributions
eta+4.588
adh+4.426
 bombshell+3.856
arms+3.799
mast+3.787
Neg logit contributions
 Fisher-3.378
 Drinking-3.290
 APPLIC-3.218
 dams-3.129
 DUI-3.111
Token GHC
Token ID46615
Feature activation-1.71e-02

Pos logit contributions
eta+0.817
adh+0.774
chin+0.683
�醒+0.681
arms+0.672
Neg logit contributions
 Fisher-0.534
 Drinking-0.498
 APPLIC-0.492
 dams-0.482
 DUI-0.479
Token
Token ID447
Feature activation+7.82e-01

Pos logit contributions
 Fisher+0.010
 Drinking+0.009
 APPLIC+0.009
 dams+0.009
 Tile+0.009
Neg logit contributions
eta-0.017
adh-0.016
�醒-0.014
chin-0.014
mast-0.014
Token
Token ID247
Feature activation+2.95e-01

Pos logit contributions
eta+0.647
adh+0.608
�醒+0.532
chin+0.521
arms+0.517
Neg logit contributions
 Fisher-0.569
 Drinking-0.531
 APPLIC-0.531
 dams-0.519
 Tile-0.518
Tokens
Token ID82
Feature activation+1.32e+00

Pos logit contributions
eta+0.262
adh+0.247
�醒+0.215
chin+0.214
mast+0.212
Neg logit contributions
 Fisher-0.198
 Drinking-0.186
 APPLIC-0.184
 dams-0.181
 Tile-0.179
Token,
Token ID11
Feature activation+3.45e-01

Pos logit contributions
eta+1.553
adh+1.464
chin+1.259
mast+1.252
arms+1.249
Neg logit contributions
 Fisher-1.209
 Drinking-1.132
 APPLIC-1.123
 dams-1.119
 Tile-1.110
Token I
Token ID314
Feature activation-3.43e-01

Pos logit contributions
eta+0.325
adh+0.309
�醒+0.276
chin+0.270
arms+0.268
Neg logit contributions
 Fisher-0.210
 Drinking-0.194
 APPLIC-0.194
 dams-0.190
 Tile-0.188
Token think
Token ID892
Feature activation-7.11e-01

Pos logit contributions
 Fisher+0.172
 Drinking+0.157
 APPLIC+0.156
 dams+0.151
 Tile+0.151
Neg logit contributions
eta-0.362
adh-0.344
chin-0.309
mast-0.304
�醒-0.303
Token
Token ID198
Feature activation+4.46e-02

Pos logit contributions
 Fisher+0.464
 Drinking+0.431
 APPLIC+0.429
 dams+0.420
 Tile+0.416
Neg logit contributions
eta-0.643
adh-0.608
�醒-0.538
chin-0.528
arms-0.524
Token
Token ID198
Feature activation-7.79e-01

Pos logit contributions
eta+0.039
adh+0.037
�醒+0.032
chin+0.032
arms+0.032
Neg logit contributions
 Fisher-0.030
 Drinking-0.028
 APPLIC-0.028
 dams-0.027
 Tile-0.027
TokenReviewer
Token ID35407
Feature activation+5.35e+00

Pos logit contributions
 Fisher+0.604
 Drinking+0.566
 APPLIC+0.565
 dams+0.555
 Tile+0.550
Neg logit contributions
eta-0.602
adh-0.565
�醒-0.495
chin-0.475
mast-0.471
Token:
Token ID25
Feature activation+1.21e+00

Pos logit contributions
eta+4.151
adh+3.814
mast+3.330
arms+3.243
chin+3.227
Neg logit contributions
 Fisher-4.009
 Drinking-3.849
 dams-3.817
 APPLIC-3.785
 Tile-3.771
Token B
Token ID347
Feature activation-1.94e-01

Pos logit contributions
eta+1.110
adh+1.053
�醒+0.928
chin+0.921
mast+0.910
Neg logit contributions
 Fisher-0.758
 APPLIC-0.707
 Drinking-0.704
 dams-0.693
 Tile-0.682
Tokeniz
Token ID528
Feature activation+2.48e-01

Pos logit contributions
 Fisher+0.142
 Drinking+0.133
 APPLIC+0.133
 dams+0.131
 Tile+0.130
Neg logit contributions
eta-0.160
adh-0.150
�醒-0.131
chin-0.128
arms-0.127
Tokenr
Token ID81
Feature activation+1.23e+00

Pos logit contributions
eta+0.205
adh+0.193
�醒+0.168
chin+0.165
mast+0.163
Neg logit contributions
 Fisher-0.181
 Drinking-0.169
 APPLIC-0.169
 dams-0.167
 Tile-0.165
Token Pix
Token ID21642
Feature activation+1.01e+00

Pos logit contributions
eta+1.044
adh+0.986
�醒+0.860
chin+0.847
mast+0.839
Neg logit contributions
 Fisher-0.866
 APPLIC-0.813
 Drinking-0.810
 dams-0.799
 Tile-0.791
Token S
Token ID311
Feature activation+1.00e+00

Pos logit contributions
eta+0.617
adh+0.582
�醒+0.514
chin+0.510
mast+0.504
Neg logit contributions
 Fisher-0.412
 APPLIC-0.386
 Drinking-0.383
 dams-0.376
 Tile-0.370
Tokenaki
Token ID8182
Feature activation+1.52e+00

Pos logit contributions
eta+0.738
adh+0.688
�醒+0.585
chin+0.575
arms+0.568
Neg logit contributions
 Fisher-0.824
 Drinking-0.777
 APPLIC-0.777
 dams-0.765
 Tile-0.759
Token,
Token ID11
Feature activation+8.16e-01

Pos logit contributions
eta+1.374
adh+1.294
�醒+1.135
chin+1.128
arms+1.119
Neg logit contributions
 Fisher-0.979
 APPLIC-0.917
 Drinking-0.913
 dams-0.897
 Tile-0.884
Token who
Token ID508
Feature activation+3.81e-01

Pos logit contributions
eta+0.730
adh+0.690
�醒+0.607
chin+0.600
arms+0.595
Neg logit contributions
 Fisher-0.532
 APPLIC-0.499
 Drinking-0.497
 dams-0.487
 Tile-0.482
Token proclaimed
Token ID25546
Feature activation-1.17e+00

Pos logit contributions
eta+0.362
adh+0.341
chin+0.300
�醒+0.300
arms+0.298
Neg logit contributions
 Fisher-0.227
 APPLIC-0.215
 Drinking-0.213
 dams-0.207
 Tile-0.204
Token himself
Token ID2241
Feature activation+5.33e+00

Pos logit contributions
 Fisher+0.749
 Drinking+0.688
 APPLIC+0.684
 dams+0.678
 Tile+0.670
Neg logit contributions
eta-1.076
adh-1.021
�醒-0.922
chin-0.889
arms-0.879
Token to
Token ID284
Feature activation+1.14e+00

Pos logit contributions
eta+4.855
adh+4.556
chin+3.990
arms+3.983
robe+3.950
Neg logit contributions
 Fisher-3.195
 APPLIC-3.099
 Drinking-3.053
 dams-2.969
stanbul-2.933
Token be
Token ID307
Feature activation+8.13e-01

Pos logit contributions
eta+1.059
adh+1.004
�醒+0.894
chin+0.875
arms+0.867
Neg logit contributions
 Fisher-0.718
 Drinking-0.664
 APPLIC-0.663
 dams-0.650
 Tile-0.644
Token the
Token ID262
Feature activation-9.51e-01

Pos logit contributions
eta+0.777
adh+0.735
�醒+0.651
chin+0.645
arms+0.641
Neg logit contributions
 Fisher-0.481
 APPLIC-0.449
 Drinking-0.446
 dams-0.437
 Tile-0.431
Token Sh
Token ID911
Feature activation-1.99e-01

Pos logit contributions
 Fisher+0.623
 Drinking+0.575
 APPLIC+0.572
 dams+0.563
 Tile+0.559
Neg logit contributions
eta-0.858
adh-0.813
�醒-0.728
chin-0.705
mast-0.700
Tokenred
Token ID445
Feature activation-1.86e+00

Pos logit contributions
 Fisher+0.142
 Drinking+0.133
 APPLIC+0.133
 dams+0.131
 Tile+0.129
Neg logit contributions
eta-0.168
adh-0.158
�醒-0.137
chin-0.136
arms-0.134
Token Crit
Token ID10056
Feature activation+6.30e-01

Pos logit contributions
eta+0.915
adh+0.864
�醒+0.764
chin+0.756
arms+0.743
Neg logit contributions
 Fisher-0.629
 Drinking-0.583
 dams-0.583
 APPLIC-0.582
 DUI-0.568
Tokenically
Token ID1146
Feature activation+1.43e+00

Pos logit contributions
eta+0.523
adh+0.493
�醒+0.430
chin+0.423
arms+0.418
Neg logit contributions
 Fisher-0.456
 Drinking-0.427
 APPLIC-0.427
 dams-0.419
 Tile-0.415
Token,
Token ID11
Feature activation+9.63e-02

Pos logit contributions
eta+1.279
adh+1.206
�醒+1.063
chin+1.050
arms+1.039
Neg logit contributions
 Fisher-0.938
 Drinking-0.874
 APPLIC-0.870
 dams-0.852
 Tile-0.847
Token these
Token ID777
Feature activation+3.42e-01

Pos logit contributions
eta+0.091
adh+0.087
�醒+0.077
chin+0.076
arms+0.075
Neg logit contributions
 Fisher-0.057
 Drinking-0.053
 APPLIC-0.053
 dams-0.051
 DUI-0.051
Token approaches
Token ID10581
Feature activation-3.62e-02

Pos logit contributions
eta+0.324
adh+0.308
�醒+0.271
chin+0.270
arms+0.267
Neg logit contributions
 Fisher-0.205
 Drinking-0.190
 APPLIC-0.188
 Tile-0.182
 dams-0.182
Token strike
Token ID5587
Feature activation+5.30e+00

Pos logit contributions
 Fisher+0.023
 Drinking+0.022
 APPLIC+0.022
 dams+0.021
 Tile+0.021
Neg logit contributions
eta-0.033
adh-0.031
�醒-0.027
chin-0.027
mast-0.027
Token right
Token ID826
Feature activation+2.48e+00

Pos logit contributions
eta+4.633
adh+4.361
chin+3.855
arms+3.806
 bombshell+3.790
Neg logit contributions
 Fisher-3.323
 Drinking-3.205
 APPLIC-3.164
 Tile-3.030
 DUI-3.024
Token at
Token ID379
Feature activation+6.25e-01

Pos logit contributions
eta+2.376
adh+2.257
�醒+2.009
chin+1.986
arms+1.973
Neg logit contributions
 Fisher-1.440
 Drinking-1.358
 APPLIC-1.336
 DUI-1.302
 dams-1.292
Token the
Token ID262
Feature activation+5.96e-01

Pos logit contributions
eta+0.610
adh+0.578
�醒+0.518
chin+0.511
mast+0.508
Neg logit contributions
 Fisher-0.354
 Drinking-0.328
 APPLIC-0.326
 dams-0.316
 Tile-0.315
Token heart
Token ID2612
Feature activation-4.62e-01

Pos logit contributions
eta+0.544
adh+0.514
�醒+0.455
chin+0.450
mast+0.445
Neg logit contributions
 Fisher-0.377
 Drinking-0.352
 APPLIC-0.350
 dams-0.341
 Tile-0.340
Token of
Token ID286
Feature activation-1.58e+00

Pos logit contributions
 Fisher+0.273
 Drinking+0.251
 APPLIC+0.251
 dams+0.247
 Tile+0.244
Neg logit contributions
eta-0.445
adh-0.422
�醒-0.378
chin-0.370
arms-0.366
Token and
Token ID290
Feature activation+3.12e-01

Pos logit contributions
eta+0.785
adh+0.744
�醒+0.652
chin+0.647
mast+0.639
Neg logit contributions
 Fisher-0.550
 APPLIC-0.512
 Drinking-0.507
 dams-0.500
 Tile-0.497
Token night
Token ID1755
Feature activation+7.12e-01

Pos logit contributions
eta+0.279
adh+0.264
�醒+0.231
chin+0.230
mast+0.227
Neg logit contributions
 Fisher-0.204
 APPLIC-0.190
 Drinking-0.189
 dams-0.186
 Tile-0.185
Token!
Token ID0
Feature activation+4.59e-01

Pos logit contributions
eta+0.621
adh+0.586
�醒+0.513
chin+0.506
mast+0.501
Neg logit contributions
 Fisher-0.484
 APPLIC-0.452
 Drinking-0.449
 dams-0.443
 Tile-0.440
Token
Token ID198
Feature activation-2.67e-01

Pos logit contributions
eta+0.420
adh+0.399
�醒+0.351
chin+0.348
arms+0.344
Neg logit contributions
 Fisher-0.290
 APPLIC-0.270
 Drinking-0.268
 dams-0.265
 Tile-0.261
Token
Token ID198
Feature activation-7.60e-01

Pos logit contributions
 Fisher+0.177
 Drinking+0.164
 APPLIC+0.164
 dams+0.161
 Tile+0.159
Neg logit contributions
eta-0.238
adh-0.225
�醒-0.199
chin-0.195
arms-0.193
TokenRated
Token ID15322
Feature activation+5.30e+00

Pos logit contributions
 Fisher+0.604
 Drinking+0.568
 APPLIC+0.567
 dams+0.557
 Tile+0.553
Neg logit contributions
eta-0.575
adh-0.539
�醒-0.471
chin-0.453
arms-0.448
Token 5
Token ID642
Feature activation+9.94e-01

Pos logit contributions
eta+5.029
adh+4.792
�醒+4.216
chin+4.198
 bombshell+4.179
Neg logit contributions
 Fisher-2.887
 dams-2.774
 Drinking-2.700
 APPLIC-2.674
stanbul-2.661
Token out
Token ID503
Feature activation+1.24e+00

Pos logit contributions
eta+0.828
adh+0.778
chin+0.666
�醒+0.661
mast+0.660
Neg logit contributions
 Fisher-0.713
 Drinking-0.672
 APPLIC-0.667
 dams-0.656
 Tile-0.653
Token of
Token ID286
Feature activation+3.19e-01

Pos logit contributions
eta+1.105
adh+1.043
chin+0.912
mast+0.908
arms+0.893
Neg logit contributions
 Fisher-0.810
 Drinking-0.764
 dams-0.754
 APPLIC-0.752
 Tile-0.746
Token 5
Token ID642
Feature activation-5.69e-01

Pos logit contributions
eta+0.333
adh+0.317
chin+0.284
mast+0.281
�醒+0.279
Neg logit contributions
 Fisher-0.157
 Drinking-0.146
 APPLIC-0.144
 dams-0.143
 Tile-0.140
Token by
Token ID416
Feature activation+9.33e-02

Pos logit contributions
 Fisher+0.394
 Drinking+0.366
 APPLIC+0.366
 dams+0.359
 Tile+0.357
Neg logit contributions
eta-0.491
adh-0.464
�醒-0.411
chin-0.400
arms-0.396
Tokeniff
Token ID733
Feature activation-1.02e+00

Pos logit contributions
 Fisher+1.151
 Drinking+1.076
 APPLIC+1.074
 dams+1.056
 Tile+1.047
Neg logit contributions
eta-1.375
adh-1.297
�醒-1.137
chin-1.114
arms-1.103
TokenTra
Token ID15721
Feature activation-8.02e-01

Pos logit contributions
 Fisher+0.644
 Drinking+0.597
 APPLIC+0.596
 dams+0.584
 Tile+0.579
Neg logit contributions
eta-0.936
adh-0.889
�醒-0.794
chin-0.773
arms-0.766
Tokenx
Token ID87
Feature activation-7.33e-01

Pos logit contributions
 Fisher+0.690
 APPLIC+0.652
 Drinking+0.652
 dams+0.642
 Tile+0.638
Neg logit contributions
eta-0.551
adh-0.515
�醒-0.446
chin-0.422
arms-0.418
Token
Token ID198
Feature activation-5.55e-01

Pos logit contributions
 Fisher+0.480
 Drinking+0.446
 APPLIC+0.445
 dams+0.436
 Tile+0.432
Neg logit contributions
eta-0.659
adh-0.625
�醒-0.554
chin-0.542
arms-0.537
Token
Token ID198
Feature activation-9.09e-01

Pos logit contributions
 Fisher+0.369
 Drinking+0.343
 APPLIC+0.342
 dams+0.337
 Tile+0.333
Neg logit contributions
eta-0.493
adh-0.466
�醒-0.410
chin-0.404
arms-0.400
TokenReviewer
Token ID35407
Feature activation+5.30e+00

Pos logit contributions
 Fisher+0.715
 Drinking+0.672
 APPLIC+0.671
 dams+0.659
 Tile+0.654
Neg logit contributions
eta-0.694
adh-0.651
�醒-0.567
chin-0.547
arms-0.541
Token:
Token ID25
Feature activation+1.03e+00

Pos logit contributions
eta+4.116
adh+3.784
mast+3.302
arms+3.214
chin+3.196
Neg logit contributions
 Fisher-3.977
 Drinking-3.817
 dams-3.773
 APPLIC-3.751
 Tile-3.739
Token V
Token ID569
Feature activation-4.47e-03

Pos logit contributions
eta+0.947
adh+0.899
�醒+0.792
chin+0.785
mast+0.777
Neg logit contributions
 Fisher-0.646
 APPLIC-0.603
 Drinking-0.600
 dams-0.591
 Tile-0.582
Tokenthe
Token ID1169
Feature activation-6.60e-01

Pos logit contributions
 Fisher+0.003
 Drinking+0.003
 APPLIC+0.003
 dams+0.003
 Tile+0.003
Neg logit contributions
eta-0.004
adh-0.003
�醒-0.003
chin-0.003
arms-0.003
TokenHappy
Token ID25082
Feature activation+7.86e-01

Pos logit contributions
 Fisher+0.479
 Drinking+0.448
 APPLIC+0.447
 dams+0.440
 Tile+0.436
Neg logit contributions
eta-0.546
adh-0.514
�醒-0.451
chin-0.440
arms-0.436
TokenL
Token ID43
Feature activation-1.64e+00

Pos logit contributions
eta+0.687
adh+0.648
�醒+0.570
chin+0.561
mast+0.557
Neg logit contributions
 Fisher-0.532
 APPLIC-0.498
 Drinking-0.496
 dams-0.490
 Tile-0.484
Token 26
Token ID2608
Feature activation+1.22e-01

Pos logit contributions
eta+0.553
adh+0.521
chin+0.449
mast+0.445
arms+0.442
Neg logit contributions
 Fisher-0.443
 Drinking-0.415
 APPLIC-0.413
 dams-0.412
 Tile-0.407
Token,
Token ID11
Feature activation+5.38e-01

Pos logit contributions
eta+0.110
adh+0.104
�醒+0.090
chin+0.090
mast+0.090
Neg logit contributions
 Fisher-0.080
 Drinking-0.075
 APPLIC-0.074
 dams-0.073
 Tile-0.073
Token 2009
Token ID3717
Feature activation-1.43e-01

Pos logit contributions
eta+0.380
adh+0.354
�醒+0.303
chin+0.294
arms+0.289
Neg logit contributions
 Fisher-0.458
 Drinking-0.433
 APPLIC-0.432
 dams-0.426
 Toll-0.423
Token
Token ID198
Feature activation+5.95e-01

Pos logit contributions
 Fisher+0.095
 Drinking+0.089
 APPLIC+0.088
 dams+0.087
 Tile+0.086
Neg logit contributions
eta-0.126
adh-0.119
�醒-0.104
chin-0.103
arms-0.102
Token
Token ID198
Feature activation+8.53e-02

Pos logit contributions
eta+0.517
adh+0.487
chin+0.421
�醒+0.419
arms+0.417
Neg logit contributions
 Fisher-0.407
 Drinking-0.381
 APPLIC-0.378
 dams-0.375
 Tile-0.370
TokenSubject
Token ID19776
Feature activation+5.25e+00

Pos logit contributions
eta+0.052
adh+0.049
�醒+0.043
chin+0.039
robe+0.038
Neg logit contributions
 Fisher-0.080
 APPLIC-0.075
 Drinking-0.075
 dams-0.074
 Toll-0.074
Token:
Token ID25
Feature activation+2.32e+00

Pos logit contributions
eta+4.015
adh+3.707
mast+3.227
chin+3.182
arms+3.106
Neg logit contributions
 Fisher-3.971
 Drinking-3.816
 dams-3.795
 Tile-3.743
-+-+-3.719
Token THE
Token ID3336
Feature activation-9.26e-01

Pos logit contributions
eta+2.146
adh+2.045
�醒+1.794
chin+1.792
mast+1.775
Neg logit contributions
 Fisher-1.395
 APPLIC-1.295
 dams-1.285
 Drinking-1.275
stanbul-1.264
Token IR
Token ID14826
Feature activation-1.31e+00

Pos logit contributions
 Fisher+0.559
 Drinking+0.515
 APPLIC+0.513
 dams+0.506
 Tile+0.501
Neg logit contributions
eta-0.878
adh-0.833
�醒-0.743
chin-0.729
mast-0.723
TokenONY
Token ID40508
Feature activation-1.78e+00

Pos logit contributions
 Fisher+0.841
 Drinking+0.780
 APPLIC+0.779
 dams+0.763
 Tile+0.756
Neg logit contributions
eta-1.196
adh-1.133
�醒-1.007
chin-0.986
arms-0.977
Token OF
Token ID3963
Feature activation-1.78e+00

Pos logit contributions
 Fisher+0.992
 Drinking+0.908
 APPLIC+0.906
 dams+0.890
 Tile+0.878
Neg logit contributions
eta-1.772
adh-1.687
�醒-1.512
chin-1.486
arms-1.474
Token sound
Token ID2128
Feature activation+1.89e-01

Pos logit contributions
eta+0.802
adh+0.762
�醒+0.673
chin+0.671
arms+0.662
Neg logit contributions
 Fisher-0.514
 APPLIC-0.483
 Drinking-0.480
 dams-0.470
 Tile-0.465
Token system
Token ID1080
Feature activation+2.54e-01

Pos logit contributions
eta+0.197
adh+0.188
�醒+0.168
chin+0.167
arms+0.166
Neg logit contributions
 Fisher-0.096
 APPLIC-0.088
 Drinking-0.088
 dams-0.085
 Toll-0.084
Token.
Token ID13
Feature activation-5.74e-01

Pos logit contributions
eta+0.225
adh+0.212
�醒+0.187
chin+0.184
arms+0.182
Neg logit contributions
 Fisher-0.169
 APPLIC-0.158
 Drinking-0.158
 dams-0.154
 Tile-0.153
Token
Token ID198
Feature activation-3.96e-01

Pos logit contributions
 Fisher+0.342
 APPLIC+0.316
 Drinking+0.315
 dams+0.310
 Tile+0.306
Neg logit contributions
eta-0.547
adh-0.520
�醒-0.464
chin-0.456
arms-0.451
Token
Token ID198
Feature activation-7.80e-01

Pos logit contributions
 Fisher+0.261
 Drinking+0.242
 APPLIC+0.242
 dams+0.237
 Tile+0.235
Neg logit contributions
eta-0.354
adh-0.335
�醒-0.297
chin-0.291
arms-0.288
TokenRated
Token ID15322
Feature activation+5.22e+00

Pos logit contributions
 Fisher+0.618
 Drinking+0.582
 APPLIC+0.581
 dams+0.570
 Tile+0.567
Neg logit contributions
eta-0.591
adh-0.553
�醒-0.485
chin-0.465
arms-0.460
Token 5
Token ID642
Feature activation+1.03e+00

Pos logit contributions
eta+4.933
adh+4.702
�醒+4.124
chin+4.116
 bombshell+4.101
Neg logit contributions
 Fisher-2.860
 dams-2.748
 Drinking-2.681
 APPLIC-2.655
stanbul-2.640
Token out
Token ID503
Feature activation+1.11e+00

Pos logit contributions
eta+0.857
adh+0.805
chin+0.689
�醒+0.684
mast+0.682
Neg logit contributions
 Fisher-0.739
 Drinking-0.697
 APPLIC-0.692
 dams-0.681
 Tile-0.677
Token of
Token ID286
Feature activation+3.26e-01

Pos logit contributions
eta+0.990
adh+0.935
chin+0.818
mast+0.813
arms+0.801
Neg logit contributions
 Fisher-0.721
 Drinking-0.679
 dams-0.671
 APPLIC-0.669
 Tile-0.663
Token 5
Token ID642
Feature activation-4.85e-01

Pos logit contributions
eta+0.341
adh+0.324
chin+0.291
mast+0.287
�醒+0.286
Neg logit contributions
 Fisher-0.161
 Drinking-0.150
 APPLIC-0.148
 dams-0.147
 Tile-0.144
Token by
Token ID416
Feature activation+5.95e-02

Pos logit contributions
 Fisher+0.336
 Drinking+0.314
 APPLIC+0.313
 dams+0.307
 Tile+0.305
Neg logit contributions
eta-0.417
adh-0.394
�醒-0.348
chin-0.339
arms-0.336
Token:
Token ID25
Feature activation-1.26e+00

Pos logit contributions
 Fisher+0.830
 Drinking+0.766
 APPLIC+0.763
 dams+0.751
 Tile+0.742
Neg logit contributions
eta-1.194
adh-1.132
�醒-1.010
chin-0.984
arms-0.976
Token "
Token ID366
Feature activation-1.01e+00

Pos logit contributions
 Fisher+0.768
 Drinking+0.711
 APPLIC+0.700
 dams+0.689
 Tile+0.678
Neg logit contributions
eta-1.188
adh-1.123
�醒-1.007
chin-0.983
arms-0.976
TokenWhy
Token ID5195
Feature activation+2.57e+00

Pos logit contributions
 Fisher+0.545
 Drinking+0.497
 APPLIC+0.497
 dams+0.486
 Tile+0.480
Neg logit contributions
eta-1.022
adh-0.973
�醒-0.875
chin-0.860
arms-0.853
Token in
Token ID287
Feature activation+1.08e+00

Pos logit contributions
eta+2.341
adh+2.219
chin+1.952
mast+1.922
 bombshell+1.914
Neg logit contributions
 Fisher-1.587
 APPLIC-1.487
 Drinking-1.474
 Tile-1.444
-+-+-1.444
Token the
Token ID262
Feature activation+1.57e+00

Pos logit contributions
eta+1.051
adh+1.000
�醒+0.892
chin+0.877
mast+0.870
Neg logit contributions
 Fisher-0.629
 APPLIC-0.580
 Drinking-0.580
 dams-0.566
 Tile-0.561
Token hell
Token ID5968
Feature activation+5.16e+00

Pos logit contributions
eta+1.876
adh+1.806
�醒+1.645
chin+1.626
mast+1.618
Neg logit contributions
 Fisher-0.539
 APPLIC-0.467
 Drinking-0.466
 dams-0.449
 Tile-0.441
Token do
Token ID466
Feature activation+1.57e+00

Pos logit contributions
eta+4.554
adh+4.321
chin+3.777
mast+3.742
 bombshell+3.695
Neg logit contributions
 Fisher-3.258
 APPLIC-3.048
 Drinking-3.047
 Tile-2.982
-+-+-2.973
Token journalists
Token ID9046
Feature activation+3.18e-01

Pos logit contributions
eta+1.513
adh+1.436
chin+1.265
�醒+1.262
mast+1.247
Neg logit contributions
 Fisher-0.918
 APPLIC-0.852
 Drinking-0.847
 dams-0.825
 Tile-0.824
Token insist
Token ID16361
Feature activation+7.89e-01

Pos logit contributions
eta+0.295
adh+0.278
chin+0.244
�醒+0.244
mast+0.242
Neg logit contributions
 Fisher-0.198
 Drinking-0.183
 APPLIC-0.183
 Tile-0.178
 dams-0.178
Token on
Token ID319
Feature activation+1.05e+00

Pos logit contributions
eta+0.694
adh+0.656
�醒+0.577
chin+0.568
mast+0.563
Neg logit contributions
 Fisher-0.528
 APPLIC-0.494
 Drinking-0.492
 dams-0.482
 Tile-0.480
Token coming
Token ID2406
Feature activation-6.84e-01

Pos logit contributions
eta+1.009
adh+0.959
�醒+0.843
mast+0.842
 bombshell+0.842
Neg logit contributions
 Fisher-0.601
 APPLIC-0.562
 Drinking-0.554
 Tile-0.541
 dams-0.539
Token to
Token ID284
Feature activation+1.52e+00

Pos logit contributions
eta+0.999
adh+0.948
�醒+0.837
chin+0.820
mast+0.814
Neg logit contributions
 Fisher-0.720
 APPLIC-0.671
 Drinking-0.670
 dams-0.657
 Tile-0.651
Token him
Token ID683
Feature activation+7.70e-01

Pos logit contributions
eta+1.426
adh+1.362
�醒+1.183
chin+1.177
mast+1.171
Neg logit contributions
 Fisher-0.911
 APPLIC-0.865
 Drinking-0.849
 dams-0.831
 Tile-0.823
Token:
Token ID25
Feature activation-4.53e-01

Pos logit contributions
eta+0.700
adh+0.663
�醒+0.589
chin+0.576
arms+0.571
Neg logit contributions
 Fisher-0.496
 Drinking-0.460
 APPLIC-0.460
 dams-0.451
 Tile-0.447
Token "
Token ID366
Feature activation-8.16e-01

Pos logit contributions
 Fisher+0.286
 APPLIC+0.266
 Drinking+0.265
 dams+0.261
 Tile+0.258
Neg logit contributions
eta-0.416
adh-0.394
�醒-0.350
chin-0.343
arms-0.340
TokenTh
Token ID817
Feature activation-1.49e+00

Pos logit contributions
 Fisher+0.499
 APPLIC+0.461
 Drinking+0.461
 dams+0.453
 Tile+0.447
Neg logit contributions
eta-0.769
adh-0.730
�醒-0.649
chin-0.637
arms-0.633
Tokenou
Token ID280
Feature activation-5.92e+00

Pos logit contributions
 Fisher+1.054
 Drinking+0.986
 APPLIC+0.972
 dams+0.964
 Tile+0.961
Neg logit contributions
eta-1.244
adh-1.171
�醒-1.044
chin-1.010
mast-1.002
Token hast
Token ID19338
Feature activation-1.98e+00

Pos logit contributions
 Fisher+3.508
 Drinking+3.232
 dams+3.205
 APPLIC+3.188
 Tile+3.151
Neg logit contributions
eta-5.539
adh-5.247
�醒-4.894
chin-4.668
arms-4.639
Token judged
Token ID19589
Feature activation-1.43e+00

Pos logit contributions
 Fisher+1.246
 Drinking+1.153
 APPLIC+1.152
 dams+1.129
 Tile+1.118
Neg logit contributions
eta-1.827
adh-1.732
�醒-1.541
chin-1.509
arms-1.496
Token with
Token ID351
Feature activation+1.30e+00

Pos logit contributions
 Fisher+0.883
 Drinking+0.818
 APPLIC+0.817
 dams+0.799
 Tile+0.793
Neg logit contributions
eta-1.328
adh-1.262
�醒-1.117
chin-1.098
mast-1.091
Token the
Token ID262
Feature activation+6.78e-01

Pos logit contributions
eta+1.193
adh+1.141
�醒+0.994
arms+0.985
mast+0.984
Neg logit contributions
 Fisher-0.814
 APPLIC-0.763
 Drinking-0.759
 Tile-0.739
 dams-0.738
Token judgement
Token ID23071
Feature activation+1.95e+00

Pos logit contributions
eta+0.612
adh+0.584
�醒+0.508
chin+0.502
mast+0.502
Neg logit contributions
 Fisher-0.428
 APPLIC-0.405
 Drinking-0.401
 dams-0.391
stanbul-0.390
TokenALTH
Token ID40818
Feature activation+1.07e+00

Pos logit contributions
 Fisher+0.857
 Drinking+0.803
 APPLIC+0.800
 dams+0.788
 Tile+0.782
Neg logit contributions
eta-0.929
adh-0.875
�醒-0.773
chin-0.749
mast-0.743
TokenAM
Token ID2390
Feature activation-3.80e+00

Pos logit contributions
eta+0.669
adh+0.615
�醒+0.503
arms+0.491
mast+0.490
Neg logit contributions
 Fisher-0.978
 Drinking-0.934
 dams-0.925
 APPLIC-0.921
stanbul-0.915
Token,
Token ID11
Feature activation-1.91e+00

Pos logit contributions
 Fisher+2.443
 APPLIC+2.259
 Drinking+2.249
 Tile+2.184
 Toll+2.178
Neg logit contributions
eta-3.433
adh-3.270
�醒-2.927
chin-2.859
arms-2.825
Token Mass
Token ID5674
Feature activation-2.72e+00

Pos logit contributions
 Fisher+1.147
 Drinking+1.058
 APPLIC+1.056
 dams+1.035
 Tile+1.023
Neg logit contributions
eta-1.819
adh-1.728
�醒-1.542
chin-1.512
arms-1.501
Token.
Token ID13
Feature activation-2.55e+00

Pos logit contributions
 Fisher+1.792
 APPLIC+1.657
 Drinking+1.649
 Toll+1.616
 dams+1.607
Neg logit contributions
eta-2.401
adh-2.263
�醒-2.041
mast-1.989
chin-1.984
Token --
Token ID1377
Feature activation-5.81e+00

Pos logit contributions
 Fisher+1.815
 Drinking+1.691
 APPLIC+1.686
 Toll+1.641
 Tile+1.640
Neg logit contributions
eta-2.163
adh-2.033
�醒-1.805
chin-1.751
arms-1.736
Token Boston
Token ID6182
Feature activation-1.01e+00

Pos logit contributions
 Fisher+3.788
 Drinking+3.498
 APPLIC+3.426
 Tile+3.374
 Toll+3.362
Neg logit contributions
eta-5.287
adh-4.994
�醒-4.523
chin-4.391
arms-4.319
Token Celtics
Token ID24116
Feature activation-5.03e-01

Pos logit contributions
 Fisher+0.544
 Drinking+0.493
 APPLIC+0.491
 dams+0.480
 Tile+0.476
Neg logit contributions
eta-1.031
adh-0.985
�醒-0.889
chin-0.871
mast-0.861
Token swing
Token ID9628
Feature activation-1.00e+00

Pos logit contributions
 Fisher+0.323
 Drinking+0.300
 APPLIC+0.300
 dams+0.294
 Tile+0.291
Neg logit contributions
eta-0.458
adh-0.433
�醒-0.384
chin-0.376
arms-0.374
Tokenman
Token ID805
Feature activation-2.87e+00

Pos logit contributions
 Fisher+0.831
 Drinking+0.784
 APPLIC+0.783
 dams+0.771
 Tile+0.766
Neg logit contributions
eta-0.725
adh-0.676
�醒-0.580
chin-0.564
arms-0.557
Token Jae
Token ID34521
Feature activation-1.27e-01

Pos logit contributions
 Fisher+1.820
 Drinking+1.659
 APPLIC+1.650
 dams+1.629
 Tile+1.623
Neg logit contributions
eta-2.656
adh-2.529
�醒-2.269
chin-2.211
mast-2.181
TokenL
Token ID43
Feature activation+3.17e-01

Pos logit contributions
 Fisher+0.437
 Drinking+0.406
 APPLIC+0.406
 dams+0.397
 Tile+0.394
Neg logit contributions
eta-0.584
adh-0.552
�醒-0.491
chin-0.478
arms-0.473
TokenANS
Token ID15037
Feature activation+1.45e+00

Pos logit contributions
eta+0.268
adh+0.252
�醒+0.222
chin+0.217
arms+0.215
Neg logit contributions
 Fisher-0.224
 Drinking-0.209
 APPLIC-0.209
 dams-0.205
 Tile-0.203
TokenING
Token ID2751
Feature activation-1.95e-01

Pos logit contributions
eta+1.400
adh+1.333
chin+1.177
robe+1.167
arms+1.158
Neg logit contributions
 Fisher-0.807
 Drinking-0.754
 dams-0.737
 Toll-0.730
 APPLIC-0.728
Token,
Token ID11
Feature activation-1.85e+00

Pos logit contributions
 Fisher+0.134
 Drinking+0.125
 APPLIC+0.125
 dams+0.123
 Tile+0.122
Neg logit contributions
eta-0.169
adh-0.160
�醒-0.141
chin-0.138
arms-0.136
Token MI
Token ID15789
Feature activation-1.00e+00

Pos logit contributions
 Fisher+1.351
 Drinking+1.264
 APPLIC+1.263
 dams+1.242
 Tile+1.231
Neg logit contributions
eta-1.523
adh-1.434
�醒-1.256
chin-1.226
arms-1.214
Token --
Token ID1377
Feature activation-5.80e+00

Pos logit contributions
 Fisher+0.736
 Drinking+0.689
 APPLIC+0.688
 dams+0.675
 Tile+0.671
Neg logit contributions
eta-0.818
adh-0.770
�醒-0.676
chin-0.657
mast-0.651
Token The
Token ID383
Feature activation-9.70e-01

Pos logit contributions
 Fisher+3.680
 Drinking+3.415
 APPLIC+3.339
 DUI+3.277
 Toll+3.260
Neg logit contributions
eta-5.389
adh-5.105
�醒-4.641
chin-4.482
robe-4.413
Token Michigan
Token ID7055
Feature activation-1.23e+00

Pos logit contributions
 Fisher+0.575
 Drinking+0.530
 APPLIC+0.528
 dams+0.517
 Tile+0.512
Neg logit contributions
eta-0.932
adh-0.886
�醒-0.793
chin-0.777
arms-0.770
Token House
Token ID2097
Feature activation-1.04e+00

Pos logit contributions
 Fisher+0.531
 Drinking+0.474
 APPLIC+0.473
 dams+0.459
 Tile+0.453
Neg logit contributions
eta-1.376
adh-1.316
�醒-1.198
chin-1.178
arms-1.170
Token is
Token ID318
Feature activation-9.32e-01

Pos logit contributions
 Fisher+0.638
 Drinking+0.590
 APPLIC+0.589
 dams+0.577
 Tile+0.572
Neg logit contributions
eta-0.978
adh-0.928
�醒-0.827
chin-0.811
arms-0.804
Token looking
Token ID2045
Feature activation-3.11e-01

Pos logit contributions
 Fisher+0.552
 APPLIC+0.509
 Drinking+0.509
 dams+0.498
 Tile+0.495
Neg logit contributions
eta-0.893
adh-0.846
�醒-0.752
chin-0.747
mast-0.736
TokenTA
Token ID5603
Feature activation-1.26e+00

Pos logit contributions
 Fisher+0.500
 Drinking+0.462
 APPLIC+0.461
 dams+0.452
 Tile+0.448
Neg logit contributions
eta-0.760
adh-0.720
�醒-0.641
chin-0.629
arms-0.624
TokenWA
Token ID15543
Feature activation-2.53e+00

Pos logit contributions
 Fisher+0.840
 Drinking+0.793
 APPLIC+0.785
 dams+0.777
 Tile+0.775
Neg logit contributions
eta-1.104
adh-1.043
�醒-0.908
chin-0.907
arms-0.889
Token (
Token ID357
Feature activation-3.19e+00

Pos logit contributions
 Fisher+1.856
 Drinking+1.746
 APPLIC+1.735
 dams+1.691
 Toll+1.687
Neg logit contributions
eta-2.077
adh-1.957
�醒-1.735
chin-1.673
arms-1.665
TokenReuters
Token ID12637
Feature activation-2.15e+00

Pos logit contributions
 Fisher+2.194
 Drinking+2.037
 APPLIC+2.037
 dams+1.977
 Toll+1.973
Neg logit contributions
eta-2.747
adh-2.604
�醒-2.307
chin-2.230
mast-2.207
Token)
Token ID8
Feature activation-3.50e+00

Pos logit contributions
 Fisher+1.093
 Drinking+0.997
 APPLIC+0.995
 dams+0.987
 Tile+0.967
Neg logit contributions
eta-2.236
adh-2.126
chin-1.894
�醒-1.882
arms-1.876
Token -
Token ID532
Feature activation-5.62e+00

Pos logit contributions
 Fisher+2.066
 Drinking+1.902
 APPLIC+1.899
 dams+1.860
 Tile+1.841
Neg logit contributions
eta-3.371
adh-3.202
�醒-2.864
chin-2.809
arms-2.785
Token Canada
Token ID3340
Feature activation-1.36e+00

Pos logit contributions
 Fisher+3.585
 Drinking+3.331
 APPLIC+3.255
 Toll+3.200
 Tile+3.164
Neg logit contributions
eta-5.176
adh-4.928
�醒-4.480
chin-4.314
robe-4.263
Token
Token ID447
Feature activation-1.29e+00

Pos logit contributions
 Fisher+0.870
 Drinking+0.806
 APPLIC+0.803
 dams+0.789
 Tile+0.781
Neg logit contributions
eta-1.242
adh-1.178
�醒-1.049
chin-1.024
arms-1.015
Token
Token ID247
Feature activation-5.18e-01

Pos logit contributions
 Fisher+1.004
 Drinking+0.945
 APPLIC+0.943
 dams+0.929
 Tile+0.921
Neg logit contributions
eta-0.992
adh-0.931
�醒-0.807
chin-0.786
mast-0.778
Tokens
Token ID82
Feature activation+3.09e-01

Pos logit contributions
 Fisher+0.324
 Drinking+0.300
 APPLIC+0.300
 dams+0.293
 Tile+0.292
Neg logit contributions
eta-0.480
adh-0.456
�醒-0.404
chin-0.397
arms-0.394
Token top
Token ID1353
Feature activation+1.47e-01

Pos logit contributions
eta+0.295
adh+0.280
�醒+0.248
chin+0.245
arms+0.244
Neg logit contributions
 Fisher-0.183
 APPLIC-0.170
 Drinking-0.169
 Tile-0.164
 dams-0.164
TokenT
Token ID51
Feature activation-7.03e-01

Pos logit contributions
 Fisher+0.224
 Drinking+0.209
 APPLIC+0.208
 dams+0.204
 Tile+0.202
Neg logit contributions
eta-0.306
adh-0.289
�醒-0.257
chin-0.251
arms-0.249
TokenAM
Token ID2390
Feature activation-6.94e-01

Pos logit contributions
 Fisher+0.536
 Drinking+0.503
 APPLIC+0.503
 dams+0.494
 Tile+0.491
Neg logit contributions
eta-0.553
adh-0.519
�醒-0.456
chin-0.441
arms-0.436
TokenPA
Token ID4537
Feature activation-1.51e+00

Pos logit contributions
 Fisher+0.507
 Drinking+0.475
 APPLIC+0.474
 dams+0.468
 Tile+0.462
Neg logit contributions
eta-0.570
adh-0.536
�醒-0.467
chin-0.458
arms-0.454
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
 Fisher+1.071
 Drinking+0.998
 APPLIC+0.996
 dams+0.979
 Tile+0.972
Neg logit contributions
eta-1.271
adh-1.201
�醒-1.057
chin-1.029
arms-1.019
Token FL
Token ID9977
Feature activation-1.03e+00

Pos logit contributions
 Fisher+0.690
 Drinking+0.641
 APPLIC+0.640
 dams+0.629
 Tile+0.623
Neg logit contributions
eta-0.914
adh-0.864
�醒-0.765
chin-0.748
arms-0.741
Token
Token ID851
Feature activation-5.55e+00

Pos logit contributions
 Fisher+0.719
 Drinking+0.671
 APPLIC+0.670
 dams+0.659
 Tile+0.653
Neg logit contributions
eta-0.874
adh-0.825
�醒-0.726
chin-0.710
arms-0.703
Token Tampa
Token ID15528
Feature activation-2.61e+00

Pos logit contributions
 Fisher+3.643
 Drinking+3.365
 APPLIC+3.321
 Toll+3.233
 Tile+3.223
Neg logit contributions
eta-5.030
adh-4.767
�醒-4.350
chin-4.197
robe-4.130
Token Bay
Token ID4696
Feature activation+2.43e-01

Pos logit contributions
 Fisher+1.377
 Drinking+1.232
 APPLIC+1.224
 dams+1.201
 Tile+1.189
Neg logit contributions
eta-2.686
adh-2.581
�醒-2.342
chin-2.282
mast-2.266
Token area
Token ID1989
Feature activation-3.61e-01

Pos logit contributions
eta+0.227
adh+0.215
�醒+0.192
chin+0.188
arms+0.186
Neg logit contributions
 Fisher-0.151
 Drinking-0.139
 APPLIC-0.139
 dams-0.136
 Tile-0.135
Token residents
Token ID5085
Feature activation-2.49e+00

Pos logit contributions
 Fisher+0.221
 Drinking+0.204
 APPLIC+0.203
 dams+0.199
 Tile+0.197
Neg logit contributions
eta-0.340
adh-0.323
�醒-0.288
chin-0.282
arms-0.280
Token are
Token ID389
Feature activation-1.61e+00

Pos logit contributions
 Fisher+1.626
 Drinking+1.506
 APPLIC+1.495
 dams+1.476
 DUI+1.455
Neg logit contributions
eta-2.245
adh-2.124
�醒-1.922
chin-1.847
arms-1.834
TokenSON
Token ID11782
Feature activation-2.90e+00

Pos logit contributions
 Fisher+1.453
 Drinking+1.277
 APPLIC+1.270
 dams+1.219
 Tile+1.206
Neg logit contributions
eta-4.032
adh-3.848
�醒-3.564
chin-3.468
mast-3.442
TokenVILLE
Token ID38526
Feature activation-2.70e+00

Pos logit contributions
 Fisher+0.859
 APPLIC+0.717
 Drinking+0.717
 dams+0.681
 Tile+0.668
Neg logit contributions
eta-3.650
adh-3.507
�醒-3.234
chin-3.182
arms-3.160
Token,
Token ID11
Feature activation-2.35e+00

Pos logit contributions
 Fisher+1.709
 APPLIC+1.583
 Drinking+1.581
 dams+1.539
 Tile+1.531
Neg logit contributions
eta-2.494
adh-2.367
�醒-2.110
chin-2.057
mast-2.036
Token Fla
Token ID22026
Feature activation-2.77e+00

Pos logit contributions
 Fisher+1.548
 Drinking+1.438
 APPLIC+1.434
 dams+1.406
 Tile+1.395
Neg logit contributions
eta-2.111
adh-1.997
�醒-1.772
chin-1.731
mast-1.715
Token.
Token ID13
Feature activation-2.57e+00

Pos logit contributions
 Fisher+1.838
 Drinking+1.693
 APPLIC+1.693
 dams+1.649
 Tile+1.647
Neg logit contributions
eta-2.455
adh-2.322
�醒-2.087
mast-2.020
chin-2.019
Token --
Token ID1377
Feature activation-5.54e+00

Pos logit contributions
 Fisher+1.782
 Drinking+1.655
 APPLIC+1.650
 dams+1.606
 Tile+1.605
Neg logit contributions
eta-2.230
adh-2.101
�醒-1.868
chin-1.813
arms-1.798
Token A
Token ID317
Feature activation-2.14e+00

Pos logit contributions
 Fisher+3.557
 Drinking+3.271
 APPLIC+3.216
 Tile+3.145
 DUI+3.141
Neg logit contributions
eta-5.135
adh-4.851
�醒-4.395
chin-4.246
arms-4.205
Token Jacksonville
Token ID23194
Feature activation-1.71e+00

Pos logit contributions
 Fisher+1.294
 Drinking+1.187
 APPLIC+1.179
 dams+1.154
 DUI+1.150
Neg logit contributions
eta-2.041
adh-1.938
�醒-1.737
chin-1.693
arms-1.686
Token sheriff
Token ID16570
Feature activation-1.30e+00

Pos logit contributions
 Fisher+1.062
 Drinking+0.980
 APPLIC+0.977
 dams+0.959
 Tile+0.949
Neg logit contributions
eta-1.592
adh-1.512
�醒-1.349
chin-1.319
arms-1.307
Token's
Token ID338
Feature activation-2.18e+00

Pos logit contributions
 Fisher+0.831
 Drinking+0.765
 APPLIC+0.761
 dams+0.752
 DUI+0.742
Neg logit contributions
eta-1.190
adh-1.129
�醒-1.010
chin-0.986
mast-0.974
Token officer
Token ID3818
Feature activation-1.94e+00

Pos logit contributions
 Fisher+1.356
 Drinking+1.238
 DUI+1.232
 APPLIC+1.228
 dams+1.217
Neg logit contributions
eta-2.029
adh-1.934
�醒-1.752
chin-1.692
mast-1.684
Tokenalling
Token ID9221
Feature activation-8.38e-01

Pos logit contributions
 Fisher+0.518
 Drinking+0.487
 APPLIC+0.484
 dams+0.478
 Tile+0.473
Neg logit contributions
eta-0.511
adh-0.480
�醒-0.422
chin-0.406
mast-0.402
Token together
Token ID1978
Feature activation+6.66e-01

Pos logit contributions
 Fisher+0.543
 Drinking+0.504
 APPLIC+0.503
 dams+0.493
 Tile+0.489
Neg logit contributions
eta-0.758
adh-0.718
�醒-0.637
chin-0.623
arms-0.618
Token in
Token ID287
Feature activation-4.24e-01

Pos logit contributions
eta+0.615
adh+0.584
�醒+0.515
chin+0.509
arms+0.504
Neg logit contributions
 Fisher-0.417
 APPLIC-0.388
 Drinking-0.385
 dams-0.378
 Tile-0.376
Token a
Token ID257
Feature activation-8.80e-01

Pos logit contributions
 Fisher+0.274
 Drinking+0.254
 APPLIC+0.252
 dams+0.248
 Tile+0.245
Neg logit contributions
eta-0.387
adh-0.366
�醒-0.327
chin-0.319
mast-0.315
Token cr
Token ID1067
Feature activation-8.99e-02

Pos logit contributions
 Fisher+0.553
 Drinking+0.512
 APPLIC+0.510
 dams+0.501
 Tile+0.496
Neg logit contributions
eta-0.816
adh-0.772
�醒-0.688
chin-0.673
arms-0.667
Tokenum
Token ID388
Feature activation-5.46e+00

Pos logit contributions
 Fisher+0.061
 Drinking+0.057
 APPLIC+0.057
 dams+0.056
 Tile+0.056
Neg logit contributions
eta-0.078
adh-0.073
�醒-0.065
chin-0.064
mast-0.063
Tokenpled
Token ID10137
Feature activation-1.61e+00

Pos logit contributions
 Fisher+4.132
 Drinking+3.894
 APPLIC+3.842
 dams+3.831
 Tile+3.812
Neg logit contributions
eta-4.136
adh-3.938
�醒-3.419
arms-3.361
mast-3.312
Token mess
Token ID2085
Feature activation-7.14e-01

Pos logit contributions
 Fisher+1.005
 Drinking+0.924
 APPLIC+0.916
 dams+0.905
 Tile+0.896
Neg logit contributions
eta-1.497
adh-1.424
�醒-1.271
chin-1.241
arms-1.232
Token of
Token ID286
Feature activation+1.29e+00

Pos logit contributions
 Fisher+0.444
 Drinking+0.412
 APPLIC+0.409
 dams+0.402
 Tile+0.398
Neg logit contributions
eta-0.663
adh-0.629
�醒-0.562
chin-0.549
arms-0.544
Token mutual
Token ID13584
Feature activation+4.87e-01

Pos logit contributions
eta+1.157
adh+1.095
�醒+0.966
chin+0.952
mast+0.943
Neg logit contributions
 Fisher-0.839
 APPLIC-0.781
 Drinking-0.780
 dams-0.762
 Tile-0.759
Token uncertainty
Token ID13479
Feature activation-7.84e-01

Pos logit contributions
eta+0.427
adh+0.404
�醒+0.355
chin+0.348
arms+0.346
Neg logit contributions
 Fisher-0.328
 APPLIC-0.307
 Drinking-0.306
 Tile-0.299
 dams-0.299
Token next
Token ID1306
Feature activation-6.27e-02

Pos logit contributions
 Fisher+1.338
 Drinking+1.232
 APPLIC+1.230
 dams+1.206
 Tile+1.193
Neg logit contributions
eta-2.114
adh-2.009
�醒-1.801
chin-1.758
arms-1.746
Token time
Token ID640
Feature activation-1.47e+00

Pos logit contributions
 Fisher+0.040
 Drinking+0.037
 APPLIC+0.037
 dams+0.037
 Tile+0.036
Neg logit contributions
eta-0.057
adh-0.054
�醒-0.048
chin-0.047
arms-0.047
Token!
Token ID0
Feature activation+4.28e-01

Pos logit contributions
 Fisher+0.897
 Drinking+0.826
 APPLIC+0.823
 dams+0.802
 Tile+0.800
Neg logit contributions
eta-1.389
adh-1.319
�醒-1.193
chin-1.153
mast-1.147
Token<|endoftext|>
Token ID50256
Feature activation-1.76e-01

Pos logit contributions
eta+0.374
adh+0.353
�醒+0.311
chin+0.305
arms+0.302
Neg logit contributions
 Fisher-0.291
 Drinking-0.271
 APPLIC-0.270
 dams-0.266
 Tile-0.263
TokenBR
Token ID11473
Feature activation-9.62e-01

Pos logit contributions
 Fisher+0.116
 Drinking+0.108
 APPLIC+0.107
 dams+0.105
 Tile+0.105
Neg logit contributions
eta-0.157
adh-0.148
�醒-0.132
chin-0.129
arms-0.128
TokenUSS
Token ID32835
Feature activation-5.44e+00

Pos logit contributions
 Fisher+0.614
 Drinking+0.569
 APPLIC+0.568
 dams+0.558
 Tile+0.552
Neg logit contributions
eta-0.880
adh-0.833
�醒-0.740
chin-0.725
arms-0.718
TokenELS
Token ID37142
Feature activation-1.20e+00

Pos logit contributions
 Fisher+2.850
 APPLIC+2.568
 Drinking+2.554
 Tile+2.499
 dams+2.479
Neg logit contributions
eta-5.526
adh-5.324
�醒-4.852
chin-4.817
mast-4.705
Token (
Token ID357
Feature activation-2.99e+00

Pos logit contributions
 Fisher+0.834
 Drinking+0.778
 APPLIC+0.777
 dams+0.761
 Tile+0.755
Neg logit contributions
eta-1.033
adh-0.974
�醒-0.863
chin-0.840
arms-0.832
TokenReuters
Token ID12637
Feature activation-1.98e+00

Pos logit contributions
 Fisher+2.065
 APPLIC+1.923
 Drinking+1.918
 dams+1.869
 Tile+1.864
Neg logit contributions
eta-2.564
adh-2.422
�醒-2.157
chin-2.085
mast-2.060
Token)
Token ID8
Feature activation-3.15e+00

Pos logit contributions
 Fisher+1.009
 Drinking+0.921
 APPLIC+0.919
 dams+0.917
 Tile+0.895
Neg logit contributions
eta-2.045
adh-1.942
chin-1.730
arms-1.712
mast-1.712
Token -
Token ID532
Feature activation-4.90e+00

Pos logit contributions
 Fisher+1.878
 Drinking+1.730
 APPLIC+1.726
 dams+1.699
 Tile+1.678
Neg logit contributions
eta-3.005
adh-2.850
�醒-2.538
chin-2.501
arms-2.478
Token Activision
Token ID40723
Feature activation-2.94e+00

Pos logit contributions
 Fisher+0.032
 Drinking+0.030
 APPLIC+0.029
 dams+0.029
 Tile+0.028
Neg logit contributions
eta-0.059
adh-0.057
�醒-0.051
chin-0.050
arms-0.049
Token.
Token ID13
Feature activation+2.80e-01

Pos logit contributions
 Fisher+1.914
 Drinking+1.757
 APPLIC+1.752
 Toll+1.711
 dams+1.709
Neg logit contributions
eta-2.643
adh-2.524
�醒-2.273
chin-2.199
mast-2.186
Token
Token ID198
Feature activation+4.36e-01

Pos logit contributions
eta+0.257
adh+0.243
�醒+0.215
chin+0.211
mast+0.209
Neg logit contributions
 Fisher-0.176
 Drinking-0.163
 APPLIC-0.163
 dams-0.163
 Tile-0.159
Token
Token ID198
Feature activation-1.53e-01

Pos logit contributions
eta+0.380
adh+0.358
chin+0.310
�醒+0.309
arms+0.307
Neg logit contributions
 Fisher-0.296
 Drinking-0.277
 APPLIC-0.275
 dams-0.272
 Tile-0.269
TokenB
Token ID33
Feature activation-1.13e+00

Pos logit contributions
 Fisher+0.094
 Drinking+0.087
 APPLIC+0.087
 dams+0.085
 Tile+0.084
Neg logit contributions
eta-0.144
adh-0.136
�醒-0.122
chin-0.119
arms-0.118
Tokenung
Token ID2150
Feature activation-5.38e+00

Pos logit contributions
 Fisher+0.946
 APPLIC+0.890
 Drinking+0.890
 dams+0.877
 Tile+0.872
Neg logit contributions
eta-0.802
adh-0.751
�醒-0.661
chin-0.628
mast-0.616
Tokenie
Token ID494
Feature activation-1.15e+00

Pos logit contributions
 Fisher+5.295
 Drinking+5.005
 APPLIC+4.983
 Tile+4.940
 Toll+4.927
Neg logit contributions
eta-2.946
adh-2.724
�醒-2.425
chin-2.146
mast-2.090
Token's
Token ID338
Feature activation-5.98e-01

Pos logit contributions
 Fisher+0.708
 Drinking+0.655
 APPLIC+0.653
 dams+0.640
 Tile+0.634
Neg logit contributions
eta-1.081
adh-1.026
�醒-0.914
chin-0.896
arms-0.888
Token contract
Token ID2775
Feature activation+2.25e-01

Pos logit contributions
 Fisher+0.374
 Drinking+0.347
 APPLIC+0.346
 dams+0.338
 Tile+0.336
Neg logit contributions
eta-0.555
adh-0.525
�醒-0.464
chin-0.456
arms-0.453
Token with
Token ID351
Feature activation+4.38e-01

Pos logit contributions
eta+0.205
adh+0.194
�醒+0.172
chin+0.169
arms+0.167
Neg logit contributions
 Fisher-0.145
 Drinking-0.135
 APPLIC-0.134
 dams-0.131
 Tile-0.130
Token Activision
Token ID40723
Feature activation-2.12e+00

Pos logit contributions
eta+0.414
adh+0.392
�醒+0.348
chin+0.342
arms+0.340
Neg logit contributions
 Fisher-0.264
 Drinking-0.246
 APPLIC-0.245
 dams-0.240
 Tile-0.237
Token geometry
Token ID22939
Feature activation-1.63e+00

Pos logit contributions
 Fisher+0.165
 Drinking+0.153
 APPLIC+0.152
 dams+0.150
 Tile+0.147
Neg logit contributions
eta-0.255
adh-0.241
�醒-0.214
chin-0.211
arms-0.209
Token,
Token ID11
Feature activation+4.25e-01

Pos logit contributions
 Fisher+1.043
 Drinking+0.967
 APPLIC+0.964
 dams+0.945
 Tile+0.936
Neg logit contributions
eta-1.494
adh-1.414
�醒-1.255
chin-1.231
arms-1.220
Token by
Token ID416
Feature activation+7.87e-01

Pos logit contributions
eta+0.397
adh+0.374
�醒+0.328
chin+0.327
arms+0.326
Neg logit contributions
 Fisher-0.262
 Drinking-0.244
 APPLIC-0.240
 dams-0.236
 Toll-0.234
Token adding
Token ID4375
Feature activation-7.36e-01

Pos logit contributions
eta+0.759
adh+0.719
�醒+0.637
chin+0.633
mast+0.633
Neg logit contributions
 Fisher-0.454
 Drinking-0.420
 APPLIC-0.414
 dams-0.409
 Toll-0.401
Token a
Token ID257
Feature activation+1.26e+00

Pos logit contributions
 Fisher+0.463
 Drinking+0.430
 APPLIC+0.428
 dams+0.420
 Tile+0.414
Neg logit contributions
eta-0.679
adh-0.642
�醒-0.569
chin-0.560
arms-0.556
Token vertex
Token ID37423
Feature activation-5.27e+00

Pos logit contributions
eta+1.178
adh+1.110
�醒+0.981
chin+0.975
 bombshell+0.972
Neg logit contributions
 Fisher-0.770
 Drinking-0.717
 APPLIC-0.715
 dams-0.707
 DUI-0.685
Token displacement
Token ID29358
Feature activation-6.50e-01

Pos logit contributions
 Fisher+3.098
 APPLIC+2.872
 Tile+2.826
 Drinking+2.822
 dams+2.784
Neg logit contributions
eta-5.019
adh-4.801
�醒-4.316
chin-4.203
mast-4.170
Token function
Token ID2163
Feature activation+3.93e-01

Pos logit contributions
 Fisher+0.382
 Drinking+0.354
 APPLIC+0.351
 dams+0.345
 Toll+0.338
Neg logit contributions
eta-0.627
adh-0.593
�醒-0.526
chin-0.522
arms-0.516
Token.
Token ID13
Feature activation-5.65e-01

Pos logit contributions
eta+0.355
adh+0.335
�醒+0.295
chin+0.291
arms+0.288
Neg logit contributions
 Fisher-0.254
 Drinking-0.236
 APPLIC-0.235
 dams-0.232
 Toll-0.227
Token This
Token ID770
Feature activation-2.75e-01

Pos logit contributions
 Fisher+0.368
 Drinking+0.341
 APPLIC+0.340
 dams+0.336
 DUI+0.329
Neg logit contributions
eta-0.509
adh-0.479
�醒-0.422
chin-0.418
arms-0.412
Token can
Token ID460
Feature activation+2.16e-01

Pos logit contributions
 Fisher+0.159
 Drinking+0.148
 APPLIC+0.147
 dams+0.144
 Tile+0.141
Neg logit contributions
eta-0.267
adh-0.253
�醒-0.224
chin-0.222
mast-0.220
Token said
Token ID531
Feature activation-1.86e+00

Pos logit contributions
 Fisher+0.535
 Drinking+0.496
 APPLIC+0.495
 dams+0.486
 Tile+0.482
Neg logit contributions
eta-0.742
adh-0.703
�醒-0.625
chin-0.610
arms-0.605
Token Trump
Token ID1301
Feature activation-2.02e+00

Pos logit contributions
 Fisher+1.159
 Drinking+1.065
 APPLIC+1.059
 dams+1.047
 Tile+1.030
Neg logit contributions
eta-1.738
adh-1.647
�醒-1.475
chin-1.438
arms-1.429
Token could
Token ID714
Feature activation-5.17e-01

Pos logit contributions
 Fisher+1.232
 Drinking+1.130
 APPLIC+1.127
 dams+1.112
 Tile+1.094
Neg logit contributions
eta-1.909
adh-1.808
�醒-1.626
chin-1.584
arms-1.573
Token learn
Token ID2193
Feature activation+4.78e-01

Pos logit contributions
 Fisher+0.335
 Drinking+0.313
 APPLIC+0.311
 dams+0.303
 Tile+0.302
Neg logit contributions
eta-0.468
adh-0.444
�醒-0.389
chin-0.386
mast-0.382
Token several
Token ID1811
Feature activation-9.24e-01

Pos logit contributions
eta+0.434
adh+0.412
�醒+0.363
chin+0.358
arms+0.356
Neg logit contributions
 Fisher-0.304
 Drinking-0.285
 APPLIC-0.285
 dams-0.277
 Tile-0.275
Token lessons
Token ID11658
Feature activation-5.15e+00

Pos logit contributions
 Fisher+0.549
 Drinking+0.508
 APPLIC+0.507
 dams+0.495
 Tile+0.491
Neg logit contributions
eta-0.885
adh-0.841
�醒-0.750
chin-0.736
arms-0.730
Token from
Token ID422
Feature activation-1.71e+00

Pos logit contributions
 Fisher+3.148
 APPLIC+2.857
 Drinking+2.842
 dams+2.808
 Toll+2.768
Neg logit contributions
eta-4.867
adh-4.605
�醒-4.231
arms-4.035
chin-4.035
Token former
Token ID1966
Feature activation-1.08e+00

Pos logit contributions
 Fisher+1.016
 Drinking+0.932
 APPLIC+0.928
 dams+0.912
 Tile+0.902
Neg logit contributions
eta-1.639
adh-1.556
�醒-1.396
chin-1.366
arms-1.354
Token President
Token ID1992
Feature activation+7.03e-01

Pos logit contributions
 Fisher+0.619
 Drinking+0.570
 APPLIC+0.570
 dams+0.556
 Tile+0.551
Neg logit contributions
eta-1.049
adh-0.998
�醒-0.893
chin-0.877
arms-0.869
Token Harry
Token ID5850
Feature activation+1.67e+00

Pos logit contributions
eta+0.580
adh+0.551
�醒+0.473
chin+0.466
mast+0.466
Neg logit contributions
 Fisher-0.501
 APPLIC-0.478
 Drinking-0.474
 dams-0.467
 Tile-0.462
Token Truman
Token ID33027
Feature activation-1.51e+00

Pos logit contributions
eta+1.474
adh+1.391
�醒+1.209
chin+1.203
arms+1.202
Neg logit contributions
 Fisher-1.104
 APPLIC-1.056
 Drinking-1.045
 dams-1.019
 Tile-1.011
TokenUAL
Token ID25620
Feature activation-2.44e+00

Pos logit contributions
 Fisher+1.320
 Drinking+1.237
 APPLIC+1.233
 dams+1.217
 Tile+1.206
Neg logit contributions
eta-1.289
adh-1.209
�醒-1.066
chin-1.026
mast-1.014
TokenA
Token ID32
Feature activation-1.84e+00

Pos logit contributions
 Fisher+1.535
 Drinking+1.424
 APPLIC+1.419
 dams+1.389
 Tile+1.375
Neg logit contributions
eta-2.248
adh-2.131
�醒-1.905
chin-1.864
arms-1.842
Token L
Token ID406
Feature activation-7.58e-01

Pos logit contributions
 Fisher+1.180
 Drinking+1.093
 APPLIC+1.093
 dams+1.070
 Tile+1.061
Neg logit contributions
eta-1.678
adh-1.590
�醒-1.413
chin-1.384
arms-1.371
TokenUMP
Token ID20476
Feature activation-1.28e+00

Pos logit contributions
 Fisher+0.506
 APPLIC+0.469
 Drinking+0.469
 dams+0.458
 Tile+0.455
Neg logit contributions
eta-0.671
adh-0.636
�醒-0.567
chin-0.552
arms-0.548
TokenUR
Token ID4261
Feature activation-1.33e+00

Pos logit contributions
 Fisher+0.709
 Drinking+0.649
 APPLIC+0.646
 dams+0.636
 Tile+0.626
Neg logit contributions
eta-1.285
adh-1.223
�醒-1.094
chin-1.076
mast-1.064
Token --
Token ID1377
Feature activation-5.09e+00

Pos logit contributions
 Fisher+0.876
 Drinking+0.814
 APPLIC+0.813
 dams+0.796
 Tile+0.788
Neg logit contributions
eta-1.183
adh-1.119
�醒-0.994
chin-0.971
arms-0.962
Token With
Token ID2080
Feature activation-1.77e+00

Pos logit contributions
 Fisher+3.211
 Drinking+2.963
 APPLIC+2.926
 Tile+2.839
 Toll+2.838
Neg logit contributions
eta-4.735
adh-4.487
�醒-4.057
chin-3.922
arms-3.901
Token barely
Token ID8523
Feature activation+5.68e-01

Pos logit contributions
 Fisher+1.053
 Drinking+0.968
 APPLIC+0.966
 dams+0.948
 Tile+0.937
Neg logit contributions
eta-1.700
adh-1.614
�醒-1.448
chin-1.416
arms-1.403
Token a
Token ID257
Feature activation-1.82e-01

Pos logit contributions
eta+0.506
adh+0.478
�醒+0.419
chin+0.415
arms+0.410
Neg logit contributions
 Fisher-0.373
 Drinking-0.349
 APPLIC-0.348
 dams-0.340
 Tile-0.339
Token month
Token ID1227
Feature activation+1.90e-01

Pos logit contributions
 Fisher+0.120
 APPLIC+0.112
 Drinking+0.112
 dams+0.110
 Tile+0.109
Neg logit contributions
eta-0.161
adh-0.152
�醒-0.134
chin-0.132
 bombshell-0.130
Token leading
Token ID3756
Feature activation+1.49e-01

Pos logit contributions
eta+0.165
adh+0.156
�醒+0.137
chin+0.135
arms+0.133
Neg logit contributions
 Fisher-0.131
 Drinking-0.122
 APPLIC-0.122
 dams-0.119
 Tile-0.118
TokenV
Token ID53
Feature activation-3.07e+00

Pos logit contributions
 Fisher+0.402
 Drinking+0.374
 APPLIC+0.373
 dams+0.366
 Tile+0.363
Neg logit contributions
eta-0.538
adh-0.509
�醒-0.449
chin-0.441
arms-0.436
TokenANA
Token ID31574
Feature activation-1.38e+00

Pos logit contributions
 Fisher+2.184
 APPLIC+2.055
 Drinking+2.052
 dams+2.000
 Tile+1.993
Neg logit contributions
eta-2.567
adh-2.418
�醒-2.150
chin-2.065
mast-2.064
Token (
Token ID357
Feature activation-3.40e+00

Pos logit contributions
 Fisher+0.935
 Drinking+0.870
 APPLIC+0.869
 dams+0.853
 Tile+0.844
Neg logit contributions
eta-1.203
adh-1.137
�醒-1.008
chin-0.983
arms-0.975
TokenReuters
Token ID12637
Feature activation-1.89e+00

Pos logit contributions
 Fisher+2.338
 APPLIC+2.179
 Drinking+2.172
 dams+2.121
 Toll+2.111
Neg logit contributions
eta-2.916
adh-2.760
�醒-2.457
chin-2.373
mast-2.346
Token)
Token ID8
Feature activation-3.29e+00

Pos logit contributions
 Fisher+0.966
 Drinking+0.881
 APPLIC+0.878
 dams+0.876
 Tile+0.857
Neg logit contributions
eta-1.952
adh-1.854
chin-1.650
arms-1.635
mast-1.634
Token -
Token ID532
Feature activation-5.07e+00

Pos logit contributions
 Fisher+1.956
 Drinking+1.801
 APPLIC+1.797
 dams+1.766
 Tile+1.746
Neg logit contributions
eta-3.153
adh-2.992
�醒-2.669
chin-2.625
arms-2.602
Token Latin
Token ID9133
Feature activation-1.47e+00

Pos logit contributions
 Fisher+3.110
 Drinking+2.909
 APPLIC+2.829
 DUI+2.765
 Toll+2.752
Neg logit contributions
eta-4.768
adh-4.526
�醒-4.150
chin-3.982
robe-3.963
Token American
Token ID1605
Feature activation+1.80e-01

Pos logit contributions
 Fisher+0.961
 Drinking+0.892
 APPLIC+0.887
 dams+0.875
 Tile+0.865
Neg logit contributions
eta-1.329
adh-1.260
�醒-1.121
chin-1.093
mast-1.083
Token and
Token ID290
Feature activation-7.62e-03

Pos logit contributions
eta+0.163
adh+0.154
�醒+0.135
chin+0.134
arms+0.133
Neg logit contributions
 Fisher-0.115
 APPLIC-0.108
 Drinking-0.107
 dams-0.105
 Tile-0.104
Token Caribbean
Token ID18020
Feature activation-3.27e-01

Pos logit contributions
 Fisher+0.005
 APPLIC+0.004
 Drinking+0.004
 dams+0.004
 Tile+0.004
Neg logit contributions
eta-0.007
adh-0.007
�醒-0.006
chin-0.006
arms-0.006
Token leaders
Token ID2766
Feature activation-4.54e-01

Pos logit contributions
 Fisher+0.200
 APPLIC+0.185
 Drinking+0.185
 dams+0.180
 Tile+0.179
Neg logit contributions
eta-0.308
adh-0.292
�醒-0.259
chin-0.255
arms-0.253
Token
Token ID198
Feature activation+1.42e+00

Pos logit contributions
eta+1.473
adh+1.376
chin+1.187
arms+1.180
mast+1.177
Neg logit contributions
 Fisher-1.235
 Drinking-1.166
 dams-1.157
 APPLIC-1.152
 Tile-1.135
TokenC
Token ID34
Feature activation+2.02e-01

Pos logit contributions
eta+1.169
adh+1.102
�醒+0.952
chin+0.941
mast+0.934
Neg logit contributions
 Fisher-1.031
 Drinking-0.974
 APPLIC-0.967
 dams-0.954
 Tile-0.948
TokenOOK
Token ID15308
Feature activation-1.43e+00

Pos logit contributions
eta+0.165
adh+0.155
�醒+0.135
chin+0.132
arms+0.131
Neg logit contributions
 Fisher-0.149
 Drinking-0.140
 APPLIC-0.140
 dams-0.137
 Tile-0.136
Token COUNTY
Token ID41012
Feature activation-2.23e+00

Pos logit contributions
 Fisher+0.840
 Drinking+0.774
 APPLIC+0.770
 dams+0.756
 Tile+0.748
Neg logit contributions
eta-1.385
adh-1.315
�醒-1.177
chin-1.154
arms-1.145
Token CR
Token ID8740
Feature activation-4.56e-01

Pos logit contributions
 Fisher+1.411
 APPLIC+1.306
 Drinking+1.304
 dams+1.272
 Tile+1.261
Neg logit contributions
eta-2.060
adh-1.955
�醒-1.745
chin-1.703
mast-1.685
TokenIM
Token ID3955
Feature activation-5.05e+00

Pos logit contributions
 Fisher+0.301
 Drinking+0.280
 APPLIC+0.279
 dams+0.274
 Tile+0.272
Neg logit contributions
eta-0.408
adh-0.386
�醒-0.341
chin-0.334
arms-0.332
TokenINAL
Token ID17961
Feature activation-2.85e+00

Pos logit contributions
 Fisher+2.963
 APPLIC+2.758
 Drinking+2.714
 dams+2.640
 Tile+2.631
Neg logit contributions
eta-4.854
adh-4.671
�醒-4.191
chin-4.090
mast-4.051
Token COUR
Token ID37384
Feature activation-1.34e+00

Pos logit contributions
 Fisher+1.578
 APPLIC+1.448
 Drinking+1.441
 dams+1.399
 DUI+1.386
Neg logit contributions
eta-2.851
adh-2.718
�醒-2.452
chin-2.403
mast-2.375
TokenTH
Token ID4221
Feature activation-1.43e+00

Pos logit contributions
 Fisher+0.926
 Drinking+0.864
 APPLIC+0.863
 dams+0.846
 Tile+0.840
Neg logit contributions
eta-1.158
adh-1.093
�醒-0.966
chin-0.941
mast-0.933
TokenOU
Token ID2606
Feature activation-3.56e+00

Pos logit contributions
 Fisher+0.893
 Drinking+0.826
 APPLIC+0.824
 dams+0.809
 Tile+0.801
Neg logit contributions
eta-1.325
adh-1.256
�醒-1.119
chin-1.096
arms-1.086
TokenSE
Token ID5188
Feature activation-2.32e+00

Pos logit contributions
 Fisher+2.643
 Drinking+2.476
 APPLIC+2.473
 dams+2.431
 Tile+2.413
Neg logit contributions
eta-2.889
adh-2.718
�醒-2.376
chin-2.317
arms-2.293
TokenSAN
Token ID36753
Feature activation-1.49e+00

Pos logit contributions
 Fisher+0.422
 Drinking+0.392
 APPLIC+0.391
 dams+0.383
 Tile+0.380
Neg logit contributions
eta-0.572
adh-0.541
�醒-0.481
chin-0.469
arms-0.464
Token FR
Token ID8782
Feature activation-1.27e+00

Pos logit contributions
 Fisher+0.803
 APPLIC+0.734
 Drinking+0.732
 dams+0.711
 Tile+0.705
Neg logit contributions
eta-1.513
adh-1.442
�醒-1.305
chin-1.273
arms-1.263
TokenANC
Token ID20940
Feature activation-2.24e+00

Pos logit contributions
 Fisher+1.131
 Drinking+1.070
 APPLIC+1.067
 dams+1.057
 Tile+1.048
Neg logit contributions
eta-0.840
adh-0.780
�醒-0.652
chin-0.634
arms-0.629
TokenIS
Token ID1797
Feature activation-4.31e+00

Pos logit contributions
 Fisher+0.916
 Drinking+0.819
 APPLIC+0.805
 dams+0.804
 Tile+0.778
Neg logit contributions
eta-2.560
adh-2.450
�醒-2.194
chin-2.179
arms-2.163
TokenCO
Token ID8220
Feature activation-2.07e+00

Pos logit contributions
 Fisher+1.613
 APPLIC+1.411
 Drinking+1.405
 dams+1.342
 Tile+1.330
Neg logit contributions
eta-5.083
adh-4.883
�醒-4.484
chin-4.389
arms-4.373
Token -
Token ID532
Feature activation-4.98e+00

Pos logit contributions
 Fisher+1.494
 Drinking+1.395
 APPLIC+1.395
 dams+1.364
 Tile+1.355
Neg logit contributions
eta-1.726
adh-1.627
�醒-1.434
chin-1.392
arms-1.380
Token Reddit
Token ID10750
Feature activation+6.51e-01

Pos logit contributions
 Fisher+3.257
 Drinking+3.026
 APPLIC+2.998
 Tile+2.896
 DUI+2.891
Neg logit contributions
eta-4.512
adh-4.279
�醒-3.877
chin-3.738
robe-3.703
Token and
Token ID290
Feature activation+4.98e-02

Pos logit contributions
eta+0.618
adh+0.584
�醒+0.519
chin+0.512
arms+0.507
Neg logit contributions
 Fisher-0.391
 APPLIC-0.362
 Drinking-0.361
 dams-0.356
 Tile-0.351
Token Google
Token ID3012
Feature activation+5.04e-01

Pos logit contributions
eta+0.048
adh+0.045
�醒+0.040
chin+0.040
mast+0.039
Neg logit contributions
 Fisher-0.030
 Drinking-0.027
 APPLIC-0.027
 dams-0.027
 Tile-0.026
Token are
Token ID389
Feature activation-1.79e-01

Pos logit contributions
eta+0.456
adh+0.431
�醒+0.377
chin+0.375
arms+0.370
Neg logit contributions
 Fisher-0.325
 Drinking-0.303
 APPLIC-0.302
 dams-0.296
 Tile-0.292
Token taking
Token ID2263
Feature activation+7.81e-01

Pos logit contributions
 Fisher+0.110
 APPLIC+0.103
 Drinking+0.102
 Tile+0.100
 dams+0.099
Neg logit contributions
eta-0.167
adh-0.157
chin-0.138
 bombshell-0.137
�醒-0.137
Token.
Token ID13
Feature activation-7.09e-01

Pos logit contributions
eta+0.479
adh+0.453
�醒+0.397
chin+0.392
arms+0.390
Neg logit contributions
 Fisher-0.353
 Drinking-0.328
 APPLIC-0.327
 Tile-0.322
 dams-0.320
Token<|endoftext|>
Token ID50256
Feature activation-6.96e-01

Pos logit contributions
 Fisher+0.461
 Drinking+0.429
 APPLIC+0.427
 dams+0.422
 Tile+0.417
Neg logit contributions
eta-0.639
adh-0.605
�醒-0.534
chin-0.525
arms-0.518
TokenED
Token ID1961
Feature activation+1.83e-02

Pos logit contributions
 Fisher+0.457
 Drinking+0.425
 APPLIC+0.425
 dams+0.415
 Tile+0.412
Neg logit contributions
eta-0.622
adh-0.589
�醒-0.524
chin-0.510
arms-0.505
TokenMON
Token ID27857
Feature activation-1.62e+00

Pos logit contributions
eta+0.014
adh+0.013
�醒+0.012
chin+0.011
arms+0.011
Neg logit contributions
 Fisher-0.014
 Drinking-0.013
 APPLIC-0.013
 dams-0.013
 Tile-0.013
TokenTON
Token ID11357
Feature activation-1.31e+00

Pos logit contributions
 Fisher+0.454
 APPLIC+0.379
 Drinking+0.378
 dams+0.360
 Tile+0.351
Neg logit contributions
eta-2.055
adh-1.974
�醒-1.819
chin-1.794
arms-1.783
Token
Token ID784
Feature activation-4.95e+00

Pos logit contributions
 Fisher+0.934
 Drinking+0.873
 APPLIC+0.872
 dams+0.857
 Tile+0.850
Neg logit contributions
eta-1.100
adh-1.037
�醒-0.911
chin-0.890
arms-0.881
Token Omar
Token ID24980
Feature activation-9.91e-01

Pos logit contributions
 Fisher+3.220
 Drinking+2.997
 APPLIC+2.950
 Toll+2.874
 DUI+2.865
Neg logit contributions
eta-4.526
adh-4.281
�醒-3.872
chin-3.738
robe-3.689
Token Kh
Token ID5311
Feature activation+8.20e-01

Pos logit contributions
 Fisher+0.629
 Drinking+0.582
 APPLIC+0.581
 dams+0.570
 Tile+0.565
Neg logit contributions
eta-0.912
adh-0.864
�醒-0.768
chin-0.752
arms-0.745
Tokenadr
Token ID41909
Feature activation+1.36e+00

Pos logit contributions
eta+0.746
adh+0.704
�醒+0.634
chin+0.614
mast+0.611
Neg logit contributions
 Fisher-0.526
 Drinking-0.486
 APPLIC-0.485
 dams-0.476
 Tile-0.472
Token has
Token ID468
Feature activation-1.79e+00

Pos logit contributions
eta+1.278
adh+1.212
�醒+1.061
chin+1.055
 bombshell+1.048
Neg logit contributions
 Fisher-0.830
 Drinking-0.775
 APPLIC-0.772
 Tile-0.754
 dams-0.753
Token agreed
Token ID4987
Feature activation-1.35e+00

Pos logit contributions
 Fisher+1.135
 Drinking+1.047
 APPLIC+1.047
 dams+1.029
 Tile+1.016
Neg logit contributions
eta-1.648
adh-1.562
�醒-1.396
chin-1.362
arms-1.350
Tokenpal
Token ID18596
Feature activation-6.27e-01

Pos logit contributions
 Fisher+0.147
 Drinking+0.138
 APPLIC+0.138
 dams+0.136
 Tile+0.135
Neg logit contributions
eta-0.156
adh-0.147
�醒-0.128
chin-0.125
arms-0.123
Tokenek
Token ID988
Feature activation-4.63e-01

Pos logit contributions
 Fisher+0.456
 Drinking+0.427
 APPLIC+0.425
 dams+0.419
 Tile+0.416
Neg logit contributions
eta-0.516
adh-0.486
�醒-0.428
chin-0.416
arms-0.412
Token.
Token ID13
Feature activation+2.01e-01

Pos logit contributions
 Fisher+0.309
 Drinking+0.287
 APPLIC+0.286
 dams+0.281
 Tile+0.279
Neg logit contributions
eta-0.409
adh-0.387
�醒-0.344
chin-0.335
arms-0.333
Token<|endoftext|>
Token ID50256
Feature activation+5.37e-02

Pos logit contributions
eta+0.181
adh+0.171
�醒+0.152
chin+0.149
mast+0.148
Neg logit contributions
 Fisher-0.132
 Drinking-0.123
 APPLIC-0.122
 dams-0.119
 Toll-0.119
TokenWASHINGTON
Token ID21793
Feature activation-3.99e+00

Pos logit contributions
eta+0.048
adh+0.046
�醒+0.040
chin+0.040
arms+0.039
Neg logit contributions
 Fisher-0.035
 Drinking-0.033
 APPLIC-0.033
 dams-0.032
 Tile-0.032
Token
Token ID851
Feature activation-4.94e+00

Pos logit contributions
 Fisher+2.881
 APPLIC+2.705
 Drinking+2.697
 Toll+2.621
 dams+2.621
Neg logit contributions
eta-3.324
adh-3.115
�醒-2.791
chin-2.676
arms-2.667
Token A
Token ID317
Feature activation-2.13e+00

Pos logit contributions
 Fisher+3.249
 Drinking+3.024
 APPLIC+2.968
 Toll+2.883
 DUI+2.877
Neg logit contributions
eta-4.461
adh-4.205
�醒-3.810
chin-3.679
arms-3.641
Token Cy
Token ID5934
Feature activation+2.83e-01

Pos logit contributions
 Fisher+1.395
 Drinking+1.291
 APPLIC+1.281
 dams+1.258
 DUI+1.251
Neg logit contributions
eta-1.918
adh-1.817
�醒-1.621
chin-1.569
arms-1.568
Tokengn
Token ID4593
Feature activation-1.79e+00

Pos logit contributions
eta+0.234
adh+0.220
�醒+0.191
chin+0.189
arms+0.187
Neg logit contributions
 Fisher-0.205
 APPLIC-0.193
 Drinking-0.193
 dams-0.189
 Tile-0.188
Tokenus
Token ID385
Feature activation+8.84e-01

Pos logit contributions
 Fisher+1.357
 Drinking+1.272
 APPLIC+1.272
 dams+1.252
 Tile+1.241
Neg logit contributions
eta-1.415
adh-1.331
�醒-1.166
chin-1.135
arms-1.120
Token cargo
Token ID15892
Feature activation-3.28e-01

Pos logit contributions
eta+0.837
adh+0.790
chin+0.695
�醒+0.695
 bombshell+0.688
Neg logit contributions
 Fisher-0.531
 Drinking-0.495
 APPLIC-0.494
 dams-0.482
 Tile-0.477
Token
Token ID198
Feature activation+6.69e-01

Pos logit contributions
eta+1.272
adh+1.186
�醒+1.036
chin+1.031
mast+1.024
Neg logit contributions
 Fisher-0.974
 dams-0.931
 APPLIC-0.923
stanbul-0.906
 Drinking-0.896
Token
Token ID198
Feature activation-5.85e-02

Pos logit contributions
eta+0.576
adh+0.540
chin+0.468
mast+0.464
arms+0.463
Neg logit contributions
 Fisher-0.462
 Drinking-0.434
 APPLIC-0.430
 dams-0.427
 Tile-0.421
TokenIGN
Token ID16284
Feature activation+2.31e+00

Pos logit contributions
 Fisher+0.029
 Drinking+0.026
 APPLIC+0.026
 dams+0.026
 Tile+0.025
Neg logit contributions
eta-0.062
adh-0.059
�醒-0.053
chin-0.052
arms-0.052
Token:
Token ID25
Feature activation-1.96e-01

Pos logit contributions
eta+1.910
adh+1.779
mast+1.527
chin+1.524
arms+1.517
Neg logit contributions
 Fisher-1.649
 Drinking-1.564
 dams-1.551
 APPLIC-1.550
 Tile-1.521
Token Got
Token ID11853
Feature activation-9.31e-01

Pos logit contributions
 Fisher+0.118
 APPLIC+0.110
 dams+0.110
 Drinking+0.108
stanbul+0.106
Neg logit contributions
eta-0.183
adh-0.172
�醒-0.152
chin-0.151
arms-0.150
Tokencha
Token ID11693
Feature activation-4.90e+00

Pos logit contributions
 Fisher+0.640
 Drinking+0.598
 APPLIC+0.598
 dams+0.586
 Tile+0.581
Neg logit contributions
eta-0.805
adh-0.760
�醒-0.669
chin-0.656
arms-0.650
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
 Fisher+3.227
 Drinking+2.970
 APPLIC+2.937
 dams+2.903
 Tile+2.897
Neg logit contributions
eta-4.386
adh-4.147
�醒-3.723
chin-3.580
mast-3.566
Token Just
Token ID2329
Feature activation-9.93e-01

Pos logit contributions
 Fisher+0.807
 APPLIC+0.748
 Drinking+0.744
 dams+0.741
 Tile+0.723
Neg logit contributions
eta-1.203
adh-1.140
�醒-1.009
chin-0.997
arms-0.986
Token a
Token ID257
Feature activation-1.56e+00

Pos logit contributions
 Fisher+0.613
 APPLIC+0.569
 Drinking+0.569
 dams+0.556
 Tile+0.551
Neg logit contributions
eta-0.924
adh-0.877
�醒-0.774
chin-0.766
arms-0.759
Token slight
Token ID3731
Feature activation-3.20e-01

Pos logit contributions
 Fisher+0.851
 Drinking+0.778
 APPLIC+0.778
 dams+0.759
 Tile+0.751
Neg logit contributions
eta-1.570
adh-1.496
�醒-1.345
chin-1.321
arms-1.310
Token side
Token ID1735
Feature activation-6.75e-01

Pos logit contributions
 Fisher+0.210
 APPLIC+0.197
 Drinking+0.196
 dams+0.192
 Tile+0.190
Neg logit contributions
eta-0.286
adh-0.270
�醒-0.238
chin-0.235
arms-0.233
TokenUSS
Token ID32835
Feature activation-5.44e+00

Pos logit contributions
 Fisher+0.614
 Drinking+0.569
 APPLIC+0.568
 dams+0.558
 Tile+0.552
Neg logit contributions
eta-0.880
adh-0.833
�醒-0.740
chin-0.725
arms-0.718
TokenELS
Token ID37142
Feature activation-1.20e+00

Pos logit contributions
 Fisher+2.850
 APPLIC+2.568
 Drinking+2.554
 Tile+2.499
 dams+2.479
Neg logit contributions
eta-5.526
adh-5.324
�醒-4.852
chin-4.817
mast-4.705
Token (
Token ID357
Feature activation-2.99e+00

Pos logit contributions
 Fisher+0.834
 Drinking+0.778
 APPLIC+0.777
 dams+0.761
 Tile+0.755
Neg logit contributions
eta-1.033
adh-0.974
�醒-0.863
chin-0.840
arms-0.832
TokenReuters
Token ID12637
Feature activation-1.98e+00

Pos logit contributions
 Fisher+2.065
 APPLIC+1.923
 Drinking+1.918
 dams+1.869
 Tile+1.864
Neg logit contributions
eta-2.564
adh-2.422
�醒-2.157
chin-2.085
mast-2.060
Token)
Token ID8
Feature activation-3.15e+00

Pos logit contributions
 Fisher+1.009
 Drinking+0.921
 APPLIC+0.919
 dams+0.917
 Tile+0.895
Neg logit contributions
eta-2.045
adh-1.942
chin-1.730
arms-1.712
mast-1.712
Token -
Token ID532
Feature activation-4.90e+00

Pos logit contributions
 Fisher+1.878
 Drinking+1.730
 APPLIC+1.726
 dams+1.699
 Tile+1.678
Neg logit contributions
eta-3.005
adh-2.850
�醒-2.538
chin-2.501
arms-2.478
Token The
Token ID383
Feature activation-9.29e-01

Pos logit contributions
 Fisher+2.978
 Drinking+2.755
 APPLIC+2.707
 Tile+2.637
 Toll+2.630
Neg logit contributions
eta-4.667
adh-4.426
�醒-4.038
chin-3.913
mast-3.853
Token world
Token ID995
Feature activation-3.53e-01

Pos logit contributions
 Fisher+0.484
 Drinking+0.441
 APPLIC+0.439
 dams+0.429
 Tile+0.424
Neg logit contributions
eta-0.959
adh-0.915
�醒-0.826
chin-0.810
arms-0.803
Token
Token ID447
Feature activation-2.07e-01

Pos logit contributions
 Fisher+0.230
 Drinking+0.214
 APPLIC+0.213
 dams+0.209
 Tile+0.207
Neg logit contributions
eta-0.318
adh-0.301
�醒-0.267
chin-0.261
arms-0.259
Token
Token ID247
Feature activation-2.22e-02

Pos logit contributions
 Fisher+0.162
 Drinking+0.152
 APPLIC+0.152
 dams+0.150
 Tile+0.148
Neg logit contributions
eta-0.160
adh-0.150
�醒-0.130
chin-0.127
arms-0.125
Tokens
Token ID82
Feature activation+1.45e+00

Pos logit contributions
 Fisher+0.015
 Drinking+0.014
 APPLIC+0.014
 dams+0.014
 Tile+0.014
Neg logit contributions
eta-0.019
adh-0.018
�醒-0.016
chin-0.016
arms-0.015
Token BRE
Token ID29377
Feature activation+9.15e-01

Pos logit contributions
 Fisher+1.345
 Drinking+1.238
 APPLIC+1.229
 dams+1.211
 Tile+1.197
Neg logit contributions
eta-2.267
adh-2.154
�醒-1.929
chin-1.892
arms-1.878
TokenATH
Token ID12599
Feature activation-1.63e+00

Pos logit contributions
eta+0.779
adh+0.735
�醒+0.635
chin+0.634
arms+0.633
Neg logit contributions
 Fisher-0.639
 dams-0.595
 Drinking-0.593
 APPLIC-0.588
 Tile-0.583
TokenING
Token ID2751
Feature activation-8.95e-02

Pos logit contributions
 Fisher+0.952
 Drinking+0.876
 APPLIC+0.873
 dams+0.858
 Tile+0.848
Neg logit contributions
eta-1.573
adh-1.493
�醒-1.338
chin-1.311
arms-1.302
Token AND
Token ID5357
Feature activation-2.15e+00

Pos logit contributions
 Fisher+0.054
 Drinking+0.050
 dams+0.050
 APPLIC+0.049
 Tile+0.049
Neg logit contributions
eta-0.084
adh-0.079
�醒-0.070
chin-0.069
arms-0.069
Token AC
Token ID7125
Feature activation-1.18e+00

Pos logit contributions
 Fisher+1.285
 Drinking+1.184
 APPLIC+1.181
 dams+1.158
 Tile+1.146
Neg logit contributions
eta-2.060
adh-1.956
�醒-1.749
chin-1.714
arms-1.700
TokenOU
Token ID2606
Feature activation-4.89e+00

Pos logit contributions
 Fisher+0.657
 Drinking+0.602
 APPLIC+0.600
 dams+0.590
 Tile+0.583
Neg logit contributions
eta-1.181
adh-1.122
�醒-1.006
chin-0.991
arms-0.983
TokenST
Token ID2257
Feature activation-3.79e+00

Pos logit contributions
 Fisher+2.866
 Drinking+2.656
 APPLIC+2.636
 Tile+2.523
 DUI+2.519
Neg logit contributions
eta-4.738
adh-4.523
�醒-4.087
chin-3.964
 bombshell-3.936
TokenIC
Token ID2149
Feature activation+5.42e-01

Pos logit contributions
 Fisher+2.193
 Drinking+2.013
 APPLIC+2.012
 dams+1.948
 Tile+1.942
Neg logit contributions
eta-3.701
adh-3.534
�醒-3.164
chin-3.102
mast-3.072
Token RES
Token ID15731
Feature activation-1.50e+00

Pos logit contributions
eta+0.523
adh+0.493
�醒+0.443
chin+0.436
arms+0.434
Neg logit contributions
 Fisher-0.308
 Drinking-0.285
 dams-0.283
 Tile-0.277
 APPLIC-0.276
TokenON
Token ID1340
Feature activation-2.58e+00

Pos logit contributions
 Fisher+0.951
 Drinking+0.881
 APPLIC+0.877
 dams+0.864
 Tile+0.855
Neg logit contributions
eta-1.377
adh-1.300
�醒-1.154
chin-1.135
arms-1.125
TokenANCE
Token ID19240
Feature activation-1.12e+00

Pos logit contributions
 Fisher+1.747
 Drinking+1.626
 APPLIC+1.625
 dams+1.593
 Tile+1.581
Neg logit contributions
eta-2.256
adh-2.133
�醒-1.885
chin-1.842
arms-1.825
Token unpopular
Token ID24166
Feature activation+2.87e+00

Pos logit contributions
eta+0.964
adh+0.911
�醒+0.804
chin+0.789
arms+0.779
Neg logit contributions
 Fisher-0.718
 Drinking-0.670
 APPLIC-0.668
 dams-0.654
 Tile-0.651
Token as
Token ID355
Feature activation+2.92e+00

Pos logit contributions
eta+2.582
adh+2.444
�醒+2.167
chin+2.122
arms+2.101
Neg logit contributions
 Fisher-1.870
 Drinking-1.738
 APPLIC-1.734
 dams-1.701
 Tile-1.687
Token ever
Token ID1683
Feature activation+1.63e+00

Pos logit contributions
eta+2.286
adh+2.143
�醒+1.827
chin+1.816
 bombshell+1.803
Neg logit contributions
 Fisher-2.198
 APPLIC-2.105
 Drinking-2.081
 Tile-2.051
 dams-2.033
Token with
Token ID351
Feature activation+2.30e+00

Pos logit contributions
eta+1.435
adh+1.355
�醒+1.194
chin+1.173
arms+1.160
Neg logit contributions
 Fisher-1.098
 Drinking-1.024
 APPLIC-1.022
 dams-1.003
 Tile-0.997
Token the
Token ID262
Feature activation+1.63e+00

Pos logit contributions
eta+2.072
adh+1.943
�醒+1.709
chin+1.698
 bombshell+1.677
Neg logit contributions
 Fisher-1.474
 Drinking-1.395
 APPLIC-1.390
 dams-1.357
 Tile-1.344
Token public
Token ID1171
Feature activation+3.17e+00

Pos logit contributions
eta+1.485
adh+1.396
�醒+1.235
chin+1.219
arms+1.207
Neg logit contributions
 Fisher-1.041
 APPLIC-0.976
 Drinking-0.975
 dams-0.953
 Tile-0.944
Token.
Token ID13
Feature activation+1.75e+00

Pos logit contributions
eta+2.752
adh+2.594
�醒+2.264
chin+2.239
arms+2.221
Neg logit contributions
 Fisher-2.159
 APPLIC-2.014
 Drinking-2.012
 dams-1.970
 Tile-1.968
Token Only
Token ID5514
Feature activation+1.60e+00

Pos logit contributions
eta+1.552
adh+1.459
�醒+1.286
chin+1.271
arms+1.249
Neg logit contributions
 Fisher-1.143
 dams-1.071
 APPLIC-1.069
 Drinking-1.062
 Tile-1.044
Token 24
Token ID1987
Feature activation+1.72e+00

Pos logit contributions
eta+1.380
adh+1.300
�醒+1.135
chin+1.127
arms+1.110
Neg logit contributions
 Fisher-1.076
 Drinking-1.016
 APPLIC-1.011
 dams-0.993
 Tile-0.987
Token percent
Token ID1411
Feature activation+2.65e+00

Pos logit contributions
eta+1.347
adh+1.256
�醒+1.093
chin+1.071
arms+1.059
Neg logit contributions
 Fisher-1.318
 Drinking-1.246
 APPLIC-1.236
 dams-1.215
 Tile-1.214
Token of
Token ID286
Feature activation+3.06e-01

Pos logit contributions
eta+2.430
adh+2.309
�醒+2.036
chin+2.036
mast+1.991
Neg logit contributions
 Fisher-1.652
 Drinking-1.541
 APPLIC-1.529
 dams-1.498
 Tile-1.483
Token [
Token ID685
Feature activation+1.38e+00

Pos logit contributions
 Fisher+0.014
 Drinking+0.013
 APPLIC+0.013
 dams+0.013
 Tile+0.013
Neg logit contributions
eta-0.022
adh-0.020
�醒-0.018
chin-0.018
mast-0.018
Token10
Token ID940
Feature activation+3.27e+00

Pos logit contributions
eta+1.163
adh+1.091
chin+0.944
arms+0.938
�醒+0.937
Neg logit contributions
 Fisher-0.978
 Drinking-0.919
 APPLIC-0.915
 dams-0.899
 Tile-0.892
Token:
Token ID25
Feature activation+1.90e+00

Pos logit contributions
eta+2.803
adh+2.652
�醒+2.302
chin+2.272
arms+2.269
Neg logit contributions
 Fisher-2.261
 Drinking-2.123
 APPLIC-2.107
 dams-2.066
 Tile-2.048
Token47
Token ID2857
Feature activation+6.13e-01

Pos logit contributions
eta+1.576
adh+1.473
chin+1.271
arms+1.256
mast+1.254
Neg logit contributions
 Fisher-1.371
 Drinking-1.289
 dams-1.287
 APPLIC-1.271
 Tile-1.256
Token]
Token ID60
Feature activation+3.66e+00

Pos logit contributions
eta+0.514
adh+0.484
�醒+0.426
chin+0.415
arms+0.411
Neg logit contributions
 Fisher-0.440
 Drinking-0.410
 APPLIC-0.410
 dams-0.404
 Tile-0.400
Token <
Token ID1279
Feature activation+3.20e+00

Pos logit contributions
eta+2.668
adh+2.486
�醒+2.121
chin+2.102
arms+2.084
Neg logit contributions
 Fisher-2.888
 APPLIC-2.770
 dams-2.727
 Drinking-2.722
 DUI-2.690
Tokenmet
Token ID4164
Feature activation-3.89e-01

Pos logit contributions
eta+2.771
adh+2.607
chin+2.276
mast+2.249
arms+2.240
Neg logit contributions
 Fisher-2.202
 Drinking-2.070
 APPLIC-2.067
 dams-2.043
 DUI-2.017
Tokenros
Token ID4951
Feature activation-5.41e-01

Pos logit contributions
 Fisher+0.331
 Drinking+0.312
 APPLIC+0.312
 dams+0.307
 Tile+0.305
Neg logit contributions
eta-0.273
adh-0.254
�醒-0.219
chin-0.211
arms-0.208
Tokenxual
Token ID5541
Feature activation+1.70e+00

Pos logit contributions
 Fisher+0.403
 Drinking+0.377
 APPLIC+0.377
 dams+0.370
 Tile+0.367
Neg logit contributions
eta-0.436
adh-0.410
�醒-0.361
chin-0.349
arms-0.345
Tokenth
Token ID400
Feature activation+5.88e-01

Pos logit contributions
eta+1.307
adh+1.221
�醒+1.047
chin+1.033
mast+1.026
Neg logit contributions
 Fisher-1.344
 Drinking-1.266
 APPLIC-1.266
 dams-1.243
 Tile-1.237
Tokenug
Token ID1018
Feature activation+6.52e-01

Pos logit contributions
eta+0.457
adh+0.429
�醒+0.369
chin+0.362
arms+0.359
Neg logit contributions
 Fisher-0.458
 APPLIC-0.431
 Drinking-0.430
 dams-0.423
 Tile-0.420
Token
Token ID198
Feature activation+1.66e+00

Pos logit contributions
 Fisher+0.217
 Drinking+0.203
 APPLIC+0.202
 dams+0.199
 Tile+0.197
Neg logit contributions
eta-0.287
adh-0.272
�醒-0.238
chin-0.235
arms-0.233
Token
Token ID198
Feature activation+3.82e-01

Pos logit contributions
eta+1.435
adh+1.348
chin+1.167
arms+1.161
mast+1.156
Neg logit contributions
 Fisher-1.152
 Drinking-1.081
 APPLIC-1.069
 dams-1.064
 Tile-1.052
TokenThis
Token ID1212
Feature activation+7.61e-01

Pos logit contributions
eta+0.331
adh+0.312
�醒+0.275
chin+0.269
arms+0.267
Neg logit contributions
 Fisher-0.263
 Drinking-0.245
 APPLIC-0.245
 dams-0.240
 Tile-0.238
Token DIY
Token ID25134
Feature activation-4.97e-01

Pos logit contributions
eta+0.735
adh+0.697
�醒+0.619
chin+0.612
mast+0.607
Neg logit contributions
 Fisher-0.442
 Drinking-0.409
 APPLIC-0.405
 dams-0.399
 Tile-0.391
Token is
Token ID318
Feature activation+1.64e+00

Pos logit contributions
 Fisher+0.299
 Drinking+0.277
 APPLIC+0.277
 dams+0.272
 Tile+0.267
Neg logit contributions
eta-0.471
adh-0.447
�醒-0.397
chin-0.391
mast-0.388
Token going
Token ID1016
Feature activation+4.29e+00

Pos logit contributions
eta+1.528
adh+1.442
chin+1.265
�醒+1.259
arms+1.249
Neg logit contributions
 Fisher-1.001
 Drinking-0.933
 APPLIC-0.926
 dams-0.920
 Tile-0.898
Token to
Token ID284
Feature activation+5.73e-01

Pos logit contributions
eta+4.102
adh+3.905
�醒+3.473
chin+3.398
mast+3.373
Neg logit contributions
 Fisher-2.502
 Drinking-2.336
 APPLIC-2.312
 dams-2.282
 Toll-2.235
Token be
Token ID307
Feature activation+1.13e+00

Pos logit contributions
eta+0.535
adh+0.507
�醒+0.449
chin+0.443
mast+0.439
Neg logit contributions
 Fisher-0.353
 Drinking-0.329
 APPLIC-0.327
 dams-0.320
 Tile-0.316
Token focusing
Token ID10759
Feature activation+1.27e+00

Pos logit contributions
eta+1.057
adh+1.001
chin+0.878
�醒+0.873
arms+0.867
Neg logit contributions
 Fisher-0.685
 Drinking-0.638
 APPLIC-0.631
 dams-0.627
 Tile-0.614
Token on
Token ID319
Feature activation-8.91e-01

Pos logit contributions
eta+1.127
adh+1.066
�醒+0.943
chin+0.923
arms+0.914
Neg logit contributions
 Fisher-0.849
 Drinking-0.790
 APPLIC-0.789
 dams-0.774
 Tile-0.767
Token those
Token ID883
Feature activation+7.60e-02

Pos logit contributions
 Fisher+0.519
 Drinking+0.478
 APPLIC+0.478
 dams+0.467
 Tile+0.461
Neg logit contributions
eta-0.861
adh-0.818
�醒-0.728
chin-0.718
mast-0.713
Token optimized
Token ID23392
Feature activation-9.67e-02

Pos logit contributions
 Fisher+0.172
 Drinking+0.159
 APPLIC+0.159
 dams+0.156
 Tile+0.154
Neg logit contributions
eta-0.252
adh-0.239
�醒-0.212
chin-0.209
arms-0.207
Token Store
Token ID9363
Feature activation+6.38e-02

Pos logit contributions
 Fisher+0.059
 Drinking+0.054
 APPLIC+0.054
 dams+0.053
 Tile+0.052
Neg logit contributions
eta-0.091
adh-0.086
�醒-0.077
chin-0.075
arms-0.075
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
eta+0.056
adh+0.053
�醒+0.047
chin+0.046
arms+0.046
Neg logit contributions
 Fisher-0.043
 Drinking-0.040
 APPLIC-0.040
 dams-0.039
 Tile-0.039
Token As
Token ID1081
Feature activation-1.79e-01

Pos logit contributions
eta+1.045
adh+0.986
�醒+0.869
chin+0.855
mast+0.851
Neg logit contributions
 Fisher-0.759
 dams-0.710
 APPLIC-0.708
 Drinking-0.701
stanbul-0.685
Token you
Token ID345
Feature activation-3.52e-01

Pos logit contributions
 Fisher+0.110
 APPLIC+0.102
 Drinking+0.102
 dams+0.100
 Tile+0.099
Neg logit contributions
eta-0.167
adh-0.159
�醒-0.140
chin-0.138
arms-0.137
Token land
Token ID1956
Feature activation+3.03e+00

Pos logit contributions
 Fisher+0.228
 Drinking+0.212
 APPLIC+0.212
 dams+0.207
 Tile+0.206
Neg logit contributions
eta-0.319
adh-0.302
�醒-0.268
chin-0.263
arms-0.260
Token on
Token ID319
Feature activation-1.59e-01

Pos logit contributions
eta+2.735
adh+2.582
�醒+2.264
chin+2.248
arms+2.242
Neg logit contributions
 Fisher-1.927
 Drinking-1.813
 APPLIC-1.788
 dams-1.749
 Toll-1.727
Token the
Token ID262
Feature activation-4.66e-01

Pos logit contributions
 Fisher+0.095
 Drinking+0.087
 APPLIC+0.087
 dams+0.085
 Tile+0.084
Neg logit contributions
eta-0.152
adh-0.144
�醒-0.128
chin-0.126
arms-0.126
Token Store
Token ID9363
Feature activation-1.48e+00

Pos logit contributions
 Fisher+0.284
 Drinking+0.262
 APPLIC+0.261
 dams+0.257
 Tile+0.253
Neg logit contributions
eta-0.439
adh-0.416
�醒-0.369
chin-0.364
arms-0.361
Token from
Token ID422
Feature activation+3.55e-01

Pos logit contributions
 Fisher+0.941
 Drinking+0.872
 APPLIC+0.871
 dams+0.853
 Tile+0.848
Neg logit contributions
eta-1.360
adh-1.289
�醒-1.148
chin-1.121
arms-1.113
Token Home
Token ID5995
Feature activation-1.70e-01

Pos logit contributions
eta+0.334
adh+0.317
�醒+0.281
chin+0.277
arms+0.275
Neg logit contributions
 Fisher-0.215
 Drinking-0.199
 APPLIC-0.198
 dams-0.195
 Tile-0.192
Token the
Token ID262
Feature activation-1.23e-02

Pos logit contributions
eta+0.801
adh+0.759
�醒+0.669
chin+0.658
mast+0.652
Neg logit contributions
 Fisher-0.572
 APPLIC-0.532
 Drinking-0.531
 dams-0.521
 Tile-0.515
Token Sch
Token ID3059
Feature activation-2.46e-01

Pos logit contributions
 Fisher+0.008
 Drinking+0.007
 APPLIC+0.007
 dams+0.007
 Tile+0.007
Neg logit contributions
eta-0.011
adh-0.011
�醒-0.009
chin-0.009
arms-0.009
Tokenm
Token ID76
Feature activation-1.94e+00

Pos logit contributions
 Fisher+0.185
 Drinking+0.173
 APPLIC+0.173
 dams+0.170
 Tile+0.169
Neg logit contributions
eta-0.196
adh-0.185
�醒-0.162
chin-0.157
arms-0.155
Tokenoes
Token ID3028
Feature activation-3.40e-02

Pos logit contributions
 Fisher+1.486
 Drinking+1.393
 APPLIC+1.383
 Tile+1.363
 dams+1.361
Neg logit contributions
eta-1.500
adh-1.416
�醒-1.263
chin-1.198
mast-1.191
Token on
Token ID319
Feature activation+4.93e-01

Pos logit contributions
 Fisher+0.023
 Drinking+0.021
 APPLIC+0.021
 dams+0.021
 Tile+0.021
Neg logit contributions
eta-0.030
adh-0.028
�醒-0.025
chin-0.025
arms-0.024
Token Twitter
Token ID3009
Feature activation+3.27e+00

Pos logit contributions
eta+0.440
adh+0.416
�醒+0.367
chin+0.361
mast+0.357
Neg logit contributions
 Fisher-0.326
 APPLIC-0.302
 Drinking-0.302
 dams-0.297
 Tile-0.294
Token!
Token ID0
Feature activation+6.97e-01

Pos logit contributions
eta+2.730
adh+2.577
�醒+2.212
chin+2.198
mast+2.186
Neg logit contributions
 Fisher-2.286
 APPLIC-2.138
 Drinking-2.125
 dams-2.117
 DUI-2.079
Token
Token ID198
Feature activation+3.76e-01

Pos logit contributions
eta+0.624
adh+0.589
�醒+0.521
chin+0.511
mast+0.507
Neg logit contributions
 Fisher-0.456
 APPLIC-0.425
 Drinking-0.423
 dams-0.418
 Tile-0.411
Token
Token ID198
Feature activation-1.33e-01

Pos logit contributions
eta+0.327
adh+0.308
chin+0.266
�醒+0.265
arms+0.264
Neg logit contributions
 Fisher-0.256
 Drinking-0.239
 APPLIC-0.238
 dams-0.236
 Tile-0.233
TokenSubscribe
Token ID27125
Feature activation+2.16e+00

Pos logit contributions
 Fisher+0.088
 Drinking+0.082
 APPLIC+0.082
 dams+0.080
 Tile+0.080
Neg logit contributions
eta-0.118
adh-0.111
�醒-0.099
chin-0.096
arms-0.095
Token to
Token ID284
Feature activation+1.73e+00

Pos logit contributions
eta+2.076
adh+1.968
�醒+1.758
chin+1.723
mast+1.715
Neg logit contributions
 Fisher-1.263
 APPLIC-1.168
 Drinking-1.161
 dams-1.143
 DUI-1.122
Token addition
Token ID3090
Feature activation+7.69e-01

Pos logit contributions
eta+0.582
adh+0.556
�醒+0.495
chin+0.489
arms+0.486
Neg logit contributions
 Fisher-0.308
 Drinking-0.285
 APPLIC-0.285
 dams-0.278
 Tile-0.273
Token,
Token ID11
Feature activation-1.23e+00

Pos logit contributions
eta+0.709
adh+0.672
�醒+0.598
chin+0.585
arms+0.581
Neg logit contributions
 Fisher-0.484
 Drinking-0.449
 APPLIC-0.448
 dams-0.438
 Tile-0.435
Token Hamas
Token ID15767
Feature activation-5.88e-01

Pos logit contributions
 Fisher+0.498
 Drinking+0.442
 APPLIC+0.440
 dams+0.425
 Tile+0.420
Neg logit contributions
eta-1.405
adh-1.347
�醒-1.227
chin-1.207
arms-1.201
Token activities
Token ID4568
Feature activation-1.09e+00

Pos logit contributions
 Fisher+0.383
 Drinking+0.356
 APPLIC+0.355
 dams+0.347
 Tile+0.345
Neg logit contributions
eta-0.530
adh-0.502
�醒-0.444
chin-0.436
arms-0.432
Token in
Token ID287
Feature activation-9.49e-01

Pos logit contributions
 Fisher+0.651
 Drinking+0.600
 APPLIC+0.599
 dams+0.587
 Tile+0.581
Neg logit contributions
eta-1.048
adh-0.996
�醒-0.891
chin-0.872
arms-0.865
Token Turkey
Token ID7137
Feature activation+4.18e-01

Pos logit contributions
 Fisher+0.437
 APPLIC+0.397
 Drinking+0.394
 dams+0.381
 Tile+0.376
Neg logit contributions
eta-1.033
adh-0.989
�醒-0.893
chin-0.880
arms-0.876
Token in
Token ID287
Feature activation-4.93e-01

Pos logit contributions
eta+0.367
adh+0.347
�醒+0.305
chin+0.300
arms+0.296
Neg logit contributions
 Fisher-0.283
 Drinking-0.264
 APPLIC-0.263
 dams-0.258
 Tile-0.256
Token general
Token ID2276
Feature activation-7.51e-02

Pos logit contributions
 Fisher+0.312
 Drinking+0.289
 APPLIC+0.289
 dams+0.282
 Tile+0.280
Neg logit contributions
eta-0.454
adh-0.431
�醒-0.381
chin-0.374
arms-0.372
Token will
Token ID481
Feature activation+3.84e-02

Pos logit contributions
 Fisher+0.051
 Drinking+0.047
 APPLIC+0.047
 dams+0.046
 Tile+0.046
Neg logit contributions
eta-0.066
adh-0.062
�醒-0.055
chin-0.054
arms-0.054
Token be
Token ID307
Feature activation+9.97e-02

Pos logit contributions
eta+0.034
adh+0.032
�醒+0.029
chin+0.028
arms+0.028
Neg logit contributions
 Fisher-0.025
 Drinking-0.024
 APPLIC-0.024
 dams-0.023
 Tile-0.023
Token limited
Token ID3614
Feature activation-1.79e-01

Pos logit contributions
eta+0.092
adh+0.087
�醒+0.077
chin+0.076
mast+0.075
Neg logit contributions
 Fisher-0.063
 Drinking-0.058
 APPLIC-0.058
 dams-0.057
 Tile-0.057
Token,
Token ID11
Feature activation-6.46e-01

Pos logit contributions
 Fisher+0.931
 Drinking+0.864
 APPLIC+0.862
 dams+0.846
 Tile+0.839
Neg logit contributions
eta-1.306
adh-1.237
�醒-1.097
chin-1.074
arms-1.065
Token the
Token ID262
Feature activation+5.68e-01

Pos logit contributions
 Fisher+0.381
 Drinking+0.352
 APPLIC+0.351
 dams+0.343
 Tile+0.340
Neg logit contributions
eta-0.622
adh-0.591
�醒-0.527
chin-0.518
arms-0.514
Token first
Token ID717
Feature activation+3.03e-01

Pos logit contributions
eta+0.530
adh+0.502
�醒+0.444
chin+0.438
mast+0.437
Neg logit contributions
 Fisher-0.346
 Drinking-0.324
 APPLIC-0.322
 dams-0.314
 Tile-0.313
Token modified
Token ID9518
Feature activation+1.50e+00

Pos logit contributions
eta+0.279
adh+0.264
�醒+0.233
chin+0.230
mast+0.229
Neg logit contributions
 Fisher-0.190
 Drinking-0.177
 APPLIC-0.177
 dams-0.173
 Tile-0.172
Token Block
Token ID9726
Feature activation+9.92e-01

Pos logit contributions
eta+1.396
adh+1.320
�醒+1.160
chin+1.153
mast+1.153
Neg logit contributions
 Fisher-0.915
 Drinking-0.860
 APPLIC-0.853
 dams-0.833
 Tile-0.831
Token III
Token ID6711
Feature activation+1.88e+00

Pos logit contributions
eta+0.912
adh+0.862
�醒+0.765
chin+0.756
mast+0.747
Neg logit contributions
 Fisher-0.626
 Drinking-0.583
 APPLIC-0.580
 dams-0.569
 Tile-0.562
Token spacecraft
Token ID16807
Feature activation+7.62e-01

Pos logit contributions
eta+1.774
adh+1.681
�醒+1.481
chin+1.475
mast+1.472
Neg logit contributions
 Fisher-1.120
 Drinking-1.047
 APPLIC-1.040
 dams-1.013
 Tile-1.011
Token,
Token ID11
Feature activation+1.66e+00

Pos logit contributions
eta+0.702
adh+0.664
�醒+0.584
chin+0.578
mast+0.576
Neg logit contributions
 Fisher-0.478
 Drinking-0.446
 APPLIC-0.445
 dams-0.434
 Tile-0.433
Token designated
Token ID11032
Feature activation+1.90e+00

Pos logit contributions
eta+1.548
adh+1.472
�醒+1.293
chin+1.286
mast+1.284
Neg logit contributions
 Fisher-1.000
 APPLIC-0.934
 Drinking-0.932
 dams-0.914
 Tile-0.905
Token Ranger
Token ID21888
Feature activation-5.64e-01

Pos logit contributions
eta+1.741
adh+1.654
�醒+1.449
chin+1.444
mast+1.433
Neg logit contributions
 Fisher-1.174
 APPLIC-1.094
 Drinking-1.091
 dams-1.076
 Tile-1.059
Token A
Token ID317
Feature activation+1.50e+00

Pos logit contributions
 Fisher+0.353
 Drinking+0.327
 APPLIC+0.326
 dams+0.320
 Tile+0.316
Neg logit contributions
eta-0.524
adh-0.497
�醒-0.444
chin-0.433
arms-0.429
Token with
Token ID351
Feature activation+1.92e-01

Pos logit contributions
eta+0.147
adh+0.139
�醒+0.122
chin+0.120
arms+0.119
Neg logit contributions
 Fisher-0.111
 Drinking-0.104
 APPLIC-0.104
 dams-0.102
 Tile-0.101
Token already
Token ID1541
Feature activation-4.73e-01

Pos logit contributions
eta+0.177
adh+0.168
�醒+0.149
chin+0.146
arms+0.145
Neg logit contributions
 Fisher-0.122
 Drinking-0.113
 APPLIC-0.113
 dams-0.110
 Tile-0.109
Token athlet
Token ID6976
Feature activation+3.29e+00

Pos logit contributions
 Fisher+0.317
 Drinking+0.295
 APPLIC+0.295
 dams+0.289
 Tile+0.287
Neg logit contributions
eta-0.417
adh-0.394
�醒-0.349
chin-0.341
arms-0.338
Tokenically
Token ID1146
Feature activation+1.82e+00

Pos logit contributions
eta+2.409
adh+2.266
chin+1.896
arms+1.881
mast+1.867
Neg logit contributions
 Fisher-2.661
 APPLIC-2.516
 Drinking-2.503
 Toll-2.466
 dams-2.462
Token fit
Token ID4197
Feature activation+1.04e+00

Pos logit contributions
eta+1.518
adh+1.433
�醒+1.244
chin+1.229
arms+1.220
Neg logit contributions
 Fisher-1.297
 Drinking-1.220
 APPLIC-1.219
 dams-1.188
 Tile-1.184
Token people
Token ID661
Feature activation+1.39e-02

Pos logit contributions
eta+1.017
adh+0.968
�醒+0.863
chin+0.849
arms+0.845
Neg logit contributions
 Fisher-0.601
 Drinking-0.554
 APPLIC-0.554
 dams-0.539
 Tile-0.535
Token,
Token ID11
Feature activation+6.73e-01

Pos logit contributions
eta+0.012
adh+0.011
�醒+0.010
chin+0.010
arms+0.010
Neg logit contributions
 Fisher-0.010
 Drinking-0.009
 APPLIC-0.009
 dams-0.009
 Tile-0.009
Token then
Token ID788
Feature activation+9.92e-02

Pos logit contributions
eta+0.617
adh+0.583
�醒+0.509
chin+0.508
arms+0.505
Neg logit contributions
 Fisher-0.423
 APPLIC-0.393
 Drinking-0.391
 dams-0.385
 Tile-0.380
Token it
Token ID340
Feature activation+1.16e+00

Pos logit contributions
eta+0.092
adh+0.087
�醒+0.077
chin+0.077
arms+0.076
Neg logit contributions
 Fisher-0.061
 APPLIC-0.057
 Drinking-0.056
 dams-0.055
 Tile-0.055
Token offers
Token ID4394
Feature activation+6.29e-01

Pos logit contributions
eta+1.022
adh+0.960
chin+0.840
�醒+0.832
mast+0.831
Neg logit contributions
 Fisher-0.768
 Drinking-0.727
 APPLIC-0.715
 Tile-0.698
-+-+-0.695
Token you
Token ID345
Feature activation+5.87e-01

Pos logit contributions
eta+0.575
adh+0.543
�醒+0.479
chin+0.473
arms+0.470
Neg logit contributions
 Fisher-0.400
 Drinking-0.372
 APPLIC-0.371
 dams-0.362
 Tile-0.359
Token
Token ID198
Feature activation+4.31e-01

Pos logit contributions
 Fisher+0.250
 Drinking+0.233
 APPLIC+0.232
 dams+0.228
 Tile+0.226
Neg logit contributions
eta-0.333
adh-0.315
�醒-0.278
chin-0.273
arms-0.270
Token
Token ID198
Feature activation-4.82e-01

Pos logit contributions
eta+0.375
adh+0.353
chin+0.305
�醒+0.304
arms+0.302
Neg logit contributions
 Fisher-0.294
 Drinking-0.275
 APPLIC-0.274
 dams-0.271
 Tile-0.268
Token2017
Token ID5539
Feature activation-7.57e-01

Pos logit contributions
 Fisher+0.379
 Drinking+0.356
 APPLIC+0.355
 dams+0.349
 Tile+0.347
Neg logit contributions
eta-0.369
adh-0.346
�醒-0.301
chin-0.292
arms-0.289
Token salary
Token ID9588
Feature activation-2.55e+00

Pos logit contributions
 Fisher+0.441
 Drinking+0.409
 APPLIC+0.408
 dams+0.399
 Tile+0.395
Neg logit contributions
eta-0.731
adh-0.693
�醒-0.617
chin-0.608
arms-0.604
Token and
Token ID290
Feature activation-4.28e-01

Pos logit contributions
 Fisher+1.763
 APPLIC+1.633
 Drinking+1.629
 dams+1.615
 Tile+1.590
Neg logit contributions
eta-2.202
adh-2.085
�醒-1.869
chin-1.795
arms-1.773
Token bonuses
Token ID17568
Feature activation+1.29e+00

Pos logit contributions
 Fisher+0.182
 Drinking+0.163
 APPLIC+0.163
 Tile+0.156
 dams+0.155
Neg logit contributions
eta-0.482
adh-0.460
�醒-0.417
chin-0.412
arms-0.411
Token:
Token ID25
Feature activation-2.18e-01

Pos logit contributions
eta+1.095
adh+1.029
chin+0.884
mast+0.879
arms+0.877
Neg logit contributions
 Fisher-0.900
 Drinking-0.852
 APPLIC-0.843
 dams-0.833
 Tile-0.827
Token $
Token ID720
Feature activation-1.34e+00

Pos logit contributions
 Fisher+0.151
 Drinking+0.140
 APPLIC+0.139
 dams+0.136
 Tile+0.136
Neg logit contributions
eta-0.190
adh-0.179
�醒-0.158
chin-0.155
mast-0.153
Token1
Token ID16
Feature activation-5.88e-01

Pos logit contributions
 Fisher+0.857
 Drinking+0.795
 APPLIC+0.792
 dams+0.781
 Tile+0.772
Neg logit contributions
eta-1.230
adh-1.165
�醒-1.029
chin-1.014
arms-1.006
Token million
Token ID1510
Feature activation-4.49e-01

Pos logit contributions
 Fisher+0.379
 Drinking+0.349
 APPLIC+0.347
 dams+0.343
 Toll+0.338
Neg logit contributions
eta-0.536
adh-0.512
�醒-0.450
chin-0.445
arms-0.438
Token (
Token ID357
Feature activation+3.12e-01

Pos logit contributions
 Fisher+0.295
 Drinking+0.272
 APPLIC+0.272
 dams+0.267
 Tile+0.265
Neg logit contributions
eta-0.404
adh-0.382
�醒-0.339
chin-0.332
arms-0.329
Token.
Token ID13
Feature activation-5.37e-01

Pos logit contributions
eta+0.637
adh+0.601
�醒+0.528
chin+0.517
arms+0.512
Neg logit contributions
 Fisher-0.523
 Drinking-0.488
 APPLIC-0.488
 dams-0.479
 Tile-0.475
Tokend
Token ID67
Feature activation+2.28e+00

Pos logit contributions
 Fisher+0.352
 Drinking+0.330
 APPLIC+0.327
 dams+0.322
 Tile+0.319
Neg logit contributions
eta-0.480
adh-0.454
�醒-0.405
chin-0.394
arms-0.390
Token
Token ID198
Feature activation+2.43e-01

Pos logit contributions
eta+1.920
adh+1.808
�醒+1.571
chin+1.541
arms+1.536
Neg logit contributions
 Fisher-1.636
 APPLIC-1.519
 Drinking-1.518
 dams-1.488
 Tile-1.480
Token
Token ID198
Feature activation-4.33e-01

Pos logit contributions
eta+0.212
adh+0.199
chin+0.173
�醒+0.172
arms+0.171
Neg logit contributions
 Fisher-0.166
 Drinking-0.155
 APPLIC-0.154
 dams-0.152
 Tile-0.151
TokenNow
Token ID3844
Feature activation+1.09e+00

Pos logit contributions
 Fisher+0.331
 Drinking+0.310
 APPLIC+0.310
 dams+0.305
 Tile+0.303
Neg logit contributions
eta-0.342
adh-0.321
�醒-0.278
chin-0.272
arms-0.270
Token we
Token ID356
Feature activation+1.12e+00

Pos logit contributions
eta+1.020
adh+0.965
chin+0.847
�醒+0.847
arms+0.839
Neg logit contributions
 Fisher-0.673
 Drinking-0.622
 APPLIC-0.619
 dams-0.605
 Toll-0.602
Token
Token ID447
Feature activation+3.81e-01

Pos logit contributions
eta+1.026
adh+0.972
�醒+0.851
chin+0.851
mast+0.840
Neg logit contributions
 Fisher-0.708
 Drinking-0.657
 APPLIC-0.654
 dams-0.638
 Tile-0.633
Token
Token ID247
Feature activation+6.88e-01

Pos logit contributions
eta+0.313
adh+0.294
�醒+0.257
chin+0.252
arms+0.250
Neg logit contributions
 Fisher-0.279
 APPLIC-0.261
 Drinking-0.260
 dams-0.255
 Tile-0.254
Tokenre
Token ID260
Feature activation+1.07e+00

Pos logit contributions
eta+0.500
adh+0.467
�醒+0.391
chin+0.391
mast+0.386
Neg logit contributions
 Fisher-0.571
 Drinking-0.541
 APPLIC-0.540
 dams-0.531
 Tile-0.528
Token going
Token ID1016
Feature activation+1.71e+00

Pos logit contributions
eta+1.014
adh+0.955
chin+0.843
�醒+0.841
mast+0.837
Neg logit contributions
 Fisher-0.642
 Drinking-0.589
 APPLIC-0.584
 dams-0.574
 Toll-0.569
Token to
Token ID284
Feature activation-1.17e+00

Pos logit contributions
eta+1.695
adh+1.612
�醒+1.448
chin+1.420
arms+1.408
Neg logit contributions
 Fisher-0.959
 Drinking-0.879
 APPLIC-0.877
 dams-0.857
 Tile-0.848
Token Ivy
Token ID27386
Feature activation-1.81e+00

Pos logit contributions
 Fisher+0.897
 Drinking+0.831
 APPLIC+0.829
 dams+0.814
 Tile+0.806
Neg logit contributions
eta-1.262
adh-1.196
�醒-1.064
chin-1.040
mast-1.030
Token League
Token ID4041
Feature activation-2.86e-01

Pos logit contributions
 Fisher+0.448
 Drinking+0.364
 APPLIC+0.358
 dams+0.341
 Tile+0.331
Neg logit contributions
eta-2.367
adh-2.285
�醒-2.115
chin-2.080
mast-2.068
Token universities
Token ID11155
Feature activation-7.08e-01

Pos logit contributions
 Fisher+0.142
 Drinking+0.128
 APPLIC+0.128
 dams+0.125
 Tile+0.124
Neg logit contributions
eta-0.302
adh-0.288
�醒-0.260
chin-0.256
arms-0.254
Token.
Token ID13
Feature activation-4.09e-02

Pos logit contributions
 Fisher+0.480
 Drinking+0.447
 APPLIC+0.446
 dams+0.438
 Tile+0.434
Neg logit contributions
eta-0.620
adh-0.586
�醒-0.519
chin-0.506
arms-0.502
Token Photo
Token ID5555
Feature activation+1.22e+00

Pos logit contributions
 Fisher+0.027
 APPLIC+0.025
 Drinking+0.025
 dams+0.025
 Tile+0.024
Neg logit contributions
eta-0.036
adh-0.034
�醒-0.030
chin-0.030
arms-0.029
Token:
Token ID25
Feature activation-2.88e-01

Pos logit contributions
eta+1.089
adh+1.031
�醒+0.909
chin+0.893
arms+0.884
Neg logit contributions
 Fisher-0.811
 Drinking-0.756
 APPLIC-0.755
 dams-0.740
 Tile-0.734
Token Luis
Token ID20894
Feature activation-8.99e-01

Pos logit contributions
 Fisher+0.199
 APPLIC+0.186
 Drinking+0.186
 dams+0.183
 Tile+0.181
Neg logit contributions
eta-0.248
adh-0.234
�醒-0.205
chin-0.202
arms-0.199
Token Sin
Token ID10884
Feature activation+1.20e+00

Pos logit contributions
 Fisher+0.638
 APPLIC+0.599
 Drinking+0.597
 dams+0.590
 Tile+0.583
Neg logit contributions
eta-0.757
adh-0.712
�醒-0.622
chin-0.610
arms-0.604
Tokenco
Token ID1073
Feature activation-1.73e+00

Pos logit contributions
eta+0.940
adh+0.882
�醒+0.761
chin+0.746
arms+0.738
Neg logit contributions
 Fisher-0.920
 Drinking-0.866
 APPLIC-0.865
 dams-0.852
 Tile-0.845
Token/
Token ID14
Feature activation+1.44e+00

Pos logit contributions
 Fisher+1.230
 Drinking+1.142
 APPLIC+1.135
 dams+1.114
 Tile+1.111
Neg logit contributions
eta-1.455
adh-1.379
�醒-1.222
chin-1.181
mast-1.178
TokenLos
Token ID28903
Feature activation-7.41e-01

Pos logit contributions
eta+1.127
adh+1.055
�醒+0.902
chin+0.901
arms+0.888
Neg logit contributions
 Fisher-1.107
 Drinking-1.038
 APPLIC-1.036
 dams-1.025
 Tile-1.011
Token low
Token ID1877
Feature activation-8.29e-01

Pos logit contributions
 Fisher+0.078
 Drinking+0.073
 APPLIC+0.073
 dams+0.071
 Tile+0.070
Neg logit contributions
eta-0.128
adh-0.122
�醒-0.108
chin-0.106
arms-0.106
Token commodity
Token ID19464
Feature activation-1.88e+00

Pos logit contributions
 Fisher+0.474
 APPLIC+0.437
 Drinking+0.437
 dams+0.425
 Tile+0.423
Neg logit contributions
eta-0.811
adh-0.771
�醒-0.688
chin-0.676
arms-0.673
Token prices
Token ID4536
Feature activation-2.44e-01

Pos logit contributions
 Fisher+1.262
 Drinking+1.171
 APPLIC+1.169
 dams+1.152
 Tile+1.138
Neg logit contributions
eta-1.654
adh-1.566
�醒-1.389
chin-1.355
arms-1.342
Token have
Token ID423
Feature activation-6.37e-01

Pos logit contributions
 Fisher+0.153
 Drinking+0.142
 APPLIC+0.142
 dams+0.138
 Tile+0.138
Neg logit contributions
eta-0.225
adh-0.214
�醒-0.188
chin-0.186
mast-0.184
Token all
Token ID477
Feature activation-6.22e-01

Pos logit contributions
 Fisher+0.391
 Drinking+0.365
 APPLIC+0.364
 Tile+0.353
 dams+0.351
Neg logit contributions
eta-0.597
adh-0.566
�醒-0.500
chin-0.494
arms-0.488
Token hurt
Token ID5938
Feature activation-8.70e-02

Pos logit contributions
 Fisher+0.372
 Drinking+0.346
 APPLIC+0.345
 Tile+0.334
 dams+0.333
Neg logit contributions
eta-0.593
adh-0.563
�醒-0.498
chin-0.494
mast-0.488
Token Argentina
Token ID16519
Feature activation-1.65e+00

Pos logit contributions
 Fisher+0.050
 APPLIC+0.047
 Drinking+0.047
 dams+0.045
 Tile+0.045
Neg logit contributions
eta-0.084
adh-0.080
�醒-0.071
chin-0.070
arms-0.070
Token;
Token ID26
Feature activation+9.57e-01

Pos logit contributions
 Fisher+1.027
 Drinking+0.948
 APPLIC+0.945
 dams+0.930
 Tile+0.918
Neg logit contributions
eta-1.531
adh-1.454
�醒-1.299
chin-1.265
arms-1.259
Token true
Token ID2081
Feature activation+6.08e-01

Pos logit contributions
eta+0.892
adh+0.847
chin+0.738
�醒+0.736
arms+0.732
Neg logit contributions
 Fisher-0.581
 APPLIC-0.546
 Drinking-0.544
 Tile-0.527
 dams-0.526
Token enough
Token ID1576
Feature activation-1.19e+00

Pos logit contributions
eta+0.553
adh+0.524
�醒+0.461
chin+0.455
arms+0.452
Neg logit contributions
 Fisher-0.387
 Drinking-0.364
 APPLIC-0.362
 Tile-0.351
 dams-0.350
Token.
Token ID13
Feature activation-7.39e-01

Pos logit contributions
 Fisher+0.758
 Drinking+0.703
 APPLIC+0.702
 dams+0.688
 Tile+0.682
Neg logit contributions
eta-1.094
adh-1.036
�醒-0.920
chin-0.902
arms-0.894
Token,
Token ID11
Feature activation+4.14e-01

Pos logit contributions
eta+0.924
adh+0.873
�醒+0.771
chin+0.756
mast+0.748
Neg logit contributions
 Fisher-0.692
 Drinking-0.645
 APPLIC-0.643
 dams-0.631
 Tile-0.626
Token and
Token ID290
Feature activation-2.24e-01

Pos logit contributions
eta+0.373
adh+0.353
�醒+0.311
chin+0.305
arms+0.303
Neg logit contributions
 Fisher-0.269
 Drinking-0.250
 APPLIC-0.250
 dams-0.245
 Tile-0.242
Token four
Token ID1440
Feature activation-9.10e-01

Pos logit contributions
 Fisher+0.134
 Drinking+0.124
 APPLIC+0.124
 dams+0.121
 Tile+0.120
Neg logit contributions
eta-0.213
adh-0.202
�醒-0.180
chin-0.177
arms-0.176
Token panel
Token ID6103
Feature activation+1.83e-01

Pos logit contributions
 Fisher+0.579
 Drinking+0.538
 APPLIC+0.537
 dams+0.525
 Tile+0.521
Neg logit contributions
eta-0.833
adh-0.789
�醒-0.700
chin-0.687
arms-0.680
Tokenists
Token ID1023
Feature activation+1.77e+00

Pos logit contributions
eta+0.141
adh+0.131
chin+0.112
�醒+0.112
mast+0.111
Neg logit contributions
 Fisher-0.142
 Drinking-0.135
 APPLIC-0.134
 Tile-0.132
 dams-0.131
Token voting
Token ID6709
Feature activation-7.19e-01

Pos logit contributions
eta+1.624
adh+1.530
chin+1.348
�醒+1.345
mast+1.331
Neg logit contributions
 Fisher-1.096
 Drinking-1.032
 APPLIC-1.027
 dams-1.003
 Tile-0.993
Token in
Token ID287
Feature activation-4.62e-01

Pos logit contributions
 Fisher+0.500
 Drinking+0.467
 APPLIC+0.466
 dams+0.458
 Tile+0.454
Neg logit contributions
eta-0.616
adh-0.582
�醒-0.512
chin-0.501
arms-0.496
Token favour
Token ID7075
Feature activation-1.61e+00

Pos logit contributions
 Fisher+0.243
 Drinking+0.219
 APPLIC+0.219
 dams+0.215
 Tile+0.212
Neg logit contributions
eta-0.477
adh-0.453
�醒-0.410
chin-0.402
mast-0.399
Token of
Token ID286
Feature activation-9.45e-01

Pos logit contributions
 Fisher+0.951
 APPLIC+0.870
 Drinking+0.865
 dams+0.854
 Tile+0.839
Neg logit contributions
eta-1.556
adh-1.477
�醒-1.321
chin-1.296
arms-1.293
Token a
Token ID257
Feature activation-6.06e-01

Pos logit contributions
 Fisher+0.588
 Drinking+0.543
 APPLIC+0.543
 dams+0.532
 Tile+0.527
Neg logit contributions
eta-0.881
adh-0.836
�醒-0.745
chin-0.729
arms-0.723
Token redesign
Token ID27135
Feature activation-1.08e+00

Pos logit contributions
 Fisher+0.395
 Drinking+0.366
 APPLIC+0.365
 dams+0.358
 Tile+0.355
Neg logit contributions
eta-0.547
adh-0.519
�醒-0.460
chin-0.450
arms-0.446
Token of
Token ID286
Feature activation+1.68e+00

Pos logit contributions
eta+0.202
adh+0.191
�醒+0.169
chin+0.166
arms+0.164
Neg logit contributions
 Fisher-0.145
 Drinking-0.135
 APPLIC-0.135
 dams-0.132
 Tile-0.131
Token Honey
Token ID21788
Feature activation-1.61e+00

Pos logit contributions
eta+1.685
adh+1.591
chin+1.419
 bombshell+1.414
arms+1.409
Neg logit contributions
 Fisher-0.887
 Drinking-0.830
 APPLIC-0.824
stanbul-0.792
 dams-0.790
Tokencomb
Token ID24011
Feature activation-1.24e+00

Pos logit contributions
 Fisher+0.998
 Drinking+0.923
 APPLIC+0.922
 dams+0.903
 Tile+0.894
Neg logit contributions
eta-1.506
adh-1.428
�醒-1.278
chin-1.246
arms-1.234
Token-
Token ID12
Feature activation-6.98e-02

Pos logit contributions
 Fisher+0.676
 Drinking+0.619
 APPLIC+0.617
 dams+0.603
 Tile+0.596
Neg logit contributions
eta-1.243
adh-1.183
�醒-1.064
chin-1.045
arms-1.037
Tokenoptim
Token ID40085
Feature activation-7.62e-01

Pos logit contributions
 Fisher+0.047
 Drinking+0.044
 APPLIC+0.044
 dams+0.043
 Tile+0.043
Neg logit contributions
eta-0.061
adh-0.058
�醒-0.051
chin-0.050
arms-0.050
Tokenized
Token ID1143
Feature activation-9.48e-01

Pos logit contributions
 Fisher+0.639
 Drinking+0.603
 APPLIC+0.602
 dams+0.593
 Tile+0.589
Neg logit contributions
eta-0.545
adh-0.508
�醒-0.436
chin-0.422
arms-0.417
Token apps
Token ID6725
Feature activation-8.81e-01

Pos logit contributions
 Fisher+0.497
 Drinking+0.455
 APPLIC+0.454
 dams+0.441
 Tile+0.435
Neg logit contributions
eta-0.973
adh-0.926
�醒-0.830
chin-0.821
arms-0.815
Token.
Token ID13
Feature activation+1.46e-01

Pos logit contributions
 Fisher+0.583
 Drinking+0.540
 APPLIC+0.540
 dams+0.531
 Tile+0.526
Neg logit contributions
eta-0.786
adh-0.745
�醒-0.662
chin-0.644
arms-0.639
Token Since
Token ID4619
Feature activation+3.35e-01

Pos logit contributions
eta+0.132
adh+0.124
�醒+0.109
chin+0.108
arms+0.107
Neg logit contributions
 Fisher-0.093
 APPLIC-0.087
 Drinking-0.087
 dams-0.086
 Tile-0.084
Token Android
Token ID5565
Feature activation+1.03e-01

Pos logit contributions
eta+0.315
adh+0.298
�醒+0.263
chin+0.261
arms+0.259
Neg logit contributions
 Fisher-0.202
 APPLIC-0.189
 Drinking-0.189
 dams-0.184
 Tile-0.182
Token devices
Token ID4410
Feature activation-4.73e-01

Pos logit contributions
eta+0.098
adh+0.093
�醒+0.083
chin+0.081
arms+0.081
Neg logit contributions
 Fisher-0.062
 Drinking-0.057
 APPLIC-0.057
 dams-0.056
 Tile-0.055
Token
Token ID198
Feature activation-3.91e-02

Pos logit contributions
 Fisher+0.188
 Drinking+0.175
 APPLIC+0.175
 dams+0.172
 Tile+0.170
Neg logit contributions
eta-0.249
adh-0.235
�醒-0.206
chin-0.204
arms-0.202
TokenThe
Token ID464
Feature activation+8.98e-01

Pos logit contributions
 Fisher+0.025
 Drinking+0.023
 APPLIC+0.023
 dams+0.022
 Tile+0.022
Neg logit contributions
eta-0.036
adh-0.034
�醒-0.030
chin-0.030
arms-0.030
Token
Token ID564
Feature activation+6.14e-01

Pos logit contributions
eta+0.827
adh+0.783
�醒+0.692
chin+0.682
arms+0.676
Neg logit contributions
 Fisher-0.564
 APPLIC-0.526
 Drinking-0.524
 dams-0.514
 Tile-0.509
Token
Token ID250
Feature activation+1.69e+00

Pos logit contributions
eta+0.518
adh+0.488
�醒+0.430
chin+0.420
arms+0.415
Neg logit contributions
 Fisher-0.435
 APPLIC-0.407
 Drinking-0.406
 dams-0.399
 Tile-0.397
TokenO
Token ID46
Feature activation-9.48e-01

Pos logit contributions
eta+1.360
adh+1.272
�醒+1.093
chin+1.083
arms+1.074
Neg logit contributions
 Fisher-1.268
 Drinking-1.194
 APPLIC-1.192
 dams-1.181
 Tile-1.169
Token
Token ID447
Feature activation-1.72e-01

Pos logit contributions
 Fisher+0.665
 Drinking+0.621
 APPLIC+0.620
 dams+0.610
 Tile+0.605
Neg logit contributions
eta-0.808
adh-0.762
�醒-0.670
chin-0.656
mast-0.649
Token
Token ID247
Feature activation+3.05e-01

Pos logit contributions
 Fisher+0.119
 APPLIC+0.110
 Drinking+0.110
 dams+0.108
 Tile+0.107
Neg logit contributions
eta-0.149
adh-0.140
�醒-0.124
chin-0.121
arms-0.120
TokenReilly
Token ID25819
Feature activation+9.28e-01

Pos logit contributions
eta+0.267
adh+0.252
�醒+0.221
chin+0.217
mast+0.215
Neg logit contributions
 Fisher-0.208
 Drinking-0.194
 APPLIC-0.194
 dams-0.190
 Tile-0.189
Token Factor
Token ID27929
Feature activation-1.25e-01

Pos logit contributions
eta+0.992
adh+0.947
�醒+0.853
chin+0.842
mast+0.837
Neg logit contributions
 Fisher-0.447
 APPLIC-0.406
 Drinking-0.402
 dams-0.395
 Tile-0.388
Token
Token ID447
Feature activation-1.05e+00

Pos logit contributions
 Fisher+0.082
 APPLIC+0.076
 Drinking+0.076
 dams+0.074
 Tile+0.074
Neg logit contributions
eta-0.113
adh-0.107
�醒-0.095
chin-0.093
mast-0.092
Token
Token ID251
Feature activation-3.81e-01

Pos logit contributions
 Fisher+0.690
 Drinking+0.643
 APPLIC+0.638
 dams+0.629
 Tile+0.621
Neg logit contributions
eta-0.944
adh-0.894
�醒-0.790
chin-0.774
mast-0.770
Token protests
Token ID8536
Feature activation+1.07e+00

Pos logit contributions
eta+0.089
adh+0.083
�醒+0.072
chin+0.070
arms+0.070
Neg logit contributions
 Fisher-0.088
 APPLIC-0.083
 Drinking-0.083
 dams-0.081
 Tile-0.081
Token?
Token ID30
Feature activation+5.95e-01

Pos logit contributions
eta+1.012
adh+0.959
�醒+0.843
chin+0.839
arms+0.831
Neg logit contributions
 Fisher-0.649
 Drinking-0.601
 APPLIC-0.600
 dams-0.586
 Tile-0.584
Token
Token ID198
Feature activation-2.29e-01

Pos logit contributions
eta+0.546
adh+0.517
chin+0.451
�醒+0.450
arms+0.447
Neg logit contributions
 Fisher-0.368
 dams-0.343
 APPLIC-0.343
 Drinking-0.340
 Tile-0.334
Token
Token ID198
Feature activation-7.38e-01

Pos logit contributions
 Fisher+0.154
 Drinking+0.144
 APPLIC+0.143
 dams+0.142
 Tile+0.140
Neg logit contributions
eta-0.202
adh-0.190
�醒-0.166
chin-0.165
arms-0.163
TokenWASHINGTON
Token ID21793
Feature activation-3.02e+00

Pos logit contributions
 Fisher+0.438
 Drinking+0.403
 APPLIC+0.402
 dams+0.392
 Tile+0.389
Neg logit contributions
eta-0.709
adh-0.673
�醒-0.605
chin-0.590
arms-0.584
Token --
Token ID1377
Feature activation-4.68e+00

Pos logit contributions
 Fisher+2.202
 APPLIC+2.062
 Drinking+2.060
 dams+2.017
 DUI+2.000
Neg logit contributions
eta-2.490
adh-2.338
�醒-2.075
chin-2.002
arms-1.988
Token The
Token ID383
Feature activation-8.26e-01

Pos logit contributions
 Fisher+2.968
 Drinking+2.755
 APPLIC+2.709
 DUI+2.638
 Toll+2.617
Neg logit contributions
eta-4.330
adh-4.104
�醒-3.749
chin-3.595
mast-3.553
Token United
Token ID1578
Feature activation+4.13e-01

Pos logit contributions
 Fisher+0.497
 Drinking+0.458
 APPLIC+0.457
 dams+0.447
 Tile+0.443
Neg logit contributions
eta-0.787
adh-0.747
�醒-0.668
chin-0.654
arms-0.648
Token Nations
Token ID7973
Feature activation-1.56e+00

Pos logit contributions
eta+0.394
adh+0.372
chin+0.327
�醒+0.326
arms+0.325
Neg logit contributions
 Fisher-0.245
 APPLIC-0.228
 Drinking-0.227
 dams-0.223
 Tile-0.220
Token envoy
Token ID39137
Feature activation+1.19e+00

Pos logit contributions
 Fisher+0.899
 Drinking+0.827
 APPLIC+0.824
 dams+0.806
 Tile+0.797
Neg logit contributions
eta-1.518
adh-1.442
�醒-1.297
chin-1.269
arms-1.257
Token for
Token ID329
Feature activation+1.63e+00

Pos logit contributions
eta+1.122
adh+1.065
�醒+0.934
chin+0.926
mast+0.920
Neg logit contributions
 Fisher-0.717
 APPLIC-0.674
 Drinking-0.665
 dams-0.656
 Tile-0.652
Token.
Token ID13
Feature activation-5.89e-01

Pos logit contributions
 Fisher+0.206
 APPLIC+0.192
 Drinking+0.192
 dams+0.188
 Tile+0.187
Neg logit contributions
eta-0.259
adh-0.245
�醒-0.216
chin-0.211
mast-0.209
Tokenget
Token ID1136
Feature activation-8.43e-01

Pos logit contributions
 Fisher+0.423
 Drinking+0.396
 APPLIC+0.395
 dams+0.388
 Tile+0.385
Neg logit contributions
eta-0.492
adh-0.464
�醒-0.407
chin-0.398
arms-0.394
TokenStruct
Token ID44909
Feature activation-1.04e+00

Pos logit contributions
 Fisher+0.566
 Drinking+0.527
 APPLIC+0.526
 dams+0.516
 Tile+0.512
Neg logit contributions
eta-0.743
adh-0.703
�醒-0.622
chin-0.607
arms-0.602
Tokenuring
Token ID870
Feature activation+5.22e-01

Pos logit contributions
 Fisher+0.793
 Drinking+0.744
 APPLIC+0.742
 dams+0.730
 Tile+0.727
Neg logit contributions
eta-0.821
adh-0.772
�醒-0.677
chin-0.653
arms-0.648
TokenElement
Token ID20180
Feature activation-4.57e-01

Pos logit contributions
eta+0.444
adh+0.417
�醒+0.365
chin+0.359
arms+0.355
Neg logit contributions
 Fisher-0.368
 Drinking-0.344
 APPLIC-0.343
 dams-0.338
 Tile-0.333
Token(
Token ID7
Feature activation-3.45e+00

Pos logit contributions
 Fisher+0.321
 Drinking+0.300
 APPLIC+0.299
 dams+0.294
 Tile+0.291
Neg logit contributions
eta-0.390
adh-0.367
�醒-0.322
chin-0.316
arms-0.313
Tokencv
Token ID33967
Feature activation-2.61e-01

Pos logit contributions
 Fisher+2.404
 APPLIC+2.268
 Drinking+2.224
 dams+2.208
 Tile+2.206
Neg logit contributions
eta-2.881
adh-2.710
�醒-2.450
chin-2.341
 bombshell-2.333
Token2
Token ID17
Feature activation-2.17e-01

Pos logit contributions
 Fisher+0.158
 Drinking+0.146
 APPLIC+0.146
 dams+0.143
 Tile+0.141
Neg logit contributions
eta-0.248
adh-0.235
�醒-0.209
chin-0.205
arms-0.204
Token.
Token ID13
Feature activation-1.16e+00

Pos logit contributions
 Fisher+0.149
 Drinking+0.139
 APPLIC+0.139
 dams+0.137
 Tile+0.135
Neg logit contributions
eta-0.187
adh-0.177
�醒-0.156
chin-0.153
arms-0.151
TokenM
Token ID44
Feature activation-1.86e-01

Pos logit contributions
 Fisher+0.847
 APPLIC+0.794
 Drinking+0.791
 dams+0.778
 Tile+0.774
Neg logit contributions
eta-0.950
adh-0.895
�醒-0.786
chin-0.763
arms-0.756
TokenOR
Token ID1581
Feature activation-5.61e-01

Pos logit contributions
 Fisher+0.126
 Drinking+0.117
 APPLIC+0.116
 dams+0.115
 Tile+0.113
Neg logit contributions
eta-0.165
adh-0.155
�醒-0.136
chin-0.134
arms-0.133
Token you
Token ID345
Feature activation+1.00e+00

Pos logit contributions
 Fisher+0.347
 Drinking+0.322
 APPLIC+0.322
 dams+0.315
 Tile+0.312
Neg logit contributions
eta-0.494
adh-0.468
�醒-0.415
chin-0.408
arms-0.404
Token out
Token ID503
Feature activation-2.14e+00

Pos logit contributions
eta+0.909
adh+0.860
�醒+0.757
chin+0.752
arms+0.740
Neg logit contributions
 Fisher-0.640
 Drinking-0.595
 APPLIC-0.594
 dams-0.581
 Tile-0.573
Token in
Token ID287
Feature activation+8.60e-01

Pos logit contributions
 Fisher+1.401
 Drinking+1.301
 APPLIC+1.297
 dams+1.274
 Tile+1.262
Neg logit contributions
eta-1.928
adh-1.827
�醒-1.627
chin-1.583
arms-1.572
Token a
Token ID257
Feature activation+5.74e-02

Pos logit contributions
eta+0.800
adh+0.761
�醒+0.672
chin+0.663
arms+0.661
Neg logit contributions
 Fisher-0.529
 APPLIC-0.493
 Drinking-0.490
 dams-0.483
 Tile-0.474
Token single
Token ID2060
Feature activation-8.47e-01

Pos logit contributions
eta+0.052
adh+0.049
�醒+0.044
chin+0.043
mast+0.042
Neg logit contributions
 Fisher-0.037
 APPLIC-0.034
 Drinking-0.034
 dams-0.034
 Tile-0.033
Token turn
Token ID1210
Feature activation-3.30e+00

Pos logit contributions
 Fisher+0.352
 APPLIC+0.313
 Drinking+0.312
 dams+0.305
 Tile+0.294
Neg logit contributions
eta-0.961
adh-0.920
�醒-0.837
chin-0.828
mast-0.821
Token!
Token ID0
Feature activation+6.78e-01

Pos logit contributions
 Fisher+2.188
 APPLIC+2.037
 Drinking+2.036
 dams+1.985
 Tile+1.982
Neg logit contributions
eta-2.914
adh-2.767
�醒-2.468
chin-2.393
arms-2.380
Token One
Token ID1881
Feature activation+4.12e-01

Pos logit contributions
eta+0.599
adh+0.562
�醒+0.492
chin+0.490
mast+0.482
Neg logit contributions
 Fisher-0.449
 dams-0.418
 APPLIC-0.417
 Drinking-0.414
 Tile-0.404
Token way
Token ID835
Feature activation-2.22e+00

Pos logit contributions
eta+0.393
adh+0.371
�醒+0.329
chin+0.327
mast+0.323
Neg logit contributions
 Fisher-0.243
 APPLIC-0.224
 Drinking-0.224
 dams-0.221
 DUI-0.216
Token to
Token ID284
Feature activation-7.52e-02

Pos logit contributions
 Fisher+1.280
 Drinking+1.176
 APPLIC+1.174
 dams+1.148
 Tile+1.137
Neg logit contributions
eta-2.162
adh-2.055
�醒-1.842
chin-1.806
arms-1.791
Token avoid
Token ID3368
Feature activation-2.08e-02

Pos logit contributions
 Fisher+0.045
 APPLIC+0.042
 Drinking+0.042
 dams+0.041
 DUI+0.040
Neg logit contributions
eta-0.071
adh-0.068
�醒-0.060
chin-0.060
mast-0.059
Token next
Token ID1306
Feature activation+4.27e-01

Pos logit contributions
 Fisher+0.128
 Drinking+0.120
 APPLIC+0.120
 dams+0.117
 Tile+0.116
Neg logit contributions
eta-0.175
adh-0.165
�醒-0.145
chin-0.144
mast-0.142
Token to
Token ID284
Feature activation+3.22e-01

Pos logit contributions
eta+0.412
adh+0.388
chin+0.341
�醒+0.341
arms+0.339
Neg logit contributions
 Fisher-0.251
 Drinking-0.235
 APPLIC-0.232
 dams-0.226
 Tile-0.225
Token a
Token ID257
Feature activation-2.54e-01

Pos logit contributions
eta+0.295
adh+0.279
�醒+0.247
chin+0.243
arms+0.241
Neg logit contributions
 Fisher-0.204
 APPLIC-0.190
 Drinking-0.189
 dams-0.186
 Tile-0.184
Token matte
Token ID36908
Feature activation+6.28e-01

Pos logit contributions
 Fisher+0.159
 APPLIC+0.148
 Drinking+0.148
 dams+0.145
 Tile+0.143
Neg logit contributions
eta-0.235
adh-0.222
�醒-0.198
chin-0.194
mast-0.192
Token black
Token ID2042
Feature activation+7.97e-01

Pos logit contributions
eta+0.585
adh+0.554
�醒+0.493
chin+0.483
mast+0.479
Neg logit contributions
 Fisher-0.390
 Drinking-0.361
 APPLIC-0.361
 dams-0.354
 Tile-0.350
Token muscle
Token ID8280
Feature activation-3.09e+00

Pos logit contributions
eta+0.735
adh+0.691
�醒+0.614
chin+0.605
mast+0.599
Neg logit contributions
 Fisher-0.502
 APPLIC-0.467
 Drinking-0.466
 dams-0.459
 Tile-0.450
Token car
Token ID1097
Feature activation+3.32e-01

Pos logit contributions
 Fisher+1.968
 Drinking+1.818
 APPLIC+1.813
 Tile+1.782
 DUI+1.772
Neg logit contributions
eta-2.799
adh-2.654
�醒-2.394
chin-2.286
robe-2.267
Token,
Token ID11
Feature activation+6.04e-01

Pos logit contributions
eta+0.300
adh+0.284
�醒+0.252
chin+0.247
arms+0.244
Neg logit contributions
 Fisher-0.216
 Drinking-0.200
 APPLIC-0.200
 dams-0.196
 Tile-0.194
Token suggesting
Token ID9524
Feature activation+4.98e-01

Pos logit contributions
eta+0.548
adh+0.519
�醒+0.459
chin+0.451
arms+0.448
Neg logit contributions
 Fisher-0.389
 APPLIC-0.361
 Drinking-0.361
 dams-0.354
 Tile-0.351
Token that
Token ID326
Feature activation+2.43e-01

Pos logit contributions
eta+0.450
adh+0.425
�醒+0.376
chin+0.370
arms+0.367
Neg logit contributions
 Fisher-0.322
 APPLIC-0.300
 Drinking-0.299
 dams-0.294
 Tile-0.291
Token the
Token ID262
Feature activation-3.25e-01

Pos logit contributions
eta+0.232
adh+0.219
�醒+0.195
chin+0.192
arms+0.191
Neg logit contributions
 Fisher-0.145
 APPLIC-0.135
 Drinking-0.134
 dams-0.131
 Tile-0.130
Token met
Token ID1138
Feature activation+8.17e-01

Pos logit contributions
eta+0.991
adh+0.932
chin+0.816
mast+0.810
�醒+0.809
Neg logit contributions
 Fisher-0.696
 APPLIC-0.648
 Drinking-0.642
 dams-0.632
 Tile-0.625
Token with
Token ID351
Feature activation+6.81e-01

Pos logit contributions
eta+0.709
adh+0.670
�醒+0.591
chin+0.578
arms+0.572
Neg logit contributions
 Fisher-0.561
 Drinking-0.522
 APPLIC-0.521
 dams-0.512
 Tile-0.508
Token D
Token ID360
Feature activation+1.31e+00

Pos logit contributions
eta+0.597
adh+0.565
�醒+0.496
chin+0.488
arms+0.485
Neg logit contributions
 Fisher-0.457
 Drinking-0.426
 APPLIC-0.426
 dams-0.418
 Tile-0.414
Tokenendi
Token ID43109
Feature activation+1.76e+00

Pos logit contributions
eta+0.890
adh+0.826
�醒+0.688
chin+0.678
arms+0.669
Neg logit contributions
 Fisher-1.153
 APPLIC-1.094
 Drinking-1.092
 dams-1.077
 Tile-1.069
Token getting
Token ID1972
Feature activation+4.20e-01

Pos logit contributions
eta+1.589
adh+1.501
�醒+1.317
chin+1.314
mast+1.300
Neg logit contributions
 Fisher-1.127
 APPLIC-1.049
 Drinking-1.041
 dams-1.026
 Tile-1.009
Token up
Token ID510
Feature activation-3.29e+00

Pos logit contributions
eta+0.380
adh+0.359
�醒+0.316
chin+0.313
mast+0.309
Neg logit contributions
 Fisher-0.270
 Drinking-0.252
 APPLIC-0.252
 dams-0.246
 Tile-0.244
Token and
Token ID290
Feature activation+1.24e-02

Pos logit contributions
 Fisher+2.154
 Drinking+2.001
 APPLIC+1.991
 dams+1.956
 Tile+1.942
Neg logit contributions
eta-2.935
adh-2.784
�醒-2.496
chin-2.412
arms-2.382
Token dancing
Token ID15360
Feature activation+7.52e-01

Pos logit contributions
eta+0.012
adh+0.011
�醒+0.010
chin+0.010
mast+0.010
Neg logit contributions
 Fisher-0.007
 APPLIC-0.007
 Drinking-0.007
 dams-0.007
 Tile-0.006
Token about
Token ID546
Feature activation+5.66e-01

Pos logit contributions
eta+0.671
adh+0.636
�醒+0.558
chin+0.550
mast+0.547
Neg logit contributions
 Fisher-0.492
 APPLIC-0.460
 Drinking-0.460
 dams-0.451
 Tile-0.446
Token nearby
Token ID6716
Feature activation+3.52e-01

Pos logit contributions
eta+0.522
adh+0.494
�醒+0.437
chin+0.430
mast+0.430
Neg logit contributions
 Fisher-0.353
 APPLIC-0.330
 Drinking-0.327
 dams-0.321
 Tile-0.318
Token.
Token ID13
Feature activation+3.35e-01

Pos logit contributions
eta+0.310
adh+0.294
�醒+0.258
chin+0.253
mast+0.253
Neg logit contributions
 Fisher-0.234
 APPLIC-0.219
 Drinking-0.218
 dams-0.213
 Tile-0.212