TOP ACTIVATIONS
MAX = 5.977

. You essentially lay your che v ron template down ,
bought this sub and it messed up and I had to
that had read it and che wed on it and thought
. So my game was screwed up by world events .
cake y and / or che wy baking recipes and not
t nailed down , and screwed their way through floor staff
run the risk of having che aters at tournaments . We
how to catch these workshop che ats and potentially even catch
separation between Gaza and the West Bank . The hope was
northern part of China 's Guang d ong province turns redd
no holds barred to the che qu ered flag ! Race
ko , Ser hi y Kaz arov , Vital y Nem
. The fact that the West Bank separation ( and encro
Let 's make sure these che ats can 't ever be
, was the presence of che wed grass . She may
overthrow Gaddafi and essentially shove Gaz prom out . I mean
region of China in southern Guang d ong province , nest
one lamb ste aks from Scot mid .' The
ran za s che c kered history at the
under g ird ed by che wy caramel mal ts with

MIN ACTIVATIONS
MIN = -6.254

the local chapter of the NAACP , has asked the Dublin
lot dirt ier ," said Natalie Koz l owski , who
in 1945 , representing the NAACP with W . E .
oley , director of fund raising for the Bomber Command Association
, head of environmental group Greenpeace s International Nuclear
involving major campaign supporters , raising real doubts about their impartial
. For others , like Natalie Port man and E wan
My Family ' dedicated to raising awareness of food insecurity in
fish we ate . Growing up in the ghetto , we
about her support for Planned Parenthood , the pro - choice
the anti - abortion group Center for Medical Progress .
car killing Dec atur High School seniors Derek King and Nicholas
- up small donors , raising more than $ 23 million
Chan , an official with Greenpeace in southern China .
example , in 2010 the ACLU sued on behalf of a
judges : Stan Walker , Natalie B assing th wa ight
out a television ad Thursday raising questions about the singer -
и я , т о
Aaron Larson , associate editor (@ Aaron
to zero - out Planned Parenthood s federal funding

INTERVAL 2.919 - 5.977
CONTAINS 0.372%

Employees Email Print Whats app Menu Whats app Google
head of the division that scrutin ized the tea party applications
, he used the student dat ap ad 's telephone function
held its first summit in Chile last year , having been
82 , 000 who were handed severe disciplinary punishments

INTERVAL -0.138 - 2.919
CONTAINS 56.689%

the Bur m ese authorities said there had been no casualties
aring such different molecules as minerals and organ ics from the
government access es AIR s information but the organization
truck drove by , and a strange can ister flew out
a & ' a T . This is a somewhat hack

INTERVAL -3.196 - -0.138
CONTAINS 42.491%

). From the start , Hitchcock seemed intent on
han em warned an Exxon Mobil executive that Libya might
He takes her arm , brings her gently towards the consulting
may be able to give . ' You cast
in public speeches . Newsletter Sign Up

INTERVAL -6.254 - -3.196
CONTAINS 0.447%

the real - estate boom raised rents beyond the reach of
, you have finished the initiation ceremony , and you are
hearing on Thursday , Mel inda Thompson , the leading lawyer
he could easily be fl irting with one at a real
post and snap my idea up . George , we could
Token.
Token ID13
Feature activation-3.84e-02

Pos logit contributions
adders+0.380
ensical+0.336
udi+0.330
 millenn+0.329
��+0.323
Neg logit contributions
oire-0.469
ドラ-0.451
 Sins-0.447
avering-0.413
gerald-0.398
Token You
Token ID921
Feature activation+1.02e+00

Pos logit contributions
adders+0.063
ensical+0.058
 millenn+0.057
udi+0.056
��+0.055
Neg logit contributions
oire-0.047
 Sins-0.045
ドラ-0.045
avering-0.040
gerald-0.038
Token essentially
Token ID6986
Feature activation-1.15e-02

Pos logit contributions
oire+1.431
ドラ+1.359
 Sins+1.351
avering+1.244
gerald+1.168
Neg logit contributions
adders-1.501
 millenn-1.381
ensical-1.357
��-1.334
udi-1.329
Token lay
Token ID3830
Feature activation+2.32e-01

Pos logit contributions
adders+0.017
 millenn+0.015
ensical+0.015
udi+0.015
��+0.015
Neg logit contributions
oire-0.017
ドラ-0.016
 Sins-0.016
avering-0.014
gerald-0.014
Token your
Token ID534
Feature activation-1.31e-01

Pos logit contributions
oire+0.347
 Sins+0.332
ドラ+0.330
avering+0.306
gerald+0.292
Neg logit contributions
adders-0.318
ensical-0.286
 millenn-0.279
udi-0.278
��-0.276
Token che
Token ID1125
Feature activation+5.98e+00

Pos logit contributions
adders+0.166
ensical+0.148
udi+0.145
 millenn+0.142
��+0.141
Neg logit contributions
oire-0.209
 Sins-0.201
ドラ-0.199
avering-0.185
gerald-0.178
Tokenv
Token ID85
Feature activation+1.02e+00

Pos logit contributions
oire+8.040
ドラ+7.240
 Sins+7.186
avering+7.063
gerald+6.466
Neg logit contributions
adders-8.047
 millenn-7.866
��-7.566
ensical-7.485
udi-6.995
Tokenron
Token ID1313
Feature activation+1.15e+00

Pos logit contributions
oire+1.665
ドラ+1.578
 Sins+1.572
avering+1.454
gerald+1.406
Neg logit contributions
adders-1.274
ensical-1.133
 millenn-1.115
��-1.099
udi-1.097
Token template
Token ID11055
Feature activation+2.95e-01

Pos logit contributions
oire+1.693
 Sins+1.604
ドラ+1.574
avering+1.463
gerald+1.398
Neg logit contributions
adders-1.612
ensical-1.459
 millenn-1.432
��-1.426
udi-1.422
Token down
Token ID866
Feature activation-3.64e-01

Pos logit contributions
oire+0.427
 Sins+0.407
ドラ+0.399
avering+0.371
gerald+0.353
Neg logit contributions
adders-0.423
ensical-0.381
 millenn-0.372
��-0.371
udi-0.369
Token,
Token ID11
Feature activation-6.18e-01

Pos logit contributions
adders+0.515
ensical+0.462
udi+0.452
 millenn+0.451
��+0.446
Neg logit contributions
oire-0.532
ドラ-0.508
 Sins-0.507
avering-0.465
gerald-0.445
Token bought
Token ID5839
Feature activation-3.26e-01

Pos logit contributions
adders+0.804
ensical+0.728
 millenn+0.719
udi+0.711
��+0.704
Neg logit contributions
oire-0.747
ドラ-0.713
 Sins-0.706
avering-0.646
gerald-0.623
Token this
Token ID428
Feature activation-8.40e-01

Pos logit contributions
adders+0.473
ensical+0.428
udi+0.419
 millenn+0.414
��+0.407
Neg logit contributions
oire-0.461
ドラ-0.444
 Sins-0.441
avering-0.401
gerald-0.386
Token sub
Token ID850
Feature activation-4.97e-01

Pos logit contributions
adders+1.131
ensical+1.011
udi+0.986
 millenn+0.973
��+0.960
Neg logit contributions
oire-1.284
ドラ-1.230
 Sins-1.226
avering-1.121
gerald-1.086
Token and
Token ID290
Feature activation+4.57e-01

Pos logit contributions
adders+0.685
ensical+0.617
udi+0.600
 millenn+0.597
��+0.583
Neg logit contributions
oire-0.742
ドラ-0.709
 Sins-0.702
avering-0.647
gerald-0.624
Token it
Token ID340
Feature activation+4.12e-01

Pos logit contributions
oire+0.641
 Sins+0.611
ドラ+0.610
avering+0.557
gerald+0.535
Neg logit contributions
adders-0.667
ensical-0.607
 millenn-0.589
udi-0.588
��-0.579
Token messed
Token ID32621
Feature activation+5.95e+00

Pos logit contributions
oire+0.581
ドラ+0.558
 Sins+0.555
avering+0.507
gerald+0.486
Neg logit contributions
adders-0.599
ensical-0.542
 millenn-0.528
udi-0.523
��-0.516
Token up
Token ID510
Feature activation-2.67e+00

Pos logit contributions
oire+7.207
 Sins+7.062
ドラ+6.902
avering+6.301
gerald+6.077
Neg logit contributions
adders-8.630
ensical-7.992
��-7.814
 millenn-7.761
udi-7.735
Token and
Token ID290
Feature activation+3.72e-01

Pos logit contributions
adders+3.592
ensical+3.207
udi+3.152
 millenn+3.092
��+3.026
Neg logit contributions
oire-4.003
ドラ-3.891
 Sins-3.815
avering-3.546
gerald-3.414
Token I
Token ID314
Feature activation-1.63e-01

Pos logit contributions
oire+0.512
 Sins+0.495
ドラ+0.495
avering+0.446
gerald+0.428
Neg logit contributions
adders-0.550
ensical-0.498
udi-0.483
 millenn-0.482
��-0.473
Token had
Token ID550
Feature activation-3.34e-01

Pos logit contributions
adders+0.230
ensical+0.207
 millenn+0.205
udi+0.203
��+0.199
Neg logit contributions
oire-0.237
ドラ-0.225
 Sins-0.225
avering-0.206
gerald-0.199
Token to
Token ID284
Feature activation+9.26e-01

Pos logit contributions
adders+0.397
ensical+0.349
udi+0.343
 millenn+0.337
��+0.332
Neg logit contributions
oire-0.560
ドラ-0.541
 Sins-0.541
avering-0.498
gerald-0.483
Token that
Token ID326
Feature activation-1.27e+00

Pos logit contributions
oire+1.348
ドラ+1.308
 Sins+1.294
avering+1.206
gerald+1.165
Neg logit contributions
adders-0.924
ensical-0.807
udi-0.790
 millenn-0.775
��-0.760
Token had
Token ID550
Feature activation-1.17e-01

Pos logit contributions
adders+1.761
ensical+1.580
udi+1.543
 millenn+1.528
��+1.501
Neg logit contributions
oire-1.880
ドラ-1.810
 Sins-1.794
avering-1.648
gerald-1.588
Token read
Token ID1100
Feature activation+1.60e+00

Pos logit contributions
adders+0.163
ensical+0.147
udi+0.144
 millenn+0.142
��+0.141
Neg logit contributions
oire-0.171
ドラ-0.164
 Sins-0.164
avering-0.150
gerald-0.143
Token it
Token ID340
Feature activation-8.91e-02

Pos logit contributions
oire+2.258
 Sins+2.170
ドラ+2.136
avering+1.925
gerald+1.845
Neg logit contributions
adders-2.316
ensical-2.092
 millenn-2.044
udi-2.039
��-2.011
Token and
Token ID290
Feature activation+2.87e-01

Pos logit contributions
adders+0.121
ensical+0.109
udi+0.106
 millenn+0.105
��+0.104
Neg logit contributions
oire-0.135
ドラ-0.129
 Sins-0.129
avering-0.118
gerald-0.114
Token che
Token ID1125
Feature activation+5.86e+00

Pos logit contributions
oire+0.412
 Sins+0.393
ドラ+0.392
avering+0.357
gerald+0.344
Neg logit contributions
adders-0.414
ensical-0.374
 millenn-0.363
udi-0.363
��-0.357
Tokenwed
Token ID19103
Feature activation+9.41e-01

Pos logit contributions
oire+7.006
 Sins+6.558
ドラ+6.516
avering+6.222
gerald+5.769
Neg logit contributions
adders-8.900
 millenn-8.513
ensical-8.303
��-8.129
 awareness-7.650
Token on
Token ID319
Feature activation-6.41e-01

Pos logit contributions
oire+1.307
ドラ+1.253
 Sins+1.250
avering+1.137
gerald+1.085
Neg logit contributions
adders-1.388
ensical-1.254
udi-1.223
 millenn-1.219
��-1.208
Token it
Token ID340
Feature activation-1.74e-01

Pos logit contributions
adders+0.944
ensical+0.851
udi+0.836
 millenn+0.826
��+0.815
Neg logit contributions
oire-0.895
ドラ-0.859
 Sins-0.853
avering-0.778
gerald-0.745
Token and
Token ID290
Feature activation+1.89e-01

Pos logit contributions
adders+0.242
ensical+0.216
udi+0.212
 millenn+0.211
��+0.208
Neg logit contributions
oire-0.258
ドラ-0.246
 Sins-0.245
avering-0.225
gerald-0.216
Token thought
Token ID1807
Feature activation-2.70e-01

Pos logit contributions
oire+0.273
 Sins+0.261
ドラ+0.261
avering+0.238
gerald+0.229
Neg logit contributions
adders-0.270
ensical-0.244
udi-0.237
 millenn-0.237
��-0.232
Token.
Token ID13
Feature activation-1.80e-01

Pos logit contributions
adders+1.361
ensical+1.229
udi+1.207
 millenn+1.198
��+1.177
Neg logit contributions
oire-1.219
ドラ-1.166
 Sins-1.162
avering-1.052
gerald-1.013
Token So
Token ID1406
Feature activation-1.06e-01

Pos logit contributions
adders+0.296
ensical+0.271
 millenn+0.267
udi+0.265
��+0.260
Neg logit contributions
oire-0.220
 Sins-0.212
ドラ-0.210
avering-0.187
gerald-0.178
Token my
Token ID616
Feature activation-1.07e+00

Pos logit contributions
adders+0.156
ensical+0.140
udi+0.137
 millenn+0.137
��+0.134
Neg logit contributions
oire-0.149
 Sins-0.143
ドラ-0.143
avering-0.130
gerald-0.124
Token game
Token ID983
Feature activation+6.97e-01

Pos logit contributions
adders+1.435
ensical+1.275
udi+1.261
 millenn+1.237
��+1.216
Neg logit contributions
oire-1.651
ドラ-1.590
 Sins-1.587
avering-1.447
gerald-1.401
Token was
Token ID373
Feature activation-4.37e-01

Pos logit contributions
oire+1.003
ドラ+0.963
 Sins+0.962
avering+0.866
gerald+0.832
Neg logit contributions
adders-0.997
ensical-0.897
udi-0.881
 millenn-0.872
��-0.856
Token screwed
Token ID27527
Feature activation+5.80e+00

Pos logit contributions
adders+0.625
ensical+0.560
udi+0.550
 millenn+0.545
��+0.534
Neg logit contributions
oire-0.629
ドラ-0.608
 Sins-0.604
avering-0.552
gerald-0.529
Token up
Token ID510
Feature activation-2.99e+00

Pos logit contributions
oire+7.117
 Sins+6.885
ドラ+6.763
avering+6.078
gerald+5.843
Neg logit contributions
adders-8.600
 millenn-7.910
ensical-7.891
��-7.721
udi-7.678
Token by
Token ID416
Feature activation+4.23e-01

Pos logit contributions
adders+3.998
ensical+3.567
udi+3.514
 millenn+3.401
��+3.354
Neg logit contributions
oire-4.446
ドラ-4.343
 Sins-4.206
avering-3.959
gerald-3.795
Token world
Token ID995
Feature activation-4.80e-01

Pos logit contributions
oire+0.569
 Sins+0.553
ドラ+0.543
avering+0.495
gerald+0.470
Neg logit contributions
adders-0.642
ensical-0.581
udi-0.569
 millenn-0.562
��-0.557
Token events
Token ID2995
Feature activation-8.60e-01

Pos logit contributions
adders+0.640
ensical+0.568
udi+0.560
 millenn+0.548
��+0.541
Neg logit contributions
oire-0.735
ドラ-0.712
 Sins-0.704
avering-0.651
gerald-0.624
Token.
Token ID13
Feature activation-3.27e-01

Pos logit contributions
adders+1.236
ensical+1.106
udi+1.087
 millenn+1.077
��+1.060
Neg logit contributions
oire-1.235
ドラ-1.185
 Sins-1.184
avering-1.077
gerald-1.036
Token cake
Token ID12187
Feature activation-3.70e-01

Pos logit contributions
adders+0.672
ensical+0.594
udi+0.588
 millenn+0.573
��+0.569
Neg logit contributions
oire-0.768
ドラ-0.736
 Sins-0.734
avering-0.679
gerald-0.656
Tokeny
Token ID88
Feature activation-1.36e-01

Pos logit contributions
adders+0.554
ensical+0.498
udi+0.493
 millenn+0.485
��+0.480
Neg logit contributions
oire-0.509
ドラ-0.487
 Sins-0.484
avering-0.443
gerald-0.424
Token and
Token ID290
Feature activation+2.95e-01

Pos logit contributions
adders+0.190
ensical+0.170
udi+0.168
 millenn+0.165
��+0.163
Neg logit contributions
oire-0.200
 Sins-0.192
ドラ-0.191
avering-0.175
gerald-0.168
Token/
Token ID14
Feature activation+7.68e-01

Pos logit contributions
oire+0.446
 Sins+0.425
ドラ+0.423
avering+0.392
gerald+0.379
Neg logit contributions
adders-0.401
ensical-0.355
udi-0.349
 millenn-0.343
��-0.340
Tokenor
Token ID273
Feature activation+9.50e-01

Pos logit contributions
oire+0.842
ドラ+0.760
 Sins+0.755
avering+0.693
gerald+0.646
Neg logit contributions
 millenn-1.373
adders-1.364
ensical-1.332
��-1.260
udi-1.225
Token che
Token ID1125
Feature activation+5.73e+00

Pos logit contributions
oire+1.422
 Sins+1.359
ドラ+1.345
avering+1.252
gerald+1.206
Neg logit contributions
adders-1.299
ensical-1.149
udi-1.138
 millenn-1.113
��-1.103
Tokenwy
Token ID21768
Feature activation+1.27e+00

Pos logit contributions
oire+6.402
 Sins+5.713
ドラ+5.665
avering+5.428
gerald+5.017
Neg logit contributions
adders-9.317
 millenn-9.100
��-8.670
ensical-8.623
��-8.226
Token baking
Token ID16871
Feature activation+4.91e-01

Pos logit contributions
oire+1.826
 Sins+1.751
ドラ+1.714
avering+1.583
gerald+1.518
Neg logit contributions
adders-1.811
ensical-1.623
udi-1.595
��-1.569
 millenn-1.567
Token recipes
Token ID14296
Feature activation-5.65e-01

Pos logit contributions
oire+0.702
 Sins+0.680
ドラ+0.675
avering+0.620
gerald+0.592
Neg logit contributions
adders-0.700
ensical-0.630
udi-0.612
 millenn-0.606
��-0.602
Token and
Token ID290
Feature activation+3.24e-01

Pos logit contributions
adders+0.825
ensical+0.742
udi+0.725
 millenn+0.725
��+0.713
Neg logit contributions
oire-0.799
 Sins-0.763
ドラ-0.762
avering-0.695
gerald-0.666
Token not
Token ID407
Feature activation-7.44e-01

Pos logit contributions
oire+0.467
 Sins+0.449
ドラ+0.443
avering+0.408
gerald+0.393
Neg logit contributions
adders-0.461
ensical-0.416
udi-0.404
 millenn-0.401
��-0.398
Tokent
Token ID83
Feature activation+2.21e-01

Pos logit contributions
adders+0.873
ensical+0.791
udi+0.779
 millenn+0.772
��+0.759
Neg logit contributions
oire-0.715
ドラ-0.685
 Sins-0.676
avering-0.615
gerald-0.588
Token nailed
Token ID37708
Feature activation+2.24e+00

Pos logit contributions
oire+0.334
ドラ+0.322
 Sins+0.321
avering+0.294
gerald+0.285
Neg logit contributions
adders-0.299
ensical-0.263
 millenn-0.260
udi-0.260
��-0.254
Token down
Token ID866
Feature activation-4.55e-01

Pos logit contributions
oire+3.196
ドラ+3.042
 Sins+3.020
avering+2.758
gerald+2.667
Neg logit contributions
adders-3.220
ensical-2.871
 millenn-2.835
udi-2.834
��-2.827
Token,
Token ID11
Feature activation-6.01e-01

Pos logit contributions
adders+0.645
ensical+0.578
udi+0.568
 millenn+0.560
��+0.551
Neg logit contributions
oire-0.659
ドラ-0.635
 Sins-0.628
avering-0.576
gerald-0.554
Token and
Token ID290
Feature activation-6.27e-02

Pos logit contributions
adders+0.814
ensical+0.727
udi+0.714
 millenn+0.705
��+0.692
Neg logit contributions
oire-0.907
ドラ-0.875
 Sins-0.866
avering-0.799
gerald-0.769
Token screwed
Token ID27527
Feature activation+5.72e+00

Pos logit contributions
adders+0.088
ensical+0.079
udi+0.077
 millenn+0.077
��+0.075
Neg logit contributions
oire-0.092
ドラ-0.089
 Sins-0.088
avering-0.081
gerald-0.078
Token their
Token ID511
Feature activation-8.89e-01

Pos logit contributions
oire+7.365
 Sins+7.102
ドラ+7.047
avering+6.257
gerald+6.227
Neg logit contributions
adders-8.058
ensical-7.530
 millenn-7.424
��-7.323
udi-7.263
Token way
Token ID835
Feature activation+6.04e-01

Pos logit contributions
adders+1.123
ensical+0.997
udi+0.978
 millenn+0.961
��+0.946
Neg logit contributions
oire-1.422
ドラ-1.378
 Sins-1.366
avering-1.261
gerald-1.219
Token through
Token ID832
Feature activation-1.75e+00

Pos logit contributions
oire+0.938
ドラ+0.894
 Sins+0.882
avering+0.814
gerald+0.787
Neg logit contributions
adders-0.789
ensical-0.715
 millenn-0.708
udi-0.699
��-0.690
Token floor
Token ID4314
Feature activation-4.77e-01

Pos logit contributions
adders+2.422
ensical+2.165
udi+2.128
 millenn+2.095
��+2.058
Neg logit contributions
oire-2.598
ドラ-2.508
 Sins-2.479
avering-2.285
gerald-2.198
Token staff
Token ID3085
Feature activation+7.95e-01

Pos logit contributions
adders+0.618
ensical+0.556
udi+0.540
 millenn+0.534
��+0.527
Neg logit contributions
oire-0.750
ドラ-0.722
 Sins-0.710
avering-0.656
gerald-0.632
Token run
Token ID1057
Feature activation+2.34e-02

Pos logit contributions
adders+1.961
ensical+1.751
 millenn+1.713
udi+1.712
��+1.670
Neg logit contributions
oire-2.154
ドラ-2.070
 Sins-2.063
avering-1.903
gerald-1.823
Token the
Token ID262
Feature activation-1.55e-02

Pos logit contributions
oire+0.034
 Sins+0.033
ドラ+0.032
avering+0.030
gerald+0.028
Neg logit contributions
adders-0.033
ensical-0.030
udi-0.029
 millenn-0.029
��-0.029
Token risk
Token ID2526
Feature activation+7.23e-02

Pos logit contributions
adders+0.022
ensical+0.019
udi+0.019
 millenn+0.019
��+0.019
Neg logit contributions
oire-0.023
 Sins-0.022
ドラ-0.022
avering-0.020
gerald-0.019
Token of
Token ID286
Feature activation+2.93e-01

Pos logit contributions
oire+0.116
ドラ+0.112
 Sins+0.111
avering+0.103
gerald+0.099
Neg logit contributions
adders-0.091
ensical-0.081
udi-0.079
 millenn-0.078
��-0.076
Token having
Token ID1719
Feature activation-3.74e-01

Pos logit contributions
oire+0.434
 Sins+0.426
ドラ+0.418
avering+0.387
gerald+0.367
Neg logit contributions
adders-0.399
ensical-0.359
udi-0.352
��-0.346
 millenn-0.345
Token che
Token ID1125
Feature activation+5.67e+00

Pos logit contributions
adders+0.516
ensical+0.465
udi+0.457
 millenn+0.453
��+0.451
Neg logit contributions
oire-0.548
 Sins-0.531
ドラ-0.529
avering-0.483
gerald-0.461
Tokenaters
Token ID8605
Feature activation+2.03e+00

Pos logit contributions
oire+6.882
 Sins+6.645
avering+6.389
ドラ+6.362
gerald+5.851
Neg logit contributions
adders-8.282
 millenn-8.013
ensical-7.859
��-7.734
 awareness-7.132
Token at
Token ID379
Feature activation-2.26e-01

Pos logit contributions
oire+2.874
 Sins+2.805
ドラ+2.748
avering+2.552
gerald+2.392
Neg logit contributions
adders-2.856
ensical-2.594
 millenn-2.583
��-2.555
udi-2.521
Token tournaments
Token ID18130
Feature activation+1.23e+00

Pos logit contributions
adders+0.319
ensical+0.288
udi+0.282
 millenn+0.280
��+0.274
Neg logit contributions
oire-0.327
 Sins-0.316
ドラ-0.315
avering-0.288
gerald-0.274
Token.
Token ID13
Feature activation+1.15e-01

Pos logit contributions
oire+1.636
 Sins+1.586
ドラ+1.554
avering+1.429
gerald+1.345
Neg logit contributions
adders-1.853
ensical-1.684
 millenn-1.665
udi-1.640
��-1.635
Token We
Token ID775
Feature activation-2.37e-01

Pos logit contributions
oire+0.139
 Sins+0.134
ドラ+0.132
avering+0.118
gerald+0.112
Neg logit contributions
adders-0.191
ensical-0.177
 millenn-0.174
udi-0.171
��-0.169
Token how
Token ID703
Feature activation+2.90e-02

Pos logit contributions
adders+1.363
ensical+1.235
udi+1.217
 millenn+1.205
��+1.189
Neg logit contributions
oire-1.127
ドラ-1.079
 Sins-1.075
avering-0.973
gerald-0.929
Token to
Token ID284
Feature activation+7.18e-01

Pos logit contributions
oire+0.047
 Sins+0.046
ドラ+0.045
avering+0.042
gerald+0.040
Neg logit contributions
adders-0.036
ensical-0.032
udi-0.031
 millenn-0.031
��-0.031
Token catch
Token ID4929
Feature activation+9.21e-01

Pos logit contributions
oire+1.009
 Sins+0.967
ドラ+0.961
avering+0.883
gerald+0.844
Neg logit contributions
adders-1.042
 millenn-0.939
ensical-0.938
��-0.928
udi-0.908
Token these
Token ID777
Feature activation-1.22e+00

Pos logit contributions
oire+1.225
 Sins+1.195
ドラ+1.162
avering+1.073
gerald+1.020
Neg logit contributions
adders-1.386
ensical-1.268
��-1.244
 millenn-1.242
udi-1.237
Token workshop
Token ID20243
Feature activation-4.00e-01

Pos logit contributions
adders+1.882
ensical+1.697
udi+1.675
 millenn+1.654
��+1.639
Neg logit contributions
oire-1.621
ドラ-1.558
 Sins-1.555
avering-1.405
gerald-1.343
Token che
Token ID1125
Feature activation+5.67e+00

Pos logit contributions
adders+0.514
ensical+0.453
udi+0.446
 millenn+0.443
��+0.437
Neg logit contributions
oire-0.630
ドラ-0.613
 Sins-0.613
avering-0.561
gerald-0.539
Tokenats
Token ID1381
Feature activation+2.51e+00

Pos logit contributions
oire+6.822
 Sins+6.625
ドラ+6.400
avering+6.292
gerald+5.847
Neg logit contributions
adders-8.140
 millenn-7.982
ensical-7.763
��-7.660
udi-7.123
Token and
Token ID290
Feature activation+2.89e-01

Pos logit contributions
oire+3.323
 Sins+3.198
ドラ+3.123
avering+2.903
gerald+2.777
Neg logit contributions
adders-3.750
 millenn-3.462
ensical-3.389
��-3.359
udi-3.300
Token potentially
Token ID6196
Feature activation-1.43e+00

Pos logit contributions
oire+0.409
 Sins+0.394
ドラ+0.392
avering+0.357
gerald+0.342
Neg logit contributions
adders-0.416
ensical-0.378
 millenn-0.373
udi-0.368
��-0.366
Token even
Token ID772
Feature activation-2.91e-01

Pos logit contributions
adders+2.038
ensical+1.831
udi+1.793
 millenn+1.788
��+1.757
Neg logit contributions
oire-2.058
ドラ-1.982
 Sins-1.971
avering-1.803
gerald-1.727
Token catch
Token ID4929
Feature activation+1.60e+00

Pos logit contributions
adders+0.413
ensical+0.372
 millenn+0.366
udi+0.363
��+0.359
Neg logit contributions
oire-0.421
ドラ-0.406
 Sins-0.405
avering-0.369
gerald-0.352
Token separation
Token ID14139
Feature activation-1.34e+00

Pos logit contributions
oire+0.802
 Sins+0.773
ドラ+0.764
avering+0.712
gerald+0.689
Neg logit contributions
adders-0.485
ensical-0.418
udi-0.405
��-0.400
 millenn-0.397
Token between
Token ID1022
Feature activation-1.02e+00

Pos logit contributions
adders+1.751
ensical+1.557
udi+1.528
 millenn+1.500
��+1.469
Neg logit contributions
oire-2.075
ドラ-2.011
 Sins-1.986
avering-1.839
gerald-1.775
Token Gaza
Token ID11010
Feature activation+2.47e+00

Pos logit contributions
adders+1.499
ensical+1.350
udi+1.324
 millenn+1.307
��+1.288
Neg logit contributions
oire-1.436
ドラ-1.382
 Sins-1.371
avering-1.252
gerald-1.200
Token and
Token ID290
Feature activation+6.32e-03

Pos logit contributions
oire+3.479
 Sins+3.351
ドラ+3.306
avering+3.035
gerald+2.878
Neg logit contributions
adders-3.528
ensical-3.215
 millenn-3.081
udi-3.049
��-3.039
Token the
Token ID262
Feature activation+5.04e-01

Pos logit contributions
oire+0.008
ドラ+0.008
 Sins+0.008
avering+0.007
gerald+0.007
Neg logit contributions
adders-0.010
ensical-0.009
udi-0.009
 millenn-0.008
��-0.008
Token West
Token ID2688
Feature activation+5.59e+00

Pos logit contributions
oire+0.688
 Sins+0.661
ドラ+0.649
avering+0.590
gerald+0.559
Neg logit contributions
adders-0.759
ensical-0.686
 millenn-0.669
udi-0.665
��-0.659
Token Bank
Token ID5018
Feature activation+9.12e-02

Pos logit contributions
oire+4.717
 Sins+4.417
ドラ+4.092
avering+3.759
gerald+3.121
Neg logit contributions
 millenn-10.694
adders-10.560
ensical-10.259
��-9.854
udi-9.522
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
oire+0.129
 Sins+0.122
ドラ+0.122
avering+0.111
gerald+0.106
Neg logit contributions
adders-0.133
ensical-0.120
udi-0.118
 millenn-0.117
��-0.115
Token The
Token ID383
Feature activation+1.27e-01

Pos logit contributions
oire+0.141
 Sins+0.137
ドラ+0.136
avering+0.120
gerald+0.113
Neg logit contributions
adders-0.199
ensical-0.185
 millenn-0.182
udi-0.178
��-0.175
Token hope
Token ID2911
Feature activation-1.35e+00

Pos logit contributions
oire+0.192
 Sins+0.185
ドラ+0.183
avering+0.168
gerald+0.161
Neg logit contributions
adders-0.171
ensical-0.153
udi-0.149
 millenn-0.148
��-0.146
Token was
Token ID373
Feature activation-4.31e-01

Pos logit contributions
adders+1.880
ensical+1.685
udi+1.656
 millenn+1.630
��+1.602
Neg logit contributions
oire-1.993
ドラ-1.922
 Sins-1.902
avering-1.751
gerald-1.683
Token northern
Token ID7840
Feature activation+2.38e+00

Pos logit contributions
oire+0.160
 Sins+0.157
ドラ+0.153
avering+0.142
gerald+0.136
Neg logit contributions
adders-0.142
ensical-0.125
udi-0.121
 millenn-0.121
��-0.118
Token part
Token ID636
Feature activation+2.37e+00

Pos logit contributions
oire+3.023
 Sins+3.005
ドラ+2.766
avering+2.559
gerald+2.400
Neg logit contributions
adders-3.786
ensical-3.381
udi-3.262
 millenn-3.248
��-3.236
Token of
Token ID286
Feature activation+1.42e-01

Pos logit contributions
oire+3.322
 Sins+3.199
ドラ+3.087
avering+2.841
gerald+2.696
Neg logit contributions
adders-3.402
 millenn-3.348
ensical-3.316
��-3.092
udi-2.975
Token China
Token ID2807
Feature activation+1.49e+00

Pos logit contributions
oire+0.192
 Sins+0.186
ドラ+0.183
avering+0.167
gerald+0.159
Neg logit contributions
adders-0.216
ensical-0.195
udi-0.190
 millenn-0.189
��-0.186
Token's
Token ID338
Feature activation+2.67e-01

Pos logit contributions
oire+2.016
 Sins+1.965
ドラ+1.916
avering+1.733
gerald+1.645
Neg logit contributions
adders-2.246
ensical-2.015
 millenn-1.978
��-1.946
udi-1.939
Token Guang
Token ID36980
Feature activation+5.49e+00

Pos logit contributions
oire+0.360
 Sins+0.352
ドラ+0.340
avering+0.312
gerald+0.297
Neg logit contributions
adders-0.409
ensical-0.365
udi-0.355
 millenn-0.354
��-0.350
Tokend
Token ID67
Feature activation+1.44e+00

Pos logit contributions
oire+6.120
 Sins+5.667
ドラ+5.461
avering+5.111
gerald+4.793
Neg logit contributions
adders-9.382
 millenn-8.352
ensical-8.326
��-8.315
udi-7.851
Tokenong
Token ID506
Feature activation+4.96e-01

Pos logit contributions
oire+1.568
 Sins+1.458
ドラ+1.415
avering+1.329
gerald+1.207
Neg logit contributions
adders-2.544
ensical-2.375
 millenn-2.324
��-2.308
udi-2.265
Token province
Token ID8473
Feature activation+1.15e+00

Pos logit contributions
oire+0.596
 Sins+0.568
ドラ+0.567
avering+0.506
gerald+0.479
Neg logit contributions
adders-0.831
ensical-0.753
udi-0.736
 millenn-0.733
��-0.724
Token turns
Token ID4962
Feature activation-1.51e+00

Pos logit contributions
oire+1.606
 Sins+1.559
ドラ+1.523
avering+1.395
gerald+1.328
Neg logit contributions
adders-1.686
ensical-1.502
 millenn-1.481
��-1.456
udi-1.456
Token redd
Token ID14688
Feature activation-3.24e-01

Pos logit contributions
adders+1.882
ensical+1.653
udi+1.612
 millenn+1.582
��+1.561
Neg logit contributions
oire-2.464
 Sins-2.378
ドラ-2.372
avering-2.184
gerald-2.111
Token no
Token ID645
Feature activation-3.44e-01

Pos logit contributions
adders+0.307
ensical+0.276
udi+0.273
 millenn+0.268
��+0.266
Neg logit contributions
oire-0.300
 Sins-0.289
ドラ-0.288
avering-0.262
gerald-0.253
Token holds
Token ID6622
Feature activation+1.08e+00

Pos logit contributions
adders+0.448
ensical+0.399
udi+0.391
 millenn+0.386
��+0.379
Neg logit contributions
oire-0.539
 Sins-0.517
ドラ-0.516
avering-0.475
gerald-0.457
Token barred
Token ID23902
Feature activation-1.16e+00

Pos logit contributions
oire+1.831
ドラ+1.740
 Sins+1.737
avering+1.649
gerald+1.577
Neg logit contributions
adders-1.262
ensical-1.112
 millenn-1.106
udi-1.092
��-1.081
Token to
Token ID284
Feature activation+8.73e-02

Pos logit contributions
adders+1.575
ensical+1.409
udi+1.385
 millenn+1.364
��+1.344
Neg logit contributions
oire-1.742
ドラ-1.675
 Sins-1.668
avering-1.531
gerald-1.476
Token the
Token ID262
Feature activation-4.74e-01

Pos logit contributions
oire+0.125
 Sins+0.121
ドラ+0.121
avering+0.111
gerald+0.105
Neg logit contributions
adders-0.124
ensical-0.112
 millenn-0.111
udi-0.110
��-0.109
Token che
Token ID1125
Feature activation+5.46e+00

Pos logit contributions
adders+0.627
ensical+0.555
udi+0.554
 millenn+0.541
��+0.536
Neg logit contributions
oire-0.728
 Sins-0.707
ドラ-0.700
avering-0.646
gerald-0.619
Tokenqu
Token ID421
Feature activation-1.53e-01

Pos logit contributions
oire+7.233
 Sins+6.814
ドラ+6.654
avering+6.579
gerald+6.170
Neg logit contributions
adders-7.576
 millenn-7.251
ensical-6.986
��-6.928
udi-6.567
Tokenered
Token ID1068
Feature activation+2.00e-01

Pos logit contributions
adders+0.224
ensical+0.203
 millenn+0.199
udi+0.199
��+0.195
Neg logit contributions
oire-0.214
 Sins-0.204
ドラ-0.204
avering-0.186
gerald-0.178
Token flag
Token ID6056
Feature activation+1.03e+00

Pos logit contributions
oire+0.366
ドラ+0.352
 Sins+0.350
avering+0.328
gerald+0.318
Neg logit contributions
adders-0.207
ensical-0.180
udi-0.177
 millenn-0.171
��-0.170
Token!
Token ID0
Feature activation+7.72e-01

Pos logit contributions
oire+1.519
 Sins+1.453
ドラ+1.452
avering+1.338
gerald+1.280
Neg logit contributions
adders-1.424
ensical-1.290
udi-1.270
 millenn-1.267
��-1.231
Token Race
Token ID12588
Feature activation-2.72e+00

Pos logit contributions
oire+0.941
 Sins+0.916
ドラ+0.895
avering+0.801
gerald+0.754
Neg logit contributions
adders-1.263
ensical-1.177
 millenn-1.165
udi-1.145
��-1.123
Tokenko
Token ID7204
Feature activation+1.64e+00

Pos logit contributions
oire+1.968
 Sins+1.868
ドラ+1.831
avering+1.709
gerald+1.620
Neg logit contributions
adders-2.101
 millenn-1.920
ensical-1.915
��-1.869
udi-1.849
Token,
Token ID11
Feature activation+1.90e-01

Pos logit contributions
oire+2.239
 Sins+2.145
ドラ+2.122
avering+1.962
gerald+1.859
Neg logit contributions
adders-2.410
ensical-2.196
 millenn-2.156
udi-2.120
��-2.108
Token Ser
Token ID2930
Feature activation+6.46e-01

Pos logit contributions
oire+0.253
 Sins+0.245
ドラ+0.242
avering+0.221
gerald+0.211
Neg logit contributions
adders-0.291
ensical-0.264
udi-0.256
 millenn-0.256
��-0.254
Tokenhi
Token ID5303
Feature activation+1.40e+00

Pos logit contributions
oire+0.760
ドラ+0.715
 Sins+0.708
avering+0.651
gerald+0.617
Neg logit contributions
adders-1.081
ensical-0.996
 millenn-0.983
��-0.969
udi-0.962
Tokeny
Token ID88
Feature activation-1.05e-02

Pos logit contributions
oire+1.901
ドラ+1.812
 Sins+1.806
avering+1.647
gerald+1.573
Neg logit contributions
adders-2.094
ensical-1.899
udi-1.852
 millenn-1.850
��-1.814
Token Kaz
Token ID16385
Feature activation+5.45e+00

Pos logit contributions
adders+0.017
ensical+0.015
udi+0.015
 millenn+0.015
��+0.014
Neg logit contributions
oire-0.014
ドラ-0.013
 Sins-0.013
avering-0.012
gerald-0.011
Tokenarov
Token ID42737
Feature activation+9.18e-01

Pos logit contributions
oire+8.161
 Sins+7.609
ドラ+7.509
avering+7.251
gerald+6.867
Neg logit contributions
adders-6.631
 millenn-6.344
ensical-6.054
��-5.923
udi-5.655
Token,
Token ID11
Feature activation+2.84e-01

Pos logit contributions
oire+1.363
ドラ+1.313
 Sins+1.302
avering+1.199
gerald+1.153
Neg logit contributions
adders-1.266
ensical-1.135
udi-1.108
 millenn-1.098
��-1.079
Token Vital
Token ID28476
Feature activation+4.22e-01

Pos logit contributions
oire+0.378
 Sins+0.367
ドラ+0.360
avering+0.331
gerald+0.314
Neg logit contributions
adders-0.434
ensical-0.395
udi-0.382
 millenn-0.381
��-0.379
Tokeny
Token ID88
Feature activation-2.94e-01

Pos logit contributions
oire+0.525
ドラ+0.500
 Sins+0.496
avering+0.449
gerald+0.426
Neg logit contributions
adders-0.684
ensical-0.624
udi-0.612
 millenn-0.608
��-0.600
Token Nem
Token ID22547
Feature activation+3.95e-01

Pos logit contributions
adders+0.443
ensical+0.401
udi+0.393
 millenn+0.390
��+0.384
Neg logit contributions
oire-0.400
ドラ-0.384
 Sins-0.383
avering-0.348
gerald-0.332
Token.
Token ID13
Feature activation-3.42e-01

Pos logit contributions
oire+1.101
ドラ+1.052
 Sins+1.047
avering+0.960
gerald+0.912
Neg logit contributions
adders-1.199
ensical-1.081
 millenn-1.066
udi-1.057
��-1.042
Token The
Token ID383
Feature activation-1.39e-01

Pos logit contributions
adders+0.539
ensical+0.495
 millenn+0.486
udi+0.479
��+0.470
Neg logit contributions
oire-0.442
 Sins-0.425
ドラ-0.423
avering-0.379
gerald-0.361
Token fact
Token ID1109
Feature activation+1.41e+00

Pos logit contributions
adders+0.185
ensical+0.166
udi+0.161
 millenn+0.159
��+0.158
Neg logit contributions
oire-0.213
 Sins-0.205
ドラ-0.203
avering-0.186
gerald-0.179
Token that
Token ID326
Feature activation-8.17e-01

Pos logit contributions
oire+1.839
ドラ+1.765
 Sins+1.735
avering+1.580
gerald+1.510
Neg logit contributions
adders-2.184
ensical-1.991
udi-1.949
 millenn-1.937
��-1.895
Token the
Token ID262
Feature activation+3.76e-01

Pos logit contributions
adders+1.200
ensical+1.086
udi+1.053
 millenn+1.048
��+1.035
Neg logit contributions
oire-1.143
ドラ-1.098
 Sins-1.097
avering-0.996
gerald-0.948
Token West
Token ID2688
Feature activation+5.42e+00

Pos logit contributions
oire+0.544
 Sins+0.524
ドラ+0.517
avering+0.469
gerald+0.449
Neg logit contributions
adders-0.533
ensical-0.480
udi-0.467
 millenn-0.464
��-0.460
Token Bank
Token ID5018
Feature activation+1.63e-01

Pos logit contributions
oire+4.680
 Sins+4.540
ドラ+4.275
avering+3.678
gerald+3.369
Neg logit contributions
adders-9.879
 millenn-9.851
ensical-9.604
��-9.223
udi-8.968
Token separation
Token ID14139
Feature activation-5.49e-01

Pos logit contributions
oire+0.238
ドラ+0.227
 Sins+0.227
avering+0.206
gerald+0.199
Neg logit contributions
adders-0.230
ensical-0.207
udi-0.201
 millenn-0.200
��-0.196
Token (
Token ID357
Feature activation+5.92e-01

Pos logit contributions
adders+0.734
ensical+0.657
udi+0.639
 millenn+0.638
��+0.630
Neg logit contributions
oire-0.839
ドラ-0.808
 Sins-0.797
avering-0.733
gerald-0.708
Tokenand
Token ID392
Feature activation+8.38e-01

Pos logit contributions
oire+0.772
ドラ+0.740
 Sins+0.725
avering+0.665
gerald+0.631
Neg logit contributions
adders-0.917
ensical-0.850
 millenn-0.833
udi-0.818
��-0.805
Token encro
Token ID45121
Feature activation-1.34e+00

Pos logit contributions
oire+1.205
 Sins+1.160
ドラ+1.155
avering+1.045
gerald+1.001
Neg logit contributions
adders-1.189
ensical-1.077
 millenn-1.035
udi-1.023
��-1.014
Token Let
Token ID3914
Feature activation+3.63e-01

Pos logit contributions
oire+0.147
 Sins+0.142
ドラ+0.140
avering+0.126
gerald+0.119
Neg logit contributions
adders-0.196
ensical-0.182
 millenn-0.180
udi-0.176
��-0.173
Token's
Token ID338
Feature activation-4.92e-01

Pos logit contributions
oire+0.483
ドラ+0.462
 Sins+0.461
avering+0.417
gerald+0.397
Neg logit contributions
adders-0.553
ensical-0.508
 millenn-0.493
udi-0.491
��-0.483
Token make
Token ID787
Feature activation+1.58e-01

Pos logit contributions
adders+0.725
ensical+0.661
 millenn+0.658
udi+0.639
��+0.638
Neg logit contributions
oire-0.684
 Sins-0.657
ドラ-0.651
avering-0.599
gerald-0.566
Token sure
Token ID1654
Feature activation-6.73e-02

Pos logit contributions
oire+0.222
 Sins+0.217
ドラ+0.214
avering+0.195
gerald+0.186
Neg logit contributions
adders-0.225
ensical-0.206
 millenn-0.200
udi-0.200
��-0.197
Token these
Token ID777
Feature activation-9.29e-01

Pos logit contributions
adders+0.101
ensical+0.093
udi+0.091
 millenn+0.090
��+0.089
Neg logit contributions
oire-0.090
 Sins-0.087
ドラ-0.087
avering-0.079
gerald-0.074
Token che
Token ID1125
Feature activation+5.39e+00

Pos logit contributions
adders+1.309
ensical+1.168
udi+1.154
 millenn+1.136
��+1.129
Neg logit contributions
oire-1.354
ドラ-1.305
 Sins-1.305
avering-1.190
gerald-1.135
Tokenats
Token ID1381
Feature activation+2.65e+00

Pos logit contributions
oire+6.802
 Sins+6.635
ドラ+6.400
avering+6.213
gerald+5.751
Neg logit contributions
adders-7.679
 millenn-7.368
ensical-7.205
��-7.068
udi-6.728
Token can
Token ID460
Feature activation+7.75e-02

Pos logit contributions
oire+3.475
 Sins+3.268
ドラ+3.249
avering+2.979
gerald+2.816
Neg logit contributions
adders-4.019
 millenn-3.778
��-3.661
ensical-3.626
udi-3.579
Token't
Token ID470
Feature activation+2.34e-01

Pos logit contributions
oire+0.107
ドラ+0.102
 Sins+0.101
avering+0.093
gerald+0.089
Neg logit contributions
adders-0.115
 millenn-0.104
ensical-0.104
udi-0.102
��-0.101
Token ever
Token ID1683
Feature activation+8.57e-01

Pos logit contributions
oire+0.322
 Sins+0.306
ドラ+0.305
avering+0.279
gerald+0.265
Neg logit contributions
adders-0.351
 millenn-0.317
ensical-0.315
udi-0.308
��-0.306
Token be
Token ID307
Feature activation+1.50e+00

Pos logit contributions
oire+1.148
 Sins+1.085
ドラ+1.077
avering+0.987
gerald+0.931
Neg logit contributions
adders-1.308
 millenn-1.203
ensical-1.179
udi-1.155
��-1.151
Token,
Token ID11
Feature activation-5.26e-01

Pos logit contributions
oire+0.318
 Sins+0.303
ドラ+0.302
avering+0.276
gerald+0.267
Neg logit contributions
adders-0.300
ensical-0.269
udi-0.263
 millenn-0.263
��-0.258
Token was
Token ID373
Feature activation-5.37e-01

Pos logit contributions
adders+0.749
ensical+0.679
udi+0.665
 millenn+0.656
��+0.652
Neg logit contributions
oire-0.752
 Sins-0.724
ドラ-0.722
avering-0.661
gerald-0.631
Token the
Token ID262
Feature activation-2.40e-01

Pos logit contributions
adders+0.792
ensical+0.713
udi+0.702
 millenn+0.692
��+0.681
Neg logit contributions
oire-0.748
ドラ-0.720
 Sins-0.711
avering-0.651
gerald-0.625
Token presence
Token ID4931
Feature activation+4.22e-01

Pos logit contributions
adders+0.330
ensical+0.294
udi+0.290
��+0.283
 millenn+0.282
Neg logit contributions
oire-0.360
ドラ-0.345
 Sins-0.343
avering-0.316
gerald-0.303
Token of
Token ID286
Feature activation+8.58e-02

Pos logit contributions
oire+0.634
ドラ+0.605
 Sins+0.605
avering+0.559
gerald+0.532
Neg logit contributions
adders-0.573
ensical-0.535
 millenn-0.528
��-0.508
udi-0.502
Token che
Token ID1125
Feature activation+5.35e+00

Pos logit contributions
oire+0.128
 Sins+0.123
ドラ+0.123
avering+0.113
gerald+0.108
Neg logit contributions
adders-0.118
ensical-0.106
udi-0.104
 millenn-0.101
��-0.101
Tokenwed
Token ID19103
Feature activation+9.21e-01

Pos logit contributions
oire+6.756
 Sins+6.200
ドラ+6.198
avering+6.148
gerald+5.622
Neg logit contributions
 millenn-7.709
adders-7.623
��-7.205
ensical-7.165
 acceptance-6.785
Token grass
Token ID8701
Feature activation-5.91e-02

Pos logit contributions
oire+1.348
ドラ+1.287
 Sins+1.281
avering+1.181
gerald+1.135
Neg logit contributions
adders-1.284
ensical-1.167
udi-1.138
��-1.121
 millenn-1.118
Token.
Token ID13
Feature activation-4.99e-01

Pos logit contributions
adders+0.086
ensical+0.077
udi+0.076
 millenn+0.076
��+0.074
Neg logit contributions
oire-0.083
ドラ-0.080
 Sins-0.079
avering-0.073
gerald-0.070
Token She
Token ID1375
Feature activation-5.55e-01

Pos logit contributions
adders+0.792
ensical+0.723
 millenn+0.710
udi+0.707
��+0.692
Neg logit contributions
oire-0.640
ドラ-0.613
 Sins-0.612
avering-0.550
gerald-0.524
Token may
Token ID743
Feature activation-1.55e+00

Pos logit contributions
adders+0.759
ensical+0.681
 millenn+0.674
udi+0.664
��+0.653
Neg logit contributions
oire-0.828
ドラ-0.794
 Sins-0.787
avering-0.728
gerald-0.703
Token overthrow
Token ID25525
Feature activation+7.71e-01

Pos logit contributions
oire+0.448
ドラ+0.431
 Sins+0.427
avering+0.392
gerald+0.373
Neg logit contributions
adders-0.464
ensical-0.419
 millenn-0.412
��-0.408
udi-0.407
Token Gaddafi
Token ID40320
Feature activation+1.55e+00

Pos logit contributions
oire+0.950
 Sins+0.907
ドラ+0.899
avering+0.811
gerald+0.770
Neg logit contributions
adders-1.254
ensical-1.153
 millenn-1.118
��-1.116
udi-1.113
Token and
Token ID290
Feature activation+3.01e-01

Pos logit contributions
oire+2.080
ドラ+1.973
 Sins+1.962
avering+1.780
gerald+1.704
Neg logit contributions
adders-2.361
ensical-2.151
 millenn-2.102
��-2.085
udi-2.072
Token essentially
Token ID6986
Feature activation+6.03e-01

Pos logit contributions
oire+0.428
 Sins+0.408
ドラ+0.406
avering+0.370
gerald+0.352
Neg logit contributions
adders-0.433
ensical-0.395
 millenn-0.385
��-0.382
udi-0.379
Token shove
Token ID38332
Feature activation+2.26e+00

Pos logit contributions
oire+0.880
ドラ+0.838
 Sins+0.834
avering+0.769
gerald+0.726
Neg logit contributions
adders-0.851
ensical-0.764
 millenn-0.751
udi-0.745
��-0.744
Token Gaz
Token ID21347
Feature activation+5.32e+00

Pos logit contributions
oire+3.138
 Sins+2.985
ドラ+2.958
avering+2.711
gerald+2.557
Neg logit contributions
adders-3.266
ensical-2.986
 millenn-2.912
��-2.880
udi-2.831
Tokenprom
Token ID16963
Feature activation+1.12e+00

Pos logit contributions
oire+4.538
 Sins+3.681
ドラ+3.642
avering+3.443
gerald+3.241
Neg logit contributions
 millenn-10.313
adders-10.039
ensical-9.866
��-9.683
��-9.179
Token out
Token ID503
Feature activation-6.44e-01

Pos logit contributions
oire+1.703
ドラ+1.631
 Sins+1.618
avering+1.485
gerald+1.433
Neg logit contributions
adders-1.504
ensical-1.352
��-1.314
 millenn-1.312
udi-1.310
Token.
Token ID13
Feature activation-1.42e-01

Pos logit contributions
adders+0.826
ensical+0.735
udi+0.718
 millenn+0.710
��+0.697
Neg logit contributions
oire-1.019
ドラ-0.986
 Sins-0.975
avering-0.904
gerald-0.870
Token I
Token ID314
Feature activation-9.64e-01

Pos logit contributions
adders+0.234
ensical+0.216
 millenn+0.212
udi+0.210
��+0.207
Neg logit contributions
oire-0.172
ドラ-0.164
 Sins-0.164
avering-0.146
gerald-0.139
Token mean
Token ID1612
Feature activation+4.76e-02

Pos logit contributions
adders+1.316
ensical+1.174
udi+1.159
 millenn+1.157
��+1.132
Neg logit contributions
oire-1.449
ドラ-1.391
 Sins-1.378
avering-1.274
gerald-1.225
Token region
Token ID3814
Feature activation-9.10e-02

Pos logit contributions
adders+1.195
ensical+1.065
udi+1.044
 millenn+1.026
��+1.016
Neg logit contributions
oire-1.291
ドラ-1.243
 Sins-1.237
avering-1.137
gerald-1.093
Token of
Token ID286
Feature activation+3.34e-01

Pos logit contributions
adders+0.119
ensical+0.105
udi+0.103
 millenn+0.102
��+0.100
Neg logit contributions
oire-0.142
ドラ-0.137
 Sins-0.136
avering-0.126
gerald-0.121
Token China
Token ID2807
Feature activation+1.42e+00

Pos logit contributions
oire+0.569
 Sins+0.552
ドラ+0.548
avering+0.510
gerald+0.490
Neg logit contributions
adders-0.390
ensical-0.338
udi-0.328
 millenn-0.321
��-0.319
Token in
Token ID287
Feature activation+9.44e-01

Pos logit contributions
oire+1.826
 Sins+1.753
ドラ+1.735
avering+1.574
gerald+1.479
Neg logit contributions
adders-2.239
ensical-2.028
 millenn-1.978
udi-1.972
��-1.950
Token southern
Token ID8372
Feature activation+2.01e+00

Pos logit contributions
oire+1.448
 Sins+1.420
ドラ+1.374
avering+1.288
gerald+1.222
Neg logit contributions
adders-1.250
ensical-1.127
udi-1.077
 millenn-1.068
��-1.061
Token Guang
Token ID36980
Feature activation+5.30e+00

Pos logit contributions
oire+2.604
 Sins+2.530
ドラ+2.420
avering+2.228
gerald+2.082
Neg logit contributions
adders-3.151
ensical-2.862
udi-2.780
��-2.745
 millenn-2.740
Tokend
Token ID67
Feature activation+1.41e+00

Pos logit contributions
oire+6.013
 Sins+5.418
ドラ+5.198
avering+4.901
gerald+4.659
Neg logit contributions
adders-8.880
 millenn-8.534
ensical-8.426
��-8.156
udi-7.596
Tokenong
Token ID506
Feature activation+5.98e-01

Pos logit contributions
oire+1.566
 Sins+1.434
ドラ+1.405
avering+1.307
gerald+1.207
Neg logit contributions
adders-2.465
ensical-2.338
 millenn-2.331
��-2.261
udi-2.223
Token province
Token ID8473
Feature activation+9.84e-01

Pos logit contributions
oire+0.781
ドラ+0.746
 Sins+0.743
avering+0.674
gerald+0.642
Neg logit contributions
adders-0.935
ensical-0.844
udi-0.829
 millenn-0.821
��-0.811
Token,
Token ID11
Feature activation+9.52e-02

Pos logit contributions
oire+1.314
 Sins+1.260
ドラ+1.240
avering+1.136
gerald+1.074
Neg logit contributions
adders-1.509
ensical-1.362
 millenn-1.328
udi-1.325
��-1.307
Token nest
Token ID16343
Feature activation-1.77e+00

Pos logit contributions
oire+0.130
 Sins+0.127
ドラ+0.123
avering+0.113
gerald+0.106
Neg logit contributions
adders-0.143
ensical-0.130
udi-0.126
 millenn-0.125
��-0.124
Tokenone
Token ID505
Feature activation+1.62e+00

Pos logit contributions
adders+0.282
ensical+0.257
udi+0.253
 millenn+0.250
��+0.246
Neg logit contributions
oire-0.212
ドラ-0.203
 Sins-0.200
avering-0.181
gerald-0.172
Token lamb
Token ID19343
Feature activation-1.39e-01

Pos logit contributions
oire+2.382
 Sins+2.311
ドラ+2.292
avering+2.099
gerald+2.030
Neg logit contributions
adders-2.204
ensical-1.951
udi-1.929
��-1.920
 millenn-1.885
Token ste
Token ID2876
Feature activation+7.85e-01

Pos logit contributions
adders+0.195
ensical+0.176
udi+0.172
 millenn+0.172
��+0.169
Neg logit contributions
oire-0.202
ドラ-0.193
 Sins-0.193
avering-0.177
gerald-0.170
Tokenaks
Token ID4730
Feature activation+9.37e-01

Pos logit contributions
oire+1.328
ドラ+1.273
 Sins+1.269
avering+1.206
gerald+1.154
Neg logit contributions
adders-0.890
 millenn-0.824
ensical-0.802
��-0.770
udi-0.753
Token from
Token ID422
Feature activation-4.46e-01

Pos logit contributions
oire+1.328
 Sins+1.273
ドラ+1.255
avering+1.162
gerald+1.115
Neg logit contributions
adders-1.343
ensical-1.209
 millenn-1.194
��-1.180
udi-1.178
Token Scot
Token ID7130
Feature activation+5.27e+00

Pos logit contributions
adders+0.667
ensical+0.602
udi+0.590
 millenn+0.584
��+0.578
Neg logit contributions
oire-0.611
 Sins-0.586
ドラ-0.583
avering-0.532
gerald-0.510
Tokenmid
Token ID13602
Feature activation-5.49e-01

Pos logit contributions
oire+6.905
ドラ+6.693
 Sins+6.619
avering+6.155
gerald+6.061
Neg logit contributions
adders-7.247
ensical-6.842
��-6.777
 millenn-6.715
udi-6.592
Token.'
Token ID2637
Feature activation+9.03e-02

Pos logit contributions
adders+0.791
ensical+0.712
udi+0.698
 millenn+0.693
��+0.680
Neg logit contributions
oire-0.781
ドラ-0.751
 Sins-0.744
avering-0.683
gerald-0.658
Token
Token ID198
Feature activation-2.95e-01

Pos logit contributions
oire+0.130
ドラ+0.126
 Sins+0.123
avering+0.113
gerald+0.109
Neg logit contributions
adders-0.128
ensical-0.116
udi-0.114
 millenn-0.111
��-0.109
Token
Token ID198
Feature activation+9.23e-02

Pos logit contributions
adders+0.456
 millenn+0.432
ensical+0.429
��+0.406
udi+0.405
Neg logit contributions
oire-0.392
 Sins-0.372
ドラ-0.371
avering-0.340
gerald-0.323
TokenThe
Token ID464
Feature activation+7.39e-02

Pos logit contributions
oire+0.121
ドラ+0.117
 Sins+0.113
avering+0.105
gerald+0.102
Neg logit contributions
adders-0.143
ensical-0.132
 millenn-0.130
udi-0.128
��-0.127
Tokenran
Token ID2596
Feature activation+7.08e-01

Pos logit contributions
adders+1.445
ensical+1.307
udi+1.285
 millenn+1.269
��+1.251
Neg logit contributions
oire-1.284
ドラ-1.233
 Sins-1.219
avering-1.114
gerald-1.066
Tokenza
Token ID4496
Feature activation+5.44e-01

Pos logit contributions
oire+0.754
ドラ+0.707
 Sins+0.707
avering+0.616
gerald+0.579
Neg logit contributions
adders-1.276
ensical-1.184
udi-1.148
 millenn-1.148
��-1.136
Token
Token ID447
Feature activation+7.65e-02

Pos logit contributions
oire+0.771
ドラ+0.740
 Sins+0.735
avering+0.672
gerald+0.654
Neg logit contributions
adders-0.786
ensical-0.712
 millenn-0.696
udi-0.688
��-0.682
Token
Token ID247
Feature activation-1.09e-01

Pos logit contributions
oire+0.128
ドラ+0.123
 Sins+0.122
avering+0.114
gerald+0.110
Neg logit contributions
adders-0.092
ensical-0.083
udi-0.079
 millenn-0.079
��-0.075
Tokens
Token ID82
Feature activation+3.32e-01

Pos logit contributions
adders+0.179
ensical+0.164
 millenn+0.161
udi+0.159
��+0.158
Neg logit contributions
oire-0.135
ドラ-0.129
 Sins-0.127
avering-0.116
gerald-0.111
Token che
Token ID1125
Feature activation+5.27e+00

Pos logit contributions
oire+0.494
 Sins+0.474
ドラ+0.472
avering+0.434
gerald+0.421
Neg logit contributions
adders-0.458
ensical-0.412
 millenn-0.398
udi-0.398
��-0.393
Tokenc
Token ID66
Feature activation-3.08e-01

Pos logit contributions
oire+6.529
 Sins+5.952
ドラ+5.824
avering+5.784
gerald+5.305
Neg logit contributions
adders-7.857
 millenn-7.545
ensical-7.369
��-7.232
udi-6.861
Tokenkered
Token ID28970
Feature activation+1.01e+00

Pos logit contributions
adders+0.530
ensical+0.485
 millenn+0.477
udi+0.474
��+0.471
Neg logit contributions
oire-0.355
ドラ-0.335
 Sins-0.332
avering-0.298
gerald-0.284
Token history
Token ID2106
Feature activation+2.00e-01

Pos logit contributions
oire+1.454
 Sins+1.377
ドラ+1.375
avering+1.250
gerald+1.206
Neg logit contributions
adders-1.425
ensical-1.312
 millenn-1.257
udi-1.256
��-1.251
Token at
Token ID379
Feature activation-6.45e-01

Pos logit contributions
oire+0.295
 Sins+0.280
ドラ+0.280
avering+0.256
gerald+0.249
Neg logit contributions
adders-0.279
ensical-0.250
 millenn-0.247
udi-0.243
��-0.241
Token the
Token ID262
Feature activation-3.50e-01

Pos logit contributions
adders+0.955
ensical+0.862
udi+0.845
 millenn+0.837
��+0.820
Neg logit contributions
oire-0.895
ドラ-0.858
 Sins-0.855
avering-0.780
gerald-0.747
Token under
Token ID739
Feature activation-4.65e-01

Pos logit contributions
oire+0.424
 Sins+0.409
ドラ+0.406
avering+0.377
gerald+0.362
Neg logit contributions
adders-0.376
ensical-0.333
udi-0.328
 millenn-0.322
��-0.318
Tokeng
Token ID70
Feature activation+2.02e+00

Pos logit contributions
adders+0.636
ensical+0.568
udi+0.557
 millenn+0.551
��+0.539
Neg logit contributions
oire-0.697
ドラ-0.671
 Sins-0.664
avering-0.615
gerald-0.590
Tokenird
Token ID1447
Feature activation+1.15e+00

Pos logit contributions
oire+2.638
 Sins+2.444
ドラ+2.408
avering+2.373
gerald+2.165
Neg logit contributions
adders-3.069
 millenn-2.910
ensical-2.824
��-2.774
udi-2.728
Tokened
Token ID276
Feature activation+8.83e-01

Pos logit contributions
oire+1.515
 Sins+1.446
ドラ+1.431
avering+1.332
gerald+1.253
Neg logit contributions
adders-1.768
ensical-1.595
 millenn-1.573
udi-1.570
��-1.532
Token by
Token ID416
Feature activation+8.04e-01

Pos logit contributions
oire+1.321
ドラ+1.264
 Sins+1.262
avering+1.167
gerald+1.122
Neg logit contributions
adders-1.208
ensical-1.069
udi-1.062
 millenn-1.042
��-1.030
Token che
Token ID1125
Feature activation+5.24e+00

Pos logit contributions
oire+1.202
 Sins+1.165
ドラ+1.146
avering+1.063
gerald+1.030
Neg logit contributions
adders-1.090
ensical-0.978
udi-0.960
 millenn-0.937
��-0.927
Tokenwy
Token ID21768
Feature activation+9.44e-01

Pos logit contributions
oire+6.504
 Sins+5.848
avering+5.743
ドラ+5.698
gerald+5.316
Neg logit contributions
 millenn-7.760
adders-7.687
ensical-7.162
��-7.043
��-6.898
Token caramel
Token ID32913
Feature activation-2.05e+00

Pos logit contributions
oire+1.355
 Sins+1.306
ドラ+1.280
avering+1.187
gerald+1.158
Neg logit contributions
adders-1.335
ensical-1.203
udi-1.179
 millenn-1.156
��-1.143
Token mal
Token ID6428
Feature activation+3.31e-01

Pos logit contributions
adders+2.962
ensical+2.664
udi+2.615
 millenn+2.574
��+2.531
Neg logit contributions
oire-2.884
ドラ-2.793
 Sins-2.752
avering-2.526
gerald-2.429
Tokents
Token ID912
Feature activation+1.22e+00

Pos logit contributions
oire+0.348
ドラ+0.327
 Sins+0.323
avering+0.289
gerald+0.272
Neg logit contributions
adders-0.600
ensical-0.555
 millenn-0.548
udi-0.545
��-0.537
Token with
Token ID351
Feature activation-4.19e-01

Pos logit contributions
oire+1.647
ドラ+1.564
 Sins+1.556
avering+1.436
gerald+1.363
Neg logit contributions
adders-1.830
ensical-1.650
 millenn-1.643
udi-1.631
��-1.592
Token the
Token ID262
Feature activation-2.00e-01

Pos logit contributions
adders+0.897
ensical+0.810
udi+0.794
 millenn+0.781
��+0.772
Neg logit contributions
oire-0.859
 Sins-0.826
ドラ-0.825
avering-0.752
gerald-0.722
Token local
Token ID1957
Feature activation+5.84e-01

Pos logit contributions
adders+0.280
ensical+0.250
udi+0.245
 millenn+0.241
��+0.239
Neg logit contributions
oire-0.294
 Sins-0.286
ドラ-0.281
avering-0.260
gerald-0.250
Token chapter
Token ID6843
Feature activation-2.04e+00

Pos logit contributions
oire+0.857
 Sins+0.835
ドラ+0.816
avering+0.752
gerald+0.726
Neg logit contributions
adders-0.814
ensical-0.723
udi-0.706
��-0.698
 millenn-0.697
Token of
Token ID286
Feature activation-1.04e-02

Pos logit contributions
adders+2.374
ensical+2.093
udi+2.052
 millenn+2.024
��+1.957
Neg logit contributions
oire-3.413
ドラ-3.326
 Sins-3.271
avering-3.054
gerald-2.981
Token the
Token ID262
Feature activation-1.09e-01

Pos logit contributions
adders+0.016
ensical+0.014
udi+0.014
 millenn+0.014
��+0.014
Neg logit contributions
oire-0.014
 Sins-0.013
ドラ-0.013
avering-0.012
gerald-0.012
Token NAACP
Token ID39829
Feature activation-6.25e+00

Pos logit contributions
adders+0.165
ensical+0.150
udi+0.147
 millenn+0.145
��+0.144
Neg logit contributions
oire-0.147
 Sins-0.143
ドラ-0.141
avering-0.129
gerald-0.123
Token,
Token ID11
Feature activation-5.29e-02

Pos logit contributions
adders+8.019
udi+7.264
ensical+7.196
 millenn+6.892
��+6.702
Neg logit contributions
oire-9.070
ドラ-8.951
 Sins-8.545
avering-8.029
gerald-7.748
Token has
Token ID468
Feature activation+1.59e-01

Pos logit contributions
adders+0.076
ensical+0.068
udi+0.066
 millenn+0.066
��+0.066
Neg logit contributions
oire-0.075
 Sins-0.073
ドラ-0.072
avering-0.066
gerald-0.064
Token asked
Token ID1965
Feature activation+1.97e-01

Pos logit contributions
oire+0.233
 Sins+0.225
ドラ+0.223
avering+0.206
gerald+0.196
Neg logit contributions
adders-0.223
ensical-0.199
udi-0.196
 millenn-0.193
��-0.192
Token the
Token ID262
Feature activation-1.42e-01

Pos logit contributions
oire+0.271
 Sins+0.263
ドラ+0.258
avering+0.239
gerald+0.228
Neg logit contributions
adders-0.291
ensical-0.262
udi-0.256
��-0.255
 millenn-0.253
Token Dublin
Token ID18220
Feature activation+2.22e+00

Pos logit contributions
adders+0.216
ensical+0.196
udi+0.192
 millenn+0.189
��+0.189
Neg logit contributions
oire-0.190
 Sins-0.183
ドラ-0.179
avering-0.165
gerald-0.159
Token lot
Token ID1256
Feature activation+8.62e-01

Pos logit contributions
oire+0.887
 Sins+0.854
ドラ+0.838
avering+0.770
gerald+0.733
Neg logit contributions
adders-1.025
ensical-0.923
udi-0.897
 millenn-0.888
��-0.878
Token dirt
Token ID13647
Feature activation+1.10e+00

Pos logit contributions
oire+1.214
 Sins+1.173
ドラ+1.160
avering+1.064
gerald+1.018
Neg logit contributions
adders-1.243
ensical-1.112
udi-1.086
 millenn-1.074
��-1.071
Tokenier
Token ID959
Feature activation+3.57e-01

Pos logit contributions
oire+1.438
 Sins+1.360
ドラ+1.350
avering+1.234
gerald+1.180
Neg logit contributions
adders-1.703
ensical-1.539
 millenn-1.514
udi-1.510
��-1.496
Token,"
Token ID553
Feature activation-1.10e-02

Pos logit contributions
oire+0.485
 Sins+0.467
ドラ+0.461
avering+0.419
gerald+0.402
Neg logit contributions
adders-0.539
ensical-0.486
udi-0.474
 millenn-0.474
��-0.467
Token said
Token ID531
Feature activation-5.43e-01

Pos logit contributions
adders+0.016
ensical+0.014
udi+0.014
 millenn+0.014
��+0.014
Neg logit contributions
oire-0.015
 Sins-0.015
ドラ-0.015
avering-0.013
gerald-0.013
Token Natalie
Token ID35571
Feature activation-6.12e+00

Pos logit contributions
adders+0.830
ensical+0.749
udi+0.733
 millenn+0.728
��+0.718
Neg logit contributions
oire-0.728
ドラ-0.696
 Sins-0.696
avering-0.633
gerald-0.608
Token Koz
Token ID40772
Feature activation+2.59e+00

Pos logit contributions
adders+8.634
udi+7.750
ensical+7.500
abus+7.225
��+7.084
Neg logit contributions
ドラ-7.862
oire-7.808
 Sins-7.113
avering-6.825
gerald-6.574
Tokenl
Token ID75
Feature activation-1.08e+00

Pos logit contributions
oire+3.602
 Sins+3.325
ドラ+3.280
avering+3.111
gerald+2.976
Neg logit contributions
adders-3.728
 millenn-3.472
ensical-3.403
��-3.329
udi-3.283
Tokenowski
Token ID12079
Feature activation-1.05e-01

Pos logit contributions
adders+1.562
ensical+1.406
udi+1.381
 millenn+1.366
��+1.342
Neg logit contributions
oire-1.518
ドラ-1.460
 Sins-1.445
avering-1.325
gerald-1.272
Token,
Token ID11
Feature activation+1.19e-01

Pos logit contributions
adders+0.147
ensical+0.132
udi+0.130
 millenn+0.127
��+0.125
Neg logit contributions
oire-0.153
ドラ-0.148
 Sins-0.146
avering-0.135
gerald-0.129
Token who
Token ID508
Feature activation+2.61e-01

Pos logit contributions
oire+0.168
 Sins+0.161
ドラ+0.161
avering+0.147
gerald+0.140
Neg logit contributions
adders-0.174
ensical-0.157
udi-0.153
 millenn-0.151
��-0.149
Token in
Token ID287
Feature activation+7.69e-01

Pos logit contributions
oire+0.513
 Sins+0.490
ドラ+0.487
avering+0.445
gerald+0.430
Neg logit contributions
adders-0.527
ensical-0.473
udi-0.462
 millenn-0.461
��-0.453
Token 1945
Token ID15761
Feature activation+2.20e+00

Pos logit contributions
oire+0.983
 Sins+0.948
ドラ+0.915
avering+0.833
gerald+0.804
Neg logit contributions
adders-1.224
ensical-1.128
udi-1.091
��-1.090
 millenn-1.084
Token,
Token ID11
Feature activation-5.73e-01

Pos logit contributions
oire+2.926
 Sins+2.765
ドラ+2.740
avering+2.496
gerald+2.400
Neg logit contributions
adders-3.359
ensical-3.052
 millenn-3.007
udi-2.967
��-2.943
Token representing
Token ID10200
Feature activation-8.09e-01

Pos logit contributions
adders+0.806
ensical+0.724
udi+0.705
 millenn+0.702
��+0.693
Neg logit contributions
oire-0.843
 Sins-0.808
ドラ-0.803
avering-0.739
gerald-0.711
Token the
Token ID262
Feature activation-4.38e-01

Pos logit contributions
adders+1.176
ensical+1.061
udi+1.040
 millenn+1.025
��+1.011
Neg logit contributions
oire-1.144
ドラ-1.097
 Sins-1.097
avering-0.998
gerald-0.959
Token NAACP
Token ID39829
Feature activation-5.99e+00

Pos logit contributions
adders+0.613
ensical+0.548
udi+0.539
 millenn+0.526
��+0.522
Neg logit contributions
oire-0.643
 Sins-0.625
ドラ-0.612
avering-0.567
gerald-0.543
Token with
Token ID351
Feature activation-3.89e-01

Pos logit contributions
adders+7.625
udi+6.804
ensical+6.739
 millenn+6.560
��+6.366
Neg logit contributions
oire-8.908
ドラ-8.836
 Sins-8.527
avering-7.968
gerald-7.722
Token W
Token ID370
Feature activation+8.12e-01

Pos logit contributions
adders+0.551
ensical+0.495
udi+0.485
 millenn+0.479
��+0.474
Neg logit contributions
oire-0.566
 Sins-0.541
ドラ-0.541
avering-0.498
gerald-0.479
Token.
Token ID13
Feature activation-4.76e-01

Pos logit contributions
oire+1.108
ドラ+1.050
 Sins+1.037
avering+0.982
gerald+0.914
Neg logit contributions
adders-1.203
ensical-1.097
 millenn-1.074
udi-1.063
��-1.059
Token E
Token ID412
Feature activation-1.22e+00

Pos logit contributions
adders+0.725
ensical+0.660
 millenn+0.647
udi+0.646
��+0.632
Neg logit contributions
oire-0.641
ドラ-0.613
 Sins-0.611
avering-0.558
gerald-0.530
Token.
Token ID13
Feature activation-6.00e-01

Pos logit contributions
adders+1.753
udi+1.549
ensical+1.545
 millenn+1.502
��+1.477
Neg logit contributions
oire-1.739
ドラ-1.703
 Sins-1.660
avering-1.520
gerald-1.460
Tokenoley
Token ID48437
Feature activation+1.64e+00

Pos logit contributions
oire+0.661
ドラ+0.602
 Sins+0.600
avering+0.529
gerald+0.487
Neg logit contributions
adders-1.417
ensical-1.312
 millenn-1.291
udi-1.286
��-1.278
Token,
Token ID11
Feature activation+2.31e-01

Pos logit contributions
oire+2.156
ドラ+2.036
 Sins+2.029
avering+1.850
gerald+1.764
Neg logit contributions
adders-2.524
ensical-2.320
 millenn-2.244
udi-2.237
��-2.209
Token director
Token ID3437
Feature activation-4.94e-01

Pos logit contributions
oire+0.328
 Sins+0.313
ドラ+0.309
avering+0.287
gerald+0.272
Neg logit contributions
adders-0.336
ensical-0.307
udi-0.297
 millenn-0.294
��-0.292
Token of
Token ID286
Feature activation+5.15e-02

Pos logit contributions
adders+0.591
ensical+0.517
udi+0.508
 millenn+0.500
��+0.488
Neg logit contributions
oire-0.820
ドラ-0.800
 Sins-0.792
avering-0.731
gerald-0.713
Token fund
Token ID1814
Feature activation-1.05e+00

Pos logit contributions
oire+0.074
 Sins+0.071
ドラ+0.070
avering+0.066
gerald+0.062
Neg logit contributions
adders-0.073
ensical-0.066
udi-0.065
 millenn-0.064
��-0.063
Token raising
Token ID8620
Feature activation-5.75e+00

Pos logit contributions
adders+1.509
ensical+1.361
udi+1.337
 millenn+1.318
��+1.297
Neg logit contributions
oire-1.491
ドラ-1.431
 Sins-1.416
avering-1.303
gerald-1.247
Token for
Token ID329
Feature activation-1.26e+00

Pos logit contributions
adders+6.866
udi+5.984
ensical+5.937
 millenn+5.786
 awareness+5.734
Neg logit contributions
oire-9.152
ドラ-9.074
 Sins-8.748
avering-8.137
gerald-7.963
Token the
Token ID262
Feature activation-3.47e-01

Pos logit contributions
adders+1.801
ensical+1.623
udi+1.592
 millenn+1.564
��+1.549
Neg logit contributions
oire-1.818
 Sins-1.740
ドラ-1.735
avering-1.601
gerald-1.531
Token Bomber
Token ID38474
Feature activation+1.03e+00

Pos logit contributions
adders+0.473
ensical+0.423
udi+0.413
��+0.405
 millenn+0.404
Neg logit contributions
oire-0.523
 Sins-0.504
ドラ-0.493
avering-0.466
gerald-0.446
Token Command
Token ID9455
Feature activation+1.71e+00

Pos logit contributions
oire+1.068
ドラ+1.014
 Sins+1.005
avering+0.888
gerald+0.837
Neg logit contributions
adders-1.876
ensical-1.729
udi-1.708
 millenn-1.688
��-1.669
Token Association
Token ID5396
Feature activation-3.46e+00

Pos logit contributions
oire+2.296
 Sins+2.202
ドラ+2.152
avering+2.000
gerald+1.911
Neg logit contributions
adders-2.565
ensical-2.329
udi-2.255
 millenn-2.247
��-2.220
Token,
Token ID11
Feature activation+1.41e-01

Pos logit contributions
adders+2.138
ensical+1.919
udi+1.889
 millenn+1.856
��+1.827
Neg logit contributions
oire-2.140
ドラ-2.066
 Sins-2.041
avering-1.874
gerald-1.801
Token head
Token ID1182
Feature activation+9.27e-01

Pos logit contributions
oire+0.211
ドラ+0.203
 Sins+0.202
avering+0.186
gerald+0.177
Neg logit contributions
adders-0.195
ensical-0.176
udi-0.170
 millenn-0.168
��-0.166
Token of
Token ID286
Feature activation-2.12e-01

Pos logit contributions
oire+1.522
ドラ+1.475
 Sins+1.464
avering+1.356
gerald+1.315
Neg logit contributions
adders-1.134
ensical-0.998
udi-0.979
 millenn-0.962
��-0.943
Token environmental
Token ID6142
Feature activation-1.69e+00

Pos logit contributions
adders+0.306
ensical+0.276
udi+0.271
 millenn+0.266
��+0.263
Neg logit contributions
oire-0.303
ドラ-0.292
 Sins-0.289
avering-0.265
gerald-0.253
Token group
Token ID1448
Feature activation-3.26e+00

Pos logit contributions
adders+2.129
ensical+1.880
udi+1.848
 millenn+1.816
��+1.775
Neg logit contributions
oire-2.715
ドラ-2.630
 Sins-2.602
avering-2.414
gerald-2.334
Token Greenpeace
Token ID42793
Feature activation-5.50e+00

Pos logit contributions
adders+4.710
udi+4.201
ensical+4.199
 millenn+4.075
��+4.011
Neg logit contributions
oire-4.516
ドラ-4.365
 Sins-4.233
avering-3.939
gerald-3.831
Token
Token ID447
Feature activation+5.42e-01

Pos logit contributions
adders+7.298
udi+6.720
ensical+6.514
 millenn+6.253
 awareness+6.138
Neg logit contributions
oire-7.702
ドラ-7.449
 Sins-7.348
avering-6.839
gerald-6.687
Token
Token ID247
Feature activation+4.65e-02

Pos logit contributions
oire+0.866
ドラ+0.835
 Sins+0.829
avering+0.770
gerald+0.743
Neg logit contributions
adders-0.689
ensical-0.613
udi-0.597
 millenn-0.586
��-0.567
Tokens
Token ID82
Feature activation+3.30e-01

Pos logit contributions
oire+0.057
ドラ+0.054
 Sins+0.054
avering+0.049
gerald+0.046
Neg logit contributions
adders-0.076
ensical-0.070
 millenn-0.068
udi-0.068
��-0.067
Token International
Token ID4037
Feature activation-1.34e+00

Pos logit contributions
oire+0.465
 Sins+0.448
ドラ+0.445
avering+0.405
gerald+0.385
Neg logit contributions
adders-0.482
ensical-0.434
udi-0.424
��-0.420
 millenn-0.420
Token Nuclear
Token ID19229
Feature activation+3.59e-01

Pos logit contributions
adders+2.149
ensical+1.952
udi+1.925
 millenn+1.896
��+1.868
Neg logit contributions
oire-1.675
ドラ-1.606
 Sins-1.581
avering-1.436
gerald-1.374
Token involving
Token ID7411
Feature activation+3.01e-01

Pos logit contributions
oire+1.223
 Sins+1.162
ドラ+1.151
avering+1.061
gerald+0.995
Neg logit contributions
adders-1.313
ensical-1.191
 millenn-1.174
udi-1.171
��-1.149
Token major
Token ID1688
Feature activation-3.10e-01

Pos logit contributions
oire+0.477
 Sins+0.465
ドラ+0.457
avering+0.428
gerald+0.412
Neg logit contributions
adders-0.378
ensical-0.338
udi-0.329
 millenn-0.327
��-0.323
Token campaign
Token ID1923
Feature activation-3.24e+00

Pos logit contributions
adders+0.382
ensical+0.339
udi+0.332
 millenn+0.328
��+0.319
Neg logit contributions
oire-0.504
ドラ-0.488
 Sins-0.487
avering-0.450
gerald-0.436
Token supporters
Token ID5941
Feature activation-1.63e+00

Pos logit contributions
adders+3.329
ensical+2.832
udi+2.766
 millenn+2.689
��+2.636
Neg logit contributions
oire-5.910
ドラ-5.754
 Sins-5.702
avering-5.349
gerald-5.184
Token,
Token ID11
Feature activation-4.83e-01

Pos logit contributions
adders+2.324
ensical+2.084
udi+2.052
 millenn+2.015
��+1.983
Neg logit contributions
oire-2.346
ドラ-2.265
 Sins-2.237
avering-2.053
gerald-1.974
Token raising
Token ID8620
Feature activation-5.50e+00

Pos logit contributions
adders+0.630
ensical+0.570
 millenn+0.547
udi+0.546
��+0.542
Neg logit contributions
oire-0.747
 Sins-0.728
ドラ-0.713
avering-0.668
gerald-0.641
Token real
Token ID1103
Feature activation-2.29e-01

Pos logit contributions
adders+6.528
ensical+5.667
udi+5.567
 millenn+5.372
 awareness+5.326
Neg logit contributions
oire-9.029
ドラ-8.741
 Sins-8.575
avering-7.941
gerald-7.715
Token doubts
Token ID17188
Feature activation-1.20e-01

Pos logit contributions
adders+0.317
ensical+0.290
 millenn+0.285
udi+0.283
��+0.277
Neg logit contributions
oire-0.331
ドラ-0.322
 Sins-0.317
avering-0.292
gerald-0.281
Token about
Token ID546
Feature activation-1.00e+00

Pos logit contributions
adders+0.194
ensical+0.174
 millenn+0.174
udi+0.173
��+0.170
Neg logit contributions
oire-0.151
ドラ-0.143
 Sins-0.143
avering-0.130
gerald-0.123
Token their
Token ID511
Feature activation-9.28e-01

Pos logit contributions
adders+1.568
ensical+1.425
udi+1.402
 millenn+1.387
��+1.369
Neg logit contributions
oire-1.313
 Sins-1.252
ドラ-1.248
avering-1.137
gerald-1.085
Token impartial
Token ID32521
Feature activation+1.85e+00

Pos logit contributions
adders+1.147
ensical+1.014
udi+1.005
 millenn+0.977
��+0.975
Neg logit contributions
oire-1.514
 Sins-1.457
ドラ-1.454
avering-1.349
gerald-1.298
Token.
Token ID13
Feature activation-1.36e-01

Pos logit contributions
adders+3.932
ensical+3.526
udi+3.469
 millenn+3.363
��+3.339
Neg logit contributions
oire-4.132
ドラ-4.047
 Sins-3.956
avering-3.647
gerald-3.506
Token For
Token ID1114
Feature activation-1.93e-01

Pos logit contributions
adders+0.227
ensical+0.209
 millenn+0.207
udi+0.204
��+0.200
Neg logit contributions
oire-0.162
 Sins-0.156
ドラ-0.155
avering-0.138
gerald-0.131
Token others
Token ID1854
Feature activation+4.82e-01

Pos logit contributions
adders+0.325
ensical+0.297
udi+0.291
 millenn+0.290
��+0.285
Neg logit contributions
oire-0.229
 Sins-0.220
ドラ-0.217
avering-0.196
gerald-0.188
Token,
Token ID11
Feature activation-1.05e-01

Pos logit contributions
oire+0.686
ドラ+0.661
 Sins+0.653
avering+0.599
gerald+0.576
Neg logit contributions
adders-0.695
ensical-0.625
udi-0.614
 millenn-0.605
��-0.595
Token like
Token ID588
Feature activation-1.08e+00

Pos logit contributions
adders+0.155
ensical+0.140
udi+0.137
 millenn+0.136
��+0.134
Neg logit contributions
oire-0.144
 Sins-0.141
ドラ-0.139
avering-0.127
gerald-0.121
Token Natalie
Token ID35571
Feature activation-5.44e+00

Pos logit contributions
adders+1.631
ensical+1.468
udi+1.444
 millenn+1.433
��+1.409
Neg logit contributions
oire-1.463
 Sins-1.406
ドラ-1.405
avering-1.272
gerald-1.222
Token Port
Token ID4347
Feature activation-2.74e-01

Pos logit contributions
adders+6.815
ensical+5.987
udi+5.962
 millenn+5.670
��+5.551
Neg logit contributions
oire-8.686
ドラ-8.459
 Sins-8.249
avering-7.694
gerald-7.473
Tokenman
Token ID805
Feature activation+3.18e-01

Pos logit contributions
adders+0.571
ensical+0.532
 millenn+0.523
udi+0.523
��+0.517
Neg logit contributions
oire-0.214
 Sins-0.198
ドラ-0.198
avering-0.166
gerald-0.155
Token and
Token ID290
Feature activation+6.62e-01

Pos logit contributions
oire+0.461
ドラ+0.444
 Sins+0.440
avering+0.404
gerald+0.388
Neg logit contributions
adders-0.450
ensical-0.403
udi-0.396
 millenn-0.391
��-0.384
Token E
Token ID412
Feature activation-7.28e-01

Pos logit contributions
oire+0.873
 Sins+0.853
ドラ+0.840
avering+0.764
gerald+0.733
Neg logit contributions
adders-1.017
ensical-0.912
 millenn-0.894
udi-0.893
��-0.881
Tokenwan
Token ID8149
Feature activation+5.94e-01

Pos logit contributions
adders+1.235
ensical+1.129
udi+1.113
 millenn+1.099
��+1.084
Neg logit contributions
oire-0.851
ドラ-0.813
 Sins-0.801
avering-0.720
gerald-0.684
Token My
Token ID2011
Feature activation-8.56e-01

Pos logit contributions
adders+1.544
ensical+1.401
udi+1.370
 millenn+1.361
��+1.350
Neg logit contributions
oire-1.315
 Sins-1.254
ドラ-1.251
avering-1.128
gerald-1.078
Token Family
Token ID7884
Feature activation-1.81e+00

Pos logit contributions
adders+1.434
ensical+1.316
udi+1.292
 millenn+1.290
��+1.266
Neg logit contributions
oire-1.023
 Sins-0.977
ドラ-0.968
avering-0.862
gerald-0.821
Token'
Token ID6
Feature activation+6.41e-01

Pos logit contributions
adders+2.670
ensical+2.407
udi+2.370
 millenn+2.330
��+2.287
Neg logit contributions
oire-2.500
ドラ-2.414
 Sins-2.375
avering-2.180
gerald-2.091
Token dedicated
Token ID7256
Feature activation-8.05e-01

Pos logit contributions
oire+0.939
 Sins+0.912
ドラ+0.896
avering+0.819
gerald+0.799
Neg logit contributions
adders-0.896
ensical-0.804
 millenn-0.780
��-0.779
udi-0.776
Token to
Token ID284
Feature activation-2.37e-01

Pos logit contributions
adders+0.811
ensical+0.695
udi+0.677
 millenn+0.665
��+0.644
Neg logit contributions
oire-1.491
ドラ-1.451
 Sins-1.437
avering-1.348
gerald-1.308
Token raising
Token ID8620
Feature activation-5.23e+00

Pos logit contributions
adders+0.303
udi+0.264
ensical+0.264
��+0.260
 millenn+0.254
Neg logit contributions
oire-0.379
 Sins-0.366
ドラ-0.361
avering-0.339
gerald-0.323
Token awareness
Token ID9359
Feature activation-2.55e+00

Pos logit contributions
adders+3.789
ensical+3.067
udi+2.900
 millenn+2.853
 awareness+2.645
Neg logit contributions
oire-11.040
ドラ-10.802
 Sins-10.641
avering-10.071
gerald-9.858
Token of
Token ID286
Feature activation-6.35e-01

Pos logit contributions
adders+3.448
ensical+3.096
udi+3.030
 millenn+2.962
��+2.900
Neg logit contributions
oire-3.819
ドラ-3.713
 Sins-3.649
avering-3.362
gerald-3.265
Token food
Token ID2057
Feature activation-1.79e+00

Pos logit contributions
adders+0.909
ensical+0.810
udi+0.796
��+0.783
 millenn+0.781
Neg logit contributions
oire-0.916
 Sins-0.879
ドラ-0.874
avering-0.803
gerald-0.769
Token insecurity
Token ID33837
Feature activation-1.08e+00

Pos logit contributions
adders+1.979
ensical+1.713
udi+1.664
 millenn+1.643
��+1.625
Neg logit contributions
oire-3.191
ドラ-3.079
 Sins-3.060
avering-2.849
gerald-2.762
Token in
Token ID287
Feature activation+2.34e-01

Pos logit contributions
adders+1.540
ensical+1.379
udi+1.351
 millenn+1.333
��+1.324
Neg logit contributions
oire-1.576
ドラ-1.513
 Sins-1.507
avering-1.376
gerald-1.324
Token fish
Token ID5916
Feature activation+9.30e-01

Pos logit contributions
oire+0.282
ドラ+0.270
 Sins+0.269
avering+0.246
gerald+0.240
Neg logit contributions
adders-0.251
ensical-0.219
udi-0.216
 millenn-0.215
��-0.211
Token we
Token ID356
Feature activation-3.44e-01

Pos logit contributions
oire+1.258
ドラ+1.190
 Sins+1.178
avering+1.075
gerald+1.047
Neg logit contributions
adders-1.410
ensical-1.261
 millenn-1.253
udi-1.241
��-1.229
Token ate
Token ID15063
Feature activation+5.88e-03

Pos logit contributions
adders+0.496
ensical+0.445
 millenn+0.442
udi+0.436
��+0.429
Neg logit contributions
oire-0.492
ドラ-0.472
 Sins-0.463
avering-0.427
gerald-0.412
Token.
Token ID13
Feature activation-6.93e-02

Pos logit contributions
oire+0.008
ドラ+0.008
 Sins+0.008
avering+0.007
gerald+0.007
Neg logit contributions
adders-0.009
ensical-0.008
 millenn-0.008
udi-0.008
��-0.008
Token Growing
Token ID37487
Feature activation-3.73e+00

Pos logit contributions
adders+0.118
ensical+0.109
 millenn+0.107
udi+0.107
��+0.105
Neg logit contributions
oire-0.080
ドラ-0.077
 Sins-0.076
avering-0.067
gerald-0.064
Token up
Token ID510
Feature activation-5.17e+00

Pos logit contributions
adders+4.724
ensical+4.179
udi+4.113
 millenn+4.022
��+3.930
Neg logit contributions
oire-5.916
ドラ-5.755
 Sins-5.652
avering-5.267
gerald-5.075
Token in
Token ID287
Feature activation-8.30e-02

Pos logit contributions
adders+6.579
ensical+5.932
udi+5.924
 millenn+5.592
��+5.542
Neg logit contributions
oire-7.844
ドラ-7.686
 Sins-7.386
avering-6.939
gerald-6.706
Token the
Token ID262
Feature activation-6.47e-01

Pos logit contributions
adders+0.119
ensical+0.108
udi+0.105
 millenn+0.104
��+0.102
Neg logit contributions
oire-0.118
 Sins-0.114
ドラ-0.113
avering-0.103
gerald-0.099
Token ghetto
Token ID46482
Feature activation+3.47e-01

Pos logit contributions
adders+0.892
ensical+0.801
udi+0.777
 millenn+0.769
��+0.755
Neg logit contributions
oire-0.965
 Sins-0.929
ドラ-0.928
avering-0.852
gerald-0.818
Token,
Token ID11
Feature activation-3.98e-01

Pos logit contributions
oire+0.485
 Sins+0.464
ドラ+0.462
avering+0.419
gerald+0.402
Neg logit contributions
adders-0.510
ensical-0.459
 millenn-0.446
udi-0.446
��-0.440
Token we
Token ID356
Feature activation-5.14e-01

Pos logit contributions
adders+0.607
ensical+0.552
udi+0.537
 millenn+0.535
��+0.528
Neg logit contributions
oire-0.533
ドラ-0.510
 Sins-0.508
avering-0.461
gerald-0.442
Token about
Token ID546
Feature activation-7.78e-01

Pos logit contributions
adders+0.054
ensical+0.048
udi+0.048
 millenn+0.047
��+0.047
Neg logit contributions
oire-0.048
 Sins-0.046
ドラ-0.046
avering-0.041
gerald-0.040
Token her
Token ID607
Feature activation-1.28e+00

Pos logit contributions
adders+1.157
ensical+1.044
udi+1.025
 millenn+1.012
��+0.997
Neg logit contributions
oire-1.075
ドラ-1.034
 Sins-1.026
avering-0.934
gerald-0.897
Token support
Token ID1104
Feature activation-8.34e-01

Pos logit contributions
adders+1.664
ensical+1.473
udi+1.445
 millenn+1.417
��+1.404
Neg logit contributions
oire-2.024
ドラ-1.952
 Sins-1.942
avering-1.790
gerald-1.725
Token for
Token ID329
Feature activation-8.62e-01

Pos logit contributions
adders+1.055
ensical+0.939
udi+0.917
 millenn+0.912
��+0.902
Neg logit contributions
oire-1.336
ドラ-1.284
 Sins-1.272
avering-1.186
gerald-1.124
Token Planned
Token ID18451
Feature activation-2.53e+00

Pos logit contributions
adders+1.135
ensical+1.011
udi+0.992
 millenn+0.974
��+0.961
Neg logit contributions
oire-1.338
ドラ-1.290
 Sins-1.286
avering-1.186
gerald-1.141
Token Parenthood
Token ID18461
Feature activation-5.14e+00

Pos logit contributions
adders+2.867
ensical+2.494
udi+2.440
 millenn+2.394
��+2.339
Neg logit contributions
oire-4.389
ドラ-4.258
 Sins-4.217
avering-3.936
gerald-3.808
Token,
Token ID11
Feature activation-2.42e-01

Pos logit contributions
adders+6.464
udi+5.747
ensical+5.687
 millenn+5.547
 awareness+5.416
Neg logit contributions
oire-7.714
ドラ-7.538
 Sins-7.339
avering-6.901
gerald-6.639
Token the
Token ID262
Feature activation+1.45e-01

Pos logit contributions
adders+0.349
ensical+0.313
udi+0.309
 millenn+0.306
��+0.302
Neg logit contributions
oire-0.346
 Sins-0.332
ドラ-0.331
avering-0.301
gerald-0.288
Token pro
Token ID386
Feature activation-4.40e-01

Pos logit contributions
oire+0.228
 Sins+0.219
ドラ+0.218
avering+0.201
gerald+0.193
Neg logit contributions
adders-0.188
ensical-0.167
udi-0.164
 millenn-0.160
��-0.160
Token-
Token ID12
Feature activation-9.94e-01

Pos logit contributions
adders+0.711
ensical+0.648
udi+0.638
 millenn+0.631
��+0.621
Neg logit contributions
oire-0.548
ドラ-0.527
 Sins-0.517
avering-0.470
gerald-0.448
Tokenchoice
Token ID25541
Feature activation-2.95e+00

Pos logit contributions
adders+1.470
ensical+1.331
udi+1.304
 millenn+1.291
��+1.271
Neg logit contributions
oire-1.377
ドラ-1.326
 Sins-1.305
avering-1.200
gerald-1.155
Token the
Token ID262
Feature activation-1.77e-01

Pos logit contributions
oire+0.641
 Sins+0.620
ドラ+0.609
avering+0.564
gerald+0.536
Neg logit contributions
adders-0.680
ensical-0.616
udi-0.608
 millenn-0.599
��-0.590
Token anti
Token ID3098
Feature activation-1.73e+00

Pos logit contributions
adders+0.231
ensical+0.206
udi+0.202
 millenn+0.198
��+0.196
Neg logit contributions
oire-0.275
 Sins-0.266
ドラ-0.263
avering-0.244
gerald-0.235
Token-
Token ID12
Feature activation-1.04e+00

Pos logit contributions
adders+2.831
ensical+2.608
 millenn+2.545
udi+2.540
��+2.485
Neg logit contributions
oire-2.128
ドラ-2.039
 Sins-1.999
avering-1.816
gerald-1.731
Tokenabortion
Token ID32396
Feature activation-1.84e+00

Pos logit contributions
adders+1.467
ensical+1.318
udi+1.292
 millenn+1.275
��+1.253
Neg logit contributions
oire-1.528
ドラ-1.474
 Sins-1.452
avering-1.340
gerald-1.291
Token group
Token ID1448
Feature activation-2.90e+00

Pos logit contributions
adders+1.621
ensical+1.353
udi+1.313
 millenn+1.279
��+1.241
Neg logit contributions
oire-3.652
ドラ-3.556
 Sins-3.528
avering-3.322
gerald-3.231
Token Center
Token ID3337
Feature activation-5.13e+00

Pos logit contributions
adders+4.257
ensical+3.820
udi+3.758
 millenn+3.687
��+3.650
Neg logit contributions
oire-4.029
ドラ-3.891
 Sins-3.799
avering-3.499
gerald-3.372
Token for
Token ID329
Feature activation-6.78e-01

Pos logit contributions
adders+5.942
udi+5.228
ensical+5.208
 millenn+4.951
��+4.803
Neg logit contributions
oire-8.321
 Sins-7.945
ドラ-7.941
avering-7.359
gerald-7.329
Token Medical
Token ID8366
Feature activation+8.19e-01

Pos logit contributions
adders+1.173
ensical+1.073
udi+1.057
 millenn+1.045
��+1.030
Neg logit contributions
oire-0.769
ドラ-0.736
 Sins-0.717
avering-0.644
gerald-0.614
Token Progress
Token ID18387
Feature activation-1.57e+00

Pos logit contributions
oire+1.076
ドラ+1.034
 Sins+1.030
avering+0.933
gerald+0.887
Neg logit contributions
adders-1.267
ensical-1.152
udi-1.133
 millenn-1.117
��-1.102
Token.
Token ID13
Feature activation-2.13e-01

Pos logit contributions
adders+2.264
ensical+2.035
udi+2.005
 millenn+1.973
��+1.939
Neg logit contributions
oire-2.216
ドラ-2.138
 Sins-2.109
avering-1.938
gerald-1.858
Token
Token ID564
Feature activation-1.01e+00

Pos logit contributions
adders+0.336
ensical+0.305
udi+0.300
 millenn+0.299
��+0.293
Neg logit contributions
oire-0.274
ドラ-0.263
 Sins-0.261
avering-0.236
gerald-0.225
Token car
Token ID1097
Feature activation+7.43e-02

Pos logit contributions
adders+0.326
ensical+0.290
udi+0.284
 millenn+0.279
��+0.274
Neg logit contributions
oire-0.391
 Sins-0.378
ドラ-0.375
avering-0.346
gerald-0.336
Token killing
Token ID5170
Feature activation+1.08e+00

Pos logit contributions
oire+0.106
 Sins+0.102
ドラ+0.102
avering+0.093
gerald+0.090
Neg logit contributions
adders-0.107
ensical-0.096
udi-0.095
 millenn-0.094
��-0.092
Token Dec
Token ID4280
Feature activation+6.23e-01

Pos logit contributions
oire+1.454
 Sins+1.430
ドラ+1.365
avering+1.273
gerald+1.222
Neg logit contributions
adders-1.616
ensical-1.469
 millenn-1.440
��-1.415
udi-1.408
Tokenatur
Token ID2541
Feature activation-5.94e-02

Pos logit contributions
oire+0.861
 Sins+0.796
ドラ+0.784
avering+0.738
gerald+0.707
Neg logit contributions
adders-0.936
 millenn-0.867
ensical-0.855
��-0.840
udi-0.831
Token High
Token ID3334
Feature activation-8.87e-01

Pos logit contributions
adders+0.088
ensical+0.080
udi+0.078
 millenn+0.078
��+0.076
Neg logit contributions
oire-0.082
ドラ-0.078
 Sins-0.078
avering-0.071
gerald-0.069
Token School
Token ID3961
Feature activation-5.12e+00

Pos logit contributions
adders+1.262
ensical+1.132
udi+1.113
 millenn+1.102
��+1.078
Neg logit contributions
oire-1.279
ドラ-1.233
 Sins-1.225
avering-1.119
gerald-1.081
Token seniors
Token ID23481
Feature activation-1.90e+00

Pos logit contributions
adders+5.050
udi+4.416
ensical+4.344
 millenn+4.157
��+4.125
Neg logit contributions
oire-9.128
ドラ-9.023
 Sins-8.670
avering-8.288
gerald-7.944
Token Derek
Token ID20893
Feature activation-1.29e+00

Pos logit contributions
adders+2.802
ensical+2.527
udi+2.493
 millenn+2.443
��+2.405
Neg logit contributions
oire-2.625
ドラ-2.533
 Sins-2.481
avering-2.283
gerald-2.183
Token King
Token ID2677
Feature activation-2.38e-01

Pos logit contributions
adders+1.909
ensical+1.717
udi+1.693
 millenn+1.659
��+1.635
Neg logit contributions
oire-1.773
ドラ-1.712
 Sins-1.680
avering-1.542
gerald-1.478
Token and
Token ID290
Feature activation+2.57e-01

Pos logit contributions
adders+0.322
ensical+0.287
udi+0.281
 millenn+0.278
��+0.273
Neg logit contributions
oire-0.360
ドラ-0.347
 Sins-0.344
avering-0.317
gerald-0.306
Token Nicholas
Token ID20320
Feature activation-2.41e+00

Pos logit contributions
oire+0.344
 Sins+0.335
ドラ+0.327
avering+0.299
gerald+0.289
Neg logit contributions
adders-0.393
ensical-0.358
 millenn-0.349
udi-0.346
��-0.341
Token-
Token ID12
Feature activation-3.27e-01

Pos logit contributions
oire+1.020
ドラ+0.983
 Sins+0.957
avering+0.896
gerald+0.869
Neg logit contributions
adders-0.940
ensical-0.869
udi-0.834
��-0.831
 millenn-0.827
Tokenup
Token ID929
Feature activation-2.01e+00

Pos logit contributions
adders+0.528
ensical+0.482
udi+0.474
 millenn+0.472
��+0.463
Neg logit contributions
oire-0.407
ドラ-0.390
 Sins-0.383
avering-0.350
gerald-0.334
Token small
Token ID1402
Feature activation-2.06e+00

Pos logit contributions
adders+2.435
ensical+2.155
udi+2.104
 millenn+2.067
��+2.039
Neg logit contributions
oire-3.339
ドラ-3.227
 Sins-3.204
avering-2.973
gerald-2.877
Token donors
Token ID13334
Feature activation-7.43e-01

Pos logit contributions
adders+3.000
ensical+2.699
udi+2.657
 millenn+2.623
��+2.583
Neg logit contributions
oire-2.911
ドラ-2.808
 Sins-2.770
avering-2.540
gerald-2.443
Token,
Token ID11
Feature activation-3.15e-01

Pos logit contributions
adders+1.044
ensical+0.940
udi+0.916
 millenn+0.913
��+0.902
Neg logit contributions
oire-1.088
ドラ-1.042
 Sins-1.036
avering-0.959
gerald-0.919
Token raising
Token ID8620
Feature activation-5.11e+00

Pos logit contributions
adders+0.437
ensical+0.397
udi+0.383
��+0.381
 millenn+0.381
Neg logit contributions
oire-0.459
 Sins-0.443
ドラ-0.436
avering-0.408
gerald-0.390
Token more
Token ID517
Feature activation-6.62e-01

Pos logit contributions
adders+6.706
ensical+5.951
udi+5.817
 millenn+5.702
��+5.544
Neg logit contributions
oire-7.735
ドラ-7.551
 Sins-7.394
avering-6.869
gerald-6.594
Token than
Token ID621
Feature activation-8.86e-01

Pos logit contributions
adders+0.942
ensical+0.852
udi+0.841
��+0.829
 millenn+0.827
Neg logit contributions
oire-0.942
ドラ-0.905
 Sins-0.904
avering-0.822
gerald-0.793
Token $
Token ID720
Feature activation+1.59e+00

Pos logit contributions
adders+1.347
udi+1.211
ensical+1.210
 millenn+1.189
��+1.173
Neg logit contributions
oire-1.199
 Sins-1.146
ドラ-1.134
avering-1.038
gerald-1.003
Token23
Token ID1954
Feature activation-1.25e-01

Pos logit contributions
oire+2.057
ドラ+1.986
 Sins+1.919
avering+1.761
gerald+1.709
Neg logit contributions
 millenn-2.468
adders-2.438
ensical-2.378
��-2.212
udi-2.201
Token million
Token ID1510
Feature activation-4.44e-01

Pos logit contributions
adders+0.202
ensical+0.184
udi+0.181
 millenn+0.179
��+0.176
Neg logit contributions
oire-0.155
ドラ-0.149
 Sins-0.147
avering-0.133
gerald-0.127
Token Chan
Token ID18704
Feature activation-8.02e-01

Pos logit contributions
adders+2.825
ensical+2.530
udi+2.520
 millenn+2.423
��+2.411
Neg logit contributions
oire-2.631
ドラ-2.556
 Sins-2.478
avering-2.289
gerald-2.207
Token,
Token ID11
Feature activation+1.58e-01

Pos logit contributions
adders+1.171
ensical+1.055
udi+1.035
 millenn+1.023
��+1.007
Neg logit contributions
oire-1.126
ドラ-1.084
 Sins-1.072
avering-0.982
gerald-0.942
Token an
Token ID281
Feature activation+4.28e-01

Pos logit contributions
oire+0.237
 Sins+0.227
ドラ+0.227
avering+0.208
gerald+0.198
Neg logit contributions
adders-0.218
ensical-0.199
udi-0.190
 millenn-0.189
��-0.185
Token official
Token ID1743
Feature activation+8.28e-01

Pos logit contributions
oire+0.730
 Sins+0.705
ドラ+0.701
avering+0.653
gerald+0.622
Neg logit contributions
adders-0.494
ensical-0.442
 millenn-0.422
udi-0.416
��-0.415
Token with
Token ID351
Feature activation-5.19e-02

Pos logit contributions
oire+1.162
ドラ+1.106
 Sins+1.103
avering+1.015
gerald+0.960
Neg logit contributions
adders-1.221
ensical-1.096
udi-1.060
 millenn-1.060
��-1.057
Token Greenpeace
Token ID42793
Feature activation-5.10e+00

Pos logit contributions
adders+0.078
ensical+0.071
udi+0.069
 millenn+0.068
��+0.067
Neg logit contributions
oire-0.071
 Sins-0.068
ドラ-0.068
avering-0.062
gerald-0.059
Token in
Token ID287
Feature activation+9.83e-01

Pos logit contributions
adders+6.801
udi+6.193
ensical+6.010
 millenn+5.815
 awareness+5.700
Neg logit contributions
oire-7.328
ドラ-7.133
 Sins-6.993
avering-6.477
gerald-6.336
Token southern
Token ID8372
Feature activation+1.89e+00

Pos logit contributions
oire+1.382
 Sins+1.348
ドラ+1.301
avering+1.211
gerald+1.143
Neg logit contributions
adders-1.431
ensical-1.303
udi-1.255
 millenn-1.253
��-1.231
Token China
Token ID2807
Feature activation+1.19e+00

Pos logit contributions
oire+2.377
 Sins+2.302
ドラ+2.167
avering+2.020
gerald+1.879
Neg logit contributions
adders-3.022
ensical-2.760
udi-2.664
 millenn-2.653
��-2.650
Token.
Token ID13
Feature activation+9.66e-02

Pos logit contributions
oire+1.580
ドラ+1.507
 Sins+1.506
avering+1.360
gerald+1.298
Neg logit contributions
adders-1.834
ensical-1.664
 millenn-1.620
udi-1.615
��-1.600
Token
Token ID198
Feature activation-1.71e-01

Pos logit contributions
oire+0.133
ドラ+0.128
 Sins+0.127
avering+0.116
gerald+0.111
Neg logit contributions
adders-0.144
ensical-0.130
udi-0.127
 millenn-0.127
��-0.124
Token example
Token ID1672
Feature activation-1.05e+00

Pos logit contributions
adders+0.718
ensical+0.606
udi+0.588
 millenn+0.574
��+0.557
Neg logit contributions
oire-1.486
ドラ-1.444
 Sins-1.437
avering-1.351
gerald-1.313
Token,
Token ID11
Feature activation+3.81e-02

Pos logit contributions
adders+1.410
ensical+1.273
udi+1.247
 millenn+1.210
��+1.194
Neg logit contributions
oire-1.586
ドラ-1.517
 Sins-1.510
avering-1.389
gerald-1.340
Token in
Token ID287
Feature activation+9.98e-01

Pos logit contributions
oire+0.055
 Sins+0.053
ドラ+0.053
avering+0.048
gerald+0.047
Neg logit contributions
adders-0.054
ensical-0.048
 millenn-0.047
udi-0.047
��-0.046
Token 2010
Token ID3050
Feature activation+6.06e-02

Pos logit contributions
oire+1.274
 Sins+1.243
ドラ+1.205
avering+1.106
gerald+1.057
Neg logit contributions
adders-1.575
ensical-1.440
 millenn-1.415
udi-1.403
��-1.380
Token the
Token ID262
Feature activation-1.72e-01

Pos logit contributions
oire+0.085
ドラ+0.082
 Sins+0.081
avering+0.074
gerald+0.072
Neg logit contributions
adders-0.088
ensical-0.079
udi-0.078
 millenn-0.077
��-0.076
Token ACLU
Token ID24381
Feature activation-5.07e+00

Pos logit contributions
adders+0.254
ensical+0.229
udi+0.225
 millenn+0.223
��+0.218
Neg logit contributions
oire-0.240
 Sins-0.231
ドラ-0.229
avering-0.209
gerald-0.202
Token sued
Token ID16334
Feature activation+1.95e-01

Pos logit contributions
adders+6.194
ensical+5.540
udi+5.482
 millenn+5.282
��+5.160
Neg logit contributions
oire-7.830
ドラ-7.674
 Sins-7.503
avering-6.992
gerald-6.677
Token on
Token ID319
Feature activation-6.39e-01

Pos logit contributions
oire+0.270
ドラ+0.260
 Sins+0.259
avering+0.235
gerald+0.227
Neg logit contributions
adders-0.288
ensical-0.260
udi-0.256
 millenn-0.253
��-0.248
Token behalf
Token ID8378
Feature activation+1.33e+00

Pos logit contributions
adders+0.780
ensical+0.688
udi+0.676
 millenn+0.662
��+0.645
Neg logit contributions
oire-1.048
 Sins-1.014
ドラ-1.013
avering-0.940
gerald-0.907
Token of
Token ID286
Feature activation+1.68e-01

Pos logit contributions
oire+1.930
ドラ+1.852
 Sins+1.845
avering+1.693
gerald+1.604
Neg logit contributions
adders-1.849
ensical-1.784
 millenn-1.764
��-1.654
udi-1.619
Token a
Token ID257
Feature activation-4.27e-02

Pos logit contributions
oire+0.243
 Sins+0.235
ドラ+0.233
avering+0.213
gerald+0.207
Neg logit contributions
adders-0.240
ensical-0.215
udi-0.211
 millenn-0.208
��-0.205
Token judges
Token ID10266
Feature activation+1.57e+00

Pos logit contributions
adders+1.143
ensical+1.033
udi+1.005
 millenn+0.997
��+0.984
Neg logit contributions
oire-1.166
ドラ-1.118
 Sins-1.118
avering-1.015
gerald-0.982
Token:
Token ID25
Feature activation+2.07e+00

Pos logit contributions
oire+1.953
 Sins+1.868
ドラ+1.836
avering+1.653
gerald+1.593
Neg logit contributions
adders-2.503
ensical-2.287
 millenn-2.235
��-2.235
udi-2.219
Token Stan
Token ID7299
Feature activation+1.34e+00

Pos logit contributions
oire+2.537
 Sins+2.446
ドラ+2.384
avering+2.138
gerald+2.059
Neg logit contributions
adders-3.341
ensical-3.062
��-2.975
 millenn-2.973
udi-2.956
Token Walker
Token ID10120
Feature activation-1.99e+00

Pos logit contributions
oire+1.834
 Sins+1.751
ドラ+1.737
avering+1.596
gerald+1.516
Neg logit contributions
adders-1.989
ensical-1.793
 millenn-1.768
udi-1.758
��-1.731
Token,
Token ID11
Feature activation+3.12e-01

Pos logit contributions
adders+2.808
ensical+2.524
udi+2.494
 millenn+2.429
��+2.394
Neg logit contributions
oire-2.860
ドラ-2.760
 Sins-2.718
avering-2.509
gerald-2.405
Token Natalie
Token ID35571
Feature activation-5.06e+00

Pos logit contributions
oire+0.415
 Sins+0.401
ドラ+0.395
avering+0.358
gerald+0.345
Neg logit contributions
adders-0.480
ensical-0.436
udi-0.423
 millenn-0.422
��-0.416
Token B
Token ID347
Feature activation-8.43e-01

Pos logit contributions
adders+7.372
udi+6.653
ensical+6.534
 millenn+6.170
��+6.168
Neg logit contributions
oire-6.584
ドラ-6.523
 Sins-6.033
avering-5.729
gerald-5.521
Tokenassing
Token ID19696
Feature activation+1.02e+00

Pos logit contributions
adders+1.209
ensical+1.088
udi+1.069
 millenn+1.061
��+1.041
Neg logit contributions
oire-1.205
ドラ-1.155
 Sins-1.145
avering-1.055
gerald-1.011
Tokenth
Token ID400
Feature activation-8.64e-02

Pos logit contributions
oire+1.505
 Sins+1.431
ドラ+1.428
avering+1.323
gerald+1.275
Neg logit contributions
adders-1.408
ensical-1.254
 millenn-1.246
udi-1.210
��-1.209
Tokenwa
Token ID10247
Feature activation+4.43e-01

Pos logit contributions
adders+0.113
ensical+0.103
 millenn+0.101
��+0.098
udi+0.098
Neg logit contributions
oire-0.134
 Sins-0.127
ドラ-0.127
avering-0.119
gerald-0.113
Tokenight
Token ID432
Feature activation+2.29e+00

Pos logit contributions
oire+0.597
 Sins+0.569
ドラ+0.563
avering+0.524
gerald+0.495
Neg logit contributions
adders-0.668
ensical-0.603
 millenn-0.594
udi-0.590
��-0.584
Token out
Token ID503
Feature activation-1.13e+00

Pos logit contributions
adders+0.647
ensical+0.577
udi+0.569
 millenn+0.559
��+0.549
Neg logit contributions
oire-0.770
ドラ-0.743
 Sins-0.737
avering-0.678
gerald-0.658
Token a
Token ID257
Feature activation-3.05e-01

Pos logit contributions
adders+1.526
ensical+1.365
udi+1.344
 millenn+1.322
��+1.298
Neg logit contributions
oire-1.701
ドラ-1.641
 Sins-1.628
avering-1.498
gerald-1.447
Token television
Token ID5581
Feature activation+5.39e-01

Pos logit contributions
adders+0.398
ensical+0.356
udi+0.349
 millenn+0.341
��+0.334
Neg logit contributions
oire-0.479
ドラ-0.462
 Sins-0.459
avering-0.423
gerald-0.410
Token ad
Token ID512
Feature activation-2.69e-01

Pos logit contributions
oire+0.909
 Sins+0.861
ドラ+0.857
avering+0.802
gerald+0.774
Neg logit contributions
adders-0.655
ensical-0.580
udi-0.560
��-0.546
 millenn-0.543
Token Thursday
Token ID3635
Feature activation+9.36e-01

Pos logit contributions
adders+0.375
ensical+0.335
udi+0.328
 millenn+0.324
��+0.320
Neg logit contributions
oire-0.402
ドラ-0.385
 Sins-0.383
avering-0.351
gerald-0.337
Token raising
Token ID8620
Feature activation-5.05e+00

Pos logit contributions
oire+1.389
 Sins+1.326
ドラ+1.322
avering+1.216
gerald+1.164
Neg logit contributions
adders-1.294
ensical-1.153
udi-1.143
 millenn-1.134
��-1.111
Token questions
Token ID2683
Feature activation-3.37e-01

Pos logit contributions
adders+6.382
ensical+5.644
udi+5.441
 millenn+5.388
 awareness+5.288
Neg logit contributions
oire-7.898
ドラ-7.664
 Sins-7.535
avering-6.960
gerald-6.722
Token about
Token ID546
Feature activation-1.05e+00

Pos logit contributions
adders+0.541
ensical+0.488
udi+0.483
 millenn+0.481
��+0.473
Neg logit contributions
oire-0.426
ドラ-0.407
 Sins-0.404
avering-0.367
gerald-0.347
Token the
Token ID262
Feature activation-3.03e-01

Pos logit contributions
adders+1.542
ensical+1.393
udi+1.371
 millenn+1.350
��+1.333
Neg logit contributions
oire-1.470
ドラ-1.414
 Sins-1.406
avering-1.286
gerald-1.234
Token singer
Token ID14015
Feature activation-2.50e+00

Pos logit contributions
adders+0.409
ensical+0.365
udi+0.360
��+0.352
 millenn+0.350
Neg logit contributions
oire-0.461
 Sins-0.445
ドラ-0.442
avering-0.407
gerald-0.392
Token-
Token ID12
Feature activation-7.13e-01

Pos logit contributions
adders+3.649
ensical+3.287
udi+3.257
 millenn+3.175
��+3.123
Neg logit contributions
oire-3.464
ドラ-3.349
 Sins-3.286
avering-3.025
gerald-2.898
Tokenи
Token ID18849
Feature activation+1.45e-01

Pos logit contributions
adders+4.895
udi+4.318
ensical+4.309
 millenn+4.120
��+4.097
Neg logit contributions
oire-5.226
ドラ-5.031
 Sins-4.996
avering-4.582
gerald-4.476
Tokenя
Token ID40623
Feature activation-1.71e-01

Pos logit contributions
oire+0.165
ドラ+0.159
 Sins+0.156
avering+0.139
gerald+0.130
Neg logit contributions
adders-0.252
ensical-0.231
 millenn-0.226
udi-0.226
��-0.222
Token,
Token ID11
Feature activation-5.39e-01

Pos logit contributions
adders+0.309
ensical+0.284
udi+0.280
 millenn+0.278
��+0.275
Neg logit contributions
oire-0.182
ドラ-0.173
 Sins-0.171
avering-0.151
gerald-0.142
Token
Token ID220
Feature activation-4.02e-01

Pos logit contributions
adders+0.992
ensical+0.914
udi+0.900
 millenn+0.891
��+0.882
Neg logit contributions
oire-0.555
ドラ-0.528
 Sins-0.522
avering-0.459
gerald-0.430
Token
Token ID141
Feature activation+6.74e-01

Pos logit contributions
adders+0.768
ensical+0.712
udi+0.702
 millenn+0.696
��+0.686
Neg logit contributions
oire-0.382
ドラ-0.364
 Sins-0.356
avering-0.311
gerald-0.290
Token
Token ID229
Feature activation-4.98e+00

Pos logit contributions
oire+1.056
ドラ+1.014
 Sins+0.996
avering+0.923
gerald+0.891
Neg logit contributions
adders-0.880
ensical-0.788
udi-0.764
 millenn-0.760
��-0.727
Tokenт
Token ID20375
Feature activation-9.41e-01

Pos logit contributions
adders+8.040
udi+7.299
ensical+7.228
��+7.002
 millenn+6.967
Neg logit contributions
oire-5.756
ドラ-5.514
 Sins-5.424
gerald-4.874
avering-4.840
Tokenо
Token ID15166
Feature activation-1.34e+00

Pos logit contributions
adders+1.579
ensical+1.446
udi+1.421
 millenn+1.410
��+1.387
Neg logit contributions
oire-1.118
ドラ-1.068
 Sins-1.051
avering-0.948
gerald-0.901
Token
Token ID12466
Feature activation+1.58e+00

Pos logit contributions
adders+2.529
ensical+2.330
udi+2.303
 millenn+2.277
��+2.250
Neg logit contributions
oire-1.291
ドラ-1.221
 Sins-1.201
avering-1.053
gerald-0.989
Token
Token ID110
Feature activation-3.77e+00

Pos logit contributions
oire+2.577
ドラ+2.518
 Sins+2.455
avering+2.294
gerald+2.208
Neg logit contributions
adders-1.922
ensical-1.716
udi-1.656
 millenn-1.611
��-1.524
Token
Token ID12466
Feature activation+1.50e+00

Pos logit contributions
adders+6.496
udi+5.892
ensical+5.872
 millenn+5.736
��+5.712
Neg logit contributions
oire-4.141
ドラ-3.915
 Sins-3.888
avering-3.479
gerald-3.309
Token
Token ID251
Feature activation-3.36e-01

Pos logit contributions
adders+0.422
ensical+0.377
udi+0.371
 millenn+0.365
��+0.359
Neg logit contributions
oire-0.455
ドラ-0.439
 Sins-0.434
avering-0.400
gerald-0.385
Token
Token ID198
Feature activation-3.33e-01

Pos logit contributions
adders+0.520
ensical+0.474
 millenn+0.465
udi+0.463
��+0.453
Neg logit contributions
oire-0.442
ドラ-0.425
 Sins-0.421
avering-0.383
gerald-0.365
Token
Token ID198
Feature activation+6.68e-02

Pos logit contributions
adders+0.507
ensical+0.474
 millenn+0.473
udi+0.451
��+0.447
Neg logit contributions
oire-0.448
ドラ-0.426
 Sins-0.424
avering-0.389
gerald-0.370
Token
Token ID960
Feature activation+1.03e+00

Pos logit contributions
oire+0.090
ドラ+0.087
 Sins+0.084
avering+0.078
gerald+0.075
Neg logit contributions
adders-0.101
ensical-0.093
 millenn-0.092
udi-0.091
��-0.089
TokenAaron
Token ID34451
Feature activation-1.29e+00

Pos logit contributions
oire+1.391
ドラ+1.348
 Sins+1.333
avering+1.214
gerald+1.169
Neg logit contributions
adders-1.552
ensical-1.416
 millenn-1.381
udi-1.375
��-1.351
Token Larson
Token ID42630
Feature activation-4.96e+00

Pos logit contributions
adders+1.908
ensical+1.711
udi+1.693
 millenn+1.648
��+1.631
Neg logit contributions
oire-1.768
ドラ-1.706
 Sins-1.671
avering-1.535
gerald-1.475
Token,
Token ID11
Feature activation+7.64e-02

Pos logit contributions
adders+6.699
ensical+5.937
udi+5.921
 millenn+5.645
��+5.612
Neg logit contributions
oire-7.142
ドラ-6.806
 Sins-6.742
avering-6.300
gerald-5.906
Token associate
Token ID11602
Feature activation-9.34e-01

Pos logit contributions
oire+0.102
ドラ+0.098
 Sins+0.098
avering+0.088
gerald+0.085
Neg logit contributions
adders-0.117
ensical-0.107
udi-0.104
 millenn-0.103
��-0.102
Token editor
Token ID5464
Feature activation+1.22e+00

Pos logit contributions
adders+1.172
ensical+1.038
udi+1.002
 millenn+0.987
��+0.974
Neg logit contributions
oire-1.517
ドラ-1.463
 Sins-1.452
avering-1.347
gerald-1.299
Token (@
Token ID4275
Feature activation-1.69e-01

Pos logit contributions
oire+1.676
ドラ+1.621
 Sins+1.596
avering+1.464
gerald+1.404
Neg logit contributions
adders-1.804
ensical-1.627
udi-1.594
 millenn-1.586
��-1.547
TokenAaron
Token ID34451
Feature activation-1.50e+00

Pos logit contributions
adders+0.241
ensical+0.219
 millenn+0.214
udi+0.213
��+0.209
Neg logit contributions
oire-0.242
ドラ-0.233
 Sins-0.229
avering-0.212
gerald-0.206
Token to
Token ID284
Feature activation+1.22e-01

Pos logit contributions
adders+0.081
ensical+0.070
udi+0.069
 millenn+0.067
��+0.066
Neg logit contributions
oire-0.133
ドラ-0.129
 Sins-0.128
avering-0.119
gerald-0.115
Token zero
Token ID6632
Feature activation-5.11e-01

Pos logit contributions
oire+0.176
ドラ+0.168
 Sins+0.168
avering+0.155
gerald+0.146
Neg logit contributions
adders-0.174
 millenn-0.157
ensical-0.157
��-0.154
udi-0.153
Token-
Token ID12
Feature activation-7.05e-01

Pos logit contributions
adders+0.685
ensical+0.611
udi+0.599
 millenn+0.591
��+0.582
Neg logit contributions
oire-0.778
ドラ-0.757
 Sins-0.747
avering-0.687
gerald-0.664
Tokenout
Token ID448
Feature activation-9.62e-02

Pos logit contributions
adders+1.082
ensical+0.979
udi+0.963
 millenn+0.952
��+0.937
Neg logit contributions
oire-0.939
ドラ-0.903
 Sins-0.891
avering-0.813
gerald-0.779
Token Planned
Token ID18451
Feature activation-2.14e+00

Pos logit contributions
adders+0.146
ensical+0.133
udi+0.131
 millenn+0.130
��+0.128
Neg logit contributions
oire-0.128
 Sins-0.123
ドラ-0.122
avering-0.112
gerald-0.107
Token Parenthood
Token ID18461
Feature activation-4.95e+00

Pos logit contributions
adders+2.477
ensical+2.183
udi+2.129
 millenn+2.103
��+2.053
Neg logit contributions
oire-3.647
ドラ-3.533
 Sins-3.504
avering-3.261
gerald-3.165
Token
Token ID447
Feature activation+2.64e-02

Pos logit contributions
adders+6.252
udi+5.532
ensical+5.453
 millenn+5.340
 awareness+5.208
Neg logit contributions
oire-7.400
ドラ-7.297
 Sins-7.062
avering-6.677
gerald-6.311
Token
Token ID247
Feature activation-1.31e-01

Pos logit contributions
oire+0.044
ドラ+0.043
 Sins+0.043
avering+0.040
gerald+0.039
Neg logit contributions
adders-0.031
ensical-0.028
 millenn-0.027
udi-0.027
��-0.025
Tokens
Token ID82
Feature activation+4.60e-01

Pos logit contributions
adders+0.218
ensical+0.199
udi+0.195
 millenn+0.195
��+0.191
Neg logit contributions
oire-0.160
ドラ-0.152
 Sins-0.151
avering-0.136
gerald-0.129
Token federal
Token ID2717
Feature activation-1.08e-01

Pos logit contributions
oire+0.734
ドラ+0.708
 Sins+0.705
avering+0.648
gerald+0.631
Neg logit contributions
adders-0.585
ensical-0.516
udi-0.512
 millenn-0.498
��-0.496
Token funding
Token ID4918
Feature activation-1.33e+00

Pos logit contributions
adders+0.116
ensical+0.100
udi+0.098
 millenn+0.096
��+0.094
Neg logit contributions
oire-0.194
ドラ-0.188
 Sins-0.187
avering-0.174
gerald-0.169
Token Employees
Token ID30260
Feature activation+1.57e+00

Pos logit contributions
adders+0.559
ensical+0.508
 millenn+0.498
udi+0.498
��+0.488
Neg logit contributions
oire-0.501
 Sins-0.489
ドラ-0.477
avering-0.437
gerald-0.415
Token
Token ID198
Feature activation+6.36e-01

Pos logit contributions
oire+2.071
 Sins+1.999
ドラ+1.948
avering+1.798
gerald+1.712
Neg logit contributions
adders-2.394
ensical-2.192
 millenn-2.159
udi-2.132
��-2.120
Token
Token ID198
Feature activation+5.46e-01

Pos logit contributions
oire+0.829
ドラ+0.797
 Sins+0.789
avering+0.712
gerald+0.684
Neg logit contributions
adders-0.998
ensical-0.946
 millenn-0.945
udi-0.886
��-0.884
TokenEmail
Token ID15333
Feature activation+2.00e-01

Pos logit contributions
oire+0.726
ドラ+0.696
 Sins+0.683
avering+0.628
gerald+0.608
Neg logit contributions
adders-0.837
ensical-0.775
 millenn-0.762
udi-0.743
��-0.734
Token Print
Token ID12578
Feature activation+1.72e+00

Pos logit contributions
oire+0.256
ドラ+0.245
 Sins+0.245
avering+0.218
gerald+0.211
Neg logit contributions
adders-0.318
ensical-0.289
udi-0.282
 millenn-0.281
��-0.278
Token Whats
Token ID28556
Feature activation+3.19e+00

Pos logit contributions
oire+2.157
 Sins+2.129
ドラ+2.064
avering+1.850
gerald+1.784
Neg logit contributions
adders-2.734
ensical-2.481
 millenn-2.446
��-2.405
udi-2.397
Tokenapp
Token ID1324
Feature activation-5.91e-01

Pos logit contributions
oire+4.101
ドラ+3.986
 Sins+3.875
avering+3.600
gerald+3.388
Neg logit contributions
adders-4.807
ensical-4.418
��-4.413
 millenn-4.384
udi-4.285
Token Menu
Token ID21860
Feature activation+1.62e-01

Pos logit contributions
adders+0.913
ensical+0.823
 millenn+0.804
udi+0.804
��+0.796
Neg logit contributions
oire-0.784
 Sins-0.763
ドラ-0.754
avering-0.676
gerald-0.654
Token Whats
Token ID28556
Feature activation+3.03e+00

Pos logit contributions
oire+0.210
 Sins+0.201
ドラ+0.200
avering+0.181
gerald+0.173
Neg logit contributions
adders-0.255
ensical-0.231
udi-0.226
 millenn-0.225
��-0.223
Tokenapp
Token ID1324
Feature activation-2.89e-01

Pos logit contributions
oire+3.692
ドラ+3.549
 Sins+3.473
avering+3.229
gerald+3.045
Neg logit contributions
adders-4.649
 millenn-4.478
ensical-4.413
��-4.325
udi-4.169
Token Google
Token ID3012
Feature activation-8.43e-01

Pos logit contributions
adders+0.446
ensical+0.405
udi+0.398
 millenn+0.394
��+0.388
Neg logit contributions
oire-0.380
ドラ-0.365
 Sins-0.363
avering-0.329
gerald-0.315
Token head
Token ID1182
Feature activation+1.27e+00

Pos logit contributions
oire+0.248
 Sins+0.240
ドラ+0.236
avering+0.220
gerald+0.214
Neg logit contributions
adders-0.241
ensical-0.214
udi-0.208
��-0.207
 millenn-0.206
Token of
Token ID286
Feature activation+2.76e-02

Pos logit contributions
oire+1.864
ドラ+1.777
 Sins+1.698
avering+1.622
gerald+1.572
Neg logit contributions
adders-1.766
ensical-1.638
 millenn-1.629
udi-1.571
��-1.549
Token the
Token ID262
Feature activation+1.21e-01

Pos logit contributions
oire+0.037
 Sins+0.036
ドラ+0.036
avering+0.033
gerald+0.031
Neg logit contributions
adders-0.041
ensical-0.038
udi-0.037
 millenn-0.037
��-0.036
Token division
Token ID7297
Feature activation-2.93e-01

Pos logit contributions
oire+0.153
 Sins+0.146
ドラ+0.141
avering+0.130
gerald+0.125
Neg logit contributions
adders-0.193
ensical-0.177
udi-0.173
 millenn-0.172
��-0.170
Token that
Token ID326
Feature activation-6.94e-01

Pos logit contributions
adders+0.413
ensical+0.371
udi+0.365
 millenn+0.360
��+0.354
Neg logit contributions
oire-0.427
ドラ-0.408
 Sins-0.406
avering-0.373
gerald-0.356
Token scrutin
Token ID30441
Feature activation+3.54e+00

Pos logit contributions
adders+0.984
ensical+0.884
udi+0.865
 millenn+0.859
��+0.845
Neg logit contributions
oire-1.005
ドラ-0.965
 Sins-0.961
avering-0.874
gerald-0.850
Tokenized
Token ID1143
Feature activation+2.54e+00

Pos logit contributions
oire+3.333
ドラ+3.166
 Sins+3.109
avering+2.650
gerald+2.499
Neg logit contributions
adders-6.497
 millenn-6.377
ensical-6.311
��-6.085
udi-5.949
Token the
Token ID262
Feature activation+1.73e-01

Pos logit contributions
oire+3.506
 Sins+3.377
ドラ+3.207
avering+3.032
gerald+2.977
Neg logit contributions
adders-3.569
ensical-3.296
udi-3.164
��-3.143
 millenn-3.136
Token tea
Token ID8887
Feature activation+2.80e-01

Pos logit contributions
oire+0.247
 Sins+0.236
ドラ+0.233
avering+0.214
gerald+0.208
Neg logit contributions
adders-0.248
ensical-0.225
udi-0.219
 millenn-0.216
��-0.214
Token party
Token ID2151
Feature activation-1.91e+00

Pos logit contributions
oire+0.438
ドラ+0.426
 Sins+0.419
avering+0.389
gerald+0.377
Neg logit contributions
adders-0.361
ensical-0.321
udi-0.316
 millenn-0.311
��-0.307
Token applications
Token ID5479
Feature activation-2.58e-01

Pos logit contributions
adders+2.254
ensical+1.991
udi+1.939
 millenn+1.900
��+1.869
Neg logit contributions
oire-3.244
ドラ-3.131
 Sins-3.107
avering-2.884
gerald-2.801
Token,
Token ID11
Feature activation-4.10e-01

Pos logit contributions
adders+1.737
ensical+1.561
udi+1.533
 millenn+1.516
��+1.493
Neg logit contributions
oire-1.706
ドラ-1.638
 Sins-1.623
avering-1.487
gerald-1.425
Token he
Token ID339
Feature activation+5.53e-01

Pos logit contributions
adders+0.570
ensical+0.510
udi+0.496
 millenn+0.495
��+0.487
Neg logit contributions
oire-0.608
 Sins-0.585
ドラ-0.585
avering-0.536
gerald-0.513
Token used
Token ID973
Feature activation-5.75e-01

Pos logit contributions
oire+0.812
 Sins+0.771
ドラ+0.770
avering+0.706
gerald+0.676
Neg logit contributions
adders-0.769
 millenn-0.702
ensical-0.694
��-0.674
udi-0.670
Token the
Token ID262
Feature activation-2.97e-01

Pos logit contributions
adders+0.811
ensical+0.730
udi+0.711
 millenn+0.708
��+0.693
Neg logit contributions
oire-0.842
ドラ-0.809
 Sins-0.804
avering-0.736
gerald-0.705
Token student
Token ID3710
Feature activation-1.24e+00

Pos logit contributions
adders+0.369
ensical+0.327
udi+0.318
 millenn+0.313
��+0.312
Neg logit contributions
oire-0.487
 Sins-0.466
ドラ-0.463
avering-0.427
gerald-0.411
Token dat
Token ID4818
Feature activation+2.94e+00

Pos logit contributions
adders+1.642
ensical+1.461
udi+1.427
 millenn+1.410
��+1.390
Neg logit contributions
oire-1.931
ドラ-1.856
 Sins-1.835
avering-1.696
gerald-1.636
Tokenap
Token ID499
Feature activation-1.24e+00

Pos logit contributions
oire+4.486
ドラ+4.107
 Sins+4.052
avering+3.890
gerald+3.721
Neg logit contributions
adders-3.861
 millenn-3.645
ensical-3.527
��-3.472
udi-3.318
Tokenad
Token ID324
Feature activation-2.67e-01

Pos logit contributions
adders+1.608
ensical+1.436
 millenn+1.414
udi+1.392
��+1.385
Neg logit contributions
oire-1.937
ドラ-1.850
 Sins-1.839
avering-1.715
gerald-1.641
Token's
Token ID338
Feature activation+3.51e-01

Pos logit contributions
adders+0.334
ensical+0.295
udi+0.287
 millenn+0.285
��+0.281
Neg logit contributions
oire-0.436
ドラ-0.417
 Sins-0.414
avering-0.384
gerald-0.369
Token telephone
Token ID11426
Feature activation+1.05e+00

Pos logit contributions
oire+0.555
ドラ+0.528
 Sins+0.528
avering+0.481
gerald+0.464
Neg logit contributions
adders-0.458
ensical-0.407
udi-0.395
 millenn-0.391
��-0.389
Token function
Token ID2163
Feature activation+1.36e+00

Pos logit contributions
oire+1.741
ドラ+1.650
 Sins+1.623
avering+1.496
gerald+1.460
Neg logit contributions
adders-1.320
ensical-1.156
��-1.124
udi-1.122
 millenn-1.111
Token held
Token ID2714
Feature activation+9.99e-01

Pos logit contributions
oire+2.060
 Sins+1.977
ドラ+1.924
avering+1.758
gerald+1.705
Neg logit contributions
adders-1.952
ensical-1.798
��-1.739
udi-1.713
 millenn-1.702
Token its
Token ID663
Feature activation-6.21e-01

Pos logit contributions
oire+1.388
 Sins+1.343
ドラ+1.314
avering+1.202
gerald+1.158
Neg logit contributions
adders-1.463
ensical-1.326
udi-1.286
��-1.284
 millenn-1.264
Token first
Token ID717
Feature activation-1.70e+00

Pos logit contributions
adders+0.840
ensical+0.742
udi+0.727
��+0.710
 millenn+0.707
Neg logit contributions
oire-0.955
 Sins-0.925
ドラ-0.913
avering-0.837
gerald-0.806
Token summit
Token ID14237
Feature activation-4.74e-01

Pos logit contributions
adders+2.336
ensical+2.084
udi+2.047
 millenn+2.008
��+1.982
Neg logit contributions
oire-2.545
ドラ-2.450
 Sins-2.432
avering-2.231
gerald-2.149
Token in
Token ID287
Feature activation+7.71e-01

Pos logit contributions
adders+0.673
ensical+0.597
udi+0.580
��+0.579
 millenn+0.578
Neg logit contributions
oire-0.704
 Sins-0.670
ドラ-0.665
avering-0.607
gerald-0.584
Token Chile
Token ID17456
Feature activation+3.31e+00

Pos logit contributions
oire+0.949
 Sins+0.911
ドラ+0.876
avering+0.803
gerald+0.760
Neg logit contributions
adders-1.264
ensical-1.166
udi-1.129
��-1.119
 millenn-1.113
Token last
Token ID938
Feature activation+4.40e-01

Pos logit contributions
oire+4.488
 Sins+4.325
ドラ+4.116
avering+3.812
gerald+3.617
Neg logit contributions
adders-4.914
ensical-4.465
 millenn-4.402
��-4.335
udi-4.133
Token year
Token ID614
Feature activation+3.97e-01

Pos logit contributions
oire+0.542
 Sins+0.522
ドラ+0.511
avering+0.458
gerald+0.437
Neg logit contributions
adders-0.720
ensical-0.659
udi-0.635
��-0.633
 millenn-0.627
Token,
Token ID11
Feature activation-3.17e-01

Pos logit contributions
oire+0.563
 Sins+0.540
ドラ+0.529
avering+0.488
gerald+0.466
Neg logit contributions
adders-0.580
ensical-0.521
 millenn-0.513
udi-0.508
��-0.504
Token having
Token ID1719
Feature activation-2.81e-01

Pos logit contributions
adders+0.450
ensical+0.404
udi+0.393
 millenn+0.391
��+0.389
Neg logit contributions
oire-0.462
 Sins-0.448
ドラ-0.436
avering-0.403
gerald-0.385
Token been
Token ID587
Feature activation+1.27e+00

Pos logit contributions
adders+0.401
ensical+0.357
udi+0.349
 millenn+0.348
��+0.346
Neg logit contributions
oire-0.407
 Sins-0.392
ドラ-0.389
avering-0.358
gerald-0.342
Token 82
Token ID9415
Feature activation+9.07e-01

Pos logit contributions
oire+2.062
 Sins+1.948
ドラ+1.932
avering+1.756
gerald+1.662
Neg logit contributions
adders-2.435
ensical-2.237
��-2.199
udi-2.185
 millenn-2.165
Token,
Token ID11
Feature activation-2.56e-01

Pos logit contributions
oire+1.206
ドラ+1.161
 Sins+1.153
avering+1.049
gerald+0.993
Neg logit contributions
adders-1.384
ensical-1.277
 millenn-1.238
udi-1.225
��-1.217
Token000
Token ID830
Feature activation-5.15e-01

Pos logit contributions
adders+0.327
ensical+0.289
udi+0.284
 millenn+0.279
��+0.274
Neg logit contributions
oire-0.406
ドラ-0.393
 Sins-0.389
avering-0.360
gerald-0.348
Token who
Token ID508
Feature activation+2.77e-01

Pos logit contributions
adders+0.770
ensical+0.699
udi+0.683
 millenn+0.675
��+0.669
Neg logit contributions
oire-0.705
ドラ-0.677
 Sins-0.673
avering-0.614
gerald-0.587
Token were
Token ID547
Feature activation-2.85e-01

Pos logit contributions
oire+0.377
 Sins+0.359
ドラ+0.357
avering+0.328
gerald+0.312
Neg logit contributions
adders-0.417
ensical-0.380
 millenn-0.372
udi-0.371
��-0.368
Token handed
Token ID10158
Feature activation+3.93e+00

Pos logit contributions
adders+0.403
ensical+0.363
udi+0.353
 millenn+0.351
��+0.346
Neg logit contributions
oire-0.414
ドラ-0.399
 Sins-0.398
avering-0.367
gerald-0.349
Token
Token ID564
Feature activation-8.49e-01

Pos logit contributions
oire+5.822
ドラ+5.683
 Sins+5.652
avering+5.181
gerald+4.921
Neg logit contributions
adders-5.043
ensical-4.595
udi-4.425
��-4.372
 millenn-4.342
Token
Token ID250
Feature activation-7.84e-01

Pos logit contributions
adders+1.303
ensical+1.195
 millenn+1.166
udi+1.164
��+1.111
Neg logit contributions
oire-1.137
ドラ-1.095
 Sins-1.075
avering-0.986
gerald-0.933
Tokensevere
Token ID43070
Feature activation+7.75e-01

Pos logit contributions
adders+1.155
ensical+1.048
udi+1.023
 millenn+1.020
��+0.995
Neg logit contributions
oire-1.090
ドラ-1.050
 Sins-1.035
avering-0.952
gerald-0.914
Token disciplinary
Token ID24737
Feature activation+3.43e+00

Pos logit contributions
oire+1.244
ドラ+1.202
 Sins+1.192
avering+1.105
gerald+1.060
Neg logit contributions
adders-0.971
ensical-0.873
udi-0.847
 millenn-0.833
��-0.822
Token punishments
Token ID35518
Feature activation+2.82e+00

Pos logit contributions
oire+5.126
ドラ+5.030
 Sins+4.937
avering+4.605
gerald+4.306
Neg logit contributions
adders-4.390
ensical-4.077
 millenn-3.905
��-3.890
udi-3.862
Token the
Token ID262
Feature activation+1.46e-02

Pos logit contributions
oire+0.130
 Sins+0.126
ドラ+0.124
avering+0.114
gerald+0.108
Neg logit contributions
adders-0.145
ensical-0.131
udi-0.127
 millenn-0.126
��-0.126
Token Bur
Token ID5481
Feature activation+1.10e+00

Pos logit contributions
oire+0.021
 Sins+0.020
ドラ+0.020
avering+0.018
gerald+0.017
Neg logit contributions
adders-0.021
ensical-0.019
udi-0.019
 millenn-0.018
��-0.018
Tokenm
Token ID76
Feature activation-7.59e-01

Pos logit contributions
oire+0.934
 Sins+0.906
ドラ+0.864
avering+0.797
gerald+0.719
Neg logit contributions
adders-2.102
ensical-1.965
 millenn-1.958
��-1.915
udi-1.877
Tokenese
Token ID2771
Feature activation+1.69e+00

Pos logit contributions
adders+1.380
ensical+1.268
udi+1.250
 millenn+1.240
��+1.224
Neg logit contributions
oire-0.794
ドラ-0.753
 Sins-0.746
avering-0.661
gerald-0.620
Token authorities
Token ID4773
Feature activation-3.32e-01

Pos logit contributions
oire+2.230
 Sins+2.154
ドラ+2.128
avering+1.934
gerald+1.842
Neg logit contributions
adders-2.563
ensical-2.323
udi-2.267
 millenn-2.248
��-2.232
Token said
Token ID531
Feature activation+5.63e-01

Pos logit contributions
adders+0.482
ensical+0.433
udi+0.423
 millenn+0.420
��+0.418
Neg logit contributions
oire-0.472
 Sins-0.449
ドラ-0.449
avering-0.416
gerald-0.395
Token there
Token ID612
Feature activation+1.59e+00

Pos logit contributions
oire+0.731
 Sins+0.706
ドラ+0.697
avering+0.641
gerald+0.605
Neg logit contributions
adders-0.868
ensical-0.796
��-0.769
udi-0.768
 millenn-0.761
Token had
Token ID550
Feature activation+1.05e-01

Pos logit contributions
oire+2.123
ドラ+2.020
 Sins+2.013
avering+1.857
gerald+1.777
Neg logit contributions
adders-2.369
ensical-2.189
 millenn-2.107
��-2.090
udi-2.089
Token been
Token ID587
Feature activation+1.29e+00

Pos logit contributions
oire+0.157
ドラ+0.151
 Sins+0.150
avering+0.139
gerald+0.133
Neg logit contributions
adders-0.143
ensical-0.129
udi-0.125
 millenn-0.124
��-0.122
Token no
Token ID645
Feature activation+1.37e-01

Pos logit contributions
oire+1.700
 Sins+1.631
ドラ+1.591
avering+1.480
gerald+1.409
Neg logit contributions
adders-1.967
ensical-1.784
��-1.742
udi-1.729
 millenn-1.720
Token casualties
Token ID18499
Feature activation+2.35e+00

Pos logit contributions
oire+0.206
ドラ+0.197
 Sins+0.196
avering+0.181
gerald+0.173
Neg logit contributions
adders-0.187
ensical-0.166
udi-0.162
 millenn-0.161
��-0.159
Tokenaring
Token ID1723
Feature activation-1.78e+00

Pos logit contributions
oire+2.046
ドラ+1.927
 Sins+1.906
avering+1.733
gerald+1.586
Neg logit contributions
adders-3.449
ensical-3.173
 millenn-3.121
udi-3.112
��-3.097
Token such
Token ID884
Feature activation-1.05e+00

Pos logit contributions
adders+2.420
ensical+2.160
udi+2.125
 millenn+2.094
��+2.045
Neg logit contributions
oire-2.671
ドラ-2.582
 Sins-2.543
avering-2.351
gerald-2.268
Token different
Token ID1180
Feature activation-2.99e-02

Pos logit contributions
adders+1.368
ensical+1.217
udi+1.190
 millenn+1.169
��+1.156
Neg logit contributions
oire-1.646
ドラ-1.587
 Sins-1.579
avering-1.458
gerald-1.402
Token molecules
Token ID17745
Feature activation-1.08e+00

Pos logit contributions
adders+0.040
ensical+0.036
udi+0.035
 millenn+0.034
��+0.034
Neg logit contributions
oire-0.046
ドラ-0.044
 Sins-0.044
avering-0.040
gerald-0.038
Token as
Token ID355
Feature activation-4.07e-01

Pos logit contributions
adders+1.483
ensical+1.331
 millenn+1.300
udi+1.300
��+1.278
Neg logit contributions
oire-1.616
ドラ-1.555
 Sins-1.554
avering-1.423
gerald-1.367
Token minerals
Token ID21622
Feature activation+4.16e-01

Pos logit contributions
adders+0.567
ensical+0.508
udi+0.498
 millenn+0.489
��+0.488
Neg logit contributions
oire-0.598
 Sins-0.577
ドラ-0.575
avering-0.527
gerald-0.506
Token and
Token ID290
Feature activation+1.99e-01

Pos logit contributions
oire+0.605
 Sins+0.582
ドラ+0.578
avering+0.530
gerald+0.504
Neg logit contributions
adders-0.580
ensical-0.523
 millenn-0.520
udi-0.510
��-0.509
Token organ
Token ID1618
Feature activation+4.45e-01

Pos logit contributions
oire+0.305
 Sins+0.295
ドラ+0.294
avering+0.272
gerald+0.260
Neg logit contributions
adders-0.263
ensical-0.236
udi-0.230
��-0.226
 millenn-0.225
Tokenics
Token ID873
Feature activation+1.23e-01

Pos logit contributions
oire+0.731
ドラ+0.706
 Sins+0.704
avering+0.659
gerald+0.625
Neg logit contributions
adders-0.533
ensical-0.477
udi-0.464
 millenn-0.456
��-0.445
Token from
Token ID422
Feature activation-1.20e-01

Pos logit contributions
oire+0.176
 Sins+0.171
ドラ+0.168
avering+0.154
gerald+0.148
Neg logit contributions
adders-0.175
ensical-0.158
 millenn-0.155
��-0.153
udi-0.153
Token the
Token ID262
Feature activation+7.96e-02

Pos logit contributions
adders+0.163
ensical+0.146
udi+0.142
��+0.140
 millenn+0.139
Neg logit contributions
oire-0.181
 Sins-0.175
ドラ-0.173
avering-0.160
gerald-0.153
Token government
Token ID1230
Feature activation+3.63e-01

Pos logit contributions
adders+0.053
ensical+0.049
udi+0.048
 millenn+0.047
��+0.047
Neg logit contributions
oire-0.041
ドラ-0.039
 Sins-0.039
avering-0.035
gerald-0.033
Token access
Token ID1895
Feature activation-9.95e-01

Pos logit contributions
oire+0.530
ドラ+0.511
 Sins+0.502
avering+0.460
gerald+0.451
Neg logit contributions
adders-0.507
ensical-0.460
 millenn-0.449
udi-0.447
��-0.445
Tokenes
Token ID274
Feature activation+4.17e-01

Pos logit contributions
adders+2.034
ensical+1.884
udi+1.864
 millenn+1.843
��+1.824
Neg logit contributions
oire-0.814
ドラ-0.765
 Sins-0.750
avering-0.639
gerald-0.589
Token AIR
Token ID31600
Feature activation-2.67e-01

Pos logit contributions
oire+0.590
 Sins+0.564
ドラ+0.562
avering+0.514
gerald+0.497
Neg logit contributions
adders-0.602
ensical-0.547
udi-0.536
 millenn-0.534
��-0.523
Token
Token ID447
Feature activation-5.28e-03

Pos logit contributions
adders+0.377
ensical+0.341
udi+0.332
 millenn+0.329
��+0.327
Neg logit contributions
oire-0.391
ドラ-0.376
 Sins-0.372
avering-0.341
gerald-0.331
Token
Token ID247
Feature activation+3.20e-01

Pos logit contributions
adders+0.006
ensical+0.006
 millenn+0.005
udi+0.005
��+0.005
Neg logit contributions
oire-0.009
ドラ-0.009
 Sins-0.009
avering-0.008
gerald-0.008
Tokens
Token ID82
Feature activation+6.98e-01

Pos logit contributions
oire+0.377
ドラ+0.359
 Sins+0.355
avering+0.319
gerald+0.306
Neg logit contributions
adders-0.539
ensical-0.498
 millenn-0.486
udi-0.486
��-0.480
Token information
Token ID1321
Feature activation+8.77e-01

Pos logit contributions
oire+1.110
 Sins+1.076
ドラ+1.073
avering+0.985
gerald+0.958
Neg logit contributions
adders-0.874
ensical-0.787
udi-0.772
 millenn-0.759
��-0.750
Token but
Token ID475
Feature activation-7.77e-01

Pos logit contributions
oire+1.281
 Sins+1.211
ドラ+1.197
avering+1.106
gerald+1.066
Neg logit contributions
adders-1.244
ensical-1.114
 millenn-1.102
��-1.099
udi-1.084
Token the
Token ID262
Feature activation+3.04e-01

Pos logit contributions
adders+1.081
ensical+0.976
udi+0.952
 millenn+0.948
��+0.930
Neg logit contributions
oire-1.148
ドラ-1.105
 Sins-1.099
avering-1.009
gerald-0.976
Token organization
Token ID4009
Feature activation-1.29e+00

Pos logit contributions
oire+0.423
 Sins+0.406
ドラ+0.400
avering+0.368
gerald+0.355
Neg logit contributions
adders-0.447
ensical-0.404
udi-0.397
 millenn-0.392
��-0.391
Token truck
Token ID7779
Feature activation+1.21e+00

Pos logit contributions
adders+0.674
ensical+0.602
udi+0.591
 millenn+0.583
��+0.575
Neg logit contributions
oire-0.688
ドラ-0.661
 Sins-0.654
avering-0.600
gerald-0.580
Token drove
Token ID10357
Feature activation-1.06e+00

Pos logit contributions
oire+1.732
ドラ+1.660
 Sins+1.651
avering+1.513
gerald+1.463
Neg logit contributions
adders-1.737
ensical-1.543
udi-1.534
 millenn-1.529
��-1.491
Token by
Token ID416
Feature activation+3.79e-02

Pos logit contributions
adders+1.540
ensical+1.385
udi+1.359
 millenn+1.344
��+1.316
Neg logit contributions
oire-1.504
 Sins-1.462
ドラ-1.455
avering-1.319
gerald-1.260
Token,
Token ID11
Feature activation-3.81e-01

Pos logit contributions
oire+0.053
 Sins+0.051
ドラ+0.051
avering+0.046
gerald+0.044
Neg logit contributions
adders-0.055
ensical-0.050
 millenn-0.049
udi-0.049
��-0.048
Token and
Token ID290
Feature activation+1.21e-01

Pos logit contributions
adders+0.545
ensical+0.489
udi+0.483
 millenn+0.479
��+0.468
Neg logit contributions
oire-0.547
 Sins-0.528
ドラ-0.527
avering-0.482
gerald-0.457
Token a
Token ID257
Feature activation+2.21e-01

Pos logit contributions
oire+0.166
 Sins+0.162
ドラ+0.161
avering+0.145
gerald+0.138
Neg logit contributions
adders-0.180
ensical-0.161
udi-0.159
 millenn-0.159
��-0.154
Token strange
Token ID6283
Feature activation-1.36e-01

Pos logit contributions
oire+0.346
 Sins+0.335
ドラ+0.334
avering+0.306
gerald+0.296
Neg logit contributions
adders-0.291
ensical-0.256
udi-0.251
 millenn-0.246
��-0.242
Token can
Token ID460
Feature activation-5.30e-01

Pos logit contributions
adders+0.181
ensical+0.161
udi+0.157
 millenn+0.154
��+0.152
Neg logit contributions
oire-0.210
ドラ-0.202
 Sins-0.201
avering-0.184
gerald-0.178
Tokenister
Token ID1694
Feature activation-4.32e-01

Pos logit contributions
adders+0.695
ensical+0.617
udi+0.606
 millenn+0.596
��+0.585
Neg logit contributions
oire-0.824
ドラ-0.797
 Sins-0.787
avering-0.729
gerald-0.702
Token flew
Token ID13112
Feature activation+1.35e-01

Pos logit contributions
adders+0.587
ensical+0.522
udi+0.514
 millenn+0.507
��+0.500
Neg logit contributions
oire-0.656
ドラ-0.632
 Sins-0.627
avering-0.575
gerald-0.553
Token out
Token ID503
Feature activation-1.25e+00

Pos logit contributions
oire+0.195
 Sins+0.190
ドラ+0.188
avering+0.171
gerald+0.165
Neg logit contributions
adders-0.191
ensical-0.172
 millenn-0.168
udi-0.167
��-0.164
Token a
Token ID257
Feature activation+5.47e-02

Pos logit contributions
oire+1.102
 Sins+1.070
ドラ+1.056
avering+0.982
gerald+0.928
Neg logit contributions
adders-1.149
ensical-1.040
udi-1.015
 millenn-1.003
��-1.001
Token &
Token ID1222
Feature activation+1.22e+00

Pos logit contributions
oire+0.079
 Sins+0.076
ドラ+0.076
avering+0.070
gerald+0.067
Neg logit contributions
adders-0.078
ensical-0.070
udi-0.068
��-0.067
 millenn-0.067
Token'
Token ID6
Feature activation+1.63e-01

Pos logit contributions
oire+1.549
 Sins+1.479
ドラ+1.457
avering+1.358
gerald+1.287
Neg logit contributions
adders-1.917
ensical-1.752
��-1.707
 millenn-1.699
udi-1.691
Tokena
Token ID64
Feature activation-3.08e-01

Pos logit contributions
oire+0.202
 Sins+0.187
ドラ+0.186
avering+0.173
gerald+0.163
Neg logit contributions
adders-0.267
ensical-0.250
 millenn-0.247
��-0.239
udi-0.239
Token T
Token ID309
Feature activation-1.79e-01

Pos logit contributions
adders+0.466
ensical+0.421
udi+0.412
 millenn+0.409
��+0.406
Neg logit contributions
oire-0.418
 Sins-0.400
ドラ-0.398
avering-0.366
gerald-0.348
Token.
Token ID764
Feature activation+8.00e-01

Pos logit contributions
adders+0.261
ensical+0.235
udi+0.230
 millenn+0.228
��+0.226
Neg logit contributions
oire-0.253
ドラ-0.242
 Sins-0.241
avering-0.223
gerald-0.214
Token This
Token ID770
Feature activation-4.68e-01

Pos logit contributions
oire+0.982
 Sins+0.952
ドラ+0.917
avering+0.842
gerald+0.793
Neg logit contributions
adders-1.311
ensical-1.205
 millenn-1.190
udi-1.159
��-1.155
Token is
Token ID318
Feature activation-9.31e-03

Pos logit contributions
adders+0.659
ensical+0.589
udi+0.573
 millenn+0.573
��+0.568
Neg logit contributions
oire-0.692
 Sins-0.661
ドラ-0.655
avering-0.602
gerald-0.582
Token a
Token ID257
Feature activation+3.48e-01

Pos logit contributions
adders+0.013
ensical+0.011
 millenn+0.011
udi+0.011
��+0.011
Neg logit contributions
oire-0.014
 Sins-0.013
ドラ-0.013
avering-0.012
gerald-0.012
Token somewhat
Token ID6454
Feature activation-2.94e+00

Pos logit contributions
oire+0.550
 Sins+0.531
ドラ+0.523
avering+0.487
gerald+0.470
Neg logit contributions
adders-0.452
ensical-0.397
udi-0.390
 millenn-0.387
��-0.381
Token hack
Token ID8156
Feature activation+2.60e+00

Pos logit contributions
adders+3.654
ensical+3.234
udi+3.169
 millenn+3.104
��+3.043
Neg logit contributions
oire-4.754
ドラ-4.612
 Sins-4.550
avering-4.225
gerald-4.083
Token).
Token ID737
Feature activation+4.42e-01

Pos logit contributions
adders+0.043
ensical+0.039
udi+0.039
 millenn+0.038
��+0.037
Neg logit contributions
oire-0.032
ドラ-0.031
 Sins-0.031
avering-0.028
gerald-0.026
Token
Token ID198
Feature activation-1.06e-01

Pos logit contributions
oire+0.572
ドラ+0.547
 Sins+0.547
avering+0.488
gerald+0.472
Neg logit contributions
adders-0.696
ensical-0.637
 millenn-0.634
udi-0.621
��-0.609
Token
Token ID198
Feature activation+1.94e-01

Pos logit contributions
adders+0.165
 millenn+0.157
ensical+0.155
��+0.146
udi+0.146
Neg logit contributions
oire-0.141
ドラ-0.134
 Sins-0.133
avering-0.122
gerald-0.116
TokenFrom
Token ID4863
Feature activation-2.12e-01

Pos logit contributions
oire+0.254
ドラ+0.246
 Sins+0.238
avering+0.218
gerald+0.213
Neg logit contributions
adders-0.301
ensical-0.280
 millenn-0.279
udi-0.269
��-0.264
Token the
Token ID262
Feature activation-4.41e-01

Pos logit contributions
adders+0.308
ensical+0.277
udi+0.272
 millenn+0.271
��+0.264
Neg logit contributions
oire-0.298
 Sins-0.287
ドラ-0.287
avering-0.260
gerald-0.251
Token start
Token ID923
Feature activation-1.41e+00

Pos logit contributions
adders+0.548
ensical+0.477
udi+0.469
 millenn+0.466
��+0.455
Neg logit contributions
oire-0.719
ドラ-0.698
 Sins-0.693
avering-0.640
gerald-0.620
Token,
Token ID11
Feature activation-2.76e-02

Pos logit contributions
adders+1.961
ensical+1.760
udi+1.732
 millenn+1.692
��+1.669
Neg logit contributions
oire-2.056
ドラ-1.984
 Sins-1.959
avering-1.810
gerald-1.738
Token Hitchcock
Token ID49817
Feature activation+1.45e+00

Pos logit contributions
adders+0.045
ensical+0.041
udi+0.040
 millenn+0.040
��+0.039
Neg logit contributions
oire-0.034
ドラ-0.033
 Sins-0.032
avering-0.029
gerald-0.028
Token seemed
Token ID3947
Feature activation-3.73e+00

Pos logit contributions
oire+2.121
ドラ+2.032
 Sins+2.012
avering+1.846
gerald+1.805
Neg logit contributions
adders-2.007
ensical-1.804
 millenn-1.787
udi-1.733
��-1.733
Token intent
Token ID6824
Feature activation-1.21e+00

Pos logit contributions
adders+3.909
ensical+3.365
udi+3.301
 millenn+3.214
��+3.124
Neg logit contributions
oire-6.651
ドラ-6.471
 Sins-6.415
avering-5.942
gerald-5.804
Token on
Token ID319
Feature activation-8.66e-01

Pos logit contributions
adders+1.629
udi+1.447
ensical+1.443
 millenn+1.403
��+1.381
Neg logit contributions
oire-1.813
ドラ-1.765
 Sins-1.750
avering-1.585
gerald-1.548
Tokenhan
Token ID7637
Feature activation+1.27e+00

Pos logit contributions
oire+2.173
 Sins+2.031
ドラ+2.024
avering+1.907
gerald+1.803
Neg logit contributions
adders-2.046
 millenn-1.867
ensical-1.858
��-1.806
udi-1.766
Tokenem
Token ID368
Feature activation-8.33e-01

Pos logit contributions
oire+1.905
 Sins+1.798
ドラ+1.791
avering+1.660
gerald+1.599
Neg logit contributions
adders-1.737
ensical-1.563
 millenn-1.549
��-1.496
udi-1.495
Token warned
Token ID7728
Feature activation-5.63e-01

Pos logit contributions
adders+1.224
ensical+1.102
udi+1.086
 millenn+1.068
��+1.051
Neg logit contributions
oire-1.162
ドラ-1.120
 Sins-1.106
avering-1.013
gerald-0.972
Token an
Token ID281
Feature activation+1.14e-01

Pos logit contributions
adders+0.849
ensical+0.767
udi+0.753
 millenn+0.746
��+0.736
Neg logit contributions
oire-0.763
ドラ-0.733
 Sins-0.726
avering-0.663
gerald-0.634
Token Exxon
Token ID27879
Feature activation+1.35e-02

Pos logit contributions
oire+0.166
ドラ+0.159
 Sins+0.159
avering+0.146
gerald+0.139
Neg logit contributions
adders-0.161
ensical-0.144
udi-0.141
 millenn-0.141
��-0.140
Token Mobil
Token ID45951
Feature activation-1.92e+00

Pos logit contributions
oire+0.019
ドラ+0.019
 Sins+0.018
avering+0.017
gerald+0.016
Neg logit contributions
adders-0.019
ensical-0.017
udi-0.017
 millenn-0.017
��-0.017
Token executive
Token ID4640
Feature activation+1.98e+00

Pos logit contributions
adders+2.577
ensical+2.301
udi+2.262
 millenn+2.225
��+2.171
Neg logit contributions
oire-2.908
ドラ-2.813
 Sins-2.776
avering-2.566
gerald-2.470
Token that
Token ID326
Feature activation-9.74e-01

Pos logit contributions
oire+2.574
 Sins+2.437
ドラ+2.429
avering+2.215
gerald+2.114
Neg logit contributions
adders-3.063
ensical-2.761
��-2.733
udi-2.699
 millenn-2.692
Token Libya
Token ID13465
Feature activation+4.41e-01

Pos logit contributions
adders+1.466
ensical+1.324
udi+1.298
 millenn+1.284
��+1.268
Neg logit contributions
oire-1.326
ドラ-1.275
 Sins-1.265
avering-1.154
gerald-1.102
Token might
Token ID1244
Feature activation-7.47e-01

Pos logit contributions
oire+0.640
ドラ+0.617
 Sins+0.612
avering+0.560
gerald+0.538
Neg logit contributions
adders-0.620
ensical-0.560
udi-0.546
 millenn-0.541
��-0.535
Token
Token ID564
Feature activation+7.88e-02

Pos logit contributions
adders+1.034
ensical+0.927
 millenn+0.913
udi+0.908
��+0.890
Neg logit contributions
oire-1.106
ドラ-1.068
 Sins-1.060
avering-0.979
gerald-0.932
TokenHe
Token ID1544
Feature activation+1.44e+00

Pos logit contributions
oire+0.173
ドラ+0.166
 Sins+0.164
avering+0.151
gerald+0.145
Neg logit contributions
adders-0.178
ensical-0.161
udi-0.157
 millenn-0.157
��-0.153
Token takes
Token ID2753
Feature activation-6.86e-02

Pos logit contributions
oire+2.078
ドラ+1.973
 Sins+1.961
avering+1.812
gerald+1.729
Neg logit contributions
adders-2.016
 millenn-1.868
ensical-1.825
udi-1.777
��-1.777
Token her
Token ID607
Feature activation-1.50e+00

Pos logit contributions
adders+0.102
ensical+0.092
udi+0.091
 millenn+0.090
��+0.090
Neg logit contributions
oire-0.095
 Sins-0.090
ドラ-0.090
avering-0.082
gerald-0.078
Token arm
Token ID3211
Feature activation+4.34e-01

Pos logit contributions
adders+1.951
ensical+1.734
udi+1.695
 millenn+1.670
��+1.660
Neg logit contributions
oire-2.373
ドラ-2.288
 Sins-2.272
avering-2.095
gerald-2.017
Token,
Token ID11
Feature activation-3.47e-01

Pos logit contributions
oire+0.647
ドラ+0.608
 Sins+0.602
avering+0.559
gerald+0.530
Neg logit contributions
adders-0.606
ensical-0.542
 millenn-0.540
udi-0.532
��-0.524
Token brings
Token ID6774
Feature activation-5.90e-01

Pos logit contributions
adders+0.475
ensical+0.424
udi+0.417
 millenn+0.416
��+0.410
Neg logit contributions
oire-0.518
 Sins-0.498
ドラ-0.495
avering-0.458
gerald-0.435
Token her
Token ID607
Feature activation-1.43e+00

Pos logit contributions
adders+0.899
ensical+0.815
udi+0.803
 millenn+0.797
��+0.786
Neg logit contributions
oire-0.799
ドラ-0.761
 Sins-0.756
avering-0.687
gerald-0.657
Token gently
Token ID15165
Feature activation-1.53e+00

Pos logit contributions
adders+1.874
ensical+1.667
udi+1.635
 millenn+1.613
��+1.605
Neg logit contributions
oire-2.247
ドラ-2.152
 Sins-2.143
avering-1.974
gerald-1.901
Token towards
Token ID3371
Feature activation-1.28e+00

Pos logit contributions
adders+1.920
ensical+1.697
udi+1.663
 millenn+1.648
��+1.618
Neg logit contributions
oire-2.484
ドラ-2.396
 Sins-2.377
avering-2.206
gerald-2.126
Token the
Token ID262
Feature activation-3.42e-01

Pos logit contributions
adders+1.729
ensical+1.545
udi+1.516
 millenn+1.496
��+1.472
Neg logit contributions
oire-1.946
ドラ-1.872
 Sins-1.859
avering-1.714
gerald-1.650
Token consulting
Token ID18158
Feature activation+1.45e-01

Pos logit contributions
adders+0.440
ensical+0.394
udi+0.385
 millenn+0.378
��+0.377
Neg logit contributions
oire-0.544
 Sins-0.519
ドラ-0.517
avering-0.481
gerald-0.460
Token may
Token ID743
Feature activation-1.12e+00

Pos logit contributions
adders+0.042
ensical+0.038
 millenn+0.038
udi+0.038
��+0.037
Neg logit contributions
oire-0.045
ドラ-0.042
 Sins-0.042
avering-0.039
gerald-0.037
Token be
Token ID307
Feature activation+1.02e+00

Pos logit contributions
adders+1.513
ensical+1.352
udi+1.327
 millenn+1.318
��+1.288
Neg logit contributions
oire-1.703
ドラ-1.635
 Sins-1.623
avering-1.502
gerald-1.441
Token able
Token ID1498
Feature activation-4.25e-01

Pos logit contributions
oire+1.558
 Sins+1.476
ドラ+1.468
avering+1.384
gerald+1.315
Neg logit contributions
adders-1.346
ensical-1.217
 millenn-1.204
udi-1.187
��-1.171
Token to
Token ID284
Feature activation+2.43e-01

Pos logit contributions
adders+0.474
 millenn+0.432
ensical+0.429
��+0.411
udi+0.406
Neg logit contributions
oire-0.741
ドラ-0.716
 Sins-0.703
avering-0.664
gerald-0.637
Token give
Token ID1577
Feature activation+1.17e+00

Pos logit contributions
oire+0.370
ドラ+0.351
 Sins+0.347
avering+0.324
gerald+0.308
Neg logit contributions
adders-0.326
ensical-0.293
 millenn-0.293
udi-0.286
��-0.285
Token.
Token ID13
Feature activation-3.10e-01

Pos logit contributions
oire+1.648
ドラ+1.556
 Sins+1.555
avering+1.423
gerald+1.346
Neg logit contributions
adders-1.689
ensical-1.528
 millenn-1.508
udi-1.493
��-1.493
Token '
Token ID705
Feature activation+7.63e-01

Pos logit contributions
adders+0.494
ensical+0.454
 millenn+0.445
udi+0.443
��+0.435
Neg logit contributions
oire-0.397
 Sins-0.380
ドラ-0.377
avering-0.340
gerald-0.324
Token
Token ID198
Feature activation-1.06e-01

Pos logit contributions
oire+1.014
 Sins+0.970
ドラ+0.965
avering+0.875
gerald+0.828
Neg logit contributions
adders-1.170
ensical-1.073
 millenn-1.046
udi-1.040
��-1.024
Token
Token ID198
Feature activation+4.72e-02

Pos logit contributions
adders+0.163
 millenn+0.153
ensical+0.152
udi+0.145
��+0.144
Neg logit contributions
oire-0.143
ドラ-0.136
 Sins-0.135
avering-0.124
gerald-0.118
TokenYou
Token ID1639
Feature activation+7.67e-01

Pos logit contributions
oire+0.062
ドラ+0.060
 Sins+0.058
avering+0.054
gerald+0.051
Neg logit contributions
adders-0.073
ensical-0.067
 millenn-0.066
udi-0.065
��-0.064
Token cast
Token ID3350
Feature activation+6.59e-01

Pos logit contributions
oire+1.088
 Sins+1.031
ドラ+1.026
avering+0.941
gerald+0.892
Neg logit contributions
adders-1.119
ensical-1.006
 millenn-1.005
udi-0.991
��-0.982
Token
Token ID447
Feature activation-6.92e-03

Pos logit contributions
oire+2.154
ドラ+2.048
 Sins+2.020
avering+1.850
gerald+1.789
Neg logit contributions
adders-2.234
ensical-1.994
 millenn-1.961
udi-1.942
��-1.921
Token
Token ID251
Feature activation-2.50e-01

Pos logit contributions
adders+0.008
ensical+0.007
udi+0.007
 millenn+0.007
��+0.006
Neg logit contributions
oire-0.012
ドラ-0.012
 Sins-0.011
avering-0.011
gerald-0.010
Token in
Token ID287
Feature activation+9.95e-01

Pos logit contributions
adders+0.342
ensical+0.305
udi+0.299
 millenn+0.298
��+0.290
Neg logit contributions
oire-0.378
ドラ-0.360
 Sins-0.358
avering-0.329
gerald-0.317
Token public
Token ID1171
Feature activation-3.15e-01

Pos logit contributions
oire+1.581
 Sins+1.529
ドラ+1.498
avering+1.408
gerald+1.338
Neg logit contributions
adders-1.267
ensical-1.121
udi-1.078
 millenn-1.067
��-1.063
Token speeches
Token ID18138
Feature activation-1.10e+00

Pos logit contributions
adders+0.468
ensical+0.421
udi+0.412
 millenn+0.408
��+0.405
Neg logit contributions
oire-0.439
ドラ-0.419
 Sins-0.416
avering-0.379
gerald-0.361
Token.
Token ID13
Feature activation-3.26e-01

Pos logit contributions
adders+1.619
ensical+1.457
udi+1.433
 millenn+1.416
��+1.392
Neg logit contributions
oire-1.535
ドラ-1.475
 Sins-1.460
avering-1.335
gerald-1.279
Token
Token ID198
Feature activation-4.10e-01

Pos logit contributions
adders+0.479
ensical+0.432
udi+0.425
 millenn+0.418
��+0.412
Neg logit contributions
oire-0.454
ドラ-0.437
 Sins-0.431
avering-0.395
gerald-0.379
Token
Token ID198
Feature activation-1.71e-02

Pos logit contributions
adders+0.621
ensical+0.579
 millenn+0.577
udi+0.553
��+0.548
Neg logit contributions
oire-0.554
ドラ-0.529
 Sins-0.526
avering-0.481
gerald-0.458
TokenNewsletter
Token ID33031
Feature activation-2.81e-01

Pos logit contributions
adders+0.025
ensical+0.024
 millenn+0.023
udi+0.023
��+0.022
Neg logit contributions
oire-0.023
ドラ-0.023
 Sins-0.022
avering-0.020
gerald-0.020
Token Sign
Token ID5865
Feature activation+1.49e+00

Pos logit contributions
adders+0.494
ensical+0.458
 millenn+0.449
udi+0.447
��+0.437
Neg logit contributions
oire-0.310
 Sins-0.295
ドラ-0.294
avering-0.261
gerald-0.247
Token Up
Token ID3205
Feature activation-3.64e+00

Pos logit contributions
oire+1.386
ドラ+1.327
 Sins+1.323
avering+1.146
gerald+1.064
Neg logit contributions
adders-2.871
ensical-2.738
 millenn-2.734
udi-2.643
��-2.611
Token the
Token ID262
Feature activation+3.88e-01

Pos logit contributions
oire+0.056
 Sins+0.054
ドラ+0.053
avering+0.049
gerald+0.047
Neg logit contributions
adders-0.059
ensical-0.053
udi-0.052
 millenn-0.052
��-0.051
Token real
Token ID1103
Feature activation+3.25e-01

Pos logit contributions
oire+0.631
 Sins+0.610
ドラ+0.606
avering+0.562
gerald+0.545
Neg logit contributions
adders-0.477
ensical-0.421
udi-0.413
 millenn-0.404
��-0.401
Token-
Token ID12
Feature activation-7.51e-02

Pos logit contributions
oire+0.502
 Sins+0.484
ドラ+0.483
avering+0.440
gerald+0.428
Neg logit contributions
adders-0.425
ensical-0.381
udi-0.373
 millenn-0.368
��-0.366
Tokenestate
Token ID44146
Feature activation-8.05e-02

Pos logit contributions
adders+0.104
ensical+0.093
udi+0.091
 millenn+0.090
��+0.089
Neg logit contributions
oire-0.111
ドラ-0.108
 Sins-0.106
avering-0.098
gerald-0.094
Token boom
Token ID14166
Feature activation+9.95e-01

Pos logit contributions
adders+0.113
ensical+0.101
udi+0.099
 millenn+0.098
��+0.097
Neg logit contributions
oire-0.118
ドラ-0.114
 Sins-0.113
avering-0.103
gerald-0.100
Token raised
Token ID4376
Feature activation-3.27e+00

Pos logit contributions
oire+1.446
ドラ+1.395
 Sins+1.381
avering+1.261
gerald+1.250
Neg logit contributions
adders-1.382
ensical-1.238
 millenn-1.211
udi-1.204
��-1.187
Token rents
Token ID28393
Feature activation-1.24e+00

Pos logit contributions
adders+4.358
ensical+3.884
udi+3.812
 millenn+3.750
��+3.685
Neg logit contributions
oire-5.018
ドラ-4.849
 Sins-4.799
avering-4.433
gerald-4.272
Token beyond
Token ID3675
Feature activation-2.10e+00

Pos logit contributions
adders+1.775
ensical+1.594
udi+1.566
 millenn+1.554
��+1.527
Neg logit contributions
oire-1.790
ドラ-1.719
 Sins-1.704
avering-1.564
gerald-1.503
Token the
Token ID262
Feature activation-2.50e-01

Pos logit contributions
adders+2.974
ensical+2.668
udi+2.622
 millenn+2.583
��+2.540
Neg logit contributions
oire-3.041
ドラ-2.933
 Sins-2.899
avering-2.665
gerald-2.561
Token reach
Token ID3151
Feature activation-2.72e-01

Pos logit contributions
adders+0.313
ensical+0.274
udi+0.273
 millenn+0.269
��+0.265
Neg logit contributions
oire-0.405
 Sins-0.389
ドラ-0.384
avering-0.358
gerald-0.347
Token of
Token ID286
Feature activation+1.12e-01

Pos logit contributions
adders+0.354
ensical+0.331
 millenn+0.325
udi+0.315
��+0.310
Neg logit contributions
oire-0.421
ドラ-0.400
 Sins-0.397
avering-0.368
gerald-0.354
Token,
Token ID11
Feature activation+1.55e-01

Pos logit contributions
oire+3.579
 Sins+3.370
ドラ+3.356
avering+3.056
gerald+2.892
Neg logit contributions
adders-4.155
ensical-3.758
��-3.700
udi-3.681
 millenn-3.664
Token you
Token ID345
Feature activation+1.13e+00

Pos logit contributions
oire+0.207
 Sins+0.199
ドラ+0.197
avering+0.178
gerald+0.169
Neg logit contributions
adders-0.236
ensical-0.215
udi-0.211
 millenn-0.209
��-0.206
Token have
Token ID423
Feature activation+4.58e-01

Pos logit contributions
oire+1.469
 Sins+1.374
ドラ+1.368
avering+1.256
gerald+1.171
Neg logit contributions
adders-1.767
ensical-1.601
 millenn-1.598
udi-1.576
��-1.569
Token finished
Token ID5201
Feature activation+1.69e+00

Pos logit contributions
oire+0.655
 Sins+0.630
ドラ+0.617
avering+0.566
gerald+0.537
Neg logit contributions
adders-0.663
ensical-0.594
udi-0.582
��-0.577
 millenn-0.575
Token the
Token ID262
Feature activation+2.10e-02

Pos logit contributions
oire+2.289
 Sins+2.224
ドラ+2.152
avering+1.990
gerald+1.870
Neg logit contributions
adders-2.514
ensical-2.276
��-2.224
udi-2.183
 millenn-2.176
Token initiation
Token ID30122
Feature activation-3.80e+00

Pos logit contributions
oire+0.033
 Sins+0.032
ドラ+0.032
avering+0.029
gerald+0.028
Neg logit contributions
adders-0.027
ensical-0.024
udi-0.023
��-0.023
 millenn-0.023
Token ceremony
Token ID10542
Feature activation-9.63e-01

Pos logit contributions
adders+4.739
ensical+4.177
udi+4.126
 millenn+4.043
��+3.897
Neg logit contributions
oire-6.001
ドラ-5.902
 Sins-5.762
avering-5.406
gerald-5.211
Token,
Token ID11
Feature activation-1.41e-01

Pos logit contributions
adders+1.392
ensical+1.251
udi+1.228
 millenn+1.214
��+1.198
Neg logit contributions
oire-1.372
ドラ-1.316
 Sins-1.306
avering-1.196
gerald-1.148
Token and
Token ID290
Feature activation+4.99e-01

Pos logit contributions
adders+0.209
ensical+0.191
udi+0.187
 millenn+0.184
��+0.183
Neg logit contributions
oire-0.195
 Sins-0.187
ドラ-0.185
avering-0.168
gerald-0.160
Token you
Token ID345
Feature activation+1.14e+00

Pos logit contributions
oire+0.673
 Sins+0.648
ドラ+0.631
avering+0.580
gerald+0.547
Neg logit contributions
adders-0.758
ensical-0.692
udi-0.675
 millenn-0.673
��-0.668
Token are
Token ID389
Feature activation+2.36e-03

Pos logit contributions
oire+1.561
 Sins+1.477
ドラ+1.445
avering+1.348
gerald+1.261
Neg logit contributions
adders-1.705
 millenn-1.559
ensical-1.543
��-1.523
udi-1.519
Token hearing
Token ID4854
Feature activation-1.07e+00

Pos logit contributions
adders+0.055
ensical+0.048
udi+0.048
 millenn+0.046
��+0.046
Neg logit contributions
oire-0.080
 Sins-0.077
ドラ-0.076
avering-0.071
gerald-0.069
Token on
Token ID319
Feature activation-6.28e-01

Pos logit contributions
adders+1.555
ensical+1.394
udi+1.369
 millenn+1.351
��+1.339
Neg logit contributions
oire-1.516
 Sins-1.450
ドラ-1.441
avering-1.317
gerald-1.270
Token Thursday
Token ID3635
Feature activation+6.29e-01

Pos logit contributions
adders+0.968
ensical+0.882
udi+0.861
 millenn+0.848
��+0.847
Neg logit contributions
oire-0.835
 Sins-0.807
ドラ-0.794
avering-0.724
gerald-0.697
Token,
Token ID11
Feature activation-1.20e-02

Pos logit contributions
oire+0.866
 Sins+0.835
ドラ+0.816
avering+0.749
gerald+0.727
Neg logit contributions
adders-0.939
ensical-0.843
udi-0.828
��-0.814
 millenn-0.814
Token Mel
Token ID5616
Feature activation-1.48e+00

Pos logit contributions
adders+0.017
ensical+0.016
udi+0.015
��+0.015
 millenn+0.015
Neg logit contributions
oire-0.017
 Sins-0.017
ドラ-0.016
avering-0.015
gerald-0.014
Tokeninda
Token ID22261
Feature activation-3.66e+00

Pos logit contributions
adders+1.999
ensical+1.784
udi+1.747
 millenn+1.729
��+1.698
Neg logit contributions
oire-2.250
ドラ-2.166
 Sins-2.149
avering-1.983
gerald-1.908
Token Thompson
Token ID11654
Feature activation-5.92e-01

Pos logit contributions
adders+4.941
ensical+4.393
udi+4.364
 millenn+4.229
��+4.109
Neg logit contributions
oire-5.336
ドラ-5.262
 Sins-5.002
avering-4.713
gerald-4.549
Token,
Token ID11
Feature activation+4.72e-01

Pos logit contributions
adders+0.872
ensical+0.784
udi+0.768
 millenn+0.765
��+0.754
Neg logit contributions
oire-0.829
ドラ-0.794
 Sins-0.792
avering-0.720
gerald-0.694
Token the
Token ID262
Feature activation+5.97e-01

Pos logit contributions
oire+0.676
 Sins+0.655
ドラ+0.647
avering+0.587
gerald+0.564
Neg logit contributions
adders-0.680
ensical-0.618
udi-0.595
 millenn-0.588
��-0.586
Token leading
Token ID3756
Feature activation-1.69e+00

Pos logit contributions
oire+0.914
 Sins+0.891
ドラ+0.874
avering+0.809
gerald+0.783
Neg logit contributions
adders-0.795
ensical-0.708
udi-0.692
 millenn-0.679
��-0.674
Token lawyer
Token ID6853
Feature activation+1.38e-03

Pos logit contributions
adders+2.120
ensical+1.875
udi+1.836
 millenn+1.805
��+1.770
Neg logit contributions
oire-2.728
ドラ-2.634
 Sins-2.620
avering-2.422
gerald-2.339
Token he
Token ID339
Feature activation+8.99e-01

Pos logit contributions
adders+3.057
ensical+2.741
udi+2.694
 millenn+2.656
��+2.610
Neg logit contributions
oire-3.137
ドラ-3.025
 Sins-2.993
avering-2.750
gerald-2.643
Token could
Token ID714
Feature activation-5.51e-01

Pos logit contributions
oire+1.154
 Sins+1.102
ドラ+1.097
avering+0.998
gerald+0.945
Neg logit contributions
adders-1.401
 millenn-1.291
ensical-1.286
��-1.241
udi-1.240
Token easily
Token ID3538
Feature activation-5.75e-01

Pos logit contributions
adders+0.783
 millenn+0.709
ensical+0.705
udi+0.686
��+0.685
Neg logit contributions
oire-0.791
 Sins-0.759
ドラ-0.752
avering-0.698
gerald-0.665
Token be
Token ID307
Feature activation+1.17e+00

Pos logit contributions
adders+0.832
 millenn+0.754
ensical+0.749
udi+0.737
��+0.730
Neg logit contributions
oire-0.812
 Sins-0.774
ドラ-0.773
avering-0.712
gerald-0.680
Token fl
Token ID781
Feature activation-4.94e-01

Pos logit contributions
oire+1.747
 Sins+1.698
ドラ+1.690
avering+1.579
gerald+1.492
Neg logit contributions
adders-1.581
ensical-1.400
 millenn-1.390
udi-1.387
��-1.347
Tokenirting
Token ID35355
Feature activation-3.35e+00

Pos logit contributions
adders+0.675
ensical+0.605
udi+0.593
 millenn+0.589
��+0.578
Neg logit contributions
oire-0.739
ドラ-0.710
 Sins-0.705
avering-0.652
gerald-0.625
Token with
Token ID351
Feature activation-6.96e-01

Pos logit contributions
adders+3.952
ensical+3.491
udi+3.420
 millenn+3.217
��+3.181
Neg logit contributions
oire-5.556
ドラ-5.435
 Sins-5.289
avering-4.946
gerald-4.777
Token one
Token ID530
Feature activation+9.25e-01

Pos logit contributions
adders+0.963
ensical+0.861
udi+0.843
 millenn+0.839
��+0.827
Neg logit contributions
oire-1.032
 Sins-0.992
ドラ-0.992
avering-0.905
gerald-0.873
Token at
Token ID379
Feature activation-6.59e-01

Pos logit contributions
oire+1.280
 Sins+1.223
ドラ+1.215
avering+1.103
gerald+1.060
Neg logit contributions
adders-1.373
ensical-1.223
 millenn-1.199
udi-1.199
��-1.190
Token a
Token ID257
Feature activation-3.16e-01

Pos logit contributions
adders+0.915
ensical+0.819
udi+0.804
 millenn+0.795
��+0.777
Neg logit contributions
oire-0.972
 Sins-0.933
ドラ-0.931
avering-0.861
gerald-0.826
Token real
Token ID1103
Feature activation-1.63e-01

Pos logit contributions
adders+0.505
ensical+0.456
udi+0.446
 millenn+0.442
��+0.436
Neg logit contributions
oire-0.403
 Sins-0.387
ドラ-0.382
avering-0.349
gerald-0.334
Token post
Token ID1281
Feature activation+1.60e+00

Pos logit contributions
adders+1.078
ensical+0.956
udi+0.937
 millenn+0.924
��+0.916
Neg logit contributions
oire-1.172
ドラ-1.127
 Sins-1.123
avering-1.027
gerald-0.990
Token and
Token ID290
Feature activation+3.22e-01

Pos logit contributions
oire+2.300
ドラ+2.184
 Sins+2.182
avering+2.004
gerald+1.927
Neg logit contributions
adders-2.283
ensical-2.026
 millenn-2.011
udi-1.990
��-1.968
Token snap
Token ID11495
Feature activation+8.10e-01

Pos logit contributions
oire+0.453
ドラ+0.437
 Sins+0.435
avering+0.393
gerald+0.379
Neg logit contributions
adders-0.472
ensical-0.422
 millenn-0.418
udi-0.413
��-0.412
Token my
Token ID616
Feature activation-8.57e-01

Pos logit contributions
oire+1.131
ドラ+1.097
 Sins+1.093
avering+0.982
gerald+0.953
Neg logit contributions
adders-1.185
ensical-1.060
 millenn-1.051
udi-1.049
��-1.032
Token idea
Token ID2126
Feature activation-2.13e+00

Pos logit contributions
adders+1.102
ensical+0.973
udi+0.959
 millenn+0.941
��+0.931
Neg logit contributions
oire-1.356
ドラ-1.312
 Sins-1.308
avering-1.203
gerald-1.162
Token up
Token ID510
Feature activation-3.54e+00

Pos logit contributions
adders+2.621
ensical+2.322
udi+2.267
 millenn+2.205
��+2.158
Neg logit contributions
oire-3.464
ドラ-3.365
 Sins-3.328
avering-3.088
gerald-2.994
Token.
Token ID13
Feature activation-6.98e-02

Pos logit contributions
adders+4.597
ensical+4.120
udi+4.045
 millenn+3.872
��+3.793
Neg logit contributions
oire-5.379
ドラ-5.198
 Sins-5.119
avering-4.767
gerald-4.602
Token George
Token ID4502
Feature activation-3.35e-01

Pos logit contributions
adders+0.114
ensical+0.104
 millenn+0.103
udi+0.102
��+0.100
Neg logit contributions
oire-0.087
 Sins-0.083
ドラ-0.083
avering-0.074
gerald-0.070
Token,
Token ID11
Feature activation-1.97e-01

Pos logit contributions
adders+0.472
ensical+0.423
udi+0.413
 millenn+0.410
��+0.403
Neg logit contributions
oire-0.488
ドラ-0.471
 Sins-0.466
avering-0.428
gerald-0.412
Token we
Token ID356
Feature activation-1.09e-01

Pos logit contributions
adders+0.298
ensical+0.271
udi+0.264
 millenn+0.264
��+0.260
Neg logit contributions
oire-0.266
ドラ-0.255
 Sins-0.255
avering-0.231
gerald-0.222
Token could
Token ID714
Feature activation-9.84e-01

Pos logit contributions
adders+0.157
ensical+0.141
 millenn+0.138
udi+0.138
��+0.136
Neg logit contributions
oire-0.156
ドラ-0.149
 Sins-0.148
avering-0.136
gerald-0.130