TOP ACTIVATIONS
MAX = 2.154

said Jere mie Ryder , conserv ator of the Mur to
transparent ly to you . Rest assured , the techn ologists
that be ? Imagine the sp uttering floor speeches from an
c es dro its en rest re ignant l ' acc
you to events like pop - ups , farmers markets and
( New York : St . Martin s Press
for the next generation . Rest raining future debt is a
If vouchers are extended , par och ial schools stand to
Cors aires l oy aux rest ants . Par le pr
ism atic - Rest less , Quick to anger
1980 s partly due to spl inter ing within the group
did little to curb the trade of child pornography in practice
a little more developer hand - holding could be a good
83 Active Elect ron ically Sc anned Array Rad ars and
strike networks on three different Open Systems Inter connection ( OS
, the nonprofit behind the open - source web browser ,
s a narrow , par och ial my
legislation passed by the territory par liam ents .[ 196 ]
the most successful . These trade - offs are different for
had ' F --- ' sc rawled in large letters across

MIN ACTIVATIONS
MIN = -2.050

cost of a gallon of diesel is about $ 4 .
lit re of reduction in diesel consumption per coach per ann
point towards the spacecraft . Deep Space 1 then entered its
referred to as " John Doe " contends he was denied
will be good times and hard times and unexpected turns .
supporters have been been releasing hydrogen balloons carrying bundles of B
actions to take , and answer your biggest market questions .
SS N ), plus some diesel submarines . Russia announced that
, 7 88 litres of diesel in a year . The
- govern ed organisation " del ivers emergency aid to people
ny Gray are in the engine room . Rob
simple : function ob b _ sub div ide
video with # Ask The D ubs , then tune in
ulf ur fuel oil and diesel at current prices .
ak / j t / s gb ( d pa ,
s oil and gas . While it
olan 41 ), Daly ( El worthy 54 ), Scott (
, and chatting . It turns out there are quite a
further reading : Hall ion , Richard P .
really well , as it turns out . The Premiere is

INTERVAL 1.103 - 2.154
CONTAINS 0.585%

lots of money for somebody else and were asked to be
nice hash . com , but will be disabled eventually .
it with a mixture of tre p idation and awe .
tit ular phrase was still fl uttering proudly in the breeze
graduating with a B FA in Musical Theater . He

INTERVAL 0.052 - 1.103
CONTAINS 44.103%

would not im ped e the development and use
stay true to the traditional flavors of Vietnamese cuisine in New
the pending switch to 100 % renewable energy .
It was amazing to watch him bat a thousand after years
M illy had written notes about hating herself , about

INTERVAL -0.999 - 0.052
CONTAINS 54.234%

to do that - it creates a lot of entry points
let s assume for a moment that we
of myth is thus conveniently reflected back to him .
get within a point at 15 - 14 heading into the
, but the biology tr umps a lot of these environmental

INTERVAL -2.050 - -0.999
CONTAINS 1.078%

lit re of reduction in diesel consumption per coach per ann
published . DA IK AI J U
obligation to protect and promote human rights ." "
ure Test truck in the 2010 B aja 1000 , as
. Fl ake , who announced last month that
Token said
Token ID531
Feature activation+5.88e-01

Pos logit contributions
seeing+0.259
irit+0.240
arers+0.238
dating+0.235
effects+0.228
Neg logit contributions
Vs-0.247
 Minutes-0.225
inki-0.220
�醒-0.215
OUS-0.209
Token Jere
Token ID10272
Feature activation+1.51e+00

Pos logit contributions
seeing+3.215
irit+2.907
arers+2.864
dating+2.862
place+2.784
Neg logit contributions
Vs-4.031
inki-3.683
 Minutes-3.640
�醒-3.608
rador-3.499
Tokenmie
Token ID44871
Feature activation+8.01e-01

Pos logit contributions
seeing+7.485
irit+7.085
lier+6.866
place+6.664
arers+6.515
Neg logit contributions
Vs-10.170
OUS-8.845
 Minutes-8.828
�醒-8.809
inki-8.766
Token Ryder
Token ID40238
Feature activation+6.30e-02

Pos logit contributions
seeing+4.720
irit+4.402
arers+4.255
lier+4.250
dating+4.241
Neg logit contributions
Vs-4.987
inki-4.503
 Minutes-4.487
�醒-4.431
rador-4.330
Token,
Token ID11
Feature activation+2.20e-01

Pos logit contributions
seeing+0.314
irit+0.287
arers+0.284
dating+0.279
effects+0.268
Neg logit contributions
Vs-0.468
 Minutes-0.436
inki-0.426
�醒-0.412
OUS-0.408
Token conserv
Token ID4055
Feature activation+2.15e+00

Pos logit contributions
seeing+1.360
irit+1.247
arers+1.239
dating+1.228
place+1.190
Neg logit contributions
Vs-1.383
 Minutes-1.254
inki-1.225
�醒-1.202
OUS-1.168
Tokenator
Token ID1352
Feature activation+1.22e+00

Pos logit contributions
irit+5.837
seeing+5.668
arers+5.399
place+5.203
atri+5.041
Neg logit contributions
Vs-17.991
OME-16.479
 Minutes-16.473
ALSE-16.234
OUS-15.972
Token of
Token ID286
Feature activation+8.93e-01

Pos logit contributions
seeing+6.219
irit+5.650
lier+5.516
dating+5.400
 painter+5.368
Neg logit contributions
Vs-8.415
 Minutes-7.738
inki-7.433
USS-7.350
�醒-7.347
Token the
Token ID262
Feature activation+7.37e-01

Pos logit contributions
seeing+5.157
dating+4.675
arers+4.672
irit+4.661
pain+4.503
Neg logit contributions
Vs-5.658
inki-5.169
�醒-5.144
 Minutes-5.126
OUS-4.864
Token Mur
Token ID5921
Feature activation+6.67e-01

Pos logit contributions
seeing+4.133
irit+3.753
arers+3.715
dating+3.641
place+3.577
Neg logit contributions
Vs-4.877
inki-4.448
�醒-4.401
 Minutes-4.398
rador-4.122
Tokento
Token ID1462
Feature activation+2.98e-01

Pos logit contributions
seeing+3.374
irit+3.094
lier+2.997
arers+2.991
dating+2.923
Neg logit contributions
Vs-4.808
 Minutes-4.422
�醒-4.361
inki-4.306
OUS-4.256
Token transparent
Token ID13245
Feature activation+3.69e-01

Pos logit contributions
seeing+1.303
irit+1.192
arers+1.191
dating+1.175
place+1.153
Neg logit contributions
Vs-1.316
�醒-1.192
 Minutes-1.190
inki-1.180
rador-1.137
Tokenly
Token ID306
Feature activation+6.20e-01

Pos logit contributions
seeing+2.280
dating+2.071
arers+2.060
lier+2.056
irit+2.028
Neg logit contributions
Vs-2.284
 Minutes-2.104
�醒-2.098
inki-2.085
USS-1.971
Token to
Token ID284
Feature activation+4.28e-01

Pos logit contributions
seeing+3.601
dating+3.238
arers+3.230
irit+3.207
place+3.141
Neg logit contributions
Vs-3.993
�醒-3.807
inki-3.656
 Minutes-3.627
rador-3.537
Token you
Token ID345
Feature activation+4.20e-01

Pos logit contributions
seeing+2.653
arers+2.497
irit+2.452
dating+2.397
place+2.386
Neg logit contributions
Vs-2.571
�醒-2.351
inki-2.347
 Minutes-2.304
rador-2.195
Token.
Token ID13
Feature activation+4.34e-01

Pos logit contributions
seeing+2.535
arers+2.333
dating+2.295
irit+2.278
place+2.223
Neg logit contributions
Vs-2.672
�醒-2.443
inki-2.421
 Minutes-2.404
USS-2.294
Token Rest
Token ID8324
Feature activation+2.15e+00

Pos logit contributions
seeing+2.912
arers+2.642
dating+2.616
place+2.604
irit+2.602
Neg logit contributions
Vs-2.422
inki-2.261
�醒-2.232
rador-2.107
 Minutes-2.069
Token assured
Token ID13933
Feature activation+8.28e-01

Pos logit contributions
raint+6.327
arers+5.761
dating+5.658
seeing+5.607
ruct+5.410
Neg logit contributions
�醒-16.926
Vs-16.533
inki-15.706
ahime-15.378
rador-15.228
Token,
Token ID11
Feature activation+4.21e-01

Pos logit contributions
seeing+4.313
dating+3.797
arers+3.772
irit+3.727
place+3.664
Neg logit contributions
Vs-5.805
�醒-5.389
inki-5.386
 Minutes-5.316
rador-5.113
Token the
Token ID262
Feature activation+3.35e-01

Pos logit contributions
seeing+2.552
arers+2.337
dating+2.316
irit+2.302
effects+2.244
Neg logit contributions
Vs-2.656
�醒-2.430
inki-2.421
 Minutes-2.399
rador-2.271
Token techn
Token ID1579
Feature activation-3.55e-01

Pos logit contributions
seeing+2.050
arers+1.891
irit+1.885
dating+1.879
effects+1.812
Neg logit contributions
Vs-2.098
inki-1.897
 Minutes-1.883
�醒-1.860
rador-1.786
Tokenologists
Token ID9251
Feature activation+4.88e-01

Pos logit contributions
Vs+2.806
 Minutes+2.618
inki+2.565
�醒+2.470
OUS+2.468
Neg logit contributions
seeing-1.586
irit-1.429
arers-1.396
dating-1.393
rising-1.320
Token that
Token ID326
Feature activation-3.80e-01

Pos logit contributions
Vs+1.315
 Minutes+1.213
inki+1.180
�醒+1.142
OUS+1.130
Neg logit contributions
seeing-1.067
irit-0.985
arers-0.975
dating-0.961
raint-0.929
Token be
Token ID307
Feature activation+4.59e-01

Pos logit contributions
Vs+2.229
 Minutes+2.022
inki+1.964
�醒+1.887
OUS+1.877
Neg logit contributions
seeing-2.476
irit-2.319
arers-2.300
dating-2.263
effects-2.207
Token?
Token ID30
Feature activation+5.62e-01

Pos logit contributions
seeing+2.371
irit+2.091
arers+2.073
dating+2.072
lier+2.010
Neg logit contributions
Vs-3.310
 Minutes-3.097
inki-3.027
�醒-2.994
rador-2.924
Token Imagine
Token ID18450
Feature activation+9.56e-02

Pos logit contributions
seeing+3.963
irit+3.564
dating+3.554
arers+3.545
place+3.544
Neg logit contributions
Vs-2.907
inki-2.680
�醒-2.582
rador-2.524
 Minutes-2.520
Token the
Token ID262
Feature activation+6.90e-01

Pos logit contributions
seeing+0.571
irit+0.524
arers+0.517
dating+0.513
lier+0.499
Neg logit contributions
Vs-0.622
 Minutes-0.568
inki-0.557
�醒-0.541
rador-0.529
Token sp
Token ID599
Feature activation+2.11e+00

Pos logit contributions
seeing+4.012
irit+3.628
arers+3.559
effects+3.546
dating+3.522
Neg logit contributions
Vs-4.438
 Minutes-4.008
inki-3.995
�醒-3.948
rador-3.859
Tokenuttering
Token ID33598
Feature activation+8.23e-01

Pos logit contributions
seeing+11.236
irit+11.220
arers+10.010
lier+9.952
ourney+9.730
Neg logit contributions
Vs-11.688
�醒-11.545
 Minutes-10.544
��-10.309
��-10.235
Token floor
Token ID4314
Feature activation+4.43e-01

Pos logit contributions
seeing+4.624
irit+4.293
place+4.201
dating+4.180
effects+4.176
Neg logit contributions
Vs-5.319
�醒-4.808
inki-4.804
 Minutes-4.713
rador-4.652
Token speeches
Token ID18138
Feature activation+4.39e-01

Pos logit contributions
seeing+2.633
irit+2.393
dating+2.363
arers+2.346
effects+2.317
Neg logit contributions
Vs-2.860
 Minutes-2.617
inki-2.571
�醒-2.528
rador-2.459
Token from
Token ID422
Feature activation+3.22e-01

Pos logit contributions
seeing+2.502
irit+2.282
arers+2.249
dating+2.215
place+2.181
Neg logit contributions
Vs-2.920
 Minutes-2.654
inki-2.634
�醒-2.578
rador-2.519
Token an
Token ID281
Feature activation+2.53e-01

Pos logit contributions
seeing+2.021
irit+1.864
arers+1.844
dating+1.805
lier+1.778
Neg logit contributions
Vs-1.974
inki-1.780
 Minutes-1.765
�醒-1.733
rador-1.688
Token c
Token ID269
Feature activation+1.28e+00

Pos logit contributions
seeing+0.809
irit+0.745
arers+0.736
dating+0.728
raint+0.705
Neg logit contributions
Vs-0.943
 Minutes-0.865
inki-0.847
�醒-0.820
OUS-0.806
Tokenes
Token ID274
Feature activation-4.09e-01

Pos logit contributions
seeing+5.785
irit+5.707
arers+5.407
ember+5.225
lier+5.174
Neg logit contributions
Vs-8.838
 Minutes-8.524
�醒-8.371
OME-8.167
inki-7.954
Token dro
Token ID3102
Feature activation+7.35e-01

Pos logit contributions
Vs+2.672
 Minutes+2.482
inki+2.398
�醒+2.301
OUS+2.301
Neg logit contributions
seeing-2.387
irit-2.193
arers-2.181
dating-2.157
rising-2.094
Tokenits
Token ID896
Feature activation+8.04e-01

Pos logit contributions
seeing+3.989
irit+3.794
arers+3.706
effects+3.586
dating+3.577
Neg logit contributions
Vs-4.752
 Minutes-4.529
�醒-4.501
inki-4.286
OUS-4.249
Token en
Token ID551
Feature activation+1.80e-01

Pos logit contributions
seeing+4.345
irit+4.015
arers+3.891
dating+3.878
lier+3.812
Neg logit contributions
Vs-5.382
 Minutes-4.956
inki-4.781
�醒-4.741
rador-4.616
Token rest
Token ID1334
Feature activation+2.09e+00

Pos logit contributions
seeing+1.070
irit+0.988
arers+0.975
dating+0.969
place+0.936
Neg logit contributions
Vs-1.171
 Minutes-1.070
inki-1.046
�醒-1.018
rador-0.994
Tokenre
Token ID260
Feature activation+1.07e+00

Pos logit contributions
seeing+9.195
irit+8.836
arers+8.802
lier+8.715
raint+8.530
Neg logit contributions
Vs-12.903
 Minutes-12.614
�醒-11.772
OME-11.717
USS-11.578
Tokenignant
Token ID25114
Feature activation+3.27e-01

Pos logit contributions
seeing+5.865
irit+5.562
arers+5.408
dating+5.356
lier+5.300
Neg logit contributions
Vs-6.792
 Minutes-6.254
inki-6.142
�醒-5.919
rador-5.898
Token l
Token ID300
Feature activation+1.24e+00

Pos logit contributions
seeing+1.806
irit+1.674
arers+1.645
dating+1.624
lier+1.587
Neg logit contributions
Vs-2.223
 Minutes-2.049
inki-2.005
�醒-1.952
rador-1.909
Token'
Token ID6
Feature activation+1.25e+00

Pos logit contributions
seeing+4.712
irit+4.630
arers+4.284
raint+4.208
dating+4.197
Neg logit contributions
Vs-9.605
 Minutes-9.197
�醒-9.045
inki-8.756
USS-8.549
Tokenacc
Token ID4134
Feature activation+3.19e-01

Pos logit contributions
seeing+6.354
irit+6.128
dating+5.918
effects+5.849
arers+5.803
Neg logit contributions
Vs-8.118
 Minutes-7.658
�醒-7.465
inki-7.199
rador-7.097
Token you
Token ID345
Feature activation-2.53e-01

Pos logit contributions
Vs+0.862
 Minutes+0.768
inki+0.741
�醒+0.705
OUS+0.692
Neg logit contributions
seeing-1.341
irit-1.264
arers-1.252
dating-1.240
effects-1.210
Token to
Token ID284
Feature activation-2.11e-01

Pos logit contributions
Vs+1.449
 Minutes+1.313
inki+1.274
�醒+1.218
OUS+1.204
Neg logit contributions
seeing-1.685
irit-1.583
arers-1.562
dating-1.546
effects-1.509
Token events
Token ID2995
Feature activation-1.33e-02

Pos logit contributions
Vs+1.281
 Minutes+1.170
inki+1.137
�醒+1.100
OUS+1.081
Neg logit contributions
seeing-1.340
irit-1.246
arers-1.235
dating-1.219
effects-1.183
Token like
Token ID588
Feature activation+2.21e-01

Pos logit contributions
Vs+0.093
 Minutes+0.086
inki+0.084
�醒+0.082
OUS+0.080
Neg logit contributions
seeing-0.073
irit-0.066
arers-0.066
dating-0.065
effects-0.062
Token pop
Token ID1461
Feature activation+1.23e+00

Pos logit contributions
seeing+1.387
irit+1.274
arers+1.268
dating+1.254
lier+1.218
Neg logit contributions
Vs-1.355
 Minutes-1.238
inki-1.212
�醒-1.188
rador-1.161
Token-
Token ID12
Feature activation+2.08e+00

Pos logit contributions
seeing+6.174
dating+5.581
arers+5.415
irit+5.377
place+5.314
Neg logit contributions
Vs-8.199
 Minutes-7.930
�醒-7.729
rador-7.702
inki-7.677
Tokenups
Token ID4739
Feature activation+5.49e-01

Pos logit contributions
seeing+5.536
dating+4.805
effects+4.457
place+4.418
rising+4.132
Neg logit contributions
�醒-18.220
 Minutes-17.713
Vs-17.442
ALSE-16.993
��-16.849
Token,
Token ID11
Feature activation+2.23e-01

Pos logit contributions
seeing+2.926
irit+2.599
arers+2.589
dating+2.587
place+2.464
Neg logit contributions
Vs-3.866
 Minutes-3.584
inki-3.516
�醒-3.469
OUS-3.374
Token farmers
Token ID9818
Feature activation-9.02e-02

Pos logit contributions
seeing+1.393
arers+1.272
irit+1.272
dating+1.261
place+1.221
Neg logit contributions
Vs-1.371
 Minutes-1.259
inki-1.227
�醒-1.218
rador-1.177
Token markets
Token ID5939
Feature activation+4.89e-01

Pos logit contributions
Vs+0.464
 Minutes+0.417
inki+0.403
�醒+0.388
OUS+0.378
Neg logit contributions
seeing-0.658
irit-0.617
arers-0.611
dating-0.605
effects-0.589
Token and
Token ID290
Feature activation-1.39e-01

Pos logit contributions
seeing+2.355
irit+2.124
arers+2.100
dating+2.069
effects+1.981
Neg logit contributions
Vs-3.712
 Minutes-3.453
inki-3.386
�醒-3.291
OUS-3.253
Token (
Token ID357
Feature activation+1.07e+00

Pos logit contributions
seeing+2.490
irit+2.271
dating+2.209
arers+2.198
place+2.174
Neg logit contributions
Vs-3.052
inki-2.819
 Minutes-2.783
�醒-2.699
rador-2.691
TokenNew
Token ID3791
Feature activation+6.05e-01

Pos logit contributions
seeing+6.510
dating+5.984
place+5.896
rising+5.886
effects+5.751
Neg logit contributions
Vs-6.167
�醒-5.873
 Minutes-5.802
inki-5.776
rador-5.505
Token York
Token ID1971
Feature activation+1.28e+00

Pos logit contributions
seeing+2.324
irit+2.008
arers+2.000
dating+1.937
place+1.909
Neg logit contributions
Vs-5.167
 Minutes-4.760
inki-4.756
�醒-4.737
OUS-4.589
Token:
Token ID25
Feature activation+1.55e+00

Pos logit contributions
seeing+6.523
irit+5.875
arers+5.831
place+5.825
dating+5.792
Neg logit contributions
Vs-8.799
inki-8.069
 Minutes-7.764
�醒-7.668
OUS-7.558
Token St
Token ID520
Feature activation+1.80e+00

Pos logit contributions
seeing+7.751
place+7.283
lier+6.932
arers+6.710
dating+6.686
Neg logit contributions
inki-9.740
Vs-9.737
�醒-9.336
rador-9.166
asionally-8.823
Token.
Token ID13
Feature activation+2.08e+00

Pos logit contributions
seeing+7.640
ember+7.604
irit+7.467
place+7.288
lier+7.172
Neg logit contributions
Vs-12.476
 Minutes-11.617
�醒-11.485
inki-11.296
ahime-10.943
Token Martin
Token ID5780
Feature activation+5.95e-01

Pos logit contributions
seeing+6.314
place+5.986
pain+5.232
rising+5.182
irit+5.181
Neg logit contributions
Vs-16.203
inki-16.023
ahime-15.842
rador-15.510
�醒-15.359
Token
Token ID447
Feature activation+8.83e-01

Pos logit contributions
seeing+3.172
irit+2.931
arers+2.880
dating+2.831
lier+2.758
Neg logit contributions
Vs-4.152
inki-3.812
 Minutes-3.784
rador-3.655
�醒-3.643
Token
Token ID247
Feature activation+8.42e-01

Pos logit contributions
seeing+5.083
irit+4.927
arers+4.720
place+4.660
dating+4.550
Neg logit contributions
Vs-5.380
 Minutes-5.038
inki-5.030
OUS-4.698
rador-4.675
Tokens
Token ID82
Feature activation+4.77e-01

Pos logit contributions
seeing+4.400
irit+4.019
arers+3.971
dating+3.956
place+3.950
Neg logit contributions
Vs-5.753
 Minutes-5.319
inki-5.306
�醒-5.186
rador-5.085
Token Press
Token ID4332
Feature activation+1.48e+00

Pos logit contributions
seeing+2.416
irit+2.229
arers+2.184
dating+2.150
place+2.110
Neg logit contributions
Vs-3.490
 Minutes-3.208
inki-3.168
�醒-3.072
rador-3.036
Token for
Token ID329
Feature activation+6.48e-02

Pos logit contributions
Vs+0.307
 Minutes+0.281
inki+0.273
�醒+0.264
OUS+0.258
Neg logit contributions
seeing-0.319
irit-0.296
arers-0.292
dating-0.289
effects-0.281
Token the
Token ID262
Feature activation+7.44e-02

Pos logit contributions
seeing+0.403
arers+0.372
irit+0.372
dating+0.364
lier+0.356
Neg logit contributions
Vs-0.405
 Minutes-0.368
inki-0.363
�醒-0.355
OUS-0.340
Token next
Token ID1306
Feature activation+6.50e-02

Pos logit contributions
seeing+0.419
irit+0.384
arers+0.383
dating+0.373
place+0.365
Neg logit contributions
Vs-0.507
 Minutes-0.467
inki-0.459
�醒-0.448
OUS-0.435
Token generation
Token ID5270
Feature activation-3.76e-01

Pos logit contributions
seeing+0.394
irit+0.362
arers+0.362
dating+0.356
place+0.350
Neg logit contributions
Vs-0.413
 Minutes-0.380
inki-0.371
�醒-0.361
rador-0.352
Token.
Token ID13
Feature activation+8.16e-02

Pos logit contributions
Vs+2.307
 Minutes+2.120
inki+2.041
�醒+1.972
OUS+1.964
Neg logit contributions
seeing-2.324
irit-2.187
arers-2.153
dating-2.131
effects-2.071
Token Rest
Token ID8324
Feature activation+2.05e+00

Pos logit contributions
seeing+0.576
irit+0.534
arers+0.531
dating+0.522
place+0.515
Neg logit contributions
Vs-0.438
inki-0.390
 Minutes-0.385
�醒-0.373
rador-0.364
Tokenraining
Token ID24674
Feature activation+3.41e-01

Pos logit contributions
raint+9.834
seeing+9.161
ruct+8.839
rising+8.746
dating+8.714
Neg logit contributions
Vs-12.980
�醒-12.491
inki-12.164
 Minutes-12.003
USS-11.583
Token future
Token ID2003
Feature activation-2.73e-01

Pos logit contributions
seeing+2.031
irit+1.870
arers+1.869
dating+1.827
effects+1.809
Neg logit contributions
Vs-2.174
 Minutes-1.985
inki-1.960
�醒-1.925
OUS-1.835
Token debt
Token ID5057
Feature activation+1.83e-01

Pos logit contributions
Vs+1.776
 Minutes+1.630
inki+1.588
�醒+1.533
OUS+1.516
Neg logit contributions
seeing-1.611
irit-1.492
arers-1.472
dating-1.456
effects-1.407
Token is
Token ID318
Feature activation+3.09e-01

Pos logit contributions
seeing+1.019
arers+0.939
irit+0.928
dating+0.906
raint+0.888
Neg logit contributions
Vs-1.251
 Minutes-1.158
inki-1.132
�醒-1.107
USS-1.074
Token a
Token ID257
Feature activation+3.68e-01

Pos logit contributions
seeing+1.937
irit+1.798
arers+1.779
dating+1.753
lier+1.739
Neg logit contributions
Vs-1.902
 Minutes-1.730
inki-1.701
�醒-1.652
OUS-1.593
Token If
Token ID1002
Feature activation-3.10e-01

Pos logit contributions
seeing+0.301
irit+0.281
arers+0.279
dating+0.276
place+0.272
Neg logit contributions
Vs-0.214
inki-0.190
 Minutes-0.189
�醒-0.181
rador-0.177
Token vouchers
Token ID41262
Feature activation+1.12e-01

Pos logit contributions
Vs+1.919
 Minutes+1.753
inki+1.703
�醒+1.643
OUS+1.622
Neg logit contributions
seeing-1.939
irit-1.801
arers-1.780
dating-1.763
effects-1.708
Token are
Token ID389
Feature activation-3.43e-01

Pos logit contributions
seeing+0.691
arers+0.646
irit+0.646
dating+0.627
place+0.618
Neg logit contributions
Vs-0.702
 Minutes-0.640
inki-0.628
�醒-0.614
OUS-0.595
Token extended
Token ID7083
Feature activation+2.53e-01

Pos logit contributions
Vs+2.133
 Minutes+1.951
inki+1.896
�醒+1.830
OUS+1.804
Neg logit contributions
seeing-2.127
irit-1.976
arers-1.952
dating-1.932
effects-1.873
Token,
Token ID11
Feature activation+5.56e-01

Pos logit contributions
seeing+1.666
arers+1.541
irit+1.538
dating+1.526
lier+1.494
Neg logit contributions
Vs-1.474
 Minutes-1.329
inki-1.315
�醒-1.287
rador-1.232
Token par
Token ID1582
Feature activation+2.04e+00

Pos logit contributions
seeing+3.310
irit+3.052
arers+3.010
place+2.976
dating+2.959
Neg logit contributions
Vs-3.483
 Minutes-3.189
inki-3.160
�醒-3.157
rador-3.004
Tokenoch
Token ID5374
Feature activation+3.62e-01

Pos logit contributions
seeing+6.844
irit+6.750
arers+6.681
place+6.469
lier+6.341
Neg logit contributions
Vs-14.975
��-14.236
�醒-14.018
inki-13.894
OME-13.779
Tokenial
Token ID498
Feature activation+5.54e-01

Pos logit contributions
seeing+1.284
irit+1.142
arers+1.119
dating+1.072
lier+1.034
Neg logit contributions
Vs-3.188
 Minutes-3.012
inki-2.966
�醒-2.934
OUS-2.889
Token schools
Token ID4266
Feature activation+1.32e+00

Pos logit contributions
seeing+2.656
irit+2.461
arers+2.427
dating+2.389
place+2.335
Neg logit contributions
Vs-4.068
 Minutes-3.823
inki-3.739
�醒-3.680
USS-3.588
Token stand
Token ID1302
Feature activation+1.02e+00

Pos logit contributions
seeing+6.271
arers+5.735
place+5.583
irit+5.499
lier+5.303
Neg logit contributions
Vs-9.306
�醒-9.002
 Minutes-8.632
inki-8.600
OUS-8.376
Token to
Token ID284
Feature activation+1.79e-01

Pos logit contributions
seeing+6.093
arers+5.680
irit+5.578
place+5.552
lier+5.525
Neg logit contributions
Vs-5.808
�醒-5.678
inki-5.589
 Minutes-5.539
ahime-5.130
Token Cors
Token ID26978
Feature activation+1.57e-01

Pos logit contributions
seeing+2.368
irit+2.205
arers+2.173
dating+2.151
effects+2.085
Neg logit contributions
Vs-2.581
 Minutes-2.346
inki-2.324
�醒-2.280
rador-2.222
Tokenaires
Token ID17693
Feature activation+1.13e+00

Pos logit contributions
seeing+0.795
irit+0.727
arers+0.721
dating+0.704
effects+0.681
Neg logit contributions
Vs-1.155
 Minutes-1.072
inki-1.045
�醒-1.020
OUS-1.004
Token l
Token ID300
Feature activation+1.19e+00

Pos logit contributions
seeing+5.987
irit+5.581
arers+5.518
dating+5.362
lier+5.329
Neg logit contributions
Vs-7.128
�醒-6.555
 Minutes-6.520
inki-6.501
rador-6.250
Tokenoy
Token ID726
Feature activation+7.44e-01

Pos logit contributions
seeing+4.730
irit+4.585
arers+4.222
raint+4.167
lier+4.148
Neg logit contributions
Vs-9.089
 Minutes-8.656
�醒-8.464
inki-8.273
USS-8.054
Tokenaux
Token ID14644
Feature activation+5.40e-01

Pos logit contributions
seeing+4.191
arers+3.865
lier+3.803
irit+3.784
dating+3.735
Neg logit contributions
Vs-4.799
 Minutes-4.522
inki-4.311
�醒-4.306
rador-4.150
Token rest
Token ID1334
Feature activation+2.03e+00

Pos logit contributions
seeing+2.971
irit+2.755
arers+2.713
dating+2.664
lier+2.599
Neg logit contributions
Vs-3.617
 Minutes-3.335
inki-3.283
�醒-3.213
rador-3.134
Tokenants
Token ID1187
Feature activation+8.70e-01

Pos logit contributions
seeing+10.010
arers+9.589
irit+9.494
raint+9.464
lier+9.283
Neg logit contributions
Vs-11.613
 Minutes-11.404
�醒-10.987
USS-10.497
OME-10.478
Token.
Token ID13
Feature activation+5.19e-01

Pos logit contributions
seeing+4.587
irit+4.218
arers+4.161
dating+4.085
lier+4.064
Neg logit contributions
Vs-5.798
 Minutes-5.398
inki-5.272
�醒-5.196
rador-5.062
TokenPar
Token ID10044
Feature activation+1.30e+00

Pos logit contributions
seeing+3.315
irit+3.017
arers+2.996
dating+2.980
place+2.975
Neg logit contributions
Vs-2.898
inki-2.719
 Minutes-2.647
rador-2.618
�醒-2.558
Token le
Token ID443
Feature activation+1.11e+00

Pos logit contributions
seeing+6.977
lier+6.616
irit+6.526
arers+6.378
dating+6.287
Neg logit contributions
Vs-8.111
 Minutes-7.489
inki-7.394
�醒-7.206
OME-7.117
Token pr
Token ID778
Feature activation+9.02e-01

Pos logit contributions
seeing+6.294
irit+5.944
arers+5.805
dating+5.774
lier+5.687
Neg logit contributions
Vs-6.759
inki-6.246
 Minutes-6.065
�醒-6.046
rador-5.867
Tokenism
Token ID1042
Feature activation+7.06e-01

Pos logit contributions
seeing+2.193
irit+2.048
arers+1.990
dating+1.953
lier+1.946
Neg logit contributions
Vs-2.770
 Minutes-2.556
inki-2.504
�醒-2.413
OUS-2.367
Tokenatic
Token ID1512
Feature activation-1.93e-02

Pos logit contributions
seeing+1.947
irit+1.749
arers+1.627
rising+1.574
raint+1.548
Neg logit contributions
Vs-6.705
inki-6.291
 Minutes-6.280
�醒-6.096
USS-5.946
Token
Token ID198
Feature activation+1.63e-01

Pos logit contributions
Vs+0.107
 Minutes+0.097
inki+0.094
�醒+0.090
OUS+0.089
Neg logit contributions
seeing-0.133
irit-0.125
arers-0.123
dating-0.122
effects-0.119
Token
Token ID198
Feature activation+1.14e-01

Pos logit contributions
seeing+1.263
dating+1.160
irit+1.157
place+1.155
arers+1.149
Neg logit contributions
Vs-0.763
�醒-0.690
inki-0.680
 Minutes-0.676
rador-0.630
Token-
Token ID12
Feature activation-5.13e-02

Pos logit contributions
seeing+0.720
irit+0.671
arers+0.663
dating+0.655
raint+0.635
Neg logit contributions
Vs-0.700
 Minutes-0.638
inki-0.620
�醒-0.597
OUS-0.590
Token Rest
Token ID8324
Feature activation+2.01e+00

Pos logit contributions
Vs+0.332
 Minutes+0.303
inki+0.297
�醒+0.289
rador+0.283
Neg logit contributions
seeing-0.308
irit-0.283
arers-0.280
dating-0.277
effects-0.268
Tokenless
Token ID1203
Feature activation+3.52e-01

Pos logit contributions
raint+8.656
seeing+8.020
arers+7.392
rising+7.305
ruct+7.251
Neg logit contributions
Vs-13.835
�醒-13.499
inki-12.632
 Minutes-12.454
USS-12.411
Token,
Token ID11
Feature activation+2.44e-01

Pos logit contributions
seeing+1.665
irit+1.508
arers+1.485
dating+1.463
effects+1.402
Neg logit contributions
Vs-2.709
 Minutes-2.520
inki-2.466
�醒-2.398
OUS-2.370
Token Quick
Token ID12029
Feature activation-3.18e-02

Pos logit contributions
seeing+1.463
irit+1.336
arers+1.326
dating+1.302
place+1.272
Neg logit contributions
Vs-1.577
 Minutes-1.423
inki-1.411
�醒-1.385
rador-1.348
Token to
Token ID284
Feature activation-8.33e-02

Pos logit contributions
Vs+0.220
 Minutes+0.203
inki+0.198
�醒+0.192
OUS+0.189
Neg logit contributions
seeing-0.175
irit-0.161
arers-0.159
dating-0.157
effects-0.152
Token anger
Token ID8993
Feature activation-2.87e-01

Pos logit contributions
Vs+0.551
 Minutes+0.508
inki+0.494
�醒+0.476
OUS+0.472
Neg logit contributions
seeing-0.485
irit-0.448
arers-0.443
dating-0.438
effects-0.424
Token 1980
Token ID7169
Feature activation-7.17e-01

Pos logit contributions
Vs+1.098
 Minutes+1.006
inki+0.979
�醒+0.948
OUS+0.932
Neg logit contributions
seeing-1.059
irit-0.982
arers-0.971
dating-0.960
effects-0.930
Tokens
Token ID82
Feature activation-5.82e-01

Pos logit contributions
Vs+3.685
 Minutes+3.269
inki+3.099
OUS+2.917
rador+2.874
Neg logit contributions
seeing-5.140
arers-4.883
irit-4.875
dating-4.747
raint-4.692
Token partly
Token ID11476
Feature activation-1.22e-01

Pos logit contributions
Vs+3.669
 Minutes+3.390
inki+3.249
�醒+3.131
OUS+3.116
Neg logit contributions
seeing-3.493
irit-3.256
arers-3.221
dating-3.161
raint-3.064
Token due
Token ID2233
Feature activation+1.05e-01

Pos logit contributions
Vs+0.725
 Minutes+0.661
inki+0.643
�醒+0.621
OUS+0.609
Neg logit contributions
seeing-0.788
irit-0.734
arers-0.726
dating-0.718
effects-0.698
Token to
Token ID284
Feature activation+7.88e-02

Pos logit contributions
seeing+0.741
irit+0.689
dating+0.682
arers+0.678
effects+0.662
Neg logit contributions
Vs-0.565
 Minutes-0.513
inki-0.505
�醒-0.501
OUS-0.462
Token spl
Token ID4328
Feature activation+2.00e+00

Pos logit contributions
seeing+0.491
irit+0.456
arers+0.450
dating+0.444
effects+0.436
Neg logit contributions
Vs-0.488
 Minutes-0.447
inki-0.438
�醒-0.426
OUS-0.413
Tokeninter
Token ID3849
Feature activation+7.90e-01

Pos logit contributions
irit+6.980
arers+6.755
seeing+6.699
ember+6.403
effects+6.171
Neg logit contributions
Vs-14.883
�醒-14.589
��-13.961
OME-13.896
USS-13.859
Tokening
Token ID278
Feature activation-3.84e-02

Pos logit contributions
seeing+3.973
irit+3.759
dating+3.731
effects+3.600
arers+3.570
Neg logit contributions
Vs-5.497
 Minutes-5.131
�醒-5.127
inki-4.985
rador-4.799
Token within
Token ID1626
Feature activation-3.92e-01

Pos logit contributions
Vs+0.245
 Minutes+0.224
inki+0.220
�醒+0.216
OUS+0.208
Neg logit contributions
seeing-0.230
irit-0.215
arers-0.212
dating-0.210
effects-0.205
Token the
Token ID262
Feature activation-4.77e-01

Pos logit contributions
Vs+2.299
 Minutes+2.101
inki+2.022
�醒+1.933
OUS+1.927
Neg logit contributions
seeing-2.564
irit-2.376
arers-2.345
dating-2.325
raint-2.260
Token group
Token ID1448
Feature activation-4.74e-01

Pos logit contributions
Vs+3.022
 Minutes+2.779
inki+2.687
�醒+2.577
OUS+2.563
Neg logit contributions
seeing-2.894
irit-2.672
arers-2.639
dating-2.611
raint-2.533
Token did
Token ID750
Feature activation-1.88e-01

Pos logit contributions
Vs+0.155
 Minutes+0.142
inki+0.138
�醒+0.135
OUS+0.132
Neg logit contributions
seeing-0.150
irit-0.139
arers-0.138
dating-0.136
effects-0.132
Token little
Token ID1310
Feature activation+2.66e-01

Pos logit contributions
Vs+0.951
 Minutes+0.853
inki+0.822
�醒+0.787
OUS+0.773
Neg logit contributions
seeing-1.385
irit-1.299
arers-1.287
dating-1.278
effects-1.242
Token to
Token ID284
Feature activation+3.63e-01

Pos logit contributions
seeing+1.842
irit+1.716
arers+1.685
dating+1.658
place+1.655
Neg logit contributions
Vs-1.482
 Minutes-1.324
inki-1.301
�醒-1.269
rador-1.226
Token curb
Token ID20799
Feature activation+1.41e-01

Pos logit contributions
seeing+1.884
irit+1.766
arers+1.752
dating+1.694
place+1.684
Neg logit contributions
Vs-2.601
 Minutes-2.378
inki-2.319
�醒-2.319
OUS-2.224
Token the
Token ID262
Feature activation+4.94e-01

Pos logit contributions
seeing+0.779
irit+0.718
arers+0.710
dating+0.698
rising+0.676
Neg logit contributions
Vs-0.968
 Minutes-0.899
inki-0.880
�醒-0.860
rador-0.836
Token trade
Token ID3292
Feature activation+2.00e+00

Pos logit contributions
seeing+2.667
irit+2.480
arers+2.431
dating+2.408
effects+2.392
Neg logit contributions
Vs-3.370
 Minutes-3.128
inki-3.053
�醒-3.043
rador-2.938
Token of
Token ID286
Feature activation+6.17e-01

Pos logit contributions
seeing+9.041
place+8.410
irit+8.379
effects+8.364
places+8.021
Neg logit contributions
Vs-12.489
 Minutes-12.007
�醒-11.917
inki-11.419
ahime-11.339
Token child
Token ID1200
Feature activation-5.20e-01

Pos logit contributions
seeing+3.162
irit+2.891
arers+2.881
dating+2.847
place+2.782
Neg logit contributions
Vs-4.359
�醒-4.003
inki-3.989
 Minutes-3.982
USS-3.782
Token pornography
Token ID18384
Feature activation-2.75e-01

Pos logit contributions
Vs+2.740
 Minutes+2.454
inki+2.342
�醒+2.244
OUS+2.242
Neg logit contributions
seeing-3.712
irit-3.482
arers-3.431
dating-3.388
raint-3.330
Token in
Token ID287
Feature activation+2.86e-01

Pos logit contributions
Vs+1.820
 Minutes+1.673
inki+1.628
�醒+1.575
OUS+1.556
Neg logit contributions
seeing-1.605
irit-1.483
arers-1.465
dating-1.447
effects-1.400
Token practice
Token ID3357
Feature activation-1.79e-02

Pos logit contributions
seeing+1.664
irit+1.552
arers+1.531
dating+1.522
place+1.485
Neg logit contributions
Vs-1.857
 Minutes-1.680
inki-1.672
�醒-1.650
OUS-1.590
Token a
Token ID257
Feature activation-8.67e-02

Pos logit contributions
Vs+4.226
 Minutes+3.883
inki+3.665
OUS+3.558
rador+3.479
Neg logit contributions
seeing-4.125
irit-3.881
arers-3.737
dating-3.735
raint-3.644
Token little
Token ID1310
Feature activation+4.71e-01

Pos logit contributions
Vs+0.515
 Minutes+0.468
inki+0.457
�醒+0.443
OUS+0.433
Neg logit contributions
seeing-0.563
irit-0.524
arers-0.519
dating-0.513
lier-0.499
Token more
Token ID517
Feature activation+1.27e-01

Pos logit contributions
seeing+2.802
irit+2.588
arers+2.545
dating+2.539
place+2.510
Neg logit contributions
Vs-3.016
 Minutes-2.737
�醒-2.709
inki-2.702
OUS-2.580
Token developer
Token ID8517
Feature activation+6.77e-01

Pos logit contributions
seeing+0.757
irit+0.699
arers+0.693
dating+0.687
place+0.669
Neg logit contributions
Vs-0.817
 Minutes-0.746
inki-0.734
�醒-0.725
OUS-0.697
Token hand
Token ID1021
Feature activation+1.11e+00

Pos logit contributions
seeing+3.913
irit+3.560
dating+3.508
lier+3.467
arers+3.414
Neg logit contributions
Vs-4.395
inki-4.016
�醒-4.008
 Minutes-3.999
USS-3.807
Token-
Token ID12
Feature activation+1.97e+00

Pos logit contributions
seeing+6.214
irit+5.399
lier+5.361
arers+5.305
dating+5.301
Neg logit contributions
Vs-7.354
�醒-6.897
 Minutes-6.786
inki-6.622
USS-6.578
Tokenholding
Token ID19216
Feature activation+3.35e-01

Pos logit contributions
seeing+8.832
pain+7.627
dating+7.396
place+7.347
lier+6.958
Neg logit contributions
�醒-14.221
Vs-13.828
 Minutes-13.223
OUS-13.104
USS-12.906
Token could
Token ID714
Feature activation-3.68e-02

Pos logit contributions
seeing+2.085
irit+1.896
arers+1.890
dating+1.862
place+1.842
Neg logit contributions
Vs-2.093
inki-1.896
 Minutes-1.886
�醒-1.884
USS-1.803
Token be
Token ID307
Feature activation+9.67e-01

Pos logit contributions
Vs+0.235
 Minutes+0.215
inki+0.209
�醒+0.204
OUS+0.199
Neg logit contributions
seeing-0.224
irit-0.208
arers-0.206
dating-0.202
place-0.196
Token a
Token ID257
Feature activation+3.24e-01

Pos logit contributions
seeing+5.733
arers+5.312
irit+5.268
dating+4.983
lier+4.981
Neg logit contributions
Vs-6.077
�醒-5.620
 Minutes-5.586
inki-5.506
rador-5.257
Token good
Token ID922
Feature activation-8.78e-02

Pos logit contributions
seeing+1.689
irit+1.534
arers+1.509
dating+1.485
place+1.468
Neg logit contributions
Vs-2.353
 Minutes-2.191
inki-2.134
�醒-2.092
OUS-2.049
Token83
Token ID5999
Feature activation-4.59e-01

Pos logit contributions
seeing+1.492
dating+1.370
irit+1.344
arers+1.340
place+1.311
Neg logit contributions
Vs-1.742
 Minutes-1.670
inki-1.643
�醒-1.634
rador-1.575
Token Active
Token ID14199
Feature activation+4.06e-01

Pos logit contributions
Vs+2.808
 Minutes+2.542
inki+2.441
�醒+2.352
OUS+2.343
Neg logit contributions
seeing-2.916
irit-2.711
arers-2.685
dating-2.655
effects-2.582
Token Elect
Token ID5903
Feature activation+3.17e-01

Pos logit contributions
seeing+2.478
irit+2.248
arers+2.242
dating+2.242
place+2.195
Neg logit contributions
Vs-2.494
inki-2.334
 Minutes-2.278
�醒-2.249
OUS-2.180
Tokenron
Token ID1313
Feature activation+2.58e-02

Pos logit contributions
seeing+1.583
irit+1.477
arers+1.445
dating+1.438
raint+1.387
Neg logit contributions
Vs-2.342
 Minutes-2.168
inki-2.129
�醒-2.070
OUS-2.054
Tokenically
Token ID1146
Feature activation+1.79e-01

Pos logit contributions
seeing+0.162
irit+0.150
arers+0.148
dating+0.147
place+0.143
Neg logit contributions
Vs-0.158
 Minutes-0.145
inki-0.143
�醒-0.138
rador-0.135
Token Sc
Token ID1446
Feature activation+1.96e+00

Pos logit contributions
seeing+1.065
irit+0.988
arers+0.972
dating+0.966
place+0.933
Neg logit contributions
Vs-1.159
 Minutes-1.062
inki-1.042
�醒-1.023
rador-1.000
Tokenanned
Token ID3577
Feature activation+3.92e-01

Pos logit contributions
hemat+7.362
seeing+6.906
irit+6.886
arers+6.803
dating+6.467
Neg logit contributions
�醒-14.459
Vs-13.945
 Minutes-13.559
OME-13.147
inki-13.095
Token Array
Token ID15690
Feature activation-4.72e-02

Pos logit contributions
seeing+2.570
arers+2.361
irit+2.358
dating+2.325
place+2.278
Neg logit contributions
Vs-2.245
inki-2.078
�醒-2.053
 Minutes-2.015
rador-1.953
Token Rad
Token ID5325
Feature activation+7.64e-01

Pos logit contributions
Vs+0.264
 Minutes+0.241
inki+0.236
�醒+0.229
rador+0.221
Neg logit contributions
seeing-0.321
irit-0.299
arers-0.297
dating-0.293
effects-0.285
Tokenars
Token ID945
Feature activation-1.13e-01

Pos logit contributions
seeing+2.653
arers+2.416
irit+2.402
dating+2.226
effects+2.103
Neg logit contributions
Vs-6.481
 Minutes-6.357
�醒-6.280
inki-6.173
rador-6.010
Token and
Token ID290
Feature activation-5.17e-02

Pos logit contributions
Vs+0.688
 Minutes+0.629
inki+0.611
�醒+0.592
OUS+0.580
Neg logit contributions
seeing-0.721
irit-0.670
arers-0.664
dating-0.656
effects-0.636
Token strike
Token ID5587
Feature activation-2.17e-01

Pos logit contributions
seeing+0.186
irit+0.173
arers+0.171
dating+0.169
place+0.165
Neg logit contributions
Vs-0.175
 Minutes-0.160
inki-0.156
�醒-0.152
OUS-0.147
Token networks
Token ID7686
Feature activation+5.09e-03

Pos logit contributions
Vs+1.290
 Minutes+1.177
inki+1.144
�醒+1.110
OUS+1.083
Neg logit contributions
seeing-1.405
irit-1.307
arers-1.298
dating-1.281
effects-1.245
Token on
Token ID319
Feature activation-6.38e-02

Pos logit contributions
seeing+0.031
irit+0.028
arers+0.028
dating+0.028
effects+0.027
Neg logit contributions
Vs-0.032
 Minutes-0.030
inki-0.029
�醒-0.028
rador-0.028
Token three
Token ID1115
Feature activation+3.24e-01

Pos logit contributions
Vs+0.379
 Minutes+0.343
inki+0.335
�醒+0.327
rador+0.320
Neg logit contributions
seeing-0.415
irit-0.385
arers-0.382
dating-0.378
effects-0.366
Token different
Token ID1180
Feature activation+9.62e-01

Pos logit contributions
seeing+1.785
irit+1.628
arers+1.601
dating+1.580
effects+1.546
Neg logit contributions
Vs-2.194
 Minutes-2.017
inki-1.993
�醒-1.980
rador-1.913
Token Open
Token ID4946
Feature activation+1.95e+00

Pos logit contributions
seeing+5.512
dating+4.975
arers+4.945
irit+4.932
lier+4.800
Neg logit contributions
Vs-5.964
�醒-5.417
inki-5.334
 Minutes-5.286
rador-5.108
Token Systems
Token ID11998
Feature activation+2.58e-01

Pos logit contributions
seeing+10.283
dating+9.309
raint+8.975
place+8.948
hemat+8.889
Neg logit contributions
Vs-9.999
�醒-9.672
rador-9.452
inki-9.406
asionally-9.316
Token Inter
Token ID4225
Feature activation+1.25e+00

Pos logit contributions
seeing+1.654
irit+1.527
arers+1.512
dating+1.504
place+1.451
Neg logit contributions
Vs-1.527
 Minutes-1.386
inki-1.367
�醒-1.348
OUS-1.291
Tokenconnection
Token ID38659
Feature activation+1.12e-01

Pos logit contributions
seeing+5.782
effects+5.287
place+5.263
dating+5.191
arers+5.105
Neg logit contributions
Vs-8.716
 Minutes-8.350
�醒-8.171
inki-7.998
asionally-7.855
Token (
Token ID357
Feature activation-6.31e-03

Pos logit contributions
seeing+0.754
irit+0.699
arers+0.696
dating+0.688
effects+0.670
Neg logit contributions
Vs-0.627
 Minutes-0.565
inki-0.557
�醒-0.544
rador-0.524
TokenOS
Token ID2640
Feature activation-7.83e-01

Pos logit contributions
Vs+0.031
 Minutes+0.028
inki+0.027
�醒+0.026
rador+0.025
Neg logit contributions
seeing-0.048
irit-0.045
arers-0.044
dating-0.044
effects-0.043
Token,
Token ID11
Feature activation-8.19e-01

Pos logit contributions
seeing+1.460
irit+1.363
arers+1.349
dating+1.328
place+1.285
Neg logit contributions
Vs-1.565
 Minutes-1.408
inki-1.404
�醒-1.399
rador-1.350
Token the
Token ID262
Feature activation-2.36e-04

Pos logit contributions
Vs+4.558
 Minutes+4.237
inki+4.029
OUS+3.815
USS+3.764
Neg logit contributions
seeing-5.411
irit-5.104
arers-5.014
dating-4.927
effects-4.865
Token nonprofit
Token ID15346
Feature activation+6.04e-01

Pos logit contributions
Vs+0.001
 Minutes+0.001
inki+0.001
�醒+0.001
rador+0.001
Neg logit contributions
seeing-0.001
irit-0.001
arers-0.001
dating-0.001
rising-0.001
Token behind
Token ID2157
Feature activation+1.95e-02

Pos logit contributions
seeing+3.631
irit+3.308
arers+3.265
dating+3.265
lier+3.228
Neg logit contributions
Vs-3.862
 Minutes-3.475
�醒-3.452
inki-3.425
OUS-3.266
Token the
Token ID262
Feature activation-2.91e-02

Pos logit contributions
seeing+0.121
irit+0.112
arers+0.111
dating+0.110
effects+0.106
Neg logit contributions
Vs-0.122
 Minutes-0.111
inki-0.108
�醒-0.105
OUS-0.103
Token open
Token ID1280
Feature activation+1.95e+00

Pos logit contributions
Vs+0.186
 Minutes+0.170
inki+0.166
�醒+0.161
OUS+0.158
Neg logit contributions
seeing-0.177
irit-0.164
arers-0.161
dating-0.160
rising-0.155
Token-
Token ID12
Feature activation+1.66e+00

Pos logit contributions
seeing+7.838
dating+7.499
lier+7.094
place+7.033
arers+6.894
Neg logit contributions
Vs-12.981
�醒-12.466
 Minutes-12.222
inki-11.990
veyard-11.788
Tokensource
Token ID10459
Feature activation+8.59e-01

Pos logit contributions
seeing+4.644
dating+3.934
place+3.816
rising+3.672
pain+3.440
Neg logit contributions
Vs-14.156
 Minutes-14.107
�醒-14.005
OUS-13.594
inki-13.557
Token web
Token ID3992
Feature activation+6.91e-01

Pos logit contributions
seeing+4.595
dating+4.206
irit+4.177
arers+4.124
lier+4.046
Neg logit contributions
Vs-5.818
�醒-5.306
inki-5.249
 Minutes-5.204
rador-4.964
Token browser
Token ID6444
Feature activation+1.77e-01

Pos logit contributions
seeing+4.018
dating+3.691
irit+3.681
arers+3.619
lier+3.572
Neg logit contributions
Vs-4.350
 Minutes-4.081
inki-3.941
�醒-3.939
OUS-3.765
Token,
Token ID11
Feature activation-2.43e-01

Pos logit contributions
seeing+0.864
irit+0.782
arers+0.769
dating+0.763
place+0.730
Neg logit contributions
Vs-1.336
 Minutes-1.235
inki-1.217
�醒-1.189
OUS-1.163
Token
Token ID247
Feature activation+3.48e-02

Pos logit contributions
Vs+0.649
 Minutes+0.607
inki+0.594
�醒+0.578
OUS+0.567
Neg logit contributions
seeing-0.479
irit-0.443
arers-0.435
dating-0.426
raint-0.413
Tokens
Token ID82
Feature activation+1.01e-01

Pos logit contributions
seeing+0.230
irit+0.214
arers+0.212
dating+0.209
effects+0.204
Neg logit contributions
Vs-0.203
 Minutes-0.186
inki-0.180
�醒-0.175
OUS-0.171
Token a
Token ID257
Feature activation+3.77e-01

Pos logit contributions
seeing+0.644
irit+0.598
arers+0.596
dating+0.584
place+0.569
Neg logit contributions
Vs-0.616
 Minutes-0.562
inki-0.549
�醒-0.535
OUS-0.519
Token narrow
Token ID7135
Feature activation-9.70e-02

Pos logit contributions
seeing+2.267
irit+2.109
arers+2.083
dating+2.029
place+2.027
Neg logit contributions
Vs-2.399
 Minutes-2.210
inki-2.146
�醒-2.093
OUS-2.030
Token,
Token ID11
Feature activation+1.70e-01

Pos logit contributions
Vs+0.626
 Minutes+0.574
inki+0.559
�醒+0.541
OUS+0.533
Neg logit contributions
seeing-0.583
irit-0.539
arers-0.533
dating-0.527
raint-0.509
Token par
Token ID1582
Feature activation+1.93e+00

Pos logit contributions
seeing+1.044
irit+0.964
arers+0.956
dating+0.938
place+0.914
Neg logit contributions
Vs-1.081
 Minutes-0.993
inki-0.960
�醒-0.936
OUS-0.915
Tokenoch
Token ID5374
Feature activation+3.87e-01

Pos logit contributions
seeing+6.858
irit+6.244
dating+5.819
lier+5.809
arers+5.566
Neg logit contributions
Vs-14.574
 Minutes-13.657
inki-13.293
OME-13.054
USS-12.991
Tokenial
Token ID498
Feature activation+6.78e-01

Pos logit contributions
seeing+1.382
irit+1.233
arers+1.171
dating+1.163
lier+1.108
Neg logit contributions
Vs-3.436
 Minutes-3.230
inki-3.160
�醒-3.114
USS-3.072
Token
Token ID564
Feature activation+1.02e-01

Pos logit contributions
seeing+4.006
irit+3.748
arers+3.684
dating+3.614
place+3.589
Neg logit contributions
Vs-4.297
 Minutes-3.976
inki-3.841
�醒-3.791
rador-3.620
Token
Token ID246
Feature activation-1.54e-01

Pos logit contributions
seeing+0.646
irit+0.604
arers+0.592
dating+0.585
raint+0.570
Neg logit contributions
Vs-0.618
 Minutes-0.565
inki-0.547
�醒-0.526
OUS-0.520
Tokenmy
Token ID1820
Feature activation+2.05e-01

Pos logit contributions
Vs+1.129
 Minutes+1.043
inki+1.021
�醒+0.990
OUS+0.979
Neg logit contributions
seeing-0.778
irit-0.714
arers-0.704
dating-0.692
raint-0.666
Token legislation
Token ID5520
Feature activation-7.69e-01

Pos logit contributions
Vs+1.154
 Minutes+1.053
inki+1.025
�醒+0.991
OUS+0.973
Neg logit contributions
seeing-1.177
irit-1.094
arers-1.083
dating-1.069
effects-1.038
Token passed
Token ID3804
Feature activation-2.11e-01

Pos logit contributions
Vs+4.546
 Minutes+4.260
inki+4.034
OUS+3.892
rador+3.858
Neg logit contributions
seeing-4.776
irit-4.417
dating-4.342
arers-4.316
raint-4.236
Token by
Token ID416
Feature activation-2.35e-01

Pos logit contributions
Vs+1.164
 Minutes+1.053
inki+1.018
�醒+0.977
OUS+0.961
Neg logit contributions
seeing-1.463
irit-1.370
arers-1.357
dating-1.343
effects-1.307
Token the
Token ID262
Feature activation+5.40e-02

Pos logit contributions
Vs+1.378
 Minutes+1.262
inki+1.212
�醒+1.167
OUS+1.156
Neg logit contributions
seeing-1.547
irit-1.438
arers-1.421
dating-1.412
effects-1.373
Token territory
Token ID7674
Feature activation+2.18e-01

Pos logit contributions
seeing+0.308
irit+0.286
arers+0.284
dating+0.277
place+0.270
Neg logit contributions
Vs-0.363
 Minutes-0.330
inki-0.327
�醒-0.317
rador-0.309
Token par
Token ID1582
Feature activation+1.93e+00

Pos logit contributions
seeing+1.327
irit+1.228
arers+1.223
dating+1.196
raint+1.164
Neg logit contributions
Vs-1.390
 Minutes-1.259
inki-1.248
�醒-1.213
OUS-1.168
Tokenliam
Token ID5058
Feature activation+2.57e-01

Pos logit contributions
seeing+7.322
irit+6.670
arers+6.510
dating+6.288
inct+6.151
Neg logit contributions
Vs-14.751
avorite-13.695
OME-13.499
�醒-13.344
��-13.176
Tokenents
Token ID658
Feature activation+2.15e-01

Pos logit contributions
seeing+1.363
irit+1.259
arers+1.258
dating+1.249
place+1.191
Neg logit contributions
Vs-1.822
�醒-1.678
 Minutes-1.658
inki-1.635
OUS-1.606
Token.[
Token ID3693
Feature activation+6.48e-01

Pos logit contributions
seeing+1.227
irit+1.127
arers+1.120
dating+1.109
place+1.072
Neg logit contributions
Vs-1.454
 Minutes-1.322
inki-1.311
�醒-1.292
OUS-1.251
Token196
Token ID25272
Feature activation+1.33e-01

Pos logit contributions
seeing+4.389
dating+4.020
irit+3.996
arers+3.952
place+3.926
Neg logit contributions
Vs-3.588
 Minutes-3.274
inki-3.214
�醒-3.149
rador-3.022
Token]
Token ID60
Feature activation-3.73e-01

Pos logit contributions
seeing+0.767
irit+0.707
arers+0.701
dating+0.692
effects+0.668
Neg logit contributions
Vs-0.888
 Minutes-0.820
inki-0.792
�醒-0.772
OUS-0.763
Token the
Token ID262
Feature activation+9.65e-02

Pos logit contributions
seeing+3.003
arers+2.793
irit+2.756
dating+2.669
lier+2.656
Neg logit contributions
Vs-3.144
 Minutes-2.912
inki-2.863
�醒-2.834
rador-2.712
Token most
Token ID749
Feature activation+2.19e-01

Pos logit contributions
seeing+0.626
irit+0.582
arers+0.579
dating+0.571
place+0.555
Neg logit contributions
Vs-0.571
 Minutes-0.523
inki-0.510
�醒-0.495
OUS-0.482
Token successful
Token ID4388
Feature activation+3.05e-01

Pos logit contributions
seeing+1.321
arers+1.224
irit+1.217
dating+1.177
place+1.165
Neg logit contributions
Vs-1.411
 Minutes-1.288
inki-1.260
�醒-1.244
rador-1.196
Token.
Token ID13
Feature activation+6.34e-02

Pos logit contributions
seeing+1.895
arers+1.772
irit+1.754
dating+1.735
place+1.716
Neg logit contributions
Vs-1.857
 Minutes-1.697
�醒-1.670
inki-1.658
rador-1.590
Token These
Token ID2312
Feature activation-1.07e-01

Pos logit contributions
seeing+0.443
irit+0.406
arers+0.406
dating+0.402
place+0.397
Neg logit contributions
Vs-0.343
inki-0.311
 Minutes-0.304
�醒-0.300
rador-0.291
Token trade
Token ID3292
Feature activation+1.92e+00

Pos logit contributions
Vs+0.637
 Minutes+0.580
inki+0.565
�醒+0.554
rador+0.537
Neg logit contributions
seeing-0.690
irit-0.642
arers-0.640
dating-0.630
place-0.611
Token-
Token ID12
Feature activation+1.56e+00

Pos logit contributions
seeing+9.922
place+9.168
irit+8.899
places+8.844
arers+8.842
Neg logit contributions
Vs-10.813
 Minutes-10.510
�醒-10.394
inki-10.344
ahime-10.058
Tokenoffs
Token ID8210
Feature activation+5.84e-01

Pos logit contributions
seeing+4.207
place+3.557
effects+3.391
dating+3.343
places+3.332
Neg logit contributions
�醒-13.595
Vs-13.469
 Minutes-13.336
OME-12.955
inki-12.728
Token are
Token ID389
Feature activation+8.83e-02

Pos logit contributions
seeing+3.431
irit+3.186
arers+3.134
dating+3.091
place+3.075
Neg logit contributions
Vs-3.751
 Minutes-3.415
inki-3.409
�醒-3.370
USS-3.237
Token different
Token ID1180
Feature activation+5.43e-01

Pos logit contributions
seeing+0.562
irit+0.521
arers+0.520
dating+0.509
effects+0.498
Neg logit contributions
Vs-0.535
 Minutes-0.487
inki-0.479
�醒-0.467
rador-0.451
Token for
Token ID329
Feature activation+4.33e-01

Pos logit contributions
seeing+3.164
irit+2.856
dating+2.841
arers+2.832
lier+2.802
Neg logit contributions
Vs-3.539
 Minutes-3.262
inki-3.179
�醒-3.132
USS-3.066
Token had
Token ID550
Feature activation+6.29e-01

Pos logit contributions
Vs+0.332
 Minutes+0.303
inki+0.295
�醒+0.288
OUS+0.281
Neg logit contributions
seeing-0.337
irit-0.312
arers-0.311
dating-0.305
effects-0.296
Token '
Token ID705
Feature activation+5.58e-01

Pos logit contributions
seeing+3.751
irit+3.392
arers+3.373
dating+3.333
place+3.298
Neg logit contributions
Vs-3.920
inki-3.578
 Minutes-3.556
�醒-3.543
USS-3.364
TokenF
Token ID37
Feature activation-1.12e+00

Pos logit contributions
seeing+3.901
dating+3.573
irit+3.496
effects+3.471
rising+3.460
Neg logit contributions
Vs-2.914
 Minutes-2.772
�醒-2.667
inki-2.633
rador-2.513
Token---
Token ID6329
Feature activation+1.93e-01

Pos logit contributions
Vs+6.693
 Minutes+6.011
inki+5.796
USS+5.737
OUS+5.705
Neg logit contributions
seeing-6.893
arers-6.408
irit-6.404
dating-6.159
raint-6.125
Token'
Token ID6
Feature activation+3.99e-01

Pos logit contributions
seeing+1.509
irit+1.419
arers+1.398
dating+1.394
effects+1.361
Neg logit contributions
Vs-0.882
 Minutes-0.779
inki-0.755
�醒-0.738
rador-0.704
Token sc
Token ID629
Feature activation+1.92e+00

Pos logit contributions
seeing+2.054
irit+1.875
arers+1.845
dating+1.841
effects+1.788
Neg logit contributions
Vs-2.835
 Minutes-2.657
inki-2.589
�醒-2.577
rador-2.489
Tokenrawled
Token ID49263
Feature activation+6.73e-01

Pos logit contributions
arers+5.105
seeing+4.883
irit+4.603
rising+4.544
hemat+4.493
Neg logit contributions
�醒-15.316
Vs-15.156
inki-15.002
 Minutes-14.764
veyard-14.196
Token in
Token ID287
Feature activation+6.99e-01

Pos logit contributions
seeing+3.455
irit+3.124
place+3.072
arers+3.066
dating+3.058
Neg logit contributions
Vs-4.719
inki-4.468
 Minutes-4.403
�醒-4.355
rador-4.173
Token large
Token ID1588
Feature activation+9.96e-02

Pos logit contributions
seeing+3.877
irit+3.596
place+3.514
dating+3.480
arers+3.455
Neg logit contributions
Vs-4.574
inki-4.220
 Minutes-4.186
�醒-4.100
OUS-3.927
Token letters
Token ID7475
Feature activation+5.77e-01

Pos logit contributions
seeing+0.608
irit+0.563
arers+0.555
dating+0.551
place+0.536
Neg logit contributions
Vs-0.627
 Minutes-0.576
inki-0.564
�醒-0.546
OUS-0.533
Token across
Token ID1973
Feature activation+3.88e-01

Pos logit contributions
seeing+3.104
irit+2.812
dating+2.769
arers+2.757
place+2.729
Neg logit contributions
Vs-3.980
 Minutes-3.686
inki-3.664
�醒-3.627
rador-3.499
Token cost
Token ID1575
Feature activation-1.86e-01

Pos logit contributions
Vs+1.168
 Minutes+1.058
inki+1.025
�醒+0.986
OUS+0.970
Neg logit contributions
seeing-1.397
irit-1.306
arers-1.293
dating-1.279
effects-1.245
Token of
Token ID286
Feature activation-4.95e-01

Pos logit contributions
Vs+1.237
 Minutes+1.138
inki+1.108
�醒+1.072
OUS+1.058
Neg logit contributions
seeing-1.083
irit-1.001
arers-0.988
dating-0.976
effects-0.944
Token a
Token ID257
Feature activation-4.27e-01

Pos logit contributions
Vs+3.109
 Minutes+2.850
inki+2.748
OUS+2.645
�醒+2.627
Neg logit contributions
seeing-3.019
irit-2.806
arers-2.775
dating-2.743
raint-2.667
Token gallon
Token ID26860
Feature activation-1.37e-01

Pos logit contributions
Vs+2.174
 Minutes+1.952
inki+1.873
�醒+1.785
OUS+1.771
Neg logit contributions
seeing-3.119
irit-2.931
arers-2.906
dating-2.880
effects-2.813
Token of
Token ID286
Feature activation-4.38e-01

Pos logit contributions
Vs+0.917
 Minutes+0.845
inki+0.822
�醒+0.795
OUS+0.787
Neg logit contributions
seeing-0.786
irit-0.726
arers-0.717
dating-0.708
effects-0.685
Token diesel
Token ID21866
Feature activation-2.05e+00

Pos logit contributions
Vs+2.801
 Minutes+2.577
inki+2.489
�醒+2.385
OUS+2.380
Neg logit contributions
seeing-2.633
irit-2.435
arers-2.414
dating-2.385
effects-2.318
Token is
Token ID318
Feature activation+9.21e-02

Pos logit contributions
Vs+10.979
 Minutes+9.670
OUS+9.169
inki+8.939
 occasionally+8.886
Neg logit contributions
seeing-11.815
irit-11.330
arers-11.263
dating-10.731
effects-10.578
Token about
Token ID546
Feature activation-7.43e-01

Pos logit contributions
seeing+0.582
irit+0.539
arers+0.535
dating+0.530
lier+0.519
Neg logit contributions
Vs-0.560
 Minutes-0.509
inki-0.502
�醒-0.496
rador-0.479
Token $
Token ID720
Feature activation+7.74e-01

Pos logit contributions
Vs+3.816
 Minutes+3.451
inki+3.256
OUS+3.131
USS+3.090
Neg logit contributions
seeing-5.318
irit-5.021
arers-4.949
dating-4.870
raint-4.788
Token4
Token ID19
Feature activation+5.16e-01

Pos logit contributions
seeing+4.156
dating+3.718
place+3.668
irit+3.660
effects+3.633
Neg logit contributions
Vs-5.087
�醒-4.951
inki-4.886
 Minutes-4.798
rador-4.648
Token.
Token ID13
Feature activation+1.13e-01

Pos logit contributions
seeing+2.906
irit+2.683
arers+2.634
dating+2.632
lier+2.607
Neg logit contributions
Vs-3.425
 Minutes-3.136
inki-3.074
�醒-2.979
rador-2.960
Token lit
Token ID6578
Feature activation+6.50e-01

Pos logit contributions
Vs+0.345
 Minutes+0.313
inki+0.307
�醒+0.296
rador+0.289
Neg logit contributions
seeing-0.405
irit-0.379
arers-0.373
dating-0.369
raint-0.359
Tokenre
Token ID260
Feature activation-2.48e-01

Pos logit contributions
seeing+2.495
arers+2.358
irit+2.287
dating+2.137
lier+2.129
Neg logit contributions
Vs-5.347
 Minutes-5.134
inki-5.017
�醒-4.961
OUS-4.853
Token of
Token ID286
Feature activation-3.51e-01

Pos logit contributions
Vs+1.601
 Minutes+1.476
inki+1.438
�醒+1.394
OUS+1.370
Neg logit contributions
seeing-1.482
irit-1.375
arers-1.355
dating-1.338
effects-1.295
Token reduction
Token ID7741
Feature activation-3.68e-01

Pos logit contributions
Vs+2.033
 Minutes+1.847
inki+1.791
�醒+1.724
OUS+1.697
Neg logit contributions
seeing-2.331
irit-2.177
arers-2.154
dating-2.131
effects-2.071
Token in
Token ID287
Feature activation-2.46e-01

Pos logit contributions
Vs+2.265
 Minutes+2.072
inki+2.013
�醒+1.945
OUS+1.910
Neg logit contributions
seeing-2.312
irit-2.151
arers-2.125
dating-2.101
effects-2.040
Token diesel
Token ID21866
Feature activation-2.05e+00

Pos logit contributions
Vs+1.446
 Minutes+1.315
inki+1.277
�醒+1.231
OUS+1.210
Neg logit contributions
seeing-1.618
irit-1.509
arers-1.493
dating-1.475
effects-1.435
Token consumption
Token ID7327
Feature activation-5.60e-01

Pos logit contributions
Vs+10.393
 Minutes+8.969
inki+8.698
OUS+8.426
OME+8.147
Neg logit contributions
seeing-12.887
dating-11.960
arers-11.833
irit-11.689
hemat-11.379
Token per
Token ID583
Feature activation-3.60e-01

Pos logit contributions
Vs+3.384
 Minutes+3.093
inki+2.977
OUS+2.849
�醒+2.815
Neg logit contributions
seeing-3.531
irit-3.299
arers-3.256
dating-3.220
raint-3.121
Token coach
Token ID3985
Feature activation-1.95e-01

Pos logit contributions
Vs+2.015
 Minutes+1.827
inki+1.765
�醒+1.694
OUS+1.672
Neg logit contributions
seeing-2.453
irit-2.297
arers-2.271
dating-2.253
effects-2.192
Token per
Token ID583
Feature activation+1.16e-01

Pos logit contributions
Vs+1.236
 Minutes+1.133
inki+1.108
�醒+1.071
OUS+1.050
Neg logit contributions
seeing-1.187
irit-1.095
arers-1.089
dating-1.070
raint-1.039
Token ann
Token ID1529
Feature activation-1.44e+00

Pos logit contributions
seeing+0.727
arers+0.670
irit+0.669
dating+0.654
lier+0.641
Neg logit contributions
Vs-0.710
 Minutes-0.645
inki-0.634
�醒-0.624
OUS-0.600
Token point
Token ID966
Feature activation+2.99e-01

Pos logit contributions
seeing+0.124
irit+0.115
arers+0.114
dating+0.112
place+0.109
Neg logit contributions
Vs-0.146
 Minutes-0.135
inki-0.130
�醒-0.128
OUS-0.125
Token towards
Token ID3371
Feature activation-2.44e-01

Pos logit contributions
seeing+1.921
arers+1.775
irit+1.751
dating+1.730
lier+1.716
Neg logit contributions
Vs-1.779
 Minutes-1.649
inki-1.609
�醒-1.590
rador-1.529
Token the
Token ID262
Feature activation+5.39e-03

Pos logit contributions
Vs+1.411
 Minutes+1.280
inki+1.238
�醒+1.186
OUS+1.174
Neg logit contributions
seeing-1.618
irit-1.515
arers-1.493
dating-1.480
effects-1.440
Token spacecraft
Token ID16807
Feature activation-4.28e-02

Pos logit contributions
seeing+0.035
irit+0.032
arers+0.032
dating+0.032
effects+0.031
Neg logit contributions
Vs-0.032
 Minutes-0.030
inki-0.029
�醒-0.028
OUS-0.027
Token.
Token ID13
Feature activation-2.85e-01

Pos logit contributions
Vs+0.290
 Minutes+0.268
inki+0.262
�醒+0.255
OUS+0.250
Neg logit contributions
seeing-0.244
irit-0.223
arers-0.222
dating-0.218
effects-0.211
Token Deep
Token ID10766
Feature activation-2.02e+00

Pos logit contributions
Vs+1.465
 Minutes+1.310
inki+1.273
�醒+1.218
OUS+1.193
Neg logit contributions
seeing-2.077
irit-1.948
arers-1.931
dating-1.912
effects-1.865
Token Space
Token ID4687
Feature activation+4.96e-01

Pos logit contributions
Vs+5.670
 Minutes+5.241
OME+4.195
 SPACE+4.112
OUS+4.028
Neg logit contributions
seeing-15.868
irit-15.656
dating-15.168
arers-15.160
accompan-15.069
Token 1
Token ID352
Feature activation-2.56e-01

Pos logit contributions
seeing+2.902
irit+2.654
arers+2.624
dating+2.592
raint+2.528
Neg logit contributions
Vs-3.224
 Minutes-2.919
inki-2.916
�醒-2.840
rador-2.774
Token then
Token ID788
Feature activation-3.90e-01

Pos logit contributions
Vs+1.587
 Minutes+1.452
inki+1.413
�醒+1.367
OUS+1.343
Neg logit contributions
seeing-1.598
irit-1.484
arers-1.469
dating-1.451
effects-1.407
Token entered
Token ID5982
Feature activation-8.86e-01

Pos logit contributions
Vs+2.440
 Minutes+2.235
inki+2.161
�醒+2.072
OUS+2.060
Neg logit contributions
seeing-2.402
irit-2.239
arers-2.196
dating-2.183
raint-2.122
Token its
Token ID663
Feature activation-7.56e-01

Pos logit contributions
Vs+4.822
 Minutes+4.474
inki+4.195
rador+3.995
�醒+3.991
Neg logit contributions
seeing-5.867
irit-5.587
arers-5.532
dating-5.432
raint-5.307
Token referred
Token ID6412
Feature activation-9.21e-01

Pos logit contributions
seeing+2.911
arers+2.694
irit+2.655
dating+2.638
lier+2.577
Neg logit contributions
Vs-3.195
 Minutes-2.932
inki-2.929
�醒-2.892
rador-2.811
Token to
Token ID284
Feature activation-3.89e-01

Pos logit contributions
Vs+4.431
 Minutes+4.149
inki+3.652
�醒+3.618
OUS+3.590
Neg logit contributions
seeing-6.768
irit-6.419
arers-6.264
raint-6.097
effects-6.096
Token as
Token ID355
Feature activation-4.19e-01

Pos logit contributions
Vs+2.257
 Minutes+2.069
inki+1.976
OUS+1.893
�醒+1.884
Neg logit contributions
seeing-2.567
irit-2.398
arers-2.367
dating-2.335
effects-2.284
Token "
Token ID366
Feature activation-5.34e-01

Pos logit contributions
Vs+2.574
 Minutes+2.390
inki+2.280
�醒+2.189
OUS+2.186
Neg logit contributions
seeing-2.594
irit-2.412
arers-2.396
dating-2.346
effects-2.301
TokenJohn
Token ID7554
Feature activation-6.89e-01

Pos logit contributions
Vs+3.166
 Minutes+2.818
inki+2.741
USS+2.628
OUS+2.621
Neg logit contributions
seeing-3.412
arers-3.235
irit-3.228
dating-3.100
raint-3.083
Token Doe
Token ID31780
Feature activation-2.01e+00

Pos logit contributions
Vs+2.771
 Minutes+2.454
inki+2.307
OUS+2.113
�醒+2.110
Neg logit contributions
seeing-5.658
irit-5.381
arers-5.367
dating-5.276
effects-5.188
Token"
Token ID1
Feature activation-5.59e-01

Pos logit contributions
Vs+9.397
 Minutes+8.775
inki+7.753
OUS+7.637
XY+7.473
Neg logit contributions
seeing-12.854
arers-12.272
irit-12.245
effects-11.806
dating-11.757
Token contends
Token ID33799
Feature activation+6.41e-02

Pos logit contributions
Vs+3.265
 Minutes+2.976
inki+2.866
OUS+2.727
�醒+2.701
Neg logit contributions
seeing-3.650
irit-3.427
arers-3.344
dating-3.329
raint-3.253
Token he
Token ID339
Feature activation+1.72e-01

Pos logit contributions
seeing+0.356
irit+0.327
arers+0.323
dating+0.320
effects+0.308
Neg logit contributions
Vs-0.441
 Minutes-0.406
inki-0.398
�醒-0.387
rador-0.380
Token was
Token ID373
Feature activation+1.25e-01

Pos logit contributions
seeing+1.126
arers+1.053
irit+1.051
dating+1.026
lier+0.995
Neg logit contributions
Vs-1.005
 Minutes-0.918
inki-0.898
�醒-0.896
rador-0.858
Token denied
Token ID6699
Feature activation+3.18e-01

Pos logit contributions
seeing+0.766
irit+0.703
arers+0.702
dating+0.695
lier+0.667
Neg logit contributions
Vs-0.791
 Minutes-0.721
inki-0.711
�醒-0.690
rador-0.672
Token will
Token ID481
Feature activation-3.99e-01

Pos logit contributions
seeing+1.695
irit+1.548
arers+1.541
dating+1.530
place+1.479
Neg logit contributions
Vs-1.872
 Minutes-1.726
inki-1.682
�醒-1.665
OUS-1.612
Token be
Token ID307
Feature activation+7.59e-02

Pos logit contributions
Vs+2.135
 Minutes+1.925
inki+1.880
�醒+1.785
OUS+1.773
Neg logit contributions
seeing-2.802
irit-2.639
arers-2.615
dating-2.587
raint-2.520
Token good
Token ID922
Feature activation-1.98e-01

Pos logit contributions
seeing+0.464
irit+0.425
arers+0.423
dating+0.416
place+0.406
Neg logit contributions
Vs-0.481
 Minutes-0.437
inki-0.427
�醒-0.423
rador-0.409
Token times
Token ID1661
Feature activation-3.78e-02

Pos logit contributions
Vs+1.140
 Minutes+1.035
inki+1.004
�醒+0.970
OUS+0.950
Neg logit contributions
seeing-1.323
irit-1.234
arers-1.221
dating-1.208
effects-1.175
Token and
Token ID290
Feature activation-1.65e-01

Pos logit contributions
Vs+0.249
 Minutes+0.229
inki+0.223
�醒+0.216
OUS+0.213
Neg logit contributions
seeing-0.222
irit-0.204
arers-0.202
dating-0.200
effects-0.193
Token hard
Token ID1327
Feature activation-2.00e+00

Pos logit contributions
Vs+1.129
 Minutes+1.041
inki+1.015
�醒+0.984
OUS+0.971
Neg logit contributions
seeing-0.929
irit-0.856
arers-0.845
dating-0.834
effects-0.806
Token times
Token ID1661
Feature activation+3.05e-01

Pos logit contributions
Vs+9.249
 Minutes+8.161
inki+7.648
rador+7.428
OUS+7.122
Neg logit contributions
seeing-13.198
irit-12.915
dating-12.266
raint-12.263
arers-12.178
Token and
Token ID290
Feature activation-2.16e-01

Pos logit contributions
seeing+1.756
irit+1.583
arers+1.570
dating+1.569
rising+1.519
Neg logit contributions
Vs-2.037
 Minutes-1.868
inki-1.825
�醒-1.807
rador-1.762
Token unexpected
Token ID10059
Feature activation-3.05e-01

Pos logit contributions
Vs+1.349
 Minutes+1.235
inki+1.200
�醒+1.154
OUS+1.142
Neg logit contributions
seeing-1.331
irit-1.239
arers-1.223
dating-1.209
effects-1.172
Token turns
Token ID4962
Feature activation-6.34e-01

Pos logit contributions
Vs+1.786
 Minutes+1.624
inki+1.575
�醒+1.513
OUS+1.493
Neg logit contributions
seeing-2.009
irit-1.878
arers-1.856
dating-1.836
effects-1.782
Token.
Token ID13
Feature activation+8.25e-01

Pos logit contributions
Vs+4.630
 Minutes+4.328
inki+4.204
OUS+4.033
�醒+3.995
Neg logit contributions
seeing-3.120
irit-2.902
arers-2.889
dating-2.799
raint-2.713
Token supporters
Token ID5941
Feature activation+6.81e-01

Pos logit contributions
Vs+1.060
 Minutes+0.975
inki+0.950
�醒+0.921
OUS+0.907
Neg logit contributions
seeing-0.926
irit-0.855
arers-0.845
dating-0.835
effects-0.807
Token have
Token ID423
Feature activation+1.71e-01

Pos logit contributions
seeing+3.724
irit+3.443
arers+3.423
dating+3.334
place+3.265
Neg logit contributions
Vs-4.659
 Minutes-4.263
�醒-4.215
inki-4.210
OUS-3.978
Token been
Token ID587
Feature activation+3.50e-01

Pos logit contributions
seeing+1.060
arers+0.988
irit+0.987
dating+0.960
effects+0.936
Neg logit contributions
Vs-1.056
 Minutes-0.968
inki-0.944
�醒-0.929
OUS-0.892
Token been
Token ID587
Feature activation+2.91e-01

Pos logit contributions
seeing+1.980
arers+1.810
irit+1.787
dating+1.764
lier+1.708
Neg logit contributions
Vs-2.371
 Minutes-2.186
inki-2.158
�醒-2.149
OUS-2.037
Token releasing
Token ID13011
Feature activation+4.11e-01

Pos logit contributions
seeing+1.653
arers+1.510
irit+1.494
dating+1.473
lier+1.424
Neg logit contributions
Vs-1.978
 Minutes-1.823
inki-1.797
�醒-1.785
OUS-1.699
Token hydrogen
Token ID17669
Feature activation-1.99e+00

Pos logit contributions
seeing+2.534
irit+2.331
dating+2.325
arers+2.315
place+2.252
Neg logit contributions
Vs-2.494
�醒-2.276
inki-2.260
 Minutes-2.246
rador-2.141
Token balloons
Token ID35485
Feature activation+8.76e-02

Pos logit contributions
Vs+9.061
 Minutes+7.748
OUS+7.385
OME+7.275
inki+7.245
Neg logit contributions
seeing-13.716
arers-12.934
irit-12.892
raint-12.834
rising-12.421
Token carrying
Token ID6872
Feature activation-2.22e-01

Pos logit contributions
seeing+0.542
irit+0.499
arers+0.498
dating+0.495
rising+0.477
Neg logit contributions
Vs-0.548
 Minutes-0.501
inki-0.489
�醒-0.486
OUS-0.464
Token bundles
Token ID36344
Feature activation-1.68e-01

Pos logit contributions
Vs+1.348
 Minutes+1.230
inki+1.200
�醒+1.163
OUS+1.135
Neg logit contributions
seeing-1.413
irit-1.311
arers-1.298
dating-1.284
effects-1.245
Token of
Token ID286
Feature activation+3.97e-02

Pos logit contributions
Vs+1.062
 Minutes+0.980
inki+0.956
OUS+0.914
�醒+0.912
Neg logit contributions
seeing-1.006
irit-0.943
arers-0.920
dating-0.909
effects-0.896
Token B
Token ID347
Feature activation+2.05e-01

Pos logit contributions
seeing+0.250
irit+0.232
arers+0.230
dating+0.228
place+0.222
Neg logit contributions
Vs-0.242
 Minutes-0.220
inki-0.217
�醒-0.214
OUS-0.206
Token actions
Token ID4028
Feature activation-2.48e-01

Pos logit contributions
Vs+3.314
 Minutes+3.076
inki+2.926
OUS+2.792
�醒+2.749
Neg logit contributions
seeing-3.344
irit-3.128
arers-3.088
dating-3.041
effects-2.951
Token to
Token ID284
Feature activation-2.98e-01

Pos logit contributions
Vs+1.623
 Minutes+1.490
inki+1.457
�醒+1.423
OUS+1.391
Neg logit contributions
seeing-1.459
irit-1.346
arers-1.330
dating-1.315
effects-1.271
Token take
Token ID1011
Feature activation+3.54e-01

Pos logit contributions
Vs+1.967
 Minutes+1.811
inki+1.761
�醒+1.699
OUS+1.681
Neg logit contributions
seeing-1.737
irit-1.604
arers-1.584
dating-1.565
effects-1.515
Token,
Token ID11
Feature activation-4.03e-01

Pos logit contributions
seeing+1.939
irit+1.740
arers+1.734
dating+1.725
place+1.692
Neg logit contributions
Vs-2.448
 Minutes-2.232
inki-2.229
�醒-2.211
OUS-2.126
Token and
Token ID290
Feature activation-9.15e-01

Pos logit contributions
Vs+2.484
 Minutes+2.284
inki+2.225
�醒+2.099
OUS+2.095
Neg logit contributions
seeing-2.505
irit-2.323
arers-2.303
dating-2.271
effects-2.208
Token answer
Token ID3280
Feature activation-1.98e+00

Pos logit contributions
Vs+5.312
 Minutes+4.965
inki+4.797
OUS+4.485
USS+4.430
Neg logit contributions
seeing-5.733
irit-5.316
arers-5.249
dating-5.243
raint-5.126
Token your
Token ID534
Feature activation-9.96e-01

Pos logit contributions
Vs+9.172
 Minutes+8.652
 occasionally+8.117
inki+7.877
OUS+7.583
Neg logit contributions
seeing-12.743
irit-12.082
raint-11.999
arers-11.750
dating-11.599
Token biggest
Token ID4094
Feature activation-1.58e+00

Pos logit contributions
Vs+4.701
 Minutes+4.242
inki+3.931
OUS+3.715
USS+3.578
Neg logit contributions
seeing-7.360
irit-7.031
arers-6.841
raint-6.789
dating-6.729
Token market
Token ID1910
Feature activation+1.45e-01

Pos logit contributions
Vs+6.715
 Minutes+6.304
inki+5.618
OUS+5.280
 bunker+5.250
Neg logit contributions
seeing-11.342
irit-11.068
arers-10.888
raint-10.741
dating-10.623
Token questions
Token ID2683
Feature activation-9.04e-01

Pos logit contributions
seeing+1.056
irit+0.989
arers+0.974
dating+0.963
place+0.958
Neg logit contributions
Vs-0.757
 Minutes-0.672
inki-0.662
�醒-0.642
rador-0.615
Token.
Token ID13
Feature activation-4.92e-01

Pos logit contributions
Vs+5.052
 Minutes+4.723
inki+4.435
OUS+4.234
rador+4.191
Neg logit contributions
seeing-5.856
irit-5.546
arers-5.484
dating-5.424
raint-5.304
TokenSS
Token ID5432
Feature activation-8.42e-01

Pos logit contributions
Vs+0.934
 Minutes+0.850
inki+0.821
�醒+0.792
OUS+0.770
Neg logit contributions
seeing-1.236
irit-1.155
arers-1.143
dating-1.136
effects-1.104
TokenN
Token ID45
Feature activation-3.92e-01

Pos logit contributions
Vs+3.510
 Minutes+2.878
inki+2.711
USS+2.693
OUS+2.597
Neg logit contributions
seeing-6.924
irit-6.570
arers-6.545
dating-6.474
effects-6.402
Token),
Token ID828
Feature activation-5.66e-01

Pos logit contributions
Vs+2.269
 Minutes+2.058
inki+1.994
�醒+1.917
OUS+1.890
Neg logit contributions
seeing-2.608
irit-2.436
arers-2.412
dating-2.385
effects-2.319
Token plus
Token ID5556
Feature activation+6.87e-02

Pos logit contributions
Vs+3.310
 Minutes+2.994
inki+2.900
OUS+2.756
�醒+2.740
Neg logit contributions
seeing-3.709
irit-3.481
arers-3.436
dating-3.388
raint-3.305
Token some
Token ID617
Feature activation-4.57e-01

Pos logit contributions
seeing+0.445
irit+0.411
arers+0.407
dating+0.403
lier+0.393
Neg logit contributions
Vs-0.407
 Minutes-0.375
inki-0.367
�醒-0.364
rador-0.345
Token diesel
Token ID21866
Feature activation-1.97e+00

Pos logit contributions
Vs+2.689
 Minutes+2.441
inki+2.361
�醒+2.250
OUS+2.246
Neg logit contributions
seeing-2.991
irit-2.794
arers-2.764
dating-2.735
effects-2.659
Token submarines
Token ID34031
Feature activation-6.92e-01

Pos logit contributions
Vs+10.427
inki+8.715
 Minutes+8.672
USS+8.655
 bunker+8.292
Neg logit contributions
seeing-12.029
irit-11.249
arers-11.199
dating-10.988
raint-10.973
Token.
Token ID13
Feature activation-4.59e-01

Pos logit contributions
Vs+4.124
 Minutes+3.739
inki+3.612
OUS+3.436
USS+3.427
Neg logit contributions
seeing-4.428
irit-4.135
arers-4.076
dating-4.037
effects-3.947
Token Russia
Token ID3284
Feature activation+2.77e-01

Pos logit contributions
Vs+2.387
 Minutes+2.150
inki+2.055
�醒+1.954
OUS+1.942
Neg logit contributions
seeing-3.312
irit-3.120
arers-3.086
dating-3.059
effects-2.976
Token announced
Token ID3414
Feature activation-2.05e-01

Pos logit contributions
seeing+1.764
irit+1.650
arers+1.634
dating+1.603
place+1.564
Neg logit contributions
Vs-1.625
 Minutes-1.509
�醒-1.487
inki-1.465
rador-1.417
Token that
Token ID326
Feature activation-1.01e-01

Pos logit contributions
Vs+1.283
 Minutes+1.174
inki+1.141
�醒+1.103
OUS+1.086
Neg logit contributions
seeing-1.266
irit-1.175
arers-1.162
dating-1.148
effects-1.113
Token,
Token ID11
Feature activation-4.98e-01

Pos logit contributions
Vs+1.367
 Minutes+1.240
inki+1.201
�醒+1.156
OUS+1.138
Neg logit contributions
seeing-1.606
irit-1.500
arers-1.484
dating-1.469
effects-1.429
Token7
Token ID22
Feature activation-2.22e-01

Pos logit contributions
Vs+2.977
 Minutes+2.731
inki+2.639
OUS+2.502
�醒+2.476
Neg logit contributions
seeing-3.168
irit-2.947
arers-2.934
dating-2.860
raint-2.799
Token88
Token ID3459
Feature activation-6.46e-02

Pos logit contributions
Vs+1.512
 Minutes+1.395
inki+1.362
�醒+1.321
OUS+1.300
Neg logit contributions
seeing-1.246
irit-1.147
arers-1.133
dating-1.119
effects-1.080
Token litres
Token ID49622
Feature activation-4.04e-02

Pos logit contributions
Vs+0.358
 Minutes+0.324
inki+0.315
�醒+0.305
OUS+0.296
Neg logit contributions
seeing-0.444
irit-0.417
arers-0.413
dating-0.407
raint-0.397
Token of
Token ID286
Feature activation-5.12e-01

Pos logit contributions
Vs+0.275
 Minutes+0.253
inki+0.247
�醒+0.240
OUS+0.236
Neg logit contributions
seeing-0.229
irit-0.211
arers-0.208
dating-0.205
raint-0.198
Token diesel
Token ID21866
Feature activation-1.97e+00

Pos logit contributions
Vs+2.686
 Minutes+2.419
inki+2.335
�醒+2.192
OUS+2.186
Neg logit contributions
seeing-3.665
irit-3.415
dating-3.383
arers-3.381
effects-3.294
Token in
Token ID287
Feature activation-3.55e-01

Pos logit contributions
Vs+9.975
 Minutes+8.712
inki+8.466
OUS+8.355
XY+7.998
Neg logit contributions
seeing-12.216
dating-11.310
arers-11.310
irit-11.029
accompan-10.842
Token a
Token ID257
Feature activation-7.36e-01

Pos logit contributions
Vs+2.118
 Minutes+1.932
inki+1.868
�醒+1.794
OUS+1.775
Neg logit contributions
seeing-2.295
irit-2.134
arers-2.106
dating-2.097
effects-2.030
Token year
Token ID614
Feature activation-8.26e-02

Pos logit contributions
Vs+4.317
 Minutes+3.934
inki+3.785
�醒+3.637
OUS+3.619
Neg logit contributions
seeing-4.709
irit-4.405
dating-4.359
arers-4.315
effects-4.223
Token.
Token ID13
Feature activation-7.34e-02

Pos logit contributions
Vs+0.518
 Minutes+0.475
inki+0.462
�醒+0.447
OUS+0.439
Neg logit contributions
seeing-0.509
irit-0.472
arers-0.467
dating-0.461
effects-0.447
Token The
Token ID383
Feature activation-8.50e-02

Pos logit contributions
Vs+0.333
 Minutes+0.293
inki+0.285
�醒+0.270
OUS+0.264
Neg logit contributions
seeing-0.580
irit-0.546
arers-0.542
dating-0.536
effects-0.525
Token-
Token ID12
Feature activation+3.25e-01

Pos logit contributions
Vs+2.263
 Minutes+2.067
inki+2.005
OUS+1.901
�醒+1.897
Neg logit contributions
seeing-2.601
irit-2.420
arers-2.383
dating-2.367
effects-2.318
Tokengovern
Token ID47866
Feature activation+2.56e-01

Pos logit contributions
seeing+1.855
irit+1.674
dating+1.673
arers+1.673
rising+1.651
Neg logit contributions
Vs-2.184
 Minutes-2.033
�醒-1.979
inki-1.946
OUS-1.905
Tokened
Token ID276
Feature activation+4.65e-01

Pos logit contributions
seeing+1.102
irit+0.978
arers+0.975
dating+0.955
raint+0.917
Neg logit contributions
Vs-2.100
 Minutes-1.951
inki-1.917
�醒-1.895
USS-1.838
Token organisation
Token ID12684
Feature activation+2.06e-01

Pos logit contributions
seeing+2.216
irit+2.059
arers+2.022
dating+1.982
lier+1.930
Neg logit contributions
Vs-3.481
 Minutes-3.172
�醒-3.151
inki-3.144
USS-3.047
Token "
Token ID366
Feature activation-7.66e-01

Pos logit contributions
seeing+1.332
arers+1.250
irit+1.245
dating+1.220
effects+1.190
Neg logit contributions
Vs-1.206
 Minutes-1.089
inki-1.075
�醒-1.066
OUS-1.018
Tokendel
Token ID12381
Feature activation-1.96e+00

Pos logit contributions
Vs+5.574
 Minutes+5.054
inki+4.979
USS+4.784
OUS+4.765
Neg logit contributions
seeing-3.829
irit-3.590
arers-3.501
dating-3.397
raint-3.325
Tokenivers
Token ID1191
Feature activation+1.36e-01

Pos logit contributions
Vs+11.035
 Minutes+9.780
inki+9.689
rador+8.894
asionally+8.852
Neg logit contributions
seeing-11.250
effects-10.224
dating-10.192
irit-10.066
raint-10.046
Token emergency
Token ID6334
Feature activation+1.23e-01

Pos logit contributions
seeing+0.818
irit+0.763
arers+0.756
dating+0.743
place+0.727
Neg logit contributions
Vs-0.870
 Minutes-0.796
inki-0.776
�醒-0.766
rador-0.738
Token aid
Token ID6133
Feature activation-6.72e-01

Pos logit contributions
seeing+0.754
irit+0.699
arers+0.697
dating+0.685
place+0.667
Neg logit contributions
Vs-0.772
 Minutes-0.707
inki-0.690
�醒-0.678
rador-0.656
Token to
Token ID284
Feature activation-5.55e-01

Pos logit contributions
Vs+3.957
 Minutes+3.591
inki+3.470
OUS+3.281
rador+3.276
Neg logit contributions
seeing-4.320
irit-4.055
arers-3.991
dating-3.967
raint-3.856
Token people
Token ID661
Feature activation-5.73e-01

Pos logit contributions
Vs+3.278
 Minutes+2.981
inki+2.889
OUS+2.760
�醒+2.733
Neg logit contributions
seeing-3.599
irit-3.342
arers-3.301
dating-3.288
effects-3.182
Tokenny
Token ID3281
Feature activation+4.97e-01

Pos logit contributions
seeing+1.382
irit+1.290
arers+1.248
dating+1.222
place+1.211
Neg logit contributions
Vs-1.644
 Minutes-1.486
inki-1.458
�醒-1.431
OUS-1.415
Token Gray
Token ID12723
Feature activation+9.20e-03

Pos logit contributions
seeing+2.859
irit+2.610
arers+2.598
dating+2.555
effects+2.503
Neg logit contributions
Vs-3.291
 Minutes-2.972
inki-2.948
�醒-2.891
USS-2.855
Token are
Token ID389
Feature activation-2.02e-01

Pos logit contributions
seeing+0.054
irit+0.050
arers+0.049
dating+0.048
place+0.047
Neg logit contributions
Vs-0.061
 Minutes-0.056
inki-0.054
�醒-0.054
OUS-0.052
Token in
Token ID287
Feature activation+1.18e-01

Pos logit contributions
Vs+1.239
 Minutes+1.132
inki+1.106
�醒+1.071
OUS+1.047
Neg logit contributions
seeing-1.277
irit-1.186
arers-1.182
dating-1.163
rising-1.126
Token the
Token ID262
Feature activation-7.60e-01

Pos logit contributions
seeing+0.723
irit+0.671
arers+0.669
dating+0.663
place+0.648
Neg logit contributions
Vs-0.725
 Minutes-0.664
inki-0.657
�醒-0.634
rador-0.624
Token engine
Token ID3113
Feature activation-1.96e+00

Pos logit contributions
Vs+4.541
 Minutes+4.137
inki+3.949
OUS+3.771
�醒+3.769
Neg logit contributions
seeing-4.884
irit-4.573
arers-4.446
dating-4.435
effects-4.301
Token room
Token ID2119
Feature activation-5.65e-01

Pos logit contributions
Vs+8.550
 Minutes+7.659
 bunker+7.174
inki+7.120
OUS+6.904
Neg logit contributions
seeing-13.453
irit-13.104
arers-12.705
dating-12.653
effects-12.232
Token.
Token ID13
Feature activation-3.35e-01

Pos logit contributions
Vs+3.377
 Minutes+3.077
inki+2.960
�醒+2.854
OUS+2.829
Neg logit contributions
seeing-3.597
irit-3.389
arers-3.330
dating-3.287
effects-3.201
Token
Token ID198
Feature activation+2.18e-01

Pos logit contributions
Vs+1.296
 Minutes+1.162
inki+1.044
�醒+1.020
OUS+0.961
Neg logit contributions
seeing-2.873
irit-2.729
arers-2.688
dating-2.679
raint-2.622
Token
Token ID198
Feature activation+4.77e-02

Pos logit contributions
seeing+1.681
dating+1.542
irit+1.536
place+1.535
arers+1.525
Neg logit contributions
Vs-1.024
�醒-0.935
inki-0.917
 Minutes-0.908
rador-0.855
TokenRob
Token ID14350
Feature activation+5.54e-02

Pos logit contributions
seeing+0.340
irit+0.315
dating+0.313
arers+0.312
effects+0.305
Neg logit contributions
Vs-0.248
 Minutes-0.229
inki-0.223
�醒-0.215
rador-0.210
Token simple
Token ID2829
Feature activation-5.84e-03

Pos logit contributions
Vs+0.751
 Minutes+0.683
inki+0.664
�醒+0.650
OUS+0.630
Neg logit contributions
seeing-0.803
irit-0.746
arers-0.738
dating-0.729
place-0.709
Token:
Token ID25
Feature activation-8.42e-02

Pos logit contributions
Vs+0.038
 Minutes+0.035
inki+0.034
�醒+0.033
rador+0.032
Neg logit contributions
seeing-0.035
irit-0.032
dating-0.032
arers-0.032
effects-0.031
Token
Token ID198
Feature activation-5.18e-02

Pos logit contributions
Vs+0.457
 Minutes+0.409
inki+0.401
�醒+0.387
OUS+0.376
Neg logit contributions
seeing-0.591
irit-0.550
arers-0.549
dating-0.543
place-0.530
Token
Token ID198
Feature activation-1.63e-01

Pos logit contributions
Vs+0.229
 Minutes+0.202
inki+0.197
�醒+0.193
rador+0.181
Neg logit contributions
seeing-0.417
irit-0.389
arers-0.386
dating-0.386
place-0.379
Tokenfunction
Token ID8818
Feature activation+1.72e-01

Pos logit contributions
Vs+1.012
 Minutes+0.924
inki+0.899
�醒+0.867
OUS+0.855
Neg logit contributions
seeing-1.016
irit-0.945
arers-0.934
dating-0.923
effects-0.895
Token ob
Token ID909
Feature activation-1.95e+00

Pos logit contributions
seeing+1.035
arers+0.953
irit+0.950
dating+0.938
place+0.924
Neg logit contributions
Vs-1.093
 Minutes-0.990
inki-0.985
�醒-0.956
rador-0.928
Tokenb
Token ID65
Feature activation-1.46e+00

Pos logit contributions
Vs+10.816
XY+9.198
rador+9.117
inki+9.084
 Minutes+8.998
Neg logit contributions
seeing-10.643
irit-10.068
arers-9.813
effects-9.749
dating-9.510
Token_
Token ID62
Feature activation-1.41e-01

Pos logit contributions
Vs+8.673
 Minutes+7.738
inki+7.408
OUS+7.275
USS+7.119
Neg logit contributions
seeing-8.578
irit-8.107
arers-7.867
raint-7.697
dating-7.541
Tokensub
Token ID7266
Feature activation-1.48e+00

Pos logit contributions
Vs+1.055
 Minutes+0.978
inki+0.956
�醒+0.927
OUS+0.918
Neg logit contributions
seeing-0.702
irit-0.642
arers-0.632
dating-0.621
raint-0.597
Tokendiv
Token ID7146
Feature activation-1.10e+00

Pos logit contributions
Vs+10.136
 Minutes+9.335
inki+9.123
rador+9.094
asionally+8.781
Neg logit contributions
seeing-6.874
irit-6.467
arers-6.363
dating-5.881
raint-5.701
Tokenide
Token ID485
Feature activation-1.70e+00

Pos logit contributions
Vs+6.750
 Minutes+6.123
inki+5.876
OUS+5.651
USS+5.548
Neg logit contributions
seeing-6.743
irit-6.111
arers-6.057
dating-6.002
effects-5.857
Token video
Token ID2008
Feature activation-9.40e-02

Pos logit contributions
seeing+0.148
irit+0.139
arers+0.137
dating+0.137
rising+0.133
Neg logit contributions
Vs-0.115
 Minutes-0.103
inki-0.100
�醒-0.097
rador-0.095
Token with
Token ID351
Feature activation-1.80e-01

Pos logit contributions
Vs+0.583
 Minutes+0.532
inki+0.518
�醒+0.502
OUS+0.492
Neg logit contributions
seeing-0.587
irit-0.546
arers-0.539
dating-0.534
effects-0.516
Token #
Token ID1303
Feature activation+6.47e-01

Pos logit contributions
Vs+1.041
 Minutes+0.947
inki+0.917
�醒+0.879
OUS+0.868
Neg logit contributions
seeing-1.193
irit-1.114
arers-1.103
dating-1.089
effects-1.059
TokenAsk
Token ID25214
Feature activation-8.65e-01

Pos logit contributions
seeing+4.969
dating+4.711
irit+4.621
arers+4.597
place+4.563
Neg logit contributions
Vs-2.775
 Minutes-2.570
�醒-2.539
inki-2.476
rador-2.363
TokenThe
Token ID464
Feature activation-1.07e+00

Pos logit contributions
Vs+3.576
 Minutes+2.943
inki+2.755
USS+2.641
OUS+2.595
Neg logit contributions
seeing-7.157
irit-6.830
arers-6.769
dating-6.609
raint-6.591
TokenD
Token ID35
Feature activation-1.93e+00

Pos logit contributions
Vs+5.230
 Minutes+4.403
inki+4.166
USS+3.992
OUS+3.945
Neg logit contributions
seeing-7.999
irit-7.616
arers-7.369
raint-7.249
lier-7.134
Tokenubs
Token ID23161
Feature activation+2.58e-01

Pos logit contributions
Vs+12.215
inki+10.514
 Minutes+10.351
OUS+9.807
rador+9.717
Neg logit contributions
seeing-10.754
accompan-9.580
irit-9.500
raint-9.390
 painter-9.262
Token,
Token ID11
Feature activation+1.19e-01

Pos logit contributions
seeing+1.563
irit+1.459
dating+1.436
arers+1.427
rising+1.388
Neg logit contributions
Vs-1.615
 Minutes-1.482
inki-1.456
�醒-1.426
rador-1.381
Token then
Token ID788
Feature activation+2.30e-01

Pos logit contributions
seeing+0.744
irit+0.688
arers+0.687
dating+0.684
place+0.660
Neg logit contributions
Vs-0.731
 Minutes-0.662
inki-0.646
�醒-0.641
OUS-0.621
Token tune
Token ID14009
Feature activation-6.78e-02

Pos logit contributions
seeing+1.376
arers+1.266
irit+1.262
dating+1.258
place+1.226
Neg logit contributions
Vs-1.469
 Minutes-1.323
inki-1.314
�醒-1.306
rador-1.260
Token in
Token ID287
Feature activation+1.94e-01

Pos logit contributions
Vs+0.415
 Minutes+0.379
inki+0.369
�醒+0.358
OUS+0.350
Neg logit contributions
seeing-0.428
irit-0.398
arers-0.394
dating-0.391
effects-0.377
Tokenulf
Token ID4754
Feature activation+1.05e-02

Pos logit contributions
Vs+5.237
 Minutes+4.600
inki+4.503
OUS+4.260
rador+4.228
Neg logit contributions
seeing-7.167
arers-6.674
dating-6.624
irit-6.622
effects-6.444
Tokenur
Token ID333
Feature activation-2.92e-01

Pos logit contributions
seeing+0.050
irit+0.047
arers+0.046
dating+0.045
place+0.043
Neg logit contributions
Vs-0.081
 Minutes-0.075
inki-0.073
�醒-0.072
OUS-0.071
Token fuel
Token ID5252
Feature activation-1.11e+00

Pos logit contributions
Vs+1.827
 Minutes+1.670
inki+1.623
�醒+1.566
OUS+1.547
Neg logit contributions
seeing-1.811
irit-1.680
arers-1.663
dating-1.642
effects-1.593
Token oil
Token ID3056
Feature activation-2.93e-01

Pos logit contributions
Vs+5.593
 Minutes+4.850
inki+4.652
OUS+4.533
rador+4.400
Neg logit contributions
seeing-7.878
irit-7.328
arers-7.324
dating-7.217
raint-7.014
Token and
Token ID290
Feature activation-3.55e-01

Pos logit contributions
Vs+1.819
 Minutes+1.658
inki+1.611
�醒+1.551
OUS+1.535
Neg logit contributions
seeing-1.834
irit-1.703
arers-1.685
dating-1.664
effects-1.616
Token diesel
Token ID21866
Feature activation-1.93e+00

Pos logit contributions
Vs+2.127
 Minutes+1.935
inki+1.876
�醒+1.797
OUS+1.784
Neg logit contributions
seeing-2.287
irit-2.127
arers-2.106
dating-2.082
effects-2.024
Token at
Token ID379
Feature activation-7.19e-01

Pos logit contributions
Vs+10.176
 Minutes+8.652
inki+8.349
OUS+8.249
 bunker+8.011
Neg logit contributions
seeing-12.058
arers-11.218
irit-11.056
dating-10.804
effects-10.664
Token current
Token ID1459
Feature activation-7.07e-01

Pos logit contributions
Vs+4.260
 Minutes+3.857
inki+3.709
�醒+3.559
OUS+3.545
Neg logit contributions
seeing-4.654
irit-4.306
arers-4.267
dating-4.241
effects-4.113
Token prices
Token ID4536
Feature activation+5.33e-01

Pos logit contributions
Vs+4.153
 Minutes+3.782
inki+3.637
OUS+3.493
�醒+3.459
Neg logit contributions
seeing-4.587
irit-4.275
arers-4.233
dating-4.163
effects-4.062
Token.
Token ID13
Feature activation-1.39e-01

Pos logit contributions
seeing+3.068
irit+2.813
dating+2.777
arers+2.772
place+2.725
Neg logit contributions
Vs-3.423
 Minutes-3.191
inki-3.140
�醒-3.055
rador-2.982
Token
Token ID198
Feature activation+1.84e-01

Pos logit contributions
Vs+0.731
 Minutes+0.651
inki+0.647
�醒+0.621
rador+0.605
Neg logit contributions
seeing-0.991
irit-0.926
arers-0.917
dating-0.908
effects-0.888
Tokenak
Token ID461
Feature activation-1.61e+00

Pos logit contributions
Vs+0.375
 Minutes+0.347
inki+0.337
�醒+0.335
rador+0.323
Neg logit contributions
seeing-0.337
irit-0.311
arers-0.307
dating-0.304
rising-0.295
Token/
Token ID14
Feature activation-4.44e-01

Pos logit contributions
Vs+8.769
 Minutes+7.708
inki+7.091
XY+7.016
OUS+6.973
Neg logit contributions
seeing-10.500
arers-9.446
effects-9.326
irit-9.291
dating-9.258
Tokenj
Token ID73
Feature activation-5.65e-02

Pos logit contributions
Vs+2.988
 Minutes+2.745
inki+2.677
�醒+2.584
OUS+2.558
Neg logit contributions
seeing-2.534
irit-2.340
arers-2.312
dating-2.282
effects-2.205
Tokent
Token ID83
Feature activation-7.34e-01

Pos logit contributions
Vs+0.378
 Minutes+0.351
inki+0.340
�醒+0.334
OUS+0.325
Neg logit contributions
seeing-0.323
irit-0.299
arers-0.296
dating-0.291
rising-0.282
Token/
Token ID14
Feature activation-5.38e-01

Pos logit contributions
Vs+3.317
 Minutes+2.893
inki+2.750
�醒+2.576
OUS+2.575
Neg logit contributions
seeing-5.789
irit-5.432
arers-5.412
dating-5.369
effects-5.275
Tokens
Token ID82
Feature activation-1.91e+00

Pos logit contributions
Vs+3.713
 Minutes+3.398
inki+3.324
�醒+3.191
OUS+3.179
Neg logit contributions
seeing-2.976
irit-2.743
arers-2.710
dating-2.674
raint-2.583
Tokengb
Token ID22296
Feature activation-1.17e+00

Pos logit contributions
Vs+11.599
 Minutes+10.156
inki+10.031
USS+9.486
OUS+9.482
Neg logit contributions
seeing-10.667
dating-9.768
arers-9.669
irit-9.594
hemat-9.406
Token (
Token ID357
Feature activation-4.99e-01

Pos logit contributions
Vs+5.486
 Minutes+4.739
inki+4.540
OUS+4.293
USS+4.276
Neg logit contributions
seeing-8.733
arers-8.090
irit-8.083
dating-8.073
raint-8.018
Tokend
Token ID67
Feature activation-1.30e+00

Pos logit contributions
Vs+3.643
 Minutes+3.365
inki+3.286
�醒+3.177
OUS+3.158
Neg logit contributions
seeing-2.567
irit-2.354
arers-2.318
dating-2.281
raint-2.197
Tokenpa
Token ID8957
Feature activation-3.14e-01

Pos logit contributions
Vs+4.454
inki+3.626
 Minutes+3.538
OUS+3.118
OME+2.998
Neg logit contributions
seeing-11.356
irit-10.541
effects-10.507
arers-10.456
dating-10.448
Token,
Token ID11
Feature activation-7.96e-01

Pos logit contributions
Vs+2.302
 Minutes+2.130
inki+2.075
�醒+2.018
OUS+1.998
Neg logit contributions
seeing-1.607
irit-1.453
arers-1.441
dating-1.419
effects-1.369
Token
Token ID447
Feature activation+2.94e-01

Pos logit contributions
Vs+0.644
 Minutes+0.589
inki+0.577
�醒+0.554
OUS+0.549
Neg logit contributions
seeing-0.574
irit-0.536
arers-0.531
dating-0.526
effects-0.510
Token
Token ID247
Feature activation-1.68e-01

Pos logit contributions
seeing+1.537
irit+1.426
arers+1.385
dating+1.341
place+1.325
Neg logit contributions
Vs-2.079
 Minutes-1.940
inki-1.894
OUS-1.812
�醒-1.806
Tokens
Token ID82
Feature activation-6.49e-01

Pos logit contributions
Vs+0.947
 Minutes+0.856
inki+0.830
�醒+0.797
OUS+0.785
Neg logit contributions
seeing-1.142
irit-1.069
arers-1.058
dating-1.047
effects-1.018
Token oil
Token ID3056
Feature activation-1.72e+00

Pos logit contributions
Vs+3.863
 Minutes+3.518
inki+3.393
OUS+3.245
USS+3.188
Neg logit contributions
seeing-4.164
irit-3.889
arers-3.861
dating-3.841
effects-3.721
Token and
Token ID290
Feature activation-1.19e+00

Pos logit contributions
Vs+9.118
 Minutes+8.037
inki+7.825
OUS+7.394
USS+7.366
Neg logit contributions
seeing-10.595
arers-10.283
irit-10.055
dating-10.014
effects-9.910
Token gas
Token ID3623
Feature activation-1.91e+00

Pos logit contributions
Vs+4.889
 Minutes+4.213
inki+4.053
USS+3.737
OUS+3.586
Neg logit contributions
seeing-9.467
arers-8.884
irit-8.863
dating-8.813
effects-8.669
Token.
Token ID13
Feature activation-6.19e-01

Pos logit contributions
Vs+9.772
 Minutes+8.519
inki+8.281
OUS+7.911
USS+7.852
Neg logit contributions
seeing-11.706
arers-11.279
irit-11.241
dating-11.060
effects-10.905
Token
Token ID198
Feature activation-1.85e-01

Pos logit contributions
Vs+2.720
 Minutes+2.475
inki+2.213
OUS+2.139
�醒+2.124
Neg logit contributions
seeing-4.892
irit-4.672
arers-4.628
dating-4.578
effects-4.463
Token
Token ID198
Feature activation-3.20e-01

Pos logit contributions
Vs+0.797
 Minutes+0.700
inki+0.677
�醒+0.650
rador+0.622
Neg logit contributions
seeing-1.505
irit-1.416
arers-1.405
dating-1.397
effects-1.366
TokenWhile
Token ID3633
Feature activation-1.10e-01

Pos logit contributions
Vs+1.629
 Minutes+1.448
inki+1.394
�醒+1.331
OUS+1.319
Neg logit contributions
seeing-2.354
irit-2.220
arers-2.201
dating-2.174
raint-2.120
Token it
Token ID340
Feature activation+6.73e-02

Pos logit contributions
Vs+0.646
 Minutes+0.585
inki+0.572
�醒+0.554
OUS+0.539
Neg logit contributions
seeing-0.728
irit-0.677
arers-0.669
dating-0.663
effects-0.644
Tokenolan
Token ID16617
Feature activation+6.72e-02

Pos logit contributions
Vs+7.579
inki+6.973
 Minutes+6.800
rador+6.501
OUS+6.479
Neg logit contributions
seeing-5.874
dating-5.264
arers-5.166
irit-5.129
effects-4.970
Token 41
Token ID6073
Feature activation-3.10e-01

Pos logit contributions
seeing+0.468
irit+0.436
arers+0.431
dating+0.426
place+0.416
Neg logit contributions
Vs-0.368
 Minutes-0.331
inki-0.323
�醒-0.314
rador-0.304
Token),
Token ID828
Feature activation-2.26e-01

Pos logit contributions
Vs+1.872
 Minutes+1.717
inki+1.645
�醒+1.583
OUS+1.571
Neg logit contributions
seeing-1.969
irit-1.848
arers-1.818
dating-1.811
effects-1.758
Token Daly
Token ID43692
Feature activation-3.66e-01

Pos logit contributions
Vs+1.417
 Minutes+1.302
inki+1.257
�醒+1.211
OUS+1.198
Neg logit contributions
seeing-1.391
irit-1.293
arers-1.277
dating-1.263
effects-1.223
Token (
Token ID357
Feature activation+2.03e-01

Pos logit contributions
Vs+2.564
 Minutes+2.387
inki+2.317
�醒+2.225
OUS+2.211
Neg logit contributions
seeing-1.987
irit-1.826
arers-1.787
dating-1.786
raint-1.717
TokenEl
Token ID9527
Feature activation-1.89e+00

Pos logit contributions
seeing+1.427
irit+1.320
dating+1.310
arers+1.306
effects+1.278
Neg logit contributions
Vs-1.076
 Minutes-0.989
inki-0.963
�醒-0.935
rador-0.909
Tokenworthy
Token ID18275
Feature activation-4.92e-02

Pos logit contributions
Vs+12.238
 Minutes+11.813
inki+11.463
rador+11.076
OUS+10.446
Neg logit contributions
seeing-8.550
dating-8.030
arers-7.860
irit-7.731
effects-7.656
Token 54
Token ID7175
Feature activation-3.85e-01

Pos logit contributions
Vs+0.270
 Minutes+0.243
inki+0.237
�醒+0.230
OUS+0.223
Neg logit contributions
seeing-0.343
irit-0.319
arers-0.316
dating-0.313
effects-0.305
Token),
Token ID828
Feature activation-2.65e-01

Pos logit contributions
Vs+2.276
 Minutes+2.099
inki+1.999
�醒+1.910
OUS+1.905
Neg logit contributions
seeing-2.490
irit-2.336
arers-2.297
dating-2.297
effects-2.232
Token Scott
Token ID4746
Feature activation+2.98e-01

Pos logit contributions
Vs+1.658
 Minutes+1.527
inki+1.468
�醒+1.413
OUS+1.401
Neg logit contributions
seeing-1.631
irit-1.518
arers-1.499
dating-1.482
raint-1.435
Token (
Token ID357
Feature activation+3.33e-02

Pos logit contributions
seeing+1.922
irit+1.787
arers+1.783
dating+1.748
raint+1.706
Neg logit contributions
Vs-1.766
 Minutes-1.592
inki-1.555
�醒-1.507
OUS-1.483
Token,
Token ID11
Feature activation-9.74e-02

Pos logit contributions
Vs+2.903
 Minutes+2.671
inki+2.597
�醒+2.520
OUS+2.490
Neg logit contributions
seeing-2.277
irit-2.098
arers-2.079
dating-2.042
effects-1.967
Token and
Token ID290
Feature activation-3.10e-01

Pos logit contributions
Vs+0.632
 Minutes+0.580
inki+0.568
�醒+0.551
OUS+0.541
Neg logit contributions
seeing-0.582
irit-0.536
arers-0.529
dating-0.526
place-0.506
Token chatting
Token ID32046
Feature activation-4.27e-01

Pos logit contributions
Vs+1.917
 Minutes+1.752
inki+1.702
�醒+1.643
OUS+1.619
Neg logit contributions
seeing-1.936
irit-1.800
arers-1.780
dating-1.759
effects-1.707
Token.
Token ID13
Feature activation-4.74e-01

Pos logit contributions
Vs+2.489
 Minutes+2.270
inki+2.189
�醒+2.103
OUS+2.075
Neg logit contributions
seeing-2.816
irit-2.627
arers-2.601
dating-2.569
effects-2.504
Token It
Token ID632
Feature activation-1.95e-01

Pos logit contributions
Vs+2.616
 Minutes+2.407
inki+2.263
�醒+2.168
OUS+2.159
Neg logit contributions
seeing-3.258
irit-3.075
arers-3.037
dating-3.001
raint-2.924
Token turns
Token ID4962
Feature activation-1.88e+00

Pos logit contributions
Vs+1.168
 Minutes+1.064
inki+1.034
�醒+0.999
OUS+0.982
Neg logit contributions
seeing-1.253
irit-1.166
arers-1.154
dating-1.140
effects-1.108
Token out
Token ID503
Feature activation-3.85e-01

Pos logit contributions
Vs+6.916
 Minutes+6.316
 occasionally+5.949
OUS+5.853
OME+5.572
Neg logit contributions
accompan-14.149
arers-13.647
seeing-13.469
raint-13.391
irit-13.389
Token there
Token ID612
Feature activation-6.60e-01

Pos logit contributions
Vs+2.498
 Minutes+2.320
inki+2.222
�醒+2.140
OUS+2.134
Neg logit contributions
seeing-2.249
irit-2.100
arers-2.075
dating-2.037
effects-1.984
Token are
Token ID389
Feature activation-9.58e-01

Pos logit contributions
Vs+3.759
 Minutes+3.427
inki+3.278
OUS+3.118
rador+3.072
Neg logit contributions
seeing-4.354
irit-4.103
arers-4.025
dating-3.987
effects-3.918
Token quite
Token ID2407
Feature activation-1.06e+00

Pos logit contributions
Vs+5.214
 Minutes+4.757
inki+4.430
OUS+4.326
OME+4.225
Neg logit contributions
seeing-6.448
irit-6.163
arers-5.983
dating-5.929
raint-5.802
Token a
Token ID257
Feature activation-1.32e+00

Pos logit contributions
Vs+5.042
 Minutes+4.551
inki+4.252
OME+4.127
OUS+4.113
Neg logit contributions
seeing-7.679
irit-7.441
dating-7.273
arers-7.189
rising-7.069
Token further
Token ID2252
Feature activation-5.20e-01

Pos logit contributions
Vs+0.183
 Minutes+0.166
inki+0.162
�醒+0.157
OUS+0.153
Neg logit contributions
seeing-0.208
irit-0.194
arers-0.192
dating-0.190
effects-0.185
Token reading
Token ID3555
Feature activation-3.93e-01

Pos logit contributions
Vs+3.708
 Minutes+3.449
inki+3.341
�醒+3.215
OUS+3.215
Neg logit contributions
seeing-2.696
irit-2.493
arers-2.477
dating-2.408
raint-2.348
Token:
Token ID25
Feature activation-3.26e-01

Pos logit contributions
Vs+2.094
 Minutes+1.894
inki+1.820
�醒+1.740
OUS+1.717
Neg logit contributions
seeing-2.784
irit-2.615
arers-2.592
dating-2.558
effects-2.492
Token
Token ID198
Feature activation-9.47e-02

Pos logit contributions
Vs+1.435
 Minutes+1.275
inki+1.206
�醒+1.144
OUS+1.123
Neg logit contributions
seeing-2.605
irit-2.470
arers-2.455
dating-2.428
effects-2.371
Token
Token ID198
Feature activation-4.11e-02

Pos logit contributions
Vs+0.414
 Minutes+0.365
inki+0.355
�醒+0.344
rador+0.325
Neg logit contributions
seeing-0.765
irit-0.718
arers-0.711
dating-0.710
place-0.695
TokenHall
Token ID34194
Feature activation-1.87e+00

Pos logit contributions
Vs+0.261
 Minutes+0.241
inki+0.234
�醒+0.227
OUS+0.222
Neg logit contributions
seeing-0.250
irit-0.231
arers-0.228
dating-0.226
effects-0.219
Tokenion
Token ID295
Feature activation-1.99e-01

Pos logit contributions
Vs+12.091
 Minutes+11.505
inki+11.421
rador+10.855
OUS+10.558
Neg logit contributions
seeing-8.777
irit-8.110
arers-8.076
dating-7.911
rising-7.709
Token,
Token ID11
Feature activation-4.66e-01

Pos logit contributions
Vs+1.394
 Minutes+1.288
inki+1.257
�醒+1.219
OUS+1.203
Neg logit contributions
seeing-1.085
irit-0.998
arers-0.984
dating-0.971
effects-0.937
Token Richard
Token ID6219
Feature activation-3.19e-02

Pos logit contributions
Vs+3.150
 Minutes+2.934
inki+2.809
OUS+2.713
�醒+2.696
Neg logit contributions
seeing-2.614
irit-2.430
arers-2.405
dating-2.356
raint-2.287
Token P
Token ID350
Feature activation+8.35e-01

Pos logit contributions
Vs+0.198
 Minutes+0.180
inki+0.176
�醒+0.170
OUS+0.167
Neg logit contributions
seeing-0.199
irit-0.185
arers-0.183
dating-0.181
effects-0.175
Token.
Token ID13
Feature activation-6.88e-02

Pos logit contributions
seeing+4.401
irit+4.153
dating+4.026
arers+4.002
raint+3.891
Neg logit contributions
Vs-5.680
 Minutes-5.245
inki-5.120
�醒-4.996
OUS-4.962
Token really
Token ID1107
Feature activation-5.94e-01

Pos logit contributions
seeing+0.780
irit+0.728
arers+0.722
dating+0.713
place+0.699
Neg logit contributions
Vs-0.730
 Minutes-0.660
inki-0.647
�醒-0.638
rador-0.609
Token well
Token ID880
Feature activation-5.43e-01

Pos logit contributions
Vs+2.893
 Minutes+2.630
inki+2.488
OUS+2.409
USS+2.323
Neg logit contributions
seeing-4.413
irit-4.138
arers-4.091
dating-4.084
raint-3.979
Token,
Token ID11
Feature activation-5.42e-01

Pos logit contributions
Vs+3.306
 Minutes+3.023
inki+2.932
OUS+2.810
rador+2.782
Neg logit contributions
seeing-3.376
irit-3.170
arers-3.130
dating-3.084
raint-2.996
Token as
Token ID355
Feature activation+1.78e-01

Pos logit contributions
Vs+3.128
 Minutes+2.852
inki+2.770
OUS+2.620
rador+2.595
Neg logit contributions
seeing-3.543
irit-3.340
arers-3.294
dating-3.249
raint-3.175
Token it
Token ID340
Feature activation-3.99e-02

Pos logit contributions
seeing+1.077
irit+0.995
arers+0.986
dating+0.972
lier+0.938
Neg logit contributions
Vs-1.138
 Minutes-1.047
inki-1.024
�醒-1.013
rador-0.971
Token turns
Token ID4962
Feature activation-1.85e+00

Pos logit contributions
Vs+0.226
 Minutes+0.206
inki+0.199
�醒+0.194
OUS+0.189
Neg logit contributions
seeing-0.270
irit-0.252
arers-0.251
dating-0.245
effects-0.240
Token out
Token ID503
Feature activation-2.69e-01

Pos logit contributions
Vs+9.184
 Minutes+8.460
 occasionally+7.493
rador+7.486
inki+7.477
Neg logit contributions
seeing-11.718
irit-11.147
dating-11.122
arers-10.935
accompan-10.661
Token.
Token ID13
Feature activation-4.41e-01

Pos logit contributions
Vs+1.656
 Minutes+1.514
inki+1.470
�醒+1.414
OUS+1.402
Neg logit contributions
seeing-1.675
irit-1.566
arers-1.545
dating-1.527
effects-1.482
Token The
Token ID383
Feature activation-1.66e-02

Pos logit contributions
Vs+2.448
 Minutes+2.231
inki+2.127
�醒+2.034
OUS+2.027
Neg logit contributions
seeing-3.030
irit-2.852
arers-2.811
dating-2.786
raint-2.712
Token Premiere
Token ID44151
Feature activation+2.74e-02

Pos logit contributions
Vs+0.104
 Minutes+0.094
inki+0.093
�醒+0.090
OUS+0.088
Neg logit contributions
seeing-0.102
irit-0.095
arers-0.095
dating-0.093
effects-0.091
Token is
Token ID318
Feature activation-2.93e-01

Pos logit contributions
seeing+0.187
irit+0.175
arers+0.173
dating+0.171
raint+0.167
Neg logit contributions
Vs-0.153
 Minutes-0.138
inki-0.135
�醒-0.132
OUS-0.127
Token lots
Token ID6041
Feature activation+2.59e-02

Pos logit contributions
seeing+0.032
irit+0.030
arers+0.030
dating+0.029
place+0.028
Neg logit contributions
Vs-0.033
 Minutes-0.030
inki-0.030
�醒-0.029
rador-0.028
Token of
Token ID286
Feature activation-2.65e-01

Pos logit contributions
seeing+0.151
irit+0.140
arers+0.139
dating+0.137
effects+0.133
Neg logit contributions
Vs-0.170
 Minutes-0.157
inki-0.152
�醒-0.148
OUS-0.146
Token money
Token ID1637
Feature activation+1.80e-01

Pos logit contributions
Vs+1.637
 Minutes+1.496
inki+1.454
�醒+1.404
OUS+1.383
Neg logit contributions
seeing-1.655
irit-1.539
arers-1.521
dating-1.504
effects-1.459
Token for
Token ID329
Feature activation+3.79e-01

Pos logit contributions
seeing+1.027
irit+0.938
arers+0.923
dating+0.917
lier+0.885
Neg logit contributions
Vs-1.222
 Minutes-1.123
inki-1.105
�醒-1.077
OUS-1.046
Token somebody
Token ID8276
Feature activation+2.14e-01

Pos logit contributions
seeing+2.275
irit+2.094
dating+2.065
arers+2.046
lier+2.004
Neg logit contributions
Vs-2.419
inki-2.199
 Minutes-2.181
�醒-2.133
rador-2.077
Token else
Token ID2073
Feature activation+1.19e+00

Pos logit contributions
seeing+1.240
irit+1.135
arers+1.121
dating+1.108
lier+1.077
Neg logit contributions
Vs-1.424
 Minutes-1.308
inki-1.287
�醒-1.260
rador-1.226
Token and
Token ID290
Feature activation+2.92e-01

Pos logit contributions
seeing+6.305
irit+5.554
arers+5.506
lier+5.481
dating+5.446
Neg logit contributions
Vs-7.701
 Minutes-7.192
�醒-7.135
inki-7.085
USS-6.952
Token were
Token ID547
Feature activation-2.65e-01

Pos logit contributions
seeing+1.767
arers+1.629
irit+1.625
dating+1.590
place+1.567
Neg logit contributions
Vs-1.855
 Minutes-1.687
inki-1.670
�醒-1.669
rador-1.589
Token asked
Token ID1965
Feature activation-3.16e-01

Pos logit contributions
Vs+1.629
 Minutes+1.489
inki+1.443
�醒+1.391
OUS+1.376
Neg logit contributions
seeing-1.664
irit-1.550
arers-1.531
dating-1.514
effects-1.471
Token to
Token ID284
Feature activation+2.47e-01

Pos logit contributions
Vs+1.563
 Minutes+1.401
inki+1.342
�醒+1.284
OUS+1.265
Neg logit contributions
seeing-2.362
irit-2.223
arers-2.207
dating-2.180
effects-2.138
Token be
Token ID307
Feature activation+9.62e-02

Pos logit contributions
seeing+1.364
irit+1.266
arers+1.259
dating+1.241
place+1.227
Neg logit contributions
Vs-1.678
 Minutes-1.531
�醒-1.504
inki-1.502
rador-1.438
Tokennice
Token ID44460
Feature activation-6.19e-01

Pos logit contributions
seeing+2.163
irit+1.946
dating+1.935
arers+1.924
place+1.870
Neg logit contributions
Vs-2.961
 Minutes-2.773
�醒-2.690
inki-2.673
rador-2.569
Tokenhash
Token ID17831
Feature activation+3.25e-01

Pos logit contributions
Vs+3.507
 Minutes+3.127
inki+3.100
rador+2.925
OUS+2.912
Neg logit contributions
seeing-4.015
arers-3.757
irit-3.739
dating-3.660
raint-3.614
Token.
Token ID13
Feature activation+2.61e-01

Pos logit contributions
seeing+1.997
irit+1.853
arers+1.827
dating+1.822
effects+1.762
Neg logit contributions
Vs-2.051
 Minutes-1.879
inki-1.821
�醒-1.768
OUS-1.739
Tokencom
Token ID785
Feature activation-1.02e-01

Pos logit contributions
seeing+1.271
irit+1.146
arers+1.137
dating+1.124
place+1.083
Neg logit contributions
Vs-1.965
 Minutes-1.842
�醒-1.780
inki-1.769
OUS-1.717
Token,
Token ID11
Feature activation+1.38e-01

Pos logit contributions
Vs+0.626
 Minutes+0.570
inki+0.554
�醒+0.539
OUS+0.529
Neg logit contributions
seeing-0.644
irit-0.598
arers-0.594
dating-0.589
effects-0.568
Token but
Token ID475
Feature activation+1.16e+00

Pos logit contributions
seeing+0.897
arers+0.831
dating+0.828
irit+0.824
place+0.800
Neg logit contributions
Vs-0.820
 Minutes-0.740
inki-0.718
�醒-0.717
OUS-0.689
Token will
Token ID481
Feature activation+2.22e-01

Pos logit contributions
seeing+6.333
arers+5.933
dating+5.836
place+5.639
lier+5.572
Neg logit contributions
Vs-7.346
�醒-6.800
 Minutes-6.582
inki-6.570
USS-6.256
Token be
Token ID307
Feature activation+8.36e-01

Pos logit contributions
seeing+1.433
irit+1.342
arers+1.338
dating+1.316
place+1.292
Neg logit contributions
Vs-1.315
 Minutes-1.192
inki-1.144
�醒-1.125
USS-1.088
Token disabled
Token ID10058
Feature activation-1.51e-01

Pos logit contributions
seeing+4.639
arers+4.383
irit+4.267
dating+4.222
place+4.094
Neg logit contributions
Vs-5.488
 Minutes-5.055
�醒-4.938
inki-4.881
rador-4.680
Token eventually
Token ID4191
Feature activation-5.30e-01

Pos logit contributions
Vs+0.883
 Minutes+0.801
inki+0.779
�醒+0.754
OUS+0.737
Neg logit contributions
seeing-0.994
irit-0.925
arers-0.916
dating-0.907
effects-0.881
Token.
Token ID13
Feature activation+6.36e-01

Pos logit contributions
Vs+3.102
 Minutes+2.832
inki+2.744
�醒+2.609
OUS+2.604
Neg logit contributions
seeing-3.451
irit-3.231
arers-3.182
dating-3.147
raint-3.069
Token it
Token ID340
Feature activation+5.30e-01

Pos logit contributions
seeing+0.585
irit+0.542
arers+0.536
dating+0.529
place+0.513
Neg logit contributions
Vs-0.633
 Minutes-0.582
inki-0.565
�醒-0.556
OUS-0.538
Token with
Token ID351
Feature activation+1.73e-02

Pos logit contributions
seeing+3.169
irit+2.891
arers+2.890
lier+2.815
dating+2.804
Neg logit contributions
Vs-3.432
 Minutes-3.126
inki-3.042
�醒-3.027
USS-2.900
Token a
Token ID257
Feature activation+1.12e-01

Pos logit contributions
seeing+0.106
irit+0.098
arers+0.097
dating+0.096
lier+0.093
Neg logit contributions
Vs-0.110
 Minutes-0.100
inki-0.098
�醒-0.095
OUS-0.093
Token mixture
Token ID11710
Feature activation+9.56e-01

Pos logit contributions
seeing+0.680
irit+0.628
arers+0.618
dating+0.610
place+0.598
Neg logit contributions
Vs-0.714
 Minutes-0.654
inki-0.635
�醒-0.620
OUS-0.603
Token of
Token ID286
Feature activation+5.97e-01

Pos logit contributions
seeing+4.288
irit+3.773
dating+3.692
place+3.680
lier+3.640
Neg logit contributions
Vs-7.382
�醒-6.990
 Minutes-6.835
inki-6.697
rador-6.446
Token tre
Token ID2054
Feature activation+1.27e+00

Pos logit contributions
seeing+3.393
irit+3.068
arers+3.011
dating+2.996
place+2.940
Neg logit contributions
Vs-3.958
 Minutes-3.640
�醒-3.598
inki-3.568
OUS-3.422
Tokenp
Token ID79
Feature activation+1.11e+00

Pos logit contributions
seeing+5.481
irit+4.872
place+4.687
arers+4.597
dating+4.445
Neg logit contributions
Vs-9.893
�醒-9.046
 Minutes-9.045
inki-8.841
OME-8.816
Tokenidation
Token ID24765
Feature activation+7.66e-02

Pos logit contributions
seeing+2.964
irit+2.589
arers+2.369
dating+2.239
pain+2.187
Neg logit contributions
Vs-10.364
�醒-9.800
 Minutes-9.775
OME-9.519
inki-9.453
Token and
Token ID290
Feature activation+2.21e-01

Pos logit contributions
seeing+0.449
irit+0.415
arers+0.410
dating+0.405
effects+0.391
Neg logit contributions
Vs-0.505
 Minutes-0.464
inki-0.452
�醒-0.438
OUS-0.431
Token awe
Token ID25030
Feature activation+1.07e-02

Pos logit contributions
seeing+1.251
irit+1.141
arers+1.123
dating+1.109
raint+1.079
Neg logit contributions
Vs-1.508
 Minutes-1.388
inki-1.352
�醒-1.347
OUS-1.291
Token.
Token ID13
Feature activation+5.28e-02

Pos logit contributions
seeing+0.065
irit+0.060
arers+0.059
dating+0.058
raint+0.056
Neg logit contributions
Vs-0.069
 Minutes-0.064
inki-0.062
�醒-0.060
OUS-0.059
Token tit
Token ID5259
Feature activation-2.58e-01

Pos logit contributions
Vs+0.262
 Minutes+0.237
inki+0.232
�醒+0.225
rador+0.219
Neg logit contributions
seeing-0.273
irit-0.254
arers-0.251
dating-0.249
effects-0.241
Tokenular
Token ID934
Feature activation-4.67e-01

Pos logit contributions
Vs+1.571
 Minutes+1.435
inki+1.393
�醒+1.344
OUS+1.324
Neg logit contributions
seeing-1.633
irit-1.519
arers-1.502
dating-1.486
effects-1.442
Token phrase
Token ID9546
Feature activation-2.63e-01

Pos logit contributions
Vs+2.906
 Minutes+2.675
inki+2.572
OUS+2.486
�醒+2.470
Neg logit contributions
seeing-2.876
irit-2.666
arers-2.638
dating-2.604
raint-2.541
Token was
Token ID373
Feature activation+4.05e-01

Pos logit contributions
Vs+1.535
 Minutes+1.399
inki+1.354
�醒+1.303
OUS+1.287
Neg logit contributions
seeing-1.725
irit-1.610
arers-1.593
dating-1.576
effects-1.533
Token still
Token ID991
Feature activation+5.27e-01

Pos logit contributions
seeing+2.544
arers+2.358
irit+2.356
dating+2.300
lier+2.248
Neg logit contributions
Vs-2.480
 Minutes-2.254
inki-2.216
�醒-2.162
rador-2.089
Token fl
Token ID781
Feature activation+1.65e+00

Pos logit contributions
seeing+3.063
arers+2.827
irit+2.801
dating+2.732
rising+2.673
Neg logit contributions
Vs-3.483
 Minutes-3.195
inki-3.113
�醒-3.091
rador-2.972
Tokenuttering
Token ID33598
Feature activation+1.09e+00

Pos logit contributions
arers+6.248
seeing+6.130
irit+6.104
place+5.735
rising+5.631
Neg logit contributions
Vs-12.328
�醒-12.112
 Minutes-11.244
OUS-11.114
OME-11.082
Token proudly
Token ID23017
Feature activation-1.13e-01

Pos logit contributions
seeing+5.733
place+5.291
lier+5.192
arers+5.132
irit+5.056
Neg logit contributions
Vs-7.264
�醒-6.939
inki-6.739
 Minutes-6.509
rador-6.501
Token in
Token ID287
Feature activation+1.73e-01

Pos logit contributions
Vs+0.716
 Minutes+0.655
inki+0.638
�醒+0.619
OUS+0.607
Neg logit contributions
seeing-0.689
irit-0.637
arers-0.631
dating-0.623
effects-0.603
Token the
Token ID262
Feature activation-7.29e-01

Pos logit contributions
seeing+1.059
irit+0.973
arers+0.963
dating+0.955
place+0.934
Neg logit contributions
Vs-1.087
 Minutes-0.989
inki-0.979
�醒-0.947
OUS-0.926
Token breeze
Token ID28633
Feature activation-2.91e-01

Pos logit contributions
Vs+4.233
 Minutes+3.889
inki+3.657
OUS+3.545
rador+3.507
Neg logit contributions
seeing-4.645
irit-4.420
arers-4.330
dating-4.293
raint-4.194
Token graduating
Token ID30024
Feature activation+8.48e-02

Pos logit contributions
seeing+1.547
irit+1.415
dating+1.393
arers+1.391
place+1.341
Neg logit contributions
Vs-1.767
 Minutes-1.650
�醒-1.588
inki-1.576
rador-1.529
Token with
Token ID351
Feature activation+5.99e-01

Pos logit contributions
seeing+0.501
irit+0.463
arers+0.454
dating+0.452
place+0.437
Neg logit contributions
Vs-0.555
 Minutes-0.509
inki-0.495
�醒-0.484
OUS-0.472
Token a
Token ID257
Feature activation+9.76e-02

Pos logit contributions
seeing+3.660
irit+3.341
dating+3.304
arers+3.241
place+3.181
Neg logit contributions
Vs-3.687
 Minutes-3.332
inki-3.296
�醒-3.254
rador-3.127
Token B
Token ID347
Feature activation+3.40e-01

Pos logit contributions
seeing+0.570
irit+0.529
dating+0.515
arers+0.513
place+0.501
Neg logit contributions
Vs-0.639
 Minutes-0.586
inki-0.574
�醒-0.559
rador-0.547
TokenFA
Token ID7708
Feature activation+2.49e-01

Pos logit contributions
seeing+2.071
irit+1.944
dating+1.887
arers+1.870
place+1.817
Neg logit contributions
Vs-2.056
 Minutes-1.932
inki-1.872
�醒-1.842
rador-1.807
Token in
Token ID287
Feature activation+1.14e+00

Pos logit contributions
seeing+1.452
irit+1.346
dating+1.312
arers+1.305
raint+1.261
Neg logit contributions
Vs-1.642
 Minutes-1.511
inki-1.471
�醒-1.444
rador-1.401
Token Musical
Token ID33375
Feature activation+6.85e-01

Pos logit contributions
seeing+5.936
dating+5.514
irit+5.497
pain+5.328
 painter+5.310
Neg logit contributions
Vs-7.401
�醒-6.831
inki-6.737
 Minutes-6.639
rador-6.523
Token Theater
Token ID22207
Feature activation+8.33e-01

Pos logit contributions
seeing+3.334
irit+3.130
dating+3.042
effects+2.960
raint+2.882
Neg logit contributions
Vs-4.876
inki-4.621
 Minutes-4.546
rador-4.403
�醒-4.377
Token.
Token ID13
Feature activation+1.27e-01

Pos logit contributions
seeing+4.565
dating+4.168
irit+4.159
raint+3.957
effects+3.951
Neg logit contributions
Vs-5.554
inki-5.129
�醒-5.043
 Minutes-5.041
rador-4.848
Token He
Token ID679
Feature activation-2.35e-02

Pos logit contributions
seeing+0.822
irit+0.752
arers+0.742
dating+0.741
place+0.723
Neg logit contributions
Vs-0.762
inki-0.690
 Minutes-0.683
�醒-0.674
rador-0.654
Token
Token ID447
Feature activation-3.70e-04

Pos logit contributions
Vs+0.135
 Minutes+0.123
inki+0.119
�醒+0.117
OUS+0.113
Neg logit contributions
seeing-0.157
irit-0.147
arers-0.145
dating-0.143
effects-0.139
Token would
Token ID561
Feature activation-3.44e-01

Pos logit contributions
Vs+0.585
 Minutes+0.536
inki+0.523
�醒+0.505
OUS+0.497
Neg logit contributions
seeing-0.568
irit-0.528
arers-0.522
dating-0.515
effects-0.500
Token not
Token ID407
Feature activation-4.46e-01

Pos logit contributions
Vs+2.164
 Minutes+1.987
inki+1.933
�醒+1.857
OUS+1.839
Neg logit contributions
seeing-2.102
irit-1.943
arers-1.913
dating-1.898
rising-1.842
Token
Token ID564
Feature activation-5.42e-01

Pos logit contributions
Vs+2.826
 Minutes+2.592
inki+2.532
�醒+2.422
OUS+2.406
Neg logit contributions
seeing-2.708
irit-2.489
arers-2.454
dating-2.434
raint-2.369
Token
Token ID250
Feature activation-3.08e-01

Pos logit contributions
Vs+2.077
 Minutes+1.760
inki+1.689
�醒+1.629
OUS+1.549
Neg logit contributions
seeing-4.630
arers-4.382
irit-4.359
dating-4.353
effects-4.245
Tokenim
Token ID320
Feature activation-1.97e-01

Pos logit contributions
Vs+2.378
 Minutes+2.205
inki+2.165
�醒+2.103
OUS+2.078
Neg logit contributions
seeing-1.444
irit-1.307
arers-1.290
dating-1.264
raint-1.218
Tokenped
Token ID9124
Feature activation+8.96e-01

Pos logit contributions
Vs+1.317
 Minutes+1.214
inki+1.180
�醒+1.144
OUS+1.129
Neg logit contributions
seeing-1.129
irit-1.044
arers-1.028
dating-1.018
effects-0.984
Tokene
Token ID68
Feature activation-1.51e-01

Pos logit contributions
irit+1.508
seeing+1.505
dating+1.165
place+1.138
lier+1.036
Neg logit contributions
Vs-9.208
 Minutes-8.761
inki-8.627
�醒-8.517
USS-8.276
Token the
Token ID262
Feature activation-3.68e-02

Pos logit contributions
Vs+0.912
 Minutes+0.834
inki+0.810
�醒+0.784
rador+0.767
Neg logit contributions
seeing-0.966
irit-0.902
arers-0.890
dating-0.879
effects-0.855
Token development
Token ID2478
Feature activation-3.28e-01

Pos logit contributions
Vs+0.217
 Minutes+0.198
inki+0.192
�醒+0.187
rador+0.183
Neg logit contributions
seeing-0.240
irit-0.225
arers-0.222
dating-0.220
effects-0.214
Token and
Token ID290
Feature activation+7.07e-01

Pos logit contributions
Vs+2.093
 Minutes+1.913
inki+1.850
�醒+1.805
OUS+1.790
Neg logit contributions
seeing-1.980
irit-1.822
arers-1.822
dating-1.780
rising-1.732
Token use
Token ID779
Feature activation+6.16e-01

Pos logit contributions
seeing+3.946
irit+3.717
arers+3.691
dating+3.592
place+3.537
Neg logit contributions
Vs-4.549
 Minutes-4.240
�醒-4.180
inki-4.113
OME-3.942
Token stay
Token ID2652
Feature activation-1.72e-02

Pos logit contributions
Vs+2.783
 Minutes+2.469
inki+2.408
�醒+2.280
OUS+2.271
Neg logit contributions
seeing-4.153
arers-3.939
irit-3.938
dating-3.888
effects-3.791
Token true
Token ID2081
Feature activation-4.39e-01

Pos logit contributions
Vs+0.113
 Minutes+0.104
inki+0.100
�醒+0.098
OUS+0.096
Neg logit contributions
seeing-0.102
irit-0.094
arers-0.093
dating-0.092
lier-0.089
Token to
Token ID284
Feature activation-1.28e-01

Pos logit contributions
Vs+1.958
 Minutes+1.718
inki+1.667
�醒+1.595
OUS+1.553
Neg logit contributions
seeing-3.436
irit-3.301
arers-3.258
dating-3.202
effects-3.152
Token the
Token ID262
Feature activation-1.89e-01

Pos logit contributions
Vs+0.731
 Minutes+0.663
inki+0.643
�醒+0.620
OUS+0.608
Neg logit contributions
seeing-0.866
irit-0.808
arers-0.800
dating-0.793
effects-0.770
Token traditional
Token ID4569
Feature activation+2.92e-01

Pos logit contributions
Vs+1.169
 Minutes+1.068
inki+1.039
�醒+1.005
OUS+0.986
Neg logit contributions
seeing-1.177
irit-1.092
arers-1.080
dating-1.069
effects-1.036
Token flavors
Token ID17361
Feature activation+2.87e-01

Pos logit contributions
seeing+2.122
dating+1.967
irit+1.963
arers+1.961
place+1.935
Neg logit contributions
Vs-1.486
 Minutes-1.333
inki-1.293
�醒-1.246
rador-1.219
Token of
Token ID286
Feature activation+1.59e-02

Pos logit contributions
seeing+1.569
irit+1.439
dating+1.398
arers+1.395
place+1.359
Neg logit contributions
Vs-1.976
 Minutes-1.844
inki-1.786
�醒-1.748
rador-1.724
Token Vietnamese
Token ID23618
Feature activation+3.77e-01

Pos logit contributions
seeing+0.121
irit+0.114
arers+0.112
dating+0.112
place+0.110
Neg logit contributions
Vs-0.077
 Minutes-0.068
inki-0.066
�醒-0.063
OUS-0.061
Token cuisine
Token ID33072
Feature activation+3.11e-01

Pos logit contributions
seeing+2.321
dating+2.141
place+2.115
arers+2.103
irit+2.098
Neg logit contributions
Vs-2.289
 Minutes-2.111
inki-2.082
�醒-2.015
rador-1.964
Token in
Token ID287
Feature activation-1.49e-01

Pos logit contributions
seeing+1.756
irit+1.589
arers+1.563
dating+1.562
place+1.519
Neg logit contributions
Vs-2.105
 Minutes-1.951
inki-1.912
�醒-1.862
rador-1.822
Token New
Token ID968
Feature activation+4.28e-01

Pos logit contributions
Vs+0.922
 Minutes+0.842
inki+0.820
�醒+0.792
OUS+0.779
Neg logit contributions
seeing-0.934
irit-0.866
arers-0.857
dating-0.849
effects-0.822
Token the
Token ID262
Feature activation+1.63e-01

Pos logit contributions
Vs+0.235
 Minutes+0.216
inki+0.209
�醒+0.203
rador+0.199
Neg logit contributions
seeing-0.228
irit-0.212
arers-0.209
dating-0.206
effects-0.201
Token pending
Token ID13310
Feature activation-4.51e-02

Pos logit contributions
seeing+0.995
irit+0.925
arers+0.908
dating+0.902
effects+0.884
Neg logit contributions
Vs-1.018
 Minutes-0.936
inki-0.908
�醒-0.897
rador-0.871
Token switch
Token ID5078
Feature activation+3.33e-01

Pos logit contributions
Vs+0.274
 Minutes+0.249
inki+0.243
�醒+0.237
rador+0.232
Neg logit contributions
seeing-0.287
irit-0.268
arers-0.263
dating-0.261
effects-0.254
Token to
Token ID284
Feature activation-2.36e-02

Pos logit contributions
seeing+2.043
irit+1.871
dating+1.853
arers+1.833
effects+1.791
Neg logit contributions
Vs-2.095
 Minutes-1.918
inki-1.866
�醒-1.810
OUS-1.769
Token 100
Token ID1802
Feature activation-2.43e-01

Pos logit contributions
Vs+0.152
 Minutes+0.139
inki+0.136
�醒+0.133
OUS+0.129
Neg logit contributions
seeing-0.141
irit-0.131
arers-0.129
dating-0.128
lier-0.124
Token%
Token ID4
Feature activation+4.75e-01

Pos logit contributions
Vs+1.543
 Minutes+1.414
inki+1.391
�醒+1.343
rador+1.319
Neg logit contributions
seeing-1.471
irit-1.368
arers-1.351
dating-1.336
raint-1.289
Token renewable
Token ID15713
Feature activation-1.01e-01

Pos logit contributions
seeing+2.494
irit+2.315
arers+2.229
dating+2.226
place+2.203
Neg logit contributions
Vs-3.299
 Minutes-3.056
inki-3.042
�醒-3.022
rador-2.925
Token energy
Token ID2568
Feature activation-2.69e-01

Pos logit contributions
Vs+0.656
 Minutes+0.604
inki+0.589
�醒+0.572
OUS+0.559
Neg logit contributions
seeing-0.594
irit-0.551
arers-0.543
dating-0.539
place-0.520
Token.
Token ID13
Feature activation+7.09e-02

Pos logit contributions
Vs+1.628
 Minutes+1.484
inki+1.439
�醒+1.385
OUS+1.370
Neg logit contributions
seeing-1.719
irit-1.600
arers-1.584
dating-1.563
effects-1.519
Token
Token ID198
Feature activation+2.85e-01

Pos logit contributions
seeing+0.518
irit+0.484
arers+0.476
dating+0.475
place+0.466
Neg logit contributions
Vs-0.360
inki-0.322
 Minutes-0.318
�醒-0.306
rador-0.299
Token
Token ID198
Feature activation+1.35e-01

Pos logit contributions
seeing+2.178
place+1.987
dating+1.982
irit+1.968
arers+1.954
Neg logit contributions
Vs-1.365
�醒-1.280
inki-1.247
 Minutes-1.218
rador-1.156
TokenIt
Token ID1026
Feature activation-4.35e-01

Pos logit contributions
Vs+3.652
 Minutes+3.186
inki+2.922
OUS+2.893
USS+2.856
Neg logit contributions
seeing-5.767
irit-5.544
arers-5.524
dating-5.381
raint-5.323
Token was
Token ID373
Feature activation+1.07e-01

Pos logit contributions
Vs+2.558
 Minutes+2.336
inki+2.244
OUS+2.148
�醒+2.127
Neg logit contributions
seeing-2.818
irit-2.647
arers-2.598
dating-2.589
raint-2.511
Token amazing
Token ID4998
Feature activation-4.49e-01

Pos logit contributions
seeing+0.672
irit+0.622
arers+0.616
dating+0.609
lier+0.593
Neg logit contributions
Vs-0.664
 Minutes-0.601
inki-0.593
�醒-0.586
rador-0.561
Token to
Token ID284
Feature activation-3.00e-01

Pos logit contributions
Vs+2.888
 Minutes+2.657
inki+2.572
OUS+2.472
�醒+2.470
Neg logit contributions
seeing-2.669
irit-2.493
arers-2.468
dating-2.427
raint-2.353
Token watch
Token ID2342
Feature activation+4.12e-01

Pos logit contributions
Vs+2.094
 Minutes+1.936
inki+1.885
�醒+1.822
OUS+1.809
Neg logit contributions
seeing-1.638
irit-1.507
arers-1.487
dating-1.468
effects-1.418
Token him
Token ID683
Feature activation+3.13e-01

Pos logit contributions
seeing+2.375
irit+2.175
arers+2.157
place+2.119
dating+2.117
Neg logit contributions
Vs-2.691
�醒-2.502
inki-2.490
 Minutes-2.422
rador-2.347
Token bat
Token ID7365
Feature activation+1.03e+00

Pos logit contributions
seeing+1.870
irit+1.715
arers+1.706
dating+1.685
place+1.657
Neg logit contributions
Vs-2.027
 Minutes-1.843
�醒-1.834
inki-1.807
USS-1.715
Token a
Token ID257
Feature activation-3.06e-02

Pos logit contributions
seeing+4.412
dating+3.959
arers+3.937
place+3.925
irit+3.885
Neg logit contributions
Vs-7.752
�醒-7.382
inki-7.105
 Minutes-7.025
OUS-6.889
Token thousand
Token ID7319
Feature activation-5.20e-01

Pos logit contributions
Vs+0.181
 Minutes+0.164
inki+0.161
�醒+0.157
OUS+0.150
Neg logit contributions
seeing-0.203
irit-0.187
arers-0.184
dating-0.183
place-0.180
Token after
Token ID706
Feature activation-1.06e+00

Pos logit contributions
Vs+2.668
 Minutes+2.396
inki+2.267
OUS+2.160
�醒+2.145
Neg logit contributions
seeing-3.792
irit-3.567
arers-3.532
dating-3.499
raint-3.428
Token years
Token ID812
Feature activation+5.64e-02

Pos logit contributions
Vs+6.798
 Minutes+6.283
inki+5.929
OUS+5.776
rador+5.713
Neg logit contributions
seeing-5.962
arers-5.617
irit-5.561
raint-5.458
dating-5.368
Token
Token ID250
Feature activation-4.60e-01

Pos logit contributions
Vs+1.992
 Minutes+1.705
�醒+1.644
inki+1.591
OUS+1.471
Neg logit contributions
seeing-4.396
arers-4.202
irit-4.169
dating-4.142
effects-4.061
TokenM
Token ID44
Feature activation-1.13e+00

Pos logit contributions
Vs+2.379
 Minutes+2.129
inki+2.037
�醒+1.951
OUS+1.932
Neg logit contributions
seeing-3.324
irit-3.139
arers-3.115
dating-3.071
raint-3.003
Tokenilly
Token ID6548
Feature activation+3.86e-02

Pos logit contributions
Vs+5.709
inki+5.056
 Minutes+5.046
OUS+4.667
rador+4.624
Neg logit contributions
seeing-8.005
arers-7.499
irit-7.350
dating-7.303
rising-7.190
Token had
Token ID550
Feature activation+4.26e-01

Pos logit contributions
seeing+0.221
irit+0.204
arers+0.202
dating+0.199
effects+0.193
Neg logit contributions
Vs-0.258
 Minutes-0.237
inki-0.232
�醒-0.226
OUS-0.221
Token written
Token ID3194
Feature activation+3.91e-01

Pos logit contributions
seeing+2.616
irit+2.425
arers+2.416
dating+2.355
place+2.312
Neg logit contributions
Vs-2.656
 Minutes-2.390
inki-2.372
�醒-2.355
USS-2.245
Token notes
Token ID4710
Feature activation+5.93e-01

Pos logit contributions
seeing+2.568
irit+2.390
dating+2.344
arers+2.319
lier+2.288
Neg logit contributions
Vs-2.271
 Minutes-2.042
inki-2.038
�醒-1.967
rador-1.909
Token about
Token ID546
Feature activation+2.84e-01

Pos logit contributions
seeing+3.743
irit+3.428
dating+3.418
arers+3.392
lier+3.294
Neg logit contributions
Vs-3.573
 Minutes-3.210
�醒-3.175
inki-3.166
OUS-3.037
Token hating
Token ID45701
Feature activation+3.42e-01

Pos logit contributions
seeing+1.560
irit+1.421
arers+1.388
dating+1.383
effects+1.341
Neg logit contributions
Vs-1.984
 Minutes-1.823
inki-1.799
�醒-1.765
OUS-1.702
Token herself
Token ID5223
Feature activation+2.04e-01

Pos logit contributions
seeing+1.957
irit+1.800
dating+1.761
arers+1.756
lier+1.720
Neg logit contributions
Vs-2.284
 Minutes-2.104
inki-2.076
�醒-2.027
OUS-1.952
Token,
Token ID11
Feature activation+1.81e-01

Pos logit contributions
seeing+1.188
irit+1.084
arers+1.069
dating+1.066
lier+1.027
Neg logit contributions
Vs-1.362
 Minutes-1.247
inki-1.215
�醒-1.190
USS-1.163
Token about
Token ID546
Feature activation+1.30e-01

Pos logit contributions
seeing+1.076
irit+0.990
arers+0.973
dating+0.970
place+0.934
Neg logit contributions
Vs-1.179
 Minutes-1.087
inki-1.053
�醒-1.036
OUS-1.006
Token to
Token ID284
Feature activation+1.56e-01

Pos logit contributions
Vs+2.480
 Minutes+2.243
inki+2.165
OUS+2.051
�醒+2.043
Neg logit contributions
seeing-3.187
irit-3.013
arers-2.979
dating-2.942
effects-2.878
Token do
Token ID466
Feature activation+6.89e-02

Pos logit contributions
seeing+0.886
irit+0.820
arers+0.819
dating+0.800
place+0.786
Neg logit contributions
Vs-1.048
 Minutes-0.962
inki-0.936
�醒-0.931
OUS-0.891
Token that
Token ID326
Feature activation-9.98e-02

Pos logit contributions
seeing+0.410
irit+0.378
arers+0.375
dating+0.371
effects+0.360
Neg logit contributions
Vs-0.448
 Minutes-0.409
inki-0.401
�醒-0.392
rador-0.382
Token -
Token ID532
Feature activation-4.37e-02

Pos logit contributions
Vs+0.626
 Minutes+0.572
inki+0.557
�醒+0.542
OUS+0.528
Neg logit contributions
seeing-0.619
irit-0.573
arers-0.566
dating-0.562
effects-0.544
Token it
Token ID340
Feature activation-2.07e-01

Pos logit contributions
Vs+0.270
 Minutes+0.247
inki+0.240
�醒+0.237
OUS+0.229
Neg logit contributions
seeing-0.276
irit-0.254
arers-0.252
dating-0.251
effects-0.242
Token creates
Token ID8075
Feature activation-5.50e-01

Pos logit contributions
Vs+1.272
 Minutes+1.162
inki+1.128
�醒+1.086
OUS+1.073
Neg logit contributions
seeing-1.299
irit-1.209
arers-1.193
dating-1.182
effects-1.146
Token a
Token ID257
Feature activation-3.43e-01

Pos logit contributions
Vs+3.048
 Minutes+2.783
inki+2.685
OUS+2.539
�醒+2.508
Neg logit contributions
seeing-3.738
irit-3.498
arers-3.459
dating-3.441
raint-3.330
Token lot
Token ID1256
Feature activation+1.41e-01

Pos logit contributions
Vs+2.052
 Minutes+1.873
inki+1.819
�醒+1.747
OUS+1.726
Neg logit contributions
seeing-2.205
irit-2.054
arers-2.032
dating-2.010
effects-1.953
Token of
Token ID286
Feature activation-1.78e-01

Pos logit contributions
seeing+0.820
irit+0.758
arers+0.753
dating+0.743
effects+0.719
Neg logit contributions
Vs-0.933
 Minutes-0.860
inki-0.839
�醒-0.809
OUS-0.801
Token entry
Token ID5726
Feature activation-6.92e-01

Pos logit contributions
Vs+1.118
 Minutes+1.022
inki+0.994
�醒+0.964
OUS+0.947
Neg logit contributions
seeing-1.100
irit-1.021
arers-1.009
dating-0.997
effects-0.967
Token points
Token ID2173
Feature activation-4.24e-01

Pos logit contributions
Vs+3.980
 Minutes+3.596
inki+3.470
OUS+3.303
�醒+3.282
Neg logit contributions
seeing-4.592
irit-4.322
arers-4.289
dating-4.199
raint-4.102
Token let
Token ID1309
Feature activation+3.03e-01

Pos logit contributions
Vs+0.791
 Minutes+0.722
inki+0.702
�醒+0.680
OUS+0.666
Neg logit contributions
seeing-0.831
irit-0.772
arers-0.765
dating-0.756
effects-0.734
Token
Token ID447
Feature activation+1.34e-01

Pos logit contributions
seeing+1.712
irit+1.573
arers+1.552
dating+1.528
place+1.485
Neg logit contributions
Vs-2.038
 Minutes-1.885
inki-1.834
�醒-1.789
rador-1.755
Token
Token ID247
Feature activation-2.61e-01

Pos logit contributions
seeing+0.811
irit+0.763
arers+0.746
dating+0.727
raint+0.716
Neg logit contributions
Vs-0.836
 Minutes-0.775
inki-0.754
OUS-0.714
�醒-0.713
Tokens
Token ID82
Feature activation-9.24e-01

Pos logit contributions
Vs+1.491
 Minutes+1.350
inki+1.310
�醒+1.259
OUS+1.240
Neg logit contributions
seeing-1.756
irit-1.642
arers-1.624
dating-1.608
effects-1.563
Token assume
Token ID7048
Feature activation-8.26e-01

Pos logit contributions
Vs+5.466
inki+5.038
 Minutes+5.032
OUS+4.691
rador+4.583
Neg logit contributions
seeing-5.645
irit-5.251
dating-5.171
raint-5.079
arers-5.078
Token for
Token ID329
Feature activation-6.29e-01

Pos logit contributions
Vs+4.860
 Minutes+4.440
inki+4.295
OUS+4.094
�醒+4.024
Neg logit contributions
seeing-5.150
irit-4.887
arers-4.741
dating-4.715
raint-4.620
Token a
Token ID257
Feature activation-9.61e-01

Pos logit contributions
Vs+3.562
 Minutes+3.282
inki+3.176
�醒+2.989
OUS+2.970
Neg logit contributions
seeing-4.112
irit-3.869
arers-3.815
dating-3.761
place-3.687
Token moment
Token ID2589
Feature activation-7.27e-01

Pos logit contributions
Vs+4.219
 Minutes+3.824
inki+3.600
�醒+3.277
USS+3.248
Neg logit contributions
seeing-7.405
irit-7.017
arers-6.968
dating-6.918
effects-6.815
Token that
Token ID326
Feature activation-9.92e-01

Pos logit contributions
Vs+4.520
 Minutes+4.135
inki+4.007
OUS+3.825
�醒+3.794
Neg logit contributions
seeing-4.333
irit-4.121
arers-4.012
dating-3.980
effects-3.850
Token we
Token ID356
Feature activation-1.02e+00

Pos logit contributions
Vs+5.533
 Minutes+5.045
inki+4.795
OUS+4.617
�醒+4.487
Neg logit contributions
seeing-6.447
irit-6.139
arers-5.916
dating-5.913
raint-5.773
Token
Token ID447
Feature activation-4.34e-01

Pos logit contributions
Vs+5.811
 Minutes+5.139
inki+5.062
OUS+4.841
USS+4.755
Neg logit contributions
seeing-6.509
irit-6.092
dating-6.004
rising-5.902
raint-5.872
Token of
Token ID286
Feature activation+3.62e-01

Pos logit contributions
seeing+0.682
irit+0.629
arers+0.624
dating+0.614
lier+0.595
Neg logit contributions
Vs-0.709
 Minutes-0.647
inki-0.636
�醒-0.619
USS-0.602
Token myth
Token ID7918
Feature activation+7.21e-02

Pos logit contributions
seeing+2.164
irit+1.996
arers+1.971
dating+1.947
lier+1.904
Neg logit contributions
Vs-2.306
 Minutes-2.103
inki-2.093
�醒-2.053
OUS-1.976
Token is
Token ID318
Feature activation+1.89e-01

Pos logit contributions
seeing+0.416
irit+0.383
arers+0.380
dating+0.374
rising+0.363
Neg logit contributions
Vs-0.481
 Minutes-0.445
inki-0.435
�醒-0.422
OUS-0.413
Token thus
Token ID4145
Feature activation+2.18e-01

Pos logit contributions
seeing+1.193
irit+1.109
arers+1.106
dating+1.079
lier+1.055
Neg logit contributions
Vs-1.163
 Minutes-1.057
inki-1.035
�醒-1.011
OUS-0.975
Token conveniently
Token ID29801
Feature activation+1.24e-01

Pos logit contributions
seeing+1.357
irit+1.262
arers+1.260
dating+1.225
lier+1.199
Neg logit contributions
Vs-1.358
 Minutes-1.234
inki-1.206
�醒-1.180
OUS-1.144
Token reflected
Token ID12548
Feature activation-2.41e-01

Pos logit contributions
seeing+0.712
irit+0.658
arers+0.654
dating+0.639
place+0.618
Neg logit contributions
Vs-0.834
 Minutes-0.768
inki-0.749
�醒-0.731
OUS-0.715
Token back
Token ID736
Feature activation+6.13e-01

Pos logit contributions
Vs+1.509
 Minutes+1.382
inki+1.341
�醒+1.295
OUS+1.278
Neg logit contributions
seeing-1.483
irit-1.380
arers-1.362
dating-1.347
effects-1.305
Token to
Token ID284
Feature activation-2.43e-02

Pos logit contributions
seeing+4.020
dating+3.727
arers+3.718
irit+3.657
lier+3.599
Neg logit contributions
Vs-3.540
 Minutes-3.263
inki-3.198
�醒-3.156
OUS-2.990
Token him
Token ID683
Feature activation+1.71e-01

Pos logit contributions
Vs+0.163
 Minutes+0.150
inki+0.147
�醒+0.143
OUS+0.140
Neg logit contributions
seeing-0.140
irit-0.129
arers-0.128
dating-0.125
effects-0.121
Token.
Token ID13
Feature activation-1.25e-01

Pos logit contributions
seeing+1.071
irit+0.978
arers+0.977
dating+0.962
lier+0.940
Neg logit contributions
Vs-1.068
 Minutes-0.968
inki-0.953
�醒-0.936
OUS-0.901
Token
Token ID198
Feature activation-4.31e-02

Pos logit contributions
Vs+0.676
 Minutes+0.604
inki+0.601
�醒+0.577
rador+0.561
Neg logit contributions
seeing-0.883
irit-0.822
arers-0.815
dating-0.807
place-0.788
Token get
Token ID651
Feature activation-4.50e-01

Pos logit contributions
Vs+3.139
 Minutes+2.879
inki+2.792
OUS+2.662
�醒+2.648
Neg logit contributions
seeing-3.135
irit-2.924
arers-2.870
dating-2.860
raint-2.758
Token within
Token ID1626
Feature activation-1.13e+00

Pos logit contributions
Vs+2.634
 Minutes+2.397
inki+2.303
�醒+2.210
OUS+2.207
Neg logit contributions
seeing-2.959
irit-2.770
arers-2.734
dating-2.701
effects-2.625
Token a
Token ID257
Feature activation-7.37e-01

Pos logit contributions
Vs+5.787
 Minutes+5.503
OUS+4.971
inki+4.962
USS+4.780
Neg logit contributions
seeing-7.641
irit-7.395
dating-7.074
arers-7.048
raint-6.932
Token point
Token ID966
Feature activation-7.14e-01

Pos logit contributions
Vs+4.026
 Minutes+3.681
inki+3.499
OUS+3.392
�醒+3.307
Neg logit contributions
seeing-5.006
irit-4.755
arers-4.652
dating-4.644
effects-4.535
Token at
Token ID379
Feature activation-9.18e-01

Pos logit contributions
Vs+4.280
 Minutes+3.920
inki+3.816
OUS+3.696
�醒+3.618
Neg logit contributions
seeing-4.421
irit-4.248
arers-4.177
dating-4.092
effects-4.002
Token 15
Token ID1315
Feature activation-5.83e-01

Pos logit contributions
Vs+5.030
 Minutes+4.589
inki+4.279
OUS+4.156
USS+4.051
Neg logit contributions
seeing-6.240
irit-5.924
arers-5.805
dating-5.719
raint-5.577
Token-
Token ID12
Feature activation+7.09e-02

Pos logit contributions
Vs+3.572
 Minutes+3.278
inki+3.135
�醒+3.014
OUS+2.997
Neg logit contributions
seeing-3.682
irit-3.447
arers-3.379
dating-3.362
effects-3.255
Token14
Token ID1415
Feature activation-1.00e+00

Pos logit contributions
seeing+0.381
irit+0.346
dating+0.341
arers+0.339
place+0.329
Neg logit contributions
Vs-0.501
 Minutes-0.466
inki-0.456
�醒-0.449
OUS-0.437
Token heading
Token ID9087
Feature activation-5.17e-01

Pos logit contributions
Vs+5.925
 Minutes+5.456
inki+5.229
�醒+4.983
OUS+4.961
Neg logit contributions
seeing-6.300
irit-5.991
arers-5.898
dating-5.763
effects-5.645
Token into
Token ID656
Feature activation-8.38e-01

Pos logit contributions
Vs+2.108
 Minutes+1.845
inki+1.742
�醒+1.626
OUS+1.616
Neg logit contributions
seeing-4.283
irit-4.092
arers-4.045
dating-3.995
effects-3.920
Token the
Token ID262
Feature activation-7.36e-01

Pos logit contributions
Vs+4.011
 Minutes+3.584
inki+3.347
�醒+3.197
OUS+3.154
Neg logit contributions
seeing-6.230
irit-5.971
arers-5.883
dating-5.710
rising-5.639
Token,
Token ID11
Feature activation-1.33e-02

Pos logit contributions
seeing+0.566
irit+0.521
arers+0.511
dating+0.511
place+0.504
Neg logit contributions
Vs-0.604
 Minutes-0.555
inki-0.546
�醒-0.536
rador-0.520
Token but
Token ID475
Feature activation+6.61e-02

Pos logit contributions
Vs+0.085
 Minutes+0.078
inki+0.076
�醒+0.075
rador+0.073
Neg logit contributions
seeing-0.082
irit-0.074
arers-0.074
dating-0.074
place-0.071
Token the
Token ID262
Feature activation+2.84e-01

Pos logit contributions
seeing+0.398
irit+0.365
arers+0.364
dating+0.359
place+0.348
Neg logit contributions
Vs-0.427
 Minutes-0.390
inki-0.384
�醒-0.376
OUS-0.363
Token biology
Token ID17219
Feature activation-1.38e-01

Pos logit contributions
seeing+1.691
irit+1.547
arers+1.547
dating+1.537
place+1.508
Neg logit contributions
Vs-1.838
 Minutes-1.665
inki-1.652
�醒-1.613
rador-1.583
Token tr
Token ID491
Feature activation+2.03e-01

Pos logit contributions
Vs+0.864
 Minutes+0.790
inki+0.772
�醒+0.748
OUS+0.731
Neg logit contributions
seeing-0.854
irit-0.792
arers-0.783
dating-0.775
effects-0.753
Tokenumps
Token ID8142
Feature activation-1.55e-01

Pos logit contributions
seeing+1.049
irit+1.001
arers+0.961
dating+0.935
lier+0.934
Neg logit contributions
Vs-1.432
 Minutes-1.324
�醒-1.295
inki-1.288
OME-1.263
Token a
Token ID257
Feature activation-3.53e-01

Pos logit contributions
Vs+0.976
 Minutes+0.893
inki+0.869
�醒+0.841
OUS+0.827
Neg logit contributions
seeing-0.958
irit-0.889
arers-0.879
dating-0.870
effects-0.843
Token lot
Token ID1256
Feature activation+2.27e-01

Pos logit contributions
Vs+1.970
 Minutes+1.780
inki+1.725
�醒+1.659
OUS+1.629
Neg logit contributions
seeing-2.425
irit-2.268
arers-2.243
dating-2.221
effects-2.159
Token of
Token ID286
Feature activation-3.02e-01

Pos logit contributions
seeing+1.285
irit+1.185
arers+1.170
dating+1.155
effects+1.116
Neg logit contributions
Vs-1.544
 Minutes-1.423
inki-1.387
�醒-1.344
OUS-1.326
Token these
Token ID777
Feature activation-2.58e-01

Pos logit contributions
Vs+1.869
 Minutes+1.714
inki+1.657
�醒+1.592
OUS+1.582
Neg logit contributions
seeing-1.878
irit-1.750
arers-1.730
dating-1.708
raint-1.659
Token environmental
Token ID6142
Feature activation-2.22e-01

Pos logit contributions
Vs+1.552
 Minutes+1.414
inki+1.378
�醒+1.333
OUS+1.306
Neg logit contributions
seeing-1.653
irit-1.538
arers-1.522
dating-1.508
effects-1.466
Token lit
Token ID6578
Feature activation+6.50e-01

Pos logit contributions
Vs+0.345
 Minutes+0.313
inki+0.307
�醒+0.296
rador+0.289
Neg logit contributions
seeing-0.405
irit-0.379
arers-0.373
dating-0.369
raint-0.359
Tokenre
Token ID260
Feature activation-2.48e-01

Pos logit contributions
seeing+2.495
arers+2.358
irit+2.287
dating+2.137
lier+2.129
Neg logit contributions
Vs-5.347
 Minutes-5.134
inki-5.017
�醒-4.961
OUS-4.853
Token of
Token ID286
Feature activation-3.51e-01

Pos logit contributions
Vs+1.601
 Minutes+1.476
inki+1.438
�醒+1.394
OUS+1.370
Neg logit contributions
seeing-1.482
irit-1.375
arers-1.355
dating-1.338
effects-1.295
Token reduction
Token ID7741
Feature activation-3.68e-01

Pos logit contributions
Vs+2.033
 Minutes+1.847
inki+1.791
�醒+1.724
OUS+1.697
Neg logit contributions
seeing-2.331
irit-2.177
arers-2.154
dating-2.131
effects-2.071
Token in
Token ID287
Feature activation-2.46e-01

Pos logit contributions
Vs+2.265
 Minutes+2.072
inki+2.013
�醒+1.945
OUS+1.910
Neg logit contributions
seeing-2.312
irit-2.151
arers-2.125
dating-2.101
effects-2.040
Token diesel
Token ID21866
Feature activation-2.05e+00

Pos logit contributions
Vs+1.446
 Minutes+1.315
inki+1.277
�醒+1.231
OUS+1.210
Neg logit contributions
seeing-1.618
irit-1.509
arers-1.493
dating-1.475
effects-1.435
Token consumption
Token ID7327
Feature activation-5.60e-01

Pos logit contributions
Vs+10.393
 Minutes+8.969
inki+8.698
OUS+8.426
OME+8.147
Neg logit contributions
seeing-12.887
dating-11.960
arers-11.833
irit-11.689
hemat-11.379
Token per
Token ID583
Feature activation-3.60e-01

Pos logit contributions
Vs+3.384
 Minutes+3.093
inki+2.977
OUS+2.849
�醒+2.815
Neg logit contributions
seeing-3.531
irit-3.299
arers-3.256
dating-3.220
raint-3.121
Token coach
Token ID3985
Feature activation-1.95e-01

Pos logit contributions
Vs+2.015
 Minutes+1.827
inki+1.765
�醒+1.694
OUS+1.672
Neg logit contributions
seeing-2.453
irit-2.297
arers-2.271
dating-2.253
effects-2.192
Token per
Token ID583
Feature activation+1.16e-01

Pos logit contributions
Vs+1.236
 Minutes+1.133
inki+1.108
�醒+1.071
OUS+1.050
Neg logit contributions
seeing-1.187
irit-1.095
arers-1.089
dating-1.070
raint-1.039
Token ann
Token ID1529
Feature activation-1.44e+00

Pos logit contributions
seeing+0.727
arers+0.670
irit+0.669
dating+0.654
lier+0.641
Neg logit contributions
Vs-0.710
 Minutes-0.645
inki-0.634
�醒-0.624
OUS-0.600
Token published
Token ID3199
Feature activation-1.63e-01

Pos logit contributions
Vs+1.940
 Minutes+1.792
inki+1.742
�醒+1.671
OUS+1.668
Neg logit contributions
seeing-1.595
irit-1.470
arers-1.445
dating-1.435
raint-1.388
Token.
Token ID13
Feature activation-2.12e-02

Pos logit contributions
Vs+1.024
 Minutes+0.938
inki+0.912
�醒+0.881
OUS+0.868
Neg logit contributions
seeing-0.997
irit-0.926
arers-0.915
dating-0.904
effects-0.877
Token
Token ID198
Feature activation+9.41e-02

Pos logit contributions
Vs+0.089
 Minutes+0.079
inki+0.073
�醒+0.070
OUS+0.068
Neg logit contributions
seeing-0.175
irit-0.166
arers-0.165
dating-0.163
raint-0.160
Token
Token ID198
Feature activation+2.33e-01

Pos logit contributions
seeing+0.737
irit+0.678
dating+0.677
arers+0.672
place+0.672
Neg logit contributions
Vs-0.435
�醒-0.387
inki-0.386
 Minutes-0.386
rador-0.358
TokenDA
Token ID5631
Feature activation-8.20e-01

Pos logit contributions
seeing+1.607
irit+1.475
dating+1.466
arers+1.455
place+1.434
Neg logit contributions
Vs-1.272
 Minutes-1.181
inki-1.153
�醒-1.126
rador-1.089
TokenIK
Token ID18694
Feature activation-1.32e+00

Pos logit contributions
Vs+3.921
 Minutes+3.446
inki+3.210
OUS+3.154
USS+3.121
Neg logit contributions
seeing-6.303
irit-5.931
arers-5.910
dating-5.860
raint-5.759
TokenAI
Token ID20185
Feature activation-8.89e-01

Pos logit contributions
Vs+6.823
 Minutes+6.171
OUS+6.034
inki+5.818
USS+5.786
Neg logit contributions
seeing-9.149
arers-8.624
dating-8.615
irit-8.529
raint-8.369
TokenJ
Token ID41
Feature activation+1.17e-01

Pos logit contributions
Vs+4.941
 Minutes+4.476
OUS+4.261
inki+4.165
USS+4.109
Neg logit contributions
seeing-5.970
irit-5.641
arers-5.601
dating-5.537
raint-5.459
TokenU
Token ID52
Feature activation-6.86e-01

Pos logit contributions
seeing+0.799
irit+0.751
arers+0.735
dating+0.730
effects+0.709
Neg logit contributions
Vs-0.642
 Minutes-0.587
inki-0.569
�醒-0.565
rador-0.552
Token
Token ID198
Feature activation-2.24e-01

Pos logit contributions
Vs+3.799
 Minutes+3.430
inki+3.255
OUS+3.210
USS+3.126
Neg logit contributions
seeing-4.720
irit-4.423
arers-4.398
dating-4.334
raint-4.251
Token
Token ID198
Feature activation-1.72e-01

Pos logit contributions
Vs+0.946
 Minutes+0.827
inki+0.790
�醒+0.747
OUS+0.731
Neg logit contributions
seeing-1.841
irit-1.743
arers-1.728
dating-1.713
effects-1.675
Token obligation
Token ID12990
Feature activation-6.50e-01

Pos logit contributions
seeing+2.206
irit+2.047
arers+2.013
dating+1.975
place+1.954
Neg logit contributions
Vs-2.881
 Minutes-2.656
inki-2.614
�醒-2.607
rador-2.499
Token to
Token ID284
Feature activation+1.89e-02

Pos logit contributions
Vs+2.757
 Minutes+2.506
inki+2.347
OUS+2.233
�醒+2.185
Neg logit contributions
seeing-5.057
irit-4.855
arers-4.799
dating-4.716
effects-4.692
Token protect
Token ID1805
Feature activation+2.58e-01

Pos logit contributions
seeing+0.106
irit+0.098
arers+0.097
dating+0.095
place+0.092
Neg logit contributions
Vs-0.130
 Minutes-0.119
inki-0.117
�醒-0.114
OUS-0.111
Token and
Token ID290
Feature activation+3.29e-01

Pos logit contributions
seeing+1.591
irit+1.465
arers+1.463
dating+1.425
place+1.413
Neg logit contributions
Vs-1.619
 Minutes-1.471
inki-1.449
�醒-1.435
rador-1.373
Token promote
Token ID7719
Feature activation+2.86e-01

Pos logit contributions
seeing+1.838
irit+1.707
arers+1.688
dating+1.651
place+1.632
Neg logit contributions
Vs-2.239
 Minutes-2.050
�醒-2.008
inki-1.994
OUS-1.912
Token human
Token ID1692
Feature activation-1.03e+00

Pos logit contributions
seeing+1.698
irit+1.564
arers+1.558
place+1.524
dating+1.521
Neg logit contributions
Vs-1.862
 Minutes-1.679
inki-1.669
�醒-1.658
rador-1.573
Token rights
Token ID2489
Feature activation+1.48e-01

Pos logit contributions
Vs+5.368
 Minutes+4.907
inki+4.692
OUS+4.487
OME+4.327
Neg logit contributions
seeing-6.876
irit-6.502
dating-6.456
arers-6.369
rising-6.303
Token."
Token ID526
Feature activation+2.93e-02

Pos logit contributions
seeing+0.852
irit+0.782
arers+0.771
dating+0.771
place+0.749
Neg logit contributions
Vs-0.992
 Minutes-0.909
inki-0.895
�醒-0.881
OUS-0.848
Token
Token ID198
Feature activation+2.64e-04

Pos logit contributions
seeing+0.235
irit+0.220
arers+0.218
dating+0.217
effects+0.213
Neg logit contributions
Vs-0.129
 Minutes-0.112
inki-0.111
�醒-0.105
rador-0.101
Token
Token ID198
Feature activation+1.25e-01

Pos logit contributions
seeing+0.002
irit+0.002
dating+0.002
arers+0.002
place+0.002
Neg logit contributions
Vs-0.001
 Minutes-0.001
inki-0.001
�醒-0.001
rador-0.001
Token"
Token ID1
Feature activation-5.90e-01

Pos logit contributions
seeing+0.905
irit+0.831
dating+0.824
arers+0.815
place+0.807
Neg logit contributions
Vs-0.647
 Minutes-0.602
inki-0.589
�醒-0.573
rador-0.557
Tokenure
Token ID495
Feature activation+5.21e-02

Pos logit contributions
seeing+6.306
irit+5.850
arers+5.599
raint+5.529
lier+5.434
Neg logit contributions
Vs-8.482
 Minutes-8.181
�醒-8.001
inki-7.895
OUS-7.520
Token Test
Token ID6208
Feature activation+4.18e-01

Pos logit contributions
seeing+0.351
irit+0.325
arers+0.322
dating+0.319
lier+0.312
Neg logit contributions
Vs-0.296
 Minutes-0.270
inki-0.264
�醒-0.258
rador-0.249
Token truck
Token ID7779
Feature activation-5.76e-01

Pos logit contributions
seeing+2.674
arers+2.459
irit+2.439
dating+2.415
place+2.384
Neg logit contributions
Vs-2.434
inki-2.237
 Minutes-2.226
�醒-2.212
OUS-2.107
Token in
Token ID287
Feature activation+2.70e-01

Pos logit contributions
Vs+3.619
 Minutes+3.301
inki+3.208
�醒+3.069
OUS+3.045
Neg logit contributions
seeing-3.511
irit-3.291
arers-3.236
dating-3.202
effects-3.113
Token the
Token ID262
Feature activation-3.07e-01

Pos logit contributions
seeing+1.489
irit+1.353
arers+1.340
dating+1.339
place+1.307
Neg logit contributions
Vs-1.847
inki-1.686
�醒-1.682
 Minutes-1.677
OUS-1.615
Token 2010
Token ID3050
Feature activation-1.13e+00

Pos logit contributions
Vs+1.875
 Minutes+1.718
inki+1.659
�醒+1.596
OUS+1.581
Neg logit contributions
seeing-1.943
irit-1.809
arers-1.788
dating-1.769
effects-1.713
Token B
Token ID347
Feature activation+3.92e-01

Pos logit contributions
Vs+6.799
 Minutes+6.201
inki+5.816
USS+5.578
OUS+5.578
Neg logit contributions
seeing-6.947
irit-6.515
arers-6.437
dating-6.379
rising-6.207
Tokenaja
Token ID27792
Feature activation-6.54e-01

Pos logit contributions
seeing+1.412
irit+1.300
arers+1.256
dating+1.225
place+1.189
Neg logit contributions
Vs-3.292
 Minutes-3.159
�醒-3.112
inki-3.066
rador-3.019
Token 1000
Token ID8576
Feature activation-1.17e+00

Pos logit contributions
Vs+3.559
 Minutes+3.241
inki+3.091
OUS+2.953
�醒+2.936
Neg logit contributions
seeing-4.483
irit-4.217
arers-4.212
dating-4.150
rising-4.039
Token,
Token ID11
Feature activation-5.11e-01

Pos logit contributions
Vs+7.065
 Minutes+6.439
inki+6.104
OUS+5.915
USS+5.823
Neg logit contributions
seeing-7.010
irit-6.711
arers-6.605
dating-6.485
effects-6.330
Token as
Token ID355
Feature activation-4.99e-02

Pos logit contributions
Vs+3.117
 Minutes+2.847
inki+2.756
�醒+2.630
OUS+2.616
Neg logit contributions
seeing-3.215
irit-3.003
arers-2.962
dating-2.920
raint-2.847
Token.
Token ID13
Feature activation-3.25e-01

Pos logit contributions
Vs+3.142
 Minutes+2.889
inki+2.807
�醒+2.711
OUS+2.660
Neg logit contributions
seeing-2.935
irit-2.734
arers-2.708
dating-2.658
effects-2.583
Token
Token ID198
Feature activation+2.00e-01

Pos logit contributions
Vs+1.475
 Minutes+1.313
inki+1.233
�醒+1.186
OUS+1.159
Neg logit contributions
seeing-2.557
irit-2.429
arers-2.408
dating-2.384
raint-2.324
Token
Token ID198
Feature activation+2.60e-02

Pos logit contributions
seeing+1.546
dating+1.412
irit+1.408
place+1.407
arers+1.395
Neg logit contributions
Vs-0.941
�醒-0.861
inki-0.852
 Minutes-0.837
rador-0.786
TokenFl
Token ID7414
Feature activation+3.76e-01

Pos logit contributions
seeing+0.176
irit+0.163
arers+0.161
dating+0.160
effects+0.156
Neg logit contributions
Vs-0.147
 Minutes-0.136
inki-0.131
�醒-0.127
rador-0.124
Tokenake
Token ID539
Feature activation+2.54e-01

Pos logit contributions
seeing+1.979
irit+1.882
arers+1.834
lier+1.796
dating+1.780
Neg logit contributions
Vs-2.682
 Minutes-2.493
�醒-2.445
inki-2.320
OUS-2.317
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
seeing+1.488
irit+1.387
arers+1.373
dating+1.346
lier+1.309
Neg logit contributions
Vs-1.632
 Minutes-1.501
inki-1.470
�醒-1.436
rador-1.401
Token who
Token ID508
Feature activation-5.46e-01

Pos logit contributions
Vs+5.739
 Minutes+5.287
inki+5.050
OUS+4.781
�醒+4.709
Neg logit contributions
seeing-6.748
irit-6.385
dating-6.192
arers-6.188
raint-6.107
Token announced
Token ID3414
Feature activation-4.02e-01

Pos logit contributions
Vs+3.009
 Minutes+2.702
inki+2.598
OUS+2.469
�醒+2.467
Neg logit contributions
seeing-3.792
irit-3.535
arers-3.488
dating-3.473
effects-3.383
Token last
Token ID938
Feature activation+1.34e-01

Pos logit contributions
Vs+2.739
 Minutes+2.535
inki+2.443
�醒+2.363
OUS+2.352
Neg logit contributions
seeing-2.241
irit-2.079
arers-2.063
dating-2.015
effects-1.957
Token month
Token ID1227
Feature activation-6.29e-01

Pos logit contributions
seeing+0.857
irit+0.797
arers+0.785
dating+0.781
place+0.762
Neg logit contributions
Vs-0.801
 Minutes-0.727
inki-0.716
�醒-0.697
rador-0.678
Token that
Token ID326
Feature activation-2.97e-01

Pos logit contributions
Vs+4.139
 Minutes+3.777
inki+3.639
�醒+3.522
OUS+3.501
Neg logit contributions
seeing-3.631
irit-3.384
arers-3.378
dating-3.263
raint-3.197