TOP ACTIVATIONS
MAX = 2.286

how to fix the broken civil - rights office . One
The EPA s civil - rights office has long
um acher 's Mercedes was third ahead of McLaren 's Lewis
the agency to finish investigating civil - rights claims that have
the EPA s civil - rights office : develop
' The Guard ia Civil ( police force ) seized
was less surprised by the long incub ation times , noting
protect Montana s public lands , fight for affordable
last time Gilbert and the civil rights group were on the
bell rang while I was preparing our dinner . " I
that may mean they will only carry the 32 GB model
the EPA s civil - rights office will boost
king had ordered the train traffic to continue , fearing that
Since you re being kicked out , I
sure the lines will be just as long . Knowing what
in their climbing trying to steal some time before Will ung
O VD is breaking the civil society mould here . <|endoftext|>
. ACLU of Oregon (@ AC LU _ OR )
of looking after the public interest and making recommendations ,
face off against the current second place Democratic presidential candidate ,

MIN ACTIVATIONS
MIN = -2.294

the situation have confirmed to Murphy s Hockey
. 1 defender , later told a Toronto Star reporter that
operating in Haiti , has told the Government of Haiti that
new - object system . draw ing . rect angle f
years old and once I turn AA on things slow down
image = [ System . Draw ing . Graphics ] ::
a knob or flip a switch , just leave everything alone
ignment = [ system . draw ing . String Al ignment
developer Yosh io Sak amoto told me in 2010 .
old and once I turn AA on things slow down quite
what the American ambassador himself told me , said
way of doing the light switch skip . Hop ing to
; the D ugg ars told Fox News in June that
ignment = [ system . draw ing . String Al ignment
new - object system . draw ing . bit map ($
) and par asy mp athetic ( rest and
ackers , with an agent telling a reporter he attends DIY
explained to NBC s Matt L auer that he
ask Players with " No Move " clauses whether they want
26 . Thanks , Wii UM aster Race <|endoftext|>

INTERVAL 1.141 - 2.286
CONTAINS 0.414%

migration will be pre - cond itions of any
traditional startup and operating as a financial institution . Even then
Apparently she was originally inspired by her love of
lawsuit in the US against Airbus and aerospace company North rop
- operation . Just five years later , in

INTERVAL -0.004 - 1.141
CONTAINS 49.830%

T ravis & Protect ing Your Girl In Public - Writ
Cup Qual ifiers , which would also offer a great incentive
fed state . Typically , the fed state starts
how can it be expanding ? Like what
City Council Six Years In Search of

INTERVAL -1.149 - -0.004
CONTAINS 49.380%

. Seriously . You know how Batman uses bats to
full v ) { // Time component const float TA U
Post explained how , in early 2008 , US Senator Frank
regarded as separate species , said Dr Potter ,
the other night . A specialist in Czech small ness and

INTERVAL -2.294 - -1.149
CONTAINS 0.375%

gra ze sheep on any golf course owned by Donald Trump
Res a ech says , ' With the information we
Lor re ) does something to him , but we 're
ty - vers us - sw orn - en m ity
officers have the huge responsibility of looking
Token how
Token ID703
Feature activation-2.79e-01

Pos logit contributions
arius+0.264
�醒+0.261
asar+0.259
+0.257
mess+0.252
Neg logit contributions
ggles-0.206
geoning-0.181
 Oprah-0.174
pees-0.172
undo-0.172
Token to
Token ID284
Feature activation-4.18e-01

Pos logit contributions
ggles+1.987
geoning+1.755
 Oprah+1.720
 toggle+1.687
pees+1.682
Neg logit contributions
arius-2.464
�醒-2.431
asar-2.413
-2.390
mess-2.328
Token fix
Token ID4259
Feature activation+5.68e-01

Pos logit contributions
ggles+2.707
geoning+2.273
 toggle+2.259
 voluntarily+2.222
undo+2.213
Neg logit contributions
�醒-4.059
arius-4.015
-3.940
asar-3.915
mess-3.839
Token the
Token ID262
Feature activation+2.05e-01

Pos logit contributions
arius+4.376
asar+4.267
�醒+4.237
+4.213
mess+4.189
Neg logit contributions
ggles-4.563
geoning-4.117
roxy-3.980
pees-3.972
undo-3.928
Token broken
Token ID5445
Feature activation+1.76e-01

Pos logit contributions
arius+1.699
�醒+1.666
asar+1.661
+1.651
mess+1.619
Neg logit contributions
ggles-1.583
geoning-1.411
 Oprah-1.355
pees-1.354
roxy-1.351
Token civil
Token ID3026
Feature activation+2.29e+00

Pos logit contributions
arius+1.388
�醒+1.358
asar+1.354
+1.346
mess+1.317
Neg logit contributions
ggles-1.436
geoning-1.289
 Oprah-1.241
pees-1.239
roxy-1.237
Token-
Token ID12
Feature activation+9.44e-01

Pos logit contributions
asar+10.577
mess+10.403
arius+9.992
 SERV+9.876
+9.746
Neg logit contributions
ggles-16.990
pees-15.936
undo-15.912
roxy-15.850
edit-15.238
Tokenrights
Token ID28046
Feature activation+1.03e+00

Pos logit contributions
mess+8.390
asar+8.315
arius+8.224
+7.966
 SERV+7.505
Neg logit contributions
ggles-6.169
geoning-5.526
 Oprah-5.357
 toggle-5.349
roxy-5.149
Token office
Token ID2607
Feature activation+1.33e-01

Pos logit contributions
mess+6.669
asar+6.560
arius+6.546
+6.518
 SERV+6.223
Neg logit contributions
ggles-8.912
pees-8.020
geoning-8.012
roxy-7.812
edit-7.780
Token.
Token ID13
Feature activation-2.09e-01

Pos logit contributions
arius+1.095
�醒+1.076
asar+1.070
+1.063
mess+1.041
Neg logit contributions
ggles-1.037
geoning-0.927
 Oprah-0.891
pees-0.888
roxy-0.886
Token One
Token ID1881
Feature activation-4.57e-01

Pos logit contributions
ggles+1.675
geoning+1.499
 Oprah+1.457
undo+1.440
pees+1.435
Neg logit contributions
arius-1.672
�醒-1.642
asar-1.636
-1.624
mess-1.583
TokenThe
Token ID464
Feature activation+1.66e-01

Pos logit contributions
ggles+0.566
geoning+0.501
 Oprah+0.480
pees+0.479
undo+0.478
Neg logit contributions
arius-0.673
�醒-0.662
asar-0.659
-0.654
mess-0.640
Token EPA
Token ID10193
Feature activation-9.94e-02

Pos logit contributions
arius+1.425
�醒+1.399
asar+1.393
+1.385
mess+1.358
Neg logit contributions
ggles-1.243
geoning-1.104
 Oprah-1.059
pees-1.056
roxy-1.054
Token
Token ID447
Feature activation+2.73e-01

Pos logit contributions
ggles+0.772
geoning+0.692
undo+0.664
 Oprah+0.663
pees+0.662
Neg logit contributions
arius-0.818
�醒-0.806
asar-0.801
-0.796
mess-0.778
Token
Token ID247
Feature activation-3.23e-01

Pos logit contributions
arius+0.951
�醒+0.912
asar+0.905
+0.878
mess+0.846
Neg logit contributions
ggles-3.438
geoning-3.199
 Oprah-3.132
undo-3.128
roxy-3.127
Tokens
Token ID82
Feature activation+4.92e-01

Pos logit contributions
ggles+1.861
geoning+1.577
undo+1.551
roxy+1.522
 voluntarily+1.512
Neg logit contributions
�醒-3.359
arius-3.281
asar-3.185
-3.184
 SERV-3.092
Token civil
Token ID3026
Feature activation+2.21e+00

Pos logit contributions
arius+3.874
asar+3.804
+3.738
�醒+3.727
mess+3.719
Neg logit contributions
ggles-3.926
geoning-3.503
pees-3.421
 Oprah-3.371
undo-3.359
Token-
Token ID12
Feature activation+8.89e-01

Pos logit contributions
asar+9.329
mess+9.042
arius+8.899
+8.101
 SERV+7.944
Neg logit contributions
ggles-19.431
geoning-18.112
undo-17.859
pees-17.551
roxy-17.429
Tokenrights
Token ID28046
Feature activation+4.12e-01

Pos logit contributions
mess+7.974
asar+7.915
arius+7.881
+7.615
 SERV+7.057
Neg logit contributions
ggles-5.807
geoning-5.220
 toggle-5.017
 Oprah-5.004
pees-4.794
Token office
Token ID2607
Feature activation+6.92e-02

Pos logit contributions
arius+3.298
asar+3.257
+3.192
�醒+3.180
mess+3.174
Neg logit contributions
ggles-3.257
geoning-2.894
pees-2.809
 Oprah-2.759
undo-2.751
Token has
Token ID468
Feature activation-1.90e-01

Pos logit contributions
arius+0.599
�醒+0.589
asar+0.586
+0.583
mess+0.571
Neg logit contributions
ggles-0.512
geoning-0.454
 Oprah-0.436
pees-0.434
roxy-0.434
Token long
Token ID890
Feature activation+3.70e-01

Pos logit contributions
ggles+1.430
geoning+1.267
 voluntarily+1.245
 Oprah+1.241
 toggle+1.222
Neg logit contributions
�醒-1.601
arius-1.594
-1.583
asar-1.563
mess-1.518
Tokenum
Token ID388
Feature activation+4.71e-01

Pos logit contributions
arius+3.602
asar+3.528
mess+3.436
+3.355
�醒+3.339
Neg logit contributions
ggles-5.521
geoning-5.108
 Oprah-4.902
pees-4.862
 toggle-4.849
Tokenacher
Token ID3493
Feature activation+4.05e-01

Pos logit contributions
arius+1.690
asar+1.586
mess+1.501
�醒+1.472
+1.418
Neg logit contributions
ggles-5.908
geoning-5.509
 Oprah-5.423
 toggle-5.366
roxy-5.357
Token's
Token ID338
Feature activation+4.43e-01

Pos logit contributions
arius+3.228
asar+3.150
+3.116
mess+3.083
�醒+3.079
Neg logit contributions
ggles-3.228
geoning-2.926
roxy-2.802
pees-2.792
undo-2.753
Token Mercedes
Token ID21279
Feature activation+9.38e-01

Pos logit contributions
arius+3.583
asar+3.510
+3.449
�醒+3.442
mess+3.433
Neg logit contributions
ggles-3.473
geoning-3.141
pees-3.031
roxy-2.989
undo-2.949
Token was
Token ID373
Feature activation+4.58e-01

Pos logit contributions
arius+6.869
mess+6.692
asar+6.658
+6.513
 SERV+6.407
Neg logit contributions
ggles-7.500
geoning-7.153
pees-6.789
roxy-6.677
laim-6.533
Token third
Token ID2368
Feature activation+2.16e+00

Pos logit contributions
arius+3.693
asar+3.603
�醒+3.547
+3.534
mess+3.490
Neg logit contributions
ggles-3.597
geoning-3.259
pees-3.123
roxy-3.121
 Oprah-3.061
Token ahead
Token ID4058
Feature activation+9.23e-01

Pos logit contributions
asar+11.449
arius+11.430
 quickest+11.337
mess+10.930
 SERV+10.646
Neg logit contributions
ggles-17.278
geoning-15.567
roxy-15.307
pees-15.274
embed-14.899
Token of
Token ID286
Feature activation+7.39e-01

Pos logit contributions
arius+5.910
asar+5.887
mess+5.520
+5.457
�醒+5.338
Neg logit contributions
ggles-8.496
geoning-7.668
pees-7.502
roxy-7.321
 Oprah-7.063
Token McLaren
Token ID33017
Feature activation+3.17e-01

Pos logit contributions
arius+5.205
asar+5.129
mess+4.972
+4.965
�醒+4.788
Neg logit contributions
ggles-6.198
geoning-5.779
roxy-5.559
pees-5.557
laim-5.403
Token's
Token ID338
Feature activation-2.64e-02

Pos logit contributions
arius+2.631
asar+2.574
+2.557
�醒+2.556
mess+2.507
Neg logit contributions
ggles-2.453
geoning-2.197
pees-2.090
roxy-2.080
 Oprah-2.079
Token Lewis
Token ID10174
Feature activation+8.20e-01

Pos logit contributions
ggles+0.199
geoning+0.176
 Oprah+0.172
roxy+0.169
pees+0.169
Neg logit contributions
arius-0.224
�醒-0.223
asar-0.220
-0.219
mess-0.213
Token the
Token ID262
Feature activation-2.16e-01

Pos logit contributions
ggles+0.229
geoning+0.203
 Oprah+0.194
pees+0.194
roxy+0.194
Neg logit contributions
arius-0.279
�醒-0.275
asar-0.273
-0.272
mess-0.267
Token agency
Token ID4086
Feature activation-8.46e-02

Pos logit contributions
ggles+1.445
geoning+1.290
 Oprah+1.236
 toggle+1.231
undo+1.220
Neg logit contributions
arius-2.015
�醒-1.997
asar-1.971
-1.959
mess-1.902
Token to
Token ID284
Feature activation-3.45e-01

Pos logit contributions
ggles+0.616
geoning+0.541
roxy+0.519
pees+0.518
 Oprah+0.517
Neg logit contributions
arius-0.747
�醒-0.734
asar-0.731
-0.728
mess-0.714
Token finish
Token ID5461
Feature activation+3.33e-01

Pos logit contributions
ggles+2.345
 voluntarily+2.003
geoning+1.975
 toggle+1.967
 Oprah+1.936
Neg logit contributions
arius-3.199
�醒-3.192
-3.164
asar-3.093
mess-3.055
Token investigating
Token ID10240
Feature activation+4.41e-01

Pos logit contributions
arius+3.085
asar+3.026
�醒+3.012
+2.984
mess+2.963
Neg logit contributions
ggles-2.255
geoning-1.962
pees-1.876
roxy-1.871
undo-1.867
Token civil
Token ID3026
Feature activation+2.12e+00

Pos logit contributions
arius+3.619
asar+3.567
�醒+3.512
+3.501
mess+3.481
Neg logit contributions
ggles-3.400
geoning-3.020
roxy-2.916
pees-2.885
undo-2.885
Token-
Token ID12
Feature activation+9.39e-01

Pos logit contributions
asar+9.460
mess+9.292
+8.977
 SERV+8.933
arius+8.758
Neg logit contributions
ggles-17.306
roxy-16.007
undo-15.983
geoning-15.823
pees-15.323
Tokenrights
Token ID28046
Feature activation+7.12e-01

Pos logit contributions
mess+8.443
asar+8.346
arius+8.279
+8.055
 SERV+7.451
Neg logit contributions
ggles-6.056
geoning-5.456
 toggle-5.232
 Oprah-5.227
roxy-5.045
Token claims
Token ID3667
Feature activation+5.53e-01

Pos logit contributions
arius+4.939
asar+4.938
mess+4.901
+4.856
�醒+4.707
Neg logit contributions
ggles-6.107
geoning-5.464
roxy-5.314
undo-5.292
pees-5.235
Token that
Token ID326
Feature activation+6.35e-01

Pos logit contributions
arius+4.170
asar+4.050
+3.979
mess+3.960
�醒+3.959
Neg logit contributions
ggles-4.657
geoning-4.185
roxy-4.026
pees-3.967
undo-3.951
Token have
Token ID423
Feature activation+5.36e-01

Pos logit contributions
arius+5.121
asar+4.976
+4.929
�醒+4.894
mess+4.876
Neg logit contributions
ggles-4.932
geoning-4.380
roxy-4.247
undo-4.145
pees-4.114
Token the
Token ID262
Feature activation+6.45e-02

Pos logit contributions
arius+2.334
asar+2.279
�醒+2.256
+2.254
mess+2.224
Neg logit contributions
ggles-2.203
geoning-1.969
roxy-1.894
pees-1.892
 Oprah-1.881
Token EPA
Token ID10193
Feature activation-1.96e-01

Pos logit contributions
arius+0.563
�醒+0.557
asar+0.550
+0.548
mess+0.535
Neg logit contributions
ggles-0.473
geoning-0.420
 Oprah-0.404
roxy-0.401
pees-0.400
Token
Token ID447
Feature activation+4.94e-01

Pos logit contributions
ggles+1.458
geoning+1.303
undo+1.248
 Oprah+1.247
roxy+1.246
Neg logit contributions
arius-1.669
�醒-1.651
asar-1.634
-1.613
mess-1.585
Token
Token ID247
Feature activation-4.11e-01

Pos logit contributions
arius+1.601
�醒+1.521
+1.507
asar+1.493
mess+1.386
Neg logit contributions
ggles-6.293
geoning-5.899
 Oprah-5.756
pees-5.749
roxy-5.733
Tokens
Token ID82
Feature activation+4.52e-01

Pos logit contributions
ggles+2.336
geoning+1.964
undo+1.929
pees+1.880
/_+1.870
Neg logit contributions
�醒-4.342
arius-4.166
asar-4.065
-4.042
 SERV-3.980
Token civil
Token ID3026
Feature activation+2.06e+00

Pos logit contributions
arius+3.645
asar+3.588
+3.521
mess+3.494
�醒+3.474
Neg logit contributions
ggles-3.533
geoning-3.146
pees-3.062
undo-3.022
 Oprah-3.022
Token-
Token ID12
Feature activation+8.44e-01

Pos logit contributions
asar+9.777
arius+9.321
mess+9.319
 SERV+8.670
+8.535
Neg logit contributions
ggles-17.631
geoning-16.523
undo-16.182
pees-15.885
roxy-15.710
Tokenrights
Token ID28046
Feature activation+5.25e-01

Pos logit contributions
mess+7.628
asar+7.610
arius+7.579
+7.307
 SERV+6.848
Neg logit contributions
ggles-5.466
geoning-4.895
 toggle-4.700
 Oprah-4.700
pees-4.508
Token office
Token ID2607
Feature activation-3.28e-02

Pos logit contributions
arius+4.131
asar+4.108
mess+4.010
+4.005
�醒+3.938
Neg logit contributions
ggles-4.155
geoning-3.693
pees-3.591
undo-3.525
 Oprah-3.524
Token:
Token ID25
Feature activation-3.28e-01

Pos logit contributions
ggles+0.253
geoning+0.225
 Oprah+0.217
pees+0.216
roxy+0.216
Neg logit contributions
arius-0.273
�醒-0.268
asar-0.267
-0.265
mess-0.260
Token develop
Token ID1205
Feature activation-3.20e-01

Pos logit contributions
ggles+2.376
geoning+2.116
 Oprah+2.024
 toggle+2.010
undo+2.010
Neg logit contributions
arius-2.857
�醒-2.801
asar-2.785
-2.769
mess-2.711
Token
Token ID198
Feature activation-8.31e-02

Pos logit contributions
arius+4.163
asar+4.065
mess+3.966
+3.940
 SERV+3.665
Neg logit contributions
ggles-5.749
geoning-5.245
pees-5.120
roxy-5.082
undo-4.973
Token'
Token ID6
Feature activation+1.68e-01

Pos logit contributions
ggles+0.603
geoning+0.534
undo+0.514
 Oprah+0.512
pees+0.511
Neg logit contributions
arius-0.727
�醒-0.721
asar-0.708
-0.706
mess-0.689
TokenThe
Token ID464
Feature activation-6.80e-02

Pos logit contributions
arius+1.423
asar+1.394
�醒+1.392
+1.387
mess+1.361
Neg logit contributions
ggles-1.275
geoning-1.137
 Oprah-1.090
roxy-1.089
pees-1.088
Token Guard
Token ID4932
Feature activation+1.57e-01

Pos logit contributions
ggles+0.482
geoning+0.424
 Oprah+0.407
undo+0.406
pees+0.404
Neg logit contributions
arius-0.609
�醒-0.603
asar-0.594
-0.593
mess-0.577
Tokenia
Token ID544
Feature activation+1.29e-01

Pos logit contributions
arius+1.062
�醒+1.032
asar+1.031
+1.021
mess+0.995
Neg logit contributions
ggles-1.468
geoning-1.337
 Oprah-1.294
pees-1.290
roxy-1.290
Token Civil
Token ID7511
Feature activation+2.05e+00

Pos logit contributions
arius+0.854
asar+0.851
�醒+0.826
+0.825
mess+0.811
Neg logit contributions
ggles-1.214
geoning-1.116
roxy-1.074
pees-1.072
 Oprah-1.067
Token (
Token ID357
Feature activation+2.41e-01

Pos logit contributions
asar+14.038
arius+13.966
mess+13.443
 SERV+13.110
�醒+12.741
Neg logit contributions
ggles-13.053
geoning-12.438
pees-11.091
 Oprah-10.938
roxy-10.676
Tokenpolice
Token ID38191
Feature activation-4.59e-01

Pos logit contributions
arius+1.985
asar+1.958
+1.935
mess+1.914
�醒+1.909
Neg logit contributions
ggles-1.870
geoning-1.669
 Oprah-1.644
 toggle-1.602
pees-1.596
Token force
Token ID2700
Feature activation+2.03e-01

Pos logit contributions
ggles+2.996
geoning+2.656
undo+2.570
roxy+2.531
 toggle+2.527
Neg logit contributions
�醒-4.283
arius-4.215
-4.144
asar-4.081
mess-4.025
Token)
Token ID8
Feature activation+1.28e-01

Pos logit contributions
arius+1.709
asar+1.673
+1.638
�醒+1.626
mess+1.609
Neg logit contributions
ggles-1.563
geoning-1.369
 Oprah-1.329
pees-1.316
roxy-1.309
Token seized
Token ID12000
Feature activation-3.36e-01

Pos logit contributions
arius+1.086
asar+1.063
�醒+1.060
+1.053
mess+1.034
Neg logit contributions
ggles-0.972
geoning-0.867
 Oprah-0.830
pees-0.829
roxy-0.828
Token was
Token ID373
Feature activation+3.67e-01

Pos logit contributions
arius+2.725
asar+2.651
�醒+2.631
+2.627
mess+2.572
Neg logit contributions
ggles-3.402
geoning-3.098
roxy-2.988
 Oprah-2.984
pees-2.973
Token less
Token ID1342
Feature activation-1.50e-01

Pos logit contributions
arius+3.241
asar+3.167
�醒+3.108
+3.108
mess+3.077
Neg logit contributions
ggles-2.639
geoning-2.320
roxy-2.233
pees-2.214
undo-2.212
Token surprised
Token ID6655
Feature activation-2.29e-01

Pos logit contributions
ggles+0.966
geoning+0.844
 Oprah+0.816
pees+0.802
undo+0.796
Neg logit contributions
arius-1.433
�醒-1.429
-1.411
asar-1.403
mess-1.366
Token by
Token ID416
Feature activation+1.05e-01

Pos logit contributions
ggles+1.718
geoning+1.545
 Oprah+1.499
undo+1.473
pees+1.467
Neg logit contributions
arius-1.931
�醒-1.900
-1.877
asar-1.875
mess-1.839
Token the
Token ID262
Feature activation+4.15e-01

Pos logit contributions
arius+0.835
asar+0.818
�醒+0.817
+0.810
mess+0.792
Neg logit contributions
ggles-0.849
geoning-0.761
pees-0.733
roxy-0.730
undo-0.729
Token long
Token ID890
Feature activation+2.05e+00

Pos logit contributions
arius+3.363
asar+3.315
+3.273
�醒+3.268
mess+3.236
Neg logit contributions
ggles-3.227
geoning-2.891
pees-2.787
roxy-2.760
 Oprah-2.746
Token incub
Token ID25543
Feature activation+7.33e-01

Pos logit contributions
mess+10.993
asar+10.768
arius+10.672
 SERV+10.486
�醒+10.445
Neg logit contributions
ggles-16.042
geoning-14.300
pees-14.029
roxy-13.767
League-13.638
Tokenation
Token ID341
Feature activation+9.12e-01

Pos logit contributions
arius+4.145
asar+3.960
mess+3.819
+3.718
�醒+3.710
Neg logit contributions
ggles-7.131
geoning-6.642
 Oprah-6.591
roxy-6.528
pees-6.500
Token times
Token ID1661
Feature activation+8.24e-01

Pos logit contributions
asar+5.949
arius+5.926
mess+5.742
+5.665
�醒+5.526
Neg logit contributions
ggles-8.101
geoning-7.294
pees-7.103
roxy-6.980
undo-6.945
Token,
Token ID11
Feature activation+2.65e-01

Pos logit contributions
arius+5.968
asar+5.857
+5.657
�醒+5.654
mess+5.643
Neg logit contributions
ggles-6.970
geoning-6.226
roxy-5.952
pees-5.928
undo-5.816
Token noting
Token ID10820
Feature activation+7.11e-03

Pos logit contributions
arius+2.167
asar+2.123
�醒+2.113
+2.098
mess+2.059
Neg logit contributions
ggles-2.085
geoning-1.860
roxy-1.790
 Oprah-1.785
pees-1.781
Token protect
Token ID1805
Feature activation+1.06e+00

Pos logit contributions
arius+2.518
asar+2.472
+2.434
�醒+2.431
mess+2.405
Neg logit contributions
ggles-2.409
geoning-2.187
roxy-2.116
pees-2.097
 Oprah-2.071
Token Montana
Token ID18311
Feature activation+6.15e-01

Pos logit contributions
arius+7.719
asar+7.570
+7.439
mess+7.436
�醒+7.430
Neg logit contributions
ggles-8.164
geoning-7.528
roxy-7.435
pees-7.317
undo-7.059
Token
Token ID447
Feature activation+4.79e-01

Pos logit contributions
arius+5.017
+4.897
asar+4.894
�醒+4.865
mess+4.766
Neg logit contributions
ggles-4.730
geoning-4.167
roxy-4.132
 Oprah-4.060
pees-4.017
Token
Token ID247
Feature activation+3.42e-01

Pos logit contributions
�醒+2.485
arius+2.433
+2.407
asar+2.326
mess+2.223
Neg logit contributions
ggles-5.077
geoning-4.789
 Oprah-4.585
roxy-4.579
pees-4.549
Tokens
Token ID82
Feature activation+8.28e-01

Pos logit contributions
arius+3.002
asar+2.948
+2.943
�醒+2.934
mess+2.890
Neg logit contributions
ggles-2.430
geoning-2.163
roxy-2.089
 Oprah-2.079
 toggle-2.022
Token public
Token ID1171
Feature activation+2.01e+00

Pos logit contributions
arius+6.860
asar+6.721
+6.642
mess+6.590
�醒+6.469
Neg logit contributions
ggles-6.023
geoning-5.271
pees-5.172
roxy-5.167
 Oprah-5.077
Token lands
Token ID8604
Feature activation+9.44e-01

Pos logit contributions
mess+12.169
 SERV+12.027
asar+11.988
arius+11.558
+11.213
Neg logit contributions
ggles-15.032
roxy-13.639
pees-13.176
undo-13.077
geoning-13.010
Token,
Token ID11
Feature activation-1.99e-01

Pos logit contributions
arius+6.170
+6.059
asar+5.993
�醒+5.901
mess+5.811
Neg logit contributions
ggles-8.452
geoning-7.408
roxy-7.292
 Oprah-7.228
pees-7.115
Token fight
Token ID1907
Feature activation+1.57e-01

Pos logit contributions
ggles+1.534
geoning+1.368
 Oprah+1.339
pees+1.323
undo+1.322
Neg logit contributions
arius-1.644
�醒-1.624
asar-1.598
-1.577
mess-1.555
Token for
Token ID329
Feature activation+1.89e-01

Pos logit contributions
arius+1.290
asar+1.264
�醒+1.258
+1.248
mess+1.225
Neg logit contributions
ggles-1.232
geoning-1.103
pees-1.062
roxy-1.062
undo-1.053
Token affordable
Token ID10935
Feature activation+4.27e-01

Pos logit contributions
arius+1.630
asar+1.597
�醒+1.595
+1.583
mess+1.555
Neg logit contributions
ggles-1.405
geoning-1.245
roxy-1.200
pees-1.197
undo-1.187
Token last
Token ID938
Feature activation+7.16e-01

Pos logit contributions
arius+1.547
�醒+1.506
asar+1.496
+1.477
mess+1.451
Neg logit contributions
ggles-2.650
geoning-2.442
pees-2.366
roxy-2.359
 Oprah-2.354
Token time
Token ID640
Feature activation+3.80e-01

Pos logit contributions
arius+5.381
�醒+5.251
asar+5.206
+5.148
mess+5.122
Neg logit contributions
ggles-5.790
geoning-5.361
pees-5.224
roxy-5.185
undo-4.922
Token Gilbert
Token ID24023
Feature activation+5.96e-01

Pos logit contributions
arius+3.025
asar+2.960
�醒+2.946
+2.926
mess+2.905
Neg logit contributions
ggles-3.035
geoning-2.721
pees-2.626
roxy-2.618
undo-2.587
Token and
Token ID290
Feature activation+6.06e-01

Pos logit contributions
arius+4.536
asar+4.410
+4.369
mess+4.341
�醒+4.288
Neg logit contributions
ggles-4.958
geoning-4.463
roxy-4.318
pees-4.259
undo-4.194
Token the
Token ID262
Feature activation+7.26e-01

Pos logit contributions
asar+4.130
arius+4.079
�醒+4.001
+3.959
mess+3.957
Neg logit contributions
ggles-5.219
geoning-4.788
pees-4.717
roxy-4.625
undo-4.575
Token civil
Token ID3026
Feature activation+1.98e+00

Pos logit contributions
asar+4.204
�醒+4.060
arius+4.058
+4.020
mess+4.017
Neg logit contributions
ggles-7.016
geoning-6.511
pees-6.360
roxy-6.282
undo-6.217
Token rights
Token ID2489
Feature activation+1.07e+00

Pos logit contributions
asar+10.199
mess+9.498
 SERV+9.271
 messenger+8.989
arius+8.988
Neg logit contributions
ggles-16.427
pees-15.035
undo-14.689
geoning-14.672
roxy-14.497
Token group
Token ID1448
Feature activation+1.00e+00

Pos logit contributions
asar+7.609
mess+7.509
+7.455
arius+7.405
�醒+7.217
Neg logit contributions
ggles-8.168
geoning-7.404
pees-7.240
roxy-7.147
undo-7.106
Token were
Token ID547
Feature activation+9.08e-01

Pos logit contributions
arius+7.679
mess+7.505
asar+7.422
 SERV+7.307
+7.300
Neg logit contributions
ggles-7.587
geoning-6.891
roxy-6.658
pees-6.541
undo-6.348
Token on
Token ID319
Feature activation+1.68e-01

Pos logit contributions
arius+7.326
asar+7.108
mess+6.998
+6.844
 SERV+6.783
Neg logit contributions
ggles-6.765
geoning-6.016
roxy-5.851
undo-5.693
pees-5.636
Token the
Token ID262
Feature activation+1.24e-01

Pos logit contributions
arius+1.369
asar+1.344
�醒+1.338
+1.324
mess+1.311
Neg logit contributions
ggles-1.308
geoning-1.178
pees-1.131
roxy-1.127
undo-1.122
Tokenbell
Token ID7923
Feature activation+1.25e+00

Pos logit contributions
arius+5.243
asar+5.138
mess+5.005
+5.000
�醒+4.909
Neg logit contributions
ggles-5.319
geoning-4.941
roxy-4.708
undo-4.636
 Oprah-4.628
Token rang
Token ID28077
Feature activation+2.54e-01

Pos logit contributions
arius+8.256
asar+8.001
mess+7.917
+7.680
 SERV+7.508
Neg logit contributions
ggles-9.736
geoning-9.521
roxy-9.122
pees-9.103
undo-9.053
Token while
Token ID981
Feature activation+3.38e-01

Pos logit contributions
arius+1.986
asar+1.934
�醒+1.925
+1.909
mess+1.883
Neg logit contributions
ggles-2.082
geoning-1.873
pees-1.808
roxy-1.804
undo-1.798
Token I
Token ID314
Feature activation+6.64e-01

Pos logit contributions
arius+2.836
asar+2.765
�醒+2.748
+2.735
mess+2.728
Neg logit contributions
ggles-2.556
geoning-2.292
pees-2.187
roxy-2.175
undo-2.154
Token was
Token ID373
Feature activation+4.56e-01

Pos logit contributions
arius+4.577
mess+4.303
asar+4.246
+4.178
 SERV+4.115
Neg logit contributions
ggles-5.797
geoning-5.592
pees-5.228
roxy-5.185
undo-5.092
Token preparing
Token ID10629
Feature activation+1.95e+00

Pos logit contributions
arius+3.881
asar+3.733
�醒+3.709
mess+3.697
+3.658
Neg logit contributions
ggles-3.410
geoning-3.043
roxy-2.890
pees-2.877
undo-2.817
Token our
Token ID674
Feature activation+1.22e+00

Pos logit contributions
mess+12.208
 SERV+12.046
arius+11.756
�醒+11.256
asar+11.109
Neg logit contributions
ggles-13.893
geoning-12.835
undo-12.127
pees-12.038
laim-11.853
Token dinner
Token ID8073
Feature activation+7.77e-01

Pos logit contributions
arius+7.823
mess+7.437
asar+7.265
 SERV+7.253
�醒+7.245
Neg logit contributions
ggles-10.397
pees-9.455
geoning-9.445
League-9.089
roxy-9.070
Token.
Token ID13
Feature activation+4.36e-02

Pos logit contributions
arius+5.212
asar+5.034
mess+5.019
�醒+5.002
+4.897
Neg logit contributions
ggles-6.879
geoning-6.213
roxy-6.065
pees-6.044
undo-5.931
Token "
Token ID366
Feature activation+3.82e-01

Pos logit contributions
arius+0.394
�醒+0.388
asar+0.386
+0.384
mess+0.376
Neg logit contributions
ggles-0.306
geoning-0.269
 Oprah-0.260
pees-0.257
roxy-0.256
TokenI
Token ID40
Feature activation+1.97e-01

Pos logit contributions
arius+3.182
+3.114
asar+3.105
mess+3.079
�醒+3.052
Neg logit contributions
ggles-2.940
geoning-2.628
pees-2.489
roxy-2.488
 Oprah-2.474
Token that
Token ID326
Feature activation+6.47e-01

Pos logit contributions
arius+0.595
�醒+0.594
asar+0.584
+0.582
mess+0.568
Neg logit contributions
ggles-0.489
geoning-0.427
 Oprah-0.416
roxy-0.411
 toggle-0.408
Token may
Token ID743
Feature activation+6.94e-01

Pos logit contributions
arius+5.324
asar+5.169
+5.054
mess+5.018
�醒+4.968
Neg logit contributions
ggles-4.798
geoning-4.475
pees-4.283
undo-4.185
roxy-4.180
Token mean
Token ID1612
Feature activation+1.13e+00

Pos logit contributions
arius+4.619
asar+4.576
mess+4.494
+4.382
�醒+4.284
Neg logit contributions
ggles-5.971
geoning-5.642
pees-5.515
 Oprah-5.386
roxy-5.358
Token they
Token ID484
Feature activation+1.06e+00

Pos logit contributions
arius+8.422
asar+8.069
mess+8.019
+7.859
 SERV+7.801
Neg logit contributions
ggles-8.339
geoning-7.936
pees-7.761
roxy-7.409
undo-7.400
Token will
Token ID481
Feature activation+1.43e+00

Pos logit contributions
arius+7.486
asar+7.248
mess+7.139
+7.026
 SERV+6.918
Neg logit contributions
ggles-8.225
geoning-7.842
pees-7.472
roxy-7.464
orney-7.213
Token only
Token ID691
Feature activation+1.95e+00

Pos logit contributions
arius+9.562
mess+9.268
asar+9.212
+8.897
 SERV+8.835
Neg logit contributions
ggles-10.135
geoning-10.034
pees-9.663
roxy-9.525
binding-9.286
Token carry
Token ID3283
Feature activation+1.52e+00

Pos logit contributions
arius+11.543
mess+11.337
asar+11.114
 SERV+11.006
+10.688
Neg logit contributions
geoning-13.231
ggles-12.883
roxy-12.331
orney-12.320
pees-12.290
Token the
Token ID262
Feature activation+9.88e-01

Pos logit contributions
arius+10.048
asar+9.411
 SERV+9.392
mess+9.283
+9.270
Neg logit contributions
ggles-11.112
geoning-10.784
undo-10.602
orney-10.469
pees-10.256
Token 32
Token ID3933
Feature activation+1.14e-01

Pos logit contributions
arius+7.344
asar+7.134
+6.963
mess+6.865
 SERV+6.813
Neg logit contributions
ggles-7.559
geoning-7.042
pees-6.852
undo-6.821
orney-6.763
TokenGB
Token ID4579
Feature activation+7.38e-02

Pos logit contributions
arius+0.717
asar+0.697
+0.691
�醒+0.688
mess+0.675
Neg logit contributions
ggles-1.099
geoning-1.009
 Oprah-0.981
pees-0.978
roxy-0.977
Token model
Token ID2746
Feature activation+4.71e-01

Pos logit contributions
arius+0.647
�醒+0.638
asar+0.632
+0.630
mess+0.616
Neg logit contributions
ggles-0.537
geoning-0.476
 Oprah-0.456
roxy-0.454
pees-0.454
Token the
Token ID262
Feature activation-2.44e-02

Pos logit contributions
arius+0.390
�醒+0.386
asar+0.382
+0.379
mess+0.371
Neg logit contributions
ggles-0.354
geoning-0.316
 Oprah-0.305
pees-0.303
roxy-0.302
Token EPA
Token ID10193
Feature activation-5.08e-01

Pos logit contributions
ggles+0.180
geoning+0.159
 Oprah+0.154
 toggle+0.152
pees+0.152
Neg logit contributions
arius-0.213
�醒-0.212
asar-0.208
-0.207
mess-0.202
Token
Token ID447
Feature activation+3.68e-01

Pos logit contributions
ggles+3.359
geoning+2.946
 voluntarily+2.891
 toggle+2.844
pees+2.836
Neg logit contributions
�醒-4.628
arius-4.609
asar-4.528
-4.466
mess-4.412
Token
Token ID247
Feature activation-5.35e-01

Pos logit contributions
arius+1.310
�醒+1.254
asar+1.240
+1.222
mess+1.160
Neg logit contributions
ggles-4.601
geoning-4.293
 Oprah-4.194
pees-4.188
roxy-4.183
Tokens
Token ID82
Feature activation+3.05e-02

Pos logit contributions
ggles+2.933
undo+2.430
geoning+2.417
roxy+2.371
pees+2.355
Neg logit contributions
�醒-5.726
arius-5.466
asar-5.337
-5.312
 SERV-5.195
Token civil
Token ID3026
Feature activation+1.94e+00

Pos logit contributions
arius+0.243
�醒+0.241
asar+0.237
+0.236
mess+0.230
Neg logit contributions
ggles-0.247
geoning-0.221
 Oprah-0.214
roxy-0.213
pees-0.212
Token-
Token ID12
Feature activation+6.95e-01

Pos logit contributions
asar+9.256
mess+8.976
arius+8.945
 SERV+8.839
+8.456
Neg logit contributions
ggles-15.906
undo-14.450
geoning-14.341
pees-14.277
roxy-14.010
Tokenrights
Token ID28046
Feature activation+4.33e-01

Pos logit contributions
arius+6.477
asar+6.471
mess+6.460
+6.281
�醒+5.984
Neg logit contributions
ggles-4.402
geoning-3.901
 Oprah-3.728
 toggle-3.713
pees-3.599
Token office
Token ID2607
Feature activation-7.90e-02

Pos logit contributions
arius+3.538
asar+3.489
+3.441
mess+3.429
�醒+3.394
Neg logit contributions
ggles-3.329
geoning-2.964
pees-2.861
undo-2.817
 Oprah-2.813
Token will
Token ID481
Feature activation+3.31e-02

Pos logit contributions
ggles+0.568
geoning+0.503
 Oprah+0.483
pees+0.482
roxy+0.481
Neg logit contributions
arius-0.696
�醒-0.689
asar-0.683
-0.679
mess-0.665
Token boost
Token ID5750
Feature activation-2.39e-01

Pos logit contributions
arius+0.285
�醒+0.280
asar+0.279
+0.278
mess+0.272
Neg logit contributions
ggles-0.245
geoning-0.218
 Oprah-0.209
pees-0.208
roxy-0.208
Token king
Token ID5822
Feature activation-9.09e-02

Pos logit contributions
arius+5.033
asar+4.998
+4.864
�醒+4.831
mess+4.814
Neg logit contributions
ggles-5.635
geoning-5.053
pees-4.935
roxy-4.863
undo-4.811
Token had
Token ID550
Feature activation+1.67e-01

Pos logit contributions
ggles+0.725
geoning+0.642
 Oprah+0.636
 voluntarily+0.635
 toggle+0.629
Neg logit contributions
�醒-0.733
arius-0.719
-0.710
asar-0.705
mess-0.688
Token ordered
Token ID6149
Feature activation+1.17e+00

Pos logit contributions
arius+1.372
�醒+1.352
asar+1.340
+1.334
mess+1.304
Neg logit contributions
ggles-1.314
geoning-1.171
 Oprah-1.132
pees-1.125
roxy-1.123
Token the
Token ID262
Feature activation+9.88e-01

Pos logit contributions
arius+8.086
asar+7.682
mess+7.551
+7.536
 SERV+7.436
Neg logit contributions
ggles-9.666
geoning-9.127
pees-8.530
roxy-8.463
undo-8.357
Token train
Token ID4512
Feature activation+1.27e+00

Pos logit contributions
arius+7.578
asar+7.521
+7.264
mess+7.168
 SERV+7.095
Neg logit contributions
ggles-7.672
geoning-6.938
pees-6.825
roxy-6.649
undo-6.484
Token traffic
Token ID4979
Feature activation+1.91e+00

Pos logit contributions
arius+9.230
asar+8.964
mess+8.579
+8.553
�醒+8.454
Neg logit contributions
ggles-9.941
geoning-8.972
pees-8.683
roxy-8.621
 Oprah-8.306
Token to
Token ID284
Feature activation+1.30e+00

Pos logit contributions
arius+12.903
asar+12.435
mess+12.238
+12.000
 messenger+11.395
Neg logit contributions
ggles-13.391
pees-12.022
geoning-11.819
roxy-11.593
League-11.546
Token continue
Token ID2555
Feature activation+1.68e+00

Pos logit contributions
arius+9.340
asar+9.119
+8.850
mess+8.639
�醒+8.369
Neg logit contributions
geoning-9.208
ggles-9.164
pees-8.718
roxy-8.624
binding-8.511
Token,
Token ID11
Feature activation+8.17e-01

Pos logit contributions
arius+10.706
asar+10.504
mess+9.923
 SERV+9.680
 LINE+9.513
Neg logit contributions
ggles-13.463
geoning-12.129
roxy-11.876
pees-11.787
undo-11.181
Token fearing
Token ID33188
Feature activation+1.28e+00

Pos logit contributions
arius+5.847
asar+5.806
+5.594
mess+5.561
�醒+5.457
Neg logit contributions
ggles-6.946
geoning-6.274
roxy-6.089
pees-5.941
 Oprah-5.841
Token that
Token ID326
Feature activation+1.45e+00

Pos logit contributions
arius+8.252
asar+8.142
mess+8.011
+7.701
 SERV+7.595
Neg logit contributions
ggles-10.947
geoning-9.986
pees-9.855
roxy-9.586
undo-9.293
Token Since
Token ID4619
Feature activation+2.75e-01

Pos logit contributions
arius+4.359
+4.342
�醒+4.243
mess+4.232
asar+4.214
Neg logit contributions
ggles-5.242
geoning-4.759
roxy-4.657
pees-4.532
undo-4.504
Token you
Token ID345
Feature activation+3.57e-01

Pos logit contributions
arius+2.625
�醒+2.570
asar+2.570
+2.556
mess+2.517
Neg logit contributions
ggles-1.782
geoning-1.555
roxy-1.474
pees-1.474
 Oprah-1.465
Token
Token ID447
Feature activation+3.40e-02

Pos logit contributions
arius+2.907
asar+2.829
+2.812
�醒+2.811
mess+2.772
Neg logit contributions
ggles-2.817
geoning-2.532
roxy-2.427
 Oprah-2.414
pees-2.412
Token
Token ID247
Feature activation-5.09e-02

Pos logit contributions
arius+0.135
asar+0.131
�醒+0.130
+0.124
mess+0.122
Neg logit contributions
ggles-0.414
geoning-0.380
 Oprah-0.377
undo-0.376
pees-0.374
Tokenre
Token ID260
Feature activation+6.56e-01

Pos logit contributions
ggles+0.245
geoning+0.202
 Oprah+0.189
pees+0.188
undo+0.187
Neg logit contributions
arius-0.572
�醒-0.566
asar-0.562
-0.560
mess-0.549
Token being
Token ID852
Feature activation+1.88e+00

Pos logit contributions
arius+5.316
asar+5.113
�醒+5.108
+5.054
mess+5.010
Neg logit contributions
ggles-5.130
geoning-4.556
roxy-4.376
undo-4.341
pees-4.335
Token kicked
Token ID12165
Feature activation+1.50e-01

Pos logit contributions
arius+14.021
asar+13.248
 SERV+12.843
mess+12.775
�醒+12.622
Neg logit contributions
ggles-12.774
geoning-11.720
roxy-10.924
undo-10.724
pees-10.671
Token out
Token ID503
Feature activation+3.68e-01

Pos logit contributions
arius+1.301
�醒+1.276
asar+1.273
+1.265
mess+1.241
Neg logit contributions
ggles-1.115
geoning-0.988
roxy-0.945
 Oprah-0.944
pees-0.944
Token,
Token ID11
Feature activation+2.67e-01

Pos logit contributions
arius+2.708
asar+2.649
+2.631
�醒+2.628
mess+2.586
Neg logit contributions
ggles-3.181
geoning-2.863
roxy-2.778
pees-2.758
undo-2.738
Token I
Token ID314
Feature activation+6.28e-01

Pos logit contributions
arius+2.360
asar+2.310
�醒+2.302
+2.299
mess+2.258
Neg logit contributions
ggles-1.915
geoning-1.704
roxy-1.622
pees-1.615
 Oprah-1.612
Token
Token ID447
Feature activation-1.15e-02

Pos logit contributions
arius+5.186
+5.035
asar+5.016
mess+4.962
�醒+4.888
Neg logit contributions
ggles-4.684
geoning-4.308
roxy-4.090
 Oprah-4.050
undo-4.012
Token sure
Token ID1654
Feature activation+4.36e-01

Pos logit contributions
arius+2.627
asar+2.544
�醒+2.505
+2.489
mess+2.475
Neg logit contributions
ggles-2.999
geoning-2.707
roxy-2.594
pees-2.586
undo-2.584
Token the
Token ID262
Feature activation+2.77e-01

Pos logit contributions
arius+3.488
asar+3.384
+3.357
�醒+3.286
mess+3.278
Neg logit contributions
ggles-3.482
geoning-3.123
pees-2.994
roxy-2.980
undo-2.978
Token lines
Token ID3951
Feature activation+1.05e+00

Pos logit contributions
arius+2.335
asar+2.285
�醒+2.282
+2.268
mess+2.225
Neg logit contributions
ggles-2.095
geoning-1.870
pees-1.787
roxy-1.787
 Oprah-1.785
Token will
Token ID481
Feature activation+1.56e+00

Pos logit contributions
arius+7.566
mess+7.250
+7.142
asar+7.109
 SERV+6.806
Neg logit contributions
ggles-8.538
geoning-7.848
roxy-7.736
pees-7.396
orney-7.376
Token be
Token ID307
Feature activation+8.15e-01

Pos logit contributions
arius+9.958
mess+9.309
+9.161
asar+9.131
 LINE+8.790
Neg logit contributions
ggles-11.779
pees-11.200
geoning-11.177
roxy-11.103
League-11.004
Token just
Token ID655
Feature activation+1.87e+00

Pos logit contributions
arius+6.045
asar+5.845
mess+5.755
+5.677
 SERV+5.606
Neg logit contributions
ggles-6.671
geoning-5.937
roxy-5.847
pees-5.731
undo-5.710
Token as
Token ID355
Feature activation+6.73e-01

Pos logit contributions
arius+10.928
mess+10.838
asar+10.788
 SERV+10.277
+9.964
Neg logit contributions
ggles-14.363
undo-13.056
orney-12.714
roxy-12.662
geoning-12.519
Token long
Token ID890
Feature activation+9.41e-01

Pos logit contributions
arius+4.307
asar+4.114
mess+4.082
+4.053
�醒+4.028
Neg logit contributions
ggles-6.287
geoning-5.685
roxy-5.610
pees-5.499
undo-5.466
Token.
Token ID13
Feature activation+6.01e-01

Pos logit contributions
arius+6.471
asar+6.265
mess+6.236
+6.181
 SERV+5.879
Neg logit contributions
ggles-8.081
geoning-7.064
roxy-7.007
undo-6.862
pees-6.847
Token Knowing
Token ID29154
Feature activation+1.04e+00

Pos logit contributions
arius+4.717
asar+4.555
+4.529
�醒+4.468
mess+4.447
Neg logit contributions
ggles-4.810
geoning-4.380
roxy-4.216
pees-4.198
undo-4.163
Token what
Token ID644
Feature activation+3.16e-01

Pos logit contributions
arius+7.184
asar+6.950
+6.748
mess+6.697
 SERV+6.450
Neg logit contributions
ggles-8.493
geoning-8.142
roxy-7.694
undo-7.670
pees-7.487
Token in
Token ID287
Feature activation+9.18e-01

Pos logit contributions
arius+0.012
�醒+0.011
asar+0.011
+0.011
mess+0.011
Neg logit contributions
ggles-0.009
geoning-0.008
 Oprah-0.008
pees-0.008
undo-0.008
Token their
Token ID511
Feature activation+9.79e-01

Pos logit contributions
arius+7.148
asar+6.953
mess+6.757
+6.727
�醒+6.722
Neg logit contributions
ggles-6.864
geoning-6.418
roxy-6.191
pees-6.001
orney-5.885
Token climbing
Token ID14281
Feature activation+6.95e-01

Pos logit contributions
arius+7.556
asar+7.273
mess+7.157
+7.008
�醒+6.965
Neg logit contributions
ggles-7.329
geoning-6.762
roxy-6.552
pees-6.478
embed-6.397
Token trying
Token ID2111
Feature activation+9.55e-01

Pos logit contributions
arius+5.529
asar+5.418
�醒+5.274
+5.269
mess+5.228
Neg logit contributions
ggles-5.398
geoning-4.808
roxy-4.738
undo-4.666
 Oprah-4.665
Token to
Token ID284
Feature activation+1.27e+00

Pos logit contributions
�醒+6.616
arius+6.536
mess+6.387
asar+6.384
+6.247
Neg logit contributions
ggles-7.549
geoning-6.838
roxy-6.831
undo-6.753
pees-6.747
Token steal
Token ID8711
Feature activation+1.85e+00

Pos logit contributions
asar+9.495
arius+9.381
mess+9.113
+8.662
 Andromeda+8.600
Neg logit contributions
ggles-8.513
geoning-8.421
roxy-8.198
pees-7.895
orney-7.816
Token some
Token ID617
Feature activation+1.18e+00

Pos logit contributions
arius+11.457
asar+10.938
 Dickinson+10.489
 Andromeda+10.445
mess+10.262
Neg logit contributions
ggles-13.690
geoning-13.547
embed-12.751
roxy-12.726
orney-12.486
Token time
Token ID640
Feature activation+7.93e-01

Pos logit contributions
arius+9.074
asar+8.630
mess+8.460
+8.331
 Andromeda+8.017
Neg logit contributions
ggles-8.875
geoning-8.201
roxy-7.929
pees-7.727
orney-7.595
Token before
Token ID878
Feature activation+1.00e+00

Pos logit contributions
arius+5.955
asar+5.754
mess+5.597
+5.519
�醒+5.471
Neg logit contributions
ggles-6.644
geoning-5.925
roxy-5.781
pees-5.587
 Oprah-5.585
Token Will
Token ID2561
Feature activation+5.41e-01

Pos logit contributions
arius+7.335
asar+7.151
mess+7.053
+6.858
 SERV+6.758
Neg logit contributions
ggles-7.845
geoning-7.145
roxy-6.947
pees-6.905
orney-6.690
Tokenung
Token ID2150
Feature activation+1.47e-01

Pos logit contributions
arius+3.876
asar+3.868
+3.760
mess+3.688
�醒+3.681
Neg logit contributions
ggles-4.672
geoning-4.246
 Oprah-4.093
pees-4.071
roxy-4.071
Token O
Token ID440
Feature activation-5.46e-01

Pos logit contributions
ggles+1.735
geoning+1.522
 Oprah+1.488
undo+1.466
roxy+1.463
Neg logit contributions
arius-2.259
�醒-2.238
asar-2.208
-2.206
mess-2.148
TokenVD
Token ID8898
Feature activation+6.54e-02

Pos logit contributions
ggles+4.039
undo+3.562
geoning+3.478
edit+3.457
roxy+3.437
Neg logit contributions
�醒-4.841
arius-4.502
asar-4.479
-4.360
 SERV-4.272
Token is
Token ID318
Feature activation-6.04e-02

Pos logit contributions
arius+0.522
�醒+0.510
asar+0.508
+0.506
mess+0.497
Neg logit contributions
ggles-0.528
geoning-0.475
 Oprah-0.456
pees-0.455
roxy-0.455
Token breaking
Token ID7163
Feature activation-3.15e-02

Pos logit contributions
ggles+0.407
geoning+0.356
 Oprah+0.342
pees+0.339
undo+0.339
Neg logit contributions
arius-0.561
�醒-0.555
asar-0.549
-0.549
mess-0.536
Token the
Token ID262
Feature activation+1.98e-01

Pos logit contributions
ggles+0.232
geoning+0.205
 Oprah+0.198
pees+0.196
undo+0.196
Neg logit contributions
arius-0.273
�醒-0.270
asar-0.267
-0.267
mess-0.261
Token civil
Token ID3026
Feature activation+1.84e+00

Pos logit contributions
arius+1.408
asar+1.376
�醒+1.366
+1.364
mess+1.338
Neg logit contributions
ggles-1.762
geoning-1.597
pees-1.542
roxy-1.539
undo-1.531
Token society
Token ID3592
Feature activation+6.31e-01

Pos logit contributions
asar+11.195
mess+10.911
arius+10.610
 SERV+10.466
+10.212
Neg logit contributions
ggles-14.291
pees-13.162
undo-12.890
geoning-12.884
roxy-12.440
Token mould
Token ID35370
Feature activation-1.29e-01

Pos logit contributions
arius+4.958
asar+4.949
+4.830
mess+4.794
�醒+4.647
Neg logit contributions
ggles-4.974
geoning-4.414
pees-4.288
roxy-4.207
 Oprah-4.161
Token here
Token ID994
Feature activation+6.40e-02

Pos logit contributions
ggles+0.991
geoning+0.888
 Oprah+0.854
roxy+0.850
undo+0.849
Neg logit contributions
arius-1.067
�醒-1.054
asar-1.045
-1.038
mess-1.015
Token.
Token ID13
Feature activation+3.34e-01

Pos logit contributions
arius+0.511
�醒+0.500
asar+0.499
+0.497
mess+0.486
Neg logit contributions
ggles-0.518
geoning-0.462
pees-0.445
 Oprah-0.445
roxy-0.443
Token<|endoftext|>
Token ID50256
Feature activation+1.84e-02

Pos logit contributions
arius+2.368
�醒+2.291
asar+2.289
+2.263
mess+2.231
Neg logit contributions
ggles-2.985
geoning-2.712
pees-2.614
roxy-2.608
undo-2.571
Token.
Token ID13
Feature activation+3.56e-01

Pos logit contributions
arius+1.147
�醒+1.125
asar+1.120
+1.116
mess+1.092
Neg logit contributions
ggles-1.131
geoning-1.008
roxy-0.970
 Oprah-0.970
pees-0.969
Token
Token ID851
Feature activation+5.29e-01

Pos logit contributions
arius+2.851
�醒+2.792
+2.786
asar+2.784
mess+2.719
Neg logit contributions
ggles-2.838
geoning-2.556
roxy-2.466
pees-2.446
undo-2.422
Token ACLU
Token ID24381
Feature activation+3.79e-01

Pos logit contributions
�醒+3.110
arius+3.060
asar+3.037
+3.007
mess+2.903
Neg logit contributions
ggles-5.174
geoning-4.745
pees-4.641
roxy-4.623
undo-4.599
Token of
Token ID286
Feature activation-2.93e-01

Pos logit contributions
arius+2.752
asar+2.676
+2.660
�醒+2.655
mess+2.590
Neg logit contributions
ggles-3.333
geoning-3.015
pees-2.916
roxy-2.911
 Oprah-2.892
Token Oregon
Token ID8819
Feature activation+1.50e-01

Pos logit contributions
ggles+1.901
 Oprah+1.727
 toggle+1.629
geoning+1.621
 voluntarily+1.582
Neg logit contributions
�醒-2.842
arius-2.745
-2.685
asar-2.669
mess-2.616
Token (@
Token ID4275
Feature activation+1.84e+00

Pos logit contributions
arius+1.271
�醒+1.256
asar+1.245
+1.233
mess+1.210
Neg logit contributions
ggles-1.144
geoning-1.017
 Oprah-0.983
pees-0.974
undo-0.971
TokenAC
Token ID2246
Feature activation+6.41e-02

Pos logit contributions
+9.037
arius+8.705
asar+8.364
mess+7.911
ICES+7.401
Neg logit contributions
ggles-17.422
roxy-15.887
geoning-15.638
ewitness-15.196
 occas-15.100
TokenLU
Token ID41596
Feature activation+3.73e-01

Pos logit contributions
�醒+0.381
arius+0.377
asar+0.363
+0.358
mess+0.354
Neg logit contributions
ggles-0.652
geoning-0.594
 Oprah-0.587
undo-0.585
pees-0.579
Token_
Token ID62
Feature activation+5.34e-01

Pos logit contributions
arius+2.739
asar+2.652
+2.639
mess+2.583
�醒+2.566
Neg logit contributions
ggles-3.254
geoning-2.948
pees-2.842
roxy-2.834
 Oprah-2.821
TokenOR
Token ID1581
Feature activation+7.24e-01

Pos logit contributions
arius+3.597
+3.469
asar+3.396
mess+3.307
�醒+3.135
Neg logit contributions
ggles-5.025
geoning-4.641
pees-4.440
roxy-4.392
undo-4.321
Token)
Token ID8
Feature activation-5.31e-01

Pos logit contributions
arius+5.672
+5.557
asar+5.531
mess+5.395
 SERV+5.103
Neg logit contributions
ggles-5.793
geoning-5.220
pees-5.030
roxy-5.004
undo-4.925
Token
Token ID251
Feature activation-3.17e-01

Pos logit contributions
arius+1.556
�醒+1.536
+1.510
asar+1.494
mess+1.432
Neg logit contributions
ggles-2.913
geoning-2.693
 Oprah-2.606
pees-2.605
roxy-2.598
Token of
Token ID286
Feature activation-2.15e-01

Pos logit contributions
ggles+2.260
geoning+2.022
undo+1.961
 Oprah+1.949
pees+1.936
Neg logit contributions
arius-2.762
�醒-2.739
asar-2.703
-2.683
mess-2.617
Token looking
Token ID2045
Feature activation+1.52e-01

Pos logit contributions
ggles+1.412
geoning+1.229
 Oprah+1.197
 toggle+1.182
 voluntarily+1.178
Neg logit contributions
arius-2.025
�醒-2.021
-1.988
asar-1.987
mess-1.930
Token after
Token ID706
Feature activation+5.09e-01

Pos logit contributions
arius+1.270
�醒+1.245
asar+1.235
+1.234
mess+1.214
Neg logit contributions
ggles-1.166
geoning-1.035
undo-0.995
roxy-0.990
pees-0.985
Token the
Token ID262
Feature activation+6.77e-01

Pos logit contributions
arius+4.204
asar+4.112
+4.095
�醒+4.085
mess+4.079
Neg logit contributions
ggles-3.879
geoning-3.483
undo-3.302
roxy-3.301
pees-3.277
Token public
Token ID1171
Feature activation+1.83e+00

Pos logit contributions
arius+5.632
+5.517
asar+5.512
mess+5.506
�醒+5.469
Neg logit contributions
ggles-4.992
geoning-4.497
roxy-4.255
undo-4.247
pees-4.211
Token interest
Token ID1393
Feature activation+8.21e-01

Pos logit contributions
mess+12.955
 SERV+12.209
asar+12.084
+11.900
arius+11.824
Neg logit contributions
ggles-13.235
geoning-11.630
undo-11.231
pees-11.013
roxy-10.836
Token and
Token ID290
Feature activation+2.21e-02

Pos logit contributions
arius+5.817
�醒+5.725
+5.719
asar+5.702
mess+5.654
Neg logit contributions
ggles-6.937
geoning-6.140
 Oprah-5.944
pees-5.912
roxy-5.902
Token making
Token ID1642
Feature activation-1.22e-01

Pos logit contributions
arius+0.200
�醒+0.197
asar+0.196
+0.195
mess+0.191
Neg logit contributions
ggles-0.155
geoning-0.136
 Oprah-0.130
pees-0.130
roxy-0.129
Token recommendations
Token ID10763
Feature activation-3.09e-01

Pos logit contributions
ggles+0.948
geoning+0.839
 Oprah+0.820
pees+0.809
undo+0.809
Neg logit contributions
arius-1.009
�醒-1.000
asar-0.987
-0.978
mess-0.953
Token,
Token ID11
Feature activation+3.32e-01

Pos logit contributions
ggles+2.146
geoning+1.908
pees+1.844
 Oprah+1.840
 voluntarily+1.840
Neg logit contributions
arius-2.751
�醒-2.714
asar-2.710
-2.659
mess-2.611
Token face
Token ID1986
Feature activation+2.72e-01

Pos logit contributions
arius+0.399
�醒+0.393
asar+0.390
+0.389
mess+0.380
Neg logit contributions
ggles-0.326
geoning-0.287
 Oprah-0.276
undo-0.274
pees-0.274
Token off
Token ID572
Feature activation+1.85e-01

Pos logit contributions
arius+2.307
asar+2.252
�醒+2.234
+2.221
mess+2.203
Neg logit contributions
ggles-2.032
geoning-1.809
pees-1.746
roxy-1.730
undo-1.718
Token against
Token ID1028
Feature activation+3.14e-01

Pos logit contributions
arius+1.548
asar+1.499
�醒+1.492
+1.486
mess+1.468
Neg logit contributions
ggles-1.423
geoning-1.262
pees-1.228
undo-1.202
roxy-1.202
Token the
Token ID262
Feature activation+5.74e-01

Pos logit contributions
arius+2.529
asar+2.469
�醒+2.438
+2.421
mess+2.420
Neg logit contributions
ggles-2.468
geoning-2.225
pees-2.148
roxy-2.124
undo-2.114
Token current
Token ID1459
Feature activation+5.55e-01

Pos logit contributions
arius+4.515
mess+4.359
asar+4.342
�醒+4.235
+4.213
Neg logit contributions
ggles-4.488
geoning-4.084
pees-3.975
undo-3.923
roxy-3.871
Token second
Token ID1218
Feature activation+1.82e+00

Pos logit contributions
arius+4.173
mess+4.019
asar+3.982
+3.937
�醒+3.936
Neg logit contributions
ggles-4.600
geoning-4.204
pees-4.049
undo-4.040
roxy-4.005
Token place
Token ID1295
Feature activation+1.20e+00

Pos logit contributions
arius+9.981
mess+9.616
asar+9.158
 messenger+9.110
�醒+8.787
Neg logit contributions
ggles-14.666
undo-14.079
pees-13.837
laim-13.481
geoning-13.448
Token Democratic
Token ID4390
Feature activation+9.01e-01

Pos logit contributions
arius+6.835
mess+6.736
asar+6.413
�醒+6.121
 SERV+5.917
Neg logit contributions
ggles-10.775
geoning-10.069
undo-9.862
League-9.850
pees-9.832
Token presidential
Token ID4787
Feature activation+9.81e-01

Pos logit contributions
arius+6.182
mess+6.018
asar+5.842
�醒+5.655
+5.630
Neg logit contributions
ggles-7.740
geoning-7.155
pees-6.908
undo-6.834
roxy-6.780
Token candidate
Token ID4540
Feature activation+3.43e-01

Pos logit contributions
arius+5.674
mess+5.439
asar+5.363
+4.979
�醒+4.848
Neg logit contributions
ggles-9.343
geoning-8.671
undo-8.569
pees-8.482
edit-8.218
Token,
Token ID11
Feature activation-2.18e-01

Pos logit contributions
arius+2.728
asar+2.675
�醒+2.628
+2.621
mess+2.613
Neg logit contributions
ggles-2.733
geoning-2.460
pees-2.350
roxy-2.338
undo-2.337
Token the
Token ID262
Feature activation-8.54e-01

Pos logit contributions
ggles+8.643
 Oprah+7.775
geoning+7.703
 toggle+7.617
 voluntarily+7.462
Neg logit contributions
�醒-5.901
arius-5.558
-5.536
asar-5.465
mess-5.071
Token situation
Token ID3074
Feature activation-3.53e-01

Pos logit contributions
ggles+6.461
 Oprah+5.622
 toggle+5.567
geoning+5.537
undo+5.384
Neg logit contributions
�醒-7.046
arius-6.935
-6.776
asar-6.695
 SERV-6.415
Token
Token ID784
Feature activation-4.47e-01

Pos logit contributions
ggles+2.743
geoning+2.461
 Oprah+2.370
undo+2.349
 voluntarily+2.348
Neg logit contributions
�醒-2.879
arius-2.877
asar-2.828
-2.820
mess-2.729
Token have
Token ID423
Feature activation-1.22e+00

Pos logit contributions
ggles+3.610
 voluntarily+3.156
geoning+3.142
 Oprah+3.122
 toggle+3.121
Neg logit contributions
�醒-3.592
arius-3.494
-3.419
asar-3.394
mess-3.211
Token confirmed
Token ID4999
Feature activation-8.19e-01

Pos logit contributions
ggles+8.575
 voluntarily+8.161
 exclusively+7.838
 Oprah+7.833
 enthusiastically+7.528
Neg logit contributions
�醒-9.821
-9.343
arius-9.304
asar-9.131
mess-8.347
Token to
Token ID284
Feature activation-2.29e+00

Pos logit contributions
ggles+6.791
 Oprah+6.206
 voluntarily+6.086
 toggle+6.041
geoning+6.039
Neg logit contributions
�醒-6.081
arius-5.803
-5.801
asar-5.660
mess-5.422
Token Murphy
Token ID14424
Feature activation-7.06e-01

Pos logit contributions
ggles+15.360
 Oprah+13.860
 Seymour+13.472
 toggle+12.953
 voluntarily+12.820
Neg logit contributions
�醒-15.942
-15.318
arius-14.329
asar-13.977
 SERV-13.060
Token
Token ID447
Feature activation+4.18e-01

Pos logit contributions
ggles+4.706
geoning+4.125
 Oprah+4.124
 voluntarily+4.064
undo+4.024
Neg logit contributions
�醒-6.310
arius-6.266
asar-6.118
-6.088
mess-5.819
Token
Token ID247
Feature activation-4.32e-01

Pos logit contributions
arius+2.612
�醒+2.547
asar+2.530
+2.514
mess+2.437
Neg logit contributions
ggles-4.085
geoning-3.749
 Oprah-3.624
pees-3.624
roxy-3.614
Tokens
Token ID82
Feature activation-8.87e-01

Pos logit contributions
ggles+3.145
geoning+2.763
 Oprah+2.677
undo+2.667
roxy+2.641
Neg logit contributions
�醒-3.735
arius-3.728
asar-3.664
-3.612
mess-3.496
Token Hockey
Token ID17277
Feature activation-5.06e-01

Pos logit contributions
ggles+6.590
 Oprah+5.864
geoning+5.816
undo+5.760
roxy+5.677
Neg logit contributions
�醒-7.194
arius-7.065
-7.035
asar-6.900
mess-6.514
Token.
Token ID13
Feature activation-5.32e-01

Pos logit contributions
arius+0.142
�醒+0.139
asar+0.138
+0.137
mess+0.134
Neg logit contributions
ggles-0.142
geoning-0.127
 Oprah-0.122
pees-0.122
roxy-0.122
Token 1
Token ID352
Feature activation+4.03e-02

Pos logit contributions
ggles+3.479
 Oprah+3.022
geoning+2.930
 toggle+2.892
edit+2.863
Neg logit contributions
�醒-4.865
arius-4.823
asar-4.760
-4.686
mess-4.570
Token defender
Token ID13191
Feature activation+4.38e-01

Pos logit contributions
arius+0.342
�醒+0.335
asar+0.334
+0.332
mess+0.325
Neg logit contributions
ggles-0.306
geoning-0.272
 Oprah-0.261
pees-0.261
roxy-0.260
Token,
Token ID11
Feature activation+2.19e-01

Pos logit contributions
arius+3.222
�醒+3.139
asar+3.093
+3.084
mess+3.010
Neg logit contributions
ggles-3.754
geoning-3.371
roxy-3.296
pees-3.258
undo-3.181
Token later
Token ID1568
Feature activation+3.80e-01

Pos logit contributions
arius+1.667
�醒+1.632
asar+1.625
+1.615
mess+1.578
Neg logit contributions
ggles-1.841
geoning-1.659
 Oprah-1.598
pees-1.595
roxy-1.594
Token told
Token ID1297
Feature activation-2.14e+00

Pos logit contributions
arius+2.800
asar+2.720
�醒+2.701
+2.695
mess+2.647
Neg logit contributions
ggles-3.292
geoning-2.981
roxy-2.863
pees-2.857
 Oprah-2.848
Token a
Token ID257
Feature activation-1.44e+00

Pos logit contributions
ggles+13.441
 Oprah+13.224
undo+12.111
 civic+11.864
 Seymour+11.643
Neg logit contributions
�醒-15.483
arius-14.911
-14.458
asar-13.900
urer-12.972
Token Toronto
Token ID6586
Feature activation-8.06e-01

Pos logit contributions
ggles+9.498
undo+8.500
 Oprah+8.432
 civic+8.300
geoning+8.274
Neg logit contributions
arius-11.183
-11.165
�醒-10.968
asar-10.700
mess-10.338
Token Star
Token ID2907
Feature activation-8.02e-01

Pos logit contributions
ggles+5.708
 Oprah+5.111
geoning+4.924
orney+4.841
 voluntarily+4.821
Neg logit contributions
�醒-6.725
arius-6.707
-6.584
asar-6.416
mess-6.358
Token reporter
Token ID9095
Feature activation-3.84e-01

Pos logit contributions
ggles+4.441
 Oprah+3.890
undo+3.856
geoning+3.728
 toggle+3.723
Neg logit contributions
arius-7.895
�醒-7.725
asar-7.710
-7.628
mess-7.317
Token that
Token ID326
Feature activation-3.82e-01

Pos logit contributions
ggles+2.768
geoning+2.452
 Oprah+2.448
undo+2.382
roxy+2.368
Neg logit contributions
arius-3.316
�醒-3.247
asar-3.223
-3.175
mess-3.108
Token operating
Token ID5361
Feature activation-5.18e-01

Pos logit contributions
ggles+3.512
 Oprah+3.130
 voluntarily+3.129
geoning+3.106
undo+3.055
Neg logit contributions
�醒-5.155
arius-5.123
asar-5.079
-5.068
mess-4.915
Token in
Token ID287
Feature activation-6.45e-01

Pos logit contributions
ggles+3.451
geoning+3.034
 Oprah+2.965
 voluntarily+2.921
pees+2.916
Neg logit contributions
arius-4.750
�醒-4.715
asar-4.678
-4.657
mess-4.550
Token Haiti
Token ID25051
Feature activation-6.91e-01

Pos logit contributions
ggles+3.707
geoning+3.148
 Oprah+3.142
pees+3.029
undo+3.013
Neg logit contributions
arius-6.449
�醒-6.371
-6.305
asar-6.272
mess-6.120
Token,
Token ID11
Feature activation-3.26e-01

Pos logit contributions
ggles+4.517
 voluntarily+4.124
geoning+4.065
 Oprah+3.982
undo+3.963
Neg logit contributions
�醒-6.097
arius-6.075
asar-6.005
-5.978
mess-5.890
Token has
Token ID468
Feature activation-1.02e+00

Pos logit contributions
ggles+2.404
geoning+2.112
 voluntarily+2.063
 Oprah+2.052
undo+2.051
Neg logit contributions
arius-2.778
�醒-2.767
asar-2.711
-2.692
mess-2.634
Token told
Token ID1297
Feature activation-2.11e+00

Pos logit contributions
ggles+7.048
 voluntarily+6.721
 Oprah+6.283
undo+6.209
 toggle+6.178
Neg logit contributions
�醒-8.294
arius-8.189
-8.157
asar-7.962
mess-7.721
Token the
Token ID262
Feature activation-1.24e+00

Pos logit contributions
ggles+13.074
 Oprah+12.749
undo+11.966
 voluntarily+11.925
 Seymour+11.706
Neg logit contributions
�醒-14.601
arius-14.575
asar-14.143
-14.046
mess-12.898
Token Government
Token ID5070
Feature activation-3.19e-01

Pos logit contributions
ggles+8.031
 Oprah+7.290
undo+7.147
 toggle+7.138
 voluntarily+6.997
Neg logit contributions
�醒-10.388
arius-10.335
asar-10.000
-9.850
mess-9.441
Token of
Token ID286
Feature activation-7.94e-01

Pos logit contributions
ggles+2.487
geoning+2.229
undo+2.169
 Oprah+2.160
roxy+2.141
Neg logit contributions
arius-2.597
�醒-2.545
asar-2.535
-2.492
mess-2.446
Token Haiti
Token ID25051
Feature activation-1.11e+00

Pos logit contributions
ggles+5.424
 Oprah+4.699
undo+4.647
geoning+4.613
 toggle+4.530
Neg logit contributions
arius-7.023
-6.794
�醒-6.774
asar-6.747
mess-6.609
Token that
Token ID326
Feature activation-1.09e+00

Pos logit contributions
ggles+6.502
 voluntarily+5.997
undo+5.873
 Oprah+5.813
 toggle+5.725
Neg logit contributions
arius-9.933
asar-9.777
�醒-9.697
-9.301
mess-9.190
Tokennew
Token ID3605
Feature activation+1.60e-01

Pos logit contributions
ggles+1.749
geoning+1.545
undo+1.481
 Oprah+1.464
roxy+1.457
Neg logit contributions
arius-2.474
�醒-2.444
-2.428
asar-2.420
mess-2.356
Token-
Token ID12
Feature activation-9.34e-02

Pos logit contributions
arius+1.100
�醒+1.074
asar+1.073
+1.046
mess+1.044
Neg logit contributions
ggles-1.468
geoning-1.324
pees-1.291
roxy-1.278
 Oprah-1.277
Tokenobject
Token ID15252
Feature activation-1.93e-01

Pos logit contributions
ggles+0.416
geoning+0.340
undo+0.318
pees+0.315
roxy+0.314
Neg logit contributions
arius-1.080
�醒-1.075
asar-1.061
-1.061
mess-1.034
Token system
Token ID1080
Feature activation+2.77e-01

Pos logit contributions
ggles+1.741
geoning+1.587
 Oprah+1.532
pees+1.525
undo+1.525
Neg logit contributions
arius-1.347
�醒-1.319
asar-1.310
-1.305
mess-1.266
Token.
Token ID13
Feature activation-3.85e-01

Pos logit contributions
arius+2.229
asar+2.177
+2.162
�醒+2.160
mess+2.121
Neg logit contributions
ggles-2.221
geoning-1.986
pees-1.910
roxy-1.907
 Oprah-1.904
Tokendraw
Token ID19334
Feature activation-2.00e+00

Pos logit contributions
ggles+2.320
geoning+2.005
pees+1.891
 Oprah+1.889
undo+1.888
Neg logit contributions
arius-3.846
�醒-3.795
asar-3.776
-3.758
mess-3.683
Tokening
Token ID278
Feature activation-5.03e-01

Pos logit contributions
edit+3.000
ggles+2.853
geoning+2.726
 ing+2.326
/_+2.306
Neg logit contributions
�醒-26.910
 mosqu-24.676
 exting-24.173
 eleph-24.047
 SERV-23.728
Token.
Token ID13
Feature activation+1.30e-01

Pos logit contributions
ggles+3.001
/_+2.683
edit+2.674
geoning+2.658
 Oprah+2.636
Neg logit contributions
�醒-5.259
arius-4.785
-4.694
asar-4.645
 SERV-4.540
Tokenrect
Token ID2554
Feature activation-4.44e-01

Pos logit contributions
arius+1.199
asar+1.182
mess+1.168
�醒+1.162
+1.161
Neg logit contributions
ggles-0.888
geoning-0.763
 Oprah-0.756
 toggle-0.736
roxy-0.735
Tokenangle
Token ID9248
Feature activation-3.51e-01

Pos logit contributions
ggles+3.185
geoning+2.870
undo+2.699
 Oprah+2.690
roxy+2.689
Neg logit contributions
arius-3.872
�醒-3.834
-3.817
asar-3.788
mess-3.663
Tokenf
Token ID69
Feature activation-7.17e-01

Pos logit contributions
ggles+2.354
geoning+2.114
 Oprah+2.007
undo+1.986
roxy+1.982
Neg logit contributions
arius-3.228
�醒-3.190
-3.169
asar-3.166
mess-3.071
Token years
Token ID812
Feature activation+1.80e-01

Pos logit contributions
ggles+2.030
geoning+1.789
 toggle+1.723
pees+1.713
 Oprah+1.705
Neg logit contributions
�醒-2.523
arius-2.519
-2.469
asar-2.442
mess-2.377
Token old
Token ID1468
Feature activation-1.65e-01

Pos logit contributions
arius+1.449
asar+1.412
�醒+1.411
+1.400
mess+1.373
Neg logit contributions
ggles-1.445
geoning-1.294
 Oprah-1.243
pees-1.243
roxy-1.241
Token and
Token ID290
Feature activation-5.98e-01

Pos logit contributions
ggles+1.290
geoning+1.154
 Oprah+1.107
roxy+1.104
pees+1.102
Neg logit contributions
arius-1.355
�醒-1.339
asar-1.322
-1.316
mess-1.287
Token once
Token ID1752
Feature activation-5.07e-01

Pos logit contributions
ggles+4.078
 toggle+3.482
 Oprah+3.385
geoning+3.336
 voluntarily+3.334
Neg logit contributions
�醒-5.734
arius-5.405
-5.401
asar-5.323
mess-5.185
Token I
Token ID314
Feature activation-1.07e+00

Pos logit contributions
ggles+3.379
 toggle+2.821
geoning+2.770
 Oprah+2.739
roxy+2.729
Neg logit contributions
�醒-4.914
-4.729
arius-4.685
asar-4.650
mess-4.506
Token turn
Token ID1210
Feature activation-1.98e+00

Pos logit contributions
ggles+7.382
 toggle+6.542
 voluntarily+6.058
edit+5.800
roxy+5.691
Neg logit contributions
�醒-10.451
-9.470
asar-9.193
arius-9.138
mess-8.867
Token AA
Token ID15923
Feature activation-1.94e+00

Pos logit contributions
ggles+11.411
 toggle+10.331
edit+9.523
 OpenGL+9.338
 myself+9.126
Neg logit contributions
�醒-16.921
-16.425
asar-15.455
arius-14.842
mess-14.678
Token on
Token ID319
Feature activation-1.74e+00

Pos logit contributions
ggles+8.777
 toggle+8.280
 voluntarily+6.926
edit+6.801
 exclusively+6.568
Neg logit contributions
�醒-19.453
-17.875
arius-17.225
asar-17.068
mess-16.629
Token things
Token ID1243
Feature activation+2.33e-01

Pos logit contributions
ggles+9.893
 toggle+8.803
 voluntarily+8.147
edit+7.833
 OpenGL+7.756
Neg logit contributions
�醒-16.452
-15.231
arius-14.518
asar-14.364
mess-13.929
Token slow
Token ID3105
Feature activation-1.68e-01

Pos logit contributions
arius+2.009
asar+1.965
�醒+1.937
+1.937
mess+1.918
Neg logit contributions
ggles-1.703
geoning-1.536
pees-1.470
roxy-1.463
 Oprah-1.459
Token down
Token ID866
Feature activation-4.47e-01

Pos logit contributions
ggles+1.249
geoning+1.102
 Oprah+1.058
pees+1.054
roxy+1.052
Neg logit contributions
arius-1.452
�醒-1.441
-1.425
asar-1.421
mess-1.382
Tokenimage
Token ID9060
Feature activation+2.37e-01

Pos logit contributions
arius+5.217
asar+5.191
mess+5.137
+5.056
�醒+4.902
Neg logit contributions
ggles-4.590
geoning-4.024
 Oprah-3.993
 toggle-3.908
pees-3.842
Token =
Token ID796
Feature activation+2.72e-01

Pos logit contributions
arius+2.202
asar+2.149
�醒+2.129
+2.122
mess+2.118
Neg logit contributions
ggles-1.612
geoning-1.374
pees-1.333
 Oprah-1.319
roxy-1.318
Token [
Token ID685
Feature activation+4.18e-01

Pos logit contributions
arius+2.369
asar+2.333
�醒+2.307
+2.268
mess+2.260
Neg logit contributions
ggles-1.965
geoning-1.747
pees-1.692
roxy-1.675
 Oprah-1.652
TokenSystem
Token ID11964
Feature activation+5.70e-01

Pos logit contributions
arius+2.490
mess+2.403
�醒+2.386
asar+2.386
+2.342
Neg logit contributions
ggles-4.168
geoning-3.727
pees-3.708
 Oprah-3.689
 toggle-3.658
Token.
Token ID13
Feature activation-2.84e-02

Pos logit contributions
arius+4.167
asar+4.069
+4.024
mess+4.000
�醒+3.773
Neg logit contributions
ggles-4.938
geoning-4.425
pees-4.320
roxy-4.293
 Oprah-4.197
TokenDraw
Token ID25302
Feature activation-1.98e+00

Pos logit contributions
ggles+0.193
geoning+0.165
roxy+0.163
 Oprah+0.162
pees+0.160
Neg logit contributions
arius-0.264
�醒-0.259
asar-0.257
-0.256
mess-0.254
Tokening
Token ID278
Feature activation-1.54e-01

Pos logit contributions
edit+2.625
ggles+2.362
geoning+2.076
/_+1.820
League+1.800
Neg logit contributions
�醒-27.019
 mosqu-24.356
 eleph-24.274
 exting-23.907
 SERV-23.722
Token.
Token ID13
Feature activation+5.62e-01

Pos logit contributions
ggles+1.053
geoning+0.945
 Oprah+0.915
/_+0.908
edit+0.906
Neg logit contributions
�醒-1.455
arius-1.396
asar-1.361
-1.348
mess-1.312
TokenGraphics
Token ID18172
Feature activation+4.03e-01

Pos logit contributions
arius+3.668
mess+3.660
asar+3.585
+3.544
�醒+3.501
Neg logit contributions
ggles-5.071
 Oprah-4.613
 toggle-4.520
 voluntarily-4.509
geoning-4.432
Token]
Token ID60
Feature activation-2.82e-01

Pos logit contributions
arius+2.602
�醒+2.566
mess+2.542
asar+2.531
+2.478
Neg logit contributions
ggles-3.776
pees-3.340
geoning-3.304
 Oprah-3.266
roxy-3.254
Token::
Token ID3712
Feature activation+5.30e-01

Pos logit contributions
ggles+1.339
geoning+1.117
/_+1.088
edit+1.084
 Oprah+1.080
Neg logit contributions
�醒-3.181
arius-3.101
asar-3.054
-3.029
mess-2.965
Token a
Token ID257
Feature activation+4.35e-02

Pos logit contributions
ggles+8.007
 toggle+7.094
 voluntarily+7.033
 Oprah+6.862
edit+6.614
Neg logit contributions
�醒-13.210
-12.774
arius-12.245
asar-12.186
mess-11.637
Token knob
Token ID36842
Feature activation-1.63e+00

Pos logit contributions
arius+0.394
�醒+0.388
asar+0.386
+0.385
mess+0.376
Neg logit contributions
ggles-0.306
geoning-0.268
 Oprah-0.257
pees-0.256
undo-0.255
Token or
Token ID393
Feature activation+5.01e-01

Pos logit contributions
ggles+8.561
 voluntarily+8.069
 toggle+7.875
edit+7.331
undo+7.329
Neg logit contributions
�醒-14.350
-13.989
arius-13.533
asar-13.216
 SERV-12.765
Token flip
Token ID14283
Feature activation-9.32e-01

Pos logit contributions
arius+4.093
asar+4.049
mess+3.968
+3.928
�醒+3.915
Neg logit contributions
ggles-3.731
geoning-3.504
roxy-3.300
pees-3.293
undo-3.224
Token a
Token ID257
Feature activation-2.50e-01

Pos logit contributions
ggles+6.362
 toggle+5.554
 Oprah+5.355
 voluntarily+5.271
undo+5.226
Neg logit contributions
-8.150
�醒-8.088
arius-8.082
asar-7.882
mess-7.440
Token switch
Token ID5078
Feature activation-1.98e+00

Pos logit contributions
ggles+1.301
 toggle+1.053
geoning+1.013
undo+1.005
 Oprah+0.992
Neg logit contributions
arius-2.740
�醒-2.721
-2.708
asar-2.684
mess-2.619
Token,
Token ID11
Feature activation-1.07e-01

Pos logit contributions
ggles+9.520
 voluntarily+9.460
 toggle+8.984
/_+8.649
undo+8.603
Neg logit contributions
�醒-16.548
-16.085
arius-15.297
asar-14.977
 SERV-14.580
Token just
Token ID655
Feature activation+8.06e-01

Pos logit contributions
ggles+0.781
geoning+0.664
 toggle+0.660
 Oprah+0.658
undo+0.649
Neg logit contributions
�醒-0.963
arius-0.957
-0.942
asar-0.934
mess-0.908
Token leave
Token ID2666
Feature activation-9.98e-01

Pos logit contributions
arius+6.516
asar+6.374
mess+6.268
+6.118
 SERV+6.088
Neg logit contributions
ggles-5.671
geoning-5.513
roxy-5.190
orney-5.070
pees-5.064
Token everything
Token ID2279
Feature activation-6.13e-01

Pos logit contributions
ggles+5.674
 toggle+4.738
 voluntarily+4.645
 Oprah+4.542
undo+4.296
Neg logit contributions
�醒-10.087
-9.834
arius-9.785
asar-9.598
mess-9.194
Token alone
Token ID3436
Feature activation-4.47e-01

Pos logit contributions
ggles+4.247
 toggle+3.728
 voluntarily+3.651
 Oprah+3.614
pees+3.480
Neg logit contributions
�醒-5.693
arius-5.424
-5.410
asar-5.300
mess-5.184
Tokenignment
Token ID16747
Feature activation+2.72e-01

Pos logit contributions
arius+4.520
asar+4.231
mess+4.140
+3.965
�醒+3.821
Neg logit contributions
ggles-6.491
geoning-5.859
roxy-5.820
 Oprah-5.780
pees-5.768
Token =
Token ID796
Feature activation-3.19e-01

Pos logit contributions
arius+2.557
�醒+2.483
asar+2.470
+2.454
mess+2.444
Neg logit contributions
ggles-1.793
geoning-1.554
pees-1.519
roxy-1.496
 Oprah-1.481
Token [
Token ID685
Feature activation+3.96e-01

Pos logit contributions
ggles+2.242
geoning+1.965
 Oprah+1.887
undo+1.883
 toggle+1.877
Neg logit contributions
arius-2.853
�醒-2.830
-2.811
asar-2.805
mess-2.731
Tokensystem
Token ID10057
Feature activation+2.82e-01

Pos logit contributions
arius+2.220
asar+2.106
mess+2.103
�醒+2.045
+2.009
Neg logit contributions
ggles-4.141
geoning-3.759
pees-3.718
 Oprah-3.691
roxy-3.650
Token.
Token ID13
Feature activation-3.33e-01

Pos logit contributions
arius+2.247
asar+2.195
+2.172
�醒+2.157
mess+2.139
Neg logit contributions
ggles-2.280
geoning-2.042
pees-1.967
roxy-1.958
 Oprah-1.957
Tokendraw
Token ID19334
Feature activation-1.94e+00

Pos logit contributions
ggles+1.692
geoning+1.436
undo+1.374
pees+1.334
roxy+1.300
Neg logit contributions
�醒-3.639
arius-3.599
-3.556
asar-3.544
 SERV-3.441
Tokening
Token ID278
Feature activation-2.50e-01

Pos logit contributions
edit+2.743
ggles+2.567
geoning+2.552
undo+2.092
/_+2.089
Neg logit contributions
�醒-26.448
 mosqu-24.460
 eleph-24.127
 exting-23.995
 SERV-23.426
Token.
Token ID13
Feature activation+8.33e-01

Pos logit contributions
ggles+1.660
geoning+1.471
 Oprah+1.450
/_+1.441
edit+1.441
Neg logit contributions
�醒-2.433
arius-2.278
-2.242
asar-2.222
 SERV-2.147
TokenString
Token ID10100
Feature activation+4.32e-01

Pos logit contributions
arius+6.179
asar+6.029
mess+5.907
+5.855
�醒+5.792
Neg logit contributions
ggles-6.713
geoning-5.893
 Oprah-5.880
roxy-5.854
pees-5.792
TokenAl
Token ID2348
Feature activation+7.82e-01

Pos logit contributions
arius+3.333
�醒+3.219
asar+3.218
+3.171
mess+3.169
Neg logit contributions
ggles-3.566
roxy-3.122
geoning-3.119
pees-3.105
 Oprah-3.060
Tokenignment
Token ID16747
Feature activation+7.60e-01

Pos logit contributions
arius+5.110
asar+4.721
mess+4.558
+4.369
�醒+4.238
Neg logit contributions
ggles-7.394
geoning-6.633
 toggle-6.630
 Oprah-6.630
roxy-6.625
Token developer
Token ID8517
Feature activation-7.62e-01

Pos logit contributions
ggles+0.285
geoning+0.251
 Oprah+0.242
undo+0.241
roxy+0.239
Neg logit contributions
arius-0.351
�醒-0.348
asar-0.344
-0.344
mess-0.334
Token Yosh
Token ID28563
Feature activation-3.28e-01

Pos logit contributions
ggles+5.586
 Oprah+5.080
 toggle+4.887
geoning+4.884
undo+4.843
Neg logit contributions
�醒-6.423
arius-6.113
-6.074
asar-5.985
mess-5.706
Tokenio
Token ID952
Feature activation-7.53e-01

Pos logit contributions
ggles+2.510
geoning+2.200
undo+2.188
 Oprah+2.157
pees+2.152
Neg logit contributions
�醒-2.718
arius-2.661
-2.654
asar-2.598
mess-2.555
Token Sak
Token ID13231
Feature activation-7.35e-01

Pos logit contributions
ggles+4.922
 Oprah+4.248
undo+4.225
 toggle+4.181
geoning+4.118
Neg logit contributions
�醒-6.964
-6.822
arius-6.758
asar-6.617
mess-6.420
Tokenamoto
Token ID25384
Feature activation+5.17e-01

Pos logit contributions
ggles+4.767
undo+4.256
pees+4.101
orney+4.089
geoning+4.089
Neg logit contributions
�醒-6.633
-6.575
 SERV-6.246
asar-6.227
mess-6.127
Token told
Token ID1297
Feature activation-1.94e+00

Pos logit contributions
arius+3.446
asar+3.395
+3.354
mess+3.286
�醒+3.231
Neg logit contributions
ggles-4.718
geoning-4.351
roxy-4.179
pees-4.153
 Oprah-4.122
Token me
Token ID502
Feature activation-5.14e-01

Pos logit contributions
ggles+11.441
 Oprah+10.554
 toggle+9.643
undo+9.538
 Seymour+9.501
Neg logit contributions
�醒-16.378
-15.256
arius-14.699
asar-14.263
mess-13.501
Token in
Token ID287
Feature activation-3.53e-01

Pos logit contributions
ggles+3.554
geoning+3.139
 Oprah+3.089
 voluntarily+3.041
undo+3.025
Neg logit contributions
�醒-4.584
arius-4.582
-4.457
asar-4.417
mess-4.321
Token 2010
Token ID3050
Feature activation-1.34e-01

Pos logit contributions
ggles+2.450
 Oprah+2.118
geoning+2.089
 toggle+2.053
 voluntarily+2.027
Neg logit contributions
�醒-3.284
arius-3.184
-3.177
asar-3.113
mess-3.030
Token.
Token ID13
Feature activation-9.27e-01

Pos logit contributions
ggles+1.069
geoning+0.954
 Oprah+0.917
pees+0.916
roxy+0.914
Neg logit contributions
arius-1.091
�醒-1.070
asar-1.067
-1.060
mess-1.037
Token
Token ID198
Feature activation+2.35e-01

Pos logit contributions
ggles+6.700
geoning+5.993
undo+5.822
 Oprah+5.775
 toggle+5.678
Neg logit contributions
arius-7.617
�醒-7.559
-7.513
asar-7.459
mess-7.283
Token old
Token ID1468
Feature activation-1.65e-01

Pos logit contributions
arius+1.449
asar+1.412
�醒+1.411
+1.400
mess+1.373
Neg logit contributions
ggles-1.445
geoning-1.294
 Oprah-1.243
pees-1.243
roxy-1.241
Token and
Token ID290
Feature activation-5.98e-01

Pos logit contributions
ggles+1.290
geoning+1.154
 Oprah+1.107
roxy+1.104
pees+1.102
Neg logit contributions
arius-1.355
�醒-1.339
asar-1.322
-1.316
mess-1.287
Token once
Token ID1752
Feature activation-5.07e-01

Pos logit contributions
ggles+4.078
 toggle+3.482
 Oprah+3.385
geoning+3.336
 voluntarily+3.334
Neg logit contributions
�醒-5.734
arius-5.405
-5.401
asar-5.323
mess-5.185
Token I
Token ID314
Feature activation-1.07e+00

Pos logit contributions
ggles+3.379
 toggle+2.821
geoning+2.770
 Oprah+2.739
roxy+2.729
Neg logit contributions
�醒-4.914
-4.729
arius-4.685
asar-4.650
mess-4.506
Token turn
Token ID1210
Feature activation-1.98e+00

Pos logit contributions
ggles+7.382
 toggle+6.542
 voluntarily+6.058
edit+5.800
roxy+5.691
Neg logit contributions
�醒-10.451
-9.470
asar-9.193
arius-9.138
mess-8.867
Token AA
Token ID15923
Feature activation-1.94e+00

Pos logit contributions
ggles+11.411
 toggle+10.331
edit+9.523
 OpenGL+9.338
 myself+9.126
Neg logit contributions
�醒-16.921
-16.425
asar-15.455
arius-14.842
mess-14.678
Token on
Token ID319
Feature activation-1.74e+00

Pos logit contributions
ggles+8.777
 toggle+8.280
 voluntarily+6.926
edit+6.801
 exclusively+6.568
Neg logit contributions
�醒-19.453
-17.875
arius-17.225
asar-17.068
mess-16.629
Token things
Token ID1243
Feature activation+2.33e-01

Pos logit contributions
ggles+9.893
 toggle+8.803
 voluntarily+8.147
edit+7.833
 OpenGL+7.756
Neg logit contributions
�醒-16.452
-15.231
arius-14.518
asar-14.364
mess-13.929
Token slow
Token ID3105
Feature activation-1.68e-01

Pos logit contributions
arius+2.009
asar+1.965
�醒+1.937
+1.937
mess+1.918
Neg logit contributions
ggles-1.703
geoning-1.536
pees-1.470
roxy-1.463
 Oprah-1.459
Token down
Token ID866
Feature activation-4.47e-01

Pos logit contributions
ggles+1.249
geoning+1.102
 Oprah+1.058
pees+1.054
roxy+1.052
Neg logit contributions
arius-1.452
�醒-1.441
-1.425
asar-1.421
mess-1.382
Token quite
Token ID2407
Feature activation+3.53e-01

Pos logit contributions
ggles+3.072
geoning+2.663
 toggle+2.607
 Oprah+2.602
 voluntarily+2.593
Neg logit contributions
�醒-4.106
-4.025
arius-4.013
asar-3.915
mess-3.804
Token what
Token ID644
Feature activation-9.73e-01

Pos logit contributions
ggles+3.388
 Oprah+2.980
 voluntarily+2.952
geoning+2.909
 toggle+2.907
Neg logit contributions
�醒-4.010
arius-3.908
-3.844
asar-3.832
mess-3.696
Token the
Token ID262
Feature activation-7.20e-01

Pos logit contributions
ggles+6.505
 Oprah+5.894
pees+5.776
 voluntarily+5.774
 toggle+5.711
Neg logit contributions
�醒-8.369
arius-8.187
asar-7.895
-7.784
mess-7.517
Token American
Token ID1605
Feature activation-7.23e-01

Pos logit contributions
ggles+5.098
 Oprah+4.523
geoning+4.503
 toggle+4.469
roxy+4.452
Neg logit contributions
�醒-6.160
arius-6.130
asar-5.933
-5.866
mess-5.514
Token ambassador
Token ID14791
Feature activation-6.51e-01

Pos logit contributions
ggles+5.627
 voluntarily+4.998
 toggle+4.934
 Oprah+4.920
pees+4.914
Neg logit contributions
�醒-5.773
arius-5.639
asar-5.468
-5.382
mess-5.124
Token himself
Token ID2241
Feature activation-9.62e-02

Pos logit contributions
ggles+5.471
 Oprah+4.990
 voluntarily+4.987
pees+4.960
undo+4.940
Neg logit contributions
�醒-4.657
arius-4.526
asar-4.383
-4.368
mess-4.205
Token told
Token ID1297
Feature activation-1.94e+00

Pos logit contributions
ggles+0.928
geoning+0.846
 Oprah+0.828
pees+0.828
undo+0.821
Neg logit contributions
arius-0.613
�醒-0.610
asar-0.590
-0.589
mess-0.569
Token me
Token ID502
Feature activation-5.41e-01

Pos logit contributions
ggles+11.686
 Oprah+11.097
 voluntarily+10.544
undo+10.448
 annually+10.110
Neg logit contributions
�醒-14.523
arius-14.298
-13.739
asar-13.363
mess-12.578
Token,
Token ID11
Feature activation-5.26e-01

Pos logit contributions
ggles+3.759
geoning+3.315
 voluntarily+3.282
 Oprah+3.271
undo+3.227
Neg logit contributions
arius-4.759
�醒-4.684
asar-4.567
-4.547
mess-4.433
Token
Token ID447
Feature activation+1.12e-02

Pos logit contributions
ggles+3.795
geoning+3.396
 voluntarily+3.384
 Oprah+3.371
pees+3.360
Neg logit contributions
arius-4.445
�醒-4.374
-4.272
asar-4.258
mess-4.138
Token
Token ID251
Feature activation-6.18e-03

Pos logit contributions
arius+0.067
asar+0.065
�醒+0.065
+0.064
mess+0.062
Neg logit contributions
ggles-0.113
geoning-0.103
 Oprah-0.101
undo-0.101
pees-0.100
Token said
Token ID531
Feature activation-8.92e-01

Pos logit contributions
ggles+0.049
geoning+0.043
 Oprah+0.042
pees+0.042
roxy+0.041
Neg logit contributions
arius-0.051
�醒-0.050
asar-0.050
-0.049
mess-0.048
Token way
Token ID835
Feature activation-1.18e-01

Pos logit contributions
ggles+2.892
 Oprah+2.538
 toggle+2.515
geoning+2.497
undo+2.473
Neg logit contributions
�醒-3.501
arius-3.435
-3.404
asar-3.339
mess-3.268
Token of
Token ID286
Feature activation-3.72e-01

Pos logit contributions
ggles+0.897
geoning+0.799
 Oprah+0.768
pees+0.765
undo+0.763
Neg logit contributions
arius-0.993
�醒-0.981
asar-0.972
-0.968
mess-0.945
Token doing
Token ID1804
Feature activation-3.72e-01

Pos logit contributions
ggles+2.475
geoning+2.098
 Oprah+2.095
 toggle+2.077
undo+2.061
Neg logit contributions
�醒-3.535
arius-3.448
-3.447
asar-3.393
mess-3.285
Token the
Token ID262
Feature activation-6.11e-01

Pos logit contributions
ggles+2.929
geoning+2.546
pees+2.513
 Oprah+2.501
undo+2.496
Neg logit contributions
�醒-3.047
arius-3.024
-2.994
asar-2.955
mess-2.878
Token light
Token ID1657
Feature activation-8.39e-01

Pos logit contributions
ggles+4.547
 Oprah+3.897
 toggle+3.896
geoning+3.894
pees+3.840
Neg logit contributions
�醒-5.188
arius-5.146
-5.092
asar-4.988
mess-4.878
Token switch
Token ID5078
Feature activation-1.93e+00

Pos logit contributions
ggles+5.907
geoning+5.391
 toggle+5.269
undo+5.203
pees+5.167
Neg logit contributions
�醒-7.127
arius-6.979
-6.942
asar-6.661
mess-6.371
Token skip
Token ID14267
Feature activation-9.96e-01

Pos logit contributions
ggles+12.716
 toggle+11.612
edit+10.859
undo+10.821
pees+10.783
Neg logit contributions
�醒-14.341
-13.831
arius-13.205
asar-12.650
 SERV-12.344
Token.
Token ID13
Feature activation-4.44e-01

Pos logit contributions
ggles+6.916
 toggle+6.152
pees+6.050
geoning+6.046
 Oprah+5.920
Neg logit contributions
�醒-8.546
-8.228
arius-8.140
asar-7.845
mess-7.602
Token Hop
Token ID9996
Feature activation-5.62e-01

Pos logit contributions
ggles+3.037
geoning+2.676
 Oprah+2.627
edit+2.609
 toggle+2.575
Neg logit contributions
�醒-3.959
arius-3.916
asar-3.893
-3.883
mess-3.740
Tokening
Token ID278
Feature activation+1.07e-01

Pos logit contributions
ggles+3.515
geoning+3.122
roxy+2.981
pees+2.972
 Oprah+2.954
Neg logit contributions
�醒-5.346
arius-5.283
-5.218
asar-5.126
mess-4.992
Token to
Token ID284
Feature activation+1.98e-02

Pos logit contributions
arius+0.920
asar+0.895
�醒+0.893
+0.889
mess+0.877
Neg logit contributions
ggles-0.803
geoning-0.715
pees-0.684
undo-0.680
 Oprah-0.680
Token;
Token ID26
Feature activation-2.89e-01

Pos logit contributions
ggles+5.609
geoning+5.263
 Oprah+5.168
 voluntarily+5.155
 toggle+5.001
Neg logit contributions
arius-8.842
�醒-8.804
-8.630
asar-8.626
mess-8.373
Token the
Token ID262
Feature activation-2.08e-01

Pos logit contributions
ggles+2.075
 Oprah+1.822
geoning+1.799
 toggle+1.755
 voluntarily+1.731
Neg logit contributions
�醒-2.584
arius-2.572
-2.530
asar-2.491
mess-2.405
Token D
Token ID360
Feature activation-5.88e-01

Pos logit contributions
ggles+1.606
geoning+1.437
 Oprah+1.412
roxy+1.378
pees+1.376
Neg logit contributions
arius-1.733
�醒-1.716
-1.688
asar-1.678
mess-1.619
Tokenugg
Token ID6837
Feature activation-4.25e-01

Pos logit contributions
ggles+3.123
undo+2.449
pees+2.437
ovan+2.418
geoning+2.415
Neg logit contributions
�醒-6.522
arius-6.139
-6.097
mess-5.973
asar-5.958
Tokenars
Token ID945
Feature activation-7.75e-02

Pos logit contributions
ggles+3.478
geoning+3.071
roxy+2.953
pees+2.952
undo+2.951
Neg logit contributions
�醒-3.381
-3.265
arius-3.230
asar-3.180
mess-3.157
Token told
Token ID1297
Feature activation-1.87e+00

Pos logit contributions
ggles+0.629
geoning+0.558
 Oprah+0.547
 voluntarily+0.543
pees+0.541
Neg logit contributions
�醒-0.619
arius-0.617
-0.601
asar-0.601
mess-0.581
Token Fox
Token ID5426
Feature activation-9.37e-01

Pos logit contributions
 Oprah+11.826
ggles+11.778
 toggle+10.401
 voluntarily+10.251
 Seymour+10.069
Neg logit contributions
�醒-14.837
-14.414
arius-14.238
asar-13.416
mess-12.116
Token News
Token ID3000
Feature activation-7.19e-01

Pos logit contributions
ggles+4.778
 Oprah+4.166
geoning+4.133
 toggle+3.964
 voluntarily+3.888
Neg logit contributions
�醒-9.606
arius-9.491
-9.462
asar-9.399
mess-8.992
Token in
Token ID287
Feature activation-4.29e-01

Pos logit contributions
ggles+4.801
 Oprah+4.302
geoning+4.216
 toggle+4.171
 voluntarily+4.169
Neg logit contributions
�醒-6.389
arius-6.380
-6.283
asar-6.245
mess-5.973
Token June
Token ID2795
Feature activation-7.09e-01

Pos logit contributions
ggles+2.846
 Oprah+2.470
geoning+2.429
 toggle+2.378
 voluntarily+2.374
Neg logit contributions
�醒-4.003
-3.973
arius-3.962
asar-3.895
mess-3.756
Token that
Token ID326
Feature activation-1.79e-01

Pos logit contributions
ggles+4.407
 Oprah+3.984
 voluntarily+3.848
 toggle+3.836
geoning+3.827
Neg logit contributions
�醒-6.553
arius-6.526
-6.339
asar-6.295
mess-6.149
Tokenignment
Token ID16747
Feature activation+3.62e-01

Pos logit contributions
arius+3.639
asar+3.449
mess+3.354
+3.286
�醒+3.261
Neg logit contributions
ggles-4.828
geoning-4.367
roxy-4.307
 Oprah-4.295
pees-4.267
Token =
Token ID796
Feature activation-1.24e-01

Pos logit contributions
arius+3.290
�醒+3.192
asar+3.173
+3.153
mess+3.136
Neg logit contributions
ggles-2.498
geoning-2.164
pees-2.122
roxy-2.099
 Oprah-2.057
Token [
Token ID685
Feature activation+6.28e-01

Pos logit contributions
ggles+0.861
geoning+0.759
pees+0.729
roxy+0.726
 Oprah+0.721
Neg logit contributions
arius-1.141
�醒-1.117
asar-1.112
-1.100
mess-1.088
Tokensystem
Token ID10057
Feature activation+3.67e-01

Pos logit contributions
arius+3.090
mess+2.968
asar+2.848
�醒+2.751
+2.667
Neg logit contributions
ggles-6.883
pees-6.234
geoning-6.206
 Oprah-6.146
 voluntarily-6.117
Token.
Token ID13
Feature activation-3.18e-01

Pos logit contributions
arius+2.852
asar+2.785
+2.747
mess+2.721
�醒+2.676
Neg logit contributions
ggles-3.048
geoning-2.735
pees-2.645
roxy-2.628
 Oprah-2.611
Tokendraw
Token ID19334
Feature activation-1.86e+00

Pos logit contributions
ggles+1.507
geoning+1.267
undo+1.239
pees+1.178
edit+1.169
Neg logit contributions
�醒-3.653
arius-3.514
-3.497
asar-3.458
 SERV-3.416
Tokening
Token ID278
Feature activation-1.11e-01

Pos logit contributions
edit+2.921
ggles+2.648
geoning+2.564
/_+2.218
undo+2.122
Neg logit contributions
�醒-25.409
 mosqu-23.379
 eleph-23.004
 exting-22.869
 SERV-22.441
Token.
Token ID13
Feature activation+9.50e-01

Pos logit contributions
ggles+0.782
geoning+0.695
 Oprah+0.679
undo+0.671
 toggle+0.668
Neg logit contributions
�醒-1.027
arius-0.985
-0.964
asar-0.960
mess-0.927
TokenString
Token ID10100
Feature activation+4.75e-01

Pos logit contributions
arius+6.643
asar+6.515
mess+6.393
+6.292
�醒+6.200
Neg logit contributions
ggles-7.824
 Oprah-6.895
geoning-6.881
roxy-6.878
pees-6.797
TokenAl
Token ID2348
Feature activation+1.00e+00

Pos logit contributions
arius+3.628
asar+3.532
mess+3.480
+3.467
�醒+3.457
Neg logit contributions
ggles-3.975
roxy-3.475
pees-3.449
geoning-3.441
 Oprah-3.393
Tokenignment
Token ID16747
Feature activation+8.46e-01

Pos logit contributions
arius+6.148
asar+5.578
mess+5.322
+4.953
�醒+4.681
Neg logit contributions
ggles-9.676
 toggle-8.764
 Oprah-8.756
roxy-8.749
geoning-8.698
Token new
Token ID649
Feature activation+1.11e+00

Pos logit contributions
arius+2.214
asar+2.182
�醒+2.163
+2.136
mess+2.114
Neg logit contributions
ggles-1.881
geoning-1.682
pees-1.618
roxy-1.593
 Oprah-1.590
Token-
Token ID12
Feature activation-7.34e-02

Pos logit contributions
arius+5.931
asar+5.830
mess+5.639
+5.478
 SERV+5.059
Neg logit contributions
ggles-11.004
geoning-10.047
pees-9.877
roxy-9.704
undo-9.703
Tokenobject
Token ID15252
Feature activation+3.28e-01

Pos logit contributions
ggles+0.199
geoning+0.143
undo+0.133
roxy+0.127
 Oprah+0.122
Neg logit contributions
�醒-0.992
arius-0.971
-0.963
asar-0.955
 SERV-0.937
Token system
Token ID1080
Feature activation+6.50e-01

Pos logit contributions
arius+1.996
asar+1.934
�醒+1.928
mess+1.894
+1.872
Neg logit contributions
ggles-3.235
geoning-2.933
pees-2.892
roxy-2.845
 Oprah-2.837
Token.
Token ID13
Feature activation-2.53e-01

Pos logit contributions
arius+4.643
asar+4.521
+4.461
mess+4.458
 SERV+4.091
Neg logit contributions
ggles-5.713
geoning-5.130
pees-4.999
roxy-4.978
 Oprah-4.854
Tokendraw
Token ID19334
Feature activation-1.86e+00

Pos logit contributions
ggles+1.516
geoning+1.298
 Oprah+1.243
roxy+1.230
pees+1.229
Neg logit contributions
arius-2.548
�醒-2.501
asar-2.497
-2.484
mess-2.452
Tokening
Token ID278
Feature activation-1.09e-01

Pos logit contributions
edit+3.011
ggles+2.921
geoning+2.779
 ing+2.417
/_+2.265
Neg logit contributions
�醒-25.204
 mosqu-22.820
 SERV-22.313
 eleph-22.222
 exting-22.204
Token.
Token ID13
Feature activation+3.44e-01

Pos logit contributions
ggles+0.762
geoning+0.679
 Oprah+0.656
undo+0.656
 toggle+0.650
Neg logit contributions
�醒-1.012
arius-0.970
-0.947
asar-0.944
mess-0.910
Tokenbit
Token ID2545
Feature activation-8.09e-01

Pos logit contributions
arius+3.209
asar+3.188
mess+3.176
+3.098
�醒+3.093
Neg logit contributions
ggles-2.244
 Oprah-1.911
geoning-1.897
 toggle-1.850
pees-1.834
Tokenmap
Token ID8899
Feature activation-1.98e-01

Pos logit contributions
ggles+3.204
geoning+2.845
undo+2.798
edit+2.640
embed+2.626
Neg logit contributions
�醒-9.963
-9.240
 SERV-9.102
arius-8.921
asar-8.542
Token($
Token ID16763
Feature activation+6.31e-01

Pos logit contributions
ggles+1.453
geoning+1.296
 Oprah+1.239
roxy+1.234
undo+1.233
Neg logit contributions
arius-1.723
�醒-1.696
asar-1.685
-1.682
mess-1.639
Token)
Token ID8
Feature activation-4.41e-01

Pos logit contributions
ggles+3.359
geoning+2.901
 toggle+2.762
roxy+2.757
 Oprah+2.751
Neg logit contributions
arius-4.871
�醒-4.868
asar-4.826
-4.764
mess-4.629
Token and
Token ID290
Feature activation-4.78e-01

Pos logit contributions
ggles+3.578
geoning+3.199
 toggle+3.083
 Oprah+3.064
undo+3.054
Neg logit contributions
arius-3.419
asar-3.405
�醒-3.395
-3.360
mess-3.240
Token par
Token ID1582
Feature activation-3.37e-01

Pos logit contributions
ggles+2.987
geoning+2.533
 toggle+2.500
 voluntarily+2.419
 Oprah+2.408
Neg logit contributions
asar-4.575
arius-4.569
-4.554
�醒-4.531
mess-4.406
Tokenasy
Token ID4107
Feature activation-8.97e-01

Pos logit contributions
ggles+3.000
geoning+2.715
roxy+2.636
 Oprah+2.627
pees+2.617
Neg logit contributions
arius-2.399
�醒-2.354
asar-2.332
-2.329
mess-2.267
Tokenmp
Token ID3149
Feature activation-7.55e-01

Pos logit contributions
ggles+5.659
geoning+4.862
roxy+4.615
 Oprah+4.469
undo+4.391
Neg logit contributions
�醒-8.651
-8.568
arius-8.507
asar-8.137
 SERV-8.052
Tokenathetic
Token ID18874
Feature activation-1.85e+00

Pos logit contributions
ggles+3.537
roxy+2.896
geoning+2.825
 voluntarily+2.707
 Oprah+2.638
Neg logit contributions
�醒-8.136
-8.134
arius-8.104
asar-8.061
mess-7.907
Token (
Token ID357
Feature activation-2.85e-01

Pos logit contributions
ggles+10.046
 toggle+8.935
geoning+8.490
 toes+8.334
 sympathetic+8.259
Neg logit contributions
asar-15.472
�醒-15.336
arius-15.070
-14.326
mess-13.998
Token
Token ID447
Feature activation-7.88e-01

Pos logit contributions
ggles+2.310
geoning+2.070
pees+1.994
undo+1.990
roxy+1.988
Neg logit contributions
arius-2.256
�醒-2.215
asar-2.208
-2.182
mess-2.132
Token
Token ID250
Feature activation-7.98e-01

Pos logit contributions
ggles+7.177
geoning+6.389
roxy+6.274
 Oprah+6.225
undo+6.211
Neg logit contributions
asar-5.227
arius-5.196
mess-4.993
-4.941
�醒-4.827
Tokenrest
Token ID2118
Feature activation-1.08e+00

Pos logit contributions
ggles+5.923
geoning+5.231
undo+5.147
pees+5.108
roxy+5.075
Neg logit contributions
�醒-6.407
arius-6.272
asar-6.260
-6.246
 SERV-5.950
Token and
Token ID290
Feature activation-6.90e-01

Pos logit contributions
ggles+6.882
 voluntarily+6.209
 toggle+6.119
geoning+6.101
roxy+6.033
Neg logit contributions
-9.395
�醒-9.312
arius-9.104
asar-9.062
mess-8.768
Tokenackers
Token ID28874
Feature activation-3.50e-01

Pos logit contributions
ggles+3.124
geoning+2.809
roxy+2.686
undo+2.684
pees+2.658
Neg logit contributions
�醒-3.291
arius-3.266
-3.237
asar-3.173
mess-3.089
Token,
Token ID11
Feature activation-1.79e-01

Pos logit contributions
ggles+2.390
geoning+2.151
pees+2.071
 Oprah+2.070
 voluntarily+2.060
Neg logit contributions
arius-3.117
�醒-3.111
asar-3.061
-3.051
mess-2.980
Token with
Token ID351
Feature activation-4.09e-01

Pos logit contributions
ggles+1.418
geoning+1.275
 Oprah+1.237
pees+1.235
undo+1.222
Neg logit contributions
arius-1.451
�醒-1.431
asar-1.402
-1.396
mess-1.371
Token an
Token ID281
Feature activation-8.83e-01

Pos logit contributions
ggles+3.010
geoning+2.703
 Oprah+2.630
roxy+2.594
 toggle+2.590
Neg logit contributions
arius-3.492
�醒-3.467
asar-3.400
-3.391
mess-3.285
Token agent
Token ID5797
Feature activation-4.90e-01

Pos logit contributions
ggles+5.505
geoning+4.877
 Oprah+4.867
 toggle+4.781
undo+4.707
Neg logit contributions
�醒-8.223
arius-8.010
-8.009
asar-7.715
mess-7.641
Token telling
Token ID5149
Feature activation-1.82e+00

Pos logit contributions
ggles+3.403
geoning+3.068
 voluntarily+3.046
 Oprah+3.043
 toggle+3.013
Neg logit contributions
�醒-4.272
arius-4.185
-4.135
asar-4.113
mess-4.019
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
ggles+12.172
 Oprah+11.793
 toggle+10.863
 voluntarily+10.751
 cocaine+10.389
Neg logit contributions
�醒-13.119
arius-13.107
-12.599
asar-12.445
mess-11.596
Token reporter
Token ID9095
Feature activation-6.82e-01

Pos logit contributions
ggles+7.154
 Oprah+6.349
geoning+6.282
 toggle+6.271
 voluntarily+6.253
Neg logit contributions
arius-8.109
-8.077
�醒-8.018
asar-7.874
mess-7.638
Token he
Token ID339
Feature activation-7.32e-01

Pos logit contributions
ggles+4.531
 Oprah+4.054
geoning+4.019
 voluntarily+3.971
 toggle+3.910
Neg logit contributions
arius-6.073
�醒-5.929
asar-5.844
-5.762
mess-5.647
Token attends
Token ID32743
Feature activation-1.70e-03

Pos logit contributions
ggles+5.072
geoning+4.461
 voluntarily+4.427
pees+4.319
 toggle+4.275
Neg logit contributions
�醒-6.378
arius-6.251
asar-6.129
-6.065
mess-5.877
Token DIY
Token ID25134
Feature activation-2.86e-01

Pos logit contributions
ggles+0.013
geoning+0.012
 Oprah+0.011
pees+0.011
roxy+0.011
Neg logit contributions
arius-0.014
�醒-0.014
asar-0.014
-0.014
mess-0.014
Token explained
Token ID4893
Feature activation-2.46e-01

Pos logit contributions
arius+0.897
�醒+0.880
asar+0.874
+0.869
mess+0.848
Neg logit contributions
ggles-1.006
geoning-0.907
 Oprah-0.875
pees-0.873
roxy-0.871
Token to
Token ID284
Feature activation-1.03e+00

Pos logit contributions
ggles+1.866
geoning+1.658
 Oprah+1.612
undo+1.587
roxy+1.586
Neg logit contributions
arius-2.071
�醒-2.032
asar-2.024
-2.008
mess-1.963
Token NBC
Token ID9716
Feature activation-8.38e-01

Pos logit contributions
ggles+7.461
 Oprah+6.839
 toggle+6.310
geoning+6.289
 Seymour+6.229
Neg logit contributions
�醒-8.553
arius-8.413
-8.222
asar-8.035
mess-7.720
Token
Token ID447
Feature activation+3.27e-01

Pos logit contributions
ggles+4.874
 Oprah+4.174
geoning+4.151
 toggle+4.140
undo+4.003
Neg logit contributions
�醒-8.185
arius-8.044
asar-7.856
-7.776
mess-7.633
Token
Token ID247
Feature activation-4.20e-01

Pos logit contributions
arius+1.545
�醒+1.513
asar+1.485
+1.480
mess+1.412
Neg logit contributions
ggles-3.685
geoning-3.429
 Oprah-3.331
pees-3.328
roxy-3.322
Tokens
Token ID82
Feature activation-1.81e+00

Pos logit contributions
ggles+2.657
geoning+2.288
undo+2.199
 Oprah+2.199
roxy+2.165
Neg logit contributions
arius-4.071
�醒-4.023
asar-3.980
-3.946
mess-3.864
Token Matt
Token ID4705
Feature activation-2.70e-01

Pos logit contributions
 Oprah+11.519
ggles+11.205
 Seymour+10.596
 toggle+10.353
 voluntarily+10.243
Neg logit contributions
�醒-14.817
-13.570
arius-13.247
asar-12.867
mess-12.340
Token L
Token ID406
Feature activation+2.20e-01

Pos logit contributions
ggles+2.079
geoning+1.846
 Oprah+1.841
pees+1.793
 toggle+1.781
Neg logit contributions
�醒-2.219
arius-2.207
-2.159
asar-2.149
mess-2.087
Tokenauer
Token ID16261
Feature activation-6.73e-02

Pos logit contributions
arius+0.908
asar+0.877
�醒+0.840
+0.828
mess+0.809
Neg logit contributions
ggles-2.603
geoning-2.470
pees-2.401
 Oprah-2.389
roxy-2.364
Token that
Token ID326
Feature activation-3.14e-01

Pos logit contributions
ggles+0.532
geoning+0.475
 Oprah+0.457
pees+0.456
roxy+0.455
Neg logit contributions
arius-0.550
�醒-0.539
asar-0.536
-0.533
mess-0.522
Token he
Token ID339
Feature activation-4.60e-01

Pos logit contributions
ggles+2.396
geoning+2.142
 Oprah+2.123
 toggle+2.082
roxy+2.060
Neg logit contributions
arius-2.593
�醒-2.570
asar-2.534
-2.519
mess-2.437
Token ask
Token ID1265
Feature activation-2.85e-01

Pos logit contributions
ggles+2.190
 voluntarily+1.845
geoning+1.844
 toggle+1.841
undo+1.829
Neg logit contributions
arius-2.569
-2.526
�醒-2.515
asar-2.502
mess-2.439
Token Players
Token ID13094
Feature activation-5.12e-01

Pos logit contributions
ggles+2.361
geoning+2.084
 Oprah+2.068
 toggle+2.040
 voluntarily+2.028
Neg logit contributions
arius-2.204
�醒-2.187
-2.169
asar-2.168
mess-2.095
Token with
Token ID351
Feature activation-7.16e-01

Pos logit contributions
ggles+3.414
geoning+2.946
 voluntarily+2.940
undo+2.911
 Oprah+2.909
Neg logit contributions
arius-4.679
asar-4.614
�醒-4.590
-4.552
mess-4.417
Token "
Token ID366
Feature activation-6.09e-01

Pos logit contributions
ggles+5.010
 toggle+4.237
geoning+4.191
 voluntarily+4.170
 Oprah+4.123
Neg logit contributions
arius-6.220
-6.180
asar-6.146
�醒-6.121
mess-5.898
TokenNo
Token ID2949
Feature activation-6.61e-01

Pos logit contributions
ggles+3.838
undo+3.306
geoning+3.230
pees+3.173
roxy+3.167
Neg logit contributions
arius-5.751
�醒-5.663
-5.630
asar-5.590
 SERV-5.458
Token Move
Token ID10028
Feature activation-1.81e+00

Pos logit contributions
ggles+5.026
geoning+4.358
pees+4.231
 toggle+4.207
undo+4.200
Neg logit contributions
arius-5.465
asar-5.380
-5.359
�醒-5.317
mess-5.194
Token"
Token ID1
Feature activation-2.18e-01

Pos logit contributions
ggles+11.783
 toggle+9.912
 voluntarily+9.863
/_+9.773
binding+9.696
Neg logit contributions
-13.781
asar-13.748
�醒-13.223
arius-13.222
 SERV-12.538
Token clauses
Token ID31485
Feature activation-2.47e-01

Pos logit contributions
ggles+1.438
geoning+1.230
 Oprah+1.187
pees+1.186
 toggle+1.182
Neg logit contributions
arius-2.057
�醒-2.031
asar-2.028
-2.014
mess-1.963
Token whether
Token ID1771
Feature activation-4.49e-01

Pos logit contributions
ggles+1.783
geoning+1.553
 Oprah+1.530
 toggle+1.516
undo+1.510
Neg logit contributions
arius-2.151
�醒-2.136
asar-2.124
-2.112
mess-2.048
Token they
Token ID484
Feature activation-1.07e+00

Pos logit contributions
ggles+3.207
geoning+2.821
 Oprah+2.794
 voluntarily+2.782
 toggle+2.774
Neg logit contributions
�醒-3.883
arius-3.880
asar-3.812
-3.809
mess-3.691
Token want
Token ID765
Feature activation-2.69e-01

Pos logit contributions
ggles+6.519
 voluntarily+5.893
 toggle+5.617
undo+5.199
 annually+5.172
Neg logit contributions
�醒-10.476
-9.799
arius-9.732
asar-9.637
mess-9.272
Token 26
Token ID2608
Feature activation+4.54e-03

Pos logit contributions
ggles+5.550
 Oprah+5.124
edit+4.737
/_+4.714
 toggle+4.647
Neg logit contributions
�醒-12.056
-10.499
asar-10.353
arius-10.324
mess-10.123
Token.
Token ID13
Feature activation-1.55e-01

Pos logit contributions
arius+0.037
�醒+0.037
asar+0.036
+0.036
mess+0.035
Neg logit contributions
ggles-0.036
geoning-0.032
 Oprah-0.030
pees-0.030
roxy-0.030
Token
Token ID198
Feature activation-1.49e-02

Pos logit contributions
ggles+1.159
geoning+1.025
 Oprah+1.014
 toggle+0.986
pees+0.982
Neg logit contributions
arius-1.308
�醒-1.305
asar-1.288
-1.275
mess-1.244
Token
Token ID198
Feature activation-3.63e-01

Pos logit contributions
ggles+0.115
geoning+0.102
 Oprah+0.101
undo+0.099
 toggle+0.099
Neg logit contributions
�醒-0.127
arius-0.123
-0.120
asar-0.120
mess-0.116
TokenThanks
Token ID9690
Feature activation-1.11e+00

Pos logit contributions
ggles+2.497
geoning+2.186
undo+2.128
pees+2.105
League+2.103
Neg logit contributions
�醒-3.271
arius-3.256
asar-3.210
-3.162
mess-3.069
Token,
Token ID11
Feature activation-1.81e+00

Pos logit contributions
ggles+6.696
geoning+6.030
 Oprah+5.995
 toggle+5.968
roxy+5.695
Neg logit contributions
arius-9.979
asar-9.946
-9.914
�醒-9.885
mess-9.430
Token Wii
Token ID16591
Feature activation-1.42e+00

Pos logit contributions
ggles+12.067
 Oprah+11.307
 toggle+11.073
 voluntarily+10.411
 exclusively+10.401
Neg logit contributions
�醒-13.473
arius-12.475
asar-12.409
-12.367
mess-11.387
TokenUM
Token ID5883
Feature activation-3.21e-01

Pos logit contributions
ggles+9.723
undo+8.811
League+8.751
geoning+8.608
pees+8.594
Neg logit contributions
�醒-10.551
-10.513
asar-10.322
arius-10.292
 SERV-9.767
Tokenaster
Token ID1603
Feature activation-2.52e-01

Pos logit contributions
ggles+2.506
geoning+2.222
undo+2.174
 Oprah+2.151
roxy+2.149
Neg logit contributions
arius-2.602
�醒-2.593
-2.571
asar-2.539
mess-2.470
TokenRace
Token ID35157
Feature activation-6.94e-02

Pos logit contributions
ggles+1.935
geoning+1.730
 Oprah+1.670
roxy+1.661
pees+1.659
Neg logit contributions
arius-2.089
�醒-2.073
asar-2.048
-2.041
mess-1.981
Token<|endoftext|>
Token ID50256
Feature activation+2.59e-02

Pos logit contributions
ggles+0.552
geoning+0.494
 Oprah+0.475
pees+0.474
roxy+0.473
Neg logit contributions
arius-0.562
�醒-0.553
asar-0.549
-0.546
mess-0.534
Token migration
Token ID13472
Feature activation+1.59e-01

Pos logit contributions
ggles+3.580
 toggle+3.111
geoning+3.050
roxy+3.038
edit+3.027
Neg logit contributions
�醒-5.337
asar-5.317
arius-5.205
-5.105
 SERV-5.033
Token
Token ID447
Feature activation+4.95e-01

Pos logit contributions
arius+1.316
�醒+1.282
+1.281
asar+1.280
mess+1.257
Neg logit contributions
ggles-1.237
geoning-1.106
pees-1.060
 Oprah-1.060
roxy-1.059
Token
Token ID247
Feature activation+4.39e-01

Pos logit contributions
arius+2.261
�醒+2.226
+2.178
asar+2.131
mess+2.049
Neg logit contributions
ggles-5.577
geoning-5.266
 Oprah-5.077
roxy-5.075
pees-5.057
Token will
Token ID481
Feature activation+2.01e-01

Pos logit contributions
arius+3.615
+3.510
asar+3.506
�醒+3.463
mess+3.459
Neg logit contributions
ggles-3.406
geoning-3.027
roxy-2.912
 Oprah-2.903
pees-2.901
Token be
Token ID307
Feature activation-4.08e-01

Pos logit contributions
arius+1.502
asar+1.461
�醒+1.459
+1.451
mess+1.420
Neg logit contributions
ggles-1.698
geoning-1.546
pees-1.494
roxy-1.490
 Oprah-1.488
Token pre
Token ID662
Feature activation+1.33e+00

Pos logit contributions
ggles+2.948
 voluntarily+2.582
 Oprah+2.573
 toggle+2.547
geoning+2.546
Neg logit contributions
�醒-3.504
arius-3.494
-3.480
asar-3.416
mess-3.375
Token-
Token ID12
Feature activation+4.20e-01

Pos logit contributions
arius+8.217
mess+8.202
asar+7.608
+7.250
�醒+7.149
Neg logit contributions
ggles-10.822
geoning-9.986
roxy-9.724
ovan-9.701
 Oprah-9.674
Tokencond
Token ID17561
Feature activation+5.91e-02

Pos logit contributions
arius+3.551
mess+3.511
asar+3.478
+3.425
�醒+3.388
Neg logit contributions
ggles-3.127
geoning-2.757
roxy-2.662
 Oprah-2.662
 toggle-2.586
Tokenitions
Token ID1756
Feature activation+3.25e-02

Pos logit contributions
arius+0.480
�醒+0.470
asar+0.469
+0.464
mess+0.458
Neg logit contributions
ggles-0.468
geoning-0.418
 Oprah-0.404
pees-0.402
roxy-0.401
Token of
Token ID286
Feature activation+2.70e-01

Pos logit contributions
arius+0.267
�醒+0.262
asar+0.261
+0.259
mess+0.254
Neg logit contributions
ggles-0.254
geoning-0.227
 Oprah-0.219
pees-0.218
undo-0.217
Token any
Token ID597
Feature activation+3.73e-01

Pos logit contributions
arius+2.249
�醒+2.201
asar+2.192
+2.171
mess+2.130
Neg logit contributions
ggles-2.083
geoning-1.856
pees-1.789
roxy-1.787
 Oprah-1.783
Token traditional
Token ID4569
Feature activation+7.65e-01

Pos logit contributions
arius+8.296
asar+7.896
mess+7.697
+7.489
 SERV+7.422
Neg logit contributions
ggles-7.862
geoning-7.344
pees-7.103
roxy-7.079
undo-6.786
Token startup
Token ID13693
Feature activation+5.67e-01

Pos logit contributions
arius+6.037
asar+5.821
mess+5.741
�醒+5.663
+5.605
Neg logit contributions
ggles-5.932
geoning-5.461
pees-5.227
roxy-5.170
undo-5.115
Token and
Token ID290
Feature activation+6.70e-01

Pos logit contributions
arius+4.367
asar+4.220
mess+4.127
+4.082
�醒+4.054
Neg logit contributions
ggles-4.628
geoning-4.178
roxy-4.107
pees-4.035
undo-3.972
Token operating
Token ID5361
Feature activation+1.06e+00

Pos logit contributions
arius+5.642
asar+5.511
mess+5.409
+5.272
 SERV+5.261
Neg logit contributions
ggles-4.879
geoning-4.425
roxy-4.252
pees-4.188
undo-4.094
Token as
Token ID355
Feature activation+1.48e+00

Pos logit contributions
arius+8.357
mess+7.994
asar+7.993
 SERV+7.632
+7.584
Neg logit contributions
ggles-7.772
geoning-7.255
pees-6.794
roxy-6.762
undo-6.647
Token a
Token ID257
Feature activation+1.31e+00

Pos logit contributions
arius+9.879
asar+9.517
mess+9.388
 SERV+9.063
 messenger+8.787
Neg logit contributions
ggles-11.174
geoning-10.872
roxy-10.283
pees-10.195
undo-9.911
Token financial
Token ID3176
Feature activation+9.82e-01

Pos logit contributions
arius+10.226
mess+9.778
asar+9.713
 SERV+9.566
 messenger+9.392
Neg logit contributions
ggles-9.003
geoning-8.473
pees-8.173
edit-7.958
roxy-7.939
Token institution
Token ID9901
Feature activation+6.03e-01

Pos logit contributions
arius+6.506
mess+6.390
asar+6.389
 SERV+6.159
+6.101
Neg logit contributions
ggles-8.373
pees-7.770
geoning-7.705
roxy-7.402
undo-7.368
Token.
Token ID13
Feature activation+5.62e-01

Pos logit contributions
arius+4.463
asar+4.307
mess+4.215
+4.175
�醒+4.134
Neg logit contributions
ggles-5.121
geoning-4.572
roxy-4.465
pees-4.434
undo-4.376
Token Even
Token ID3412
Feature activation+8.47e-01

Pos logit contributions
arius+4.713
asar+4.590
+4.538
�醒+4.487
mess+4.479
Neg logit contributions
ggles-4.254
geoning-3.864
roxy-3.701
pees-3.661
undo-3.625
Token then
Token ID788
Feature activation+4.86e-01

Pos logit contributions
arius+6.312
asar+6.191
mess+6.000
+5.925
 SERV+5.857
Neg logit contributions
ggles-6.855
geoning-6.316
roxy-6.006
pees-5.992
undo-5.840
Token
Token ID198
Feature activation+5.29e-02

Pos logit contributions
arius+2.641
asar+2.564
+2.538
mess+2.513
�醒+2.493
Neg logit contributions
ggles-2.897
geoning-2.630
pees-2.537
roxy-2.524
undo-2.486
Token
Token ID198
Feature activation-8.72e-02

Pos logit contributions
�醒+0.440
arius+0.430
+0.420
asar+0.420
mess+0.406
Neg logit contributions
ggles-0.416
geoning-0.371
 Oprah-0.365
undo-0.359
 toggle-0.357
TokenApparently
Token ID30402
Feature activation+3.13e-01

Pos logit contributions
ggles+0.606
geoning+0.531
 Oprah+0.514
undo+0.510
pees+0.507
Neg logit contributions
arius-0.790
�醒-0.786
asar-0.775
-0.768
mess-0.751
Token she
Token ID673
Feature activation+3.10e-01

Pos logit contributions
arius+2.462
asar+2.378
+2.354
mess+2.342
�醒+2.335
Neg logit contributions
ggles-2.564
geoning-2.296
pees-2.205
roxy-2.182
undo-2.180
Token was
Token ID373
Feature activation+7.04e-01

Pos logit contributions
arius+2.663
asar+2.596
+2.571
�醒+2.563
mess+2.549
Neg logit contributions
ggles-2.273
geoning-2.049
roxy-1.948
pees-1.936
undo-1.935
Token originally
Token ID6198
Feature activation+1.49e+00

Pos logit contributions
arius+6.105
asar+5.895
mess+5.774
+5.703
�醒+5.663
Neg logit contributions
ggles-5.072
geoning-4.477
roxy-4.236
pees-4.196
undo-4.180
Token inspired
Token ID7867
Feature activation+8.96e-01

Pos logit contributions
arius+11.576
asar+10.944
mess+10.778
+10.427
 SERV+10.406
Neg logit contributions
ggles-10.588
geoning-9.398
undo-8.723
roxy-8.543
laim-8.424
Token by
Token ID416
Feature activation+2.01e-01

Pos logit contributions
arius+5.436
asar+5.238
+5.134
mess+5.070
�醒+4.945
Neg logit contributions
ggles-8.409
geoning-7.469
roxy-7.290
undo-7.237
pees-7.212
Token her
Token ID607
Feature activation-8.45e-02

Pos logit contributions
arius+1.607
asar+1.565
�醒+1.565
+1.559
mess+1.538
Neg logit contributions
ggles-1.601
geoning-1.439
pees-1.374
undo-1.374
roxy-1.366
Token love
Token ID1842
Feature activation-1.10e-01

Pos logit contributions
ggles+0.622
geoning+0.551
 Oprah+0.534
roxy+0.527
pees+0.526
Neg logit contributions
arius-0.734
�醒-0.725
asar-0.718
-0.715
mess-0.698
Token of
Token ID286
Feature activation-1.17e-01

Pos logit contributions
ggles+0.956
geoning+0.862
undo+0.832
 Oprah+0.829
pees+0.828
Neg logit contributions
arius-0.804
�醒-0.792
asar-0.782
-0.779
mess-0.766
Token lawsuit
Token ID8087
Feature activation+4.63e-01

Pos logit contributions
arius+1.136
�醒+1.116
asar+1.111
+1.104
mess+1.082
Neg logit contributions
ggles-1.017
geoning-0.905
 Oprah-0.869
pees-0.867
roxy-0.865
Token in
Token ID287
Feature activation-5.71e-02

Pos logit contributions
arius+3.570
asar+3.513
�醒+3.451
+3.450
mess+3.411
Neg logit contributions
ggles-3.814
geoning-3.433
roxy-3.276
pees-3.261
undo-3.249
Token the
Token ID262
Feature activation-1.46e-01

Pos logit contributions
ggles+0.438
geoning+0.388
 Oprah+0.377
pees+0.372
undo+0.371
Neg logit contributions
arius-0.483
�醒-0.475
-0.469
asar-0.469
mess-0.455
Token US
Token ID1294
Feature activation+2.76e-02

Pos logit contributions
ggles+1.145
geoning+1.022
 Oprah+0.999
 toggle+0.978
undo+0.974
Neg logit contributions
arius-1.215
�醒-1.199
-1.177
asar-1.170
mess-1.135
Token against
Token ID1028
Feature activation+4.09e-01

Pos logit contributions
arius+0.232
�醒+0.228
asar+0.226
+0.225
mess+0.220
Neg logit contributions
ggles-0.211
geoning-0.189
 Oprah-0.181
pees-0.181
undo-0.180
Token Airbus
Token ID39173
Feature activation+1.37e+00

Pos logit contributions
arius+3.137
asar+3.092
�醒+3.056
mess+3.024
+3.019
Neg logit contributions
ggles-3.346
geoning-3.023
roxy-2.909
pees-2.907
undo-2.887
Token and
Token ID290
Feature activation+3.85e-01

Pos logit contributions
arius+8.705
asar+8.659
 SERV+8.613
mess+8.489
�醒+8.357
Neg logit contributions
ggles-11.590
geoning-10.402
pees-10.230
roxy-10.004
undo-9.705
Token aerospace
Token ID40439
Feature activation+7.53e-01

Pos logit contributions
arius+3.207
asar+3.136
�醒+3.103
mess+3.077
+3.066
Neg logit contributions
ggles-2.925
geoning-2.646
pees-2.516
roxy-2.509
undo-2.492
Token company
Token ID1664
Feature activation+9.91e-01

Pos logit contributions
arius+4.924
asar+4.905
mess+4.879
�醒+4.791
+4.773
Neg logit contributions
ggles-6.783
geoning-6.044
pees-6.044
roxy-5.883
undo-5.871
Token North
Token ID2258
Feature activation+1.03e+00

Pos logit contributions
arius+6.778
asar+6.632
mess+6.591
 SERV+6.516
�醒+6.378
Neg logit contributions
ggles-8.256
geoning-7.606
pees-7.393
roxy-7.331
undo-7.181
Tokenrop
Token ID1773
Feature activation+3.95e-02

Pos logit contributions
arius+6.570
asar+6.419
mess+6.292
�醒+6.048
+5.953
Neg logit contributions
ggles-9.087
geoning-8.742
pees-8.372
undo-7.993
edit-7.985
Token-
Token ID12
Feature activation-1.04e+00

Pos logit contributions
ggles+4.922
geoning+4.352
pees+4.297
 Oprah+4.212
undo+4.177
Neg logit contributions
arius-4.767
�醒-4.691
asar-4.659
-4.575
mess-4.484
Tokenoperation
Token ID27184
Feature activation-5.23e-01

Pos logit contributions
ggles+5.705
binding+4.713
geoning+4.705
ilateral+4.700
pees+4.614
Neg logit contributions
�醒-10.464
arius-9.945
asar-9.908
-9.560
 SERV-9.429
Token.
Token ID13
Feature activation-5.33e-01

Pos logit contributions
ggles+3.646
geoning+3.231
 voluntarily+3.196
pees+3.121
 toggle+3.114
Neg logit contributions
�醒-4.607
arius-4.526
asar-4.514
-4.396
mess-4.324
Token
Token ID198
Feature activation-1.09e-02

Pos logit contributions
ggles+3.578
 Oprah+3.215
geoning+3.080
 toggle+3.040
League+3.021
Neg logit contributions
�醒-4.779
arius-4.666
asar-4.570
-4.569
mess-4.535
Token
Token ID198
Feature activation-4.75e-01

Pos logit contributions
ggles+0.084
geoning+0.074
 Oprah+0.074
 toggle+0.072
undo+0.072
Neg logit contributions
�醒-0.094
arius-0.091
-0.089
asar-0.088
mess-0.086
TokenJust
Token ID5703
Feature activation+1.19e+00

Pos logit contributions
ggles+3.169
geoning+2.774
edit+2.742
League+2.741
undo+2.723
Neg logit contributions
�醒-4.332
arius-4.281
asar-4.180
-4.126
 SERV-4.050
Token five
Token ID1936
Feature activation+9.26e-02

Pos logit contributions
asar+8.247
arius+8.164
mess+7.723
�醒+7.656
+7.505
Neg logit contributions
ggles-9.739
geoning-8.832
pees-8.446
undo-8.425
 Oprah-8.400
Token years
Token ID812
Feature activation+5.62e-01

Pos logit contributions
arius+0.697
asar+0.679
�醒+0.678
+0.674
mess+0.657
Neg logit contributions
ggles-0.791
geoning-0.713
 Oprah-0.687
roxy-0.686
pees-0.686
Token later
Token ID1568
Feature activation-3.74e-01

Pos logit contributions
arius+4.097
asar+4.043
�醒+3.918
+3.914
mess+3.898
Neg logit contributions
ggles-4.858
geoning-4.346
roxy-4.209
pees-4.138
 Oprah-4.102
Token,
Token ID11
Feature activation-4.67e-01

Pos logit contributions
ggles+2.742
geoning+2.440
 Oprah+2.380
roxy+2.330
pees+2.327
Neg logit contributions
arius-3.200
�醒-3.163
asar-3.118
-3.108
mess-3.051
Token in
Token ID287
Feature activation-5.22e-01

Pos logit contributions
ggles+3.333
 Oprah+2.926
geoning+2.909
 voluntarily+2.904
 toggle+2.846
Neg logit contributions
�醒-4.091
arius-4.013
-3.932
asar-3.858
mess-3.829
TokenT
Token ID51
Feature activation-3.03e-03

Pos logit contributions
arius+2.472
asar+2.419
+2.408
�醒+2.397
mess+2.367
Neg logit contributions
ggles-2.209
geoning-1.966
roxy-1.880
pees-1.878
 toggle-1.854
Tokenravis
Token ID16956
Feature activation-1.48e-01

Pos logit contributions
ggles+0.024
geoning+0.021
 Oprah+0.021
pees+0.021
roxy+0.020
Neg logit contributions
arius-0.025
�醒-0.024
asar-0.024
-0.024
mess-0.024
Token &
Token ID1222
Feature activation-1.43e-01

Pos logit contributions
ggles+1.187
geoning+1.057
 Oprah+1.036
pees+1.022
undo+1.019
Neg logit contributions
arius-1.178
�醒-1.164
-1.156
asar-1.154
mess-1.119
Token Protect
Token ID21916
Feature activation+1.19e+00

Pos logit contributions
ggles+1.114
geoning+0.990
 Oprah+0.971
undo+0.952
pees+0.951
Neg logit contributions
arius-1.170
�醒-1.159
asar-1.144
-1.142
mess-1.113
Tokening
Token ID278
Feature activation+6.88e-01

Pos logit contributions
arius+9.592
asar+9.208
mess+8.942
�醒+8.710
+8.679
Neg logit contributions
ggles-8.076
geoning-7.681
pees-7.235
roxy-7.213
 voluntarily-7.123
Token Your
Token ID3406
Feature activation+6.72e-01

Pos logit contributions
arius+5.508
asar+5.335
+5.219
�醒+5.200
mess+5.190
Neg logit contributions
ggles-5.199
geoning-4.833
roxy-4.598
pees-4.560
undo-4.488
Token Girl
Token ID7430
Feature activation+1.88e-01

Pos logit contributions
arius+5.134
asar+4.989
+4.931
�醒+4.906
mess+4.873
Neg logit contributions
ggles-5.366
geoning-4.907
pees-4.707
roxy-4.703
 toggle-4.655
Token In
Token ID554
Feature activation+5.40e-01

Pos logit contributions
arius+1.429
asar+1.402
�醒+1.394
+1.384
mess+1.363
Neg logit contributions
ggles-1.571
geoning-1.421
roxy-1.366
undo-1.365
pees-1.363
Token Public
Token ID5094
Feature activation+1.36e+00

Pos logit contributions
arius+4.412
asar+4.266
�醒+4.242
+4.151
mess+4.126
Neg logit contributions
ggles-4.137
geoning-3.825
roxy-3.645
undo-3.599
 toggle-3.586
Token-
Token ID12
Feature activation+3.51e-01

Pos logit contributions
arius+9.244
mess+9.241
asar+9.180
�醒+8.783
+8.716
Neg logit contributions
ggles-9.789
geoning-9.414
roxy-8.955
undo-8.909
pees-8.746
TokenWrit
Token ID20257
Feature activation+3.04e-01

Pos logit contributions
arius+2.956
asar+2.889
+2.878
�醒+2.852
mess+2.832
Neg logit contributions
ggles-2.657
geoning-2.367
roxy-2.263
pees-2.252
 toggle-2.228
Token Cup
Token ID5454
Feature activation-6.31e-01

Pos logit contributions
arius+1.437
asar+1.411
�醒+1.406
+1.400
mess+1.370
Neg logit contributions
ggles-1.245
geoning-1.113
pees-1.066
undo-1.056
 Oprah-1.056
Token Qual
Token ID9537
Feature activation-1.66e-01

Pos logit contributions
ggles+4.377
geoning+3.871
roxy+3.792
 Oprah+3.767
League+3.746
Neg logit contributions
�醒-5.540
arius-5.386
-5.237
mess-5.157
asar-5.155
Tokenifiers
Token ID13350
Feature activation-1.31e-01

Pos logit contributions
ggles+1.578
geoning+1.437
pees+1.387
roxy+1.386
 Oprah+1.385
Neg logit contributions
arius-1.090
�醒-1.081
-1.066
asar-1.056
mess-1.024
Token,
Token ID11
Feature activation+2.95e-01

Pos logit contributions
ggles+0.997
geoning+0.889
 Oprah+0.854
roxy+0.849
pees+0.849
Neg logit contributions
arius-1.098
�醒-1.094
asar-1.070
-1.069
mess-1.048
Token which
Token ID543
Feature activation+7.39e-02

Pos logit contributions
arius+2.390
asar+2.346
�醒+2.319
+2.318
mess+2.269
Neg logit contributions
ggles-2.338
geoning-2.102
pees-2.010
roxy-2.010
 Oprah-2.005
Token would
Token ID561
Feature activation+4.71e-01

Pos logit contributions
arius+0.637
�醒+0.624
asar+0.624
+0.620
mess+0.607
Neg logit contributions
ggles-0.548
geoning-0.488
pees-0.467
 Oprah-0.466
roxy-0.465
Token also
Token ID635
Feature activation+4.64e-01

Pos logit contributions
arius+3.719
asar+3.622
+3.597
�醒+3.523
mess+3.508
Neg logit contributions
ggles-3.685
geoning-3.394
pees-3.293
 Oprah-3.232
roxy-3.222
Token offer
Token ID2897
Feature activation-5.38e-01

Pos logit contributions
arius+3.785
asar+3.686
+3.657
�醒+3.600
mess+3.575
Neg logit contributions
ggles-3.539
geoning-3.235
pees-3.118
roxy-3.080
 Oprah-3.059
Token a
Token ID257
Feature activation-4.68e-02

Pos logit contributions
ggles+3.778
 Oprah+3.258
geoning+3.175
 toggle+3.164
undo+3.130
Neg logit contributions
�醒-4.829
arius-4.710
-4.625
asar-4.552
mess-4.488
Token great
Token ID1049
Feature activation+2.02e-02

Pos logit contributions
ggles+0.349
geoning+0.307
 Oprah+0.297
 toggle+0.295
roxy+0.294
Neg logit contributions
arius-0.402
�醒-0.400
asar-0.394
-0.393
mess-0.384
Token incentive
Token ID15660
Feature activation-3.87e-01

Pos logit contributions
arius+0.198
�醒+0.196
asar+0.194
+0.194
mess+0.190
Neg logit contributions
ggles-0.126
geoning-0.109
 Oprah-0.104
roxy-0.103
pees-0.103
Token fed
Token ID11672
Feature activation-6.73e-01

Pos logit contributions
arius+0.326
�醒+0.320
asar+0.318
+0.316
mess+0.310
Neg logit contributions
ggles-0.307
geoning-0.274
 Oprah-0.263
pees-0.263
roxy-0.262
Token state
Token ID1181
Feature activation-4.79e-01

Pos logit contributions
ggles+4.107
geoning+3.571
 toggle+3.427
undo+3.403
 Oprah+3.331
Neg logit contributions
arius-6.475
asar-6.402
-6.353
�醒-6.249
mess-6.122
Token.
Token ID13
Feature activation-1.61e-01

Pos logit contributions
ggles+3.421
geoning+3.026
undo+2.914
 Oprah+2.913
 toggle+2.901
Neg logit contributions
arius-4.199
�醒-4.108
asar-4.084
-4.070
mess-3.979
Token
Token ID198
Feature activation-1.02e-01

Pos logit contributions
ggles+1.233
 Oprah+1.088
geoning+1.070
 toggle+1.053
undo+1.049
Neg logit contributions
arius-1.354
�醒-1.335
asar-1.327
-1.308
mess-1.285
Token
Token ID198
Feature activation-2.70e-01

Pos logit contributions
ggles+0.763
geoning+0.674
 Oprah+0.673
 toggle+0.659
undo+0.656
Neg logit contributions
�醒-0.909
arius-0.866
-0.849
asar-0.844
mess-0.814
TokenTypically
Token ID49321
Feature activation+2.07e-01

Pos logit contributions
ggles+1.932
geoning+1.682
undo+1.669
 Oprah+1.647
pees+1.638
Neg logit contributions
arius-2.375
�醒-2.353
asar-2.332
-2.288
mess-2.228
Token,
Token ID11
Feature activation+1.93e-02

Pos logit contributions
arius+1.688
�醒+1.652
asar+1.644
+1.633
mess+1.605
Neg logit contributions
ggles-1.623
geoning-1.459
roxy-1.390
pees-1.384
undo-1.383
Token the
Token ID262
Feature activation+1.92e-01

Pos logit contributions
arius+0.163
�醒+0.160
asar+0.159
+0.158
mess+0.154
Neg logit contributions
ggles-0.148
geoning-0.131
 Oprah-0.128
pees-0.127
undo-0.126
Token fed
Token ID11672
Feature activation-6.24e-01

Pos logit contributions
arius+1.557
�醒+1.530
asar+1.518
+1.511
mess+1.479
Neg logit contributions
ggles-1.526
geoning-1.368
pees-1.310
 Oprah-1.309
roxy-1.308
Token state
Token ID1181
Feature activation-3.40e-01

Pos logit contributions
ggles+4.124
geoning+3.577
 toggle+3.488
undo+3.458
 Oprah+3.419
Neg logit contributions
arius-5.746
asar-5.656
-5.603
�醒-5.570
mess-5.423
Token starts
Token ID4940
Feature activation+2.97e-02

Pos logit contributions
ggles+2.275
geoning+1.945
 Oprah+1.903
 toggle+1.888
 voluntarily+1.874
Neg logit contributions
arius-3.199
�醒-3.156
asar-3.126
-3.100
mess-3.041
Token how
Token ID703
Feature activation-1.86e-01

Pos logit contributions
arius+1.825
�醒+1.789
asar+1.786
+1.775
mess+1.740
Neg logit contributions
ggles-1.575
geoning-1.401
 Oprah-1.338
pees-1.337
roxy-1.336
Token can
Token ID460
Feature activation+6.54e-01

Pos logit contributions
ggles+1.217
geoning+1.040
 Oprah+1.023
pees+1.010
 toggle+1.001
Neg logit contributions
�醒-1.764
arius-1.761
asar-1.726
-1.715
mess-1.680
Token it
Token ID340
Feature activation+8.96e-01

Pos logit contributions
arius+5.426
+5.272
asar+5.254
�醒+5.181
mess+5.178
Neg logit contributions
ggles-4.931
geoning-4.465
roxy-4.239
pees-4.158
undo-4.157
Token be
Token ID307
Feature activation+5.74e-01

Pos logit contributions
arius+6.545
mess+6.337
asar+6.293
+6.274
�醒+6.042
Neg logit contributions
ggles-7.273
geoning-6.607
pees-6.430
roxy-6.393
 Oprah-6.316
Token expanding
Token ID11581
Feature activation+4.57e-01

Pos logit contributions
arius+4.602
asar+4.492
+4.411
mess+4.401
�醒+4.390
Neg logit contributions
ggles-4.535
geoning-4.105
pees-3.910
roxy-3.891
undo-3.862
Token?
Token ID30
Feature activation+3.26e-01

Pos logit contributions
arius+3.643
asar+3.566
+3.491
�醒+3.488
mess+3.478
Neg logit contributions
ggles-3.663
geoning-3.271
pees-3.156
roxy-3.127
undo-3.096
Token Like
Token ID4525
Feature activation+2.33e-01

Pos logit contributions
arius+2.748
asar+2.695
+2.673
�醒+2.671
mess+2.627
Neg logit contributions
ggles-2.455
geoning-2.207
roxy-2.108
pees-2.095
undo-2.083
Token
Token ID1399
Feature activation+2.65e-02

Pos logit contributions
arius+1.959
asar+1.916
�醒+1.914
+1.901
mess+1.866
Neg logit contributions
ggles-1.773
geoning-1.584
pees-1.510
roxy-1.508
undo-1.504
Token what
Token ID644
Feature activation-6.65e-02

Pos logit contributions
arius+0.224
�醒+0.221
asar+0.219
+0.218
mess+0.213
Neg logit contributions
ggles-0.201
geoning-0.178
 Oprah-0.173
pees-0.171
undo-0.171
Token
Token ID447
Feature activation-1.81e-01

Pos logit contributions
ggles+0.490
geoning+0.429
 Oprah+0.421
pees+0.414
undo+0.413
Neg logit contributions
arius-0.578
�醒-0.575
asar-0.563
-0.562
mess-0.550
Token
Token ID247
Feature activation-3.17e-01

Pos logit contributions
ggles+1.768
undo+1.594
geoning+1.579
 Oprah+1.579
pees+1.558
Neg logit contributions
arius-1.113
asar-1.101
�醒-1.100
-1.062
mess-1.048
Token
Token ID251
Feature activation+4.41e-01

Pos logit contributions
arius+0.310
�醒+0.304
asar+0.299
+0.298
mess+0.288
Neg logit contributions
ggles-0.544
geoning-0.501
 Oprah-0.486
pees-0.485
roxy-0.484
Token
Token ID198
Feature activation+8.70e-02

Pos logit contributions
arius+3.542
+3.476
�醒+3.457
asar+3.442
mess+3.381
Neg logit contributions
ggles-3.490
geoning-3.141
roxy-3.025
pees-3.018
 Oprah-2.934
Token
Token ID198
Feature activation-5.73e-02

Pos logit contributions
�醒+0.716
arius+0.705
asar+0.688
+0.686
mess+0.666
Neg logit contributions
ggles-0.688
geoning-0.615
 Oprah-0.603
undo-0.593
 toggle-0.592
TokenCity
Token ID14941
Feature activation-3.91e-02

Pos logit contributions
ggles+0.417
geoning+0.368
undo+0.354
 Oprah+0.353
pees+0.353
Neg logit contributions
arius-0.502
�醒-0.496
asar-0.491
-0.486
mess-0.476
Token Council
Token ID4281
Feature activation+3.80e-02

Pos logit contributions
ggles+0.313
geoning+0.281
 Oprah+0.271
undo+0.270
pees+0.270
Neg logit contributions
arius-0.314
�醒-0.310
asar-0.307
-0.304
mess-0.297
Token
Token ID960
Feature activation+4.38e-02

Pos logit contributions
arius+0.323
�醒+0.318
asar+0.316
+0.315
mess+0.308
Neg logit contributions
ggles-0.287
geoning-0.255
 Oprah-0.245
pees-0.244
undo-0.244
TokenSix
Token ID21447
Feature activation-4.05e-01

Pos logit contributions
arius+0.374
�醒+0.367
asar+0.366
+0.364
mess+0.358
Neg logit contributions
ggles-0.329
geoning-0.293
 Oprah-0.281
pees-0.280
roxy-0.279
Token Years
Token ID13212
Feature activation-4.83e-02

Pos logit contributions
ggles+3.107
geoning+2.772
 Oprah+2.659
pees+2.656
undo+2.653
Neg logit contributions
arius-3.339
�醒-3.317
asar-3.253
-3.243
mess-3.139
Token In
Token ID554
Feature activation+2.20e-01

Pos logit contributions
ggles+0.382
geoning+0.342
 Oprah+0.329
pees+0.328
undo+0.327
Neg logit contributions
arius-0.392
�醒-0.386
asar-0.383
-0.381
mess-0.373
Token Search
Token ID11140
Feature activation+5.62e-01

Pos logit contributions
arius+1.685
asar+1.634
�醒+1.631
+1.608
mess+1.582
Neg logit contributions
ggles-1.853
geoning-1.669
roxy-1.627
 Oprah-1.602
undo-1.599
Token of
Token ID286
Feature activation+2.85e-01

Pos logit contributions
arius+3.920
�醒+3.838
mess+3.749
asar+3.720
+3.681
Neg logit contributions
ggles-4.941
geoning-4.332
roxy-4.303
pees-4.260
undo-4.211
Token.
Token ID13
Feature activation+4.31e-02

Pos logit contributions
ggles+1.028
geoning+0.915
 Oprah+0.885
pees+0.878
undo+0.877
Neg logit contributions
arius-1.182
�醒-1.170
asar-1.158
-1.154
mess-1.130
Token Seriously
Token ID27777
Feature activation-3.10e-01

Pos logit contributions
arius+0.367
�醒+0.363
asar+0.359
+0.357
mess+0.350
Neg logit contributions
ggles-0.325
geoning-0.287
 Oprah-0.280
pees-0.275
undo-0.275
Token.
Token ID13
Feature activation-1.99e-02

Pos logit contributions
ggles+2.290
geoning+2.049
 Oprah+1.988
undo+1.950
pees+1.949
Neg logit contributions
arius-2.652
�醒-2.630
asar-2.603
-2.579
mess-2.523
Token You
Token ID921
Feature activation-3.65e-01

Pos logit contributions
ggles+0.149
geoning+0.132
 Oprah+0.130
undo+0.126
pees+0.126
Neg logit contributions
arius-0.170
�醒-0.169
asar-0.166
-0.165
mess-0.162
Token know
Token ID760
Feature activation+1.68e-01

Pos logit contributions
ggles+2.548
geoning+2.207
 Oprah+2.171
 toggle+2.154
pees+2.142
Neg logit contributions
�醒-3.298
arius-3.228
asar-3.185
-3.157
mess-3.059
Token how
Token ID703
Feature activation-5.76e-01

Pos logit contributions
arius+1.304
asar+1.274
�醒+1.270
+1.263
mess+1.240
Neg logit contributions
ggles-1.393
geoning-1.253
roxy-1.205
pees-1.203
undo-1.199
Token Batman
Token ID9827
Feature activation-6.34e-01

Pos logit contributions
ggles+4.030
 Oprah+3.673
geoning+3.508
 toggle+3.502
 voluntarily+3.454
Neg logit contributions
�醒-5.051
arius-4.967
-4.877
asar-4.821
mess-4.722
Token uses
Token ID3544
Feature activation-2.01e-01

Pos logit contributions
ggles+4.416
geoning+3.760
pees+3.758
 voluntarily+3.731
 Oprah+3.718
Neg logit contributions
�醒-5.662
arius-5.541
asar-5.436
-5.309
mess-5.256
Token bats
Token ID19553
Feature activation+1.31e-02

Pos logit contributions
ggles+1.576
geoning+1.395
 Oprah+1.363
pees+1.352
 toggle+1.342
Neg logit contributions
arius-1.652
�醒-1.624
asar-1.607
-1.605
mess-1.562
Token to
Token ID284
Feature activation-7.83e-02

Pos logit contributions
arius+0.107
�醒+0.105
asar+0.105
+0.104
mess+0.102
Neg logit contributions
ggles-0.103
geoning-0.092
 Oprah-0.088
pees-0.088
roxy-0.088
Token
Token ID564
Feature activation-3.06e-01

Pos logit contributions
ggles+0.586
geoning+0.510
 Oprah+0.496
 toggle+0.494
undo+0.494
Neg logit contributions
arius-0.676
�醒-0.670
-0.660
asar-0.658
mess-0.642
Tokenfull
Token ID12853
Feature activation-5.66e-01

Pos logit contributions
ggles+0.365
geoning+0.326
 Oprah+0.318
pees+0.309
roxy+0.308
Neg logit contributions
arius-0.366
asar-0.360
�醒-0.357
mess-0.353
-0.351
Token v
Token ID410
Feature activation+3.62e-03

Pos logit contributions
ggles+3.701
geoning+3.221
 toggle+3.148
roxy+3.094
 Oprah+3.079
Neg logit contributions
arius-5.245
-5.177
�醒-5.170
asar-5.163
mess-4.990
Token)
Token ID8
Feature activation-3.47e-01

Pos logit contributions
arius+0.031
asar+0.030
�醒+0.030
+0.030
mess+0.030
Neg logit contributions
ggles-0.027
geoning-0.024
 Oprah-0.023
pees-0.023
roxy-0.023
Token {
Token ID1391
Feature activation-2.05e-01

Pos logit contributions
ggles+2.626
geoning+2.330
 Oprah+2.236
roxy+2.228
pees+2.227
Neg logit contributions
arius-2.935
�醒-2.882
asar-2.867
-2.858
mess-2.794
Token //
Token ID3373
Feature activation-1.68e-01

Pos logit contributions
ggles+1.466
geoning+1.288
 Oprah+1.237
 toggle+1.233
undo+1.233
Neg logit contributions
arius-1.822
�醒-1.794
asar-1.787
-1.786
mess-1.734
Token Time
Token ID3862
Feature activation-7.49e-02

Pos logit contributions
ggles+1.266
geoning+1.106
undo+1.067
 toggle+1.065
 Oprah+1.060
Neg logit contributions
arius-1.439
�醒-1.425
-1.414
asar-1.407
mess-1.359
Token component
Token ID7515
Feature activation-3.70e-01

Pos logit contributions
ggles+0.549
geoning+0.485
 Oprah+0.466
pees+0.464
undo+0.464
Neg logit contributions
arius-0.652
�醒-0.645
-0.639
asar-0.639
mess-0.622
Token const
Token ID1500
Feature activation+2.78e-01

Pos logit contributions
ggles+2.722
geoning+2.406
 toggle+2.365
 Oprah+2.318
undo+2.315
Neg logit contributions
�醒-3.144
arius-3.128
-3.123
asar-3.087
mess-2.987
Token float
Token ID12178
Feature activation-1.22e-01

Pos logit contributions
arius+2.481
asar+2.423
�醒+2.418
+2.391
mess+2.373
Neg logit contributions
ggles-1.967
geoning-1.755
 Oprah-1.668
pees-1.668
roxy-1.664
Token TA
Token ID21664
Feature activation-4.72e-02

Pos logit contributions
ggles+0.924
geoning+0.815
 Oprah+0.789
 toggle+0.787
undo+0.784
Neg logit contributions
arius-1.027
�醒-1.016
-1.010
asar-1.003
mess-0.974
TokenU
Token ID52
Feature activation+3.20e-01

Pos logit contributions
ggles+0.344
geoning+0.305
 Oprah+0.293
roxy+0.292
pees+0.291
Neg logit contributions
arius-0.413
�醒-0.405
asar-0.404
-0.401
mess-0.394
Token Post
Token ID2947
Feature activation-3.29e-01

Pos logit contributions
ggles+0.584
geoning+0.519
 Oprah+0.498
roxy+0.494
pees+0.494
Neg logit contributions
arius-0.686
�醒-0.676
asar-0.671
-0.667
mess-0.652
Token explained
Token ID4893
Feature activation-1.58e-01

Pos logit contributions
ggles+2.515
geoning+2.246
 toggle+2.162
undo+2.160
 Oprah+2.154
Neg logit contributions
arius-2.698
�醒-2.671
asar-2.660
-2.651
mess-2.581
Token how
Token ID703
Feature activation-2.76e-01

Pos logit contributions
ggles+1.212
geoning+1.079
 Oprah+1.041
undo+1.032
roxy+1.032
Neg logit contributions
arius-1.323
�醒-1.299
asar-1.293
-1.287
mess-1.258
Token,
Token ID11
Feature activation-3.78e-01

Pos logit contributions
ggles+2.099
geoning+1.869
 Oprah+1.824
 toggle+1.795
undo+1.791
Neg logit contributions
arius-2.306
�醒-2.284
asar-2.257
-2.256
mess-2.187
Token in
Token ID287
Feature activation-2.85e-01

Pos logit contributions
ggles+2.707
geoning+2.382
 Oprah+2.363
 voluntarily+2.361
 toggle+2.340
Neg logit contributions
�醒-3.263
arius-3.239
-3.197
asar-3.149
mess-3.074
Token early
Token ID1903
Feature activation-1.10e-01

Pos logit contributions
ggles+2.121
geoning+1.842
 Oprah+1.831
 toggle+1.803
 voluntarily+1.787
Neg logit contributions
�醒-2.460
-2.429
arius-2.420
asar-2.375
mess-2.313
Token 2008
Token ID3648
Feature activation-9.81e-02

Pos logit contributions
ggles+0.734
geoning+0.637
 Oprah+0.627
 toggle+0.615
undo+0.605
Neg logit contributions
�醒-1.034
arius-1.022
-1.009
asar-1.005
mess-0.974
Token,
Token ID11
Feature activation-3.94e-01

Pos logit contributions
ggles+0.784
geoning+0.696
pees+0.673
roxy+0.668
 Oprah+0.667
Neg logit contributions
arius-0.800
�醒-0.788
asar-0.782
-0.777
mess-0.762
Token US
Token ID1294
Feature activation-5.60e-01

Pos logit contributions
ggles+2.953
geoning+2.623
 Oprah+2.586
 toggle+2.538
 voluntarily+2.509
Neg logit contributions
arius-3.296
�醒-3.267
-3.223
asar-3.199
mess-3.098
Token Senator
Token ID8962
Feature activation-1.58e-01

Pos logit contributions
ggles+4.252
geoning+3.879
 toggle+3.740
 Oprah+3.728
undo+3.690
Neg logit contributions
�醒-4.459
arius-4.438
asar-4.411
-4.298
mess-4.191
Token Frank
Token ID5278
Feature activation-1.69e-02

Pos logit contributions
ggles+1.082
geoning+0.947
 Oprah+0.937
 toggle+0.902
pees+0.898
Neg logit contributions
arius-1.444
�醒-1.429
-1.412
asar-1.410
mess-1.369
Token regarded
Token ID11987
Feature activation+4.94e-01

Pos logit contributions
arius+0.112
�醒+0.111
+0.109
asar+0.109
mess+0.107
Neg logit contributions
ggles-0.103
geoning-0.091
 Oprah-0.088
roxy-0.088
undo-0.087
Token as
Token ID355
Feature activation+1.84e-01

Pos logit contributions
arius+2.983
asar+2.919
+2.881
mess+2.798
�醒+2.793
Neg logit contributions
ggles-4.929
pees-4.376
geoning-4.365
roxy-4.252
undo-4.242
Token separate
Token ID4553
Feature activation+1.97e-01

Pos logit contributions
arius+1.592
�醒+1.561
asar+1.558
+1.547
mess+1.517
Neg logit contributions
ggles-1.357
geoning-1.205
pees-1.155
 Oprah-1.152
roxy-1.150
Token species
Token ID4693
Feature activation+4.08e-01

Pos logit contributions
arius+1.448
�醒+1.418
asar+1.410
+1.401
mess+1.367
Neg logit contributions
ggles-1.723
geoning-1.557
 Oprah-1.504
pees-1.501
roxy-1.498
Token,
Token ID11
Feature activation+1.63e-01

Pos logit contributions
arius+3.334
asar+3.244
�醒+3.219
+3.214
mess+3.153
Neg logit contributions
ggles-3.240
geoning-2.869
pees-2.777
roxy-2.754
 Oprah-2.724
Token
Token ID447
Feature activation-7.77e-02

Pos logit contributions
arius+1.380
�醒+1.356
asar+1.349
+1.341
mess+1.313
Neg logit contributions
ggles-1.232
geoning-1.096
 Oprah-1.054
pees-1.050
roxy-1.048
Token
Token ID251
Feature activation+4.37e-01

Pos logit contributions
ggles+0.636
 Oprah+0.567
geoning+0.562
undo+0.560
roxy+0.553
Neg logit contributions
arius-0.605
�醒-0.599
asar-0.597
-0.584
mess-0.578
Token said
Token ID531
Feature activation-4.54e-02

Pos logit contributions
arius+3.496
+3.431
asar+3.425
�醒+3.397
mess+3.359
Neg logit contributions
ggles-3.494
geoning-3.119
pees-2.993
roxy-2.948
 Oprah-2.921
Token Dr
Token ID1583
Feature activation-2.08e-01

Pos logit contributions
ggles+0.343
geoning+0.305
 Oprah+0.293
pees+0.292
roxy+0.291
Neg logit contributions
arius-0.385
�醒-0.379
asar-0.376
-0.374
mess-0.367
Token Potter
Token ID14179
Feature activation+6.18e-01

Pos logit contributions
ggles+1.676
geoning+1.503
 Oprah+1.453
pees+1.443
undo+1.441
Neg logit contributions
arius-1.656
�醒-1.629
asar-1.616
-1.611
mess-1.574
Token,
Token ID11
Feature activation+8.70e-01

Pos logit contributions
asar+4.236
�醒+4.233
arius+4.224
+4.174
mess+4.032
Neg logit contributions
ggles-5.531
geoning-4.889
pees-4.810
roxy-4.695
 Oprah-4.690
Token the
Token ID262
Feature activation+1.62e-01

Pos logit contributions
arius+3.765
asar+3.686
+3.656
�醒+3.606
mess+3.596
Neg logit contributions
ggles-3.914
geoning-3.499
pees-3.369
roxy-3.344
undo-3.338
Token other
Token ID584
Feature activation-4.06e-01

Pos logit contributions
arius+1.290
asar+1.261
�醒+1.261
+1.253
mess+1.229
Neg logit contributions
ggles-1.302
geoning-1.170
pees-1.124
 Oprah-1.118
roxy-1.118
Token night
Token ID1755
Feature activation+1.83e-01

Pos logit contributions
ggles+2.807
geoning+2.494
 Oprah+2.462
 toggle+2.425
undo+2.382
Neg logit contributions
�醒-3.627
arius-3.606
asar-3.508
-3.507
mess-3.353
Token.
Token ID13
Feature activation+2.95e-03

Pos logit contributions
arius+1.474
asar+1.447
+1.439
�醒+1.438
mess+1.409
Neg logit contributions
ggles-1.469
geoning-1.303
roxy-1.252
pees-1.250
undo-1.249
Token A
Token ID317
Feature activation+2.05e-01

Pos logit contributions
arius+0.024
�醒+0.024
asar+0.024
+0.024
mess+0.023
Neg logit contributions
ggles-0.023
geoning-0.020
 Oprah-0.020
pees-0.020
roxy-0.020
Token specialist
Token ID15670
Feature activation-2.79e-01

Pos logit contributions
arius+1.665
�醒+1.626
asar+1.625
+1.613
mess+1.584
Neg logit contributions
ggles-1.624
geoning-1.451
pees-1.400
 Oprah-1.396
roxy-1.392
Token in
Token ID287
Feature activation-3.47e-01

Pos logit contributions
ggles+1.963
geoning+1.741
 Oprah+1.704
 toggle+1.683
undo+1.680
Neg logit contributions
�醒-2.474
arius-2.457
-2.426
asar-2.412
mess-2.349
Token Czech
Token ID16639
Feature activation+4.96e-01

Pos logit contributions
ggles+2.626
geoning+2.331
 Oprah+2.289
 toggle+2.252
roxy+2.238
Neg logit contributions
arius-2.928
�醒-2.909
-2.842
asar-2.836
mess-2.734
Token small
Token ID1402
Feature activation+2.96e-01

Pos logit contributions
arius+3.974
asar+3.912
+3.878
mess+3.832
�醒+3.814
Neg logit contributions
ggles-3.900
geoning-3.490
pees-3.331
 Oprah-3.323
undo-3.317
Tokenness
Token ID1108
Feature activation+1.81e-01

Pos logit contributions
arius+2.467
asar+2.424
+2.370
mess+2.362
�醒+2.345
Neg logit contributions
ggles-2.260
geoning-2.014
 Oprah-1.945
roxy-1.942
undo-1.924
Token and
Token ID290
Feature activation-1.03e-02

Pos logit contributions
arius+1.434
asar+1.398
�醒+1.394
+1.387
mess+1.358
Neg logit contributions
ggles-1.478
geoning-1.324
roxy-1.272
pees-1.271
 Oprah-1.268
Token gra
Token ID7933
Feature activation+4.81e-02

Pos logit contributions
ggles+3.796
 voluntarily+3.252
 toggle+3.244
geoning+3.134
 Oprah+3.097
Neg logit contributions
�醒-5.226
-5.129
arius-5.128
mess-4.940
asar-4.924
Tokenze
Token ID2736
Feature activation-4.82e-01

Pos logit contributions
arius+0.443
�醒+0.437
asar+0.434
+0.433
mess+0.423
Neg logit contributions
ggles-0.330
geoning-0.289
 Oprah-0.276
pees-0.275
roxy-0.274
Token sheep
Token ID15900
Feature activation-8.07e-02

Pos logit contributions
ggles+3.312
 voluntarily+2.952
geoning+2.908
 Oprah+2.907
 toggle+2.867
Neg logit contributions
�醒-4.330
arius-4.270
asar-4.209
-4.198
mess-4.054
Token on
Token ID319
Feature activation-9.20e-01

Pos logit contributions
ggles+0.626
geoning+0.559
 Oprah+0.538
pees+0.537
roxy+0.535
Neg logit contributions
arius-0.668
�醒-0.657
asar-0.652
-0.649
mess-0.635
Token any
Token ID597
Feature activation-6.33e-01

Pos logit contributions
ggles+6.037
 Oprah+5.395
 voluntarily+5.380
 toggle+5.316
geoning+5.200
Neg logit contributions
�醒-8.254
arius-8.089
asar-7.954
-7.849
mess-7.593
Token golf
Token ID13126
Feature activation-1.43e+00

Pos logit contributions
ggles+4.241
 voluntarily+3.729
 toggle+3.691
geoning+3.691
 Oprah+3.682
Neg logit contributions
�醒-5.717
arius-5.641
asar-5.585
-5.456
mess-5.385
Token course
Token ID1781
Feature activation-5.39e-01

Pos logit contributions
ggles+7.958
geoning+7.312
/_+7.015
 Oprah+6.982
 voluntarily+6.939
Neg logit contributions
�醒-12.812
arius-12.186
asar-12.079
-12.042
 SERV-11.672
Token owned
Token ID6898
Feature activation-2.11e-02

Pos logit contributions
ggles+3.612
 Oprah+3.287
 voluntarily+3.266
geoning+3.248
 toggle+3.178
Neg logit contributions
�醒-4.879
arius-4.755
-4.688
asar-4.675
mess-4.522
Token by
Token ID416
Feature activation-4.12e-01

Pos logit contributions
ggles+0.188
geoning+0.168
roxy+0.165
pees+0.163
undo+0.163
Neg logit contributions
arius-0.153
�醒-0.148
asar-0.148
-0.147
mess-0.143
Token Donald
Token ID3759
Feature activation-1.16e+00

Pos logit contributions
ggles+3.014
geoning+2.672
 Oprah+2.652
 toggle+2.569
 voluntarily+2.564
Neg logit contributions
arius-3.548
�醒-3.518
asar-3.449
-3.417
mess-3.326
Token Trump
Token ID1301
Feature activation-6.42e-01

Pos logit contributions
ggles+7.774
 Oprah+7.113
geoning+7.032
 toggle+6.771
 voluntarily+6.749
Neg logit contributions
�醒-9.470
asar-9.358
arius-9.348
-9.226
mess-9.033
Token
Token ID198
Feature activation-2.48e-01

Pos logit contributions
ggles+2.667
 Oprah+2.396
edit+2.351
/_+2.333
geoning+2.313
Neg logit contributions
�醒-4.003
arius-3.629
-3.603
asar-3.538
 SERV-3.431
TokenRes
Token ID4965
Feature activation+5.00e-01

Pos logit contributions
ggles+1.754
geoning+1.550
undo+1.489
roxy+1.481
pees+1.480
Neg logit contributions
�醒-2.195
arius-2.192
asar-2.152
-2.146
mess-2.068
Tokena
Token ID64
Feature activation+6.91e-01

Pos logit contributions
arius+2.583
asar+2.517
+2.415
mess+2.395
�醒+2.393
Neg logit contributions
ggles-5.420
geoning-5.010
 Oprah-4.861
pees-4.856
roxy-4.830
Tokenech
Token ID3055
Feature activation+1.39e-01

Pos logit contributions
arius+4.635
asar+4.518
+4.257
mess+4.205
�醒+4.111
Neg logit contributions
ggles-6.304
geoning-5.790
roxy-5.594
 Oprah-5.591
 toggle-5.578
Token says
Token ID1139
Feature activation-7.07e-01

Pos logit contributions
arius+1.043
asar+1.015
�醒+1.011
+1.005
mess+0.986
Neg logit contributions
ggles-1.178
geoning-1.070
 Oprah-1.026
roxy-1.026
pees-1.025
Token,
Token ID11
Feature activation-1.35e+00

Pos logit contributions
ggles+4.993
geoning+4.499
 Oprah+4.329
undo+4.277
 toggle+4.270
Neg logit contributions
arius-6.123
�醒-6.020
asar-6.012
-5.985
mess-5.812
Token '
Token ID705
Feature activation+3.09e-01

Pos logit contributions
ggles+5.974
geoning+5.299
 toggle+4.915
 Oprah+4.896
undo+4.850
Neg logit contributions
-13.765
arius-13.748
�醒-13.634
asar-13.614
mess-13.123
Token With
Token ID2080
Feature activation+2.78e-01

Pos logit contributions
arius+2.607
asar+2.549
�醒+2.537
+2.524
mess+2.485
Neg logit contributions
ggles-2.337
geoning-2.082
pees-2.000
roxy-1.992
 Oprah-1.990
Token the
Token ID262
Feature activation+4.96e-01

Pos logit contributions
arius+2.261
asar+2.206
�醒+2.189
+2.177
mess+2.149
Neg logit contributions
ggles-2.198
geoning-1.968
pees-1.893
 Oprah-1.886
roxy-1.881
Token information
Token ID1321
Feature activation-2.42e-02

Pos logit contributions
arius+4.136
asar+4.021
mess+3.959
+3.948
�醒+3.900
Neg logit contributions
ggles-3.736
geoning-3.336
pees-3.257
roxy-3.199
undo-3.181
Token we
Token ID356
Feature activation+5.59e-01

Pos logit contributions
ggles+0.178
geoning+0.157
 Oprah+0.152
pees+0.151
roxy+0.151
Neg logit contributions
arius-0.209
�醒-0.208
-0.205
asar-0.204
mess-0.200
Token Lor
Token ID26068
Feature activation+3.59e-01

Pos logit contributions
ggles+2.650
geoning+2.393
 Oprah+2.334
pees+2.283
 toggle+2.281
Neg logit contributions
�醒-2.793
arius-2.765
-2.751
asar-2.703
mess-2.601
Tokenre
Token ID260
Feature activation+4.87e-02

Pos logit contributions
arius+4.167
asar+4.083
mess+4.004
+3.999
�醒+3.974
Neg logit contributions
ggles-1.601
geoning-1.333
 Oprah-1.263
 toggle-1.217
pees-1.201
Token)
Token ID8
Feature activation-7.12e-01

Pos logit contributions
arius+0.435
asar+0.427
�醒+0.426
+0.424
mess+0.416
Neg logit contributions
ggles-0.347
geoning-0.306
 Oprah-0.292
pees-0.292
roxy-0.292
Token does
Token ID857
Feature activation-1.06e+00

Pos logit contributions
ggles+4.874
 voluntarily+4.303
 Oprah+4.265
geoning+4.263
pees+4.229
Neg logit contributions
�醒-6.265
arius-6.132
-6.029
asar-5.973
mess-5.826
Token something
Token ID1223
Feature activation-9.56e-01

Pos logit contributions
ggles+7.369
 Oprah+6.292
geoning+6.281
roxy+6.209
 voluntarily+6.200
Neg logit contributions
arius-9.007
-8.915
�醒-8.752
asar-8.530
mess-8.407
Token to
Token ID284
Feature activation-1.17e+00

Pos logit contributions
ggles+6.081
 Oprah+5.604
 voluntarily+5.528
 toggle+5.402
geoning+5.241
Neg logit contributions
�醒-8.720
arius-8.410
-8.276
asar-8.050
mess-7.866
Token him
Token ID683
Feature activation-7.87e-01

Pos logit contributions
ggles+8.259
 Oprah+7.171
 toggle+7.150
undo+6.702
 Seymour+6.670
Neg logit contributions
�醒-9.664
arius-9.472
-9.418
asar-9.070
 SERV-8.939
Token,
Token ID11
Feature activation-8.76e-01

Pos logit contributions
ggles+5.203
geoning+4.715
 voluntarily+4.669
 Oprah+4.655
 toggle+4.558
Neg logit contributions
arius-6.790
�醒-6.760
-6.631
asar-6.560
mess-6.360
Token but
Token ID475
Feature activation-6.78e-01

Pos logit contributions
ggles+5.960
 Oprah+5.392
 voluntarily+5.380
 toggle+5.265
geoning+5.191
Neg logit contributions
�醒-7.497
arius-7.377
-7.290
asar-7.033
mess-6.926
Token we
Token ID356
Feature activation-8.10e-01

Pos logit contributions
ggles+4.984
 Oprah+4.406
 voluntarily+4.289
 toggle+4.232
roxy+4.203
Neg logit contributions
�醒-5.859
arius-5.720
-5.693
asar-5.524
mess-5.369
Token're
Token ID821
Feature activation-1.17e-01

Pos logit contributions
ggles+5.243
 toggle+4.391
 voluntarily+4.297
geoning+4.193
 Oprah+4.186
Neg logit contributions
�醒-7.881
-7.453
arius-7.333
asar-7.225
mess-7.063
Tokenty
Token ID774
Feature activation-1.28e-01

Pos logit contributions
ggles+0.430
geoning+0.376
 Oprah+0.362
pees+0.358
roxy+0.357
Neg logit contributions
arius-0.590
�醒-0.584
asar-0.579
-0.575
mess-0.566
Token-
Token ID12
Feature activation-1.89e-01

Pos logit contributions
ggles+0.955
geoning+0.853
 Oprah+0.830
 toggle+0.820
roxy+0.816
Neg logit contributions
arius-1.092
�醒-1.083
asar-1.071
-1.060
mess-1.034
Tokenvers
Token ID690
Feature activation+3.87e-01

Pos logit contributions
ggles+1.304
geoning+1.149
pees+1.108
undo+1.106
roxy+1.099
Neg logit contributions
�醒-1.703
arius-1.703
asar-1.670
-1.667
mess-1.610
Tokenus
Token ID385
Feature activation+2.58e-01

Pos logit contributions
arius+2.600
asar+2.472
�醒+2.421
+2.400
mess+2.371
Neg logit contributions
ggles-3.623
geoning-3.292
 Oprah-3.229
roxy-3.227
pees-3.224
Token-
Token ID12
Feature activation-5.95e-01

Pos logit contributions
arius+1.661
�醒+1.659
+1.627
mess+1.581
asar+1.572
Neg logit contributions
ggles-2.440
geoning-2.197
roxy-2.168
pees-2.164
undo-2.162
Tokensw
Token ID2032
Feature activation-1.39e+00

Pos logit contributions
ggles+4.365
undo+3.876
geoning+3.854
pees+3.797
edit+3.747
Neg logit contributions
�醒-5.029
arius-4.873
-4.829
asar-4.813
 SERV-4.669
Tokenorn
Token ID1211
Feature activation+4.99e-01

Pos logit contributions
ggles+10.327
geoning+9.153
edit+9.056
roxy+9.014
ilateral+8.961
Neg logit contributions
�醒-10.609
-10.135
 SERV-9.581
arius-9.382
asar-9.208
Token-
Token ID12
Feature activation-4.46e-01

Pos logit contributions
arius+3.795
+3.712
�醒+3.694
asar+3.682
mess+3.634
Neg logit contributions
ggles-4.152
geoning-3.715
pees-3.610
roxy-3.587
undo-3.570
Tokenen
Token ID268
Feature activation-4.52e-02

Pos logit contributions
ggles+3.123
geoning+2.733
undo+2.701
pees+2.684
roxy+2.635
Neg logit contributions
�醒-3.949
arius-3.869
asar-3.824
-3.821
 SERV-3.670
Tokenm
Token ID76
Feature activation-9.86e-02

Pos logit contributions
ggles+0.380
geoning+0.342
 Oprah+0.329
pees+0.329
roxy+0.328
Neg logit contributions
arius-0.348
�醒-0.339
asar-0.338
-0.335
mess-0.329
Tokenity
Token ID414
Feature activation-1.23e-01

Pos logit contributions
ggles+0.931
geoning+0.849
 Oprah+0.823
pees+0.821
roxy+0.820
Neg logit contributions
arius-0.653
�醒-0.635
asar-0.633
-0.627
mess-0.611
Token officers
Token ID3790
Feature activation-4.88e-02

Pos logit contributions
arius+5.260
+5.155
asar+5.132
�醒+5.115
mess+5.077
Neg logit contributions
ggles-5.120
geoning-4.624
pees-4.395
undo-4.334
 Oprah-4.317
Token have
Token ID423
Feature activation+4.00e-03

Pos logit contributions
ggles+0.352
geoning+0.312
 Oprah+0.302
pees+0.299
undo+0.298
Neg logit contributions
arius-0.428
�醒-0.425
asar-0.419
-0.418
mess-0.409
Token the
Token ID262
Feature activation-3.82e-01

Pos logit contributions
arius+0.033
�醒+0.033
asar+0.032
+0.032
mess+0.031
Neg logit contributions
ggles-0.031
geoning-0.028
 Oprah-0.027
pees-0.027
roxy-0.027
Token
Token ID564
Feature activation-4.61e-01

Pos logit contributions
ggles+2.508
geoning+2.149
 toggle+2.102
 Oprah+2.090
 voluntarily+2.063
Neg logit contributions
arius-3.617
�醒-3.588
-3.530
asar-3.526
mess-3.436
Token
Token ID250
Feature activation-1.07e+00

Pos logit contributions
ggles+4.431
geoning+4.010
undo+3.945
pees+3.930
 Oprah+3.920
Neg logit contributions
arius-2.885
asar-2.840
-2.747
mess-2.739
�醒-2.715
Tokenhuge
Token ID40878
Feature activation-1.31e+00

Pos logit contributions
ggles+7.802
undo+7.092
edit+6.875
League+6.874
pees+6.858
Neg logit contributions
�醒-8.329
arius-8.221
-8.018
asar-7.977
 SERV-7.916
Token responsibility
Token ID5798
Feature activation-5.78e-01

Pos logit contributions
ggles+8.333
geoning+7.397
undo+7.380
 toggle+7.307
 Oprah+7.092
Neg logit contributions
arius-10.421
�醒-10.191
asar-10.145
-10.114
 SERV-9.868
Token
Token ID447
Feature activation+2.80e-01

Pos logit contributions
ggles+4.211
geoning+3.799
undo+3.668
pees+3.626
 toggle+3.619
Neg logit contributions
arius-4.883
�醒-4.809
asar-4.792
-4.725
mess-4.625
Token
Token ID251
Feature activation-3.17e-01

Pos logit contributions
arius+1.556
�醒+1.536
+1.510
asar+1.494
mess+1.432
Neg logit contributions
ggles-2.913
geoning-2.693
 Oprah-2.606
pees-2.605
roxy-2.598
Token of
Token ID286
Feature activation-2.15e-01

Pos logit contributions
ggles+2.260
geoning+2.022
undo+1.961
 Oprah+1.949
pees+1.936
Neg logit contributions
arius-2.762
�醒-2.739
asar-2.703
-2.683
mess-2.617
Token looking
Token ID2045
Feature activation+1.52e-01

Pos logit contributions
ggles+1.412
geoning+1.229
 Oprah+1.197
 toggle+1.182
 voluntarily+1.178
Neg logit contributions
arius-2.025
�醒-2.021
-1.988
asar-1.987
mess-1.930