TOP ACTIVATIONS
MAX = 0.637

were s omew hat satisfied .
Mer ri am - Web ster 's third definition of value
Comm ander of the On mits uk
, showed trustees were reimb ursed for hand lot ion ,
it was dec ided ly average and
Luxembourg premier Jean - Cla ude Jun cker , she hoped
the process of reim plement ing our message bus with this
first on . <|endoftext|> Mon ths of anticipation ( mostly from
GO AL ! pic . twitter . com / E q
- Lou CK & Mar riages - E arl Lu vs
t complain , one supp oses . And so
pattern of tax - avoid ance . My own
start developing in AWS Lamb da with server less framework .
put Brock way , Og den ville and North H aver
P oul iot and Mat hew Dumb a has made his
if they could be reprim anded . J SO said since
that holds a " nom inal interest " in White Marsh
What is CAT Adjust ing ? Cat ast
many friends , who affection ately called him " Short y
8 b ET pic . twitter . com / W 8

MIN ACTIVATIONS
MIN = -0.494

a mob of unnecessary dupl icates . . ."
with Love conveyed through lyr ical themes of pregnancy and the
Gay Away ser mons , the man who once
door . Ident ifying tags and Hebrew logos have
conditions added at his arra ignment in December 2016 .
Ke pler - 10 c didn
to wave away policy sill iness with a boys - will
density to a grid simpl ifies computing spatial derivatives . So
set of background presupp os itions insofar as each is reacting
Startup India programme is deb atable , but there is a
some availability on its cru ises ( 08 71 231 4
that changes occur in rehears als and things that are suggested
EL VE ) quarterback hur ries . He is a very
- 10 planetary system revis ited by HAR PS - N
, Egypt , and cru ises via A q aba ,
and golden on the bott oms . The tops of the
7 ) K och companies letter to Sen .
split first - team repet itions . Lyn ch
rage . polit ically correct o omp a lo
creatures in place while ampl ifying damage from other weapons ,

INTERVAL 0.354 - 0.637
CONTAINS 0.014%

ph ians use a neutral ized short a is special to
OPS on AWS Cloud Inst ance . This is
Be safe pic . twitter . com / 74 E
, the R uling Stew ard of the land of G
be hurt by accidentally bump ing the shif ter into reverse

INTERVAL 0.072 - 0.354
CONTAINS 6.806%

, who flee into the " forest " of Hu orns
the accused for causing destruction to the environment . Later on
I put out a little shed in a parking lot and
new settlement is devised to Juda ise the city . The
Carly Fiorina 's political career Fiorina , right , poses for

INTERVAL -0.211 - 0.072
CONTAINS 93.020%

in large red letters on the structure and also on the
iva ). 3 . Xen H VM 3 .
percent may not be a res ounding endorsement , it topped
in his stories , nor does he sugar coat the realities
Make a break for it ( 3 )

INTERVAL -0.494 - -0.211
CONTAINS 0.160%

be in a fren z ied state as he began shouting
Law - abiding , Ded icated , Log ical
of toxic al gal blo oms in the North Atlantic and
campaign . Instead of Techn icol or , Hitchcock shot it
Lever age your natural overnight fast by skipping breakfast ( dr
Token were
Token ID547
Feature activation+6.33e-02

Pos logit contributions
��+0.007
eteen+0.006
yip+0.006
��+0.005
earchers+0.005
Neg logit contributions
-0.011
-0.009
-0.009
Reviewer-0.009
-0.009
Token
Token ID564
Feature activation+1.21e-02

Pos logit contributions
��+0.161
eteen+0.131
yip+0.128
��+0.121
earchers+0.116
Neg logit contributions
-0.239
-0.206
-0.193
Reviewer-0.188
-0.185
Token
Token ID250
Feature activation-2.89e-02

Pos logit contributions
��+0.023
eteen+0.017
yip+0.017
��+0.016
earchers+0.014
Neg logit contributions
-0.053
-0.047
-0.045
Reviewer-0.044
-0.043
Tokens
Token ID82
Feature activation-3.49e-02

Pos logit contributions
+0.091
+0.076
+0.070
Reviewer+0.068
+0.067
Neg logit contributions
��-0.091
eteen-0.078
yip-0.077
��-0.073
earchers-0.071
Tokenomew
Token ID28030
Feature activation-7.11e-02

Pos logit contributions
+0.136
+0.117
+0.110
Reviewer+0.108
+0.106
Neg logit contributions
��-0.085
eteen-0.069
yip-0.067
��-0.063
earchers-0.061
Tokenhat
Token ID5183
Feature activation+6.37e-01

Pos logit contributions
+0.267
+0.230
+0.218
Reviewer+0.210
+0.209
Neg logit contributions
��-0.179
eteen-0.143
yip-0.143
��-0.136
maxwell-0.128
Token satisfied
Token ID11378
Feature activation-5.84e-02

Pos logit contributions
��+1.619
eteen+1.323
yip+1.269
��+1.214
earchers+1.163
Neg logit contributions
-2.401
-2.074
-1.933
Reviewer-1.895
-1.851
Token.
Token ID13
Feature activation+5.55e-03

Pos logit contributions
+0.192
+0.161
+0.149
Reviewer+0.145
+0.142
Neg logit contributions
��-0.177
eteen-0.150
yip-0.147
��-0.140
earchers-0.136
Token
Token ID447
Feature activation+1.67e-02

Pos logit contributions
��+0.015
eteen+0.012
yip+0.012
��+0.011
earchers+0.011
Neg logit contributions
-0.020
-0.018
-0.016
Reviewer-0.016
-0.016
Token
Token ID251
Feature activation+1.46e-03

Pos logit contributions
��+0.028
eteen+0.020
yip+0.019
��+0.017
earchers+0.016
Neg logit contributions
-0.078
-0.069
-0.066
Reviewer-0.064
-0.064
Token
Token ID198
Feature activation-2.56e-02

Pos logit contributions
��+0.004
eteen+0.003
yip+0.003
��+0.003
earchers+0.003
Neg logit contributions
-0.005
-0.005
-0.004
Reviewer-0.004
-0.004
Token Mer
Token ID4638
Feature activation+2.80e-02

Pos logit contributions
��+0.066
eteen+0.055
yip+0.054
��+0.051
earchers+0.049
Neg logit contributions
-0.084
-0.071
-0.066
Reviewer-0.065
-0.063
Tokenri
Token ID380
Feature activation-2.67e-02

Pos logit contributions
��+0.093
eteen+0.081
yip+0.079
��+0.075
earchers+0.074
Neg logit contributions
-0.084
-0.069
-0.064
Reviewer-0.061
-0.060
Tokenam
Token ID321
Feature activation+1.03e-01

Pos logit contributions
+0.075
+0.061
+0.056
Reviewer+0.054
+0.053
Neg logit contributions
��-0.093
eteen-0.081
yip-0.079
��-0.076
earchers-0.074
Token-
Token ID12
Feature activation+9.00e-02

Pos logit contributions
��+0.282
eteen+0.235
yip+0.229
��+0.217
earchers+0.210
Neg logit contributions
-0.365
-0.311
-0.291
Reviewer-0.283
-0.279
TokenWeb
Token ID13908
Feature activation-1.14e-03

Pos logit contributions
��+0.340
eteen+0.301
yip+0.294
��+0.284
earchers+0.278
Neg logit contributions
-0.227
-0.180
-0.158
Reviewer-0.150
wcs-0.146
Tokenster
Token ID1706
Feature activation+5.70e-01

Pos logit contributions
+0.004
+0.003
+0.003
Reviewer+0.003
+0.003
Neg logit contributions
��-0.003
eteen-0.003
yip-0.003
��-0.003
earchers-0.003
Token's
Token ID338
Feature activation+4.72e-02

Pos logit contributions
��+1.470
eteen+1.224
yip+1.171
��+1.103
earchers+1.075
Neg logit contributions
-2.086
-1.798
-1.679
Reviewer-1.653
-1.611
Token third
Token ID2368
Feature activation+6.65e-02

Pos logit contributions
��+0.131
eteen+0.110
yip+0.107
��+0.101
earchers+0.098
Neg logit contributions
-0.166
-0.141
-0.132
Reviewer-0.129
-0.126
Token definition
Token ID6770
Feature activation+8.67e-02

Pos logit contributions
��+0.173
eteen+0.144
yip+0.140
��+0.132
earchers+0.127
Neg logit contributions
-0.245
-0.211
-0.197
Reviewer-0.192
-0.189
Token of
Token ID286
Feature activation+1.18e-01

Pos logit contributions
��+0.228
eteen+0.189
yip+0.185
��+0.173
earchers+0.167
Neg logit contributions
-0.318
-0.273
-0.255
Reviewer-0.249
-0.245
Token value
Token ID1988
Feature activation+7.44e-02

Pos logit contributions
��+0.290
eteen+0.237
yip+0.231
��+0.216
earchers+0.208
Neg logit contributions
-0.450
-0.388
-0.365
Reviewer-0.356
-0.351
Token
Token ID447
Feature activation+6.29e-04

Pos logit contributions
��+0.109
eteen+0.091
yip+0.089
��+0.084
earchers+0.081
Neg logit contributions
-0.143
-0.122
-0.114
Reviewer-0.111
-0.109
Token
Token ID251
Feature activation+1.27e-02

Pos logit contributions
��+0.001
eteen+0.001
yip+0.001
��+0.001
earchers+0.001
Neg logit contributions
-0.003
-0.002
-0.002
Reviewer-0.002
-0.002
Token
Token ID564
Feature activation+1.97e-02

Pos logit contributions
��+0.034
eteen+0.028
yip+0.028
��+0.026
earchers+0.025
Neg logit contributions
-0.046
-0.039
-0.037
Reviewer-0.036
-0.035
Token
Token ID250
Feature activation+3.35e-02

Pos logit contributions
��+0.030
eteen+0.021
yip+0.021
��+0.018
earchers+0.017
Neg logit contributions
-0.094
-0.084
-0.080
Reviewer-0.079
-0.078
TokenComm
Token ID6935
Feature activation+4.92e-02

Pos logit contributions
��+0.110
eteen+0.095
yip+0.093
��+0.089
earchers+0.086
Neg logit contributions
-0.101
-0.083
-0.077
Reviewer-0.074
-0.072
Tokenander
Token ID4066
Feature activation+5.55e-01

Pos logit contributions
��+0.083
eteen+0.058
yip+0.057
��+0.051
earchers+0.047
Neg logit contributions
-0.229
-0.203
-0.193
Reviewer-0.189
-0.188
Token of
Token ID286
Feature activation+5.28e-02

Pos logit contributions
��+1.494
eteen+1.249
yip+1.226
��+1.150
earchers+1.106
Neg logit contributions
-1.989
-1.692
-1.590
Reviewer-1.547
-1.514
Token the
Token ID262
Feature activation+2.96e-02

Pos logit contributions
��+0.135
eteen+0.110
yip+0.108
��+0.102
earchers+0.098
Neg logit contributions
-0.199
-0.171
-0.160
Reviewer-0.156
-0.154
Token On
Token ID1550
Feature activation+5.51e-02

Pos logit contributions
��+0.073
eteen+0.060
yip+0.058
��+0.055
earchers+0.052
Neg logit contributions
-0.114
-0.098
-0.092
Reviewer-0.090
-0.088
Tokenmits
Token ID24883
Feature activation-2.37e-02

Pos logit contributions
��+0.168
eteen+0.144
yip+0.141
��+0.134
earchers+0.130
Neg logit contributions
-0.180
-0.150
-0.140
Reviewer-0.135
-0.133
Tokenuk
Token ID2724
Feature activation+8.94e-02

Pos logit contributions
+0.083
+0.071
+0.066
Reviewer+0.064
+0.063
Neg logit contributions
��-0.066
eteen-0.055
yip-0.054
��-0.051
earchers-0.049
Token,
Token ID11
Feature activation-1.35e-02

Pos logit contributions
+0.226
+0.195
+0.183
Reviewer+0.177
+0.175
Neg logit contributions
��-0.155
eteen-0.128
yip-0.124
��-0.117
earchers-0.112
Token showed
Token ID3751
Feature activation+4.42e-02

Pos logit contributions
+0.048
+0.041
+0.038
Reviewer+0.037
+0.036
Neg logit contributions
��-0.038
eteen-0.032
yip-0.031
��-0.029
earchers-0.028
Token trustees
Token ID36636
Feature activation+1.20e-02

Pos logit contributions
��+0.116
eteen+0.096
yip+0.094
��+0.088
earchers+0.085
Neg logit contributions
-0.163
-0.140
-0.131
Reviewer-0.127
-0.126
Token were
Token ID547
Feature activation+3.38e-02

Pos logit contributions
��+0.030
eteen+0.024
yip+0.024
��+0.022
earchers+0.021
Neg logit contributions
-0.046
-0.039
-0.037
Reviewer-0.036
-0.036
Token reimb
Token ID46496
Feature activation-6.16e-03

Pos logit contributions
��+0.093
eteen+0.077
yip+0.076
��+0.071
earchers+0.069
Neg logit contributions
-0.121
-0.103
-0.096
Reviewer-0.093
-0.092
Tokenursed
Token ID17539
Feature activation+5.16e-01

Pos logit contributions
+0.028
+0.025
+0.024
Reviewer+0.023
+0.023
Neg logit contributions
��-0.011
eteen-0.008
yip-0.008
��-0.007
earchers-0.006
Token for
Token ID329
Feature activation+1.38e-01

Pos logit contributions
��+1.275
eteen+1.049
yip+1.022
��+0.944
earchers+0.918
Neg logit contributions
-1.973
-1.704
-1.594
Reviewer-1.558
-1.528
Token hand
Token ID1021
Feature activation+5.80e-02

Pos logit contributions
��+0.376
eteen+0.313
yip+0.306
��+0.287
earchers+0.278
Neg logit contributions
-0.497
-0.422
-0.394
Reviewer-0.385
-0.376
Token lot
Token ID1256
Feature activation-5.40e-02

Pos logit contributions
��+0.162
eteen+0.141
yip+0.134
��+0.127
earchers+0.127
Neg logit contributions
-0.200
-0.168
-0.156
Reviewer-0.152
-0.148
Tokenion
Token ID295
Feature activation-1.96e-02

Pos logit contributions
+0.189
+0.160
+0.149
Reviewer+0.143
+0.141
Neg logit contributions
��-0.153
eteen-0.129
yip-0.125
��-0.119
earchers-0.115
Token,
Token ID11
Feature activation-4.85e-02

Pos logit contributions
+0.072
+0.061
+0.057
Reviewer+0.056
+0.054
Neg logit contributions
��-0.052
eteen-0.043
yip-0.042
��-0.040
earchers-0.039
Token it
Token ID340
Feature activation+1.28e-02

Pos logit contributions
+0.060
+0.052
+0.049
Reviewer+0.047
+0.046
Neg logit contributions
��-0.043
eteen-0.035
yip-0.035
��-0.033
earchers-0.031
Token was
Token ID373
Feature activation+1.78e-02

Pos logit contributions
��+0.034
eteen+0.028
yip+0.027
��+0.026
earchers+0.025
Neg logit contributions
-0.047
-0.040
-0.037
Reviewer-0.036
-0.036
Token
Token ID564
Feature activation+4.20e-02

Pos logit contributions
��+0.047
eteen+0.039
yip+0.038
��+0.036
earchers+0.035
Neg logit contributions
-0.065
-0.056
-0.052
Reviewer-0.051
-0.050
Token
Token ID250
Feature activation-2.53e-02

Pos logit contributions
��+0.081
eteen+0.062
yip+0.061
��+0.055
earchers+0.052
Neg logit contributions
-0.183
-0.161
-0.153
Reviewer-0.150
-0.148
Tokendec
Token ID12501
Feature activation+4.70e-02

Pos logit contributions
+0.079
+0.066
+0.061
Reviewer+0.058
+0.057
Neg logit contributions
��-0.081
eteen-0.069
yip-0.068
��-0.065
earchers-0.063
Tokenided
Token ID1384
Feature activation+5.11e-01

Pos logit contributions
��+0.125
eteen+0.103
yip+0.101
��+0.095
earchers+0.092
Neg logit contributions
-0.172
-0.147
-0.138
Reviewer-0.134
-0.132
Tokenly
Token ID306
Feature activation-4.15e-02

Pos logit contributions
��+1.712
eteen+1.483
yip+1.453
��+1.384
earchers+1.351
Neg logit contributions
-1.503
-1.235
-1.133
Reviewer-1.092
-1.063
Token average
Token ID2811
Feature activation+8.72e-03

Pos logit contributions
+0.139
+0.117
+0.109
Reviewer+0.106
+0.104
Neg logit contributions
��-0.123
eteen-0.104
yip-0.102
��-0.097
earchers-0.094
Token
Token ID447
Feature activation+2.80e-03

Pos logit contributions
��+0.025
eteen+0.021
yip+0.021
��+0.020
maxwell+0.019
Neg logit contributions
-0.030
-0.025
-0.023
Reviewer-0.023
-0.022
Token
Token ID251
Feature activation-4.58e-02

Pos logit contributions
��+0.004
eteen+0.003
yip+0.003
��+0.003
earchers+0.002
Neg logit contributions
-0.013
-0.012
-0.011
Reviewer-0.011
-0.011
Token and
Token ID290
Feature activation+1.94e-02

Pos logit contributions
+0.166
+0.142
+0.133
Reviewer+0.129
+0.128
Neg logit contributions
��-0.123
eteen-0.102
yip-0.100
��-0.094
earchers-0.091
Token Luxembourg
Token ID31864
Feature activation-8.25e-02

Pos logit contributions
��+0.120
eteen+0.101
yip+0.099
��+0.094
earchers+0.091
Neg logit contributions
-0.139
-0.117
-0.109
Reviewer-0.106
-0.103
Token premier
Token ID18256
Feature activation+7.04e-03

Pos logit contributions
+0.276
+0.233
+0.215
Reviewer+0.209
+0.205
Neg logit contributions
��-0.244
eteen-0.206
yip-0.203
��-0.192
earchers-0.187
Token Jean
Token ID11320
Feature activation+1.40e-02

Pos logit contributions
��+0.020
eteen+0.017
yip+0.016
��+0.015
earchers+0.015
Neg logit contributions
-0.025
-0.021
-0.019
Reviewer-0.019
-0.019
Token-
Token ID12
Feature activation-1.17e-02

Pos logit contributions
��+0.040
eteen+0.034
yip+0.034
��+0.032
earchers+0.031
Neg logit contributions
-0.048
-0.041
-0.038
Reviewer-0.037
-0.036
TokenCla
Token ID47404
Feature activation+4.43e-02

Pos logit contributions
+0.038
+0.032
+0.029
Reviewer+0.029
ghai+0.028
Neg logit contributions
��-0.035
eteen-0.031
yip-0.030
��-0.028
maxwell-0.028
Tokenude
Token ID2507
Feature activation+4.98e-01

Pos logit contributions
��+0.145
eteen+0.124
yip+0.122
��+0.117
earchers+0.113
Neg logit contributions
-0.135
-0.112
-0.103
Reviewer-0.099
-0.098
Token Jun
Token ID7653
Feature activation+1.69e-03

Pos logit contributions
��+1.401
eteen+1.196
yip+1.142
ords+1.059
cipline+1.056
Neg logit contributions
-1.635
-1.401
-1.300
Reviewer-1.281
wcs-1.244
Tokencker
Token ID15280
Feature activation+9.49e-02

Pos logit contributions
��+0.006
eteen+0.006
yip+0.006
��+0.005
earchers+0.005
Neg logit contributions
-0.004
-0.003
-0.003
Reviewer-0.003
-0.003
Token,
Token ID11
Feature activation-5.79e-03

Pos logit contributions
��+0.246
eteen+0.203
yip+0.197
��+0.186
earchers+0.177
Neg logit contributions
-0.352
-0.302
-0.282
Reviewer-0.277
-0.271
Token she
Token ID673
Feature activation-8.78e-04

Pos logit contributions
+0.021
+0.018
+0.017
Reviewer+0.017
+0.016
Neg logit contributions
��-0.015
eteen-0.013
yip-0.012
��-0.012
earchers-0.011
Token hoped
Token ID10719
Feature activation+3.57e-02

Pos logit contributions
+0.003
+0.003
+0.003
Reviewer+0.003
+0.003
Neg logit contributions
��-0.002
eteen-0.002
yip-0.002
��-0.002
earchers-0.002
Token the
Token ID262
Feature activation+6.44e-03

Pos logit contributions
��+0.058
eteen+0.048
yip+0.047
��+0.044
earchers+0.042
Neg logit contributions
-0.089
-0.076
-0.072
Reviewer-0.070
-0.069
Token process
Token ID1429
Feature activation-8.19e-04

Pos logit contributions
��+0.017
eteen+0.014
yip+0.013
��+0.013
earchers+0.012
Neg logit contributions
-0.024
-0.021
-0.019
Reviewer-0.019
-0.019
Token of
Token ID286
Feature activation-3.49e-02

Pos logit contributions
+0.003
+0.002
+0.002
Reviewer+0.002
+0.002
Neg logit contributions
��-0.002
eteen-0.002
yip-0.002
��-0.002
earchers-0.002
Token reim
Token ID21123
Feature activation-4.59e-02

Pos logit contributions
+0.122
+0.104
+0.097
Reviewer+0.094
+0.093
Neg logit contributions
��-0.098
eteen-0.082
yip-0.080
��-0.076
earchers-0.073
Tokenplement
Token ID26908
Feature activation-2.99e-01

Pos logit contributions
+0.148
+0.124
+0.115
Reviewer+0.112
+0.110
Neg logit contributions
��-0.141
eteen-0.121
yip-0.118
��-0.113
earchers-0.109
Tokening
Token ID278
Feature activation+4.66e-01

Pos logit contributions
+1.037
+0.880
+0.821
Reviewer+0.797
+0.784
Neg logit contributions
��-0.851
eteen-0.714
yip-0.697
��-0.663
earchers-0.640
Token our
Token ID674
Feature activation+6.73e-02

Pos logit contributions
��+1.238
eteen+1.025
yip+1.001
��+0.944
earchers+0.913
Neg logit contributions
-1.696
-1.457
-1.357
Reviewer-1.329
-1.300
Token message
Token ID3275
Feature activation-1.11e-02

Pos logit contributions
��+0.180
eteen+0.149
yip+0.146
��+0.138
earchers+0.133
Neg logit contributions
-0.244
-0.209
-0.195
Reviewer-0.191
-0.187
Token bus
Token ID1323
Feature activation+1.09e-02

Pos logit contributions
+0.038
+0.032
+0.030
Reviewer+0.029
+0.029
Neg logit contributions
��-0.032
eteen-0.027
yip-0.026
��-0.025
earchers-0.024
Token with
Token ID351
Feature activation+3.48e-02

Pos logit contributions
��+0.029
eteen+0.024
yip+0.023
��+0.022
earchers+0.021
Neg logit contributions
-0.040
-0.034
-0.032
Reviewer-0.031
-0.030
Token this
Token ID428
Feature activation+1.95e-02

Pos logit contributions
��+0.088
eteen+0.071
yip+0.070
��+0.066
earchers+0.063
Neg logit contributions
-0.132
-0.114
-0.107
Reviewer-0.104
-0.102
Token first
Token ID717
Feature activation+3.50e-02

Pos logit contributions
��+0.087
eteen+0.072
yip+0.071
��+0.067
earchers+0.064
Neg logit contributions
-0.118
-0.101
-0.095
Reviewer-0.092
-0.090
Token on
Token ID319
Feature activation+1.55e-02

Pos logit contributions
��+0.079
eteen+0.064
yip+0.063
��+0.056
maxwell+0.054
Neg logit contributions
-0.140
-0.122
-0.115
Reviewer-0.112
-0.110
Token.
Token ID764
Feature activation-4.80e-02

Pos logit contributions
��+0.043
eteen+0.036
yip+0.035
��+0.033
earchers+0.032
Neg logit contributions
-0.055
-0.047
-0.044
Reviewer-0.042
-0.041
Token<|endoftext|>
Token ID50256
Feature activation+3.71e-02

Pos logit contributions
+0.180
+0.155
+0.146
Reviewer+0.141
+0.139
Neg logit contributions
��-0.123
eteen-0.101
yip-0.099
��-0.093
earchers-0.089
TokenMon
Token ID9069
Feature activation+1.07e-01

Pos logit contributions
��+0.123
eteen+0.107
yip+0.105
��+0.100
maxwell+0.098
Neg logit contributions
-0.110
-0.091
-0.083
Reviewer-0.079
-0.078
Tokenths
Token ID9998
Feature activation+4.64e-01

Pos logit contributions
��+0.313
eteen+0.264
yip+0.259
��+0.245
earchers+0.237
Neg logit contributions
-0.366
-0.309
-0.288
Reviewer-0.280
-0.275
Token of
Token ID286
Feature activation+2.58e-02

Pos logit contributions
��+1.208
eteen+1.002
yip+0.976
��+0.920
earchers+0.888
Neg logit contributions
-1.711
-1.471
-1.379
Reviewer-1.342
-1.317
Token anticipation
Token ID24054
Feature activation+1.07e-01

Pos logit contributions
��+0.070
eteen+0.058
yip+0.057
��+0.054
earchers+0.052
Neg logit contributions
-0.092
-0.079
-0.074
Reviewer-0.072
-0.070
Token (
Token ID357
Feature activation+1.17e-02

Pos logit contributions
��+0.265
eteen+0.218
yip+0.211
��+0.198
earchers+0.191
Neg logit contributions
-0.406
-0.351
-0.329
Reviewer-0.321
-0.315
Tokenmostly
Token ID29471
Feature activation-4.57e-02

Pos logit contributions
��+0.036
eteen+0.031
yip+0.031
��+0.029
earchers+0.028
Neg logit contributions
-0.038
-0.031
-0.029
Reviewer-0.028
-0.027
Token from
Token ID422
Feature activation-1.16e-02

Pos logit contributions
+0.168
+0.144
+0.135
Reviewer+0.132
+0.130
Neg logit contributions
��-0.120
eteen-0.099
yip-0.097
��-0.091
earchers-0.088
Token GO
Token ID10351
Feature activation+6.80e-02

Pos logit contributions
��+0.207
eteen+0.173
yip+0.169
��+0.160
earchers+0.154
Neg logit contributions
-0.259
-0.220
-0.206
Reviewer-0.200
-0.196
TokenAL
Token ID1847
Feature activation-5.72e-02

Pos logit contributions
��+0.207
eteen+0.176
yip+0.172
��+0.164
earchers+0.159
Neg logit contributions
-0.221
-0.186
-0.173
Reviewer-0.168
-0.164
Token!
Token ID0
Feature activation+7.64e-02

Pos logit contributions
+0.194
+0.164
+0.153
Reviewer+0.149
+0.146
Neg logit contributions
��-0.166
eteen-0.140
yip-0.138
��-0.130
earchers-0.126
Token pic
Token ID8301
Feature activation+9.76e-02

Pos logit contributions
��+0.208
eteen+0.173
yip+0.169
��+0.160
earchers+0.154
Neg logit contributions
-0.273
-0.233
-0.218
Reviewer-0.212
-0.208
Token.
Token ID13
Feature activation+1.25e-01

Pos logit contributions
��+0.258
eteen+0.210
yip+0.209
��+0.200
earchers+0.189
Neg logit contributions
-0.361
-0.310
-0.291
Reviewer-0.283
-0.281
Tokentwitter
Token ID6956
Feature activation+4.51e-01

Pos logit contributions
��+0.494
eteen+0.437
yip+0.430
��+0.414
maxwell+0.406
Neg logit contributions
-0.295
-0.229
-0.204
Reviewer-0.194
-0.188
Token.
Token ID13
Feature activation+1.42e-01

Pos logit contributions
��+1.161
eteen+0.947
yip+0.934
��+0.885
earchers+0.844
Neg logit contributions
-1.692
-1.453
-1.370
Reviewer-1.331
-1.315
Tokencom
Token ID785
Feature activation-3.69e-03

Pos logit contributions
��+0.486
eteen+0.420
yip+0.414
��+0.396
earchers+0.385
Neg logit contributions
-0.411
-0.336
-0.308
Reviewer-0.297
-0.291
Token/
Token ID14
Feature activation+2.25e-01

Pos logit contributions
+0.014
+0.012
+0.011
+0.011
Reviewer+0.011
Neg logit contributions
��-0.010
eteen-0.008
yip-0.008
��-0.008
earchers-0.007
TokenE
Token ID36
Feature activation+7.48e-02

Pos logit contributions
��+0.720
eteen+0.623
yip+0.614
��+0.582
earchers+0.567
Neg logit contributions
-0.693
-0.575
-0.531
Reviewer-0.511
-0.500
Tokenq
Token ID80
Feature activation+3.97e-02

Pos logit contributions
��+0.246
eteen+0.214
yip+0.211
��+0.200
earchers+0.196
Neg logit contributions
-0.224
-0.185
-0.170
Reviewer-0.164
-0.160
Token-
Token ID12
Feature activation+1.04e-01

Pos logit contributions
��+0.292
eteen+0.244
yip+0.236
��+0.224
maxwell+0.214
Neg logit contributions
-0.400
-0.343
-0.320
Reviewer-0.313
-0.306
TokenLou
Token ID24016
Feature activation+1.86e-01

Pos logit contributions
��+0.341
eteen+0.296
yip+0.289
��+0.277
maxwell+0.270
Neg logit contributions
-0.312
-0.257
-0.237
Reviewer-0.226
-0.221
Token CK
Token ID45233
Feature activation+5.83e-02

Pos logit contributions
��+0.535
eteen+0.455
yip+0.445
��+0.419
earchers+0.406
Neg logit contributions
-0.631
-0.535
-0.500
Reviewer-0.482
-0.477
Token&
Token ID5
Feature activation+7.79e-03

Pos logit contributions
��+0.159
eteen+0.133
yip+0.130
��+0.123
earchers+0.118
Neg logit contributions
-0.207
-0.177
-0.165
Reviewer-0.161
-0.158
TokenMar
Token ID7676
Feature activation+4.77e-04

Pos logit contributions
��+0.025
eteen+0.021
yip+0.021
��+0.020
maxwell+0.019
Neg logit contributions
-0.024
-0.020
-0.019
Reviewer-0.018
-0.018
Tokenriages
Token ID16451
Feature activation+4.50e-01

Pos logit contributions
��+0.001
eteen+0.001
yip+0.001
��+0.001
earchers+0.001
Neg logit contributions
-0.002
-0.001
-0.001
Reviewer-0.001
-0.001
Token-
Token ID12
Feature activation+1.25e-01

Pos logit contributions
��+1.228
eteen+1.034
yip+1.005
��+0.952
earchers+0.920
Neg logit contributions
-1.593
-1.360
-1.267
Reviewer-1.239
-1.209
TokenE
Token ID36
Feature activation+6.62e-02

Pos logit contributions
��+0.410
eteen+0.357
yip+0.347
��+0.333
maxwell+0.326
Neg logit contributions
-0.374
-0.309
-0.284
Reviewer-0.270
-0.265
Tokenarl
Token ID7063
Feature activation+2.53e-02

Pos logit contributions
��+0.220
eteen+0.191
yip+0.188
��+0.179
earchers+0.175
Neg logit contributions
-0.197
-0.162
-0.149
Reviewer-0.142
-0.141
Token Lu
Token ID6026
Feature activation+6.59e-02

Pos logit contributions
��+0.069
eteen+0.058
yip+0.056
��+0.053
earchers+0.052
Neg logit contributions
-0.090
-0.077
-0.072
Reviewer-0.070
-0.069
Tokenvs
Token ID14259
Feature activation+1.12e-02

Pos logit contributions
��+0.199
eteen+0.171
yip+0.166
��+0.157
earchers+0.153
Neg logit contributions
-0.217
-0.182
-0.169
Reviewer-0.163
-0.160
Tokent
Token ID83
Feature activation+8.70e-03

Pos logit contributions
��+0.140
eteen+0.118
yip+0.115
��+0.109
earchers+0.106
Neg logit contributions
-0.167
-0.142
-0.132
Reviewer-0.128
-0.126
Token complain
Token ID13121
Feature activation-2.50e-02

Pos logit contributions
��+0.023
eteen+0.019
yip+0.019
��+0.018
earchers+0.017
Neg logit contributions
-0.032
-0.027
-0.026
Reviewer-0.025
-0.025
Token,
Token ID11
Feature activation+5.00e-02

Pos logit contributions
+0.094
+0.081
+0.076
Reviewer+0.074
+0.073
Neg logit contributions
��-0.063
eteen-0.052
yip-0.051
��-0.048
earchers-0.046
Token one
Token ID530
Feature activation+2.82e-02

Pos logit contributions
��+0.130
eteen+0.107
yip+0.105
��+0.098
earchers+0.094
Neg logit contributions
-0.186
-0.160
-0.150
Reviewer-0.146
-0.144
Token supp
Token ID802
Feature activation+5.10e-02

Pos logit contributions
��+0.076
eteen+0.064
yip+0.062
��+0.058
earchers+0.056
Neg logit contributions
-0.102
-0.087
-0.081
Reviewer-0.079
-0.078
Tokenoses
Token ID4629
Feature activation+4.41e-01

Pos logit contributions
��+0.144
eteen+0.122
yip+0.120
��+0.112
earchers+0.109
Neg logit contributions
-0.177
-0.151
-0.140
Reviewer-0.137
-0.134
Token.
Token ID13
Feature activation+5.83e-02

Pos logit contributions
��+1.100
eteen+0.900
yip+0.880
��+0.821
earchers+0.782
Neg logit contributions
-1.679
-1.449
-1.358
Reviewer-1.318
-1.303
Token
Token ID198
Feature activation+4.59e-02

Pos logit contributions
��+0.156
eteen+0.129
yip+0.126
��+0.119
earchers+0.114
Neg logit contributions
-0.211
-0.181
-0.169
Reviewer-0.165
-0.162
Token
Token ID198
Feature activation+4.22e-02

Pos logit contributions
��+0.127
eteen+0.104
yip+0.104
��+0.100
earchers+0.095
Neg logit contributions
-0.164
-0.140
-0.132
Reviewer-0.128
-0.127
TokenAnd
Token ID1870
Feature activation-1.36e-02

Pos logit contributions
��+0.143
eteen+0.125
yip+0.122
��+0.117
maxwell+0.114
Neg logit contributions
-0.121
-0.100
-0.090
Reviewer-0.086
-0.084
Token so
Token ID523
Feature activation+2.20e-02

Pos logit contributions
+0.051
+0.044
+0.041
Reviewer+0.040
+0.040
Neg logit contributions
��-0.035
eteen-0.028
yip-0.028
��-0.026
earchers-0.025
Token pattern
Token ID3912
Feature activation-4.83e-02

Pos logit contributions
+0.139
+0.119
+0.111
Reviewer+0.108
+0.107
Neg logit contributions
��-0.101
eteen-0.084
yip-0.082
��-0.077
earchers-0.075
Token of
Token ID286
Feature activation-8.45e-02

Pos logit contributions
+0.172
+0.147
+0.137
Reviewer+0.134
+0.131
Neg logit contributions
��-0.132
eteen-0.110
yip-0.108
��-0.102
earchers-0.098
Token tax
Token ID1687
Feature activation-4.93e-02

Pos logit contributions
+0.291
+0.246
+0.228
Reviewer+0.222
+0.217
Neg logit contributions
��-0.242
eteen-0.204
yip-0.201
��-0.188
earchers-0.183
Token-
Token ID12
Feature activation+2.48e-02

Pos logit contributions
+0.159
+0.133
+0.123
Reviewer+0.119
+0.116
Neg logit contributions
��-0.152
eteen-0.130
yip-0.127
��-0.121
earchers-0.117
Tokenavoid
Token ID27080
Feature activation+3.23e-02

Pos logit contributions
��+0.085
eteen+0.075
yip+0.073
��+0.070
maxwell+0.068
Neg logit contributions
-0.071
-0.057
-0.053
Reviewer-0.050
-0.049
Tokenance
Token ID590
Feature activation+4.35e-01

Pos logit contributions
��+0.072
eteen+0.057
yip+0.056
��+0.052
earchers+0.049
Neg logit contributions
-0.132
-0.115
-0.109
Reviewer-0.106
-0.105
Token.
Token ID13
Feature activation-3.90e-03

Pos logit contributions
��+1.107
eteen+0.915
yip+0.887
��+0.819
earchers+0.802
Neg logit contributions
-1.630
-1.409
-1.308
Reviewer-1.282
-1.252
Token
Token ID198
Feature activation-4.36e-02

Pos logit contributions
+0.014
+0.012
+0.011
Reviewer+0.011
+0.011
Neg logit contributions
��-0.010
eteen-0.009
yip-0.008
��-0.008
earchers-0.008
Token
Token ID198
Feature activation-5.06e-02

Pos logit contributions
+0.155
+0.132
+0.125
Reviewer+0.121
+0.120
Neg logit contributions
��-0.121
eteen-0.099
yip-0.099
��-0.095
earchers-0.090
TokenMy
Token ID3666
Feature activation-7.94e-02

Pos logit contributions
+0.148
+0.123
+0.110
Reviewer+0.106
+0.103
Neg logit contributions
��-0.168
eteen-0.147
yip-0.142
��-0.136
maxwell-0.134
Token own
Token ID898
Feature activation-6.72e-02

Pos logit contributions
+0.282
+0.242
+0.223
Reviewer+0.219
+0.215
Neg logit contributions
��-0.218
eteen-0.183
yip-0.178
��-0.169
earchers-0.163
Token start
Token ID923
Feature activation+2.35e-02

Pos logit contributions
��+0.034
eteen+0.028
yip+0.027
��+0.025
earchers+0.024
Neg logit contributions
-0.052
-0.044
-0.042
Reviewer-0.041
-0.040
Token developing
Token ID5922
Feature activation+7.87e-04

Pos logit contributions
��+0.064
eteen+0.053
yip+0.052
��+0.049
earchers+0.047
Neg logit contributions
-0.085
-0.072
-0.068
Reviewer-0.066
-0.065
Token in
Token ID287
Feature activation+2.43e-02

Pos logit contributions
��+0.002
eteen+0.002
yip+0.002
��+0.002
earchers+0.001
Neg logit contributions
-0.003
-0.003
-0.002
Reviewer-0.002
-0.002
Token AWS
Token ID30865
Feature activation+2.94e-02

Pos logit contributions
��+0.066
eteen+0.055
yip+0.054
��+0.051
earchers+0.049
Neg logit contributions
-0.087
-0.074
-0.070
Reviewer-0.068
-0.067
Token Lamb
Token ID21114
Feature activation-1.04e-01

Pos logit contributions
��+0.077
eteen+0.064
yip+0.062
��+0.059
earchers+0.056
Neg logit contributions
-0.108
-0.093
-0.087
Reviewer-0.085
-0.083
Tokenda
Token ID6814
Feature activation+4.35e-01

Pos logit contributions
+0.318
+0.263
+0.243
Reviewer+0.236
+0.232
Neg logit contributions
��-0.340
eteen-0.289
yip-0.287
��-0.276
earchers-0.265
Token with
Token ID351
Feature activation+4.44e-02

Pos logit contributions
��+1.134
eteen+0.936
yip+0.908
��+0.857
earchers+0.822
Neg logit contributions
-1.608
-1.381
-1.294
Reviewer-1.256
-1.237
Token server
Token ID4382
Feature activation-5.63e-02

Pos logit contributions
��+0.120
eteen+0.100
yip+0.098
��+0.093
earchers+0.089
Neg logit contributions
-0.160
-0.136
-0.128
Reviewer-0.124
-0.122
Tokenless
Token ID1203
Feature activation+4.86e-02

Pos logit contributions
+0.155
+0.125
+0.115
Reviewer+0.110
+0.108
Neg logit contributions
��-0.201
eteen-0.174
yip-0.172
��-0.166
earchers-0.161
Token framework
Token ID9355
Feature activation+5.42e-02

Pos logit contributions
��+0.125
eteen+0.103
yip+0.101
��+0.095
earchers+0.091
Neg logit contributions
-0.181
-0.156
-0.146
Reviewer-0.142
-0.140
Token.
Token ID13
Feature activation+2.75e-02

Pos logit contributions
��+0.137
eteen+0.112
yip+0.110
��+0.103
earchers+0.098
Neg logit contributions
-0.205
-0.176
-0.166
Reviewer-0.162
-0.159
Token put
Token ID1234
Feature activation-7.49e-02

Pos logit contributions
��+0.068
eteen+0.057
yip+0.056
��+0.053
earchers+0.051
Neg logit contributions
-0.086
-0.073
-0.068
Reviewer-0.066
-0.065
Token Brock
Token ID20501
Feature activation+5.63e-02

Pos logit contributions
+0.285
+0.245
+0.231
Reviewer+0.225
+0.221
Neg logit contributions
��-0.188
eteen-0.153
yip-0.150
��-0.141
earchers-0.135
Tokenway
Token ID1014
Feature activation-1.11e-02

Pos logit contributions
��+0.154
eteen+0.129
yip+0.126
��+0.119
earchers+0.114
Neg logit contributions
-0.201
-0.172
-0.160
Reviewer-0.156
-0.153
Token,
Token ID11
Feature activation+1.86e-03

Pos logit contributions
+0.041
+0.035
+0.033
Reviewer+0.032
+0.031
Neg logit contributions
��-0.030
eteen-0.024
yip-0.024
��-0.023
earchers-0.022
Token Og
Token ID25686
Feature activation-6.79e-02

Pos logit contributions
��+0.005
eteen+0.004
yip+0.004
��+0.004
earchers+0.004
Neg logit contributions
-0.007
-0.006
-0.005
Reviewer-0.005
-0.005
Tokenden
Token ID6559
Feature activation+4.34e-01

Pos logit contributions
+0.210
+0.174
+0.161
Reviewer+0.156
+0.153
Neg logit contributions
��-0.219
eteen-0.186
yip-0.184
��-0.176
earchers-0.170
Tokenville
Token ID4244
Feature activation+1.95e-02

Pos logit contributions
��+1.143
eteen+0.945
yip+0.916
��+0.868
earchers+0.829
Neg logit contributions
-1.590
-1.362
-1.272
Reviewer-1.251
-1.223
Token and
Token ID290
Feature activation+2.30e-02

Pos logit contributions
��+0.052
eteen+0.042
yip+0.042
��+0.039
earchers+0.038
Neg logit contributions
-0.071
-0.061
-0.057
Reviewer-0.056
-0.055
Token North
Token ID2258
Feature activation+4.52e-02

Pos logit contributions
��+0.062
eteen+0.051
yip+0.050
��+0.047
earchers+0.045
Neg logit contributions
-0.083
-0.071
-0.067
Reviewer-0.065
-0.064
Token H
Token ID367
Feature activation-1.43e-02

Pos logit contributions
��+0.128
eteen+0.107
yip+0.105
��+0.100
earchers+0.096
Neg logit contributions
-0.157
-0.133
-0.124
Reviewer-0.121
-0.119
Tokenaver
Token ID8770
Feature activation+9.38e-02

Pos logit contributions
+0.052
+0.045
+0.042
Reviewer+0.041
+0.040
Neg logit contributions
��-0.038
eteen-0.031
yip-0.030
��-0.029
earchers-0.028
Token P
Token ID350
Feature activation+4.13e-02

Pos logit contributions
��+0.115
eteen+0.094
yip+0.092
��+0.087
earchers+0.083
Neg logit contributions
-0.169
-0.145
-0.137
Reviewer-0.133
-0.131
Tokenoul
Token ID2852
Feature activation+5.97e-02

Pos logit contributions
��+0.093
eteen+0.074
yip+0.072
��+0.067
earchers+0.064
Neg logit contributions
-0.168
-0.146
-0.138
Reviewer-0.135
-0.133
Tokeniot
Token ID5151
Feature activation-5.16e-02

Pos logit contributions
��+0.193
eteen+0.167
yip+0.163
��+0.156
earchers+0.152
Neg logit contributions
-0.183
-0.152
-0.140
Reviewer-0.136
-0.132
Token and
Token ID290
Feature activation+4.93e-02

Pos logit contributions
+0.189
+0.162
+0.152
Reviewer+0.148
+0.145
Neg logit contributions
��-0.136
eteen-0.113
yip-0.110
��-0.104
earchers-0.100
Token Mat
Token ID6550
Feature activation+4.46e-02

Pos logit contributions
��+0.137
eteen+0.115
yip+0.112
��+0.106
earchers+0.102
Neg logit contributions
-0.174
-0.148
-0.138
Reviewer-0.135
-0.132
Tokenhew
Token ID6391
Feature activation+4.33e-01

Pos logit contributions
��+0.116
eteen+0.096
yip+0.094
��+0.089
earchers+0.085
Neg logit contributions
-0.165
-0.141
-0.133
Reviewer-0.129
-0.127
Token Dumb
Token ID42989
Feature activation+5.56e-02

Pos logit contributions
��+1.215
eteen+1.016
yip+0.996
��+0.942
earchers+0.907
Neg logit contributions
-1.520
-1.291
-1.210
Reviewer-1.172
-1.154
Tokena
Token ID64
Feature activation+2.11e-02

Pos logit contributions
��+0.190
eteen+0.164
yip+0.162
��+0.156
earchers+0.151
Neg logit contributions
-0.161
-0.131
-0.121
Reviewer-0.116
-0.114
Token has
Token ID468
Feature activation-9.24e-03

Pos logit contributions
��+0.055
eteen+0.045
yip+0.044
��+0.042
earchers+0.040
Neg logit contributions
-0.078
-0.067
-0.063
Reviewer-0.061
-0.060
Token made
Token ID925
Feature activation+9.21e-03

Pos logit contributions
+0.034
+0.029
+0.027
Reviewer+0.027
+0.026
Neg logit contributions
��-0.024
eteen-0.020
yip-0.020
��-0.019
earchers-0.018
Token his
Token ID465
Feature activation+3.81e-02

Pos logit contributions
��+0.023
eteen+0.019
yip+0.019
��+0.018
earchers+0.017
Neg logit contributions
-0.035
-0.030
-0.028
Reviewer-0.027
-0.027
Token if
Token ID611
Feature activation+1.27e-02

Pos logit contributions
��+0.039
eteen+0.032
yip+0.031
��+0.030
earchers+0.028
Neg logit contributions
-0.053
-0.045
-0.043
Reviewer-0.042
-0.041
Token they
Token ID484
Feature activation+3.04e-03

Pos logit contributions
��+0.034
eteen+0.028
yip+0.027
��+0.026
earchers+0.024
Neg logit contributions
-0.047
-0.040
-0.038
Reviewer-0.037
-0.036
Token could
Token ID714
Feature activation-4.75e-02

Pos logit contributions
��+0.008
eteen+0.006
yip+0.006
��+0.006
earchers+0.005
Neg logit contributions
-0.012
-0.010
-0.010
Reviewer-0.009
-0.009
Token be
Token ID307
Feature activation-2.80e-02

Pos logit contributions
+0.183
+0.158
+0.149
Reviewer+0.145
+0.143
Neg logit contributions
��-0.117
eteen-0.095
yip-0.093
��-0.087
earchers-0.083
Token reprim
Token ID41929
Feature activation-1.27e-02

Pos logit contributions
+0.097
+0.082
+0.076
Reviewer+0.075
+0.073
Neg logit contributions
��-0.080
eteen-0.067
yip-0.065
��-0.063
earchers-0.060
Tokenanded
Token ID12249
Feature activation+4.31e-01

Pos logit contributions
+0.048
+0.041
+0.039
Reviewer+0.038
+0.038
Neg logit contributions
��-0.032
yip-0.025
eteen-0.025
��-0.024
maxwell-0.022
Token.
Token ID13
Feature activation+1.35e-02

Pos logit contributions
��+1.074
eteen+0.878
yip+0.846
��+0.797
earchers+0.764
Neg logit contributions
-1.646
-1.419
-1.333
Reviewer-1.299
-1.276
Token J
Token ID449
Feature activation+1.46e-01

Pos logit contributions
��+0.034
eteen+0.028
yip+0.028
��+0.026
earchers+0.025
Neg logit contributions
-0.051
-0.043
-0.041
Reviewer-0.040
-0.039
TokenSO
Token ID15821
Feature activation+1.12e-01

Pos logit contributions
��+0.471
eteen+0.403
yip+0.397
��+0.379
earchers+0.367
Neg logit contributions
-0.451
-0.374
-0.346
Reviewer-0.335
-0.329
Token said
Token ID531
Feature activation+4.60e-02

Pos logit contributions
��+0.341
eteen+0.291
yip+0.285
��+0.273
earchers+0.264
Neg logit contributions
-0.363
-0.303
-0.281
Reviewer-0.272
-0.267
Token since
Token ID1201
Feature activation+5.09e-02

Pos logit contributions
��+0.118
eteen+0.097
yip+0.094
��+0.089
earchers+0.085
Neg logit contributions
-0.172
-0.148
-0.138
Reviewer-0.135
-0.133
Token that
Token ID326
Feature activation+1.51e-02

Pos logit contributions
+0.065
+0.055
+0.052
Reviewer+0.051
+0.050
Neg logit contributions
��-0.047
eteen-0.039
yip-0.038
��-0.035
earchers-0.034
Token holds
Token ID6622
Feature activation+9.19e-02

Pos logit contributions
��+0.040
eteen+0.033
yip+0.033
��+0.031
earchers+0.030
Neg logit contributions
-0.055
-0.047
-0.044
Reviewer-0.043
-0.042
Token a
Token ID257
Feature activation+3.27e-02

Pos logit contributions
��+0.237
eteen+0.195
yip+0.191
��+0.179
earchers+0.172
Neg logit contributions
-0.343
-0.295
-0.277
Reviewer-0.269
-0.265
Token "
Token ID366
Feature activation+1.05e-01

Pos logit contributions
��+0.088
eteen+0.073
yip+0.071
��+0.067
earchers+0.065
Neg logit contributions
-0.118
-0.101
-0.095
Reviewer-0.092
-0.091
Tokennom
Token ID26601
Feature activation+1.06e-01

Pos logit contributions
��+0.345
eteen+0.299
yip+0.296
��+0.280
maxwell+0.274
Neg logit contributions
-0.315
-0.261
-0.238
Reviewer-0.228
-0.225
Tokeninal
Token ID1292
Feature activation+4.29e-01

Pos logit contributions
��+0.301
eteen+0.252
yip+0.247
��+0.235
earchers+0.226
Neg logit contributions
-0.368
-0.312
-0.291
Reviewer-0.282
-0.278
Token interest
Token ID1393
Feature activation-5.20e-02

Pos logit contributions
��+1.209
eteen+1.018
yip+1.004
��+0.937
earchers+0.911
Neg logit contributions
-1.488
-1.270
-1.172
Reviewer-1.145
-1.126
Token"
Token ID1
Feature activation+5.42e-02

Pos logit contributions
+0.155
+0.129
+0.117
Reviewer+0.113
+0.111
Neg logit contributions
��-0.172
eteen-0.149
yip-0.147
��-0.139
earchers-0.136
Token in
Token ID287
Feature activation+7.36e-02

Pos logit contributions
��+0.149
eteen+0.125
yip+0.123
��+0.115
earchers+0.112
Neg logit contributions
-0.192
-0.164
-0.153
Reviewer-0.149
-0.146
Token White
Token ID2635
Feature activation+7.48e-02

Pos logit contributions
��+0.185
eteen+0.152
yip+0.148
��+0.139
earchers+0.133
Neg logit contributions
-0.279
-0.240
-0.226
Reviewer-0.220
-0.217
Token Marsh
Token ID9786
Feature activation+2.76e-02

Pos logit contributions
��+0.216
eteen+0.183
yip+0.180
��+0.170
earchers+0.164
Neg logit contributions
-0.255
-0.216
-0.202
Reviewer-0.196
-0.192
Token
Token ID198
Feature activation+4.24e-02

Pos logit contributions
��+0.105
eteen+0.086
yip+0.085
��+0.080
earchers+0.077
Neg logit contributions
-0.144
-0.123
-0.116
Reviewer-0.113
-0.111
TokenWhat
Token ID2061
Feature activation+2.97e-03

Pos logit contributions
��+0.145
eteen+0.126
yip+0.123
��+0.118
maxwell+0.116
Neg logit contributions
-0.122
-0.100
-0.091
Reviewer-0.086
-0.085
Token is
Token ID318
Feature activation-2.43e-02

Pos logit contributions
��+0.007
eteen+0.006
yip+0.006
��+0.006
earchers+0.005
Neg logit contributions
-0.011
-0.010
-0.009
Reviewer-0.009
-0.009
Token CAT
Token ID38348
Feature activation-3.96e-02

Pos logit contributions
+0.086
+0.073
+0.068
Reviewer+0.067
+0.066
Neg logit contributions
��-0.067
eteen-0.056
yip-0.055
��-0.052
earchers-0.050
Token Adjust
Token ID20292
Feature activation-4.60e-02

Pos logit contributions
+0.145
+0.124
+0.116
Reviewer+0.113
+0.111
Neg logit contributions
��-0.105
eteen-0.087
yip-0.085
��-0.080
earchers-0.077
Tokening
Token ID278
Feature activation+4.29e-01

Pos logit contributions
+0.147
+0.123
+0.114
Reviewer+0.110
+0.108
Neg logit contributions
��-0.143
eteen-0.122
yip-0.120
��-0.115
earchers-0.111
Token?
Token ID30
Feature activation+2.54e-02

Pos logit contributions
��+1.087
eteen+0.885
yip+0.864
��+0.811
maxwell+0.779
Neg logit contributions
-1.626
-1.395
-1.309
Reviewer-1.274
-1.261
Token
Token ID198
Feature activation+2.99e-02

Pos logit contributions
��+0.068
eteen+0.057
yip+0.055
��+0.052
earchers+0.051
Neg logit contributions
-0.092
-0.078
-0.073
Reviewer-0.071
-0.070
Token
Token ID198
Feature activation+3.13e-02

Pos logit contributions
��+0.084
eteen+0.069
yip+0.069
��+0.066
earchers+0.063
Neg logit contributions
-0.106
-0.090
-0.085
Reviewer-0.083
-0.082
TokenCat
Token ID21979
Feature activation-6.17e-03

Pos logit contributions
��+0.103
eteen+0.089
yip+0.087
��+0.083
maxwell+0.082
Neg logit contributions
-0.094
-0.077
-0.070
Reviewer-0.067
-0.066
Tokenast
Token ID459
Feature activation+6.60e-02

Pos logit contributions
+0.021
+0.018
+0.017
Reviewer+0.016
+0.016
Neg logit contributions
��-0.018
eteen-0.015
yip-0.014
��-0.014
earchers-0.013
Token many
Token ID867
Feature activation+4.68e-02

Pos logit contributions
��+0.092
eteen+0.076
yip+0.074
��+0.070
earchers+0.067
Neg logit contributions
-0.127
-0.109
-0.102
Reviewer-0.099
-0.098
Token friends
Token ID2460
Feature activation+6.55e-02

Pos logit contributions
��+0.123
eteen+0.102
yip+0.098
��+0.093
earchers+0.090
Neg logit contributions
-0.172
-0.148
-0.138
Reviewer-0.135
-0.133
Token,
Token ID11
Feature activation+1.25e-02

Pos logit contributions
��+0.171
eteen+0.142
yip+0.137
��+0.130
earchers+0.125
Neg logit contributions
-0.242
-0.208
-0.195
Reviewer-0.189
-0.186
Token who
Token ID508
Feature activation+9.47e-02

Pos logit contributions
��+0.033
eteen+0.027
yip+0.027
��+0.025
earchers+0.024
Neg logit contributions
-0.046
-0.040
-0.037
Reviewer-0.036
-0.036
Token affection
Token ID17696
Feature activation+5.23e-02

Pos logit contributions
��+0.248
eteen+0.205
yip+0.200
��+0.189
earchers+0.182
Neg logit contributions
-0.349
-0.299
-0.281
Reviewer-0.273
-0.269
Tokenately
Token ID1286
Feature activation+4.27e-01

Pos logit contributions
��+0.090
eteen+0.066
yip+0.064
��+0.058
earchers+0.054
Neg logit contributions
-0.239
-0.212
-0.202
Reviewer-0.197
-0.195
Token called
Token ID1444
Feature activation+7.05e-02

Pos logit contributions
��+1.106
eteen+0.934
yip+0.904
��+0.840
earchers+0.819
Neg logit contributions
-1.569
-1.347
-1.257
Reviewer-1.231
-1.205
Token him
Token ID683
Feature activation+2.87e-04

Pos logit contributions
��+0.182
eteen+0.149
yip+0.146
��+0.137
earchers+0.132
Neg logit contributions
-0.264
-0.226
-0.213
Reviewer-0.207
-0.204
Token "
Token ID366
Feature activation+5.71e-02

Pos logit contributions
��+0.001
eteen+0.001
yip+0.001
��+0.001
earchers+0.001
Neg logit contributions
-0.001
-0.001
-0.001
Reviewer-0.001
-0.001
TokenShort
Token ID16438
Feature activation+5.38e-02

Pos logit contributions
��+0.189
eteen+0.164
yip+0.161
��+0.154
maxwell+0.150
Neg logit contributions
-0.171
-0.141
-0.129
Reviewer-0.124
-0.122
Tokeny
Token ID88
Feature activation+7.64e-02

Pos logit contributions
��+0.147
eteen+0.121
yip+0.119
��+0.113
earchers+0.109
Neg logit contributions
-0.193
-0.164
-0.154
Reviewer-0.150
-0.148
Token8
Token ID23
Feature activation-4.60e-02

Pos logit contributions
+0.123
+0.102
+0.094
Reviewer+0.090
+0.089
Neg logit contributions
��-0.130
eteen-0.112
yip-0.110
��-0.105
earchers-0.102
Tokenb
Token ID65
Feature activation+5.82e-02

Pos logit contributions
+0.144
+0.120
+0.111
Reviewer+0.107
+0.105
Neg logit contributions
��-0.146
eteen-0.125
yip-0.124
��-0.118
earchers-0.114
TokenET
Token ID2767
Feature activation+7.18e-03

Pos logit contributions
��+0.188
eteen+0.162
yip+0.160
��+0.152
earchers+0.147
Neg logit contributions
-0.179
-0.148
-0.137
Reviewer-0.132
-0.129
Token pic
Token ID8301
Feature activation+8.97e-02

Pos logit contributions
��+0.023
eteen+0.019
yip+0.019
��+0.018
earchers+0.018
Neg logit contributions
-0.023
-0.019
-0.017
Reviewer-0.017
-0.017
Token.
Token ID13
Feature activation+9.40e-02

Pos logit contributions
��+0.239
eteen+0.195
yip+0.193
��+0.186
earchers+0.176
Neg logit contributions
-0.330
-0.283
-0.267
Reviewer-0.259
-0.257
Tokentwitter
Token ID6956
Feature activation+4.24e-01

Pos logit contributions
��+0.377
eteen+0.334
yip+0.330
��+0.319
earchers+0.311
Neg logit contributions
-0.216
-0.167
-0.149
Reviewer-0.141
-0.137
Token.
Token ID13
Feature activation+1.26e-01

Pos logit contributions
��+1.099
eteen+0.896
yip+0.885
��+0.843
earchers+0.802
Neg logit contributions
-1.582
-1.358
-1.281
Reviewer-1.243
-1.230
Tokencom
Token ID785
Feature activation-1.14e-03

Pos logit contributions
��+0.432
eteen+0.373
yip+0.368
��+0.353
earchers+0.343
Neg logit contributions
-0.362
-0.296
-0.272
Reviewer-0.261
-0.256
Token/
Token ID14
Feature activation+1.88e-01

Pos logit contributions
+0.004
+0.004
+0.003
+0.003
Reviewer+0.003
Neg logit contributions
��-0.003
eteen-0.002
yip-0.002
��-0.002
earchers-0.002
TokenW
Token ID54
Feature activation+6.51e-03

Pos logit contributions
��+0.601
eteen+0.519
yip+0.512
��+0.485
earchers+0.472
Neg logit contributions
-0.582
-0.482
-0.446
Reviewer-0.429
-0.421
Token8
Token ID23
Feature activation-6.23e-02

Pos logit contributions
��+0.021
eteen+0.019
yip+0.018
��+0.017
earchers+0.017
Neg logit contributions
-0.020
-0.016
-0.015
Reviewer-0.014
-0.014
Token a
Token ID257
Feature activation+3.36e-02

Pos logit contributions
+0.067
+0.057
+0.054
Reviewer+0.053
+0.052
Neg logit contributions
��-0.043
eteen-0.035
yip-0.034
��-0.032
earchers-0.031
Token mob
Token ID7251
Feature activation-9.59e-04

Pos logit contributions
��+0.086
eteen+0.071
yip+0.070
��+0.065
earchers+0.063
Neg logit contributions
-0.125
-0.108
-0.101
Reviewer-0.098
-0.097
Token of
Token ID286
Feature activation+6.08e-02

Pos logit contributions
+0.003
+0.003
+0.003
Reviewer+0.003
+0.003
Neg logit contributions
��-0.003
eteen-0.002
yip-0.002
��-0.002
earchers-0.002
Token unnecessary
Token ID13114
Feature activation-1.02e-01

Pos logit contributions
��+0.155
eteen+0.128
yip+0.125
��+0.118
earchers+0.113
Neg logit contributions
-0.228
-0.196
-0.184
Reviewer-0.179
-0.176
Token dupl
Token ID14184
Feature activation+2.38e-02

Pos logit contributions
+0.368
+0.314
+0.294
Reviewer+0.286
+0.281
Neg logit contributions
��-0.276
eteen-0.229
yip-0.224
��-0.212
earchers-0.204
Tokenicates
Token ID16856
Feature activation-4.94e-01

Pos logit contributions
��+0.090
eteen+0.080
yip+0.079
��+0.075
earchers+0.075
Neg logit contributions
-0.060
-0.047
-0.042
Reviewer-0.041
-0.039
Token.
Token ID764
Feature activation-9.26e-02

Pos logit contributions
+1.800
+1.539
+1.442
Reviewer+1.403
+1.382
Neg logit contributions
��-1.317
eteen-1.090
yip-1.067
��-1.008
earchers-0.969
Token.
Token ID764
Feature activation-7.89e-02

Pos logit contributions
+0.327
+0.278
+0.261
Reviewer+0.253
+0.250
Neg logit contributions
��-0.257
eteen-0.214
yip-0.210
��-0.200
earchers-0.192
Token."
Token ID22135
Feature activation-4.29e-02

Pos logit contributions
+0.279
+0.237
+0.222
Reviewer+0.215
+0.212
Neg logit contributions
��-0.219
eteen-0.182
yip-0.179
��-0.170
earchers-0.163
Token
Token ID198
Feature activation+9.44e-03

Pos logit contributions
+0.158
+0.135
+0.127
Reviewer+0.123
+0.121
Neg logit contributions
��-0.112
eteen-0.093
yip-0.091
��-0.085
earchers-0.082
Token
Token ID198
Feature activation-3.04e-03

Pos logit contributions
��+0.026
eteen+0.021
yip+0.021
��+0.020
earchers+0.019
Neg logit contributions
-0.034
-0.029
-0.027
Reviewer-0.026
-0.026
Token with
Token ID351
Feature activation-2.54e-02

Pos logit contributions
��+0.098
eteen+0.082
yip+0.079
��+0.076
earchers+0.073
Neg logit contributions
-0.130
-0.111
-0.103
Reviewer-0.101
-0.099
Token Love
Token ID5896
Feature activation-2.52e-02

Pos logit contributions
+0.090
+0.077
+0.072
Reviewer+0.070
+0.069
Neg logit contributions
��-0.070
eteen-0.058
yip-0.057
��-0.054
earchers-0.052
Token conveyed
Token ID33348
Feature activation-6.42e-02

Pos logit contributions
+0.091
+0.078
+0.073
Reviewer+0.071
+0.069
Neg logit contributions
��-0.068
eteen-0.056
yip-0.055
��-0.052
earchers-0.050
Token through
Token ID832
Feature activation+1.47e-02

Pos logit contributions
+0.246
+0.213
+0.200
Reviewer+0.194
+0.192
Neg logit contributions
��-0.159
eteen-0.129
yip-0.126
��-0.118
earchers-0.114
Token lyr
Token ID47188
Feature activation-3.07e-02

Pos logit contributions
��+0.039
eteen+0.032
yip+0.031
��+0.029
earchers+0.028
Neg logit contributions
-0.054
-0.047
-0.044
Reviewer-0.042
-0.042
Tokenical
Token ID605
Feature activation-4.80e-01

Pos logit contributions
+0.084
+0.068
+0.062
Reviewer+0.059
+0.058
Neg logit contributions
��-0.109
eteen-0.096
yip-0.094
��-0.090
earchers-0.088
Token themes
Token ID13460
Feature activation+1.11e-02

Pos logit contributions
+1.575
+1.322
+1.232
Reviewer+1.190
+1.172
Neg logit contributions
��-1.456
eteen-1.230
yip-1.213
��-1.156
earchers-1.115
Token of
Token ID286
Feature activation-1.03e-01

Pos logit contributions
��+0.030
eteen+0.025
yip+0.024
��+0.023
earchers+0.022
Neg logit contributions
-0.040
-0.035
-0.032
Reviewer-0.032
-0.031
Token pregnancy
Token ID10241
Feature activation-1.03e-01

Pos logit contributions
+0.378
+0.324
+0.304
Reviewer+0.296
+0.291
Neg logit contributions
��-0.270
eteen-0.223
yip-0.218
��-0.206
earchers-0.198
Token and
Token ID290
Feature activation+9.90e-03

Pos logit contributions
+0.378
+0.323
+0.303
Reviewer+0.295
+0.290
Neg logit contributions
��-0.274
eteen-0.227
yip-0.222
��-0.209
earchers-0.201
Token the
Token ID262
Feature activation-3.94e-02

Pos logit contributions
��+0.026
eteen+0.021
yip+0.021
��+0.020
earchers+0.019
Neg logit contributions
-0.037
-0.031
-0.029
Reviewer-0.029
-0.028
Token Gay
Token ID16013
Feature activation-8.49e-02

Pos logit contributions
+0.205
+0.175
+0.164
Reviewer+0.159
+0.156
Neg logit contributions
��-0.156
eteen-0.130
yip-0.127
��-0.120
earchers-0.116
Token Away
Token ID21986
Feature activation-7.48e-02

Pos logit contributions
+0.280
+0.235
+0.219
Reviewer+0.212
+0.208
Neg logit contributions
��-0.256
eteen-0.217
yip-0.213
��-0.203
earchers-0.196
Token
Token ID447
Feature activation-4.00e-02

Pos logit contributions
+0.209
+0.170
+0.155
Reviewer+0.149
+0.145
Neg logit contributions
��-0.263
eteen-0.229
yip-0.226
��-0.216
earchers-0.210
Token
Token ID247
Feature activation-4.28e-02

Pos logit contributions
+0.191
+0.170
+0.162
Reviewer+0.159
+0.158
Neg logit contributions
��-0.061
eteen-0.043
yip-0.041
��-0.036
earchers-0.033
Token ser
Token ID1055
Feature activation-8.36e-02

Pos logit contributions
+0.146
+0.124
+0.115
Reviewer+0.112
+0.110
Neg logit contributions
��-0.124
eteen-0.105
yip-0.102
��-0.097
earchers-0.094
Tokenmons
Token ID11567
Feature activation-4.75e-01

Pos logit contributions
+0.273
+0.229
+0.213
Reviewer+0.206
+0.202
Neg logit contributions
��-0.254
eteen-0.218
yip-0.212
��-0.201
earchers-0.196
Token,
Token ID11
Feature activation-3.09e-02

Pos logit contributions
+1.699
+1.449
+1.357
Reviewer+1.318
+1.299
Neg logit contributions
��-1.300
eteen-1.079
yip-1.058
��-1.002
earchers-0.965
Token the
Token ID262
Feature activation-2.17e-02

Pos logit contributions
+0.115
+0.099
+0.093
Reviewer+0.090
+0.089
Neg logit contributions
��-0.080
eteen-0.066
yip-0.064
��-0.060
earchers-0.058
Token man
Token ID582
Feature activation+2.13e-02

Pos logit contributions
+0.079
+0.068
+0.064
Reviewer+0.062
+0.061
Neg logit contributions
��-0.058
eteen-0.048
yip-0.047
��-0.044
earchers-0.042
Token who
Token ID508
Feature activation-2.13e-02

Pos logit contributions
��+0.060
eteen+0.050
yip+0.049
��+0.046
earchers+0.045
Neg logit contributions
-0.075
-0.064
-0.059
Reviewer-0.058
-0.057
Token once
Token ID1752
Feature activation-1.37e-02

Pos logit contributions
+0.076
+0.065
+0.061
Reviewer+0.059
+0.058
Neg logit contributions
��-0.058
eteen-0.048
yip-0.047
��-0.044
earchers-0.043
Token door
Token ID3420
Feature activation+4.57e-02

Pos logit contributions
��+0.045
eteen+0.037
yip+0.037
��+0.035
earchers+0.034
Neg logit contributions
-0.056
-0.047
-0.044
Reviewer-0.043
-0.042
Token.
Token ID13
Feature activation-2.70e-04

Pos logit contributions
��+0.116
eteen+0.095
yip+0.093
��+0.088
earchers+0.084
Neg logit contributions
-0.172
-0.148
-0.139
Reviewer-0.135
-0.133
Token
Token ID198
Feature activation-2.71e-02

Pos logit contributions
+0.001
+0.001
+0.001
Reviewer+0.001
+0.001
Neg logit contributions
��-0.001
eteen-0.001
yip-0.001
��-0.001
earchers-0.001
Token
Token ID198
Feature activation-4.12e-02

Pos logit contributions
+0.098
+0.083
+0.079
Reviewer+0.076
+0.075
Neg logit contributions
��-0.074
eteen-0.061
yip-0.060
��-0.058
earchers-0.055
TokenIdent
Token ID33234
Feature activation-7.36e-03

Pos logit contributions
+0.119
+0.098
+0.088
Reviewer+0.084
+0.082
Neg logit contributions
��-0.139
eteen-0.123
yip-0.119
��-0.113
earchers-0.111
Tokenifying
Token ID4035
Feature activation-4.74e-01

Pos logit contributions
+0.020
+0.016
+0.014
Reviewer+0.014
+0.013
Neg logit contributions
��-0.027
eteen-0.024
yip-0.023
��-0.022
earchers-0.022
Token tags
Token ID15940
Feature activation+3.26e-03

Pos logit contributions
+1.763
+1.512
+1.421
Reviewer+1.382
+1.363
Neg logit contributions
��-1.231
eteen-1.009
yip-0.989
��-0.935
earchers-0.895
Token and
Token ID290
Feature activation-5.90e-03

Pos logit contributions
��+0.008
eteen+0.007
yip+0.007
��+0.006
earchers+0.006
Neg logit contributions
-0.012
-0.010
-0.010
Reviewer-0.010
-0.009
Token Hebrew
Token ID16505
Feature activation-1.02e-01

Pos logit contributions
+0.020
+0.017
+0.016
Reviewer+0.015
+0.015
Neg logit contributions
��-0.017
eteen-0.014
yip-0.014
��-0.013
earchers-0.013
Token logos
Token ID29645
Feature activation-1.55e-02

Pos logit contributions
+0.314
+0.264
+0.242
Reviewer+0.237
+0.229
Neg logit contributions
��-0.329
eteen-0.284
yip-0.278
��-0.263
earchers-0.258
Token have
Token ID423
Feature activation-1.07e-02

Pos logit contributions
+0.060
+0.052
+0.049
Reviewer+0.047
+0.047
Neg logit contributions
��-0.038
eteen-0.031
yip-0.030
��-0.028
earchers-0.027
Token conditions
Token ID3403
Feature activation-1.16e-01

Pos logit contributions
+0.048
+0.041
+0.039
Reviewer+0.038
+0.037
Neg logit contributions
��-0.034
eteen-0.028
yip-0.028
��-0.026
earchers-0.025
Token added
Token ID2087
Feature activation+1.66e-02

Pos logit contributions
+0.426
+0.365
+0.343
Reviewer+0.333
+0.329
Neg logit contributions
��-0.303
eteen-0.249
yip-0.245
��-0.231
earchers-0.221
Token at
Token ID379
Feature activation+2.07e-02

Pos logit contributions
��+0.042
eteen+0.034
yip+0.033
��+0.031
earchers+0.030
Neg logit contributions
-0.063
-0.054
-0.051
Reviewer-0.050
-0.049
Token his
Token ID465
Feature activation+1.23e-02

Pos logit contributions
��+0.052
eteen+0.043
yip+0.042
��+0.039
earchers+0.038
Neg logit contributions
-0.078
-0.067
-0.063
Reviewer-0.062
-0.061
Token arra
Token ID44723
Feature activation-3.95e-02

Pos logit contributions
��+0.034
eteen+0.028
yip+0.027
��+0.026
earchers+0.025
Neg logit contributions
-0.044
-0.037
-0.035
Reviewer-0.034
-0.033
Tokenignment
Token ID16747
Feature activation-4.74e-01

Pos logit contributions
+0.141
+0.120
+0.112
Reviewer+0.109
+0.107
Neg logit contributions
��-0.108
eteen-0.091
yip-0.089
��-0.083
earchers-0.081
Token in
Token ID287
Feature activation-1.46e-02

Pos logit contributions
+1.753
+1.502
+1.412
Reviewer+1.372
+1.353
Neg logit contributions
��-1.241
eteen-1.021
yip-1.000
��-0.944
earchers-0.906
Token December
Token ID3426
Feature activation-1.24e-02

Pos logit contributions
+0.052
+0.044
+0.041
Reviewer+0.040
+0.039
Neg logit contributions
��-0.040
eteen-0.034
yip-0.033
��-0.031
earchers-0.030
Token 2016
Token ID1584
Feature activation+2.43e-03

Pos logit contributions
+0.046
+0.040
+0.037
Reviewer+0.036
+0.036
Neg logit contributions
��-0.032
eteen-0.026
yip-0.026
��-0.024
earchers-0.023
Token.
Token ID13
Feature activation+1.08e-03

Pos logit contributions
��+0.006
eteen+0.005
yip+0.005
��+0.005
earchers+0.004
Neg logit contributions
-0.009
-0.008
-0.007
Reviewer-0.007
-0.007
Token
Token ID198
Feature activation-6.50e-02

Pos logit contributions
��+0.003
eteen+0.002
yip+0.002
��+0.002
earchers+0.002
Neg logit contributions
-0.004
-0.003
-0.003
Reviewer-0.003
-0.003
Token
Token ID198
Feature activation+6.47e-02

Pos logit contributions
��+0.038
eteen+0.032
yip+0.031
��+0.030
earchers+0.029
Neg logit contributions
-0.046
-0.039
-0.036
Reviewer-0.035
-0.035
Token
Token ID198
Feature activation+6.75e-02

Pos logit contributions
��+0.180
eteen+0.148
yip+0.147
��+0.142
earchers+0.134
Neg logit contributions
-0.230
-0.196
-0.185
Reviewer-0.179
-0.178
Token
Token ID447
Feature activation+3.55e-02

Pos logit contributions
��+0.238
eteen+0.209
yip+0.204
��+0.195
maxwell+0.193
Neg logit contributions
-0.184
-0.149
-0.134
Reviewer-0.127
-0.125
Token
Token ID250
Feature activation+9.14e-02

Pos logit contributions
��+0.056
eteen+0.040
yip+0.039
��+0.034
earchers+0.032
Neg logit contributions
-0.167
-0.148
-0.142
Reviewer-0.139
-0.137
TokenKe
Token ID8896
Feature activation-5.61e-02

Pos logit contributions
��+0.308
eteen+0.268
yip+0.262
��+0.251
earchers+0.247
Neg logit contributions
-0.267
-0.219
-0.199
Reviewer-0.190
-0.187
Tokenpler
Token ID20053
Feature activation-4.71e-01

Pos logit contributions
+0.129
+0.099
+0.089
Reviewer+0.084
+0.082
Neg logit contributions
��-0.225
eteen-0.199
yip-0.197
��-0.190
earchers-0.185
Token-
Token ID12
Feature activation+7.53e-02

Pos logit contributions
+1.679
+1.429
+1.341
Reviewer+1.301
+1.282
Neg logit contributions
��-1.297
eteen-1.076
yip-1.058
��-1.001
earchers-0.960
Token10
Token ID940
Feature activation+1.05e-02

Pos logit contributions
��+0.258
eteen+0.226
yip+0.221
��+0.214
maxwell+0.208
Neg logit contributions
-0.214
-0.175
-0.158
Reviewer-0.152
-0.149
Tokenc
Token ID66
Feature activation+3.96e-02

Pos logit contributions
��+0.037
eteen+0.032
yip+0.032
��+0.030
earchers+0.030
Neg logit contributions
-0.029
-0.024
-0.022
Reviewer-0.021
-0.021
Token didn
Token ID1422
Feature activation+6.20e-02

Pos logit contributions
��+0.100
eteen+0.082
yip+0.080
��+0.075
earchers+0.072
Neg logit contributions
-0.150
-0.129
-0.121
Reviewer-0.118
-0.116
Token
Token ID447
Feature activation+3.13e-02

Pos logit contributions
��+0.157
eteen+0.126
yip+0.125
��+0.119
earchers+0.113
Neg logit contributions
-0.236
-0.204
-0.192
Reviewer-0.187
-0.185
Token to
Token ID284
Feature activation-3.83e-02

Pos logit contributions
+0.054
+0.047
+0.044
Reviewer+0.043
+0.042
Neg logit contributions
��-0.035
eteen-0.028
yip-0.028
��-0.026
earchers-0.025
Token wave
Token ID6769
Feature activation+9.61e-03

Pos logit contributions
+0.140
+0.119
+0.112
Reviewer+0.109
+0.107
Neg logit contributions
��-0.102
eteen-0.084
yip-0.083
��-0.078
earchers-0.075
Token away
Token ID1497
Feature activation-8.08e-02

Pos logit contributions
��+0.024
eteen+0.019
yip+0.019
��+0.018
earchers+0.017
Neg logit contributions
-0.037
-0.032
-0.030
Reviewer-0.029
-0.029
Token policy
Token ID2450
Feature activation-5.58e-02

Pos logit contributions
+0.300
+0.257
+0.241
Reviewer+0.235
+0.231
Neg logit contributions
��-0.211
eteen-0.174
yip-0.170
��-0.160
earchers-0.154
Token sill
Token ID49276
Feature activation+1.52e-02

Pos logit contributions
+0.171
+0.142
+0.131
Reviewer+0.127
+0.124
Neg logit contributions
��-0.180
eteen-0.155
yip-0.152
��-0.146
earchers-0.141
Tokeniness
Token ID1272
Feature activation-4.66e-01

Pos logit contributions
��+0.049
eteen+0.043
yip+0.042
��+0.040
earchers+0.039
Neg logit contributions
-0.047
-0.038
-0.035
Reviewer-0.034
-0.034
Token with
Token ID351
Feature activation-1.84e-02

Pos logit contributions
+1.730
+1.484
+1.398
Reviewer+1.356
+1.341
Neg logit contributions
��-1.212
eteen-0.992
yip-0.974
��-0.922
earchers-0.881
Token a
Token ID257
Feature activation-1.27e-02

Pos logit contributions
+0.069
+0.059
+0.055
Reviewer+0.054
+0.053
Neg logit contributions
��-0.047
eteen-0.039
yip-0.038
��-0.036
earchers-0.034
Token boys
Token ID6510
Feature activation-7.88e-02

Pos logit contributions
+0.045
+0.039
+0.036
Reviewer+0.035
+0.035
Neg logit contributions
��-0.034
eteen-0.029
yip-0.028
��-0.027
earchers-0.026
Token-
Token ID12
Feature activation-4.95e-02

Pos logit contributions
+0.272
+0.230
+0.215
Reviewer+0.209
+0.205
Neg logit contributions
��-0.225
eteen-0.189
yip-0.185
��-0.176
earchers-0.170
Tokenwill
Token ID10594
Feature activation+3.90e-04

Pos logit contributions
+0.167
+0.141
+0.131
Reviewer+0.127
+0.125
Neg logit contributions
��-0.145
eteen-0.123
yip-0.121
��-0.114
maxwell-0.111
Token density
Token ID12109
Feature activation-3.62e-02

Pos logit contributions
+0.250
+0.216
+0.204
Reviewer+0.198
+0.196
Neg logit contributions
��-0.153
eteen-0.123
yip-0.121
��-0.113
earchers-0.108
Token to
Token ID284
Feature activation-5.92e-02

Pos logit contributions
+0.132
+0.113
+0.106
Reviewer+0.103
+0.101
Neg logit contributions
��-0.096
eteen-0.080
yip-0.078
��-0.073
earchers-0.071
Token a
Token ID257
Feature activation-1.17e-02

Pos logit contributions
+0.229
+0.198
+0.186
Reviewer+0.181
+0.179
Neg logit contributions
��-0.145
eteen-0.117
yip-0.115
��-0.108
earchers-0.103
Token grid
Token ID10706
Feature activation-8.68e-03

Pos logit contributions
+0.044
+0.038
+0.036
Reviewer+0.035
+0.034
Neg logit contributions
��-0.029
eteen-0.024
yip-0.024
��-0.022
earchers-0.021
Token simpl
Token ID7106
Feature activation-1.38e-02

Pos logit contributions
+0.034
+0.029
+0.027
Reviewer+0.027
+0.026
Neg logit contributions
��-0.021
eteen-0.017
yip-0.017
��-0.016
earchers-0.015
Tokenifies
Token ID6945
Feature activation-4.55e-01

Pos logit contributions
+0.041
+0.034
+0.031
Reviewer+0.030
+0.030
Neg logit contributions
��-0.046
eteen-0.040
yip-0.039
��-0.037
earchers-0.036
Token computing
Token ID14492
Feature activation-1.29e-02

Pos logit contributions
+1.674
+1.433
+1.344
Reviewer+1.307
+1.289
Neg logit contributions
��-1.202
eteen-0.990
yip-0.969
��-0.914
earchers-0.879
Token spatial
Token ID21739
Feature activation-1.70e-02

Pos logit contributions
+0.048
+0.041
+0.038
Reviewer+0.037
+0.037
Neg logit contributions
��-0.034
eteen-0.028
yip-0.027
��-0.026
earchers-0.025
Token derivatives
Token ID28486
Feature activation+1.95e-03

Pos logit contributions
+0.059
+0.050
+0.047
Reviewer+0.046
+0.045
Neg logit contributions
��-0.048
eteen-0.040
yip-0.039
��-0.037
earchers-0.036
Token.
Token ID13
Feature activation-2.34e-02

Pos logit contributions
��+0.005
eteen+0.004
yip+0.004
��+0.004
earchers+0.004
Neg logit contributions
-0.007
-0.006
-0.006
Reviewer-0.006
-0.006
Token So
Token ID1406
Feature activation-2.01e-02

Pos logit contributions
+0.082
+0.070
+0.065
Reviewer+0.063
+0.062
Neg logit contributions
��-0.065
eteen-0.054
yip-0.053
��-0.051
earchers-0.049
Token set
Token ID900
Feature activation-3.63e-02

Pos logit contributions
+0.345
+0.293
+0.273
Reviewer+0.265
+0.261
Neg logit contributions
��-0.276
eteen-0.231
yip-0.227
��-0.215
earchers-0.207
Token of
Token ID286
Feature activation-8.70e-02

Pos logit contributions
+0.128
+0.108
+0.101
Reviewer+0.098
+0.097
Neg logit contributions
��-0.101
eteen-0.085
yip-0.083
��-0.079
earchers-0.076
Token background
Token ID4469
Feature activation-8.62e-02

Pos logit contributions
+0.289
+0.244
+0.226
Reviewer+0.219
+0.215
Neg logit contributions
��-0.260
eteen-0.220
yip-0.216
��-0.205
earchers-0.199
Token presupp
Token ID45077
Feature activation-7.95e-02

Pos logit contributions
+0.300
+0.255
+0.237
Reviewer+0.230
+0.227
Neg logit contributions
��-0.243
eteen-0.205
yip-0.200
��-0.189
earchers-0.183
Tokenos
Token ID418
Feature activation-8.64e-02

Pos logit contributions
+0.245
+0.205
+0.188
Reviewer+0.182
+0.178
Neg logit contributions
��-0.257
eteen-0.221
yip-0.216
��-0.206
earchers-0.203
Tokenitions
Token ID1756
Feature activation-4.53e-01

Pos logit contributions
+0.278
+0.232
+0.216
Reviewer+0.208
+0.204
Neg logit contributions
��-0.266
eteen-0.230
yip-0.223
��-0.212
earchers-0.208
Token insofar
Token ID44061
Feature activation-4.82e-02

Pos logit contributions
+1.655
+1.414
+1.331
Reviewer+1.291
+1.274
Neg logit contributions
��-1.205
eteen-0.992
yip-0.974
��-0.922
earchers-0.883
Token as
Token ID355
Feature activation-1.13e-02

Pos logit contributions
+0.192
+0.166
+0.157
Reviewer+0.153
+0.151
Neg logit contributions
��-0.113
eteen-0.091
yip-0.089
��-0.082
earchers-0.079
Token each
Token ID1123
Feature activation-1.08e-02

Pos logit contributions
+0.042
+0.036
+0.034
Reviewer+0.033
+0.032
Neg logit contributions
��-0.029
eteen-0.024
yip-0.024
��-0.022
earchers-0.022
Token is
Token ID318
Feature activation-2.35e-02

Pos logit contributions
+0.039
+0.034
+0.031
Reviewer+0.030
+0.030
Neg logit contributions
��-0.029
eteen-0.024
yip-0.023
��-0.022
earchers-0.021
Token reacting
Token ID33413
Feature activation+2.64e-02

Pos logit contributions
+0.087
+0.075
+0.070
Reviewer+0.069
+0.068
Neg logit contributions
��-0.061
eteen-0.050
yip-0.049
��-0.046
earchers-0.044
Token Startup
Token ID40472
Feature activation+1.15e-01

Pos logit contributions
+0.077
+0.065
+0.061
Reviewer+0.060
+0.058
Neg logit contributions
��-0.060
eteen-0.050
yip-0.049
��-0.047
earchers-0.045
Token India
Token ID3794
Feature activation-9.78e-02

Pos logit contributions
��+0.339
eteen+0.286
yip+0.282
��+0.267
earchers+0.258
Neg logit contributions
-0.387
-0.325
-0.303
Reviewer-0.295
-0.289
Token programme
Token ID11383
Feature activation-1.55e-01

Pos logit contributions
+0.326
+0.275
+0.256
Reviewer+0.248
+0.244
Neg logit contributions
��-0.290
eteen-0.245
yip-0.241
��-0.229
earchers-0.221
Token is
Token ID318
Feature activation-6.98e-02

Pos logit contributions
+0.583
+0.500
+0.470
Reviewer+0.457
+0.451
Neg logit contributions
��-0.397
eteen-0.325
yip-0.319
��-0.299
earchers-0.287
Token deb
Token ID1915
Feature activation+8.37e-02

Pos logit contributions
+0.275
+0.238
+0.227
Reviewer+0.221
+0.218
Neg logit contributions
��-0.164
eteen-0.131
yip-0.130
��-0.121
maxwell-0.115
Tokenatable
Token ID21156
Feature activation-4.50e-01

Pos logit contributions
��+0.215
eteen+0.176
yip+0.172
��+0.162
earchers+0.156
Neg logit contributions
-0.314
-0.269
-0.253
Reviewer-0.247
-0.243
Token,
Token ID11
Feature activation-3.89e-02

Pos logit contributions
+1.635
+1.397
+1.321
Reviewer+1.279
+1.264
Neg logit contributions
��-1.196
eteen-0.981
yip-0.970
��-0.921
earchers-0.879
Token but
Token ID475
Feature activation-6.23e-02

Pos logit contributions
+0.142
+0.122
+0.114
Reviewer+0.111
+0.109
Neg logit contributions
��-0.103
eteen-0.085
yip-0.083
��-0.079
earchers-0.076
Token there
Token ID612
Feature activation-2.12e-02

Pos logit contributions
+0.235
+0.202
+0.190
Reviewer+0.185
+0.182
Neg logit contributions
��-0.159
eteen-0.130
yip-0.127
��-0.119
earchers-0.115
Token is
Token ID318
Feature activation-9.59e-03

Pos logit contributions
+0.084
+0.072
+0.068
Reviewer+0.066
+0.066
Neg logit contributions
��-0.051
eteen-0.041
yip-0.040
��-0.037
earchers-0.036
Token a
Token ID257
Feature activation-2.20e-02

Pos logit contributions
+0.037
+0.032
+0.030
Reviewer+0.029
+0.029
Neg logit contributions
��-0.023
eteen-0.019
yip-0.019
��-0.017
earchers-0.017
Token some
Token ID617
Feature activation+4.12e-02

Pos logit contributions
+0.064
+0.055
+0.052
Reviewer+0.050
+0.050
Neg logit contributions
��-0.043
eteen-0.036
yip-0.035
��-0.033
earchers-0.031
Token availability
Token ID11500
Feature activation+7.58e-02

Pos logit contributions
��+0.111
eteen+0.093
yip+0.091
��+0.085
earchers+0.083
Neg logit contributions
-0.148
-0.126
-0.118
Reviewer-0.115
-0.113
Token on
Token ID319
Feature activation+4.07e-02

Pos logit contributions
��+0.205
eteen+0.171
yip+0.167
��+0.157
earchers+0.152
Neg logit contributions
-0.271
-0.232
-0.217
Reviewer-0.212
-0.207
Token its
Token ID663
Feature activation+1.41e-02

Pos logit contributions
��+0.109
eteen+0.090
yip+0.088
��+0.083
earchers+0.080
Neg logit contributions
-0.148
-0.126
-0.118
Reviewer-0.116
-0.113
Token cru
Token ID4630
Feature activation-1.26e-02

Pos logit contributions
��+0.041
eteen+0.035
yip+0.034
��+0.032
earchers+0.031
Neg logit contributions
-0.048
-0.040
-0.037
Reviewer-0.036
-0.036
Tokenises
Token ID2696
Feature activation-4.45e-01

Pos logit contributions
+0.032
+0.025
+0.022
Reviewer+0.021
+0.020
Neg logit contributions
��-0.047
eteen-0.042
yip-0.042
��-0.040
earchers-0.039
Token (
Token ID357
Feature activation+2.00e-02

Pos logit contributions
+1.618
+1.382
+1.297
Reviewer+1.261
+1.242
Neg logit contributions
��-1.189
eteen-0.984
yip-0.965
��-0.909
earchers-0.876
Token08
Token ID2919
Feature activation+3.47e-02

Pos logit contributions
��+0.067
eteen+0.059
yip+0.058
��+0.055
maxwell+0.054
Neg logit contributions
-0.058
-0.047
-0.044
Reviewer-0.042
-0.041
Token71
Token ID4869
Feature activation-3.36e-02

Pos logit contributions
��+0.136
eteen+0.120
yip+0.118
��+0.114
earchers+0.111
Neg logit contributions
-0.083
-0.065
-0.058
Reviewer-0.055
-0.053
Token 231
Token ID34598
Feature activation-1.59e-02

Pos logit contributions
+0.126
+0.108
+0.101
Reviewer+0.099
+0.097
Neg logit contributions
��-0.086
eteen-0.071
yip-0.069
��-0.065
earchers-0.063
Token 4
Token ID604
Feature activation+1.87e-02

Pos logit contributions
+0.061
+0.052
+0.049
Reviewer+0.048
+0.047
Neg logit contributions
��-0.040
eteen-0.032
yip-0.032
��-0.030
earchers-0.028
Token that
Token ID326
Feature activation-9.53e-02

Pos logit contributions
+0.030
+0.026
+0.024
Reviewer+0.023
+0.023
Neg logit contributions
��-0.021
eteen-0.017
yip-0.017
��-0.016
earchers-0.015
Token changes
Token ID2458
Feature activation+2.49e-02

Pos logit contributions
+0.349
+0.299
+0.280
Reviewer+0.272
+0.268
Neg logit contributions
��-0.252
eteen-0.208
yip-0.204
��-0.192
earchers-0.185
Token occur
Token ID3051
Feature activation-2.74e-02

Pos logit contributions
��+0.060
eteen+0.048
yip+0.047
��+0.044
earchers+0.042
Neg logit contributions
-0.097
-0.084
-0.079
Reviewer-0.077
-0.076
Token in
Token ID287
Feature activation+8.51e-02

Pos logit contributions
+0.101
+0.087
+0.081
Reviewer+0.079
+0.078
Neg logit contributions
��-0.072
eteen-0.059
yip-0.058
��-0.055
earchers-0.053
Token rehears
Token ID28779
Feature activation+3.92e-02

Pos logit contributions
��+0.222
eteen+0.183
yip+0.179
��+0.169
earchers+0.162
Neg logit contributions
-0.315
-0.270
-0.253
Reviewer-0.246
-0.243
Tokenals
Token ID874
Feature activation-4.30e-01

Pos logit contributions
��+0.122
eteen+0.106
yip+0.103
earchers+0.096
��+0.096
Neg logit contributions
-0.126
-0.103
-0.096
Reviewer-0.093
-0.091
Token and
Token ID290
Feature activation+9.69e-05

Pos logit contributions
+1.570
+1.346
+1.270
Reviewer+1.230
+1.217
Neg logit contributions
��-1.140
eteen-0.933
yip-0.918
��-0.875
maxwell-0.833
Token things
Token ID1243
Feature activation-3.12e-03

Pos logit contributions
��+0.000
eteen+0.000
yip+0.000
��+0.000
earchers+0.000
Neg logit contributions
-0.000
-0.000
-0.000
Reviewer-0.000
-0.000
Token that
Token ID326
Feature activation-2.41e-02

Pos logit contributions
+0.012
+0.010
+0.010
Reviewer+0.009
+0.009
Neg logit contributions
��-0.008
eteen-0.006
yip-0.006
��-0.006
earchers-0.006
Token are
Token ID389
Feature activation-2.14e-02

Pos logit contributions
+0.097
+0.084
+0.080
Reviewer+0.077
+0.077
Neg logit contributions
��-0.056
eteen-0.044
yip-0.043
��-0.040
maxwell-0.038
Token suggested
Token ID5220
Feature activation+2.51e-02

Pos logit contributions
+0.079
+0.067
+0.063
Reviewer+0.061
+0.061
Neg logit contributions
��-0.056
eteen-0.047
yip-0.046
��-0.043
earchers-0.041
TokenEL
Token ID3698
Feature activation-5.12e-02

Pos logit contributions
+0.365
+0.302
+0.280
Reviewer+0.270
+0.265
Neg logit contributions
��-0.384
eteen-0.329
yip-0.324
��-0.310
earchers-0.300
TokenVE
Token ID6089
Feature activation+1.97e-02

Pos logit contributions
+0.150
+0.123
+0.113
Reviewer+0.109
+0.107
Neg logit contributions
��-0.173
eteen-0.149
yip-0.147
��-0.140
earchers-0.137
Token)
Token ID8
Feature activation-5.23e-03

Pos logit contributions
��+0.056
eteen+0.047
yip+0.046
��+0.044
earchers+0.042
Neg logit contributions
-0.068
-0.057
-0.054
Reviewer-0.052
-0.051
Token quarterback
Token ID7880
Feature activation-7.99e-03

Pos logit contributions
+0.017
+0.015
+0.013
Reviewer+0.013
+0.013
Neg logit contributions
��-0.016
eteen-0.013
yip-0.013
��-0.012
earchers-0.012
Token hur
Token ID6990
Feature activation-5.14e-02

Pos logit contributions
+0.026
+0.022
+0.020
Reviewer+0.020
+0.020
Neg logit contributions
��-0.024
eteen-0.020
yip-0.020
��-0.019
earchers-0.018
Tokenries
Token ID1678
Feature activation-4.27e-01

Pos logit contributions
+0.189
+0.163
+0.150
Reviewer+0.147
+0.143
Neg logit contributions
��-0.137
eteen-0.115
yip-0.112
��-0.103
earchers-0.101
Token.
Token ID13
Feature activation-2.17e-02

Pos logit contributions
+1.567
+1.343
+1.265
Reviewer+1.223
+1.212
Neg logit contributions
��-1.125
eteen-0.922
yip-0.910
��-0.859
earchers-0.825
Token He
Token ID679
Feature activation-1.78e-02

Pos logit contributions
+0.079
+0.068
+0.063
Reviewer+0.062
+0.061
Neg logit contributions
��-0.057
eteen-0.048
yip-0.046
��-0.044
earchers-0.042
Token is
Token ID318
Feature activation-4.52e-03

Pos logit contributions
+0.065
+0.056
+0.053
Reviewer+0.051
+0.051
Neg logit contributions
��-0.047
eteen-0.038
yip-0.038
��-0.036
earchers-0.034
Token a
Token ID257
Feature activation-2.93e-02

Pos logit contributions
+0.017
+0.015
+0.014
Reviewer+0.014
+0.013
Neg logit contributions
��-0.011
eteen-0.009
yip-0.009
��-0.009
earchers-0.008
Token very
Token ID845
Feature activation+2.94e-03

Pos logit contributions
+0.112
+0.097
+0.091
Reviewer+0.089
+0.087
Neg logit contributions
��-0.073
eteen-0.059
yip-0.058
��-0.055
earchers-0.052
Token-
Token ID12
Feature activation+4.19e-02

Pos logit contributions
+0.079
+0.067
+0.063
Reviewer+0.061
+0.060
Neg logit contributions
��-0.063
eteen-0.053
yip-0.052
��-0.050
earchers-0.048
Token10
Token ID940
Feature activation+1.12e-02

Pos logit contributions
��+0.148
eteen+0.129
yip+0.127
��+0.122
maxwell+0.119
Neg logit contributions
-0.116
-0.094
-0.085
Reviewer-0.082
-0.080
Token planetary
Token ID27047
Feature activation+1.55e-03

Pos logit contributions
��+0.034
eteen+0.029
yip+0.028
��+0.027
earchers+0.026
Neg logit contributions
-0.037
-0.031
-0.029
Reviewer-0.028
-0.027
Token system
Token ID1080
Feature activation-5.31e-02

Pos logit contributions
��+0.005
eteen+0.004
yip+0.004
��+0.004
earchers+0.004
Neg logit contributions
-0.005
-0.004
-0.004
Reviewer-0.004
-0.004
Token revis
Token ID22124
Feature activation-2.92e-02

Pos logit contributions
+0.194
+0.166
+0.156
Reviewer+0.152
+0.149
Neg logit contributions
��-0.140
eteen-0.116
yip-0.113
��-0.107
earchers-0.103
Tokenited
Token ID863
Feature activation-4.27e-01

Pos logit contributions
+0.084
+0.069
+0.063
Reviewer+0.061
+0.059
Neg logit contributions
��-0.100
eteen-0.087
yip-0.085
��-0.081
earchers-0.080
Token by
Token ID416
Feature activation+2.38e-02

Pos logit contributions
+1.571
+1.345
+1.264
Reviewer+1.229
+1.211
Neg logit contributions
��-1.123
eteen-0.923
yip-0.906
��-0.855
earchers-0.820
Token HAR
Token ID43638
Feature activation-4.54e-02

Pos logit contributions
��+0.061
eteen+0.050
yip+0.049
��+0.047
earchers+0.045
Neg logit contributions
-0.089
-0.076
-0.071
Reviewer-0.070
-0.069
TokenPS
Token ID3705
Feature activation+6.25e-02

Pos logit contributions
+0.153
+0.129
+0.120
Reviewer+0.117
+0.114
Neg logit contributions
��-0.133
eteen-0.113
yip-0.110
��-0.105
earchers-0.102
Token-
Token ID12
Feature activation+2.10e-02

Pos logit contributions
��+0.172
eteen+0.143
yip+0.140
��+0.133
earchers+0.129
Neg logit contributions
-0.222
-0.189
-0.177
Reviewer-0.172
-0.169
TokenN
Token ID45
Feature activation+9.52e-02

Pos logit contributions
��+0.066
eteen+0.057
yip+0.056
��+0.053
earchers+0.052
Neg logit contributions
-0.066
-0.055
-0.051
Reviewer-0.049
-0.048
Token,
Token ID11
Feature activation-4.58e-03

Pos logit contributions
��+0.156
eteen+0.129
yip+0.126
��+0.119
earchers+0.114
Neg logit contributions
-0.218
-0.187
-0.175
Reviewer-0.171
-0.168
Token Egypt
Token ID6365
Feature activation+3.27e-02

Pos logit contributions
+0.016
+0.014
+0.013
Reviewer+0.013
+0.013
Neg logit contributions
��-0.012
eteen-0.010
yip-0.010
��-0.010
earchers-0.009
Token,
Token ID11
Feature activation+1.61e-02

Pos logit contributions
��+0.083
eteen+0.068
yip+0.066
��+0.062
earchers+0.060
Neg logit contributions
-0.123
-0.106
-0.099
Reviewer-0.097
-0.096
Token and
Token ID290
Feature activation+4.55e-02

Pos logit contributions
��+0.041
eteen+0.034
yip+0.033
��+0.031
earchers+0.030
Neg logit contributions
-0.060
-0.052
-0.048
Reviewer-0.047
-0.046
Token cru
Token ID4630
Feature activation-4.81e-02

Pos logit contributions
��+0.121
eteen+0.100
yip+0.098
��+0.092
earchers+0.089
Neg logit contributions
-0.167
-0.142
-0.134
Reviewer-0.130
-0.128
Tokenises
Token ID2696
Feature activation-4.26e-01

Pos logit contributions
+0.117
+0.091
+0.082
Reviewer+0.077
+0.076
Neg logit contributions
��-0.186
eteen-0.166
yip-0.163
��-0.156
earchers-0.153
Token via
Token ID2884
Feature activation+5.15e-02

Pos logit contributions
+1.605
+1.382
+1.302
Reviewer+1.262
+1.250
Neg logit contributions
��-1.085
eteen-0.883
yip-0.867
��-0.825
maxwell-0.782
Token A
Token ID317
Feature activation+7.64e-02

Pos logit contributions
��+0.141
eteen+0.117
yip+0.115
��+0.109
earchers+0.105
Neg logit contributions
-0.184
-0.157
-0.147
Reviewer-0.143
-0.140
Tokenq
Token ID80
Feature activation+3.19e-02

Pos logit contributions
��+0.242
eteen+0.208
yip+0.205
��+0.194
earchers+0.189
Neg logit contributions
-0.241
-0.199
-0.184
Reviewer-0.177
-0.174
Tokenaba
Token ID15498
Feature activation-4.94e-02

Pos logit contributions
��+0.116
eteen+0.101
yip+0.099
��+0.096
earchers+0.093
Neg logit contributions
-0.086
-0.069
-0.063
Reviewer-0.060
-0.059
Token,
Token ID11
Feature activation+1.46e-02

Pos logit contributions
+0.181
+0.155
+0.146
Reviewer+0.142
+0.139
Neg logit contributions
��-0.131
eteen-0.108
yip-0.106
��-0.100
earchers-0.096
Token and
Token ID290
Feature activation+2.74e-02

Pos logit contributions
��+0.121
eteen+0.099
yip+0.097
��+0.091
earchers+0.087
Neg logit contributions
-0.175
-0.150
-0.141
Reviewer-0.137
-0.135
Token golden
Token ID10861
Feature activation+3.84e-02

Pos logit contributions
��+0.065
eteen+0.052
yip+0.051
��+0.048
earchers+0.046
Neg logit contributions
-0.108
-0.094
-0.089
Reviewer-0.086
-0.085
Token on
Token ID319
Feature activation+4.70e-02

Pos logit contributions
��+0.100
eteen+0.082
yip+0.080
��+0.076
earchers+0.072
Neg logit contributions
-0.143
-0.123
-0.115
Reviewer-0.112
-0.110
Token the
Token ID262
Feature activation+1.83e-02

Pos logit contributions
��+0.120
eteen+0.098
yip+0.096
��+0.090
earchers+0.087
Neg logit contributions
-0.177
-0.152
-0.143
Reviewer-0.139
-0.137
Token bott
Token ID3005
Feature activation-1.11e-01

Pos logit contributions
��+0.050
eteen+0.042
yip+0.041
��+0.038
earchers+0.037
Neg logit contributions
-0.066
-0.056
-0.052
Reviewer-0.051
-0.050
Tokenoms
Token ID3150
Feature activation-4.21e-01

Pos logit contributions
+0.386
+0.328
+0.307
Reviewer+0.297
+0.293
Neg logit contributions
��-0.312
eteen-0.261
yip-0.256
��-0.242
earchers-0.234
Token.
Token ID13
Feature activation+6.23e-02

Pos logit contributions
+1.539
+1.315
+1.237
Reviewer+1.203
+1.187
Neg logit contributions
��-1.118
eteen-0.918
yip-0.903
��-0.855
earchers-0.819
Token The
Token ID383
Feature activation+1.78e-02

Pos logit contributions
��+0.166
eteen+0.137
yip+0.134
��+0.126
earchers+0.121
Neg logit contributions
-0.227
-0.194
-0.182
Reviewer-0.177
-0.174
Token tops
Token ID21246
Feature activation-6.91e-02

Pos logit contributions
��+0.047
eteen+0.039
yip+0.038
��+0.036
earchers+0.035
Neg logit contributions
-0.065
-0.056
-0.053
Reviewer-0.051
-0.050
Token of
Token ID286
Feature activation-2.65e-02

Pos logit contributions
+0.275
+0.239
+0.226
Reviewer+0.220
+0.218
Neg logit contributions
��-0.161
eteen-0.128
yip-0.125
��-0.117
earchers-0.111
Token the
Token ID262
Feature activation-9.81e-03

Pos logit contributions
+0.100
+0.086
+0.081
Reviewer+0.078
+0.077
Neg logit contributions
��-0.067
eteen-0.055
yip-0.054
��-0.051
earchers-0.048
Token 7
Token ID767
Feature activation+1.70e-02

Pos logit contributions
��+0.003
eteen+0.002
yip+0.002
��+0.002
earchers+0.002
Neg logit contributions
-0.005
-0.004
-0.004
Reviewer-0.004
-0.004
Token)
Token ID8
Feature activation-1.49e-03

Pos logit contributions
��+0.046
eteen+0.038
yip+0.037
��+0.035
maxwell+0.034
Neg logit contributions
-0.061
-0.052
-0.049
Reviewer-0.048
-0.047
Token
Token ID198
Feature activation-7.17e-02

Pos logit contributions
+0.005
+0.004
+0.004
Reviewer+0.004
+0.004
Neg logit contributions
��-0.004
eteen-0.004
yip-0.003
��-0.003
earchers-0.003
Token
Token ID198
Feature activation-5.31e-02

Pos logit contributions
+0.254
+0.216
+0.204
Reviewer+0.197
+0.196
Neg logit contributions
��-0.201
yip-0.165
eteen-0.165
��-0.159
earchers-0.151
TokenK
Token ID42
Feature activation-7.12e-03

Pos logit contributions
+0.151
+0.125
+0.112
Reviewer+0.107
+0.105
Neg logit contributions
��-0.181
eteen-0.159
yip-0.155
��-0.148
maxwell-0.145
Tokenoch
Token ID5374
Feature activation-4.21e-01

Pos logit contributions
+0.021
+0.018
+0.016
Reviewer+0.016
+0.015
Neg logit contributions
��-0.023
eteen-0.020
yip-0.020
��-0.019
earchers-0.018
Token companies
Token ID2706
Feature activation-1.85e-02

Pos logit contributions
+1.334
+1.113
+1.032
Reviewer+0.999
+0.979
Neg logit contributions
��-1.322
eteen-1.129
yip-1.110
��-1.059
earchers-1.027
Token letter
Token ID3850
Feature activation-4.30e-02

Pos logit contributions
+0.072
+0.062
+0.059
Reviewer+0.057
+0.056
Neg logit contributions
��-0.045
eteen-0.036
yip-0.036
��-0.033
earchers-0.032
Token to
Token ID284
Feature activation+4.06e-02

Pos logit contributions
+0.157
+0.134
+0.125
Reviewer+0.122
+0.120
Neg logit contributions
��-0.114
eteen-0.095
yip-0.093
��-0.087
earchers-0.084
Token Sen
Token ID2311
Feature activation+1.53e-02

Pos logit contributions
��+0.120
eteen+0.101
yip+0.099
��+0.094
earchers+0.091
Neg logit contributions
-0.136
-0.115
-0.107
Reviewer-0.104
-0.102
Token.
Token ID13
Feature activation+2.19e-02

Pos logit contributions
��+0.039
eteen+0.032
yip+0.031
��+0.029
earchers+0.028
Neg logit contributions
-0.058
-0.050
-0.047
Reviewer-0.046
-0.045
Token split
Token ID6626
Feature activation-1.09e-01

Pos logit contributions
+0.268
+0.230
+0.216
Reviewer+0.210
+0.207
Neg logit contributions
��-0.186
eteen-0.152
yip-0.149
��-0.140
earchers-0.135
Token first
Token ID717
Feature activation-2.17e-02

Pos logit contributions
+0.394
+0.337
+0.317
Reviewer+0.306
+0.303
Neg logit contributions
��-0.292
eteen-0.241
yip-0.237
��-0.224
maxwell-0.215
Token-
Token ID12
Feature activation-7.29e-02

Pos logit contributions
+0.076
+0.064
+0.060
Reviewer+0.058
+0.057
Neg logit contributions
��-0.061
eteen-0.051
yip-0.050
��-0.048
earchers-0.046
Tokenteam
Token ID15097
Feature activation-7.91e-02

Pos logit contributions
+0.254
+0.215
+0.201
Reviewer+0.195
+0.192
Neg logit contributions
��-0.207
eteen-0.173
yip-0.170
��-0.161
earchers-0.156
Token repet
Token ID46152
Feature activation-6.98e-02

Pos logit contributions
+0.281
+0.239
+0.225
Reviewer+0.218
+0.215
Neg logit contributions
��-0.218
eteen-0.180
yip-0.179
��-0.168
maxwell-0.162
Tokenitions
Token ID1756
Feature activation-4.20e-01

Pos logit contributions
+0.219
+0.182
+0.168
Reviewer+0.162
+0.159
Neg logit contributions
��-0.222
eteen-0.190
yip-0.187
��-0.178
earchers-0.173
Token.
Token ID13
Feature activation-1.77e-02

Pos logit contributions
+1.514
+1.295
+1.219
Reviewer+1.176
+1.167
Neg logit contributions
��-1.133
eteen-0.932
yip-0.921
��-0.873
earchers-0.836
Token
Token ID198
Feature activation-4.17e-02

Pos logit contributions
+0.063
+0.054
+0.050
Reviewer+0.049
+0.048
Neg logit contributions
��-0.048
eteen-0.040
yip-0.039
��-0.037
earchers-0.036
Token
Token ID198
Feature activation-3.87e-02

Pos logit contributions
+0.148
+0.126
+0.119
Reviewer+0.115
+0.115
Neg logit contributions
��-0.116
eteen-0.095
yip-0.095
��-0.091
earchers-0.087
TokenLyn
Token ID37207
Feature activation+4.40e-02

Pos logit contributions
+0.109
+0.089
+0.080
Reviewer+0.077
+0.075
Neg logit contributions
��-0.134
eteen-0.117
yip-0.114
��-0.110
maxwell-0.107
Tokench
Token ID354
Feature activation-6.33e-02

Pos logit contributions
��+0.139
yip+0.116
eteen+0.116
��+0.111
maxwell+0.107
Neg logit contributions
-0.139
-0.116
-0.109
-0.104
Reviewer-0.103
Token rage
Token ID14404
Feature activation+4.39e-02

Pos logit contributions
+0.166
+0.142
+0.133
Reviewer+0.130
+0.128
Neg logit contributions
��-0.118
eteen-0.097
yip-0.094
��-0.089
earchers-0.086
Token.
Token ID13
Feature activation+1.18e-02

Pos logit contributions
��+0.111
eteen+0.091
yip+0.089
��+0.083
earchers+0.080
Neg logit contributions
-0.166
-0.143
-0.134
Reviewer-0.130
-0.129
Token
Token ID198
Feature activation+1.14e-02

Pos logit contributions
��+0.031
eteen+0.026
yip+0.025
��+0.024
earchers+0.023
Neg logit contributions
-0.043
-0.036
-0.034
Reviewer-0.033
-0.033
Token
Token ID198
Feature activation+1.56e-02

Pos logit contributions
��+0.032
eteen+0.026
yip+0.026
��+0.025
earchers+0.024
Neg logit contributions
-0.041
-0.035
-0.033
Reviewer-0.032
-0.031
Tokenpolit
Token ID34470
Feature activation+4.07e-02

Pos logit contributions
��+0.051
eteen+0.045
yip+0.044
��+0.042
maxwell+0.041
Neg logit contributions
-0.047
-0.038
-0.035
Reviewer-0.033
-0.033
Tokenically
Token ID1146
Feature activation-4.16e-01

Pos logit contributions
��+0.134
eteen+0.117
yip+0.115
��+0.109
earchers+0.107
Neg logit contributions
-0.122
-0.100
-0.093
Reviewer-0.089
-0.087
Token correct
Token ID3376
Feature activation-2.76e-02

Pos logit contributions
+1.511
+1.292
+1.213
Reviewer+1.179
+1.162
Neg logit contributions
��-1.113
eteen-0.919
yip-0.901
��-0.852
earchers-0.819
Token o
Token ID267
Feature activation-4.69e-02

Pos logit contributions
+0.094
+0.079
+0.074
Reviewer+0.072
+0.070
Neg logit contributions
��-0.080
eteen-0.068
yip-0.066
��-0.063
earchers-0.061
Tokenomp
Token ID3361
Feature activation+4.97e-02

Pos logit contributions
+0.147
+0.122
+0.113
Reviewer+0.108
+0.107
Neg logit contributions
��-0.149
eteen-0.128
yip-0.125
��-0.119
earchers-0.116
Tokena
Token ID64
Feature activation+3.79e-02

Pos logit contributions
��+0.152
eteen+0.129
yip+0.126
��+0.120
earchers+0.117
Neg logit contributions
-0.163
-0.136
-0.126
Reviewer-0.123
-0.120
Token lo
Token ID2376
Feature activation-8.16e-03

Pos logit contributions
��+0.106
eteen+0.088
yip+0.086
��+0.081
maxwell+0.079
Neg logit contributions
-0.134
-0.113
-0.106
Reviewer-0.103
-0.101
Token creatures
Token ID8109
Feature activation-5.07e-03

Pos logit contributions
+0.302
+0.258
+0.242
Reviewer+0.235
+0.232
Neg logit contributions
��-0.221
eteen-0.182
yip-0.179
��-0.169
earchers-0.162
Token in
Token ID287
Feature activation+3.63e-02

Pos logit contributions
+0.019
+0.016
+0.015
Reviewer+0.015
+0.014
Neg logit contributions
��-0.013
eteen-0.011
yip-0.011
��-0.010
earchers-0.010
Token place
Token ID1295
Feature activation+2.27e-02

Pos logit contributions
��+0.093
eteen+0.076
yip+0.075
��+0.070
earchers+0.067
Neg logit contributions
-0.136
-0.117
-0.110
Reviewer-0.107
-0.105
Token while
Token ID981
Feature activation+3.77e-02

Pos logit contributions
��+0.063
eteen+0.052
yip+0.051
��+0.048
earchers+0.046
Neg logit contributions
-0.081
-0.069
-0.064
Reviewer-0.063
-0.061
Token ampl
Token ID12306
Feature activation-3.32e-02

Pos logit contributions
��+0.102
eteen+0.084
yip+0.083
��+0.078
earchers+0.075
Neg logit contributions
-0.136
-0.116
-0.109
Reviewer-0.106
-0.104
Tokenifying
Token ID4035
Feature activation-4.15e-01

Pos logit contributions
+0.084
+0.066
+0.059
Reviewer+0.057
+0.055
Neg logit contributions
��-0.126
eteen-0.112
yip-0.109
��-0.105
maxwell-0.104
Token damage
Token ID2465
Feature activation+4.20e-02

Pos logit contributions
+1.568
+1.353
+1.273
Reviewer+1.238
+1.222
Neg logit contributions
��-1.049
eteen-0.858
yip-0.836
��-0.790
earchers-0.757
Token from
Token ID422
Feature activation+8.29e-03

Pos logit contributions
��+0.107
eteen+0.088
yip+0.086
��+0.081
earchers+0.078
Neg logit contributions
-0.158
-0.136
-0.127
Reviewer-0.124
-0.122
Token other
Token ID584
Feature activation-1.01e-02

Pos logit contributions
��+0.022
eteen+0.018
yip+0.017
��+0.016
earchers+0.016
Neg logit contributions
-0.031
-0.026
-0.025
Reviewer-0.024
-0.024
Token weapons
Token ID3777
Feature activation+4.79e-03

Pos logit contributions
+0.035
+0.030
+0.028
Reviewer+0.027
+0.027
Neg logit contributions
��-0.028
eteen-0.024
yip-0.023
��-0.022
earchers-0.021
Token,
Token ID11
Feature activation-1.20e-03

Pos logit contributions
��+0.012
eteen+0.010
yip+0.010
��+0.009
earchers+0.009
Neg logit contributions
-0.018
-0.015
-0.014
Reviewer-0.014
-0.014
Tokenph
Token ID746
Feature activation+7.54e-02

Pos logit contributions
��+0.146
eteen+0.129
yip+0.127
��+0.121
earchers+0.119
Neg logit contributions
-0.102
-0.081
-0.073
Reviewer-0.070
-0.068
Tokenians
Token ID1547
Feature activation+3.56e-02

Pos logit contributions
��+0.173
eteen+0.138
yip+0.134
��+0.125
earchers+0.120
Neg logit contributions
-0.303
-0.264
-0.249
Reviewer-0.243
-0.239
Token use
Token ID779
Feature activation+4.71e-02

Pos logit contributions
��+0.102
eteen+0.085
yip+0.083
��+0.079
earchers+0.076
Neg logit contributions
-0.124
-0.105
-0.098
Reviewer-0.095
-0.093
Token a
Token ID257
Feature activation-4.19e-02

Pos logit contributions
��+0.117
eteen+0.095
yip+0.093
��+0.087
earchers+0.083
Neg logit contributions
-0.180
-0.156
-0.146
Reviewer-0.143
-0.140
Token neutral
Token ID8500
Feature activation-3.31e-02

Pos logit contributions
+0.137
+0.115
+0.107
Reviewer+0.104
+0.102
Neg logit contributions
��-0.127
eteen-0.108
yip-0.106
��-0.101
earchers-0.097
Tokenized
Token ID1143
Feature activation+3.89e-01

Pos logit contributions
+0.107
+0.090
+0.083
Reviewer+0.081
+0.079
Neg logit contributions
��-0.101
eteen-0.086
yip-0.085
��-0.081
earchers-0.078
Token short
Token ID1790
Feature activation+4.37e-02

Pos logit contributions
��+1.168
eteen+0.991
yip+0.972
��+0.925
earchers+0.890
Neg logit contributions
-1.288
-1.082
-1.003
Reviewer-0.978
-0.956
Token a
Token ID257
Feature activation+1.76e-03

Pos logit contributions
��+0.124
eteen+0.104
yip+0.102
��+0.097
earchers+0.093
Neg logit contributions
-0.151
-0.128
-0.120
Reviewer-0.116
-0.114
Token is
Token ID318
Feature activation-2.37e-03

Pos logit contributions
��+0.005
eteen+0.004
yip+0.004
��+0.004
earchers+0.004
Neg logit contributions
-0.006
-0.005
-0.005
Reviewer-0.004
-0.004
Token special
Token ID2041
Feature activation+1.40e-01

Pos logit contributions
+0.009
+0.008
+0.007
Reviewer+0.007
+0.007
Neg logit contributions
��-0.006
eteen-0.005
yip-0.005
��-0.004
earchers-0.004
Token to
Token ID284
Feature activation-2.62e-02

Pos logit contributions
��+0.356
eteen+0.294
yip+0.287
��+0.267
earchers+0.259
Neg logit contributions
-0.525
-0.449
-0.422
Reviewer-0.409
-0.404
TokenOPS
Token ID30737
Feature activation-7.10e-02

Pos logit contributions
��+0.044
eteen+0.037
yip+0.037
��+0.035
earchers+0.033
Neg logit contributions
-0.054
-0.046
-0.043
Reviewer-0.042
-0.041
Token on
Token ID319
Feature activation+3.13e-02

Pos logit contributions
+0.269
+0.231
+0.218
Reviewer+0.212
+0.209
Neg logit contributions
��-0.179
eteen-0.146
yip-0.143
��-0.135
earchers-0.129
Token AWS
Token ID30865
Feature activation+5.87e-02

Pos logit contributions
��+0.079
eteen+0.064
yip+0.063
��+0.059
earchers+0.056
Neg logit contributions
-0.119
-0.102
-0.096
Reviewer-0.094
-0.093
Token Cloud
Token ID10130
Feature activation+1.16e-02

Pos logit contributions
��+0.161
eteen+0.133
yip+0.131
��+0.124
earchers+0.119
Neg logit contributions
-0.210
-0.179
-0.168
Reviewer-0.163
-0.160
Token Inst
Token ID2262
Feature activation+9.62e-02

Pos logit contributions
��+0.032
eteen+0.026
yip+0.026
��+0.025
earchers+0.024
Neg logit contributions
-0.041
-0.035
-0.033
Reviewer-0.032
-0.031
Tokenance
Token ID590
Feature activation+3.61e-01

Pos logit contributions
��+0.243
eteen+0.198
yip+0.194
��+0.182
earchers+0.174
Neg logit contributions
-0.365
-0.314
-0.296
Reviewer-0.288
-0.284
Token.
Token ID13
Feature activation+3.77e-02

Pos logit contributions
��+0.949
eteen+0.787
yip+0.765
��+0.722
earchers+0.694
Neg logit contributions
-1.319
-1.131
-1.060
Reviewer-1.035
-1.015
Token
Token ID198
Feature activation+5.53e-02

Pos logit contributions
��+0.101
eteen+0.084
yip+0.082
��+0.078
earchers+0.075
Neg logit contributions
-0.137
-0.117
-0.109
Reviewer-0.106
-0.105
Token
Token ID198
Feature activation+5.96e-02

Pos logit contributions
��+0.154
eteen+0.127
yip+0.126
��+0.122
earchers+0.115
Neg logit contributions
-0.196
-0.167
-0.158
Reviewer-0.153
-0.152
TokenThis
Token ID1212
Feature activation-2.24e-02

Pos logit contributions
��+0.205
eteen+0.180
yip+0.175
��+0.169
maxwell+0.165
Neg logit contributions
-0.168
-0.137
-0.125
Reviewer-0.119
-0.117
Token is
Token ID318
Feature activation+4.89e-03

Pos logit contributions
+0.082
+0.070
+0.066
Reviewer+0.064
+0.063
Neg logit contributions
��-0.059
eteen-0.049
yip-0.048
��-0.045
earchers-0.044
Token
Token ID37929
Feature activation-2.01e-02

Pos logit contributions
��+0.308
eteen+0.262
yip+0.258
��+0.245
earchers+0.237
Neg logit contributions
-0.323
-0.270
-0.251
Reviewer-0.243
-0.238
Token Be
Token ID1355
Feature activation-4.72e-02

Pos logit contributions
+0.068
+0.057
+0.053
Reviewer+0.052
+0.051
Neg logit contributions
��-0.059
eteen-0.050
yip-0.049
��-0.046
earchers-0.045
Token safe
Token ID3338
Feature activation+6.22e-02

Pos logit contributions
+0.173
+0.149
+0.139
Reviewer+0.135
+0.134
Neg logit contributions
��-0.124
eteen-0.103
yip-0.100
��-0.095
earchers-0.091
Token pic
Token ID8301
Feature activation+7.39e-02

Pos logit contributions
��+0.172
eteen+0.143
yip+0.141
��+0.133
earchers+0.128
Neg logit contributions
-0.219
-0.187
-0.176
Reviewer-0.170
-0.167
Token.
Token ID13
Feature activation+7.54e-02

Pos logit contributions
��+0.196
eteen+0.160
yip+0.158
��+0.152
earchers+0.144
Neg logit contributions
-0.272
-0.234
-0.220
Reviewer-0.214
-0.212
Tokentwitter
Token ID6956
Feature activation+4.18e-01

Pos logit contributions
��+0.302
eteen+0.267
yip+0.263
��+0.254
earchers+0.248
Neg logit contributions
-0.174
-0.135
-0.120
Reviewer-0.114
-0.111
Token.
Token ID13
Feature activation+1.01e-01

Pos logit contributions
��+1.084
eteen+0.884
yip+0.873
��+0.831
earchers+0.790
Neg logit contributions
-1.561
-1.339
-1.264
Reviewer-1.226
-1.213
Tokencom
Token ID785
Feature activation-3.94e-02

Pos logit contributions
��+0.349
eteen+0.302
yip+0.298
��+0.286
earchers+0.278
Neg logit contributions
-0.291
-0.238
-0.218
Reviewer-0.210
-0.206
Token/
Token ID14
Feature activation+1.80e-01

Pos logit contributions
+0.147
+0.126
+0.119
+0.115
+0.115
Neg logit contributions
��-0.103
eteen-0.084
yip-0.083
��-0.081
earchers-0.076
Token74
Token ID4524
Feature activation-2.07e-02

Pos logit contributions
��+0.577
eteen+0.498
yip+0.491
��+0.466
earchers+0.453
Neg logit contributions
-0.555
-0.460
-0.425
Reviewer-0.409
-0.401
TokenE
Token ID36
Feature activation+1.02e-02

Pos logit contributions
+0.065
+0.054
+0.050
Reviewer+0.048
+0.047
Neg logit contributions
��-0.066
eteen-0.056
yip-0.056
��-0.053
earchers-0.051
Token,
Token ID11
Feature activation+2.55e-02

Pos logit contributions
��+0.005
eteen+0.005
yip+0.004
��+0.004
earchers+0.004
Neg logit contributions
-0.008
-0.007
-0.006
Reviewer-0.006
-0.006
Token the
Token ID262
Feature activation+5.05e-02

Pos logit contributions
��+0.068
eteen+0.056
yip+0.055
��+0.051
earchers+0.049
Neg logit contributions
-0.094
-0.080
-0.075
Reviewer-0.073
-0.072
Token R
Token ID371
Feature activation+7.55e-02

Pos logit contributions
��+0.137
eteen+0.113
yip+0.111
��+0.105
earchers+0.101
Neg logit contributions
-0.182
-0.155
-0.146
Reviewer-0.141
-0.139
Tokenuling
Token ID16619
Feature activation-1.17e-01

Pos logit contributions
��+0.161
eteen+0.125
yip+0.122
��+0.114
earchers+0.106
Neg logit contributions
-0.316
-0.276
-0.262
Reviewer-0.256
-0.253
Token Stew
Token ID12194
Feature activation+2.02e-02

Pos logit contributions
+0.428
+0.366
+0.345
Reviewer+0.334
+0.330
Neg logit contributions
��-0.311
eteen-0.256
yip-0.252
��-0.238
earchers-0.229
Tokenard
Token ID446
Feature activation+3.70e-01

Pos logit contributions
��+0.046
eteen+0.037
yip+0.036
��+0.034
earchers+0.032
Neg logit contributions
-0.081
-0.071
-0.067
Reviewer-0.065
-0.064
Token of
Token ID286
Feature activation+1.17e-02

Pos logit contributions
��+0.962
eteen+0.799
yip+0.774
��+0.731
earchers+0.698
Neg logit contributions
-1.371
-1.173
-1.095
Reviewer-1.073
-1.057
Token the
Token ID262
Feature activation+4.59e-02

Pos logit contributions
��+0.031
eteen+0.026
yip+0.025
��+0.024
earchers+0.023
Neg logit contributions
-0.043
-0.036
-0.034
Reviewer-0.033
-0.033
Token land
Token ID1956
Feature activation-2.12e-02

Pos logit contributions
��+0.126
eteen+0.105
yip+0.103
��+0.097
earchers+0.094
Neg logit contributions
-0.164
-0.140
-0.131
Reviewer-0.127
-0.125
Token of
Token ID286
Feature activation+6.66e-02

Pos logit contributions
+0.078
+0.067
+0.063
Reviewer+0.061
+0.061
Neg logit contributions
��-0.055
eteen-0.046
yip-0.045
��-0.042
earchers-0.040
Token G
Token ID402
Feature activation+9.91e-02

Pos logit contributions
��+0.177
eteen+0.146
yip+0.143
��+0.135
earchers+0.129
Neg logit contributions
-0.243
-0.208
-0.195
Reviewer-0.190
-0.187
Token be
Token ID307
Feature activation+5.20e-02

Pos logit contributions
��+0.100
eteen+0.081
yip+0.080
��+0.075
earchers+0.072
Neg logit contributions
-0.157
-0.135
-0.127
Reviewer-0.124
-0.122
Token hurt
Token ID5938
Feature activation-3.97e-02

Pos logit contributions
��+0.134
eteen+0.109
yip+0.107
��+0.101
earchers+0.097
Neg logit contributions
-0.195
-0.167
-0.157
Reviewer-0.153
-0.150
Token by
Token ID416
Feature activation+7.88e-02

Pos logit contributions
+0.153
+0.133
+0.125
Reviewer+0.121
+0.120
Neg logit contributions
��-0.097
eteen-0.079
yip-0.077
��-0.072
earchers-0.069
Token accidentally
Token ID14716
Feature activation+1.91e-02

Pos logit contributions
��+0.212
eteen+0.175
yip+0.172
��+0.162
earchers+0.157
Neg logit contributions
-0.286
-0.243
-0.229
Reviewer-0.222
-0.218
Token bump
Token ID13852
Feature activation-7.41e-02

Pos logit contributions
��+0.059
eteen+0.050
yip+0.049
��+0.046
earchers+0.045
Neg logit contributions
-0.062
-0.052
-0.048
Reviewer-0.047
-0.046
Tokening
Token ID278
Feature activation+3.67e-01

Pos logit contributions
+0.259
+0.219
+0.205
Reviewer+0.199
+0.196
Neg logit contributions
��-0.208
eteen-0.175
yip-0.172
��-0.162
maxwell-0.156
Token the
Token ID262
Feature activation+4.21e-02

Pos logit contributions
��+0.853
eteen+0.682
yip+0.669
��+0.621
earchers+0.593
Neg logit contributions
-1.462
-1.264
-1.197
Reviewer-1.165
-1.149
Token shif
Token ID7143
Feature activation-5.14e-02

Pos logit contributions
��+0.128
eteen+0.108
yip+0.106
��+0.101
earchers+0.098
Neg logit contributions
-0.138
-0.116
-0.108
Reviewer-0.104
-0.102
Tokenter
Token ID353
Feature activation+1.41e-01

Pos logit contributions
+0.173
+0.146
+0.136
Reviewer+0.132
+0.130
Neg logit contributions
��-0.152
eteen-0.127
yip-0.125
��-0.120
earchers-0.115
Token into
Token ID656
Feature activation+7.99e-02

Pos logit contributions
��+0.354
eteen+0.289
yip+0.283
��+0.264
earchers+0.254
Neg logit contributions
-0.538
-0.462
-0.435
Reviewer-0.424
-0.417
Token reverse
Token ID9575
Feature activation-9.77e-02

Pos logit contributions
��+0.192
eteen+0.155
yip+0.152
��+0.142
earchers+0.136
Neg logit contributions
-0.312
-0.269
-0.254
Reviewer-0.248
-0.244
Token,
Token ID11
Feature activation+2.31e-02

Pos logit contributions
+0.154
+0.132
+0.124
Reviewer+0.121
+0.119
Neg logit contributions
��-0.107
eteen-0.088
yip-0.086
��-0.081
earchers-0.078
Token who
Token ID508
Feature activation+4.06e-02

Pos logit contributions
��+0.065
eteen+0.054
yip+0.053
��+0.051
earchers+0.049
Neg logit contributions
-0.081
-0.069
-0.064
Reviewer-0.063
-0.062
Token flee
Token ID11562
Feature activation+8.48e-02

Pos logit contributions
��+0.100
eteen+0.081
yip+0.079
��+0.075
earchers+0.071
Neg logit contributions
-0.156
-0.135
-0.127
Reviewer-0.123
-0.122
Token into
Token ID656
Feature activation+1.77e-02

Pos logit contributions
��+0.215
eteen+0.176
yip+0.172
��+0.162
earchers+0.155
Neg logit contributions
-0.320
-0.275
-0.259
Reviewer-0.252
-0.249
Token the
Token ID262
Feature activation-1.12e-03

Pos logit contributions
��+0.046
eteen+0.038
yip+0.037
��+0.035
earchers+0.034
Neg logit contributions
-0.066
-0.056
-0.053
Reviewer-0.051
-0.051
Token "
Token ID366
Feature activation+8.09e-02

Pos logit contributions
+0.004
+0.004
+0.003
Reviewer+0.003
+0.003
Neg logit contributions
��-0.003
eteen-0.002
yip-0.002
��-0.002
maxwell-0.002
Tokenforest
Token ID29623
Feature activation-1.22e-02

Pos logit contributions
��+0.255
eteen+0.219
yip+0.215
��+0.205
earchers+0.199
Neg logit contributions
-0.255
-0.212
-0.196
Reviewer-0.189
-0.186
Token"
Token ID1
Feature activation+2.60e-02

Pos logit contributions
+0.041
+0.035
+0.032
Reviewer+0.031
+0.031
Neg logit contributions
��-0.036
eteen-0.031
yip-0.030
��-0.029
earchers-0.028
Token of
Token ID286
Feature activation+1.05e-02

Pos logit contributions
��+0.070
eteen+0.058
yip+0.057
��+0.054
earchers+0.052
Neg logit contributions
-0.094
-0.080
-0.075
Reviewer-0.073
-0.072
Token Hu
Token ID11256
Feature activation+1.33e-01

Pos logit contributions
��+0.028
eteen+0.023
yip+0.023
��+0.021
earchers+0.021
Neg logit contributions
-0.038
-0.033
-0.031
Reviewer-0.030
-0.030
Tokenorns
Token ID19942
Feature activation-2.72e-02

Pos logit contributions
��+0.333
eteen+0.270
yip+0.266
��+0.251
earchers+0.239
Neg logit contributions
-0.504
-0.434
-0.409
Reviewer-0.398
-0.392
Token the
Token ID262
Feature activation+3.84e-03

Pos logit contributions
��+0.243
eteen+0.198
yip+0.194
��+0.183
earchers+0.175
Neg logit contributions
-0.361
-0.311
-0.292
Reviewer-0.284
-0.280
Token accused
Token ID5371
Feature activation+1.83e-02

Pos logit contributions
��+0.011
eteen+0.009
yip+0.009
��+0.008
earchers+0.008
Neg logit contributions
-0.014
-0.012
-0.011
Reviewer-0.011
-0.010
Token for
Token ID329
Feature activation-1.69e-02

Pos logit contributions
��+0.047
eteen+0.038
yip+0.037
��+0.035
earchers+0.034
Neg logit contributions
-0.069
-0.059
-0.056
Reviewer-0.054
-0.053
Token causing
Token ID6666
Feature activation+8.86e-02

Pos logit contributions
+0.062
+0.053
+0.049
Reviewer+0.048
+0.047
Neg logit contributions
��-0.045
eteen-0.037
yip-0.037
��-0.035
earchers-0.033
Token destruction
Token ID8166
Feature activation+9.94e-02

Pos logit contributions
��+0.223
eteen+0.182
yip+0.179
��+0.168
earchers+0.160
Neg logit contributions
-0.336
-0.289
-0.272
Reviewer-0.266
-0.261
Token to
Token ID284
Feature activation+9.79e-02

Pos logit contributions
��+0.272
eteen+0.227
yip+0.221
��+0.210
earchers+0.202
Neg logit contributions
-0.355
-0.303
-0.283
Reviewer-0.276
-0.271
Token the
Token ID262
Feature activation+8.21e-03

Pos logit contributions
��+0.265
eteen+0.220
yip+0.216
��+0.204
earchers+0.195
Neg logit contributions
-0.353
-0.301
-0.283
Reviewer-0.275
-0.270
Token environment
Token ID2858
Feature activation+2.50e-02

Pos logit contributions
��+0.021
eteen+0.017
yip+0.016
��+0.015
earchers+0.015
Neg logit contributions
-0.031
-0.027
-0.025
Reviewer-0.025
-0.024
Token.
Token ID13
Feature activation+3.04e-02

Pos logit contributions
��+0.064
eteen+0.052
yip+0.051
��+0.048
earchers+0.046
Neg logit contributions
-0.094
-0.080
-0.076
Reviewer-0.074
-0.073
Token Later
Token ID11450
Feature activation+7.13e-02

Pos logit contributions
��+0.082
eteen+0.068
yip+0.066
��+0.063
earchers+0.060
Neg logit contributions
-0.110
-0.094
-0.088
Reviewer-0.085
-0.084
Token on
Token ID319
Feature activation+2.48e-02

Pos logit contributions
��+0.189
eteen+0.157
yip+0.153
��+0.145
earchers+0.139
Neg logit contributions
-0.260
-0.223
-0.209
Reviewer-0.203
-0.201
Token I
Token ID314
Feature activation-3.77e-03

Pos logit contributions
��+0.243
eteen+0.201
yip+0.197
��+0.186
earchers+0.178
Neg logit contributions
-0.343
-0.294
-0.276
Reviewer-0.269
-0.264
Token put
Token ID1234
Feature activation-4.88e-02

Pos logit contributions
+0.014
+0.012
+0.011
Reviewer+0.011
+0.011
Neg logit contributions
��-0.010
eteen-0.008
yip-0.008
��-0.008
earchers-0.007
Token out
Token ID503
Feature activation-8.40e-03

Pos logit contributions
+0.186
+0.161
+0.151
Reviewer+0.147
+0.145
Neg logit contributions
��-0.121
eteen-0.099
yip-0.097
��-0.091
earchers-0.087
Token a
Token ID257
Feature activation+2.32e-03

Pos logit contributions
+0.032
+0.028
+0.026
Reviewer+0.025
+0.025
Neg logit contributions
��-0.021
eteen-0.017
yip-0.017
��-0.016
earchers-0.015
Token little
Token ID1310
Feature activation+4.97e-02

Pos logit contributions
��+0.006
eteen+0.005
yip+0.005
��+0.005
earchers+0.005
Neg logit contributions
-0.008
-0.007
-0.007
Reviewer-0.006
-0.006
Token shed
Token ID14999
Feature activation+9.09e-02

Pos logit contributions
��+0.128
eteen+0.105
yip+0.103
��+0.097
earchers+0.093
Neg logit contributions
-0.185
-0.159
-0.149
Reviewer-0.145
-0.143
Token in
Token ID287
Feature activation-2.02e-02

Pos logit contributions
��+0.231
eteen+0.190
yip+0.186
��+0.174
earchers+0.168
Neg logit contributions
-0.342
-0.294
-0.276
Reviewer-0.269
-0.264
Token a
Token ID257
Feature activation-2.30e-02

Pos logit contributions
+0.074
+0.063
+0.059
Reviewer+0.058
+0.057
Neg logit contributions
��-0.054
eteen-0.044
yip-0.043
��-0.041
earchers-0.039
Token parking
Token ID7647
Feature activation+4.98e-02

Pos logit contributions
+0.083
+0.071
+0.067
Reviewer+0.065
+0.064
Neg logit contributions
��-0.062
eteen-0.051
yip-0.050
��-0.047
earchers-0.045
Token lot
Token ID1256
Feature activation+3.76e-03

Pos logit contributions
��+0.131
eteen+0.108
yip+0.106
��+0.100
earchers+0.096
Neg logit contributions
-0.183
-0.157
-0.147
Reviewer-0.143
-0.141
Token and
Token ID290
Feature activation-4.15e-02

Pos logit contributions
��+0.010
eteen+0.008
yip+0.008
��+0.007
earchers+0.007
Neg logit contributions
-0.014
-0.012
-0.011
Reviewer-0.011
-0.011
Token new
Token ID649
Feature activation-8.47e-02

Pos logit contributions
+0.076
+0.065
+0.061
Reviewer+0.059
+0.058
Neg logit contributions
��-0.058
eteen-0.048
yip-0.047
��-0.045
earchers-0.043
Token settlement
Token ID9443
Feature activation-7.14e-02

Pos logit contributions
+0.294
+0.249
+0.233
Reviewer+0.226
+0.223
Neg logit contributions
��-0.240
eteen-0.201
yip-0.197
��-0.187
earchers-0.180
Token is
Token ID318
Feature activation-1.41e-02

Pos logit contributions
+0.259
+0.221
+0.206
Reviewer+0.201
+0.197
Neg logit contributions
��-0.192
eteen-0.159
yip-0.156
��-0.146
earchers-0.142
Token devised
Token ID28964
Feature activation-1.47e-02

Pos logit contributions
+0.053
+0.046
+0.043
Reviewer+0.042
+0.041
Neg logit contributions
��-0.036
eteen-0.029
yip-0.028
��-0.027
earchers-0.026
Token to
Token ID284
Feature activation-5.28e-02

Pos logit contributions
+0.056
+0.048
+0.045
Reviewer+0.044
+0.043
Neg logit contributions
��-0.037
eteen-0.030
yip-0.030
��-0.028
earchers-0.027
Token Juda
Token ID24771
Feature activation+8.25e-02

Pos logit contributions
+0.192
+0.165
+0.155
Reviewer+0.150
+0.148
Neg logit contributions
��-0.141
eteen-0.116
yip-0.114
��-0.108
earchers-0.104
Tokenise
Token ID786
Feature activation+1.81e-02

Pos logit contributions
��+0.171
eteen+0.134
yip+0.129
��+0.119
earchers+0.113
Neg logit contributions
-0.349
-0.305
-0.289
Reviewer-0.283
-0.279
Token the
Token ID262
Feature activation-5.35e-02

Pos logit contributions
��+0.047
eteen+0.038
yip+0.038
��+0.036
earchers+0.034
Neg logit contributions
-0.068
-0.058
-0.055
Reviewer-0.053
-0.052
Token city
Token ID1748
Feature activation-7.43e-02

Pos logit contributions
+0.206
+0.178
+0.168
Reviewer+0.163
+0.161
Neg logit contributions
��-0.132
eteen-0.107
yip-0.105
��-0.099
earchers-0.094
Token.
Token ID13
Feature activation-3.25e-02

Pos logit contributions
+0.271
+0.232
+0.217
Reviewer+0.211
+0.208
Neg logit contributions
��-0.198
eteen-0.164
yip-0.160
��-0.151
earchers-0.146
Token The
Token ID383
Feature activation-3.81e-02

Pos logit contributions
+0.116
+0.099
+0.093
Reviewer+0.090
+0.089
Neg logit contributions
��-0.088
eteen-0.073
yip-0.072
��-0.068
earchers-0.065
Token Carly
Token ID35982
Feature activation+2.83e-02

Pos logit contributions
��+0.235
eteen+0.202
yip+0.197
��+0.187
earchers+0.183
Neg logit contributions
-0.243
-0.203
-0.188
Reviewer-0.184
-0.179
Token Fiorina
Token ID49028
Feature activation+9.74e-02

Pos logit contributions
��+0.090
eteen+0.077
yip+0.075
��+0.072
earchers+0.070
Neg logit contributions
-0.089
-0.074
-0.068
Reviewer-0.066
-0.065
Token's
Token ID338
Feature activation-1.08e-02

Pos logit contributions
��+0.265
eteen+0.221
yip+0.216
��+0.204
earchers+0.198
Neg logit contributions
-0.349
-0.298
-0.278
Reviewer-0.272
-0.266
Token political
Token ID1964
Feature activation+6.61e-02

Pos logit contributions
+0.039
+0.033
+0.031
Reviewer+0.031
+0.030
Neg logit contributions
��-0.029
eteen-0.024
yip-0.024
��-0.022
earchers-0.022
Token career
Token ID3451
Feature activation+5.88e-02

Pos logit contributions
��+0.176
eteen+0.148
yip+0.144
��+0.135
earchers+0.131
Neg logit contributions
-0.239
-0.205
-0.192
Reviewer-0.187
-0.183
Token Fiorina
Token ID49028
Feature activation+1.09e-01

Pos logit contributions
��+0.190
eteen+0.165
yip+0.161
��+0.154
earchers+0.151
Neg logit contributions
-0.179
-0.148
-0.137
Reviewer-0.133
-0.129
Token,
Token ID11
Feature activation+2.25e-02

Pos logit contributions
��+0.296
eteen+0.250
yip+0.242
��+0.229
earchers+0.222
Neg logit contributions
-0.386
-0.329
-0.306
Reviewer-0.302
-0.293
Token right
Token ID826
Feature activation+5.61e-02

Pos logit contributions
��+0.059
eteen+0.049
yip+0.047
��+0.044
earchers+0.043
Neg logit contributions
-0.083
-0.071
-0.066
Reviewer-0.065
-0.064
Token,
Token ID11
Feature activation+2.49e-02

Pos logit contributions
��+0.146
eteen+0.121
yip+0.118
��+0.110
earchers+0.106
Neg logit contributions
-0.209
-0.178
-0.167
Reviewer-0.163
-0.160
Token poses
Token ID17313
Feature activation+1.44e-01

Pos logit contributions
��+0.071
eteen+0.061
yip+0.059
��+0.056
earchers+0.054
Neg logit contributions
-0.085
-0.072
-0.066
Reviewer-0.066
-0.063
Token for
Token ID329
Feature activation+2.24e-02

Pos logit contributions
��+0.369
eteen+0.307
yip+0.296
��+0.276
earchers+0.271
Neg logit contributions
-0.535
-0.460
-0.428
Reviewer-0.423
-0.409
Token in
Token ID287
Feature activation-5.50e-02

Pos logit contributions
��+0.324
eteen+0.266
yip+0.260
��+0.242
earchers+0.230
Neg logit contributions
-0.511
-0.445
-0.416
Reviewer-0.413
-0.400
Token large
Token ID1588
Feature activation-1.83e-04

Pos logit contributions
+0.203
+0.174
+0.163
Reviewer+0.160
+0.156
Neg logit contributions
��-0.144
eteen-0.120
yip-0.116
��-0.110
earchers-0.106
Token red
Token ID2266
Feature activation+8.65e-03

Pos logit contributions
+0.001
+0.001
+0.000
Reviewer+0.000
+0.000
Neg logit contributions
��-0.001
eteen-0.000
yip-0.000
��-0.000
earchers-0.000
Token letters
Token ID7475
Feature activation-8.25e-03

Pos logit contributions
��+0.025
eteen+0.021
yip+0.021
��+0.020
earchers+0.019
Neg logit contributions
-0.029
-0.025
-0.023
Reviewer-0.023
-0.022
Token on
Token ID319
Feature activation+4.07e-02

Pos logit contributions
+0.031
+0.027
+0.025
Reviewer+0.025
+0.024
Neg logit contributions
��-0.021
eteen-0.017
yip-0.017
��-0.015
earchers-0.015
Token the
Token ID262
Feature activation+2.17e-02

Pos logit contributions
��+0.104
eteen+0.085
yip+0.083
��+0.078
earchers+0.075
Neg logit contributions
-0.154
-0.132
-0.124
Reviewer-0.121
-0.119
Token structure
Token ID4645
Feature activation-5.48e-02

Pos logit contributions
��+0.063
eteen+0.053
yip+0.051
��+0.049
earchers+0.047
Neg logit contributions
-0.074
-0.063
-0.059
Reviewer-0.057
-0.056
Token and
Token ID290
Feature activation-1.87e-02

Pos logit contributions
+0.195
+0.166
+0.156
Reviewer+0.152
+0.148
Neg logit contributions
��-0.150
eteen-0.126
yip-0.123
��-0.116
earchers-0.112
Token also
Token ID635
Feature activation+5.01e-03

Pos logit contributions
+0.068
+0.058
+0.054
Reviewer+0.053
+0.052
Neg logit contributions
��-0.050
eteen-0.042
yip-0.041
��-0.038
earchers-0.037
Token on
Token ID319
Feature activation+1.10e-02

Pos logit contributions
��+0.013
eteen+0.011
yip+0.011
��+0.010
earchers+0.010
Neg logit contributions
-0.018
-0.016
-0.015
Reviewer-0.014
-0.014
Token the
Token ID262
Feature activation+1.95e-03

Pos logit contributions
��+0.028
eteen+0.023
yip+0.023
��+0.021
earchers+0.021
Neg logit contributions
-0.041
-0.036
-0.033
Reviewer-0.033
-0.032
Tokeniva
Token ID12151
Feature activation-3.84e-02

Pos logit contributions
+0.059
+0.049
+0.046
Reviewer+0.044
+0.044
Neg logit contributions
��-0.055
eteen-0.047
yip-0.046
��-0.043
maxwell-0.042
Token).
Token ID737
Feature activation+6.78e-02

Pos logit contributions
+0.134
+0.114
+0.107
Reviewer+0.103
+0.102
Neg logit contributions
��-0.108
eteen-0.091
yip-0.089
��-0.084
maxwell-0.081
Token
Token ID198
Feature activation+1.74e-02

Pos logit contributions
��+0.186
eteen+0.156
yip+0.153
��+0.144
maxwell+0.139
Neg logit contributions
-0.240
-0.204
-0.192
Reviewer-0.186
-0.183
Token
Token ID198
Feature activation+1.33e-02

Pos logit contributions
��+0.048
eteen+0.039
yip+0.039
��+0.037
earchers+0.036
Neg logit contributions
-0.062
-0.053
-0.050
Reviewer-0.048
-0.048
Token3
Token ID18
Feature activation-1.53e-02

Pos logit contributions
��+0.044
eteen+0.038
yip+0.038
��+0.036
maxwell+0.035
Neg logit contributions
-0.039
-0.032
-0.030
Reviewer-0.028
-0.028
Token.
Token ID13
Feature activation-6.61e-02

Pos logit contributions
+0.054
+0.047
+0.043
Reviewer+0.042
+0.042
Neg logit contributions
��-0.042
eteen-0.035
yip-0.034
��-0.032
earchers-0.031
Token Xen
Token ID21173
Feature activation-7.06e-02

Pos logit contributions
+0.217
+0.182
+0.169
Reviewer+0.164
+0.161
Neg logit contributions
��-0.200
eteen-0.170
yip-0.167
��-0.159
maxwell-0.153
Token H
Token ID367
Feature activation-3.56e-02

Pos logit contributions
+0.239
+0.202
+0.188
Reviewer+0.183
+0.179
Neg logit contributions
��-0.206
eteen-0.174
yip-0.171
��-0.162
earchers-0.156
TokenVM
Token ID15996
Feature activation+4.18e-02

Pos logit contributions
+0.113
+0.095
+0.087
Reviewer+0.084
+0.083
Neg logit contributions
��-0.111
eteen-0.096
yip-0.094
��-0.090
earchers-0.087
Token 3
Token ID513
Feature activation-2.13e-02

Pos logit contributions
��+0.114
eteen+0.095
yip+0.093
��+0.088
earchers+0.085
Neg logit contributions
-0.149
-0.127
-0.119
Reviewer-0.116
-0.114
Token.
Token ID13
Feature activation-1.66e-02

Pos logit contributions
+0.080
+0.069
+0.065
Reviewer+0.064
+0.062
Neg logit contributions
��-0.054
eteen-0.044
yip-0.043
��-0.041
earchers-0.039
Token percent
Token ID1411
Feature activation-3.23e-02

Pos logit contributions
��+0.293
eteen+0.250
yip+0.244
��+0.233
earchers+0.225
Neg logit contributions
-0.317
-0.266
-0.248
Reviewer-0.240
-0.235
Token may
Token ID743
Feature activation+4.81e-03

Pos logit contributions
+0.117
+0.100
+0.093
Reviewer+0.091
+0.089
Neg logit contributions
��-0.087
eteen-0.072
yip-0.071
��-0.067
earchers-0.064
Token not
Token ID407
Feature activation+7.52e-03

Pos logit contributions
��+0.013
eteen+0.011
yip+0.010
��+0.010
earchers+0.009
Neg logit contributions
-0.018
-0.015
-0.014
Reviewer-0.014
-0.014
Token be
Token ID307
Feature activation-2.01e-02

Pos logit contributions
��+0.020
eteen+0.016
yip+0.016
��+0.015
earchers+0.014
Neg logit contributions
-0.028
-0.024
-0.022
Reviewer-0.022
-0.021
Token a
Token ID257
Feature activation+2.63e-02

Pos logit contributions
+0.078
+0.067
+0.063
Reviewer+0.062
+0.061
Neg logit contributions
��-0.049
eteen-0.040
yip-0.039
��-0.037
earchers-0.035
Token res
Token ID581
Feature activation-8.94e-03

Pos logit contributions
��+0.067
eteen+0.054
yip+0.053
��+0.050
earchers+0.048
Neg logit contributions
-0.099
-0.086
-0.081
Reviewer-0.078
-0.077
Tokenounding
Token ID9969
Feature activation-3.80e-02

Pos logit contributions
+0.030
+0.026
+0.024
Reviewer+0.023
+0.023
Neg logit contributions
��-0.026
eteen-0.022
yip-0.022
��-0.020
earchers-0.020
Token endorsement
Token ID17819
Feature activation+2.69e-02

Pos logit contributions
+0.132
+0.112
+0.104
Reviewer+0.101
+0.100
Neg logit contributions
��-0.108
eteen-0.090
yip-0.089
��-0.084
earchers-0.081
Token,
Token ID11
Feature activation+1.17e-02

Pos logit contributions
��+0.074
eteen+0.061
yip+0.060
��+0.057
earchers+0.055
Neg logit contributions
-0.096
-0.082
-0.077
Reviewer-0.075
-0.074
Token it
Token ID340
Feature activation+1.66e-02

Pos logit contributions
��+0.029
eteen+0.024
yip+0.024
��+0.022
earchers+0.021
Neg logit contributions
-0.044
-0.038
-0.036
Reviewer-0.035
-0.034
Token topped
Token ID20633
Feature activation-1.25e-02

Pos logit contributions
��+0.043
eteen+0.035
yip+0.034
��+0.032
earchers+0.031
Neg logit contributions
-0.062
-0.053
-0.050
Reviewer-0.049
-0.048
Token in
Token ID287
Feature activation+2.32e-02

Pos logit contributions
��+0.004
eteen+0.003
yip+0.003
��+0.003
earchers+0.003
Neg logit contributions
-0.005
-0.005
-0.004
Reviewer-0.004
-0.004
Token his
Token ID465
Feature activation+9.15e-03

Pos logit contributions
��+0.058
eteen+0.048
yip+0.046
��+0.044
earchers+0.042
Neg logit contributions
-0.088
-0.076
-0.072
Reviewer-0.070
-0.069
Token stories
Token ID3923
Feature activation-9.22e-02

Pos logit contributions
��+0.027
eteen+0.023
yip+0.023
��+0.022
earchers+0.021
Neg logit contributions
-0.030
-0.026
-0.024
Reviewer-0.023
-0.023
Token,
Token ID11
Feature activation+1.66e-02

Pos logit contributions
+0.343
+0.294
+0.276
Reviewer+0.269
+0.264
Neg logit contributions
��-0.239
eteen-0.197
yip-0.192
��-0.181
earchers-0.174
Token nor
Token ID4249
Feature activation-4.80e-02

Pos logit contributions
��+0.044
eteen+0.036
yip+0.035
��+0.033
earchers+0.032
Neg logit contributions
-0.061
-0.052
-0.049
Reviewer-0.048
-0.047
Token does
Token ID857
Feature activation-2.22e-02

Pos logit contributions
+0.176
+0.151
+0.141
Reviewer+0.138
+0.136
Neg logit contributions
��-0.126
eteen-0.104
yip-0.102
��-0.096
earchers-0.092
Token he
Token ID339
Feature activation+4.20e-02

Pos logit contributions
+0.076
+0.064
+0.060
Reviewer+0.058
+0.057
Neg logit contributions
��-0.064
eteen-0.054
yip-0.053
��-0.050
earchers-0.048
Token sugar
Token ID7543
Feature activation-3.38e-02

Pos logit contributions
��+0.114
eteen+0.095
yip+0.093
��+0.088
earchers+0.084
Neg logit contributions
-0.151
-0.129
-0.121
Reviewer-0.118
-0.116
Tokencoat
Token ID31434
Feature activation+1.02e-01

Pos logit contributions
+0.122
+0.103
+0.097
Reviewer+0.094
+0.093
Neg logit contributions
��-0.092
eteen-0.076
yip-0.075
��-0.070
earchers-0.068
Token the
Token ID262
Feature activation+3.51e-02

Pos logit contributions
��+0.253
eteen+0.206
yip+0.201
��+0.189
earchers+0.181
Neg logit contributions
-0.388
-0.333
-0.314
Reviewer-0.307
-0.301
Token realities
Token ID22072
Feature activation-2.84e-03

Pos logit contributions
��+0.093
eteen+0.077
yip+0.075
��+0.071
earchers+0.068
Neg logit contributions
-0.128
-0.109
-0.103
Reviewer-0.100
-0.098
Token
Token ID198
Feature activation-1.17e-02

Pos logit contributions
+0.124
+0.106
+0.100
Reviewer+0.097
+0.096
Neg logit contributions
��-0.099
eteen-0.082
yip-0.082
��-0.079
earchers-0.075
TokenMake
Token ID12050
Feature activation+5.74e-02

Pos logit contributions
+0.036
+0.030
+0.028
Reviewer+0.027
+0.026
Neg logit contributions
��-0.037
eteen-0.032
yip-0.031
��-0.030
maxwell-0.029
Token a
Token ID257
Feature activation-2.62e-03

Pos logit contributions
��+0.140
eteen+0.114
yip+0.111
��+0.104
earchers+0.100
Neg logit contributions
-0.222
-0.192
-0.181
Reviewer-0.176
-0.174
Token break
Token ID2270
Feature activation+6.79e-02

Pos logit contributions
+0.010
+0.008
+0.008
Reviewer+0.008
+0.007
Neg logit contributions
��-0.007
eteen-0.006
yip-0.006
��-0.005
earchers-0.005
Token for
Token ID329
Feature activation+1.09e-03

Pos logit contributions
��+0.177
eteen+0.145
yip+0.142
��+0.134
earchers+0.129
Neg logit contributions
-0.252
-0.216
-0.203
Reviewer-0.197
-0.195
Token it
Token ID340
Feature activation-1.08e-02

Pos logit contributions
��+0.003
eteen+0.002
yip+0.002
��+0.002
earchers+0.002
Neg logit contributions
-0.004
-0.004
-0.003
Reviewer-0.003
-0.003
Token (
Token ID357
Feature activation+1.38e-02

Pos logit contributions
+0.040
+0.034
+0.032
Reviewer+0.031
+0.031
Neg logit contributions
��-0.029
eteen-0.024
yip-0.023
��-0.022
earchers-0.021
Token3
Token ID18
Feature activation-4.74e-02

Pos logit contributions
��+0.047
eteen+0.041
yip+0.041
��+0.039
maxwell+0.038
Neg logit contributions
-0.039
-0.032
-0.029
Reviewer-0.028
-0.027
Token)
Token ID8
Feature activation-3.47e-03

Pos logit contributions
+0.159
+0.134
+0.125
Reviewer+0.121
+0.120
Neg logit contributions
��-0.141
eteen-0.118
yip-0.116
��-0.111
earchers-0.107
Token
Token ID198
Feature activation-3.37e-02

Pos logit contributions
+0.013
+0.011
+0.010
Reviewer+0.010
+0.010
Neg logit contributions
��-0.009
eteen-0.008
yip-0.007
��-0.007
earchers-0.007
Token
Token ID198
Feature activation-5.79e-03

Pos logit contributions
+0.118
+0.101
+0.095
Reviewer+0.092
+0.092
Neg logit contributions
��-0.095
eteen-0.078
yip-0.078
��-0.075
earchers-0.071
Token be
Token ID307
Feature activation-4.73e-02

Pos logit contributions
+0.147
+0.126
+0.118
Reviewer+0.115
+0.113
Neg logit contributions
��-0.106
eteen-0.088
yip-0.086
��-0.081
earchers-0.078
Token in
Token ID287
Feature activation-3.09e-02

Pos logit contributions
+0.171
+0.145
+0.135
Reviewer+0.133
+0.130
Neg logit contributions
��-0.129
eteen-0.107
yip-0.104
��-0.099
earchers-0.095
Token a
Token ID257
Feature activation-3.51e-02

Pos logit contributions
+0.120
+0.103
+0.097
Reviewer+0.095
+0.093
Neg logit contributions
��-0.075
eteen-0.061
yip-0.060
��-0.056
earchers-0.054
Token fren
Token ID22832
Feature activation-6.23e-02

Pos logit contributions
+0.129
+0.111
+0.104
Reviewer+0.101
+0.099
Neg logit contributions
��-0.092
eteen-0.076
yip-0.074
��-0.070
earchers-0.068
Tokenz
Token ID89
Feature activation-3.80e-02

Pos logit contributions
+0.209
+0.176
+0.163
Reviewer+0.158
+0.155
Neg logit contributions
��-0.184
eteen-0.159
yip-0.155
��-0.146
earchers-0.142
Tokenied
Token ID798
Feature activation-2.29e-01

Pos logit contributions
+0.139
+0.119
+0.111
Reviewer+0.108
+0.107
Neg logit contributions
��-0.102
eteen-0.083
yip-0.082
��-0.078
earchers-0.075
Token state
Token ID1181
Feature activation+2.63e-02

Pos logit contributions
+0.854
+0.732
+0.689
Reviewer+0.670
+0.660
Neg logit contributions
��-0.593
eteen-0.486
yip-0.477
��-0.449
earchers-0.431
Token as
Token ID355
Feature activation-4.49e-02

Pos logit contributions
��+0.065
eteen+0.054
yip+0.052
��+0.049
earchers+0.047
Neg logit contributions
-0.101
-0.087
-0.081
Reviewer-0.080
-0.078
Token he
Token ID339
Feature activation-4.43e-02

Pos logit contributions
+0.158
+0.134
+0.124
Reviewer+0.122
+0.119
Neg logit contributions
��-0.126
eteen-0.106
yip-0.104
��-0.099
earchers-0.095
Token began
Token ID2540
Feature activation-8.22e-02

Pos logit contributions
+0.156
+0.133
+0.123
Reviewer+0.122
+0.118
Neg logit contributions
��-0.123
eteen-0.104
yip-0.101
��-0.095
earchers-0.092
Token shouting
Token ID19642
Feature activation-7.03e-02

Pos logit contributions
+0.312
+0.267
+0.248
Reviewer+0.246
+0.240
Neg logit contributions
��-0.208
eteen-0.172
yip-0.166
��-0.157
earchers-0.150
Token Law
Token ID3854
Feature activation+3.11e-02

Pos logit contributions
+0.094
+0.080
+0.075
Reviewer+0.073
+0.072
Neg logit contributions
��-0.071
eteen-0.059
yip-0.057
��-0.054
earchers-0.052
Token-
Token ID12
Feature activation+6.54e-02

Pos logit contributions
��+0.101
eteen+0.086
yip+0.085
��+0.081
earchers+0.079
Neg logit contributions
-0.096
-0.079
-0.073
Reviewer-0.071
-0.070
Tokenabiding
Token ID43056
Feature activation-5.11e-02

Pos logit contributions
��+0.179
eteen+0.149
yip+0.146
��+0.138
earchers+0.133
Neg logit contributions
-0.234
-0.199
-0.187
Reviewer-0.181
-0.179
Token,
Token ID11
Feature activation+2.13e-02

Pos logit contributions
+0.186
+0.158
+0.148
Reviewer+0.144
+0.142
Neg logit contributions
��-0.138
eteen-0.114
yip-0.111
��-0.105
earchers-0.101
Token Ded
Token ID35023
Feature activation-5.90e-04

Pos logit contributions
��+0.058
eteen+0.048
yip+0.047
��+0.044
earchers+0.043
Neg logit contributions
-0.077
-0.065
-0.061
Reviewer-0.059
-0.058
Tokenicated
Token ID3474
Feature activation-2.37e-01

Pos logit contributions
+0.002
+0.001
+0.001
Reviewer+0.001
+0.001
Neg logit contributions
��-0.002
eteen-0.002
yip-0.002
��-0.002
earchers-0.002
Token,
Token ID11
Feature activation-7.20e-03

Pos logit contributions
+0.863
+0.737
+0.691
Reviewer+0.671
+0.661
Neg logit contributions
��-0.637
eteen-0.526
yip-0.515
��-0.487
earchers-0.469
Token Log
Token ID5972
Feature activation-2.31e-02

Pos logit contributions
+0.026
+0.022
+0.021
Reviewer+0.020
+0.020
Neg logit contributions
��-0.020
eteen-0.016
yip-0.016
��-0.015
earchers-0.015
Tokenical
Token ID605
Feature activation-1.34e-01

Pos logit contributions
+0.063
+0.051
+0.047
Reviewer+0.045
+0.044
Neg logit contributions
��-0.082
eteen-0.072
yip-0.071
��-0.068
earchers-0.066
Token
Token ID198
Feature activation+8.99e-03

Pos logit contributions
+0.491
+0.419
+0.393
Reviewer+0.382
+0.376
Neg logit contributions
��-0.356
eteen-0.294
yip-0.287
��-0.271
earchers-0.261
Token
Token ID198
Feature activation+1.33e-02

Pos logit contributions
��+0.025
eteen+0.021
yip+0.021
��+0.020
earchers+0.019
Neg logit contributions
-0.032
-0.027
-0.026
Reviewer-0.025
-0.025
Token of
Token ID286
Feature activation-6.39e-02

Pos logit contributions
��+0.038
eteen+0.032
yip+0.031
��+0.029
earchers+0.028
Neg logit contributions
-0.050
-0.042
-0.040
Reviewer-0.038
-0.038
Token toxic
Token ID11422
Feature activation-1.20e-03

Pos logit contributions
+0.244
+0.210
+0.198
Reviewer+0.193
+0.190
Neg logit contributions
��-0.159
eteen-0.130
yip-0.127
��-0.119
earchers-0.115
Token al
Token ID435
Feature activation-5.66e-02

Pos logit contributions
+0.004
+0.004
+0.003
Reviewer+0.003
+0.003
Neg logit contributions
��-0.003
eteen-0.003
yip-0.003
��-0.002
earchers-0.002
Tokengal
Token ID13528
Feature activation-4.38e-02

Pos logit contributions
+0.139
+0.109
+0.098
Reviewer+0.093
+0.091
Neg logit contributions
��-0.218
eteen-0.193
yip-0.190
��-0.182
earchers-0.178
Token blo
Token ID24924
Feature activation-5.94e-02

Pos logit contributions
+0.170
+0.147
+0.139
Reviewer+0.134
+0.133
Neg logit contributions
��-0.106
eteen-0.086
yip-0.084
��-0.079
earchers-0.076
Tokenoms
Token ID3150
Feature activation-2.20e-01

Pos logit contributions
+0.203
+0.171
+0.160
Reviewer+0.156
+0.153
Neg logit contributions
��-0.172
eteen-0.144
yip-0.142
��-0.136
earchers-0.130
Token in
Token ID287
Feature activation+2.86e-02

Pos logit contributions
+0.816
+0.700
+0.658
Reviewer+0.639
+0.630
Neg logit contributions
��-0.572
eteen-0.470
yip-0.460
��-0.433
earchers-0.417
Token the
Token ID262
Feature activation+1.64e-02

Pos logit contributions
��+0.072
eteen+0.059
yip+0.058
��+0.054
earchers+0.052
Neg logit contributions
-0.108
-0.093
-0.087
Reviewer-0.085
-0.084
Token North
Token ID2258
Feature activation+1.06e-01

Pos logit contributions
��+0.042
eteen+0.034
yip+0.033
��+0.031
earchers+0.030
Neg logit contributions
-0.062
-0.053
-0.050
Reviewer-0.049
-0.048
Token Atlantic
Token ID10596
Feature activation+1.16e-01

Pos logit contributions
��+0.251
eteen+0.201
yip+0.197
��+0.184
earchers+0.176
Neg logit contributions
-0.419
-0.363
-0.343
Reviewer-0.334
-0.330
Token and
Token ID290
Feature activation+4.79e-02

Pos logit contributions
��+0.305
eteen+0.251
yip+0.246
��+0.231
earchers+0.223
Neg logit contributions
-0.430
-0.369
-0.346
Reviewer-0.336
-0.332
Token campaign
Token ID1923
Feature activation-1.49e-02

Pos logit contributions
+0.274
+0.232
+0.216
Reviewer+0.210
+0.206
Neg logit contributions
��-0.231
eteen-0.195
yip-0.191
��-0.181
earchers-0.176
Token.
Token ID13
Feature activation-7.30e-02

Pos logit contributions
+0.055
+0.048
+0.044
Reviewer+0.043
+0.043
Neg logit contributions
��-0.038
eteen-0.032
yip-0.031
��-0.029
earchers-0.028
Token Instead
Token ID5455
Feature activation-6.66e-02

Pos logit contributions
+0.258
+0.220
+0.206
Reviewer+0.200
+0.197
Neg logit contributions
��-0.201
eteen-0.168
yip-0.163
��-0.154
earchers-0.150
Token of
Token ID286
Feature activation-7.09e-02

Pos logit contributions
+0.232
+0.197
+0.184
Reviewer+0.178
+0.176
Neg logit contributions
��-0.189
eteen-0.158
yip-0.155
��-0.147
earchers-0.142
TokenTechn
Token ID25574
Feature activation+4.68e-02

Pos logit contributions
+0.247
+0.210
+0.196
Reviewer+0.191
+0.187
Neg logit contributions
��-0.200
eteen-0.167
yip-0.164
��-0.155
earchers-0.150
Tokenicol
Token ID27045
Feature activation-2.35e-01

Pos logit contributions
��+0.134
eteen+0.112
yip+0.110
��+0.104
earchers+0.100
Neg logit contributions
-0.162
-0.137
-0.128
Reviewer-0.125
-0.123
Tokenor
Token ID273
Feature activation+1.48e-01

Pos logit contributions
+0.796
+0.672
+0.629
Reviewer+0.609
+0.602
Neg logit contributions
��-0.685
eteen-0.568
yip-0.561
��-0.545
earchers-0.517
Token,
Token ID11
Feature activation+1.62e-02

Pos logit contributions
��+0.405
eteen+0.338
yip+0.329
��+0.312
earchers+0.303
Neg logit contributions
-0.524
-0.449
-0.419
Reviewer-0.409
-0.401
Token Hitchcock
Token ID49817
Feature activation+5.53e-03

Pos logit contributions
��+0.043
eteen+0.036
yip+0.035
��+0.033
earchers+0.032
Neg logit contributions
-0.059
-0.051
-0.048
Reviewer-0.046
-0.046
Token shot
Token ID2823
Feature activation-3.96e-02

Pos logit contributions
��+0.015
eteen+0.012
yip+0.012
��+0.011
earchers+0.011
Neg logit contributions
-0.020
-0.017
-0.016
Reviewer-0.016
-0.015
Token it
Token ID340
Feature activation-2.49e-02

Pos logit contributions
+0.147
+0.126
+0.119
Reviewer+0.116
+0.114
Neg logit contributions
��-0.102
eteen-0.084
yip-0.082
��-0.078
earchers-0.074
Token Lever
Token ID42365
Feature activation+3.64e-03

Pos logit contributions
��+0.041
eteen+0.034
yip+0.033
��+0.032
earchers+0.030
Neg logit contributions
-0.060
-0.051
-0.048
Reviewer-0.047
-0.046
Tokenage
Token ID496
Feature activation-1.74e-01

Pos logit contributions
��+0.010
eteen+0.008
yip+0.008
��+0.007
earchers+0.007
Neg logit contributions
-0.013
-0.012
-0.011
Reviewer-0.011
-0.010
Token your
Token ID534
Feature activation-5.56e-02

Pos logit contributions
+0.680
+0.588
+0.556
Reviewer+0.541
+0.535
Neg logit contributions
��-0.418
eteen-0.337
yip-0.330
��-0.310
earchers-0.294
Token natural
Token ID3288
Feature activation-7.23e-02

Pos logit contributions
+0.214
+0.185
+0.174
Reviewer+0.170
+0.168
Neg logit contributions
��-0.137
eteen-0.111
yip-0.108
��-0.102
earchers-0.097
Token overnight
Token ID13417
Feature activation-5.20e-02

Pos logit contributions
+0.274
+0.236
+0.222
Reviewer+0.216
+0.213
Neg logit contributions
��-0.182
eteen-0.148
yip-0.146
��-0.137
earchers-0.131
Token fast
Token ID3049
Feature activation-2.25e-01

Pos logit contributions
+0.205
+0.177
+0.167
Reviewer+0.163
+0.161
Neg logit contributions
��-0.124
eteen-0.100
yip-0.097
��-0.090
earchers-0.087
Token by
Token ID416
Feature activation+1.24e-02

Pos logit contributions
+0.808
+0.687
+0.643
Reviewer+0.625
+0.615
Neg logit contributions
��-0.614
eteen-0.511
yip-0.499
��-0.470
earchers-0.455
Token skipping
Token ID31017
Feature activation-7.23e-02

Pos logit contributions
��+0.035
eteen+0.029
yip+0.028
��+0.027
earchers+0.026
Neg logit contributions
-0.044
-0.037
-0.035
Reviewer-0.034
-0.033
Token breakfast
Token ID12607
Feature activation-2.78e-02

Pos logit contributions
+0.262
+0.224
+0.210
Reviewer+0.204
+0.201
Neg logit contributions
��-0.194
eteen-0.161
yip-0.157
��-0.149
earchers-0.143
Token (
Token ID357
Feature activation-1.16e-02

Pos logit contributions
+0.104
+0.089
+0.084
Reviewer+0.082
+0.080
Neg logit contributions
��-0.072
eteen-0.059
yip-0.057
��-0.054
earchers-0.052
Tokendr
Token ID7109
Feature activation-7.26e-02

Pos logit contributions
+0.037
+0.031
+0.028
Reviewer+0.027
+0.027
Neg logit contributions
��-0.036
eteen-0.031
yip-0.030
��-0.029
earchers-0.028