TOP ACTIVATIONS
MAX = 0.477

130 , were the C rows ' playing additions , with
accomplished . The Hill c rows : The study
had committed to the C rows .
have not yet discussed Bur rows s art work
. But Bur rows s sterile style
Bryce Gibbs to the C rows and Jake Lever to the
fog machines or WWE page antry . S igh .
/ File photo TP X IMAGES OF THE DAY ) Chris
Why is Mont pel ier s city
Vermont . Rem oving Mont pel ier Bever age would remove
, is inaccurate . Stat istically speaking , the ' average
Ab road in Israel , Pants uit Nation and Green Course
one element of a tri pty ch called Echo that Indigenous
FS 1 =$ FS 2 =$ FontSize } } Try {
Fat so ." Phil ip Mont oro writes about
. Worse yet , Mont pel ier is supposedly the
the team . Phil K essel isn
Is Richard Nixon in a Pants uit appeared first on Money
seemed every bit as sensational istic as its marketing campaign .
Is Richard Nixon in a Pants uit was originally published at

MIN ACTIVATIONS
MIN = -0.522

with d uk ( K orean rice cakes ), bitter mel
you have . Does your cousins half - brother s uncle
including " Jack ie Chan Adventures ", " The Big Guy
wetlands , Rams ar envis aged action by both national governments
come and visit the whore house <|endoftext|> Steven Gerrard scored
K hal isha told NBC news that
Catalan resolution is in contra vention of a previous court ruling
asked artist Mary Anne Bark house why the squirrel s would
Gand alf reports that Sar uman has betrayed them and is
profit generated by an underlying asset , such as government buildings
4 . Expl ained Murd y ,
es ") is an autos omal dominant ker at oder ma
es ") is an autos omal dominant ker at oder ma
iving / su icide pre vention / doc uments / older
problem . Expl ained Marin , When
gon . gov / pre vention well ness / saf el
/ any Mode : Man aged Access Point : Not -
/ any Mode : Man aged Access Point : Not -
. gov / violence pre vention / su icide / stat
reform the state 's civil asset forfeiture laws , which allow

INTERVAL 0.227 - 0.477
CONTAINS 0.488%

the unc ertain ty of change .
your email , they can root through your bank records ,
. And the big men fly . And the players go
All the an ach ron istic elements , the d our
penalties for those who " def raud or abuse federal programs

INTERVAL -0.022 - 0.227
CONTAINS 69.903%

keep reading , c uz you too , can
fired . The city funded 10 , 000 new leases for
That s a major systemic weakness that
surveyed in Ohio and 59 percent of voters in Wisconsin say
6 { ( 0 s )} 3 . g 3 {

INTERVAL -0.272 - -0.022
CONTAINS 29.409%

until the accumulated debt becomes unsustainable and the government
Today we re excited to announce Shop ify POS
{ Write - Warning - Message " F ailed cannot find
' first preseason game , against Washington , he totaled seven
, M imi writes with palpable delight .

INTERVAL -0.522 - -0.272
CONTAINS 0.200%

at ollah R uh ollah Kh ome ini 's radical Islamists
the 64 - bit Hy rule Fi end , O car
unnoticed . This was down graded in October 2015 after three
{ ( 90 s )} Q c 7 { ( 108
World War II veteran to death . Del bert
Token 130
Token ID11323
Feature activation-4.89e-02

Pos logit contributions
iped+0.033
pel+0.028
 HOR+0.026
rows+0.026
Py+0.026
Neg logit contributions
aturdays-0.034
forcer-0.033
usra-0.032
oteric-0.032
umatic-0.032
Token,
Token ID11
Feature activation-2.05e-03

Pos logit contributions
aturdays+0.035
forcer+0.034
usra+0.033
oteric+0.033
umatic+0.033
Neg logit contributions
iped-0.034
pel-0.029
 HOR-0.028
rows-0.027
Py-0.027
Token were
Token ID547
Feature activation-2.98e-02

Pos logit contributions
aturdays+0.001
forcer+0.001
oteric+0.001
usra+0.001
umatic+0.001
Neg logit contributions
iped-0.002
pel-0.001
 HOR-0.001
rows-0.001
Py-0.001
Token the
Token ID262
Feature activation+2.19e-02

Pos logit contributions
aturdays+0.020
forcer+0.020
usra+0.019
oteric+0.019
umatic+0.019
Neg logit contributions
iped-0.022
pel-0.019
 HOR-0.018
rows-0.018
Py-0.017
Token C
Token ID327
Feature activation-6.90e-02

Pos logit contributions
iped+0.016
pel+0.014
 HOR+0.013
rows+0.013
Py+0.013
Neg logit contributions
aturdays-0.015
forcer-0.014
usra-0.014
oteric-0.014
umatic-0.014
Tokenrows
Token ID8516
Feature activation+4.77e-01

Pos logit contributions
aturdays+0.086
forcer+0.085
usra+0.084
oteric+0.084
orean+0.084
Neg logit contributions
iped-0.011
pel-0.004
 HOR-0.002
Py-0.001
 squid-0.001
Token'
Token ID6
Feature activation+4.30e-02

Pos logit contributions
iped+0.328
pel+0.283
 HOR+0.267
rows+0.266
Py+0.261
Neg logit contributions
aturdays-0.346
forcer-0.332
oteric-0.330
usra-0.329
umatic-0.324
Token playing
Token ID2712
Feature activation+3.89e-02

Pos logit contributions
iped+0.031
pel+0.027
 HOR+0.026
rows+0.025
Py+0.025
Neg logit contributions
aturdays-0.029
forcer-0.028
usra-0.028
oteric-0.028
umatic-0.027
Token additions
Token ID19885
Feature activation+1.73e-02

Pos logit contributions
iped+0.027
pel+0.023
 HOR+0.022
rows+0.022
Py+0.021
Neg logit contributions
aturdays-0.028
forcer-0.027
usra-0.027
oteric-0.027
umatic-0.026
Token,
Token ID11
Feature activation+1.99e-02

Pos logit contributions
iped+0.012
pel+0.010
 HOR+0.009
rows+0.009
Py+0.009
Neg logit contributions
aturdays-0.013
forcer-0.012
oteric-0.012
usra-0.012
umatic-0.012
Token with
Token ID351
Feature activation+1.23e-03

Pos logit contributions
iped+0.014
pel+0.012
 HOR+0.011
rows+0.011
Py+0.011
Neg logit contributions
aturdays-0.014
forcer-0.014
usra-0.013
oteric-0.013
umatic-0.013
Token accomplished
Token ID13013
Feature activation+3.45e-02

Pos logit contributions
iped+0.073
pel+0.063
 HOR+0.060
rows+0.059
Py+0.059
Neg logit contributions
aturdays-0.069
forcer-0.067
usra-0.066
oteric-0.066
umatic-0.065
Token.
Token ID13
Feature activation-2.15e-02

Pos logit contributions
iped+0.023
pel+0.019
 HOR+0.018
rows+0.018
Py+0.018
Neg logit contributions
aturdays-0.026
forcer-0.025
oteric-0.025
usra-0.025
umatic-0.024
Token The
Token ID383
Feature activation-2.46e-02

Pos logit contributions
aturdays+0.015
forcer+0.014
usra+0.014
oteric+0.014
umatic+0.014
Neg logit contributions
iped-0.015
pel-0.013
 HOR-0.013
rows-0.013
Py-0.012
Token Hill
Token ID3327
Feature activation+7.16e-02

Pos logit contributions
aturdays+0.017
forcer+0.016
usra+0.016
oteric+0.016
umatic+0.016
Neg logit contributions
iped-0.018
pel-0.015
 HOR-0.014
rows-0.014
Py-0.014
Token c
Token ID269
Feature activation-2.22e-03

Pos logit contributions
iped+0.054
pel+0.047
 HOR+0.045
rows+0.044
Py+0.044
Neg logit contributions
aturdays-0.047
forcer-0.045
usra-0.045
oteric-0.045
umatic-0.044
Tokenrows
Token ID8516
Feature activation+4.73e-01

Pos logit contributions
aturdays+0.001
forcer+0.001
usra+0.001
oteric+0.001
orean+0.001
Neg logit contributions
iped-0.002
pel-0.001
 HOR-0.001
Py-0.001
rows-0.001
Token:
Token ID25
Feature activation+1.87e-02

Pos logit contributions
iped+0.315
pel+0.270
 HOR+0.253
rows+0.252
Py+0.249
Neg logit contributions
aturdays-0.352
forcer-0.340
usra-0.336
oteric-0.336
umatic-0.331
Token
Token ID198
Feature activation-5.01e-02

Pos logit contributions
iped+0.014
pel+0.012
 HOR+0.011
rows+0.011
Py+0.011
Neg logit contributions
aturdays-0.013
forcer-0.012
usra-0.012
oteric-0.012
umatic-0.012
Token
Token ID198
Feature activation-5.36e-02

Pos logit contributions
aturdays+0.034
forcer+0.033
usra+0.032
oteric+0.032
orean+0.032
Neg logit contributions
iped-0.037
pel-0.032
 HOR-0.030
rows-0.030
 squid-0.029
TokenThe
Token ID464
Feature activation-5.45e-02

Pos logit contributions
aturdays+0.034
forcer+0.033
usra+0.033
oteric+0.032
umatic+0.032
Neg logit contributions
iped-0.041
pel-0.036
 HOR-0.034
rows-0.034
Py-0.033
Token study
Token ID2050
Feature activation+5.93e-05

Pos logit contributions
aturdays+0.037
forcer+0.035
usra+0.035
oteric+0.035
umatic+0.034
Neg logit contributions
iped-0.040
pel-0.035
 HOR-0.033
rows-0.033
Py-0.032
Token had
Token ID550
Feature activation-7.85e-03

Pos logit contributions
aturdays+0.037
forcer+0.036
usra+0.036
oteric+0.036
umatic+0.035
Neg logit contributions
iped-0.039
pel-0.034
 HOR-0.032
rows-0.032
Py-0.032
Token committed
Token ID5364
Feature activation+7.03e-02

Pos logit contributions
aturdays+0.005
forcer+0.005
oteric+0.005
usra+0.005
umatic+0.005
Neg logit contributions
iped-0.006
pel-0.005
 HOR-0.005
rows-0.005
Py-0.005
Token to
Token ID284
Feature activation-1.77e-02

Pos logit contributions
iped+0.047
pel+0.040
 HOR+0.038
rows+0.037
Py+0.037
Neg logit contributions
aturdays-0.052
forcer-0.051
usra-0.050
oteric-0.050
umatic-0.049
Token the
Token ID262
Feature activation+1.03e-02

Pos logit contributions
aturdays+0.012
forcer+0.011
usra+0.011
oteric+0.011
umatic+0.011
Neg logit contributions
iped-0.013
pel-0.011
 HOR-0.011
rows-0.011
Py-0.011
Token C
Token ID327
Feature activation-7.38e-02

Pos logit contributions
iped+0.007
pel+0.006
 HOR+0.006
rows+0.006
Py+0.006
Neg logit contributions
aturdays-0.007
forcer-0.007
usra-0.007
oteric-0.007
umatic-0.007
Tokenrows
Token ID8516
Feature activation+4.65e-01

Pos logit contributions
aturdays+0.058
forcer+0.056
usra+0.056
oteric+0.056
umatic+0.055
Neg logit contributions
iped-0.046
pel-0.039
 HOR-0.036
rows-0.035
Py-0.035
Token.
Token ID13
Feature activation+2.85e-02

Pos logit contributions
iped+0.313
pel+0.269
 HOR+0.253
rows+0.252
Py+0.248
Neg logit contributions
aturdays-0.344
forcer-0.331
usra-0.329
oteric-0.328
umatic-0.322
Token
Token ID198
Feature activation+9.21e-03

Pos logit contributions
iped+0.020
pel+0.018
 HOR+0.017
rows+0.017
Py+0.016
Neg logit contributions
aturdays-0.020
forcer-0.019
usra-0.019
oteric-0.019
umatic-0.019
Token
Token ID198
Feature activation+8.51e-03

Pos logit contributions
iped+0.007
pel+0.006
 HOR+0.006
rows+0.005
Py+0.005
Neg logit contributions
aturdays-0.006
forcer-0.006
usra-0.006
oteric-0.006
orean-0.006
Token
Token ID447
Feature activation+1.44e-02

Pos logit contributions
iped+0.006
pel+0.005
 HOR+0.005
rows+0.005
Py+0.005
Neg logit contributions
aturdays-0.006
forcer-0.006
usra-0.006
oteric-0.006
umatic-0.006
Token
Token ID250
Feature activation+6.35e-03

Pos logit contributions
iped+0.010
pel+0.009
 HOR+0.008
rows+0.008
Py+0.008
Neg logit contributions
aturdays-0.010
forcer-0.010
usra-0.010
oteric-0.010
umatic-0.010
Token have
Token ID423
Feature activation-2.10e-02

Pos logit contributions
aturdays+0.023
forcer+0.022
oteric+0.022
usra+0.022
umatic+0.021
Neg logit contributions
iped-0.025
pel-0.022
 HOR-0.021
rows-0.021
Py-0.021
Token not
Token ID407
Feature activation-3.57e-02

Pos logit contributions
aturdays+0.014
forcer+0.014
usra+0.013
oteric+0.013
umatic+0.013
Neg logit contributions
iped-0.015
pel-0.013
 HOR-0.013
rows-0.013
Py-0.012
Token yet
Token ID1865
Feature activation+4.75e-02

Pos logit contributions
aturdays+0.025
forcer+0.024
oteric+0.024
usra+0.024
umatic+0.023
Neg logit contributions
iped-0.025
pel-0.022
 HOR-0.021
rows-0.021
Py-0.020
Token discussed
Token ID6693
Feature activation-1.34e-01

Pos logit contributions
iped+0.034
pel+0.030
 HOR+0.028
rows+0.028
Py+0.028
Neg logit contributions
aturdays-0.033
forcer-0.032
oteric-0.031
usra-0.031
umatic-0.031
Token Bur
Token ID5481
Feature activation+4.56e-03

Pos logit contributions
aturdays+0.094
forcer+0.091
usra+0.090
oteric+0.090
umatic+0.088
Neg logit contributions
iped-0.094
pel-0.081
 HOR-0.076
rows-0.076
Py-0.075
Tokenrows
Token ID8516
Feature activation+4.54e-01

Pos logit contributions
iped+0.004
pel+0.003
 HOR+0.003
Py+0.003
Phil+0.003
Neg logit contributions
aturdays-0.002
forcer-0.002
usra-0.002
oteric-0.002
umatic-0.002
Token
Token ID447
Feature activation-8.28e-03

Pos logit contributions
iped+0.307
pel+0.263
 HOR+0.247
rows+0.245
Py+0.242
Neg logit contributions
aturdays-0.334
forcer-0.322
usra-0.319
oteric-0.318
umatic-0.314
Token
Token ID247
Feature activation-4.44e-03

Pos logit contributions
aturdays+0.006
forcer+0.006
usra+0.006
oteric+0.006
umatic+0.006
Neg logit contributions
iped-0.006
pel-0.005
 HOR-0.005
rows-0.004
Py-0.004
Tokens
Token ID82
Feature activation+9.64e-03

Pos logit contributions
aturdays+0.003
forcer+0.003
usra+0.003
oteric+0.003
orean+0.003
Neg logit contributions
iped-0.003
pel-0.003
 HOR-0.003
rows-0.003
Py-0.003
Token art
Token ID1242
Feature activation-2.55e-02

Pos logit contributions
iped+0.007
pel+0.006
 HOR+0.006
rows+0.006
Py+0.005
Neg logit contributions
aturdays-0.007
forcer-0.007
usra-0.006
oteric-0.006
umatic-0.006
Token work
Token ID670
Feature activation-9.61e-02

Pos logit contributions
aturdays+0.019
forcer+0.018
oteric+0.018
usra+0.018
orean+0.017
Neg logit contributions
iped-0.017
pel-0.015
 HOR-0.014
rows-0.014
Py-0.014
Token.
Token ID13
Feature activation+8.86e-03

Pos logit contributions
aturdays+0.066
forcer+0.064
oteric+0.063
usra+0.063
umatic+0.063
Neg logit contributions
iped-0.060
pel-0.052
 HOR-0.048
rows-0.048
Py-0.048
Token
Token ID198
Feature activation-7.39e-03

Pos logit contributions
iped+0.006
pel+0.006
rows+0.005
 HOR+0.005
Py+0.005
Neg logit contributions
aturdays-0.006
forcer-0.006
usra-0.006
oteric-0.006
orean-0.006
Token
Token ID198
Feature activation-4.63e-03

Pos logit contributions
aturdays+0.005
forcer+0.005
usra+0.005
oteric+0.005
orean+0.005
Neg logit contributions
iped-0.005
pel-0.005
 HOR-0.004
rows-0.004
Py-0.004
TokenBut
Token ID1537
Feature activation+3.90e-02

Pos logit contributions
aturdays+0.003
forcer+0.003
usra+0.003
oteric+0.003
orean+0.003
Neg logit contributions
iped-0.003
pel-0.003
 HOR-0.003
rows-0.003
 squid-0.003
Token Bur
Token ID5481
Feature activation+1.82e-02

Pos logit contributions
iped+0.027
pel+0.023
 HOR+0.022
rows+0.022
Py+0.021
Neg logit contributions
aturdays-0.028
forcer-0.027
usra-0.027
oteric-0.027
umatic-0.026
Tokenrows
Token ID8516
Feature activation+4.48e-01

Pos logit contributions
iped+0.003
pel+0.001
 HOR+0.000
Py+0.000
Phil+0.000
Neg logit contributions
aturdays-0.023
forcer-0.022
usra-0.022
oteric-0.022
orean-0.022
Token
Token ID447
Feature activation-2.55e-03

Pos logit contributions
iped+0.313
pel+0.270
 HOR+0.254
rows+0.253
Py+0.250
Neg logit contributions
aturdays-0.320
forcer-0.308
usra-0.304
oteric-0.304
umatic-0.300
Token
Token ID247
Feature activation-1.93e-03

Pos logit contributions
aturdays+0.002
forcer+0.002
usra+0.002
oteric+0.002
umatic+0.002
Neg logit contributions
iped-0.002
pel-0.001
 HOR-0.001
rows-0.001
Py-0.001
Tokens
Token ID82
Feature activation-1.50e-02

Pos logit contributions
aturdays+0.001
forcer+0.001
usra+0.001
oteric+0.001
umatic+0.001
Neg logit contributions
iped-0.001
pel-0.001
 HOR-0.001
rows-0.001
Py-0.001
Token sterile
Token ID38697
Feature activation+1.47e-01

Pos logit contributions
aturdays+0.011
forcer+0.010
usra+0.010
oteric+0.010
umatic+0.010
Neg logit contributions
iped-0.011
pel-0.009
 HOR-0.008
rows-0.008
Py-0.008
Token style
Token ID3918
Feature activation-6.46e-02

Pos logit contributions
iped+0.099
pel+0.085
 HOR+0.080
rows+0.080
Py+0.079
Neg logit contributions
aturdays-0.109
forcer-0.105
usra-0.104
oteric-0.104
umatic-0.102
Token Bryce
Token ID39560
Feature activation+1.36e-02

Pos logit contributions
aturdays+0.003
forcer+0.003
usra+0.003
oteric+0.003
umatic+0.003
Neg logit contributions
iped-0.003
pel-0.003
 HOR-0.002
rows-0.002
Py-0.002
Token Gibbs
Token ID41071
Feature activation+1.07e-01

Pos logit contributions
iped+0.009
pel+0.008
 HOR+0.008
rows+0.008
Py+0.007
Neg logit contributions
aturdays-0.010
forcer-0.009
usra-0.009
oteric-0.009
umatic-0.009
Token to
Token ID284
Feature activation-2.93e-02

Pos logit contributions
iped+0.071
pel+0.060
 HOR+0.057
rows+0.056
Py+0.055
Neg logit contributions
aturdays-0.080
forcer-0.077
usra-0.077
oteric-0.076
umatic-0.075
Token the
Token ID262
Feature activation+4.46e-03

Pos logit contributions
aturdays+0.020
forcer+0.019
usra+0.019
oteric+0.019
umatic+0.019
Neg logit contributions
iped-0.021
pel-0.019
 HOR-0.017
rows-0.017
Py-0.017
Token C
Token ID327
Feature activation-6.39e-02

Pos logit contributions
iped+0.003
pel+0.003
 HOR+0.003
rows+0.003
Py+0.003
Neg logit contributions
aturdays-0.003
forcer-0.003
usra-0.003
oteric-0.003
umatic-0.003
Tokenrows
Token ID8516
Feature activation+4.45e-01

Pos logit contributions
aturdays+0.077
forcer+0.075
usra+0.075
oteric+0.074
orean+0.074
Neg logit contributions
iped-0.013
pel-0.007
 HOR-0.005
Py-0.004
pants-0.004
Token and
Token ID290
Feature activation+6.23e-03

Pos logit contributions
iped+0.321
pel+0.278
 HOR+0.264
rows+0.262
Py+0.258
Neg logit contributions
aturdays-0.308
forcer-0.295
oteric-0.293
usra-0.293
orean-0.287
Token Jake
Token ID14757
Feature activation-1.98e-02

Pos logit contributions
iped+0.005
pel+0.004
 HOR+0.004
rows+0.004
Py+0.004
Neg logit contributions
aturdays-0.004
forcer-0.004
usra-0.004
oteric-0.004
umatic-0.004
Token Lever
Token ID42365
Feature activation-7.23e-02

Pos logit contributions
aturdays+0.014
forcer+0.013
usra+0.013
oteric+0.013
umatic+0.013
Neg logit contributions
iped-0.014
pel-0.012
 HOR-0.012
rows-0.012
Py-0.012
Token to
Token ID284
Feature activation-2.15e-02

Pos logit contributions
aturdays+0.053
usra+0.051
forcer+0.051
oteric+0.050
orean+0.049
Neg logit contributions
iped-0.050
pel-0.043
 HOR-0.040
rows-0.040
Py-0.039
Token the
Token ID262
Feature activation+8.24e-03

Pos logit contributions
aturdays+0.015
forcer+0.015
usra+0.015
oteric+0.015
umatic+0.014
Neg logit contributions
iped-0.015
pel-0.013
 HOR-0.012
rows-0.012
Py-0.012
Token fog
Token ID19558
Feature activation+1.56e-01

Pos logit contributions
iped+0.028
pel+0.025
 HOR+0.023
rows+0.023
Py+0.023
Neg logit contributions
aturdays-0.027
forcer-0.026
usra-0.026
oteric-0.026
orean-0.025
Token machines
Token ID8217
Feature activation-1.55e-02

Pos logit contributions
iped+0.106
pel+0.090
 HOR+0.086
rows+0.084
Py+0.084
Neg logit contributions
aturdays-0.114
forcer-0.111
usra-0.110
oteric-0.109
umatic-0.109
Token or
Token ID393
Feature activation-2.22e-03

Pos logit contributions
aturdays+0.012
forcer+0.011
usra+0.011
oteric+0.011
umatic+0.011
Neg logit contributions
iped-0.010
pel-0.009
 HOR-0.008
rows-0.008
Py-0.008
Token WWE
Token ID15247
Feature activation+2.75e-02

Pos logit contributions
aturdays+0.002
forcer+0.002
usra+0.002
oteric+0.001
orean+0.001
Neg logit contributions
iped-0.002
pel-0.001
 HOR-0.001
rows-0.001
Py-0.001
Token page
Token ID2443
Feature activation-1.95e-02

Pos logit contributions
iped+0.019
pel+0.017
 HOR+0.016
rows+0.015
Py+0.015
Neg logit contributions
aturdays-0.020
forcer-0.019
usra-0.019
oteric-0.019
umatic-0.018
Tokenantry
Token ID21238
Feature activation+4.37e-01

Pos logit contributions
aturdays+0.019
forcer+0.018
usra+0.018
oteric+0.018
umatic+0.018
Neg logit contributions
iped-0.009
pel-0.007
 HOR-0.006
Py-0.006
rows-0.006
Token.
Token ID13
Feature activation+1.80e-02

Pos logit contributions
iped+0.292
pel+0.250
 HOR+0.234
rows+0.233
Py+0.230
Neg logit contributions
aturdays-0.325
forcer-0.314
usra-0.310
oteric-0.309
umatic-0.306
Token S
Token ID311
Feature activation+4.88e-02

Pos logit contributions
iped+0.013
pel+0.011
rows+0.011
 HOR+0.011
Py+0.010
Neg logit contributions
aturdays-0.012
forcer-0.012
oteric-0.012
usra-0.012
orean-0.011
Tokenigh
Token ID394
Feature activation-1.79e-01

Pos logit contributions
iped+0.042
pel+0.038
 HOR+0.036
rows+0.036
Py+0.035
Neg logit contributions
aturdays-0.027
forcer-0.025
usra-0.025
oteric-0.025
umatic-0.024
Token.
Token ID13
Feature activation+9.19e-03

Pos logit contributions
aturdays+0.126
forcer+0.122
usra+0.121
oteric+0.119
umatic+0.118
Neg logit contributions
iped-0.127
pel-0.109
 HOR-0.103
rows-0.102
Py-0.101
Token
Token ID198
Feature activation+1.37e-02

Pos logit contributions
iped+0.007
pel+0.006
rows+0.005
 HOR+0.005
Py+0.005
Neg logit contributions
aturdays-0.006
forcer-0.006
oteric-0.006
usra-0.006
orean-0.006
Token/
Token ID14
Feature activation+5.51e-02

Pos logit contributions
aturdays+0.011
forcer+0.010
oteric+0.010
usra+0.010
umatic+0.010
Neg logit contributions
iped-0.010
pel-0.009
 HOR-0.008
rows-0.008
Py-0.008
TokenFile
Token ID8979
Feature activation-5.60e-02

Pos logit contributions
iped+0.043
pel+0.038
 HOR+0.036
rows+0.036
Py+0.035
Neg logit contributions
aturdays-0.035
forcer-0.033
usra-0.033
oteric-0.033
orean-0.032
Token photo
Token ID4590
Feature activation+1.44e-01

Pos logit contributions
aturdays+0.051
forcer+0.050
oteric+0.049
usra+0.049
umatic+0.048
Neg logit contributions
iped-0.028
pel-0.023
 HOR-0.021
rows-0.021
Py-0.020
Token TP
Token ID24525
Feature activation+8.92e-02

Pos logit contributions
iped+0.105
pel+0.091
rows+0.086
 HOR+0.085
Py+0.085
Neg logit contributions
aturdays-0.098
forcer-0.095
usra-0.093
oteric-0.093
umatic-0.092
TokenX
Token ID55
Feature activation+4.36e-02

Pos logit contributions
iped+0.073
pel+0.064
 HOR+0.061
rows+0.061
Py+0.060
Neg logit contributions
aturdays-0.053
forcer-0.051
oteric-0.050
usra-0.050
umatic-0.049
Token IMAGES
Token ID45325
Feature activation+4.25e-01

Pos logit contributions
iped+0.011
pel+0.007
rows+0.006
 HOR+0.005
Py+0.005
Neg logit contributions
aturdays-0.050
forcer-0.049
oteric-0.049
usra-0.049
umatic-0.049
Token OF
Token ID3963
Feature activation-4.23e-02

Pos logit contributions
iped+0.352
pel+0.312
 HOR+0.296
rows+0.295
Py+0.292
Neg logit contributions
aturdays-0.248
forcer-0.238
usra-0.234
oteric-0.234
umatic-0.229
Token THE
Token ID3336
Feature activation+5.75e-02

Pos logit contributions
aturdays+0.025
forcer+0.025
usra+0.024
oteric+0.024
orean+0.024
Neg logit contributions
iped-0.035
pel-0.030
rows-0.029
Py-0.028
Phil-0.028
Token DAY
Token ID24644
Feature activation+9.62e-02

Pos logit contributions
iped+0.047
pel+0.041
rows+0.038
 squid+0.037
Py+0.036
Neg logit contributions
aturdays-0.034
forcer-0.034
usra-0.033
oteric-0.033
orean-0.033
Token)
Token ID8
Feature activation+4.86e-03

Pos logit contributions
iped+0.070
pel+0.060
rows+0.057
 HOR+0.056
Py+0.056
Neg logit contributions
aturdays-0.065
forcer-0.064
oteric-0.063
usra-0.063
umatic-0.062
Token Chris
Token ID5180
Feature activation-3.79e-02

Pos logit contributions
iped+0.004
pel+0.003
rows+0.003
 HOR+0.003
Py+0.003
Neg logit contributions
aturdays-0.003
forcer-0.003
usra-0.003
oteric-0.003
orean-0.003
Token
Token ID198
Feature activation+2.41e-02

Pos logit contributions
iped+0.026
pel+0.022
 HOR+0.021
rows+0.021
Py+0.021
Neg logit contributions
aturdays-0.024
forcer-0.024
usra-0.023
oteric-0.023
umatic-0.023
Token
Token ID198
Feature activation+3.25e-02

Pos logit contributions
iped+0.017
pel+0.015
 HOR+0.014
rows+0.014
Py+0.014
Neg logit contributions
aturdays-0.016
forcer-0.016
usra-0.016
oteric-0.016
orean-0.015
TokenWhy
Token ID5195
Feature activation+9.65e-02

Pos logit contributions
iped+0.025
pel+0.022
 HOR+0.020
rows+0.020
Py+0.020
Neg logit contributions
aturdays-0.021
forcer-0.020
usra-0.020
oteric-0.020
umatic-0.020
Token is
Token ID318
Feature activation+2.21e-02

Pos logit contributions
iped+0.075
pel+0.065
 HOR+0.062
rows+0.062
Py+0.061
Neg logit contributions
aturdays-0.062
forcer-0.059
oteric-0.059
usra-0.059
umatic-0.057
Token Mont
Token ID5575
Feature activation+4.66e-02

Pos logit contributions
iped+0.015
pel+0.013
 HOR+0.013
rows+0.012
Py+0.012
Neg logit contributions
aturdays-0.016
forcer-0.015
oteric-0.015
usra-0.015
umatic-0.015
Tokenpel
Token ID30242
Feature activation+4.23e-01

Pos logit contributions
iped+0.024
pel+0.019
 HOR+0.018
rows+0.018
Py+0.018
Neg logit contributions
aturdays-0.042
forcer-0.040
usra-0.040
oteric-0.040
umatic-0.039
Tokenier
Token ID959
Feature activation-2.18e-01

Pos logit contributions
iped+0.372
pel+0.332
 HOR+0.317
rows+0.315
Py+0.313
Neg logit contributions
aturdays-0.225
forcer-0.214
usra-0.210
oteric-0.210
umatic-0.206
Token
Token ID447
Feature activation-2.14e-02

Pos logit contributions
aturdays+0.152
forcer+0.147
usra+0.145
oteric+0.145
umatic+0.143
Neg logit contributions
iped-0.155
pel-0.134
 HOR-0.126
rows-0.126
Py-0.124
Token
Token ID247
Feature activation-3.22e-02

Pos logit contributions
aturdays+0.015
forcer+0.014
oteric+0.014
usra+0.014
umatic+0.014
Neg logit contributions
iped-0.016
pel-0.014
 HOR-0.013
rows-0.013
Py-0.013
Tokens
Token ID82
Feature activation-4.18e-03

Pos logit contributions
aturdays+0.022
forcer+0.021
oteric+0.021
usra+0.020
umatic+0.020
Neg logit contributions
iped-0.024
pel-0.021
 HOR-0.020
rows-0.020
Py-0.019
Token city
Token ID1748
Feature activation+8.84e-02

Pos logit contributions
aturdays+0.003
forcer+0.003
usra+0.003
oteric+0.003
umatic+0.003
Neg logit contributions
iped-0.003
pel-0.003
 HOR-0.002
rows-0.002
Py-0.002
Token Vermont
Token ID16033
Feature activation-5.79e-02

Pos logit contributions
aturdays+0.009
forcer+0.009
usra+0.009
oteric+0.009
umatic+0.008
Neg logit contributions
iped-0.009
pel-0.008
 HOR-0.008
rows-0.008
Py-0.007
Token.
Token ID13
Feature activation+2.60e-03

Pos logit contributions
aturdays+0.043
forcer+0.041
usra+0.041
oteric+0.041
umatic+0.040
Neg logit contributions
iped-0.039
pel-0.033
 HOR-0.031
rows-0.031
Py-0.031
Token Rem
Token ID3982
Feature activation-1.68e-02

Pos logit contributions
iped+0.002
pel+0.002
 HOR+0.001
rows+0.001
Py+0.001
Neg logit contributions
aturdays-0.002
forcer-0.002
usra-0.002
oteric-0.002
umatic-0.002
Tokenoving
Token ID5165
Feature activation+1.39e-01

Pos logit contributions
aturdays+0.013
forcer+0.012
usra+0.012
oteric+0.012
umatic+0.012
Neg logit contributions
iped-0.011
pel-0.009
 HOR-0.009
rows-0.009
Py-0.009
Token Mont
Token ID5575
Feature activation+4.00e-02

Pos logit contributions
iped+0.097
pel+0.083
 HOR+0.078
rows+0.078
Py+0.077
Neg logit contributions
aturdays-0.100
forcer-0.096
usra-0.095
oteric-0.095
umatic-0.094
Tokenpel
Token ID30242
Feature activation+4.22e-01

Pos logit contributions
iped+0.004
pel+0.000
 HOR-0.001
rows-0.001
Py-0.001
Neg logit contributions
aturdays-0.052
forcer-0.051
usra-0.051
oteric-0.051
umatic-0.050
Tokenier
Token ID959
Feature activation-2.13e-01

Pos logit contributions
iped+0.373
pel+0.332
 HOR+0.317
rows+0.316
Py+0.313
Neg logit contributions
aturdays-0.224
forcer-0.213
usra-0.210
oteric-0.209
umatic-0.205
Token Bever
Token ID22185
Feature activation-9.94e-02

Pos logit contributions
aturdays+0.144
forcer+0.138
usra+0.137
oteric+0.136
umatic+0.135
Neg logit contributions
iped-0.157
pel-0.136
 HOR-0.129
rows-0.128
Py-0.126
Tokenage
Token ID496
Feature activation-2.05e-01

Pos logit contributions
aturdays+0.048
forcer+0.045
usra+0.045
oteric+0.045
umatic+0.043
Neg logit contributions
iped-0.092
pel-0.083
 HOR-0.079
rows-0.079
Py-0.078
Token would
Token ID561
Feature activation-5.91e-03

Pos logit contributions
aturdays+0.134
forcer+0.129
usra+0.127
oteric+0.127
umatic+0.126
Neg logit contributions
iped-0.155
pel-0.134
 HOR-0.127
rows-0.127
Py-0.125
Token remove
Token ID4781
Feature activation+3.33e-02

Pos logit contributions
aturdays+0.004
forcer+0.004
oteric+0.004
usra+0.004
umatic+0.003
Neg logit contributions
iped-0.005
pel-0.004
 HOR-0.004
rows-0.004
Py-0.004
Token,
Token ID11
Feature activation-7.39e-03

Pos logit contributions
iped+0.019
pel+0.016
 HOR+0.015
rows+0.015
Py+0.015
Neg logit contributions
aturdays-0.023
forcer-0.022
usra-0.022
oteric-0.022
umatic-0.022
Token is
Token ID318
Feature activation+8.92e-03

Pos logit contributions
aturdays+0.005
forcer+0.005
usra+0.005
oteric+0.005
umatic+0.005
Neg logit contributions
iped-0.005
pel-0.005
 HOR-0.004
rows-0.004
Py-0.004
Token inaccurate
Token ID21873
Feature activation+2.75e-01

Pos logit contributions
iped+0.006
pel+0.005
rows+0.005
 HOR+0.005
Py+0.005
Neg logit contributions
aturdays-0.006
forcer-0.006
oteric-0.006
usra-0.006
umatic-0.006
Token.
Token ID13
Feature activation+9.55e-03

Pos logit contributions
iped+0.186
pel+0.159
rows+0.149
 HOR+0.148
Py+0.146
Neg logit contributions
aturdays-0.202
forcer-0.195
oteric-0.193
usra-0.193
umatic-0.190
Token Stat
Token ID5133
Feature activation+1.40e-01

Pos logit contributions
iped+0.007
pel+0.006
rows+0.006
 HOR+0.006
Py+0.006
Neg logit contributions
aturdays-0.007
forcer-0.006
oteric-0.006
usra-0.006
orean-0.006
Tokenistically
Token ID16772
Feature activation+4.22e-01

Pos logit contributions
iped+0.054
pel+0.041
 HOR+0.036
rows+0.035
Phil+0.034
Neg logit contributions
aturdays-0.144
forcer-0.140
usra-0.139
oteric-0.139
umatic-0.138
Token speaking
Token ID5486
Feature activation+8.69e-02

Pos logit contributions
iped+0.264
pel+0.223
rows+0.207
 HOR+0.207
Py+0.204
Neg logit contributions
aturdays-0.332
forcer-0.321
oteric-0.318
usra-0.317
umatic-0.313
Token,
Token ID11
Feature activation-4.63e-03

Pos logit contributions
iped+0.057
pel+0.049
 HOR+0.045
rows+0.045
Py+0.045
Neg logit contributions
aturdays-0.066
forcer-0.064
oteric-0.063
usra-0.063
umatic-0.062
Token the
Token ID262
Feature activation-1.88e-04

Pos logit contributions
aturdays+0.003
forcer+0.003
oteric+0.003
usra+0.003
umatic+0.003
Neg logit contributions
iped-0.003
pel-0.003
rows-0.003
 HOR-0.003
Py-0.003
Token '
Token ID705
Feature activation+2.14e-01

Pos logit contributions
aturdays+0.000
forcer+0.000
usra+0.000
oteric+0.000
umatic+0.000
Neg logit contributions
iped-0.000
pel-0.000
rows-0.000
 HOR-0.000
Py-0.000
Tokenaverage
Token ID23913
Feature activation-4.45e-02

Pos logit contributions
iped+0.164
pel+0.143
 HOR+0.136
rows+0.135
Py+0.134
Neg logit contributions
aturdays-0.138
forcer-0.132
usra-0.130
oteric-0.130
umatic-0.128
Token Ab
Token ID2275
Feature activation-1.61e-02

Pos logit contributions
aturdays+0.053
forcer+0.051
usra+0.051
oteric+0.051
umatic+0.050
Neg logit contributions
iped-0.050
pel-0.043
 HOR-0.040
rows-0.040
Py-0.040
Tokenroad
Token ID6344
Feature activation+3.00e-02

Pos logit contributions
aturdays+0.012
forcer+0.012
oteric+0.012
usra+0.012
umatic+0.012
Neg logit contributions
iped-0.010
pel-0.009
 HOR-0.008
rows-0.008
Py-0.008
Token in
Token ID287
Feature activation+6.40e-03

Pos logit contributions
iped+0.020
pel+0.017
 HOR+0.016
rows+0.016
Py+0.016
Neg logit contributions
aturdays-0.023
forcer-0.022
usra-0.022
oteric-0.021
umatic-0.021
Token Israel
Token ID2692
Feature activation+2.13e-01

Pos logit contributions
iped+0.005
pel+0.004
 HOR+0.004
rows+0.004
Py+0.004
Neg logit contributions
aturdays-0.004
forcer-0.004
usra-0.004
oteric-0.004
umatic-0.004
Token,
Token ID11
Feature activation-1.23e-02

Pos logit contributions
iped+0.141
pel+0.121
 HOR+0.113
rows+0.112
Py+0.111
Neg logit contributions
aturdays-0.159
forcer-0.154
usra-0.152
oteric-0.151
umatic-0.150
Token Pants
Token ID41689
Feature activation+4.22e-01

Pos logit contributions
aturdays+0.008
forcer+0.008
usra+0.008
oteric+0.008
umatic+0.008
Neg logit contributions
iped-0.009
pel-0.008
 HOR-0.007
rows-0.007
Py-0.007
Tokenuit
Token ID5013
Feature activation-1.53e-01

Pos logit contributions
iped+0.267
pel+0.225
 HOR+0.211
rows+0.210
Py+0.206
Neg logit contributions
aturdays-0.329
forcer-0.318
usra-0.315
oteric-0.314
umatic-0.309
Token Nation
Token ID8741
Feature activation+1.77e-01

Pos logit contributions
aturdays+0.109
forcer+0.105
usra+0.104
oteric+0.103
umatic+0.102
Neg logit contributions
iped-0.107
pel-0.093
 HOR-0.087
rows-0.086
Py-0.085
Token and
Token ID290
Feature activation+4.96e-03

Pos logit contributions
iped+0.118
pel+0.101
 HOR+0.094
rows+0.094
Py+0.093
Neg logit contributions
aturdays-0.132
forcer-0.127
usra-0.126
oteric-0.125
umatic-0.124
Token Green
Token ID3469
Feature activation+1.12e-02

Pos logit contributions
iped+0.003
pel+0.003
 HOR+0.003
rows+0.003
Py+0.003
Neg logit contributions
aturdays-0.003
forcer-0.003
usra-0.003
oteric-0.003
umatic-0.003
Token Course
Token ID20537
Feature activation-4.07e-02

Pos logit contributions
iped+0.009
pel+0.008
 HOR+0.008
rows+0.008
Py+0.008
Neg logit contributions
aturdays-0.007
forcer-0.006
usra-0.006
oteric-0.006
umatic-0.006
Token one
Token ID530
Feature activation-3.94e-02

Pos logit contributions
iped+0.021
pel+0.018
 HOR+0.017
rows+0.017
Py+0.017
Neg logit contributions
aturdays-0.020
forcer-0.020
usra-0.019
oteric-0.019
umatic-0.019
Token element
Token ID5002
Feature activation+3.95e-02

Pos logit contributions
aturdays+0.028
forcer+0.027
usra+0.027
oteric+0.027
umatic+0.027
Neg logit contributions
iped-0.027
pel-0.023
 HOR-0.022
rows-0.022
Py-0.022
Token of
Token ID286
Feature activation+2.34e-02

Pos logit contributions
iped+0.027
pel+0.024
 HOR+0.022
rows+0.022
Py+0.022
Neg logit contributions
aturdays-0.028
forcer-0.027
usra-0.027
oteric-0.027
umatic-0.027
Token a
Token ID257
Feature activation+4.89e-03

Pos logit contributions
iped+0.016
pel+0.014
 HOR+0.013
rows+0.013
Py+0.013
Neg logit contributions
aturdays-0.017
forcer-0.016
usra-0.016
oteric-0.016
umatic-0.016
Token tri
Token ID1333
Feature activation+7.34e-02

Pos logit contributions
iped+0.003
pel+0.003
rows+0.003
 HOR+0.003
Py+0.003
Neg logit contributions
aturdays-0.004
forcer-0.003
usra-0.003
oteric-0.003
umatic-0.003
Tokenpty
Token ID5835
Feature activation+4.21e-01

Pos logit contributions
iped+0.055
pel+0.048
 HOR+0.045
rows+0.045
Py+0.044
Neg logit contributions
aturdays-0.049
forcer-0.047
usra-0.047
oteric-0.047
orean-0.046
Tokench
Token ID354
Feature activation+5.82e-02

Pos logit contributions
iped+0.317
pel+0.276
 HOR+0.262
rows+0.260
Py+0.257
Neg logit contributions
aturdays-0.277
forcer-0.267
usra-0.264
oteric-0.263
umatic-0.259
Token called
Token ID1444
Feature activation+5.41e-02

Pos logit contributions
iped+0.040
pel+0.035
 HOR+0.033
rows+0.033
Py+0.032
Neg logit contributions
aturdays-0.041
forcer-0.040
usra-0.040
oteric-0.040
umatic-0.039
Token Echo
Token ID21455
Feature activation+2.01e-01

Pos logit contributions
iped+0.038
pel+0.032
 HOR+0.030
rows+0.030
Py+0.030
Neg logit contributions
aturdays-0.039
forcer-0.038
usra-0.037
oteric-0.037
umatic-0.036
Token that
Token ID326
Feature activation+4.21e-02

Pos logit contributions
iped+0.147
pel+0.128
 HOR+0.120
rows+0.120
Py+0.118
Neg logit contributions
aturdays-0.138
forcer-0.132
usra-0.131
oteric-0.131
umatic-0.128
Token Indigenous
Token ID24033
Feature activation+5.66e-02

Pos logit contributions
iped+0.032
pel+0.028
 HOR+0.026
rows+0.026
Py+0.026
Neg logit contributions
aturdays-0.028
forcer-0.027
usra-0.026
oteric-0.026
umatic-0.026
TokenFS
Token ID10652
Feature activation-5.14e-02

Pos logit contributions
iped+0.056
pel+0.049
 HOR+0.046
rows+0.046
Phil+0.045
Neg logit contributions
aturdays-0.051
forcer-0.049
oteric-0.048
usra-0.048
orean-0.047
Token1
Token ID16
Feature activation+6.13e-02

Pos logit contributions
aturdays+0.035
forcer+0.033
usra+0.033
oteric+0.033
umatic+0.032
Neg logit contributions
iped-0.038
pel-0.033
 HOR-0.031
rows-0.031
Py-0.030
Token=$
Token ID43641
Feature activation+3.96e-01

Pos logit contributions
iped+0.042
pel+0.036
 HOR+0.034
rows+0.034
Py+0.033
Neg logit contributions
aturdays-0.044
forcer-0.043
oteric-0.042
usra-0.042
orean-0.041
TokenFS
Token ID10652
Feature activation-5.50e-02

Pos logit contributions
iped+0.328
pel+0.290
 HOR+0.275
rows+0.274
Py+0.272
Neg logit contributions
aturdays-0.232
forcer-0.222
usra-0.218
oteric-0.218
umatic-0.214
Token2
Token ID17
Feature activation+1.20e-01

Pos logit contributions
aturdays+0.042
forcer+0.040
usra+0.040
oteric+0.040
umatic+0.039
Neg logit contributions
iped-0.036
pel-0.031
 HOR-0.029
rows-0.028
Py-0.028
Token=$
Token ID43641
Feature activation+4.18e-01

Pos logit contributions
iped+0.082
pel+0.070
 HOR+0.066
rows+0.065
Py+0.064
Neg logit contributions
aturdays-0.087
forcer-0.084
oteric-0.083
usra-0.083
orean-0.081
TokenFontSize
Token ID38160
Feature activation+1.12e-01

Pos logit contributions
iped+0.362
pel+0.322
 HOR+0.307
rows+0.305
Py+0.302
Neg logit contributions
aturdays-0.229
forcer-0.218
usra-0.215
oteric-0.214
umatic-0.210
Token }
Token ID1782
Feature activation-1.61e-02

Pos logit contributions
iped+0.071
pel+0.060
 HOR+0.056
rows+0.056
Py+0.055
Neg logit contributions
aturdays-0.087
forcer-0.084
oteric-0.083
usra-0.083
orean-0.082
Token }
Token ID1782
Feature activation-6.60e-03

Pos logit contributions
aturdays+0.011
forcer+0.011
usra+0.011
oteric+0.011
orean+0.010
Neg logit contributions
iped-0.011
pel-0.010
 HOR-0.009
rows-0.009
Py-0.009
Token Try
Token ID9993
Feature activation+7.00e-02

Pos logit contributions
aturdays+0.005
forcer+0.004
usra+0.004
oteric+0.004
orean+0.004
Neg logit contributions
iped-0.005
pel-0.004
 HOR-0.004
rows-0.004
Py-0.004
Token {
Token ID1391
Feature activation-1.70e-01

Pos logit contributions
iped+0.050
pel+0.044
rows+0.041
 HOR+0.041
Py+0.040
Neg logit contributions
aturdays-0.048
forcer-0.047
usra-0.046
oteric-0.046
umatic-0.045
Token Fat
Token ID12301
Feature activation-2.62e-02

Pos logit contributions
aturdays+0.011
forcer+0.010
usra+0.010
oteric+0.010
umatic+0.010
Neg logit contributions
iped-0.010
pel-0.009
rows-0.008
 HOR-0.008
Py-0.008
Tokenso
Token ID568
Feature activation-8.60e-02

Pos logit contributions
aturdays+0.019
forcer+0.019
usra+0.019
oteric+0.019
umatic+0.018
Neg logit contributions
iped-0.018
pel-0.015
rows-0.014
 HOR-0.014
Py-0.014
Token."
Token ID526
Feature activation-1.87e-02

Pos logit contributions
aturdays+0.056
forcer+0.054
usra+0.053
oteric+0.053
umatic+0.053
Neg logit contributions
iped-0.065
pel-0.057
 HOR-0.054
rows-0.053
Py-0.053
Token
Token ID198
Feature activation-1.33e-02

Pos logit contributions
aturdays+0.013
forcer+0.012
oteric+0.012
usra+0.012
umatic+0.012
Neg logit contributions
iped-0.013
pel-0.012
 HOR-0.011
rows-0.011
Py-0.011
Token
Token ID198
Feature activation-1.54e-02

Pos logit contributions
aturdays+0.009
forcer+0.009
usra+0.009
oteric+0.009
orean+0.008
Neg logit contributions
iped-0.010
pel-0.008
 HOR-0.008
rows-0.008
Py-0.008
TokenPhil
Token ID18673
Feature activation+4.16e-01

Pos logit contributions
aturdays+0.010
forcer+0.010
usra+0.009
oteric+0.009
umatic+0.009
Neg logit contributions
iped-0.012
pel-0.010
 HOR-0.010
rows-0.010
Py-0.010
Tokenip
Token ID541
Feature activation+7.63e-02

Pos logit contributions
iped+0.293
pel+0.253
 HOR+0.238
rows+0.237
Py+0.234
Neg logit contributions
aturdays-0.295
forcer-0.284
oteric-0.280
usra-0.280
umatic-0.275
Token Mont
Token ID5575
Feature activation+2.99e-02

Pos logit contributions
iped+0.056
pel+0.048
 HOR+0.045
rows+0.045
Py+0.045
Neg logit contributions
aturdays-0.052
forcer-0.050
usra-0.050
oteric-0.050
umatic-0.049
Tokenoro
Token ID16522
Feature activation+2.73e-02

Pos logit contributions
iped+0.023
pel+0.020
 HOR+0.019
rows+0.019
Py+0.019
Neg logit contributions
aturdays-0.019
forcer-0.018
usra-0.018
oteric-0.018
umatic-0.018
Token writes
Token ID6797
Feature activation-1.37e-01

Pos logit contributions
iped+0.020
pel+0.017
 HOR+0.016
rows+0.016
Py+0.016
Neg logit contributions
aturdays-0.019
forcer-0.018
usra-0.018
oteric-0.018
umatic-0.018
Token about
Token ID546
Feature activation-6.71e-03

Pos logit contributions
aturdays+0.102
forcer+0.099
oteric+0.097
usra+0.097
umatic+0.096
Neg logit contributions
iped-0.091
pel-0.078
rows-0.073
 HOR-0.072
Py-0.072
Token.
Token ID13
Feature activation-2.23e-02

Pos logit contributions
aturdays+0.066
forcer+0.064
usra+0.063
oteric+0.063
umatic+0.062
Neg logit contributions
iped-0.060
pel-0.052
 HOR-0.048
rows-0.048
Py-0.047
Token Worse
Token ID36855
Feature activation+3.63e-02

Pos logit contributions
aturdays+0.016
forcer+0.015
usra+0.015
oteric+0.015
umatic+0.014
Neg logit contributions
iped-0.016
pel-0.014
rows-0.013
 HOR-0.013
Py-0.013
Token yet
Token ID1865
Feature activation+7.29e-02

Pos logit contributions
iped+0.022
pel+0.019
 HOR+0.017
rows+0.017
Py+0.017
Neg logit contributions
aturdays-0.029
forcer-0.028
usra-0.028
oteric-0.028
umatic-0.027
Token,
Token ID11
Feature activation-1.06e-02

Pos logit contributions
iped+0.047
pel+0.040
 HOR+0.038
rows+0.038
Py+0.037
Neg logit contributions
aturdays-0.055
forcer-0.054
usra-0.053
oteric-0.053
umatic-0.052
Token Mont
Token ID5575
Feature activation+4.30e-02

Pos logit contributions
aturdays+0.008
forcer+0.007
usra+0.007
oteric+0.007
umatic+0.007
Neg logit contributions
iped-0.007
pel-0.006
 HOR-0.006
rows-0.006
Py-0.006
Tokenpel
Token ID30242
Feature activation+4.15e-01

Pos logit contributions
iped+0.005
pel+0.000
 HOR-0.001
rows-0.001
Py-0.001
Neg logit contributions
aturdays-0.056
forcer-0.055
usra-0.055
oteric-0.054
umatic-0.054
Tokenier
Token ID959
Feature activation-2.11e-01

Pos logit contributions
iped+0.366
pel+0.326
 HOR+0.312
rows+0.311
Py+0.308
Neg logit contributions
aturdays-0.219
forcer-0.209
usra-0.205
oteric-0.205
umatic-0.200
Token is
Token ID318
Feature activation-1.35e-03

Pos logit contributions
aturdays+0.145
forcer+0.140
usra+0.138
oteric+0.137
umatic+0.135
Neg logit contributions
iped-0.153
pel-0.132
 HOR-0.125
rows-0.124
Py-0.123
Token supposedly
Token ID13519
Feature activation-2.55e-02

Pos logit contributions
aturdays+0.001
forcer+0.001
usra+0.001
oteric+0.001
umatic+0.001
Neg logit contributions
iped-0.001
pel-0.001
 HOR-0.001
rows-0.001
Py-0.001
Token the
Token ID262
Feature activation+1.08e-02

Pos logit contributions
aturdays+0.018
forcer+0.017
usra+0.017
oteric+0.017
umatic+0.017
Neg logit contributions
iped-0.018
pel-0.016
 HOR-0.015
rows-0.015
Py-0.015
Token
Token ID564
Feature activation+4.51e-03

Pos logit contributions
iped+0.008
pel+0.007
 HOR+0.006
rows+0.006
Py+0.006
Neg logit contributions
aturdays-0.007
forcer-0.007
usra-0.007
oteric-0.007
umatic-0.007
Token the
Token ID262
Feature activation+3.55e-02

Pos logit contributions
iped+0.027
pel+0.024
 HOR+0.022
rows+0.022
Py+0.022
Neg logit contributions
aturdays-0.026
forcer-0.026
usra-0.025
oteric-0.025
umatic-0.025
Token team
Token ID1074
Feature activation-5.51e-02

Pos logit contributions
iped+0.027
pel+0.024
 HOR+0.022
rows+0.022
Py+0.022
Neg logit contributions
aturdays-0.023
forcer-0.022
usra-0.022
oteric-0.022
umatic-0.021
Token.
Token ID13
Feature activation+4.91e-02

Pos logit contributions
aturdays+0.041
forcer+0.039
oteric+0.039
usra+0.039
umatic+0.038
Neg logit contributions
iped-0.037
pel-0.032
 HOR-0.030
rows-0.030
Py-0.029
Token
Token ID198
Feature activation+3.16e-02

Pos logit contributions
iped+0.035
pel+0.031
rows+0.029
 HOR+0.029
Py+0.028
Neg logit contributions
aturdays-0.034
forcer-0.033
oteric-0.032
usra-0.032
orean-0.032
Token
Token ID198
Feature activation+3.18e-02

Pos logit contributions
iped+0.023
pel+0.020
 HOR+0.019
rows+0.019
Py+0.018
Neg logit contributions
aturdays-0.022
forcer-0.021
usra-0.021
oteric-0.021
orean-0.020
TokenPhil
Token ID18673
Feature activation+4.14e-01

Pos logit contributions
iped+0.024
pel+0.021
 HOR+0.020
rows+0.020
Py+0.020
Neg logit contributions
aturdays-0.021
forcer-0.020
usra-0.020
oteric-0.020
umatic-0.019
Token K
Token ID509
Feature activation-5.18e-02

Pos logit contributions
iped+0.323
pel+0.284
 HOR+0.269
rows+0.268
Py+0.265
Neg logit contributions
aturdays-0.261
forcer-0.251
usra-0.247
oteric-0.247
umatic-0.243
Tokenessel
Token ID7878
Feature activation+5.85e-02

Pos logit contributions
aturdays+0.037
forcer+0.036
usra+0.036
oteric+0.036
orean+0.035
Neg logit contributions
iped-0.035
pel-0.030
 HOR-0.029
rows-0.028
Py-0.028
Token isn
Token ID2125
Feature activation+1.00e-01

Pos logit contributions
iped+0.043
pel+0.037
 HOR+0.035
rows+0.035
Py+0.035
Neg logit contributions
aturdays-0.040
forcer-0.038
usra-0.038
oteric-0.038
umatic-0.037
Token
Token ID447
Feature activation+1.44e-02

Pos logit contributions
iped+0.077
pel+0.067
 HOR+0.064
rows+0.063
Py+0.062
Neg logit contributions
aturdays-0.065
forcer-0.062
usra-0.062
oteric-0.061
orean-0.060
Token
Token ID247
Feature activation-1.51e-02

Pos logit contributions
iped+0.010
pel+0.009
 HOR+0.009
rows+0.008
Py+0.008
Neg logit contributions
aturdays-0.010
forcer-0.010
usra-0.009
oteric-0.009
umatic-0.009
Token Is
Token ID1148
Feature activation-3.66e-02

Pos logit contributions
aturdays+0.025
forcer+0.024
usra+0.024
oteric+0.023
umatic+0.023
Neg logit contributions
iped-0.029
pel-0.025
 HOR-0.024
rows-0.023
Py-0.023
Token Richard
Token ID6219
Feature activation+1.81e-01

Pos logit contributions
aturdays+0.025
forcer+0.024
usra+0.024
oteric+0.024
umatic+0.024
Neg logit contributions
iped-0.026
pel-0.023
rows-0.021
 HOR-0.021
Py-0.021
Token Nixon
Token ID16642
Feature activation+1.58e-01

Pos logit contributions
iped+0.094
pel+0.077
 HOR+0.070
rows+0.070
Py+0.068
Neg logit contributions
aturdays-0.161
forcer-0.158
usra-0.156
oteric-0.156
umatic-0.154
Token in
Token ID287
Feature activation-2.64e-02

Pos logit contributions
iped+0.119
pel+0.105
 HOR+0.098
rows+0.098
Py+0.097
Neg logit contributions
aturdays-0.104
forcer-0.100
usra-0.099
oteric-0.098
umatic-0.097
Token a
Token ID257
Feature activation-6.72e-03

Pos logit contributions
aturdays+0.018
forcer+0.017
usra+0.017
oteric+0.017
umatic+0.017
Neg logit contributions
iped-0.019
pel-0.017
 HOR-0.016
rows-0.016
Py-0.015
Token Pants
Token ID41689
Feature activation+4.10e-01

Pos logit contributions
aturdays+0.005
forcer+0.004
usra+0.004
oteric+0.004
umatic+0.004
Neg logit contributions
iped-0.005
pel-0.004
 HOR-0.004
rows-0.004
Py-0.004
Tokenuit
Token ID5013
Feature activation-1.85e-01

Pos logit contributions
iped+0.406
pel+0.365
 HOR+0.351
rows+0.350
Py+0.347
Neg logit contributions
aturdays-0.173
forcer-0.162
oteric-0.159
usra-0.159
orean-0.155
Token appeared
Token ID4120
Feature activation+7.14e-02

Pos logit contributions
aturdays+0.129
forcer+0.124
usra+0.122
oteric+0.122
umatic+0.121
Neg logit contributions
iped-0.133
pel-0.115
 HOR-0.108
rows-0.107
Py-0.106
Token first
Token ID717
Feature activation-2.48e-02

Pos logit contributions
iped+0.049
pel+0.042
 HOR+0.040
rows+0.040
Py+0.039
Neg logit contributions
aturdays-0.052
forcer-0.050
oteric-0.049
usra-0.049
umatic-0.048
Token on
Token ID319
Feature activation+3.05e-02

Pos logit contributions
aturdays+0.020
forcer+0.019
oteric+0.019
usra+0.019
umatic+0.019
Neg logit contributions
iped-0.015
pel-0.013
 HOR-0.012
rows-0.012
Py-0.012
Token Money
Token ID12911
Feature activation-3.26e-02

Pos logit contributions
iped+0.021
pel+0.018
 HOR+0.017
rows+0.017
Py+0.017
Neg logit contributions
aturdays-0.022
forcer-0.021
usra-0.021
oteric-0.021
umatic-0.020
Token seemed
Token ID3947
Feature activation+7.30e-02

Pos logit contributions
aturdays+0.006
forcer+0.006
usra+0.006
oteric+0.006
umatic+0.006
Neg logit contributions
iped-0.007
pel-0.006
 HOR-0.006
rows-0.006
Py-0.005
Token every
Token ID790
Feature activation+2.87e-02

Pos logit contributions
iped+0.050
pel+0.043
rows+0.040
 HOR+0.040
Py+0.040
Neg logit contributions
aturdays-0.052
forcer-0.051
oteric-0.050
usra-0.050
orean-0.050
Token bit
Token ID1643
Feature activation+1.72e-01

Pos logit contributions
iped+0.015
pel+0.013
rows+0.012
 HOR+0.012
Py+0.011
Neg logit contributions
aturdays-0.025
forcer-0.024
oteric-0.024
usra-0.024
umatic-0.024
Token as
Token ID355
Feature activation+2.83e-02

Pos logit contributions
iped+0.119
pel+0.102
rows+0.096
 HOR+0.095
Py+0.094
Neg logit contributions
aturdays-0.123
forcer-0.120
oteric-0.118
usra-0.117
orean-0.116
Token sensational
Token ID31826
Feature activation-1.99e-02

Pos logit contributions
iped+0.020
pel+0.017
rows+0.016
 HOR+0.016
Py+0.016
Neg logit contributions
aturdays-0.020
forcer-0.020
oteric-0.019
usra-0.019
orean-0.019
Tokenistic
Token ID2569
Feature activation+4.08e-01

Pos logit contributions
aturdays+0.014
forcer+0.014
oteric+0.014
usra+0.014
umatic+0.014
Neg logit contributions
iped-0.014
pel-0.012
 HOR-0.011
rows-0.011
Py-0.011
Token as
Token ID355
Feature activation+2.80e-02

Pos logit contributions
iped+0.277
pel+0.237
 HOR+0.223
rows+0.222
Py+0.219
Neg logit contributions
aturdays-0.300
forcer-0.290
oteric-0.286
usra-0.286
umatic-0.282
Token its
Token ID663
Feature activation-3.33e-02

Pos logit contributions
iped+0.019
pel+0.016
rows+0.015
 HOR+0.015
Py+0.015
Neg logit contributions
aturdays-0.021
forcer-0.020
oteric-0.020
usra-0.020
umatic-0.020
Token marketing
Token ID7124
Feature activation+1.10e-02

Pos logit contributions
aturdays+0.023
forcer+0.023
oteric+0.022
usra+0.022
orean+0.022
Neg logit contributions
iped-0.023
pel-0.020
rows-0.019
 HOR-0.019
Py-0.019
Token campaign
Token ID1923
Feature activation-1.12e-01

Pos logit contributions
iped+0.008
pel+0.007
 HOR+0.006
rows+0.006
Py+0.006
Neg logit contributions
aturdays-0.008
forcer-0.007
oteric-0.007
usra-0.007
umatic-0.007
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
aturdays+0.081
forcer+0.079
oteric+0.078
usra+0.077
umatic+0.076
Neg logit contributions
iped-0.077
pel-0.066
 HOR-0.062
rows-0.062
Py-0.061
Token Is
Token ID1148
Feature activation-1.62e-02

Pos logit contributions
aturdays+0.028
forcer+0.027
usra+0.027
oteric+0.027
umatic+0.026
Neg logit contributions
iped-0.032
pel-0.028
 HOR-0.027
rows-0.027
Py-0.026
Token Richard
Token ID6219
Feature activation+1.86e-01

Pos logit contributions
aturdays+0.014
forcer+0.014
usra+0.014
oteric+0.014
umatic+0.014
Neg logit contributions
iped-0.008
pel-0.007
 HOR-0.006
rows-0.006
Py-0.006
Token Nixon
Token ID16642
Feature activation+1.53e-01

Pos logit contributions
iped+0.098
pel+0.081
rows+0.073
 HOR+0.073
 squid+0.071
Neg logit contributions
aturdays-0.163
forcer-0.161
usra-0.159
oteric-0.158
orean-0.157
Token in
Token ID287
Feature activation-1.39e-02

Pos logit contributions
iped+0.107
pel+0.093
 HOR+0.087
rows+0.086
Py+0.085
Neg logit contributions
aturdays-0.109
forcer-0.106
usra-0.104
oteric-0.104
umatic-0.102
Token a
Token ID257
Feature activation-3.52e-04

Pos logit contributions
aturdays+0.010
forcer+0.010
usra+0.009
oteric+0.009
umatic+0.009
Neg logit contributions
iped-0.010
pel-0.008
 HOR-0.008
rows-0.008
Py-0.008
Token Pants
Token ID41689
Feature activation+4.08e-01

Pos logit contributions
aturdays+0.000
forcer+0.000
oteric+0.000
usra+0.000
umatic+0.000
Neg logit contributions
iped-0.000
pel-0.000
rows-0.000
 HOR-0.000
Py-0.000
Tokenuit
Token ID5013
Feature activation-1.70e-01

Pos logit contributions
iped+0.403
pel+0.364
 HOR+0.351
rows+0.349
Py+0.346
Neg logit contributions
aturdays-0.172
forcer-0.162
usra-0.159
oteric-0.159
umatic-0.155
Token was
Token ID373
Feature activation+2.66e-03

Pos logit contributions
aturdays+0.117
forcer+0.113
usra+0.111
oteric+0.111
umatic+0.109
Neg logit contributions
iped-0.123
pel-0.107
 HOR-0.101
rows-0.100
Py-0.099
Token originally
Token ID6198
Feature activation-7.78e-02

Pos logit contributions
iped+0.002
pel+0.002
 HOR+0.002
rows+0.002
Py+0.002
Neg logit contributions
aturdays-0.002
forcer-0.002
oteric-0.002
usra-0.002
umatic-0.002
Token published
Token ID3199
Feature activation+7.55e-03

Pos logit contributions
aturdays+0.052
forcer+0.050
oteric+0.050
usra+0.049
umatic+0.049
Neg logit contributions
iped-0.057
pel-0.050
 HOR-0.047
rows-0.047
Py-0.046
Token at
Token ID379
Feature activation+3.42e-03

Pos logit contributions
iped+0.005
pel+0.004
 HOR+0.004
rows+0.004
Py+0.004
Neg logit contributions
aturdays-0.006
forcer-0.006
oteric-0.006
usra-0.006
umatic-0.005
Token with
Token ID351
Feature activation-7.84e-03

Pos logit contributions
aturdays+0.036
forcer+0.035
usra+0.034
oteric+0.034
orean+0.034
Neg logit contributions
iped-0.035
pel-0.030
 HOR-0.029
rows-0.028
Phil-0.028
Token d
Token ID288
Feature activation-3.34e-02

Pos logit contributions
aturdays+0.006
forcer+0.006
usra+0.006
oteric+0.005
orean+0.005
Neg logit contributions
iped-0.005
pel-0.005
rows-0.004
 HOR-0.004
Py-0.004
Tokenuk
Token ID2724
Feature activation-8.73e-02

Pos logit contributions
aturdays+0.020
usra+0.019
forcer+0.019
oteric+0.019
hal+0.019
Neg logit contributions
iped-0.027
pel-0.024
 HOR-0.023
Py-0.022
rows-0.022
Token (
Token ID357
Feature activation+2.12e-02

Pos logit contributions
aturdays+0.059
forcer+0.057
usra+0.056
oteric+0.056
umatic+0.055
Neg logit contributions
iped-0.064
pel-0.056
 HOR-0.053
rows-0.052
Py-0.052
TokenK
Token ID42
Feature activation-1.55e-02

Pos logit contributions
iped+0.017
pel+0.015
 HOR+0.014
rows+0.014
Py+0.014
Neg logit contributions
aturdays-0.013
forcer-0.013
usra-0.012
oteric-0.012
umatic-0.012
Tokenorean
Token ID29456
Feature activation-5.22e-01

Pos logit contributions
aturdays+0.009
forcer+0.008
oteric+0.008
usra+0.008
umatic+0.008
Neg logit contributions
iped-0.013
pel-0.012
 HOR-0.011
rows-0.011
Py-0.011
Token rice
Token ID11464
Feature activation+2.42e-02

Pos logit contributions
aturdays+0.389
forcer+0.376
usra+0.371
oteric+0.371
orean+0.366
Neg logit contributions
iped-0.348
pel-0.298
 HOR-0.279
rows-0.278
Py-0.274
Token cakes
Token ID30849
Feature activation-7.59e-02

Pos logit contributions
iped+0.016
pel+0.013
 HOR+0.012
rows+0.012
Py+0.012
Neg logit contributions
aturdays-0.019
forcer-0.018
usra-0.018
oteric-0.018
umatic-0.017
Token),
Token ID828
Feature activation+3.37e-02

Pos logit contributions
aturdays+0.055
forcer+0.053
usra+0.053
oteric+0.053
umatic+0.052
Neg logit contributions
iped-0.052
pel-0.045
 HOR-0.042
rows-0.042
Py-0.041
Token bitter
Token ID12922
Feature activation+1.01e-02

Pos logit contributions
iped+0.023
pel+0.019
 HOR+0.018
rows+0.018
Py+0.018
Neg logit contributions
aturdays-0.025
forcer-0.024
usra-0.024
oteric-0.024
orean-0.023
Token mel
Token ID7758
Feature activation+7.50e-02

Pos logit contributions
iped+0.007
pel+0.006
 HOR+0.005
rows+0.005
Py+0.005
Neg logit contributions
aturdays-0.007
forcer-0.007
usra-0.007
oteric-0.007
orean-0.007
Token you
Token ID345
Feature activation+6.02e-02

Pos logit contributions
aturdays+0.014
forcer+0.014
oteric+0.014
usra+0.014
umatic+0.014
Neg logit contributions
iped-0.014
pel-0.012
 HOR-0.011
rows-0.011
Py-0.011
Token have
Token ID423
Feature activation+8.18e-03

Pos logit contributions
iped+0.049
pel+0.043
 HOR+0.041
rows+0.041
Py+0.041
Neg logit contributions
aturdays-0.036
forcer-0.034
oteric-0.034
usra-0.034
umatic-0.033
Token.
Token ID13
Feature activation+2.23e-02

Pos logit contributions
iped+0.006
pel+0.005
 HOR+0.004
rows+0.004
Py+0.004
Neg logit contributions
aturdays-0.006
forcer-0.006
oteric-0.006
usra-0.006
umatic-0.006
Token Does
Token ID8314
Feature activation-3.88e-02

Pos logit contributions
iped+0.016
pel+0.014
 HOR+0.013
rows+0.013
Py+0.013
Neg logit contributions
aturdays-0.015
forcer-0.015
usra-0.015
oteric-0.015
umatic-0.014
Token your
Token ID534
Feature activation-1.71e-02

Pos logit contributions
aturdays+0.026
forcer+0.025
usra+0.025
oteric+0.025
umatic+0.025
Neg logit contributions
iped-0.028
pel-0.025
 HOR-0.023
rows-0.023
Py-0.023
Token cousins
Token ID28706
Feature activation-4.57e-01

Pos logit contributions
aturdays+0.012
forcer+0.012
usra+0.012
oteric+0.012
umatic+0.011
Neg logit contributions
iped-0.012
pel-0.010
 HOR-0.010
rows-0.010
Py-0.010
Token half
Token ID2063
Feature activation-1.13e-01

Pos logit contributions
aturdays+0.300
forcer+0.289
oteric+0.285
usra+0.285
orean+0.280
Neg logit contributions
iped-0.344
pel-0.300
 HOR-0.284
rows-0.282
Py-0.280
Token-
Token ID12
Feature activation-4.31e-02

Pos logit contributions
aturdays+0.079
forcer+0.076
oteric+0.075
usra+0.075
orean+0.074
Neg logit contributions
iped-0.080
pel-0.069
 HOR-0.065
rows-0.065
Py-0.064
Tokenbrother
Token ID37343
Feature activation-1.88e-01

Pos logit contributions
aturdays+0.028
forcer+0.027
usra+0.027
oteric+0.027
umatic+0.026
Neg logit contributions
iped-0.033
pel-0.028
 HOR-0.027
rows-0.027
Py-0.027
Tokens
Token ID82
Feature activation+6.97e-02

Pos logit contributions
aturdays+0.128
forcer+0.123
usra+0.122
oteric+0.122
umatic+0.120
Neg logit contributions
iped-0.137
pel-0.119
 HOR-0.112
rows-0.112
Py-0.110
Token uncle
Token ID7711
Feature activation-8.45e-02

Pos logit contributions
iped+0.051
pel+0.044
 HOR+0.042
rows+0.042
Py+0.041
Neg logit contributions
aturdays-0.047
forcer-0.046
usra-0.045
oteric-0.045
umatic-0.044
Token including
Token ID1390
Feature activation+8.98e-02

Pos logit contributions
iped+0.008
pel+0.007
 HOR+0.007
rows+0.007
Py+0.006
Neg logit contributions
aturdays-0.009
forcer-0.009
oteric-0.009
usra-0.009
umatic-0.009
Token "
Token ID366
Feature activation-5.65e-02

Pos logit contributions
iped+0.065
pel+0.057
 HOR+0.053
rows+0.053
Py+0.053
Neg logit contributions
aturdays-0.062
forcer-0.059
usra-0.059
oteric-0.059
umatic-0.058
TokenJack
Token ID14295
Feature activation+1.24e-01

Pos logit contributions
aturdays+0.036
forcer+0.035
oteric+0.035
usra+0.034
orean+0.034
Neg logit contributions
iped-0.043
pel-0.038
 HOR-0.036
rows-0.036
Py-0.035
Tokenie
Token ID494
Feature activation-1.95e-02

Pos logit contributions
iped+0.088
pel+0.076
 HOR+0.072
rows+0.071
Py+0.070
Neg logit contributions
aturdays-0.087
forcer-0.084
usra-0.083
oteric-0.083
umatic-0.082
Token Chan
Token ID18704
Feature activation-4.56e-02

Pos logit contributions
aturdays+0.013
forcer+0.013
usra+0.012
oteric+0.012
umatic+0.012
Neg logit contributions
iped-0.014
pel-0.013
 HOR-0.012
rows-0.012
Py-0.012
Token Adventures
Token ID15640
Feature activation-4.52e-01

Pos logit contributions
aturdays+0.030
forcer+0.029
usra+0.029
oteric+0.029
umatic+0.028
Neg logit contributions
iped-0.034
pel-0.030
 HOR-0.028
rows-0.028
Py-0.027
Token",
Token ID1600
Feature activation-1.27e-01

Pos logit contributions
aturdays+0.294
forcer+0.282
usra+0.279
oteric+0.278
umatic+0.274
Neg logit contributions
iped-0.345
pel-0.301
 HOR-0.285
rows-0.284
Py-0.281
Token "
Token ID366
Feature activation-6.49e-02

Pos logit contributions
aturdays+0.078
forcer+0.074
usra+0.073
oteric+0.073
umatic+0.072
Neg logit contributions
iped-0.101
pel-0.089
 HOR-0.085
rows-0.084
Py-0.084
TokenThe
Token ID464
Feature activation-4.10e-02

Pos logit contributions
aturdays+0.042
forcer+0.040
oteric+0.040
usra+0.040
orean+0.039
Neg logit contributions
iped-0.050
pel-0.043
 HOR-0.041
rows-0.041
Py-0.040
Token Big
Token ID4403
Feature activation-2.42e-02

Pos logit contributions
aturdays+0.027
forcer+0.026
usra+0.026
oteric+0.026
umatic+0.026
Neg logit contributions
iped-0.030
pel-0.027
 HOR-0.025
rows-0.025
Py-0.025
Token Guy
Token ID13145
Feature activation+1.22e-01

Pos logit contributions
aturdays+0.019
forcer+0.018
usra+0.018
oteric+0.018
umatic+0.018
Neg logit contributions
iped-0.015
pel-0.013
 HOR-0.012
rows-0.012
Py-0.012
Token wetlands
Token ID49297
Feature activation-4.23e-02

Pos logit contributions
aturdays+0.034
forcer+0.033
oteric+0.033
usra+0.033
umatic+0.032
Neg logit contributions
iped-0.029
pel-0.025
rows-0.023
 HOR-0.023
Py-0.023
Token,
Token ID11
Feature activation-4.49e-02

Pos logit contributions
aturdays+0.031
forcer+0.030
usra+0.030
oteric+0.030
umatic+0.029
Neg logit contributions
iped-0.028
pel-0.024
 HOR-0.023
rows-0.023
Py-0.022
Token Rams
Token ID16077
Feature activation+8.49e-02

Pos logit contributions
aturdays+0.032
forcer+0.031
usra+0.031
oteric+0.031
umatic+0.031
Neg logit contributions
iped-0.031
pel-0.027
 HOR-0.025
rows-0.025
Py-0.024
Tokenar
Token ID283
Feature activation-6.84e-02

Pos logit contributions
iped+0.073
pel+0.065
 HOR+0.061
rows+0.061
Py+0.061
Neg logit contributions
aturdays-0.047
forcer-0.045
usra-0.044
oteric-0.044
umatic-0.044
Token envis
Token ID45633
Feature activation+3.97e-03

Pos logit contributions
aturdays+0.046
forcer+0.044
usra+0.044
oteric+0.044
umatic+0.043
Neg logit contributions
iped-0.050
pel-0.044
 HOR-0.041
rows-0.041
Py-0.041
Tokenaged
Token ID1886
Feature activation-4.50e-01

Pos logit contributions
iped+0.005
pel+0.004
 HOR+0.004
rows+0.004
Py+0.004
Neg logit contributions
aturdays-0.001
forcer-0.001
oteric-0.001
usra-0.001
umatic-0.001
Token action
Token ID2223
Feature activation+1.98e-02

Pos logit contributions
aturdays+0.318
forcer+0.308
usra+0.304
oteric+0.304
umatic+0.299
Neg logit contributions
iped-0.316
pel-0.273
 HOR-0.256
rows-0.255
Py-0.252
Token by
Token ID416
Feature activation+3.02e-03

Pos logit contributions
iped+0.013
pel+0.011
 HOR+0.011
rows+0.011
Py+0.011
Neg logit contributions
aturdays-0.015
forcer-0.014
oteric-0.014
usra-0.014
umatic-0.014
Token both
Token ID1111
Feature activation+1.05e-02

Pos logit contributions
iped+0.002
pel+0.002
 HOR+0.002
rows+0.002
Py+0.002
Neg logit contributions
aturdays-0.002
forcer-0.002
usra-0.002
oteric-0.002
umatic-0.002
Token national
Token ID2260
Feature activation-6.64e-02

Pos logit contributions
iped+0.007
pel+0.006
 HOR+0.006
rows+0.006
Py+0.006
Neg logit contributions
aturdays-0.007
forcer-0.007
usra-0.007
oteric-0.007
umatic-0.007
Token governments
Token ID6905
Feature activation-4.07e-02

Pos logit contributions
aturdays+0.047
forcer+0.045
usra+0.045
oteric+0.045
umatic+0.044
Neg logit contributions
iped-0.047
pel-0.040
 HOR-0.038
rows-0.038
Py-0.037
Token come
Token ID1282
Feature activation-8.63e-02

Pos logit contributions
iped+0.013
pel+0.011
 HOR+0.011
rows+0.011
Py+0.010
Neg logit contributions
aturdays-0.010
forcer-0.010
usra-0.010
oteric-0.010
umatic-0.009
Token and
Token ID290
Feature activation+1.93e-02

Pos logit contributions
aturdays+0.062
forcer+0.060
usra+0.059
oteric+0.059
umatic+0.058
Neg logit contributions
iped-0.060
pel-0.051
 HOR-0.048
rows-0.048
Py-0.047
Token visit
Token ID3187
Feature activation-4.30e-02

Pos logit contributions
iped+0.014
pel+0.013
 HOR+0.012
rows+0.012
Py+0.012
Neg logit contributions
aturdays-0.013
forcer-0.012
usra-0.012
oteric-0.012
umatic-0.012
Token the
Token ID262
Feature activation+1.86e-02

Pos logit contributions
aturdays+0.031
forcer+0.030
usra+0.029
oteric+0.029
umatic+0.029
Neg logit contributions
iped-0.030
pel-0.026
 HOR-0.024
rows-0.024
Py-0.024
Token whore
Token ID40107
Feature activation+1.69e-01

Pos logit contributions
iped+0.013
pel+0.011
rows+0.010
 HOR+0.010
Py+0.010
Neg logit contributions
aturdays-0.014
forcer-0.013
usra-0.013
oteric-0.013
umatic-0.013
Tokenhouse
Token ID4803
Feature activation-4.42e-01

Pos logit contributions
iped+0.160
pel+0.143
 HOR+0.137
rows+0.137
Py+0.136
Neg logit contributions
aturdays-0.078
forcer-0.074
usra-0.073
oteric-0.072
umatic-0.071
Token
Token ID1399
Feature activation+1.89e-02

Pos logit contributions
aturdays+0.320
forcer+0.310
usra+0.305
oteric+0.305
umatic+0.301
Neg logit contributions
iped-0.304
pel-0.260
rows-0.243
 HOR-0.243
Py-0.241
Token<|endoftext|>
Token ID50256
Feature activation+4.47e-02

Pos logit contributions
iped+0.014
pel+0.012
 HOR+0.011
rows+0.011
Py+0.011
Neg logit contributions
aturdays-0.013
forcer-0.013
usra-0.012
oteric-0.012
umatic-0.012
TokenSteven
Token ID28292
Feature activation+9.33e-02

Pos logit contributions
iped+0.034
pel+0.030
 HOR+0.028
rows+0.028
 squid+0.028
Neg logit contributions
aturdays-0.029
forcer-0.028
usra-0.028
oteric-0.027
umatic-0.027
Token Gerrard
Token ID49314
Feature activation+3.80e-02

Pos logit contributions
iped+0.068
pel+0.059
 HOR+0.056
rows+0.055
Py+0.055
Neg logit contributions
aturdays-0.064
forcer-0.062
usra-0.061
oteric-0.061
umatic-0.060
Token scored
Token ID7781
Feature activation-5.22e-02

Pos logit contributions
iped+0.028
pel+0.025
 HOR+0.023
rows+0.023
Py+0.023
Neg logit contributions
aturdays-0.025
forcer-0.024
usra-0.024
oteric-0.024
umatic-0.024
Token
Token ID447
Feature activation+6.36e-03

Pos logit contributions
aturdays+0.007
forcer+0.006
usra+0.006
oteric+0.006
umatic+0.006
Neg logit contributions
iped-0.006
pel-0.005
 HOR-0.005
rows-0.005
Py-0.005
Token
Token ID251
Feature activation-7.37e-03

Pos logit contributions
iped+0.005
pel+0.004
 HOR+0.004
rows+0.004
Py+0.004
Neg logit contributions
aturdays-0.004
forcer-0.004
oteric-0.004
usra-0.004
umatic-0.004
Token
Token ID198
Feature activation-6.66e-03

Pos logit contributions
aturdays+0.005
forcer+0.005
usra+0.005
oteric+0.005
umatic+0.004
Neg logit contributions
iped-0.006
pel-0.005
 HOR-0.005
rows-0.005
Py-0.005
Token
Token ID198
Feature activation-7.44e-03

Pos logit contributions
aturdays+0.004
forcer+0.004
usra+0.004
oteric+0.004
orean+0.004
Neg logit contributions
iped-0.005
pel-0.004
 HOR-0.004
rows-0.004
Py-0.004
TokenK
Token ID42
Feature activation+6.59e-04

Pos logit contributions
aturdays+0.005
forcer+0.005
usra+0.005
oteric+0.005
umatic+0.005
Neg logit contributions
iped-0.005
pel-0.005
 HOR-0.004
rows-0.004
Py-0.004
Tokenhal
Token ID14201
Feature activation-4.40e-01

Pos logit contributions
iped+0.001
pel+0.001
rows+0.001
 HOR+0.001
Phil+0.001
Neg logit contributions
aturdays-0.000
forcer-0.000
oteric-0.000
usra-0.000
umatic-0.000
Tokenisha
Token ID19388
Feature activation-1.31e-01

Pos logit contributions
aturdays+0.201
forcer+0.192
usra+0.189
oteric+0.187
umatic+0.184
Neg logit contributions
iped-0.417
pel-0.375
 HOR-0.361
rows-0.359
Py-0.356
Token told
Token ID1297
Feature activation+5.80e-02

Pos logit contributions
aturdays+0.088
forcer+0.085
usra+0.083
oteric+0.083
umatic+0.082
Neg logit contributions
iped-0.097
pel-0.084
 HOR-0.080
rows-0.079
Py-0.078
Token NBC
Token ID9716
Feature activation+1.42e-03

Pos logit contributions
iped+0.043
pel+0.037
 HOR+0.035
rows+0.035
Py+0.034
Neg logit contributions
aturdays-0.039
forcer-0.038
oteric-0.037
usra-0.037
umatic-0.037
Token news
Token ID1705
Feature activation+3.37e-02

Pos logit contributions
iped+0.001
pel+0.001
 HOR+0.001
rows+0.001
Py+0.001
Neg logit contributions
aturdays-0.001
forcer-0.001
usra-0.001
oteric-0.001
umatic-0.001
Token that
Token ID326
Feature activation-4.99e-02

Pos logit contributions
iped+0.024
pel+0.020
 HOR+0.019
rows+0.019
Py+0.019
Neg logit contributions
aturdays-0.024
forcer-0.023
usra-0.023
oteric-0.023
umatic-0.022
Token Catalan
Token ID31066
Feature activation-5.25e-02

Pos logit contributions
aturdays+0.012
forcer+0.012
usra+0.012
oteric+0.012
umatic+0.011
Neg logit contributions
iped-0.013
pel-0.012
 HOR-0.011
rows-0.011
Py-0.011
Token resolution
Token ID6323
Feature activation-6.79e-02

Pos logit contributions
aturdays+0.036
forcer+0.035
usra+0.034
oteric+0.034
orean+0.034
Neg logit contributions
iped-0.038
pel-0.033
 HOR-0.031
rows-0.031
Py-0.031
Token is
Token ID318
Feature activation+8.84e-03

Pos logit contributions
aturdays+0.045
forcer+0.044
usra+0.043
oteric+0.043
umatic+0.042
Neg logit contributions
iped-0.051
pel-0.044
 HOR-0.042
rows-0.041
Py-0.041
Token in
Token ID287
Feature activation-2.40e-03

Pos logit contributions
iped+0.006
pel+0.005
 HOR+0.005
rows+0.005
Py+0.005
Neg logit contributions
aturdays-0.006
forcer-0.006
usra-0.006
oteric-0.006
umatic-0.006
Token contra
Token ID25736
Feature activation-4.60e-03

Pos logit contributions
aturdays+0.001
forcer+0.001
usra+0.001
oteric+0.001
umatic+0.001
Neg logit contributions
iped-0.002
pel-0.002
 HOR-0.002
rows-0.002
Py-0.002
Tokenvention
Token ID4018
Feature activation-4.39e-01

Pos logit contributions
aturdays+0.001
forcer+0.001
oteric+0.001
usra+0.001
umatic+0.001
Neg logit contributions
iped-0.005
pel-0.005
 HOR-0.005
rows-0.005
Py-0.005
Token of
Token ID286
Feature activation-3.12e-02

Pos logit contributions
aturdays+0.315
forcer+0.304
usra+0.300
oteric+0.300
umatic+0.296
Neg logit contributions
iped-0.305
pel-0.262
 HOR-0.247
rows-0.246
Py-0.242
Token a
Token ID257
Feature activation-3.17e-02

Pos logit contributions
aturdays+0.021
forcer+0.020
usra+0.020
oteric+0.020
umatic+0.020
Neg logit contributions
iped-0.023
pel-0.020
 HOR-0.019
rows-0.019
Py-0.019
Token previous
Token ID2180
Feature activation-1.22e-01

Pos logit contributions
aturdays+0.022
forcer+0.021
usra+0.021
oteric+0.021
umatic+0.021
Neg logit contributions
iped-0.023
pel-0.020
rows-0.018
 HOR-0.018
Py-0.018
Token court
Token ID2184
Feature activation-3.90e-02

Pos logit contributions
aturdays+0.083
forcer+0.080
usra+0.079
oteric+0.079
umatic+0.078
Neg logit contributions
iped-0.090
pel-0.078
 HOR-0.073
rows-0.073
Py-0.072
Token ruling
Token ID6922
Feature activation-3.80e-02

Pos logit contributions
aturdays+0.026
forcer+0.025
usra+0.024
oteric+0.024
umatic+0.024
Neg logit contributions
iped-0.029
pel-0.026
 HOR-0.024
rows-0.024
Py-0.024
Token asked
Token ID1965
Feature activation-7.73e-02

Pos logit contributions
iped+0.009
pel+0.008
 HOR+0.008
rows+0.007
Py+0.007
Neg logit contributions
aturdays-0.008
forcer-0.008
usra-0.008
oteric-0.008
umatic-0.008
Token artist
Token ID6802
Feature activation+3.93e-02

Pos logit contributions
aturdays+0.055
forcer+0.053
usra+0.053
oteric+0.052
umatic+0.052
Neg logit contributions
iped-0.054
pel-0.046
 HOR-0.043
rows-0.043
Py-0.043
Token Mary
Token ID5335
Feature activation-8.25e-03

Pos logit contributions
iped+0.026
pel+0.023
 HOR+0.021
rows+0.021
Py+0.021
Neg logit contributions
aturdays-0.029
forcer-0.028
usra-0.028
oteric-0.028
umatic-0.027
Token Anne
Token ID15397
Feature activation-1.51e-03

Pos logit contributions
aturdays+0.006
forcer+0.005
usra+0.005
oteric+0.005
umatic+0.005
Neg logit contributions
iped-0.006
pel-0.005
 HOR-0.005
rows-0.005
Py-0.005
Token Bark
Token ID20511
Feature activation-1.12e-01

Pos logit contributions
aturdays+0.001
forcer+0.001
usra+0.001
oteric+0.001
umatic+0.001
Neg logit contributions
iped-0.001
pel-0.001
 HOR-0.001
rows-0.001
Py-0.001
Tokenhouse
Token ID4803
Feature activation-4.39e-01

Pos logit contributions
aturdays+0.074
forcer+0.071
usra+0.070
oteric+0.070
umatic+0.069
Neg logit contributions
iped-0.085
pel-0.074
 HOR-0.070
rows-0.069
Py-0.069
Token why
Token ID1521
Feature activation-4.53e-03

Pos logit contributions
aturdays+0.326
forcer+0.316
usra+0.312
oteric+0.312
umatic+0.308
Neg logit contributions
iped-0.292
pel-0.250
 HOR-0.234
rows-0.233
Py-0.230
Token the
Token ID262
Feature activation-2.81e-02

Pos logit contributions
aturdays+0.003
forcer+0.003
usra+0.003
oteric+0.003
umatic+0.003
Neg logit contributions
iped-0.003
pel-0.003
 HOR-0.003
rows-0.003
Py-0.003
Token squirrel
Token ID33039
Feature activation-1.90e-01

Pos logit contributions
aturdays+0.020
forcer+0.019
usra+0.019
oteric+0.019
umatic+0.018
Neg logit contributions
iped-0.020
pel-0.017
 HOR-0.016
rows-0.016
Py-0.016
Tokens
Token ID82
Feature activation+5.00e-02

Pos logit contributions
aturdays+0.128
forcer+0.123
usra+0.122
oteric+0.122
umatic+0.120
Neg logit contributions
iped-0.140
pel-0.122
 HOR-0.115
rows-0.114
Py-0.113
Token would
Token ID561
Feature activation+3.63e-02

Pos logit contributions
iped+0.038
pel+0.033
 HOR+0.031
rows+0.031
Py+0.031
Neg logit contributions
aturdays-0.033
forcer-0.031
usra-0.031
oteric-0.031
umatic-0.030
Token Gand
Token ID17727
Feature activation-6.04e-02

Pos logit contributions
aturdays+0.030
forcer+0.028
usra+0.028
oteric+0.028
umatic+0.028
Neg logit contributions
iped-0.031
pel-0.027
 HOR-0.025
rows-0.025
Py-0.025
Tokenalf
Token ID1604
Feature activation-2.89e-01

Pos logit contributions
aturdays+0.040
forcer+0.039
usra+0.038
oteric+0.038
umatic+0.038
Neg logit contributions
iped-0.045
pel-0.039
 HOR-0.037
rows-0.037
Py-0.036
Token reports
Token ID3136
Feature activation-5.83e-02

Pos logit contributions
aturdays+0.204
forcer+0.197
usra+0.195
oteric+0.194
orean+0.193
Neg logit contributions
iped-0.204
pel-0.176
 HOR-0.164
rows-0.164
Py-0.163
Token that
Token ID326
Feature activation+8.65e-03

Pos logit contributions
aturdays+0.043
forcer+0.041
usra+0.041
oteric+0.041
umatic+0.040
Neg logit contributions
iped-0.039
pel-0.034
 HOR-0.032
rows-0.032
Py-0.031
Token Sar
Token ID6866
Feature activation-1.46e-01

Pos logit contributions
iped+0.006
pel+0.005
 HOR+0.005
rows+0.005
Py+0.005
Neg logit contributions
aturdays-0.006
forcer-0.006
usra-0.006
oteric-0.006
umatic-0.006
Tokenuman
Token ID3778
Feature activation-4.32e-01

Pos logit contributions
aturdays+0.026
forcer+0.023
oteric+0.022
usra+0.022
umatic+0.020
Neg logit contributions
iped-0.180
pel-0.165
 HOR-0.161
rows-0.160
Py-0.159
Token has
Token ID468
Feature activation-2.42e-02

Pos logit contributions
aturdays+0.287
forcer+0.279
usra+0.275
oteric+0.274
umatic+0.270
Neg logit contributions
iped-0.321
pel-0.279
 HOR-0.263
rows-0.262
Py-0.259
Token betrayed
Token ID26281
Feature activation+1.33e-01

Pos logit contributions
aturdays+0.016
forcer+0.016
usra+0.016
oteric+0.016
umatic+0.015
Neg logit contributions
iped-0.018
pel-0.015
 HOR-0.014
rows-0.014
Py-0.014
Token them
Token ID606
Feature activation-5.12e-02

Pos logit contributions
iped+0.098
pel+0.085
 HOR+0.080
rows+0.080
Py+0.079
Neg logit contributions
aturdays-0.091
forcer-0.087
oteric-0.086
usra-0.086
umatic-0.084
Token and
Token ID290
Feature activation+7.17e-03

Pos logit contributions
aturdays+0.038
forcer+0.037
oteric+0.036
usra+0.036
umatic+0.036
Neg logit contributions
iped-0.035
pel-0.030
 HOR-0.028
rows-0.028
Py-0.027
Token is
Token ID318
Feature activation+8.03e-03

Pos logit contributions
iped+0.005
pel+0.005
 HOR+0.004
rows+0.004
Py+0.004
Neg logit contributions
aturdays-0.005
forcer-0.005
usra-0.005
oteric-0.005
umatic-0.005
Token profit
Token ID7630
Feature activation-7.03e-02

Pos logit contributions
iped+0.007
pel+0.006
 HOR+0.005
rows+0.005
Py+0.005
Neg logit contributions
aturdays-0.007
forcer-0.007
usra-0.007
oteric-0.007
umatic-0.006
Token generated
Token ID7560
Feature activation-4.15e-02

Pos logit contributions
aturdays+0.049
forcer+0.047
oteric+0.047
usra+0.047
umatic+0.046
Neg logit contributions
iped-0.050
pel-0.044
 HOR-0.041
rows-0.040
Py-0.040
Token by
Token ID416
Feature activation+2.12e-02

Pos logit contributions
aturdays+0.031
forcer+0.030
oteric+0.029
usra+0.029
umatic+0.029
Neg logit contributions
iped-0.028
pel-0.024
 HOR-0.022
rows-0.022
Py-0.022
Token an
Token ID281
Feature activation-1.60e-02

Pos logit contributions
iped+0.015
pel+0.013
 HOR+0.012
rows+0.012
Py+0.012
Neg logit contributions
aturdays-0.015
forcer-0.015
usra-0.014
oteric-0.014
umatic-0.014
Token underlying
Token ID10238
Feature activation-6.12e-02

Pos logit contributions
aturdays+0.011
forcer+0.011
usra+0.011
oteric+0.011
umatic+0.011
Neg logit contributions
iped-0.011
pel-0.010
 HOR-0.009
rows-0.009
Py-0.009
Token asset
Token ID11171
Feature activation-4.31e-01

Pos logit contributions
aturdays+0.043
forcer+0.042
usra+0.041
oteric+0.041
umatic+0.040
Neg logit contributions
iped-0.043
pel-0.038
 HOR-0.035
rows-0.035
Py-0.035
Token,
Token ID11
Feature activation-2.12e-02

Pos logit contributions
aturdays+0.312
forcer+0.304
usra+0.299
oteric+0.298
umatic+0.294
Neg logit contributions
iped-0.294
pel-0.254
 HOR-0.237
rows-0.235
Py-0.234
Token such
Token ID884
Feature activation+1.34e-02

Pos logit contributions
aturdays+0.016
forcer+0.015
usra+0.015
oteric+0.015
umatic+0.015
Neg logit contributions
iped-0.014
pel-0.012
 HOR-0.011
rows-0.011
Py-0.011
Token as
Token ID355
Feature activation+5.24e-03

Pos logit contributions
iped+0.009
pel+0.008
 HOR+0.007
rows+0.007
Py+0.007
Neg logit contributions
aturdays-0.010
forcer-0.009
usra-0.009
oteric-0.009
umatic-0.009
Token government
Token ID1230
Feature activation+3.22e-02

Pos logit contributions
iped+0.004
pel+0.003
 HOR+0.003
rows+0.003
Py+0.003
Neg logit contributions
aturdays-0.004
forcer-0.004
usra-0.004
oteric-0.004
umatic-0.003
Token buildings
Token ID6832
Feature activation+3.09e-02

Pos logit contributions
iped+0.028
pel+0.025
 HOR+0.024
rows+0.024
Py+0.024
Neg logit contributions
aturdays-0.017
forcer-0.016
usra-0.016
oteric-0.016
umatic-0.016
Token 4
Token ID604
Feature activation+5.10e-02

Pos logit contributions
iped+0.010
pel+0.009
 HOR+0.008
rows+0.008
Py+0.008
Neg logit contributions
aturdays-0.011
forcer-0.011
usra-0.010
oteric-0.010
umatic-0.010
Token.
Token ID13
Feature activation+2.70e-02

Pos logit contributions
iped+0.035
pel+0.030
 HOR+0.028
rows+0.028
Py+0.027
Neg logit contributions
aturdays-0.037
forcer-0.036
usra-0.036
oteric-0.035
umatic-0.035
Token
Token ID198
Feature activation+2.70e-02

Pos logit contributions
iped+0.019
pel+0.016
rows+0.015
 HOR+0.015
Py+0.015
Neg logit contributions
aturdays-0.019
forcer-0.018
usra-0.018
oteric-0.018
umatic-0.018
Token
Token ID198
Feature activation+2.44e-02

Pos logit contributions
iped+0.020
pel+0.017
 HOR+0.016
rows+0.016
Py+0.016
Neg logit contributions
aturdays-0.018
forcer-0.018
usra-0.018
oteric-0.018
orean-0.017
TokenExpl
Token ID18438
Feature activation+6.53e-02

Pos logit contributions
iped+0.019
pel+0.016
 HOR+0.016
rows+0.015
 squid+0.015
Neg logit contributions
aturdays-0.016
forcer-0.015
usra-0.015
oteric-0.015
umatic-0.015
Tokenained
Token ID1328
Feature activation-4.26e-01

Pos logit contributions
iped+0.054
pel+0.048
rows+0.045
 HOR+0.045
Py+0.045
Neg logit contributions
aturdays-0.038
forcer-0.037
oteric-0.036
usra-0.036
umatic-0.035
Token Murd
Token ID20305
Feature activation-7.95e-03

Pos logit contributions
aturdays+0.295
forcer+0.287
usra+0.282
oteric+0.282
umatic+0.278
Neg logit contributions
iped-0.304
pel-0.264
rows-0.248
 HOR-0.246
Py-0.244
Tokeny
Token ID88
Feature activation-2.50e-02

Pos logit contributions
aturdays+0.005
forcer+0.005
usra+0.005
oteric+0.005
umatic+0.005
Neg logit contributions
iped-0.006
pel-0.006
 HOR-0.005
rows-0.005
Py-0.005
Token,
Token ID11
Feature activation+7.75e-03

Pos logit contributions
aturdays+0.019
forcer+0.018
oteric+0.018
usra+0.018
umatic+0.018
Neg logit contributions
iped-0.016
pel-0.014
 HOR-0.013
rows-0.013
Py-0.013
Token
Token ID564
Feature activation-3.34e-03

Pos logit contributions
iped+0.006
pel+0.005
 HOR+0.005
rows+0.005
Py+0.005
Neg logit contributions
aturdays-0.005
forcer-0.005
usra-0.005
oteric-0.005
umatic-0.005
Token
Token ID250
Feature activation-1.58e-02

Pos logit contributions
aturdays+0.002
forcer+0.002
usra+0.002
oteric+0.002
umatic+0.002
Neg logit contributions
iped-0.002
pel-0.002
 HOR-0.002
rows-0.002
Py-0.002
Tokenes
Token ID274
Feature activation-5.33e-02

Pos logit contributions
aturdays+0.031
forcer+0.030
usra+0.030
oteric+0.030
umatic+0.029
Neg logit contributions
iped-0.036
pel-0.031
 HOR-0.030
rows-0.030
Py-0.029
Token")
Token ID4943
Feature activation+4.58e-02

Pos logit contributions
aturdays+0.035
forcer+0.034
usra+0.033
oteric+0.033
orean+0.033
Neg logit contributions
iped-0.040
pel-0.035
 HOR-0.033
rows-0.033
Py-0.032
Token is
Token ID318
Feature activation-2.58e-02

Pos logit contributions
iped+0.035
pel+0.030
 HOR+0.028
rows+0.028
Py+0.028
Neg logit contributions
aturdays-0.030
forcer-0.029
usra-0.029
oteric-0.029
orean-0.028
Token an
Token ID281
Feature activation-1.72e-02

Pos logit contributions
aturdays+0.017
forcer+0.017
oteric+0.017
usra+0.017
orean+0.017
Neg logit contributions
iped-0.018
pel-0.016
rows-0.015
 HOR-0.015
Py-0.015
Token autos
Token ID44619
Feature activation-2.73e-01

Pos logit contributions
aturdays+0.007
forcer+0.007
usra+0.007
oteric+0.007
umatic+0.006
Neg logit contributions
iped-0.017
pel-0.015
 HOR-0.015
rows-0.015
Py-0.015
Tokenomal
Token ID18048
Feature activation-4.26e-01

Pos logit contributions
aturdays+0.034
forcer+0.026
usra+0.023
oteric+0.022
umatic+0.019
Neg logit contributions
iped-0.353
pel-0.328
rows-0.317
 HOR-0.315
Py-0.315
Token dominant
Token ID11410
Feature activation+1.10e-01

Pos logit contributions
aturdays+0.368
forcer+0.365
usra+0.361
oteric+0.360
orean+0.356
Neg logit contributions
iped-0.225
pel-0.186
 HOR-0.172
rows-0.171
Py-0.167
Token ker
Token ID41927
Feature activation+5.19e-02

Pos logit contributions
iped+0.060
pel+0.049
 HOR+0.046
rows+0.045
Py+0.044
Neg logit contributions
aturdays-0.095
forcer-0.093
usra-0.092
oteric-0.092
umatic-0.091
Tokenat
Token ID265
Feature activation-1.02e-01

Pos logit contributions
iped+0.048
pel+0.043
rows+0.041
 HOR+0.041
Py+0.041
Neg logit contributions
aturdays-0.026
forcer-0.024
usra-0.024
oteric-0.024
umatic-0.023
Tokenoder
Token ID12342
Feature activation-1.74e-01

Pos logit contributions
aturdays+0.045
forcer+0.044
oteric+0.043
usra+0.042
orean+0.042
Neg logit contributions
iped-0.098
pel-0.089
 HOR-0.085
rows-0.084
Py-0.084
Tokenma
Token ID2611
Feature activation-3.94e-02

Pos logit contributions
aturdays+0.115
forcer+0.111
usra+0.109
oteric+0.109
umatic+0.108
Neg logit contributions
iped-0.130
pel-0.113
 HOR-0.107
rows-0.107
Py-0.105
Tokenes
Token ID274
Feature activation-5.59e-02

Pos logit contributions
aturdays+0.026
forcer+0.025
oteric+0.025
usra+0.025
umatic+0.024
Neg logit contributions
iped-0.030
pel-0.026
 HOR-0.025
rows-0.025
Py-0.025
Token")
Token ID4943
Feature activation+2.77e-02

Pos logit contributions
aturdays+0.039
forcer+0.038
usra+0.037
oteric+0.037
umatic+0.037
Neg logit contributions
iped-0.039
pel-0.034
 HOR-0.032
rows-0.032
Py-0.032
Token is
Token ID318
Feature activation-3.72e-02

Pos logit contributions
iped+0.021
pel+0.018
 HOR+0.017
rows+0.017
Py+0.017
Neg logit contributions
aturdays-0.019
forcer-0.018
usra-0.018
oteric-0.018
umatic-0.017
Token an
Token ID281
Feature activation-2.11e-02

Pos logit contributions
aturdays+0.025
forcer+0.024
oteric+0.023
usra+0.023
umatic+0.023
Neg logit contributions
iped-0.028
pel-0.024
 HOR-0.023
rows-0.023
Py-0.022
Token autos
Token ID44619
Feature activation-2.68e-01

Pos logit contributions
aturdays+0.011
forcer+0.010
oteric+0.010
usra+0.010
umatic+0.010
Neg logit contributions
iped-0.019
pel-0.017
 HOR-0.016
rows-0.016
Py-0.016
Tokenomal
Token ID18048
Feature activation-4.25e-01

Pos logit contributions
aturdays+0.030
forcer+0.023
usra+0.020
oteric+0.020
umatic+0.017
Neg logit contributions
iped-0.350
pel-0.324
rows-0.314
 HOR-0.313
Py-0.312
Token dominant
Token ID11410
Feature activation+1.10e-01

Pos logit contributions
aturdays+0.374
forcer+0.365
oteric+0.362
usra+0.361
umatic+0.358
Neg logit contributions
iped-0.220
pel-0.181
rows-0.167
 HOR-0.166
Py-0.163
Token ker
Token ID41927
Feature activation+5.82e-02

Pos logit contributions
iped+0.077
pel+0.066
 HOR+0.063
rows+0.063
Py+0.062
Neg logit contributions
aturdays-0.077
forcer-0.075
usra-0.074
oteric-0.074
umatic-0.073
Tokenat
Token ID265
Feature activation-9.68e-02

Pos logit contributions
iped+0.053
pel+0.048
rows+0.045
 HOR+0.045
Py+0.045
Neg logit contributions
aturdays-0.029
forcer-0.028
usra-0.027
oteric-0.027
umatic-0.026
Tokenoder
Token ID12342
Feature activation-1.72e-01

Pos logit contributions
aturdays+0.047
forcer+0.045
oteric+0.044
usra+0.044
umatic+0.043
Neg logit contributions
iped-0.089
pel-0.080
 HOR-0.077
rows-0.076
Py-0.076
Tokenma
Token ID2611
Feature activation-2.99e-02

Pos logit contributions
aturdays+0.114
forcer+0.110
usra+0.108
oteric+0.108
umatic+0.106
Neg logit contributions
iped-0.129
pel-0.113
 HOR-0.107
rows-0.107
Py-0.105
Tokeniving
Token ID1412
Feature activation+4.78e-02

Pos logit contributions
iped+0.040
pel+0.035
 HOR+0.033
rows+0.033
Py+0.032
Neg logit contributions
aturdays-0.034
forcer-0.033
oteric-0.032
usra-0.032
umatic-0.032
Token/
Token ID14
Feature activation+7.84e-02

Pos logit contributions
iped+0.032
pel+0.028
 HOR+0.026
rows+0.026
Py+0.026
Neg logit contributions
aturdays-0.035
forcer-0.034
usra-0.034
oteric-0.034
umatic-0.033
Tokensu
Token ID2385
Feature activation-4.96e-03

Pos logit contributions
iped+0.060
pel+0.052
 HOR+0.050
rows+0.049
Py+0.048
Neg logit contributions
aturdays-0.051
forcer-0.049
usra-0.048
oteric-0.048
umatic-0.048
Tokenicide
Token ID5285
Feature activation-1.24e-01

Pos logit contributions
aturdays+0.002
forcer+0.002
usra+0.002
oteric+0.002
umatic+0.002
Neg logit contributions
iped-0.005
pel-0.004
 HOR-0.004
rows-0.004
Py-0.004
Tokenpre
Token ID3866
Feature activation+1.17e-01

Pos logit contributions
aturdays+0.090
forcer+0.087
usra+0.086
oteric+0.086
umatic+0.084
Neg logit contributions
iped-0.084
pel-0.072
 HOR-0.069
rows-0.068
Py-0.067
Tokenvention
Token ID4018
Feature activation-4.23e-01

Pos logit contributions
iped+0.136
pel+0.125
 HOR+0.121
rows+0.121
Py+0.120
Neg logit contributions
aturdays-0.029
forcer-0.026
oteric-0.025
usra-0.025
umatic-0.024
Token/
Token ID14
Feature activation+6.35e-02

Pos logit contributions
aturdays+0.306
forcer+0.297
usra+0.292
oteric+0.292
umatic+0.288
Neg logit contributions
iped-0.289
pel-0.248
 HOR-0.236
rows-0.232
Py-0.229
Tokendoc
Token ID15390
Feature activation+1.65e-01

Pos logit contributions
iped+0.049
pel+0.043
 HOR+0.041
rows+0.040
Py+0.040
Neg logit contributions
aturdays-0.040
forcer-0.039
usra-0.038
oteric-0.038
umatic-0.038
Tokenuments
Token ID2886
Feature activation+1.11e-01

Pos logit contributions
iped+0.115
pel+0.099
 HOR+0.094
rows+0.092
 squid+0.091
Neg logit contributions
aturdays-0.118
forcer-0.114
usra-0.113
oteric-0.113
umatic-0.111
Token/
Token ID14
Feature activation+8.81e-02

Pos logit contributions
iped+0.070
pel+0.060
 HOR+0.056
rows+0.055
Py+0.054
Neg logit contributions
aturdays-0.087
forcer-0.084
usra-0.083
oteric-0.083
umatic-0.082
Tokenolder
Token ID19892
Feature activation+1.24e-01

Pos logit contributions
iped+0.067
pel+0.059
 HOR+0.056
rows+0.055
Py+0.055
Neg logit contributions
aturdays-0.057
forcer-0.055
usra-0.054
oteric-0.054
umatic-0.053
Token problem
Token ID1917
Feature activation+1.29e-01

Pos logit contributions
aturdays+0.007
forcer+0.007
usra+0.007
oteric+0.007
umatic+0.007
Neg logit contributions
iped-0.006
pel-0.005
 HOR-0.005
rows-0.005
Py-0.005
Token.
Token ID13
Feature activation-1.04e-02

Pos logit contributions
iped+0.086
pel+0.074
 HOR+0.069
rows+0.069
Py+0.068
Neg logit contributions
aturdays-0.096
forcer-0.094
usra-0.092
oteric-0.092
umatic-0.091
Token
Token ID447
Feature activation+1.37e-02

Pos logit contributions
aturdays+0.007
forcer+0.007
usra+0.007
oteric+0.007
orean+0.007
Neg logit contributions
iped-0.008
pel-0.007
 HOR-0.006
rows-0.006
Py-0.006
Token
Token ID251
Feature activation-1.38e-04

Pos logit contributions
iped+0.010
pel+0.009
 HOR+0.008
rows+0.008
Py+0.008
Neg logit contributions
aturdays-0.009
forcer-0.009
usra-0.009
oteric-0.009
umatic-0.009
Token Expl
Token ID5905
Feature activation-8.31e-02

Pos logit contributions
aturdays+0.000
forcer+0.000
usra+0.000
oteric+0.000
orean+0.000
Neg logit contributions
iped-0.000
pel-0.000
 HOR-0.000
rows-0.000
Py-0.000
Tokenained
Token ID1328
Feature activation-4.22e-01

Pos logit contributions
aturdays+0.049
forcer+0.047
usra+0.046
oteric+0.046
umatic+0.045
Neg logit contributions
iped-0.069
pel-0.061
 HOR-0.058
rows-0.058
Py-0.057
Token Marin
Token ID36384
Feature activation+3.30e-02

Pos logit contributions
aturdays+0.299
forcer+0.290
usra+0.285
oteric+0.285
orean+0.280
Neg logit contributions
iped-0.296
pel-0.256
 HOR-0.240
rows-0.239
Py-0.237
Token,
Token ID11
Feature activation+2.21e-03

Pos logit contributions
iped+0.022
pel+0.018
 HOR+0.017
rows+0.017
Py+0.017
Neg logit contributions
aturdays-0.025
forcer-0.024
usra-0.024
oteric-0.024
umatic-0.024
Token
Token ID564
Feature activation+2.72e-02

Pos logit contributions
iped+0.002
pel+0.001
 HOR+0.001
rows+0.001
Py+0.001
Neg logit contributions
aturdays-0.001
forcer-0.001
usra-0.001
oteric-0.001
umatic-0.001
Token
Token ID250
Feature activation+4.17e-05

Pos logit contributions
iped+0.019
pel+0.017
 HOR+0.016
rows+0.015
Py+0.015
Neg logit contributions
aturdays-0.019
forcer-0.019
oteric-0.018
usra-0.018
umatic-0.018
TokenWhen
Token ID2215
Feature activation-3.83e-02

Pos logit contributions
iped+0.000
pel+0.000
 HOR+0.000
rows+0.000
Py+0.000
Neg logit contributions
aturdays-0.000
forcer-0.000
oteric-0.000
usra-0.000
umatic-0.000
Tokengon
Token ID14520
Feature activation+1.32e-01

Pos logit contributions
aturdays+0.095
forcer+0.093
usra+0.092
oteric+0.091
umatic+0.090
Neg logit contributions
iped-0.062
pel-0.052
 HOR-0.049
rows-0.047
Py-0.047
Token.
Token ID13
Feature activation-9.18e-03

Pos logit contributions
iped+0.089
pel+0.077
 HOR+0.075
rows+0.073
Py+0.073
Neg logit contributions
aturdays-0.095
forcer-0.093
usra-0.092
oteric-0.091
umatic-0.091
Tokengov
Token ID9567
Feature activation-1.20e-01

Pos logit contributions
aturdays+0.005
forcer+0.004
usra+0.004
oteric+0.004
umatic+0.004
Neg logit contributions
iped-0.008
pel-0.007
 HOR-0.007
rows-0.007
Py-0.007
Token/
Token ID14
Feature activation+8.60e-02

Pos logit contributions
aturdays+0.091
forcer+0.089
usra+0.088
oteric+0.088
umatic+0.086
Neg logit contributions
iped-0.077
pel-0.066
 HOR-0.062
rows-0.061
Py-0.061
Tokenpre
Token ID3866
Feature activation+1.36e-01

Pos logit contributions
iped+0.062
pel+0.054
 HOR+0.051
rows+0.051
Py+0.050
Neg logit contributions
aturdays-0.059
forcer-0.057
usra-0.056
oteric-0.056
umatic-0.055
Tokenvention
Token ID4018
Feature activation-4.21e-01

Pos logit contributions
iped+0.138
pel+0.125
 HOR+0.120
rows+0.120
Py+0.119
Neg logit contributions
aturdays-0.055
forcer-0.051
oteric-0.050
usra-0.050
orean-0.048
Tokenwell
Token ID4053
Feature activation-1.13e-01

Pos logit contributions
aturdays+0.321
forcer+0.312
usra+0.308
oteric+0.307
umatic+0.304
Neg logit contributions
iped-0.271
pel-0.231
 HOR-0.218
rows-0.215
Py-0.211
Tokenness
Token ID1108
Feature activation-1.46e-02

Pos logit contributions
aturdays+0.065
forcer+0.062
usra+0.061
oteric+0.061
umatic+0.059
Neg logit contributions
iped-0.095
pel-0.085
 HOR-0.081
rows-0.080
Py-0.079
Token/
Token ID14
Feature activation+6.84e-02

Pos logit contributions
aturdays+0.011
forcer+0.011
usra+0.011
oteric+0.011
umatic+0.011
Neg logit contributions
iped-0.009
pel-0.008
 HOR-0.008
rows-0.007
Py-0.007
Tokensaf
Token ID49585
Feature activation-1.45e-01

Pos logit contributions
iped+0.050
pel+0.044
 HOR+0.042
rows+0.041
Py+0.041
Neg logit contributions
aturdays-0.046
forcer-0.044
usra-0.044
oteric-0.044
umatic-0.043
Tokenel
Token ID417
Feature activation+5.22e-02

Pos logit contributions
aturdays+0.092
forcer+0.089
oteric+0.088
usra+0.088
umatic+0.086
Neg logit contributions
iped-0.111
pel-0.097
 HOR-0.093
rows-0.092
Py-0.091
Token/
Token ID14
Feature activation+3.41e-02

Pos logit contributions
iped+0.095
pel+0.080
 HOR+0.075
rows+0.075
Py+0.074
Neg logit contributions
aturdays-0.116
forcer-0.112
oteric-0.111
usra-0.111
umatic-0.109
Tokenany
Token ID1092
Feature activation-3.58e-03

Pos logit contributions
iped+0.025
pel+0.022
 HOR+0.021
rows+0.021
Py+0.020
Neg logit contributions
aturdays-0.023
forcer-0.022
usra-0.022
oteric-0.022
umatic-0.021
Token Mode
Token ID10363
Feature activation-2.57e-01

Pos logit contributions
aturdays+0.002
forcer+0.002
oteric+0.002
usra+0.002
umatic+0.002
Neg logit contributions
iped-0.003
pel-0.003
 HOR-0.003
rows-0.003
Py-0.003
Token:
Token ID25
Feature activation-2.26e-02

Pos logit contributions
aturdays+0.183
forcer+0.182
usra+0.178
oteric+0.177
hal+0.175
Neg logit contributions
iped-0.175
pel-0.150
 HOR-0.141
rows-0.140
 squid-0.137
TokenMan
Token ID5124
Feature activation+3.10e-02

Pos logit contributions
aturdays+0.017
forcer+0.017
oteric+0.017
usra+0.017
umatic+0.016
Neg logit contributions
iped-0.014
pel-0.012
 HOR-0.012
rows-0.012
Py-0.011
Tokenaged
Token ID1886
Feature activation-4.21e-01

Pos logit contributions
iped+0.037
pel+0.034
 HOR+0.032
rows+0.032
Py+0.032
Neg logit contributions
aturdays-0.007
forcer-0.006
usra-0.006
oteric-0.006
umatic-0.006
Token Access
Token ID8798
Feature activation+6.69e-02

Pos logit contributions
aturdays+0.310
forcer+0.308
oteric+0.303
usra+0.302
orean+0.301
Neg logit contributions
iped-0.275
pel-0.234
rows-0.217
 HOR-0.215
 squid-0.214
Token Point
Token ID6252
Feature activation-4.92e-02

Pos logit contributions
iped+0.043
pel+0.036
rows+0.034
 HOR+0.034
 squid+0.033
Neg logit contributions
aturdays-0.050
forcer-0.050
oteric-0.049
usra-0.048
orean-0.048
Token:
Token ID25
Feature activation+3.63e-03

Pos logit contributions
aturdays+0.036
forcer+0.035
usra+0.035
oteric+0.034
orean+0.034
Neg logit contributions
iped-0.033
pel-0.028
 HOR-0.027
rows-0.026
Py-0.026
Token Not
Token ID1892
Feature activation-6.02e-02

Pos logit contributions
iped+0.003
pel+0.003
rows+0.002
 HOR+0.002
Py+0.002
Neg logit contributions
aturdays-0.002
forcer-0.002
oteric-0.002
usra-0.002
umatic-0.002
Token-
Token ID12
Feature activation-3.65e-02

Pos logit contributions
aturdays+0.041
forcer+0.040
oteric+0.039
usra+0.039
orean+0.039
Neg logit contributions
iped-0.043
pel-0.038
 HOR-0.036
rows-0.035
Py-0.035
Token/
Token ID14
Feature activation+3.55e-02

Pos logit contributions
iped+0.106
pel+0.091
 HOR+0.086
rows+0.086
Py+0.085
Neg logit contributions
aturdays-0.104
forcer-0.101
oteric-0.100
usra-0.099
orean-0.098
Tokenany
Token ID1092
Feature activation-1.84e-03

Pos logit contributions
iped+0.023
pel+0.020
 HOR+0.019
rows+0.019
Py+0.019
Neg logit contributions
aturdays-0.027
forcer-0.026
usra-0.025
oteric-0.025
hal-0.025
Token Mode
Token ID10363
Feature activation-2.57e-01

Pos logit contributions
aturdays+0.001
forcer+0.001
oteric+0.001
usra+0.001
hal+0.001
Neg logit contributions
iped-0.001
pel-0.001
 HOR-0.001
rows-0.001
Py-0.001
Token:
Token ID25
Feature activation+7.87e-03

Pos logit contributions
aturdays+0.184
forcer+0.179
oteric+0.176
usra+0.176
umatic+0.174
Neg logit contributions
iped-0.175
pel-0.152
rows-0.141
 HOR-0.141
Py-0.140
TokenMan
Token ID5124
Feature activation+4.09e-02

Pos logit contributions
iped+0.005
pel+0.005
 HOR+0.004
rows+0.004
Py+0.004
Neg logit contributions
aturdays-0.006
forcer-0.005
oteric-0.005
usra-0.005
orean-0.005
Tokenaged
Token ID1886
Feature activation-4.18e-01

Pos logit contributions
iped+0.037
pel+0.033
 HOR+0.031
rows+0.031
Py+0.031
Neg logit contributions
aturdays-0.021
forcer-0.020
usra-0.020
oteric-0.020
umatic-0.019
Token Access
Token ID8798
Feature activation+7.83e-02

Pos logit contributions
aturdays+0.292
forcer+0.283
oteric+0.279
usra+0.277
umatic+0.273
Neg logit contributions
iped-0.296
pel-0.255
rows-0.239
 HOR-0.239
Py-0.236
Token Point
Token ID6252
Feature activation-3.51e-02

Pos logit contributions
iped+0.055
pel+0.048
 HOR+0.045
rows+0.045
Py+0.045
Neg logit contributions
aturdays-0.054
forcer-0.053
oteric-0.052
usra-0.052
orean-0.052
Token:
Token ID25
Feature activation+2.15e-02

Pos logit contributions
aturdays+0.025
forcer+0.025
oteric+0.024
usra+0.024
orean+0.024
Neg logit contributions
iped-0.024
pel-0.021
rows-0.019
 HOR-0.019
Py-0.019
Token Not
Token ID1892
Feature activation-5.91e-02

Pos logit contributions
iped+0.016
pel+0.014
 HOR+0.013
rows+0.013
Py+0.013
Neg logit contributions
aturdays-0.014
forcer-0.014
usra-0.013
oteric-0.013
orean-0.013
Token-
Token ID12
Feature activation-3.43e-02

Pos logit contributions
aturdays+0.043
forcer+0.041
oteric+0.041
usra+0.041
orean+0.040
Neg logit contributions
iped-0.040
pel-0.035
 HOR-0.033
rows-0.033
Py-0.032
Token.
Token ID13
Feature activation+3.58e-02

Pos logit contributions
iped+0.007
pel+0.006
 HOR+0.005
rows+0.005
 squid+0.005
Neg logit contributions
aturdays-0.007
forcer-0.007
usra-0.007
oteric-0.007
orean-0.007
Tokengov
Token ID9567
Feature activation-8.92e-02

Pos logit contributions
iped+0.034
pel+0.030
 HOR+0.029
rows+0.029
Py+0.029
Neg logit contributions
aturdays-0.016
forcer-0.016
usra-0.016
oteric-0.015
orean-0.015
Token/
Token ID14
Feature activation+9.16e-02

Pos logit contributions
aturdays+0.067
forcer+0.066
usra+0.065
oteric+0.065
orean+0.064
Neg logit contributions
iped-0.058
pel-0.049
 HOR-0.046
rows-0.045
 squid-0.045
Tokenviolence
Token ID37502
Feature activation+9.41e-02

Pos logit contributions
iped+0.069
pel+0.060
 HOR+0.057
rows+0.056
Py+0.056
Neg logit contributions
aturdays-0.060
forcer-0.058
usra-0.058
oteric-0.057
umatic-0.056
Tokenpre
Token ID3866
Feature activation+1.38e-01

Pos logit contributions
iped+0.059
pel+0.050
 HOR+0.047
rows+0.046
Py+0.046
Neg logit contributions
aturdays-0.074
forcer-0.072
usra-0.071
oteric-0.071
umatic-0.070
Tokenvention
Token ID4018
Feature activation-4.17e-01

Pos logit contributions
iped+0.162
pel+0.149
 HOR+0.143
rows+0.143
Py+0.142
Neg logit contributions
aturdays-0.033
forcer-0.030
oteric-0.029
usra-0.029
umatic-0.027
Token/
Token ID14
Feature activation+7.57e-02

Pos logit contributions
aturdays+0.326
forcer+0.316
usra+0.313
oteric+0.312
umatic+0.308
Neg logit contributions
iped-0.262
pel-0.223
 HOR-0.209
rows-0.206
Py-0.204
Tokensu
Token ID2385
Feature activation-1.13e-02

Pos logit contributions
iped+0.055
pel+0.047
 HOR+0.045
rows+0.044
Py+0.044
Neg logit contributions
aturdays-0.052
forcer-0.050
usra-0.050
oteric-0.050
umatic-0.049
Tokenicide
Token ID5285
Feature activation-1.15e-01

Pos logit contributions
aturdays+0.006
forcer+0.006
usra+0.005
oteric+0.005
umatic+0.005
Neg logit contributions
iped-0.010
pel-0.009
 HOR-0.009
 squid-0.009
rows-0.009
Token/
Token ID14
Feature activation+6.95e-02

Pos logit contributions
aturdays+0.086
forcer+0.083
usra+0.082
oteric+0.082
umatic+0.080
Neg logit contributions
iped-0.077
pel-0.066
 HOR-0.062
rows-0.061
Py-0.060
Tokenstat
Token ID14269
Feature activation+9.23e-02

Pos logit contributions
iped+0.051
pel+0.044
 HOR+0.042
rows+0.041
Py+0.041
Neg logit contributions
aturdays-0.047
forcer-0.046
usra-0.045
oteric-0.045
umatic-0.044
Token reform
Token ID4975
Feature activation+3.29e-02

Pos logit contributions
iped+0.012
pel+0.010
 HOR+0.010
rows+0.010
Py+0.010
Neg logit contributions
aturdays-0.010
forcer-0.010
oteric-0.010
usra-0.010
umatic-0.010
Token the
Token ID262
Feature activation+3.75e-02

Pos logit contributions
iped+0.023
pel+0.020
 HOR+0.019
rows+0.019
Py+0.018
Neg logit contributions
aturdays-0.023
forcer-0.022
oteric-0.022
usra-0.022
umatic-0.022
Token state
Token ID1181
Feature activation-1.20e-02

Pos logit contributions
iped+0.027
pel+0.023
 HOR+0.022
rows+0.022
Py+0.021
Neg logit contributions
aturdays-0.026
forcer-0.025
usra-0.025
oteric-0.025
umatic-0.025
Token's
Token ID338
Feature activation+2.35e-02

Pos logit contributions
aturdays+0.008
forcer+0.008
usra+0.007
oteric+0.007
umatic+0.007
Neg logit contributions
iped-0.009
pel-0.008
 HOR-0.008
rows-0.008
Py-0.007
Token civil
Token ID3026
Feature activation-1.19e-01

Pos logit contributions
iped+0.016
pel+0.014
 HOR+0.013
rows+0.013
Py+0.013
Neg logit contributions
aturdays-0.017
forcer-0.016
usra-0.016
oteric-0.016
umatic-0.016
Token asset
Token ID11171
Feature activation-4.15e-01

Pos logit contributions
aturdays+0.080
forcer+0.077
usra+0.076
oteric+0.076
umatic+0.075
Neg logit contributions
iped-0.088
pel-0.077
 HOR-0.072
rows-0.072
Py-0.071
Token forfeiture
Token ID32075
Feature activation-5.39e-02

Pos logit contributions
aturdays+0.221
forcer+0.215
usra+0.210
orean+0.205
oteric+0.205
Neg logit contributions
iped-0.363
pel-0.325
rows-0.306
 HOR-0.306
Py-0.305
Token laws
Token ID3657
Feature activation-1.52e-02

Pos logit contributions
aturdays+0.036
forcer+0.036
usra+0.035
oteric+0.035
umatic+0.034
Neg logit contributions
iped-0.039
pel-0.034
 HOR-0.032
rows-0.032
Py-0.032
Token,
Token ID11
Feature activation+3.39e-02

Pos logit contributions
aturdays+0.011
forcer+0.011
oteric+0.011
usra+0.011
umatic+0.011
Neg logit contributions
iped-0.010
pel-0.009
 HOR-0.008
rows-0.008
Py-0.008
Token which
Token ID543
Feature activation+6.72e-02

Pos logit contributions
iped+0.023
pel+0.020
 HOR+0.018
rows+0.018
Py+0.018
Neg logit contributions
aturdays-0.025
forcer-0.024
oteric-0.024
usra-0.024
umatic-0.023
Token allow
Token ID1249
Feature activation-2.08e-02

Pos logit contributions
iped+0.052
pel+0.045
 HOR+0.043
rows+0.043
Py+0.042
Neg logit contributions
aturdays-0.043
forcer-0.041
usra-0.041
oteric-0.041
umatic-0.040
Token the
Token ID262
Feature activation-6.16e-03

Pos logit contributions
aturdays+0.015
forcer+0.015
oteric+0.014
usra+0.014
umatic+0.014
Neg logit contributions
iped-0.015
pel-0.013
 HOR-0.012
rows-0.012
Py-0.012
Token
Token ID564
Feature activation+1.93e-02

Pos logit contributions
aturdays+0.004
forcer+0.004
usra+0.004
oteric+0.004
umatic+0.004
Neg logit contributions
iped-0.004
pel-0.004
 HOR-0.004
rows-0.004
Py-0.004
Token
Token ID246
Feature activation-3.26e-02

Pos logit contributions
iped+0.016
pel+0.014
 HOR+0.013
rows+0.013
Py+0.013
Neg logit contributions
aturdays-0.012
forcer-0.011
oteric-0.011
usra-0.011
umatic-0.011
Tokenunc
Token ID19524
Feature activation-9.52e-02

Pos logit contributions
aturdays+0.022
forcer+0.021
oteric+0.021
usra+0.021
umatic+0.020
Neg logit contributions
iped-0.024
pel-0.021
 HOR-0.020
rows-0.020
Py-0.020
Tokenertain
Token ID1425
Feature activation+8.48e-02

Pos logit contributions
aturdays+0.057
forcer+0.054
usra+0.053
oteric+0.053
umatic+0.052
Neg logit contributions
iped-0.078
pel-0.069
 HOR-0.065
rows-0.065
Py-0.064
Tokenty
Token ID774
Feature activation+3.18e-01

Pos logit contributions
iped+0.043
pel+0.034
 HOR+0.032
rows+0.031
Py+0.031
Neg logit contributions
aturdays-0.077
forcer-0.075
usra-0.074
oteric-0.074
umatic-0.073
Token
Token ID447
Feature activation+3.40e-02

Pos logit contributions
iped+0.218
pel+0.187
 HOR+0.176
rows+0.174
Py+0.173
Neg logit contributions
aturdays-0.231
forcer-0.224
usra-0.221
oteric-0.220
umatic-0.218
Token
Token ID247
Feature activation+5.35e-03

Pos logit contributions
iped+0.025
pel+0.022
rows+0.021
 HOR+0.021
Py+0.020
Neg logit contributions
aturdays-0.023
forcer-0.022
oteric-0.022
usra-0.022
umatic-0.021
Token of
Token ID286
Feature activation-6.78e-03

Pos logit contributions
iped+0.004
pel+0.003
 HOR+0.003
rows+0.003
Py+0.003
Neg logit contributions
aturdays-0.004
forcer-0.004
usra-0.004
oteric-0.004
umatic-0.004
Token change
Token ID1487
Feature activation-1.74e-02

Pos logit contributions
aturdays+0.005
forcer+0.005
usra+0.005
oteric+0.005
umatic+0.005
Neg logit contributions
iped-0.005
pel-0.004
 HOR-0.004
rows-0.004
Py-0.004
Token.
Token ID13
Feature activation-7.23e-03

Pos logit contributions
aturdays+0.013
forcer+0.012
usra+0.012
oteric+0.012
umatic+0.012
Neg logit contributions
iped-0.012
pel-0.010
 HOR-0.010
rows-0.009
Py-0.009
Token your
Token ID534
Feature activation-4.66e-02

Pos logit contributions
aturdays+0.061
forcer+0.059
usra+0.058
oteric+0.058
umatic+0.057
Neg logit contributions
iped-0.060
pel-0.052
 HOR-0.049
rows-0.048
Py-0.048
Token email
Token ID3053
Feature activation-1.68e-01

Pos logit contributions
aturdays+0.028
forcer+0.027
oteric+0.026
usra+0.026
umatic+0.026
Neg logit contributions
iped-0.038
pel-0.033
 HOR-0.032
rows-0.032
Py-0.031
Token,
Token ID11
Feature activation+1.60e-02

Pos logit contributions
aturdays+0.124
forcer+0.120
usra+0.118
oteric+0.118
umatic+0.116
Neg logit contributions
iped-0.114
pel-0.098
 HOR-0.091
rows-0.091
Py-0.090
Token they
Token ID484
Feature activation+2.15e-03

Pos logit contributions
iped+0.012
pel+0.010
 HOR+0.010
rows+0.009
Py+0.009
Neg logit contributions
aturdays-0.011
forcer-0.011
usra-0.010
oteric-0.010
umatic-0.010
Token can
Token ID460
Feature activation+2.31e-02

Pos logit contributions
iped+0.002
pel+0.001
 HOR+0.001
rows+0.001
Py+0.001
Neg logit contributions
aturdays-0.001
forcer-0.001
usra-0.001
oteric-0.001
umatic-0.001
Token root
Token ID6808
Feature activation+3.34e-01

Pos logit contributions
iped+0.017
pel+0.015
 HOR+0.014
rows+0.014
Py+0.014
Neg logit contributions
aturdays-0.015
forcer-0.015
usra-0.015
oteric-0.015
umatic-0.014
Token through
Token ID832
Feature activation-3.93e-04

Pos logit contributions
iped+0.225
pel+0.193
 HOR+0.181
rows+0.180
Py+0.178
Neg logit contributions
aturdays-0.246
forcer-0.238
usra-0.235
oteric-0.235
umatic-0.231
Token your
Token ID534
Feature activation-6.40e-02

Pos logit contributions
aturdays+0.000
forcer+0.000
usra+0.000
oteric+0.000
umatic+0.000
Neg logit contributions
iped-0.000
pel-0.000
 HOR-0.000
rows-0.000
Py-0.000
Token bank
Token ID3331
Feature activation-3.71e-02

Pos logit contributions
aturdays+0.042
forcer+0.041
oteric+0.040
usra+0.040
umatic+0.040
Neg logit contributions
iped-0.048
pel-0.042
 HOR-0.040
rows-0.040
Py-0.039
Token records
Token ID4406
Feature activation-6.13e-02

Pos logit contributions
aturdays+0.025
forcer+0.024
oteric+0.024
usra+0.024
umatic+0.024
Neg logit contributions
iped-0.027
pel-0.024
 HOR-0.022
rows-0.022
Py-0.022
Token,
Token ID11
Feature activation+2.05e-02

Pos logit contributions
aturdays+0.046
forcer+0.044
usra+0.044
oteric+0.044
umatic+0.043
Neg logit contributions
iped-0.040
pel-0.035
 HOR-0.032
rows-0.032
Py-0.032
Token.
Token ID13
Feature activation-3.45e-03

Pos logit contributions
aturdays+0.096
forcer+0.093
oteric+0.091
usra+0.091
umatic+0.091
Neg logit contributions
iped-0.088
pel-0.075
 HOR-0.071
rows-0.070
Py-0.070
Token And
Token ID843
Feature activation+1.70e-02

Pos logit contributions
aturdays+0.002
forcer+0.002
usra+0.002
oteric+0.002
orean+0.002
Neg logit contributions
iped-0.002
pel-0.002
rows-0.002
 HOR-0.002
Py-0.002
Token the
Token ID262
Feature activation-1.07e-02

Pos logit contributions
iped+0.011
pel+0.009
 HOR+0.009
rows+0.008
Py+0.008
Neg logit contributions
aturdays-0.013
forcer-0.013
usra-0.013
oteric-0.013
umatic-0.012
Token big
Token ID1263
Feature activation+1.26e-02

Pos logit contributions
aturdays+0.008
forcer+0.007
usra+0.007
oteric+0.007
umatic+0.007
Neg logit contributions
iped-0.007
pel-0.006
 HOR-0.006
rows-0.006
Py-0.006
Token men
Token ID1450
Feature activation+1.62e-01

Pos logit contributions
iped+0.009
pel+0.007
 HOR+0.007
rows+0.007
Py+0.007
Neg logit contributions
aturdays-0.009
forcer-0.009
usra-0.009
oteric-0.009
umatic-0.009
Token fly
Token ID6129
Feature activation+2.36e-01

Pos logit contributions
iped+0.116
pel+0.100
 HOR+0.094
rows+0.094
Py+0.093
Neg logit contributions
aturdays-0.113
forcer-0.109
usra-0.107
oteric-0.107
umatic-0.105
Token.
Token ID13
Feature activation-1.71e-02

Pos logit contributions
iped+0.160
pel+0.137
 HOR+0.128
rows+0.128
Py+0.126
Neg logit contributions
aturdays-0.173
forcer-0.167
usra-0.166
oteric-0.165
umatic-0.163
Token And
Token ID843
Feature activation+1.72e-02

Pos logit contributions
aturdays+0.012
forcer+0.012
usra+0.011
oteric+0.011
umatic+0.011
Neg logit contributions
iped-0.012
pel-0.011
 HOR-0.010
rows-0.010
Py-0.010
Token the
Token ID262
Feature activation-3.68e-04

Pos logit contributions
iped+0.011
pel+0.010
 HOR+0.009
rows+0.009
Py+0.009
Neg logit contributions
aturdays-0.013
forcer-0.012
usra-0.012
oteric-0.012
umatic-0.012
Token players
Token ID1938
Feature activation+9.46e-02

Pos logit contributions
aturdays+0.000
forcer+0.000
usra+0.000
oteric+0.000
umatic+0.000
Neg logit contributions
iped-0.000
pel-0.000
 HOR-0.000
rows-0.000
Py-0.000
Token go
Token ID467
Feature activation+2.32e-02

Pos logit contributions
iped+0.066
pel+0.057
 HOR+0.054
rows+0.054
Py+0.053
Neg logit contributions
aturdays-0.067
forcer-0.065
usra-0.064
oteric-0.064
umatic-0.063
Token All
Token ID1439
Feature activation-6.38e-02

Pos logit contributions
iped+0.009
pel+0.008
rows+0.007
 HOR+0.007
Py+0.007
Neg logit contributions
aturdays-0.009
forcer-0.008
oteric-0.008
usra-0.008
orean-0.008
Token the
Token ID262
Feature activation+1.45e-02

Pos logit contributions
aturdays+0.046
forcer+0.044
usra+0.043
oteric+0.043
umatic+0.043
Neg logit contributions
iped-0.045
pel-0.039
 HOR-0.036
rows-0.036
Py-0.036
Token an
Token ID281
Feature activation-3.64e-03

Pos logit contributions
iped+0.010
pel+0.009
 HOR+0.008
rows+0.008
Py+0.008
Neg logit contributions
aturdays-0.010
forcer-0.010
usra-0.010
oteric-0.010
umatic-0.010
Tokenach
Token ID620
Feature activation-5.41e-03

Pos logit contributions
aturdays+0.002
forcer+0.002
usra+0.002
oteric+0.002
umatic+0.002
Neg logit contributions
iped-0.003
pel-0.003
 HOR-0.002
rows-0.002
Py-0.002
Tokenron
Token ID1313
Feature activation-4.03e-02

Pos logit contributions
aturdays+0.003
forcer+0.003
oteric+0.003
usra+0.003
orean+0.003
Neg logit contributions
iped-0.004
pel-0.004
 HOR-0.004
rows-0.004
Py-0.004
Tokenistic
Token ID2569
Feature activation+3.08e-01

Pos logit contributions
aturdays+0.029
forcer+0.028
usra+0.028
oteric+0.028
umatic+0.028
Neg logit contributions
iped-0.028
pel-0.024
 HOR-0.022
rows-0.022
Py-0.022
Token elements
Token ID4847
Feature activation-8.63e-02

Pos logit contributions
iped+0.214
pel+0.184
 HOR+0.173
rows+0.172
Py+0.170
Neg logit contributions
aturdays-0.221
forcer-0.214
usra-0.212
oteric-0.211
umatic-0.208
Token,
Token ID11
Feature activation-8.45e-03

Pos logit contributions
aturdays+0.059
forcer+0.057
usra+0.056
oteric+0.056
umatic+0.055
Neg logit contributions
iped-0.063
pel-0.054
 HOR-0.051
rows-0.051
Py-0.051
Token the
Token ID262
Feature activation-6.92e-03

Pos logit contributions
aturdays+0.006
forcer+0.006
usra+0.006
oteric+0.006
orean+0.006
Neg logit contributions
iped-0.006
pel-0.005
 HOR-0.005
rows-0.005
Py-0.005
Token d
Token ID288
Feature activation-1.19e-02

Pos logit contributions
aturdays+0.005
forcer+0.005
usra+0.005
oteric+0.005
umatic+0.005
Neg logit contributions
iped-0.005
pel-0.004
 HOR-0.004
rows-0.004
Py-0.004
Tokenour
Token ID454
Feature activation-1.32e-01

Pos logit contributions
aturdays+0.007
forcer+0.007
usra+0.007
oteric+0.007
umatic+0.006
Neg logit contributions
iped-0.010
pel-0.009
 HOR-0.008
rows-0.008
Py-0.008
Token penalties
Token ID12970
Feature activation+1.08e-01

Pos logit contributions
iped+0.017
pel+0.014
 HOR+0.013
rows+0.013
Py+0.013
Neg logit contributions
aturdays-0.018
forcer-0.017
usra-0.017
oteric-0.017
orean-0.017
Token for
Token ID329
Feature activation+3.60e-02

Pos logit contributions
iped+0.074
pel+0.064
 HOR+0.060
rows+0.060
Py+0.059
Neg logit contributions
aturdays-0.078
forcer-0.075
oteric-0.074
usra-0.074
umatic-0.073
Token those
Token ID883
Feature activation-2.51e-02

Pos logit contributions
iped+0.025
pel+0.021
 HOR+0.020
rows+0.020
Py+0.020
Neg logit contributions
aturdays-0.026
forcer-0.025
usra-0.025
oteric-0.025
umatic-0.025
Token who
Token ID508
Feature activation+1.01e-02

Pos logit contributions
aturdays+0.017
forcer+0.016
usra+0.016
oteric+0.016
umatic+0.016
Neg logit contributions
iped-0.019
pel-0.016
 HOR-0.015
rows-0.015
Py-0.015
Token "
Token ID366
Feature activation-5.97e-02

Pos logit contributions
iped+0.008
pel+0.007
 HOR+0.006
rows+0.006
Py+0.006
Neg logit contributions
aturdays-0.007
forcer-0.006
usra-0.006
oteric-0.006
umatic-0.006
Tokendef
Token ID4299
Feature activation+2.55e-01

Pos logit contributions
aturdays+0.038
forcer+0.037
usra+0.036
oteric+0.036
umatic+0.035
Neg logit contributions
iped-0.046
pel-0.040
 HOR-0.038
rows-0.038
Py-0.038
Tokenraud
Token ID22863
Feature activation+5.61e-02

Pos logit contributions
iped+0.196
pel+0.172
 HOR+0.163
rows+0.162
Py+0.161
Neg logit contributions
aturdays-0.163
forcer-0.157
usra-0.155
oteric-0.155
umatic-0.152
Token or
Token ID393
Feature activation+1.80e-02

Pos logit contributions
iped+0.039
pel+0.033
 HOR+0.031
rows+0.031
Py+0.031
Neg logit contributions
aturdays-0.040
forcer-0.039
usra-0.038
oteric-0.038
umatic-0.038
Token abuse
Token ID5076
Feature activation-4.71e-02

Pos logit contributions
iped+0.013
pel+0.011
 HOR+0.011
rows+0.011
Py+0.011
Neg logit contributions
aturdays-0.012
forcer-0.012
usra-0.012
oteric-0.012
orean-0.012
Token federal
Token ID2717
Feature activation-1.13e-02

Pos logit contributions
aturdays+0.033
forcer+0.032
usra+0.032
oteric+0.032
umatic+0.031
Neg logit contributions
iped-0.033
pel-0.029
 HOR-0.027
rows-0.027
Py-0.027
Token programs
Token ID4056
Feature activation+1.04e-03

Pos logit contributions
aturdays+0.008
forcer+0.008
usra+0.008
oteric+0.008
umatic+0.007
Neg logit contributions
iped-0.008
pel-0.007
 HOR-0.007
rows-0.006
Py-0.006
Token keep
Token ID1394
Feature activation-5.13e-02

Pos logit contributions
aturdays+0.008
forcer+0.008
usra+0.008
oteric+0.008
umatic+0.007
Neg logit contributions
iped-0.007
pel-0.006
 HOR-0.006
rows-0.006
Py-0.006
Token reading
Token ID3555
Feature activation-5.55e-03

Pos logit contributions
aturdays+0.035
forcer+0.034
usra+0.033
oteric+0.033
umatic+0.033
Neg logit contributions
iped-0.037
pel-0.032
 HOR-0.031
rows-0.030
Py-0.030
Token,
Token ID11
Feature activation+7.75e-03

Pos logit contributions
aturdays+0.004
forcer+0.004
oteric+0.004
usra+0.004
umatic+0.004
Neg logit contributions
iped-0.004
pel-0.003
 HOR-0.003
rows-0.003
Py-0.003
Token c
Token ID269
Feature activation+6.86e-03

Pos logit contributions
iped+0.005
pel+0.005
 HOR+0.004
rows+0.004
Py+0.004
Neg logit contributions
aturdays-0.006
forcer-0.005
usra-0.005
oteric-0.005
umatic-0.005
Tokenuz
Token ID10277
Feature activation+1.08e-01

Pos logit contributions
iped+0.005
pel+0.004
 HOR+0.004
Py+0.004
rows+0.004
Neg logit contributions
aturdays-0.005
forcer-0.004
usra-0.004
oteric-0.004
umatic-0.004
Token
Token ID447
Feature activation+2.70e-02

Pos logit contributions
iped+0.075
pel+0.064
 HOR+0.061
rows+0.060
Py+0.060
Neg logit contributions
aturdays-0.077
forcer-0.074
usra-0.073
oteric-0.073
umatic-0.072
Token
Token ID247
Feature activation+1.79e-02

Pos logit contributions
iped+0.018
pel+0.016
 HOR+0.015
rows+0.015
Py+0.015
Neg logit contributions
aturdays-0.020
forcer-0.019
usra-0.019
oteric-0.019
umatic-0.019
Token you
Token ID345
Feature activation+1.30e-02

Pos logit contributions
iped+0.013
pel+0.012
 HOR+0.011
rows+0.011
Py+0.011
Neg logit contributions
aturdays-0.012
forcer-0.012
oteric-0.011
usra-0.011
umatic-0.011
Token too
Token ID1165
Feature activation+4.20e-02

Pos logit contributions
iped+0.010
pel+0.009
 HOR+0.008
rows+0.008
Py+0.008
Neg logit contributions
aturdays-0.008
forcer-0.008
usra-0.008
oteric-0.008
umatic-0.008
Token,
Token ID11
Feature activation+1.50e-02

Pos logit contributions
iped+0.033
pel+0.029
 HOR+0.027
rows+0.027
Py+0.027
Neg logit contributions
aturdays-0.026
forcer-0.025
usra-0.025
oteric-0.025
umatic-0.025
Token can
Token ID460
Feature activation+4.82e-03

Pos logit contributions
iped+0.012
pel+0.010
 HOR+0.010
rows+0.010
Py+0.009
Neg logit contributions
aturdays-0.010
forcer-0.009
usra-0.009
oteric-0.009
umatic-0.009
Tokenfired
Token ID26803
Feature activation-3.31e-02

Pos logit contributions
iped+0.066
pel+0.058
 HOR+0.056
rows+0.055
Py+0.054
Neg logit contributions
aturdays-0.049
forcer-0.047
usra-0.047
oteric-0.046
umatic-0.045
Token.
Token ID13
Feature activation+4.62e-04

Pos logit contributions
aturdays+0.024
forcer+0.024
oteric+0.023
usra+0.023
umatic+0.023
Neg logit contributions
iped-0.022
pel-0.019
 HOR-0.018
rows-0.018
Py-0.018
Token The
Token ID383
Feature activation-2.11e-02

Pos logit contributions
iped+0.000
pel+0.000
rows+0.000
 HOR+0.000
Py+0.000
Neg logit contributions
aturdays-0.000
forcer-0.000
usra-0.000
oteric-0.000
orean-0.000
Token city
Token ID1748
Feature activation+8.02e-02

Pos logit contributions
aturdays+0.015
forcer+0.015
usra+0.015
oteric+0.015
umatic+0.015
Neg logit contributions
iped-0.014
pel-0.012
 HOR-0.011
rows-0.011
Py-0.011
Token funded
Token ID10588
Feature activation-1.36e-01

Pos logit contributions
iped+0.059
pel+0.051
 HOR+0.048
rows+0.048
Py+0.047
Neg logit contributions
aturdays-0.054
forcer-0.052
usra-0.052
oteric-0.051
umatic-0.051
Token 10
Token ID838
Feature activation+4.41e-02

Pos logit contributions
aturdays+0.094
forcer+0.091
usra+0.090
oteric+0.090
umatic+0.088
Neg logit contributions
iped-0.098
pel-0.085
rows-0.079
 HOR-0.079
Py-0.079
Token,
Token ID11
Feature activation-1.75e-02

Pos logit contributions
iped+0.030
pel+0.026
 HOR+0.024
rows+0.024
Py+0.024
Neg logit contributions
aturdays-0.033
forcer-0.031
usra-0.031
oteric-0.031
umatic-0.030
Token000
Token ID830
Feature activation-7.33e-02

Pos logit contributions
aturdays+0.011
forcer+0.010
usra+0.010
oteric+0.010
umatic+0.010
Neg logit contributions
iped-0.014
pel-0.012
 HOR-0.012
rows-0.012
Py-0.012
Token new
Token ID649
Feature activation-5.86e-03

Pos logit contributions
aturdays+0.055
forcer+0.053
usra+0.053
oteric+0.053
umatic+0.052
Neg logit contributions
iped-0.048
pel-0.041
rows-0.038
 HOR-0.038
Py-0.038
Token leases
Token ID38522
Feature activation-6.78e-02

Pos logit contributions
aturdays+0.005
forcer+0.005
usra+0.005
oteric+0.005
umatic+0.004
Neg logit contributions
iped-0.004
pel-0.003
rows-0.003
 HOR-0.003
Py-0.003
Token for
Token ID329
Feature activation-1.65e-02

Pos logit contributions
aturdays+0.050
forcer+0.048
oteric+0.047
usra+0.047
umatic+0.047
Neg logit contributions
iped-0.046
pel-0.039
 HOR-0.037
rows-0.037
Py-0.036
Token
Token ID198
Feature activation-3.30e-03

Pos logit contributions
aturdays+0.006
forcer+0.005
usra+0.005
oteric+0.005
orean+0.005
Neg logit contributions
iped-0.006
pel-0.005
 HOR-0.005
rows-0.005
 squid-0.005
TokenThat
Token ID2504
Feature activation-3.83e-03

Pos logit contributions
aturdays+0.002
forcer+0.002
usra+0.002
oteric+0.002
hal+0.002
Neg logit contributions
iped-0.003
pel-0.002
 HOR-0.002
 squid-0.002
rows-0.002
Token
Token ID447
Feature activation-2.73e-03

Pos logit contributions
aturdays+0.003
forcer+0.002
usra+0.002
oteric+0.002
umatic+0.002
Neg logit contributions
iped-0.003
pel-0.003
rows-0.002
 HOR-0.002
Py-0.002
Token
Token ID247
Feature activation-1.09e-02

Pos logit contributions
aturdays+0.002
forcer+0.002
oteric+0.002
usra+0.002
umatic+0.002
Neg logit contributions
iped-0.002
pel-0.002
 HOR-0.001
rows-0.001
Py-0.001
Tokens
Token ID82
Feature activation-1.52e-02

Pos logit contributions
aturdays+0.007
forcer+0.007
usra+0.007
oteric+0.007
umatic+0.007
Neg logit contributions
iped-0.008
pel-0.007
 HOR-0.007
rows-0.007
Py-0.007
Token a
Token ID257
Feature activation-2.05e-02

Pos logit contributions
aturdays+0.011
forcer+0.010
usra+0.010
oteric+0.010
umatic+0.010
Neg logit contributions
iped-0.011
pel-0.009
rows-0.009
 HOR-0.009
Py-0.008
Token major
Token ID1688
Feature activation+4.92e-02

Pos logit contributions
aturdays+0.015
forcer+0.015
usra+0.015
oteric+0.015
umatic+0.014
Neg logit contributions
iped-0.014
pel-0.012
rows-0.011
 HOR-0.011
Py-0.011
Token systemic
Token ID21971
Feature activation+9.77e-02

Pos logit contributions
iped+0.033
pel+0.028
rows+0.026
 HOR+0.026
Py+0.026
Neg logit contributions
aturdays-0.037
forcer-0.036
usra-0.035
oteric-0.035
umatic-0.035
Token weakness
Token ID10453
Feature activation-3.04e-02

Pos logit contributions
iped+0.061
pel+0.052
rows+0.048
 HOR+0.048
Py+0.047
Neg logit contributions
aturdays-0.076
forcer-0.074
usra-0.073
oteric-0.072
hal-0.072
Token that
Token ID326
Feature activation+2.37e-03

Pos logit contributions
aturdays+0.022
forcer+0.021
usra+0.021
oteric+0.021
umatic+0.020
Neg logit contributions
iped-0.021
pel-0.018
 HOR-0.017
rows-0.017
Py-0.017
Token
Token ID447
Feature activation-1.01e-02

Pos logit contributions
iped+0.002
pel+0.002
 HOR+0.001
rows+0.001
Py+0.001
Neg logit contributions
aturdays-0.002
forcer-0.001
usra-0.001
oteric-0.001
umatic-0.001
Token surveyed
Token ID20840
Feature activation+1.23e-01

Pos logit contributions
aturdays+0.035
forcer+0.034
usra+0.034
oteric+0.034
umatic+0.033
Neg logit contributions
iped-0.033
pel-0.028
 HOR-0.026
rows-0.026
Py-0.026
Token in
Token ID287
Feature activation-3.87e-02

Pos logit contributions
iped+0.085
pel+0.072
 HOR+0.068
rows+0.068
Py+0.067
Neg logit contributions
aturdays-0.089
forcer-0.086
usra-0.085
oteric-0.085
umatic-0.084
Token Ohio
Token ID6835
Feature activation-6.00e-02

Pos logit contributions
aturdays+0.021
forcer+0.021
usra+0.020
oteric+0.020
umatic+0.020
Neg logit contributions
iped-0.033
pel-0.029
 HOR-0.028
rows-0.028
Py-0.027
Token and
Token ID290
Feature activation-4.94e-03

Pos logit contributions
aturdays+0.043
forcer+0.041
usra+0.041
oteric+0.041
umatic+0.040
Neg logit contributions
iped-0.042
pel-0.036
 HOR-0.034
rows-0.034
Py-0.033
Token 59
Token ID7863
Feature activation+7.61e-02

Pos logit contributions
aturdays+0.003
forcer+0.003
usra+0.003
oteric+0.003
umatic+0.003
Neg logit contributions
iped-0.004
pel-0.003
 HOR-0.003
rows-0.003
Py-0.003
Token percent
Token ID1411
Feature activation-1.85e-02

Pos logit contributions
iped+0.053
pel+0.045
 HOR+0.043
rows+0.042
Py+0.042
Neg logit contributions
aturdays-0.054
forcer-0.053
usra-0.052
oteric-0.052
orean-0.051
Token of
Token ID286
Feature activation-6.91e-02

Pos logit contributions
aturdays+0.014
forcer+0.013
usra+0.013
oteric+0.013
umatic+0.013
Neg logit contributions
iped-0.013
pel-0.011
 HOR-0.010
rows-0.010
Py-0.010
Token voters
Token ID4446
Feature activation+9.26e-02

Pos logit contributions
aturdays+0.047
forcer+0.045
usra+0.045
oteric+0.045
umatic+0.044
Neg logit contributions
iped-0.050
pel-0.044
 HOR-0.041
rows-0.041
Py-0.041
Token in
Token ID287
Feature activation-4.81e-02

Pos logit contributions
iped+0.064
pel+0.055
 HOR+0.052
rows+0.051
Py+0.051
Neg logit contributions
aturdays-0.066
forcer-0.064
usra-0.063
oteric-0.063
umatic-0.062
Token Wisconsin
Token ID9279
Feature activation-3.01e-02

Pos logit contributions
aturdays+0.027
forcer+0.026
usra+0.025
oteric+0.025
umatic+0.025
Neg logit contributions
iped-0.041
pel-0.036
 HOR-0.035
rows-0.034
Py-0.034
Token say
Token ID910
Feature activation+3.72e-02

Pos logit contributions
aturdays+0.022
forcer+0.021
usra+0.021
oteric+0.021
umatic+0.020
Neg logit contributions
iped-0.021
pel-0.018
 HOR-0.017
rows-0.016
Py-0.016
Token6
Token ID21
Feature activation+3.52e-03

Pos logit contributions
aturdays+0.017
forcer+0.016
usra+0.016
oteric+0.016
umatic+0.016
Neg logit contributions
iped-0.021
pel-0.019
 HOR-0.018
rows-0.018
Py-0.017
Token {
Token ID1391
Feature activation-2.05e-01

Pos logit contributions
iped+0.003
pel+0.003
 HOR+0.003
rows+0.003
Py+0.003
Neg logit contributions
aturdays-0.002
forcer-0.002
oteric-0.002
usra-0.002
umatic-0.002
Token(
Token ID7
Feature activation-7.78e-02

Pos logit contributions
aturdays+0.130
forcer+0.126
oteric+0.124
usra+0.124
orean+0.122
Neg logit contributions
iped-0.158
pel-0.138
 HOR-0.131
rows-0.130
 squid-0.128
Token0
Token ID15
Feature activation-5.51e-02

Pos logit contributions
aturdays+0.048
forcer+0.046
usra+0.045
oteric+0.045
umatic+0.045
Neg logit contributions
iped-0.062
pel-0.054
 HOR-0.052
rows-0.051
Py-0.051
Tokens
Token ID82
Feature activation-3.46e-02

Pos logit contributions
aturdays+0.036
forcer+0.035
usra+0.034
oteric+0.034
orean+0.033
Neg logit contributions
iped-0.042
pel-0.036
 HOR-0.035
rows-0.034
Py-0.034
Token)}
Token ID38165
Feature activation+5.80e-02

Pos logit contributions
aturdays+0.026
forcer+0.025
usra+0.025
oteric+0.025
umatic+0.024
Neg logit contributions
iped-0.023
pel-0.020
 HOR-0.018
rows-0.018
Py-0.018
Token 3
Token ID513
Feature activation-1.13e-04

Pos logit contributions
iped+0.042
pel+0.036
 HOR+0.034
rows+0.034
Py+0.034
Neg logit contributions
aturdays-0.040
forcer-0.038
usra-0.038
oteric-0.038
umatic-0.037
Token.
Token ID13
Feature activation+6.95e-03

Pos logit contributions
aturdays+0.000
forcer+0.000
usra+0.000
oteric+0.000
orean+0.000
Neg logit contributions
iped-0.000
pel-0.000
 HOR-0.000
rows-0.000
 squid-0.000
Token g
Token ID308
Feature activation-1.66e-03

Pos logit contributions
iped+0.005
pel+0.004
 HOR+0.004
rows+0.004
Py+0.004
Neg logit contributions
aturdays-0.005
forcer-0.004
usra-0.004
oteric-0.004
umatic-0.004
Token3
Token ID18
Feature activation+2.69e-02

Pos logit contributions
aturdays+0.001
forcer+0.001
usra+0.001
oteric+0.001
umatic+0.001
Neg logit contributions
iped-0.001
pel-0.001
 HOR-0.001
rows-0.001
Py-0.001
Token {
Token ID1391
Feature activation-2.27e-01

Pos logit contributions
iped+0.024
pel+0.021
 HOR+0.020
rows+0.020
Py+0.020
Neg logit contributions
aturdays-0.014
forcer-0.014
usra-0.013
oteric-0.013
umatic-0.013
Token until
Token ID1566
Feature activation+1.15e-01

Pos logit contributions
iped+0.042
pel+0.036
 HOR+0.034
rows+0.033
Py+0.033
Neg logit contributions
aturdays-0.045
forcer-0.044
oteric-0.043
usra-0.043
umatic-0.043
Token the
Token ID262
Feature activation+3.67e-02

Pos logit contributions
iped+0.080
pel+0.069
 HOR+0.065
rows+0.065
Py+0.064
Neg logit contributions
aturdays-0.082
forcer-0.079
usra-0.078
oteric-0.078
umatic-0.077
Token accumulated
Token ID22425
Feature activation+6.79e-02

Pos logit contributions
iped+0.026
pel+0.022
 HOR+0.021
rows+0.021
Py+0.021
Neg logit contributions
aturdays-0.026
forcer-0.025
usra-0.025
oteric-0.025
umatic-0.024
Token debt
Token ID5057
Feature activation-1.63e-02

Pos logit contributions
iped+0.049
pel+0.043
 HOR+0.040
rows+0.040
Py+0.040
Neg logit contributions
aturdays-0.046
forcer-0.045
usra-0.044
oteric-0.044
umatic-0.043
Token becomes
Token ID4329
Feature activation-1.45e-01

Pos logit contributions
aturdays+0.010
forcer+0.010
usra+0.010
oteric+0.010
umatic+0.010
Neg logit contributions
iped-0.013
pel-0.011
 HOR-0.010
rows-0.010
Py-0.010
Token unsustainable
Token ID39554
Feature activation-3.80e-02

Pos logit contributions
aturdays+0.108
forcer+0.106
oteric+0.104
usra+0.104
umatic+0.103
Neg logit contributions
iped-0.095
pel-0.082
rows-0.076
 HOR-0.076
Py-0.075
Token and
Token ID290
Feature activation+3.76e-02

Pos logit contributions
aturdays+0.028
forcer+0.027
oteric+0.027
usra+0.027
umatic+0.026
Neg logit contributions
iped-0.026
pel-0.022
 HOR-0.021
rows-0.020
Py-0.020
Token the
Token ID262
Feature activation+2.89e-02

Pos logit contributions
iped+0.027
pel+0.023
 HOR+0.022
rows+0.022
Py+0.021
Neg logit contributions
aturdays-0.027
forcer-0.026
usra-0.025
oteric-0.025
umatic-0.025
Token government
Token ID1230
Feature activation+2.11e-02

Pos logit contributions
iped+0.020
pel+0.017
 HOR+0.016
rows+0.016
Py+0.016
Neg logit contributions
aturdays-0.021
forcer-0.020
usra-0.020
oteric-0.020
umatic-0.020
Token
Token ID447
Feature activation+1.75e-02

Pos logit contributions
iped+0.017
pel+0.015
 HOR+0.014
rows+0.014
Py+0.014
Neg logit contributions
aturdays-0.013
forcer-0.012
oteric-0.012
usra-0.012
umatic-0.012
Token
Token ID247
Feature activation+1.64e-02

Pos logit contributions
iped+0.012
pel+0.010
 HOR+0.009
rows+0.009
Py+0.009
Neg logit contributions
aturdays-0.013
forcer-0.013
usra-0.012
oteric-0.012
umatic-0.012
TokenToday
Token ID8888
Feature activation+8.33e-02

Pos logit contributions
iped+0.025
pel+0.021
 HOR+0.020
rows+0.020
 squid+0.020
Neg logit contributions
aturdays-0.021
forcer-0.020
usra-0.020
oteric-0.020
umatic-0.020
Token we
Token ID356
Feature activation-8.10e-04

Pos logit contributions
iped+0.059
pel+0.051
 HOR+0.048
rows+0.048
Py+0.047
Neg logit contributions
aturdays-0.058
forcer-0.056
usra-0.056
oteric-0.056
umatic-0.055
Token
Token ID447
Feature activation-1.62e-03

Pos logit contributions
aturdays+0.000
forcer+0.000
oteric+0.000
usra+0.000
umatic+0.000
Neg logit contributions
iped-0.001
pel-0.001
 HOR-0.001
rows-0.001
Py-0.001
Token
Token ID247
Feature activation-2.80e-02

Pos logit contributions
aturdays+0.001
forcer+0.001
oteric+0.001
usra+0.001
umatic+0.001
Neg logit contributions
iped-0.001
pel-0.001
rows-0.001
 HOR-0.001
Py-0.001
Tokenre
Token ID260
Feature activation-1.12e-01

Pos logit contributions
aturdays+0.018
forcer+0.017
usra+0.017
oteric+0.017
umatic+0.017
Neg logit contributions
iped-0.021
pel-0.019
 HOR-0.018
rows-0.018
Py-0.018
Token excited
Token ID6568
Feature activation-1.13e-01

Pos logit contributions
aturdays+0.076
forcer+0.073
oteric+0.072
usra+0.072
umatic+0.071
Neg logit contributions
iped-0.082
pel-0.071
 HOR-0.067
rows-0.067
Py-0.066
Token to
Token ID284
Feature activation-6.18e-02

Pos logit contributions
aturdays+0.084
forcer+0.081
usra+0.080
oteric+0.080
umatic+0.079
Neg logit contributions
iped-0.075
pel-0.064
 HOR-0.060
rows-0.060
Py-0.059
Token announce
Token ID5453
Feature activation-5.85e-02

Pos logit contributions
aturdays+0.039
forcer+0.038
oteric+0.037
usra+0.037
umatic+0.037
Neg logit contributions
iped-0.048
pel-0.042
 HOR-0.040
rows-0.040
Py-0.039
Token Shop
Token ID13705
Feature activation-3.62e-02

Pos logit contributions
aturdays+0.042
forcer+0.041
oteric+0.040
usra+0.040
umatic+0.039
Neg logit contributions
iped-0.040
pel-0.035
 HOR-0.033
rows-0.033
Py-0.032
Tokenify
Token ID1958
Feature activation-4.61e-03

Pos logit contributions
aturdays+0.024
forcer+0.023
oteric+0.023
usra+0.023
umatic+0.022
Neg logit contributions
iped-0.027
pel-0.023
 HOR-0.022
rows-0.022
Py-0.022
Token POS
Token ID28069
Feature activation-1.77e-01

Pos logit contributions
aturdays+0.003
forcer+0.003
usra+0.003
oteric+0.003
umatic+0.003
Neg logit contributions
iped-0.003
pel-0.003
 HOR-0.003
rows-0.003
Py-0.003
Token {
Token ID1391
Feature activation-1.56e-01

Pos logit contributions
iped+0.050
pel+0.044
 HOR+0.042
rows+0.042
Py+0.042
Neg logit contributions
aturdays-0.028
forcer-0.027
oteric-0.027
usra-0.027
umatic-0.026
Token Write
Token ID19430
Feature activation-5.51e-02

Pos logit contributions
aturdays+0.120
forcer+0.116
usra+0.115
oteric+0.115
orean+0.113
Neg logit contributions
iped-0.100
pel-0.085
 HOR-0.079
rows-0.079
Py-0.078
Token-
Token ID12
Feature activation+3.12e-02

Pos logit contributions
aturdays+0.038
forcer+0.037
usra+0.036
oteric+0.036
umatic+0.036
Neg logit contributions
iped-0.040
pel-0.034
 HOR-0.032
rows-0.032
Py-0.032
TokenWarning
Token ID20361
Feature activation+1.83e-01

Pos logit contributions
iped+0.020
pel+0.017
 HOR+0.016
rows+0.015
Py+0.015
Neg logit contributions
aturdays-0.024
forcer-0.024
usra-0.023
oteric-0.023
umatic-0.023
Token -
Token ID532
Feature activation-7.24e-02

Pos logit contributions
iped+0.139
pel+0.121
 HOR+0.114
rows+0.114
Py+0.113
Neg logit contributions
aturdays-0.120
forcer-0.115
usra-0.114
oteric-0.114
umatic-0.111
TokenMessage
Token ID12837
Feature activation-2.83e-02

Pos logit contributions
aturdays+0.052
forcer+0.050
oteric+0.050
usra+0.049
umatic+0.049
Neg logit contributions
iped-0.050
pel-0.043
 HOR-0.041
rows-0.041
Py-0.040
Token "
Token ID366
Feature activation-2.22e-03

Pos logit contributions
aturdays+0.018
forcer+0.018
usra+0.017
oteric+0.017
umatic+0.017
Neg logit contributions
iped-0.022
pel-0.019
 HOR-0.018
rows-0.018
Py-0.018
TokenF
Token ID37
Feature activation-1.28e-02

Pos logit contributions
aturdays+0.002
forcer+0.001
usra+0.001
oteric+0.001
umatic+0.001
Neg logit contributions
iped-0.002
pel-0.001
 HOR-0.001
rows-0.001
Py-0.001
Tokenailed
Token ID6255
Feature activation-1.33e-01

Pos logit contributions
aturdays+0.007
forcer+0.006
usra+0.006
oteric+0.006
umatic+0.006
Neg logit contributions
iped-0.011
pel-0.010
 HOR-0.010
rows-0.010
Py-0.010
Token cannot
Token ID2314
Feature activation+4.22e-02

Pos logit contributions
aturdays+0.098
forcer+0.095
oteric+0.093
usra+0.093
orean+0.092
Neg logit contributions
iped-0.090
pel-0.077
rows-0.072
 HOR-0.072
Py-0.071
Token find
Token ID1064
Feature activation+4.16e-02

Pos logit contributions
iped+0.031
pel+0.027
 HOR+0.025
rows+0.025
Py+0.025
Neg logit contributions
aturdays-0.028
forcer-0.027
usra-0.027
oteric-0.027
orean-0.027
Token'
Token ID6
Feature activation+7.39e-02

Pos logit contributions
aturdays+0.025
forcer+0.024
oteric+0.024
usra+0.024
umatic+0.023
Neg logit contributions
iped-0.024
pel-0.021
 HOR-0.019
rows-0.019
Py-0.019
Token first
Token ID717
Feature activation+1.49e-02

Pos logit contributions
iped+0.051
pel+0.044
 HOR+0.042
rows+0.041
Py+0.041
Neg logit contributions
aturdays-0.053
forcer-0.051
usra-0.051
oteric-0.051
umatic-0.050
Token preseason
Token ID18913
Feature activation-6.49e-02

Pos logit contributions
iped+0.011
pel+0.010
 HOR+0.009
rows+0.009
Py+0.009
Neg logit contributions
aturdays-0.010
forcer-0.010
usra-0.010
oteric-0.010
umatic-0.009
Token game
Token ID983
Feature activation+2.79e-02

Pos logit contributions
aturdays+0.046
forcer+0.044
usra+0.043
oteric+0.043
umatic+0.043
Neg logit contributions
iped-0.046
pel-0.040
 HOR-0.038
rows-0.037
Py-0.037
Token,
Token ID11
Feature activation+2.03e-02

Pos logit contributions
iped+0.019
pel+0.017
 HOR+0.015
rows+0.015
Py+0.015
Neg logit contributions
aturdays-0.020
forcer-0.020
usra-0.019
oteric-0.019
umatic-0.019
Token against
Token ID1028
Feature activation-2.82e-02

Pos logit contributions
iped+0.015
pel+0.013
 HOR+0.013
rows+0.013
Py+0.012
Neg logit contributions
aturdays-0.013
forcer-0.013
usra-0.013
oteric-0.013
umatic-0.013
Token Washington
Token ID2669
Feature activation+2.03e-02

Pos logit contributions
aturdays+0.020
forcer+0.019
usra+0.019
oteric+0.019
orean+0.019
Neg logit contributions
iped-0.020
pel-0.017
rows-0.016
 HOR-0.016
Py-0.016
Token,
Token ID11
Feature activation+2.55e-04

Pos logit contributions
iped+0.013
pel+0.012
 HOR+0.011
rows+0.011
Py+0.011
Neg logit contributions
aturdays-0.015
forcer-0.015
usra-0.014
oteric-0.014
umatic-0.014
Token he
Token ID339
Feature activation+1.46e-02

Pos logit contributions
iped+0.000
pel+0.000
 HOR+0.000
rows+0.000
Py+0.000
Neg logit contributions
aturdays-0.000
forcer-0.000
usra-0.000
oteric-0.000
umatic-0.000
Token totaled
Token ID39398
Feature activation+9.42e-02

Pos logit contributions
iped+0.011
pel+0.010
 HOR+0.009
rows+0.009
Py+0.009
Neg logit contributions
aturdays-0.010
forcer-0.009
oteric-0.009
usra-0.009
umatic-0.009
Token seven
Token ID3598
Feature activation+3.81e-02

Pos logit contributions
iped+0.064
pel+0.055
 HOR+0.051
rows+0.051
Py+0.050
Neg logit contributions
aturdays-0.070
forcer-0.067
oteric-0.066
usra-0.066
umatic-0.065
Token,
Token ID11
Feature activation-1.11e-02

Pos logit contributions
iped+0.030
pel+0.026
 HOR+0.024
rows+0.024
Py+0.024
Neg logit contributions
aturdays-0.031
forcer-0.030
usra-0.030
oteric-0.030
umatic-0.029
Token
Token ID447
Feature activation+1.81e-04

Pos logit contributions
aturdays+0.008
forcer+0.008
usra+0.008
oteric+0.008
umatic+0.008
Neg logit contributions
iped-0.008
pel-0.007
 HOR-0.006
rows-0.006
Py-0.006
Token
Token ID251
Feature activation-2.74e-03

Pos logit contributions
iped+0.000
pel+0.000
 HOR+0.000
rows+0.000
Py+0.000
Neg logit contributions
aturdays-0.000
forcer-0.000
usra-0.000
oteric-0.000
umatic-0.000
Token M
Token ID337
Feature activation-7.18e-03

Pos logit contributions
aturdays+0.002
forcer+0.002
usra+0.002
oteric+0.002
umatic+0.002
Neg logit contributions
iped-0.002
pel-0.002
 HOR-0.002
rows-0.002
Py-0.002
Tokenimi
Token ID25236
Feature activation-3.01e-02

Pos logit contributions
aturdays+0.004
forcer+0.004
usra+0.004
oteric+0.004
orean+0.004
Neg logit contributions
iped-0.006
pel-0.006
 HOR-0.005
rows-0.005
Py-0.005
Token writes
Token ID6797
Feature activation-9.99e-02

Pos logit contributions
aturdays+0.021
forcer+0.021
usra+0.020
oteric+0.020
umatic+0.020
Neg logit contributions
iped-0.021
pel-0.018
 HOR-0.017
rows-0.017
Py-0.017
Token with
Token ID351
Feature activation-3.27e-02

Pos logit contributions
aturdays+0.074
forcer+0.071
usra+0.070
oteric+0.070
umatic+0.070
Neg logit contributions
iped-0.067
pel-0.058
 HOR-0.054
rows-0.054
Py-0.053
Token palpable
Token ID45061
Feature activation+4.10e-02

Pos logit contributions
aturdays+0.023
forcer+0.022
usra+0.022
oteric+0.022
umatic+0.021
Neg logit contributions
iped-0.023
pel-0.020
 HOR-0.019
rows-0.019
Py-0.019
Token delight
Token ID10974
Feature activation+1.83e-02

Pos logit contributions
iped+0.032
pel+0.028
 HOR+0.026
rows+0.026
Py+0.026
Neg logit contributions
aturdays-0.026
forcer-0.025
usra-0.025
oteric-0.025
umatic-0.024
Token.
Token ID13
Feature activation-6.23e-03

Pos logit contributions
iped+0.012
pel+0.010
 HOR+0.010
rows+0.010
Py+0.010
Neg logit contributions
aturdays-0.014
forcer-0.013
oteric-0.013
usra-0.013
umatic-0.013
Token
Token ID564
Feature activation+6.57e-05

Pos logit contributions
aturdays+0.004
forcer+0.004
usra+0.004
oteric+0.004
orean+0.004
Neg logit contributions
iped-0.004
pel-0.004
rows-0.004
 HOR-0.004
Py-0.004
Tokenat
Token ID265
Feature activation-6.62e-02

Pos logit contributions
aturdays+0.075
forcer+0.071
oteric+0.070
usra+0.070
umatic+0.068
Neg logit contributions
iped-0.139
pel-0.125
 HOR-0.119
rows-0.119
Py-0.118
Tokenollah
Token ID17999
Feature activation-4.82e-02

Pos logit contributions
aturdays+0.038
forcer+0.038
usra+0.037
oteric+0.037
orean+0.036
Neg logit contributions
iped-0.055
pel-0.048
 HOR-0.046
rows-0.045
Phil-0.045
Token R
Token ID371
Feature activation-2.78e-02

Pos logit contributions
aturdays+0.029
forcer+0.027
oteric+0.027
usra+0.027
umatic+0.027
Neg logit contributions
iped-0.039
pel-0.035
 HOR-0.033
rows-0.033
Py-0.033
Tokenuh
Token ID7456
Feature activation-7.73e-02

Pos logit contributions
aturdays+0.016
forcer+0.015
usra+0.015
oteric+0.015
umatic+0.015
Neg logit contributions
iped-0.023
pel-0.021
 HOR-0.020
rows-0.019
Py-0.019
Tokenollah
Token ID17999
Feature activation-6.58e-02

Pos logit contributions
aturdays+0.047
forcer+0.045
usra+0.045
oteric+0.044
umatic+0.044
Neg logit contributions
iped-0.062
pel-0.055
 HOR-0.052
rows-0.052
Phil-0.051
Token Kh
Token ID5311
Feature activation-3.03e-01

Pos logit contributions
aturdays+0.026
forcer+0.024
usra+0.023
oteric+0.023
umatic+0.023
Neg logit contributions
iped-0.067
pel-0.061
 HOR-0.059
rows-0.058
Py-0.058
Tokenome
Token ID462
Feature activation-2.73e-01

Pos logit contributions
aturdays+0.093
forcer+0.083
oteric+0.079
umatic+0.075
usra+0.074
Neg logit contributions
iped-0.337
pel-0.307
rows-0.297
Py-0.296
 HOR-0.294
Tokenini
Token ID5362
Feature activation+4.59e-03

Pos logit contributions
aturdays+0.180
forcer+0.174
usra+0.172
oteric+0.172
umatic+0.169
Neg logit contributions
iped-0.204
pel-0.178
 HOR-0.168
rows-0.168
Phil-0.165
Token's
Token ID338
Feature activation-1.31e-02

Pos logit contributions
iped+0.003
pel+0.003
 HOR+0.003
rows+0.003
Py+0.003
Neg logit contributions
aturdays-0.003
forcer-0.003
usra-0.003
oteric-0.003
umatic-0.003
Token radical
Token ID7702
Feature activation+1.32e-01

Pos logit contributions
aturdays+0.009
forcer+0.009
usra+0.009
oteric+0.009
umatic+0.008
Neg logit contributions
iped-0.010
pel-0.008
 HOR-0.008
rows-0.008
Py-0.008
Token Islamists
Token ID34434
Feature activation+5.85e-02

Pos logit contributions
iped+0.094
pel+0.082
 HOR+0.077
rows+0.077
Py+0.076
Neg logit contributions
aturdays-0.091
forcer-0.088
oteric-0.087
usra-0.087
umatic-0.086
Token the
Token ID262
Feature activation-4.32e-02

Pos logit contributions
iped+0.069
pel+0.059
 HOR+0.055
rows+0.055
Py+0.055
Neg logit contributions
aturdays-0.070
forcer-0.067
oteric-0.067
usra-0.067
umatic-0.066
Token 64
Token ID5598
Feature activation-1.41e-01

Pos logit contributions
aturdays+0.031
forcer+0.030
usra+0.030
oteric+0.030
umatic+0.029
Neg logit contributions
iped-0.030
pel-0.025
 HOR-0.024
rows-0.024
Py-0.023
Token-
Token ID12
Feature activation-4.60e-02

Pos logit contributions
aturdays+0.100
forcer+0.097
usra+0.095
oteric+0.095
umatic+0.094
Neg logit contributions
iped-0.099
pel-0.085
 HOR-0.080
rows-0.080
Py-0.079
Tokenbit
Token ID2545
Feature activation-8.88e-02

Pos logit contributions
aturdays+0.026
forcer+0.025
usra+0.025
oteric+0.025
umatic+0.024
Neg logit contributions
iped-0.039
pel-0.034
 HOR-0.032
rows-0.032
Py-0.032
Token Hy
Token ID6707
Feature activation+1.52e-01

Pos logit contributions
aturdays+0.062
forcer+0.060
usra+0.059
oteric+0.059
umatic+0.058
Neg logit contributions
iped-0.064
pel-0.055
 HOR-0.052
rows-0.052
Py-0.051
Tokenrule
Token ID25135
Feature activation-2.96e-01

Pos logit contributions
iped+0.161
pel+0.147
rows+0.141
 HOR+0.141
Py+0.140
Neg logit contributions
aturdays-0.054
forcer-0.050
oteric-0.049
usra-0.048
umatic-0.047
Token Fi
Token ID23238
Feature activation+2.29e-01

Pos logit contributions
aturdays+0.216
forcer+0.209
usra+0.206
oteric+0.206
umatic+0.204
Neg logit contributions
iped-0.202
pel-0.173
 HOR-0.162
rows-0.161
Py-0.159
Tokenend
Token ID437
Feature activation+8.02e-02

Pos logit contributions
iped+0.143
pel+0.122
 HOR+0.114
rows+0.112
Py+0.111
Neg logit contributions
aturdays-0.179
forcer-0.173
usra-0.171
oteric-0.171
orean-0.169
Token,
Token ID11
Feature activation-4.30e-02

Pos logit contributions
iped+0.057
pel+0.050
 HOR+0.047
rows+0.046
Py+0.046
Neg logit contributions
aturdays-0.056
forcer-0.054
usra-0.053
oteric-0.053
umatic-0.052
Token O
Token ID440
Feature activation+1.01e-01

Pos logit contributions
aturdays+0.031
forcer+0.030
usra+0.029
oteric+0.029
umatic+0.029
Neg logit contributions
iped-0.030
pel-0.026
 HOR-0.024
rows-0.024
Py-0.024
Tokencar
Token ID7718
Feature activation+7.16e-02

Pos logit contributions
iped+0.063
pel+0.054
 HOR+0.050
rows+0.050
Py+0.049
Neg logit contributions
aturdays-0.079
forcer-0.076
oteric-0.076
usra-0.076
umatic-0.075
Token unnoticed
Token ID33755
Feature activation-2.63e-02

Pos logit contributions
iped+0.009
pel+0.008
 HOR+0.007
rows+0.007
Py+0.007
Neg logit contributions
aturdays-0.009
forcer-0.009
usra-0.009
oteric-0.009
umatic-0.009
Token.
Token ID13
Feature activation+2.38e-02

Pos logit contributions
aturdays+0.019
forcer+0.019
oteric+0.019
usra+0.019
umatic+0.018
Neg logit contributions
iped-0.017
pel-0.015
 HOR-0.014
rows-0.014
Py-0.014
Token This
Token ID770
Feature activation+2.95e-02

Pos logit contributions
iped+0.017
pel+0.015
rows+0.014
 HOR+0.014
Py+0.014
Neg logit contributions
aturdays-0.016
forcer-0.016
usra-0.016
oteric-0.016
orean-0.015
Token was
Token ID373
Feature activation+2.43e-03

Pos logit contributions
iped+0.022
pel+0.019
rows+0.018
 HOR+0.018
Py+0.018
Neg logit contributions
aturdays-0.020
forcer-0.019
usra-0.019
oteric-0.019
umatic-0.019
Token down
Token ID866
Feature activation-8.97e-02

Pos logit contributions
iped+0.002
pel+0.002
 HOR+0.001
rows+0.001
Py+0.001
Neg logit contributions
aturdays-0.002
forcer-0.002
usra-0.002
oteric-0.002
umatic-0.002
Tokengraded
Token ID21791
Feature activation-2.81e-01

Pos logit contributions
aturdays+0.055
forcer+0.053
usra+0.052
oteric+0.052
umatic+0.051
Neg logit contributions
iped-0.072
pel-0.063
 HOR-0.060
rows-0.060
Py-0.059
Token in
Token ID287
Feature activation+2.30e-02

Pos logit contributions
aturdays+0.209
forcer+0.202
usra+0.199
oteric+0.199
umatic+0.197
Neg logit contributions
iped-0.188
pel-0.161
rows-0.150
 HOR-0.150
Py-0.148
Token October
Token ID3267
Feature activation-6.71e-02

Pos logit contributions
iped+0.017
pel+0.015
 HOR+0.014
rows+0.014
Py+0.014
Neg logit contributions
aturdays-0.015
forcer-0.015
usra-0.015
oteric-0.015
umatic-0.014
Token 2015
Token ID1853
Feature activation+3.22e-02

Pos logit contributions
aturdays+0.049
forcer+0.048
oteric+0.047
usra+0.047
umatic+0.046
Neg logit contributions
iped-0.046
pel-0.039
 HOR-0.037
rows-0.036
Py-0.036
Token after
Token ID706
Feature activation+4.57e-02

Pos logit contributions
iped+0.022
pel+0.018
 HOR+0.017
rows+0.017
Py+0.017
Neg logit contributions
aturdays-0.024
forcer-0.023
usra-0.023
oteric-0.023
umatic-0.023
Token three
Token ID1115
Feature activation-1.75e-02

Pos logit contributions
iped+0.032
pel+0.028
 HOR+0.026
rows+0.026
Py+0.026
Neg logit contributions
aturdays-0.033
forcer-0.031
usra-0.031
oteric-0.031
umatic-0.030
Token {
Token ID1391
Feature activation-2.26e-01

Pos logit contributions
iped+0.046
pel+0.041
 HOR+0.039
rows+0.039
Py+0.039
Neg logit contributions
aturdays-0.027
forcer-0.025
usra-0.025
oteric-0.025
umatic-0.024
Token(
Token ID7
Feature activation-1.25e-01

Pos logit contributions
aturdays+0.142
forcer+0.139
usra+0.136
oteric+0.136
orean+0.134
Neg logit contributions
iped-0.175
pel-0.153
 HOR-0.145
rows-0.144
 squid-0.142
Token90
Token ID3829
Feature activation-4.04e-02

Pos logit contributions
aturdays+0.080
forcer+0.077
usra+0.076
oteric+0.076
umatic+0.075
Neg logit contributions
iped-0.096
pel-0.084
 HOR-0.079
rows-0.079
Py-0.078
Tokens
Token ID82
Feature activation-6.58e-02

Pos logit contributions
aturdays+0.026
forcer+0.025
usra+0.025
oteric+0.024
orean+0.024
Neg logit contributions
iped-0.031
pel-0.027
 HOR-0.026
rows-0.025
 squid-0.025
Token)}
Token ID38165
Feature activation+6.40e-03

Pos logit contributions
aturdays+0.049
forcer+0.047
usra+0.047
oteric+0.047
umatic+0.046
Neg logit contributions
iped-0.044
pel-0.038
 HOR-0.035
rows-0.035
Py-0.035
Token Q
Token ID1195
Feature activation-2.85e-01

Pos logit contributions
iped+0.005
pel+0.004
 HOR+0.004
rows+0.004
Py+0.004
Neg logit contributions
aturdays-0.004
forcer-0.004
usra-0.004
oteric-0.004
umatic-0.004
Tokenc
Token ID66
Feature activation-5.17e-02

Pos logit contributions
aturdays+0.181
forcer+0.173
usra+0.171
oteric+0.171
umatic+0.168
Neg logit contributions
iped-0.223
pel-0.195
 HOR-0.185
rows-0.185
Py-0.183
Token7
Token ID22
Feature activation-8.24e-03

Pos logit contributions
aturdays+0.033
forcer+0.032
usra+0.031
oteric+0.031
umatic+0.031
Neg logit contributions
iped-0.040
pel-0.035
 HOR-0.034
rows-0.033
 squid-0.033
Token {
Token ID1391
Feature activation-2.19e-01

Pos logit contributions
aturdays+0.004
forcer+0.004
oteric+0.004
usra+0.004
umatic+0.004
Neg logit contributions
iped-0.008
pel-0.007
 HOR-0.006
rows-0.006
Py-0.006
Token(
Token ID7
Feature activation-1.32e-01

Pos logit contributions
aturdays+0.139
forcer+0.135
usra+0.133
oteric+0.133
orean+0.130
Neg logit contributions
iped-0.169
pel-0.147
 HOR-0.140
rows-0.139
Py-0.137
Token108
Token ID15711
Feature activation-4.98e-02

Pos logit contributions
aturdays+0.086
forcer+0.083
usra+0.081
oteric+0.081
umatic+0.080
Neg logit contributions
iped-0.100
pel-0.088
 HOR-0.083
rows-0.083
Py-0.081
Token World
Token ID2159
Feature activation-4.09e-02

Pos logit contributions
aturdays+0.016
forcer+0.016
usra+0.016
oteric+0.016
umatic+0.015
Neg logit contributions
iped-0.015
pel-0.013
 HOR-0.012
rows-0.012
Py-0.012
Token War
Token ID1810
Feature activation-3.73e-02

Pos logit contributions
aturdays+0.022
forcer+0.021
usra+0.020
oteric+0.020
umatic+0.020
Neg logit contributions
iped-0.036
pel-0.032
 HOR-0.031
rows-0.031
Py-0.030
Token II
Token ID2873
Feature activation+3.90e-03

Pos logit contributions
aturdays+0.023
forcer+0.022
usra+0.022
oteric+0.021
orean+0.021
Neg logit contributions
iped-0.030
pel-0.026
 HOR-0.025
rows-0.025
Py-0.025
Token veteran
Token ID9298
Feature activation-1.31e-02

Pos logit contributions
iped+0.004
pel+0.003
 HOR+0.003
rows+0.003
Py+0.003
Neg logit contributions
aturdays-0.002
forcer-0.002
usra-0.002
oteric-0.002
umatic-0.002
Token to
Token ID284
Feature activation-1.69e-02

Pos logit contributions
aturdays+0.010
forcer+0.009
oteric+0.009
usra+0.009
umatic+0.009
Neg logit contributions
iped-0.009
pel-0.008
 HOR-0.007
rows-0.007
Py-0.007
Token death
Token ID1918
Feature activation-2.79e-01

Pos logit contributions
aturdays+0.005
forcer+0.004
usra+0.004
oteric+0.004
umatic+0.004
Neg logit contributions
iped-0.019
pel-0.017
 HOR-0.017
rows-0.017
Py-0.017
Token.
Token ID13
Feature activation-1.76e-02

Pos logit contributions
aturdays+0.205
forcer+0.199
usra+0.196
oteric+0.195
umatic+0.193
Neg logit contributions
iped-0.189
pel-0.162
 HOR-0.151
rows-0.151
Py-0.149
Token
Token ID198
Feature activation-1.79e-02

Pos logit contributions
aturdays+0.012
forcer+0.012
usra+0.012
oteric+0.012
umatic+0.011
Neg logit contributions
iped-0.013
pel-0.011
 HOR-0.010
rows-0.010
Py-0.010
Token
Token ID198
Feature activation-1.85e-02

Pos logit contributions
aturdays+0.012
forcer+0.012
usra+0.012
oteric+0.012
orean+0.011
Neg logit contributions
iped-0.013
pel-0.011
 HOR-0.011
rows-0.011
Py-0.010
TokenDel
Token ID13856
Feature activation+2.39e-02

Pos logit contributions
aturdays+0.012
forcer+0.011
usra+0.011
oteric+0.011
orean+0.011
Neg logit contributions
iped-0.014
pel-0.012
 HOR-0.012
rows-0.012
 squid-0.012
Tokenbert
Token ID4835
Feature activation+1.39e-01

Pos logit contributions
iped+0.015
pel+0.013
 HOR+0.012
rows+0.012
Py+0.012
Neg logit contributions
aturdays-0.018
forcer-0.018
usra-0.017
oteric-0.017
orean-0.017