TOP ACTIVATIONS
MAX = 0.542

and J ansson 's pict orial and descriptive , yet rhy
home . Phil ip Rivers ( vs . MIA
so ." Phil ip Mont oro writes about beer
Master the Exc ursion Fun nel ( 1 )
Ay at ollah Ali Kham enei , warned that the day
as a function of prosecut orial discretion should not be used
the commander - in - chief of the United States when
British commander - in - chief , Sir Ian Hamilton .
's paired with " Del iver ance " on Aug .
Follow @ Jessica Dur ando on Twitter Read
omed . Sil ence envelop s us until my
s show Bel iever in the wake
want it to be digest ible . I was
King 's judgment )." Sah ih al - B uk hari
a stronger relationship with ab orig inals . The
claimed ? Trans gender people can undergo medically -
hours . In prolonged d rough ts , which are almost
That was Rob H oe gee 's episode and I think
A gentleman of genuine polit eness will not give all his
( R - S hen ando ah ) speaks on the

MIN ACTIVATIONS
MIN = -0.518

Mind claw Shaman vs Pyro blast Prim al Druid
yard sprint , and McF adden also threw for one touchdown
social justice issue . Athe ists are oppressed and persecuted the
Nut t campaigned for McF adden in the Heisman Trophy race
48 points , McF adden only accounted for one rushing
like the Purch asing Man agers Ind ices .
E pen a mo ek ahi ia n
on the issue on Kau ai . State legislators
to 10 men everything cons pired against us ," said Owen
Liberal Activ ists Than Man agers for City Posts .
if there are no que uing delays and no processing delays
this Al J our g ensen and Ge za X studio
chopped up society as unsu ited to state hood .
across the south - e astern Bahamas late Tuesday and early
Cash - Se eking Democratic Senator ial Campaign Committee
R & D ) tot alled 2 . 07 % of
, features a full Shim ano 600 tri - color group
Lu is Moreno Instit uto de Pol í tic as
- 0 Se eking just a point to clinch
s o eu vre . It feels nothing like

INTERVAL 0.277 - 0.542
CONTAINS 0.020%

the optics are still magn ifying those spaces as well as
Master the Exc ursion Fun nel ( 1 )
fees , the pl ump profits from home equity lines of
his work on electronic text uality , Matthew Kir sc hen
of Peoples ' Comm iss ariat of Defence No . 3

INTERVAL 0.012 - 0.277
CONTAINS 37.313%

their motive was . We 're not going to tolerate that
1 . Sa 'd dismissed the pleas of the
hold power anywhere today . But now we can see the
back ? Yes , they have a chance . But this
m a product of yellow fever or something

INTERVAL -0.253 - 0.012
CONTAINS 62.623%

be flowing freely during these events to prep the party
to certain guys , David Clarkson said .
suggest that this attack was being planned and the investigation had
tissue . In reality , the tapes show that they were
by books she wrote 40 years ago , and only occasionally

INTERVAL -0.518 - -0.253
CONTAINS 0.044%

/ any Mode : Man aged Access Point : Not -
Cash - Se eking Democratic Senator ial Campaign Committee
Dr Rune Fl ober gh agen . Mod elling
if there are no que uing delays and no processing delays
Scroll down and click on Vide opl ayer Input Stream .
Token and
Token ID290
Feature activation+4.10e-02

Pos logit contributions
astern+0.027
cry+0.023
blast+0.023
forcement+0.023
rots+0.022
Neg logit contributions
 LGBT-0.020
gee-0.019
 homosexuals-0.019
 POS-0.019
 interf-0.019
Token J
Token ID449
Feature activation+5.54e-02

Pos logit contributions
 LGBT+0.031
gee+0.030
 homosexuals+0.030
 POS+0.029
 interf+0.029
Neg logit contributions
astern-0.033
cry-0.028
forcement-0.027
blast-0.027
rots-0.026
Tokenansson
Token ID44038
Feature activation+5.17e-02

Pos logit contributions
 LGBT+0.037
gee+0.036
 homosexuals+0.036
 POS+0.035
nel+0.035
Neg logit contributions
astern-0.049
cry-0.042
forcement-0.041
blast-0.041
rots-0.040
Token's
Token ID338
Feature activation-2.90e-02

Pos logit contributions
 LGBT+0.044
gee+0.042
 homosexuals+0.042
 POS+0.042
 interf+0.041
Neg logit contributions
astern-0.037
cry-0.030
forcement-0.030
blast-0.030
rots-0.029
Token pict
Token ID2862
Feature activation-1.37e-02

Pos logit contributions
astern+0.023
cry+0.019
forcement+0.019
blast+0.019
rots+0.018
Neg logit contributions
 LGBT-0.022
 homosexuals-0.021
gee-0.021
 POS-0.021
 interf-0.021
Tokenorial
Token ID5132
Feature activation+5.42e-01

Pos logit contributions
astern+0.019
cry+0.017
forcement+0.017
blast+0.017
rots+0.017
Neg logit contributions
 LGBT-0.002
gee-0.002
 homosexuals-0.002
 POS-0.002
 interf-0.002
Token and
Token ID290
Feature activation+5.00e-02

Pos logit contributions
 LGBT+0.442
 homosexuals+0.421
gee+0.421
 POS+0.419
 interf+0.419
Neg logit contributions
astern-0.410
cry-0.335
forcement-0.330
blast-0.327
rots-0.318
Token descriptive
Token ID35644
Feature activation+8.29e-02

Pos logit contributions
 LGBT+0.037
gee+0.035
 homosexuals+0.035
 POS+0.035
 interf+0.034
Neg logit contributions
astern-0.042
cry-0.035
forcement-0.034
blast-0.034
rots-0.033
Token,
Token ID11
Feature activation+3.07e-02

Pos logit contributions
 LGBT+0.067
gee+0.064
 homosexuals+0.064
 POS+0.063
 interf+0.063
Neg logit contributions
astern-0.063
cry-0.052
forcement-0.051
blast-0.051
rots-0.049
Token yet
Token ID1865
Feature activation-7.31e-02

Pos logit contributions
 LGBT+0.022
gee+0.020
 homosexuals+0.020
 POS+0.020
 interf+0.020
Neg logit contributions
astern-0.026
cry-0.022
forcement-0.022
blast-0.022
rots-0.021
Token rhy
Token ID40005
Feature activation+1.16e-01

Pos logit contributions
astern+0.064
cry+0.055
forcement+0.053
blast+0.053
rots+0.052
Neg logit contributions
 LGBT-0.050
gee-0.048
 homosexuals-0.047
 POS-0.047
 interf-0.047
Token home
Token ID1363
Feature activation-9.31e-03

Pos logit contributions
 LGBT+0.042
 POS+0.040
gee+0.040
 homosexuals+0.040
 interf+0.040
Neg logit contributions
astern-0.047
cry-0.040
forcement-0.038
blast-0.038
rots-0.037
Token.
Token ID13
Feature activation+8.55e-03

Pos logit contributions
astern+0.008
cry+0.007
blast+0.007
forcement+0.007
rots+0.007
Neg logit contributions
 LGBT-0.006
gee-0.006
 POS-0.006
 homosexuals-0.006
 interf-0.006
Token
Token ID198
Feature activation+3.95e-02

Pos logit contributions
 LGBT+0.006
 POS+0.005
gee+0.005
nel+0.005
 homosexuals+0.005
Neg logit contributions
astern-0.008
cry-0.007
forcement-0.006
blast-0.006
rots-0.006
Token
Token ID198
Feature activation+2.47e-02

Pos logit contributions
 LGBT+0.022
gee+0.020
 homosexuals+0.020
 POS+0.020
nel+0.020
Neg logit contributions
astern-0.040
cry-0.035
forcement-0.034
blast-0.034
rots-0.033
TokenPhil
Token ID18673
Feature activation-1.16e-02

Pos logit contributions
 LGBT+0.019
gee+0.019
nel+0.019
 POS+0.018
 homosexuals+0.018
Neg logit contributions
astern-0.019
cry-0.016
forcement-0.015
blast-0.015
phabet-0.015
Tokenip
Token ID541
Feature activation+4.75e-01

Pos logit contributions
astern+0.014
cry+0.013
blast+0.013
forcement+0.013
phabet+0.012
Neg logit contributions
 LGBT-0.004
 homosexuals-0.003
gee-0.003
 POS-0.003
 interf-0.003
Token Rivers
Token ID18280
Feature activation+1.29e-01

Pos logit contributions
 LGBT+0.288
gee+0.273
 homosexuals+0.271
 POS+0.270
 interf+0.268
Neg logit contributions
astern-0.452
cry-0.393
forcement-0.383
blast-0.382
rots-0.374
Token (
Token ID357
Feature activation-3.42e-03

Pos logit contributions
 LGBT+0.084
gee+0.079
 homosexuals+0.079
 POS+0.079
 interf+0.078
Neg logit contributions
astern-0.117
cry-0.101
forcement-0.098
blast-0.098
rots-0.096
Tokenvs
Token ID14259
Feature activation+2.20e-02

Pos logit contributions
astern+0.003
cry+0.002
forcement+0.002
blast+0.002
ensen+0.002
Neg logit contributions
 LGBT-0.003
gee-0.003
 POS-0.003
nel-0.003
 homosexuals-0.003
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
 LGBT+0.015
gee+0.014
 POS+0.014
 homosexuals+0.014
nel+0.014
Neg logit contributions
astern-0.020
cry-0.017
forcement-0.016
blast-0.016
rots-0.016
Token MIA
Token ID44903
Feature activation+1.88e-01

Pos logit contributions
 LGBT+0.009
 POS+0.008
 homosexuals+0.008
gee+0.008
 interf+0.008
Neg logit contributions
astern-0.013
cry-0.011
forcement-0.011
blast-0.011
rots-0.010
Tokenso
Token ID568
Feature activation+1.75e-02

Pos logit contributions
 LGBT+0.133
gee+0.127
 homosexuals+0.126
 POS+0.126
nel+0.125
Neg logit contributions
astern-0.153
cry-0.130
forcement-0.126
blast-0.126
rots-0.123
Token."
Token ID526
Feature activation+6.15e-02

Pos logit contributions
 LGBT+0.012
gee+0.011
 homosexuals+0.011
 POS+0.011
nel+0.011
Neg logit contributions
astern-0.015
cry-0.013
forcement-0.013
blast-0.013
rots-0.013
Token
Token ID198
Feature activation+3.26e-02

Pos logit contributions
 LGBT+0.041
gee+0.039
 homosexuals+0.039
 POS+0.039
nel+0.039
Neg logit contributions
astern-0.055
cry-0.047
forcement-0.046
blast-0.046
rots-0.045
Token
Token ID198
Feature activation+1.51e-02

Pos logit contributions
 LGBT+0.018
gee+0.017
 homosexuals+0.017
 POS+0.017
nel+0.017
Neg logit contributions
astern-0.033
cry-0.029
forcement-0.028
blast-0.028
rots-0.027
TokenPhil
Token ID18673
Feature activation-2.99e-02

Pos logit contributions
 LGBT+0.012
gee+0.011
 homosexuals+0.011
nel+0.011
 POS+0.011
Neg logit contributions
astern-0.012
cry-0.010
forcement-0.010
blast-0.010
rots-0.009
Tokenip
Token ID541
Feature activation+4.69e-01

Pos logit contributions
astern+0.027
cry+0.024
forcement+0.023
blast+0.023
rots+0.023
Neg logit contributions
 LGBT-0.019
gee-0.018
 homosexuals-0.018
 POS-0.018
nel-0.018
Token Mont
Token ID5575
Feature activation-3.54e-02

Pos logit contributions
 LGBT+0.342
gee+0.327
 homosexuals+0.325
 POS+0.324
nel+0.322
Neg logit contributions
astern-0.389
cry-0.330
forcement-0.321
blast-0.320
rots-0.311
Tokenoro
Token ID16522
Feature activation+3.01e-03

Pos logit contributions
astern+0.029
cry+0.025
forcement+0.024
blast+0.024
rots+0.023
Neg logit contributions
 LGBT-0.026
gee-0.025
nel-0.025
 homosexuals-0.025
 POS-0.024
Token writes
Token ID6797
Feature activation+5.42e-02

Pos logit contributions
 LGBT+0.002
gee+0.002
 homosexuals+0.002
 POS+0.002
nel+0.002
Neg logit contributions
astern-0.003
cry-0.002
forcement-0.002
blast-0.002
rots-0.002
Token about
Token ID546
Feature activation+1.86e-02

Pos logit contributions
 LGBT+0.037
gee+0.036
 homosexuals+0.035
 POS+0.035
 interf+0.035
Neg logit contributions
astern-0.047
cry-0.040
forcement-0.039
blast-0.039
rots-0.039
Token beer
Token ID6099
Feature activation-1.58e-02

Pos logit contributions
 LGBT+0.014
gee+0.013
 homosexuals+0.013
 POS+0.013
 interf+0.013
Neg logit contributions
astern-0.015
cry-0.013
forcement-0.013
blast-0.013
rots-0.012
TokenMaster
Token ID18254
Feature activation+6.71e-02

Pos logit contributions
 LGBT+0.037
gee+0.036
 POS+0.035
 homosexuals+0.035
nel+0.035
Neg logit contributions
astern-0.032
cry-0.026
forcement-0.025
blast-0.025
rots-0.024
Token the
Token ID262
Feature activation+3.83e-02

Pos logit contributions
 LGBT+0.044
gee+0.042
 homosexuals+0.042
 POS+0.041
 interf+0.041
Neg logit contributions
astern-0.061
cry-0.052
forcement-0.051
blast-0.051
rots-0.050
Token Exc
Token ID25268
Feature activation-1.39e-01

Pos logit contributions
 LGBT+0.028
gee+0.027
 homosexuals+0.027
 POS+0.027
 interf+0.027
Neg logit contributions
astern-0.031
cry-0.027
forcement-0.026
blast-0.026
rots-0.025
Tokenursion
Token ID24197
Feature activation+8.70e-02

Pos logit contributions
astern+0.119
cry+0.101
forcement+0.099
blast+0.098
rots+0.096
Neg logit contributions
 LGBT-0.098
gee-0.095
nel-0.093
 homosexuals-0.093
 POS-0.093
Token Fun
Token ID11138
Feature activation-5.65e-02

Pos logit contributions
 LGBT+0.065
gee+0.063
 POS+0.062
 homosexuals+0.062
 interf+0.062
Neg logit contributions
astern-0.071
cry-0.059
forcement-0.058
blast-0.057
rots-0.056
Tokennel
Token ID4954
Feature activation+4.66e-01

Pos logit contributions
astern+0.052
cry+0.044
forcement+0.043
blast+0.043
rots+0.042
Neg logit contributions
 LGBT-0.036
gee-0.035
nel-0.035
 POS-0.034
 homosexuals-0.034
Token (
Token ID357
Feature activation+2.19e-02

Pos logit contributions
 LGBT+0.331
gee+0.321
nel+0.315
 POS+0.314
 interf+0.313
Neg logit contributions
astern-0.398
cry-0.337
forcement-0.327
blast-0.321
rots-0.321
Token1
Token ID16
Feature activation+2.80e-02

Pos logit contributions
 LGBT+0.017
gee+0.017
 POS+0.016
 homosexuals+0.016
nel+0.016
Neg logit contributions
astern-0.017
cry-0.014
forcement-0.014
blast-0.013
rots-0.013
Token)
Token ID8
Feature activation+3.64e-02

Pos logit contributions
 LGBT+0.019
gee+0.018
 POS+0.018
 homosexuals+0.018
nel+0.018
Neg logit contributions
astern-0.024
cry-0.021
forcement-0.020
blast-0.020
rots-0.020
Token
Token ID198
Feature activation+5.12e-02

Pos logit contributions
 LGBT+0.021
gee+0.020
 homosexuals+0.020
 POS+0.019
 interf+0.019
Neg logit contributions
astern-0.036
cry-0.031
forcement-0.031
blast-0.031
rots-0.030
Token
Token ID198
Feature activation+4.78e-02

Pos logit contributions
 LGBT+0.029
gee+0.027
 homosexuals+0.027
 POS+0.027
 interf+0.026
Neg logit contributions
astern-0.051
cry-0.045
forcement-0.044
blast-0.044
rots-0.043
Token Ay
Token ID13709
Feature activation+1.40e-02

Pos logit contributions
astern+0.010
cry+0.009
blast+0.008
forcement+0.008
rots+0.008
Neg logit contributions
 LGBT-0.007
 homosexuals-0.007
 interf-0.007
 POS-0.007
gee-0.007
Tokenat
Token ID265
Feature activation-3.32e-02

Pos logit contributions
 LGBT+0.010
gee+0.009
 POS+0.009
 homosexuals+0.009
nel+0.009
Neg logit contributions
astern-0.012
cry-0.010
blast-0.010
forcement-0.010
phabet-0.010
Tokenollah
Token ID17999
Feature activation-8.98e-02

Pos logit contributions
astern+0.032
cry+0.027
forcement+0.027
blast+0.027
rots+0.026
Neg logit contributions
 LGBT-0.020
gee-0.019
 homosexuals-0.019
 POS-0.019
 interf-0.019
Token Ali
Token ID12104
Feature activation+1.12e-01

Pos logit contributions
astern+0.092
cry+0.081
forcement+0.080
blast+0.080
rots+0.077
Neg logit contributions
 LGBT-0.047
gee-0.045
 homosexuals-0.044
 POS-0.044
nel-0.044
Token Kham
Token ID45159
Feature activation+1.09e-01

Pos logit contributions
 LGBT+0.076
 homosexuals+0.071
 POS+0.071
 interf+0.071
gee+0.071
Neg logit contributions
astern-0.099
cry-0.085
forcement-0.083
blast-0.083
rots-0.082
Tokenenei
Token ID46009
Feature activation+3.80e-01

Pos logit contributions
 LGBT+0.049
gee+0.045
 homosexuals+0.045
 POS+0.044
nel+0.044
Neg logit contributions
astern-0.121
cry-0.108
forcement-0.105
blast-0.105
rots-0.103
Token,
Token ID11
Feature activation-1.73e-02

Pos logit contributions
 LGBT+0.235
gee+0.223
 homosexuals+0.222
 POS+0.220
 interf+0.219
Neg logit contributions
astern-0.358
cry-0.310
forcement-0.302
blast-0.301
rots-0.295
Token warned
Token ID7728
Feature activation-3.08e-02

Pos logit contributions
astern+0.014
cry+0.012
forcement+0.012
blast+0.011
rots+0.011
Neg logit contributions
 LGBT-0.013
 homosexuals-0.012
 interf-0.012
gee-0.012
 POS-0.012
Token that
Token ID326
Feature activation+4.05e-02

Pos logit contributions
astern+0.027
cry+0.024
forcement+0.023
blast+0.023
rots+0.022
Neg logit contributions
 LGBT-0.021
 homosexuals-0.020
gee-0.020
 interf-0.019
 POS-0.019
Token the
Token ID262
Feature activation+3.64e-02

Pos logit contributions
 LGBT+0.027
 homosexuals+0.026
gee+0.026
 interf+0.025
 POS+0.025
Neg logit contributions
astern-0.036
cry-0.031
blast-0.030
forcement-0.030
rots-0.030
Token day
Token ID1110
Feature activation-1.52e-02

Pos logit contributions
 LGBT+0.026
 homosexuals+0.024
 interf+0.024
 POS+0.024
gee+0.024
Neg logit contributions
astern-0.031
cry-0.027
forcement-0.026
blast-0.026
ensen-0.025
Token as
Token ID355
Feature activation-1.28e-02

Pos logit contributions
 LGBT+0.004
gee+0.003
 homosexuals+0.003
 POS+0.003
 interf+0.003
Neg logit contributions
astern-0.004
cry-0.004
blast-0.004
forcement-0.004
rots-0.004
Token a
Token ID257
Feature activation-1.13e-02

Pos logit contributions
astern+0.012
cry+0.010
blast+0.010
forcement+0.010
rots+0.010
Neg logit contributions
 LGBT-0.008
gee-0.008
 homosexuals-0.008
 POS-0.008
 interf-0.008
Token function
Token ID2163
Feature activation-8.27e-03

Pos logit contributions
astern+0.009
cry+0.007
blast+0.007
forcement+0.007
rots+0.007
Neg logit contributions
 LGBT-0.009
gee-0.008
 POS-0.008
 homosexuals-0.008
 interf-0.008
Token of
Token ID286
Feature activation-1.74e-02

Pos logit contributions
astern+0.008
cry+0.007
forcement+0.007
blast+0.007
phabet+0.007
Neg logit contributions
 LGBT-0.005
gee-0.004
 homosexuals-0.004
 POS-0.004
 interf-0.004
Token prosecut
Token ID6627
Feature activation+8.43e-03

Pos logit contributions
astern+0.015
cry+0.013
blast+0.013
forcement+0.013
rots+0.013
Neg logit contributions
 LGBT-0.012
gee-0.011
 homosexuals-0.011
 POS-0.011
 interf-0.011
Tokenorial
Token ID5132
Feature activation+3.80e-01

Pos logit contributions
 LGBT+0.001
gee+0.001
 homosexuals+0.001
 POS+0.001
 interf+0.001
Neg logit contributions
astern-0.012
cry-0.011
blast-0.011
forcement-0.011
rots-0.011
Token discretion
Token ID14130
Feature activation+1.74e-01

Pos logit contributions
 LGBT+0.188
 homosexuals+0.173
gee+0.173
 POS+0.173
 interf+0.172
Neg logit contributions
astern-0.409
cry-0.360
forcement-0.352
blast-0.352
phabet-0.346
Token should
Token ID815
Feature activation+2.84e-02

Pos logit contributions
 LGBT+0.119
gee+0.114
 homosexuals+0.113
 POS+0.113
 interf+0.112
Neg logit contributions
astern-0.152
cry-0.130
blast-0.126
forcement-0.126
rots-0.123
Token not
Token ID407
Feature activation+2.68e-03

Pos logit contributions
 LGBT+0.019
gee+0.018
 homosexuals+0.018
 POS+0.017
 interf+0.017
Neg logit contributions
astern-0.026
cry-0.022
forcement-0.022
blast-0.022
rots-0.021
Token be
Token ID307
Feature activation-1.68e-02

Pos logit contributions
 LGBT+0.002
gee+0.002
 homosexuals+0.002
 POS+0.002
 interf+0.002
Neg logit contributions
astern-0.002
cry-0.002
forcement-0.002
blast-0.002
rots-0.002
Token used
Token ID973
Feature activation+3.12e-02

Pos logit contributions
astern+0.014
cry+0.011
blast+0.011
forcement+0.011
rots+0.011
Neg logit contributions
 LGBT-0.013
gee-0.012
 homosexuals-0.012
 POS-0.012
 interf-0.012
Token the
Token ID262
Feature activation-2.01e-02

Pos logit contributions
astern+0.020
cry+0.017
forcement+0.016
blast+0.016
rots+0.016
Neg logit contributions
 LGBT-0.015
gee-0.014
 homosexuals-0.014
 POS-0.014
 interf-0.014
Token commander
Token ID11561
Feature activation+7.27e-02

Pos logit contributions
astern+0.016
cry+0.013
blast+0.013
forcement+0.013
rots+0.013
Neg logit contributions
 LGBT-0.015
gee-0.015
 homosexuals-0.015
 POS-0.015
 interf-0.015
Token-
Token ID12
Feature activation-2.61e-02

Pos logit contributions
 LGBT+0.050
gee+0.048
 homosexuals+0.047
 POS+0.047
 interf+0.047
Neg logit contributions
astern-0.063
cry-0.054
forcement-0.053
blast-0.053
rots-0.051
Tokenin
Token ID259
Feature activation+1.17e-01

Pos logit contributions
astern+0.025
cry+0.021
forcement+0.021
blast+0.021
rots+0.020
Neg logit contributions
 LGBT-0.016
gee-0.015
 homosexuals-0.015
 POS-0.015
 interf-0.015
Token-
Token ID12
Feature activation-2.10e-02

Pos logit contributions
 LGBT+0.086
gee+0.082
 homosexuals+0.081
 POS+0.081
 interf+0.081
Neg logit contributions
astern-0.097
cry-0.082
blast-0.080
forcement-0.080
rots-0.078
Tokenchief
Token ID17351
Feature activation+3.79e-01

Pos logit contributions
astern+0.024
cry+0.022
blast+0.021
forcement+0.021
rots+0.021
Neg logit contributions
 LGBT-0.008
 homosexuals-0.008
 POS-0.008
gee-0.007
 interf-0.007
Token of
Token ID286
Feature activation+2.38e-03

Pos logit contributions
 LGBT+0.227
gee+0.214
 homosexuals+0.214
 POS+0.212
 interf+0.211
Neg logit contributions
astern-0.364
cry-0.316
blast-0.309
forcement-0.308
rots-0.302
Token the
Token ID262
Feature activation-1.29e-02

Pos logit contributions
 LGBT+0.002
gee+0.001
 homosexuals+0.001
 POS+0.001
 interf+0.001
Neg logit contributions
astern-0.002
cry-0.002
blast-0.002
forcement-0.002
rots-0.002
Token United
Token ID1578
Feature activation+3.34e-02

Pos logit contributions
astern+0.012
cry+0.010
forcement+0.010
blast+0.010
rots+0.010
Neg logit contributions
 LGBT-0.008
gee-0.008
 homosexuals-0.008
 POS-0.008
 interf-0.007
Token States
Token ID1829
Feature activation+5.28e-02

Pos logit contributions
 LGBT+0.019
gee+0.018
 POS+0.018
 homosexuals+0.018
nel+0.018
Neg logit contributions
astern-0.033
cry-0.029
blast-0.028
forcement-0.028
rots-0.028
Token when
Token ID618
Feature activation-6.77e-03

Pos logit contributions
 LGBT+0.035
gee+0.034
 homosexuals+0.033
 POS+0.033
 interf+0.033
Neg logit contributions
astern-0.047
cry-0.040
blast-0.039
forcement-0.039
rots-0.038
Token British
Token ID3517
Feature activation-6.80e-02

Pos logit contributions
astern+0.003
cry+0.002
forcement+0.002
blast+0.002
rots+0.002
Neg logit contributions
 LGBT-0.003
gee-0.002
 homosexuals-0.002
 POS-0.002
 interf-0.002
Token commander
Token ID11561
Feature activation+8.67e-02

Pos logit contributions
astern+0.057
cry+0.048
forcement+0.047
blast+0.047
ensen+0.046
Neg logit contributions
 LGBT-0.050
gee-0.047
 POS-0.047
 homosexuals-0.047
 interf-0.047
Token-
Token ID12
Feature activation-5.27e-03

Pos logit contributions
 LGBT+0.060
gee+0.058
 homosexuals+0.057
 POS+0.057
nel+0.057
Neg logit contributions
astern-0.075
cry-0.064
forcement-0.062
blast-0.062
rots-0.061
Tokenin
Token ID259
Feature activation+1.11e-01

Pos logit contributions
astern+0.005
cry+0.004
blast+0.004
forcement+0.004
rots+0.004
Neg logit contributions
 LGBT-0.003
gee-0.003
 homosexuals-0.003
 POS-0.003
 interf-0.003
Token-
Token ID12
Feature activation+3.60e-03

Pos logit contributions
 LGBT+0.081
gee+0.078
 homosexuals+0.077
 POS+0.077
 interf+0.076
Neg logit contributions
astern-0.093
cry-0.079
blast-0.076
forcement-0.076
rots-0.074
Tokenchief
Token ID17351
Feature activation+3.75e-01

Pos logit contributions
 LGBT+0.001
gee+0.001
 homosexuals+0.001
 POS+0.001
 interf+0.001
Neg logit contributions
astern-0.004
cry-0.004
blast-0.004
forcement-0.004
rots-0.004
Token,
Token ID11
Feature activation-1.73e-02

Pos logit contributions
 LGBT+0.250
gee+0.238
 homosexuals+0.237
 POS+0.236
 interf+0.234
Neg logit contributions
astern-0.336
cry-0.288
forcement-0.281
blast-0.280
rots-0.274
Token Sir
Token ID7361
Feature activation+1.35e-01

Pos logit contributions
astern+0.014
cry+0.012
forcement+0.011
blast+0.011
rots+0.011
Neg logit contributions
 LGBT-0.013
gee-0.012
 homosexuals-0.012
 POS-0.012
 interf-0.012
Token Ian
Token ID12930
Feature activation+1.34e-01

Pos logit contributions
 LGBT+0.100
gee+0.096
 homosexuals+0.095
 POS+0.095
nel+0.094
Neg logit contributions
astern-0.111
cry-0.094
forcement-0.091
blast-0.091
rots-0.089
Token Hamilton
Token ID11582
Feature activation-1.23e-01

Pos logit contributions
 LGBT+0.090
gee+0.086
 homosexuals+0.086
 POS+0.085
 interf+0.085
Neg logit contributions
astern-0.119
cry-0.102
forcement-0.099
blast-0.099
rots-0.097
Token.
Token ID13
Feature activation-7.99e-03

Pos logit contributions
astern+0.108
cry+0.093
forcement+0.090
blast+0.090
rots+0.088
Neg logit contributions
 LGBT-0.083
gee-0.079
 homosexuals-0.078
 POS-0.078
 interf-0.078
Token's
Token ID338
Feature activation+6.96e-02

Pos logit contributions
 LGBT+0.037
gee+0.035
 homosexuals+0.035
 POS+0.035
 interf+0.035
Neg logit contributions
astern-0.038
cry-0.032
forcement-0.031
blast-0.031
rots-0.030
Token paired
Token ID20312
Feature activation-8.16e-02

Pos logit contributions
 LGBT+0.046
gee+0.044
 homosexuals+0.044
 POS+0.043
 interf+0.043
Neg logit contributions
astern-0.063
cry-0.054
forcement-0.053
blast-0.052
rots-0.051
Token with
Token ID351
Feature activation-5.60e-02

Pos logit contributions
astern+0.069
cry+0.059
forcement+0.057
blast+0.057
rots+0.056
Neg logit contributions
 LGBT-0.058
gee-0.055
 homosexuals-0.055
 POS-0.055
 interf-0.055
Token "
Token ID366
Feature activation+4.48e-02

Pos logit contributions
astern+0.053
cry+0.046
forcement+0.045
blast+0.045
rots+0.044
Neg logit contributions
 LGBT-0.034
gee-0.032
 homosexuals-0.032
 POS-0.032
 interf-0.032
TokenDel
Token ID13856
Feature activation+3.64e-02

Pos logit contributions
 LGBT+0.036
gee+0.034
 homosexuals+0.034
 POS+0.034
nel+0.034
Neg logit contributions
astern-0.034
cry-0.028
forcement-0.028
blast-0.027
ensen-0.027
Tokeniver
Token ID1428
Feature activation+3.74e-01

Pos logit contributions
 LGBT+0.022
gee+0.020
 homosexuals+0.020
 POS+0.020
 interf+0.020
Neg logit contributions
astern-0.035
cry-0.031
forcement-0.030
blast-0.030
rots-0.029
Tokenance
Token ID590
Feature activation+1.62e-01

Pos logit contributions
 LGBT+0.227
gee+0.216
 homosexuals+0.213
 POS+0.212
nel+0.211
Neg logit contributions
astern-0.357
cry-0.309
forcement-0.302
blast-0.301
rots-0.295
Token"
Token ID1
Feature activation+4.46e-02

Pos logit contributions
 LGBT+0.100
gee+0.095
 homosexuals+0.094
 POS+0.094
nel+0.093
Neg logit contributions
astern-0.152
cry-0.132
forcement-0.128
blast-0.128
rots-0.125
Token on
Token ID319
Feature activation+1.07e-02

Pos logit contributions
 LGBT+0.030
gee+0.028
 homosexuals+0.028
 POS+0.028
 interf+0.028
Neg logit contributions
astern-0.040
cry-0.034
forcement-0.033
blast-0.033
rots-0.033
Token Aug
Token ID2447
Feature activation+1.75e-02

Pos logit contributions
 LGBT+0.008
gee+0.007
 homosexuals+0.007
 POS+0.007
 interf+0.007
Neg logit contributions
astern-0.009
cry-0.008
forcement-0.007
blast-0.007
rots-0.007
Token.
Token ID13
Feature activation+3.91e-02

Pos logit contributions
 LGBT+0.012
gee+0.011
 POS+0.011
 homosexuals+0.011
nel+0.011
Neg logit contributions
astern-0.016
cry-0.013
forcement-0.013
blast-0.013
rots-0.013
Token
Token ID198
Feature activation-4.12e-02

Pos logit contributions
astern+0.040
cry+0.035
forcement+0.034
blast+0.034
rots+0.034
Neg logit contributions
 LGBT-0.022
gee-0.021
 homosexuals-0.021
 POS-0.021
nel-0.021
TokenFollow
Token ID7155
Feature activation-9.32e-02

Pos logit contributions
astern+0.032
cry+0.027
forcement+0.026
blast+0.026
rots+0.025
Neg logit contributions
 LGBT-0.032
gee-0.031
 homosexuals-0.031
 POS-0.030
nel-0.030
Token @
Token ID2488
Feature activation+7.47e-03

Pos logit contributions
astern+0.076
cry+0.064
forcement+0.062
blast+0.062
rots+0.060
Neg logit contributions
 LGBT-0.070
gee-0.067
 homosexuals-0.067
 POS-0.066
 interf-0.066
TokenJessica
Token ID45572
Feature activation-1.07e-01

Pos logit contributions
 LGBT+0.006
gee+0.006
 homosexuals+0.006
 POS+0.005
nel+0.005
Neg logit contributions
astern-0.006
cry-0.005
forcement-0.005
blast-0.005
rots-0.005
TokenDur
Token ID36927
Feature activation-4.45e-02

Pos logit contributions
astern+0.086
cry+0.073
forcement+0.071
blast+0.071
rots+0.069
Neg logit contributions
 LGBT-0.080
gee-0.077
 homosexuals-0.076
 POS-0.076
nel-0.075
Tokenando
Token ID25440
Feature activation+3.71e-01

Pos logit contributions
astern+0.038
cry+0.032
forcement+0.031
blast+0.031
rots+0.031
Neg logit contributions
 LGBT-0.031
gee-0.030
nel-0.030
 homosexuals-0.030
 POS-0.030
Token on
Token ID319
Feature activation+4.35e-02

Pos logit contributions
 LGBT+0.244
gee+0.233
 homosexuals+0.230
 POS+0.229
nel+0.228
Neg logit contributions
astern-0.334
cry-0.288
forcement-0.280
blast-0.279
rots-0.273
Token Twitter
Token ID3009
Feature activation+1.06e-02

Pos logit contributions
 LGBT+0.039
gee+0.037
 POS+0.037
 homosexuals+0.037
 interf+0.037
Neg logit contributions
astern-0.029
cry-0.024
forcement-0.023
blast-0.023
rots-0.022
Token
Token ID198
Feature activation+7.19e-03

Pos logit contributions
 LGBT+0.007
gee+0.007
 homosexuals+0.007
 POS+0.006
 interf+0.006
Neg logit contributions
astern-0.010
cry-0.008
forcement-0.008
blast-0.008
rots-0.008
Token
Token ID198
Feature activation-2.83e-03

Pos logit contributions
 LGBT+0.004
gee+0.004
 homosexuals+0.004
 POS+0.004
nel+0.004
Neg logit contributions
astern-0.007
cry-0.006
forcement-0.006
blast-0.006
rots-0.006
TokenRead
Token ID5569
Feature activation+1.67e-03

Pos logit contributions
astern+0.002
cry+0.002
forcement+0.002
blast+0.002
rots+0.002
Neg logit contributions
 LGBT-0.002
gee-0.002
 homosexuals-0.002
nel-0.002
 POS-0.002
Tokenomed
Token ID12657
Feature activation-6.59e-02

Pos logit contributions
 LGBT+0.004
gee+0.004
 homosexuals+0.004
 POS+0.004
 interf+0.004
Neg logit contributions
astern-0.005
cry-0.004
blast-0.004
forcement-0.004
rots-0.004
Token.
Token ID13
Feature activation-3.13e-02

Pos logit contributions
astern+0.059
cry+0.050
forcement+0.049
blast+0.049
rots+0.048
Neg logit contributions
 LGBT-0.044
gee-0.042
 homosexuals-0.042
 POS-0.042
 interf-0.041
Token
Token ID198
Feature activation-2.72e-02

Pos logit contributions
astern+0.030
cry+0.026
forcement+0.025
blast+0.025
rots+0.024
Neg logit contributions
 LGBT-0.019
gee-0.018
 homosexuals-0.018
 POS-0.018
 interf-0.018
Token
Token ID198
Feature activation-3.92e-02

Pos logit contributions
astern+0.027
cry+0.024
forcement+0.023
blast+0.023
rots+0.023
Neg logit contributions
 LGBT-0.015
gee-0.014
 homosexuals-0.014
 POS-0.014
 interf-0.014
TokenSil
Token ID15086
Feature activation-7.18e-02

Pos logit contributions
astern+0.032
cry+0.027
forcement+0.026
blast+0.026
ensen+0.025
Neg logit contributions
 LGBT-0.029
gee-0.028
 homosexuals-0.028
 POS-0.028
nel-0.028
Tokenence
Token ID594
Feature activation+3.67e-01

Pos logit contributions
astern+0.069
cry+0.060
blast+0.058
forcement+0.058
rots+0.057
Neg logit contributions
 LGBT-0.043
gee-0.041
 homosexuals-0.040
 POS-0.040
 interf-0.040
Token envelop
Token ID16441
Feature activation-1.35e-01

Pos logit contributions
 LGBT+0.253
gee+0.241
 homosexuals+0.240
 POS+0.240
 interf+0.239
Neg logit contributions
astern-0.320
cry-0.272
forcement-0.266
blast-0.265
phabet-0.259
Tokens
Token ID82
Feature activation+6.19e-02

Pos logit contributions
astern+0.106
cry+0.089
blast+0.086
forcement+0.086
rots+0.084
Neg logit contributions
 LGBT-0.105
gee-0.101
 homosexuals-0.100
 POS-0.100
 interf-0.099
Token us
Token ID514
Feature activation-6.94e-02

Pos logit contributions
 LGBT+0.046
gee+0.043
 homosexuals+0.043
 POS+0.043
nel+0.043
Neg logit contributions
astern-0.051
cry-0.043
forcement-0.042
blast-0.042
ensen-0.041
Token until
Token ID1566
Feature activation+3.62e-02

Pos logit contributions
astern+0.063
cry+0.054
forcement+0.052
blast+0.052
rots+0.051
Neg logit contributions
 LGBT-0.046
gee-0.044
 homosexuals-0.043
 POS-0.043
 interf-0.043
Token my
Token ID616
Feature activation+3.63e-02

Pos logit contributions
 LGBT+0.025
gee+0.024
 homosexuals+0.024
 POS+0.023
 interf+0.023
Neg logit contributions
astern-0.032
cry-0.027
forcement-0.026
blast-0.026
ensen-0.026
Tokens
Token ID82
Feature activation+3.37e-02

Pos logit contributions
 LGBT+0.025
gee+0.024
 homosexuals+0.024
 POS+0.024
nel+0.023
Neg logit contributions
astern-0.028
cry-0.024
forcement-0.023
blast-0.023
rots-0.023
Token show
Token ID905
Feature activation+9.62e-03

Pos logit contributions
 LGBT+0.025
gee+0.024
 homosexuals+0.024
 POS+0.023
 interf+0.023
Neg logit contributions
astern-0.028
cry-0.024
forcement-0.023
blast-0.023
rots-0.022
Token
Token ID564
Feature activation+5.33e-02

Pos logit contributions
 LGBT+0.006
gee+0.006
 homosexuals+0.006
 POS+0.006
 interf+0.006
Neg logit contributions
astern-0.009
cry-0.007
blast-0.007
forcement-0.007
rots-0.007
Token
Token ID250
Feature activation+1.37e-02

Pos logit contributions
 LGBT+0.035
gee+0.033
 homosexuals+0.033
 POS+0.033
nel+0.032
Neg logit contributions
astern-0.049
cry-0.042
blast-0.041
forcement-0.040
rots-0.040
TokenBel
Token ID12193
Feature activation-4.18e-02

Pos logit contributions
 LGBT+0.010
gee+0.009
 homosexuals+0.009
 POS+0.009
nel+0.009
Neg logit contributions
astern-0.012
cry-0.010
forcement-0.010
blast-0.010
rots-0.009
Tokeniever
Token ID47818
Feature activation+3.67e-01

Pos logit contributions
astern+0.039
cry+0.034
blast+0.033
forcement+0.033
rots+0.032
Neg logit contributions
 LGBT-0.026
gee-0.025
 homosexuals-0.024
 interf-0.024
 POS-0.024
Token
Token ID447
Feature activation+1.81e-02

Pos logit contributions
 LGBT+0.253
gee+0.240
 POS+0.238
 homosexuals+0.238
nel+0.237
Neg logit contributions
astern-0.319
cry-0.273
blast-0.265
forcement-0.265
rots-0.259
Token
Token ID251
Feature activation-8.55e-03

Pos logit contributions
 LGBT+0.014
gee+0.013
 homosexuals+0.013
 POS+0.013
nel+0.013
Neg logit contributions
astern-0.015
cry-0.012
blast-0.012
forcement-0.012
rots-0.012
Token in
Token ID287
Feature activation+1.17e-03

Pos logit contributions
astern+0.007
cry+0.006
forcement+0.006
blast+0.006
rots+0.006
Neg logit contributions
 LGBT-0.006
gee-0.006
 homosexuals-0.006
 POS-0.006
nel-0.006
Token the
Token ID262
Feature activation+1.59e-03

Pos logit contributions
 LGBT+0.001
 homosexuals+0.001
gee+0.001
 POS+0.001
 interf+0.001
Neg logit contributions
astern-0.001
cry-0.001
blast-0.001
forcement-0.001
rots-0.001
Token wake
Token ID7765
Feature activation-9.85e-02

Pos logit contributions
 LGBT+0.001
 homosexuals+0.001
gee+0.001
 POS+0.001
 interf+0.001
Neg logit contributions
astern-0.001
cry-0.001
blast-0.001
forcement-0.001
ensen-0.001
Token want
Token ID765
Feature activation+4.78e-02

Pos logit contributions
 LGBT+0.011
gee+0.010
 homosexuals+0.010
 POS+0.010
 interf+0.010
Neg logit contributions
astern-0.012
cry-0.010
forcement-0.010
blast-0.010
rots-0.010
Token it
Token ID340
Feature activation+3.14e-02

Pos logit contributions
 LGBT+0.030
gee+0.028
 homosexuals+0.028
 POS+0.028
 interf+0.028
Neg logit contributions
astern-0.045
cry-0.038
blast-0.037
forcement-0.037
ensen-0.037
Token to
Token ID284
Feature activation+4.74e-02

Pos logit contributions
 LGBT+0.019
gee+0.018
 homosexuals+0.017
 POS+0.017
 interf+0.017
Neg logit contributions
astern-0.030
cry-0.026
blast-0.026
forcement-0.026
rots-0.025
Token be
Token ID307
Feature activation+3.53e-02

Pos logit contributions
 LGBT+0.033
gee+0.032
 homosexuals+0.031
 POS+0.031
 interf+0.031
Neg logit contributions
astern-0.041
cry-0.035
blast-0.034
forcement-0.034
rots-0.033
Token digest
Token ID16274
Feature activation+5.65e-03

Pos logit contributions
 LGBT+0.023
gee+0.022
 POS+0.022
 homosexuals+0.022
 interf+0.022
Neg logit contributions
astern-0.032
cry-0.027
blast-0.027
forcement-0.027
rots-0.026
Tokenible
Token ID856
Feature activation+3.65e-01

Pos logit contributions
 LGBT+0.001
gee+0.001
 homosexuals+0.001
 POS+0.001
 interf+0.001
Neg logit contributions
astern-0.007
cry-0.007
forcement-0.007
blast-0.007
rots-0.007
Token.
Token ID13
Feature activation+2.90e-02

Pos logit contributions
 LGBT+0.238
gee+0.225
 POS+0.224
 homosexuals+0.223
 interf+0.223
Neg logit contributions
astern-0.335
cry-0.286
blast-0.278
forcement-0.278
rots-0.273
Token
Token ID198
Feature activation+3.33e-02

Pos logit contributions
 LGBT+0.018
gee+0.017
 POS+0.017
 homosexuals+0.017
nel+0.017
Neg logit contributions
astern-0.027
cry-0.023
forcement-0.023
blast-0.023
rots-0.022
Token
Token ID198
Feature activation+2.38e-02

Pos logit contributions
 LGBT+0.018
gee+0.017
 homosexuals+0.017
 POS+0.017
nel+0.017
Neg logit contributions
astern-0.034
cry-0.029
forcement-0.029
blast-0.029
rots-0.028
TokenI
Token ID40
Feature activation+2.40e-02

Pos logit contributions
 LGBT+0.019
gee+0.018
 POS+0.018
 homosexuals+0.018
nel+0.018
Neg logit contributions
astern-0.018
cry-0.015
forcement-0.015
blast-0.015
ensen-0.014
Token was
Token ID373
Feature activation+2.89e-02

Pos logit contributions
 LGBT+0.017
gee+0.017
 homosexuals+0.017
 POS+0.017
 interf+0.016
Neg logit contributions
astern-0.020
cry-0.017
forcement-0.016
blast-0.016
rots-0.016
Token King
Token ID2677
Feature activation-1.56e-03

Pos logit contributions
astern+0.029
cry+0.024
blast+0.024
forcement+0.024
rots+0.023
Neg logit contributions
 LGBT-0.024
gee-0.023
 homosexuals-0.023
 POS-0.023
 interf-0.022
Token's
Token ID338
Feature activation-6.03e-02

Pos logit contributions
astern+0.001
cry+0.001
blast+0.001
forcement+0.001
rots+0.001
Neg logit contributions
 LGBT-0.001
gee-0.001
 homosexuals-0.001
 POS-0.001
 interf-0.001
Token judgment
Token ID8492
Feature activation-1.54e-02

Pos logit contributions
astern+0.051
cry+0.043
blast+0.042
forcement+0.042
rots+0.041
Neg logit contributions
 LGBT-0.043
gee-0.041
 homosexuals-0.041
 POS-0.041
 interf-0.041
Token)."
Token ID21387
Feature activation-6.23e-02

Pos logit contributions
astern+0.013
cry+0.011
blast+0.011
forcement+0.011
rots+0.010
Neg logit contributions
 LGBT-0.011
gee-0.011
 homosexuals-0.011
 POS-0.011
 interf-0.011
TokenSah
Token ID32194
Feature activation+1.54e-01

Pos logit contributions
astern+0.057
cry+0.049
forcement+0.048
blast+0.048
rots+0.047
Neg logit contributions
 LGBT-0.040
gee-0.038
 homosexuals-0.038
 POS-0.038
 interf-0.038
Tokenih
Token ID4449
Feature activation+3.55e-01

Pos logit contributions
 LGBT+0.019
gee+0.014
 homosexuals+0.014
 POS+0.013
nel+0.013
Neg logit contributions
astern-0.221
cry-0.202
blast-0.199
forcement-0.199
rots-0.196
Token al
Token ID435
Feature activation-4.88e-02

Pos logit contributions
 LGBT+0.212
gee+0.201
 homosexuals+0.200
 POS+0.199
 interf+0.198
Neg logit contributions
astern-0.341
cry-0.296
forcement-0.289
blast-0.289
rots-0.282
Token-
Token ID12
Feature activation+4.05e-02

Pos logit contributions
astern+0.040
cry+0.034
forcement+0.033
blast+0.033
rots+0.032
Neg logit contributions
 LGBT-0.036
gee-0.034
 homosexuals-0.034
 POS-0.034
 interf-0.034
TokenB
Token ID33
Feature activation+4.39e-02

Pos logit contributions
 LGBT+0.029
gee+0.028
 homosexuals+0.027
 interf+0.027
 POS+0.027
Neg logit contributions
astern-0.034
cry-0.029
forcement-0.028
blast-0.028
ensen-0.028
Tokenuk
Token ID2724
Feature activation+7.47e-05

Pos logit contributions
 LGBT+0.038
 homosexuals+0.037
gee+0.037
 POS+0.037
 interf+0.036
Neg logit contributions
astern-0.030
cry-0.025
blast-0.024
forcement-0.024
rots-0.023
Tokenhari
Token ID49573
Feature activation-1.62e-02

Pos logit contributions
 LGBT+0.000
gee+0.000
 homosexuals+0.000
 POS+0.000
 interf+0.000
Neg logit contributions
astern-0.000
cry-0.000
blast-0.000
forcement-0.000
rots-0.000
Token a
Token ID257
Feature activation+3.84e-02

Pos logit contributions
astern+0.001
cry+0.001
forcement+0.001
blast+0.001
rots+0.001
Neg logit contributions
 LGBT-0.001
gee-0.001
 homosexuals-0.001
 POS-0.001
 interf-0.001
Token stronger
Token ID7387
Feature activation+1.73e-02

Pos logit contributions
 LGBT+0.026
gee+0.025
 homosexuals+0.025
 POS+0.025
 interf+0.025
Neg logit contributions
astern-0.034
cry-0.029
forcement-0.028
blast-0.028
rots-0.027
Token relationship
Token ID2776
Feature activation+2.53e-02

Pos logit contributions
 LGBT+0.011
gee+0.011
 homosexuals+0.011
 POS+0.011
 interf+0.011
Neg logit contributions
astern-0.016
cry-0.013
forcement-0.013
blast-0.013
rots-0.013
Token with
Token ID351
Feature activation-2.90e-02

Pos logit contributions
 LGBT+0.018
gee+0.017
 homosexuals+0.017
 POS+0.017
 interf+0.017
Neg logit contributions
astern-0.021
cry-0.018
blast-0.018
forcement-0.018
rots-0.017
Token ab
Token ID450
Feature activation-1.19e-01

Pos logit contributions
astern+0.025
cry+0.022
blast+0.021
forcement+0.021
rots+0.021
Neg logit contributions
 LGBT-0.020
 homosexuals-0.019
gee-0.019
 POS-0.019
 interf-0.019
Tokenorig
Token ID11612
Feature activation+3.51e-01

Pos logit contributions
astern+0.143
cry+0.128
forcement+0.126
blast+0.125
rots+0.123
Neg logit contributions
 LGBT-0.042
gee-0.039
 homosexuals-0.038
 POS-0.038
nel-0.037
Tokeninals
Token ID6897
Feature activation-7.06e-03

Pos logit contributions
 LGBT+0.239
gee+0.229
 homosexuals+0.226
 POS+0.225
nel+0.224
Neg logit contributions
astern-0.307
cry-0.264
forcement-0.257
blast-0.257
rots-0.249
Token.
Token ID13
Feature activation+5.37e-02

Pos logit contributions
astern+0.006
cry+0.005
blast+0.005
forcement+0.005
rots+0.005
Neg logit contributions
 LGBT-0.005
gee-0.005
 homosexuals-0.004
 POS-0.004
 interf-0.004
Token
Token ID198
Feature activation+4.99e-02

Pos logit contributions
 LGBT+0.031
gee+0.029
 homosexuals+0.029
 POS+0.029
nel+0.028
Neg logit contributions
astern-0.053
cry-0.046
blast-0.045
forcement-0.045
rots-0.044
Token
Token ID198
Feature activation+3.18e-02

Pos logit contributions
 LGBT+0.028
gee+0.026
 homosexuals+0.026
 POS+0.026
nel+0.025
Neg logit contributions
astern-0.050
cry-0.044
forcement-0.043
blast-0.043
rots-0.042
TokenThe
Token ID464
Feature activation+2.06e-02

Pos logit contributions
 LGBT+0.024
gee+0.023
nel+0.023
 POS+0.023
 homosexuals+0.023
Neg logit contributions
astern-0.025
cry-0.021
forcement-0.021
blast-0.021
phabet-0.020
Token claimed
Token ID4752
Feature activation-1.24e-02

Pos logit contributions
astern+0.034
cry+0.029
forcement+0.028
blast+0.028
phabet+0.027
Neg logit contributions
 LGBT-0.028
 homosexuals-0.026
gee-0.026
 POS-0.026
 interf-0.026
Token?
Token ID30
Feature activation-4.89e-02

Pos logit contributions
astern+0.011
cry+0.009
forcement+0.009
blast+0.009
rots+0.009
Neg logit contributions
 LGBT-0.008
gee-0.008
 homosexuals-0.008
 POS-0.008
nel-0.008
Token
Token ID198
Feature activation+3.58e-02

Pos logit contributions
astern+0.043
cry+0.037
blast+0.036
forcement+0.036
rots+0.035
Neg logit contributions
 LGBT-0.033
gee-0.031
 homosexuals-0.031
 interf-0.031
 POS-0.031
Token
Token ID198
Feature activation+1.69e-02

Pos logit contributions
 LGBT+0.020
gee+0.019
 homosexuals+0.019
 POS+0.019
nel+0.018
Neg logit contributions
astern-0.036
cry-0.031
forcement-0.031
blast-0.031
rots-0.030
TokenTrans
Token ID8291
Feature activation+5.87e-02

Pos logit contributions
 LGBT+0.013
gee+0.013
 homosexuals+0.013
 POS+0.013
nel+0.012
Neg logit contributions
astern-0.013
cry-0.011
forcement-0.011
blast-0.011
rots-0.010
Tokengender
Token ID8388
Feature activation+3.48e-01

Pos logit contributions
 LGBT+0.017
gee+0.015
 homosexuals+0.015
 POS+0.015
 interf+0.014
Neg logit contributions
astern-0.075
cry-0.067
blast-0.066
forcement-0.066
rots-0.065
Token people
Token ID661
Feature activation-4.95e-02

Pos logit contributions
 LGBT+0.230
gee+0.218
 homosexuals+0.217
 POS+0.216
 interf+0.214
Neg logit contributions
astern-0.314
cry-0.270
forcement-0.263
blast-0.263
rots-0.256
Token can
Token ID460
Feature activation-1.62e-02

Pos logit contributions
astern+0.043
cry+0.037
blast+0.036
forcement+0.036
rots+0.035
Neg logit contributions
 LGBT-0.034
gee-0.033
 homosexuals-0.032
 POS-0.032
nel-0.032
Token undergo
Token ID17777
Feature activation+5.87e-02

Pos logit contributions
astern+0.014
cry+0.012
blast+0.011
forcement+0.011
ensen+0.011
Neg logit contributions
 LGBT-0.011
gee-0.011
 homosexuals-0.011
 POS-0.011
 interf-0.011
Token medically
Token ID35031
Feature activation-1.14e-03

Pos logit contributions
 LGBT+0.033
gee+0.032
 homosexuals+0.031
nel+0.031
 POS+0.031
Neg logit contributions
astern-0.058
cry-0.051
blast-0.049
forcement-0.049
rots-0.048
Token-
Token ID12
Feature activation-7.01e-02

Pos logit contributions
astern+0.001
cry+0.001
blast+0.001
forcement+0.001
rots+0.001
Neg logit contributions
 LGBT-0.001
gee-0.001
 homosexuals-0.001
nel-0.001
 POS-0.001
Token hours
Token ID2250
Feature activation-6.58e-02

Pos logit contributions
astern+0.048
cry+0.041
forcement+0.040
blast+0.039
rots+0.038
Neg logit contributions
 LGBT-0.047
gee-0.046
 homosexuals-0.045
 POS-0.045
 interf-0.045
Token.
Token ID13
Feature activation-7.62e-03

Pos logit contributions
astern+0.061
cry+0.053
forcement+0.051
blast+0.051
rots+0.050
Neg logit contributions
 LGBT-0.041
gee-0.039
 homosexuals-0.039
 POS-0.039
 interf-0.039
Token In
Token ID554
Feature activation-2.57e-02

Pos logit contributions
astern+0.006
cry+0.005
blast+0.005
forcement+0.005
rots+0.005
Neg logit contributions
 LGBT-0.005
gee-0.005
 homosexuals-0.005
 POS-0.005
 interf-0.005
Token prolonged
Token ID20573
Feature activation+2.09e-02

Pos logit contributions
astern+0.022
cry+0.019
blast+0.018
forcement+0.018
rots+0.018
Neg logit contributions
 LGBT-0.018
gee-0.017
 homosexuals-0.017
 POS-0.017
 interf-0.017
Token d
Token ID288
Feature activation+2.91e-02

Pos logit contributions
 LGBT+0.016
gee+0.015
 homosexuals+0.015
 POS+0.015
 interf+0.015
Neg logit contributions
astern-0.017
cry-0.014
forcement-0.014
blast-0.014
rots-0.013
Tokenrough
Token ID740
Feature activation+3.46e-01

Pos logit contributions
 LGBT+0.013
gee+0.013
 homosexuals+0.012
 POS+0.012
nel+0.012
Neg logit contributions
astern-0.032
cry-0.028
forcement-0.028
blast-0.028
phabet-0.027
Tokents
Token ID912
Feature activation+1.09e-01

Pos logit contributions
 LGBT+0.231
gee+0.220
 homosexuals+0.218
 POS+0.217
nel+0.217
Neg logit contributions
astern-0.309
cry-0.265
blast-0.259
forcement-0.258
rots-0.251
Token,
Token ID11
Feature activation-4.96e-02

Pos logit contributions
 LGBT+0.071
gee+0.068
 homosexuals+0.068
 POS+0.067
 interf+0.067
Neg logit contributions
astern-0.098
cry-0.084
blast-0.082
forcement-0.082
rots-0.080
Token which
Token ID543
Feature activation+1.50e-02

Pos logit contributions
astern+0.043
cry+0.037
forcement+0.036
blast+0.036
rots+0.035
Neg logit contributions
 LGBT-0.034
gee-0.033
 homosexuals-0.033
 POS-0.032
 interf-0.032
Token are
Token ID389
Feature activation-2.77e-02

Pos logit contributions
 LGBT+0.011
gee+0.010
 homosexuals+0.010
 POS+0.010
 interf+0.010
Neg logit contributions
astern-0.013
cry-0.011
forcement-0.011
blast-0.011
rots-0.010
Token almost
Token ID2048
Feature activation+3.42e-02

Pos logit contributions
astern+0.024
cry+0.020
forcement+0.019
blast+0.019
rots+0.019
Neg logit contributions
 LGBT-0.020
gee-0.019
 homosexuals-0.019
 POS-0.019
 interf-0.019
Token That
Token ID1320
Feature activation+2.37e-02

Pos logit contributions
 LGBT+0.017
gee+0.016
 POS+0.016
 homosexuals+0.016
nel+0.016
Neg logit contributions
astern-0.021
cry-0.018
forcement-0.018
blast-0.017
rots-0.017
Token was
Token ID373
Feature activation-2.13e-02

Pos logit contributions
 LGBT+0.018
gee+0.017
 POS+0.017
 homosexuals+0.017
 interf+0.017
Neg logit contributions
astern-0.019
cry-0.016
forcement-0.016
blast-0.016
rots-0.015
Token Rob
Token ID3851
Feature activation-4.47e-02

Pos logit contributions
astern+0.020
cry+0.017
blast+0.016
forcement+0.016
rots+0.016
Neg logit contributions
 LGBT-0.014
gee-0.013
 POS-0.013
 homosexuals-0.013
 interf-0.013
Token H
Token ID367
Feature activation-4.29e-02

Pos logit contributions
astern+0.038
cry+0.032
forcement+0.031
blast+0.031
rots+0.031
Neg logit contributions
 LGBT-0.032
gee-0.031
 homosexuals-0.030
 POS-0.030
 interf-0.030
Tokenoe
Token ID2577
Feature activation-1.20e-01

Pos logit contributions
astern+0.035
cry+0.030
forcement+0.029
blast+0.029
rots+0.028
Neg logit contributions
 LGBT-0.032
gee-0.030
 homosexuals-0.030
 POS-0.030
 interf-0.030
Tokengee
Token ID29622
Feature activation+3.46e-01

Pos logit contributions
astern+0.098
cry+0.083
forcement+0.081
blast+0.081
rots+0.079
Neg logit contributions
 LGBT-0.088
gee-0.085
 homosexuals-0.084
 POS-0.084
 interf-0.083
Token's
Token ID338
Feature activation-1.44e-02

Pos logit contributions
 LGBT+0.273
gee+0.263
 interf+0.260
 homosexuals+0.259
 POS+0.259
Neg logit contributions
astern-0.269
cry-0.224
blast-0.216
forcement-0.215
rots-0.211
Token episode
Token ID4471
Feature activation+4.35e-02

Pos logit contributions
astern+0.011
cry+0.009
forcement+0.008
blast+0.008
rots+0.008
Neg logit contributions
 LGBT-0.012
gee-0.011
 homosexuals-0.011
 POS-0.011
 interf-0.011
Token and
Token ID290
Feature activation+1.14e-04

Pos logit contributions
 LGBT+0.029
gee+0.027
 homosexuals+0.027
 POS+0.027
 interf+0.027
Neg logit contributions
astern-0.039
cry-0.033
blast-0.033
forcement-0.033
rots-0.032
Token I
Token ID314
Feature activation+3.21e-03

Pos logit contributions
 LGBT+0.000
gee+0.000
 homosexuals+0.000
 POS+0.000
 interf+0.000
Neg logit contributions
astern-0.000
cry-0.000
forcement-0.000
blast-0.000
rots-0.000
Token think
Token ID892
Feature activation-2.01e-02

Pos logit contributions
 LGBT+0.002
gee+0.002
 homosexuals+0.002
 POS+0.002
 interf+0.002
Neg logit contributions
astern-0.003
cry-0.002
forcement-0.002
blast-0.002
rots-0.002
TokenA
Token ID32
Feature activation-3.93e-03

Pos logit contributions
 LGBT+0.019
gee+0.018
 homosexuals+0.018
 POS+0.018
nel+0.018
Neg logit contributions
astern-0.019
cry-0.015
forcement-0.015
blast-0.015
ensen-0.015
Token gentleman
Token ID22427
Feature activation+9.05e-02

Pos logit contributions
astern+0.004
cry+0.003
forcement+0.003
blast+0.003
rots+0.003
Neg logit contributions
 LGBT-0.002
gee-0.002
 homosexuals-0.002
 POS-0.002
nel-0.002
Token of
Token ID286
Feature activation+3.46e-02

Pos logit contributions
 LGBT+0.055
gee+0.052
 homosexuals+0.052
 POS+0.052
 interf+0.051
Neg logit contributions
astern-0.086
cry-0.075
blast-0.073
forcement-0.073
rots-0.071
Token genuine
Token ID8768
Feature activation+1.05e-02

Pos logit contributions
 LGBT+0.023
gee+0.022
 homosexuals+0.022
 POS+0.022
 interf+0.022
Neg logit contributions
astern-0.030
cry-0.026
forcement-0.025
blast-0.025
rots-0.025
Token polit
Token ID1408
Feature activation-3.44e-02

Pos logit contributions
 LGBT+0.008
gee+0.007
 homosexuals+0.007
 POS+0.007
 interf+0.007
Neg logit contributions
astern-0.009
cry-0.007
forcement-0.007
blast-0.007
rots-0.007
Tokeneness
Token ID9449
Feature activation+3.45e-01

Pos logit contributions
astern+0.045
cry+0.041
forcement+0.040
blast+0.040
rots+0.040
Neg logit contributions
 LGBT-0.008
gee-0.007
 homosexuals-0.007
 POS-0.007
 interf-0.007
Token will
Token ID481
Feature activation+4.38e-02

Pos logit contributions
 LGBT+0.219
gee+0.209
 homosexuals+0.207
 POS+0.206
 interf+0.205
Neg logit contributions
astern-0.319
cry-0.275
forcement-0.268
blast-0.268
rots-0.262
Token not
Token ID407
Feature activation+3.15e-02

Pos logit contributions
 LGBT+0.030
gee+0.029
 homosexuals+0.028
 POS+0.028
 interf+0.028
Neg logit contributions
astern-0.038
cry-0.033
blast-0.032
forcement-0.032
rots-0.031
Token give
Token ID1577
Feature activation+4.51e-03

Pos logit contributions
 LGBT+0.024
gee+0.023
 homosexuals+0.022
 POS+0.022
 interf+0.022
Neg logit contributions
astern-0.026
cry-0.022
forcement-0.021
blast-0.021
rots-0.020
Token all
Token ID477
Feature activation+3.82e-02

Pos logit contributions
 LGBT+0.003
gee+0.003
 homosexuals+0.003
 POS+0.003
 interf+0.003
Neg logit contributions
astern-0.004
cry-0.003
forcement-0.003
blast-0.003
rots-0.003
Token his
Token ID465
Feature activation-6.36e-03

Pos logit contributions
 LGBT+0.027
 homosexuals+0.026
gee+0.026
 POS+0.026
 interf+0.025
Neg logit contributions
astern-0.033
cry-0.028
forcement-0.027
blast-0.027
ensen-0.026
Token (
Token ID357
Feature activation-5.52e-02

Pos logit contributions
astern+0.014
cry+0.012
forcement+0.012
blast+0.012
rots+0.011
Neg logit contributions
 LGBT-0.011
gee-0.011
 homosexuals-0.011
 POS-0.011
 interf-0.011
TokenR
Token ID49
Feature activation-2.81e-02

Pos logit contributions
astern+0.048
cry+0.041
forcement+0.040
blast+0.040
rots+0.039
Neg logit contributions
 LGBT-0.038
gee-0.036
 homosexuals-0.036
 POS-0.036
nel-0.036
Token-
Token ID12
Feature activation-2.15e-02

Pos logit contributions
astern+0.023
cry+0.020
forcement+0.019
blast+0.019
rots+0.018
Neg logit contributions
 LGBT-0.021
gee-0.020
 homosexuals-0.020
 POS-0.020
 interf-0.019
TokenS
Token ID50
Feature activation-8.26e-03

Pos logit contributions
astern+0.019
cry+0.016
forcement+0.016
blast+0.016
rots+0.016
Neg logit contributions
 LGBT-0.014
gee-0.014
 homosexuals-0.014
 POS-0.013
nel-0.013
Tokenhen
Token ID831
Feature activation+1.72e-01

Pos logit contributions
astern+0.007
cry+0.006
blast+0.006
forcement+0.006
phabet+0.006
Neg logit contributions
 LGBT-0.005
gee-0.005
nel-0.005
 POS-0.005
 homosexuals-0.005
Tokenando
Token ID25440
Feature activation+3.45e-01

Pos logit contributions
 LGBT+0.036
gee+0.031
 homosexuals+0.030
 POS+0.029
nel+0.029
Neg logit contributions
astern-0.232
cry-0.210
blast-0.206
forcement-0.206
rots-0.204
Tokenah
Token ID993
Feature activation+4.73e-02

Pos logit contributions
 LGBT+0.192
gee+0.181
 homosexuals+0.179
 POS+0.178
 interf+0.177
Neg logit contributions
astern-0.346
cry-0.302
forcement-0.295
blast-0.295
rots-0.289
Token)
Token ID8
Feature activation+8.61e-06

Pos logit contributions
 LGBT+0.035
gee+0.034
 homosexuals+0.034
 POS+0.034
 interf+0.033
Neg logit contributions
astern-0.038
cry-0.032
blast-0.031
forcement-0.031
rots-0.031
Token speaks
Token ID9209
Feature activation+7.19e-03

Feature not active on prev token;
no predictions were affected.
Token on
Token ID319
Feature activation+3.07e-02

Pos logit contributions
 LGBT+0.005
gee+0.004
 homosexuals+0.004
 POS+0.004
 interf+0.004
Neg logit contributions
astern-0.007
cry-0.006
forcement-0.006
blast-0.006
rots-0.005
Token the
Token ID262
Feature activation+1.06e-02

Pos logit contributions
 LGBT+0.022
 homosexuals+0.020
gee+0.020
 POS+0.020
 interf+0.020
Neg logit contributions
astern-0.027
cry-0.023
forcement-0.022
blast-0.022
rots-0.022
TokenMind
Token ID28478
Feature activation-1.78e-01

Pos logit contributions
astern+0.027
cry+0.022
forcement+0.022
blast+0.021
rots+0.021
Neg logit contributions
 LGBT-0.026
gee-0.025
 homosexuals-0.025
 POS-0.025
nel-0.025
Tokenclaw
Token ID43143
Feature activation+2.98e-02

Pos logit contributions
astern+0.136
cry+0.112
forcement+0.108
blast+0.107
ensen+0.106
Neg logit contributions
 LGBT-0.141
gee-0.136
 homosexuals-0.135
nel-0.134
 POS-0.134
Token Shaman
Token ID34805
Feature activation-1.52e-01

Pos logit contributions
 LGBT+0.021
gee+0.020
 homosexuals+0.020
 POS+0.020
nel+0.020
Neg logit contributions
astern-0.025
cry-0.021
forcement-0.021
blast-0.021
rots-0.020
Token vs
Token ID3691
Feature activation-3.77e-02

Pos logit contributions
astern+0.123
cry+0.104
blast+0.101
forcement+0.101
rots+0.098
Neg logit contributions
 LGBT-0.114
gee-0.109
 homosexuals-0.109
 POS-0.108
 interf-0.108
Token Pyro
Token ID44954
Feature activation-1.16e-01

Pos logit contributions
astern+0.032
cry+0.027
forcement+0.027
blast+0.027
rots+0.026
Neg logit contributions
 LGBT-0.027
gee-0.025
 homosexuals-0.025
 POS-0.025
 interf-0.025
Tokenblast
Token ID39806
Feature activation-5.18e-01

Pos logit contributions
astern+0.048
cry+0.033
forcement+0.031
blast+0.029
rots+0.029
Neg logit contributions
 LGBT-0.132
gee-0.129
 homosexuals-0.128
 POS-0.128
nel-0.128
Token
Token ID198
Feature activation-3.31e-02

Pos logit contributions
astern+0.488
cry+0.426
blast+0.416
forcement+0.414
rots+0.403
Neg logit contributions
 LGBT-0.317
gee-0.302
 homosexuals-0.301
 POS-0.298
 interf-0.296
Token
Token ID198
Feature activation-4.01e-02

Pos logit contributions
astern+0.033
cry+0.029
forcement+0.028
blast+0.028
rots+0.028
Neg logit contributions
 LGBT-0.018
gee-0.017
 homosexuals-0.017
 POS-0.017
nel-0.017
TokenPrim
Token ID23828
Feature activation-1.67e-02

Pos logit contributions
astern+0.031
cry+0.026
forcement+0.025
blast+0.025
rots+0.025
Neg logit contributions
 LGBT-0.031
gee-0.030
 homosexuals-0.030
 POS-0.030
nel-0.030
Tokenal
Token ID282
Feature activation-6.00e-02

Pos logit contributions
astern+0.013
cry+0.011
forcement+0.010
blast+0.010
rots+0.010
Neg logit contributions
 LGBT-0.013
gee-0.013
 homosexuals-0.013
 POS-0.013
nel-0.013
Token Druid
Token ID32685
Feature activation-7.14e-02

Pos logit contributions
astern+0.056
cry+0.048
forcement+0.047
blast+0.046
rots+0.046
Neg logit contributions
 LGBT-0.038
gee-0.036
 homosexuals-0.036
 POS-0.036
nel-0.036
Tokenyard
Token ID9413
Feature activation-1.47e-01

Pos logit contributions
astern+0.008
cry+0.006
forcement+0.006
blast+0.006
rots+0.006
Neg logit contributions
 LGBT-0.015
gee-0.015
 POS-0.014
 homosexuals-0.014
nel-0.014
Token sprint
Token ID18553
Feature activation+1.13e-02

Pos logit contributions
astern+0.110
cry+0.091
forcement+0.089
blast+0.087
rots+0.086
Neg logit contributions
 LGBT-0.120
gee-0.115
 POS-0.114
 homosexuals-0.114
 interf-0.114
Token,
Token ID11
Feature activation+8.03e-03

Pos logit contributions
 LGBT+0.008
gee+0.007
 POS+0.007
nel+0.007
 homosexuals+0.007
Neg logit contributions
astern-0.010
cry-0.008
forcement-0.008
blast-0.008
rots-0.008
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
 LGBT+0.006
gee+0.005
 homosexuals+0.005
 POS+0.005
 interf+0.005
Neg logit contributions
astern-0.007
cry-0.006
forcement-0.006
blast-0.006
rots-0.006
Token McF
Token ID38222
Feature activation+5.68e-02

Pos logit contributions
astern+0.011
cry+0.010
forcement+0.010
blast+0.009
rots+0.009
Neg logit contributions
 LGBT-0.009
gee-0.008
 homosexuals-0.008
 POS-0.008
 interf-0.008
Tokenadden
Token ID38014
Feature activation-4.22e-01

Pos logit contributions
 LGBT+0.069
gee+0.068
 homosexuals+0.067
 POS+0.067
nel+0.067
Neg logit contributions
astern-0.019
cry-0.012
forcement-0.011
blast-0.011
rots-0.010
Token also
Token ID635
Feature activation+1.14e-02

Pos logit contributions
astern+0.347
cry+0.294
forcement+0.285
blast+0.285
rots+0.278
Neg logit contributions
 LGBT-0.312
gee-0.298
 homosexuals-0.296
 POS-0.296
 interf-0.296
Token threw
Token ID9617
Feature activation+2.02e-02

Pos logit contributions
 LGBT+0.009
gee+0.008
 homosexuals+0.008
 interf+0.008
 POS+0.008
Neg logit contributions
astern-0.009
cry-0.008
forcement-0.008
blast-0.008
rots-0.007
Token for
Token ID329
Feature activation+9.67e-03

Pos logit contributions
 LGBT+0.013
gee+0.012
 homosexuals+0.012
 POS+0.012
 interf+0.012
Neg logit contributions
astern-0.019
cry-0.016
forcement-0.016
blast-0.016
rots-0.015
Token one
Token ID530
Feature activation-5.32e-02

Pos logit contributions
 LGBT+0.006
gee+0.006
 homosexuals+0.006
 POS+0.006
 interf+0.006
Neg logit contributions
astern-0.009
cry-0.008
forcement-0.008
blast-0.008
rots-0.007
Token touchdown
Token ID10242
Feature activation-7.00e-04

Pos logit contributions
astern+0.043
cry+0.036
blast+0.035
forcement+0.035
rots+0.034
Neg logit contributions
 LGBT-0.040
gee-0.039
 homosexuals-0.038
 POS-0.038
 interf-0.038
Token social
Token ID1919
Feature activation-4.61e-02

Pos logit contributions
 LGBT+0.006
gee+0.006
 POS+0.006
 homosexuals+0.006
 interf+0.006
Neg logit contributions
astern-0.008
cry-0.007
forcement-0.007
blast-0.007
rots-0.007
Token justice
Token ID5316
Feature activation-1.86e-02

Pos logit contributions
astern+0.038
cry+0.033
blast+0.032
forcement+0.032
rots+0.031
Neg logit contributions
 LGBT-0.034
gee-0.032
 homosexuals-0.032
 POS-0.032
 interf-0.032
Token issue
Token ID2071
Feature activation+1.74e-02

Pos logit contributions
astern+0.016
cry+0.013
forcement+0.013
blast+0.013
rots+0.013
Neg logit contributions
 LGBT-0.013
gee-0.013
 homosexuals-0.013
 POS-0.013
 interf-0.013
Token.
Token ID13
Feature activation-1.22e-02

Pos logit contributions
 LGBT+0.012
gee+0.011
 homosexuals+0.011
 POS+0.011
 interf+0.011
Neg logit contributions
astern-0.016
cry-0.013
blast-0.013
forcement-0.013
rots-0.013
Token Athe
Token ID27751
Feature activation+6.79e-02

Pos logit contributions
astern+0.011
cry+0.009
forcement+0.009
blast+0.009
rots+0.009
Neg logit contributions
 LGBT-0.008
gee-0.008
 homosexuals-0.008
 POS-0.008
 interf-0.008
Tokenists
Token ID1023
Feature activation-4.08e-01

Pos logit contributions
 LGBT+0.048
gee+0.046
 homosexuals+0.045
 POS+0.045
nel+0.045
Neg logit contributions
astern-0.058
cry-0.050
forcement-0.048
blast-0.048
rots-0.047
Token are
Token ID389
Feature activation+1.43e-02

Pos logit contributions
astern+0.355
cry+0.304
blast+0.295
forcement+0.295
rots+0.288
Neg logit contributions
 LGBT-0.279
gee-0.268
 homosexuals-0.266
 POS-0.264
nel-0.263
Token oppressed
Token ID25765
Feature activation-2.89e-02

Pos logit contributions
 LGBT+0.009
gee+0.009
 homosexuals+0.009
 POS+0.009
 interf+0.009
Neg logit contributions
astern-0.013
cry-0.011
forcement-0.011
blast-0.011
rots-0.010
Token and
Token ID290
Feature activation+6.96e-03

Pos logit contributions
astern+0.026
cry+0.022
forcement+0.021
blast+0.021
ensen+0.021
Neg logit contributions
 LGBT-0.020
 homosexuals-0.019
gee-0.019
 POS-0.019
 interf-0.018
Token persecuted
Token ID37501
Feature activation+1.28e-01

Pos logit contributions
 LGBT+0.005
gee+0.004
 homosexuals+0.004
 POS+0.004
 interf+0.004
Neg logit contributions
astern-0.006
cry-0.005
forcement-0.005
blast-0.005
rots-0.005
Token the
Token ID262
Feature activation+9.86e-03

Pos logit contributions
 LGBT+0.090
 homosexuals+0.085
gee+0.085
 POS+0.084
 interf+0.084
Neg logit contributions
astern-0.111
cry-0.094
blast-0.092
forcement-0.092
phabet-0.089
Token Nut
Token ID11959
Feature activation+8.95e-02

Pos logit contributions
 LGBT+0.100
gee+0.096
 homosexuals+0.095
 POS+0.095
nel+0.094
Neg logit contributions
astern-0.123
cry-0.105
forcement-0.102
blast-0.102
rots-0.100
Tokent
Token ID83
Feature activation-6.17e-02

Pos logit contributions
 LGBT+0.069
gee+0.066
 homosexuals+0.066
 POS+0.066
nel+0.065
Neg logit contributions
astern-0.071
cry-0.059
forcement-0.057
blast-0.057
rots-0.056
Token campaigned
Token ID32853
Feature activation+4.67e-03

Pos logit contributions
astern+0.049
cry+0.041
blast+0.040
forcement+0.039
phabet+0.038
Neg logit contributions
 LGBT-0.048
gee-0.046
 interf-0.045
nel-0.045
 homosexuals-0.045
Token for
Token ID329
Feature activation+2.42e-02

Pos logit contributions
 LGBT+0.003
 homosexuals+0.003
gee+0.003
 POS+0.003
 interf+0.003
Neg logit contributions
astern-0.004
cry-0.004
blast-0.004
forcement-0.004
rots-0.003
Token McF
Token ID38222
Feature activation+8.24e-02

Pos logit contributions
 LGBT+0.016
gee+0.015
 homosexuals+0.015
 POS+0.015
 interf+0.015
Neg logit contributions
astern-0.022
cry-0.019
blast-0.018
forcement-0.018
rots-0.018
Tokenadden
Token ID38014
Feature activation-3.99e-01

Pos logit contributions
 LGBT+0.101
gee+0.098
 homosexuals+0.098
 POS+0.098
 interf+0.098
Neg logit contributions
astern-0.027
cry-0.017
forcement-0.015
blast-0.015
rots-0.014
Token in
Token ID287
Feature activation+8.08e-03

Pos logit contributions
astern+0.366
cry+0.316
blast+0.308
forcement+0.307
rots+0.301
Neg logit contributions
 LGBT-0.256
gee-0.243
 homosexuals-0.241
 POS-0.241
 interf-0.239
Token the
Token ID262
Feature activation+9.32e-03

Pos logit contributions
 LGBT+0.006
gee+0.005
 homosexuals+0.005
 POS+0.005
 interf+0.005
Neg logit contributions
astern-0.007
cry-0.006
blast-0.006
forcement-0.006
rots-0.006
Token Heisman
Token ID49537
Feature activation+1.27e-02

Pos logit contributions
 LGBT+0.007
gee+0.007
 homosexuals+0.007
 POS+0.007
 interf+0.007
Neg logit contributions
astern-0.008
cry-0.006
blast-0.006
forcement-0.006
rots-0.006
Token Trophy
Token ID25640
Feature activation-4.73e-02

Pos logit contributions
 LGBT+0.010
gee+0.010
 POS+0.010
 homosexuals+0.010
 interf+0.010
Neg logit contributions
astern-0.010
cry-0.008
blast-0.008
forcement-0.008
rots-0.008
Token race
Token ID3234
Feature activation-8.07e-02

Pos logit contributions
astern+0.037
cry+0.031
blast+0.030
forcement+0.030
rots+0.029
Neg logit contributions
 LGBT-0.037
 POS-0.035
gee-0.035
 interf-0.035
 homosexuals-0.035
Token
Token ID247
Feature activation+4.37e-02

Pos logit contributions
 LGBT+0.036
gee+0.034
 homosexuals+0.034
 POS+0.034
nel+0.034
Neg logit contributions
astern-0.033
cry-0.027
blast-0.026
forcement-0.026
rots-0.026
Token 48
Token ID4764
Feature activation-2.34e-02

Pos logit contributions
 LGBT+0.033
gee+0.031
 homosexuals+0.031
 POS+0.031
nel+0.031
Neg logit contributions
astern-0.036
cry-0.030
blast-0.029
forcement-0.029
rots-0.028
Token points
Token ID2173
Feature activation-3.96e-02

Pos logit contributions
astern+0.021
cry+0.018
forcement+0.018
blast+0.018
rots+0.017
Neg logit contributions
 LGBT-0.015
gee-0.015
 homosexuals-0.015
 POS-0.015
 interf-0.014
Token,
Token ID11
Feature activation+7.20e-02

Pos logit contributions
astern+0.035
cry+0.030
forcement+0.029
blast+0.029
rots+0.028
Neg logit contributions
 LGBT-0.027
 homosexuals-0.026
 POS-0.025
gee-0.025
 interf-0.025
Token McF
Token ID38222
Feature activation+7.69e-02

Pos logit contributions
 LGBT+0.048
gee+0.045
 homosexuals+0.045
 POS+0.045
 interf+0.044
Neg logit contributions
astern-0.065
cry-0.056
forcement-0.055
blast-0.055
rots-0.053
Tokenadden
Token ID38014
Feature activation-3.95e-01

Pos logit contributions
 LGBT+0.088
gee+0.086
 homosexuals+0.085
 POS+0.085
nel+0.085
Neg logit contributions
astern-0.032
cry-0.022
forcement-0.021
blast-0.021
rots-0.019
Token only
Token ID691
Feature activation-4.27e-03

Pos logit contributions
astern+0.323
cry+0.273
blast+0.266
forcement+0.265
rots+0.258
Neg logit contributions
 LGBT-0.293
gee-0.280
 POS-0.279
 homosexuals-0.278
 interf-0.277
Token accounted
Token ID17830
Feature activation-2.86e-02

Pos logit contributions
astern+0.004
cry+0.003
forcement+0.003
blast+0.003
phabet+0.003
Neg logit contributions
 LGBT-0.003
 POS-0.003
gee-0.003
 interf-0.003
 homosexuals-0.003
Token for
Token ID329
Feature activation-1.79e-03

Pos logit contributions
astern+0.026
cry+0.023
blast+0.022
forcement+0.022
rots+0.022
Neg logit contributions
 LGBT-0.018
gee-0.017
 POS-0.017
 homosexuals-0.017
 interf-0.017
Token one
Token ID530
Feature activation-2.79e-02

Pos logit contributions
astern+0.002
cry+0.001
forcement+0.001
blast+0.001
rots+0.001
Neg logit contributions
 LGBT-0.001
gee-0.001
 homosexuals-0.001
 POS-0.001
 interf-0.001
Token rushing
Token ID15795
Feature activation-5.75e-02

Pos logit contributions
astern+0.024
cry+0.021
forcement+0.020
blast+0.020
rots+0.020
Neg logit contributions
 LGBT-0.019
gee-0.018
 homosexuals-0.018
 POS-0.018
 interf-0.018
Token like
Token ID588
Feature activation+3.96e-02

Pos logit contributions
astern+0.060
cry+0.051
blast+0.050
forcement+0.050
rots+0.049
Neg logit contributions
 LGBT-0.044
gee-0.042
 homosexuals-0.042
 POS-0.042
 interf-0.042
Token the
Token ID262
Feature activation+3.08e-02

Pos logit contributions
 LGBT+0.027
gee+0.026
 POS+0.025
 homosexuals+0.025
 interf+0.025
Neg logit contributions
astern-0.035
cry-0.030
blast-0.029
forcement-0.029
rots-0.028
Token Purch
Token ID34459
Feature activation+3.54e-02

Pos logit contributions
 LGBT+0.022
 POS+0.021
gee+0.021
 homosexuals+0.021
 interf+0.020
Neg logit contributions
astern-0.026
cry-0.022
forcement-0.022
blast-0.022
rots-0.021
Tokenasing
Token ID2313
Feature activation-1.81e-01

Pos logit contributions
 LGBT+0.029
gee+0.028
 homosexuals+0.028
 POS+0.028
nel+0.028
Neg logit contributions
astern-0.026
cry-0.022
forcement-0.021
blast-0.021
rots-0.020
Token Man
Token ID1869
Feature activation-7.56e-02

Pos logit contributions
astern+0.121
cry+0.099
forcement+0.095
blast+0.095
rots+0.092
Neg logit contributions
 LGBT-0.161
gee-0.155
 homosexuals-0.154
 POS-0.154
 interf-0.152
Tokenagers
Token ID10321
Feature activation-3.82e-01

Pos logit contributions
astern+0.030
cry+0.020
forcement+0.019
blast+0.019
rots+0.018
Neg logit contributions
 LGBT-0.087
nel-0.085
gee-0.085
 POS-0.085
 homosexuals-0.085
Token
Token ID447
Feature activation-1.37e-02

Pos logit contributions
astern+0.317
cry+0.269
forcement+0.261
blast+0.261
rots+0.255
Neg logit contributions
 LGBT-0.278
gee-0.265
 POS-0.264
 homosexuals-0.264
 interf-0.262
Token
Token ID247
Feature activation+1.35e-02

Pos logit contributions
astern+0.010
cry+0.009
blast+0.008
forcement+0.008
rots+0.008
Neg logit contributions
 LGBT-0.011
gee-0.011
 POS-0.010
 homosexuals-0.010
nel-0.010
Token Ind
Token ID1423
Feature activation+3.57e-02

Pos logit contributions
 LGBT+0.010
gee+0.010
 POS+0.010
 homosexuals+0.010
nel+0.010
Neg logit contributions
astern-0.011
cry-0.009
blast-0.009
forcement-0.009
rots-0.008
Tokenices
Token ID1063
Feature activation-1.28e-02

Pos logit contributions
 LGBT+0.020
gee+0.019
 POS+0.019
 homosexuals+0.019
 interf+0.018
Neg logit contributions
astern-0.035
cry-0.031
forcement-0.030
blast-0.030
rots-0.030
Token.
Token ID13
Feature activation+4.39e-03

Pos logit contributions
astern+0.011
cry+0.010
blast+0.009
forcement+0.009
rots+0.009
Neg logit contributions
 LGBT-0.009
 POS-0.008
gee-0.008
 homosexuals-0.008
 interf-0.008
Token E
Token ID412
Feature activation-7.58e-02

Pos logit contributions
astern+0.015
cry+0.013
blast+0.012
forcement+0.012
rots+0.012
Neg logit contributions
 LGBT-0.010
gee-0.009
 homosexuals-0.009
 POS-0.009
 interf-0.009
Token pen
Token ID3112
Feature activation-1.88e-01

Pos logit contributions
astern+0.057
cry+0.048
blast+0.046
forcement+0.046
rots+0.045
Neg logit contributions
 LGBT-0.061
gee-0.059
 homosexuals-0.058
 POS-0.058
 interf-0.058
Tokena
Token ID64
Feature activation-1.00e-01

Pos logit contributions
astern+0.167
cry+0.143
forcement+0.139
blast+0.139
rots+0.136
Neg logit contributions
 LGBT-0.126
gee-0.120
 homosexuals-0.120
 POS-0.119
 interf-0.119
Token mo
Token ID6941
Feature activation-1.95e-01

Pos logit contributions
astern+0.089
cry+0.077
forcement+0.075
blast+0.075
rots+0.073
Neg logit contributions
 LGBT-0.066
gee-0.063
 homosexuals-0.063
 POS-0.063
 interf-0.062
Tokenek
Token ID988
Feature activation-3.79e-02

Pos logit contributions
astern+0.176
cry+0.152
forcement+0.147
blast+0.147
rots+0.144
Neg logit contributions
 LGBT-0.129
gee-0.123
 homosexuals-0.122
 POS-0.121
 interf-0.120
Tokenahi
Token ID32810
Feature activation-3.71e-01

Pos logit contributions
astern+0.033
cry+0.029
forcement+0.028
blast+0.028
rots+0.027
Neg logit contributions
 LGBT-0.026
gee-0.025
 homosexuals-0.024
 POS-0.024
 interf-0.024
Token
Token ID220
Feature activation-1.07e-01

Pos logit contributions
astern+0.332
cry+0.286
forcement+0.278
blast+0.278
rots+0.272
Neg logit contributions
 LGBT-0.244
gee-0.233
 homosexuals-0.232
 POS-0.230
nel-0.229
Token
Token ID134
Feature activation+1.27e-02

Pos logit contributions
astern+0.106
cry+0.093
forcement+0.091
blast+0.091
rots+0.089
Neg logit contributions
 LGBT-0.060
gee-0.056
 homosexuals-0.056
 POS-0.056
 interf-0.055
Token
Token ID119
Feature activation-2.42e-02

Pos logit contributions
 LGBT+0.008
gee+0.008
 homosexuals+0.008
 POS+0.008
nel+0.008
Neg logit contributions
astern-0.012
cry-0.010
blast-0.010
forcement-0.010
rots-0.010
Tokenia
Token ID544
Feature activation+6.84e-02

Pos logit contributions
astern+0.021
cry+0.018
blast+0.018
forcement+0.017
rots+0.017
Neg logit contributions
 LGBT-0.017
gee-0.016
 homosexuals-0.016
 POS-0.016
nel-0.016
Token n
Token ID299
Feature activation-1.25e-01

Pos logit contributions
 LGBT+0.046
gee+0.044
 homosexuals+0.043
 POS+0.043
 interf+0.043
Neg logit contributions
astern-0.061
cry-0.052
forcement-0.051
blast-0.051
rots-0.050
Token on
Token ID319
Feature activation-2.69e-02

Pos logit contributions
 LGBT+0.021
 homosexuals+0.020
 POS+0.020
gee+0.020
 interf+0.020
Neg logit contributions
astern-0.027
cry-0.023
blast-0.022
forcement-0.022
rots-0.022
Token the
Token ID262
Feature activation-3.09e-02

Pos logit contributions
astern+0.024
cry+0.020
blast+0.020
forcement+0.020
rots+0.019
Neg logit contributions
 LGBT-0.018
gee-0.017
 homosexuals-0.017
 POS-0.017
 interf-0.017
Token issue
Token ID2071
Feature activation-1.12e-02

Pos logit contributions
astern+0.025
cry+0.021
forcement+0.021
blast+0.021
rots+0.020
Neg logit contributions
 LGBT-0.023
gee-0.022
 homosexuals-0.022
 POS-0.022
 interf-0.022
Token on
Token ID319
Feature activation-1.62e-02

Pos logit contributions
astern+0.010
cry+0.009
blast+0.009
forcement+0.008
rots+0.008
Neg logit contributions
 LGBT-0.007
gee-0.007
 homosexuals-0.007
 POS-0.007
 interf-0.007
Token Kau
Token ID28148
Feature activation+7.22e-02

Pos logit contributions
astern+0.015
cry+0.013
blast+0.013
forcement+0.013
rots+0.012
Neg logit contributions
 LGBT-0.010
gee-0.010
 homosexuals-0.010
 POS-0.010
 interf-0.010
Tokenai
Token ID1872
Feature activation-3.49e-01

Pos logit contributions
 LGBT+0.066
gee+0.064
 homosexuals+0.064
 POS+0.063
 interf+0.063
Neg logit contributions
astern-0.046
cry-0.037
forcement-0.036
blast-0.036
rots-0.034
Token.
Token ID13
Feature activation+3.78e-03

Pos logit contributions
astern+0.317
cry+0.274
forcement+0.267
blast+0.267
rots+0.261
Neg logit contributions
 LGBT-0.227
gee-0.215
 homosexuals-0.214
 POS-0.213
 interf-0.212
Token
Token ID198
Feature activation+1.08e-02

Pos logit contributions
 LGBT+0.002
gee+0.002
 homosexuals+0.002
 POS+0.002
nel+0.002
Neg logit contributions
astern-0.003
cry-0.003
forcement-0.003
blast-0.003
rots-0.003
Token
Token ID198
Feature activation+5.34e-03

Pos logit contributions
 LGBT+0.006
gee+0.006
 homosexuals+0.006
 POS+0.006
 interf+0.006
Neg logit contributions
astern-0.011
cry-0.009
forcement-0.009
blast-0.009
rots-0.009
TokenState
Token ID9012
Feature activation-2.08e-02

Pos logit contributions
 LGBT+0.004
gee+0.004
 homosexuals+0.004
 POS+0.004
nel+0.004
Neg logit contributions
astern-0.004
cry-0.004
forcement-0.003
blast-0.003
rots-0.003
Token legislators
Token ID19964
Feature activation-2.15e-02

Pos logit contributions
astern+0.017
cry+0.015
blast+0.014
forcement+0.014
rots+0.014
Neg logit contributions
 LGBT-0.015
gee-0.014
 homosexuals-0.014
 POS-0.014
nel-0.014
Token to
Token ID284
Feature activation+5.81e-02

Pos logit contributions
astern+0.030
cry+0.026
forcement+0.025
blast+0.025
ensen+0.025
Neg logit contributions
 LGBT-0.022
 homosexuals-0.021
gee-0.021
 POS-0.021
 interf-0.021
Token 10
Token ID838
Feature activation+2.50e-02

Pos logit contributions
 LGBT+0.039
 homosexuals+0.037
gee+0.037
 POS+0.037
 interf+0.036
Neg logit contributions
astern-0.052
cry-0.044
forcement-0.043
blast-0.043
ensen-0.042
Token men
Token ID1450
Feature activation+1.18e-01

Pos logit contributions
 LGBT+0.016
gee+0.015
 homosexuals+0.015
 POS+0.015
 interf+0.015
Neg logit contributions
astern-0.023
cry-0.020
forcement-0.019
blast-0.019
rots-0.019
Token everything
Token ID2279
Feature activation+2.47e-02

Pos logit contributions
 LGBT+0.078
 homosexuals+0.074
gee+0.074
 POS+0.074
 interf+0.074
Neg logit contributions
astern-0.105
cry-0.090
forcement-0.088
blast-0.088
rots-0.086
Token cons
Token ID762
Feature activation-1.06e-02

Pos logit contributions
 LGBT+0.018
 homosexuals+0.017
gee+0.017
 POS+0.017
 interf+0.017
Neg logit contributions
astern-0.021
cry-0.017
forcement-0.017
blast-0.017
phabet-0.016
Tokenpired
Token ID6474
Feature activation-3.48e-01

Pos logit contributions
astern+0.006
cry+0.005
blast+0.005
forcement+0.005
phabet+0.005
Neg logit contributions
 LGBT-0.010
gee-0.010
 POS-0.010
 homosexuals-0.010
nel-0.010
Token against
Token ID1028
Feature activation-7.23e-02

Pos logit contributions
astern+0.321
cry+0.278
forcement+0.270
blast+0.270
rots+0.264
Neg logit contributions
 LGBT-0.220
gee-0.210
 homosexuals-0.208
 POS-0.207
 interf-0.206
Token us
Token ID514
Feature activation-1.09e-01

Pos logit contributions
astern+0.056
cry+0.047
forcement+0.045
blast+0.045
ensen+0.044
Neg logit contributions
 LGBT-0.057
gee-0.055
 homosexuals-0.055
 POS-0.054
 interf-0.054
Token,"
Token ID553
Feature activation+1.62e-02

Pos logit contributions
astern+0.099
cry+0.085
forcement+0.083
blast+0.083
rots+0.081
Neg logit contributions
 LGBT-0.071
gee-0.068
 homosexuals-0.067
 POS-0.067
 interf-0.066
Token said
Token ID531
Feature activation-2.23e-02

Pos logit contributions
 LGBT+0.012
gee+0.012
 homosexuals+0.012
 POS+0.012
nel+0.012
Neg logit contributions
astern-0.013
cry-0.011
blast-0.011
forcement-0.011
rots-0.010
Token Owen
Token ID22605
Feature activation-2.95e-02

Pos logit contributions
astern+0.018
cry+0.015
blast+0.015
forcement+0.015
ensen+0.014
Neg logit contributions
 LGBT-0.017
gee-0.016
 homosexuals-0.016
 POS-0.016
nel-0.016
Token Liberal
Token ID12266
Feature activation-1.87e-01

Pos logit contributions
astern+0.054
cry+0.046
forcement+0.044
blast+0.044
rots+0.043
Neg logit contributions
 LGBT-0.047
gee-0.045
 homosexuals-0.045
 interf-0.045
 POS-0.045
Token Activ
Token ID13144
Feature activation-6.31e-02

Pos logit contributions
astern+0.154
cry+0.130
forcement+0.126
blast+0.126
rots+0.123
Neg logit contributions
 LGBT-0.137
gee-0.132
 homosexuals-0.131
 POS-0.130
 interf-0.129
Tokenists
Token ID1023
Feature activation-9.22e-02

Pos logit contributions
astern+0.042
cry+0.034
forcement+0.033
blast+0.033
ensen+0.031
Neg logit contributions
 LGBT-0.057
gee-0.055
 homosexuals-0.054
 POS-0.054
nel-0.054
Token Than
Token ID17924
Feature activation-3.77e-02

Pos logit contributions
astern+0.083
cry+0.071
forcement+0.069
blast+0.069
rots+0.068
Neg logit contributions
 LGBT-0.061
gee-0.058
 homosexuals-0.058
 POS-0.058
nel-0.057
Token Man
Token ID1869
Feature activation-9.49e-02

Pos logit contributions
astern+0.032
cry+0.028
forcement+0.027
blast+0.027
rots+0.026
Neg logit contributions
 LGBT-0.026
gee-0.025
 homosexuals-0.025
 POS-0.025
 interf-0.025
Tokenagers
Token ID10321
Feature activation-3.47e-01

Pos logit contributions
astern+0.072
cry+0.060
forcement+0.058
blast+0.058
ensen+0.056
Neg logit contributions
 LGBT-0.076
gee-0.073
 homosexuals-0.073
 POS-0.073
nel-0.072
Token for
Token ID329
Feature activation-4.66e-03

Pos logit contributions
astern+0.309
cry+0.265
blast+0.258
forcement+0.258
rots+0.252
Neg logit contributions
 LGBT-0.232
gee-0.221
 homosexuals-0.219
 POS-0.219
nel-0.217
Token City
Token ID2254
Feature activation-1.13e-01

Pos logit contributions
astern+0.004
cry+0.003
forcement+0.003
blast+0.003
rots+0.003
Neg logit contributions
 LGBT-0.003
gee-0.003
 homosexuals-0.003
 POS-0.003
 interf-0.003
Token Posts
Token ID12043
Feature activation-3.13e-02

Pos logit contributions
astern+0.085
cry+0.071
blast+0.069
forcement+0.068
rots+0.067
Neg logit contributions
 LGBT-0.091
gee-0.087
 POS-0.087
 homosexuals-0.087
nel-0.086
Token.
Token ID13
Feature activation+5.33e-02

Pos logit contributions
astern+0.028
cry+0.024
blast+0.023
forcement+0.023
rots+0.023
Neg logit contributions
 LGBT-0.021
gee-0.020
 POS-0.020
 homosexuals-0.020
nel-0.019
Token
Token ID447
Feature activation-7.63e-03

Pos logit contributions
 LGBT+0.037
gee+0.035
 POS+0.035
 homosexuals+0.035
nel+0.035
Neg logit contributions
astern-0.046
cry-0.040
forcement-0.039
blast-0.039
rots-0.038
Token if
Token ID611
Feature activation-4.80e-02

Pos logit contributions
astern+0.150
cry+0.130
forcement+0.126
blast+0.126
rots+0.123
Neg logit contributions
 LGBT-0.108
gee-0.102
 homosexuals-0.102
 POS-0.102
 interf-0.101
Token there
Token ID612
Feature activation-7.01e-02

Pos logit contributions
astern+0.043
cry+0.037
blast+0.036
forcement+0.036
rots+0.035
Neg logit contributions
 LGBT-0.032
gee-0.031
 homosexuals-0.030
 POS-0.030
 interf-0.030
Token are
Token ID389
Feature activation-1.68e-02

Pos logit contributions
astern+0.060
cry+0.052
forcement+0.050
blast+0.050
rots+0.049
Neg logit contributions
 LGBT-0.049
gee-0.047
 homosexuals-0.046
 POS-0.046
 interf-0.046
Token no
Token ID645
Feature activation-2.81e-02

Pos logit contributions
astern+0.015
cry+0.013
forcement+0.013
blast+0.013
ensen+0.012
Neg logit contributions
 LGBT-0.011
gee-0.010
 homosexuals-0.010
 POS-0.010
nel-0.010
Token que
Token ID8358
Feature activation-3.07e-02

Pos logit contributions
astern+0.024
cry+0.020
blast+0.019
forcement+0.019
rots+0.019
Neg logit contributions
 LGBT-0.020
gee-0.019
 homosexuals-0.019
 POS-0.019
 interf-0.019
Tokenuing
Token ID4250
Feature activation-3.45e-01

Pos logit contributions
astern+0.024
cry+0.020
blast+0.020
forcement+0.019
rots+0.019
Neg logit contributions
 LGBT-0.024
gee-0.023
 homosexuals-0.023
 POS-0.023
 interf-0.023
Token delays
Token ID16119
Feature activation+1.82e-02

Pos logit contributions
astern+0.279
cry+0.237
blast+0.229
forcement+0.229
rots+0.223
Neg logit contributions
 LGBT-0.259
gee-0.248
 homosexuals-0.247
 POS-0.247
 interf-0.245
Token and
Token ID290
Feature activation-2.16e-02

Pos logit contributions
 LGBT+0.012
gee+0.011
 homosexuals+0.011
 POS+0.011
 interf+0.011
Neg logit contributions
astern-0.016
cry-0.014
blast-0.014
forcement-0.014
rots-0.013
Token no
Token ID645
Feature activation-5.43e-02

Pos logit contributions
astern+0.019
cry+0.016
blast+0.016
forcement+0.016
rots+0.016
Neg logit contributions
 LGBT-0.015
gee-0.014
 homosexuals-0.014
 POS-0.014
 interf-0.014
Token processing
Token ID7587
Feature activation-5.34e-02

Pos logit contributions
astern+0.046
cry+0.039
blast+0.038
forcement+0.038
rots+0.037
Neg logit contributions
 LGBT-0.039
gee-0.037
 homosexuals-0.037
 POS-0.037
 interf-0.037
Token delays
Token ID16119
Feature activation+5.12e-02

Pos logit contributions
astern+0.044
cry+0.037
forcement+0.036
blast+0.036
rots+0.035
Neg logit contributions
 LGBT-0.039
gee-0.038
 homosexuals-0.037
 POS-0.037
 interf-0.037
Token this
Token ID428
Feature activation-5.94e-03

Pos logit contributions
astern+0.002
cry+0.002
forcement+0.002
blast+0.002
rots+0.002
Neg logit contributions
 LGBT-0.002
 homosexuals-0.002
gee-0.002
 POS-0.002
 interf-0.002
Token Al
Token ID978
Feature activation-7.87e-02

Pos logit contributions
astern+0.005
cry+0.004
forcement+0.004
blast+0.004
rots+0.004
Neg logit contributions
 LGBT-0.004
gee-0.004
 POS-0.004
 homosexuals-0.004
 interf-0.004
Token J
Token ID449
Feature activation-1.79e-02

Pos logit contributions
astern+0.066
cry+0.055
forcement+0.054
blast+0.054
rots+0.053
Neg logit contributions
 LGBT-0.057
gee-0.055
 homosexuals-0.054
 POS-0.054
 interf-0.054
Tokenour
Token ID454
Feature activation-3.73e-02

Pos logit contributions
astern+0.012
cry+0.010
blast+0.010
forcement+0.010
rots+0.009
Neg logit contributions
 LGBT-0.016
gee-0.015
 homosexuals-0.015
 POS-0.015
 interf-0.015
Tokeng
Token ID70
Feature activation-3.77e-03

Pos logit contributions
astern+0.032
cry+0.027
blast+0.026
forcement+0.026
rots+0.026
Neg logit contributions
 LGBT-0.026
gee-0.025
 homosexuals-0.025
 POS-0.025
nel-0.025
Tokenensen
Token ID18756
Feature activation-3.44e-01

Pos logit contributions
astern+0.002
cry+0.001
forcement+0.001
blast+0.001
phabet+0.001
Neg logit contributions
 LGBT-0.004
gee-0.004
 homosexuals-0.004
 POS-0.004
nel-0.004
Token and
Token ID290
Feature activation+1.45e-02

Pos logit contributions
astern+0.266
cry+0.222
forcement+0.215
blast+0.215
rots+0.209
Neg logit contributions
 LGBT-0.271
gee-0.260
 homosexuals-0.258
 POS-0.258
 interf-0.257
Token Ge
Token ID2269
Feature activation-6.74e-02

Pos logit contributions
 LGBT+0.010
gee+0.010
 homosexuals+0.010
 POS+0.010
 interf+0.010
Neg logit contributions
astern-0.012
cry-0.011
forcement-0.010
blast-0.010
rots-0.010
Tokenza
Token ID4496
Feature activation-9.34e-02

Pos logit contributions
astern+0.057
cry+0.049
forcement+0.047
blast+0.047
rots+0.046
Neg logit contributions
 LGBT-0.048
gee-0.046
 homosexuals-0.046
 POS-0.045
 interf-0.045
Token X
Token ID1395
Feature activation+7.69e-02

Pos logit contributions
astern+0.073
cry+0.062
forcement+0.060
blast+0.059
rots+0.058
Neg logit contributions
 LGBT-0.072
gee-0.069
 homosexuals-0.069
 POS-0.069
 interf-0.069
Token studio
Token ID8034
Feature activation+7.66e-02

Pos logit contributions
 LGBT+0.055
gee+0.052
 homosexuals+0.052
 POS+0.052
nel+0.052
Neg logit contributions
astern-0.065
cry-0.055
forcement-0.054
blast-0.054
rots-0.052
Token chopped
Token ID20720
Feature activation-7.55e-02

Pos logit contributions
astern+0.008
cry+0.007
forcement+0.007
blast+0.007
rots+0.007
Neg logit contributions
 LGBT-0.007
gee-0.006
 homosexuals-0.006
 POS-0.006
 interf-0.006
Token up
Token ID510
Feature activation-4.23e-02

Pos logit contributions
astern+0.062
cry+0.052
forcement+0.051
blast+0.051
rots+0.049
Neg logit contributions
 LGBT-0.056
gee-0.054
 homosexuals-0.053
 POS-0.053
 interf-0.053
Token society
Token ID3592
Feature activation+1.58e-02

Pos logit contributions
astern+0.036
cry+0.031
forcement+0.030
blast+0.030
rots+0.029
Neg logit contributions
 LGBT-0.030
gee-0.029
 homosexuals-0.028
 POS-0.028
 interf-0.028
Token as
Token ID355
Feature activation-6.09e-02

Pos logit contributions
 LGBT+0.010
gee+0.010
 homosexuals+0.010
 POS+0.009
 interf+0.009
Neg logit contributions
astern-0.015
cry-0.013
blast-0.012
forcement-0.012
rots-0.012
Token unsu
Token ID48092
Feature activation+4.53e-02

Pos logit contributions
astern+0.057
cry+0.049
forcement+0.048
blast+0.048
rots+0.047
Neg logit contributions
 LGBT-0.038
gee-0.036
 homosexuals-0.036
 POS-0.036
 interf-0.036
Tokenited
Token ID863
Feature activation-3.41e-01

Pos logit contributions
 LGBT+0.029
 homosexuals+0.028
gee+0.028
 POS+0.028
 interf+0.027
Neg logit contributions
astern-0.041
cry-0.036
forcement-0.034
blast-0.034
rots-0.034
Token to
Token ID284
Feature activation-8.24e-02

Pos logit contributions
astern+0.314
cry+0.270
forcement+0.263
blast+0.262
rots+0.258
Neg logit contributions
 LGBT-0.216
gee-0.208
 homosexuals-0.205
 POS-0.204
nel-0.203
Token state
Token ID1181
Feature activation-9.82e-02

Pos logit contributions
astern+0.072
cry+0.061
forcement+0.060
blast+0.060
ensen+0.058
Neg logit contributions
 LGBT-0.057
gee-0.054
 homosexuals-0.054
 POS-0.054
 interf-0.054
Tokenhood
Token ID2894
Feature activation-3.35e-02

Pos logit contributions
astern+0.108
cry+0.096
blast+0.094
forcement+0.094
ensen+0.092
Neg logit contributions
 LGBT-0.046
gee-0.042
 homosexuals-0.042
 interf-0.042
 POS-0.041
Token.
Token ID13
Feature activation-6.17e-02

Pos logit contributions
astern+0.030
cry+0.026
blast+0.025
forcement+0.025
ensen+0.024
Neg logit contributions
 LGBT-0.022
gee-0.021
 homosexuals-0.021
 POS-0.021
 interf-0.021
Token
Token ID198
Feature activation-5.21e-02

Pos logit contributions
astern+0.058
cry+0.050
forcement+0.049
blast+0.049
rots+0.048
Neg logit contributions
 LGBT-0.038
gee-0.036
 homosexuals-0.036
 POS-0.036
nel-0.036
Token across
Token ID1973
Feature activation-7.70e-02

Pos logit contributions
astern+0.008
cry+0.007
blast+0.007
forcement+0.007
rots+0.007
Neg logit contributions
 LGBT-0.007
 homosexuals-0.007
gee-0.007
 interf-0.007
 POS-0.007
Token the
Token ID262
Feature activation+1.77e-02

Pos logit contributions
astern+0.067
cry+0.058
blast+0.056
forcement+0.056
rots+0.055
Neg logit contributions
 LGBT-0.052
gee-0.050
 homosexuals-0.050
 POS-0.050
 interf-0.049
Token south
Token ID5366
Feature activation-6.23e-03

Pos logit contributions
 LGBT+0.013
 homosexuals+0.012
gee+0.012
 POS+0.012
 interf+0.012
Neg logit contributions
astern-0.015
cry-0.013
blast-0.012
forcement-0.012
rots-0.012
Token-
Token ID12
Feature activation-5.94e-03

Pos logit contributions
astern+0.005
cry+0.004
forcement+0.004
blast+0.004
rots+0.004
Neg logit contributions
 LGBT-0.005
gee-0.004
 homosexuals-0.004
 POS-0.004
 interf-0.004
Tokene
Token ID68
Feature activation+6.89e-02

Pos logit contributions
astern+0.005
cry+0.005
forcement+0.005
blast+0.005
rots+0.004
Neg logit contributions
 LGBT-0.004
gee-0.004
 homosexuals-0.004
 POS-0.004
nel-0.004
Tokenastern
Token ID6470
Feature activation-3.40e-01

Pos logit contributions
 LGBT+0.104
gee+0.102
 homosexuals+0.101
 POS+0.101
 interf+0.100
Neg logit contributions
astern-0.001
cry+0.005
forcement+0.007
blast+0.007
rots+0.007
Token Bahamas
Token ID45726
Feature activation-7.70e-02

Pos logit contributions
astern+0.300
cry+0.257
forcement+0.250
blast+0.250
rots+0.244
Neg logit contributions
 LGBT-0.230
gee-0.220
 homosexuals-0.219
 POS-0.217
 interf-0.216
Token late
Token ID2739
Feature activation+9.22e-02

Pos logit contributions
astern+0.066
cry+0.057
blast+0.055
forcement+0.055
rots+0.054
Neg logit contributions
 LGBT-0.054
gee-0.051
 homosexuals-0.051
 POS-0.051
 interf-0.051
Token Tuesday
Token ID3431
Feature activation+1.10e-01

Pos logit contributions
 LGBT+0.072
 homosexuals+0.069
gee+0.069
 POS+0.069
 interf+0.068
Neg logit contributions
astern-0.072
cry-0.060
forcement-0.058
blast-0.058
rots-0.057
Token and
Token ID290
Feature activation+6.32e-03

Pos logit contributions
 LGBT+0.078
gee+0.074
 homosexuals+0.074
 POS+0.074
 interf+0.073
Neg logit contributions
astern-0.094
cry-0.080
forcement-0.078
blast-0.078
rots-0.076
Token early
Token ID1903
Feature activation+2.77e-03

Pos logit contributions
 LGBT+0.005
 homosexuals+0.005
 interf+0.005
gee+0.005
 POS+0.005
Neg logit contributions
astern-0.005
cry-0.004
blast-0.004
forcement-0.004
rots-0.004
Token
Token ID447
Feature activation-5.04e-02

Pos logit contributions
astern+0.019
cry+0.016
forcement+0.016
blast+0.015
phabet+0.015
Neg logit contributions
 LGBT-0.020
gee-0.020
 homosexuals-0.020
 POS-0.019
nel-0.019
Token
Token ID250
Feature activation-1.95e-02

Pos logit contributions
astern+0.044
cry+0.038
blast+0.037
forcement+0.036
ensen+0.036
Neg logit contributions
 LGBT-0.034
gee-0.033
 homosexuals-0.033
 POS-0.033
nel-0.032
TokenCash
Token ID35361
Feature activation+6.48e-03

Pos logit contributions
astern+0.016
cry+0.013
forcement+0.013
blast+0.013
ensen+0.013
Neg logit contributions
 LGBT-0.014
gee-0.014
 POS-0.014
 homosexuals-0.014
nel-0.014
Token-
Token ID12
Feature activation-5.67e-02

Pos logit contributions
 LGBT+0.005
gee+0.004
 POS+0.004
 homosexuals+0.004
nel+0.004
Neg logit contributions
astern-0.005
cry-0.005
forcement-0.004
blast-0.004
rots-0.004
TokenSe
Token ID4653
Feature activation-6.79e-02

Pos logit contributions
astern+0.046
cry+0.039
forcement+0.038
blast+0.038
rots+0.037
Neg logit contributions
 LGBT-0.042
gee-0.040
 homosexuals-0.040
 POS-0.040
nel-0.039
Tokeneking
Token ID18754
Feature activation-3.39e-01

Pos logit contributions
astern+0.038
cry+0.030
blast+0.029
forcement+0.029
rots+0.027
Neg logit contributions
 LGBT-0.067
gee-0.065
 homosexuals-0.065
 POS-0.065
nel-0.064
Token Democratic
Token ID4390
Feature activation+5.66e-02

Pos logit contributions
astern+0.288
cry+0.244
forcement+0.238
blast+0.238
rots+0.232
Neg logit contributions
 LGBT-0.243
gee-0.231
 POS-0.231
 homosexuals-0.230
nel-0.228
Token Senator
Token ID8962
Feature activation-4.47e-02

Pos logit contributions
 LGBT+0.043
 POS+0.041
gee+0.041
 homosexuals+0.041
nel+0.040
Neg logit contributions
astern-0.045
cry-0.038
forcement-0.037
blast-0.037
rots-0.036
Tokenial
Token ID498
Feature activation+3.42e-02

Pos logit contributions
astern+0.039
cry+0.033
blast+0.032
forcement+0.032
ensen+0.032
Neg logit contributions
 LGBT-0.031
 POS-0.029
gee-0.029
 homosexuals-0.029
nel-0.029
Token Campaign
Token ID13718
Feature activation-7.45e-02

Pos logit contributions
 LGBT+0.025
 POS+0.024
gee+0.024
 homosexuals+0.024
nel+0.023
Neg logit contributions
astern-0.029
cry-0.024
forcement-0.024
blast-0.024
rots-0.023
Token Committee
Token ID4606
Feature activation+8.21e-02

Pos logit contributions
astern+0.065
cry+0.056
forcement+0.054
blast+0.054
rots+0.053
Neg logit contributions
 LGBT-0.051
gee-0.048
 homosexuals-0.048
 POS-0.048
nel-0.048
TokenR
Token ID49
Feature activation+1.19e-02

Pos logit contributions
astern+0.014
cry+0.012
forcement+0.011
blast+0.011
rots+0.011
Neg logit contributions
 LGBT-0.011
gee-0.010
 homosexuals-0.010
 POS-0.010
 interf-0.010
Token&
Token ID5
Feature activation+7.48e-02

Pos logit contributions
 LGBT+0.007
gee+0.007
 homosexuals+0.007
 POS+0.006
nel+0.006
Neg logit contributions
astern-0.012
cry-0.010
forcement-0.010
blast-0.010
rots-0.010
TokenD
Token ID35
Feature activation+7.50e-02

Pos logit contributions
 LGBT+0.042
gee+0.040
 homosexuals+0.039
 POS+0.039
 interf+0.039
Neg logit contributions
astern-0.075
cry-0.065
blast-0.064
forcement-0.064
rots-0.062
Token)
Token ID8
Feature activation+1.95e-02

Pos logit contributions
 LGBT+0.053
gee+0.050
 homosexuals+0.050
 POS+0.050
 interf+0.050
Neg logit contributions
astern-0.064
cry-0.055
blast-0.053
forcement-0.053
rots-0.052
Token tot
Token ID2006
Feature activation+9.96e-02

Pos logit contributions
 LGBT+0.014
gee+0.014
 homosexuals+0.013
 POS+0.013
 interf+0.013
Neg logit contributions
astern-0.016
cry-0.014
forcement-0.013
blast-0.013
rots-0.013
Tokenalled
Token ID4262
Feature activation-3.39e-01

Pos logit contributions
 LGBT+0.114
gee+0.111
 homosexuals+0.111
 POS+0.110
 interf+0.110
Neg logit contributions
astern-0.041
cry-0.029
forcement-0.026
blast-0.026
rots-0.025
Token 2
Token ID362
Feature activation+6.97e-04

Pos logit contributions
astern+0.323
cry+0.280
forcement+0.273
blast+0.273
rots+0.267
Neg logit contributions
 LGBT-0.205
gee-0.195
 homosexuals-0.194
 POS-0.192
nel-0.191
Token.
Token ID13
Feature activation+2.90e-02

Pos logit contributions
 LGBT+0.000
gee+0.000
 homosexuals+0.000
 POS+0.000
 interf+0.000
Neg logit contributions
astern-0.001
cry-0.001
forcement-0.001
blast-0.001
rots-0.001
Token07
Token ID2998
Feature activation+1.44e-01

Pos logit contributions
 LGBT+0.023
gee+0.023
 homosexuals+0.022
 POS+0.022
nel+0.022
Neg logit contributions
astern-0.022
cry-0.018
forcement-0.018
blast-0.018
rots-0.017
Token%
Token ID4
Feature activation+7.80e-02

Pos logit contributions
 LGBT+0.086
gee+0.081
 homosexuals+0.080
 POS+0.080
 interf+0.080
Neg logit contributions
astern-0.138
cry-0.120
forcement-0.117
blast-0.117
rots-0.115
Token of
Token ID286
Feature activation+1.93e-02

Pos logit contributions
 LGBT+0.044
gee+0.041
 homosexuals+0.041
 POS+0.041
nel+0.040
Neg logit contributions
astern-0.078
cry-0.068
forcement-0.066
blast-0.066
rots-0.065
Token,
Token ID11
Feature activation-6.31e-02

Pos logit contributions
astern+0.037
cry+0.032
forcement+0.031
blast+0.031
rots+0.030
Neg logit contributions
 LGBT-0.028
gee-0.026
 homosexuals-0.026
 POS-0.026
 interf-0.026
Token features
Token ID3033
Feature activation+4.80e-02

Pos logit contributions
astern+0.053
cry+0.045
forcement+0.044
blast+0.044
rots+0.042
Neg logit contributions
 LGBT-0.045
gee-0.043
 homosexuals-0.043
 POS-0.043
 interf-0.043
Token a
Token ID257
Feature activation-5.35e-02

Pos logit contributions
 LGBT+0.030
gee+0.028
 homosexuals+0.028
 POS+0.028
 interf+0.028
Neg logit contributions
astern-0.045
cry-0.039
blast-0.038
forcement-0.038
rots-0.037
Token full
Token ID1336
Feature activation+6.49e-02

Pos logit contributions
astern+0.045
cry+0.038
forcement+0.037
blast+0.037
rots+0.036
Neg logit contributions
 LGBT-0.039
gee-0.037
 POS-0.037
 homosexuals-0.037
 interf-0.036
Token Shim
Token ID31698
Feature activation-4.15e-02

Pos logit contributions
 LGBT+0.048
gee+0.045
 homosexuals+0.045
 POS+0.045
 interf+0.045
Neg logit contributions
astern-0.054
cry-0.046
forcement-0.044
blast-0.044
rots-0.043
Tokenano
Token ID5733
Feature activation-3.39e-01

Pos logit contributions
astern+0.020
cry+0.015
forcement+0.014
blast+0.014
rots+0.014
Neg logit contributions
 LGBT-0.044
 POS-0.043
gee-0.043
nel-0.043
 homosexuals-0.043
Token 600
Token ID10053
Feature activation-2.93e-02

Pos logit contributions
astern+0.291
cry+0.249
blast+0.241
forcement+0.241
rots+0.236
Neg logit contributions
 LGBT-0.238
gee-0.227
 homosexuals-0.226
 POS-0.225
 interf-0.224
Token tri
Token ID1333
Feature activation-5.95e-02

Pos logit contributions
astern+0.024
cry+0.020
forcement+0.020
blast+0.020
rots+0.019
Neg logit contributions
 LGBT-0.022
gee-0.021
 POS-0.021
 homosexuals-0.020
 interf-0.020
Token-
Token ID12
Feature activation-3.53e-02

Pos logit contributions
astern+0.050
cry+0.042
forcement+0.041
blast+0.041
rots+0.040
Neg logit contributions
 LGBT-0.043
gee-0.041
 homosexuals-0.041
 POS-0.040
 interf-0.040
Tokencolor
Token ID8043
Feature activation-1.19e-01

Pos logit contributions
astern+0.028
cry+0.023
forcement+0.022
blast+0.022
ensen+0.022
Neg logit contributions
 LGBT-0.027
gee-0.026
 homosexuals-0.026
 POS-0.026
 interf-0.026
Token group
Token ID1448
Feature activation+8.36e-03

Pos logit contributions
astern+0.097
cry+0.082
forcement+0.080
blast+0.080
rots+0.078
Neg logit contributions
 LGBT-0.089
gee-0.085
 POS-0.085
 homosexuals-0.085
 interf-0.084
TokenLu
Token ID25596
Feature activation-1.09e-02

Pos logit contributions
 LGBT+0.013
gee+0.012
 homosexuals+0.012
 POS+0.012
nel+0.012
Neg logit contributions
astern-0.012
cry-0.010
forcement-0.010
blast-0.010
ensen-0.010
Tokenis
Token ID271
Feature activation-9.81e-02

Pos logit contributions
astern+0.010
cry+0.009
blast+0.008
forcement+0.008
ensen+0.008
Neg logit contributions
 LGBT-0.007
gee-0.007
 homosexuals-0.007
nel-0.007
 POS-0.007
Token Moreno
Token ID43269
Feature activation+2.36e-01

Pos logit contributions
astern+0.087
cry+0.074
forcement+0.072
blast+0.072
rots+0.071
Neg logit contributions
 LGBT-0.066
gee-0.063
 homosexuals-0.063
 POS-0.062
 interf-0.062
Token
Token ID784
Feature activation+5.68e-03

Pos logit contributions
 LGBT+0.168
gee+0.161
 homosexuals+0.160
 POS+0.159
 interf+0.158
Neg logit contributions
astern-0.199
cry-0.169
blast-0.164
forcement-0.164
rots-0.160
Token Instit
Token ID37931
Feature activation+1.87e-02

Pos logit contributions
 LGBT+0.004
gee+0.004
 homosexuals+0.004
 POS+0.004
 interf+0.004
Neg logit contributions
astern-0.005
cry-0.004
forcement-0.004
blast-0.004
rots-0.004
Tokenuto
Token ID9390
Feature activation-3.35e-01

Pos logit contributions
 LGBT+0.020
gee+0.019
 homosexuals+0.019
 POS+0.019
 interf+0.019
Neg logit contributions
astern-0.009
cry-0.007
blast-0.006
forcement-0.006
rots-0.006
Token de
Token ID390
Feature activation-4.85e-02

Pos logit contributions
astern+0.290
cry+0.247
blast+0.240
forcement+0.240
rots+0.234
Neg logit contributions
 LGBT-0.231
gee-0.223
 homosexuals-0.221
 POS-0.218
nel-0.218
Token Pol
Token ID2165
Feature activation+6.81e-03

Pos logit contributions
astern+0.039
cry+0.033
forcement+0.032
blast+0.032
rots+0.031
Neg logit contributions
 LGBT-0.036
gee-0.035
 homosexuals-0.035
 POS-0.035
 interf-0.034
Tokení
Token ID8836
Feature activation+5.70e-02

Pos logit contributions
 LGBT+0.004
gee+0.003
 homosexuals+0.003
 POS+0.003
 interf+0.003
Neg logit contributions
astern-0.007
cry-0.006
blast-0.006
forcement-0.006
rots-0.006
Tokentic
Token ID13370
Feature activation-4.79e-02

Pos logit contributions
 LGBT+0.045
gee+0.043
 homosexuals+0.043
 POS+0.043
nel+0.043
Neg logit contributions
astern-0.044
cry-0.037
forcement-0.036
blast-0.036
ensen-0.035
Tokenas
Token ID292
Feature activation-9.86e-02

Pos logit contributions
astern+0.043
cry+0.037
blast+0.036
forcement+0.036
rots+0.035
Neg logit contributions
 LGBT-0.031
gee-0.030
 homosexuals-0.030
 POS-0.029
 interf-0.029
Token-
Token ID12
Feature activation-2.43e-02

Pos logit contributions
astern+0.064
cry+0.054
forcement+0.053
blast+0.052
rots+0.052
Neg logit contributions
 LGBT-0.054
gee-0.052
 POS-0.051
 homosexuals-0.051
nel-0.051
Token0
Token ID15
Feature activation+5.25e-02

Pos logit contributions
astern+0.020
cry+0.017
forcement+0.017
blast+0.017
rots+0.016
Neg logit contributions
 LGBT-0.018
gee-0.017
 POS-0.017
 homosexuals-0.017
nel-0.016
Token
Token ID198
Feature activation-1.61e-02

Pos logit contributions
 LGBT+0.029
gee+0.027
 POS+0.027
 homosexuals+0.027
 interf+0.027
Neg logit contributions
astern-0.053
cry-0.046
forcement-0.045
blast-0.045
rots-0.044
Token
Token ID198
Feature activation-1.76e-02

Pos logit contributions
astern+0.016
cry+0.014
forcement+0.014
blast+0.014
rots+0.014
Neg logit contributions
 LGBT-0.009
gee-0.008
 homosexuals-0.008
 POS-0.008
nel-0.008
TokenSe
Token ID4653
Feature activation-5.60e-02

Pos logit contributions
astern+0.013
cry+0.010
forcement+0.010
blast+0.010
phabet+0.010
Neg logit contributions
 LGBT-0.015
gee-0.014
 POS-0.014
 homosexuals-0.014
nel-0.014
Tokeneking
Token ID18754
Feature activation-3.35e-01

Pos logit contributions
astern+0.034
cry+0.027
blast+0.026
forcement+0.026
rots+0.025
Neg logit contributions
 LGBT-0.053
gee-0.052
 homosexuals-0.051
 POS-0.051
 interf-0.051
Token just
Token ID655
Feature activation+5.83e-03

Pos logit contributions
astern+0.314
cry+0.271
blast+0.265
forcement+0.265
rots+0.259
Neg logit contributions
 LGBT-0.208
gee-0.197
 POS-0.196
 homosexuals-0.196
 interf-0.193
Token a
Token ID257
Feature activation-1.24e-02

Pos logit contributions
 LGBT+0.004
 POS+0.004
gee+0.004
 homosexuals+0.004
 interf+0.004
Neg logit contributions
astern-0.005
cry-0.005
blast-0.004
forcement-0.004
rots-0.004
Token point
Token ID966
Feature activation+4.56e-02

Pos logit contributions
astern+0.010
cry+0.009
forcement+0.008
blast+0.008
rots+0.008
Neg logit contributions
 LGBT-0.009
 POS-0.009
gee-0.009
 homosexuals-0.009
 interf-0.009
Token to
Token ID284
Feature activation-4.62e-02

Pos logit contributions
 LGBT+0.030
gee+0.028
 homosexuals+0.028
 POS+0.028
 interf+0.028
Neg logit contributions
astern-0.041
cry-0.035
forcement-0.034
blast-0.034
rots-0.034
Token clinch
Token ID45963
Feature activation-1.11e-02

Pos logit contributions
astern+0.034
cry+0.029
forcement+0.028
blast+0.028
rots+0.027
Neg logit contributions
 LGBT-0.038
 POS-0.036
 homosexuals-0.036
gee-0.036
 interf-0.036
Token
Token ID447
Feature activation-3.03e-03

Pos logit contributions
 LGBT+0.045
gee+0.043
 homosexuals+0.043
 POS+0.043
nel+0.042
Neg logit contributions
astern-0.041
cry-0.034
forcement-0.033
blast-0.033
rots-0.032
Token
Token ID247
Feature activation+2.90e-02

Pos logit contributions
astern+0.002
cry+0.002
blast+0.002
forcement+0.002
rots+0.002
Neg logit contributions
 LGBT-0.003
gee-0.002
 POS-0.002
 homosexuals-0.002
nel-0.002
Tokens
Token ID82
Feature activation+6.01e-03

Pos logit contributions
 LGBT+0.023
gee+0.022
nel+0.021
 POS+0.021
 homosexuals+0.021
Neg logit contributions
astern-0.023
cry-0.019
forcement-0.018
blast-0.018
rots-0.018
Token o
Token ID267
Feature activation-2.11e-02

Pos logit contributions
 LGBT+0.005
gee+0.004
 homosexuals+0.004
 POS+0.004
 interf+0.004
Neg logit contributions
astern-0.005
cry-0.004
forcement-0.004
blast-0.004
rots-0.004
Tokeneu
Token ID12496
Feature activation+2.26e-02

Pos logit contributions
astern+0.020
cry+0.017
forcement+0.017
blast+0.017
rots+0.016
Neg logit contributions
 LGBT-0.013
gee-0.013
 homosexuals-0.012
nel-0.012
 POS-0.012
Tokenvre
Token ID43933
Feature activation-3.34e-01

Pos logit contributions
 LGBT+0.027
gee+0.026
 homosexuals+0.026
 POS+0.026
 interf+0.026
Neg logit contributions
astern-0.008
cry-0.005
forcement-0.005
blast-0.005
rots-0.004
Token.
Token ID13
Feature activation+3.41e-02

Pos logit contributions
astern+0.301
cry+0.259
forcement+0.252
blast+0.252
rots+0.246
Neg logit contributions
 LGBT-0.219
gee-0.209
 homosexuals-0.207
 POS-0.207
 interf-0.205
Token It
Token ID632
Feature activation+1.45e-02

Pos logit contributions
 LGBT+0.024
gee+0.023
 POS+0.023
 homosexuals+0.023
nel+0.023
Neg logit contributions
astern-0.029
cry-0.025
forcement-0.024
blast-0.024
rots-0.024
Token feels
Token ID5300
Feature activation-5.31e-02

Pos logit contributions
 LGBT+0.011
gee+0.010
 homosexuals+0.010
 POS+0.010
 interf+0.010
Neg logit contributions
astern-0.012
cry-0.010
blast-0.010
forcement-0.010
ensen-0.009
Token nothing
Token ID2147
Feature activation+1.27e-02

Pos logit contributions
astern+0.047
cry+0.041
forcement+0.040
blast+0.040
rots+0.039
Neg logit contributions
 LGBT-0.035
gee-0.034
 homosexuals-0.033
 POS-0.033
 interf-0.033
Token like
Token ID588
Feature activation+8.69e-02

Pos logit contributions
 LGBT+0.008
gee+0.008
 homosexuals+0.008
 POS+0.008
 interf+0.008
Neg logit contributions
astern-0.011
cry-0.010
forcement-0.010
blast-0.010
rots-0.009
Token the
Token ID262
Feature activation-1.16e-02

Pos logit contributions
astern+0.011
cry+0.010
forcement+0.010
blast+0.010
rots+0.009
Neg logit contributions
 LGBT-0.009
gee-0.009
 homosexuals-0.009
 POS-0.009
 interf-0.009
Token optics
Token ID36237
Feature activation-7.89e-02

Pos logit contributions
astern+0.010
cry+0.009
blast+0.008
forcement+0.008
rots+0.008
Neg logit contributions
 LGBT-0.008
gee-0.008
 homosexuals-0.008
 POS-0.008
 interf-0.008
Token are
Token ID389
Feature activation-2.43e-02

Pos logit contributions
astern+0.070
cry+0.060
forcement+0.058
blast+0.058
rots+0.057
Neg logit contributions
 LGBT-0.053
gee-0.051
 homosexuals-0.050
 POS-0.050
 interf-0.050
Token still
Token ID991
Feature activation-3.12e-02

Pos logit contributions
astern+0.021
cry+0.018
forcement+0.017
blast+0.017
rots+0.017
Neg logit contributions
 LGBT-0.017
gee-0.016
 homosexuals-0.016
 POS-0.016
 interf-0.016
Token magn
Token ID7842
Feature activation-1.93e-02

Pos logit contributions
astern+0.027
cry+0.023
forcement+0.022
blast+0.022
rots+0.021
Neg logit contributions
 LGBT-0.022
gee-0.021
 homosexuals-0.021
 POS-0.021
 interf-0.021
Tokenifying
Token ID4035
Feature activation+2.97e-01

Pos logit contributions
astern+0.020
cry+0.018
forcement+0.017
blast+0.017
rots+0.017
Neg logit contributions
 LGBT-0.010
gee-0.009
 homosexuals-0.009
 POS-0.009
 interf-0.009
Token those
Token ID883
Feature activation+5.11e-02

Pos logit contributions
 LGBT+0.198
gee+0.189
 homosexuals+0.188
 POS+0.188
 interf+0.187
Neg logit contributions
astern-0.265
cry-0.227
forcement-0.221
blast-0.220
rots-0.216
Token spaces
Token ID9029
Feature activation-2.63e-02

Pos logit contributions
 LGBT+0.036
 homosexuals+0.035
gee+0.035
 POS+0.035
 interf+0.034
Neg logit contributions
astern-0.043
cry-0.037
forcement-0.036
blast-0.036
rots-0.035
Token as
Token ID355
Feature activation+2.04e-02

Pos logit contributions
astern+0.023
cry+0.020
forcement+0.019
blast+0.019
rots+0.019
Neg logit contributions
 LGBT-0.018
gee-0.017
 homosexuals-0.017
 POS-0.017
 interf-0.017
Token well
Token ID880
Feature activation-3.51e-02

Pos logit contributions
 LGBT+0.015
gee+0.014
 POS+0.014
 homosexuals+0.014
 interf+0.014
Neg logit contributions
astern-0.017
cry-0.014
forcement-0.014
blast-0.014
rots-0.014
Token as
Token ID355
Feature activation+5.32e-03

Pos logit contributions
astern+0.031
cry+0.027
forcement+0.026
blast+0.026
rots+0.025
Neg logit contributions
 LGBT-0.024
gee-0.023
 homosexuals-0.022
 POS-0.022
 interf-0.022
TokenMaster
Token ID18254
Feature activation+6.71e-02

Pos logit contributions
 LGBT+0.037
gee+0.036
 POS+0.035
 homosexuals+0.035
nel+0.035
Neg logit contributions
astern-0.032
cry-0.026
forcement-0.025
blast-0.025
rots-0.024
Token the
Token ID262
Feature activation+3.83e-02

Pos logit contributions
 LGBT+0.044
gee+0.042
 homosexuals+0.042
 POS+0.041
 interf+0.041
Neg logit contributions
astern-0.061
cry-0.052
forcement-0.051
blast-0.051
rots-0.050
Token Exc
Token ID25268
Feature activation-1.39e-01

Pos logit contributions
 LGBT+0.028
gee+0.027
 homosexuals+0.027
 POS+0.027
 interf+0.027
Neg logit contributions
astern-0.031
cry-0.027
forcement-0.026
blast-0.026
rots-0.025
Tokenursion
Token ID24197
Feature activation+8.70e-02

Pos logit contributions
astern+0.119
cry+0.101
forcement+0.099
blast+0.098
rots+0.096
Neg logit contributions
 LGBT-0.098
gee-0.095
nel-0.093
 homosexuals-0.093
 POS-0.093
Token Fun
Token ID11138
Feature activation-5.65e-02

Pos logit contributions
 LGBT+0.065
gee+0.063
 POS+0.062
 homosexuals+0.062
 interf+0.062
Neg logit contributions
astern-0.071
cry-0.059
forcement-0.058
blast-0.057
rots-0.056
Tokennel
Token ID4954
Feature activation+4.66e-01

Pos logit contributions
astern+0.052
cry+0.044
forcement+0.043
blast+0.043
rots+0.042
Neg logit contributions
 LGBT-0.036
gee-0.035
nel-0.035
 POS-0.034
 homosexuals-0.034
Token (
Token ID357
Feature activation+2.19e-02

Pos logit contributions
 LGBT+0.331
gee+0.321
nel+0.315
 POS+0.314
 interf+0.313
Neg logit contributions
astern-0.398
cry-0.337
forcement-0.327
blast-0.321
rots-0.321
Token1
Token ID16
Feature activation+2.80e-02

Pos logit contributions
 LGBT+0.017
gee+0.017
 POS+0.016
 homosexuals+0.016
nel+0.016
Neg logit contributions
astern-0.017
cry-0.014
forcement-0.014
blast-0.013
rots-0.013
Token)
Token ID8
Feature activation+3.64e-02

Pos logit contributions
 LGBT+0.019
gee+0.018
 POS+0.018
 homosexuals+0.018
nel+0.018
Neg logit contributions
astern-0.024
cry-0.021
forcement-0.020
blast-0.020
rots-0.020
Token
Token ID198
Feature activation+5.12e-02

Pos logit contributions
 LGBT+0.021
gee+0.020
 homosexuals+0.020
 POS+0.019
 interf+0.019
Neg logit contributions
astern-0.036
cry-0.031
forcement-0.031
blast-0.031
rots-0.030
Token
Token ID198
Feature activation+4.78e-02

Pos logit contributions
 LGBT+0.029
gee+0.027
 homosexuals+0.027
 POS+0.027
 interf+0.026
Neg logit contributions
astern-0.051
cry-0.045
forcement-0.044
blast-0.044
rots-0.043
Token fees
Token ID6642
Feature activation+1.57e-01

Pos logit contributions
 LGBT+0.094
gee+0.089
 homosexuals+0.089
 POS+0.088
 interf+0.088
Neg logit contributions
astern-0.168
cry-0.147
forcement-0.144
blast-0.143
ensen-0.141
Token,
Token ID11
Feature activation-2.28e-02

Pos logit contributions
 LGBT+0.101
gee+0.096
 homosexuals+0.096
 POS+0.095
 interf+0.095
Neg logit contributions
astern-0.143
cry-0.123
forcement-0.120
blast-0.120
ensen-0.117
Token the
Token ID262
Feature activation-9.45e-03

Pos logit contributions
astern+0.021
cry+0.018
blast+0.017
forcement+0.017
ensen+0.017
Neg logit contributions
 LGBT-0.015
gee-0.014
 homosexuals-0.014
 POS-0.014
 interf-0.014
Token pl
Token ID458
Feature activation+4.93e-02

Pos logit contributions
astern+0.008
cry+0.007
blast+0.007
forcement+0.007
ensen+0.006
Neg logit contributions
 LGBT-0.007
gee-0.006
 homosexuals-0.006
 POS-0.006
 interf-0.006
Tokenump
Token ID931
Feature activation+7.92e-02

Pos logit contributions
 LGBT+0.039
gee+0.037
nel+0.037
 homosexuals+0.037
 POS+0.037
Neg logit contributions
astern-0.037
cry-0.032
forcement-0.031
blast-0.030
rots-0.030
Token profits
Token ID10177
Feature activation+2.85e-01

Pos logit contributions
 LGBT+0.056
 homosexuals+0.053
gee+0.053
 POS+0.053
 interf+0.053
Neg logit contributions
astern-0.067
cry-0.057
forcement-0.056
blast-0.056
rots-0.055
Token from
Token ID422
Feature activation-5.84e-03

Pos logit contributions
 LGBT+0.187
 homosexuals+0.178
gee+0.178
 POS+0.176
 interf+0.176
Neg logit contributions
astern-0.257
cry-0.221
forcement-0.215
blast-0.215
rots-0.210
Token home
Token ID1363
Feature activation-1.24e-02

Pos logit contributions
astern+0.005
cry+0.004
forcement+0.004
blast+0.004
ensen+0.004
Neg logit contributions
 LGBT-0.004
 homosexuals-0.004
 POS-0.004
gee-0.004
 interf-0.004
Token equity
Token ID13598
Feature activation+6.63e-02

Pos logit contributions
astern+0.010
cry+0.008
forcement+0.008
blast+0.008
ensen+0.008
Neg logit contributions
 LGBT-0.010
gee-0.009
 POS-0.009
 homosexuals-0.009
 interf-0.009
Token lines
Token ID3951
Feature activation+1.93e-02

Pos logit contributions
 LGBT+0.049
 homosexuals+0.047
 POS+0.047
 interf+0.047
gee+0.046
Neg logit contributions
astern-0.055
cry-0.046
forcement-0.045
blast-0.045
ensen-0.044
Token of
Token ID286
Feature activation+1.70e-02

Pos logit contributions
 LGBT+0.011
gee+0.011
 homosexuals+0.011
 POS+0.011
 interf+0.011
Neg logit contributions
astern-0.019
cry-0.016
forcement-0.016
blast-0.016
ensen-0.016
Token his
Token ID465
Feature activation-2.06e-02

Pos logit contributions
astern+0.013
cry+0.011
blast+0.011
forcement+0.011
rots+0.011
Neg logit contributions
 LGBT-0.010
gee-0.010
 homosexuals-0.010
 POS-0.010
 interf-0.010
Token work
Token ID670
Feature activation-3.68e-03

Pos logit contributions
astern+0.016
cry+0.013
blast+0.013
forcement+0.013
ensen+0.012
Neg logit contributions
 LGBT-0.017
gee-0.016
 POS-0.016
 interf-0.016
 homosexuals-0.016
Token on
Token ID319
Feature activation-2.04e-02

Pos logit contributions
astern+0.003
cry+0.002
blast+0.002
forcement+0.002
ensen+0.002
Neg logit contributions
 LGBT-0.003
gee-0.003
 homosexuals-0.003
 POS-0.003
 interf-0.003
Token electronic
Token ID7914
Feature activation-7.28e-02

Pos logit contributions
astern+0.018
cry+0.015
blast+0.015
forcement+0.014
rots+0.014
Neg logit contributions
 LGBT-0.014
 homosexuals-0.014
gee-0.014
 POS-0.014
 interf-0.014
Token text
Token ID2420
Feature activation+7.07e-02

Pos logit contributions
astern+0.060
cry+0.051
forcement+0.050
blast+0.049
rots+0.048
Neg logit contributions
 LGBT-0.054
 homosexuals-0.051
 interf-0.051
gee-0.051
 POS-0.051
Tokenuality
Token ID25775
Feature activation+3.03e-01

Pos logit contributions
 LGBT+0.049
gee+0.047
 homosexuals+0.047
 interf+0.047
 POS+0.047
Neg logit contributions
astern-0.061
cry-0.052
forcement-0.050
blast-0.050
rots-0.049
Token,
Token ID11
Feature activation-1.64e-02

Pos logit contributions
 LGBT+0.226
 homosexuals+0.214
 POS+0.214
gee+0.214
 interf+0.214
Neg logit contributions
astern-0.247
cry-0.209
blast-0.203
forcement-0.203
ensen-0.196
Token Matthew
Token ID9308
Feature activation-2.77e-03

Pos logit contributions
astern+0.014
cry+0.012
blast+0.011
forcement+0.011
rots+0.011
Neg logit contributions
 LGBT-0.012
gee-0.011
 homosexuals-0.011
 POS-0.011
 interf-0.011
Token Kir
Token ID7385
Feature activation+1.02e-01

Pos logit contributions
astern+0.002
cry+0.002
blast+0.002
forcement+0.002
rots+0.002
Neg logit contributions
 LGBT-0.002
gee-0.002
 homosexuals-0.002
 POS-0.002
nel-0.002
Tokensc
Token ID1416
Feature activation-2.21e-01

Pos logit contributions
 LGBT+0.090
gee+0.087
nel+0.087
 POS+0.086
 homosexuals+0.086
Neg logit contributions
astern-0.069
cry-0.056
forcement-0.054
blast-0.054
phabet-0.052
Tokenhen
Token ID831
Feature activation+2.13e-01

Pos logit contributions
astern+0.191
cry+0.164
forcement+0.159
blast+0.159
rots+0.155
Neg logit contributions
 LGBT-0.153
gee-0.147
 homosexuals-0.145
 POS-0.145
nel-0.144
Token of
Token ID286
Feature activation-2.74e-02

Pos logit contributions
 LGBT+0.057
gee+0.054
 homosexuals+0.054
 POS+0.054
 interf+0.053
Neg logit contributions
astern-0.086
cry-0.074
forcement-0.072
blast-0.072
rots-0.071
Token Peoples
Token ID41770
Feature activation-3.60e-02

Pos logit contributions
astern+0.024
cry+0.021
blast+0.020
forcement+0.020
rots+0.020
Neg logit contributions
 LGBT-0.018
gee-0.018
 homosexuals-0.017
 POS-0.017
 interf-0.017
Token'
Token ID6
Feature activation-6.48e-02

Pos logit contributions
astern+0.032
cry+0.027
forcement+0.027
blast+0.027
rots+0.026
Neg logit contributions
 LGBT-0.024
gee-0.023
 homosexuals-0.023
 POS-0.023
 interf-0.023
Token Comm
Token ID1520
Feature activation+1.33e-01

Pos logit contributions
astern+0.057
cry+0.049
forcement+0.048
blast+0.048
rots+0.047
Neg logit contributions
 LGBT-0.044
gee-0.042
 homosexuals-0.041
 POS-0.041
 interf-0.041
Tokeniss
Token ID747
Feature activation+2.02e-01

Pos logit contributions
 LGBT+0.069
gee+0.065
 homosexuals+0.065
 POS+0.064
 interf+0.063
Neg logit contributions
astern-0.138
cry-0.121
forcement-0.118
blast-0.118
rots-0.116
Tokenariat
Token ID21621
Feature activation+3.06e-01

Pos logit contributions
 LGBT+0.141
gee+0.135
 homosexuals+0.134
 POS+0.133
 interf+0.133
Neg logit contributions
astern-0.174
cry-0.148
forcement-0.144
blast-0.144
rots-0.141
Token of
Token ID286
Feature activation-3.02e-02

Pos logit contributions
 LGBT+0.186
gee+0.176
 homosexuals+0.175
 POS+0.174
 interf+0.173
Neg logit contributions
astern-0.292
cry-0.253
forcement-0.247
blast-0.247
rots-0.241
Token Defence
Token ID16721
Feature activation-9.06e-02

Pos logit contributions
astern+0.024
cry+0.020
blast+0.020
forcement+0.020
rots+0.019
Neg logit contributions
 LGBT-0.023
gee-0.022
 homosexuals-0.022
 POS-0.022
 interf-0.022
Token No
Token ID1400
Feature activation-3.44e-02

Pos logit contributions
astern+0.086
cry+0.074
blast+0.072
forcement+0.072
rots+0.071
Neg logit contributions
 LGBT-0.056
gee-0.053
 homosexuals-0.052
 POS-0.052
 interf-0.052
Token.
Token ID13
Feature activation-2.44e-02

Pos logit contributions
astern+0.031
cry+0.026
forcement+0.025
blast+0.025
rots+0.025
Neg logit contributions
 LGBT-0.023
gee-0.022
 homosexuals-0.022
 POS-0.022
 interf-0.022
Token 3
Token ID513
Feature activation-3.89e-02

Pos logit contributions
astern+0.023
cry+0.020
forcement+0.019
blast+0.019
rots+0.019
Neg logit contributions
 LGBT-0.015
gee-0.015
 homosexuals-0.014
 POS-0.014
 interf-0.014
Token their
Token ID511
Feature activation-3.14e-02

Pos logit contributions
astern+0.030
cry+0.026
forcement+0.025
blast+0.025
rots+0.025
Neg logit contributions
 LGBT-0.026
 homosexuals-0.024
gee-0.024
 POS-0.024
 interf-0.024
Token motive
Token ID20289
Feature activation+8.92e-02

Pos logit contributions
astern+0.028
cry+0.024
forcement+0.023
blast+0.023
rots+0.023
Neg logit contributions
 LGBT-0.021
 homosexuals-0.020
gee-0.020
 POS-0.020
 interf-0.020
Token was
Token ID373
Feature activation+2.91e-02

Pos logit contributions
 LGBT+0.061
gee+0.058
 homosexuals+0.058
 POS+0.058
 interf+0.057
Neg logit contributions
astern-0.078
cry-0.067
forcement-0.065
blast-0.065
rots-0.063
Token.
Token ID13
Feature activation-1.81e-02

Pos logit contributions
 LGBT+0.019
gee+0.018
 homosexuals+0.018
 POS+0.018
 interf+0.018
Neg logit contributions
astern-0.026
cry-0.023
forcement-0.022
blast-0.022
rots-0.022
Token We
Token ID775
Feature activation+2.25e-02

Pos logit contributions
astern+0.016
cry+0.013
forcement+0.013
blast+0.013
rots+0.013
Neg logit contributions
 LGBT-0.013
gee-0.012
 homosexuals-0.012
 POS-0.012
nel-0.012
Token're
Token ID821
Feature activation+2.71e-02

Pos logit contributions
 LGBT+0.016
gee+0.016
 homosexuals+0.015
 POS+0.015
 interf+0.015
Neg logit contributions
astern-0.019
cry-0.016
blast-0.016
forcement-0.016
rots-0.015
Token not
Token ID407
Feature activation+2.05e-02

Pos logit contributions
 LGBT+0.021
gee+0.020
 homosexuals+0.020
 POS+0.020
 interf+0.019
Neg logit contributions
astern-0.022
cry-0.018
forcement-0.018
blast-0.018
rots-0.017
Token going
Token ID1016
Feature activation-1.54e-02

Pos logit contributions
 LGBT+0.016
 homosexuals+0.015
gee+0.015
 POS+0.015
 interf+0.015
Neg logit contributions
astern-0.016
cry-0.013
forcement-0.013
blast-0.013
rots-0.013
Token to
Token ID284
Feature activation+6.16e-03

Pos logit contributions
astern+0.015
cry+0.013
blast+0.013
forcement+0.013
rots+0.012
Neg logit contributions
 LGBT-0.009
gee-0.009
 homosexuals-0.009
 POS-0.008
nel-0.008
Token tolerate
Token ID21923
Feature activation+3.23e-02

Pos logit contributions
 LGBT+0.005
gee+0.004
 homosexuals+0.004
 POS+0.004
 interf+0.004
Neg logit contributions
astern-0.005
cry-0.004
forcement-0.004
blast-0.004
rots-0.004
Token that
Token ID326
Feature activation-9.13e-03

Pos logit contributions
 LGBT+0.022
 homosexuals+0.021
gee+0.021
 POS+0.021
 interf+0.021
Neg logit contributions
astern-0.029
cry-0.024
forcement-0.024
blast-0.024
rots-0.023
Token1
Token ID16
Feature activation-1.43e-04

Pos logit contributions
astern+0.026
cry+0.023
forcement+0.022
blast+0.022
rots+0.021
Neg logit contributions
 LGBT-0.019
gee-0.018
 homosexuals-0.018
 POS-0.018
 interf-0.018
Token.
Token ID13
Feature activation-1.52e-02

Pos logit contributions
astern+0.000
cry+0.000
forcement+0.000
blast+0.000
rots+0.000
Neg logit contributions
 LGBT-0.000
gee-0.000
 homosexuals-0.000
 POS-0.000
 interf-0.000
Token
Token ID198
Feature activation+7.23e-03

Pos logit contributions
astern+0.014
cry+0.013
forcement+0.012
blast+0.012
rots+0.012
Neg logit contributions
 LGBT-0.009
gee-0.009
 homosexuals-0.009
 POS-0.009
nel-0.009
Token
Token ID198
Feature activation-3.90e-03

Pos logit contributions
 LGBT+0.004
gee+0.004
 homosexuals+0.004
 POS+0.004
nel+0.004
Neg logit contributions
astern-0.007
cry-0.006
forcement-0.006
blast-0.006
rots-0.006
TokenSa
Token ID33890
Feature activation-1.07e-02

Pos logit contributions
astern+0.003
cry+0.003
forcement+0.003
blast+0.003
rots+0.002
Neg logit contributions
 LGBT-0.003
gee-0.003
 homosexuals-0.003
 POS-0.003
nel-0.003
Token'd
Token ID1549
Feature activation+9.60e-02

Pos logit contributions
astern+0.010
cry+0.008
forcement+0.008
blast+0.008
rots+0.008
Neg logit contributions
 LGBT-0.007
gee-0.007
 homosexuals-0.007
 POS-0.007
 interf-0.006
Token dismissed
Token ID11126
Feature activation+3.65e-02

Pos logit contributions
 LGBT+0.072
gee+0.069
 homosexuals+0.069
 POS+0.069
 interf+0.068
Neg logit contributions
astern-0.078
cry-0.065
forcement-0.063
blast-0.063
rots-0.062
Token the
Token ID262
Feature activation-1.93e-02

Pos logit contributions
 LGBT+0.024
gee+0.023
 homosexuals+0.023
 POS+0.023
 interf+0.023
Neg logit contributions
astern-0.033
cry-0.028
forcement-0.027
blast-0.027
rots-0.027
Token pleas
Token ID21289
Feature activation-4.32e-02

Pos logit contributions
astern+0.016
cry+0.014
forcement+0.013
blast+0.013
rots+0.013
Neg logit contributions
 LGBT-0.014
 homosexuals-0.014
gee-0.014
 POS-0.013
 interf-0.013
Token of
Token ID286
Feature activation+9.41e-04

Pos logit contributions
astern+0.042
cry+0.036
forcement+0.036
blast+0.036
rots+0.035
Neg logit contributions
 LGBT-0.025
gee-0.024
 homosexuals-0.024
 POS-0.024
 interf-0.024
Token the
Token ID262
Feature activation-1.66e-02

Pos logit contributions
 LGBT+0.001
gee+0.001
 homosexuals+0.001
 POS+0.001
 interf+0.001
Neg logit contributions
astern-0.001
cry-0.001
forcement-0.001
blast-0.001
rots-0.001
Token hold
Token ID1745
Feature activation-9.38e-02

Pos logit contributions
astern+0.027
cry+0.023
forcement+0.022
blast+0.022
ensen+0.022
Neg logit contributions
 LGBT-0.022
gee-0.021
 homosexuals-0.021
 POS-0.021
 interf-0.021
Token power
Token ID1176
Feature activation-8.50e-02

Pos logit contributions
astern+0.083
cry+0.071
forcement+0.069
blast+0.069
ensen+0.067
Neg logit contributions
 LGBT-0.064
 homosexuals-0.060
gee-0.060
 POS-0.060
 interf-0.060
Token anywhere
Token ID6609
Feature activation+1.26e-02

Pos logit contributions
astern+0.075
cry+0.064
forcement+0.062
blast+0.062
rots+0.061
Neg logit contributions
 LGBT-0.058
gee-0.055
 homosexuals-0.055
 POS-0.055
 interf-0.054
Token today
Token ID1909
Feature activation-5.42e-02

Pos logit contributions
 LGBT+0.008
gee+0.008
 homosexuals+0.008
 POS+0.008
 interf+0.008
Neg logit contributions
astern-0.011
cry-0.010
forcement-0.010
blast-0.010
rots-0.009
Token.
Token ID13
Feature activation-2.45e-02

Pos logit contributions
astern+0.049
cry+0.042
forcement+0.041
blast+0.041
rots+0.040
Neg logit contributions
 LGBT-0.035
gee-0.034
 homosexuals-0.034
 POS-0.033
 interf-0.033
Token But
Token ID887
Feature activation+4.27e-02

Pos logit contributions
astern+0.021
cry+0.018
forcement+0.018
blast+0.018
ensen+0.017
Neg logit contributions
 LGBT-0.017
 POS-0.016
gee-0.016
 homosexuals-0.016
nel-0.016
Token now
Token ID783
Feature activation-5.19e-02

Pos logit contributions
 LGBT+0.030
 homosexuals+0.028
 POS+0.028
gee+0.028
 interf+0.028
Neg logit contributions
astern-0.037
cry-0.032
forcement-0.031
blast-0.031
ensen-0.030
Token we
Token ID356
Feature activation-3.45e-02

Pos logit contributions
astern+0.046
cry+0.039
forcement+0.038
blast+0.038
ensen+0.037
Neg logit contributions
 LGBT-0.035
 homosexuals-0.034
gee-0.034
 POS-0.033
 interf-0.033
Token can
Token ID460
Feature activation-2.95e-02

Pos logit contributions
astern+0.030
cry+0.026
forcement+0.025
blast+0.025
rots+0.024
Neg logit contributions
 LGBT-0.024
gee-0.023
 homosexuals-0.023
 POS-0.022
 interf-0.022
Token see
Token ID766
Feature activation-1.43e-02

Pos logit contributions
astern+0.024
cry+0.020
forcement+0.020
blast+0.020
rots+0.019
Neg logit contributions
 LGBT-0.022
gee-0.021
 homosexuals-0.021
 POS-0.021
 interf-0.021
Token the
Token ID262
Feature activation-1.78e-02

Pos logit contributions
astern+0.012
cry+0.010
forcement+0.010
blast+0.010
phabet+0.010
Neg logit contributions
 LGBT-0.010
 homosexuals-0.010
 POS-0.009
gee-0.009
 interf-0.009
Token back
Token ID736
Feature activation-1.50e-02

Pos logit contributions
 LGBT+0.033
 POS+0.031
 homosexuals+0.031
gee+0.031
 interf+0.031
Neg logit contributions
astern-0.036
cry-0.031
blast-0.030
forcement-0.030
rots-0.029
Token?
Token ID30
Feature activation-2.51e-02

Pos logit contributions
astern+0.014
cry+0.012
blast+0.011
forcement+0.011
rots+0.011
Neg logit contributions
 LGBT-0.010
gee-0.009
 homosexuals-0.009
 POS-0.009
 interf-0.009
Token Yes
Token ID3363
Feature activation+1.11e-01

Pos logit contributions
astern+0.023
cry+0.019
forcement+0.019
blast+0.019
rots+0.018
Neg logit contributions
 LGBT-0.017
gee-0.016
 homosexuals-0.016
 POS-0.016
nel-0.016
Token,
Token ID11
Feature activation-1.52e-02

Pos logit contributions
 LGBT+0.074
gee+0.070
 homosexuals+0.070
 POS+0.070
 interf+0.069
Neg logit contributions
astern-0.100
cry-0.085
forcement-0.083
blast-0.083
rots-0.081
Token they
Token ID484
Feature activation+1.61e-02

Pos logit contributions
astern+0.013
cry+0.011
forcement+0.011
blast+0.011
rots+0.011
Neg logit contributions
 LGBT-0.010
gee-0.010
 homosexuals-0.010
 POS-0.010
 interf-0.010
Token have
Token ID423
Feature activation+1.87e-02

Pos logit contributions
 LGBT+0.011
gee+0.010
 homosexuals+0.010
 POS+0.010
 interf+0.010
Neg logit contributions
astern-0.014
cry-0.012
forcement-0.012
blast-0.012
rots-0.012
Token a
Token ID257
Feature activation+2.00e-02

Pos logit contributions
 LGBT+0.012
gee+0.011
 homosexuals+0.011
 POS+0.011
 interf+0.011
Neg logit contributions
astern-0.017
cry-0.015
forcement-0.014
blast-0.014
rots-0.014
Token chance
Token ID2863
Feature activation+9.15e-02

Pos logit contributions
 LGBT+0.013
gee+0.012
 POS+0.012
 homosexuals+0.012
 interf+0.012
Neg logit contributions
astern-0.019
cry-0.016
forcement-0.016
blast-0.016
rots-0.015
Token.
Token ID13
Feature activation+8.19e-03

Pos logit contributions
 LGBT+0.059
gee+0.056
 homosexuals+0.056
 POS+0.056
 interf+0.055
Neg logit contributions
astern-0.083
cry-0.072
forcement-0.070
blast-0.070
rots-0.068
Token But
Token ID887
Feature activation+6.43e-02

Pos logit contributions
 LGBT+0.005
gee+0.005
 homosexuals+0.005
 POS+0.005
nel+0.005
Neg logit contributions
astern-0.007
cry-0.006
forcement-0.006
blast-0.006
rots-0.006
Token this
Token ID428
Feature activation+3.59e-02

Pos logit contributions
 LGBT+0.045
gee+0.043
 homosexuals+0.042
 POS+0.042
 interf+0.042
Neg logit contributions
astern-0.056
cry-0.048
forcement-0.046
blast-0.046
rots-0.045
Token
Token ID247
Feature activation-2.21e-03

Pos logit contributions
 LGBT+0.005
gee+0.005
 homosexuals+0.005
 POS+0.005
 interf+0.005
Neg logit contributions
astern-0.005
cry-0.004
blast-0.004
forcement-0.004
rots-0.004
Tokenm
Token ID76
Feature activation+1.59e-02

Pos logit contributions
astern+0.002
cry+0.002
blast+0.001
forcement+0.001
rots+0.001
Neg logit contributions
 LGBT-0.002
gee-0.002
 homosexuals-0.002
 POS-0.002
 interf-0.002
Token a
Token ID257
Feature activation+1.73e-02

Pos logit contributions
 LGBT+0.011
gee+0.011
 homosexuals+0.010
 POS+0.010
 interf+0.010
Neg logit contributions
astern-0.014
cry-0.012
forcement-0.011
blast-0.011
rots-0.011
Token product
Token ID1720
Feature activation+1.27e-02

Pos logit contributions
 LGBT+0.013
gee+0.012
 POS+0.012
 homosexuals+0.012
 interf+0.012
Neg logit contributions
astern-0.014
cry-0.012
forcement-0.012
blast-0.012
rots-0.011
Token of
Token ID286
Feature activation-1.09e-02

Pos logit contributions
 LGBT+0.008
gee+0.007
 homosexuals+0.007
 POS+0.007
nel+0.007
Neg logit contributions
astern-0.012
cry-0.010
blast-0.010
forcement-0.010
rots-0.010
Token yellow
Token ID7872
Feature activation+1.29e-02

Pos logit contributions
astern+0.010
cry+0.009
forcement+0.009
blast+0.009
rots+0.008
Neg logit contributions
 LGBT-0.007
gee-0.006
 homosexuals-0.006
 POS-0.006
 interf-0.006
Token fever
Token ID17372
Feature activation+1.73e-02

Pos logit contributions
 LGBT+0.009
gee+0.009
 homosexuals+0.009
 POS+0.009
 interf+0.009
Neg logit contributions
astern-0.011
cry-0.009
forcement-0.009
blast-0.009
ensen-0.009
Token
Token ID447
Feature activation+3.69e-03

Pos logit contributions
 LGBT+0.011
gee+0.011
 homosexuals+0.011
 POS+0.011
 interf+0.011
Neg logit contributions
astern-0.016
cry-0.013
forcement-0.013
blast-0.013
rots-0.013
Token
Token ID251
Feature activation-7.84e-03

Pos logit contributions
 LGBT+0.003
gee+0.003
 homosexuals+0.003
 POS+0.003
nel+0.003
Neg logit contributions
astern-0.003
cry-0.002
blast-0.002
forcement-0.002
rots-0.002
Token or
Token ID393
Feature activation-1.86e-02

Pos logit contributions
astern+0.007
cry+0.006
forcement+0.006
blast+0.006
rots+0.005
Neg logit contributions
 LGBT-0.005
gee-0.005
 homosexuals-0.005
 POS-0.005
 interf-0.005
Token something
Token ID1223
Feature activation-3.40e-02

Pos logit contributions
astern+0.016
cry+0.014
blast+0.013
forcement+0.013
rots+0.013
Neg logit contributions
 LGBT-0.013
gee-0.013
 homosexuals-0.013
 POS-0.013
 interf-0.012
Token be
Token ID307
Feature activation-6.54e-02

Pos logit contributions
 LGBT+0.021
gee+0.020
 homosexuals+0.020
 POS+0.020
nel+0.020
Neg logit contributions
astern-0.027
cry-0.023
blast-0.022
forcement-0.022
rots-0.022
Token flowing
Token ID17609
Feature activation-7.87e-02

Pos logit contributions
astern+0.055
cry+0.046
forcement+0.045
blast+0.045
rots+0.044
Neg logit contributions
 LGBT-0.047
gee-0.045
 homosexuals-0.045
 POS-0.045
 interf-0.045
Token freely
Token ID12748
Feature activation+7.34e-02

Pos logit contributions
astern+0.070
cry+0.060
blast+0.059
forcement+0.058
rots+0.057
Neg logit contributions
 LGBT-0.053
 homosexuals-0.050
 POS-0.050
gee-0.050
 interf-0.050
Token during
Token ID1141
Feature activation-9.30e-03

Pos logit contributions
 LGBT+0.048
gee+0.045
 homosexuals+0.045
 POS+0.045
 interf+0.045
Neg logit contributions
astern-0.067
cry-0.057
forcement-0.056
blast-0.056
rots-0.055
Token these
Token ID777
Feature activation-5.64e-02

Pos logit contributions
astern+0.008
cry+0.007
blast+0.007
forcement+0.007
rots+0.007
Neg logit contributions
 LGBT-0.006
gee-0.006
 homosexuals-0.006
 POS-0.006
 interf-0.006
Token events
Token ID2995
Feature activation-1.32e-02

Pos logit contributions
astern+0.045
cry+0.038
forcement+0.037
blast+0.037
rots+0.036
Neg logit contributions
 LGBT-0.043
gee-0.041
 homosexuals-0.041
 POS-0.041
 interf-0.041
Token to
Token ID284
Feature activation-1.31e-02

Pos logit contributions
astern+0.012
cry+0.010
blast+0.010
forcement+0.010
rots+0.010
Neg logit contributions
 LGBT-0.009
gee-0.008
 homosexuals-0.008
 POS-0.008
 interf-0.008
Token prep
Token ID3143
Feature activation-1.13e-01

Pos logit contributions
astern+0.011
cry+0.010
forcement+0.009
blast+0.009
rots+0.009
Neg logit contributions
 LGBT-0.009
gee-0.009
 homosexuals-0.009
 POS-0.009
 interf-0.009
Token the
Token ID262
Feature activation-3.41e-02

Pos logit contributions
astern+0.099
cry+0.085
blast+0.082
forcement+0.082
rots+0.080
Neg logit contributions
 LGBT-0.077
gee-0.073
 homosexuals-0.073
 POS-0.072
 interf-0.072
Token party
Token ID2151
Feature activation+4.79e-02

Pos logit contributions
astern+0.028
cry+0.024
forcement+0.023
blast+0.023
rots+0.022
Neg logit contributions
 LGBT-0.025
gee-0.024
 homosexuals-0.024
 POS-0.024
 interf-0.024
Token
Token ID447
Feature activation-1.87e-02

Pos logit contributions
 LGBT+0.033
gee+0.031
 homosexuals+0.031
 POS+0.031
nel+0.030
Neg logit contributions
astern-0.042
cry-0.036
forcement-0.035
blast-0.035
rots-0.034
Token to
Token ID284
Feature activation+1.27e-02

Pos logit contributions
 LGBT+0.038
gee+0.036
 homosexuals+0.035
 POS+0.035
 interf+0.035
Neg logit contributions
astern-0.062
cry-0.054
blast-0.052
forcement-0.052
rots-0.051
Token certain
Token ID1728
Feature activation-2.06e-02

Pos logit contributions
 LGBT+0.009
gee+0.009
 homosexuals+0.009
 POS+0.009
 interf+0.009
Neg logit contributions
astern-0.011
cry-0.009
forcement-0.009
blast-0.009
ensen-0.009
Token guys
Token ID3730
Feature activation+1.15e-02

Pos logit contributions
astern+0.016
cry+0.014
forcement+0.013
blast+0.013
phabet+0.013
Neg logit contributions
 LGBT-0.016
gee-0.015
 homosexuals-0.015
 POS-0.015
 interf-0.015
Token,
Token ID11
Feature activation-8.81e-03

Pos logit contributions
 LGBT+0.008
gee+0.007
 homosexuals+0.007
 POS+0.007
 interf+0.007
Neg logit contributions
astern-0.010
cry-0.009
blast-0.009
forcement-0.008
rots-0.008
Token
Token ID447
Feature activation-1.01e-02

Pos logit contributions
astern+0.008
cry+0.006
forcement+0.006
blast+0.006
rots+0.006
Neg logit contributions
 LGBT-0.006
gee-0.006
 homosexuals-0.006
 POS-0.006
 interf-0.006
Token
Token ID251
Feature activation-2.07e-02

Pos logit contributions
astern+0.009
cry+0.007
blast+0.007
forcement+0.007
rots+0.007
Neg logit contributions
 LGBT-0.007
gee-0.007
 POS-0.007
 homosexuals-0.007
nel-0.007
Token David
Token ID3271
Feature activation+4.73e-03

Pos logit contributions
astern+0.018
cry+0.015
forcement+0.015
blast+0.015
rots+0.014
Neg logit contributions
 LGBT-0.014
gee-0.014
 POS-0.014
 homosexuals-0.014
nel-0.013
Token Clarkson
Token ID40524
Feature activation-1.11e-01

Pos logit contributions
 LGBT+0.004
gee+0.004
 POS+0.004
 homosexuals+0.004
nel+0.004
Neg logit contributions
astern-0.003
cry-0.003
forcement-0.003
blast-0.003
phabet-0.003
Token said
Token ID531
Feature activation-2.22e-02

Pos logit contributions
astern+0.097
cry+0.083
blast+0.080
forcement+0.080
rots+0.078
Neg logit contributions
 LGBT-0.076
gee-0.072
 POS-0.072
 homosexuals-0.072
 interf-0.071
Token.
Token ID13
Feature activation-1.07e-02

Pos logit contributions
astern+0.020
cry+0.017
blast+0.017
forcement+0.017
rots+0.016
Neg logit contributions
 LGBT-0.015
 homosexuals-0.014
gee-0.014
 POS-0.014
 interf-0.014
Token
Token ID564
Feature activation+9.64e-03

Pos logit contributions
astern+0.010
cry+0.009
forcement+0.009
blast+0.009
ensen+0.009
Neg logit contributions
 LGBT-0.006
 POS-0.006
gee-0.006
 homosexuals-0.006
nel-0.006
Token suggest
Token ID1950
Feature activation+5.95e-03

Pos logit contributions
astern+0.016
cry+0.014
forcement+0.013
blast+0.013
rots+0.013
Neg logit contributions
 LGBT-0.014
gee-0.014
 homosexuals-0.014
 POS-0.014
 interf-0.014
Token that
Token ID326
Feature activation-4.38e-02

Pos logit contributions
 LGBT+0.004
gee+0.004
 homosexuals+0.004
 POS+0.004
 interf+0.004
Neg logit contributions
astern-0.005
cry-0.004
blast-0.004
forcement-0.004
rots-0.004
Token this
Token ID428
Feature activation-7.24e-04

Pos logit contributions
astern+0.038
cry+0.032
blast+0.031
forcement+0.031
rots+0.031
Neg logit contributions
 LGBT-0.030
gee-0.029
 homosexuals-0.029
 POS-0.029
 interf-0.029
Token attack
Token ID1368
Feature activation+3.47e-03

Pos logit contributions
astern+0.001
cry+0.001
forcement+0.001
blast+0.001
rots+0.001
Neg logit contributions
 LGBT-0.001
gee-0.000
 homosexuals-0.000
 POS-0.000
 interf-0.000
Token was
Token ID373
Feature activation-1.28e-02

Pos logit contributions
 LGBT+0.002
gee+0.002
 homosexuals+0.002
 POS+0.002
 interf+0.002
Neg logit contributions
astern-0.003
cry-0.003
blast-0.003
forcement-0.003
rots-0.002
Token being
Token ID852
Feature activation-3.74e-02

Pos logit contributions
astern+0.011
cry+0.010
forcement+0.010
blast+0.010
rots+0.009
Neg logit contributions
 LGBT-0.009
 homosexuals-0.008
gee-0.008
 POS-0.008
 interf-0.008
Token planned
Token ID6027
Feature activation+1.27e-02

Pos logit contributions
astern+0.032
cry+0.027
forcement+0.026
blast+0.026
rots+0.026
Neg logit contributions
 LGBT-0.027
gee-0.025
 homosexuals-0.025
 POS-0.025
 interf-0.025
Token and
Token ID290
Feature activation-1.37e-02

Pos logit contributions
 LGBT+0.008
gee+0.008
 homosexuals+0.008
 POS+0.008
 interf+0.008
Neg logit contributions
astern-0.011
cry-0.010
forcement-0.009
blast-0.009
rots-0.009
Token the
Token ID262
Feature activation+4.89e-03

Pos logit contributions
astern+0.012
cry+0.010
forcement+0.010
blast+0.010
rots+0.010
Neg logit contributions
 LGBT-0.010
 homosexuals-0.009
gee-0.009
 POS-0.009
 interf-0.009
Token investigation
Token ID3645
Feature activation-1.40e-02

Pos logit contributions
 LGBT+0.003
gee+0.003
 homosexuals+0.003
 POS+0.003
 interf+0.003
Neg logit contributions
astern-0.004
cry-0.004
forcement-0.003
blast-0.003
rots-0.003
Token had
Token ID550
Feature activation+5.85e-02

Pos logit contributions
astern+0.012
cry+0.010
forcement+0.010
blast+0.010
rots+0.009
Neg logit contributions
 LGBT-0.010
gee-0.010
 homosexuals-0.010
 POS-0.010
 interf-0.009
Token tissue
Token ID10712
Feature activation+5.88e-02

Pos logit contributions
 LGBT+0.009
gee+0.009
 homosexuals+0.009
 POS+0.009
 interf+0.009
Neg logit contributions
astern-0.015
cry-0.013
forcement-0.013
blast-0.013
rots-0.013
Token.
Token ID13
Feature activation+2.61e-02

Pos logit contributions
 LGBT+0.038
gee+0.037
 homosexuals+0.036
 POS+0.036
 interf+0.036
Neg logit contributions
astern-0.053
cry-0.046
forcement-0.045
blast-0.045
rots-0.044
Token In
Token ID554
Feature activation+7.74e-03

Pos logit contributions
 LGBT+0.017
gee+0.017
 homosexuals+0.017
 POS+0.016
 interf+0.016
Neg logit contributions
astern-0.023
cry-0.020
forcement-0.019
blast-0.019
rots-0.019
Token reality
Token ID3950
Feature activation+1.78e-03

Pos logit contributions
 LGBT+0.006
gee+0.006
 homosexuals+0.005
 POS+0.005
 interf+0.005
Neg logit contributions
astern-0.006
cry-0.005
forcement-0.005
blast-0.005
rots-0.005
Token,
Token ID11
Feature activation-6.67e-04

Pos logit contributions
 LGBT+0.001
gee+0.001
 homosexuals+0.001
 POS+0.001
 interf+0.001
Neg logit contributions
astern-0.002
cry-0.001
forcement-0.001
blast-0.001
rots-0.001
Token the
Token ID262
Feature activation-2.13e-03

Pos logit contributions
astern+0.001
cry+0.001
forcement+0.001
blast+0.001
rots+0.001
Neg logit contributions
 LGBT-0.000
gee-0.000
 homosexuals-0.000
 POS-0.000
 interf-0.000
Token tapes
Token ID23695
Feature activation+8.79e-02

Pos logit contributions
astern+0.002
cry+0.001
blast+0.001
forcement+0.001
rots+0.001
Neg logit contributions
 LGBT-0.002
gee-0.001
 homosexuals-0.001
 POS-0.001
 interf-0.001
Token show
Token ID905
Feature activation-9.10e-02

Pos logit contributions
 LGBT+0.064
gee+0.061
 homosexuals+0.061
 POS+0.060
 interf+0.060
Neg logit contributions
astern-0.074
cry-0.062
forcement-0.061
blast-0.061
rots-0.059
Token that
Token ID326
Feature activation-3.32e-03

Pos logit contributions
astern+0.085
cry+0.074
forcement+0.072
blast+0.072
rots+0.070
Neg logit contributions
 LGBT-0.057
gee-0.054
 homosexuals-0.053
 POS-0.053
 interf-0.053
Token they
Token ID484
Feature activation-1.26e-04

Pos logit contributions
astern+0.003
cry+0.003
forcement+0.003
blast+0.003
rots+0.003
Neg logit contributions
 LGBT-0.002
gee-0.002
 homosexuals-0.002
 POS-0.002
 interf-0.002
Token were
Token ID547
Feature activation+4.67e-02

Pos logit contributions
astern+0.000
cry+0.000
forcement+0.000
blast+0.000
rots+0.000
Neg logit contributions
 LGBT-0.000
gee-0.000
 homosexuals-0.000
 POS-0.000
 interf-0.000
Token by
Token ID416
Feature activation+2.42e-02

Pos logit contributions
 LGBT+0.014
gee+0.013
 homosexuals+0.013
 POS+0.013
 interf+0.013
Neg logit contributions
astern-0.017
cry-0.015
blast-0.014
forcement-0.014
ensen-0.014
Token books
Token ID3835
Feature activation-8.09e-02

Pos logit contributions
 LGBT+0.017
 homosexuals+0.016
gee+0.016
 POS+0.016
 interf+0.016
Neg logit contributions
astern-0.021
cry-0.018
blast-0.018
forcement-0.018
rots-0.017
Token she
Token ID673
Feature activation+2.31e-02

Pos logit contributions
astern+0.072
cry+0.062
forcement+0.060
blast+0.060
rots+0.059
Neg logit contributions
 LGBT-0.054
gee-0.052
 homosexuals-0.051
 POS-0.051
 interf-0.051
Token wrote
Token ID2630
Feature activation-1.26e-02

Pos logit contributions
 LGBT+0.019
gee+0.018
 homosexuals+0.018
 POS+0.018
 interf+0.018
Neg logit contributions
astern-0.018
cry-0.015
forcement-0.014
blast-0.014
phabet-0.014
Token 40
Token ID2319
Feature activation+9.45e-03

Pos logit contributions
astern+0.011
cry+0.010
forcement+0.010
blast+0.010
rots+0.009
Neg logit contributions
 LGBT-0.008
 homosexuals-0.008
gee-0.008
 interf-0.008
 POS-0.008
Token years
Token ID812
Feature activation-4.15e-03

Pos logit contributions
 LGBT+0.007
gee+0.007
 POS+0.007
 homosexuals+0.007
 interf+0.007
Neg logit contributions
astern-0.008
cry-0.006
forcement-0.006
blast-0.006
ensen-0.006
Token ago
Token ID2084
Feature activation-2.78e-02

Pos logit contributions
astern+0.003
cry+0.003
forcement+0.003
blast+0.003
ensen+0.002
Neg logit contributions
 LGBT-0.003
gee-0.003
 homosexuals-0.003
 POS-0.003
nel-0.003
Token,
Token ID11
Feature activation+1.08e-02

Pos logit contributions
astern+0.026
cry+0.022
forcement+0.021
blast+0.021
ensen+0.021
Neg logit contributions
 LGBT-0.018
gee-0.017
 homosexuals-0.017
 POS-0.017
 interf-0.017
Token and
Token ID290
Feature activation-2.86e-03

Pos logit contributions
 LGBT+0.008
 homosexuals+0.007
gee+0.007
 interf+0.007
 POS+0.007
Neg logit contributions
astern-0.009
cry-0.008
forcement-0.008
blast-0.008
rots-0.008
Token only
Token ID691
Feature activation-1.21e-02

Pos logit contributions
astern+0.002
cry+0.002
forcement+0.002
blast+0.002
ensen+0.002
Neg logit contributions
 LGBT-0.002
 homosexuals-0.002
gee-0.002
 POS-0.002
 interf-0.002
Token occasionally
Token ID10491
Feature activation+2.82e-03

Pos logit contributions
astern+0.010
cry+0.008
forcement+0.008
blast+0.008
ensen+0.008
Neg logit contributions
 LGBT-0.009
 homosexuals-0.009
gee-0.009
 POS-0.009
 interf-0.009
Token/
Token ID14
Feature activation-1.99e-02

Pos logit contributions
 LGBT+0.005
gee+0.005
 POS+0.005
 homosexuals+0.005
 interf+0.005
Neg logit contributions
astern-0.008
cry-0.007
blast-0.006
forcement-0.006
rots-0.006
Tokenany
Token ID1092
Feature activation-7.81e-02

Pos logit contributions
astern+0.015
cry+0.013
blast+0.012
forcement+0.012
rots+0.012
Neg logit contributions
 LGBT-0.016
gee-0.015
 homosexuals-0.015
 POS-0.015
 interf-0.015
Token Mode
Token ID10363
Feature activation+2.18e-02

Pos logit contributions
astern+0.071
cry+0.061
forcement+0.059
blast+0.059
rots+0.058
Neg logit contributions
 LGBT-0.051
gee-0.049
 POS-0.048
 homosexuals-0.048
 interf-0.048
Token:
Token ID25
Feature activation-1.16e-02

Pos logit contributions
 LGBT+0.015
gee+0.014
 homosexuals+0.014
 POS+0.014
 interf+0.014
Neg logit contributions
astern-0.019
cry-0.016
blast-0.016
forcement-0.016
rots-0.015
TokenMan
Token ID5124
Feature activation-5.18e-02

Pos logit contributions
astern+0.010
cry+0.008
blast+0.008
forcement+0.008
rots+0.008
Neg logit contributions
 LGBT-0.008
gee-0.008
 POS-0.008
 homosexuals-0.008
 interf-0.008
Tokenaged
Token ID1886
Feature activation-2.64e-01

Pos logit contributions
astern+0.059
cry+0.052
blast+0.051
forcement+0.051
rots+0.050
Neg logit contributions
 LGBT-0.022
gee-0.021
 homosexuals-0.020
 POS-0.020
nel-0.020
Token Access
Token ID8798
Feature activation-4.25e-02

Pos logit contributions
astern+0.232
cry+0.198
blast+0.193
forcement+0.193
rots+0.189
Neg logit contributions
 LGBT-0.180
gee-0.172
 homosexuals-0.170
 POS-0.170
 interf-0.169
Token Point
Token ID6252
Feature activation+4.65e-02

Pos logit contributions
astern+0.036
cry+0.031
forcement+0.030
blast+0.030
rots+0.029
Neg logit contributions
 LGBT-0.030
gee-0.029
 homosexuals-0.029
 POS-0.029
 interf-0.028
Token:
Token ID25
Feature activation+5.44e-03

Pos logit contributions
 LGBT+0.032
gee+0.031
 homosexuals+0.031
 POS+0.031
 interf+0.030
Neg logit contributions
astern-0.040
cry-0.034
blast-0.033
forcement-0.033
rots-0.033
Token Not
Token ID1892
Feature activation-3.37e-02

Pos logit contributions
 LGBT+0.004
gee+0.004
 homosexuals+0.004
 POS+0.004
 interf+0.004
Neg logit contributions
astern-0.005
cry-0.004
blast-0.004
forcement-0.004
rots-0.004
Token-
Token ID12
Feature activation-3.52e-03

Pos logit contributions
astern+0.029
cry+0.025
blast+0.024
forcement+0.024
rots+0.024
Neg logit contributions
 LGBT-0.024
gee-0.023
 POS-0.022
 homosexuals-0.022
 interf-0.022
Token
Token ID447
Feature activation-5.04e-02

Pos logit contributions
astern+0.019
cry+0.016
forcement+0.016
blast+0.015
phabet+0.015
Neg logit contributions
 LGBT-0.020
gee-0.020
 homosexuals-0.020
 POS-0.019
nel-0.019
Token
Token ID250
Feature activation-1.95e-02

Pos logit contributions
astern+0.044
cry+0.038
blast+0.037
forcement+0.036
ensen+0.036
Neg logit contributions
 LGBT-0.034
gee-0.033
 homosexuals-0.033
 POS-0.033
nel-0.032
TokenCash
Token ID35361
Feature activation+6.48e-03

Pos logit contributions
astern+0.016
cry+0.013
forcement+0.013
blast+0.013
ensen+0.013
Neg logit contributions
 LGBT-0.014
gee-0.014
 POS-0.014
 homosexuals-0.014
nel-0.014
Token-
Token ID12
Feature activation-5.67e-02

Pos logit contributions
 LGBT+0.005
gee+0.004
 POS+0.004
 homosexuals+0.004
nel+0.004
Neg logit contributions
astern-0.005
cry-0.005
forcement-0.004
blast-0.004
rots-0.004
TokenSe
Token ID4653
Feature activation-6.79e-02

Pos logit contributions
astern+0.046
cry+0.039
forcement+0.038
blast+0.038
rots+0.037
Neg logit contributions
 LGBT-0.042
gee-0.040
 homosexuals-0.040
 POS-0.040
nel-0.039
Tokeneking
Token ID18754
Feature activation-3.39e-01

Pos logit contributions
astern+0.038
cry+0.030
blast+0.029
forcement+0.029
rots+0.027
Neg logit contributions
 LGBT-0.067
gee-0.065
 homosexuals-0.065
 POS-0.065
nel-0.064
Token Democratic
Token ID4390
Feature activation+5.66e-02

Pos logit contributions
astern+0.288
cry+0.244
forcement+0.238
blast+0.238
rots+0.232
Neg logit contributions
 LGBT-0.243
gee-0.231
 POS-0.231
 homosexuals-0.230
nel-0.228
Token Senator
Token ID8962
Feature activation-4.47e-02

Pos logit contributions
 LGBT+0.043
 POS+0.041
gee+0.041
 homosexuals+0.041
nel+0.040
Neg logit contributions
astern-0.045
cry-0.038
forcement-0.037
blast-0.037
rots-0.036
Tokenial
Token ID498
Feature activation+3.42e-02

Pos logit contributions
astern+0.039
cry+0.033
blast+0.032
forcement+0.032
ensen+0.032
Neg logit contributions
 LGBT-0.031
 POS-0.029
gee-0.029
 homosexuals-0.029
nel-0.029
Token Campaign
Token ID13718
Feature activation-7.45e-02

Pos logit contributions
 LGBT+0.025
 POS+0.024
gee+0.024
 homosexuals+0.024
nel+0.023
Neg logit contributions
astern-0.029
cry-0.024
forcement-0.024
blast-0.024
rots-0.023
Token Committee
Token ID4606
Feature activation+8.21e-02

Pos logit contributions
astern+0.065
cry+0.056
forcement+0.054
blast+0.054
rots+0.053
Neg logit contributions
 LGBT-0.051
gee-0.048
 homosexuals-0.048
 POS-0.048
nel-0.048
Token Dr
Token ID1583
Feature activation-6.70e-03

Pos logit contributions
astern+0.031
cry+0.026
blast+0.026
forcement+0.026
rots+0.025
Neg logit contributions
 LGBT-0.026
gee-0.025
 homosexuals-0.025
 POS-0.025
 interf-0.025
Token Rune
Token ID28228
Feature activation+8.90e-02

Pos logit contributions
astern+0.006
cry+0.005
blast+0.005
forcement+0.005
rots+0.004
Neg logit contributions
 LGBT-0.005
gee-0.005
 POS-0.005
 homosexuals-0.005
nel-0.005
Token Fl
Token ID1610
Feature activation-9.61e-02

Pos logit contributions
 LGBT+0.062
gee+0.060
 homosexuals+0.059
 POS+0.059
nel+0.059
Neg logit contributions
astern-0.076
cry-0.065
forcement-0.063
blast-0.063
rots-0.062
Tokenober
Token ID2023
Feature activation+7.27e-02

Pos logit contributions
astern+0.079
cry+0.067
forcement+0.065
blast+0.065
rots+0.063
Neg logit contributions
 LGBT-0.070
gee-0.067
nel-0.067
 homosexuals-0.066
 interf-0.066
Tokengh
Token ID456
Feature activation+1.41e-02

Pos logit contributions
 LGBT+0.052
gee+0.050
nel+0.050
 POS+0.049
 homosexuals+0.049
Neg logit contributions
astern-0.061
cry-0.052
forcement-0.050
blast-0.050
rots-0.049
Tokenagen
Token ID11286
Feature activation-2.63e-01

Pos logit contributions
 LGBT+0.009
gee+0.009
 homosexuals+0.009
 POS+0.009
nel+0.009
Neg logit contributions
astern-0.013
cry-0.011
forcement-0.010
blast-0.010
rots-0.010
Token.
Token ID13
Feature activation-8.94e-03

Pos logit contributions
astern+0.234
cry+0.201
forcement+0.195
blast+0.195
rots+0.191
Neg logit contributions
 LGBT-0.176
gee-0.168
 homosexuals-0.166
 POS-0.166
nel-0.166
Token
Token ID198
Feature activation-3.78e-02

Pos logit contributions
astern+0.009
cry+0.008
forcement+0.008
blast+0.008
rots+0.008
Neg logit contributions
 LGBT-0.005
gee-0.005
 homosexuals-0.005
 POS-0.005
nel-0.005
Token
Token ID198
Feature activation-3.86e-02

Pos logit contributions
astern+0.038
cry+0.033
forcement+0.032
blast+0.032
rots+0.032
Neg logit contributions
 LGBT-0.021
gee-0.020
 homosexuals-0.020
 POS-0.020
nel-0.019
TokenMod
Token ID5841
Feature activation-7.68e-02

Pos logit contributions
astern+0.032
cry+0.027
forcement+0.026
blast+0.026
rots+0.025
Neg logit contributions
 LGBT-0.029
gee-0.028
 homosexuals-0.027
 POS-0.027
nel-0.027
Tokenelling
Token ID9417
Feature activation+1.28e-01

Pos logit contributions
astern+0.075
cry+0.066
forcement+0.064
blast+0.064
rots+0.063
Neg logit contributions
 LGBT-0.044
gee-0.042
 POS-0.041
nel-0.041
 homosexuals-0.041
Token if
Token ID611
Feature activation-4.80e-02

Pos logit contributions
astern+0.150
cry+0.130
forcement+0.126
blast+0.126
rots+0.123
Neg logit contributions
 LGBT-0.108
gee-0.102
 homosexuals-0.102
 POS-0.102
 interf-0.101
Token there
Token ID612
Feature activation-7.01e-02

Pos logit contributions
astern+0.043
cry+0.037
blast+0.036
forcement+0.036
rots+0.035
Neg logit contributions
 LGBT-0.032
gee-0.031
 homosexuals-0.030
 POS-0.030
 interf-0.030
Token are
Token ID389
Feature activation-1.68e-02

Pos logit contributions
astern+0.060
cry+0.052
forcement+0.050
blast+0.050
rots+0.049
Neg logit contributions
 LGBT-0.049
gee-0.047
 homosexuals-0.046
 POS-0.046
 interf-0.046
Token no
Token ID645
Feature activation-2.81e-02

Pos logit contributions
astern+0.015
cry+0.013
forcement+0.013
blast+0.013
ensen+0.012
Neg logit contributions
 LGBT-0.011
gee-0.010
 homosexuals-0.010
 POS-0.010
nel-0.010
Token que
Token ID8358
Feature activation-3.07e-02

Pos logit contributions
astern+0.024
cry+0.020
blast+0.019
forcement+0.019
rots+0.019
Neg logit contributions
 LGBT-0.020
gee-0.019
 homosexuals-0.019
 POS-0.019
 interf-0.019
Tokenuing
Token ID4250
Feature activation-3.45e-01

Pos logit contributions
astern+0.024
cry+0.020
blast+0.020
forcement+0.019
rots+0.019
Neg logit contributions
 LGBT-0.024
gee-0.023
 homosexuals-0.023
 POS-0.023
 interf-0.023
Token delays
Token ID16119
Feature activation+1.82e-02

Pos logit contributions
astern+0.279
cry+0.237
blast+0.229
forcement+0.229
rots+0.223
Neg logit contributions
 LGBT-0.259
gee-0.248
 homosexuals-0.247
 POS-0.247
 interf-0.245
Token and
Token ID290
Feature activation-2.16e-02

Pos logit contributions
 LGBT+0.012
gee+0.011
 homosexuals+0.011
 POS+0.011
 interf+0.011
Neg logit contributions
astern-0.016
cry-0.014
blast-0.014
forcement-0.014
rots-0.013
Token no
Token ID645
Feature activation-5.43e-02

Pos logit contributions
astern+0.019
cry+0.016
blast+0.016
forcement+0.016
rots+0.016
Neg logit contributions
 LGBT-0.015
gee-0.014
 homosexuals-0.014
 POS-0.014
 interf-0.014
Token processing
Token ID7587
Feature activation-5.34e-02

Pos logit contributions
astern+0.046
cry+0.039
blast+0.038
forcement+0.038
rots+0.037
Neg logit contributions
 LGBT-0.039
gee-0.037
 homosexuals-0.037
 POS-0.037
 interf-0.037
Token delays
Token ID16119
Feature activation+5.12e-02

Pos logit contributions
astern+0.044
cry+0.037
forcement+0.036
blast+0.036
rots+0.035
Neg logit contributions
 LGBT-0.039
gee-0.038
 homosexuals-0.037
 POS-0.037
 interf-0.037
Token Scroll
Token ID17428
Feature activation-4.50e-02

Pos logit contributions
astern+0.003
cry+0.003
forcement+0.002
blast+0.002
rots+0.002
Neg logit contributions
 LGBT-0.003
gee-0.003
 POS-0.003
 homosexuals-0.003
nel-0.003
Token down
Token ID866
Feature activation-9.70e-03

Pos logit contributions
astern+0.031
cry+0.025
blast+0.024
forcement+0.024
rots+0.024
Neg logit contributions
 LGBT-0.039
gee-0.038
 homosexuals-0.038
 POS-0.037
 interf-0.037
Token and
Token ID290
Feature activation+1.02e-02

Pos logit contributions
astern+0.009
cry+0.008
blast+0.007
forcement+0.007
rots+0.007
Neg logit contributions
 LGBT-0.006
gee-0.006
 homosexuals-0.006
 POS-0.006
 interf-0.006
Token click
Token ID3904
Feature activation-4.39e-02

Pos logit contributions
 LGBT+0.008
gee+0.007
 homosexuals+0.007
 POS+0.007
 interf+0.007
Neg logit contributions
astern-0.008
cry-0.007
forcement-0.007
blast-0.007
rots-0.007
Token on
Token ID319
Feature activation+7.94e-02

Pos logit contributions
astern+0.039
cry+0.033
blast+0.032
forcement+0.032
rots+0.031
Neg logit contributions
 LGBT-0.030
gee-0.028
 homosexuals-0.028
 POS-0.028
 interf-0.028
Token Vide
Token ID42551
Feature activation-2.55e-01

Pos logit contributions
 LGBT+0.054
gee+0.051
 homosexuals+0.050
 POS+0.050
 interf+0.050
Neg logit contributions
astern-0.071
cry-0.061
blast-0.059
forcement-0.059
rots-0.058
Tokenopl
Token ID20106
Feature activation+5.00e-02

Pos logit contributions
astern+0.209
cry+0.177
forcement+0.172
blast+0.172
rots+0.166
Neg logit contributions
 LGBT-0.188
gee-0.181
 homosexuals-0.179
 POS-0.179
nel-0.178
Tokenayer
Token ID2794
Feature activation-2.15e-02

Pos logit contributions
 LGBT+0.035
gee+0.033
 homosexuals+0.033
 POS+0.032
nel+0.032
Neg logit contributions
astern-0.043
cry-0.037
forcement-0.036
blast-0.036
rots-0.035
Token Input
Token ID23412
Feature activation-1.50e-02

Pos logit contributions
astern+0.019
cry+0.016
forcement+0.016
blast+0.016
rots+0.015
Neg logit contributions
 LGBT-0.015
gee-0.014
 POS-0.014
 homosexuals-0.014
 interf-0.014
TokenStream
Token ID12124
Feature activation+1.18e-01

Pos logit contributions
astern+0.013
cry+0.011
blast+0.011
forcement+0.011
rots+0.010
Neg logit contributions
 LGBT-0.011
gee-0.010
 POS-0.010
 homosexuals-0.010
 interf-0.010
Token.
Token ID13
Feature activation-2.12e-02

Pos logit contributions
 LGBT+0.084
gee+0.080
 POS+0.079
 homosexuals+0.079
 interf+0.079
Neg logit contributions
astern-0.100
cry-0.085
blast-0.082
forcement-0.082
rots-0.081