TOP ACTIVATIONS
MAX = 7.713

plot details ? Drop some comments down below , share with
on Barrett gave me useful comments to improve the project description
who come over and leave comments with ideas , questions and
A Comment The comments to this entry have been
r ah in ah ib rahim , r ai han must
R S enth il Kumar / Press Trust of India
story , and Russell declined comment .) After a
old myths . Though her comments about him have been hed
process , so I cant comment much on the topic .
opus ." His comments caused confusion among fans ,
ent s past comments have been referenced as an
campaign ), called Milo 's comments in the video " very
Flake tweeted Saturday after his comments began to attract media attention
Syrian Benghazi , said Aus ama Mon aj ed
washed , Schwartz said , referring to the White
t remember what they said . ( Jones was not
someone a link with nasty comments from people with bad manners
2006 , CEO Henry Paul son left the firm to join
The Fox ", that Wes Anderson would adapt into a stop
Mc All ister said . He told

MIN ACTIVATIONS
MIN = -16.152

all the spoil thereof , shalt thou take unto thy self
unto thy hands , thou shalt sm ite every male therein
| count | | ---- -+ -------------------------------- + ------- | |
| count | | -------------------- -+ -------- -+ ------- | |
| | -------------------- -+ -------- -+ ------- | | org .
Moore s Prom ethe a , serial ized from
E ad gar A ethe ling fled to Scotland with
is required is simple . Cheap customer engines and restrictions on
those that are in the thro es of making the decision
. E ad gar A ethe ling and his supporters joined
( 3 ) Break 11 test chamber monitors (
, 2011 ). Temperature feedback s in effect
68 E ad gar A ethe ling , the last surviving
Geek Z odiac THE G EEK Z OD I
least 26 of 27 : THE FIRST PART IS E ASY
6 . 0 ( h undred times the major version plus
Her so - called urban thru - h ikes have st
igan 's surviving at the outer edges of Mont rose for
25 percent of GW Batt leground respondents saying they had a
If you prefer to sail yourself , the cruising grounds

INTERVAL 1.747 - 7.713
CONTAINS 14.982%

, Pay Attention to Your Partner .
Alfred is the Wayne family but ler ? Has been so
sometimes the goalkeeper , Jasper C ill essen , who started
trade also reun ites Fel ton and Thomas with center Marcus
public instead of anonymous makes someone immoral and selfish , I

INTERVAL -4.220 - 1.747
CONTAINS 84.233%

driving the vehicle that had it installed , according to court
est link in each direction ). Here
brainer human rights treaties , undermined the International Criminal Court ,
- woman ] and gay rights activist had received th rea
pulled a hood y tightly around her head to reveal only

INTERVAL -10.186 - -4.220
CONTAINS 0.723%

gallons of water off the runway each year . To give
center . When c es ium comes
in addition to plant ain flour . If you are buying
Assad and for embracing a warmer relationship with China .
the story had been record - breaking

INTERVAL -16.152 - -10.186
CONTAINS 0.062%

Token plot
Token ID7110
Feature activation+6.59e-01

Pos logit contributions
 said+0.149
 commented+0.141
 saying+0.130
 noted+0.127
 remark+0.127
Neg logit contributions
-0.190
-0.154
��-0.141
��-0.134
Temperature-0.133
Token details
Token ID3307
Feature activation+4.09e+00

Pos logit contributions
 said+0.075
 commented+0.071
 saying+0.065
 noted+0.064
 remark+0.064
Neg logit contributions
-0.084
-0.067
��-0.060
��-0.057
-0.057
Token?
Token ID30
Feature activation-5.10e-01

Pos logit contributions
 said+0.507
 commented+0.484
 saying+0.450
 remark+0.450
 comments+0.448
Neg logit contributions
-0.471
-0.363
��-0.317
-0.294
Temperature-0.294
Token Drop
Token ID14258
Feature activation-3.59e+00

Pos logit contributions
+0.055
+0.041
��+0.036
Temperature+0.035
▬▬+0.034
Neg logit contributions
 said-0.067
 commented-0.063
 saying-0.060
 noted-0.060
 remark-0.060
Token some
Token ID617
Feature activation+1.87e+00

Pos logit contributions
+0.442
+0.354
��+0.310
��+0.294
+0.290
Neg logit contributions
 said-0.407
 commented-0.389
 noted-0.360
 saying-0.360
 admitted-0.358
Token comments
Token ID3651
Feature activation+7.71e+00

Pos logit contributions
 said+0.122
 commented+0.110
 noted+0.098
 saying+0.096
 admitted+0.095
Neg logit contributions
-0.318
-0.269
��-0.246
Temperature-0.242
��-0.240
Token down
Token ID866
Feature activation-7.79e-01

Pos logit contributions
 said+0.864
 commented+0.809
 admitted+0.747
 noted+0.746
 remark+0.742
Neg logit contributions
-0.993
-0.778
��-0.692
Temperature-0.664
||||-0.661
Token below
Token ID2174
Feature activation-2.10e-01

Pos logit contributions
+0.096
+0.075
��+0.067
Temperature+0.064
||||+0.064
Neg logit contributions
 said-0.089
 commented-0.084
 admitted-0.078
 saying-0.078
 remark-0.077
Token,
Token ID11
Feature activation-1.96e-01

Pos logit contributions
+0.026
+0.020
��+0.018
Temperature+0.017
||||+0.017
Neg logit contributions
 said-0.025
 commented-0.023
 admitted-0.021
 noted-0.021
 remark-0.021
Token share
Token ID2648
Feature activation+2.16e+00

Pos logit contributions
+0.027
+0.022
��+0.020
▬▬+0.019
Temperature+0.019
Neg logit contributions
 said-0.019
 commented-0.018
 admitted-0.017
 noted-0.016
 cited-0.016
Token with
Token ID351
Feature activation+7.17e-01

Pos logit contributions
 said+0.266
 commented+0.253
 saying+0.236
 admitted+0.235
 noted+0.235
Neg logit contributions
-0.248
-0.190
��-0.170
Temperature-0.161
▬▬-0.159
Tokenon
Token ID261
Feature activation+1.74e+00

Pos logit contributions
+0.100
+0.079
��+0.070
Temperature+0.069
||||+0.067
Neg logit contributions
 said-0.084
 commented-0.080
 saying-0.074
 noted-0.073
 remark-0.072
Token Barrett
Token ID26971
Feature activation+4.51e+00

Pos logit contributions
 said+0.167
 commented+0.158
 saying+0.149
 remark+0.147
 comments+0.143
Neg logit contributions
-0.250
-0.201
��-0.182
Temperature-0.175
▬▬-0.173
Token gave
Token ID2921
Feature activation+2.51e+00

Pos logit contributions
 said+0.511
 commented+0.482
 saying+0.463
 remark+0.452
 noted+0.445
Neg logit contributions
-0.563
-0.436
��-0.396
Temperature-0.382
▬▬-0.376
Token me
Token ID502
Feature activation+1.88e+00

Pos logit contributions
 said+0.259
 commented+0.245
 saying+0.226
 noted+0.222
 admitted+0.221
Neg logit contributions
-0.340
-0.273
��-0.240
Temperature-0.237
▬▬-0.237
Token useful
Token ID4465
Feature activation+9.45e-02

Pos logit contributions
 said+0.164
 commented+0.153
 saying+0.137
 noted+0.136
 admitted+0.134
Neg logit contributions
-0.282
-0.233
▬▬-0.209
��-0.208
Temperature-0.207
Token comments
Token ID3651
Feature activation+7.71e+00

Pos logit contributions
 said+0.008
 commented+0.007
 saying+0.006
 noted+0.006
 admitted+0.006
Neg logit contributions
-0.015
-0.012
��-0.011
Temperature-0.011
▬▬-0.011
Token to
Token ID284
Feature activation-6.12e-01

Pos logit contributions
 said+0.735
 commented+0.684
 admitted+0.617
 noted+0.616
 remark+0.608
Neg logit contributions
-1.099
-0.888
��-0.794
の�-0.767
Temperature-0.767
Token improve
Token ID2987
Feature activation-1.12e+00

Pos logit contributions
+0.085
+0.069
��+0.062
Temperature+0.061
▬▬+0.060
Neg logit contributions
 said-0.059
 commented-0.057
 admitted-0.051
 saying-0.051
 noted-0.051
Token the
Token ID262
Feature activation-7.36e-01

Pos logit contributions
+0.148
+0.117
��+0.105
Temperature+0.103
||||+0.102
Neg logit contributions
 said-0.119
 commented-0.112
 saying-0.105
 admitted-0.103
 noted-0.103
Token project
Token ID1628
Feature activation+3.04e+00

Pos logit contributions
+0.105
+0.086
��+0.078
Temperature+0.077
▬▬+0.075
Neg logit contributions
 said-0.069
 commented-0.064
 saying-0.059
 noted-0.058
 admitted-0.058
Token description
Token ID6764
Feature activation+4.33e+00

Pos logit contributions
 said+0.324
 commented+0.307
 saying+0.280
 admitted+0.280
 noted+0.279
Neg logit contributions
-0.399
-0.313
Temperature-0.278
��-0.278
��-0.270
Token who
Token ID508
Feature activation+4.21e+00

Pos logit contributions
+0.109
+0.090
▬▬+0.082
Temperature+0.082
��+0.082
Neg logit contributions
 said-0.046
 commented-0.041
 noted-0.036
 admitted-0.036
 remark-0.036
Token come
Token ID1282
Feature activation-1.17e+00

Pos logit contributions
 said+0.385
 commented+0.350
 saying+0.338
 remark+0.323
 noted+0.322
Neg logit contributions
-0.617
-0.503
��-0.449
▬▬-0.442
Temperature-0.440
Token over
Token ID625
Feature activation-2.19e-01

Pos logit contributions
+0.150
+0.119
��+0.105
||||+0.102
▬▬+0.102
Neg logit contributions
 said-0.130
 commented-0.121
 noted-0.112
 saying-0.112
 admitted-0.111
Token and
Token ID290
Feature activation+1.19e+00

Pos logit contributions
+0.029
+0.023
��+0.020
Temperature+0.020
▬▬+0.020
Neg logit contributions
 said-0.024
 commented-0.022
 remark-0.020
 noted-0.020
 saying-0.020
Token leave
Token ID2666
Feature activation+6.02e-02

Pos logit contributions
 said+0.122
 commented+0.112
 saying+0.106
 noted+0.104
 comments+0.103
Neg logit contributions
-0.163
-0.131
��-0.117
Temperature-0.115
▬▬-0.114
Token comments
Token ID3651
Feature activation+7.70e+00

Pos logit contributions
 said+0.005
 commented+0.005
 noted+0.004
 admitted+0.004
 saying+0.004
Neg logit contributions
-0.009
-0.007
��-0.007
▬▬-0.007
Temperature-0.007
Token with
Token ID351
Feature activation+7.40e-01

Pos logit contributions
 said+0.774
 commented+0.719
 noted+0.656
 admitted+0.654
 remark+0.651
Neg logit contributions
-1.079
-0.868
��-0.782
▬▬-0.752
Temperature-0.750
Token ideas
Token ID4213
Feature activation+3.60e+00

Pos logit contributions
 said+0.068
 commented+0.062
 saying+0.057
 noted+0.057
 admitted+0.056
Neg logit contributions
-0.108
-0.088
��-0.080
▬▬-0.078
Temperature-0.078
Token,
Token ID11
Feature activation-1.60e-01

Pos logit contributions
 said+0.378
 commented+0.354
 saying+0.324
 noted+0.322
 remark+0.322
Neg logit contributions
-0.491
-0.391
��-0.353
▬▬-0.339
Temperature-0.338
Token questions
Token ID2683
Feature activation+4.65e+00

Pos logit contributions
+0.026
+0.022
��+0.020
▬▬+0.020
Temperature+0.020
Neg logit contributions
 said-0.012
 commented-0.011
 saying-0.010
 admitted-0.010
 noted-0.010
Token and
Token ID290
Feature activation+1.24e+00

Pos logit contributions
 said+0.571
 commented+0.540
 saying+0.506
 noted+0.500
 admitted+0.497
Neg logit contributions
-0.556
-0.428
��-0.378
Temperature-0.360
▬▬-0.356
Token A
Token ID317
Feature activation+1.90e+00

Pos logit contributions
+0.309
+0.254
��+0.227
▬▬+0.224
Temperature+0.224
Neg logit contributions
 said-0.166
 commented-0.152
 saying-0.140
 admitted-0.138
 noted-0.137
Token Comment
Token ID18957
Feature activation+5.79e+00

Pos logit contributions
 said+0.137
 commented+0.120
 admitted+0.111
 saying+0.109
 noted+0.108
Neg logit contributions
-0.318
-0.265
��-0.245
Temperature-0.240
▬▬-0.240
Token
Token ID198
Feature activation+7.92e-01

Pos logit contributions
 said+0.754
 commented+0.714
 saying+0.673
 remark+0.673
 noted+0.668
Neg logit contributions
-0.638
-0.475
��-0.413
Temperature-0.399
▬▬-0.394
Token
Token ID198
Feature activation+7.80e-01

Pos logit contributions
 said+0.097
 commented+0.092
 saying+0.087
 noted+0.085
 remark+0.085
Neg logit contributions
-0.093
-0.071
��-0.064
Temperature-0.063
▬▬-0.060
TokenThe
Token ID464
Feature activation-3.26e+00

Pos logit contributions
 said+0.111
 commented+0.106
 saying+0.101
 noted+0.099
 remark+0.099
Neg logit contributions
-0.078
-0.057
��-0.050
Temperature-0.050
▬▬-0.047
Token comments
Token ID3651
Feature activation+7.69e+00

Pos logit contributions
+0.463
+0.377
��+0.346
Temperature+0.336
▬▬+0.329
Neg logit contributions
 said-0.314
 commented-0.291
 saying-0.269
 admitted-0.263
 noted-0.263
Token to
Token ID284
Feature activation-6.27e-01

Pos logit contributions
 said+0.807
 commented+0.755
 saying+0.702
 remark+0.696
 admitted+0.688
Neg logit contributions
-1.032
-0.824
��-0.735
▬▬-0.709
Temperature-0.709
Token this
Token ID428
Feature activation+5.25e-01

Pos logit contributions
+0.084
+0.067
��+0.061
Temperature+0.059
▬▬+0.058
Neg logit contributions
 said-0.065
 commented-0.061
 saying-0.057
 noted-0.056
 admitted-0.055
Token entry
Token ID5726
Feature activation+8.99e-01

Pos logit contributions
 said+0.045
 commented+0.041
 saying+0.038
 admitted+0.037
 noted+0.037
Neg logit contributions
-0.080
-0.065
��-0.060
Temperature-0.058
▬▬-0.058
Token have
Token ID423
Feature activation+6.13e-01

Pos logit contributions
 said+0.120
 commented+0.115
 saying+0.108
 remark+0.107
 noted+0.106
Neg logit contributions
-0.096
-0.071
��-0.063
Temperature-0.058
▬▬-0.058
Token been
Token ID587
Feature activation+8.65e-01

Pos logit contributions
 said+0.067
 commented+0.063
 saying+0.060
 remark+0.059
say+0.058
Neg logit contributions
-0.079
-0.063
��-0.056
Temperature-0.054
-0.054
Token r
Token ID374
Feature activation-1.74e-01

Pos logit contributions
+0.103
+0.082
��+0.075
��+0.072
+0.071
Neg logit contributions
 said-0.079
 commented-0.076
 saying-0.070
 remark-0.068
 noted-0.068
Tokenah
Token ID993
Feature activation+1.53e+00

Pos logit contributions
+0.024
+0.019
��+0.018
��+0.017
Temperature+0.017
Neg logit contributions
 said-0.018
 commented-0.017
 remark-0.016
 saying-0.016
 noted-0.016
Tokenin
Token ID259
Feature activation+2.09e+00

Pos logit contributions
 said+0.132
 commented+0.122
 saying+0.111
 noted+0.109
 remark+0.108
Neg logit contributions
-0.241
-0.197
��-0.183
Temperature-0.175
��-0.174
Tokenah
Token ID993
Feature activation+1.60e+00

Pos logit contributions
 said+0.215
 commented+0.203
 saying+0.188
 noted+0.185
 remark+0.184
Neg logit contributions
-0.289
-0.232
��-0.212
Temperature-0.201
▬▬-0.200
Token ib
Token ID24283
Feature activation+1.13e-01

Pos logit contributions
 said+0.198
 commented+0.190
 saying+0.179
 noted+0.175
 remark+0.174
Neg logit contributions
-0.188
-0.141
��-0.128
Temperature-0.121
��-0.120
Tokenrahim
Token ID26922
Feature activation+7.68e+00

Pos logit contributions
 said+0.010
 commented+0.009
 saying+0.009
 remark+0.008
 comments+0.008
Neg logit contributions
-0.017
-0.014
��-0.012
-0.012
Temperature-0.012
Token,
Token ID11
Feature activation-7.71e-01

Pos logit contributions
 said+1.000
 commented+0.954
 saying+0.902
 noted+0.884
 remark+0.882
Neg logit contributions
-0.856
-0.641
��-0.574
��-0.543
Temperature-0.541
Token r
Token ID374
Feature activation-1.71e-01

Pos logit contributions
+0.104
+0.083
��+0.076
��+0.073
+0.072
Neg logit contributions
 said-0.079
 commented-0.076
 saying-0.069
 remark-0.068
 noted-0.068
Tokenai
Token ID1872
Feature activation+2.08e+00

Pos logit contributions
+0.024
+0.019
��+0.017
Temperature+0.017
��+0.017
Neg logit contributions
 said-0.017
 commented-0.016
 saying-0.015
 remark-0.015
 noted-0.015
Tokenhan
Token ID7637
Feature activation+3.75e+00

Pos logit contributions
 said+0.194
 commented+0.184
 saying+0.168
 noted+0.164
 remark+0.163
Neg logit contributions
-0.312
-0.254
��-0.231
▬▬-0.220
Temperature-0.220
Token must
Token ID1276
Feature activation-2.74e+00

Pos logit contributions
 said+0.434
 commented+0.414
 saying+0.385
 noted+0.377
 comments+0.376
Neg logit contributions
-0.472
-0.365
��-0.333
Temperature-0.316
▬▬-0.314
Token
Token ID198
Feature activation+3.01e-01

Pos logit contributions
 said+0.036
 commented+0.035
 saying+0.033
 noted+0.032
 comments+0.032
Neg logit contributions
-0.035
-0.027
��-0.024
Temperature-0.024
▬▬-0.023
TokenR
Token ID49
Feature activation+1.35e+00

Pos logit contributions
 said+0.044
 commented+0.042
 saying+0.040
 remark+0.040
 noted+0.040
Neg logit contributions
-0.029
-0.021
Temperature-0.018
��-0.018
These-0.017
Token S
Token ID311
Feature activation+2.71e+00

Pos logit contributions
 said+0.127
 commented+0.120
 saying+0.111
 noted+0.109
 remark+0.109
Neg logit contributions
-0.194
-0.155
��-0.144
Temperature-0.137
��-0.136
Tokenenth
Token ID7944
Feature activation-1.54e+00

Pos logit contributions
 said+0.324
 commented+0.309
 saying+0.290
 comments+0.286
 noted+0.284
Neg logit contributions
-0.334
-0.257
��-0.232
Temperature-0.224
��-0.216
Tokenil
Token ID346
Feature activation+2.06e-01

Pos logit contributions
+0.197
+0.154
��+0.138
Temperature+0.136
��+0.131
Neg logit contributions
 said-0.172
 commented-0.164
 saying-0.154
 remark-0.151
 noted-0.150
Token Kumar
Token ID26105
Feature activation+7.47e+00

Pos logit contributions
 said+0.010
 commented+0.009
 saying+0.007
 remark+0.007
 noted+0.007
Neg logit contributions
-0.039
-0.033
Temperature-0.031
��-0.030
の�-0.030
Token/
Token ID14
Feature activation-1.68e+00

Pos logit contributions
 said+1.135
 commented+1.087
 saying+1.040
 remark+1.026
 comments+1.020
Neg logit contributions
-0.670
-0.461
��-0.394
Temperature-0.366
��-0.356
Token Press
Token ID4332
Feature activation+1.03e+00

Pos logit contributions
+0.173
+0.125
��+0.109
Temperature+0.108
▬▬+0.104
Neg logit contributions
 said-0.233
 commented-0.226
 saying-0.213
 remark-0.211
 comments-0.211
Token Trust
Token ID9870
Feature activation+2.09e+00

Pos logit contributions
 said+0.090
 commented+0.083
 saying+0.077
 comments+0.074
 noted+0.074
Neg logit contributions
-0.158
-0.130
��-0.121
Temperature-0.118
��-0.116
Token of
Token ID286
Feature activation-1.76e+00

Pos logit contributions
 said+0.275
 commented+0.260
 saying+0.246
 noted+0.242
 comments+0.242
Neg logit contributions
-0.232
-0.175
��-0.158
Temperature-0.149
��-0.146
Token India
Token ID3794
Feature activation-1.42e+00

Pos logit contributions
+0.198
+0.150
��+0.133
Temperature+0.129
▬▬+0.125
Neg logit contributions
 said-0.227
 commented-0.216
 saying-0.204
 comments-0.200
 remark-0.200
Token story
Token ID1621
Feature activation+3.78e+00

Pos logit contributions
 said+0.040
 commented+0.037
 saying+0.033
 admitted+0.032
 noted+0.031
Neg logit contributions
-0.102
-0.085
��-0.078
��-0.077
Temperature-0.077
Token,
Token ID11
Feature activation-1.49e-01

Pos logit contributions
 said+0.412
 commented+0.392
 saying+0.357
 remark+0.356
 comments+0.355
Neg logit contributions
-0.492
-0.386
��-0.339
Temperature-0.333
��-0.332
Token and
Token ID290
Feature activation+1.21e+00

Pos logit contributions
+0.023
+0.019
の�+0.017
��+0.017
��+0.017
Neg logit contributions
 said-0.012
 commented-0.011
 comments-0.010
 admitted-0.010
 noted-0.010
Token Russell
Token ID11563
Feature activation+4.42e+00

Pos logit contributions
 said+0.102
 commented+0.098
 saying+0.090
 remark+0.088
 comments+0.086
Neg logit contributions
-0.182
-0.149
��-0.133
Temperature-0.132
の�-0.131
Token declined
Token ID7392
Feature activation+3.41e+00

Pos logit contributions
 said+0.418
 commented+0.396
 saying+0.377
 remark+0.367
 comments+0.363
Neg logit contributions
-0.624
-0.505
��-0.441
-0.440
の�-0.437
Token comment
Token ID2912
Feature activation+7.31e+00

Pos logit contributions
 said+0.369
 commented+0.346
 noted+0.320
 saying+0.317
 admitted+0.315
Neg logit contributions
-0.453
-0.359
��-0.316
Temperature-0.306
��-0.305
Token.)
Token ID2014
Feature activation+3.96e-01

Pos logit contributions
 said+0.752
 commented+0.709
 noted+0.637
 saying+0.636
 admitted+0.636
Neg logit contributions
-1.003
-0.798
��-0.719
-0.689
の�-0.688
Token
Token ID198
Feature activation+7.24e-01

Pos logit contributions
 said+0.048
 commented+0.047
 remark+0.044
 saying+0.044
 commenting+0.043
Neg logit contributions
-0.046
-0.036
Temperature-0.030
の�-0.030
These-0.030
Token
Token ID198
Feature activation+6.70e-01

Pos logit contributions
 said+0.089
 commented+0.084
 saying+0.079
 noted+0.078
 comments+0.078
Neg logit contributions
-0.086
-0.066
��-0.060
Temperature-0.058
▬▬-0.056
TokenAfter
Token ID3260
Feature activation-1.43e+00

Pos logit contributions
 said+0.095
 commented+0.092
 saying+0.088
 comments+0.087
 remark+0.087
Neg logit contributions
-0.066
-0.049
Temperature-0.043
These-0.040
��-0.040
Token a
Token ID257
Feature activation+3.11e+00

Pos logit contributions
+0.197
+0.160
��+0.141
Temperature+0.139
▬▬+0.135
Neg logit contributions
 said-0.144
 commented-0.139
 noted-0.123
 comments-0.122
 admitted-0.122
Token old
Token ID1468
Feature activation-2.84e-01

Pos logit contributions
+0.149
+0.116
��+0.107
Temperature+0.102
▬▬+0.101
Neg logit contributions
 said-0.139
 commented-0.134
 saying-0.121
 noted-0.120
 admitted-0.119
Token myths
Token ID22886
Feature activation+1.45e+00

Pos logit contributions
+0.037
+0.030
��+0.027
Temperature+0.026
▬▬+0.026
Neg logit contributions
 said-0.031
 commented-0.029
 noted-0.026
 admitted-0.026
 saying-0.026
Token.
Token ID13
Feature activation+1.15e+00

Pos logit contributions
 said+0.163
 commented+0.156
 comments+0.144
 saying+0.144
 remark+0.142
Neg logit contributions
-0.184
-0.145
��-0.130
▬▬-0.123
-0.122
Token Though
Token ID7486
Feature activation-1.91e+00

Pos logit contributions
 said+0.142
 commented+0.135
 saying+0.127
 noted+0.125
 remark+0.125
Neg logit contributions
-0.133
-0.103
��-0.089
Temperature-0.087
▬▬-0.086
Token her
Token ID607
Feature activation+1.82e+00

Pos logit contributions
+0.286
+0.234
��+0.213
Temperature+0.207
▬▬+0.205
Neg logit contributions
 said-0.169
 commented-0.159
 saying-0.143
 comments-0.140
 noted-0.139
Token comments
Token ID3651
Feature activation+7.29e+00

Pos logit contributions
 said+0.178
 commented+0.169
 saying+0.152
 noted+0.151
 admitted+0.148
Neg logit contributions
-0.258
-0.208
��-0.189
Temperature-0.184
▬▬-0.182
Token about
Token ID546
Feature activation+1.97e+00

Pos logit contributions
 said+0.772
 commented+0.726
 saying+0.676
 comments+0.661
 noted+0.654
Neg logit contributions
-0.978
-0.788
��-0.711
▬▬-0.672
Temperature-0.671
Token him
Token ID683
Feature activation+3.27e+00

Pos logit contributions
 said+0.201
 commented+0.191
 saying+0.172
 noted+0.171
 admitted+0.169
Neg logit contributions
-0.271
-0.218
��-0.199
Temperature-0.191
-0.190
Token have
Token ID423
Feature activation+2.40e-01

Pos logit contributions
 said+0.377
 commented+0.357
 saying+0.334
 comments+0.328
 noted+0.327
Neg logit contributions
-0.405
-0.319
��-0.286
▬▬-0.272
Temperature-0.272
Token been
Token ID587
Feature activation+4.60e-01

Pos logit contributions
 said+0.024
 commented+0.022
 saying+0.021
 remark+0.020
 comments+0.020
Neg logit contributions
-0.034
-0.027
��-0.024
Temperature-0.024
▬▬-0.023
Token hed
Token ID16418
Feature activation+5.72e-01

Pos logit contributions
 said+0.042
 commented+0.039
 saying+0.036
 comments+0.035
 remark+0.035
Neg logit contributions
-0.067
-0.054
��-0.050
の�-0.048
▬▬-0.048
Token process
Token ID1429
Feature activation+4.60e-01

Pos logit contributions
 said+0.471
 commented+0.448
 admitted+0.402
 noted+0.402
 remark+0.402
Neg logit contributions
-0.628
-0.506
��-0.448
Temperature-0.436
-0.432
Token,
Token ID11
Feature activation-3.75e-01

Pos logit contributions
 said+0.048
 commented+0.045
 saying+0.041
 remark+0.041
 noted+0.041
Neg logit contributions
-0.062
-0.049
��-0.044
Temperature-0.043
��-0.043
Token so
Token ID523
Feature activation+2.44e+00

Pos logit contributions
+0.057
+0.047
��+0.043
Temperature+0.042
▬▬+0.041
Neg logit contributions
 said-0.032
 commented-0.030
 noted-0.027
 saying-0.027
 remark-0.026
Token I
Token ID314
Feature activation+1.06e+00

Pos logit contributions
 said+0.251
 commented+0.235
 saying+0.219
 noted+0.216
 remark+0.215
Neg logit contributions
-0.328
-0.259
��-0.236
▬▬-0.227
Temperature-0.227
Token cant
Token ID18548
Feature activation-2.53e+00

Pos logit contributions
 said+0.111
 commented+0.104
 saying+0.100
 comments+0.097
 remark+0.096
Neg logit contributions
-0.139
-0.110
��-0.099
▬▬-0.095
Temperature-0.095
Token comment
Token ID2912
Feature activation+7.29e+00

Pos logit contributions
+0.400
+0.330
��+0.303
Temperature+0.296
+0.292
Neg logit contributions
 said-0.198
 commented-0.180
 noted-0.163
 saying-0.161
 admitted-0.159
Token much
Token ID881
Feature activation+1.23e+00

Pos logit contributions
 said+0.687
 commented+0.639
 noted+0.583
 saying+0.578
 remark+0.576
Neg logit contributions
-1.062
-0.860
��-0.775
��-0.753
-0.751
Token on
Token ID319
Feature activation+1.05e+00

Pos logit contributions
 said+0.118
 commented+0.109
 saying+0.100
 noted+0.100
 remark+0.099
Neg logit contributions
-0.177
-0.141
��-0.129
Temperature-0.124
-0.124
Token the
Token ID262
Feature activation-7.67e-01

Pos logit contributions
 said+0.106
 commented+0.101
 saying+0.092
 noted+0.092
 admitted+0.092
Neg logit contributions
-0.143
-0.114
��-0.103
Temperature-0.100
▬▬-0.098
Token topic
Token ID7243
Feature activation+3.56e+00

Pos logit contributions
+0.111
+0.091
��+0.083
Temperature+0.080
▬▬+0.079
Neg logit contributions
 said-0.071
 commented-0.067
 saying-0.060
 noted-0.060
 admitted-0.059
Token.
Token ID13
Feature activation+1.60e+00

Pos logit contributions
 said+0.360
 commented+0.339
 saying+0.311
 remark+0.307
 noted+0.307
Neg logit contributions
-0.494
-0.392
��-0.356
Temperature-0.345
��-0.341
Tokenopus
Token ID25790
Feature activation+1.10e+00

Pos logit contributions
+0.241
+0.184
��+0.166
Temperature+0.158
��+0.155
Neg logit contributions
 said-0.244
 commented-0.232
 saying-0.221
 remark-0.216
 comments-0.216
Token."
Token ID526
Feature activation+1.03e+00

Pos logit contributions
 said+0.121
 commented+0.115
 saying+0.107
 comments+0.106
 remark+0.105
Neg logit contributions
-0.141
-0.110
��-0.099
Temperature-0.098
の�-0.095
Token
Token ID198
Feature activation+3.53e-01

Pos logit contributions
 said+0.131
 commented+0.125
 saying+0.118
 noted+0.117
 remark+0.117
Neg logit contributions
-0.114
-0.086
Temperature-0.074
��-0.073
▬▬-0.072
Token
Token ID198
Feature activation+3.31e-01

Pos logit contributions
 said+0.043
 commented+0.041
 saying+0.038
 noted+0.038
 comments+0.038
Neg logit contributions
-0.043
-0.033
��-0.030
Temperature-0.029
▬▬-0.028
TokenHis
Token ID6653
Feature activation+2.29e-01

Pos logit contributions
 said+0.050
 commented+0.048
 saying+0.046
 noted+0.045
 comments+0.045
Neg logit contributions
-0.030
-0.021
Temperature-0.018
��-0.018
▬▬-0.017
Token comments
Token ID3651
Feature activation+7.28e+00

Pos logit contributions
 said+0.017
 commented+0.016
 noted+0.014
 saying+0.014
 admitted+0.013
Neg logit contributions
-0.038
-0.031
��-0.029
Temperature-0.028
▬▬-0.028
Token caused
Token ID4073
Feature activation+2.87e-01

Pos logit contributions
 said+0.763
 commented+0.721
 saying+0.671
 comments+0.659
 noted+0.653
Neg logit contributions
-0.963
-0.771
��-0.688
の�-0.658
▬▬-0.653
Token confusion
Token ID10802
Feature activation+1.15e+00

Pos logit contributions
 said+0.024
 commented+0.023
 saying+0.020
 noted+0.020
 remark+0.020
Neg logit contributions
-0.044
-0.036
��-0.033
▬▬-0.032
の�-0.032
Token among
Token ID1871
Feature activation+2.04e-01

Pos logit contributions
 said+0.106
 commented+0.100
 admitted+0.090
 saying+0.089
 noted+0.089
Neg logit contributions
-0.165
-0.135
��-0.120
の�-0.118
Temperature-0.117
Token fans
Token ID3296
Feature activation+4.47e-01

Pos logit contributions
 said+0.021
 commented+0.020
 saying+0.018
 noted+0.018
 remark+0.018
Neg logit contributions
-0.027
-0.022
��-0.020
Temperature-0.019
▬▬-0.019
Token,
Token ID11
Feature activation-6.22e-01

Pos logit contributions
 said+0.043
 commented+0.041
 noted+0.037
 admitted+0.037
 comments+0.037
Neg logit contributions
-0.062
-0.050
��-0.045
▬▬-0.044
の�-0.044
Tokenent
Token ID298
Feature activation+2.31e-01

Pos logit contributions
+0.251
+0.196
��+0.177
Temperature+0.172
+0.168
Neg logit contributions
 said-0.219
 commented-0.208
 saying-0.196
 noted-0.191
 comments-0.191
Token
Token ID447
Feature activation-9.39e-02

Pos logit contributions
 said+0.018
 commented+0.017
 saying+0.016
 remark+0.016
 comments+0.015
Neg logit contributions
-0.036
-0.029
��-0.027
��-0.026
-0.026
Token
Token ID247
Feature activation+1.31e-01

Pos logit contributions
+0.007
+0.004
��+0.003
��+0.003
+0.003
Neg logit contributions
 said-0.016
 commented-0.016
 saying-0.015
 noted-0.015
 comments-0.015
Tokens
Token ID82
Feature activation+2.96e+00

Pos logit contributions
 said+0.008
 commented+0.008
 saying+0.007
 remark+0.007
 commenting+0.007
Neg logit contributions
-0.022
-0.019
��-0.017
の�-0.017
-0.017
Token past
Token ID1613
Feature activation+5.74e-01

Pos logit contributions
 said+0.199
 commented+0.187
 saying+0.160
 noted+0.159
 admitted+0.153
Neg logit contributions
-0.496
-0.416
��-0.385
▬▬-0.378
��-0.377
Token comments
Token ID3651
Feature activation+7.27e+00

Pos logit contributions
 said+0.027
 commented+0.024
 noted+0.019
 saying+0.019
 admitted+0.018
Neg logit contributions
-0.110
-0.093
��-0.087
▬▬-0.086
-0.085
Token have
Token ID423
Feature activation+2.64e-01

Pos logit contributions
 said+0.733
 commented+0.699
 saying+0.640
 comments+0.632
 admitted+0.626
Neg logit contributions
-0.984
-0.794
��-0.707
▬▬-0.678
の�-0.676
Token been
Token ID587
Feature activation+5.54e-01

Pos logit contributions
 said+0.024
 commented+0.022
 saying+0.021
 remark+0.020
 comments+0.020
Neg logit contributions
-0.039
-0.031
��-0.028
の�-0.027
��-0.027
Token referenced
Token ID20717
Feature activation+3.92e+00

Pos logit contributions
 said+0.042
 commented+0.040
 saying+0.036
 remark+0.035
 comments+0.035
Neg logit contributions
-0.088
-0.073
��-0.067
の�-0.065
��-0.065
Token as
Token ID355
Feature activation+3.20e+00

Pos logit contributions
 said+0.365
 commented+0.346
 admitted+0.310
 saying+0.310
 noted+0.306
Neg logit contributions
-0.569
-0.464
��-0.417
の�-0.407
-0.403
Token an
Token ID281
Feature activation+2.09e+00

Pos logit contributions
 said+0.260
 commented+0.246
 admitted+0.217
 noted+0.213
sen+0.206
Neg logit contributions
-0.501
-0.416
��-0.388
��-0.365
▬▬-0.365
Token campaign
Token ID1923
Feature activation-8.31e-02

Pos logit contributions
+0.134
+0.108
��+0.097
��+0.094
Temperature+0.093
Neg logit contributions
 said-0.101
 commented-0.097
 saying-0.087
 noted-0.087
 admitted-0.086
Token),
Token ID828
Feature activation-2.45e+00

Pos logit contributions
+0.008
+0.006
��+0.005
��+0.005
Temperature+0.004
Neg logit contributions
 said-0.012
 commented-0.012
 saying-0.011
 comments-0.011
 noted-0.011
Token called
Token ID1444
Feature activation+3.94e+00

Pos logit contributions
+0.371
+0.305
��+0.274
+0.270
の�+0.269
Neg logit contributions
 said-0.186
 commented-0.177
 saying-0.167
 remark-0.166
say-0.160
Token Milo
Token ID32662
Feature activation+1.16e+00

Pos logit contributions
 said+0.394
 commented+0.378
 saying+0.341
 noted+0.338
 remark+0.336
Neg logit contributions
-0.543
-0.430
��-0.387
Temperature-0.375
-0.371
Token's
Token ID338
Feature activation-6.32e-01

Pos logit contributions
 said+0.115
 commented+0.110
 noted+0.098
 saying+0.097
 remark+0.096
Neg logit contributions
-0.163
-0.127
��-0.114
��-0.111
▬▬-0.111
Token comments
Token ID3651
Feature activation+7.26e+00

Pos logit contributions
+0.093
+0.076
��+0.069
Temperature+0.067
▬▬+0.067
Neg logit contributions
 said-0.055
 commented-0.053
 saying-0.047
 noted-0.046
 admitted-0.045
Token in
Token ID287
Feature activation+1.40e+00

Pos logit contributions
 said+0.698
 commented+0.663
 noted+0.591
 saying+0.588
say+0.584
Neg logit contributions
-1.028
-0.829
��-0.758
��-0.714
▬▬-0.709
Token the
Token ID262
Feature activation-1.26e+00

Pos logit contributions
 said+0.143
 commented+0.138
 noted+0.123
 saying+0.122
 admitted+0.121
Neg logit contributions
-0.189
-0.151
��-0.136
Temperature-0.131
▬▬-0.129
Token video
Token ID2008
Feature activation+1.33e+00

Pos logit contributions
+0.191
+0.155
��+0.141
��+0.138
Temperature+0.138
Neg logit contributions
 said-0.109
 commented-0.103
 saying-0.091
 noted-0.090
 admitted-0.088
Token "
Token ID366
Feature activation+1.25e+00

Pos logit contributions
 said+0.122
 commented+0.116
 noted+0.103
 saying+0.102
say+0.101
Neg logit contributions
-0.194
-0.156
��-0.140
Temperature-0.136
の�-0.136
Tokenvery
Token ID548
Feature activation-3.24e-01

Pos logit contributions
 said+0.140
 commented+0.135
 saying+0.124
 noted+0.124
 remark+0.123
Neg logit contributions
-0.163
-0.127
��-0.116
Temperature-0.112
These-0.109
Token Flake
Token ID39727
Feature activation+3.63e-01

Pos logit contributions
+0.326
+0.259
��+0.229
||||+0.225
Temperature+0.225
Neg logit contributions
 said-0.225
 commented-0.218
 saying-0.200
 remark-0.196
 comments-0.196
Token tweeted
Token ID10830
Feature activation+2.10e+00

Pos logit contributions
 said+0.028
 commented+0.027
 saying+0.024
 remark+0.024
 comments+0.024
Neg logit contributions
-0.057
-0.046
��-0.042
Temperature-0.041
��-0.041
Token Saturday
Token ID3909
Feature activation-2.49e+00

Pos logit contributions
 said+0.217
 commented+0.209
 noted+0.187
 admitted+0.187
 comments+0.186
Neg logit contributions
-0.283
-0.225
��-0.200
Temperature-0.195
||||-0.195
Token after
Token ID706
Feature activation+2.46e+00

Pos logit contributions
+0.316
+0.246
��+0.218
Temperature+0.214
||||+0.212
Neg logit contributions
 said-0.274
 commented-0.264
 noted-0.240
 admitted-0.239
 comments-0.236
Token his
Token ID465
Feature activation+1.83e+00

Pos logit contributions
 said+0.233
 commented+0.224
 admitted+0.197
 noted+0.197
 saying+0.194
Neg logit contributions
-0.354
-0.286
��-0.256
||||-0.250
Temperature-0.250
Token comments
Token ID3651
Feature activation+7.26e+00

Pos logit contributions
 said+0.118
 commented+0.110
 saying+0.092
 noted+0.091
 admitted+0.090
Neg logit contributions
-0.316
-0.264
��-0.244
Temperature-0.241
��-0.241
Token began
Token ID2540
Feature activation-8.30e-01

Pos logit contributions
 said+0.744
 commented+0.708
 saying+0.642
 comments+0.642
 noted+0.640
Neg logit contributions
-0.984
-0.785
��-0.703
��-0.667
の�-0.665
Token to
Token ID284
Feature activation-1.07e+00

Pos logit contributions
+0.109
+0.086
��+0.076
||||+0.073
��+0.073
Neg logit contributions
 said-0.091
 commented-0.086
 admitted-0.079
 noted-0.078
 saying-0.078
Token attract
Token ID4729
Feature activation-9.50e-01

Pos logit contributions
+0.123
+0.095
��+0.083
Temperature+0.080
��+0.078
Neg logit contributions
 said-0.134
 commented-0.127
 saying-0.120
 noted-0.117
 comments-0.117
Token media
Token ID2056
Feature activation+2.05e+00

Pos logit contributions
+0.139
+0.113
��+0.103
Temperature+0.101
||||+0.101
Neg logit contributions
 said-0.085
 commented-0.079
 saying-0.073
 admitted-0.072
 noted-0.070
Token attention
Token ID3241
Feature activation+9.63e-01

Pos logit contributions
 said+0.182
 commented+0.168
 admitted+0.153
 saying+0.151
 noted+0.150
Neg logit contributions
-0.302
-0.248
��-0.225
Temperature-0.223
||||-0.221
Token Syrian
Token ID6318
Feature activation-7.03e-01

Pos logit contributions
 said+0.344
 commented+0.327
 comments+0.299
 saying+0.297
 noted+0.296
Neg logit contributions
-0.429
-0.337
��-0.309
▬▬-0.295
Temperature-0.293
Token Benghazi
Token ID22497
Feature activation-1.59e+00

Pos logit contributions
+0.104
+0.084
��+0.078
Temperature+0.075
▬▬+0.074
Neg logit contributions
 said-0.063
 commented-0.060
 comments-0.053
 noted-0.053
 remark-0.053
Token,
Token ID11
Feature activation-1.49e-01

Pos logit contributions
+0.216
+0.172
��+0.157
Temperature+0.151
��+0.151
Neg logit contributions
 said-0.162
 commented-0.155
 comments-0.141
 saying-0.140
 noted-0.139
Token
Token ID447
Feature activation+4.04e-01

Pos logit contributions
+0.021
+0.017
��+0.016
Temperature+0.015
の�+0.015
Neg logit contributions
 said-0.013
 commented-0.013
 comments-0.012
 noted-0.012
 remark-0.011
Token
Token ID251
Feature activation-1.90e+00

Pos logit contributions
 said+0.091
 commented+0.089
 saying+0.086
 comments+0.086
 remark+0.085
Neg logit contributions
-0.009
+0.003
��+0.008
+0.009
��+0.010
Token said
Token ID531
Feature activation+7.26e+00

Pos logit contributions
+0.270
+0.217
��+0.196
Temperature+0.190
These+0.188
Neg logit contributions
 said-0.180
 commented-0.172
 remark-0.157
 comments-0.156
 saying-0.156
Token Aus
Token ID27545
Feature activation+2.45e+00

Pos logit contributions
 said+0.638
 commented+0.611
 saying+0.551
 remark+0.542
 comments+0.535
Neg logit contributions
-1.092
-0.890
��-0.800
Temperature-0.781
の�-0.775
Tokenama
Token ID1689
Feature activation+5.34e+00

Pos logit contributions
 said+0.237
 commented+0.229
 saying+0.210
 remark+0.209
 comments+0.205
Neg logit contributions
-0.350
-0.281
��-0.252
Temperature-0.247
-0.242
Token Mon
Token ID2892
Feature activation+1.87e+00

Pos logit contributions
 said+0.514
 commented+0.494
 remark+0.451
 saying+0.449
 comments+0.445
Neg logit contributions
-0.757
-0.607
��-0.549
-0.526
Temperature-0.524
Tokenaj
Token ID1228
Feature activation+4.05e+00

Pos logit contributions
 said+0.180
 commented+0.171
 saying+0.162
 remark+0.161
 comments+0.158
Neg logit contributions
-0.265
-0.216
��-0.194
-0.188
��-0.187
Tokened
Token ID276
Feature activation+2.53e+00

Pos logit contributions
 said+0.448
 commented+0.431
 saying+0.402
 remark+0.399
 comments+0.395
Neg logit contributions
-0.513
-0.400
��-0.356
Temperature-0.342
||||-0.340
Tokenwashed
Token ID45462
Feature activation-1.36e+00

Pos logit contributions
+0.026
+0.019
��+0.017
Temperature+0.016
��+0.016
Neg logit contributions
 said-0.035
 commented-0.033
 saying-0.032
 remark-0.032
 comments-0.032
Token,
Token ID11
Feature activation-1.45e-01

Pos logit contributions
+0.168
+0.132
��+0.117
��+0.113
Temperature+0.111
Neg logit contributions
 said-0.158
 commented-0.151
 noted-0.139
 saying-0.138
 remark-0.138
Token
Token ID447
Feature activation+3.98e-01

Pos logit contributions
+0.019
+0.015
��+0.014
▬▬+0.013
||||+0.013
Neg logit contributions
 said-0.015
 commented-0.014
 noted-0.013
 remark-0.013
 comments-0.013
Token
Token ID251
Feature activation-1.85e+00

Pos logit contributions
 said+0.067
 commented+0.065
 saying+0.062
 noted+0.062
 remark+0.061
Neg logit contributions
-0.032
-0.020
��-0.016
-0.014
��-0.014
Token Schwartz
Token ID28672
Feature activation+3.69e+00

Pos logit contributions
+0.256
+0.206
��+0.187
Temperature+0.182
▬▬+0.179
Neg logit contributions
 said-0.185
 commented-0.176
 noted-0.161
 remark-0.160
 saying-0.160
Token said
Token ID531
Feature activation+7.26e+00

Pos logit contributions
 said+0.246
 commented+0.230
 saying+0.209
 remark+0.204
 comment+0.198
Neg logit contributions
-0.620
-0.519
��-0.480
Temperature-0.468
-0.467
Token,
Token ID11
Feature activation-1.55e-01

Pos logit contributions
 said+0.727
 commented+0.698
 saying+0.624
 noted+0.622
 admitted+0.620
Neg logit contributions
-1.009
-0.804
��-0.728
Temperature-0.712
の�-0.700
Token referring
Token ID9759
Feature activation+5.36e+00

Pos logit contributions
+0.023
+0.018
��+0.017
Temperature+0.016
||||+0.016
Neg logit contributions
 said-0.014
 commented-0.013
 admitted-0.012
 noted-0.012
 comments-0.012
Token to
Token ID284
Feature activation-5.51e-01

Pos logit contributions
 said+0.646
 commented+0.615
 saying+0.569
 noted+0.562
 admitted+0.560
Neg logit contributions
-0.655
-0.509
��-0.454
��-0.423
Temperature-0.421
Token the
Token ID262
Feature activation-7.58e-01

Pos logit contributions
+0.082
+0.068
��+0.062
+0.059
��+0.059
Neg logit contributions
 said-0.048
 commented-0.046
 saying-0.041
 admitted-0.040
 noted-0.040
Token White
Token ID2635
Feature activation-3.91e-01

Pos logit contributions
+0.111
+0.091
��+0.084
��+0.081
Temperature+0.080
Neg logit contributions
 said-0.068
 commented-0.065
 saying-0.057
 noted-0.056
 admitted-0.056
Token
Token ID247
Feature activation+6.33e-01

Pos logit contributions
 said+0.083
 commented+0.081
 saying+0.078
 noted+0.077
 comments+0.077
Neg logit contributions
-0.012
-0.002
��+0.002
��+0.004
+0.004
Tokent
Token ID83
Feature activation+9.64e-01

Pos logit contributions
 said+0.064
 commented+0.061
 saying+0.057
 comments+0.056
 noted+0.056
Neg logit contributions
-0.089
-0.071
��-0.063
Temperature-0.063
-0.061
Token remember
Token ID3505
Feature activation+3.03e+00

Pos logit contributions
 said+0.081
 commented+0.076
 saying+0.068
 noted+0.067
 comments+0.067
Neg logit contributions
-0.147
-0.122
��-0.110
Temperature-0.109
の�-0.108
Token what
Token ID644
Feature activation+1.78e+00

Pos logit contributions
 said+0.284
 commented+0.270
 noted+0.241
 admitted+0.240
 comments+0.237
Neg logit contributions
-0.437
-0.357
��-0.317
Temperature-0.311
の�-0.308
Token they
Token ID484
Feature activation+1.42e+00

Pos logit contributions
 said+0.164
 commented+0.158
 saying+0.144
 noted+0.141
 admitted+0.140
Neg logit contributions
-0.250
-0.204
Temperature-0.183
��-0.178
▬▬-0.177
Token said
Token ID531
Feature activation+7.26e+00

Pos logit contributions
 said+0.050
 commented+0.047
 comments+0.041
 saying+0.039
 commenting+0.037
Neg logit contributions
-0.274
-0.236
��-0.218
の�-0.214
-0.213
Token.
Token ID13
Feature activation+1.59e+00

Pos logit contributions
 said+0.741
 commented+0.707
 saying+0.643
 noted+0.641
 comments+0.641
Neg logit contributions
-0.981
-0.788
��-0.696
の�-0.680
Temperature-0.677
Token (
Token ID357
Feature activation+2.58e-01

Pos logit contributions
 said+0.196
 commented+0.189
 saying+0.178
 remark+0.178
 noted+0.175
Neg logit contributions
-0.182
-0.141
Temperature-0.119
-0.118
の�-0.117
TokenJones
Token ID25784
Feature activation+4.77e+00

Pos logit contributions
 said+0.034
 commented+0.033
 saying+0.031
 noted+0.030
 comments+0.030
Neg logit contributions
-0.028
-0.021
Temperature-0.018
��-0.018
These-0.017
Token was
Token ID373
Feature activation+5.29e-01

Pos logit contributions
 said+0.429
 commented+0.403
 saying+0.378
 comments+0.365
 remark+0.364
Neg logit contributions
-0.696
-0.566
��-0.507
Temperature-0.500
の�-0.496
Token not
Token ID407
Feature activation+2.21e+00

Pos logit contributions
 said+0.044
 commented+0.042
 comments+0.038
 saying+0.037
 remark+0.036
Neg logit contributions
-0.082
-0.067
��-0.060
の�-0.059
��-0.059
Token someone
Token ID2130
Feature activation+3.81e+00

Pos logit contributions
 said+0.075
 commented+0.070
 saying+0.064
 admitted+0.063
 noted+0.063
Neg logit contributions
-0.128
-0.103
��-0.093
 THESE-0.091
Temperature-0.091
Token a
Token ID257
Feature activation+2.70e+00

Pos logit contributions
 said+0.346
 commented+0.319
 noted+0.288
 saying+0.287
 remark+0.286
Neg logit contributions
-0.564
-0.450
��-0.404
▬▬-0.401
Temperature-0.401
Token link
Token ID2792
Feature activation+1.31e+00

Pos logit contributions
 said+0.231
 commented+0.210
 saying+0.191
 admitted+0.190
 noted+0.188
Neg logit contributions
-0.417
-0.339
��-0.311
▬▬-0.303
Temperature-0.302
Token with
Token ID351
Feature activation+1.58e-01

Pos logit contributions
 said+0.139
 commented+0.132
 noted+0.120
 admitted+0.119
 comments+0.118
Neg logit contributions
-0.172
-0.135
��-0.121
の�-0.115
▬▬-0.115
Token nasty
Token ID17166
Feature activation-1.44e+00

Pos logit contributions
 said+0.014
 commented+0.013
 saying+0.012
 noted+0.012
 remark+0.012
Neg logit contributions
-0.023
-0.019
��-0.017
Temperature-0.017
��-0.017
Token comments
Token ID3651
Feature activation+7.25e+00

Pos logit contributions
+0.226
+0.184
��+0.172
Temperature+0.166
��+0.165
Neg logit contributions
 said-0.115
 commented-0.106
 saying-0.096
 noted-0.094
 admitted-0.093
Token from
Token ID422
Feature activation+7.56e-01

Pos logit contributions
 said+0.675
 commented+0.634
 noted+0.567
 admitted+0.560
 remark+0.559
Neg logit contributions
-1.052
-0.854
��-0.777
の�-0.738
These-0.738
Token people
Token ID661
Feature activation+2.46e+00

Pos logit contributions
 said+0.066
 commented+0.063
 saying+0.055
 remark+0.054
 admitted+0.054
Neg logit contributions
-0.114
-0.092
��-0.086
Temperature-0.082
��-0.082
Token with
Token ID351
Feature activation+1.82e-01

Pos logit contributions
 said+0.148
 commented+0.134
 comments+0.112
 remark+0.111
 noted+0.111
Neg logit contributions
-0.434
-0.363
��-0.336
の�-0.326
Temperature-0.326
Token bad
Token ID2089
Feature activation-6.65e-01

Pos logit contributions
 said+0.017
 commented+0.016
 saying+0.015
 remark+0.014
 noted+0.014
Neg logit contributions
-0.026
-0.021
��-0.019
Temperature-0.019
��-0.018
Token manners
Token ID33558
Feature activation+6.38e-01

Pos logit contributions
+0.100
+0.082
��+0.075
の�+0.072
��+0.072
Neg logit contributions
 said-0.059
 commented-0.055
 saying-0.050
 noted-0.049
 remark-0.048
Token 2006
Token ID4793
Feature activation+1.10e+00

Pos logit contributions
+0.097
+0.078
��+0.071
▬▬+0.069
Temperature+0.068
Neg logit contributions
 said-0.073
 commented-0.071
 saying-0.063
 noted-0.063
 remark-0.062
Token,
Token ID11
Feature activation-1.55e-01

Pos logit contributions
 said+0.139
 commented+0.134
 saying+0.125
 comments+0.123
 remark+0.123
Neg logit contributions
-0.126
-0.094
��-0.085
-0.080
Temperature-0.080
Token CEO
Token ID6123
Feature activation+3.75e+00

Pos logit contributions
+0.022
+0.018
��+0.016
Temperature+0.015
+0.015
Neg logit contributions
 said-0.015
 commented-0.014
 saying-0.013
 remark-0.013
 comments-0.013
Token Henry
Token ID8616
Feature activation+2.98e+00

Pos logit contributions
 said+0.277
 commented+0.259
 saying+0.234
 remark+0.229
 noted+0.222
Neg logit contributions
-0.617
-0.511
��-0.471
Temperature-0.457
��-0.454
Token Paul
Token ID3362
Feature activation+6.33e+00

Pos logit contributions
 said+0.255
 commented+0.244
 remark+0.227
 saying+0.226
 commenting+0.218
Neg logit contributions
-0.448
-0.364
��-0.328
-0.320
��-0.317
Tokenson
Token ID1559
Feature activation+7.25e+00

Pos logit contributions
 said+0.283
 commented+0.253
 saying+0.226
 remark+0.219
 comment+0.214
Neg logit contributions
-1.212
-1.034
��-0.968
の�-0.929
-0.924
Token left
Token ID1364
Feature activation-3.55e-02

Pos logit contributions
 said+0.585
 commented+0.548
 saying+0.518
 comments+0.505
 remark+0.500
Neg logit contributions
-1.127
-0.924
��-0.845
��-0.815
-0.811
Token the
Token ID262
Feature activation-7.87e-01

Pos logit contributions
+0.005
+0.004
��+0.004
▬▬+0.003
+0.003
Neg logit contributions
 said-0.003
 commented-0.003
 saying-0.003
 noted-0.003
 admitted-0.003
Token firm
Token ID4081
Feature activation+2.31e+00

Pos logit contributions
+0.119
+0.098
��+0.091
▬▬+0.087
��+0.087
Neg logit contributions
 said-0.068
 commented-0.064
 saying-0.056
 noted-0.055
 admitted-0.055
Token to
Token ID284
Feature activation-6.09e-01

Pos logit contributions
 said+0.250
 commented+0.240
 comments+0.221
 remark+0.218
 comment+0.216
Neg logit contributions
-0.301
-0.232
��-0.205
の�-0.199
▬▬-0.199
Token join
Token ID4654
Feature activation-5.32e-01

Pos logit contributions
+0.073
+0.055
��+0.049
Temperature+0.046
��+0.046
Neg logit contributions
 said-0.073
 commented-0.070
 saying-0.066
 comments-0.064
 noted-0.064
TokenThe
Token ID464
Feature activation-3.56e+00

Pos logit contributions
 said+0.206
 commented+0.198
 saying+0.187
 noted+0.186
 comments+0.185
Neg logit contributions
-0.135
-0.097
��-0.084
Temperature-0.083
▬▬-0.078
Token Fox
Token ID5426
Feature activation+1.32e+00

Pos logit contributions
+0.437
+0.342
��+0.307
Temperature+0.298
||||+0.288
Neg logit contributions
 said-0.414
 commented-0.398
 saying-0.366
 noted-0.363
 remark-0.363
Token",
Token ID1600
Feature activation-1.79e+00

Pos logit contributions
 said+0.143
 commented+0.136
 saying+0.126
 remark+0.126
 comments+0.125
Neg logit contributions
-0.172
-0.137
��-0.124
Temperature-0.119
��-0.117
Token that
Token ID326
Feature activation+1.95e+00

Pos logit contributions
+0.275
+0.227
��+0.208
��+0.200
▬▬+0.199
Neg logit contributions
 said-0.143
 commented-0.135
 comments-0.120
 noted-0.118
 remark-0.117
Token Wes
Token ID18622
Feature activation+2.92e+00

Pos logit contributions
 said+0.172
 commented+0.161
 saying+0.152
 comments+0.146
 remark+0.146
Neg logit contributions
-0.291
-0.236
��-0.217
Temperature-0.209
-0.208
Token Anderson
Token ID9918
Feature activation+7.24e+00

Pos logit contributions
 said+0.261
 commented+0.244
 saying+0.228
 remark+0.224
 comments+0.221
Neg logit contributions
-0.434
-0.356
��-0.315
▬▬-0.310
-0.309
Token would
Token ID561
Feature activation-6.76e-01

Pos logit contributions
 said+0.603
 commented+0.558
 saying+0.527
 comments+0.513
 remark+0.510
Neg logit contributions
-1.120
-0.925
��-0.847
��-0.816
Temperature-0.811
Token adapt
Token ID6068
Feature activation+4.44e-01

Pos logit contributions
+0.094
+0.077
��+0.069
Temperature+0.066
��+0.065
Neg logit contributions
 said-0.067
 commented-0.063
 saying-0.058
 comments-0.057
 noted-0.057
Token into
Token ID656
Feature activation-5.18e-01

Pos logit contributions
 said+0.045
 commented+0.043
 comments+0.039
 noted+0.038
 remark+0.038
Neg logit contributions
-0.061
-0.049
��-0.043
Temperature-0.042
��-0.042
Token a
Token ID257
Feature activation+3.10e+00

Pos logit contributions
+0.085
+0.070
��+0.066
��+0.063
Temperature+0.062
Neg logit contributions
 said-0.039
 commented-0.037
 saying-0.032
 remark-0.032
 comments-0.032
Token stop
Token ID2245
Feature activation-2.05e+00

Pos logit contributions
 said+0.331
 commented+0.314
 saying+0.288
 comments+0.287
 remark+0.282
Neg logit contributions
-0.415
-0.328
��-0.300
��-0.286
Temperature-0.284
Token
Token ID447
Feature activation+2.76e-01

Pos logit contributions
 said+0.059
 commented+0.057
 saying+0.053
 noted+0.052
 remark+0.052
Neg logit contributions
-0.058
-0.044
��-0.039
Temperature-0.038
▬▬-0.038
Token
Token ID251
Feature activation-1.93e+00

Pos logit contributions
 said+0.056
 commented+0.055
 saying+0.053
 noted+0.052
 comments+0.052
Neg logit contributions
-0.013
-0.005
��-0.001
-0.000
-0.000
Token Mc
Token ID1982
Feature activation+2.79e+00

Pos logit contributions
+0.225
+0.171
��+0.150
Temperature+0.148
▬▬+0.144
Neg logit contributions
 said-0.236
 commented-0.227
 saying-0.212
 remark-0.211
 noted-0.211
TokenAll
Token ID3237
Feature activation-1.99e+00

Pos logit contributions
 said+0.346
 commented+0.331
 saying+0.311
 comments+0.307
 noted+0.307
Neg logit contributions
-0.327
-0.249
Temperature-0.216
��-0.216
▬▬-0.207
Tokenister
Token ID1694
Feature activation+5.34e+00

Pos logit contributions
+0.305
+0.250
��+0.231
��+0.222
Temperature+0.221
Neg logit contributions
 said-0.168
 commented-0.160
 saying-0.147
 remark-0.144
 noted-0.144
Token said
Token ID531
Feature activation+7.24e+00

Pos logit contributions
 said+0.447
 commented+0.420
 saying+0.391
 remark+0.380
 comments+0.378
Neg logit contributions
-0.822
-0.670
��-0.609
Temperature-0.601
▬▬-0.594
Token.
Token ID13
Feature activation+1.60e+00

Pos logit contributions
 said+0.739
 commented+0.708
 saying+0.636
 noted+0.636
 comments+0.627
Neg logit contributions
-0.999
-0.792
��-0.708
Temperature-0.702
▬▬-0.686
Token
Token ID564
Feature activation+7.28e-01

Pos logit contributions
 said+0.188
 commented+0.181
 saying+0.169
 remark+0.168
 noted+0.168
Neg logit contributions
-0.194
-0.151
��-0.130
Temperature-0.129
-0.128
Token
Token ID250
Feature activation-3.85e+00

Pos logit contributions
 said+0.135
 commented+0.130
 saying+0.125
 noted+0.124
 comments+0.124
Neg logit contributions
-0.046
-0.026
��-0.018
-0.014
-0.014
TokenHe
Token ID1544
Feature activation+1.60e+00

Pos logit contributions
+0.397
+0.289
Temperature+0.250
��+0.246
▬▬+0.239
Neg logit contributions
 said-0.532
 commented-0.513
 saying-0.483
 noted-0.482
 remark-0.481
Token told
Token ID1297
Feature activation+5.92e+00

Pos logit contributions
 said+0.104
 commented+0.097
 saying+0.089
 remark+0.087
 comments+0.086
Neg logit contributions
-0.272
-0.227
��-0.209
Temperature-0.203
▬▬-0.203
Token all
Token ID477
Feature activation-5.26e-01

Pos logit contributions
 said+0.174
 commented+0.168
 comments+0.152
 saying+0.152
 noted+0.151
Neg logit contributions
-0.223
-0.178
��-0.161
Temperature-0.156
▬▬-0.155
Token the
Token ID262
Feature activation-1.58e+00

Pos logit contributions
+0.076
+0.062
��+0.057
Temperature+0.055
▬▬+0.054
Neg logit contributions
 said-0.050
 commented-0.048
 saying-0.043
 comments-0.043
 noted-0.043
Token spoil
Token ID20851
Feature activation-4.36e+00

Pos logit contributions
+0.200
+0.159
��+0.142
Temperature+0.137
▬▬+0.136
Neg logit contributions
 said-0.174
 commented-0.169
 saying-0.153
 comments-0.153
 remark-0.153
Token thereof
Token ID15370
Feature activation-5.38e+00

Pos logit contributions
+0.487
+0.369
��+0.326
Temperature+0.307
��+0.302
Neg logit contributions
 said-0.556
 commented-0.537
 saying-0.499
 comments-0.498
 remark-0.492
Token,
Token ID11
Feature activation-9.92e-01

Pos logit contributions
+0.602
+0.456
��+0.406
Temperature+0.385
▬▬+0.382
Neg logit contributions
 said-0.679
 commented-0.655
 comments-0.609
 saying-0.606
 noted-0.601
Token shalt
Token ID36258
Feature activation-8.62e+00

Pos logit contributions
+0.117
+0.091
��+0.081
Temperature+0.077
▬▬+0.077
Neg logit contributions
 said-0.119
 commented-0.115
 comments-0.106
 noted-0.105
 saying-0.104
Token thou
Token ID14210
Feature activation-4.62e+00

Pos logit contributions
+1.006
+0.782
��+0.674
Temperature+0.648
��+0.646
Neg logit contributions
 said-1.040
 commented-1.004
 saying-0.927
 comments-0.920
 noted-0.920
Token take
Token ID1011
Feature activation-1.15e+00

Pos logit contributions
+0.573
+0.446
��+0.392
Temperature+0.386
の�+0.381
Neg logit contributions
 said-0.520
 commented-0.503
 comments-0.458
 saying-0.457
 noted-0.456
Token unto
Token ID12722
Feature activation-5.83e+00

Pos logit contributions
+0.151
+0.119
��+0.107
Temperature+0.104
▬▬+0.103
Neg logit contributions
 said-0.124
 commented-0.120
 saying-0.109
 comments-0.108
 noted-0.107
Token thy
Token ID11906
Feature activation-4.68e+00

Pos logit contributions
+0.563
+0.407
��+0.343
Temperature+0.325
��+0.317
Neg logit contributions
 said-0.836
 commented-0.818
 saying-0.761
 comments-0.756
 noted-0.752
Tokenself
Token ID944
Feature activation+1.37e-01

Pos logit contributions
+0.623
+0.498
��+0.446
Temperature+0.431
||||+0.426
Neg logit contributions
 said-0.492
 commented-0.480
 noted-0.430
 comments-0.428
 saying-0.428
Token unto
Token ID12722
Feature activation-5.50e+00

Pos logit contributions
 said+0.006
 commented+0.006
 comments+0.005
 saying+0.005
 noted+0.005
Neg logit contributions
-0.007
-0.006
��-0.005
Temperature-0.005
��-0.005
Token thy
Token ID11906
Feature activation-4.35e+00

Pos logit contributions
+0.676
+0.536
��+0.474
▬▬+0.449
��+0.448
Neg logit contributions
 said-0.633
 commented-0.620
 comments-0.562
 saying-0.561
 noted-0.558
Token hands
Token ID2832
Feature activation-1.58e+00

Pos logit contributions
+0.599
+0.486
��+0.436
▬▬+0.418
||||+0.417
Neg logit contributions
 said-0.432
 commented-0.424
 comments-0.377
 noted-0.376
 commenting-0.373
Token,
Token ID11
Feature activation-7.15e-01

Pos logit contributions
+0.191
+0.148
��+0.132
Temperature+0.124
▬▬+0.124
Neg logit contributions
 said-0.187
 commented-0.181
 comments-0.166
 noted-0.165
 saying-0.164
Token thou
Token ID14210
Feature activation-4.39e+00

Pos logit contributions
+0.097
+0.077
��+0.071
Temperature+0.068
の�+0.067
Neg logit contributions
 said-0.072
 commented-0.070
 comments-0.063
 noted-0.062
 saying-0.062
Token shalt
Token ID36258
Feature activation-8.39e+00

Pos logit contributions
+0.259
+0.137
��+0.090
 shalt+0.083
▬▬+0.075
Neg logit contributions
 said-0.777
 commented-0.765
 comments-0.727
 saying-0.721
 noted-0.720
Token sm
Token ID895
Feature activation-3.96e-01

Pos logit contributions
+1.081
+0.865
��+0.759
の�+0.737
Temperature+0.737
Neg logit contributions
 said-0.887
 commented-0.855
 saying-0.780
 comments-0.777
 noted-0.772
Tokenite
Token ID578
Feature activation-4.97e-01

Pos logit contributions
+0.047
+0.036
Temperature+0.032
��+0.032
▬▬+0.031
Neg logit contributions
 said-0.047
 commented-0.046
 saying-0.043
 remark-0.042
 noted-0.042
Token every
Token ID790
Feature activation-1.77e+00

Pos logit contributions
+0.067
+0.054
��+0.049
▬▬+0.047
Temperature+0.047
Neg logit contributions
 said-0.051
 commented-0.049
 saying-0.045
 comments-0.045
 noted-0.044
Token male
Token ID4257
Feature activation+8.64e-01

Pos logit contributions
+0.256
+0.210
��+0.190
▬▬+0.185
Temperature+0.183
Neg logit contributions
 said-0.166
 commented-0.161
 comments-0.145
 saying-0.141
 noted-0.141
Token therein
Token ID27258
Feature activation-2.32e+00

Pos logit contributions
 said+0.083
 commented+0.080
 comments+0.073
 saying+0.071
 remark+0.070
Neg logit contributions
-0.124
-0.100
��-0.091
▬▬-0.089
の�-0.088
Token |
Token ID930
Feature activation-1.89e+00

Pos logit contributions
 said+0.010
 commented+0.010
 saying+0.009
 noted+0.009
 admitted+0.009
Neg logit contributions
-0.008
-0.006
��-0.006
Temperature-0.005
||||-0.005
Token count
Token ID954
Feature activation-5.51e-01

Pos logit contributions
+0.251
+0.202
��+0.176
||||+0.175
+0.173
Neg logit contributions
 said-0.194
 commented-0.184
 saying-0.171
 noted-0.169
 admitted-0.167
Token |
Token ID930
Feature activation-1.88e+00

Pos logit contributions
+0.061
+0.046
��+0.040
Temperature+0.039
▬▬+0.038
Neg logit contributions
 said-0.071
 commented-0.068
 saying-0.064
 noted-0.063
 admitted-0.063
Token |
Token ID930
Feature activation-1.84e+00

Pos logit contributions
+0.213
+0.163
��+0.142
||||+0.138
+0.137
Neg logit contributions
 said-0.233
 commented-0.222
 saying-0.209
 noted-0.206
 admitted-0.205
Token ----
Token ID13498
Feature activation-1.13e+00

Pos logit contributions
+0.189
+0.140
��+0.121
+0.116
▬▬+0.115
Neg logit contributions
 said-0.250
 commented-0.239
 saying-0.226
 noted-0.223
 admitted-0.221
Token-+
Token ID19529
Feature activation-6.89e+00

Pos logit contributions
+0.081
+0.048
��+0.038
Temperature+0.033
+0.033
Neg logit contributions
 said-0.196
 commented-0.188
 saying-0.181
 noted-0.178
 admitted-0.177
Token--------------------------------
Token ID3880
Feature activation-3.05e+00

Pos logit contributions
+0.515
+0.325
��+0.269
+0.241
▬▬+0.239
Neg logit contributions
 said-1.153
 commented-1.104
 saying-1.063
 noted-1.048
 admitted-1.042
Token+
Token ID10
Feature activation-4.84e+00

Pos logit contributions
+0.276
+0.191
��+0.162
+0.148
▬▬+0.148
Neg logit contributions
 said-0.468
 commented-0.447
 saying-0.427
 noted-0.421
 admitted-0.419
Token-------
Token ID26866
Feature activation-4.20e+00

Pos logit contributions
+0.411
+0.275
��+0.237
+0.210
Temperature+0.210
Neg logit contributions
 said-0.765
 commented-0.730
 saying-0.702
 noted-0.690
 admitted-0.688
Token|
Token ID91
Feature activation-4.01e+00

Pos logit contributions
+0.309
+0.192
��+0.153
Temperature+0.136
+0.132
Neg logit contributions
 said-0.715
 commented-0.685
 saying-0.658
 noted-0.650
 comments-0.646
Token |
Token ID930
Feature activation-1.83e+00

Pos logit contributions
+0.438
+0.328
��+0.290
+0.274
||||+0.272
Neg logit contributions
 said-0.528
 commented-0.501
 saying-0.478
 noted-0.471
 admitted-0.466
Token |
Token ID930
Feature activation-1.79e+00

Pos logit contributions
 said+0.098
 commented+0.093
 saying+0.088
 noted+0.087
 admitted+0.086
Neg logit contributions
-0.081
-0.061
��-0.053
Temperature-0.051
||||-0.050
Token count
Token ID954
Feature activation-4.79e-01

Pos logit contributions
+0.235
+0.187
��+0.167
||||+0.162
+0.161
Neg logit contributions
 said-0.191
 commented-0.179
 saying-0.168
 noted-0.165
 admitted-0.165
Token |
Token ID930
Feature activation-1.79e+00

Pos logit contributions
+0.053
+0.039
��+0.034
Temperature+0.033
▬▬+0.033
Neg logit contributions
 said-0.063
 commented-0.060
 saying-0.057
 noted-0.056
 admitted-0.056
Token |
Token ID930
Feature activation-1.73e+00

Pos logit contributions
+0.226
+0.178
��+0.160
Temperature+0.153
||||+0.153
Neg logit contributions
 said-0.201
 commented-0.188
 saying-0.177
 noted-0.175
 admitted-0.174
Token --------------------
Token ID41436
Feature activation-1.43e+00

Pos logit contributions
+0.219
+0.172
��+0.154
||||+0.148
Temperature+0.148
Neg logit contributions
 said-0.196
 commented-0.184
 saying-0.173
 noted-0.171
 admitted-0.169
Token-+
Token ID19529
Feature activation-6.80e+00

Pos logit contributions
+0.130
+0.090
��+0.077
Temperature+0.072
+0.071
Neg logit contributions
 said-0.216
 commented-0.207
 saying-0.197
 noted-0.194
 admitted-0.193
Token--------
Token ID982
Feature activation-2.75e+00

Pos logit contributions
+0.601
+0.410
��+0.352
+0.330
▬▬+0.328
Neg logit contributions
 said-1.036
 commented-0.989
 saying-0.947
 noted-0.935
 admitted-0.933
Token-+
Token ID19529
Feature activation-6.77e+00

Pos logit contributions
+0.138
+0.060
��+0.034
Temperature+0.024
��+0.023
Neg logit contributions
 said-0.531
 commented-0.511
 saying-0.494
 noted-0.488
 admitted-0.486
Token-------
Token ID26866
Feature activation-4.07e+00

Pos logit contributions
+0.583
+0.393
��+0.333
+0.313
▬▬+0.309
Neg logit contributions
 said-1.050
 commented-1.002
 saying-0.964
 noted-0.950
 admitted-0.946
Token|
Token ID91
Feature activation-3.91e+00

Pos logit contributions
+0.321
+0.206
��+0.165
Temperature+0.154
||||+0.152
Neg logit contributions
 said-0.663
 commented-0.635
 saying-0.612
 noted-0.602
 admitted-0.599
Token |
Token ID930
Feature activation-1.74e+00

Pos logit contributions
+0.362
+0.251
��+0.219
+0.206
▬▬+0.204
Neg logit contributions
 said-0.579
 commented-0.551
 saying-0.531
 noted-0.521
 admitted-0.518
Token |
Token ID930
Feature activation-1.79e+00

Pos logit contributions
+0.053
+0.039
��+0.034
Temperature+0.033
▬▬+0.033
Neg logit contributions
 said-0.063
 commented-0.060
 saying-0.057
 noted-0.056
 admitted-0.056
Token |
Token ID930
Feature activation-1.73e+00

Pos logit contributions
+0.226
+0.178
��+0.160
Temperature+0.153
||||+0.153
Neg logit contributions
 said-0.201
 commented-0.188
 saying-0.177
 noted-0.175
 admitted-0.174
Token --------------------
Token ID41436
Feature activation-1.43e+00

Pos logit contributions
+0.219
+0.172
��+0.154
||||+0.148
Temperature+0.148
Neg logit contributions
 said-0.196
 commented-0.184
 saying-0.173
 noted-0.171
 admitted-0.169
Token-+
Token ID19529
Feature activation-6.80e+00

Pos logit contributions
+0.130
+0.090
��+0.077
Temperature+0.072
+0.071
Neg logit contributions
 said-0.216
 commented-0.207
 saying-0.197
 noted-0.194
 admitted-0.193
Token--------
Token ID982
Feature activation-2.75e+00

Pos logit contributions
+0.601
+0.410
��+0.352
+0.330
▬▬+0.328
Neg logit contributions
 said-1.036
 commented-0.989
 saying-0.947
 noted-0.935
 admitted-0.933
Token-+
Token ID19529
Feature activation-6.77e+00

Pos logit contributions
+0.138
+0.060
��+0.034
Temperature+0.024
��+0.023
Neg logit contributions
 said-0.531
 commented-0.511
 saying-0.494
 noted-0.488
 admitted-0.486
Token-------
Token ID26866
Feature activation-4.07e+00

Pos logit contributions
+0.583
+0.393
��+0.333
+0.313
▬▬+0.309
Neg logit contributions
 said-1.050
 commented-1.002
 saying-0.964
 noted-0.950
 admitted-0.946
Token|
Token ID91
Feature activation-3.91e+00

Pos logit contributions
+0.321
+0.206
��+0.165
Temperature+0.154
||||+0.152
Neg logit contributions
 said-0.663
 commented-0.635
 saying-0.612
 noted-0.602
 admitted-0.599
Token |
Token ID930
Feature activation-1.74e+00

Pos logit contributions
+0.362
+0.251
��+0.219
+0.206
▬▬+0.204
Neg logit contributions
 said-0.579
 commented-0.551
 saying-0.531
 noted-0.521
 admitted-0.518
Token org
Token ID8745
Feature activation+4.81e-01

Pos logit contributions
+0.222
+0.176
��+0.157
||||+0.151
Temperature+0.150
Neg logit contributions
 said-0.196
 commented-0.185
 saying-0.175
 noted-0.172
 admitted-0.170
Token.
Token ID764
Feature activation+1.34e+00

Pos logit contributions
 said+0.055
 commented+0.052
 saying+0.049
 noted+0.049
 admitted+0.048
Neg logit contributions
-0.061
-0.047
��-0.043
Temperature-0.041
||||-0.041
Token Moore
Token ID8877
Feature activation+3.05e+00

Pos logit contributions
+0.156
+0.123
��+0.110
Temperature+0.105
��+0.104
Neg logit contributions
 said-0.142
 commented-0.136
 saying-0.124
 comments-0.123
 remark-0.123
Token
Token ID447
Feature activation-2.68e-01

Pos logit contributions
 said+0.397
 commented+0.378
 saying+0.353
 noted+0.350
 remark+0.350
Neg logit contributions
-0.338
-0.255
��-0.223
Temperature-0.213
▬▬-0.208
Token
Token ID247
Feature activation-9.12e-02

Pos logit contributions
+0.023
+0.015
��+0.013
��+0.012
+0.011
Neg logit contributions
 said-0.043
 commented-0.041
 saying-0.040
 noted-0.039
 comments-0.039
Tokens
Token ID82
Feature activation+2.74e+00

Pos logit contributions
+0.017
+0.014
��+0.013
Temperature+0.013
+0.013
Neg logit contributions
 said-0.005
 commented-0.005
 saying-0.004
 remark-0.004
 noted-0.004
Token Prom
Token ID10335
Feature activation-8.56e-01

Pos logit contributions
 said+0.264
 commented+0.250
 saying+0.227
 noted+0.220
 remark+0.218
Neg logit contributions
-0.392
-0.320
��-0.291
Temperature-0.282
��-0.282
Tokenethe
Token ID10567
Feature activation-6.53e+00

Pos logit contributions
+0.074
+0.050
��+0.043
Temperature+0.040
��+0.040
Neg logit contributions
 said-0.132
 commented-0.127
 saying-0.121
 comments-0.119
 remark-0.119
Tokena
Token ID64
Feature activation+3.20e+00

Pos logit contributions
+1.156
+0.983
��+0.907
Temperature+0.890
+0.877
Neg logit contributions
 said-0.400
 commented-0.377
 commenting-0.327
 remark-0.324
 saying-0.324
Token,
Token ID11
Feature activation-8.34e-01

Pos logit contributions
 said+0.372
 commented+0.354
 saying+0.326
 noted+0.323
 admitted+0.321
Neg logit contributions
-0.398
-0.313
��-0.277
Temperature-0.268
��-0.266
Token serial
Token ID11389
Feature activation+1.78e-01

Pos logit contributions
+0.126
+0.105
��+0.096
��+0.092
Temperature+0.092
Neg logit contributions
 said-0.072
 commented-0.068
 comments-0.060
 admitted-0.060
 saying-0.060
Tokenized
Token ID1143
Feature activation+5.16e-01

Pos logit contributions
 said+0.020
 commented+0.019
 saying+0.018
 noted+0.017
 remark+0.017
Neg logit contributions
-0.022
-0.018
��-0.016
Temperature-0.016
の�-0.015
Token from
Token ID422
Feature activation+6.57e-01

Pos logit contributions
 said+0.048
 commented+0.045
 saying+0.041
 noted+0.040
 remark+0.040
Neg logit contributions
-0.076
-0.062
��-0.057
Temperature-0.056
-0.055
Token
Token ID198
Feature activation+2.34e-01

Pos logit contributions
 said+0.031
 commented+0.030
 saying+0.028
 noted+0.027
 comments+0.027
Neg logit contributions
-0.031
-0.024
��-0.022
Temperature-0.021
▬▬-0.020
TokenE
Token ID36
Feature activation+1.64e-01

Pos logit contributions
 said+0.035
 commented+0.034
 saying+0.032
 comments+0.032
 noted+0.032
Neg logit contributions
-0.021
-0.015
��-0.013
Temperature-0.013
▬▬-0.012
Tokenad
Token ID324
Feature activation+3.37e+00

Pos logit contributions
 said+0.016
 commented+0.015
 saying+0.014
 remark+0.014
 noted+0.014
Neg logit contributions
-0.023
-0.019
��-0.018
Temperature-0.017
-0.016
Tokengar
Token ID4563
Feature activation+2.91e-01

Pos logit contributions
 said+0.357
 commented+0.334
 saying+0.312
 comments+0.306
 noted+0.305
Neg logit contributions
-0.460
-0.367
��-0.337
Temperature-0.323
▬▬-0.318
Token A
Token ID317
Feature activation+1.39e+00

Pos logit contributions
 said+0.029
 commented+0.027
 saying+0.026
 comments+0.025
 remark+0.025
Neg logit contributions
-0.040
-0.032
��-0.029
��-0.028
の�-0.028
Tokenethe
Token ID10567
Feature activation-6.45e+00

Pos logit contributions
 said+0.221
 commented+0.210
 saying+0.201
 noted+0.198
 comments+0.198
Neg logit contributions
-0.119
-0.080
��-0.070
Temperature-0.063
��-0.062
Tokenling
Token ID1359
Feature activation-2.17e-01

Pos logit contributions
+0.814
+0.633
��+0.573
Temperature+0.550
��+0.540
Neg logit contributions
 said-0.743
 commented-0.703
 saying-0.656
 comments-0.648
 remark-0.646
Token fled
Token ID11468
Feature activation-2.52e+00

Pos logit contributions
+0.030
+0.024
��+0.022
Temperature+0.021
��+0.021
Neg logit contributions
 said-0.022
 commented-0.020
 saying-0.019
 comments-0.019
 remark-0.019
Token to
Token ID284
Feature activation-1.14e+00

Pos logit contributions
+0.321
+0.251
��+0.227
Temperature+0.219
+0.217
Neg logit contributions
 said-0.282
 commented-0.267
 saying-0.247
 comments-0.244
 noted-0.243
Token Scotland
Token ID8838
Feature activation-1.99e+00

Pos logit contributions
+0.151
+0.120
��+0.109
Temperature+0.104
��+0.104
Neg logit contributions
 said-0.122
 commented-0.117
 saying-0.108
 comments-0.106
 noted-0.105
Token with
Token ID351
Feature activation+1.59e-01

Pos logit contributions
+0.268
+0.214
��+0.193
+0.185
Temperature+0.185
Neg logit contributions
 said-0.210
 commented-0.198
 comments-0.181
 noted-0.179
 saying-0.179
Token is
Token ID318
Feature activation-1.32e+00

Pos logit contributions
+0.221
+0.173
��+0.156
Temperature+0.151
▬▬+0.150
Neg logit contributions
 said-0.196
 commented-0.186
 saying-0.173
 noted-0.170
 remark-0.170
Token required
Token ID2672
Feature activation-2.50e-01

Pos logit contributions
+0.178
+0.141
��+0.128
Temperature+0.123
▬▬+0.122
Neg logit contributions
 said-0.137
 commented-0.130
 saying-0.120
 remark-0.120
 comments-0.119
Token is
Token ID318
Feature activation-1.32e+00

Pos logit contributions
+0.032
+0.025
��+0.022
Temperature+0.021
▬▬+0.021
Neg logit contributions
 said-0.029
 commented-0.027
 saying-0.025
 noted-0.025
 comments-0.025
Token simple
Token ID2829
Feature activation-9.14e-03

Pos logit contributions
+0.205
+0.168
��+0.154
▬▬+0.151
Temperature+0.150
Neg logit contributions
 said-0.109
 commented-0.104
 remark-0.092
 saying-0.091
 noted-0.091
Token.
Token ID13
Feature activation+9.46e-01

Pos logit contributions
+0.001
+0.001
��+0.001
▬▬+0.001
Temperature+0.001
Neg logit contributions
 said-0.001
 commented-0.001
 noted-0.001
 saying-0.001
 remark-0.001
Token Cheap
Token ID41087
Feature activation-6.42e+00

Pos logit contributions
 said+0.128
 commented+0.123
 saying+0.116
 noted+0.115
 remark+0.115
Neg logit contributions
-0.099
-0.073
��-0.064
Temperature-0.061
▬▬-0.060
Token customer
Token ID6491
Feature activation+5.06e-01

Pos logit contributions
+0.808
+0.635
��+0.570
Temperature+0.549
▬▬+0.538
Neg logit contributions
 said-0.730
 commented-0.695
 noted-0.644
 saying-0.642
 remark-0.628
Token engines
Token ID11874
Feature activation-3.05e+00

Pos logit contributions
 said+0.065
 commented+0.062
 saying+0.058
 noted+0.058
 remark+0.057
Neg logit contributions
-0.056
-0.042
��-0.037
▬▬-0.036
Temperature-0.035
Token and
Token ID290
Feature activation+5.70e-01

Pos logit contributions
+0.383
+0.300
��+0.267
Temperature+0.258
▬▬+0.257
Neg logit contributions
 said-0.348
 commented-0.330
 saying-0.306
 comments-0.304
 noted-0.302
Token restrictions
Token ID8733
Feature activation-8.22e-01

Pos logit contributions
 said+0.061
 commented+0.058
 saying+0.054
 noted+0.053
 remark+0.053
Neg logit contributions
-0.074
-0.059
��-0.053
Temperature-0.052
▬▬-0.052
Token on
Token ID319
Feature activation+4.32e-01

Pos logit contributions
+0.119
+0.096
��+0.087
Temperature+0.085
▬▬+0.085
Neg logit contributions
 said-0.079
 commented-0.074
 saying-0.068
 comments-0.067
 noted-0.067
Token those
Token ID883
Feature activation-5.19e-01

Pos logit contributions
 said+0.027
 commented+0.026
 remark+0.023
 noted+0.023
 saying+0.023
Neg logit contributions
-0.041
-0.033
��-0.030
��-0.029
Temperature-0.029
Token that
Token ID326
Feature activation+1.64e+00

Pos logit contributions
+0.074
+0.060
��+0.055
▬▬+0.053
��+0.052
Neg logit contributions
 said-0.049
 commented-0.046
 remark-0.041
 noted-0.041
 saying-0.041
Token are
Token ID389
Feature activation-1.93e+00

Pos logit contributions
 said+0.194
 commented+0.185
 saying+0.174
 comments+0.172
 noted+0.171
Neg logit contributions
-0.195
-0.153
��-0.134
��-0.128
Temperature-0.128
Token in
Token ID287
Feature activation+1.26e+00

Pos logit contributions
+0.261
+0.209
��+0.187
��+0.182
▬▬+0.181
Neg logit contributions
 said-0.194
 commented-0.186
 noted-0.168
 comments-0.167
 remark-0.167
Token the
Token ID262
Feature activation-1.36e+00

Pos logit contributions
 said+0.129
 commented+0.123
 noted+0.111
 remark+0.111
 saying+0.111
Neg logit contributions
-0.170
-0.138
��-0.124
▬▬-0.121
Temperature-0.120
Token thro
Token ID9244
Feature activation-6.38e+00

Pos logit contributions
+0.180
+0.143
��+0.129
Temperature+0.123
▬▬+0.123
Neg logit contributions
 said-0.146
 commented-0.139
 saying-0.127
 noted-0.126
 remark-0.125
Tokenes
Token ID274
Feature activation+2.91e+00

Pos logit contributions
+1.053
+0.878
��+0.823
Temperature+0.800
��+0.788
Neg logit contributions
 said-0.490
 commented-0.449
 saying-0.404
 noted-0.393
 comments-0.391
Token of
Token ID286
Feature activation-1.86e+00

Pos logit contributions
 said+0.386
 commented+0.366
 saying+0.345
 noted+0.340
 comments+0.339
Neg logit contributions
-0.321
-0.240
��-0.217
Temperature-0.204
▬▬-0.200
Token making
Token ID1642
Feature activation+7.87e-01

Pos logit contributions
+0.268
+0.220
��+0.200
Temperature+0.192
▬▬+0.192
Neg logit contributions
 said-0.170
 commented-0.163
 noted-0.145
 remark-0.144
 admitted-0.143
Token the
Token ID262
Feature activation-1.32e+00

Pos logit contributions
 said+0.079
 commented+0.076
 saying+0.069
 noted+0.068
 remark+0.068
Neg logit contributions
-0.108
-0.087
��-0.079
Temperature-0.076
▬▬-0.075
Token decision
Token ID2551
Feature activation+2.21e+00

Pos logit contributions
+0.182
+0.146
��+0.134
Temperature+0.128
��+0.127
Neg logit contributions
 said-0.134
 commented-0.127
 saying-0.115
 noted-0.113
 remark-0.112
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
+0.379
+0.303
��+0.279
Temperature+0.264
||||+0.264
Neg logit contributions
 said-0.289
 commented-0.272
 saying-0.248
 comments-0.247
 noted-0.245
Token E
Token ID412
Feature activation+1.27e+00

Pos logit contributions
 said+0.125
 commented+0.118
 saying+0.112
 remark+0.110
 noted+0.110
Neg logit contributions
-0.121
-0.094
��-0.083
Temperature-0.081
||||-0.079
Tokenad
Token ID324
Feature activation+3.44e+00

Pos logit contributions
 said+0.102
 commented+0.097
 saying+0.088
 remark+0.085
 comments+0.085
Neg logit contributions
-0.201
-0.167
��-0.157
Temperature-0.149
の�-0.147
Tokengar
Token ID4563
Feature activation+3.14e-01

Pos logit contributions
 said+0.362
 commented+0.337
 saying+0.315
 comments+0.307
 noted+0.307
Neg logit contributions
-0.476
-0.380
��-0.352
Temperature-0.338
��-0.332
Token A
Token ID317
Feature activation+1.39e+00

Pos logit contributions
 said+0.032
 commented+0.030
 saying+0.028
 comments+0.027
 remark+0.027
Neg logit contributions
-0.043
-0.035
��-0.031
��-0.030
の�-0.030
Tokenethe
Token ID10567
Feature activation-6.35e+00

Pos logit contributions
 said+0.220
 commented+0.209
 saying+0.200
 noted+0.197
 comments+0.197
Neg logit contributions
-0.120
-0.081
��-0.071
Temperature-0.064
��-0.063
Tokenling
Token ID1359
Feature activation-2.29e-01

Pos logit contributions
+0.803
+0.624
��+0.565
Temperature+0.544
��+0.535
Neg logit contributions
 said-0.732
 commented-0.692
 saying-0.646
 comments-0.638
 noted-0.636
Token and
Token ID290
Feature activation+6.98e-01

Pos logit contributions
+0.032
+0.026
��+0.023
の�+0.022
��+0.022
Neg logit contributions
 said-0.023
 commented-0.021
 saying-0.020
 comments-0.019
 remark-0.019
Token his
Token ID465
Feature activation+1.75e+00

Pos logit contributions
 said+0.056
 commented+0.052
 saying+0.047
 comments+0.045
 remark+0.045
Neg logit contributions
-0.111
-0.093
��-0.084
||||-0.082
Temperature-0.082
Token supporters
Token ID5941
Feature activation+2.35e+00

Pos logit contributions
 said+0.148
 commented+0.142
 saying+0.125
 commenting+0.123
 noted+0.121
Neg logit contributions
-0.268
-0.221
��-0.203
||||-0.195
-0.195
Token joined
Token ID5399
Feature activation+6.38e-01

Pos logit contributions
 said+0.237
 commented+0.221
 saying+0.209
 comments+0.204
 noted+0.201
Neg logit contributions
-0.324
-0.260
��-0.239
��-0.226
Temperature-0.225
Token (
Token ID357
Feature activation-4.92e-01

Pos logit contributions
+0.082
+0.064
��+0.059
Temperature+0.056
+0.055
Neg logit contributions
 said-0.076
 commented-0.072
 saying-0.067
 noted-0.066
 remark-0.065
Token3
Token ID18
Feature activation-1.44e+00

Pos logit contributions
+0.053
+0.039
��+0.034
Temperature+0.034
▬▬+0.032
Neg logit contributions
 said-0.066
 commented-0.063
 saying-0.060
 remark-0.059
 noted-0.059
Token)
Token ID8
Feature activation-2.18e+00

Pos logit contributions
+0.135
+0.095
��+0.084
Temperature+0.077
��+0.075
Neg logit contributions
 said-0.215
 commented-0.205
 saying-0.195
 noted-0.192
 comments-0.191
Token
Token ID198
Feature activation-9.25e-03

Pos logit contributions
+0.264
+0.204
��+0.187
Temperature+0.177
��+0.174
Neg logit contributions
 said-0.266
 commented-0.250
 saying-0.236
 noted-0.231
 comments-0.231
Token
Token ID198
Feature activation-7.85e-03

Pos logit contributions
+0.001
+0.001
��+0.001
Temperature+0.001
▬▬+0.001
Neg logit contributions
 said-0.001
 commented-0.001
 saying-0.001
 noted-0.001
 comments-0.001
TokenBreak
Token ID31737
Feature activation-6.30e+00

Pos logit contributions
+0.001
+0.000
��+0.000
Temperature+0.000
▬▬+0.000
Neg logit contributions
 said-0.001
 commented-0.001
 saying-0.001
 comments-0.001
 noted-0.001
Token 11
Token ID1367
Feature activation+3.55e-01

Pos logit contributions
+0.842
+0.671
��+0.608
Temperature+0.584
��+0.575
Neg logit contributions
 said-0.675
 commented-0.644
 saying-0.594
 comments-0.581
 noted-0.581
Token test
Token ID1332
Feature activation+5.03e-01

Pos logit contributions
 said+0.043
 commented+0.041
 saying+0.038
 remark+0.037
 noted+0.037
Neg logit contributions
-0.043
-0.034
��-0.031
Temperature-0.029
��-0.029
Token chamber
Token ID11847
Feature activation-8.75e-03

Pos logit contributions
 said+0.065
 commented+0.062
 saying+0.058
 remark+0.057
 noted+0.057
Neg logit contributions
-0.056
-0.043
��-0.038
Temperature-0.036
-0.035
Token monitors
Token ID19374
Feature activation+4.12e-01

Pos logit contributions
+0.001
+0.001
��+0.001
Temperature+0.001
+0.001
Neg logit contributions
 said-0.001
 commented-0.001
 saying-0.001
 noted-0.001
 remark-0.001
Token (
Token ID357
Feature activation-5.68e-01

Pos logit contributions
 said+0.041
 commented+0.038
 saying+0.035
 comments+0.035
 remark+0.035
Neg logit contributions
-0.058
-0.047
��-0.042
Temperature-0.041
▬▬-0.040
Token,
Token ID11
Feature activation-8.07e-01

Pos logit contributions
+0.018
+0.014
��+0.012
Temperature+0.012
)/+0.011
Neg logit contributions
 said-0.019
 commented-0.018
 saying-0.017
 remark-0.016
 admitted-0.016
Token 2011
Token ID2813
Feature activation+6.43e-01

Pos logit contributions
+0.113
+0.092
��+0.081
Temperature+0.080
+0.079
Neg logit contributions
 said-0.079
 commented-0.074
 saying-0.068
 admitted-0.066
 remark-0.066
Token).
Token ID737
Feature activation-1.04e+00

Pos logit contributions
 said+0.079
 commented+0.074
 saying+0.069
 remark+0.068
 admitted+0.068
Neg logit contributions
-0.076
-0.059
��-0.052
Temperature-0.050
)/-0.049
Token
Token ID198
Feature activation+1.25e-01

Pos logit contributions
+0.108
+0.080
Temperature+0.071
��+0.069
+0.066
Neg logit contributions
 said-0.142
 commented-0.136
 saying-0.129
 remark-0.127
 noted-0.126
Token
Token ID198
Feature activation+1.35e-01

Pos logit contributions
 said+0.015
 commented+0.015
 saying+0.014
 noted+0.013
 comments+0.013
Neg logit contributions
-0.015
-0.011
��-0.010
Temperature-0.010
▬▬-0.010
TokenTemperature
Token ID42492
Feature activation-6.30e+00

Pos logit contributions
 said+0.021
 commented+0.021
 saying+0.020
 comments+0.019
 noted+0.019
Neg logit contributions
-0.011
-0.008
Temperature-0.007
��-0.006
These-0.006
Token feedback
Token ID7538
Feature activation+1.93e+00

Pos logit contributions
+0.855
+0.685
Temperature+0.628
��+0.622
+0.597
Neg logit contributions
 said-0.647
 commented-0.610
 saying-0.563
 noted-0.546
 comments-0.545
Tokens
Token ID82
Feature activation+2.73e+00

Pos logit contributions
 said+0.122
 commented+0.113
 saying+0.097
 remark+0.093
 noted+0.092
Neg logit contributions
-0.340
-0.289
Temperature-0.267
��-0.267
-0.260
Token
Token ID784
Feature activation-1.09e+00

Pos logit contributions
 said+0.316
 commented+0.298
 saying+0.280
 comments+0.275
 noted+0.271
Neg logit contributions
-0.339
-0.266
��-0.238
Temperature-0.232
の�-0.225
Token in
Token ID287
Feature activation+1.18e+00

Pos logit contributions
+0.153
+0.123
Temperature+0.113
��+0.113
+0.109
Neg logit contributions
 said-0.108
 commented-0.102
 saying-0.093
 remark-0.091
 comments-0.091
Token effect
Token ID1245
Feature activation+5.58e-01

Pos logit contributions
 said+0.116
 commented+0.111
 saying+0.099
 noted+0.098
 remark+0.097
Neg logit contributions
-0.168
-0.136
Temperature-0.125
��-0.124
-0.120
Token68
Token ID3104
Feature activation-3.47e-01

Pos logit contributions
+0.043
+0.033
��+0.029
Temperature+0.028
��+0.027
Neg logit contributions
 said-0.049
 commented-0.047
 saying-0.044
 remark-0.043
 noted-0.043
Token E
Token ID412
Feature activation+1.41e+00

Pos logit contributions
+0.049
+0.039
��+0.036
Temperature+0.034
��+0.034
Neg logit contributions
 said-0.035
 commented-0.033
 saying-0.030
 remark-0.030
 comments-0.030
Tokenad
Token ID324
Feature activation+3.52e+00

Pos logit contributions
 said+0.151
 commented+0.145
 saying+0.135
 remark+0.133
 comments+0.132
Neg logit contributions
-0.187
-0.149
��-0.138
Temperature-0.128
-0.127
Tokengar
Token ID4563
Feature activation+5.12e-01

Pos logit contributions
 said+0.396
 commented+0.376
 saying+0.351
 comments+0.346
 remark+0.346
Neg logit contributions
-0.452
-0.358
��-0.321
-0.298
Temperature-0.298
Token A
Token ID317
Feature activation+1.56e+00

Pos logit contributions
 said+0.050
 commented+0.047
 saying+0.044
 comments+0.043
 remark+0.043
Neg logit contributions
-0.073
-0.059
��-0.054
��-0.051
の�-0.051
Tokenethe
Token ID10567
Feature activation-6.26e+00

Pos logit contributions
 said+0.167
 commented+0.159
 saying+0.148
 comments+0.146
 remark+0.145
Neg logit contributions
-0.208
-0.167
��-0.152
-0.144
��-0.144
Tokenling
Token ID1359
Feature activation-6.42e-02

Pos logit contributions
+0.855
+0.686
��+0.618
+0.587
+0.584
Neg logit contributions
 said-0.642
 commented-0.608
 saying-0.574
 remark-0.572
 comments-0.566
Token,
Token ID11
Feature activation-5.72e-01

Pos logit contributions
+0.009
+0.007
��+0.007
��+0.006
の�+0.006
Neg logit contributions
 said-0.006
 commented-0.006
 saying-0.006
 comments-0.005
 remark-0.005
Token the
Token ID262
Feature activation-1.16e+00

Pos logit contributions
+0.085
+0.070
��+0.064
��+0.062
の�+0.062
Neg logit contributions
 said-0.050
 commented-0.048
 comments-0.043
 saying-0.042
 comment-0.042
Token last
Token ID938
Feature activation+3.16e-01

Pos logit contributions
+0.166
+0.135
��+0.124
��+0.118
Temperature+0.118
Neg logit contributions
 said-0.110
 commented-0.104
 saying-0.095
 comments-0.094
 commenting-0.093
Token surviving
Token ID16997
Feature activation-4.42e-01

Pos logit contributions
 said+0.035
 commented+0.033
 saying+0.031
 comments+0.031
 remark+0.030
Neg logit contributions
-0.041
-0.032
��-0.029
Temperature-0.028
の�-0.028
Token Geek
Token ID33639
Feature activation+2.84e-01

Pos logit contributions
+0.556
+0.447
��+0.409
Temperature+0.396
��+0.386
Neg logit contributions
 said-0.418
 commented-0.401
 saying-0.359
 noted-0.354
 admitted-0.351
Token Z
Token ID1168
Feature activation+1.79e+00

Pos logit contributions
 said+0.033
 commented+0.031
 saying+0.029
 comments+0.029
 remark+0.029
Neg logit contributions
-0.036
-0.028
��-0.025
Temperature-0.025
▬▬-0.024
Tokenodiac
Token ID40096
Feature activation-1.04e+00

Pos logit contributions
 said+0.199
 commented+0.190
 saying+0.175
 comments+0.174
 noted+0.173
Neg logit contributions
-0.235
-0.186
��-0.168
Temperature-0.162
��-0.161
Token
Token ID198
Feature activation+1.51e-02

Pos logit contributions
+0.132
+0.104
��+0.094
Temperature+0.091
��+0.090
Neg logit contributions
 said-0.117
 commented-0.111
 saying-0.104
 remark-0.102
 comments-0.101
Token
Token ID198
Feature activation+3.84e-02

Pos logit contributions
 said+0.002
 commented+0.002
 saying+0.002
 remark+0.002
 noted+0.002
Neg logit contributions
-0.002
-0.001
Temperature-0.001
��-0.001
||||-0.001
TokenTHE
Token ID10970
Feature activation-6.18e+00

Pos logit contributions
 said+0.006
 commented+0.005
 saying+0.005
 noted+0.005
 comments+0.005
Neg logit contributions
-0.004
-0.003
��-0.002
Temperature-0.002
▬▬-0.002
Token G
Token ID402
Feature activation+1.40e+00

Pos logit contributions
+0.771
+0.608
��+0.544
Temperature+0.523
▬▬+0.512
Neg logit contributions
 said-0.709
 commented-0.675
 saying-0.626
 noted-0.617
 remark-0.616
TokenEEK
Token ID33823
Feature activation-4.77e-01

Pos logit contributions
 said+0.175
 commented+0.166
 saying+0.158
 remark+0.155
 comments+0.155
Neg logit contributions
-0.164
-0.125
��-0.110
Temperature-0.107
▬▬-0.104
Token Z
Token ID1168
Feature activation+1.81e+00

Pos logit contributions
+0.066
+0.053
��+0.049
▬▬+0.047
Temperature+0.046
Neg logit contributions
 said-0.049
 commented-0.046
 saying-0.043
 comments-0.042
 remark-0.041
TokenOD
Token ID3727
Feature activation+7.59e-01

Pos logit contributions
 said+0.216
 commented+0.203
 saying+0.192
 comments+0.188
 noted+0.188
Neg logit contributions
-0.224
-0.174
��-0.158
Temperature-0.152
▬▬-0.149
TokenI
Token ID40
Feature activation+5.03e-01

Pos logit contributions
 said+0.095
 commented+0.091
 saying+0.086
 remark+0.084
 noted+0.084
Neg logit contributions
-0.088
-0.067
��-0.058
Temperature-0.056
▬▬-0.056
Token least
Token ID1551
Feature activation-1.40e+00

Pos logit contributions
 said+0.115
 commented+0.110
 saying+0.101
 remark+0.099
 comments+0.099
Neg logit contributions
-0.146
-0.116
��-0.106
▬▬-0.102
Temperature-0.102
Token 26
Token ID2608
Feature activation+1.22e-01

Pos logit contributions
+0.192
+0.153
��+0.140
Temperature+0.135
▬▬+0.132
Neg logit contributions
 said-0.140
 commented-0.134
 saying-0.121
 remark-0.120
 comments-0.120
Token of
Token ID286
Feature activation-2.11e+00

Pos logit contributions
 said+0.014
 commented+0.013
 saying+0.012
 remark+0.012
 noted+0.012
Neg logit contributions
-0.016
-0.012
��-0.011
Temperature-0.011
��-0.011
Token 27
Token ID2681
Feature activation-2.95e-01

Pos logit contributions
+0.268
+0.211
��+0.186
Temperature+0.180
��+0.178
Neg logit contributions
 said-0.238
 commented-0.229
 saying-0.212
 remark-0.208
 noted-0.207
Token:
Token ID25
Feature activation-1.16e+00

Pos logit contributions
+0.040
+0.032
��+0.029
Temperature+0.028
��+0.028
Neg logit contributions
 said-0.030
 commented-0.029
 saying-0.026
 remark-0.026
 noted-0.026
TokenTHE
Token ID10970
Feature activation-6.16e+00

Pos logit contributions
+0.140
+0.108
��+0.096
Temperature+0.094
▬▬+0.091
Neg logit contributions
 said-0.137
 commented-0.131
 saying-0.122
 noted-0.120
 remark-0.120
Token FIRST
Token ID31328
Feature activation-2.85e+00

Pos logit contributions
+0.790
+0.628
��+0.562
Temperature+0.545
▬▬+0.534
Neg logit contributions
 said-0.686
 commented-0.651
 saying-0.601
 noted-0.594
 remark-0.587
Token PART
Token ID16652
Feature activation-1.40e-01

Pos logit contributions
+0.343
+0.264
��+0.237
Temperature+0.228
��+0.226
Neg logit contributions
 said-0.341
 commented-0.324
 saying-0.304
 noted-0.298
 remark-0.296
Token IS
Token ID3180
Feature activation-2.66e+00

Pos logit contributions
+0.014
+0.010
��+0.009
Temperature+0.009
��+0.008
Neg logit contributions
 said-0.019
 commented-0.019
 saying-0.018
 remark-0.017
 comments-0.017
Token E
Token ID412
Feature activation+1.03e+00

Pos logit contributions
+0.299
+0.224
��+0.199
▬▬+0.191
Temperature+0.190
Neg logit contributions
 said-0.342
 commented-0.325
 saying-0.307
 remark-0.305
 comments-0.303
TokenASY
Token ID26483
Feature activation-1.32e+00

Pos logit contributions
 said+0.132
 commented+0.126
 saying+0.119
 remark+0.117
 comments+0.116
Neg logit contributions
-0.116
-0.087
��-0.079
Temperature-0.074
��-0.073
Token 6
Token ID718
Feature activation-1.48e+00

Pos logit contributions
+0.044
+0.034
��+0.030
Temperature+0.030
||||+0.029
Neg logit contributions
 said-0.043
 commented-0.040
 saying-0.038
 remark-0.038
 noted-0.037
Token.
Token ID13
Feature activation+8.73e-01

Pos logit contributions
+0.197
+0.157
��+0.140
Temperature+0.137
||||+0.136
Neg logit contributions
 said-0.158
 commented-0.148
 saying-0.138
 remark-0.136
 admitted-0.135
Token0
Token ID15
Feature activation-1.81e+00

Pos logit contributions
 said+0.120
 commented+0.114
 saying+0.108
 admitted+0.106
 noted+0.106
Neg logit contributions
-0.090
-0.067
��-0.057
Temperature-0.055
||||-0.054
Token (
Token ID357
Feature activation-3.69e-01

Pos logit contributions
+0.234
+0.186
��+0.165
Temperature+0.162
||||+0.160
Neg logit contributions
 said-0.198
 commented-0.186
 saying-0.174
 remark-0.171
 comments-0.170
Tokenh
Token ID71
Feature activation+6.28e-02

Pos logit contributions
+0.046
+0.036
��+0.032
Temperature+0.032
||||+0.031
Neg logit contributions
 said-0.042
 commented-0.040
 saying-0.038
 remark-0.037
 comments-0.037
Tokenundred
Token ID3229
Feature activation-6.16e+00

Pos logit contributions
 said+0.007
 commented+0.007
 saying+0.006
 noted+0.006
 remark+0.006
Neg logit contributions
-0.008
-0.006
��-0.006
Temperature-0.006
||||-0.005
Token times
Token ID1661
Feature activation-1.76e+00

Pos logit contributions
+0.747
+0.581
��+0.519
Temperature+0.495
||||+0.484
Neg logit contributions
 said-0.736
 commented-0.695
 saying-0.653
 noted-0.644
 admitted-0.641
Token the
Token ID262
Feature activation-1.43e+00

Pos logit contributions
+0.214
+0.167
��+0.149
Temperature+0.144
||||+0.141
Neg logit contributions
 said-0.206
 commented-0.194
 saying-0.182
 comments-0.180
 noted-0.178
Token major
Token ID1688
Feature activation-5.72e-01

Pos logit contributions
+0.189
+0.151
��+0.136
Temperature+0.132
||||+0.130
Neg logit contributions
 said-0.151
 commented-0.140
 saying-0.131
 noted-0.128
 remark-0.128
Token version
Token ID2196
Feature activation+5.94e-01

Pos logit contributions
+0.078
+0.063
��+0.057
Temperature+0.055
||||+0.055
Neg logit contributions
 said-0.059
 commented-0.055
 saying-0.051
 noted-0.050
 admitted-0.050
Token plus
Token ID5556
Feature activation-1.54e+00

Pos logit contributions
 said+0.072
 commented+0.068
 saying+0.064
 comments+0.063
 remark+0.063
Neg logit contributions
-0.071
-0.055
��-0.048
Temperature-0.047
||||-0.045
Token Her
Token ID2332
Feature activation+4.46e-01

Pos logit contributions
 said+0.198
 commented+0.189
 saying+0.178
 noted+0.176
 remark+0.176
Neg logit contributions
-0.168
-0.127
��-0.112
Temperature-0.108
▬▬-0.105
Token so
Token ID523
Feature activation+1.21e+00

Pos logit contributions
 said+0.042
 commented+0.040
 saying+0.036
 noted+0.036
 admitted+0.035
Neg logit contributions
-0.064
-0.053
��-0.048
Temperature-0.047
▬▬-0.046
Token-
Token ID12
Feature activation-3.43e+00

Pos logit contributions
 said+0.149
 commented+0.141
 saying+0.133
 comments+0.132
 remark+0.131
Neg logit contributions
-0.143
-0.109
��-0.098
Temperature-0.093
▬▬-0.092
Tokencalled
Token ID7174
Feature activation+1.77e+00

Pos logit contributions
+0.566
+0.471
Temperature+0.434
��+0.432
の�+0.426
Neg logit contributions
 said-0.244
 commented-0.227
 saying-0.205
 remark-0.204
 comments-0.201
Token urban
Token ID7876
Feature activation-3.09e+00

Pos logit contributions
 said+0.179
 commented+0.169
 saying+0.153
 noted+0.152
 admitted+0.149
Neg logit contributions
-0.246
-0.199
��-0.182
Temperature-0.176
��-0.173
Token thru
Token ID33834
Feature activation-6.15e+00

Pos logit contributions
+0.412
+0.329
��+0.300
Temperature+0.290
▬▬+0.284
Neg logit contributions
 said-0.330
 commented-0.315
 saying-0.287
 noted-0.284
 admitted-0.281
Token-
Token ID12
Feature activation-3.11e+00

Pos logit contributions
+0.589
+0.420
��+0.363
Temperature+0.334
��+0.334
Neg logit contributions
 said-0.891
 commented-0.858
 saying-0.809
 noted-0.800
 comments-0.796
Tokenh
Token ID71
Feature activation-2.92e-01

Pos logit contributions
+0.411
+0.327
��+0.293
Temperature+0.283
||||+0.279
Neg logit contributions
 said-0.338
 commented-0.323
 saying-0.299
 noted-0.295
 remark-0.294
Tokenikes
Token ID7938
Feature activation+1.74e+00

Pos logit contributions
+0.043
+0.035
��+0.032
Temperature+0.030
||||+0.030
Neg logit contributions
 said-0.028
 commented-0.026
 saying-0.024
 noted-0.024
 comments-0.023
Token have
Token ID423
Feature activation-1.34e-01

Pos logit contributions
 said+0.202
 commented+0.192
 saying+0.180
 comments+0.177
 noted+0.175
Neg logit contributions
-0.213
-0.167
��-0.149
Temperature-0.142
▬▬-0.140
Token st
Token ID336
Feature activation-1.28e+00

Pos logit contributions
+0.019
+0.015
��+0.014
Temperature+0.013
��+0.013
Neg logit contributions
 said-0.013
 commented-0.012
 saying-0.012
 remark-0.011
 comments-0.011
Tokenigan
Token ID5516
Feature activation+2.33e+00

Pos logit contributions
 said+0.446
 commented+0.422
 saying+0.390
 comments+0.384
 noted+0.384
Neg logit contributions
-0.610
-0.490
��-0.445
Temperature-0.430
��-0.423
Token's
Token ID338
Feature activation-9.20e-01

Pos logit contributions
 said+0.277
 commented+0.265
 saying+0.247
 comments+0.245
 noted+0.243
Neg logit contributions
-0.282
-0.219
��-0.194
▬▬-0.185
Temperature-0.184
Token surviving
Token ID16997
Feature activation-8.20e-01

Pos logit contributions
+0.124
+0.099
��+0.089
Temperature+0.086
��+0.085
Neg logit contributions
 said-0.097
 commented-0.092
 saying-0.084
 comments-0.084
 noted-0.083
Token at
Token ID379
Feature activation+1.13e+00

Pos logit contributions
+0.112
+0.089
��+0.081
Temperature+0.078
+0.078
Neg logit contributions
 said-0.083
 commented-0.079
 saying-0.072
 noted-0.071
 comments-0.070
Token the
Token ID262
Feature activation-1.46e+00

Pos logit contributions
 said+0.118
 commented+0.114
 saying+0.103
 comments+0.102
 noted+0.101
Neg logit contributions
-0.154
-0.123
��-0.113
Temperature-0.108
▬▬-0.107
Token outer
Token ID12076
Feature activation-6.13e+00

Pos logit contributions
+0.186
+0.146
��+0.133
Temperature+0.129
▬▬+0.126
Neg logit contributions
 said-0.163
 commented-0.157
 saying-0.143
 comments-0.141
 noted-0.140
Token edges
Token ID13015
Feature activation-2.66e+00

Pos logit contributions
+0.642
+0.469
��+0.419
Temperature+0.395
▬▬+0.391
Neg logit contributions
 said-0.834
 commented-0.801
 saying-0.748
 comments-0.738
 noted-0.737
Token of
Token ID286
Feature activation-2.04e+00

Pos logit contributions
+0.295
+0.222
��+0.199
Temperature+0.186
▬▬+0.184
Neg logit contributions
 said-0.350
 commented-0.332
 saying-0.312
 comments-0.308
 noted-0.307
Token Mont
Token ID5575
Feature activation+1.72e+00

Pos logit contributions
+0.270
+0.215
��+0.195
▬▬+0.185
Temperature+0.185
Neg logit contributions
 said-0.220
 commented-0.212
 saying-0.192
 comments-0.191
 remark-0.189
Tokenrose
Token ID13698
Feature activation-7.19e-01

Pos logit contributions
 said+0.255
 commented+0.246
 saying+0.234
 remark+0.232
 comments+0.231
Neg logit contributions
-0.161
-0.114
��-0.096
Temperature-0.090
-0.087
Token for
Token ID329
Feature activation+1.96e-01

Pos logit contributions
+0.097
+0.076
��+0.070
Temperature+0.067
��+0.066
Neg logit contributions
 said-0.076
 commented-0.073
 saying-0.066
 remark-0.066
 comments-0.066
Token 25
Token ID1679
Feature activation-6.76e-01

Pos logit contributions
+0.524
+0.422
��+0.384
Temperature+0.373
��+0.373
Neg logit contributions
 said-0.358
 commented-0.347
 saying-0.310
 remark-0.301
 noted-0.301
Token percent
Token ID1411
Feature activation-6.97e-01

Pos logit contributions
+0.076
+0.057
��+0.051
��+0.047
Temperature+0.047
Neg logit contributions
 said-0.086
 commented-0.083
 saying-0.076
 remark-0.076
 noted-0.075
Token of
Token ID286
Feature activation-2.14e+00

Pos logit contributions
+0.082
+0.063
��+0.055
の�+0.052
▬▬+0.052
Neg logit contributions
 said-0.082
 commented-0.080
 comments-0.073
 noted-0.073
 remark-0.072
Token GW
Token ID27164
Feature activation+5.28e-01

Pos logit contributions
+0.286
+0.228
��+0.207
Temperature+0.196
��+0.195
Neg logit contributions
 said-0.218
 commented-0.210
 saying-0.189
 remark-0.187
 noted-0.187
Token Batt
Token ID12350
Feature activation-3.01e-01

Pos logit contributions
 said+0.059
 commented+0.057
 saying+0.052
 remark+0.052
 comments+0.051
Neg logit contributions
-0.067
-0.052
��-0.047
Temperature-0.045
��-0.045
Tokenleground
Token ID28272
Feature activation-6.12e+00

Pos logit contributions
+0.021
+0.013
��+0.011
Temperature+0.009
��+0.009
Neg logit contributions
 said-0.052
 commented-0.050
 saying-0.048
 comments-0.047
 noted-0.047
Token respondents
Token ID14502
Feature activation+3.12e+00

Pos logit contributions
+0.936
+0.760
��+0.692
��+0.672
Temperature+0.668
Neg logit contributions
 said-0.514
 commented-0.493
 noted-0.428
 remark-0.425
 saying-0.422
Token saying
Token ID2282
Feature activation+5.67e+00

Pos logit contributions
 said+0.183
 commented+0.173
 comments+0.143
 noted+0.142
 remarks+0.136
Neg logit contributions
-0.551
-0.467
��-0.424
Temperature-0.417
||||-0.415
Token they
Token ID484
Feature activation+6.44e-01

Pos logit contributions
 said+0.543
 commented+0.530
 remark+0.467
 saying+0.466
 noted+0.464
Neg logit contributions
-0.800
-0.638
��-0.571
Temperature-0.564
-0.562
Token had
Token ID550
Feature activation+8.31e-01

Pos logit contributions
 said+0.069
 commented+0.066
 saying+0.062
 comments+0.062
 remark+0.061
Neg logit contributions
-0.082
-0.065
��-0.056
��-0.055
Temperature-0.054
Token a
Token ID257
Feature activation+2.42e+00

Pos logit contributions
 said+0.074
 commented+0.070
 saying+0.065
 remark+0.065
 commenting+0.063
Neg logit contributions
-0.121
-0.099
Temperature-0.086
��-0.086
��-0.086
Token
Token ID198
Feature activation+1.30e-01

Pos logit contributions
 said+0.014
 commented+0.013
 saying+0.012
 noted+0.012
 comments+0.012
Neg logit contributions
-0.013
-0.010
��-0.009
Temperature-0.009
▬▬-0.009
TokenIf
Token ID1532
Feature activation-2.20e+00

Pos logit contributions
 said+0.018
 commented+0.017
 saying+0.016
 comments+0.016
 remark+0.016
Neg logit contributions
-0.013
-0.010
��-0.009
Temperature-0.009
▬▬-0.008
Token you
Token ID345
Feature activation-1.32e+00

Pos logit contributions
+0.260
+0.200
��+0.179
Temperature+0.173
▬▬+0.169
Neg logit contributions
 said-0.265
 commented-0.253
 saying-0.236
 remark-0.232
 noted-0.231
Token prefer
Token ID4702
Feature activation+3.65e-01

Pos logit contributions
+0.146
+0.111
��+0.098
Temperature+0.092
||||+0.091
Neg logit contributions
 said-0.167
 commented-0.159
 saying-0.152
 comments-0.149
 noted-0.147
Token to
Token ID284
Feature activation-1.21e+00

Pos logit contributions
 said+0.038
 commented+0.036
 saying+0.033
 noted+0.033
 comments+0.033
Neg logit contributions
-0.049
-0.039
��-0.036
Temperature-0.035
▬▬-0.034
Token sail
Token ID14936
Feature activation-6.12e+00

Pos logit contributions
+0.142
+0.109
��+0.098
Temperature+0.094
��+0.091
Neg logit contributions
 said-0.148
 commented-0.141
 saying-0.132
 noted-0.130
 comments-0.130
Token yourself
Token ID3511
Feature activation-7.77e-01

Pos logit contributions
+0.833
+0.665
��+0.610
Temperature+0.589
��+0.581
Neg logit contributions
 said-0.628
 commented-0.591
 saying-0.545
 comments-0.536
 remark-0.534
Token,
Token ID11
Feature activation-8.12e-01

Pos logit contributions
+0.101
+0.080
��+0.072
Temperature+0.070
▬▬+0.069
Neg logit contributions
 said-0.085
 commented-0.080
 saying-0.074
 comments-0.073
 noted-0.073
Token the
Token ID262
Feature activation-1.42e+00

Pos logit contributions
+0.108
+0.086
��+0.079
Temperature+0.076
▬▬+0.074
Neg logit contributions
 said-0.084
 commented-0.080
 saying-0.073
 comments-0.073
 noted-0.072
Token cruising
Token ID44339
Feature activation-4.74e+00

Pos logit contributions
+0.185
+0.148
��+0.136
Temperature+0.130
▬▬+0.128
Neg logit contributions
 said-0.151
 commented-0.144
 saying-0.132
 remark-0.131
 comments-0.130
Token grounds
Token ID9384
Feature activation+1.50e-01

Pos logit contributions
+0.606
+0.481
��+0.437
Temperature+0.430
▬▬+0.417
Neg logit contributions
 said-0.519
 commented-0.491
 saying-0.455
 noted-0.447
 remark-0.444
Token,
Token ID11
Feature activation-3.73e-01

Pos logit contributions
+0.422
+0.330
��+0.300
Temperature+0.289
��+0.284
Neg logit contributions
 said-0.372
 commented-0.352
 saying-0.331
 comments-0.323
 noted-0.323
Token Pay
Token ID7119
Feature activation-8.25e-01

Pos logit contributions
+0.046
+0.036
��+0.033
Temperature+0.031
▬▬+0.031
Neg logit contributions
 said-0.042
 commented-0.040
 saying-0.037
 noted-0.037
 comments-0.037
Token Attention
Token ID47406
Feature activation-1.24e+00

Pos logit contributions
+0.106
+0.084
��+0.076
Temperature+0.073
��+0.072
Neg logit contributions
 said-0.092
 commented-0.087
 saying-0.082
 comments-0.079
 admitted-0.079
Token to
Token ID284
Feature activation-8.48e-01

Pos logit contributions
+0.148
+0.113
��+0.101
Temperature+0.097
��+0.095
Neg logit contributions
 said-0.151
 commented-0.144
 saying-0.135
 comments-0.132
 noted-0.132
Token Your
Token ID3406
Feature activation-4.06e+00

Pos logit contributions
+0.100
+0.077
��+0.070
Temperature+0.066
��+0.065
Neg logit contributions
 said-0.103
 commented-0.098
 saying-0.092
 admitted-0.090
 noted-0.090
Token Partner
Token ID35532
Feature activation+3.41e+00

Pos logit contributions
+0.517
+0.409
��+0.374
Temperature+0.362
▬▬+0.346
Neg logit contributions
 said-0.449
 commented-0.428
 saying-0.396
 noted-0.391
 remark-0.390
Token.
Token ID13
Feature activation+1.30e+00

Pos logit contributions
 said+0.408
 commented+0.387
 saying+0.361
 remark+0.357
 noted+0.357
Neg logit contributions
-0.412
-0.318
��-0.285
Temperature-0.275
▬▬-0.270
Token
Token ID198
Feature activation+4.80e-01

Pos logit contributions
 said+0.168
 commented+0.160
 saying+0.151
 noted+0.149
 remark+0.149
Neg logit contributions
-0.143
-0.108
��-0.096
Temperature-0.093
▬▬-0.091
Token
Token ID198
Feature activation+5.10e-01

Pos logit contributions
 said+0.059
 commented+0.056
 saying+0.053
 noted+0.052
 comments+0.052
Neg logit contributions
-0.057
-0.043
��-0.039
Temperature-0.038
▬▬-0.037
Token
Token ID447
Feature activation+1.05e-01

Pos logit contributions
 said+0.078
 commented+0.075
 saying+0.072
 noted+0.071
 comments+0.071
Neg logit contributions
-0.045
-0.031
��-0.027
Temperature-0.027
▬▬-0.025
Token
Token ID250
Feature activation-4.11e+00

Pos logit contributions
 said+0.022
 commented+0.021
 saying+0.020
 noted+0.020
 comments+0.020
Neg logit contributions
-0.004
-0.002
��-0.000
+0.000
��+0.000
Token Alfred
Token ID22044
Feature activation+3.35e+00

Pos logit contributions
 said+0.084
 commented+0.080
 saying+0.072
 comments+0.071
 remark+0.070
Neg logit contributions
-0.139
-0.114
��-0.104
▬▬-0.100
Temperature-0.100
Token is
Token ID318
Feature activation-1.33e+00

Pos logit contributions
 said+0.392
 commented+0.374
 saying+0.350
 noted+0.345
 remark+0.343
Neg logit contributions
-0.414
-0.323
��-0.288
Temperature-0.273
-0.273
Token the
Token ID262
Feature activation-1.33e+00

Pos logit contributions
+0.193
+0.157
��+0.141
▬▬+0.138
Temperature+0.137
Neg logit contributions
 said-0.125
 commented-0.119
 comments-0.108
 saying-0.106
 noted-0.105
Token Wayne
Token ID13329
Feature activation+1.07e+00

Pos logit contributions
+0.180
+0.146
��+0.132
▬▬+0.126
Temperature+0.126
Neg logit contributions
 said-0.139
 commented-0.133
 saying-0.121
 comments-0.120
 remark-0.119
Token family
Token ID1641
Feature activation+3.36e+00

Pos logit contributions
 said+0.113
 commented+0.107
 saying+0.098
 comments+0.097
 noted+0.097
Neg logit contributions
-0.144
-0.117
��-0.105
-0.101
▬▬-0.101
Token but
Token ID475
Feature activation+2.09e+00

Pos logit contributions
 said+0.410
 commented+0.394
 saying+0.363
 comments+0.362
 noted+0.360
Neg logit contributions
-0.397
-0.306
��-0.275
Temperature-0.258
▬▬-0.254
Tokenler
Token ID1754
Feature activation+2.07e+00

Pos logit contributions
 said+0.197
 commented+0.186
 saying+0.169
 remark+0.167
 comments+0.167
Neg logit contributions
-0.309
-0.251
��-0.231
▬▬-0.219
Temperature-0.219
Token?
Token ID30
Feature activation-1.11e+00

Pos logit contributions
 said+0.258
 commented+0.247
 saying+0.229
 comments+0.228
 noted+0.226
Neg logit contributions
-0.241
-0.183
��-0.161
Temperature-0.152
||||-0.151
Token Has
Token ID7875
Feature activation-2.35e-01

Pos logit contributions
+0.122
+0.091
��+0.080
Temperature+0.076
▬▬+0.076
Neg logit contributions
 said-0.145
 commented-0.139
 saying-0.131
 remark-0.130
 noted-0.129
Token been
Token ID587
Feature activation+3.51e-01

Pos logit contributions
+0.037
+0.031
��+0.028
Temperature+0.027
▬▬+0.026
Neg logit contributions
 said-0.020
 commented-0.019
 saying-0.017
 remark-0.017
 noted-0.017
Token so
Token ID523
Feature activation+2.02e+00

Pos logit contributions
 said+0.035
 commented+0.033
 comments+0.030
 saying+0.030
 remark+0.030
Neg logit contributions
-0.050
-0.040
��-0.036
▬▬-0.035
Temperature-0.035
Token sometimes
Token ID3360
Feature activation+1.64e+00

Pos logit contributions
+0.136
+0.110
��+0.101
Temperature+0.098
+0.098
Neg logit contributions
 said-0.080
 commented-0.077
 comments-0.068
 noted-0.067
 saying-0.067
Token the
Token ID262
Feature activation-1.54e+00

Pos logit contributions
 said+0.169
 commented+0.160
 comments+0.146
 saying+0.145
 noted+0.144
Neg logit contributions
-0.221
-0.174
��-0.159
Temperature-0.154
▬▬-0.154
Token goalkeeper
Token ID26318
Feature activation-2.52e+00

Pos logit contributions
+0.191
+0.149
��+0.134
Temperature+0.131
+0.128
Neg logit contributions
 said-0.176
 commented-0.169
 saying-0.155
 noted-0.153
 remark-0.152
Token,
Token ID11
Feature activation-9.40e-01

Pos logit contributions
+0.331
+0.260
��+0.233
Temperature+0.229
の�+0.224
Neg logit contributions
 said-0.265
 commented-0.250
 comments-0.231
 saying-0.231
 remark-0.228
Token Jasper
Token ID42992
Feature activation+3.67e+00

Pos logit contributions
+0.132
+0.106
��+0.097
Temperature+0.094
の�+0.093
Neg logit contributions
 said-0.089
 commented-0.085
 saying-0.077
 noted-0.076
 comments-0.076
Token C
Token ID327
Feature activation+2.18e+00

Pos logit contributions
 said+0.333
 commented+0.320
 saying+0.293
 remark+0.286
 noted+0.283
Neg logit contributions
-0.551
-0.442
��-0.403
��-0.393
-0.386
Tokenill
Token ID359
Feature activation+1.86e+00

Pos logit contributions
 said+0.219
 commented+0.208
 saying+0.192
 noted+0.187
 remark+0.187
Neg logit contributions
-0.310
-0.250
��-0.228
Temperature-0.219
��-0.216
Tokenessen
Token ID44483
Feature activation+2.97e+00

Pos logit contributions
 said+0.202
 commented+0.196
 remark+0.184
 saying+0.182
 comments+0.180
Neg logit contributions
-0.237
-0.185
��-0.160
-0.159
Temperature-0.157
Token,
Token ID11
Feature activation-9.56e-01

Pos logit contributions
 said+0.310
 commented+0.294
 saying+0.270
 comments+0.270
 remark+0.265
Neg logit contributions
-0.399
-0.318
��-0.288
Temperature-0.278
��-0.275
Token who
Token ID508
Feature activation+3.44e+00

Pos logit contributions
+0.140
+0.114
��+0.103
Temperature+0.101
の�+0.100
Neg logit contributions
 said-0.084
 commented-0.080
 comments-0.072
 saying-0.072
 remark-0.071
Token started
Token ID2067
Feature activation+4.74e-01

Pos logit contributions
 said+0.332
 commented+0.310
 saying+0.294
 comments+0.284
 remark+0.283
Neg logit contributions
-0.490
-0.396
��-0.363
Temperature-0.350
��-0.347
Token trade
Token ID3292
Feature activation-2.38e+00

Pos logit contributions
+0.400
+0.310
��+0.280
Temperature+0.269
+0.263
Neg logit contributions
 said-0.375
 commented-0.364
 saying-0.331
 noted-0.324
 admitted-0.324
Token also
Token ID635
Feature activation+3.19e+00

Pos logit contributions
+0.327
+0.262
��+0.237
▬▬+0.231
Temperature+0.230
Neg logit contributions
 said-0.231
 commented-0.220
 saying-0.203
 commenting-0.198
 remark-0.197
Token reun
Token ID15398
Feature activation-1.50e+00

Pos logit contributions
 said+0.298
 commented+0.283
 saying+0.264
 remark+0.255
 comments+0.253
Neg logit contributions
-0.454
-0.367
��-0.332
の�-0.320
Temperature-0.319
Tokenites
Token ID2737
Feature activation-3.41e-01

Pos logit contributions
+0.175
+0.137
��+0.117
Temperature+0.116
の�+0.114
Neg logit contributions
 said-0.182
 commented-0.175
 saying-0.168
 remark-0.167
 commenting-0.163
Token Fel
Token ID13937
Feature activation+3.09e-02

Pos logit contributions
+0.050
+0.041
��+0.037
Temperature+0.036
||||+0.036
Neg logit contributions
 said-0.030
 commented-0.029
 saying-0.026
 remark-0.026
 commenting-0.026
Tokenton
Token ID1122
Feature activation+2.49e+00

Pos logit contributions
 said+0.003
 commented+0.003
 remark+0.003
 saying+0.003
 commenting+0.003
Neg logit contributions
-0.004
-0.003
��-0.003
Temperature-0.003
||||-0.003
Token and
Token ID290
Feature activation+1.29e+00

Pos logit contributions
 said+0.251
 commented+0.241
 saying+0.220
 comments+0.220
 remark+0.220
Neg logit contributions
-0.342
-0.273
��-0.242
-0.234
||||-0.234
Token Thomas
Token ID5658
Feature activation+5.02e+00

Pos logit contributions
 said+0.108
 commented+0.105
 saying+0.093
 remark+0.091
 comments+0.091
Neg logit contributions
-0.197
-0.161
��-0.147
||||-0.143
Temperature-0.142
Token with
Token ID351
Feature activation+7.05e-01

Pos logit contributions
 said+0.543
 commented+0.520
 remark+0.476
 saying+0.474
 comments+0.470
Neg logit contributions
-0.656
-0.518
��-0.459
-0.441
の�-0.440
Token center
Token ID3641
Feature activation-6.20e-01

Pos logit contributions
 said+0.064
 commented+0.062
 saying+0.055
 remark+0.055
 commenting+0.054
Neg logit contributions
-0.102
-0.083
��-0.076
||||-0.073
-0.073
Token Marcus
Token ID17068
Feature activation+5.71e+00

Pos logit contributions
+0.100
+0.082
��+0.076
��+0.073
Temperature+0.073
Neg logit contributions
 said-0.050
 commented-0.047
 saying-0.042
 remark-0.041
 commenting-0.041
Token public
Token ID1171
Feature activation+5.15e-01

Pos logit contributions
 said+0.042
 commented+0.040
 saying+0.037
 noted+0.036
 comments+0.036
Neg logit contributions
-0.044
-0.034
��-0.030
▬▬-0.029
Temperature-0.029
Token instead
Token ID2427
Feature activation+7.58e-01

Pos logit contributions
 said+0.059
 commented+0.056
 saying+0.052
 noted+0.052
 remark+0.051
Neg logit contributions
-0.064
-0.050
��-0.045
Temperature-0.044
-0.043
Token of
Token ID286
Feature activation-1.94e+00

Pos logit contributions
 said+0.100
 commented+0.095
 saying+0.090
 noted+0.088
 comments+0.088
Neg logit contributions
-0.084
-0.063
��-0.057
Temperature-0.053
��-0.052
Token anonymous
Token ID11614
Feature activation+4.09e+00

Pos logit contributions
+0.285
+0.230
��+0.208
▬▬+0.205
��+0.204
Neg logit contributions
 said-0.173
 commented-0.162
 noted-0.145
 comments-0.142
 remark-0.142
Token makes
Token ID1838
Feature activation-7.75e-01

Pos logit contributions
 said+0.465
 commented+0.441
 saying+0.407
 noted+0.406
 remark+0.401
Neg logit contributions
-0.515
-0.400
��-0.361
Temperature-0.347
▬▬-0.344
Token someone
Token ID2130
Feature activation+3.80e+00

Pos logit contributions
+0.109
+0.087
��+0.079
▬▬+0.077
+0.076
Neg logit contributions
 said-0.077
 commented-0.074
 saying-0.067
 noted-0.066
 admitted-0.066
Token immoral
Token ID31756
Feature activation-4.07e-01

Pos logit contributions
 said+0.378
 commented+0.355
 saying+0.323
 noted+0.321
 comments+0.318
Neg logit contributions
-0.538
-0.431
��-0.388
▬▬-0.383
-0.380
Token and
Token ID290
Feature activation+6.51e-01

Pos logit contributions
+0.049
+0.037
��+0.033
▬▬+0.031
Temperature+0.031
Neg logit contributions
 said-0.050
 commented-0.047
 saying-0.044
 noted-0.044
 remark-0.043
Token selfish
Token ID20363
Feature activation+1.68e-01

Pos logit contributions
 said+0.070
 commented+0.066
 saying+0.060
 noted+0.060
 remark+0.060
Neg logit contributions
-0.087
-0.068
��-0.063
▬▬-0.059
��-0.059
Token,
Token ID11
Feature activation-7.89e-01

Pos logit contributions
 said+0.021
 commented+0.020
 saying+0.019
 noted+0.019
 remark+0.018
Neg logit contributions
-0.020
-0.015
��-0.013
Temperature-0.012
��-0.012
Token I
Token ID314
Feature activation+6.06e-01

Pos logit contributions
+0.113
+0.091
��+0.083
Temperature+0.080
▬▬+0.080
Neg logit contributions
 said-0.075
 commented-0.071
 saying-0.064
 noted-0.064
 admitted-0.063
Token driving
Token ID5059
Feature activation-1.95e+00

Pos logit contributions
+0.036
+0.030
��+0.027
▬▬+0.026
Temperature+0.026
Neg logit contributions
 said-0.021
 commented-0.019
 comments-0.018
 saying-0.018
 remark-0.017
Token the
Token ID262
Feature activation-1.35e+00

Pos logit contributions
+0.271
+0.216
��+0.192
Temperature+0.192
▬▬+0.189
Neg logit contributions
 said-0.188
 commented-0.179
 remark-0.165
 saying-0.163
 comments-0.162
Token vehicle
Token ID4038
Feature activation-8.63e-01

Pos logit contributions
+0.167
+0.132
��+0.118
Temperature+0.112
��+0.112
Neg logit contributions
 said-0.155
 commented-0.149
 saying-0.139
 comments-0.137
 remark-0.136
Token that
Token ID326
Feature activation+1.62e+00

Pos logit contributions
+0.120
+0.096
Temperature+0.086
��+0.085
▬▬+0.084
Neg logit contributions
 said-0.082
 commented-0.078
 comments-0.072
 remark-0.072
 saying-0.071
Token had
Token ID550
Feature activation+9.67e-01

Pos logit contributions
 said+0.180
 commented+0.172
 saying+0.162
 remark+0.160
 comments+0.159
Neg logit contributions
-0.205
-0.161
��-0.143
Temperature-0.141
▬▬-0.136
Token it
Token ID340
Feature activation-3.38e-02

Pos logit contributions
 said+0.090
 commented+0.085
 saying+0.080
 remark+0.079
 commenting+0.078
Neg logit contributions
-0.139
-0.113
Temperature-0.099
��-0.099
-0.098
Token installed
Token ID6589
Feature activation-6.26e-01

Pos logit contributions
+0.005
+0.004
Temperature+0.003
の�+0.003
▬▬+0.003
Neg logit contributions
 said-0.003
 commented-0.003
 comments-0.003
 remark-0.003
 saying-0.003
Token,
Token ID11
Feature activation-7.95e-01

Pos logit contributions
+0.082
+0.065
||||+0.057
��+0.057
Temperature+0.057
Neg logit contributions
 said-0.063
 commented-0.061
 remark-0.056
 comments-0.056
 saying-0.055
Token according
Token ID1864
Feature activation+4.42e+00

Pos logit contributions
+0.125
+0.104
��+0.094
Temperature+0.093
の�+0.091
Neg logit contributions
 said-0.057
 commented-0.056
 comments-0.049
 remark-0.048
 remarks-0.047
Token to
Token ID284
Feature activation-1.20e+00

Pos logit contributions
 said+0.518
 commented+0.490
 saying+0.459
 noted+0.451
 remark+0.449
Neg logit contributions
-0.550
-0.428
��-0.385
Temperature-0.375
▬▬-0.364
Token court
Token ID2184
Feature activation+1.80e+00

Pos logit contributions
+0.173
+0.140
��+0.128
Temperature+0.123
��+0.122
Neg logit contributions
 said-0.106
 commented-0.105
 remark-0.093
 saying-0.093
say-0.091
Tokenest
Token ID395
Feature activation+1.41e-01

Pos logit contributions
+0.433
+0.338
��+0.301
Temperature+0.291
+0.284
Neg logit contributions
 said-0.414
 commented-0.394
 saying-0.368
 noted-0.362
 comments-0.361
Token link
Token ID2792
Feature activation+1.30e+00

Pos logit contributions
 said+0.016
 commented+0.015
 saying+0.014
 comments+0.014
 noted+0.014
Neg logit contributions
-0.018
-0.014
��-0.013
Temperature-0.013
-0.012
Token in
Token ID287
Feature activation+1.43e+00

Pos logit contributions
 said+0.155
 commented+0.147
 saying+0.137
 comments+0.135
 noted+0.135
Neg logit contributions
-0.159
-0.125
��-0.111
Temperature-0.108
-0.105
Token each
Token ID1123
Feature activation-8.17e-01

Pos logit contributions
 said+0.146
 commented+0.141
 saying+0.129
 noted+0.127
 remark+0.125
Neg logit contributions
-0.194
-0.158
��-0.143
Temperature-0.138
-0.137
Token direction
Token ID4571
Feature activation+6.03e-01

Pos logit contributions
+0.107
+0.085
��+0.077
Temperature+0.073
+0.073
Neg logit contributions
 said-0.089
 commented-0.084
 saying-0.079
 noted-0.077
 comments-0.077
Token).
Token ID737
Feature activation-8.78e-01

Pos logit contributions
 said+0.079
 commented+0.075
 saying+0.071
 comments+0.070
 remark+0.069
Neg logit contributions
-0.066
-0.050
��-0.044
Temperature-0.042
-0.042
Token
Token ID198
Feature activation+3.24e-01

Pos logit contributions
+0.093
+0.070
��+0.060
Temperature+0.059
+0.057
Neg logit contributions
 said-0.119
 commented-0.113
 saying-0.107
 noted-0.106
 remark-0.105
Token
Token ID198
Feature activation+3.22e-01

Pos logit contributions
 said+0.040
 commented+0.038
 saying+0.035
 noted+0.035
 comments+0.035
Neg logit contributions
-0.039
-0.030
��-0.027
Temperature-0.026
▬▬-0.025
TokenHere
Token ID4342
Feature activation-3.03e+00

Pos logit contributions
 said+0.051
 commented+0.049
 saying+0.047
 comments+0.046
 noted+0.046
Neg logit contributions
-0.028
-0.019
��-0.016
Temperature-0.015
▬▬-0.014
Token
Token ID447
Feature activation-6.46e-02

Pos logit contributions
+0.355
+0.274
��+0.241
Temperature+0.234
��+0.227
Neg logit contributions
 said-0.377
 commented-0.360
 saying-0.338
 noted-0.331
 remark-0.329
Token
Token ID247
Feature activation+1.51e-01

Pos logit contributions
+0.006
+0.004
��+0.003
��+0.003
+0.003
Neg logit contributions
 said-0.010
 commented-0.010
 saying-0.010
 noted-0.009
 comments-0.009
Tokenbrainer
Token ID49334
Feature activation+1.21e+00

Pos logit contributions
+0.325
+0.249
Temperature+0.225
��+0.224
These+0.213
Neg logit contributions
 said-0.311
 commented-0.296
 saying-0.279
 noted-0.276
 remark-0.276
Token human
Token ID1692
Feature activation-6.86e-01

Pos logit contributions
 said+0.125
 commented+0.119
 saying+0.109
 noted+0.108
 remark+0.106
Neg logit contributions
-0.163
-0.131
��-0.120
Temperature-0.115
��-0.114
Token rights
Token ID2489
Feature activation+1.37e-01

Pos logit contributions
+0.094
+0.076
��+0.069
+0.067
Temperature+0.066
Neg logit contributions
 said-0.069
 commented-0.066
 saying-0.060
 noted-0.059
 remark-0.059
Token treaties
Token ID30734
Feature activation-2.47e+00

Pos logit contributions
 said+0.015
 commented+0.014
 saying+0.013
 noted+0.013
 remark+0.013
Neg logit contributions
-0.018
-0.014
��-0.013
▬▬-0.012
��-0.012
Token,
Token ID11
Feature activation-7.71e-01

Pos logit contributions
+0.333
+0.267
��+0.240
▬▬+0.229
Temperature+0.228
Neg logit contributions
 said-0.254
 commented-0.242
 comments-0.222
 remark-0.220
 saying-0.218
Token undermined
Token ID31256
Feature activation-1.67e+00

Pos logit contributions
+0.112
+0.091
��+0.083
��+0.081
Temperature+0.080
Neg logit contributions
 said-0.069
 commented-0.066
 comments-0.060
 remark-0.060
 saying-0.060
Token the
Token ID262
Feature activation-1.34e+00

Pos logit contributions
+0.215
+0.169
��+0.155
��+0.147
▬▬+0.147
Neg logit contributions
 said-0.183
 commented-0.175
 saying-0.163
 remark-0.159
 noted-0.158
Token International
Token ID4037
Feature activation-1.41e+00

Pos logit contributions
+0.179
+0.143
��+0.129
▬▬+0.126
Temperature+0.125
Neg logit contributions
 said-0.142
 commented-0.135
 saying-0.124
 remark-0.121
 noted-0.121
Token Criminal
Token ID15531
Feature activation-2.31e-01

Pos logit contributions
+0.183
+0.147
��+0.134
Temperature+0.129
▬▬+0.128
Neg logit contributions
 said-0.152
 commented-0.145
 saying-0.135
 remark-0.133
 noted-0.132
Token Court
Token ID3078
Feature activation+8.13e-01

Pos logit contributions
+0.032
+0.026
��+0.023
▬▬+0.023
Temperature+0.023
Neg logit contributions
 said-0.023
 commented-0.022
 saying-0.020
 comments-0.020
 noted-0.020
Token,
Token ID11
Feature activation-7.77e-01

Pos logit contributions
 said+0.094
 commented+0.090
 saying+0.083
 noted+0.082
 comments+0.082
Neg logit contributions
-0.102
-0.079
��-0.070
��-0.068
▬▬-0.068
Token-
Token ID12
Feature activation-2.52e+00

Pos logit contributions
 said+0.201
 commented+0.195
 saying+0.185
 remark+0.183
 comments+0.182
Neg logit contributions
-0.109
-0.072
��-0.060
��-0.055
||||-0.054
Tokenwoman
Token ID8580
Feature activation+4.00e+00

Pos logit contributions
+0.329
+0.256
��+0.233
Temperature+0.226
+0.217
Neg logit contributions
 said-0.286
 commented-0.278
 saying-0.255
 comments-0.253
 remark-0.251
Token]
Token ID60
Feature activation-1.05e+00

Pos logit contributions
 said+0.509
 commented+0.491
 saying+0.458
 remark+0.454
 comments+0.452
Neg logit contributions
-0.457
-0.340
��-0.302
��-0.280
-0.278
Token and
Token ID290
Feature activation+6.75e-01

Pos logit contributions
+0.168
+0.139
��+0.127
||||+0.123
��+0.123
Neg logit contributions
 said-0.076
 commented-0.073
 remark-0.065
 saying-0.065
 comments-0.064
Token gay
Token ID5650
Feature activation-1.43e+00

Pos logit contributions
 said+0.058
 commented+0.056
 saying+0.050
 remark+0.049
 commenting+0.049
Neg logit contributions
-0.103
-0.084
��-0.077
��-0.074
-0.073
Token rights
Token ID2489
Feature activation+1.62e-01

Pos logit contributions
+0.216
+0.178
��+0.165
��+0.157
▬▬+0.155
Neg logit contributions
 said-0.127
 commented-0.120
 saying-0.109
 noted-0.106
 remark-0.106
Token activist
Token ID11276
Feature activation+2.96e+00

Pos logit contributions
 said+0.011
 commented+0.010
 saying+0.009
 noted+0.008
 remark+0.008
Neg logit contributions
-0.028
-0.023
��-0.022
��-0.021
▬▬-0.021
Token had
Token ID550
Feature activation+1.10e+00

Pos logit contributions
 said+0.265
 commented+0.256
 saying+0.235
 remark+0.234
 comments+0.231
Neg logit contributions
-0.430
-0.347
��-0.314
��-0.307
-0.303
Token received
Token ID2722
Feature activation+1.74e+00

Pos logit contributions
 said+0.094
 commented+0.090
 saying+0.083
 remark+0.083
sen+0.081
Neg logit contributions
-0.163
-0.134
��-0.119
▬▬-0.117
Temperature-0.116
Token th
Token ID294
Feature activation-2.62e+00

Pos logit contributions
 said+0.173
 commented+0.165
 saying+0.149
 noted+0.147
 remark+0.145
Neg logit contributions
-0.244
-0.194
��-0.177
▬▬-0.171
Temperature-0.170
Tokenrea
Token ID21468
Feature activation+4.42e+00

Pos logit contributions
+0.277
+0.206
��+0.177
Temperature+0.171
��+0.170
Neg logit contributions
 said-0.350
 commented-0.336
 saying-0.314
 noted-0.312
 remark-0.310
Token pulled
Token ID5954
Feature activation-4.06e-01

Pos logit contributions
 said+0.058
 commented+0.054
 saying+0.050
 remark+0.049
 noted+0.049
Neg logit contributions
-0.099
-0.080
��-0.072
▬▬-0.072
Temperature-0.071
Token a
Token ID257
Feature activation+2.65e+00

Pos logit contributions
+0.056
+0.045
��+0.041
▬▬+0.039
��+0.039
Neg logit contributions
 said-0.040
 commented-0.038
 saying-0.035
 noted-0.035
 remark-0.034
Token hood
Token ID14263
Feature activation-2.14e-01

Pos logit contributions
 said+0.298
 commented+0.285
 saying+0.262
 comments+0.259
 noted+0.258
Neg logit contributions
-0.335
-0.263
��-0.238
Temperature-0.228
▬▬-0.228
Tokeny
Token ID88
Feature activation+1.39e+00

Pos logit contributions
+0.031
+0.025
��+0.023
Temperature+0.022
||||+0.022
Neg logit contributions
 said-0.020
 commented-0.019
 comments-0.017
 noted-0.017
 admitted-0.017
Token tightly
Token ID17707
Feature activation-2.26e+00

Pos logit contributions
 said+0.151
 commented+0.144
 noted+0.133
 comments+0.132
 saying+0.131
Neg logit contributions
-0.177
-0.138
��-0.125
▬▬-0.121
Temperature-0.120
Token around
Token ID1088
Feature activation-1.32e+00

Pos logit contributions
+0.278
+0.217
��+0.190
▬▬+0.187
の�+0.185
Neg logit contributions
 said-0.253
 commented-0.241
 comments-0.224
 remark-0.223
 noted-0.223
Token her
Token ID607
Feature activation+1.71e+00

Pos logit contributions
+0.203
+0.167
��+0.153
▬▬+0.148
Temperature+0.147
Neg logit contributions
 said-0.107
 commented-0.103
 saying-0.092
 remark-0.091
 noted-0.090
Token head
Token ID1182
Feature activation+1.17e+00

Pos logit contributions
 said+0.194
 commented+0.185
 saying+0.171
 noted+0.170
 remark+0.168
Neg logit contributions
-0.209
-0.161
��-0.143
Temperature-0.140
▬▬-0.139
Token to
Token ID284
Feature activation-1.11e+00

Pos logit contributions
 said+0.121
 commented+0.117
 noted+0.106
 comments+0.105
 admitted+0.105
Neg logit contributions
-0.157
-0.125
��-0.111
Temperature-0.109
||||-0.107
Token reveal
Token ID7766
Feature activation+2.62e+00

Pos logit contributions
+0.139
+0.109
��+0.098
Temperature+0.095
▬▬+0.094
Neg logit contributions
 said-0.125
 commented-0.120
 saying-0.111
 noted-0.110
 cited-0.108
Token only
Token ID691
Feature activation+1.34e+00

Pos logit contributions
 said+0.207
 commented+0.198
 saying+0.177
 remark+0.174
 cited+0.172
Neg logit contributions
-0.411
-0.339
��-0.311
Temperature-0.304
▬▬-0.303
Token gallons
Token ID23239
Feature activation-1.82e+00

Pos logit contributions
+0.073
+0.052
��+0.045
Temperature+0.043
▬▬+0.041
Neg logit contributions
 said-0.107
 commented-0.103
 saying-0.098
 remark-0.097
 noted-0.097
Token of
Token ID286
Feature activation-1.98e+00

Pos logit contributions
+0.236
+0.185
��+0.163
Temperature+0.163
の�+0.160
Neg logit contributions
 said-0.194
 commented-0.186
 saying-0.171
 remark-0.171
 comments-0.170
Token water
Token ID1660
Feature activation-3.98e+00

Pos logit contributions
+0.197
+0.142
��+0.121
Temperature+0.121
の�+0.113
Neg logit contributions
 said-0.273
 commented-0.265
 saying-0.250
 remark-0.246
 comments-0.246
Token off
Token ID572
Feature activation-2.73e+00

Pos logit contributions
+0.533
+0.420
Temperature+0.377
��+0.371
の�+0.366
Neg logit contributions
 said-0.402
 commented-0.385
 remark-0.353
 saying-0.353
 comments-0.348
Token the
Token ID262
Feature activation-1.45e+00

Pos logit contributions
+0.363
+0.287
��+0.258
Temperature+0.254
の�+0.250
Neg logit contributions
 said-0.289
 commented-0.279
 saying-0.258
 remark-0.253
 comments-0.251
Token runway
Token ID23443
Feature activation-4.76e+00

Pos logit contributions
+0.171
+0.134
��+0.117
Temperature+0.115
||||+0.112
Neg logit contributions
 said-0.171
 commented-0.166
 saying-0.153
 comments-0.152
 remark-0.152
Token each
Token ID1123
Feature activation-1.03e+00

Pos logit contributions
+0.602
+0.471
Temperature+0.416
��+0.413
||||+0.403
Neg logit contributions
 said-0.523
 commented-0.504
 saying-0.459
 remark-0.459
 comments-0.458
Token year
Token ID614
Feature activation-1.72e+00

Pos logit contributions
+0.108
+0.080
��+0.069
Temperature+0.067
��+0.065
Neg logit contributions
 said-0.138
 commented-0.134
 saying-0.125
 comments-0.124
 noted-0.124
Token.
Token ID13
Feature activation+8.80e-01

Pos logit contributions
+0.225
+0.176
��+0.157
Temperature+0.156
の�+0.152
Neg logit contributions
 said-0.184
 commented-0.176
 comments-0.161
 saying-0.161
 remark-0.160
Token To
Token ID1675
Feature activation-4.29e+00

Pos logit contributions
 said+0.120
 commented+0.115
 saying+0.109
 remark+0.109
 noted+0.108
Neg logit contributions
-0.091
-0.068
Temperature-0.058
��-0.058
||||-0.055
Token give
Token ID1577
Feature activation+1.13e+00

Pos logit contributions
+0.526
+0.411
��+0.365
Temperature+0.355
��+0.346
Neg logit contributions
 said-0.510
 commented-0.487
 saying-0.453
 noted-0.445
 comments-0.444
Token center
Token ID3641
Feature activation-1.28e+00

Pos logit contributions
 said+0.305
 commented+0.293
 saying+0.265
 noted+0.261
 remark+0.260
Neg logit contributions
-0.438
-0.350
の�-0.314
Temperature-0.314
��-0.312
Token.
Token ID13
Feature activation+9.45e-01

Pos logit contributions
+0.183
+0.146
��+0.134
Temperature+0.134
の�+0.133
Neg logit contributions
 said-0.117
 commented-0.113
 saying-0.101
 remark-0.100
 comments-0.100
Token
Token ID198
Feature activation+1.57e-01

Pos logit contributions
 said+0.122
 commented+0.117
 saying+0.111
 remark+0.110
 noted+0.109
Neg logit contributions
-0.104
-0.079
Temperature-0.069
��-0.068
-0.065
Token
Token ID198
Feature activation+1.33e-01

Pos logit contributions
 said+0.019
 commented+0.018
 saying+0.017
 noted+0.017
 remark+0.017
Neg logit contributions
-0.019
-0.014
��-0.013
Temperature-0.013
▬▬-0.012
Token
Token ID447
Feature activation-2.81e-01

Pos logit contributions
 said+0.019
 commented+0.018
 saying+0.017
 remark+0.017
 comments+0.017
Neg logit contributions
-0.013
-0.010
Temperature-0.009
��-0.008
▬▬-0.008
Token
Token ID250
Feature activation-4.52e+00

Pos logit contributions
+0.001
-0.006
��-0.009
Temperature-0.010
��-0.010
Neg logit contributions
 said-0.067
 commented-0.065
 saying-0.063
 noted-0.063
 comments-0.063
TokenWhen
Token ID2215
Feature activation-1.52e+00

Pos logit contributions
+0.400
+0.279
Temperature+0.240
��+0.231
▬▬+0.219
Neg logit contributions
 said-0.691
 commented-0.666
 saying-0.635
 remark-0.627
 noted-0.626
Token c
Token ID269
Feature activation-8.50e-02

Pos logit contributions
+0.203
+0.163
Temperature+0.147
��+0.146
▬▬+0.143
Neg logit contributions
 said-0.158
 commented-0.150
 saying-0.139
 remark-0.137
 comments-0.136
Tokenes
Token ID274
Feature activation+2.86e+00

Pos logit contributions
+0.014
+0.011
��+0.011
Temperature+0.011
▬▬+0.010
Neg logit contributions
 said-0.007
 commented-0.006
 saying-0.006
 noted-0.005
 comments-0.005
Tokenium
Token ID1505
Feature activation+1.08e-01

Pos logit contributions
 said+0.341
 commented+0.321
 saying+0.303
 noted+0.296
 comments+0.296
Neg logit contributions
-0.353
-0.275
��-0.248
Temperature-0.240
��-0.235
Token comes
Token ID2058
Feature activation-3.08e+00

Pos logit contributions
 said+0.014
 commented+0.013
 saying+0.012
 noted+0.012
 remark+0.012
Neg logit contributions
-0.012
-0.009
��-0.008
Temperature-0.008
��-0.008
Token in
Token ID287
Feature activation+1.15e+00

Pos logit contributions
+0.203
+0.161
��+0.143
Temperature+0.142
��+0.140
Neg logit contributions
 said-0.164
 commented-0.158
 remark-0.144
 saying-0.144
 admitted-0.142
Token addition
Token ID3090
Feature activation+7.03e-01

Pos logit contributions
 said+0.117
 commented+0.112
 saying+0.103
 remark+0.101
 admitted+0.100
Neg logit contributions
-0.157
-0.127
��-0.114
Temperature-0.113
▬▬-0.111
Token to
Token ID284
Feature activation-1.32e+00

Pos logit contributions
 said+0.080
 commented+0.075
 saying+0.070
 comments+0.069
 noted+0.069
Neg logit contributions
-0.091
-0.071
��-0.065
Temperature-0.063
��-0.062
Token plant
Token ID4618
Feature activation-7.92e-02

Pos logit contributions
+0.166
+0.129
��+0.115
Temperature+0.113
��+0.112
Neg logit contributions
 said-0.148
 commented-0.143
 saying-0.130
 remark-0.129
 comments-0.129
Tokenain
Token ID391
Feature activation+1.78e+00

Pos logit contributions
+0.012
+0.010
��+0.009
Temperature+0.009
��+0.009
Neg logit contributions
 said-0.007
 commented-0.006
 saying-0.006
 comments-0.006
 remark-0.006
Token flour
Token ID10601
Feature activation-4.31e+00

Pos logit contributions
 said+0.202
 commented+0.194
 saying+0.177
 remark+0.176
 admitted+0.175
Neg logit contributions
-0.225
-0.175
��-0.154
Temperature-0.153
��-0.152
Token.
Token ID13
Feature activation+8.91e-01

Pos logit contributions
+0.575
+0.457
��+0.410
Temperature+0.408
��+0.403
Neg logit contributions
 said-0.444
 commented-0.426
 remark-0.388
 saying-0.386
 admitted-0.384
Token If
Token ID1002
Feature activation-1.89e+00

Pos logit contributions
 said+0.123
 commented+0.118
 saying+0.112
 remark+0.111
 noted+0.110
Neg logit contributions
-0.091
-0.067
��-0.057
Temperature-0.057
▬▬-0.055
Token you
Token ID345
Feature activation-1.31e+00

Pos logit contributions
+0.225
+0.171
��+0.150
Temperature+0.149
��+0.148
Neg logit contributions
 said-0.227
 commented-0.218
 saying-0.202
 remark-0.201
 comments-0.201
Token are
Token ID389
Feature activation-2.05e+00

Pos logit contributions
+0.163
+0.127
��+0.113
Temperature+0.109
��+0.109
Neg logit contributions
 said-0.154
 commented-0.145
 saying-0.137
 comments-0.134
 noted-0.133
Token buying
Token ID7067
Feature activation-1.39e+00

Pos logit contributions
+0.286
+0.228
��+0.202
��+0.201
Temperature+0.200
Neg logit contributions
 said-0.207
 commented-0.196
 comments-0.176
 saying-0.175
 noted-0.175
Token Assad
Token ID13032
Feature activation+9.94e-01

Pos logit contributions
 said+0.037
 commented+0.035
 remark+0.032
 saying+0.032
 comments+0.032
Neg logit contributions
-0.055
-0.044
��-0.039
Temperature-0.039
��-0.038
Token and
Token ID290
Feature activation+6.28e-01

Pos logit contributions
 said+0.105
 commented+0.101
 noted+0.091
 remark+0.091
 comments+0.091
Neg logit contributions
-0.131
-0.103
��-0.091
の�-0.089
Temperature-0.088
Token for
Token ID329
Feature activation+2.72e-01

Pos logit contributions
 said+0.039
 commented+0.036
 comments+0.031
 saying+0.031
 remark+0.030
Neg logit contributions
-0.108
-0.091
��-0.084
��-0.083
-0.081
Token embracing
Token ID26005
Feature activation-2.38e+00

Pos logit contributions
 said+0.026
 commented+0.024
 noted+0.022
 admitted+0.021
 saying+0.021
Neg logit contributions
-0.039
-0.032
��-0.029
��-0.028
Temperature-0.028
Token a
Token ID257
Feature activation+2.61e+00

Pos logit contributions
+0.343
+0.279
��+0.257
��+0.246
+0.245
Neg logit contributions
 said-0.222
 commented-0.213
 saying-0.190
 remark-0.187
 noted-0.186
Token warmer
Token ID23254
Feature activation-4.60e+00

Pos logit contributions
 said+0.262
 commented+0.248
 saying+0.224
 comments+0.219
 noted+0.219
Neg logit contributions
-0.363
-0.292
��-0.268
��-0.259
▬▬-0.258
Token relationship
Token ID2776
Feature activation+2.49e+00

Pos logit contributions
+0.704
+0.580
��+0.535
Temperature+0.527
▬▬+0.517
Neg logit contributions
 said-0.393
 commented-0.367
 saying-0.328
 noted-0.322
 comments-0.318
Token with
Token ID351
Feature activation+6.21e-02

Pos logit contributions
 said+0.244
 commented+0.231
 saying+0.209
 comments+0.208
 remark+0.205
Neg logit contributions
-0.350
-0.283
��-0.259
▬▬-0.248
Temperature-0.248
Token China
Token ID2807
Feature activation-3.17e+00

Pos logit contributions
 said+0.008
 commented+0.008
 saying+0.007
 remark+0.007
 comments+0.007
Neg logit contributions
-0.007
-0.005
��-0.005
Temperature-0.004
��-0.004
Token.
Token ID13
Feature activation+9.42e-01

Pos logit contributions
+0.431
+0.345
��+0.306
Temperature+0.294
の�+0.293
Neg logit contributions
 said-0.326
 commented-0.312
 remark-0.282
 saying-0.282
 comments-0.281
Token
Token ID198
Feature activation+1.36e-01

Pos logit contributions
 said+0.120
 commented+0.116
 saying+0.109
 remark+0.109
 noted+0.108
Neg logit contributions
-0.105
-0.079
��-0.069
Temperature-0.068
These-0.066
Token the
Token ID262
Feature activation-1.54e+00

Pos logit contributions
+0.186
+0.148
��+0.134
Temperature+0.130
▬▬+0.128
Neg logit contributions
 said-0.142
 commented-0.136
 saying-0.124
 admitted-0.123
 noted-0.123
Token story
Token ID1621
Feature activation+3.02e+00

Pos logit contributions
+0.235
+0.192
��+0.178
Temperature+0.173
▬▬+0.172
Neg logit contributions
 said-0.131
 commented-0.123
 saying-0.109
 admitted-0.107
 noted-0.107
Token had
Token ID550
Feature activation+8.19e-01

Pos logit contributions
 said+0.279
 commented+0.261
 saying+0.240
 noted+0.235
 comments+0.234
Neg logit contributions
-0.441
-0.359
��-0.329
Temperature-0.319
-0.319
Token been
Token ID587
Feature activation+1.55e-01

Pos logit contributions
 said+0.081
 commented+0.075
 saying+0.071
 remark+0.069
 comments+0.069
Neg logit contributions
-0.114
-0.093
��-0.083
Temperature-0.081
��-0.080
Token
Token ID564
Feature activation-1.66e-02

Pos logit contributions
 said+0.016
 commented+0.015
 saying+0.014
 comments+0.014
 remark+0.013
Neg logit contributions
-0.021
-0.017
��-0.015
Temperature-0.015
▬▬-0.015
Token
Token ID250
Feature activation-4.60e+00

Pos logit contributions
+0.002
+0.001
��+0.001
+0.001
��+0.001
Neg logit contributions
 said-0.003
 commented-0.002
 saying-0.002
 noted-0.002
 comments-0.002
Tokenrecord
Token ID22105
Feature activation-4.48e-01

Pos logit contributions
+0.571
+0.442
��+0.393
▬▬+0.384
Temperature+0.382
Neg logit contributions
 said-0.542
 commented-0.512
 saying-0.482
 remark-0.475
 noted-0.474
Token-
Token ID12
Feature activation-2.76e+00

Pos logit contributions
+0.054
+0.041
��+0.037
Temperature+0.036
▬▬+0.035
Neg logit contributions
 said-0.054
 commented-0.052
 saying-0.048
 noted-0.048
 remark-0.047
Tokenbreaking
Token ID13395
Feature activation-6.12e+00

Pos logit contributions
+0.259
+0.181
Temperature+0.153
��+0.152
▬▬+0.140
Neg logit contributions
 said-0.410
 commented-0.393
 saying-0.372
 remark-0.369
 comments-0.369
Token
Token ID447
Feature activation-3.74e-01

Pos logit contributions
+0.677
+0.507
��+0.442
Temperature+0.433
▬▬+0.422
Neg logit contributions
 said-0.792
 commented-0.752
 saying-0.709
 noted-0.700
 admitted-0.697
Token
Token ID251
Feature activation-2.55e+00

Pos logit contributions
+0.026
+0.015
��+0.010
+0.009
��+0.009
Neg logit contributions
 said-0.068
 commented-0.065
 saying-0.062
 noted-0.062
 remark-0.062