TOP ACTIVATIONS
MAX = 25.353

" F ruits and Sand wic hes !" Mama agreed and
and listening . <|endoftext|> M omm a and Billy were walking
The lion was scept ical but he decided to
mustache !" M omm a replied , " Well
. M omm a taught them a special
their faces . <|endoftext|> M omm a and Ron ny were
Bear said , " M omm a Bear , it looks
it . M omm a and Tom went to
â M omm a said ,
up and said " M omm a , I need a
L ily felt very bad and didn 't know
loose tooth . M omm a told the baby ,
small yellow bird who was very proud . He loved to
The baby was very happy and the m ama
they arrived home , M omm a gave Ron ny a
yawn . M omm a smiled and said ,
. M omm a and Billy stayed by
Tom and Lily are very sorry . They say ,
Lily 's mom my was very smart and knew how to
The little boy was very glad when the doctor told

MIN ACTIVATIONS
MIN = -31.959

and Jimmy 's face sc run ched up as he ate
do it . She sc run ched up her face .
the park . Julia sc run ched up her face and
M am ma sc run ched up all of the
could , trying to out run the backwards me adow .
knew he couldn 't out run the lion , so he
but she couldn 't out run the man . He quickly
toys instead . He sc run ched his face up and
was missing . She sc run ched up her face and
hated it . He sc run ched his face up as
you ?" B run o said , " Of
never be able to out run the lion . He lay
but he couldn 't out run the enormous wolves . Stru
her candy , she sc run ched her eyes tight .
paws . So he sc run ched up his face and
and always tried to out run everyone else . One day
silly to try and out run a vehicle ! Always think
the dirt . He sc run ched up his face as
play a game called " run and catch ". Ben was
. He tried to out run the train but he was

INTERVAL 11.025 - 25.353
CONTAINS 1.289%

really hard to make the c actus like it , but
and the swings and the l adders . They ring the
was a girl who was very anxious about something . She
, but it was also very dirty . So all day
upset . She saw the jelly everywhere . She said ,

INTERVAL -3.303 - 11.025
CONTAINS 74.768%

a point y stick on the ground . She picked it
make mistakes and no one is perfect . Being careful and
were very happy . They had shared a lot and learned
a big hug and praised her for completing the toy .
day John went to visit his grand pa . Grand pa

INTERVAL -17.631 - -3.303
CONTAINS 23.939%

to melt ! At first , Peter did not
and knowledge . <|endoftext|> Once upon a time there were two
" I can 't run anymore ." Charlie
Suddenly , Tim my had an idea . He
When the pig gy pe ered inside the hole , she

INTERVAL -31.959 - -17.631
CONTAINS 0.004%

a ship that sank long ago . She said they could
ball ." Buddy and Wh isk ers stopped fighting . They
y ran over to Wh isk ers and bark ed very
ers . Mr . Wh isk ers loved to study .
outside the door . Wh isk ers wanted to catch the
Token "
Token ID366
Feature activation-1.50e+00

Pos logit contributions
 very+0.123
 so+0.104
 best+0.103
 scr+0.100
 inches+0.092
Neg logit contributions
use-0.169
 greet-0.138
 claim-0.134
 visit-0.133
|-0.128
TokenF
Token ID37
Feature activation-5.38e+00

Pos logit contributions
use+0.036
 greet+0.027
 claim+0.026
 visit+0.025
 meet+0.023
Neg logit contributions
 very-0.055
 so-0.049
 best-0.049
 scr-0.048
 dots-0.046
Tokenruits
Token ID50187
Feature activation-5.82e+00

Pos logit contributions
use+0.135
 greet+0.105
 claim+0.101
 visit+0.098
to+0.092
Neg logit contributions
 very-0.183
 so-0.161
 scr-0.160
 best-0.160
 inches-0.154
Token and
Token ID290
Feature activation+4.21e+00

Pos logit contributions
use+0.211
 visit+0.172
 greet+0.167
 meet+0.165
 claim+0.157
Neg logit contributions
 very-0.120
 scr-0.108
 best-0.104
 dots-0.099
der-0.098
Token Sand
Token ID3837
Feature activation-4.89e+00

Pos logit contributions
 very+0.090
 best+0.075
 so+0.073
 scr+0.070
 inches+0.066
Neg logit contributions
use-0.162
 greet-0.136
 visit-0.133
 claim-0.131
 meet-0.128
Tokenwic
Token ID22664
Feature activation+2.54e+01

Pos logit contributions
use+0.176
 greet+0.143
 visit+0.139
 claim+0.137
to+0.133
Neg logit contributions
 very-0.118
 so-0.099
 best-0.098
 scr-0.096
 inches-0.090
Tokenhes
Token ID956
Feature activation-5.90e+00

Pos logit contributions
 very+0.683
 best+0.615
 scr+0.612
 ears+0.599
 so+0.598
Neg logit contributions
use-0.769
 greet-0.609
to-0.579
|-0.575
war-0.573
Token!"
Token ID2474
Feature activation-3.38e+00

Pos logit contributions
use+0.205
 greet+0.168
 visit+0.162
 claim+0.161
 meet+0.155
Neg logit contributions
 very-0.147
 best-0.125
 so-0.124
 scr-0.123
 inches-0.114
Token Mama
Token ID40084
Feature activation-1.47e+00

Pos logit contributions
use+0.100
 greet+0.080
 visit+0.078
 claim+0.075
 meet+0.073
Neg logit contributions
 very-0.102
 best-0.090
 so-0.088
 scr-0.087
 inches-0.085
Token agreed
Token ID4987
Feature activation+2.40e+00

Pos logit contributions
use+0.051
 greet+0.042
 claim+0.041
 visit+0.041
 meet+0.039
Neg logit contributions
 very-0.037
 best-0.031
 so-0.031
 scr-0.030
 inches-0.028
Token and
Token ID290
Feature activation+4.35e+00

Pos logit contributions
 very+0.045
 best+0.041
 scr+0.038
 blades+0.038
 dots+0.038
Neg logit contributions
use-0.090
 greet-0.078
 visit-0.078
 claim-0.075
to-0.073
Token and
Token ID290
Feature activation+1.10e+01

Pos logit contributions
 very+0.294
 so+0.263
 best+0.229
 scr+0.217
 most+0.200
Neg logit contributions
use-0.422
 greet-0.338
talk-0.336
|-0.335
war-0.335
Token listening
Token ID8680
Feature activation+5.04e+00

Pos logit contributions
 very+0.292
 so+0.255
 best+0.236
 scr+0.218
 most+0.206
Neg logit contributions
use-0.407
 claim-0.328
 greet-0.325
|-0.319
war-0.318
Token.
Token ID13
Feature activation+4.53e+00

Pos logit contributions
 very+0.141
 so+0.123
 best+0.116
 scr+0.112
der+0.106
Neg logit contributions
use-0.175
 claim-0.140
 greet-0.139
 visit-0.133
|-0.133
Token<|endoftext|>
Token ID50256
Feature activation+5.38e-01

Pos logit contributions
 very+0.117
 so+0.109
 scr+0.090
 best+0.090
 because+0.089
Neg logit contributions
use-0.167
 greet-0.130
 claim-0.127
war-0.124
away-0.124
TokenM
Token ID44
Feature activation+3.35e+00

Pos logit contributions
 very+0.016
 so+0.013
 best+0.013
 air+0.011
 too+0.011
Neg logit contributions
use-0.019
 greet-0.015
 claim-0.015
shadow-0.014
talk-0.014
Tokenomm
Token ID2002
Feature activation+2.17e+01

Pos logit contributions
 very+0.090
 best+0.077
 so+0.076
 scr+0.072
 inches+0.067
Neg logit contributions
use-0.116
 greet-0.097
 claim-0.095
war-0.091
|-0.089
Tokena
Token ID64
Feature activation+3.08e+00

Pos logit contributions
 very+0.637
 best+0.549
 so+0.547
 scr+0.534
 inches+0.504
Neg logit contributions
use-0.678
 greet-0.546
 claim-0.528
 visit-0.525
 meet-0.497
Token and
Token ID290
Feature activation+7.28e+00

Pos logit contributions
 very+0.039
 so+0.028
 best+0.022
 scr+0.020
 ears+0.016
Neg logit contributions
use-0.147
 greet-0.132
 claim-0.130
|-0.127
war-0.126
Token Billy
Token ID15890
Feature activation-5.59e+00

Pos logit contributions
 very+0.212
 so+0.189
 best+0.163
 scr+0.155
 because+0.147
Neg logit contributions
use-0.234
war-0.201
 greet-0.191
 claim-0.191
|-0.191
Token were
Token ID547
Feature activation+7.39e+00

Pos logit contributions
use+0.188
war+0.167
 greet+0.165
etch+0.161
to+0.161
Neg logit contributions
 very-0.137
 so-0.115
 best-0.103
 scr-0.101
 ears-0.090
Token walking
Token ID6155
Feature activation-4.09e+00

Pos logit contributions
 very+0.181
 so+0.152
 best+0.149
 scr+0.126
 inches+0.120
Neg logit contributions
use-0.292
 greet-0.242
 claim-0.239
war-0.232
 visit-0.229
Token
Token ID198
Feature activation-5.39e+00

Pos logit contributions
 very+0.099
 so+0.089
 best+0.085
der+0.066
 most+0.064
Neg logit contributions
use-0.166
 greet-0.128
 claim-0.120
away-0.117
 trace-0.115
Token
Token ID198
Feature activation-5.36e+00

Pos logit contributions
use+0.221
 claim+0.159
etch+0.158
 greet+0.157
 trace+0.153
Neg logit contributions
 very-0.137
 so-0.114
 best-0.086
 a-0.081
 and-0.080
TokenThe
Token ID464
Feature activation-1.88e+00

Pos logit contributions
use+0.194
 greet+0.149
etch+0.145
|+0.145
 claim+0.143
Neg logit contributions
 very-0.142
 so-0.122
 inches-0.108
 best-0.108
 scr-0.108
Token lion
Token ID18744
Feature activation-1.89e+00

Pos logit contributions
use+0.065
 greet+0.053
 visit+0.051
 claim+0.051
 meet+0.049
Neg logit contributions
 very-0.049
 so-0.042
 best-0.041
 scr-0.041
 inches-0.038
Token was
Token ID373
Feature activation+7.11e+00

Pos logit contributions
use+0.069
 greet+0.057
 claim+0.056
 visit+0.055
 meet+0.053
Neg logit contributions
 very-0.045
 so-0.038
 best-0.037
 scr-0.036
 inches-0.034
Token scept
Token ID30186
Feature activation+2.14e+01

Pos logit contributions
 very+0.086
 best+0.064
 scr+0.062
 so+0.058
 dots+0.057
Neg logit contributions
use-0.326
 greet-0.291
 claim-0.280
 visit-0.280
 meet-0.274
Tokenical
Token ID605
Feature activation-2.48e+00

Pos logit contributions
 very+0.587
 so+0.503
 best+0.499
 scr+0.486
der+0.455
Neg logit contributions
use-0.707
 greet-0.582
 visit-0.560
 claim-0.559
 meet-0.532
Token but
Token ID475
Feature activation-6.63e-01

Pos logit contributions
use+0.099
 greet+0.084
 claim+0.082
 visit+0.082
 meet+0.078
Neg logit contributions
 very-0.046
 scr-0.038
 best-0.038
 so-0.035
 dots-0.034
Token he
Token ID339
Feature activation+2.74e-01

Pos logit contributions
use+0.022
 greet+0.018
 claim+0.017
 visit+0.017
 meet+0.017
Neg logit contributions
 very-0.018
 best-0.015
 so-0.015
 scr-0.015
 inches-0.014
Token decided
Token ID3066
Feature activation+5.50e+00

Pos logit contributions
 very+0.007
 best+0.006
 so+0.005
 scr+0.005
 inches+0.005
Neg logit contributions
use-0.010
 greet-0.008
 claim-0.008
 visit-0.008
 meet-0.008
Token to
Token ID284
Feature activation-8.11e+00

Pos logit contributions
 very+0.171
 so+0.156
 best+0.136
 because+0.123
 too+0.122
Neg logit contributions
use-0.190
|-0.164
 greet-0.145
etch-0.143
war-0.141
Token mustache
Token ID49303
Feature activation+7.28e+00

Pos logit contributions
 very+0.158
 best+0.132
 so+0.128
 scr+0.123
 inches+0.109
Neg logit contributions
use-0.227
 greet-0.189
|-0.185
 claim-0.183
etch-0.177
Token!"
Token ID2474
Feature activation-1.20e+00

Pos logit contributions
 very+0.219
 so+0.191
 best+0.191
 scr+0.186
 inches+0.176
Neg logit contributions
use-0.220
 greet-0.174
 claim-0.168
 visit-0.167
 meet-0.158
Token
Token ID198
Feature activation-4.68e+00

Pos logit contributions
use+0.032
 greet+0.024
 claim+0.023
 visit+0.023
 meet+0.021
Neg logit contributions
 very-0.041
 so-0.036
 best-0.036
 scr-0.035
 dots-0.034
Token
Token ID198
Feature activation-4.62e+00

Pos logit contributions
use+0.199
 claim+0.165
 greet+0.152
race+0.142
talk+0.141
Neg logit contributions
 very-0.124
 so-0.101
 best-0.079
 really-0.075
 too-0.073
TokenM
Token ID44
Feature activation+2.15e+00

Pos logit contributions
use+0.124
 claim+0.090
 greet+0.087
 visit+0.078
|+0.075
Neg logit contributions
 very-0.172
 so-0.150
 scr-0.144
 best-0.141
 dots-0.135
Tokenomm
Token ID2002
Feature activation+2.12e+01

Pos logit contributions
 very+0.055
 so+0.047
 best+0.046
 scr+0.045
 dots+0.042
Neg logit contributions
use-0.075
 greet-0.061
 visit-0.059
 claim-0.059
to-0.057
Tokena
Token ID64
Feature activation+2.18e+00

Pos logit contributions
 very+0.528
 best+0.446
 so+0.444
 scr+0.430
 inches+0.401
Neg logit contributions
use-0.758
 greet-0.624
 claim-0.606
 visit-0.603
 meet-0.577
Token replied
Token ID8712
Feature activation-1.23e+00

Pos logit contributions
 very+0.053
 best+0.045
 so+0.045
 scr+0.043
 inches+0.040
Neg logit contributions
use-0.078
 greet-0.065
 claim-0.063
 visit-0.062
 meet-0.060
Token,
Token ID11
Feature activation+7.47e+00

Pos logit contributions
use+0.041
to+0.036
 To+0.035
 greet+0.035
 visit+0.035
Neg logit contributions
 very-0.025
 best-0.024
 sizes-0.024
 dots-0.023
 inches-0.023
Token "
Token ID366
Feature activation+1.15e+00

Pos logit contributions
 very+0.204
 best+0.184
 so+0.175
 scr+0.175
 inches+0.172
Neg logit contributions
use-0.233
 greet-0.189
 visit-0.184
 claim-0.181
to-0.175
TokenWell
Token ID5779
Feature activation-2.07e+00

Pos logit contributions
 very+0.037
 so+0.033
 scr+0.031
 best+0.031
 dots+0.031
Neg logit contributions
use-0.034
 claim-0.026
 greet-0.026
 visit-0.025
|-0.024
Token.
Token ID13
Feature activation+2.11e+00

Pos logit contributions
 very+0.001
 so+0.001
 best+0.001
 scr+0.001
 inches+0.001
Neg logit contributions
use-0.003
 greet-0.002
 claim-0.002
 visit-0.002
to-0.002
Token
Token ID220
Feature activation+4.68e+00

Pos logit contributions
 very+0.055
 so+0.049
 scr+0.044
 best+0.044
der+0.041
Neg logit contributions
use-0.077
 claim-0.060
 greet-0.060
war-0.058
etch-0.057
Token
Token ID198
Feature activation-4.75e+00

Pos logit contributions
 very+0.123
 so+0.110
 best+0.105
 most+0.089
 scr+0.088
Neg logit contributions
use-0.178
away-0.145
 greet-0.138
 claim-0.132
 trace-0.131
Token
Token ID198
Feature activation-4.69e+00

Pos logit contributions
use+0.157
 claim+0.119
 greet+0.108
talk+0.105
 confront+0.101
Neg logit contributions
 very-0.160
 so-0.135
 best-0.115
 scr-0.115
 incredibly-0.111
TokenM
Token ID44
Feature activation+2.02e+00

Pos logit contributions
use+0.132
 greet+0.100
 claim+0.098
 visit+0.094
 meet+0.089
Neg logit contributions
 very-0.157
 so-0.138
 best-0.135
 scr-0.134
 dots-0.126
Tokenomm
Token ID2002
Feature activation+2.10e+01

Pos logit contributions
 very+0.040
 so+0.035
 blades+0.032
 scr+0.032
 best+0.032
Neg logit contributions
use-0.080
 visit-0.064
 greet-0.064
 claim-0.063
to-0.062
Tokena
Token ID64
Feature activation+1.99e+00

Pos logit contributions
 very+0.548
 best+0.473
 so+0.473
 scr+0.466
 dots+0.442
Neg logit contributions
use-0.700
 greet-0.568
 claim-0.548
 visit-0.545
to-0.528
Token taught
Token ID7817
Feature activation+1.69e+00

Pos logit contributions
 very+0.046
 best+0.040
 scr+0.039
 so+0.038
 dots+0.037
Neg logit contributions
use-0.070
 greet-0.058
 visit-0.057
 claim-0.055
 meet-0.055
Token them
Token ID606
Feature activation-9.44e-01

Pos logit contributions
 very+0.053
 best+0.047
 scr+0.046
 so+0.045
 inches+0.044
Neg logit contributions
use-0.048
 greet-0.038
 visit-0.037
 claim-0.036
 meet-0.034
Token a
Token ID257
Feature activation+6.60e+00

Pos logit contributions
use+0.026
 greet+0.024
 visit+0.023
 meet+0.022
 join+0.022
Neg logit contributions
 very-0.024
 scr-0.022
 inches-0.021
 best-0.021
 dots-0.021
Token special
Token ID2041
Feature activation-6.85e-01

Pos logit contributions
 very+0.155
 scr+0.139
 best+0.138
 inches+0.137
 dots+0.137
Neg logit contributions
use-0.216
 visit-0.185
 greet-0.184
 claim-0.174
to-0.173
Token their
Token ID511
Feature activation+6.39e+00

Pos logit contributions
use+0.284
 greet+0.222
 claim+0.219
 visit+0.212
|+0.212
Neg logit contributions
 very-0.223
 so-0.191
 best-0.186
 scr-0.180
 ears-0.166
Token faces
Token ID6698
Feature activation+1.95e+00

Pos logit contributions
 very+0.135
 best+0.109
 so+0.107
 scr+0.103
 skin+0.095
Neg logit contributions
use-0.259
 claim-0.213
 greet-0.213
to-0.205
 visit-0.204
Token.
Token ID13
Feature activation+2.45e+00

Pos logit contributions
 very+0.047
 so+0.039
 best+0.039
 scr+0.037
 inches+0.035
Neg logit contributions
use-0.072
 greet-0.059
 claim-0.058
 visit-0.057
|-0.056
Token<|endoftext|>
Token ID50256
Feature activation-7.60e-01

Pos logit contributions
 very+0.079
 so+0.071
 best+0.066
 because+0.061
 scr+0.061
Neg logit contributions
use-0.078
 greet-0.057
 claim-0.055
talk-0.054
 oblig-0.053
TokenM
Token ID44
Feature activation+2.16e+00

Pos logit contributions
use+0.027
 greet+0.023
 claim+0.022
 confront+0.021
 oblig+0.021
Neg logit contributions
 very-0.021
 so-0.018
 best-0.017
 air-0.015
 too-0.015
Tokenomm
Token ID2002
Feature activation+2.10e+01

Pos logit contributions
 very+0.072
 best+0.064
 so+0.064
 scr+0.061
 inches+0.056
Neg logit contributions
use-0.060
 greet-0.048
 claim-0.046
war-0.043
to-0.042
Tokena
Token ID64
Feature activation+1.96e+00

Pos logit contributions
 very+0.632
 best+0.552
 so+0.551
 scr+0.539
 inches+0.510
Neg logit contributions
use-0.634
 greet-0.501
 claim-0.483
 visit-0.480
 meet-0.454
Token and
Token ID290
Feature activation+6.31e+00

Pos logit contributions
 very+0.030
 so+0.023
 best+0.020
 scr+0.018
 dots+0.015
Neg logit contributions
use-0.090
 greet-0.078
 claim-0.077
war-0.075
|-0.074
Token Ron
Token ID6575
Feature activation-3.27e+00

Pos logit contributions
 very+0.191
 so+0.169
 best+0.164
 scr+0.160
 dots+0.149
Neg logit contributions
use-0.194
 greet-0.152
 claim-0.147
 visit-0.143
|-0.141
Tokenny
Token ID3281
Feature activation+3.80e+00

Pos logit contributions
use+0.116
 greet+0.098
 claim+0.097
to+0.095
war+0.094
Neg logit contributions
 very-0.078
 so-0.064
 best-0.061
 scr-0.059
 extremely-0.053
Token were
Token ID547
Feature activation+7.58e+00

Pos logit contributions
 very+0.081
 so+0.067
 best+0.066
 scr+0.061
 skin+0.054
Neg logit contributions
use-0.149
 greet-0.125
 claim-0.123
war-0.121
to-0.119
Token Bear
Token ID14732
Feature activation+4.34e+00

Pos logit contributions
 very+0.050
 so+0.043
 best+0.042
 scr+0.036
 ears+0.033
Neg logit contributions
use-0.082
 greet-0.069
 claim-0.068
|-0.066
war-0.064
Token said
Token ID531
Feature activation+2.65e-01

Pos logit contributions
 very+0.104
 so+0.088
 best+0.083
 scr+0.078
 inches+0.070
Neg logit contributions
use-0.162
 claim-0.135
 greet-0.134
|-0.128
war-0.128
Token,
Token ID11
Feature activation+8.72e+00

Pos logit contributions
 very+0.006
 so+0.005
 best+0.005
 scr+0.004
 inches+0.004
Neg logit contributions
use-0.011
 greet-0.009
 claim-0.009
 visit-0.009
|-0.009
Token "
Token ID366
Feature activation+1.81e+00

Pos logit contributions
 very+0.210
 so+0.183
 scr+0.160
 best+0.155
 because+0.138
Neg logit contributions
use-0.347
 claim-0.279
|-0.276
 greet-0.274
talk-0.265
TokenM
Token ID44
Feature activation+3.24e+00

Pos logit contributions
 very+0.045
 so+0.040
 dots+0.040
 scr+0.040
 most+0.040
Neg logit contributions
use-0.054
 claim-0.047
 greet-0.045
etch-0.044
 confront-0.044
Tokenomm
Token ID2002
Feature activation+2.10e+01

Pos logit contributions
 very+0.073
 best+0.061
 so+0.060
 scr+0.058
 inches+0.054
Neg logit contributions
use-0.123
 greet-0.103
 claim-0.100
 visit-0.099
 meet-0.096
Tokena
Token ID64
Feature activation+2.35e+00

Pos logit contributions
 very+0.564
 best+0.485
 so+0.479
 scr+0.465
 inches+0.439
Neg logit contributions
use-0.705
 greet-0.580
 claim-0.563
 visit-0.557
 meet-0.532
Token Bear
Token ID14732
Feature activation+3.20e+00

Pos logit contributions
 very+0.044
 best+0.035
 so+0.035
 scr+0.033
 inches+0.030
Neg logit contributions
use-0.097
 greet-0.083
 claim-0.081
 visit-0.080
 meet-0.077
Token,
Token ID11
Feature activation+7.70e+00

Pos logit contributions
 very+0.069
 best+0.057
 so+0.057
 scr+0.055
 inches+0.050
Neg logit contributions
use-0.123
 greet-0.103
 claim-0.101
 visit-0.100
to-0.096
Token it
Token ID340
Feature activation+4.26e+00

Pos logit contributions
 very+0.225
 so+0.201
 best+0.193
 scr+0.188
 inches+0.174
Neg logit contributions
use-0.243
 greet-0.194
 claim-0.193
|-0.184
 visit-0.182
Token looks
Token ID3073
Feature activation+1.11e+01

Pos logit contributions
 very+0.105
 best+0.089
 so+0.088
 scr+0.087
 dots+0.083
Neg logit contributions
use-0.149
 greet-0.121
 visit-0.118
 claim-0.115
 meet-0.113
Token it
Token ID340
Feature activation+3.15e+00

Pos logit contributions
use+0.299
 greet+0.229
 claim+0.220
to+0.219
|+0.219
Neg logit contributions
 very-0.226
 so-0.196
 scr-0.178
 best-0.177
 dots-0.177
Token.
Token ID13
Feature activation+2.15e+00

Pos logit contributions
 very+0.070
 so+0.059
 best+0.054
 scr+0.049
 inches+0.045
Neg logit contributions
use-0.128
 greet-0.107
|-0.104
 claim-0.104
war-0.101
Token
Token ID198
Feature activation-4.99e+00

Pos logit contributions
 very+0.061
 so+0.057
 scr+0.051
 best+0.050
 dots+0.048
Neg logit contributions
use-0.077
 greet-0.057
 claim-0.056
away-0.055
 visit-0.053
Token
Token ID198
Feature activation-4.94e+00

Pos logit contributions
use+0.216
 claim+0.169
 greet+0.160
 confront+0.154
 trace+0.153
Neg logit contributions
 very-0.130
 so-0.114
 best-0.084
 really-0.083
 and-0.077
TokenM
Token ID44
Feature activation+1.75e+00

Pos logit contributions
use+0.149
 greet+0.113
 claim+0.109
 visit+0.105
|+0.100
Neg logit contributions
 very-0.163
 so-0.146
 scr-0.142
 best-0.137
 dots-0.134
Tokenomm
Token ID2002
Feature activation+2.10e+01

Pos logit contributions
 very+0.032
 so+0.027
 scr+0.026
 best+0.025
 blades+0.025
Neg logit contributions
use-0.069
 visit-0.057
 greet-0.057
 claim-0.057
to-0.056
Tokena
Token ID64
Feature activation+2.03e+00

Pos logit contributions
 very+0.525
 best+0.444
 so+0.443
 scr+0.431
 inches+0.402
Neg logit contributions
use-0.741
 greet-0.608
 claim-0.591
 visit-0.587
 meet-0.562
Token and
Token ID290
Feature activation+6.18e+00

Pos logit contributions
 very+0.050
 so+0.042
 best+0.041
 scr+0.040
 dots+0.037
Neg logit contributions
use-0.074
 greet-0.061
 claim-0.060
 visit-0.059
|-0.057
Token Tom
Token ID4186
Feature activation-5.15e+00

Pos logit contributions
 very+0.229
 best+0.206
 so+0.204
 scr+0.203
 inches+0.195
Neg logit contributions
use-0.141
 greet-0.103
 claim-0.098
 visit-0.097
 meet-0.089
Token went
Token ID1816
Feature activation+7.58e-01

Pos logit contributions
use+0.176
 greet+0.144
 claim+0.139
 visit+0.139
 meet+0.133
Neg logit contributions
 very-0.133
 best-0.115
 so-0.113
 scr-0.111
 inches-0.105
Token to
Token ID284
Feature activation-7.24e+00

Pos logit contributions
 very+0.024
 so+0.020
 best+0.018
 too+0.016
 because+0.015
Neg logit contributions
use-0.026
 claim-0.020
|-0.019
war-0.019
Ell-0.019
Tokenâ
Token ID22940
Feature activation+4.41e-01

Pos logit contributions
 very+0.031
 so+0.029
 best+0.025
 scr+0.024
 most+0.023
Neg logit contributions
use-0.042
 greet-0.032
 claim-0.031
talk-0.031
away-0.030
Token
Token ID26391
Feature activation+6.69e+00

Pos logit contributions
 very+0.014
 so+0.012
 best+0.012
 scr+0.012
 inches+0.011
Neg logit contributions
use-0.012
 greet-0.010
 claim-0.009
 visit-0.009
to-0.009
Token
Token ID198
Feature activation-4.54e+00

Pos logit contributions
 very+0.225
 so+0.200
 best+0.198
 scr+0.192
 inches+0.183
Neg logit contributions
use-0.181
 greet-0.140
 claim-0.133
 visit-0.128
|-0.121
Token
Token ID198
Feature activation-4.45e+00

Pos logit contributions
use+0.171
 claim+0.125
talk+0.118
 greet+0.116
 visit+0.114
Neg logit contributions
 very-0.141
 so-0.120
 best-0.101
 scr-0.096
 incredibly-0.092
TokenM
Token ID44
Feature activation+2.14e+00

Pos logit contributions
use+0.144
 greet+0.111
 claim+0.110
 visit+0.106
talk+0.100
Neg logit contributions
 very-0.135
 so-0.119
 best-0.114
 scr-0.113
 dots-0.103
Tokenomm
Token ID2002
Feature activation+2.09e+01

Pos logit contributions
 very+0.046
 so+0.038
 best+0.037
 scr+0.036
 blades+0.033
Neg logit contributions
use-0.083
 greet-0.068
 visit-0.068
 claim-0.067
to-0.065
Tokena
Token ID64
Feature activation+2.05e+00

Pos logit contributions
 very+0.518
 best+0.436
 so+0.435
 scr+0.420
 inches+0.393
Neg logit contributions
use-0.750
 greet-0.618
 claim-0.601
 visit-0.598
 meet-0.571
Token said
Token ID531
Feature activation+5.12e-01

Pos logit contributions
 very+0.048
 so+0.040
 best+0.038
 scr+0.036
 inches+0.034
Neg logit contributions
use-0.078
 greet-0.064
 claim-0.064
 visit-0.062
|-0.061
Token,
Token ID11
Feature activation+7.73e+00

Pos logit contributions
 very+0.007
 so+0.005
 best+0.005
 scr+0.003
 skin+0.003
Neg logit contributions
use-0.025
 greet-0.022
 claim-0.022
|-0.021
 visit-0.021
Token
Token ID6184
Feature activation-1.08e+00

Pos logit contributions
 very+0.162
 so+0.134
 scr+0.106
 best+0.103
der+0.096
Neg logit contributions
use-0.335
|-0.283
 claim-0.276
 greet-0.274
talk-0.272
Token
Token ID95
Feature activation+1.19e+00

Pos logit contributions
use+0.029
 greet+0.022
 claim+0.021
 visit+0.021
 meet+0.020
Neg logit contributions
 very-0.036
 so-0.032
 best-0.032
 scr-0.031
 inches-0.030
Token up
Token ID510
Feature activation-2.48e+00

Pos logit contributions
 very+0.229
 scr+0.219
 dots+0.218
 best+0.212
 inches+0.212
Neg logit contributions
use-0.398
 greet-0.330
 claim-0.323
 visit-0.322
to-0.302
Token and
Token ID290
Feature activation+5.63e+00

Pos logit contributions
use+0.089
 greet+0.073
 claim+0.071
 visit+0.070
 meet+0.067
Neg logit contributions
 very-0.061
 best-0.052
 so-0.052
 scr-0.050
 inches-0.047
Token said
Token ID531
Feature activation-2.04e-01

Pos logit contributions
 very+0.171
 so+0.151
 scr+0.128
 best+0.122
 most+0.119
Neg logit contributions
use-0.189
etch-0.164
|-0.161
war-0.156
 greet-0.151
Token "
Token ID366
Feature activation+3.12e-01

Pos logit contributions
use+0.008
 greet+0.007
 visit+0.007
 claim+0.007
to+0.006
Neg logit contributions
 very-0.004
 best-0.003
 scr-0.003
 so-0.003
 inches-0.003
TokenM
Token ID44
Feature activation+1.65e+00

Pos logit contributions
 very+0.009
 so+0.008
 dots+0.008
 scr+0.008
 most+0.008
Neg logit contributions
use-0.009
 greet-0.008
 claim-0.007
etch-0.007
irs-0.007
Tokenomm
Token ID2002
Feature activation+2.08e+01

Pos logit contributions
 very+0.044
 so+0.037
 best+0.036
 scr+0.036
 dots+0.034
Neg logit contributions
use-0.057
 greet-0.045
 visit-0.044
 claim-0.044
to-0.043
Tokena
Token ID64
Feature activation+1.72e+00

Pos logit contributions
 very+0.548
 best+0.469
 so+0.464
 scr+0.446
 inches+0.421
Neg logit contributions
use-0.712
 greet-0.588
 claim-0.568
 visit-0.563
|-0.538
Token,
Token ID11
Feature activation+6.91e+00

Pos logit contributions
 very+0.033
 best+0.026
 so+0.026
 scr+0.026
 dots+0.023
Neg logit contributions
use-0.070
 greet-0.059
 visit-0.058
 claim-0.057
 meet-0.056
Token I
Token ID314
Feature activation-3.74e+00

Pos logit contributions
 very+0.196
 so+0.174
 best+0.167
 scr+0.158
 inches+0.148
Neg logit contributions
use-0.231
 greet-0.184
 claim-0.178
|-0.176
 visit-0.172
Token need
Token ID761
Feature activation-5.77e-01

Pos logit contributions
use+0.110
 greet+0.090
 claim+0.085
war+0.082
 visit+0.082
Neg logit contributions
 very-0.115
 so-0.099
 best-0.099
 scr-0.095
 inches-0.088
Token a
Token ID257
Feature activation+6.28e+00

Pos logit contributions
use+0.020
|+0.018
 greet+0.017
war+0.017
 claim+0.016
Neg logit contributions
 very-0.015
 so-0.013
 best-0.012
 scr-0.011
 inches-0.011
Token
Token ID198
Feature activation+7.90e-03

Pos logit contributions
use+0.147
enn+0.115
to+0.113
war+0.112
etch+0.112
Neg logit contributions
 very-0.086
 so-0.073
 best-0.063
 scr-0.062
 skin-0.061
Token
Token ID198
Feature activation+1.22e+00

Pos logit contributions
 very+0.000
 so+0.000
 best+0.000
 incredibly+0.000
 much+0.000
Neg logit contributions
use-0.000
 greet-0.000
 claim-0.000
 awaited-0.000
war-0.000
TokenL
Token ID43
Feature activation+3.41e+00

Pos logit contributions
 very+0.033
 so+0.028
 incredibly+0.024
 best+0.024
 scr+0.024
Neg logit contributions
use-0.043
 greet-0.033
 claim-0.032
 visit-0.029
 gra-0.029
Tokenily
Token ID813
Feature activation-2.29e-01

Pos logit contributions
 very+0.101
 best+0.088
 so+0.088
 scr+0.086
 inches+0.082
Neg logit contributions
use-0.105
 greet-0.084
 claim-0.081
 visit-0.080
 meet-0.076
Token felt
Token ID2936
Feature activation+1.41e+01

Pos logit contributions
use+0.008
 greet+0.007
 visit+0.007
 claim+0.007
 meet+0.007
Neg logit contributions
 very-0.005
 best-0.004
 so-0.004
 scr-0.004
 inches-0.004
Token very
Token ID845
Feature activation+2.04e+01

Pos logit contributions
 dots+0.265
 sizes+0.258
 inches+0.250
 blades+0.241
 best+0.225
Neg logit contributions
use-0.486
 greet-0.436
 claim-0.416
 meet-0.416
 visit-0.413
Token bad
Token ID2089
Feature activation+4.08e+00

Pos logit contributions
 very+0.330
 dots+0.300
 sizes+0.285
 best+0.282
 scr+0.277
Neg logit contributions
use-0.812
 greet-0.690
 visit-0.683
 claim-0.675
 meet-0.662
Token and
Token ID290
Feature activation+8.82e+00

Pos logit contributions
 scr+0.101
 very+0.099
 blades+0.096
 best+0.092
 inches+0.091
Neg logit contributions
use-0.119
 visit-0.098
to-0.094
 greet-0.092
 To-0.092
Token didn
Token ID1422
Feature activation+8.31e+00

Pos logit contributions
 very+0.168
 dots+0.155
 best+0.155
 inches+0.150
 scr+0.145
Neg logit contributions
use-0.329
 greet-0.275
 visit-0.271
 claim-0.268
 meet-0.261
Token't
Token ID470
Feature activation+8.18e-01

Pos logit contributions
 very+0.311
 so+0.278
 best+0.274
 scr+0.267
 inches+0.252
Neg logit contributions
use-0.203
 greet-0.148
 claim-0.139
 visit-0.134
 meet-0.124
Token know
Token ID760
Feature activation-6.93e-01

Pos logit contributions
 very+0.024
 so+0.020
 best+0.020
 scr+0.020
 dots+0.018
Neg logit contributions
use-0.026
 greet-0.021
 claim-0.020
 visit-0.020
to-0.020
Token loose
Token ID9155
Feature activation-3.13e+00

Pos logit contributions
 very+0.150
der+0.147
 inches+0.141
 dots+0.140
 so+0.139
Neg logit contributions
use-0.281
 visit-0.246
 greet-0.235
 claim-0.228
 meet-0.219
Token tooth
Token ID16162
Feature activation+9.80e+00

Pos logit contributions
use+0.105
 visit+0.087
 claim+0.084
 greet+0.083
 aid+0.082
Neg logit contributions
 very-0.071
 scr-0.066
 best-0.066
 sizes-0.065
 so-0.063
Token.
Token ID13
Feature activation+2.58e+00

Pos logit contributions
 very+0.235
 best+0.202
 scr+0.200
 so+0.196
 inches+0.186
Neg logit contributions
use-0.343
 greet-0.279
 claim-0.274
 visit-0.273
to-0.258
Token
Token ID198
Feature activation-4.26e+00

Pos logit contributions
 very+0.064
 so+0.058
 scr+0.053
 eyes+0.049
 best+0.049
Neg logit contributions
use-0.099
 greet-0.077
 claim-0.074
war-0.074
talk-0.073
TokenM
Token ID44
Feature activation+2.00e+00

Pos logit contributions
use+0.160
 claim+0.122
 greet+0.121
 fulfill+0.109
talk+0.109
Neg logit contributions
 very-0.122
 so-0.092
 best-0.082
 much-0.081
 really-0.080
Tokenomm
Token ID2002
Feature activation+2.04e+01

Pos logit contributions
 very+0.049
 so+0.042
 best+0.041
 scr+0.040
 dots+0.038
Neg logit contributions
use-0.071
 greet-0.058
 claim-0.057
 visit-0.057
to-0.054
Tokena
Token ID64
Feature activation+1.02e+00

Pos logit contributions
 very+0.509
 best+0.430
 so+0.426
 scr+0.412
 inches+0.386
Neg logit contributions
use-0.728
 greet-0.603
 claim-0.585
 visit-0.583
 meet-0.557
Token told
Token ID1297
Feature activation-5.91e+00

Pos logit contributions
 very+0.024
 so+0.020
 best+0.020
 scr+0.019
 inches+0.018
Neg logit contributions
use-0.038
 greet-0.031
 claim-0.031
 visit-0.030
|-0.029
Token the
Token ID262
Feature activation+3.38e+00

Pos logit contributions
use+0.190
 greet+0.148
 claim+0.146
 visit+0.142
|+0.138
Neg logit contributions
 very-0.177
 so-0.156
 best-0.149
 scr-0.147
 skin-0.136
Token baby
Token ID5156
Feature activation+4.19e+00

Pos logit contributions
 very+0.066
 best+0.053
 so+0.052
 scr+0.048
 inches+0.043
Neg logit contributions
use-0.144
 greet-0.120
 claim-0.118
 visit-0.115
war-0.115
Token,
Token ID11
Feature activation+6.23e+00

Pos logit contributions
 very+0.117
 best+0.101
 so+0.100
 scr+0.100
 inches+0.095
Neg logit contributions
use-0.134
 greet-0.107
 visit-0.104
 claim-0.103
 meet-0.098
Token small
Token ID1402
Feature activation+3.63e+00

Pos logit contributions
 very+0.079
 so+0.069
 best+0.067
 scr+0.066
 dots+0.061
Neg logit contributions
use-0.096
 greet-0.076
 claim-0.074
 visit-0.072
|-0.070
Token yellow
Token ID7872
Feature activation+1.05e+01

Pos logit contributions
 very+0.081
 best+0.067
 so+0.067
 scr+0.065
 inches+0.060
Neg logit contributions
use-0.137
 greet-0.114
 claim-0.111
 visit-0.110
 meet-0.106
Token bird
Token ID6512
Feature activation+5.90e+00

Pos logit contributions
 very+0.223
 best+0.188
 scr+0.187
 so+0.184
 inches+0.175
Neg logit contributions
use-0.387
 visit-0.320
 greet-0.317
 claim-0.310
 meet-0.299
Token who
Token ID508
Feature activation+2.74e+00

Pos logit contributions
 very+0.137
 so+0.114
 best+0.113
 scr+0.109
 inches+0.101
Neg logit contributions
use-0.220
 greet-0.183
 claim-0.179
 visit-0.177
 meet-0.169
Token was
Token ID373
Feature activation+1.08e+01

Pos logit contributions
 very+0.061
 so+0.048
 best+0.046
 scr+0.045
 dots+0.041
Neg logit contributions
use-0.108
 greet-0.090
 claim-0.089
 visit-0.086
etch-0.085
Token very
Token ID845
Feature activation+2.04e+01

Pos logit contributions
 very+0.197
 so+0.154
 best+0.147
 scr+0.133
 inches+0.119
Neg logit contributions
use-0.477
 greet-0.401
 claim-0.396
 visit-0.388
war-0.381
Token proud
Token ID6613
Feature activation+8.36e+00

Pos logit contributions
 very+0.347
 so+0.258
 best+0.258
 scr+0.240
 inches+0.209
Neg logit contributions
use-0.913
 greet-0.775
 claim-0.764
 visit-0.755
|-0.735
Token.
Token ID13
Feature activation+4.39e+00

Pos logit contributions
 very+0.240
 so+0.209
 best+0.207
 scr+0.200
 inches+0.190
Neg logit contributions
use-0.270
 greet-0.215
 claim-0.208
 visit-0.205
|-0.198
Token He
Token ID679
Feature activation+3.09e+00

Pos logit contributions
 very+0.110
 so+0.103
 because+0.084
 best+0.081
 scr+0.081
Neg logit contributions
use-0.168
 greet-0.131
war-0.128
 claim-0.125
talk-0.123
Token loved
Token ID6151
Feature activation+3.86e+00

Pos logit contributions
 very+0.076
 so+0.064
 best+0.062
 scr+0.061
 inches+0.055
Neg logit contributions
use-0.113
 greet-0.092
 claim-0.091
 visit-0.088
war-0.086
Token to
Token ID284
Feature activation-6.34e+00

Pos logit contributions
 very+0.120
 so+0.105
 best+0.099
 scr+0.097
 dots+0.092
Neg logit contributions
use-0.125
 greet-0.094
 claim-0.093
|-0.090
war-0.087
Token
Token ID198
Feature activation+7.90e-03

Pos logit contributions
 very+0.089
 so+0.077
 best+0.074
 scr+0.073
der+0.069
Neg logit contributions
use-0.110
 greet-0.087
 claim-0.085
 visit-0.083
|-0.080
Token
Token ID198
Feature activation+1.22e+00

Pos logit contributions
 very+0.000
 so+0.000
 best+0.000
 incredibly+0.000
 much+0.000
Neg logit contributions
use-0.000
 greet-0.000
 claim-0.000
 awaited-0.000
war-0.000
TokenThe
Token ID464
Feature activation+4.43e+00

Pos logit contributions
 very+0.033
 so+0.028
 incredibly+0.024
 best+0.024
 scr+0.024
Neg logit contributions
use-0.043
 greet-0.033
 claim-0.032
 visit-0.029
 gra-0.029
Token baby
Token ID5156
Feature activation+1.01e+01

Pos logit contributions
 very+0.101
 best+0.085
 so+0.084
 scr+0.081
 inches+0.075
Neg logit contributions
use-0.167
 greet-0.139
 claim-0.135
 visit-0.134
to-0.129
Token was
Token ID373
Feature activation+1.10e+01

Pos logit contributions
 very+0.231
 best+0.192
 so+0.192
 scr+0.183
 inches+0.171
Neg logit contributions
use-0.380
 greet-0.318
 claim-0.310
 visit-0.304
to-0.295
Token very
Token ID845
Feature activation+2.03e+01

Pos logit contributions
 very+0.109
 best+0.068
 so+0.066
 scr+0.062
 inches+0.047
Neg logit contributions
use-0.547
 greet-0.480
 claim-0.470
 visit-0.469
 meet-0.456
Token happy
Token ID3772
Feature activation+6.80e+00

Pos logit contributions
 very+0.323
 best+0.244
 so+0.243
 scr+0.231
 inches+0.202
Neg logit contributions
use-0.910
 greet-0.779
 claim-0.763
 visit-0.759
 meet-0.734
Token and
Token ID290
Feature activation+8.78e+00

Pos logit contributions
 very+0.143
 best+0.118
 so+0.116
 scr+0.114
 inches+0.104
Neg logit contributions
use-0.265
 greet-0.223
 visit-0.217
 claim-0.217
 meet-0.208
Token the
Token ID262
Feature activation+6.19e+00

Pos logit contributions
 very+0.222
 best+0.195
 scr+0.189
 so+0.186
 inches+0.183
Neg logit contributions
use-0.289
 greet-0.244
 visit-0.237
 claim-0.233
 meet-0.226
Token m
Token ID285
Feature activation+7.66e+00

Pos logit contributions
 very+0.137
 best+0.114
 so+0.113
 scr+0.109
 inches+0.101
Neg logit contributions
use-0.236
 greet-0.197
 claim-0.192
 visit-0.190
 meet-0.183
Tokenama
Token ID1689
Feature activation-6.83e-01

Pos logit contributions
 very+0.188
 best+0.159
 so+0.157
 scr+0.154
 inches+0.144
Neg logit contributions
use-0.278
 greet-0.230
 claim-0.222
 visit-0.220
 meet-0.212
Token they
Token ID484
Feature activation-6.62e-02

Pos logit contributions
use+0.041
 greet+0.033
 claim+0.031
 visit+0.031
 meet+0.030
Neg logit contributions
 very-0.039
 best-0.034
 so-0.033
 scr-0.033
 inches-0.031
Token arrived
Token ID5284
Feature activation+4.60e-01

Pos logit contributions
use+0.002
 greet+0.002
 claim+0.002
etch+0.002
to+0.002
Neg logit contributions
 very-0.002
 so-0.001
 scr-0.001
 best-0.001
 extremely-0.001
Token home
Token ID1363
Feature activation-2.19e+00

Pos logit contributions
 very+0.008
 best+0.006
 so+0.006
 inches+0.004
 eyes+0.004
Neg logit contributions
use-0.022
 greet-0.018
 claim-0.018
|-0.018
war-0.018
Token,
Token ID11
Feature activation+6.98e+00

Pos logit contributions
use+0.100
 greet+0.085
 claim+0.083
 visit+0.083
war+0.081
Neg logit contributions
 very-0.035
 so-0.027
 best-0.026
 scr-0.023
 inches-0.021
Token M
Token ID337
Feature activation+4.20e+00

Pos logit contributions
 very+0.216
 best+0.189
 so+0.188
 scr+0.188
 inches+0.180
Neg logit contributions
use-0.198
 greet-0.157
 visit-0.152
 claim-0.151
 meet-0.141
Tokenomm
Token ID2002
Feature activation+2.03e+01

Pos logit contributions
 very+0.095
 so+0.078
 best+0.077
 scr+0.075
 dots+0.071
Neg logit contributions
use-0.156
to-0.126
 greet-0.125
 claim-0.124
 visit-0.124
Tokena
Token ID64
Feature activation+1.36e+00

Pos logit contributions
 very+0.515
 best+0.439
 so+0.437
 scr+0.427
 inches+0.399
Neg logit contributions
use-0.702
 greet-0.575
 claim-0.557
 visit-0.553
to-0.530
Token gave
Token ID2921
Feature activation-8.01e-01

Pos logit contributions
 very+0.031
 so+0.026
 best+0.025
 scr+0.025
 inches+0.022
Neg logit contributions
use-0.052
 greet-0.043
 claim-0.042
 visit-0.041
etch-0.040
Token Ron
Token ID6575
Feature activation-3.89e+00

Pos logit contributions
use+0.023
 greet+0.018
 claim+0.017
 visit+0.017
 meet+0.016
Neg logit contributions
 very-0.026
 so-0.023
 best-0.023
 scr-0.022
 inches-0.021
Tokenny
Token ID3281
Feature activation+2.99e+00

Pos logit contributions
use+0.119
 greet+0.098
 claim+0.097
 visit+0.095
 meet+0.091
Neg logit contributions
 very-0.115
 best-0.099
 so-0.099
 scr-0.092
 inches-0.088
Token a
Token ID257
Feature activation+6.37e+00

Pos logit contributions
 very+0.055
 best+0.049
 scr+0.048
 dots+0.047
 inches+0.047
Neg logit contributions
use-0.112
 greet-0.099
 visit-0.098
 meet-0.092
 claim-0.091
Token yawn
Token ID40908
Feature activation+9.36e+00

Pos logit contributions
use+0.027
 greet+0.020
 visit+0.019
 claim+0.018
 meet+0.016
Neg logit contributions
 very-0.051
 best-0.048
 so-0.046
 inches-0.046
 scr-0.046
Token.
Token ID13
Feature activation+1.66e+00

Pos logit contributions
 very+0.223
 so+0.189
 best+0.182
 scr+0.173
der+0.163
Neg logit contributions
use-0.354
 greet-0.292
 claim-0.284
 visit-0.282
|-0.274
Token
Token ID198
Feature activation-5.11e+00

Pos logit contributions
 very+0.054
 so+0.049
 best+0.045
 scr+0.043
 eyes+0.041
Neg logit contributions
use-0.054
 greet-0.039
 claim-0.038
talk-0.037
away-0.036
Token
Token ID198
Feature activation-5.05e+00

Pos logit contributions
use+0.210
 claim+0.152
 greet+0.145
talk+0.143
away+0.139
Neg logit contributions
 very-0.152
 so-0.123
 best-0.103
 really-0.095
 scr-0.095
TokenM
Token ID44
Feature activation+1.56e+00

Pos logit contributions
use+0.182
 greet+0.138
 claim+0.133
 visit+0.125
talk+0.125
Neg logit contributions
 very-0.144
 so-0.121
 scr-0.118
 best-0.114
 extremely-0.105
Tokenomm
Token ID2002
Feature activation+2.03e+01

Pos logit contributions
 very+0.030
 so+0.025
 best+0.023
 scr+0.022
 blades+0.022
Neg logit contributions
use-0.063
 visit-0.052
 greet-0.051
 claim-0.051
to-0.051
Tokena
Token ID64
Feature activation+1.36e+00

Pos logit contributions
 very+0.512
 best+0.436
 so+0.435
 scr+0.424
 inches+0.396
Neg logit contributions
use-0.704
 greet-0.576
 claim-0.559
 visit-0.555
to-0.531
Token smiled
Token ID13541
Feature activation+1.11e+01

Pos logit contributions
 very+0.032
 best+0.027
 so+0.027
 scr+0.026
 inches+0.024
Neg logit contributions
use-0.049
 greet-0.041
 claim-0.040
 visit-0.040
 meet-0.038
Token and
Token ID290
Feature activation+5.95e+00

Pos logit contributions
 sizes+0.230
 inches+0.230
 blades+0.227
 very+0.225
 scr+0.218
Neg logit contributions
use-0.338
 visit-0.310
 greet-0.304
 claim-0.292
to-0.280
Token said
Token ID531
Feature activation-1.74e-01

Pos logit contributions
 very+0.168
 best+0.146
 so+0.145
 scr+0.142
 inches+0.134
Neg logit contributions
use-0.190
 greet-0.153
 claim-0.148
 visit-0.147
 meet-0.140
Token,
Token ID11
Feature activation+7.01e+00

Pos logit contributions
use+0.008
 greet+0.007
 claim+0.006
 visit+0.006
 meet+0.006
Neg logit contributions
 very-0.003
 best-0.002
 so-0.002
 scr-0.002
 inches-0.002
Token.
Token ID13
Feature activation+1.93e+00

Pos logit contributions
use+0.201
 greet+0.165
 claim+0.161
|+0.161
 visit+0.158
Neg logit contributions
 very-0.117
 so-0.099
 best-0.093
 scr-0.087
 inches-0.080
Token
Token ID220
Feature activation+4.24e+00

Pos logit contributions
 very+0.063
 so+0.056
 best+0.054
 scr+0.051
 most+0.049
Neg logit contributions
use-0.060
 greet-0.045
 claim-0.044
etch-0.042
war-0.041
Token
Token ID198
Feature activation-5.12e+00

Pos logit contributions
 very+0.119
 so+0.107
 best+0.102
 most+0.084
 scr+0.083
Neg logit contributions
use-0.164
 greet-0.128
away-0.126
 claim-0.123
talk-0.117
Token
Token ID198
Feature activation-5.08e+00

Pos logit contributions
use+0.200
 claim+0.152
talk+0.143
 greet+0.141
away+0.141
Neg logit contributions
 very-0.153
 so-0.131
 best-0.111
 scr-0.103
 incredibly-0.100
TokenM
Token ID44
Feature activation+1.46e+00

Pos logit contributions
use+0.181
 greet+0.140
 claim+0.138
 visit+0.131
etch+0.128
Neg logit contributions
 very-0.144
 scr-0.124
 so-0.123
 best-0.119
 incredibly-0.110
Tokenomm
Token ID2002
Feature activation+2.03e+01

Pos logit contributions
 very+0.029
 so+0.024
 best+0.023
 scr+0.022
 blades+0.022
Neg logit contributions
use-0.058
 greet-0.048
 visit-0.047
 claim-0.047
to-0.046
Tokena
Token ID64
Feature activation+1.47e+00

Pos logit contributions
 very+0.539
 best+0.478
 so+0.471
 scr+0.467
 dots+0.459
Neg logit contributions
use-0.632
 greet-0.510
 claim-0.487
 visit-0.483
to-0.483
Token and
Token ID290
Feature activation+6.03e+00

Pos logit contributions
 very+0.034
 best+0.029
 so+0.028
 scr+0.027
 inches+0.026
Neg logit contributions
use-0.054
 greet-0.045
 visit-0.044
 claim-0.043
 meet-0.042
Token Billy
Token ID15890
Feature activation-5.86e+00

Pos logit contributions
 very+0.212
 so+0.190
 best+0.187
 scr+0.183
 inches+0.173
Neg logit contributions
use-0.157
 greet-0.117
 claim-0.111
 visit-0.108
to-0.103
Token stayed
Token ID9658
Feature activation+3.63e+00

Pos logit contributions
use+0.214
 greet+0.178
 claim+0.172
 visit+0.172
 meet+0.165
Neg logit contributions
 very-0.137
 best-0.116
 so-0.115
 scr-0.111
 inches-0.105
Token by
Token ID416
Feature activation-6.62e+00

Pos logit contributions
 very+0.099
 so+0.084
 best+0.084
 scr+0.074
 inches+0.072
Neg logit contributions
use-0.131
 greet-0.103
 claim-0.102
war-0.101
to-0.099
Token
Token ID198
Feature activation+7.90e-03

Pos logit contributions
 very+0.173
 best+0.144
 so+0.136
 scr+0.130
 dots+0.119
Neg logit contributions
use-0.242
 greet-0.198
|-0.195
 claim-0.191
to-0.186
TokenTom
Token ID13787
Feature activation+1.38e+00

Pos logit contributions
 very+0.000
 so+0.000
 best+0.000
 incredibly+0.000
 much+0.000
Neg logit contributions
use-0.000
 greet-0.000
 claim-0.000
 awaited-0.000
war-0.000
Token and
Token ID290
Feature activation+1.12e+01

Pos logit contributions
 very+0.030
 best+0.025
 so+0.024
 scr+0.024
 inches+0.023
Neg logit contributions
use-0.053
 greet-0.044
 visit-0.043
 claim-0.042
 meet-0.042
Token Lily
Token ID20037
Feature activation+1.23e+00

Pos logit contributions
 very+0.324
 so+0.280
 best+0.244
 ears+0.225
 eyes+0.223
Neg logit contributions
use-0.390
irs-0.321
 claim-0.313
to-0.313
war-0.306
Token are
Token ID389
Feature activation+1.06e+01

Pos logit contributions
 very+0.029
 best+0.024
 so+0.024
 scr+0.023
 inches+0.022
Neg logit contributions
use-0.046
 greet-0.038
 claim-0.037
 visit-0.037
 meet-0.035
Token very
Token ID845
Feature activation+2.02e+01

Pos logit contributions
 very+0.174
 scr+0.159
 dots+0.151
der+0.150
 blades+0.145
Neg logit contributions
use-0.402
 greet-0.358
 visit-0.348
 claim-0.346
 meet-0.342
Token sorry
Token ID7926
Feature activation+2.75e+00

Pos logit contributions
 very+0.337
 best+0.254
 so+0.245
 scr+0.233
 inches+0.201
Neg logit contributions
use-0.916
 greet-0.781
 claim-0.767
 visit-0.758
to-0.738
Token.
Token ID13
Feature activation+4.65e+00

Pos logit contributions
 very+0.067
 best+0.061
 blades+0.061
 scr+0.060
 inches+0.059
Neg logit contributions
use-0.087
 visit-0.074
 greet-0.070
 claim-0.068
 meet-0.068
Token They
Token ID1119
Feature activation+3.33e+00

Pos logit contributions
 very+0.118
 so+0.100
 best+0.095
 scr+0.094
 dots+0.089
Neg logit contributions
use-0.169
 greet-0.138
 claim-0.132
 visit-0.129
to-0.128
Token say
Token ID910
Feature activation+5.15e-01

Pos logit contributions
 very+0.112
 so+0.098
 best+0.098
 scr+0.096
 inches+0.091
Neg logit contributions
use-0.090
 greet-0.069
 claim-0.066
 visit-0.066
to-0.062
Token,
Token ID11
Feature activation+8.05e+00

Pos logit contributions
 very+0.011
 best+0.009
 so+0.009
 scr+0.009
 inches+0.008
Neg logit contributions
use-0.020
 greet-0.016
 claim-0.016
 visit-0.016
 meet-0.015
Token Lily
Token ID20037
Feature activation+3.10e+00

Pos logit contributions
 very+0.068
 so+0.059
 best+0.056
 scr+0.051
 most+0.048
Neg logit contributions
use-0.105
 greet-0.088
 claim-0.085
war-0.085
to-0.084
Token's
Token ID338
Feature activation+1.38e+01

Pos logit contributions
 very+0.075
 best+0.063
 so+0.063
 scr+0.061
 inches+0.057
Neg logit contributions
use-0.112
 greet-0.093
 claim-0.090
 visit-0.089
 meet-0.086
Token mom
Token ID1995
Feature activation+4.69e+00

Pos logit contributions
 very+0.352
 best+0.301
 so+0.295
 scr+0.285
 inches+0.266
Neg logit contributions
use-0.491
 greet-0.402
 claim-0.390
 visit-0.383
|-0.373
Tokenmy
Token ID1820
Feature activation-7.84e-01

Pos logit contributions
 very+0.121
 so+0.102
 scr+0.096
 best+0.094
 because+0.078
Neg logit contributions
use-0.163
 greet-0.149
|-0.147
war-0.141
 claim-0.140
Token was
Token ID373
Feature activation+1.08e+01

Pos logit contributions
use+0.027
 greet+0.023
|+0.022
 claim+0.022
war+0.021
Neg logit contributions
 very-0.021
 so-0.018
 best-0.017
 scr-0.017
 inches-0.014
Token very
Token ID845
Feature activation+2.02e+01

Pos logit contributions
 very+0.152
 so+0.109
 best+0.107
 scr+0.100
 inches+0.085
Neg logit contributions
use-0.511
 greet-0.440
 claim-0.431
 visit-0.429
 meet-0.416
Token smart
Token ID4451
Feature activation+3.64e+00

Pos logit contributions
 very+0.352
 so+0.252
 best+0.250
 scr+0.243
 inches+0.202
Neg logit contributions
use-0.921
 greet-0.790
 claim-0.775
 visit-0.763
|-0.755
Token and
Token ID290
Feature activation+9.18e+00

Pos logit contributions
 very+0.079
 scr+0.078
 best+0.073
 blades+0.070
 inches+0.069
Neg logit contributions
use-0.121
 visit-0.102
 greet-0.098
 claim-0.097
to-0.091
Token knew
Token ID2993
Feature activation+9.45e+00

Pos logit contributions
 very+0.216
 best+0.185
 so+0.182
 scr+0.179
 inches+0.171
Neg logit contributions
use-0.328
 greet-0.272
 visit-0.266
 claim-0.264
 meet-0.253
Token how
Token ID703
Feature activation+9.32e+00

Pos logit contributions
 very+0.235
 so+0.200
 best+0.199
 scr+0.188
 inches+0.176
Neg logit contributions
use-0.344
 greet-0.282
 claim-0.272
 visit-0.271
|-0.262
Token to
Token ID284
Feature activation-6.08e+00

Pos logit contributions
 very+0.324
 so+0.277
 best+0.271
 scr+0.256
 inches+0.242
Neg logit contributions
use-0.281
 greet-0.216
 claim-0.207
|-0.206
talk-0.195
Token
Token ID198
Feature activation+7.90e-03

Pos logit contributions
use+0.127
 visit+0.106
 greet+0.105
 claim+0.101
 meet+0.100
Neg logit contributions
 very-0.096
 best-0.086
 scr-0.084
 dots-0.082
 inches-0.081
TokenThe
Token ID464
Feature activation+5.81e+00

Pos logit contributions
 very+0.000
 so+0.000
 best+0.000
 incredibly+0.000
 much+0.000
Neg logit contributions
use-0.000
 greet-0.000
 claim-0.000
 awaited-0.000
war-0.000
Token little
Token ID1310
Feature activation+4.30e+00

Pos logit contributions
 very+0.132
 best+0.110
 so+0.108
 scr+0.104
 inches+0.097
Neg logit contributions
use-0.221
 greet-0.184
 claim-0.179
 visit-0.177
to-0.173
Token boy
Token ID2933
Feature activation+7.21e+00

Pos logit contributions
 very+0.091
 best+0.074
 so+0.074
 scr+0.069
 inches+0.065
Neg logit contributions
use-0.170
 greet-0.144
 claim-0.140
 visit-0.138
to-0.135
Token was
Token ID373
Feature activation+1.10e+01

Pos logit contributions
 very+0.167
 so+0.139
 best+0.138
 scr+0.134
 inches+0.124
Neg logit contributions
use-0.269
 greet-0.223
 claim-0.218
 visit-0.216
 meet-0.207
Token very
Token ID845
Feature activation+2.02e+01

Pos logit contributions
 very+0.095
 best+0.057
 so+0.052
 scr+0.052
 dots+0.039
Neg logit contributions
use-0.556
 greet-0.493
 visit-0.481
 claim-0.480
 meet-0.469
Token glad
Token ID9675
Feature activation+6.28e+00

Pos logit contributions
 very+0.329
 best+0.257
 so+0.254
 scr+0.244
 dots+0.219
Neg logit contributions
use-0.881
 greet-0.757
 claim-0.737
 visit-0.735
 meet-0.712
Token when
Token ID618
Feature activation+4.05e+00

Pos logit contributions
 very+0.153
 so+0.132
 best+0.124
 scr+0.118
 inches+0.111
Neg logit contributions
use-0.238
 greet-0.195
 claim-0.189
|-0.185
 visit-0.184
Token the
Token ID262
Feature activation+5.88e+00

Pos logit contributions
 very+0.104
 so+0.089
 best+0.087
 inches+0.073
 most+0.073
Neg logit contributions
use-0.157
war-0.128
|-0.124
etch-0.122
 greet-0.122
Token doctor
Token ID6253
Feature activation+1.63e+00

Pos logit contributions
 very+0.128
 best+0.105
 so+0.100
 scr+0.095
 inches+0.089
Neg logit contributions
use-0.234
 greet-0.195
 claim-0.190
to-0.186
|-0.186
Token told
Token ID1297
Feature activation-4.45e+00

Pos logit contributions
 very+0.040
 so+0.034
 best+0.032
 scr+0.030
 inches+0.028
Neg logit contributions
use-0.058
 greet-0.050
|-0.048
etch-0.048
 claim-0.048
Token and
Token ID290
Feature activation+4.07e+00

Pos logit contributions
 very+0.127
 extremely+0.117
 incredibly+0.113
 best+0.111
 scr+0.110
Neg logit contributions
use-0.125
 visit-0.100
 claim-0.094
 greet-0.093
of-0.092
Token Jimmy
Token ID12963
Feature activation-2.95e+00

Pos logit contributions
 very+0.072
 best+0.064
 so+0.058
der+0.057
 inches+0.056
Neg logit contributions
use-0.157
 greet-0.135
 visit-0.131
 claim-0.128
 meet-0.126
Token's
Token ID338
Feature activation+5.66e+00

Pos logit contributions
use+0.090
 greet+0.073
 visit+0.073
to+0.070
 meet+0.070
Neg logit contributions
 very-0.067
 best-0.065
der-0.061
 dots-0.060
 inches-0.060
Token face
Token ID1986
Feature activation-1.43e+00

Pos logit contributions
 very+0.092
 so+0.072
 best+0.071
 scr+0.069
der+0.065
Neg logit contributions
use-0.237
 greet-0.204
 visit-0.203
 claim-0.198
 meet-0.192
Token sc
Token ID629
Feature activation+5.20e+00

Pos logit contributions
use+0.042
 visit+0.034
 greet+0.033
to+0.033
 claim+0.031
Neg logit contributions
 very-0.033
 best-0.033
 inches-0.030
 so-0.029
 worst-0.029
Tokenrun
Token ID5143
Feature activation-3.20e+01

Pos logit contributions
 very+0.157
 scr+0.137
 so+0.135
 best+0.134
 inches+0.128
Neg logit contributions
use-0.154
 greet-0.125
 visit-0.124
 claim-0.119
|-0.113
Tokenched
Token ID1740
Feature activation-6.57e+00

Pos logit contributions
use+0.974
 greet+0.784
 claim+0.767
 visit+0.753
 meet+0.709
Neg logit contributions
 very-0.967
 so-0.837
 best-0.827
 scr-0.809
 inches-0.755
Token up
Token ID510
Feature activation-5.34e+00

Pos logit contributions
use+0.197
 greet+0.159
 visit+0.152
 claim+0.151
to+0.147
Neg logit contributions
 very-0.177
 scr-0.159
 best-0.157
 so-0.152
 inches-0.151
Token as
Token ID355
Feature activation+2.94e+00

Pos logit contributions
use+0.192
 greet+0.159
 claim+0.155
 visit+0.154
 meet+0.147
Neg logit contributions
 very-0.127
 best-0.108
 so-0.106
 scr-0.104
 inches-0.097
Token he
Token ID339
Feature activation-2.20e+00

Pos logit contributions
 very+0.068
 best+0.058
 so+0.057
 scr+0.056
 inches+0.052
Neg logit contributions
use-0.106
 greet-0.088
 claim-0.086
 visit-0.086
to-0.082
Token ate
Token ID15063
Feature activation-3.76e+00

Pos logit contributions
use+0.079
 greet+0.065
 claim+0.064
 visit+0.063
|+0.061
Neg logit contributions
 very-0.055
 so-0.046
 scr-0.046
 best-0.045
 inches-0.042
Token do
Token ID466
Feature activation-3.74e+00

Pos logit contributions
use+0.160
 greet+0.125
 claim+0.123
 visit+0.118
to+0.118
Neg logit contributions
 very-0.142
 so-0.124
 best-0.117
 scr-0.114
 because-0.106
Token it
Token ID340
Feature activation+1.06e+00

Pos logit contributions
use+0.115
 greet+0.093
 visit+0.091
 claim+0.091
 meet+0.086
Neg logit contributions
 very-0.102
 best-0.089
 scr-0.089
 so-0.087
 dots-0.085
Token.
Token ID13
Feature activation-5.04e-01

Pos logit contributions
 very+0.025
 best+0.021
 so+0.021
 scr+0.020
 inches+0.019
Neg logit contributions
use-0.039
 greet-0.032
 claim-0.032
 visit-0.031
 meet-0.030
Token She
Token ID1375
Feature activation+5.31e-01

Pos logit contributions
use+0.018
 greet+0.014
 claim+0.014
 visit+0.013
|+0.013
Neg logit contributions
 very-0.013
 so-0.012
 best-0.011
 scr-0.011
 skin-0.010
Token sc
Token ID629
Feature activation+5.15e+00

Pos logit contributions
 very+0.012
 best+0.010
 so+0.010
 scr+0.010
 inches+0.009
Neg logit contributions
use-0.020
 greet-0.016
 visit-0.016
 claim-0.016
to-0.015
Tokenrun
Token ID5143
Feature activation-3.16e+01

Pos logit contributions
 very+0.173
 so+0.156
 scr+0.153
 best+0.151
 inches+0.144
Neg logit contributions
use-0.136
 greet-0.103
 visit-0.102
 claim-0.099
|-0.095
Tokenched
Token ID1740
Feature activation-6.15e+00

Pos logit contributions
use+0.967
 greet+0.786
 claim+0.780
 visit+0.754
 meet+0.709
Neg logit contributions
 very-0.948
 so-0.831
 best-0.769
 scr-0.762
 incredibly-0.712
Token up
Token ID510
Feature activation-5.17e+00

Pos logit contributions
use+0.172
 greet+0.138
 visit+0.130
 claim+0.129
to+0.127
Neg logit contributions
 very-0.183
 best-0.164
 scr-0.164
 inches-0.161
der-0.158
Token her
Token ID607
Feature activation+2.77e+00

Pos logit contributions
use+0.186
 greet+0.156
 visit+0.152
 claim+0.150
to+0.144
Neg logit contributions
 very-0.115
 best-0.100
 scr-0.096
 so-0.093
 inches-0.091
Token face
Token ID1986
Feature activation-1.17e+00

Pos logit contributions
 very+0.052
 best+0.042
 so+0.042
 scr+0.041
 inches+0.038
Neg logit contributions
use-0.110
 greet-0.093
 visit-0.092
 claim-0.090
 meet-0.087
Token.
Token ID13
Feature activation-5.67e-01

Pos logit contributions
use+0.042
 visit+0.035
 greet+0.035
to+0.034
 claim+0.034
Neg logit contributions
 very-0.021
 best-0.019
 inches-0.018
 scr-0.017
 dots-0.017
Token the
Token ID262
Feature activation+5.20e-01

Pos logit contributions
use+0.316
 greet+0.252
 claim+0.243
 visit+0.241
to+0.233
Neg logit contributions
 very-0.273
 so-0.235
 best-0.233
 scr-0.225
 inches-0.214
Token park
Token ID3952
Feature activation-2.04e+00

Pos logit contributions
 very+0.012
 best+0.010
 so+0.010
 scr+0.009
 inches+0.009
Neg logit contributions
use-0.020
 greet-0.016
 claim-0.016
 visit-0.016
 meet-0.015
Token.
Token ID13
Feature activation-1.25e-01

Pos logit contributions
use+0.075
 greet+0.061
 claim+0.060
 visit+0.059
|+0.057
Neg logit contributions
 very-0.050
 so-0.042
 best-0.041
 scr-0.040
 inches-0.037
Token Julia
Token ID22300
Feature activation-7.44e-01

Pos logit contributions
use+0.004
 greet+0.003
 claim+0.003
 visit+0.003
 oblig+0.003
Neg logit contributions
 very-0.003
 so-0.003
 best-0.003
 scr-0.003
 dots-0.003
Token sc
Token ID629
Feature activation+5.37e+00

Pos logit contributions
use+0.025
 greet+0.020
 visit+0.020
 claim+0.019
 meet+0.019
Neg logit contributions
 very-0.019
 best-0.017
 so-0.016
 scr-0.016
 inches-0.015
Tokenrun
Token ID5143
Feature activation-3.15e+01

Pos logit contributions
 very+0.182
 so+0.162
 scr+0.161
 best+0.159
 inches+0.153
Neg logit contributions
use-0.144
 visit-0.107
 greet-0.107
 claim-0.102
|-0.100
Tokenched
Token ID1740
Feature activation-5.94e+00

Pos logit contributions
use+0.976
 claim+0.774
 greet+0.774
 visit+0.740
 meet+0.693
Neg logit contributions
 very-0.960
 so-0.837
 best-0.792
 scr-0.776
 dots-0.734
Token up
Token ID510
Feature activation-4.91e+00

Pos logit contributions
use+0.172
 greet+0.140
 visit+0.132
 claim+0.130
to+0.125
Neg logit contributions
 very-0.173
 scr-0.155
 best-0.152
 so-0.150
der-0.148
Token her
Token ID607
Feature activation+3.39e+00

Pos logit contributions
use+0.181
 greet+0.153
 visit+0.149
 claim+0.148
 meet+0.140
Neg logit contributions
 very-0.107
 best-0.091
 scr-0.089
 so-0.087
 inches-0.081
Token face
Token ID1986
Feature activation-7.90e-01

Pos logit contributions
 very+0.076
 so+0.064
 best+0.063
 scr+0.063
 inches+0.059
Neg logit contributions
use-0.124
 greet-0.102
 visit-0.102
 claim-0.100
 meet-0.095
Token and
Token ID290
Feature activation+4.09e+00

Pos logit contributions
use+0.027
 visit+0.023
 greet+0.023
 claim+0.022
to+0.022
Neg logit contributions
 very-0.015
 best-0.014
 inches-0.013
 scr-0.013
 dots-0.013
Token
Token ID198
Feature activation-7.09e+00

Pos logit contributions
use+0.313
 claim+0.245
 greet+0.234
 duel+0.228
 trace+0.226
Neg logit contributions
 very-0.183
 so-0.140
 best-0.111
 really-0.110
 much-0.101
TokenM
Token ID44
Feature activation-5.48e-01

Pos logit contributions
use+0.191
 greet+0.147
 claim+0.140
 visit+0.140
 meet+0.131
Neg logit contributions
 very-0.235
 best-0.210
 so-0.208
 scr-0.204
 inches-0.195
Tokenam
Token ID321
Feature activation-1.73e+00

Pos logit contributions
use+0.016
 visit+0.012
 greet+0.012
to+0.012
 claim+0.012
Neg logit contributions
 very-0.015
 so-0.014
 sizes-0.013
 best-0.013
 dots-0.013
Tokenma
Token ID2611
Feature activation-4.42e+00

Pos logit contributions
use+0.058
 claim+0.046
 greet+0.045
 visit+0.044
|+0.042
Neg logit contributions
 very-0.052
 so-0.045
 best-0.043
 scr-0.040
 most-0.037
Token sc
Token ID629
Feature activation+5.35e+00

Pos logit contributions
use+0.175
 greet+0.149
 claim+0.146
|+0.143
etch+0.143
Neg logit contributions
 very-0.105
 so-0.088
 best-0.073
 scr-0.066
 ears-0.064
Tokenrun
Token ID5143
Feature activation-3.15e+01

Pos logit contributions
 very+0.140
 so+0.119
 scr+0.118
 best+0.115
 inches+0.110
Neg logit contributions
use-0.182
 visit-0.152
 greet-0.151
|-0.144
 claim-0.143
Tokenched
Token ID1740
Feature activation-6.05e+00

Pos logit contributions
use+1.106
 claim+0.933
 greet+0.926
 visit+0.897
 meet+0.840
Neg logit contributions
 very-0.850
 so-0.716
 best-0.668
 scr-0.608
 blades-0.583
Token up
Token ID510
Feature activation-4.96e+00

Pos logit contributions
use+0.179
 greet+0.144
 visit+0.137
 claim+0.136
to+0.131
Neg logit contributions
 very-0.177
 best-0.158
 scr-0.157
 so-0.154
 inches-0.148
Token all
Token ID477
Feature activation-1.19e+00

Pos logit contributions
use+0.178
 greet+0.148
 visit+0.144
 claim+0.144
 meet+0.136
Neg logit contributions
 very-0.118
 best-0.101
 so-0.099
 scr-0.098
 inches-0.091
Token of
Token ID286
Feature activation-1.04e+01

Pos logit contributions
use+0.040
 claim+0.032
 greet+0.032
 visit+0.030
to+0.030
Neg logit contributions
 very-0.034
 so-0.030
 best-0.029
 scr-0.027
 inches-0.026
Token the
Token ID262
Feature activation+4.28e-01

Pos logit contributions
use+0.314
 greet+0.249
 claim+0.240
 visit+0.239
 meet+0.226
Neg logit contributions
 very-0.313
 best-0.274
 so-0.272
 scr-0.267
 inches-0.253
Token could
Token ID714
Feature activation-3.42e+00

Pos logit contributions
use+0.078
 greet+0.065
 claim+0.063
 visit+0.062
|+0.061
Neg logit contributions
 very-0.050
 so-0.042
 best-0.041
 scr-0.038
 dots-0.036
Token,
Token ID11
Feature activation+4.46e+00

Pos logit contributions
use+0.144
 greet+0.114
|+0.113
 claim+0.112
 visit+0.109
Neg logit contributions
 very-0.076
 so-0.063
 best-0.059
 scr-0.053
der-0.048
Token trying
Token ID2111
Feature activation+2.92e+00

Pos logit contributions
 very+0.098
 so+0.081
 best+0.081
 scr+0.078
 inches+0.072
Neg logit contributions
use-0.172
 greet-0.143
 claim-0.139
 visit-0.139
 meet-0.133
Token to
Token ID284
Feature activation-1.01e+01

Pos logit contributions
 very+0.102
 so+0.091
 best+0.090
 scr+0.087
 inches+0.083
Neg logit contributions
use-0.077
 greet-0.057
 claim-0.055
 visit-0.054
|-0.052
Token out
Token ID503
Feature activation-6.85e+00

Pos logit contributions
use+0.348
|+0.250
to+0.250
enn+0.244
etch+0.243
Neg logit contributions
 very-0.315
 so-0.279
 best-0.258
 scr-0.243
 inches-0.231
Tokenrun
Token ID5143
Feature activation-3.14e+01

Pos logit contributions
use+0.269
 greet+0.214
 claim+0.210
|+0.210
 visit+0.208
Neg logit contributions
 very-0.165
 so-0.142
 best-0.132
 scr-0.125
 inches-0.119
Token the
Token ID262
Feature activation-9.96e-02

Pos logit contributions
use+1.078
 greet+0.880
 claim+0.877
 visit+0.837
|+0.821
Neg logit contributions
 very-0.878
 so-0.753
 best-0.739
 scr-0.640
 most-0.636
Token backwards
Token ID16196
Feature activation-3.64e+00

Pos logit contributions
use+0.004
 greet+0.003
 visit+0.003
 claim+0.003
 meet+0.003
Neg logit contributions
 very-0.002
 so-0.002
 best-0.002
 scr-0.002
 inches-0.002
Token me
Token ID502
Feature activation-5.81e+00

Pos logit contributions
use+0.128
 greet+0.106
 visit+0.103
 claim+0.103
 meet+0.098
Neg logit contributions
 very-0.088
 best-0.075
 scr-0.073
 so-0.073
 inches-0.068
Tokenadow
Token ID4584
Feature activation+1.26e+00

Pos logit contributions
use+0.206
 greet+0.169
 claim+0.164
 visit+0.164
 meet+0.157
Neg logit contributions
 very-0.141
 best-0.121
 so-0.119
 scr-0.117
 dots-0.109
Token.
Token ID13
Feature activation-3.40e-01

Pos logit contributions
 very+0.031
 best+0.027
 scr+0.026
 so+0.026
 dots+0.024
Neg logit contributions
use-0.043
 greet-0.036
 visit-0.035
 claim-0.035
 meet-0.033
Token knew
Token ID2993
Feature activation+5.14e+00

Pos logit contributions
use+0.227
 greet+0.186
 claim+0.183
 visit+0.178
|+0.171
Neg logit contributions
 very-0.170
 so-0.145
 best-0.140
 scr-0.136
 inches-0.127
Token he
Token ID339
Feature activation-2.11e+00

Pos logit contributions
 very+0.132
 so+0.117
 best+0.109
 scr+0.096
 inches+0.094
Neg logit contributions
use-0.200
 greet-0.159
|-0.157
 claim-0.154
war-0.153
Token couldn
Token ID3521
Feature activation+1.06e+00

Pos logit contributions
use+0.074
 greet+0.061
 claim+0.059
 visit+0.059
 meet+0.056
Neg logit contributions
 very-0.053
 best-0.045
 so-0.045
 scr-0.044
 inches-0.041
Token't
Token ID470
Feature activation-3.43e+00

Pos logit contributions
 very+0.037
 so+0.033
 best+0.032
 scr+0.030
 inches+0.029
Neg logit contributions
use-0.029
 greet-0.022
 claim-0.021
 visit-0.020
war-0.019
Token out
Token ID503
Feature activation-6.63e+00

Pos logit contributions
use+0.106
to+0.083
 greet+0.079
war+0.078
irs+0.078
Neg logit contributions
 very-0.105
 so-0.092
 best-0.084
 scr-0.083
 because-0.080
Tokenrun
Token ID5143
Feature activation-3.13e+01

Pos logit contributions
use+0.256
 greet+0.207
 claim+0.203
|+0.201
 visit+0.200
Neg logit contributions
 very-0.160
 so-0.137
 best-0.129
 scr-0.119
 inches-0.117
Token the
Token ID262
Feature activation+4.04e-01

Pos logit contributions
use+1.119
 greet+0.934
 claim+0.927
 visit+0.875
|+0.846
Neg logit contributions
 very-0.820
 so-0.699
 best-0.679
 scr-0.587
 most-0.576
Token lion
Token ID18744
Feature activation-3.84e+00

Pos logit contributions
 very+0.009
 best+0.007
 so+0.007
 scr+0.007
 inches+0.007
Neg logit contributions
use-0.016
 greet-0.013
 claim-0.013
 visit-0.013
 meet-0.012
Token,
Token ID11
Feature activation+4.56e+00

Pos logit contributions
use+0.152
 greet+0.126
 claim+0.122
|+0.119
 visit+0.119
Neg logit contributions
 very-0.088
 so-0.076
 best-0.070
 scr-0.065
 inches-0.060
Token so
Token ID523
Feature activation+1.07e+01

Pos logit contributions
 very+0.055
 best+0.039
 so+0.037
 scr+0.036
 inches+0.030
Neg logit contributions
use-0.218
 greet-0.190
 claim-0.186
 visit-0.185
 meet-0.180
Token he
Token ID339
Feature activation-2.06e+00

Pos logit contributions
 very+0.290
 so+0.252
 best+0.239
 scr+0.223
 inches+0.211
Neg logit contributions
use-0.400
 greet-0.319
 claim-0.311
|-0.305
 visit-0.299
Token but
Token ID475
Feature activation-2.74e+00

Pos logit contributions
 very+0.115
 so+0.097
 best+0.092
 scr+0.089
 inches+0.082
Neg logit contributions
use-0.188
 greet-0.155
 claim-0.152
 visit-0.151
|-0.146
Token she
Token ID673
Feature activation-1.14e-01

Pos logit contributions
use+0.104
 greet+0.084
 claim+0.081
 visit+0.080
|+0.079
Neg logit contributions
 very-0.069
 so-0.058
 best-0.056
 scr-0.052
 inches-0.048
Token couldn
Token ID3521
Feature activation+1.16e+00

Pos logit contributions
use+0.004
 greet+0.004
 claim+0.004
 visit+0.004
 meet+0.003
Neg logit contributions
 very-0.002
 best-0.002
 so-0.002
 scr-0.002
 inches-0.002
Token't
Token ID470
Feature activation-3.44e+00

Pos logit contributions
 very+0.041
 so+0.036
 best+0.035
 scr+0.034
 inches+0.032
Neg logit contributions
use-0.032
 greet-0.024
 claim-0.023
 visit-0.021
war-0.021
Token out
Token ID503
Feature activation-6.62e+00

Pos logit contributions
use+0.094
etch+0.066
to+0.066
 greet+0.065
war+0.064
Neg logit contributions
 very-0.120
 so-0.107
 best-0.099
 scr-0.098
 because-0.093
Tokenrun
Token ID5143
Feature activation-3.13e+01

Pos logit contributions
use+0.274
 greet+0.221
|+0.221
war+0.220
to+0.218
Neg logit contributions
 very-0.147
 so-0.125
 best-0.112
 scr-0.103
 inches-0.100
Token the
Token ID262
Feature activation+7.38e-01

Pos logit contributions
use+1.096
 greet+0.908
 claim+0.898
 visit+0.852
etch+0.822
Neg logit contributions
 very-0.849
 so-0.729
 best-0.704
 scr-0.603
 most-0.597
Token man
Token ID582
Feature activation-1.80e+00

Pos logit contributions
 very+0.018
 best+0.015
 so+0.015
 scr+0.014
 inches+0.014
Neg logit contributions
use-0.027
 greet-0.022
 claim-0.022
to-0.021
 visit-0.021
Token.
Token ID13
Feature activation-1.37e-01

Pos logit contributions
use+0.071
 greet+0.058
|+0.057
Ell+0.056
irs+0.056
Neg logit contributions
 very-0.044
 so-0.038
 best-0.032
 scr-0.031
 because-0.028
Token He
Token ID679
Feature activation-8.79e-01

Pos logit contributions
use+0.005
 greet+0.004
 claim+0.004
war+0.004
etch+0.004
Neg logit contributions
 very-0.003
 so-0.003
 scr-0.003
 best-0.003
 eyes-0.003
Token quickly
Token ID2952
Feature activation-2.23e+00

Pos logit contributions
use+0.033
 greet+0.028
 claim+0.027
 visit+0.027
 meet+0.026
Neg logit contributions
 very-0.020
 so-0.017
 best-0.017
 scr-0.016
 inches-0.015
Token toys
Token ID14958
Feature activation-1.92e+00

Pos logit contributions
 very+0.063
 best+0.051
 so+0.050
 scr+0.048
 inches+0.044
Neg logit contributions
use-0.126
 greet-0.106
 claim-0.103
 visit-0.102
 meet-0.099
Token instead
Token ID2427
Feature activation-1.67e+00

Pos logit contributions
use+0.070
 greet+0.058
 claim+0.056
 visit+0.056
 meet+0.054
Neg logit contributions
 very-0.046
 so-0.039
 best-0.039
 scr-0.037
 inches-0.035
Token.
Token ID13
Feature activation+5.11e-01

Pos logit contributions
use+0.070
 greet+0.058
 claim+0.057
 visit+0.055
|+0.055
Neg logit contributions
 very-0.036
 so-0.031
 best-0.028
 scr-0.025
 because-0.023
Token He
Token ID679
Feature activation-4.33e-01

Pos logit contributions
 very+0.013
 so+0.012
 scr+0.010
 best+0.010
 because+0.009
Neg logit contributions
use-0.020
 greet-0.016
 claim-0.015
|-0.015
 meet-0.015
Token sc
Token ID629
Feature activation+5.82e+00

Pos logit contributions
use+0.015
 greet+0.012
 claim+0.012
 visit+0.011
 meet+0.011
Neg logit contributions
 very-0.012
 so-0.010
 best-0.010
 scr-0.010
 inches-0.009
Tokenrun
Token ID5143
Feature activation-3.12e+01

Pos logit contributions
 very+0.160
 so+0.139
 scr+0.137
 best+0.134
der+0.127
Neg logit contributions
use-0.187
 visit-0.151
 greet-0.147
 claim-0.141
|-0.141
Tokenched
Token ID1740
Feature activation-5.49e+00

Pos logit contributions
use+0.999
 greet+0.824
 claim+0.814
 visit+0.784
 meet+0.740
Neg logit contributions
 very-0.915
 so-0.794
 best-0.731
 scr-0.694
 incredibly-0.669
Token his
Token ID465
Feature activation+2.90e+00

Pos logit contributions
use+0.172
 greet+0.136
 claim+0.132
 visit+0.131
 meet+0.125
Neg logit contributions
 very-0.163
 so-0.142
 best-0.141
 scr-0.136
 inches-0.129
Token face
Token ID1986
Feature activation-8.48e-01

Pos logit contributions
 very+0.056
der+0.049
 scr+0.048
 so+0.047
 dots+0.047
Neg logit contributions
use-0.106
 greet-0.092
 visit-0.092
 claim-0.086
 meet-0.084
Token up
Token ID510
Feature activation-4.71e+00

Pos logit contributions
use+0.030
 visit+0.026
 greet+0.025
to+0.025
 claim+0.025
Neg logit contributions
 very-0.015
 best-0.014
 dots-0.014
 inches-0.014
 scr-0.013
Token and
Token ID290
Feature activation+4.03e+00

Pos logit contributions
use+0.195
 greet+0.166
 visit+0.162
 claim+0.162
to+0.155
Neg logit contributions
 very-0.086
 best-0.069
 so-0.067
 scr-0.066
 dots-0.060
Token was
Token ID373
Feature activation+5.04e+00

Pos logit contributions
 very+0.160
 best+0.134
 so+0.134
 scr+0.129
 inches+0.120
Neg logit contributions
use-0.257
 greet-0.213
 claim-0.207
 visit-0.206
 meet-0.198
Token missing
Token ID4814
Feature activation-5.19e+00

Pos logit contributions
 very+0.105
 so+0.085
 best+0.085
 scr+0.082
 inches+0.075
Neg logit contributions
use-0.200
 greet-0.168
 claim-0.164
 visit-0.163
 meet-0.157
Token.
Token ID13
Feature activation-2.94e-01

Pos logit contributions
use+0.178
 greet+0.146
 claim+0.142
 visit+0.142
 meet+0.134
Neg logit contributions
 very-0.127
 best-0.109
 scr-0.109
 so-0.107
 dots-0.101
Token She
Token ID1375
Feature activation+3.81e-01

Pos logit contributions
use+0.011
 greet+0.009
 claim+0.008
 visit+0.008
etch+0.008
Neg logit contributions
 very-0.008
 so-0.007
 best-0.006
 scr-0.006
der-0.006
Token sc
Token ID629
Feature activation+5.46e+00

Pos logit contributions
 very+0.009
 best+0.008
 so+0.008
 scr+0.008
 dots+0.007
Neg logit contributions
use-0.013
 greet-0.011
 claim-0.011
 visit-0.011
 meet-0.010
Tokenrun
Token ID5143
Feature activation-3.11e+01

Pos logit contributions
 very+0.166
 so+0.146
 scr+0.142
 best+0.142
 inches+0.135
Neg logit contributions
use-0.166
 visit-0.130
 greet-0.128
 claim-0.124
|-0.119
Tokenched
Token ID1740
Feature activation-5.62e+00

Pos logit contributions
use+0.971
 claim+0.795
 greet+0.792
 visit+0.772
 meet+0.717
Neg logit contributions
 very-0.915
 so-0.798
 best-0.742
 scr-0.737
 incredibly-0.691
Token up
Token ID510
Feature activation-4.91e+00

Pos logit contributions
use+0.177
 greet+0.141
 claim+0.137
 visit+0.136
|+0.129
Neg logit contributions
 very-0.167
 so-0.146
 best-0.144
 scr-0.138
 dots-0.131
Token her
Token ID607
Feature activation+3.35e+00

Pos logit contributions
use+0.197
 greet+0.163
 claim+0.160
 visit+0.156
 meet+0.154
Neg logit contributions
 very-0.108
 so-0.091
 best-0.085
 scr-0.080
 inches-0.074
Token face
Token ID1986
Feature activation-7.00e-01

Pos logit contributions
 very+0.070
 best+0.058
 so+0.057
 scr+0.057
 inches+0.052
Neg logit contributions
use-0.129
 greet-0.109
 visit-0.106
 claim-0.106
 meet-0.101
Token and
Token ID290
Feature activation+3.75e+00

Pos logit contributions
use+0.027
 greet+0.023
 visit+0.023
 claim+0.023
to+0.022
Neg logit contributions
 very-0.013
 best-0.011
 scr-0.010
 so-0.010
 inches-0.010
Token hated
Token ID16563
Feature activation-1.70e+00

Pos logit contributions
use+0.031
 greet+0.025
 visit+0.025
 claim+0.024
 meet+0.024
Neg logit contributions
 very-0.019
 best-0.017
 so-0.016
 scr-0.016
 inches-0.016
Token it
Token ID340
Feature activation+1.00e+00

Pos logit contributions
use+0.055
 visit+0.046
 greet+0.046
 meet+0.044
 claim+0.043
Neg logit contributions
 very-0.041
 best-0.037
 scr-0.034
 inches-0.033
 extremely-0.033
Token.
Token ID13
Feature activation-2.17e-01

Pos logit contributions
 dots+0.017
 scr+0.016
 blades+0.015
 inches+0.015
 scalp+0.015
Neg logit contributions
use-0.036
 visit-0.032
etch-0.030
 claim-0.029
 meet-0.029
Token He
Token ID679
Feature activation-8.00e-01

Pos logit contributions
use+0.007
 greet+0.006
 claim+0.005
 visit+0.005
 oblig+0.005
Neg logit contributions
 very-0.007
 so-0.006
 best-0.006
 most-0.005
 scr-0.005
Token sc
Token ID629
Feature activation+5.46e+00

Pos logit contributions
use+0.029
 greet+0.023
 visit+0.023
 claim+0.023
 meet+0.022
Neg logit contributions
 very-0.019
 best-0.017
 so-0.016
 scr-0.016
 inches-0.015
Tokenrun
Token ID5143
Feature activation-3.10e+01

Pos logit contributions
 very+0.161
 so+0.140
 scr+0.137
 best+0.136
 inches+0.127
Neg logit contributions
use-0.171
 greet-0.136
 visit-0.135
 claim-0.130
|-0.125
Tokenched
Token ID1740
Feature activation-5.60e+00

Pos logit contributions
use+0.992
 greet+0.800
 claim+0.778
 visit+0.767
 meet+0.722
Neg logit contributions
 very-0.891
 so-0.769
 best-0.749
 scr-0.735
 dots-0.686
Token his
Token ID465
Feature activation+2.50e+00

Pos logit contributions
use+0.144
 greet+0.113
 visit+0.106
 claim+0.105
war+0.104
Neg logit contributions
 very-0.171
 scr-0.162
 best-0.158
 inches-0.153
der-0.152
Token face
Token ID1986
Feature activation-6.39e-01

Pos logit contributions
 very+0.046
der+0.041
 scr+0.040
 so+0.039
 inches+0.039
Neg logit contributions
use-0.090
 visit-0.079
 greet-0.077
 claim-0.072
 meet-0.072
Token up
Token ID510
Feature activation-4.77e+00

Pos logit contributions
use+0.021
to+0.018
 visit+0.018
war+0.017
 greet+0.017
Neg logit contributions
 dots-0.011
 best-0.011
 very-0.011
 inches-0.011
 sizes-0.011
Token as
Token ID355
Feature activation+3.49e+00

Pos logit contributions
use+0.189
 visit+0.160
 greet+0.159
 claim+0.156
to+0.150
Neg logit contributions
 very-0.083
 best-0.073
 scr-0.070
 dots-0.068
 so-0.065
Token you
Token ID345
Feature activation-4.62e+00

Pos logit contributions
use+0.172
 greet+0.139
 claim+0.136
|+0.132
war+0.129
Neg logit contributions
 very-0.185
 so-0.164
 best-0.156
 scr-0.150
 skin-0.143
Token?"
Token ID1701
Feature activation-4.34e+00

Pos logit contributions
use+0.147
 greet+0.123
 claim+0.119
|+0.118
war+0.115
Neg logit contributions
 very-0.138
 so-0.119
 best-0.117
 scr-0.109
 most-0.101
Token
Token ID198
Feature activation-6.90e+00

Pos logit contributions
use+0.134
 greet+0.106
etch+0.104
|+0.099
war+0.098
Neg logit contributions
 very-0.142
 so-0.126
 best-0.113
 because-0.107
 scr-0.101
Token
Token ID198
Feature activation-6.86e+00

Pos logit contributions
use+0.275
 greet+0.226
etch+0.217
 confront+0.215
 claim+0.211
Neg logit contributions
 very-0.182
 so-0.143
 best-0.121
 and-0.114
 a-0.102
TokenB
Token ID33
Feature activation-1.42e+00

Pos logit contributions
use+0.198
 greet+0.164
 claim+0.149
|+0.149
etch+0.148
Neg logit contributions
 very-0.222
 so-0.190
 best-0.181
 scr-0.178
 most-0.164
Tokenrun
Token ID5143
Feature activation-3.08e+01

Pos logit contributions
use+0.036
 greet+0.031
 claim+0.029
 visit+0.027
 meet+0.026
Neg logit contributions
 very-0.045
 so-0.040
 best-0.039
 scr-0.039
der-0.037
Tokeno
Token ID78
Feature activation-4.22e+00

Pos logit contributions
use+0.987
 greet+0.802
 claim+0.797
 visit+0.770
 meet+0.726
Neg logit contributions
 very-0.890
 so-0.763
 best-0.742
 scr-0.722
 dots-0.676
Token said
Token ID531
Feature activation-2.18e+00

Pos logit contributions
use+0.150
 greet+0.124
 visit+0.120
 claim+0.119
to+0.114
Neg logit contributions
 very-0.103
 best-0.088
 so-0.086
 scr-0.084
 inches-0.079
Token,
Token ID11
Feature activation+4.91e+00

Pos logit contributions
use+0.093
 greet+0.080
 visit+0.077
 claim+0.077
to+0.075
Neg logit contributions
 very-0.037
 best-0.029
 so-0.029
 scr-0.028
 inches-0.026
Token "
Token ID366
Feature activation-1.36e+00

Pos logit contributions
 very+0.123
 so+0.104
 best+0.101
 scr+0.099
 dots+0.090
Neg logit contributions
use-0.179
 greet-0.145
 claim-0.142
 visit-0.140
|-0.137
TokenOf
Token ID5189
Feature activation-4.22e+00

Pos logit contributions
use+0.033
 greet+0.025
 claim+0.025
 visit+0.023
|+0.023
Neg logit contributions
 very-0.049
 so-0.043
 best-0.042
 scr-0.041
 dots-0.040
Token never
Token ID1239
Feature activation-5.75e-01

Pos logit contributions
 very+0.018
 so+0.015
 best+0.014
 scr+0.012
 because+0.012
Neg logit contributions
use-0.029
war-0.024
to-0.024
Em-0.023
 greet-0.023
Token be
Token ID307
Feature activation+4.60e+00

Pos logit contributions
use+0.018
irs+0.017
Em+0.017
to+0.016
war+0.016
Neg logit contributions
 very-0.016
 so-0.014
 best-0.013
 too-0.012
 scr-0.012
Token able
Token ID1498
Feature activation+1.71e-01

Pos logit contributions
 very+0.105
 so+0.088
 best+0.083
 scr+0.073
 inches+0.071
Neg logit contributions
use-0.187
 greet-0.154
 claim-0.151
 visit-0.150
to-0.144
Token to
Token ID284
Feature activation-1.00e+01

Pos logit contributions
 very+0.005
 so+0.005
 best+0.004
 too+0.004
 inches+0.004
Neg logit contributions
use-0.006
 greet-0.005
 claim-0.005
|-0.005
talk-0.004
Token out
Token ID503
Feature activation-6.60e+00

Pos logit contributions
use+0.321
Em+0.284
to+0.265
war+0.263
enn+0.262
Neg logit contributions
 very-0.316
 so-0.274
 best-0.250
 too-0.229
 scr-0.228
Tokenrun
Token ID5143
Feature activation-3.08e+01

Pos logit contributions
use+0.275
 greet+0.220
war+0.217
|+0.216
 claim+0.215
Neg logit contributions
 very-0.153
 so-0.133
 best-0.112
 inches-0.106
 because-0.102
Token the
Token ID262
Feature activation+5.02e-02

Pos logit contributions
use+1.100
 greet+0.916
 claim+0.905
 visit+0.858
|+0.844
Neg logit contributions
 very-0.822
 so-0.696
 best-0.688
 most-0.581
 scr-0.566
Token lion
Token ID18744
Feature activation-3.80e+00

Pos logit contributions
 very+0.001
 best+0.001
 so+0.001
 scr+0.001
 inches+0.001
Neg logit contributions
use-0.002
 greet-0.002
 claim-0.002
 visit-0.002
to-0.002
Token.
Token ID13
Feature activation+1.53e-01

Pos logit contributions
use+0.150
 greet+0.124
 claim+0.118
 oblig+0.116
|+0.115
Neg logit contributions
 very-0.089
 so-0.077
 best-0.069
 scr-0.065
 inches-0.060
Token He
Token ID679
Feature activation-7.65e-01

Pos logit contributions
 very+0.004
 so+0.004
 best+0.003
 because+0.003
 most+0.003
Neg logit contributions
use-0.006
 greet-0.005
 claim-0.005
irs-0.005
away-0.004
Token lay
Token ID3830
Feature activation+2.26e+00

Pos logit contributions
use+0.025
 greet+0.021
 visit+0.020
 claim+0.019
 meet+0.019
Neg logit contributions
 very-0.019
 best-0.017
 dots-0.017
 scr-0.017
 so-0.016
Token but
Token ID475
Feature activation-2.59e+00

Pos logit contributions
 very+0.105
 so+0.089
 best+0.082
 scr+0.077
 inches+0.071
Neg logit contributions
use-0.190
 greet-0.156
 claim-0.152
 visit-0.150
to-0.148
Token he
Token ID339
Feature activation-1.65e+00

Pos logit contributions
use+0.095
 greet+0.075
 claim+0.073
to+0.072
war+0.072
Neg logit contributions
 very-0.072
 so-0.062
 best-0.058
 scr-0.052
 most-0.051
Token couldn
Token ID3521
Feature activation+1.74e+00

Pos logit contributions
use+0.063
 greet+0.052
 claim+0.051
 visit+0.051
 meet+0.049
Neg logit contributions
 very-0.036
 best-0.030
 so-0.030
 scr-0.029
 dots-0.027
Token't
Token ID470
Feature activation-3.41e+00

Pos logit contributions
 very+0.059
 so+0.053
 best+0.050
 scr+0.048
 inches+0.045
Neg logit contributions
use-0.050
 greet-0.039
 claim-0.036
 visit-0.034
talk-0.034
Token out
Token ID503
Feature activation-6.39e+00

Pos logit contributions
use+0.091
etch+0.067
war+0.065
to+0.064
|+0.064
Neg logit contributions
 very-0.123
 so-0.110
 best-0.101
 scr-0.099
 because-0.095
Tokenrun
Token ID5143
Feature activation-3.06e+01

Pos logit contributions
use+0.274
|+0.221
 greet+0.221
war+0.221
to+0.218
Neg logit contributions
 very-0.138
 so-0.118
 best-0.100
 inches-0.091
 because-0.089
Token the
Token ID262
Feature activation+4.33e-01

Pos logit contributions
use+1.075
 greet+0.904
 claim+0.886
 visit+0.839
|+0.817
Neg logit contributions
 very-0.821
 so-0.700
 best-0.684
 most-0.576
 scr-0.573
Token enormous
Token ID9812
Feature activation-1.37e+00

Pos logit contributions
 very+0.009
 best+0.008
 so+0.008
 scr+0.007
 inches+0.007
Neg logit contributions
use-0.017
 greet-0.014
 claim-0.014
to-0.014
 visit-0.014
Token wolves
Token ID23214
Feature activation-6.23e-01

Pos logit contributions
use+0.049
 greet+0.040
 claim+0.039
 visit+0.039
 meet+0.037
Neg logit contributions
 very-0.034
 best-0.029
 so-0.029
 scr-0.028
 inches-0.026
Token.
Token ID13
Feature activation+1.84e-01

Pos logit contributions
use+0.026
 greet+0.022
to+0.022
war+0.022
iffs+0.021
Neg logit contributions
 very-0.013
 so-0.012
 best-0.009
 because-0.008
 too-0.008
Token Stru
Token ID34887
Feature activation+5.29e+00

Pos logit contributions
 very+0.005
 so+0.004
 best+0.004
der+0.004
 scr+0.004
Neg logit contributions
use-0.007
 greet-0.006
 claim-0.005
 visit-0.005
 meet-0.005
Token her
Token ID607
Feature activation+3.83e+00

Pos logit contributions
use+0.045
 greet+0.036
 claim+0.035
|+0.035
war+0.035
Neg logit contributions
 very-0.031
 so-0.026
 best-0.025
 scr-0.024
 inches-0.022
Token candy
Token ID18550
Feature activation+3.90e+00

Pos logit contributions
 very+0.100
 best+0.085
 so+0.084
 scr+0.082
 inches+0.076
Neg logit contributions
use-0.133
 greet-0.109
 claim-0.105
 visit-0.103
|-0.100
Token,
Token ID11
Feature activation+5.26e+00

Pos logit contributions
 very+0.070
 best+0.057
 scr+0.057
 so+0.054
 dots+0.053
Neg logit contributions
use-0.158
 visit-0.134
 claim-0.133
 greet-0.133
to-0.124
Token she
Token ID673
Feature activation+6.92e-01

Pos logit contributions
 very+0.137
 so+0.118
 best+0.116
 scr+0.109
 dots+0.102
Neg logit contributions
use-0.191
 greet-0.152
 claim-0.147
etch-0.144
 visit-0.144
Token sc
Token ID629
Feature activation+6.26e+00

Pos logit contributions
 very+0.018
 best+0.016
 so+0.015
 scr+0.015
 inches+0.014
Neg logit contributions
use-0.023
 greet-0.019
 visit-0.019
 claim-0.018
 meet-0.017
Tokenrun
Token ID5143
Feature activation-3.05e+01

Pos logit contributions
 very+0.179
 scr+0.154
 so+0.154
 best+0.149
 inches+0.143
Neg logit contributions
use-0.198
 greet-0.155
 visit-0.151
|-0.146
 meet-0.141
Tokenched
Token ID1740
Feature activation-4.87e+00

Pos logit contributions
use+0.951
 greet+0.770
 claim+0.755
 visit+0.735
 meet+0.695
Neg logit contributions
 very-0.918
 so-0.795
 best-0.753
 scr-0.732
 dots-0.689
Token her
Token ID607
Feature activation+4.04e+00

Pos logit contributions
use+0.155
 greet+0.125
 claim+0.121
 visit+0.120
 meet+0.113
Neg logit contributions
 very-0.139
 best-0.120
 so-0.120
 scr-0.118
 inches-0.112
Token eyes
Token ID2951
Feature activation+8.07e+00

Pos logit contributions
 very+0.087
 so+0.073
 scr+0.072
 best+0.071
 inches+0.070
Neg logit contributions
use-0.149
 visit-0.126
 greet-0.125
 claim-0.123
 meet-0.115
Token tight
Token ID5381
Feature activation+3.05e+00

Pos logit contributions
 very+0.165
 best+0.158
 inches+0.146
 sizes+0.143
 dots+0.142
Neg logit contributions
use-0.282
 greet-0.240
 visit-0.238
 claim-0.235
to-0.222
Token.
Token ID13
Feature activation+5.88e-01

Pos logit contributions
 very+0.072
 best+0.065
 scr+0.065
 dots+0.062
 inches+0.061
Neg logit contributions
use-0.100
 greet-0.080
 visit-0.080
 claim-0.079
to-0.077
Token paws
Token ID49193
Feature activation-1.58e+00

Pos logit contributions
 very+0.057
 so+0.047
 scr+0.046
 best+0.045
der+0.042
Neg logit contributions
use-0.123
 greet-0.104
 visit-0.104
 claim-0.101
 meet-0.098
Token.
Token ID13
Feature activation+5.23e-01

Pos logit contributions
use+0.047
 greet+0.038
 claim+0.037
 visit+0.036
|+0.034
Neg logit contributions
 very-0.049
 so-0.043
 best-0.042
 scr-0.041
 inches-0.038
Token So
Token ID1406
Feature activation+3.16e+00

Pos logit contributions
 very+0.012
 so+0.011
 because+0.009
 best+0.009
 scr+0.009
Neg logit contributions
use-0.022
 greet-0.017
 claim-0.016
irs-0.016
 oblig-0.016
Token he
Token ID339
Feature activation-1.23e+00

Pos logit contributions
 very+0.057
 scr+0.048
 best+0.046
 so+0.045
 dots+0.042
Neg logit contributions
use-0.131
 greet-0.111
 visit-0.110
 claim-0.108
 meet-0.104
Token sc
Token ID629
Feature activation+6.24e+00

Pos logit contributions
use+0.040
 greet+0.032
 visit+0.032
 claim+0.031
to+0.030
Neg logit contributions
 very-0.033
 best-0.029
 so-0.028
 scr-0.028
 inches-0.027
Tokenrun
Token ID5143
Feature activation-3.04e+01

Pos logit contributions
 very+0.187
 so+0.163
 best+0.161
 scr+0.157
 inches+0.149
Neg logit contributions
use-0.189
 greet-0.144
 visit-0.143
 claim-0.138
|-0.134
Tokenched
Token ID1740
Feature activation-4.85e+00

Pos logit contributions
use+0.927
 greet+0.745
 claim+0.726
 visit+0.706
 meet+0.664
Neg logit contributions
 very-0.930
 so-0.806
 best-0.772
 scr-0.754
 dots-0.710
Token up
Token ID510
Feature activation-4.18e+00

Pos logit contributions
use+0.127
 greet+0.103
 visit+0.099
to+0.097
 claim+0.096
Neg logit contributions
 very-0.144
 scr-0.137
 best-0.134
 inches-0.130
der-0.129
Token his
Token ID465
Feature activation+3.30e+00

Pos logit contributions
use+0.144
 visit+0.123
 greet+0.122
 claim+0.118
to+0.113
Neg logit contributions
 very-0.090
 best-0.085
 scr-0.080
 dots-0.077
 sizes-0.074
Token face
Token ID1986
Feature activation+8.84e-02

Pos logit contributions
 very+0.067
 so+0.057
 scr+0.056
 best+0.054
der+0.054
Neg logit contributions
use-0.124
 visit-0.106
 greet-0.103
 claim-0.101
 meet-0.098
Token and
Token ID290
Feature activation+4.33e+00

Pos logit contributions
 very+0.002
 best+0.002
 dots+0.002
 scr+0.002
 inches+0.002
Neg logit contributions
use-0.003
 visit-0.003
to-0.002
 greet-0.002
 claim-0.002
Token and
Token ID290
Feature activation+4.20e+00

Pos logit contributions
use+0.015
 greet+0.013
 claim+0.012
 visit+0.012
 meet+0.012
Neg logit contributions
 very-0.011
 best-0.009
 so-0.009
 scr-0.009
 inches-0.008
Token always
Token ID1464
Feature activation-8.28e-02

Pos logit contributions
 very+0.104
 so+0.087
 best+0.087
 scr+0.083
 inches+0.077
Neg logit contributions
use-0.153
 greet-0.125
 claim-0.122
 visit-0.119
to-0.115
Token tried
Token ID3088
Feature activation+1.73e+00

Pos logit contributions
use+0.003
 greet+0.002
 claim+0.002
 visit+0.002
to+0.002
Neg logit contributions
 very-0.002
 best-0.002
 so-0.002
 scr-0.002
 inches-0.002
Token to
Token ID284
Feature activation-9.28e+00

Pos logit contributions
 very+0.050
 best+0.044
 so+0.043
 scr+0.042
 inches+0.040
Neg logit contributions
use-0.054
 greet-0.043
 claim-0.042
 visit-0.042
 meet-0.040
Token out
Token ID503
Feature activation-5.98e+00

Pos logit contributions
use+0.310
enn+0.234
to+0.228
|+0.225
 claim+0.219
Neg logit contributions
 very-0.294
 so-0.256
 best-0.249
 scr-0.227
 most-0.217
Tokenrun
Token ID5143
Feature activation-3.01e+01

Pos logit contributions
use+0.240
 greet+0.197
 claim+0.193
 visit+0.190
|+0.189
Neg logit contributions
 very-0.132
 so-0.108
 best-0.106
 scr-0.101
 inches-0.093
Token everyone
Token ID2506
Feature activation-2.12e+00

Pos logit contributions
use+1.098
 greet+0.902
 claim+0.894
 visit+0.854
|+0.846
Neg logit contributions
 very-0.773
 best-0.655
 so-0.650
 scr-0.574
 most-0.548
Token else
Token ID2073
Feature activation-5.12e+00

Pos logit contributions
use+0.081
 greet+0.066
 claim+0.065
|+0.063
 visit+0.062
Neg logit contributions
 very-0.054
 so-0.046
 best-0.044
 scr-0.040
 inches-0.038
Token.
Token ID13
Feature activation+6.29e-01

Pos logit contributions
use+0.188
 greet+0.154
 claim+0.150
 visit+0.148
|+0.144
Neg logit contributions
 very-0.126
 best-0.106
 so-0.105
 scr-0.100
 inches-0.093
Token One
Token ID1881
Feature activation-1.45e+00

Pos logit contributions
 very+0.018
 so+0.016
 best+0.015
 because+0.014
 most+0.013
Neg logit contributions
use-0.023
 greet-0.017
 claim-0.016
away-0.016
Aud-0.016
Token day
Token ID1110
Feature activation-1.09e+00

Pos logit contributions
use+0.043
 visit+0.037
 greet+0.035
 meet+0.033
 claim+0.033
Neg logit contributions
 very-0.039
 scr-0.036
 so-0.035
der-0.035
 best-0.035
Token silly
Token ID14397
Feature activation+1.48e+00

Pos logit contributions
 very+0.094
 scr+0.076
 best+0.075
 dots+0.071
der+0.069
Neg logit contributions
use-0.253
 greet-0.224
 claim-0.216
 visit-0.216
 meet-0.210
Token to
Token ID284
Feature activation-8.12e+00

Pos logit contributions
 very+0.034
 scr+0.029
 best+0.028
 so+0.027
 inches+0.026
Neg logit contributions
use-0.053
 greet-0.044
 visit-0.043
 claim-0.042
 meet-0.041
Token try
Token ID1949
Feature activation+2.21e+00

Pos logit contributions
use+0.223
|+0.184
war+0.177
to+0.176
enn+0.176
Neg logit contributions
 very-0.277
 so-0.248
 scr-0.231
 best-0.229
 skin-0.226
Token and
Token ID290
Feature activation+5.62e+00

Pos logit contributions
 very+0.065
 so+0.057
 best+0.055
 scr+0.052
 inches+0.049
Neg logit contributions
use-0.072
 greet-0.058
 claim-0.056
|-0.055
 visit-0.055
Token out
Token ID503
Feature activation-5.30e+00

Pos logit contributions
 very+0.190
 so+0.170
 scr+0.159
 best+0.156
 skin+0.146
Neg logit contributions
use-0.164
|-0.127
enn-0.125
war-0.120
etch-0.119
Tokenrun
Token ID5143
Feature activation-2.99e+01

Pos logit contributions
use+0.203
 claim+0.170
 greet+0.168
|+0.164
war+0.162
Neg logit contributions
 very-0.129
 so-0.111
 best-0.099
 scr-0.094
 inches-0.088
Token a
Token ID257
Feature activation+5.77e+00

Pos logit contributions
use+0.946
 claim+0.800
 greet+0.798
 visit+0.754
 meet+0.712
Neg logit contributions
 very-0.866
 so-0.751
 best-0.731
 scr-0.660
 most-0.631
Token vehicle
Token ID4038
Feature activation+7.51e-01

Pos logit contributions
 very+0.140
 best+0.113
 so+0.112
 scr+0.107
 inches+0.096
Neg logit contributions
use-0.218
 greet-0.181
 claim-0.174
|-0.173
 visit-0.170
Token!
Token ID0
Feature activation+4.12e-01

Pos logit contributions
 very+0.019
 so+0.016
 best+0.014
 scr+0.014
 inches+0.012
Neg logit contributions
use-0.028
 greet-0.024
 claim-0.023
|-0.023
 visit-0.022
Token Always
Token ID16622
Feature activation-3.30e+00

Pos logit contributions
 very+0.011
 so+0.010
 best+0.008
 scr+0.008
 because+0.008
Neg logit contributions
use-0.015
 greet-0.012
talk-0.012
 claim-0.011
etch-0.011
Token think
Token ID892
Feature activation+3.58e+00

Pos logit contributions
use+0.096
 greet+0.076
 claim+0.074
|+0.074
to+0.072
Neg logit contributions
 very-0.104
 so-0.091
 best-0.091
 scr-0.087
 inches-0.082
Token the
Token ID262
Feature activation+5.01e-01

Pos logit contributions
 very+0.112
 so+0.096
 best+0.096
 scr+0.093
 inches+0.087
Neg logit contributions
use-0.148
 greet-0.120
 claim-0.117
 visit-0.116
 meet-0.111
Token dirt
Token ID13647
Feature activation+6.55e+00

Pos logit contributions
 very+0.010
 best+0.008
 so+0.008
 scr+0.007
 inches+0.007
Neg logit contributions
use-0.021
 greet-0.017
 claim-0.017
 visit-0.017
to-0.016
Token.
Token ID13
Feature activation+5.29e-01

Pos logit contributions
 very+0.168
 best+0.150
 scr+0.148
 so+0.142
 extremely+0.137
Neg logit contributions
use-0.193
 greet-0.161
 visit-0.158
 claim-0.153
to-0.145
Token He
Token ID679
Feature activation+1.64e-01

Pos logit contributions
 very+0.014
 so+0.013
 best+0.012
 scr+0.012
 dots+0.011
Neg logit contributions
use-0.018
 greet-0.014
 claim-0.014
 visit-0.014
|-0.013
Token sc
Token ID629
Feature activation+6.52e+00

Pos logit contributions
 very+0.004
 best+0.003
 so+0.003
 scr+0.003
 inches+0.003
Neg logit contributions
use-0.006
 greet-0.005
 visit-0.005
 claim-0.005
 meet-0.004
Tokenrun
Token ID5143
Feature activation-2.99e+01

Pos logit contributions
 very+0.193
 so+0.169
 scr+0.167
 best+0.167
 inches+0.161
Neg logit contributions
use-0.197
 visit-0.153
 greet-0.153
 claim-0.149
|-0.147
Tokenched
Token ID1740
Feature activation-4.43e+00

Pos logit contributions
use+0.899
 greet+0.718
 claim+0.702
 visit+0.688
 meet+0.648
Neg logit contributions
 very-0.914
 so-0.794
 best-0.773
 scr-0.762
 dots-0.718
Token up
Token ID510
Feature activation-3.80e+00

Pos logit contributions
use+0.134
 greet+0.111
 visit+0.104
 claim+0.103
to+0.101
Neg logit contributions
 very-0.118
 scr-0.107
 best-0.105
 so-0.101
 inches-0.101
Token his
Token ID465
Feature activation+3.45e+00

Pos logit contributions
use+0.150
 greet+0.128
 visit+0.125
 claim+0.124
to+0.118
Neg logit contributions
 very-0.072
 best-0.061
 scr-0.058
 so-0.056
 dots-0.053
Token face
Token ID1986
Feature activation+5.19e-01

Pos logit contributions
 very+0.066
 so+0.053
 best+0.053
 scr+0.052
 inches+0.047
Neg logit contributions
use-0.141
 greet-0.119
 visit-0.117
 claim-0.116
 meet-0.112
Token as
Token ID355
Feature activation+4.36e+00

Pos logit contributions
 very+0.011
 best+0.010
 worst+0.009
 dots+0.009
 inches+0.009
Neg logit contributions
use-0.017
 visit-0.015
 greet-0.014
to-0.014
 claim-0.014
Token play
Token ID711
Feature activation-4.64e+00

Pos logit contributions
use+0.279
enn+0.206
to+0.201
war+0.194
 claim+0.188
Neg logit contributions
 very-0.293
 so-0.263
 best-0.252
 scr-0.239
 skin-0.232
Token a
Token ID257
Feature activation+4.82e+00

Pos logit contributions
use+0.173
 greet+0.144
 claim+0.139
 visit+0.139
 meet+0.133
Neg logit contributions
 very-0.107
 best-0.089
 so-0.089
 scr-0.087
 inches-0.080
Token game
Token ID983
Feature activation+2.88e-01

Pos logit contributions
 very+0.120
 scr+0.109
 so+0.108
 inches+0.107
 best+0.107
Neg logit contributions
use-0.150
 visit-0.129
 greet-0.124
 claim-0.121
 meet-0.117
Token called
Token ID1444
Feature activation+8.40e-01

Pos logit contributions
 very+0.007
 so+0.006
 best+0.006
 scr+0.005
 inches+0.005
Neg logit contributions
use-0.011
 greet-0.009
 claim-0.009
 visit-0.009
|-0.008
Token "
Token ID366
Feature activation-9.50e-01

Pos logit contributions
 very+0.020
 so+0.017
 best+0.016
 scr+0.015
 inches+0.014
Neg logit contributions
use-0.033
 greet-0.026
 claim-0.026
 visit-0.025
to-0.024
Tokenrun
Token ID5143
Feature activation-2.96e+01

Pos logit contributions
use+0.027
 greet+0.022
 claim+0.021
 visit+0.019
 confront+0.019
Neg logit contributions
 very-0.031
 so-0.027
 best-0.026
 scr-0.025
 dots-0.025
Token and
Token ID290
Feature activation+4.68e+00

Pos logit contributions
use+0.976
 greet+0.791
 claim+0.784
 visit+0.751
 meet+0.703
Neg logit contributions
 very-0.834
 so-0.715
 best-0.702
 scr-0.669
 dots-0.626
Token catch
Token ID4929
Feature activation-5.82e+00

Pos logit contributions
 very+0.159
 so+0.144
 best+0.135
 scr+0.131
 ears+0.124
Neg logit contributions
use-0.132
 claim-0.095
 greet-0.095
to-0.094
|-0.091
Token".
Token ID1911
Feature activation-3.72e+00

Pos logit contributions
use+0.203
 greet+0.165
 claim+0.161
 visit+0.159
 meet+0.152
Neg logit contributions
 very-0.151
 so-0.129
 best-0.128
 scr-0.124
 inches-0.116
Token Ben
Token ID3932
Feature activation-3.79e+00

Pos logit contributions
use+0.088
 greet+0.068
 visit+0.067
 To+0.065
war+0.063
Neg logit contributions
 very-0.127
 best-0.117
 inches-0.114
 scr-0.111
 so-0.107
Token was
Token ID373
Feature activation+6.31e+00

Pos logit contributions
use+0.129
 visit+0.107
 greet+0.105
 meet+0.102
|+0.099
Neg logit contributions
 very-0.087
 best-0.079
 scr-0.079
 inches-0.077
 dots-0.075
Token.
Token ID13
Feature activation+1.33e+00

Pos logit contributions
use+0.067
 greet+0.054
war+0.053
|+0.053
 claim+0.053
Neg logit contributions
 very-0.024
 so-0.019
 best-0.016
 scr-0.013
 because-0.013
Token He
Token ID679
Feature activation+9.94e-01

Pos logit contributions
 very+0.035
 so+0.031
 best+0.028
der+0.026
 eyes+0.026
Neg logit contributions
use-0.051
 greet-0.038
 claim-0.037
|-0.036
etch-0.036
Token tried
Token ID3088
Feature activation+2.93e+00

Pos logit contributions
 very+0.025
 so+0.021
 best+0.020
 scr+0.020
 inches+0.018
Neg logit contributions
use-0.036
 greet-0.029
 claim-0.029
 visit-0.028
etch-0.028
Token to
Token ID284
Feature activation-8.18e+00

Pos logit contributions
 very+0.099
 best+0.089
 so+0.088
 scr+0.088
 inches+0.084
Neg logit contributions
use-0.076
 greet-0.058
 claim-0.056
 visit-0.056
 meet-0.052
Token out
Token ID503
Feature activation-4.76e+00

Pos logit contributions
use+0.250
to+0.179
war+0.172
|+0.166
enn+0.165
Neg logit contributions
 very-0.273
 so-0.246
 best-0.229
 scr-0.221
 skin-0.210
Tokenrun
Token ID5143
Feature activation-2.88e+01

Pos logit contributions
use+0.181
 greet+0.148
 claim+0.145
 visit+0.143
|+0.140
Neg logit contributions
 very-0.111
 so-0.094
 best-0.091
 scr-0.086
 inches-0.082
Token the
Token ID262
Feature activation+2.05e+00

Pos logit contributions
use+0.984
 claim+0.803
 greet+0.797
 visit+0.766
|+0.741
Neg logit contributions
 very-0.813
 so-0.704
 best-0.675
 most-0.577
 incredibly-0.554
Token train
Token ID4512
Feature activation+3.90e+00

Pos logit contributions
 very+0.049
 scr+0.045
 so+0.042
 best+0.040
der+0.040
Neg logit contributions
use-0.070
 greet-0.059
 visit-0.058
 claim-0.058
 meet-0.056
Token but
Token ID475
Feature activation-5.59e-01

Pos logit contributions
 very+0.067
 so+0.054
 best+0.050
 scr+0.044
 inches+0.039
Neg logit contributions
use-0.175
 greet-0.147
 claim-0.144
 visit-0.141
to-0.141
Token he
Token ID339
Feature activation+2.69e-01

Pos logit contributions
use+0.020
 greet+0.016
 claim+0.015
 visit+0.015
to+0.014
Neg logit contributions
 very-0.016
 so-0.014
 best-0.013
 scr-0.012
 inches-0.012
Token was
Token ID373
Feature activation+7.25e+00

Pos logit contributions
 very+0.006
 best+0.005
 so+0.005
 scr+0.005
 dots+0.004
Neg logit contributions
use-0.010
 greet-0.009
 claim-0.008
 visit-0.008
 meet-0.008
Token really
Token ID1107
Feature activation+8.47e+00

Pos logit contributions
 very+0.128
 scr+0.118
 best+0.117
 so+0.115
 inches+0.113
Neg logit contributions
use-0.074
 greet-0.055
 visit-0.054
 claim-0.051
to-0.049
Token hard
Token ID1327
Feature activation+6.20e+00

Pos logit contributions
 very+0.106
 best+0.077
 so+0.076
 scr+0.076
 dots+0.065
Neg logit contributions
use-0.396
 greet-0.342
 visit-0.336
 claim-0.334
to-0.325
Token to
Token ID284
Feature activation-7.70e+00

Pos logit contributions
 very+0.160
 scr+0.160
 best+0.145
 dots+0.142
 inches+0.142
Neg logit contributions
use-0.189
 visit-0.153
to-0.147
 greet-0.146
 claim-0.141
Token make
Token ID787
Feature activation-4.43e-01

Pos logit contributions
use+0.232
to+0.173
enn+0.170
war+0.166
|+0.165
Neg logit contributions
 very-0.252
 so-0.220
 scr-0.213
 best-0.202
 skin-0.197
Token the
Token ID262
Feature activation+2.29e+00

Pos logit contributions
use+0.015
 greet+0.013
 visit+0.012
 claim+0.012
 meet+0.012
Neg logit contributions
 very-0.011
 scr-0.009
 best-0.009
 so-0.009
 inches-0.008
Token c
Token ID269
Feature activation+1.16e+01

Pos logit contributions
 very+0.051
 scr+0.048
der+0.048
 inches+0.046
 best+0.044
Neg logit contributions
use-0.079
 greet-0.066
 visit-0.065
 meet-0.064
 claim-0.062
Tokenactus
Token ID34144
Feature activation+7.31e+00

Pos logit contributions
 very+0.303
 so+0.265
 best+0.264
 scr+0.248
 inches+0.242
Neg logit contributions
use-0.388
 greet-0.310
 claim-0.308
to-0.306
 visit-0.305
Token like
Token ID588
Feature activation+1.20e+00

Pos logit contributions
der+0.152
 dots+0.144
 blades+0.135
 inches+0.135
 very+0.134
Neg logit contributions
use-0.221
 greet-0.211
 visit-0.207
 meet-0.195
 claim-0.191
Token it
Token ID340
Feature activation+3.38e+00

Pos logit contributions
 very+0.025
 best+0.022
 scr+0.021
 so+0.020
 inches+0.020
Neg logit contributions
use-0.043
 greet-0.037
 visit-0.037
 claim-0.036
 meet-0.035
Token,
Token ID11
Feature activation+7.16e+00

Pos logit contributions
 very+0.074
 best+0.065
 scr+0.064
 dots+0.063
 inches+0.063
Neg logit contributions
use-0.120
 visit-0.099
 greet-0.098
 claim-0.095
 meet-0.093
Token but
Token ID475
Feature activation+5.44e-02

Pos logit contributions
 very+0.160
 best+0.139
 scr+0.135
 inches+0.127
 dots+0.126
Neg logit contributions
use-0.256
 greet-0.216
 visit-0.212
 claim-0.210
 meet-0.202
Token and
Token ID290
Feature activation+3.95e+00

Pos logit contributions
 very+0.047
 best+0.039
 so+0.039
 scr+0.039
 inches+0.036
Neg logit contributions
use-0.073
 greet-0.061
 claim-0.059
 visit-0.059
 meet-0.056
Token the
Token ID262
Feature activation+5.97e-01

Pos logit contributions
 very+0.126
 so+0.114
 best+0.103
 scr+0.103
 skin+0.097
Neg logit contributions
use-0.120
to-0.089
enn-0.089
etch-0.083
|-0.083
Token swings
Token ID26728
Feature activation+9.79e-01

Pos logit contributions
 very+0.014
 best+0.012
 so+0.012
 scr+0.011
 inches+0.010
Neg logit contributions
use-0.023
 greet-0.019
 claim-0.018
 visit-0.018
to-0.018
Token and
Token ID290
Feature activation+3.98e+00

Pos logit contributions
 very+0.022
 so+0.018
 best+0.017
 scr+0.017
 inches+0.015
Neg logit contributions
use-0.039
 greet-0.032
 claim-0.031
 visit-0.031
to-0.030
Token the
Token ID262
Feature activation+5.89e-01

Pos logit contributions
 very+0.103
 so+0.091
 best+0.081
 scr+0.077
 blades+0.074
Neg logit contributions
use-0.145
to-0.117
enn-0.113
 greet-0.111
 claim-0.111
Token l
Token ID300
Feature activation+1.11e+01

Pos logit contributions
 very+0.015
 best+0.013
 so+0.013
 inches+0.012
 scr+0.012
Neg logit contributions
use-0.021
 greet-0.018
 claim-0.017
to-0.017
 visit-0.017
Tokenadders
Token ID45940
Feature activation-6.23e+00

Pos logit contributions
 very+0.309
 best+0.267
 so+0.262
 scr+0.251
 inches+0.248
Neg logit contributions
use-0.374
 greet-0.299
 visit-0.294
 claim-0.291
 meet-0.272
Token.
Token ID13
Feature activation+8.62e-02

Pos logit contributions
use+0.249
 greet+0.207
 claim+0.200
 visit+0.198
|+0.196
Neg logit contributions
 very-0.138
 so-0.115
 best-0.108
 scr-0.101
 inches-0.095
Token They
Token ID1119
Feature activation-2.21e-01

Pos logit contributions
 very+0.002
 so+0.001
 best+0.001
 because+0.001
 eyes+0.001
Neg logit contributions
use-0.004
 greet-0.003
 claim-0.003
 oblig-0.003
enn-0.003
Token ring
Token ID5858
Feature activation-2.90e+00

Pos logit contributions
use+0.007
to+0.006
 greet+0.005
enn+0.005
 claim+0.005
Neg logit contributions
 very-0.007
 so-0.006
 best-0.006
 scr-0.006
 dots-0.005
Token the
Token ID262
Feature activation+6.83e-01

Pos logit contributions
use+0.093
 greet+0.076
 visit+0.073
 claim+0.073
 meet+0.069
Neg logit contributions
 very-0.078
 best-0.067
 scr-0.067
 so-0.067
 inches-0.063
Token was
Token ID373
Feature activation+5.80e+00

Pos logit contributions
use+0.498
 greet+0.419
 claim+0.408
 visit+0.403
 meet+0.389
Neg logit contributions
 very-0.251
 so-0.203
 best-0.203
 scr-0.192
 inches-0.174
Token a
Token ID257
Feature activation+4.38e+00

Pos logit contributions
der+0.134
 inches+0.126
 scr+0.125
 blades+0.122
 sizes+0.121
Neg logit contributions
 visit-0.168
use-0.161
|-0.153
 meet-0.150
to-0.148
Token girl
Token ID2576
Feature activation+9.55e-01

Pos logit contributions
 very+0.092
 so+0.080
 best+0.079
 scr+0.078
der+0.074
Neg logit contributions
use-0.162
 visit-0.137
 greet-0.136
 claim-0.134
 meet-0.128
Token who
Token ID508
Feature activation-3.05e+00

Pos logit contributions
 very+0.026
 so+0.022
 best+0.021
 scr+0.020
 skin+0.019
Neg logit contributions
use-0.033
 greet-0.027
 claim-0.026
war-0.026
to-0.025
Token was
Token ID373
Feature activation+5.92e+00

Pos logit contributions
use+0.113
 visit+0.098
 greet+0.094
|+0.093
 meet+0.092
Neg logit contributions
der-0.059
 very-0.059
 inches-0.058
 scr-0.056
 dots-0.056
Token very
Token ID845
Feature activation+1.44e+01

Pos logit contributions
der+0.080
 dots+0.061
 sizes+0.057
 inches+0.057
 blades+0.053
Neg logit contributions
 visit-0.243
use-0.233
 greet-0.224
 meet-0.222
 gra-0.216
Token anxious
Token ID18116
Feature activation+3.13e+00

Pos logit contributions
 very+0.236
 so+0.199
der+0.198
 best+0.193
 scr+0.192
Neg logit contributions
use-0.589
 greet-0.510
 visit-0.506
 claim-0.486
 meet-0.485
Token about
Token ID546
Feature activation-4.54e+00

Pos logit contributions
 scr+0.082
 very+0.082
 best+0.081
 inches+0.080
 dots+0.079
Neg logit contributions
use-0.072
 visit-0.070
 greet-0.062
to-0.060
 claim-0.060
Token something
Token ID1223
Feature activation+9.47e+00

Pos logit contributions
use+0.166
 greet+0.133
 claim+0.130
 visit+0.126
war+0.125
Neg logit contributions
 very-0.115
 so-0.098
 best-0.094
 scr-0.092
 inches-0.085
Token.
Token ID13
Feature activation+8.24e-01

Pos logit contributions
 very+0.205
 scr+0.181
 best+0.177
 dots+0.171
 so+0.171
Neg logit contributions
use-0.336
 greet-0.281
 visit-0.276
 claim-0.269
 meet-0.261
Token She
Token ID1375
Feature activation+1.32e+00

Pos logit contributions
 very+0.021
 so+0.018
 best+0.016
 scr+0.016
 skin+0.015
Neg logit contributions
use-0.031
 greet-0.025
 claim-0.024
to-0.023
 visit-0.023
Token,
Token ID11
Feature activation+5.23e+00

Pos logit contributions
use+0.071
 visit+0.060
 greet+0.053
 claim+0.050
 meet+0.049
Neg logit contributions
 scr-0.111
 very-0.102
 extremely-0.100
 inches-0.099
 incredibly-0.098
Token but
Token ID475
Feature activation-2.37e+00

Pos logit contributions
 very+0.100
 scr+0.099
 best+0.097
 inches+0.095
der+0.094
Neg logit contributions
use-0.168
 visit-0.153
 greet-0.149
 meet-0.141
 claim-0.141
Token it
Token ID340
Feature activation+1.92e+00

Pos logit contributions
use+0.029
 visit+0.024
 greet+0.021
 behold+0.020
 meet+0.018
Neg logit contributions
 scr-0.087
der-0.084
 inches-0.082
 centimeters-0.081
 ankle-0.080
Token was
Token ID373
Feature activation+5.89e+00

Pos logit contributions
 very+0.046
 scr+0.045
der+0.044
 best+0.044
 inches+0.043
Neg logit contributions
use-0.060
 visit-0.051
 greet-0.050
 meet-0.047
|-0.047
Token also
Token ID635
Feature activation+2.95e+00

Pos logit contributions
der+0.059
 scr+0.049
 dots+0.046
 inches+0.042
 best+0.039
Neg logit contributions
use-0.244
 greet-0.232
 visit-0.225
 meet-0.220
 claim-0.213
Token very
Token ID845
Feature activation+1.49e+01

Pos logit contributions
der+0.042
 scr+0.033
itive+0.029
 dots+0.029
 inches+0.029
Neg logit contributions
use-0.114
 greet-0.105
 visit-0.104
 meet-0.101
 gra-0.095
Token dirty
Token ID11841
Feature activation+3.64e+00

Pos logit contributions
 very+0.277
 best+0.224
 so+0.223
 scr+0.217
 inches+0.197
Neg logit contributions
use-0.616
 greet-0.524
 visit-0.510
 claim-0.509
 meet-0.491
Token.
Token ID13
Feature activation+9.78e-01

Pos logit contributions
 scr+0.119
 very+0.109
 dots+0.108
 inches+0.108
 best+0.104
Neg logit contributions
use-0.071
 visit-0.055
 claim-0.048
 greet-0.048
away-0.046
Token So
Token ID1406
Feature activation+2.80e+00

Pos logit contributions
 very+0.024
 so+0.021
 best+0.019
 scr+0.018
 skin+0.017
Neg logit contributions
use-0.039
 greet-0.031
 claim-0.030
war-0.029
to-0.028
Token all
Token ID477
Feature activation-6.34e-01

Pos logit contributions
 very+0.054
 scr+0.048
der+0.044
 so+0.043
 best+0.043
Neg logit contributions
use-0.109
 visit-0.094
 greet-0.091
 claim-0.089
 meet-0.088
Token day
Token ID1110
Feature activation-1.52e+00

Pos logit contributions
use+0.016
 greet+0.013
 visit+0.013
 meet+0.012
|+0.012
Neg logit contributions
 very-0.020
 scr-0.019
 best-0.018
der-0.018
 inches-0.018
Token upset
Token ID9247
Feature activation+4.43e+00

Pos logit contributions
 very+0.157
 best+0.119
 so+0.119
 scr+0.113
 inches+0.099
Neg logit contributions
use-0.440
 greet-0.377
 claim-0.369
 visit-0.367
 meet-0.355
Token.
Token ID13
Feature activation+3.39e+00

Pos logit contributions
 very+0.097
 best+0.086
 scr+0.084
 inches+0.079
 blades+0.078
Neg logit contributions
use-0.159
 greet-0.130
 visit-0.129
 claim-0.127
to-0.123
Token She
Token ID1375
Feature activation+4.25e+00

Pos logit contributions
 very+0.079
 so+0.069
 scr+0.059
 best+0.056
 because+0.054
Neg logit contributions
use-0.137
 greet-0.115
 claim-0.111
war-0.109
|-0.108
Token saw
Token ID2497
Feature activation-3.67e+00

Pos logit contributions
 very+0.115
 so+0.096
 scr+0.088
 best+0.086
 ears+0.077
Neg logit contributions
use-0.150
 greet-0.127
 claim-0.123
war-0.121
 visit-0.120
Token the
Token ID262
Feature activation+4.54e+00

Pos logit contributions
use+0.132
 greet+0.106
 visit+0.102
 claim+0.102
|+0.101
Neg logit contributions
 very-0.099
 so-0.088
 best-0.080
 scr-0.078
der-0.073
Token jelly
Token ID27644
Feature activation+1.29e+01

Pos logit contributions
 very+0.099
 so+0.083
 scr+0.082
 best+0.082
 inches+0.074
Neg logit contributions
use-0.169
 greet-0.141
 claim-0.138
 visit-0.138
 meet-0.131
Token everywhere
Token ID8347
Feature activation-2.48e+00

Pos logit contributions
 very+0.323
 scr+0.283
 best+0.282
 so+0.273
 ears+0.263
Neg logit contributions
use-0.413
 greet-0.329
 visit-0.321
 claim-0.318
 meet-0.302
Token.
Token ID13
Feature activation+2.17e+00

Pos logit contributions
use+0.091
 greet+0.075
 claim+0.073
 visit+0.073
 meet+0.070
Neg logit contributions
 very-0.057
 best-0.049
 so-0.048
 scr-0.047
 inches-0.044
Token She
Token ID1375
Feature activation+2.86e+00

Pos logit contributions
 very+0.050
 so+0.045
 scr+0.038
 best+0.038
 eyes+0.036
Neg logit contributions
use-0.086
 greet-0.071
 claim-0.068
 visit-0.067
war-0.066
Token said
Token ID531
Feature activation-1.04e+00

Pos logit contributions
 very+0.078
 so+0.067
 best+0.065
 scr+0.064
 inches+0.060
Neg logit contributions
use-0.097
 greet-0.079
 claim-0.076
 visit-0.076
 meet-0.072
Token,
Token ID11
Feature activation+6.27e+00

Pos logit contributions
use+0.042
 greet+0.036
 claim+0.035
 visit+0.035
|+0.034
Neg logit contributions
 very-0.022
 so-0.018
 best-0.017
 scr-0.016
 inches-0.015
Token a
Token ID257
Feature activation+3.96e+00

Pos logit contributions
 very+0.075
 best+0.072
 scr+0.065
 inches+0.063
der+0.062
Neg logit contributions
use-0.173
 greet-0.153
 visit-0.151
 claim-0.150
 meet-0.143
Token point
Token ID966
Feature activation-2.96e+00

Pos logit contributions
 very+0.087
 best+0.077
 so+0.077
 scr+0.077
der+0.075
Neg logit contributions
use-0.138
 greet-0.115
 visit-0.115
 claim-0.113
 meet-0.107
Tokeny
Token ID88
Feature activation-2.36e+00

Pos logit contributions
use+0.098
 greet+0.078
 visit+0.077
 claim+0.077
to+0.076
Neg logit contributions
 very-0.074
 best-0.066
 scr-0.063
 so-0.062
 inches-0.059
Token stick
Token ID4859
Feature activation-1.14e+00

Pos logit contributions
use+0.068
 visit+0.061
 claim+0.057
 greet+0.057
 meet+0.054
Neg logit contributions
 very-0.055
 best-0.055
 scr-0.052
 sizes-0.050
der-0.049
Token on
Token ID319
Feature activation-1.05e+01

Pos logit contributions
use+0.039
 greet+0.032
 claim+0.032
 visit+0.031
to+0.030
Neg logit contributions
 very-0.027
 best-0.024
 scr-0.023
 so-0.022
 inches-0.021
Token the
Token ID262
Feature activation+5.23e-01

Pos logit contributions
use+0.256
 greet+0.211
 claim+0.203
 visit+0.202
 meet+0.183
Neg logit contributions
 very-0.337
 best-0.321
 scr-0.304
 so-0.293
 inches-0.290
Token ground
Token ID2323
Feature activation+1.83e+00

Pos logit contributions
 very+0.010
 scr+0.009
 best+0.009
der+0.008
 so+0.008
Neg logit contributions
use-0.019
 greet-0.016
 visit-0.016
 claim-0.016
 meet-0.016
Token.
Token ID13
Feature activation+2.39e-01

Pos logit contributions
 very+0.048
 best+0.043
 scr+0.041
 so+0.040
 inches+0.038
Neg logit contributions
use-0.057
 greet-0.047
 claim-0.046
 visit-0.046
to-0.043
Token She
Token ID1375
Feature activation+7.19e-01

Pos logit contributions
 very+0.006
 so+0.005
 scr+0.005
 best+0.004
 skin+0.004
Neg logit contributions
use-0.009
 greet-0.007
 claim-0.007
 visit-0.007
|-0.007
Token picked
Token ID6497
Feature activation-1.37e+00

Pos logit contributions
 very+0.021
 best+0.019
 so+0.018
 scr+0.018
 inches+0.017
Neg logit contributions
use-0.022
 greet-0.017
 visit-0.017
 claim-0.017
to-0.016
Token it
Token ID340
Feature activation+1.32e+00

Pos logit contributions
use+0.039
 greet+0.031
 claim+0.030
 visit+0.030
to+0.029
Neg logit contributions
 very-0.042
 scr-0.037
 best-0.037
 so-0.036
 inches-0.035
Token make
Token ID787
Feature activation-2.41e+00

Pos logit contributions
use+0.017
 greet+0.014
 claim+0.013
to+0.013
 visit+0.013
Neg logit contributions
 very-0.018
 so-0.016
 best-0.015
 scr-0.014
 inches-0.013
Token mistakes
Token ID10135
Feature activation-1.74e+00

Pos logit contributions
use+0.084
 greet+0.070
 visit+0.069
 claim+0.068
to+0.065
Neg logit contributions
 very-0.058
 scr-0.049
 so-0.048
 best-0.048
 dots-0.046
Token and
Token ID290
Feature activation+4.31e+00

Pos logit contributions
use+0.072
 greet+0.061
 visit+0.060
 claim+0.059
 meet+0.057
Neg logit contributions
 very-0.032
 best-0.026
 scr-0.025
 so-0.025
 inches-0.023
Token no
Token ID645
Feature activation+3.57e+00

Pos logit contributions
 very+0.137
 so+0.121
 best+0.120
 scr+0.116
 inches+0.110
Neg logit contributions
use-0.125
 greet-0.097
 claim-0.093
 visit-0.091
|-0.088
Token one
Token ID530
Feature activation+3.88e+00

Pos logit contributions
 very+0.097
 best+0.084
 so+0.082
 scr+0.078
 inches+0.075
Neg logit contributions
use-0.122
 greet-0.099
 claim-0.095
 visit-0.093
|-0.093
Token is
Token ID318
Feature activation+5.50e+00

Pos logit contributions
 very+0.101
 so+0.083
 best+0.082
 inches+0.071
 scr+0.069
Neg logit contributions
use-0.141
 greet-0.121
 claim-0.118
|-0.114
Ell-0.113
Token perfect
Token ID2818
Feature activation-1.66e+00

Pos logit contributions
 very+0.114
 so+0.093
 best+0.092
 scr+0.085
 inches+0.079
Neg logit contributions
use-0.224
 greet-0.188
 claim-0.183
 visit-0.182
|-0.178
Token.
Token ID13
Feature activation+4.23e-01

Pos logit contributions
use+0.061
 greet+0.051
 claim+0.049
 visit+0.048
|+0.048
Neg logit contributions
 very-0.041
 so-0.035
 best-0.034
 scr-0.031
 inches-0.030
Token Being
Token ID11204
Feature activation+2.51e+00

Pos logit contributions
 very+0.013
 so+0.012
 best+0.011
 scr+0.010
 because+0.010
Neg logit contributions
use-0.013
 greet-0.010
 claim-0.010
war-0.010
talk-0.009
Token careful
Token ID8161
Feature activation+2.43e+00

Pos logit contributions
 very+0.046
 scr+0.042
 dots+0.039
 skin+0.038
 inches+0.038
Neg logit contributions
use-0.093
 greet-0.079
 visit-0.078
 claim-0.075
 meet-0.075
Token and
Token ID290
Feature activation+3.97e+00

Pos logit contributions
 very+0.052
 best+0.043
 so+0.043
 scr+0.042
 inches+0.039
Neg logit contributions
use-0.093
 greet-0.078
 visit-0.076
 claim-0.076
 meet-0.073
Token were
Token ID547
Feature activation+4.71e+00

Pos logit contributions
use+0.151
 greet+0.124
 visit+0.122
 claim+0.120
 meet+0.117
Neg logit contributions
 very-0.104
 best-0.090
 so-0.087
 scr-0.086
 inches-0.084
Token very
Token ID845
Feature activation+1.38e+01

Pos logit contributions
der+0.057
 dots+0.052
 very+0.051
 scr+0.048
 blades+0.046
Neg logit contributions
use-0.192
 greet-0.179
 visit-0.177
 meet-0.175
 claim-0.169
Token happy
Token ID3772
Feature activation+2.57e+00

Pos logit contributions
 very+0.244
der+0.239
 dots+0.231
 blades+0.223
 scr+0.215
Neg logit contributions
use-0.510
 greet-0.458
 visit-0.443
 meet-0.433
 claim-0.426
Token.
Token ID13
Feature activation+1.28e-01

Pos logit contributions
 very+0.066
 scr+0.058
 best+0.057
 so+0.056
 inches+0.053
Neg logit contributions
use-0.084
 greet-0.070
 visit-0.069
 claim-0.068
 meet-0.065
Token They
Token ID1119
Feature activation-1.39e-01

Pos logit contributions
 very+0.004
 so+0.003
 best+0.003
 scr+0.003
 dots+0.003
Neg logit contributions
use-0.004
 greet-0.003
 claim-0.003
 visit-0.003
 meet-0.003
Token had
Token ID550
Feature activation-2.57e-01

Pos logit contributions
use+0.005
 greet+0.004
 visit+0.004
 claim+0.004
 meet+0.004
Neg logit contributions
 very-0.004
 best-0.003
 so-0.003
 scr-0.003
 inches-0.003
Token shared
Token ID4888
Feature activation-1.82e+00

Pos logit contributions
use+0.009
 greet+0.007
 visit+0.007
 claim+0.007
 meet+0.007
Neg logit contributions
 very-0.006
 best-0.005
 scr-0.005
 so-0.005
 inches-0.005
Token a
Token ID257
Feature activation+4.30e+00

Pos logit contributions
use+0.064
 greet+0.052
 claim+0.051
 visit+0.050
to+0.048
Neg logit contributions
 very-0.047
 so-0.040
 best-0.040
 scr-0.038
 inches-0.036
Token lot
Token ID1256
Feature activation+6.98e+00

Pos logit contributions
 very+0.092
 so+0.079
 best+0.078
 scr+0.077
 inches+0.075
Neg logit contributions
use-0.158
 greet-0.135
 visit-0.134
 claim-0.130
 meet-0.126
Token and
Token ID290
Feature activation+4.24e+00

Pos logit contributions
 very+0.214
 scr+0.202
 best+0.198
 ears+0.195
 blades+0.194
Neg logit contributions
use-0.178
 greet-0.146
 visit-0.139
 claim-0.135
 meet-0.129
Token learned
Token ID4499
Feature activation+5.11e+00

Pos logit contributions
 very+0.107
 scr+0.097
 best+0.095
 inches+0.093
 blades+0.092
Neg logit contributions
use-0.131
 visit-0.115
 greet-0.112
 meet-0.109
 claim-0.106
Token a
Token ID257
Feature activation+6.15e+00

Pos logit contributions
 very+0.103
 scr+0.085
 best+0.083
 inches+0.081
 dots+0.080
Neg logit contributions
use-0.226
 greet-0.198
 visit-0.195
 claim-0.193
 meet-0.185
Token big
Token ID1263
Feature activation-8.72e-01

Pos logit contributions
 very+0.137
 best+0.119
 scr+0.118
 so+0.118
 inches+0.117
Neg logit contributions
use-0.219
 greet-0.186
 visit-0.184
 claim-0.179
 meet-0.172
Token hug
Token ID16225
Feature activation+2.28e+00

Pos logit contributions
use+0.018
 greet+0.014
 visit+0.013
 claim+0.012
 meet+0.011
Neg logit contributions
 very-0.033
 scr-0.030
 best-0.030
 so-0.030
der-0.029
Token and
Token ID290
Feature activation+6.27e+00

Pos logit contributions
 very+0.039
 scr+0.033
 best+0.032
 inches+0.030
 dots+0.029
Neg logit contributions
use-0.093
 greet-0.079
 visit-0.079
 claim-0.078
 meet-0.075
Token praised
Token ID15342
Feature activation+5.26e-01

Pos logit contributions
 very+0.181
 best+0.159
 scr+0.156
 so+0.156
 inches+0.151
Neg logit contributions
use-0.188
 greet-0.154
 visit-0.150
 claim-0.147
 meet-0.139
Token her
Token ID607
Feature activation+5.93e+00

Pos logit contributions
 very+0.013
 so+0.012
 best+0.010
 scr+0.010
 inches+0.009
Neg logit contributions
use-0.021
 greet-0.016
 claim-0.016
etch-0.015
 visit-0.015
Token for
Token ID329
Feature activation-5.63e+00

Pos logit contributions
 very+0.148
 best+0.125
 so+0.124
 scr+0.118
 inches+0.110
Neg logit contributions
use-0.218
 greet-0.177
 claim-0.172
 visit-0.170
|-0.165
Token completing
Token ID14339
Feature activation+3.56e-01

Pos logit contributions
use+0.230
 greet+0.184
 claim+0.181
 visit+0.180
war+0.179
Neg logit contributions
 very-0.128
 so-0.107
 best-0.101
 scr-0.093
 inches-0.090
Token the
Token ID262
Feature activation+2.84e+00

Pos logit contributions
 very+0.010
 best+0.009
 so+0.009
 scr+0.009
 dots+0.009
Neg logit contributions
use-0.011
 greet-0.009
 claim-0.008
 visit-0.008
 meet-0.008
Token toy
Token ID13373
Feature activation+4.60e+00

Pos logit contributions
 very+0.070
 so+0.059
 best+0.059
 scr+0.057
 inches+0.053
Neg logit contributions
use-0.102
 greet-0.084
 claim-0.082
 visit-0.082
 meet-0.078
Token.
Token ID13
Feature activation+2.52e+00

Pos logit contributions
 very+0.116
 best+0.098
 so+0.098
 scr+0.096
 inches+0.090
Neg logit contributions
use-0.160
 greet-0.131
 claim-0.128
 visit-0.128
 meet-0.121
Token day
Token ID1110
Feature activation+3.05e+00

Pos logit contributions
 very+0.081
 scr+0.072
 best+0.071
 so+0.070
der+0.069
Neg logit contributions
use-0.093
 visit-0.078
 greet-0.076
 meet-0.072
 claim-0.072
Token John
Token ID1757
Feature activation-2.63e+00

Pos logit contributions
 very+0.046
 best+0.034
 so+0.033
 scr+0.028
 inches+0.024
Neg logit contributions
use-0.142
 greet-0.124
 claim-0.122
 visit-0.118
 meet-0.115
Token went
Token ID1816
Feature activation+1.95e+00

Pos logit contributions
use+0.101
 greet+0.085
 claim+0.083
 visit+0.081
war+0.079
Neg logit contributions
 very-0.057
 so-0.047
 best-0.046
 scr-0.044
 inches-0.040
Token to
Token ID284
Feature activation-5.53e+00

Pos logit contributions
 very+0.055
 so+0.047
 best+0.043
 because+0.038
 too+0.036
Neg logit contributions
use-0.070
 claim-0.057
 greet-0.056
war-0.052
|-0.052
Token visit
Token ID3187
Feature activation-6.80e+00

Pos logit contributions
use+0.239
war+0.191
talk+0.188
etch+0.182
enn+0.181
Neg logit contributions
 very-0.124
 so-0.107
 best-0.087
 a-0.079
 skin-0.078
Token his
Token ID465
Feature activation+6.05e+00

Pos logit contributions
use+0.307
war+0.248
|+0.242
talk+0.235
Ell+0.235
Neg logit contributions
 very-0.152
 so-0.130
 best-0.104
 because-0.102
 a-0.086
Token grand
Token ID4490
Feature activation-2.66e+00

Pos logit contributions
 very+0.154
 best+0.132
 so+0.128
 scr+0.118
 most+0.111
Neg logit contributions
use-0.226
 greet-0.181
 claim-0.174
|-0.172
war-0.172
Tokenpa
Token ID8957
Feature activation-1.18e+00

Pos logit contributions
use+0.081
 greet+0.063
 claim+0.061
war+0.059
to+0.058
Neg logit contributions
 very-0.081
 so-0.071
 best-0.070
 scr-0.067
 inches-0.064
Token.
Token ID13
Feature activation+2.81e+00

Pos logit contributions
use+0.051
war+0.043
 greet+0.043
Ell+0.042
|+0.042
Neg logit contributions
 very-0.024
 so-0.020
 best-0.016
 because-0.015
 scr-0.014
Token Grand
Token ID5675
Feature activation-3.20e+00

Pos logit contributions
 very+0.076
 so+0.067
 scr+0.062
 best+0.062
 dots+0.057
Neg logit contributions
use-0.098
 greet-0.075
 claim-0.073
|-0.072
war-0.070
Tokenpa
Token ID8957
Feature activation-2.11e+00

Pos logit contributions
use+0.077
 greet+0.057
 claim+0.054
 visit+0.054
 meet+0.050
Neg logit contributions
 very-0.116
 best-0.103
 so-0.103
 scr-0.102
 inches-0.097
Token to
Token ID284
Feature activation-7.71e+00

Pos logit contributions
 very+0.156
 so+0.139
 best+0.135
 scr+0.131
 inches+0.126
Neg logit contributions
use-0.113
 greet-0.085
 claim-0.082
 visit-0.078
 meet-0.075
Token melt
Token ID16867
Feature activation+8.14e-01

Pos logit contributions
use+0.250
 greet+0.203
 claim+0.196
 visit+0.196
 meet+0.186
Neg logit contributions
 very-0.213
 best-0.185
 so-0.183
 scr-0.178
 inches-0.169
Token!
Token ID0
Feature activation+1.10e+00

Pos logit contributions
 very+0.021
 scr+0.019
 best+0.019
der+0.018
 so+0.018
Neg logit contributions
use-0.025
 greet-0.020
 visit-0.019
 claim-0.019
to-0.018
Token
Token ID198
Feature activation-4.79e+00

Pos logit contributions
 very+0.030
 so+0.027
 best+0.023
 most+0.022
 eyes+0.022
Neg logit contributions
use-0.040
 greet-0.032
 claim-0.031
etch-0.030
Aud-0.029
Token
Token ID198
Feature activation-4.74e+00

Pos logit contributions
use+0.198
 claim+0.153
 greet+0.149
 gra+0.145
etch+0.144
Neg logit contributions
 very-0.132
 so-0.107
 really-0.089
 best-0.084
 incredibly-0.080
TokenAt
Token ID2953
Feature activation-7.18e+00

Pos logit contributions
use+0.154
 greet+0.117
 claim+0.115
|+0.110
etch+0.109
Neg logit contributions
 very-0.150
 so-0.127
 incredibly-0.116
 best-0.114
 extremely-0.114
Token first
Token ID717
Feature activation-3.64e+00

Pos logit contributions
use+0.251
 greet+0.201
 claim+0.196
war+0.188
to+0.188
Neg logit contributions
 very-0.195
 so-0.169
 best-0.163
 scr-0.146
 inches-0.144
Token,
Token ID11
Feature activation+7.17e+00

Pos logit contributions
use+0.176
 greet+0.153
 claim+0.149
to+0.147
etch+0.144
Neg logit contributions
 very-0.050
 so-0.038
 best-0.035
 inches-0.025
 ears-0.023
Token Peter
Token ID5613
Feature activation-6.60e-01

Pos logit contributions
 very+0.252
 so+0.224
 best+0.223
 scr+0.216
 inches+0.207
Neg logit contributions
use-0.186
 greet-0.139
 claim-0.133
 visit-0.130
|-0.123
Token did
Token ID750
Feature activation+2.84e+00

Pos logit contributions
use+0.024
 greet+0.020
 claim+0.020
 visit+0.020
 meet+0.019
Neg logit contributions
 very-0.015
 best-0.013
 so-0.013
 scr-0.012
 inches-0.012
Token not
Token ID407
Feature activation+2.54e+00

Pos logit contributions
 very+0.058
 best+0.049
 so+0.047
 scr+0.046
 dots+0.043
Neg logit contributions
use-0.111
 greet-0.094
 visit-0.091
 claim-0.091
 meet-0.088
Token and
Token ID290
Feature activation+3.90e+00

Pos logit contributions
use+0.029
 greet+0.024
 claim+0.023
 visit+0.022
|+0.022
Neg logit contributions
 very-0.017
 so-0.014
 best-0.014
 scr-0.012
 inches-0.012
Token knowledge
Token ID3725
Feature activation-3.47e-01

Pos logit contributions
 very+0.112
 so+0.097
 best+0.097
 scr+0.092
 inches+0.087
Neg logit contributions
use-0.127
 greet-0.099
 claim-0.097
|-0.094
 visit-0.094
Token.
Token ID13
Feature activation+2.07e-01

Pos logit contributions
use+0.013
 greet+0.011
 claim+0.011
 visit+0.010
|+0.010
Neg logit contributions
 very-0.008
 so-0.007
 best-0.007
 scr-0.006
 inches-0.006
Token<|endoftext|>
Token ID50256
Feature activation-3.04e+00

Pos logit contributions
 very+0.007
 so+0.006
 best+0.006
 because+0.005
 most+0.005
Neg logit contributions
use-0.006
 greet-0.005
 claim-0.004
war-0.004
etch-0.004
TokenOnce
Token ID7454
Feature activation-1.85e+00

Pos logit contributions
use+0.107
 claim+0.085
 greet+0.085
etch+0.079
 confront+0.078
Neg logit contributions
 very-0.090
 so-0.077
 best-0.072
 air-0.063
 too-0.063
Token upon
Token ID2402
Feature activation-5.02e+00

Pos logit contributions
use+0.065
 greet+0.055
 claim+0.052
talk+0.052
war+0.049
Neg logit contributions
 very-0.057
 best-0.049
 so-0.048
 because-0.041
 too-0.039
Token a
Token ID257
Feature activation+3.86e+00

Pos logit contributions
use+0.186
 visit+0.171
 greet+0.168
 claim+0.162
to+0.162
Neg logit contributions
 very-0.086
 best-0.079
 scr-0.076
 blades-0.074
 inches-0.074
Token time
Token ID640
Feature activation-4.41e+00

Pos logit contributions
 very+0.108
 best+0.092
 so+0.092
 scr+0.089
 inches+0.081
Neg logit contributions
use-0.128
 greet-0.105
 claim-0.103
 visit-0.096
 meet-0.094
Token there
Token ID612
Feature activation-1.31e+01

Pos logit contributions
use+0.199
 greet+0.171
 claim+0.168
 visit+0.165
 meet+0.160
Neg logit contributions
 very-0.069
 so-0.052
 best-0.052
 scr-0.047
 inches-0.041
Token were
Token ID547
Feature activation+4.43e+00

Pos logit contributions
use+0.539
 greet+0.453
 claim+0.442
 visit+0.434
 meet+0.420
Neg logit contributions
 very-0.263
 so-0.211
 best-0.210
 scr-0.196
 inches-0.176
Token two
Token ID734
Feature activation-2.95e+00

Pos logit contributions
 very+0.088
 scr+0.072
 so+0.070
 best+0.070
 inches+0.064
Neg logit contributions
use-0.175
 greet-0.147
 visit-0.145
 claim-0.142
 meet-0.140
Token "
Token ID366
Feature activation-1.66e+00

Pos logit contributions
 very+0.119
 so+0.101
 best+0.097
 scr+0.096
der+0.087
Neg logit contributions
use-0.174
 greet-0.142
 claim-0.139
 visit-0.137
|-0.135
TokenI
Token ID40
Feature activation-5.92e+00

Pos logit contributions
use+0.053
 greet+0.042
etch+0.041
irs+0.040
 claim+0.040
Neg logit contributions
 very-0.043
 dots-0.039
 so-0.039
 scr-0.038
 most-0.037
Token can
Token ID460
Feature activation-1.35e+00

Pos logit contributions
use+0.192
 greet+0.158
 claim+0.155
to+0.146
war+0.146
Neg logit contributions
 very-0.168
 so-0.142
 best-0.137
 scr-0.135
 because-0.118
Token't
Token ID470
Feature activation-3.14e+00

Pos logit contributions
use+0.040
 greet+0.029
war+0.029
to+0.028
 claim+0.028
Neg logit contributions
 very-0.045
 so-0.040
 best-0.039
 scr-0.037
 skin-0.034
Token run
Token ID1057
Feature activation-2.64e+00

Pos logit contributions
use+0.099
to+0.079
war+0.076
enn+0.076
 greet+0.075
Neg logit contributions
 very-0.096
 so-0.087
 best-0.078
 because-0.077
 scr-0.076
Token anymore
Token ID7471
Feature activation-5.26e+00

Pos logit contributions
use+0.098
 greet+0.082
 claim+0.080
 visit+0.079
 meet+0.076
Neg logit contributions
 very-0.062
 so-0.052
 best-0.051
 scr-0.049
 inches-0.045
Token."
Token ID526
Feature activation-3.74e+00

Pos logit contributions
use+0.190
 greet+0.157
 claim+0.152
 visit+0.148
war+0.144
Neg logit contributions
 very-0.133
 so-0.116
 best-0.111
 scr-0.105
 inches-0.100
Token
Token ID220
Feature activation+2.15e+00

Pos logit contributions
use+0.118
 greet+0.090
etch+0.087
 claim+0.087
war+0.083
Neg logit contributions
 very-0.119
 so-0.109
 best-0.098
 ears-0.092
 scr-0.092
Token
Token ID198
Feature activation-6.94e+00

Pos logit contributions
 very+0.056
 best+0.051
 so+0.051
der+0.039
 most+0.039
Neg logit contributions
use-0.089
 greet-0.068
away-0.065
talk-0.063
|-0.063
Token
Token ID198
Feature activation-6.92e+00

Pos logit contributions
use+0.286
 greet+0.223
 claim+0.218
etch+0.209
 trace+0.207
Neg logit contributions
 very-0.184
 so-0.148
 best-0.125
 really-0.107
 and-0.103
TokenCharlie
Token ID37136
Feature activation-8.65e+00

Pos logit contributions
use+0.229
 greet+0.181
|+0.170
 claim+0.170
etch+0.163
Neg logit contributions
 very-0.212
 so-0.178
 best-0.165
 scr-0.165
 extremely-0.155
Token
Token ID198
Feature activation-6.08e+00

Pos logit contributions
 very+0.077
 so+0.067
 best+0.066
 because+0.054
 most+0.054
Neg logit contributions
use-0.100
 greet-0.082
 claim-0.074
away-0.071
 oblig-0.071
Token
Token ID198
Feature activation-6.04e+00

Pos logit contributions
use+0.235
 greet+0.198
 claim+0.184
 gra+0.175
 visit+0.169
Neg logit contributions
 very-0.172
 so-0.143
 best-0.128
 really-0.114
 most-0.107
TokenSuddenly
Token ID38582
Feature activation-6.94e+00

Pos logit contributions
use+0.178
 greet+0.144
 claim+0.128
 visit+0.125
 oblig+0.121
Neg logit contributions
 very-0.207
 so-0.179
 best-0.173
 scr-0.170
 incredibly-0.161
Token,
Token ID11
Feature activation+5.31e+00

Pos logit contributions
use+0.270
 visit+0.226
 greet+0.222
 claim+0.222
 meet+0.210
Neg logit contributions
 very-0.134
 scr-0.120
 best-0.108
 so-0.107
 inches-0.105
Token Tim
Token ID5045
Feature activation-6.76e+00

Pos logit contributions
 very+0.188
 scr+0.169
 best+0.169
 so+0.167
 inches+0.162
Neg logit contributions
use-0.126
 greet-0.094
 visit-0.092
 claim-0.092
 meet-0.083
Tokenmy
Token ID1820
Feature activation-6.67e+00

Pos logit contributions
use+0.196
 greet+0.157
 claim+0.150
 visit+0.147
 meet+0.138
Neg logit contributions
 very-0.215
 best-0.188
 so-0.188
 scr-0.181
 inches-0.172
Token had
Token ID550
Feature activation+6.79e-02

Pos logit contributions
use+0.217
 greet+0.170
 visit+0.169
 claim+0.168
to+0.162
Neg logit contributions
 very-0.176
 best-0.155
 scr-0.153
 so-0.150
 inches-0.147
Token an
Token ID281
Feature activation+5.61e+00

Pos logit contributions
 very+0.001
 best+0.001
 scr+0.001
 so+0.001
 inches+0.001
Neg logit contributions
use-0.003
 greet-0.002
 visit-0.002
 claim-0.002
 meet-0.002
Token idea
Token ID2126
Feature activation+5.47e+00

Pos logit contributions
 dots+0.132
 scalp+0.120
der+0.118
 sizes+0.117
 inches+0.115
Neg logit contributions
use-0.148
 visit-0.140
 greet-0.136
etch-0.134
 ask-0.134
Token.
Token ID13
Feature activation+2.28e-01

Pos logit contributions
 very+0.115
 so+0.096
 best+0.082
 because+0.078
 scr+0.077
Neg logit contributions
use-0.227
 greet-0.193
 claim-0.185
|-0.185
 visit-0.182
Token He
Token ID679
Feature activation-1.90e-01

Pos logit contributions
 very+0.007
 so+0.006
 best+0.006
 scr+0.005
 inches+0.005
Neg logit contributions
use-0.007
 greet-0.006
 claim-0.006
 visit-0.006
 meet-0.005
TokenWhen
Token ID2215
Feature activation-3.38e+00

Pos logit contributions
use+0.253
 claim+0.197
 greet+0.196
 visit+0.189
|+0.181
Neg logit contributions
 very-0.191
 so-0.163
 scr-0.150
 incredibly-0.139
 best-0.136
Token the
Token ID262
Feature activation+3.02e-01

Pos logit contributions
use+0.124
 greet+0.100
 claim+0.097
etch+0.095
 visit+0.094
Neg logit contributions
 very-0.087
 so-0.073
 best-0.072
 scr-0.066
 inches-0.063
Token pig
Token ID12967
Feature activation+1.30e+00

Pos logit contributions
 very+0.008
 so+0.007
 scr+0.007
 best+0.007
 inches+0.006
Neg logit contributions
use-0.010
 greet-0.008
 visit-0.008
 claim-0.008
 meet-0.008
Tokengy
Token ID1360
Feature activation-4.96e+00

Pos logit contributions
 very+0.035
 so+0.030
 best+0.030
 scr+0.028
 ears+0.026
Neg logit contributions
use-0.044
 greet-0.036
 claim-0.035
etch-0.034
 visit-0.033
Token pe
Token ID613
Feature activation+6.27e+00

Pos logit contributions
use+0.176
 greet+0.145
 claim+0.140
 visit+0.137
to+0.133
Neg logit contributions
 very-0.124
 so-0.105
 best-0.104
 scr-0.100
 inches-0.093
Tokenered
Token ID1068
Feature activation-3.64e+00

Pos logit contributions
 very+0.152
 best+0.131
 so+0.125
 inches+0.125
 scr+0.125
Neg logit contributions
use-0.223
 greet-0.190
 visit-0.187
 claim-0.184
 meet-0.178
Token inside
Token ID2641
Feature activation-1.04e+01

Pos logit contributions
use+0.111
 greet+0.089
 claim+0.085
 visit+0.084
 meet+0.079
Neg logit contributions
 very-0.107
 best-0.095
 scr-0.093
 so-0.093
 inches-0.087
Token the
Token ID262
Feature activation+2.93e-01

Pos logit contributions
use+0.537
war+0.465
talk+0.463
|+0.455
etch+0.453
Neg logit contributions
 very-0.140
 so-0.113
 best-0.075
 because-0.060
 eyes-0.051
Token hole
Token ID7604
Feature activation+6.74e+00

Pos logit contributions
 very+0.006
 so+0.005
 best+0.005
 scr+0.005
 inches+0.005
Neg logit contributions
use-0.012
 greet-0.010
 claim-0.009
 visit-0.009
to-0.009
Token,
Token ID11
Feature activation+4.51e+00

Pos logit contributions
 very+0.112
 so+0.087
 best+0.086
 scr+0.081
 inches+0.072
Neg logit contributions
use-0.295
 greet-0.252
 claim-0.246
 visit-0.245
 meet-0.237
Token she
Token ID673
Feature activation+1.35e-01

Pos logit contributions
 very+0.115
 so+0.099
 best+0.096
 scr+0.091
 inches+0.085
Neg logit contributions
use-0.165
 greet-0.133
 claim-0.129
 visit-0.127
etch-0.125
Token a
Token ID257
Feature activation+3.70e+00

Pos logit contributions
use+0.074
to+0.063
war+0.062
of+0.061
|+0.060
Neg logit contributions
 very-0.052
 so-0.041
 best-0.036
 inches-0.033
 because-0.033
Token ship
Token ID4074
Feature activation+1.69e+00

Pos logit contributions
 very+0.100
 so+0.078
 best+0.072
 scr+0.068
 skin+0.061
Neg logit contributions
use-0.135
to-0.111
 greet-0.111
talk-0.110
|-0.108
Token that
Token ID326
Feature activation-6.36e+00

Pos logit contributions
 very+0.035
 so+0.029
 best+0.024
 because+0.021
 too+0.021
Neg logit contributions
use-0.072
 greet-0.061
to-0.059
|-0.059
 claim-0.058
Token sank
Token ID30895
Feature activation-2.61e+00

Pos logit contributions
use+0.249
 greet+0.207
 claim+0.204
etch+0.194
war+0.194
Neg logit contributions
 very-0.146
 best-0.121
 so-0.120
 scr-0.108
 most-0.101
Token long
Token ID890
Feature activation-3.37e+00

Pos logit contributions
use+0.085
 greet+0.069
 claim+0.067
 visit+0.066
|+0.063
Neg logit contributions
 very-0.075
 so-0.064
 best-0.064
 scr-0.060
 inches-0.058
Token ago
Token ID2084
Feature activation-2.07e+01

Pos logit contributions
use+0.131
 greet+0.110
 claim+0.107
 visit+0.106
to+0.104
Neg logit contributions
 very-0.073
 so-0.061
 best-0.060
 scr-0.056
 inches-0.054
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
use+0.674
 visit+0.544
 greet+0.542
 claim+0.526
 meet+0.511
Neg logit contributions
 very-0.527
 scr-0.461
 best-0.458
 so-0.442
 dots-0.428
Token She
Token ID1375
Feature activation+9.47e-01

Pos logit contributions
use+0.000
 greet+0.000
 claim+0.000
 visit+0.000
war+0.000
Neg logit contributions
 very-0.000
 so-0.000
 best-0.000
 scr-0.000
der-0.000
Token said
Token ID531
Feature activation-2.42e+00

Pos logit contributions
 very+0.026
 best+0.023
 so+0.023
 scr+0.022
 inches+0.022
Neg logit contributions
use-0.030
 greet-0.024
 visit-0.024
 claim-0.024
 meet-0.022
Token they
Token ID484
Feature activation-2.39e+00

Pos logit contributions
use+0.091
 greet+0.076
 claim+0.074
 visit+0.073
|+0.070
Neg logit contributions
 very-0.056
 so-0.047
 best-0.046
 scr-0.044
 inches-0.041
Token could
Token ID714
Feature activation-2.78e+00

Pos logit contributions
use+0.084
 greet+0.069
 visit+0.067
 claim+0.067
 meet+0.065
Neg logit contributions
 very-0.058
 best-0.051
 scr-0.049
 so-0.049
 inches-0.047
Token ball
Token ID2613
Feature activation+1.04e+00

Pos logit contributions
 very+0.050
 scr+0.044
 so+0.041
der+0.041
 inches+0.041
Neg logit contributions
use-0.071
 visit-0.060
 greet-0.058
 claim-0.057
 meet-0.056
Token."
Token ID526
Feature activation-1.89e+00

Pos logit contributions
 very+0.026
 scr+0.024
 best+0.023
 dots+0.023
 so+0.022
Neg logit contributions
use-0.033
 greet-0.027
 visit-0.027
 claim-0.026
|-0.025
Token Buddy
Token ID36896
Feature activation-4.13e+00

Pos logit contributions
use+0.056
 greet+0.044
 claim+0.042
 visit+0.039
to+0.038
Neg logit contributions
 very-0.063
 so-0.057
 best-0.052
 scr-0.050
 ears-0.049
Token and
Token ID290
Feature activation+5.59e+00

Pos logit contributions
use+0.145
 visit+0.121
 greet+0.120
to+0.117
 meet+0.115
Neg logit contributions
 very-0.080
 best-0.078
 dots-0.077
 inches-0.075
 blades-0.072
Token Wh
Token ID854
Feature activation-2.83e+00

Pos logit contributions
 very+0.185
 best+0.164
 so+0.163
 scr+0.160
 inches+0.151
Neg logit contributions
use-0.154
 greet-0.118
 claim-0.113
 visit-0.112
 meet-0.105
Tokenisk
Token ID1984
Feature activation-2.11e+01

Pos logit contributions
use+0.044
 greet+0.024
 claim+0.022
 visit+0.022
|+0.020
Neg logit contributions
 very-0.127
 so-0.116
 best-0.116
 scr-0.115
 dots-0.111
Tokeners
Token ID364
Feature activation-1.18e+00

Pos logit contributions
use+0.714
 greet+0.581
 claim+0.563
 visit+0.560
 meet+0.533
Neg logit contributions
 very-0.560
 best-0.479
 so-0.478
 scr-0.463
 inches-0.436
Token stopped
Token ID5025
Feature activation+1.87e+00

Pos logit contributions
use+0.047
 greet+0.039
 claim+0.038
 visit+0.038
to+0.037
Neg logit contributions
 very-0.025
 so-0.020
 best-0.020
 scr-0.019
 inches-0.017
Token fighting
Token ID4330
Feature activation-8.70e-01

Pos logit contributions
 very+0.047
 so+0.039
 best+0.038
 scr+0.037
 inches+0.034
Neg logit contributions
use-0.069
 greet-0.057
 claim-0.054
 visit-0.054
|-0.053
Token.
Token ID13
Feature activation+1.60e+00

Pos logit contributions
use+0.034
 greet+0.029
 visit+0.029
 claim+0.029
 meet+0.027
Neg logit contributions
 very-0.016
 best-0.014
 scr-0.014
 so-0.013
 inches-0.012
Token They
Token ID1119
Feature activation+1.96e+00

Pos logit contributions
 very+0.045
 so+0.039
 best+0.038
 scr+0.035
 ears+0.034
Neg logit contributions
use-0.057
 greet-0.045
 claim-0.043
 visit-0.042
 meet-0.040
Tokeny
Token ID88
Feature activation-1.16e+00

Pos logit contributions
use+0.031
 greet+0.025
 visit+0.025
 claim+0.025
to+0.024
Neg logit contributions
 very-0.019
 best-0.017
 so-0.016
 scr-0.016
 inches-0.015
Token ran
Token ID4966
Feature activation+3.81e-01

Pos logit contributions
use+0.039
 visit+0.032
 greet+0.031
 claim+0.030
 meet+0.030
Neg logit contributions
 very-0.028
 best-0.025
 scr-0.024
 inches-0.024
 so-0.024
Token over
Token ID625
Feature activation-7.84e+00

Pos logit contributions
 very+0.013
 best+0.012
 scr+0.012
 so+0.012
 inches+0.011
Neg logit contributions
use-0.010
 greet-0.007
 visit-0.007
 claim-0.007
 meet-0.007
Token to
Token ID284
Feature activation-8.15e+00

Pos logit contributions
use+0.255
 greet+0.190
 claim+0.189
|+0.180
 oblig+0.179
Neg logit contributions
 very-0.256
 so-0.225
 best-0.198
 too-0.183
 because-0.182
Token Wh
Token ID854
Feature activation-2.75e+00

Pos logit contributions
use+0.289
war+0.228
enn+0.221
to+0.221
Em+0.217
Neg logit contributions
 very-0.245
 so-0.207
 best-0.172
 because-0.171
 scr-0.166
Tokenisk
Token ID1984
Feature activation-2.08e+01

Pos logit contributions
use+0.044
 greet+0.027
 claim+0.024
 visit+0.024
 meet+0.021
Neg logit contributions
 very-0.122
 best-0.112
 so-0.112
 scr-0.110
 inches-0.106
Tokeners
Token ID364
Feature activation-7.67e-01

Pos logit contributions
use+0.856
 claim+0.699
 greet+0.682
 visit+0.678
 oblig+0.633
Neg logit contributions
 very-0.453
 best-0.351
 so-0.349
 inches-0.294
 most-0.269
Token and
Token ID290
Feature activation+5.64e+00

Pos logit contributions
use+0.035
 greet+0.030
 claim+0.030
 visit+0.029
|+0.029
Neg logit contributions
 very-0.011
 so-0.008
 best-0.008
 scr-0.007
 inches-0.006
Token bark
Token ID21405
Feature activation+5.67e+00

Pos logit contributions
 very+0.176
 so+0.155
 best+0.148
 scr+0.148
 dots+0.137
Neg logit contributions
use-0.173
 greet-0.134
 claim-0.131
 visit-0.128
|-0.126
Tokened
Token ID276
Feature activation-6.94e+00

Pos logit contributions
 inches+0.146
 blades+0.144
 sizes+0.143
 best+0.142
 very+0.141
Neg logit contributions
use-0.140
 greet-0.124
to-0.123
 visit-0.110
 meet-0.110
Token very
Token ID845
Feature activation+1.57e+01

Pos logit contributions
use+0.234
 greet+0.197
 visit+0.190
 claim+0.189
to+0.179
Neg logit contributions
 very-0.171
 best-0.151
 so-0.146
 scr-0.145
 inches-0.139
Tokeners
Token ID364
Feature activation-2.32e+00

Pos logit contributions
use+0.852
 greet+0.691
 claim+0.684
 visit+0.670
etch+0.631
Neg logit contributions
 very-0.536
 best-0.439
 so-0.437
 inches-0.383
 scr-0.383
Token.
Token ID13
Feature activation+4.85e-01

Pos logit contributions
use+0.068
 visit+0.058
 meet+0.053
 greet+0.053
 claim+0.052
Neg logit contributions
 very-0.060
 dots-0.059
 inches-0.059
 best-0.058
 scr-0.057
Token Mr
Token ID1770
Feature activation+4.20e+00

Pos logit contributions
 very+0.015
 so+0.013
 best+0.013
 scr+0.013
 dots+0.012
Neg logit contributions
use-0.015
 greet-0.012
 claim-0.011
 visit-0.011
to-0.011
Token.
Token ID13
Feature activation+4.82e-01

Pos logit contributions
 very+0.115
 scr+0.106
 so+0.105
 best+0.101
 inches+0.099
Neg logit contributions
use-0.129
 visit-0.101
 claim-0.100
 greet-0.099
|-0.094
Token Wh
Token ID854
Feature activation-3.66e+00

Pos logit contributions
 very+0.014
 best+0.012
 so+0.012
 scr+0.012
 ears+0.011
Neg logit contributions
use-0.015
 greet-0.012
to-0.011
 claim-0.011
war-0.011
Tokenisk
Token ID1984
Feature activation-2.21e+01

Pos logit contributions
use+0.044
|+0.016
 visit+0.015
 claim+0.015
 greet+0.014
Neg logit contributions
 very-0.176
 so-0.164
 scr-0.162
 best-0.162
 dots-0.161
Tokeners
Token ID364
Feature activation-2.18e+00

Pos logit contributions
use+0.748
 greet+0.607
 claim+0.589
 visit+0.585
 meet+0.557
Neg logit contributions
 very-0.588
 best-0.503
 so-0.502
 scr-0.487
 inches-0.458
Token loved
Token ID6151
Feature activation+1.73e+00

Pos logit contributions
use+0.066
 visit+0.060
 meet+0.056
 greet+0.052
|+0.052
Neg logit contributions
 inches-0.052
 dots-0.052
der-0.050
 scr-0.049
 blades-0.048
Token to
Token ID284
Feature activation-8.98e+00

Pos logit contributions
 very+0.066
 best+0.063
 scr+0.062
 so+0.060
 inches+0.060
Neg logit contributions
use-0.033
 visit-0.024
 greet-0.024
 claim-0.021
to-0.021
Token study
Token ID2050
Feature activation+6.72e-01

Pos logit contributions
use+0.192
 greet+0.136
 visit+0.128
 claim+0.128
 meet+0.117
Neg logit contributions
 very-0.349
 best-0.316
 so-0.315
 scr-0.309
 inches-0.299
Token.
Token ID13
Feature activation+6.81e-01

Pos logit contributions
 very+0.015
 scr+0.013
 best+0.013
 so+0.012
 inches+0.012
Neg logit contributions
use-0.023
 greet-0.020
 visit-0.020
 claim-0.019
to-0.018
Token outside
Token ID2354
Feature activation-9.08e+00

Pos logit contributions
 very+0.038
 so+0.032
 best+0.030
 scr+0.030
 inches+0.027
Neg logit contributions
use-0.057
 greet-0.047
 claim-0.045
 visit-0.045
war-0.043
Token the
Token ID262
Feature activation+2.69e+00

Pos logit contributions
use+0.318
 greet+0.254
 claim+0.246
 visit+0.239
to+0.232
Neg logit contributions
 very-0.250
 so-0.216
 best-0.202
 scr-0.201
 inches-0.187
Token door
Token ID3420
Feature activation-1.33e+00

Pos logit contributions
 very+0.062
 so+0.051
 best+0.049
 scr+0.047
 dots+0.044
Neg logit contributions
use-0.104
 greet-0.086
 claim-0.083
to-0.081
 visit-0.081
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
use+0.049
 greet+0.041
 claim+0.040
 visit+0.039
 meet+0.038
Neg logit contributions
 very-0.031
 best-0.026
 so-0.026
 scr-0.025
 inches-0.024
Token Wh
Token ID854
Feature activation-3.35e+00

Pos logit contributions
 very+0.034
 so+0.030
 scr+0.028
 best+0.027
 ears+0.026
Neg logit contributions
use-0.037
 greet-0.029
 claim-0.028
 visit-0.027
|-0.026
Tokenisk
Token ID1984
Feature activation-2.17e+01

Pos logit contributions
use+0.050
 greet+0.026
 claim+0.025
 visit+0.024
|+0.021
Neg logit contributions
 very-0.154
 so-0.141
 best-0.141
 scr-0.139
 dots-0.135
Tokeners
Token ID364
Feature activation-2.19e+00

Pos logit contributions
use+0.897
 greet+0.733
 claim+0.732
 visit+0.727
etch+0.682
Neg logit contributions
 very-0.464
 best-0.363
 so-0.361
 inches-0.311
 most-0.291
Token wanted
Token ID2227
Feature activation-1.06e+00

Pos logit contributions
use+0.076
 greet+0.062
 visit+0.061
 claim+0.060
 meet+0.058
Neg logit contributions
 very-0.053
 best-0.048
 so-0.045
 scr-0.044
 inches-0.043
Token to
Token ID284
Feature activation-8.56e+00

Pos logit contributions
use+0.031
 greet+0.023
 claim+0.022
|+0.022
etch+0.021
Neg logit contributions
 very-0.037
 so-0.033
 best-0.031
 scr-0.029
 skin-0.028
Token catch
Token ID4929
Feature activation-6.15e+00

Pos logit contributions
use+0.257
to+0.184
enn+0.184
etch+0.180
Em+0.180
Neg logit contributions
 very-0.282
 so-0.254
 scr-0.238
 skin-0.225
 best-0.213
Token the
Token ID262
Feature activation+1.90e+00

Pos logit contributions
use+0.200
 greet+0.158
 visit+0.152
 claim+0.151
|+0.144
Neg logit contributions
 very-0.180
 so-0.157
 best-0.150
 scr-0.147
 inches-0.138