TOP ACTIVATIONS
MAX = 9.665

It became the happiest books he lf in the world !
it was a beautiful t eddy bear ! But Lucy was
is a very nice t eddy bear . I love it
was her furry orange t eddy bear . Every day ,
favorite was a leather t eddy bear . One day ,
toy was a green t eddy bear . One day ,
toy was a soft t eddy bear named Mr . Bear
could not believe how care free T essa was . She
favorite toy was a t eddy bear . One day ,
You are the best t eddy bear in the world ."
is a very nice t eddy bear , but it is
favorite toy was a t eddy bear named Brown ie .
" This is my t eddy bear . He is very
: " What is bat he ? Why is it good
and their stomach s ac he . They spit
favorite toy was a t eddy bear . One day ,
favorite toy was a t eddy bear named Teddy . One
favorite toy was a t eddy bear named Mr . Sn
The toy was a t eddy bear , and it was
favorite toy was a t eddy bear . One day ,

MIN ACTIVATIONS
MIN = -10.088

. So he knocked on lots of doors . He gave
. After lots of squ ashing , it
Grand ma would go on lots of adventures . They would
wanted . While they took lots of pictures , Jimmy found
play !" They tried on lots of clothes and had so
for dinner . She took lots of things out of the
prince and princess went on lots of fun adventures and then
to the festival and took lots of fun things . They
friend . They went on lots of adventures together , and
friend . They went on lots of adventures together , and
He tried on lots of different ones , until
, and they went on lots of fun adventures together .
The team went on lots of adventures together , and
the park . They took lots of delicious treats and spread
into the garden and took lots of lovely photos of the
checked her foot . After lots of tests , the doctors
to sleep . Lucy took lots of cozy n aps with
adows ! After lots of hopping around , the
. He asked lots of people , but he
worked very hard . After lots of trying , he finally

INTERVAL 4.727 - 9.665
CONTAINS 0.319%

, " What are you doing ?" The man
" There , it is fixed . Good as new ."
" Wow , this is amazing !" Ben says .
Anna thinks the car is pretty . She wants to see
what Mrs . Brown meant by " sp oil ", but

INTERVAL -0.211 - 4.727
CONTAINS 56.647%

was a little girl called Sarah . She was three years
time , there was a gray fish who lived in the
, round balloons . Sue 's mom bought her a red
wash him . Then he will be clean and fresh .
. One day , she was playing with her favorite toy

INTERVAL -5.149 - -0.211
CONTAINS 42.582%

Come on , let 's go !" He opened
there ." They looked around the slide and saw
happy little girl named Sarah . She loved going for walks
bed and his mom my read him a story . Tim
in the park and he saw a bright yellow notebook lying

INTERVAL -10.088 - -5.149
CONTAINS 0.452%

playing . <|endoftext|> Once upon a time there was a happy
little brother had knocked over some books . Lily 's mom
time . <|endoftext|> Once upon a time , there was a
sad . <|endoftext|> Once upon a time , in a small
. I won â t hurt you . â
Token It
Token ID632
Feature activation+1.32e+00

Pos logit contributions
icer+0.001
 scoring+0.001
bing+0.001
 exploding+0.001
 ramp+0.001
Neg logit contributions
 lots-0.001
First-0.001
 many-0.001
up-0.001
drawn-0.001
Token became
Token ID2627
Feature activation+2.66e+00

Pos logit contributions
icer+0.042
 scoring+0.040
 ramp+0.039
etti+0.038
 exploding+0.038
Neg logit contributions
up-0.032
drawn-0.030
aid-0.029
large-0.028
zzy-0.028
Token the
Token ID262
Feature activation+2.67e+00

Pos logit contributions
icer+0.099
 scoring+0.097
 exploding+0.092
 burying+0.092
bing+0.091
Neg logit contributions
up-0.050
drawn-0.046
aid-0.043
ave-0.042
large-0.041
Token happiest
Token ID49414
Feature activation+1.55e+00

Pos logit contributions
icer+0.109
 scoring+0.102
 ramp+0.101
 exploding+0.098
etti+0.098
Neg logit contributions
up-0.042
drawn-0.039
aid-0.037
large-0.035
zzy-0.034
Token books
Token ID3835
Feature activation+2.62e+00

Pos logit contributions
icer+0.056
 scoring+0.053
bing+0.052
omp+0.051
ocol+0.051
Neg logit contributions
up-0.031
 to-0.031
 lots-0.031
 big-0.030
 many-0.030
Tokenhe
Token ID258
Feature activation+9.67e+00

Pos logit contributions
icer+0.108
 scoring+0.099
 ramp+0.097
ocol+0.095
 dent+0.095
Neg logit contributions
 to-0.043
up-0.042
 lots-0.039
 extra-0.039
drawn-0.038
Tokenlf
Token ID1652
Feature activation+5.90e+00

Pos logit contributions
icer+0.210
 dent+0.188
etti+0.177
 spoiler+0.175
ocol+0.174
Neg logit contributions
 to-0.340
 off-0.325
 many-0.316
 lots-0.316
 very-0.314
Token in
Token ID287
Feature activation+5.04e+00

Pos logit contributions
icer+0.243
 scoring+0.224
 ramp+0.220
 dent+0.216
ocol+0.214
Neg logit contributions
up-0.097
 to-0.097
 lots-0.089
drawn-0.087
 extra-0.087
Token the
Token ID262
Feature activation+1.73e+00

Pos logit contributions
icer+0.217
oc+0.210
 ramp+0.210
etti+0.201
 scoring+0.199
Neg logit contributions
 many-0.068
 lots-0.066
 to-0.064
 big-0.061
 a-0.058
Token world
Token ID995
Feature activation+1.60e+00

Pos logit contributions
 scoring+0.048
icer+0.047
omp+0.047
etti+0.046
bing+0.046
Neg logit contributions
up-0.046
drawn-0.044
 lots-0.044
 many-0.043
 big-0.043
Token!
Token ID0
Feature activation-3.09e-02

Pos logit contributions
icer+0.061
 scoring+0.054
 ramp+0.054
 dent+0.053
ocol+0.052
Neg logit contributions
up-0.032
 to-0.031
drawn-0.029
 lots-0.029
 extra-0.027
Token it
Token ID340
Feature activation+3.44e-01

Pos logit contributions
up+0.041
drawn+0.040
large+0.037
zzy+0.036
 sunglasses+0.036
Neg logit contributions
icer-0.062
 scoring-0.056
oc-0.055
etti-0.054
bing-0.053
Token was
Token ID373
Feature activation+3.62e+00

Pos logit contributions
icer+0.011
 scoring+0.011
 ramp+0.010
etti+0.010
 exploding+0.010
Neg logit contributions
up-0.009
drawn-0.008
aid-0.007
large-0.007
ave-0.007
Token a
Token ID257
Feature activation+1.04e+00

Pos logit contributions
icer+0.153
 scoring+0.145
 ramp+0.142
etti+0.140
 exploding+0.140
Neg logit contributions
up-0.050
drawn-0.046
aid-0.042
large-0.041
First-0.040
Token beautiful
Token ID4950
Feature activation+4.59e+00

Pos logit contributions
icer+0.043
 scoring+0.041
 ramp+0.040
etti+0.040
 exploding+0.039
Neg logit contributions
up-0.015
drawn-0.014
aid-0.013
large-0.012
First-0.012
Token t
Token ID256
Feature activation+1.75e+00

Pos logit contributions
icer+0.182
 scoring+0.173
omp+0.168
bing+0.165
 ramp+0.163
Neg logit contributions
up-0.079
drawn-0.073
 lots-0.069
 news-0.069
 to-0.068
Tokeneddy
Token ID21874
Feature activation+9.27e+00

Pos logit contributions
icer+0.027
etti+0.022
oc+0.021
eros+0.020
 ramp+0.019
Neg logit contributions
up-0.070
drawn-0.070
First-0.069
aid-0.067
zzy-0.067
Token bear
Token ID6842
Feature activation+4.53e+00

Pos logit contributions
icer+0.235
 scoring+0.230
 ramp+0.216
omp+0.214
 exploding+0.213
Neg logit contributions
up-0.275
drawn-0.263
aid-0.260
 extra-0.253
 to-0.250
Token!
Token ID0
Feature activation+1.54e-01

Pos logit contributions
icer+0.154
 scoring+0.143
 ramp+0.140
 exploding+0.134
etti+0.133
Neg logit contributions
up-0.105
drawn-0.095
large-0.093
aid-0.092
 to-0.092
Token But
Token ID887
Feature activation+1.74e+00

Pos logit contributions
icer+0.005
 scoring+0.004
 exploding+0.004
bing+0.004
 ramp+0.004
Neg logit contributions
up-0.004
drawn-0.004
large-0.004
First-0.004
aid-0.004
Token Lucy
Token ID22162
Feature activation+1.41e+00

Pos logit contributions
icer+0.055
 scoring+0.050
 ramp+0.049
etti+0.049
 exploding+0.048
Neg logit contributions
up-0.044
 lots-0.040
drawn-0.039
aid-0.039
 extra-0.039
Token was
Token ID373
Feature activation+1.29e+00

Pos logit contributions
icer+0.049
 scoring+0.046
 ramp+0.045
 exploding+0.044
etti+0.044
Neg logit contributions
up-0.033
bs-0.027
ave-0.027
 to-0.027
drawn-0.027
Token is
Token ID318
Feature activation+4.74e+00

Pos logit contributions
icer+0.072
 scoring+0.070
etti+0.068
 exploding+0.066
bing+0.066
Neg logit contributions
up-0.087
drawn-0.081
 lots-0.079
large-0.078
aid-0.078
Token a
Token ID257
Feature activation+2.12e+00

Pos logit contributions
icer+0.173
 scoring+0.163
 ramp+0.159
etti+0.156
 exploding+0.156
Neg logit contributions
up-0.094
drawn-0.088
aid-0.083
large-0.082
First-0.080
Token very
Token ID845
Feature activation-3.85e-01

Pos logit contributions
icer+0.100
 scoring+0.094
etti+0.094
bing+0.093
 ramp+0.092
Neg logit contributions
up-0.018
drawn-0.016
aid-0.015
 lots-0.014
 extra-0.014
Token nice
Token ID3621
Feature activation+1.73e+00

Pos logit contributions
drawn+0.005
up+0.005
First+0.005
aid+0.004
zzy+0.004
Neg logit contributions
icer-0.016
 scoring-0.015
 ramp-0.015
oc-0.015
 burying-0.015
Token t
Token ID256
Feature activation+1.88e+00

Pos logit contributions
icer+0.069
 scoring+0.068
bing+0.067
etti+0.066
 exploding+0.066
Neg logit contributions
up-0.026
 to-0.026
 lots-0.026
 big-0.026
 extra-0.025
Tokeneddy
Token ID21874
Feature activation+9.00e+00

Pos logit contributions
icer+0.038
 scoring+0.036
 ramp+0.034
 exploding+0.033
etti+0.032
Neg logit contributions
up-0.068
drawn-0.064
aid-0.063
large-0.062
zzy-0.061
Token bear
Token ID6842
Feature activation+4.31e+00

Pos logit contributions
icer+0.228
 scoring+0.220
etti+0.211
 exploding+0.209
omp+0.208
Neg logit contributions
up-0.269
aid-0.259
 extra-0.253
drawn-0.250
 to-0.246
Token.
Token ID13
Feature activation+2.55e+00

Pos logit contributions
icer+0.161
 scoring+0.149
 ramp+0.146
etti+0.142
 exploding+0.142
Neg logit contributions
up-0.087
drawn-0.077
aid-0.076
 to-0.074
large-0.073
Token I
Token ID314
Feature activation-1.63e+00

Pos logit contributions
icer+0.071
 scoring+0.067
bing+0.063
 ramp+0.063
 exploding+0.062
Neg logit contributions
up-0.073
aid-0.071
large-0.071
drawn-0.070
First-0.069
Token love
Token ID1842
Feature activation+3.28e-01

Pos logit contributions
up+0.030
aid+0.027
drawn+0.026
zzy+0.026
large+0.025
Neg logit contributions
icer-0.064
 scoring-0.060
 ramp-0.059
bing-0.058
etti-0.058
Token it
Token ID340
Feature activation+3.83e-01

Pos logit contributions
icer+0.011
 ramp+0.011
etti+0.010
 scoring+0.010
omp+0.010
Neg logit contributions
 to-0.008
 lots-0.007
up-0.007
 many-0.007
drawn-0.007
Token was
Token ID373
Feature activation+3.47e+00

Pos logit contributions
icer+0.046
 scoring+0.043
 ramp+0.042
etti+0.041
 exploding+0.041
Neg logit contributions
up-0.037
drawn-0.036
aid-0.034
large-0.034
First-0.033
Token her
Token ID607
Feature activation+3.90e+00

Pos logit contributions
icer+0.142
 scoring+0.135
 ramp+0.132
etti+0.130
 exploding+0.129
Neg logit contributions
up-0.053
drawn-0.049
aid-0.046
large-0.044
First-0.043
Token furry
Token ID40857
Feature activation+3.97e+00

Pos logit contributions
icer+0.164
 scoring+0.154
oc+0.151
 ramp+0.151
etti+0.150
Neg logit contributions
up-0.052
drawn-0.048
 lots-0.047
 prett-0.045
 extra-0.044
Token orange
Token ID10912
Feature activation+3.24e+00

Pos logit contributions
icer+0.126
 scoring+0.120
 exploding+0.116
etti+0.115
 ramp+0.114
Neg logit contributions
up-0.094
drawn-0.087
 sunglasses-0.086
 lots-0.085
 to-0.082
Token t
Token ID256
Feature activation+2.29e+00

Pos logit contributions
icer+0.102
 scoring+0.096
 exploding+0.092
 ramp+0.091
omp+0.091
Neg logit contributions
up-0.078
drawn-0.074
 sunglasses-0.070
aid-0.070
 lots-0.069
Tokeneddy
Token ID21874
Feature activation+8.93e+00

Pos logit contributions
icer+0.038
etti+0.030
 ramp+0.029
 scoring+0.028
oc+0.028
Neg logit contributions
up-0.091
drawn-0.090
First-0.087
aid-0.087
large-0.086
Token bear
Token ID6842
Feature activation+4.36e+00

Pos logit contributions
icer+0.226
 scoring+0.215
ocol+0.209
 ramp+0.207
 exploding+0.207
Neg logit contributions
up-0.260
drawn-0.247
 lots-0.243
aid-0.243
 extra-0.242
Token.
Token ID13
Feature activation-5.35e-01

Pos logit contributions
icer+0.180
 scoring+0.166
ocol+0.166
 ramp+0.165
 dent+0.162
Neg logit contributions
up-0.068
 to-0.065
 lots-0.064
 very-0.063
 extra-0.061
Token Every
Token ID3887
Feature activation+1.31e-01

Pos logit contributions
 lots+0.008
 to+0.008
First+0.008
 many+0.007
Of+0.007
Neg logit contributions
 scoring-0.021
 exploding-0.021
bing-0.021
 defeating-0.020
icer-0.020
Token day
Token ID1110
Feature activation-1.94e-01

Pos logit contributions
icer+0.004
 scoring+0.004
 ramp+0.004
etti+0.004
 exploding+0.004
Neg logit contributions
up-0.003
drawn-0.003
aid-0.003
large-0.003
zzy-0.003
Token,
Token ID11
Feature activation-1.61e+00

Pos logit contributions
up+0.005
drawn+0.004
 lots+0.004
 to+0.004
aid+0.004
Neg logit contributions
icer-0.007
 ramp-0.006
 scoring-0.006
etti-0.006
 exploding-0.006
Token favorite
Token ID4004
Feature activation+1.13e+00

Pos logit contributions
icer+0.052
 ramp+0.049
 pretending+0.049
 scoring+0.049
 burying+0.046
Neg logit contributions
up-0.023
drawn-0.022
aid-0.021
First-0.021
large-0.020
Token was
Token ID373
Feature activation+3.54e+00

Pos logit contributions
icer+0.034
 scoring+0.031
 ramp+0.031
etti+0.030
 exploding+0.030
Neg logit contributions
up-0.030
drawn-0.029
aid-0.028
large-0.027
First-0.027
Token a
Token ID257
Feature activation+1.50e+00

Pos logit contributions
icer+0.129
 scoring+0.128
 pretending+0.126
 sharing+0.123
 burying+0.121
Neg logit contributions
drawn-0.066
up-0.064
aid-0.060
large-0.058
First-0.058
Token leather
Token ID11620
Feature activation+3.90e+00

Pos logit contributions
icer+0.059
 ramp+0.056
 scoring+0.056
 exploding+0.054
oc+0.054
Neg logit contributions
up-0.026
drawn-0.024
aid-0.023
First-0.023
large-0.022
Token t
Token ID256
Feature activation+1.49e+00

Pos logit contributions
icer+0.129
 scoring+0.123
 exploding+0.119
etti+0.116
omp+0.116
Neg logit contributions
up-0.090
drawn-0.081
aid-0.080
 sunglasses-0.078
 lots-0.077
Tokeneddy
Token ID21874
Feature activation+8.92e+00

Pos logit contributions
icer+0.024
etti+0.019
 ramp+0.018
oc+0.018
 scoring+0.018
Neg logit contributions
up-0.059
drawn-0.059
First-0.057
aid-0.057
large-0.056
Token bear
Token ID6842
Feature activation+4.20e+00

Pos logit contributions
icer+0.219
 scoring+0.204
ocol+0.201
 exploding+0.197
omp+0.197
Neg logit contributions
up-0.265
aid-0.256
drawn-0.253
 sunglasses-0.251
 extra-0.249
Token.
Token ID13
Feature activation-1.18e-01

Pos logit contributions
icer+0.152
 scoring+0.140
 ramp+0.138
 exploding+0.135
ocol+0.134
Neg logit contributions
up-0.087
drawn-0.078
aid-0.078
 lots-0.075
 to-0.074
Token One
Token ID1881
Feature activation-2.54e+00

Pos logit contributions
up+0.002
First+0.002
 lots+0.002
drawn+0.002
 to+0.002
Neg logit contributions
icer-0.004
 scoring-0.004
bing-0.004
 exploding-0.004
 burying-0.004
Token day
Token ID1110
Feature activation-2.29e-01

Pos logit contributions
up+0.074
drawn+0.072
aid+0.069
First+0.067
large+0.067
Neg logit contributions
icer-0.068
 scoring-0.061
oc-0.060
 ramp-0.059
 exploding-0.058
Token,
Token ID11
Feature activation-1.32e+00

Pos logit contributions
up+0.006
drawn+0.006
aid+0.006
large+0.006
 lots+0.006
Neg logit contributions
icer-0.007
 scoring-0.006
 ramp-0.006
etti-0.006
 exploding-0.006
Token toy
Token ID13373
Feature activation+1.31e+00

Pos logit contributions
icer+0.019
 scoring+0.017
 ramp+0.017
etti+0.016
 exploding+0.016
Neg logit contributions
up-0.015
drawn-0.015
aid-0.014
large-0.014
zzy-0.014
Token was
Token ID373
Feature activation+4.07e+00

Pos logit contributions
icer+0.038
 scoring+0.035
 ramp+0.033
etti+0.033
 exploding+0.033
Neg logit contributions
up-0.036
drawn-0.034
aid-0.033
 lots-0.032
large-0.032
Token a
Token ID257
Feature activation+1.69e+00

Pos logit contributions
icer+0.170
 scoring+0.163
 ramp+0.159
 exploding+0.157
etti+0.157
Neg logit contributions
up-0.056
drawn-0.053
aid-0.048
large-0.047
First-0.046
Token green
Token ID4077
Feature activation+4.23e+00

Pos logit contributions
icer+0.066
 scoring+0.062
 ramp+0.060
etti+0.060
bing+0.059
Neg logit contributions
up-0.029
drawn-0.027
aid-0.026
large-0.025
zzy-0.024
Token t
Token ID256
Feature activation+1.64e+00

Pos logit contributions
icer+0.130
 scoring+0.122
omp+0.120
 exploding+0.117
bing+0.117
Neg logit contributions
up-0.103
drawn-0.098
 sunglasses-0.098
 lots-0.095
 to-0.094
Tokeneddy
Token ID21874
Feature activation+8.83e+00

Pos logit contributions
icer+0.028
etti+0.023
 ramp+0.022
 scoring+0.022
oc+0.021
Neg logit contributions
up-0.064
drawn-0.063
aid-0.061
First-0.061
large-0.060
Token bear
Token ID6842
Feature activation+4.38e+00

Pos logit contributions
icer+0.224
 scoring+0.212
omp+0.209
ocol+0.208
 exploding+0.201
Neg logit contributions
up-0.249
aid-0.240
drawn-0.239
 extra-0.237
 sunglasses-0.237
Token.
Token ID13
Feature activation-1.91e-01

Pos logit contributions
icer+0.165
 scoring+0.152
 ramp+0.150
ocol+0.146
 exploding+0.145
Neg logit contributions
up-0.083
drawn-0.075
aid-0.074
 to-0.073
 lots-0.072
Token One
Token ID1881
Feature activation-2.64e+00

Pos logit contributions
up+0.003
First+0.003
 to+0.003
 lots+0.003
drawn+0.003
Neg logit contributions
icer-0.007
 scoring-0.007
bing-0.007
 exploding-0.007
eros-0.007
Token day
Token ID1110
Feature activation-3.56e-01

Pos logit contributions
up+0.080
drawn+0.077
aid+0.075
First+0.072
large+0.071
Neg logit contributions
icer-0.068
 scoring-0.061
oc-0.061
 ramp-0.059
 exploding-0.058
Token,
Token ID11
Feature activation-1.54e+00

Pos logit contributions
up+0.010
drawn+0.009
aid+0.009
large+0.009
 lots+0.009
Neg logit contributions
icer-0.010
 scoring-0.009
 ramp-0.009
etti-0.009
 exploding-0.009
Token toy
Token ID13373
Feature activation+1.19e+00

Pos logit contributions
icer+0.033
 scoring+0.031
etti+0.031
 exploding+0.030
 ramp+0.030
Neg logit contributions
up-0.021
drawn-0.019
 lots-0.019
 extra-0.019
large-0.018
Token was
Token ID373
Feature activation+4.30e+00

Pos logit contributions
icer+0.037
 scoring+0.035
omp+0.033
ocol+0.033
 exploding+0.033
Neg logit contributions
up-0.031
drawn-0.028
 lots-0.027
 to-0.027
 many-0.026
Token a
Token ID257
Feature activation+1.68e+00

Pos logit contributions
icer+0.198
 scoring+0.186
 ramp+0.184
oc+0.182
etti+0.182
Neg logit contributions
up-0.046
drawn-0.039
 lots-0.035
aid-0.035
large-0.034
Token soft
Token ID2705
Feature activation+4.58e+00

Pos logit contributions
icer+0.070
 scoring+0.066
bing+0.064
etti+0.064
omp+0.064
Neg logit contributions
up-0.023
drawn-0.022
aid-0.020
 lots-0.020
large-0.020
Token t
Token ID256
Feature activation+1.38e+00

Pos logit contributions
icer+0.169
omp+0.164
 scoring+0.162
ocol+0.154
cks+0.154
Neg logit contributions
 big-0.086
 to-0.085
 sunglasses-0.083
 lots-0.082
 extra-0.081
Tokeneddy
Token ID21874
Feature activation+8.82e+00

Pos logit contributions
icer+0.025
 scoring+0.021
 ramp+0.021
etti+0.020
oc+0.019
Neg logit contributions
up-0.052
drawn-0.051
aid-0.050
large-0.049
First-0.049
Token bear
Token ID6842
Feature activation+4.30e+00

Pos logit contributions
icer+0.245
ocol+0.243
omp+0.242
 scoring+0.234
 exploding+0.228
Neg logit contributions
 very-0.230
 to-0.221
 extra-0.219
up-0.219
 so-0.216
Token named
Token ID3706
Feature activation+3.71e+00

Pos logit contributions
icer+0.177
ocol+0.165
 ramp+0.160
 scoring+0.159
omp+0.159
Neg logit contributions
 to-0.074
up-0.067
 very-0.066
 lots-0.063
 so-0.062
Token Mr
Token ID1770
Feature activation+3.68e+00

Pos logit contributions
icer+0.096
 scoring+0.086
 ramp+0.084
 exploding+0.082
bing+0.081
Neg logit contributions
up-0.112
drawn-0.108
 lots-0.105
large-0.104
aid-0.103
Token.
Token ID13
Feature activation+2.33e+00

Pos logit contributions
icer+0.130
 scoring+0.124
 ramp+0.120
bing+0.117
 exploding+0.117
Neg logit contributions
up-0.076
drawn-0.071
aid-0.068
zzy-0.066
large-0.065
Token Bear
Token ID14732
Feature activation+1.90e+00

Pos logit contributions
 scoring+0.076
 exploding+0.074
 ramp+0.074
omp+0.073
ebra+0.073
Neg logit contributions
 lots-0.049
 a-0.046
 to-0.046
 many-0.045
First-0.043
Token could
Token ID714
Feature activation-1.66e-01

Pos logit contributions
icer+0.011
 scoring+0.010
 ramp+0.010
 exploding+0.010
 burying+0.010
Neg logit contributions
up-0.008
drawn-0.008
large-0.007
aid-0.007
First-0.007
Token not
Token ID407
Feature activation-2.54e+00

Pos logit contributions
up+0.004
drawn+0.004
aid+0.004
large+0.004
ave+0.004
Neg logit contributions
icer-0.005
 scoring-0.005
 ramp-0.004
etti-0.004
 exploding-0.004
Token believe
Token ID1975
Feature activation+2.45e+00

Pos logit contributions
up+0.062
drawn+0.062
aid+0.058
large+0.058
First+0.057
Neg logit contributions
 ramp-0.072
icer-0.072
 scoring-0.071
 exploding-0.069
oc-0.068
Token how
Token ID703
Feature activation-3.50e+00

Pos logit contributions
icer+0.089
 scoring+0.085
 ramp+0.082
etti+0.081
 exploding+0.081
Neg logit contributions
up-0.048
drawn-0.045
aid-0.043
large-0.042
First-0.041
Token care
Token ID1337
Feature activation-1.66e+00

Pos logit contributions
drawn+0.060
aid+0.059
up+0.058
ave+0.054
First+0.054
Neg logit contributions
 scoring-0.125
icer-0.123
 burying-0.121
 exploding-0.121
 ramp-0.119
Tokenfree
Token ID5787
Feature activation+8.79e+00

Pos logit contributions
up+0.040
drawn+0.039
aid+0.037
large+0.036
zzy+0.036
Neg logit contributions
icer-0.052
 scoring-0.049
 ramp-0.048
 exploding-0.047
bing-0.046
Token T
Token ID309
Feature activation+2.22e+00

Pos logit contributions
icer+0.308
 scoring+0.285
 ramp+0.280
etti+0.278
 exploding+0.269
Neg logit contributions
up-0.191
 lots-0.173
drawn-0.168
zzy-0.164
aid-0.161
Tokenessa
Token ID21411
Feature activation+2.52e+00

Pos logit contributions
icer+0.079
 scoring+0.073
etti+0.073
bing+0.072
eros+0.072
Neg logit contributions
drawn-0.040
First-0.040
aid-0.039
up-0.038
large-0.037
Token was
Token ID373
Feature activation+1.92e+00

Pos logit contributions
icer+0.088
 scoring+0.081
 ramp+0.080
etti+0.078
 exploding+0.078
Neg logit contributions
up-0.057
drawn-0.051
aid-0.049
zzy-0.048
ave-0.048
Token.
Token ID13
Feature activation-4.91e-01

Pos logit contributions
icer+0.059
 scoring+0.054
 ramp+0.053
etti+0.052
oc+0.051
Neg logit contributions
up-0.050
drawn-0.047
aid-0.045
large-0.045
zzy-0.044
Token She
Token ID1375
Feature activation+1.55e+00

Pos logit contributions
up+0.015
drawn+0.014
aid+0.014
large+0.014
zzy+0.013
Neg logit contributions
icer-0.013
 scoring-0.012
 ramp-0.011
etti-0.011
 exploding-0.011
Token favorite
Token ID4004
Feature activation+4.90e-01

Pos logit contributions
icer+0.021
 scoring+0.019
 ramp+0.019
 pretending+0.018
etti+0.018
Neg logit contributions
up-0.017
drawn-0.016
large-0.015
aid-0.015
First-0.015
Token toy
Token ID13373
Feature activation+1.20e+00

Pos logit contributions
icer+0.015
 scoring+0.014
 ramp+0.014
etti+0.013
 exploding+0.013
Neg logit contributions
up-0.012
drawn-0.012
aid-0.011
large-0.011
zzy-0.011
Token was
Token ID373
Feature activation+3.77e+00

Pos logit contributions
icer+0.034
 scoring+0.032
 exploding+0.030
 ramp+0.030
etti+0.030
Neg logit contributions
up-0.034
drawn-0.032
 lots-0.031
aid-0.030
large-0.030
Token a
Token ID257
Feature activation+1.62e+00

Pos logit contributions
icer+0.155
 scoring+0.150
 ramp+0.145
 exploding+0.144
etti+0.144
Neg logit contributions
up-0.053
drawn-0.051
aid-0.046
large-0.045
First-0.044
Token t
Token ID256
Feature activation+1.31e+00

Pos logit contributions
icer+0.063
 scoring+0.059
etti+0.057
bing+0.057
 ramp+0.057
Neg logit contributions
up-0.028
drawn-0.026
aid-0.024
large-0.024
 lots-0.023
Tokeneddy
Token ID21874
Feature activation+8.78e+00

Pos logit contributions
icer+0.024
 scoring+0.020
 ramp+0.019
etti+0.019
oc+0.018
Neg logit contributions
up-0.050
drawn-0.049
aid-0.047
large-0.047
First-0.047
Token bear
Token ID6842
Feature activation+4.34e+00

Pos logit contributions
icer+0.214
 scoring+0.205
omp+0.197
 ramp+0.195
ocol+0.193
Neg logit contributions
up-0.260
aid-0.251
drawn-0.249
 sunglasses-0.243
 extra-0.242
Token.
Token ID13
Feature activation-3.24e-01

Pos logit contributions
icer+0.161
 scoring+0.149
 ramp+0.146
ocol+0.143
 exploding+0.142
Neg logit contributions
up-0.086
aid-0.076
drawn-0.076
 to-0.075
 lots-0.073
Token One
Token ID1881
Feature activation-2.64e+00

Pos logit contributions
up+0.006
First+0.006
drawn+0.005
 to+0.005
 lots+0.005
Neg logit contributions
icer-0.012
 scoring-0.012
bing-0.011
 exploding-0.011
 burying-0.011
Token day
Token ID1110
Feature activation-2.21e-01

Pos logit contributions
up+0.079
drawn+0.076
aid+0.074
First+0.071
zzy+0.071
Neg logit contributions
icer-0.070
 scoring-0.062
oc-0.062
 ramp-0.061
 exploding-0.059
Token,
Token ID11
Feature activation-1.36e+00

Pos logit contributions
up+0.006
drawn+0.006
aid+0.006
large+0.006
First+0.006
Neg logit contributions
icer-0.006
 scoring-0.006
 ramp-0.006
etti-0.005
 exploding-0.005
Token You
Token ID921
Feature activation-6.28e-01

Pos logit contributions
icer+0.065
 scoring+0.064
bing+0.064
 exploding+0.061
 ramp+0.058
Neg logit contributions
Of-0.079
large-0.078
up-0.077
drawn-0.077
aid-0.076
Token are
Token ID389
Feature activation+2.14e+00

Pos logit contributions
drawn+0.019
up+0.018
large+0.018
zzy+0.017
First+0.017
Neg logit contributions
icer-0.015
 scoring-0.014
 ramp-0.014
etti-0.014
 exploding-0.014
Token the
Token ID262
Feature activation+3.41e+00

Pos logit contributions
icer+0.083
 scoring+0.077
 ramp+0.077
etti+0.075
oc+0.075
Neg logit contributions
up-0.037
drawn-0.034
aid-0.033
large-0.032
 lots-0.032
Token best
Token ID1266
Feature activation+1.08e+00

Pos logit contributions
icer+0.130
 scoring+0.119
 ramp+0.118
 exploding+0.111
 dent+0.111
Neg logit contributions
up-0.063
drawn-0.061
ave-0.056
aid-0.056
bs-0.056
Token t
Token ID256
Feature activation+1.76e+00

Pos logit contributions
icer+0.036
 scoring+0.035
bing+0.033
 ramp+0.033
 exploding+0.033
Neg logit contributions
up-0.025
drawn-0.023
aid-0.022
large-0.022
 lots-0.022
Tokeneddy
Token ID21874
Feature activation+8.77e+00

Pos logit contributions
icer+0.035
 scoring+0.032
 ramp+0.031
 exploding+0.029
etti+0.029
Neg logit contributions
up-0.064
drawn-0.061
aid-0.060
large-0.059
zzy-0.058
Token bear
Token ID6842
Feature activation+4.11e+00

Pos logit contributions
icer+0.200
 scoring+0.190
 ramp+0.182
 exploding+0.179
etti+0.178
Neg logit contributions
up-0.292
drawn-0.276
aid-0.274
ave-0.263
 extra-0.262
Token in
Token ID287
Feature activation+4.10e+00

Pos logit contributions
icer+0.134
 scoring+0.124
 ramp+0.121
etti+0.118
 exploding+0.118
Neg logit contributions
up-0.099
drawn-0.093
aid-0.090
large-0.088
zzy-0.086
Token the
Token ID262
Feature activation+1.23e+00

Pos logit contributions
icer+0.153
 ramp+0.146
etti+0.146
 scoring+0.145
bing+0.143
Neg logit contributions
up-0.073
 lots-0.073
 many-0.073
 big-0.069
 to-0.068
Token world
Token ID995
Feature activation+1.61e+00

Pos logit contributions
icer+0.031
 scoring+0.029
etti+0.029
bing+0.028
 ramp+0.028
Neg logit contributions
up-0.037
drawn-0.036
 lots-0.034
large-0.034
aid-0.033
Token."
Token ID526
Feature activation-1.72e+00

Pos logit contributions
icer+0.052
 scoring+0.049
 ramp+0.048
etti+0.047
 exploding+0.047
Neg logit contributions
up-0.037
drawn-0.035
aid-0.034
large-0.034
zzy-0.033
Token is
Token ID318
Feature activation+3.80e+00

Pos logit contributions
icer+0.090
 scoring+0.087
ocol+0.087
 exploding+0.085
etti+0.084
Neg logit contributions
up-0.062
 lots-0.050
 to-0.049
drawn-0.049
 prett-0.048
Token a
Token ID257
Feature activation+1.39e+00

Pos logit contributions
icer+0.165
 ramp+0.155
etti+0.155
 scoring+0.154
oc+0.153
Neg logit contributions
up-0.046
 lots-0.043
 extra-0.041
drawn-0.041
 prett-0.040
Token very
Token ID845
Feature activation-9.45e-01

Pos logit contributions
icer+0.062
 scoring+0.059
etti+0.058
bing+0.058
 ramp+0.057
Neg logit contributions
up-0.015
drawn-0.014
aid-0.013
large-0.012
 extra-0.012
Token nice
Token ID3621
Feature activation+1.42e+00

Pos logit contributions
drawn+0.013
up+0.012
First+0.012
aid+0.011
ave+0.010
Neg logit contributions
icer-0.039
 scoring-0.038
 ramp-0.037
oc-0.036
 burying-0.036
Token t
Token ID256
Feature activation+1.42e+00

Pos logit contributions
icer+0.053
 scoring+0.051
bing+0.050
etti+0.049
 exploding+0.049
Neg logit contributions
up-0.028
 to-0.026
 news-0.026
 lots-0.026
 big-0.026
Tokeneddy
Token ID21874
Feature activation+8.71e+00

Pos logit contributions
icer+0.028
 scoring+0.027
 ramp+0.025
 exploding+0.024
etti+0.024
Neg logit contributions
up-0.051
drawn-0.049
aid-0.048
large-0.047
zzy-0.046
Token bear
Token ID6842
Feature activation+4.14e+00

Pos logit contributions
icer+0.239
 scoring+0.235
 exploding+0.225
omp+0.224
etti+0.219
Neg logit contributions
up-0.240
aid-0.233
 to-0.226
 extra-0.226
 very-0.225
Token,
Token ID11
Feature activation+3.56e+00

Pos logit contributions
icer+0.176
 scoring+0.159
 ramp+0.157
 dent+0.155
bing+0.154
Neg logit contributions
 to-0.069
up-0.067
 extra-0.055
 very-0.055
aid-0.054
Token but
Token ID475
Feature activation+1.95e+00

Pos logit contributions
icer+0.141
etti+0.133
bing+0.132
oc+0.132
 scoring+0.132
Neg logit contributions
 so-0.069
 to-0.066
 very-0.059
 lots-0.059
 a-0.058
Token it
Token ID340
Feature activation+1.06e+00

Pos logit contributions
icer+0.066
etti+0.059
 ramp+0.059
 scoring+0.059
oc+0.057
Neg logit contributions
up-0.045
 lots-0.043
 to-0.043
 prett-0.043
 many-0.042
Token is
Token ID318
Feature activation+4.05e+00

Pos logit contributions
etti+0.034
icer+0.034
 scoring+0.034
bing+0.033
ocol+0.032
Neg logit contributions
up-0.026
 prett-0.022
 very-0.022
 listened-0.022
 extra-0.021
Token favorite
Token ID4004
Feature activation+6.27e-01

Pos logit contributions
icer+0.022
 scoring+0.020
 ramp+0.020
 pretending+0.019
bing+0.019
Neg logit contributions
up-0.018
drawn-0.017
large-0.016
aid-0.016
zzy-0.016
Token toy
Token ID13373
Feature activation+1.36e+00

Pos logit contributions
icer+0.020
 scoring+0.018
 ramp+0.018
etti+0.018
 exploding+0.017
Neg logit contributions
up-0.015
drawn-0.015
aid-0.014
large-0.014
zzy-0.014
Token was
Token ID373
Feature activation+4.07e+00

Pos logit contributions
icer+0.039
 scoring+0.037
 exploding+0.034
etti+0.034
 ramp+0.034
Neg logit contributions
up-0.038
drawn-0.035
 lots-0.034
aid-0.033
 sunglasses-0.032
Token a
Token ID257
Feature activation+1.76e+00

Pos logit contributions
icer+0.171
 scoring+0.164
 ramp+0.159
 exploding+0.158
etti+0.157
Neg logit contributions
up-0.056
drawn-0.053
aid-0.047
large-0.047
First-0.045
Token t
Token ID256
Feature activation+1.40e+00

Pos logit contributions
icer+0.068
 scoring+0.064
 ramp+0.062
etti+0.062
bing+0.062
Neg logit contributions
up-0.030
drawn-0.028
aid-0.027
large-0.026
 lots-0.025
Tokeneddy
Token ID21874
Feature activation+8.70e+00

Pos logit contributions
icer+0.025
 scoring+0.022
 ramp+0.021
etti+0.021
 exploding+0.020
Neg logit contributions
up-0.053
drawn-0.052
aid-0.050
large-0.050
First-0.049
Token bear
Token ID6842
Feature activation+4.27e+00

Pos logit contributions
icer+0.220
 scoring+0.207
omp+0.204
ocol+0.204
etti+0.198
Neg logit contributions
up-0.247
aid-0.239
drawn-0.237
 sunglasses-0.235
 extra-0.234
Token named
Token ID3706
Feature activation+3.49e+00

Pos logit contributions
icer+0.162
 scoring+0.150
 ramp+0.147
ocol+0.145
omp+0.143
Neg logit contributions
up-0.080
 to-0.072
aid-0.071
drawn-0.071
 lots-0.069
Token Brown
Token ID4373
Feature activation+2.52e+00

Pos logit contributions
icer+0.090
 scoring+0.083
 ramp+0.081
 exploding+0.079
oc+0.079
Neg logit contributions
up-0.103
drawn-0.100
 lots-0.099
large-0.097
 sunglasses-0.095
Tokenie
Token ID494
Feature activation+2.64e+00

Pos logit contributions
 scoring+0.079
icer+0.079
 ramp+0.079
etti+0.076
 dent+0.075
Neg logit contributions
up-0.057
 to-0.053
drawn-0.050
 lots-0.049
 extra-0.049
Token.
Token ID13
Feature activation-4.23e-01

Pos logit contributions
icer+0.102
 scoring+0.095
 ramp+0.094
ocol+0.094
etti+0.094
Neg logit contributions
up-0.044
drawn-0.041
 to-0.041
 lots-0.040
aid-0.038
Token "
Token ID366
Feature activation-1.34e+00

Pos logit contributions
icer+0.017
 ramp+0.014
ocol+0.014
 scoring+0.014
oc+0.014
Neg logit contributions
 to-0.011
 lots-0.010
Of-0.010
 many-0.010
aid-0.010
TokenThis
Token ID1212
Feature activation+3.16e+00

Pos logit contributions
up+0.024
drawn+0.023
aid+0.021
large+0.021
zzy+0.021
Neg logit contributions
icer-0.051
 scoring-0.048
 ramp-0.046
etti-0.046
 exploding-0.046
Token is
Token ID318
Feature activation+4.27e+00

Pos logit contributions
icer+0.077
 scoring+0.074
bing+0.069
etti+0.069
 exploding+0.068
Neg logit contributions
up-0.101
drawn-0.094
 lots-0.093
First-0.090
aid-0.090
Token my
Token ID616
Feature activation+3.32e+00

Pos logit contributions
icer+0.141
 scoring+0.129
 ramp+0.128
etti+0.126
bing+0.125
Neg logit contributions
up-0.099
drawn-0.094
 lots-0.090
large-0.089
aid-0.089
Token t
Token ID256
Feature activation+1.98e+00

Pos logit contributions
icer+0.124
 scoring+0.119
bing+0.115
omp+0.115
etti+0.114
Neg logit contributions
up-0.059
drawn-0.056
 lots-0.055
 extra-0.054
 prett-0.053
Tokeneddy
Token ID21874
Feature activation+8.69e+00

Pos logit contributions
icer+0.036
 scoring+0.031
 ramp+0.030
etti+0.029
oc+0.028
Neg logit contributions
up-0.075
drawn-0.073
aid-0.071
large-0.070
First-0.070
Token bear
Token ID6842
Feature activation+4.10e+00

Pos logit contributions
 scoring+0.202
icer+0.201
 exploding+0.190
 ramp+0.189
etti+0.186
Neg logit contributions
up-0.271
drawn-0.255
aid-0.253
 extra-0.251
 lots-0.248
Token.
Token ID13
Feature activation+3.12e+00

Pos logit contributions
icer+0.161
 scoring+0.152
 ramp+0.148
 exploding+0.145
ocol+0.145
Neg logit contributions
up-0.076
 to-0.067
 lots-0.063
aid-0.062
 extra-0.062
Token He
Token ID679
Feature activation+4.05e+00

Pos logit contributions
icer+0.094
 scoring+0.090
bing+0.087
 exploding+0.084
 ramp+0.082
Neg logit contributions
up-0.083
large-0.081
drawn-0.081
Of-0.080
aid-0.079
Token is
Token ID318
Feature activation+2.37e+00

Pos logit contributions
icer+0.128
 scoring+0.128
bing+0.125
omp+0.120
 exploding+0.120
Neg logit contributions
up-0.109
 lots-0.089
 to-0.087
ave-0.087
aid-0.086
Token very
Token ID845
Feature activation-1.30e+00

Pos logit contributions
icer+0.104
oc+0.099
etti+0.098
 ramp+0.098
bing+0.097
Neg logit contributions
 very-0.030
 big-0.029
 lots-0.029
 prett-0.029
 extra-0.028
Token:
Token ID25
Feature activation-2.58e+00

Pos logit contributions
up+0.114
drawn+0.106
aid+0.102
large+0.101
First+0.100
Neg logit contributions
icer-0.155
 scoring-0.144
 ramp-0.141
etti-0.139
 exploding-0.137
Token "
Token ID366
Feature activation-2.43e+00

Pos logit contributions
 to+0.064
 many+0.062
 lots+0.061
 very+0.058
Of+0.056
Neg logit contributions
icer-0.089
 ramp-0.078
etti-0.076
 dent-0.076
ocol-0.075
TokenWhat
Token ID2061
Feature activation-1.04e+00

Pos logit contributions
up+0.058
drawn+0.055
aid+0.053
First+0.052
large+0.052
Neg logit contributions
icer-0.078
 scoring-0.073
 ramp-0.072
etti-0.070
 exploding-0.070
Token is
Token ID318
Feature activation+2.80e+00

Pos logit contributions
drawn+0.037
up+0.037
aid+0.036
zzy+0.035
ave+0.035
Neg logit contributions
icer-0.020
 scoring-0.019
 sharing-0.018
 ramp-0.018
oc-0.017
Token bat
Token ID7365
Feature activation+8.51e-01

Pos logit contributions
icer+0.090
 scoring+0.086
 ramp+0.083
 exploding+0.083
bing+0.082
Neg logit contributions
up-0.068
drawn-0.064
aid-0.062
ave-0.060
large-0.060
Tokenhe
Token ID258
Feature activation+8.66e+00

Pos logit contributions
icer+0.028
 scoring+0.026
 ramp+0.025
etti+0.025
 exploding+0.025
Neg logit contributions
up-0.020
drawn-0.019
aid-0.018
large-0.018
First-0.017
Token?
Token ID30
Feature activation+1.04e+00

Pos logit contributions
icer+0.351
 scoring+0.331
 ramp+0.329
etti+0.327
 dent+0.322
Neg logit contributions
up-0.134
 extra-0.111
 lots-0.109
 to-0.109
large-0.108
Token Why
Token ID4162
Feature activation-8.60e-01

Pos logit contributions
icer+0.027
 scoring+0.025
 exploding+0.023
 ramp+0.023
bing+0.023
Neg logit contributions
up-0.030
First-0.029
drawn-0.029
large-0.029
aid-0.029
Token is
Token ID318
Feature activation+1.76e+00

Pos logit contributions
up+0.023
 lots+0.021
aid+0.021
drawn+0.021
First+0.020
Neg logit contributions
icer-0.025
 scoring-0.024
etti-0.023
 ramp-0.023
 exploding-0.022
Token it
Token ID340
Feature activation+8.21e-01

Pos logit contributions
icer+0.048
 scoring+0.046
 exploding+0.043
 ramp+0.043
bing+0.042
Neg logit contributions
up-0.051
drawn-0.050
ave-0.048
zzy-0.048
aid-0.047
Token good
Token ID922
Feature activation+7.16e-01

Pos logit contributions
icer+0.030
 scoring+0.028
 ramp+0.028
 exploding+0.027
bing+0.027
Neg logit contributions
up-0.016
drawn-0.016
aid-0.015
large-0.014
First-0.014
Token and
Token ID290
Feature activation+2.68e+00

Pos logit contributions
icer+0.082
 scoring+0.075
 ramp+0.075
oc+0.073
etti+0.073
Neg logit contributions
up-0.051
 lots-0.048
drawn-0.047
 many-0.046
aid-0.046
Token their
Token ID511
Feature activation+2.49e+00

Pos logit contributions
icer+0.095
 scoring+0.089
 ramp+0.087
etti+0.086
 exploding+0.086
Neg logit contributions
up-0.055
drawn-0.052
aid-0.049
large-0.048
First-0.047
Token stomach
Token ID11384
Feature activation-9.31e-01

Pos logit contributions
icer+0.085
 scoring+0.080
etti+0.078
 ramp+0.078
 exploding+0.076
Neg logit contributions
up-0.053
drawn-0.050
aid-0.048
 sunglasses-0.047
 lots-0.047
Tokens
Token ID82
Feature activation+3.58e+00

Pos logit contributions
up+0.017
aid+0.014
drawn+0.014
 lots+0.014
ave+0.013
Neg logit contributions
icer-0.037
 scoring-0.034
 ramp-0.033
etti-0.033
eros-0.033
Token ac
Token ID936
Feature activation+3.88e+00

Pos logit contributions
icer+0.127
 scoring+0.116
 ramp+0.115
etti+0.112
oc+0.111
Neg logit contributions
up-0.076
aid-0.070
drawn-0.070
 lots-0.067
ave-0.065
Tokenhe
Token ID258
Feature activation+8.62e+00

Pos logit contributions
 scoring+0.097
icer+0.097
 exploding+0.092
 ramp+0.090
etti+0.089
Neg logit contributions
up-0.120
aid-0.115
drawn-0.108
ave-0.106
zzy-0.105
Token.
Token ID13
Feature activation+2.07e+00

Pos logit contributions
icer+0.305
ocol+0.282
etti+0.282
 scoring+0.276
omp+0.275
Neg logit contributions
up-0.163
 to-0.161
 lots-0.161
 many-0.159
 extra-0.154
Token
Token ID198
Feature activation+1.84e+00

Pos logit contributions
icer+0.066
 scoring+0.061
bing+0.059
 ramp+0.058
 exploding+0.057
Neg logit contributions
up-0.049
 lots-0.049
First-0.048
aid-0.048
drawn-0.047
Token
Token ID198
Feature activation+1.82e+00

Pos logit contributions
icer+0.066
 scoring+0.064
etti+0.060
 ramp+0.060
 exploding+0.059
Neg logit contributions
up-0.037
First-0.035
 lots-0.035
aid-0.034
 many-0.034
TokenThey
Token ID2990
Feature activation+4.13e-01

Pos logit contributions
icer+0.060
 scoring+0.055
etti+0.053
 ramp+0.053
 exploding+0.052
Neg logit contributions
up-0.046
First-0.046
aid-0.045
A-0.042
 lots-0.041
Token spit
Token ID27591
Feature activation-3.15e+00

Pos logit contributions
icer+0.013
 scoring+0.012
 ramp+0.012
 exploding+0.012
etti+0.012
Neg logit contributions
up-0.009
drawn-0.009
large-0.009
First-0.009
earned-0.009
Token favorite
Token ID4004
Feature activation+7.47e-01

Pos logit contributions
icer+0.018
 scoring+0.017
 ramp+0.017
etti+0.016
bing+0.016
Neg logit contributions
up-0.014
drawn-0.013
large-0.013
aid-0.013
First-0.012
Token toy
Token ID13373
Feature activation+1.39e+00

Pos logit contributions
icer+0.023
 scoring+0.022
 ramp+0.021
etti+0.020
 exploding+0.020
Neg logit contributions
up-0.019
drawn-0.018
aid-0.018
large-0.017
zzy-0.017
Token was
Token ID373
Feature activation+4.03e+00

Pos logit contributions
icer+0.041
 scoring+0.039
 ramp+0.037
etti+0.036
 exploding+0.036
Neg logit contributions
up-0.037
drawn-0.034
 lots-0.033
aid-0.033
large-0.032
Token a
Token ID257
Feature activation+1.63e+00

Pos logit contributions
icer+0.169
 scoring+0.162
 ramp+0.158
etti+0.156
 exploding+0.155
Neg logit contributions
up-0.056
drawn-0.052
aid-0.048
large-0.047
First-0.045
Token t
Token ID256
Feature activation+1.27e+00

Pos logit contributions
icer+0.063
 scoring+0.059
etti+0.058
 ramp+0.058
bing+0.057
Neg logit contributions
up-0.028
drawn-0.026
aid-0.025
large-0.024
 lots-0.023
Tokeneddy
Token ID21874
Feature activation+8.60e+00

Pos logit contributions
icer+0.023
 scoring+0.019
 ramp+0.018
etti+0.018
oc+0.018
Neg logit contributions
up-0.048
drawn-0.047
aid-0.046
large-0.046
First-0.045
Token bear
Token ID6842
Feature activation+4.25e+00

Pos logit contributions
icer+0.218
 scoring+0.203
ocol+0.202
omp+0.199
etti+0.198
Neg logit contributions
up-0.244
 sunglasses-0.235
aid-0.234
drawn-0.233
 extra-0.231
Token.
Token ID13
Feature activation-2.57e-01

Pos logit contributions
icer+0.162
 scoring+0.148
 ramp+0.147
ocol+0.144
etti+0.143
Neg logit contributions
up-0.080
 to-0.071
aid-0.070
drawn-0.070
 lots-0.069
Token One
Token ID1881
Feature activation-2.62e+00

Pos logit contributions
up+0.004
First+0.004
drawn+0.004
 lots+0.004
 to+0.004
Neg logit contributions
icer-0.010
 scoring-0.009
bing-0.009
 exploding-0.009
 burying-0.009
Token day
Token ID1110
Feature activation-3.62e-01

Pos logit contributions
up+0.078
drawn+0.075
aid+0.073
First+0.070
large+0.070
Neg logit contributions
icer-0.069
 scoring-0.062
oc-0.061
 ramp-0.061
 exploding-0.059
Token,
Token ID11
Feature activation-1.54e+00

Pos logit contributions
up+0.010
drawn+0.009
aid+0.009
large+0.009
 lots+0.009
Neg logit contributions
icer-0.011
 scoring-0.010
 ramp-0.010
etti-0.009
 exploding-0.009
Token favorite
Token ID4004
Feature activation+3.47e-01

Pos logit contributions
icer+0.016
 scoring+0.015
 ramp+0.015
 pretending+0.014
bing+0.014
Neg logit contributions
up-0.016
drawn-0.015
large-0.015
zzy-0.014
First-0.014
Token toy
Token ID13373
Feature activation+1.17e+00

Pos logit contributions
icer+0.010
 scoring+0.010
 ramp+0.009
etti+0.009
 exploding+0.009
Neg logit contributions
up-0.009
drawn-0.009
aid-0.009
large-0.008
zzy-0.008
Token was
Token ID373
Feature activation+3.78e+00

Pos logit contributions
icer+0.030
 scoring+0.028
 ramp+0.027
etti+0.026
 exploding+0.026
Neg logit contributions
up-0.035
drawn-0.034
aid-0.033
large-0.033
zzy-0.032
Token a
Token ID257
Feature activation+1.58e+00

Pos logit contributions
icer+0.148
 scoring+0.145
 pretending+0.140
 exploding+0.139
 sharing+0.138
Neg logit contributions
up-0.058
drawn-0.057
large-0.051
aid-0.050
First-0.050
Token t
Token ID256
Feature activation+1.28e+00

Pos logit contributions
icer+0.057
 scoring+0.054
 ramp+0.053
etti+0.052
 exploding+0.051
Neg logit contributions
up-0.032
drawn-0.030
aid-0.028
large-0.028
First-0.027
Tokeneddy
Token ID21874
Feature activation+8.60e+00

Pos logit contributions
icer+0.023
 scoring+0.019
 ramp+0.019
etti+0.018
oc+0.018
Neg logit contributions
up-0.049
drawn-0.048
aid-0.046
large-0.046
First-0.046
Token bear
Token ID6842
Feature activation+4.18e+00

Pos logit contributions
icer+0.213
 scoring+0.199
ocol+0.196
omp+0.196
etti+0.193
Neg logit contributions
up-0.250
aid-0.241
drawn-0.239
 extra-0.235
 sunglasses-0.235
Token named
Token ID3706
Feature activation+3.53e+00

Pos logit contributions
icer+0.157
 scoring+0.145
 ramp+0.143
ocol+0.141
 exploding+0.139
Neg logit contributions
up-0.080
aid-0.071
 to-0.071
drawn-0.070
 lots-0.069
Token Teddy
Token ID29345
Feature activation+3.51e+00

Pos logit contributions
icer+0.090
 scoring+0.082
 ramp+0.080
 exploding+0.079
oc+0.079
Neg logit contributions
up-0.105
drawn-0.102
 lots-0.101
large-0.098
First-0.097
Token.
Token ID13
Feature activation-3.49e-01

Pos logit contributions
icer+0.150
ocol+0.143
 ramp+0.139
omp+0.138
 dent+0.138
Neg logit contributions
 to-0.049
up-0.045
 lots-0.042
 very-0.042
 extra-0.040
Token One
Token ID1881
Feature activation-2.50e+00

Pos logit contributions
First+0.006
 to+0.006
 lots+0.006
up+0.006
 many+0.005
Neg logit contributions
 scoring-0.013
eros-0.013
icer-0.013
 exploding-0.012
omp-0.012
Token favorite
Token ID4004
Feature activation+8.38e-01

Pos logit contributions
icer+0.022
 scoring+0.021
 ramp+0.021
 pretending+0.020
bing+0.019
Neg logit contributions
up-0.018
drawn-0.017
large-0.016
aid-0.016
zzy-0.016
Token toy
Token ID13373
Feature activation+1.47e+00

Pos logit contributions
icer+0.026
 scoring+0.024
 ramp+0.023
etti+0.022
oc+0.022
Neg logit contributions
up-0.021
drawn-0.021
aid-0.020
zzy-0.020
large-0.020
Token was
Token ID373
Feature activation+4.17e+00

Pos logit contributions
icer+0.042
 scoring+0.039
 ramp+0.037
 exploding+0.037
etti+0.036
Neg logit contributions
up-0.041
drawn-0.039
aid-0.037
 lots-0.037
large-0.036
Token a
Token ID257
Feature activation+1.80e+00

Pos logit contributions
icer+0.175
 scoring+0.167
 ramp+0.164
etti+0.161
 exploding+0.161
Neg logit contributions
up-0.057
drawn-0.054
aid-0.049
large-0.048
First-0.047
Token t
Token ID256
Feature activation+1.37e+00

Pos logit contributions
icer+0.070
 scoring+0.065
etti+0.063
bing+0.063
 ramp+0.063
Neg logit contributions
up-0.031
drawn-0.029
aid-0.027
large-0.027
 lots-0.026
Tokeneddy
Token ID21874
Feature activation+8.59e+00

Pos logit contributions
icer+0.024
 scoring+0.020
 ramp+0.020
etti+0.020
oc+0.019
Neg logit contributions
up-0.053
drawn-0.051
aid-0.050
large-0.049
First-0.049
Token bear
Token ID6842
Feature activation+4.30e+00

Pos logit contributions
icer+0.213
 scoring+0.201
omp+0.198
ocol+0.196
etti+0.193
Neg logit contributions
up-0.250
aid-0.242
drawn-0.239
 sunglasses-0.236
 extra-0.235
Token named
Token ID3706
Feature activation+3.59e+00

Pos logit contributions
icer+0.161
 scoring+0.149
 ramp+0.147
ocol+0.144
 exploding+0.143
Neg logit contributions
up-0.083
aid-0.073
drawn-0.073
 to-0.072
 lots-0.071
Token Mr
Token ID1770
Feature activation+3.61e+00

Pos logit contributions
icer+0.089
 scoring+0.081
 ramp+0.078
 exploding+0.077
oc+0.076
Neg logit contributions
up-0.111
drawn-0.108
 lots-0.105
large-0.104
First-0.102
Token.
Token ID13
Feature activation+1.90e+00

Pos logit contributions
icer+0.131
 scoring+0.125
 ramp+0.122
bing+0.119
etti+0.118
Neg logit contributions
up-0.069
drawn-0.064
aid-0.063
zzy-0.060
large-0.058
Token Sn
Token ID5489
Feature activation+1.48e+00

Pos logit contributions
rim+0.065
opl+0.065
omp+0.063
 exploding+0.063
orse+0.062
Neg logit contributions
 lots-0.035
 a-0.035
 many-0.034
 to-0.034
First-0.032
Token The
Token ID383
Feature activation+1.53e+00

Pos logit contributions
 lots+0.002
 to+0.002
 a+0.002
large+0.002
 many+0.002
Neg logit contributions
icer-0.004
bing-0.004
 scoring-0.004
rim-0.004
 exploding-0.004
Token toy
Token ID13373
Feature activation+6.87e-01

Pos logit contributions
icer+0.067
omp+0.065
ocol+0.065
etti+0.064
 scoring+0.064
Neg logit contributions
 big-0.021
 pretty-0.019
 nice-0.017
 very-0.016
 bright-0.016
Token was
Token ID373
Feature activation+3.04e+00

Pos logit contributions
ocol+0.029
omp+0.027
icer+0.026
 scoring+0.026
arter+0.025
Neg logit contributions
 very-0.012
 big-0.011
 lots-0.011
 to-0.011
 decided-0.011
Token a
Token ID257
Feature activation+1.06e+00

Pos logit contributions
icer+0.152
etti+0.143
oc+0.142
 scoring+0.142
 ramp+0.141
Neg logit contributions
up-0.017
 lots-0.017
 many-0.014
 big-0.014
 extra-0.014
Token t
Token ID256
Feature activation+1.18e+00

Pos logit contributions
icer+0.048
 scoring+0.045
etti+0.045
bing+0.044
omp+0.044
Neg logit contributions
up-0.011
drawn-0.009
aid-0.009
large-0.009
 lots-0.009
Tokeneddy
Token ID21874
Feature activation+8.58e+00

Pos logit contributions
icer+0.024
 scoring+0.023
 ramp+0.022
 exploding+0.021
etti+0.020
Neg logit contributions
up-0.042
drawn-0.040
aid-0.039
large-0.038
zzy-0.037
Token bear
Token ID6842
Feature activation+3.94e+00

Pos logit contributions
ocol+0.247
icer+0.244
omp+0.236
 scoring+0.228
etti+0.227
Neg logit contributions
 very-0.224
 to-0.209
 extra-0.207
 so-0.206
 decided-0.206
Token,
Token ID11
Feature activation+2.49e+00

Pos logit contributions
icer+0.172
ocol+0.164
 ramp+0.158
 dent+0.157
 scoring+0.157
Neg logit contributions
 to-0.058
 very-0.056
 lots-0.050
up-0.050
 many-0.048
Token and
Token ID290
Feature activation+1.38e+00

Pos logit contributions
ocol+0.115
icer+0.113
oc+0.108
etti+0.107
omp+0.106
Neg logit contributions
 lots-0.036
 so-0.034
 a-0.033
 very-0.033
 many-0.031
Token it
Token ID340
Feature activation+1.32e+00

Pos logit contributions
icer+0.047
 scoring+0.044
 ramp+0.043
etti+0.042
 exploding+0.042
Neg logit contributions
up-0.031
drawn-0.029
aid-0.028
large-0.028
 lots-0.027
Token was
Token ID373
Feature activation+3.60e+00

Pos logit contributions
icer+0.051
ocol+0.049
etti+0.047
 scoring+0.046
 ramp+0.046
Neg logit contributions
up-0.026
 decided-0.023
 lots-0.023
 very-0.023
 listened-0.022
Token favorite
Token ID4004
Feature activation+2.97e-01

Pos logit contributions
icer+0.014
 scoring+0.013
 ramp+0.012
 pretending+0.012
bing+0.012
Neg logit contributions
up-0.014
drawn-0.013
large-0.012
aid-0.012
zzy-0.012
Token toy
Token ID13373
Feature activation+1.13e+00

Pos logit contributions
icer+0.009
 scoring+0.008
 ramp+0.008
 exploding+0.007
etti+0.007
Neg logit contributions
up-0.008
drawn-0.008
aid-0.008
large-0.007
zzy-0.007
Token was
Token ID373
Feature activation+3.76e+00

Pos logit contributions
icer+0.029
 scoring+0.027
 ramp+0.026
etti+0.025
 exploding+0.025
Neg logit contributions
up-0.034
drawn-0.033
aid-0.032
large-0.031
zzy-0.031
Token a
Token ID257
Feature activation+1.52e+00

Pos logit contributions
icer+0.151
 scoring+0.147
 ramp+0.141
 exploding+0.141
 pretending+0.140
Neg logit contributions
up-0.055
drawn-0.053
aid-0.048
large-0.048
First-0.046
Token t
Token ID256
Feature activation+1.24e+00

Pos logit contributions
icer+0.055
 scoring+0.052
 ramp+0.051
etti+0.050
 exploding+0.050
Neg logit contributions
up-0.030
drawn-0.028
aid-0.027
large-0.026
First-0.026
Tokeneddy
Token ID21874
Feature activation+8.57e+00

Pos logit contributions
icer+0.022
 scoring+0.017
etti+0.017
 ramp+0.017
oc+0.016
Neg logit contributions
up-0.048
drawn-0.047
aid-0.046
large-0.045
First-0.045
Token bear
Token ID6842
Feature activation+4.12e+00

Pos logit contributions
icer+0.216
 scoring+0.204
omp+0.200
ocol+0.199
etti+0.194
Neg logit contributions
up-0.245
aid-0.236
drawn-0.235
 extra-0.233
 sunglasses-0.231
Token.
Token ID13
Feature activation-4.99e-01

Pos logit contributions
icer+0.155
 scoring+0.143
 ramp+0.141
ocol+0.138
omp+0.137
Neg logit contributions
up-0.079
aid-0.070
drawn-0.070
 to-0.069
 lots-0.068
Token One
Token ID1881
Feature activation-2.74e+00

Pos logit contributions
First+0.009
up+0.008
 lots+0.008
 to+0.008
drawn+0.008
Neg logit contributions
icer-0.019
 scoring-0.018
bing-0.018
eros-0.018
 exploding-0.018
Token day
Token ID1110
Feature activation-3.08e-01

Pos logit contributions
up+0.081
drawn+0.078
aid+0.076
First+0.073
large+0.073
Neg logit contributions
icer-0.073
 scoring-0.065
oc-0.064
 ramp-0.063
 exploding-0.062
Token,
Token ID11
Feature activation-1.47e+00

Pos logit contributions
up+0.008
drawn+0.008
aid+0.008
large+0.008
 lots+0.008
Neg logit contributions
icer-0.009
 scoring-0.008
 ramp-0.008
etti-0.008
 exploding-0.008
Token.
Token ID13
Feature activation-5.75e-01

Pos logit contributions
up+0.054
drawn+0.048
 lots+0.048
aid+0.047
 to+0.047
Neg logit contributions
icer-0.087
 scoring-0.079
 ramp-0.078
etti-0.077
bing-0.076
Token So
Token ID1406
Feature activation-1.17e+00

Pos logit contributions
up+0.015
drawn+0.014
First+0.014
aid+0.013
 lots+0.013
Neg logit contributions
icer-0.017
 scoring-0.017
 exploding-0.016
etti-0.016
bing-0.016
Token he
Token ID339
Feature activation-7.67e-01

Pos logit contributions
up+0.031
drawn+0.030
 lots+0.029
aid+0.029
large+0.029
Neg logit contributions
icer-0.034
 scoring-0.031
etti-0.031
 ramp-0.031
 exploding-0.030
Token knocked
Token ID13642
Feature activation-3.91e+00

Pos logit contributions
up+0.015
large+0.015
drawn+0.015
aid+0.015
First+0.014
Neg logit contributions
icer-0.025
 scoring-0.024
 ramp-0.024
oc-0.023
etti-0.023
Token on
Token ID319
Feature activation-8.52e-01

Pos logit contributions
up+0.103
drawn+0.095
large+0.093
zzy+0.093
 prett+0.089
Neg logit contributions
icer-0.117
 scoring-0.108
bing-0.102
omp-0.100
 ramp-0.100
Token lots
Token ID6041
Feature activation-1.01e+01

Pos logit contributions
up+0.025
aid+0.024
ave+0.024
drawn+0.024
First+0.024
Neg logit contributions
icer-0.022
 scoring-0.021
 ramp-0.020
oc-0.019
 exploding-0.019
Token of
Token ID286
Feature activation-5.51e-01

Pos logit contributions
up+0.248
drawn+0.238
aid+0.228
large+0.225
zzy+0.222
Neg logit contributions
icer-0.315
 scoring-0.292
 ramp-0.285
etti-0.279
 exploding-0.278
Token doors
Token ID8215
Feature activation-4.86e+00

Pos logit contributions
zzy+0.014
ave+0.014
drawn+0.013
up+0.013
aid+0.012
Neg logit contributions
 pretending-0.016
 burying-0.015
icer-0.015
ocol-0.015
 exploding-0.015
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
up+0.125
drawn+0.120
aid+0.116
large+0.113
zzy+0.112
Neg logit contributions
icer-0.146
 scoring-0.135
 ramp-0.133
etti-0.131
 exploding-0.130
Token He
Token ID679
Feature activation+1.14e+00

Pos logit contributions
up+0.035
drawn+0.033
First+0.032
aid+0.032
 lots+0.032
Neg logit contributions
icer-0.033
 scoring-0.030
 exploding-0.029
bing-0.029
 ramp-0.029
Token gave
Token ID2921
Feature activation-2.88e-01

Pos logit contributions
icer+0.035
 ramp+0.033
 scoring+0.032
oc+0.032
 pretending+0.032
Neg logit contributions
drawn-0.026
aid-0.026
large-0.026
up-0.026
First-0.025
Token.
Token ID13
Feature activation-4.35e-01

Pos logit contributions
icer+0.059
 scoring+0.055
bing+0.053
 ramp+0.053
etti+0.052
Neg logit contributions
up-0.054
drawn-0.053
aid-0.052
large-0.050
First-0.049
Token
Token ID220
Feature activation-3.19e+00

Pos logit contributions
up+0.013
aid+0.013
drawn+0.013
large+0.012
zzy+0.012
Neg logit contributions
icer-0.010
 scoring-0.010
etti-0.010
 ramp-0.009
bing-0.009
Token
Token ID198
Feature activation+1.68e-01

Pos logit contributions
up+0.077
drawn+0.072
First+0.071
aid+0.070
 lots+0.069
Neg logit contributions
icer-0.102
 scoring-0.097
 ramp-0.093
 exploding-0.091
oc-0.090
Token
Token ID198
Feature activation+3.16e-01

Pos logit contributions
icer+0.004
 scoring+0.004
 ramp+0.004
bing+0.004
etti+0.004
Neg logit contributions
up-0.005
drawn-0.005
aid-0.005
large-0.005
 prett-0.005
TokenAfter
Token ID3260
Feature activation+1.94e-01

Pos logit contributions
 scoring+0.009
bing+0.009
icer+0.009
 exploding+0.008
etti+0.008
Neg logit contributions
aid-0.008
drawn-0.008
up-0.008
large-0.008
 ju-0.007
Token lots
Token ID6041
Feature activation-9.50e+00

Pos logit contributions
icer+0.007
 ramp+0.007
 scoring+0.007
oc+0.007
etti+0.006
Neg logit contributions
up-0.004
drawn-0.003
aid-0.003
 lots-0.003
large-0.003
Token of
Token ID286
Feature activation+4.17e-01

Pos logit contributions
up+0.204
aid+0.184
drawn+0.179
large+0.174
bs+0.170
Neg logit contributions
icer-0.355
 ramp-0.321
 scoring-0.316
etti-0.314
bing-0.302
Token squ
Token ID2809
Feature activation-3.03e+00

Pos logit contributions
icer+0.018
 scoring+0.017
 ramp+0.017
etti+0.017
oc+0.016
Neg logit contributions
up-0.006
drawn-0.005
aid-0.005
First-0.005
large-0.005
Tokenashing
Token ID2140
Feature activation-1.74e-01

Pos logit contributions
up+0.069
drawn+0.064
aid+0.062
large+0.060
zzy+0.059
Neg logit contributions
icer-0.103
 scoring-0.096
 ramp-0.095
etti-0.093
 exploding-0.092
Token,
Token ID11
Feature activation-3.72e-01

Pos logit contributions
up+0.004
drawn+0.004
aid+0.004
large+0.004
zzy+0.004
Neg logit contributions
icer-0.005
 scoring-0.005
 ramp-0.005
 exploding-0.005
etti-0.005
Token it
Token ID340
Feature activation-5.86e-01

Pos logit contributions
up+0.010
aid+0.010
drawn+0.010
ave+0.010
large+0.009
Neg logit contributions
icer-0.010
 scoring-0.010
etti-0.009
 exploding-0.009
bing-0.009
Token Grand
Token ID5675
Feature activation+1.51e-01

Pos logit contributions
up+0.010
drawn+0.009
aid+0.009
large+0.008
First+0.008
Neg logit contributions
icer-0.018
 scoring-0.017
 ramp-0.016
etti-0.016
 exploding-0.016
Tokenma
Token ID2611
Feature activation-1.01e+00

Pos logit contributions
icer+0.004
 scoring+0.003
 ramp+0.003
 exploding+0.003
etti+0.003
Neg logit contributions
up-0.005
drawn-0.005
aid-0.004
large-0.004
First-0.004
Token would
Token ID561
Feature activation-6.33e-01

Pos logit contributions
up+0.019
drawn+0.017
aid+0.016
ave+0.016
 lots+0.016
Neg logit contributions
icer-0.039
 scoring-0.036
 ramp-0.036
etti-0.035
 exploding-0.035
Token go
Token ID467
Feature activation-2.59e+00

Pos logit contributions
up+0.012
 lots+0.012
drawn+0.012
First+0.011
aid+0.011
Neg logit contributions
icer-0.024
 scoring-0.022
 ramp-0.021
etti-0.021
eros-0.021
Token on
Token ID319
Feature activation-1.47e+00

Pos logit contributions
up+0.034
drawn+0.031
aid+0.029
large+0.027
First+0.026
Neg logit contributions
icer-0.111
 scoring-0.106
 ramp-0.104
etti-0.103
 exploding-0.102
Token lots
Token ID6041
Feature activation-9.48e+00

Pos logit contributions
up+0.029
aid+0.027
ave+0.026
drawn+0.025
large+0.024
Neg logit contributions
icer-0.055
 scoring-0.052
 ramp-0.051
 exploding-0.050
etti-0.050
Token of
Token ID286
Feature activation-4.04e-01

Pos logit contributions
up+0.207
drawn+0.181
large+0.173
 extra+0.173
aid+0.172
Neg logit contributions
icer-0.341
 scoring-0.317
 ramp-0.311
etti-0.310
 exploding-0.301
Token adventures
Token ID17545
Feature activation-9.95e-01

Pos logit contributions
up+0.010
aid+0.010
ave+0.010
drawn+0.009
large+0.009
Neg logit contributions
icer-0.012
 scoring-0.012
 ramp-0.012
 pretending-0.012
 burying-0.012
Token.
Token ID13
Feature activation-2.96e-01

Pos logit contributions
up+0.023
drawn+0.022
 lots+0.022
aid+0.021
First+0.021
Neg logit contributions
icer-0.033
 scoring-0.030
 ramp-0.029
etti-0.029
 dent-0.028
Token They
Token ID1119
Feature activation+1.18e+00

Pos logit contributions
up+0.008
First+0.007
drawn+0.007
 lots+0.007
large+0.007
Neg logit contributions
icer-0.009
 scoring-0.008
 ramp-0.008
bing-0.008
 exploding-0.008
Token would
Token ID561
Feature activation-5.27e-01

Pos logit contributions
icer+0.041
 scoring+0.039
 ramp+0.038
 exploding+0.038
etti+0.038
Neg logit contributions
up-0.024
drawn-0.023
aid-0.022
large-0.021
First-0.020
Token wanted
Token ID2227
Feature activation-2.11e+00

Pos logit contributions
up+0.021
drawn+0.020
large+0.020
aid+0.019
First+0.019
Neg logit contributions
icer-0.034
 scoring-0.032
etti-0.031
oc-0.031
 ramp-0.031
Token.
Token ID13
Feature activation-3.06e-01

Pos logit contributions
up+0.027
drawn+0.023
 lots+0.021
large+0.020
aid+0.020
Neg logit contributions
icer-0.094
 scoring-0.088
 ramp-0.087
etti-0.085
oc-0.085
Token While
Token ID2893
Feature activation-2.83e+00

Pos logit contributions
up+0.007
drawn+0.006
First+0.006
large+0.006
aid+0.006
Neg logit contributions
icer-0.011
 scoring-0.010
 ramp-0.010
 exploding-0.010
etti-0.010
Token they
Token ID484
Feature activation-2.34e+00

Pos logit contributions
 lots+0.060
 to+0.058
 many+0.057
 off+0.051
 extra+0.051
Neg logit contributions
icer-0.109
 ramp-0.096
oc-0.095
 dent-0.093
 scoring-0.092
Token took
Token ID1718
Feature activation-2.82e+00

Pos logit contributions
up+0.052
drawn+0.049
aid+0.047
large+0.046
zzy+0.045
Neg logit contributions
icer-0.080
 scoring-0.075
 ramp-0.073
etti-0.071
 exploding-0.071
Token lots
Token ID6041
Feature activation-9.45e+00

Pos logit contributions
up+0.065
drawn+0.061
aid+0.061
ave+0.059
First+0.059
Neg logit contributions
icer-0.090
 scoring-0.086
 exploding-0.083
 ramp-0.083
etti-0.082
Token of
Token ID286
Feature activation-7.83e-01

Pos logit contributions
up+0.191
aid+0.160
 to+0.156
 lots+0.155
drawn+0.155
Neg logit contributions
icer-0.375
 scoring-0.341
 ramp-0.337
etti-0.327
bing-0.320
Token pictures
Token ID5986
Feature activation-3.85e+00

Pos logit contributions
up+0.017
drawn+0.016
zzy+0.016
aid+0.016
ave+0.015
Neg logit contributions
icer-0.026
 scoring-0.024
 pretending-0.024
 exploding-0.024
ocol-0.024
Token,
Token ID11
Feature activation-3.75e-01

Pos logit contributions
up+0.086
 to+0.082
 lots+0.077
 off+0.076
 extra+0.073
Neg logit contributions
icer-0.149
 scoring-0.129
oc-0.127
 ramp-0.124
eros-0.122
Token Jimmy
Token ID12963
Feature activation+1.31e+00

Pos logit contributions
up+0.010
drawn+0.009
aid+0.009
large+0.009
First+0.008
Neg logit contributions
icer-0.011
 scoring-0.011
 ramp-0.010
etti-0.010
 exploding-0.010
Token found
Token ID1043
Feature activation-6.58e-01

Pos logit contributions
icer+0.050
 scoring+0.046
 ramp+0.046
ocol+0.045
 exploding+0.045
Neg logit contributions
up-0.025
drawn-0.023
ave-0.021
zzy-0.021
 lots-0.021
Token play
Token ID711
Feature activation+2.99e+00

Pos logit contributions
icer+0.077
 scoring+0.077
 exploding+0.074
 ramp+0.073
etti+0.072
Neg logit contributions
up-0.062
drawn-0.060
aid-0.058
zzy-0.057
large-0.057
Token!"
Token ID2474
Feature activation-3.44e+00

Pos logit contributions
icer+0.101
etti+0.092
ocol+0.091
 scoring+0.090
 ramp+0.089
Neg logit contributions
up-0.067
 lots-0.065
drawn-0.062
 to-0.062
 many-0.060
Token They
Token ID1119
Feature activation+4.34e-01

Pos logit contributions
up+0.090
drawn+0.087
aid+0.083
First+0.082
 lots+0.082
Neg logit contributions
icer-0.106
 scoring-0.095
 ramp-0.094
etti-0.093
 exploding-0.092
Token tried
Token ID3088
Feature activation-4.10e+00

Pos logit contributions
icer+0.014
 scoring+0.013
 ramp+0.013
etti+0.013
 exploding+0.013
Neg logit contributions
up-0.010
drawn-0.009
aid-0.009
large-0.009
First-0.009
Token on
Token ID319
Feature activation-1.13e+00

Pos logit contributions
up+0.030
drawn+0.025
aid+0.020
large+0.020
zzy+0.017
Neg logit contributions
icer-0.199
 scoring-0.194
 ramp-0.190
 exploding-0.187
etti-0.186
Token lots
Token ID6041
Feature activation-9.44e+00

Pos logit contributions
up+0.022
drawn+0.021
 lots+0.021
First+0.020
large+0.019
Neg logit contributions
icer-0.041
 scoring-0.038
etti-0.037
 ramp-0.037
oc-0.037
Token of
Token ID286
Feature activation-2.62e-01

Pos logit contributions
up+0.188
aid+0.161
 off+0.154
 lots+0.153
 to+0.151
Neg logit contributions
icer-0.371
etti-0.338
 scoring-0.335
 ramp-0.327
 exploding-0.323
Token clothes
Token ID8242
Feature activation-3.32e+00

Pos logit contributions
up+0.006
drawn+0.006
aid+0.005
large+0.005
zzy+0.005
Neg logit contributions
icer-0.009
 scoring-0.008
 ramp-0.008
etti-0.008
 exploding-0.008
Token and
Token ID290
Feature activation-9.03e-01

Pos logit contributions
up+0.084
aid+0.077
drawn+0.076
 lots+0.076
 to+0.075
Neg logit contributions
icer-0.106
 scoring-0.095
oc-0.093
 ramp-0.092
etti-0.090
Token had
Token ID550
Feature activation-1.11e+00

Pos logit contributions
up+0.021
drawn+0.020
aid+0.019
large+0.019
zzy+0.019
Neg logit contributions
icer-0.029
 scoring-0.028
 ramp-0.027
 exploding-0.027
bing-0.026
Token so
Token ID523
Feature activation-3.78e+00

Pos logit contributions
up+0.017
drawn+0.016
aid+0.016
ave+0.015
large+0.015
Neg logit contributions
 scoring-0.043
icer-0.042
 ramp-0.041
 pretending-0.040
 exploding-0.040
Token for
Token ID329
Feature activation-1.09e+00

Pos logit contributions
up+0.008
aid+0.007
drawn+0.007
zzy+0.007
large+0.007
Neg logit contributions
icer-0.017
 scoring-0.017
 exploding-0.016
 burying-0.016
 ramp-0.016
Token dinner
Token ID8073
Feature activation-9.80e-01

Pos logit contributions
up+0.022
drawn+0.021
aid+0.020
large+0.019
zzy+0.019
Neg logit contributions
icer-0.039
 scoring-0.037
 ramp-0.036
etti-0.035
 exploding-0.035
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
up+0.023
drawn+0.021
aid+0.020
large+0.020
First+0.020
Neg logit contributions
icer-0.032
 scoring-0.030
 ramp-0.029
etti-0.029
 exploding-0.029
Token She
Token ID1375
Feature activation+1.23e+00

Pos logit contributions
up+0.027
drawn+0.026
First+0.025
large+0.025
aid+0.024
Neg logit contributions
icer-0.036
 scoring-0.034
 ramp-0.033
 exploding-0.032
etti-0.032
Token took
Token ID1718
Feature activation-2.68e+00

Pos logit contributions
icer+0.044
 scoring+0.042
 ramp+0.041
etti+0.040
 exploding+0.040
Neg logit contributions
up-0.024
drawn-0.023
aid-0.022
large-0.021
zzy-0.021
Token lots
Token ID6041
Feature activation-9.44e+00

Pos logit contributions
zzy+0.056
Of+0.056
aid+0.051
up+0.051
First+0.051
Neg logit contributions
icer-0.082
ocol-0.077
 repairing-0.076
 pretending-0.073
omp-0.073
Token of
Token ID286
Feature activation-5.57e-01

Pos logit contributions
drawn+0.273
up+0.261
zzy+0.258
large+0.252
First+0.251
Neg logit contributions
icer-0.237
 scoring-0.213
oc-0.213
 ramp-0.210
bing-0.209
Token things
Token ID1243
Feature activation-5.38e+00

Pos logit contributions
drawn+0.013
up+0.013
zzy+0.013
ave+0.013
aid+0.012
Neg logit contributions
icer-0.018
 pretending-0.016
 burying-0.016
ocol-0.016
 scoring-0.016
Token out
Token ID503
Feature activation-1.50e+00

Pos logit contributions
up+0.115
drawn+0.113
large+0.105
aid+0.105
ave+0.102
Neg logit contributions
icer-0.179
 scoring-0.168
 ramp-0.166
etti-0.165
bing-0.164
Token of
Token ID286
Feature activation+1.65e+00

Pos logit contributions
up+0.036
drawn+0.035
aid+0.033
large+0.033
zzy+0.032
Neg logit contributions
icer-0.047
 scoring-0.044
 ramp-0.043
etti-0.042
 exploding-0.042
Token the
Token ID262
Feature activation-1.05e+00

Pos logit contributions
icer+0.059
 scoring+0.055
 ramp+0.054
etti+0.054
 exploding+0.053
Neg logit contributions
up-0.034
drawn-0.032
aid-0.030
large-0.030
 lots-0.029
Token prince
Token ID19716
Feature activation+1.15e+00

Pos logit contributions
icer+0.046
 scoring+0.043
 ramp+0.042
 exploding+0.041
etti+0.041
Neg logit contributions
up-0.028
drawn-0.026
aid-0.025
large-0.024
First-0.024
Token and
Token ID290
Feature activation+3.95e-02

Pos logit contributions
icer+0.042
 scoring+0.039
 ramp+0.039
etti+0.038
 exploding+0.038
Neg logit contributions
up-0.023
drawn-0.021
aid-0.020
zzy-0.020
ave-0.020
Token princess
Token ID21752
Feature activation+1.26e+00

Pos logit contributions
icer+0.001
 scoring+0.001
 ramp+0.001
 exploding+0.001
etti+0.001
Neg logit contributions
up-0.001
drawn-0.001
aid-0.001
large-0.001
First-0.001
Token went
Token ID1816
Feature activation-3.17e+00

Pos logit contributions
icer+0.042
 scoring+0.040
etti+0.038
oc+0.038
 ramp+0.038
Neg logit contributions
up-0.027
drawn-0.026
large-0.025
aid-0.024
First-0.024
Token on
Token ID319
Feature activation-1.61e+00

Pos logit contributions
up+0.047
aid+0.043
drawn+0.042
large+0.041
ave+0.039
Neg logit contributions
icer-0.124
 scoring-0.123
 ramp-0.121
 exploding-0.119
etti-0.118
Token lots
Token ID6041
Feature activation-9.44e+00

Pos logit contributions
up+0.040
aid+0.036
ave+0.035
drawn+0.034
large+0.034
Neg logit contributions
icer-0.053
 scoring-0.051
 exploding-0.049
 burying-0.049
 ramp-0.048
Token of
Token ID286
Feature activation-3.17e-01

Pos logit contributions
up+0.204
drawn+0.180
 lots+0.172
 extra+0.172
aid+0.171
Neg logit contributions
icer-0.343
 scoring-0.314
 ramp-0.310
etti-0.309
 dent-0.298
Token fun
Token ID1257
Feature activation-1.89e+00

Pos logit contributions
up+0.008
aid+0.008
ave+0.008
drawn+0.007
large+0.007
Neg logit contributions
 scoring-0.010
icer-0.010
 pretending-0.010
 burying-0.009
 exploding-0.009
Token adventures
Token ID17545
Feature activation-1.72e-01

Pos logit contributions
up+0.046
drawn+0.044
aid+0.042
large+0.041
First+0.040
Neg logit contributions
icer-0.060
 scoring-0.056
 ramp-0.055
etti-0.054
 exploding-0.053
Token and
Token ID290
Feature activation-2.88e-01

Pos logit contributions
up+0.004
drawn+0.004
 lots+0.004
aid+0.004
 to+0.004
Neg logit contributions
icer-0.006
 scoring-0.005
 ramp-0.005
etti-0.005
 dent-0.005
Token then
Token ID788
Feature activation-1.58e+00

Pos logit contributions
up+0.006
drawn+0.006
aid+0.006
large+0.006
 lots+0.005
Neg logit contributions
icer-0.010
 scoring-0.009
 ramp-0.009
etti-0.009
bing-0.009
Token to
Token ID284
Feature activation-2.15e+00

Pos logit contributions
aid+0.039
up+0.039
drawn+0.035
large+0.035
ave+0.033
Neg logit contributions
 scoring-0.108
 ramp-0.107
icer-0.107
 pretending-0.104
 exploding-0.104
Token the
Token ID262
Feature activation-3.24e+00

Pos logit contributions
up+0.050
drawn+0.046
aid+0.045
large+0.044
ave+0.043
Neg logit contributions
icer-0.072
 scoring-0.068
 ramp-0.066
etti-0.064
 exploding-0.064
Token festival
Token ID10876
Feature activation-1.91e+00

Pos logit contributions
aid+0.112
ave+0.106
zzy+0.105
bs+0.105
up+0.102
Neg logit contributions
 scoring-0.071
 ramp-0.070
icer-0.069
 exploding-0.065
 sharing-0.063
Token and
Token ID290
Feature activation-9.51e-01

Pos logit contributions
up+0.046
drawn+0.044
 lots+0.042
 to+0.041
First+0.041
Neg logit contributions
icer-0.062
 scoring-0.056
ocol-0.055
 ramp-0.055
etti-0.054
Token took
Token ID1718
Feature activation-2.67e+00

Pos logit contributions
up+0.023
aid+0.023
drawn+0.021
large+0.021
First+0.019
Neg logit contributions
 scoring-0.029
icer-0.028
 exploding-0.027
 pretending-0.027
 ramp-0.027
Token lots
Token ID6041
Feature activation-9.44e+00

Pos logit contributions
Of+0.058
aid+0.058
 hesitate+0.056
ache+0.055
zzy+0.055
Neg logit contributions
 pretending-0.071
 exploding-0.071
icer-0.071
etti-0.071
 scoring-0.071
Token of
Token ID286
Feature activation-2.57e-01

Pos logit contributions
drawn+0.271
zzy+0.261
up+0.258
First+0.253
large+0.251
Neg logit contributions
icer-0.230
 scoring-0.216
 ramp-0.211
oc-0.211
 exploding-0.208
Token fun
Token ID1257
Feature activation-2.26e+00

Pos logit contributions
ave+0.006
zzy+0.006
drawn+0.006
up+0.006
Of+0.006
Neg logit contributions
icer-0.008
 pretending-0.007
ocol-0.007
 ramp-0.007
 scoring-0.007
Token things
Token ID1243
Feature activation-4.22e+00

Pos logit contributions
up+0.056
drawn+0.055
aid+0.054
large+0.052
ave+0.052
Neg logit contributions
icer-0.067
 scoring-0.064
 ramp-0.063
 exploding-0.062
bing-0.061
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
up+0.090
drawn+0.088
aid+0.084
large+0.083
First+0.080
Neg logit contributions
icer-0.136
 scoring-0.134
 ramp-0.131
 exploding-0.130
etti-0.130
Token They
Token ID1119
Feature activation+8.51e-01

Pos logit contributions
up+0.033
drawn+0.032
First+0.032
 lots+0.031
large+0.031
Neg logit contributions
icer-0.037
 scoring-0.035
 ramp-0.033
bing-0.032
 exploding-0.032
Token friend
Token ID1545
Feature activation-9.74e-01

Pos logit contributions
up+0.015
 sunglasses+0.014
 to+0.013
 extra+0.013
 big+0.013
Neg logit contributions
icer-0.032
etti-0.030
 scoring-0.030
 exploding-0.029
 ramp-0.028
Token.
Token ID13
Feature activation-6.29e-01

Pos logit contributions
up+0.019
 to+0.018
 lots+0.018
drawn+0.018
 extra+0.017
Neg logit contributions
icer-0.036
 ramp-0.033
 scoring-0.032
 dent-0.032
etti-0.032
Token They
Token ID1119
Feature activation+1.05e+00

Pos logit contributions
 lots+0.013
First+0.012
large+0.012
 to+0.012
 many+0.012
Neg logit contributions
icer-0.023
 ramp-0.022
 scoring-0.022
bing-0.021
 dent-0.021
Token went
Token ID1816
Feature activation-2.69e+00

Pos logit contributions
icer+0.039
 scoring+0.036
 ramp+0.036
etti+0.035
 exploding+0.035
Neg logit contributions
up-0.020
drawn-0.019
aid-0.018
large-0.017
zzy-0.017
Token on
Token ID319
Feature activation-1.21e+00

Pos logit contributions
up+0.046
drawn+0.043
 lots+0.040
 to+0.039
aid+0.039
Neg logit contributions
icer-0.107
 scoring-0.099
etti-0.097
 ramp-0.096
oc-0.096
Token lots
Token ID6041
Feature activation-9.43e+00

Pos logit contributions
up+0.027
aid+0.025
drawn+0.025
large+0.024
ave+0.024
Neg logit contributions
icer-0.042
 scoring-0.040
 ramp-0.038
 exploding-0.038
etti-0.037
Token of
Token ID286
Feature activation-2.96e-01

Pos logit contributions
up+0.187
 lots+0.156
 extra+0.156
drawn+0.153
large+0.151
Neg logit contributions
icer-0.366
etti-0.337
 scoring-0.334
 ramp-0.332
 dent-0.325
Token adventures
Token ID17545
Feature activation-8.75e-01

Pos logit contributions
up+0.006
aid+0.005
drawn+0.005
ave+0.005
large+0.005
Neg logit contributions
icer-0.011
 scoring-0.010
 ramp-0.010
 burying-0.010
 exploding-0.010
Token together
Token ID1978
Feature activation-1.01e+00

Pos logit contributions
 to+0.019
 lots+0.019
up+0.019
 many+0.018
drawn+0.018
Neg logit contributions
icer-0.031
 scoring-0.027
 dent-0.027
 ramp-0.027
ocol-0.027
Token,
Token ID11
Feature activation+4.21e-01

Pos logit contributions
 to+0.021
 lots+0.020
 many+0.019
 so+0.018
 extra+0.018
Neg logit contributions
icer-0.040
 ramp-0.037
 dent-0.036
etti-0.035
 spoiler-0.035
Token and
Token ID290
Feature activation-2.55e-01

Pos logit contributions
icer+0.015
ocol+0.014
 dent+0.014
etti+0.013
eros+0.013
Neg logit contributions
 lots-0.011
 to-0.011
 many-0.010
 so-0.010
 adventurous-0.010
Token friend
Token ID1545
Feature activation-9.74e-01

Pos logit contributions
up+0.015
 sunglasses+0.014
 to+0.013
 extra+0.013
 big+0.013
Neg logit contributions
icer-0.032
etti-0.030
 scoring-0.030
 exploding-0.029
 ramp-0.028
Token.
Token ID13
Feature activation-6.29e-01

Pos logit contributions
up+0.019
 to+0.018
 lots+0.018
drawn+0.018
 extra+0.017
Neg logit contributions
icer-0.036
 ramp-0.033
 scoring-0.032
 dent-0.032
etti-0.032
Token They
Token ID1119
Feature activation+1.05e+00

Pos logit contributions
 lots+0.013
First+0.012
large+0.012
 to+0.012
 many+0.012
Neg logit contributions
icer-0.023
 ramp-0.022
 scoring-0.022
bing-0.021
 dent-0.021
Token went
Token ID1816
Feature activation-2.69e+00

Pos logit contributions
icer+0.039
 scoring+0.036
 ramp+0.036
etti+0.035
 exploding+0.035
Neg logit contributions
up-0.020
drawn-0.019
aid-0.018
large-0.017
zzy-0.017
Token on
Token ID319
Feature activation-1.21e+00

Pos logit contributions
up+0.046
drawn+0.043
 lots+0.040
 to+0.039
aid+0.039
Neg logit contributions
icer-0.107
 scoring-0.099
etti-0.097
 ramp-0.096
oc-0.096
Token lots
Token ID6041
Feature activation-9.43e+00

Pos logit contributions
up+0.027
aid+0.025
drawn+0.025
large+0.024
ave+0.024
Neg logit contributions
icer-0.042
 scoring-0.040
 ramp-0.038
 exploding-0.038
etti-0.037
Token of
Token ID286
Feature activation-2.96e-01

Pos logit contributions
up+0.187
 lots+0.156
 extra+0.156
drawn+0.153
large+0.151
Neg logit contributions
icer-0.366
etti-0.337
 scoring-0.334
 ramp-0.332
 dent-0.325
Token adventures
Token ID17545
Feature activation-8.75e-01

Pos logit contributions
up+0.006
aid+0.005
drawn+0.005
ave+0.005
large+0.005
Neg logit contributions
icer-0.011
 scoring-0.010
 ramp-0.010
 burying-0.010
 exploding-0.010
Token together
Token ID1978
Feature activation-1.01e+00

Pos logit contributions
 to+0.019
 lots+0.019
up+0.019
 many+0.018
drawn+0.018
Neg logit contributions
icer-0.031
 scoring-0.027
 dent-0.027
 ramp-0.027
ocol-0.027
Token,
Token ID11
Feature activation+4.21e-01

Pos logit contributions
 to+0.021
 lots+0.020
 many+0.019
 so+0.018
 extra+0.018
Neg logit contributions
icer-0.040
 ramp-0.037
 dent-0.036
etti-0.035
 spoiler-0.035
Token and
Token ID290
Feature activation-2.55e-01

Pos logit contributions
icer+0.015
ocol+0.014
 dent+0.014
etti+0.013
eros+0.013
Neg logit contributions
 lots-0.011
 to-0.011
 many-0.010
 so-0.010
 adventurous-0.010
Token
Token ID198
Feature activation+2.06e-01

Pos logit contributions
 lots+0.007
 many+0.007
 to+0.007
First+0.007
Of+0.006
Neg logit contributions
icer-0.013
bing-0.013
 exploding-0.013
ocol-0.013
 scoring-0.012
Token
Token ID198
Feature activation+3.46e-01

Pos logit contributions
icer+0.007
 scoring+0.007
etti+0.007
ocol+0.007
 exploding+0.007
Neg logit contributions
up-0.004
First-0.004
zzy-0.004
aid-0.004
drawn-0.004
TokenHe
Token ID1544
Feature activation+9.84e-01

Pos logit contributions
icer+0.016
arter+0.015
etti+0.015
eros+0.015
ocol+0.015
Neg logit contributions
First-0.004
up-0.004
aid-0.004
 lots-0.003
zzy-0.003
Token tried
Token ID3088
Feature activation-3.50e+00

Pos logit contributions
icer+0.037
 scoring+0.034
 ramp+0.033
ocol+0.033
etti+0.033
Neg logit contributions
up-0.019
 lots-0.017
drawn-0.017
aid-0.017
zzy-0.016
Token on
Token ID319
Feature activation-9.79e-01

Pos logit contributions
 lots+0.027
 to+0.026
 many+0.022
up+0.020
 extra+0.020
Neg logit contributions
icer-0.176
etti-0.165
eros-0.163
oc-0.163
 ramp-0.162
Token lots
Token ID6041
Feature activation-9.39e+00

Pos logit contributions
 lots+0.019
up+0.019
 extra+0.017
 many+0.017
 to+0.017
Neg logit contributions
icer-0.037
etti-0.035
arter-0.035
 exploding-0.034
 ramp-0.034
Token of
Token ID286
Feature activation+5.22e-02

Pos logit contributions
up+0.160
aid+0.135
 to+0.131
 extra+0.131
 lots+0.126
Neg logit contributions
icer-0.406
etti-0.379
 exploding-0.366
 ramp-0.363
 scoring-0.361
Token different
Token ID1180
Feature activation-2.25e+00

Pos logit contributions
icer+0.002
 scoring+0.002
 ramp+0.002
 exploding+0.002
etti+0.002
Neg logit contributions
up-0.001
drawn-0.001
aid-0.001
large-0.001
zzy-0.001
Token ones
Token ID3392
Feature activation-6.11e+00

Pos logit contributions
drawn+0.050
up+0.048
aid+0.048
zzy+0.047
large+0.046
Neg logit contributions
icer-0.074
 scoring-0.069
 ramp-0.067
bing-0.066
oc-0.065
Token,
Token ID11
Feature activation+8.06e-01

Pos logit contributions
up+0.147
 to+0.137
 lots+0.134
aid+0.131
 extra+0.131
Neg logit contributions
icer-0.204
 scoring-0.184
 ramp-0.184
etti-0.182
 exploding-0.178
Token until
Token ID1566
Feature activation-5.40e-01

Pos logit contributions
icer+0.026
etti+0.024
 scoring+0.024
 exploding+0.023
bing+0.023
Neg logit contributions
up-0.019
 lots-0.018
 extra-0.017
aid-0.017
large-0.017
Token,
Token ID11
Feature activation+1.71e-01

Pos logit contributions
 to+0.005
 lots+0.005
 extra+0.005
 so+0.005
up+0.004
Neg logit contributions
icer-0.010
ocol-0.009
 dent-0.009
 exploding-0.009
 scoring-0.009
Token and
Token ID290
Feature activation-4.28e-01

Pos logit contributions
icer+0.007
ocol+0.006
arter+0.006
etti+0.006
 ramp+0.006
Neg logit contributions
 to-0.004
 lots-0.004
 so-0.003
 decided-0.003
 many-0.003
Token they
Token ID484
Feature activation-2.96e-01

Pos logit contributions
up+0.009
drawn+0.008
aid+0.008
large+0.008
 lots+0.008
Neg logit contributions
icer-0.016
 scoring-0.014
 ramp-0.014
etti-0.014
 exploding-0.014
Token went
Token ID1816
Feature activation-2.64e+00

Pos logit contributions
up+0.006
drawn+0.005
aid+0.005
large+0.005
First+0.005
Neg logit contributions
icer-0.011
 scoring-0.010
 ramp-0.010
etti-0.010
 exploding-0.010
Token on
Token ID319
Feature activation-1.33e+00

Pos logit contributions
up+0.030
drawn+0.027
 to+0.025
 lots+0.025
 off+0.024
Neg logit contributions
icer-0.123
 scoring-0.112
oc-0.111
etti-0.110
 ramp-0.109
Token lots
Token ID6041
Feature activation-9.39e+00

Pos logit contributions
up+0.024
aid+0.021
drawn+0.021
ave+0.020
large+0.020
Neg logit contributions
icer-0.052
 scoring-0.050
 ramp-0.048
 exploding-0.048
etti-0.048
Token of
Token ID286
Feature activation-1.64e-01

Pos logit contributions
up+0.187
 extra+0.156
 to+0.152
 lots+0.152
drawn+0.151
Neg logit contributions
icer-0.370
etti-0.333
 scoring-0.332
 ramp-0.329
 dent-0.322
Token fun
Token ID1257
Feature activation-1.82e+00

Pos logit contributions
up+0.004
aid+0.004
drawn+0.004
ave+0.004
large+0.004
Neg logit contributions
icer-0.005
 scoring-0.005
 ramp-0.005
 exploding-0.005
 burying-0.005
Token adventures
Token ID17545
Feature activation-3.10e-02

Pos logit contributions
up+0.040
 lots+0.038
 to+0.037
 many+0.036
drawn+0.036
Neg logit contributions
icer-0.067
 dent-0.057
 ramp-0.057
eros-0.057
etti-0.057
Token together
Token ID1978
Feature activation-2.39e-01

Pos logit contributions
up+0.001
 lots+0.001
 to+0.001
drawn+0.001
 many+0.001
Neg logit contributions
icer-0.001
 scoring-0.001
 ramp-0.001
 dent-0.001
ocol-0.001
Token.
Token ID13
Feature activation-4.83e-01

Pos logit contributions
 to+0.004
 lots+0.003
 many+0.003
 off+0.003
 extra+0.003
Neg logit contributions
icer-0.011
 ramp-0.010
 dent-0.009
eros-0.009
ocol-0.009
Token
Token ID198
Feature activation+1.15e-01

Pos logit contributions
icer+0.002
 scoring+0.002
 ramp+0.002
bing+0.002
etti+0.002
Neg logit contributions
up-0.002
drawn-0.002
aid-0.002
large-0.002
ave-0.002
TokenThe
Token ID464
Feature activation+4.92e-01

Pos logit contributions
icer+0.004
 scoring+0.004
bing+0.003
 ramp+0.003
oc+0.003
Neg logit contributions
up-0.002
drawn-0.002
aid-0.002
large-0.002
 sunglasses-0.002
Token team
Token ID1074
Feature activation-2.48e+00

Pos logit contributions
icer+0.014
 scoring+0.014
 ramp+0.013
 exploding+0.013
etti+0.013
Neg logit contributions
up-0.013
drawn-0.012
aid-0.012
zzy-0.012
large-0.011
Token went
Token ID1816
Feature activation-2.42e+00

Pos logit contributions
up+0.046
 lots+0.038
drawn+0.038
ave+0.038
 decided+0.037
Neg logit contributions
icer-0.100
ocol-0.090
 scoring-0.090
etti-0.090
 ramp-0.089
Token on
Token ID319
Feature activation-1.34e+00

Pos logit contributions
up+0.030
 to+0.027
drawn+0.027
 lots+0.025
 off+0.024
Neg logit contributions
icer-0.112
oc-0.102
etti-0.101
ocol-0.100
 ramp-0.099
Token lots
Token ID6041
Feature activation-9.37e+00

Pos logit contributions
 lots+0.016
up+0.014
 many+0.014
 to+0.014
drawn+0.014
Neg logit contributions
icer-0.060
oc-0.056
 ramp-0.055
etti-0.055
ocol-0.054
Token of
Token ID286
Feature activation-5.03e-01

Pos logit contributions
up+0.177
 to+0.144
 extra+0.143
 lots+0.140
 off+0.136
Neg logit contributions
icer-0.384
etti-0.344
 ramp-0.343
 scoring-0.341
 dent-0.334
Token adventures
Token ID17545
Feature activation-9.87e-01

Pos logit contributions
up+0.010
drawn+0.010
aid+0.010
large+0.009
zzy+0.009
Neg logit contributions
icer-0.018
 scoring-0.017
 ramp-0.017
 exploding-0.016
bing-0.016
Token together
Token ID1978
Feature activation-1.28e+00

Pos logit contributions
 to+0.021
 lots+0.020
up+0.020
 many+0.018
 so+0.018
Neg logit contributions
icer-0.038
ocol-0.032
 scoring-0.032
 ramp-0.032
 dent-0.032
Token,
Token ID11
Feature activation+3.96e-01

Pos logit contributions
 to+0.025
 lots+0.023
up+0.022
 many+0.021
 off+0.020
Neg logit contributions
icer-0.055
 ramp-0.048
 dent-0.047
oc-0.046
eros-0.046
Token and
Token ID290
Feature activation-3.17e-01

Pos logit contributions
icer+0.015
ocol+0.014
eros+0.013
etti+0.013
 dent+0.013
Neg logit contributions
 lots-0.009
 so-0.009
 to-0.009
 many-0.008
 very-0.008
Token the
Token ID262
Feature activation-3.01e-01

Pos logit contributions
icer+0.071
 scoring+0.066
 ramp+0.065
bing+0.064
etti+0.064
Neg logit contributions
up-0.033
 lots-0.031
drawn-0.031
large-0.029
First-0.029
Token park
Token ID3952
Feature activation+1.81e+00

Pos logit contributions
aid+0.009
up+0.008
drawn+0.008
ave+0.008
zzy+0.008
Neg logit contributions
icer-0.008
 ramp-0.007
 scoring-0.007
 exploding-0.007
oc-0.007
Token.
Token ID13
Feature activation-4.78e-01

Pos logit contributions
icer+0.068
 scoring+0.062
 ramp+0.061
ocol+0.060
etti+0.060
Neg logit contributions
up-0.036
drawn-0.032
 lots-0.032
 to-0.030
zzy-0.030
Token They
Token ID1119
Feature activation+8.42e-01

Pos logit contributions
up+0.013
drawn+0.012
First+0.012
 lots+0.012
large+0.012
Neg logit contributions
icer-0.014
 scoring-0.013
bing-0.013
 ramp-0.012
ocol-0.012
Token took
Token ID1718
Feature activation-2.72e+00

Pos logit contributions
icer+0.029
 scoring+0.027
 ramp+0.027
etti+0.026
 exploding+0.026
Neg logit contributions
up-0.018
drawn-0.017
aid-0.016
large-0.016
First-0.015
Token lots
Token ID6041
Feature activation-9.37e+00

Pos logit contributions
up+0.059
aid+0.057
Of+0.056
ave+0.055
zzy+0.055
Neg logit contributions
icer-0.087
etti-0.078
 exploding-0.078
 scoring-0.077
 ramp-0.077
Token of
Token ID286
Feature activation-4.86e-01

Pos logit contributions
up+0.218
drawn+0.200
aid+0.196
large+0.191
First+0.185
Neg logit contributions
icer-0.316
 scoring-0.296
 ramp-0.287
etti-0.283
 exploding-0.280
Token delicious
Token ID12625
Feature activation-1.94e+00

Pos logit contributions
drawn+0.011
up+0.011
ave+0.010
zzy+0.010
large+0.010
Neg logit contributions
icer-0.016
 ramp-0.015
 burying-0.014
 scoring-0.014
 pretending-0.014
Token treats
Token ID18432
Feature activation-1.88e+00

Pos logit contributions
up+0.038
drawn+0.037
aid+0.035
zzy+0.034
large+0.034
Neg logit contributions
icer-0.069
 scoring-0.064
 ramp-0.063
 exploding-0.062
etti-0.062
Token and
Token ID290
Feature activation-6.19e-01

Pos logit contributions
up+0.046
drawn+0.044
aid+0.042
large+0.041
First+0.041
Neg logit contributions
icer-0.059
 scoring-0.055
 ramp-0.054
etti-0.053
 exploding-0.053
Token spread
Token ID4104
Feature activation-1.51e+00

Pos logit contributions
up+0.014
drawn+0.013
aid+0.013
zzy+0.012
large+0.012
Neg logit contributions
icer-0.019
 scoring-0.019
 ramp-0.018
 exploding-0.018
etti-0.018
Token into
Token ID656
Feature activation-2.51e+00

Pos logit contributions
up+0.094
drawn+0.088
aid+0.083
large+0.081
 lots+0.081
Neg logit contributions
icer-0.163
 scoring-0.150
 ramp-0.148
etti-0.146
oc-0.145
Token the
Token ID262
Feature activation-2.00e+00

Pos logit contributions
up+0.057
drawn+0.054
aid+0.052
large+0.050
zzy+0.049
Neg logit contributions
icer-0.085
 scoring-0.079
 ramp-0.077
etti-0.076
 exploding-0.076
Token garden
Token ID11376
Feature activation-6.46e-02

Pos logit contributions
up+0.057
drawn+0.056
aid+0.055
zzy+0.055
large+0.054
Neg logit contributions
icer-0.054
 ramp-0.049
 scoring-0.049
 exploding-0.046
oc-0.046
Token and
Token ID290
Feature activation-1.17e+00

Pos logit contributions
 to+0.001
 lots+0.001
up+0.001
 many+0.001
 so+0.001
Neg logit contributions
icer-0.002
ocol-0.002
 scoring-0.002
eros-0.002
 ramp-0.002
Token took
Token ID1718
Feature activation-2.92e+00

Pos logit contributions
up+0.026
drawn+0.024
aid+0.024
large+0.024
First+0.023
Neg logit contributions
 scoring-0.038
icer-0.037
 burying-0.036
 pretending-0.036
 ramp-0.036
Token lots
Token ID6041
Feature activation-9.37e+00

Pos logit contributions
First+0.063
ache+0.062
ave+0.062
zzy+0.062
earned+0.061
Neg logit contributions
etti-0.083
icer-0.081
ocol-0.080
 pretending-0.080
omp-0.078
Token of
Token ID286
Feature activation-2.50e-01

Pos logit contributions
up+0.229
drawn+0.219
aid+0.209
large+0.206
zzy+0.203
Neg logit contributions
icer-0.295
 scoring-0.275
 ramp-0.268
etti-0.263
 exploding-0.261
Token lovely
Token ID14081
Feature activation-1.43e+00

Pos logit contributions
up+0.005
drawn+0.005
zzy+0.005
ave+0.005
large+0.005
Neg logit contributions
icer-0.008
ocol-0.008
 burying-0.008
 scoring-0.008
 pretending-0.008
Token photos
Token ID5205
Feature activation-3.12e+00

Pos logit contributions
up+0.029
drawn+0.028
aid+0.027
large+0.026
zzy+0.026
Neg logit contributions
icer-0.050
 scoring-0.047
 ramp-0.046
 exploding-0.045
etti-0.045
Token of
Token ID286
Feature activation+4.46e+00

Pos logit contributions
up+0.071
drawn+0.067
aid+0.064
large+0.063
First+0.062
Neg logit contributions
icer-0.104
 scoring-0.097
 ramp-0.095
etti-0.093
 exploding-0.092
Token the
Token ID262
Feature activation+1.20e+00

Pos logit contributions
icer+0.165
 scoring+0.153
 ramp+0.153
etti+0.149
bing+0.147
Neg logit contributions
up-0.086
drawn-0.081
aid-0.076
 lots-0.075
large-0.075
Token checked
Token ID10667
Feature activation-8.66e-01

Pos logit contributions
drawn+0.004
up+0.004
zzy+0.004
talk+0.004
First+0.004
Neg logit contributions
 pretending-0.005
 ramp-0.005
 burying-0.005
 scoring-0.005
 repairing-0.005
Token her
Token ID607
Feature activation-6.43e-01

Pos logit contributions
up+0.019
drawn+0.018
large+0.017
aid+0.016
ave+0.016
Neg logit contributions
icer-0.030
bing-0.028
 scoring-0.027
 burying-0.026
 ramp-0.026
Token foot
Token ID2366
Feature activation-1.78e+00

Pos logit contributions
up+0.016
drawn+0.015
aid+0.015
large+0.015
First+0.014
Neg logit contributions
icer-0.020
 scoring-0.019
 ramp-0.018
bing-0.018
etti-0.018
Token.
Token ID13
Feature activation-6.27e-01

Pos logit contributions
up+0.042
drawn+0.041
large+0.039
aid+0.039
First+0.038
Neg logit contributions
icer-0.056
 scoring-0.052
 ramp-0.052
bing-0.050
etti-0.050
Token After
Token ID2293
Feature activation+1.77e-01

Pos logit contributions
up+0.016
drawn+0.016
aid+0.015
large+0.015
First+0.015
Neg logit contributions
icer-0.019
 scoring-0.018
 ramp-0.017
etti-0.017
 exploding-0.017
Token lots
Token ID6041
Feature activation-9.27e+00

Pos logit contributions
icer+0.008
 scoring+0.007
 ramp+0.007
oc+0.007
etti+0.007
Neg logit contributions
up-0.002
 lots-0.002
drawn-0.002
aid-0.002
 many-0.002
Token of
Token ID286
Feature activation+5.29e-01

Pos logit contributions
up+0.188
aid+0.171
 extra+0.157
drawn+0.157
bs+0.157
Neg logit contributions
icer-0.360
 scoring-0.328
 ramp-0.327
etti-0.326
 exploding-0.313
Token tests
Token ID5254
Feature activation-3.96e+00

Pos logit contributions
icer+0.020
 scoring+0.019
 ramp+0.018
 exploding+0.018
etti+0.018
Neg logit contributions
up-0.010
drawn-0.009
aid-0.009
large-0.009
zzy-0.008
Token,
Token ID11
Feature activation-9.89e-01

Pos logit contributions
up+0.097
aid+0.089
drawn+0.088
 lots+0.087
 to+0.087
Neg logit contributions
icer-0.134
 scoring-0.118
 ramp-0.118
oc-0.114
etti-0.114
Token the
Token ID262
Feature activation+5.27e-01

Pos logit contributions
up+0.027
drawn+0.026
aid+0.026
large+0.025
ave+0.025
Neg logit contributions
icer-0.028
 scoring-0.026
bing-0.026
 ramp-0.025
 burying-0.025
Token doctors
Token ID7519
Feature activation-1.02e+00

Pos logit contributions
icer+0.017
 ramp+0.016
 scoring+0.016
 dent+0.015
 exploding+0.015
Neg logit contributions
drawn-0.012
up-0.012
large-0.012
zzy-0.011
aid-0.011
Token to
Token ID284
Feature activation-8.42e-01

Pos logit contributions
up+0.028
drawn+0.026
aid+0.025
large+0.025
First+0.024
Neg logit contributions
icer-0.051
 scoring-0.048
 ramp-0.047
etti-0.047
 exploding-0.046
Token sleep
Token ID3993
Feature activation+2.02e+00

Pos logit contributions
up+0.021
drawn+0.020
aid+0.020
large+0.019
First+0.019
Neg logit contributions
icer-0.026
 scoring-0.024
 ramp-0.023
etti-0.023
 exploding-0.023
Token.
Token ID13
Feature activation-3.25e-01

Pos logit contributions
icer+0.067
 scoring+0.063
 ramp+0.061
etti+0.060
 exploding+0.060
Neg logit contributions
up-0.046
drawn-0.044
aid-0.042
large-0.041
zzy-0.040
Token Lucy
Token ID22162
Feature activation+5.02e-01

Pos logit contributions
up+0.008
drawn+0.008
aid+0.008
First+0.008
 lots+0.008
Neg logit contributions
icer-0.010
 scoring-0.009
 ramp-0.009
 exploding-0.009
bing-0.009
Token took
Token ID1718
Feature activation-2.50e+00

Pos logit contributions
icer+0.018
 scoring+0.016
 ramp+0.016
etti+0.016
 exploding+0.016
Neg logit contributions
up-0.011
drawn-0.010
aid-0.009
large-0.009
zzy-0.009
Token lots
Token ID6041
Feature activation-9.26e+00

Pos logit contributions
nam+0.054
 ju+0.053
large+0.053
drawn+0.052
First+0.051
Neg logit contributions
 pretending-0.068
omp-0.066
icer-0.065
 Be-0.065
bing-0.065
Token of
Token ID286
Feature activation-5.35e-01

Pos logit contributions
up+0.238
drawn+0.237
large+0.222
zzy+0.221
aid+0.221
Neg logit contributions
icer-0.265
 scoring-0.247
 ramp-0.240
bing-0.236
oc-0.236
Token cozy
Token ID37438
Feature activation-9.63e-01

Pos logit contributions
drawn+0.010
up+0.010
large+0.010
zzy+0.010
ave+0.010
Neg logit contributions
icer-0.018
 pretending-0.017
 ramp-0.017
 scoring-0.017
ocol-0.016
Token n
Token ID299
Feature activation+3.91e-01

Pos logit contributions
up+0.023
drawn+0.022
aid+0.021
large+0.021
zzy+0.021
Neg logit contributions
icer-0.031
 scoring-0.028
 ramp-0.028
etti-0.028
 exploding-0.027
Tokenaps
Token ID1686
Feature activation+1.07e+00

Pos logit contributions
icer+0.008
 scoring+0.007
 ramp+0.007
 exploding+0.007
oc+0.006
Neg logit contributions
up-0.014
drawn-0.014
aid-0.013
zzy-0.013
large-0.013
Token with
Token ID351
Feature activation+2.40e-01

Pos logit contributions
icer+0.031
 scoring+0.029
 ramp+0.028
etti+0.028
bing+0.027
Neg logit contributions
up-0.028
drawn-0.027
aid-0.026
large-0.026
First-0.025
Tokenadows
Token ID9797
Feature activation+2.40e+00

Pos logit contributions
icer+0.030
 scoring+0.029
 ramp+0.028
etti+0.028
 exploding+0.027
Neg logit contributions
up-0.015
drawn-0.014
aid-0.013
large-0.013
zzy-0.013
Token!
Token ID0
Feature activation-1.74e-01

Pos logit contributions
icer+0.088
 scoring+0.080
 ramp+0.079
etti+0.077
 dent+0.076
Neg logit contributions
up-0.049
 lots-0.045
drawn-0.045
 to-0.043
 extra-0.042
Token
Token ID198
Feature activation+6.98e-01

Pos logit contributions
 lots+0.003
First+0.003
large+0.003
drawn+0.003
Of+0.003
Neg logit contributions
icer-0.006
bing-0.006
 exploding-0.006
 scoring-0.006
etti-0.006
Token
Token ID198
Feature activation+6.85e-01

Pos logit contributions
icer+0.024
etti+0.023
 scoring+0.023
 exploding+0.022
 ramp+0.022
Neg logit contributions
up-0.015
First-0.014
zzy-0.014
 lots-0.014
aid-0.014
TokenAfter
Token ID3260
Feature activation+3.34e-01

Pos logit contributions
icer+0.028
etti+0.027
arter+0.026
 exploding+0.026
 dent+0.026
Neg logit contributions
First-0.012
up-0.011
zzy-0.011
aid-0.010
 lots-0.010
Token lots
Token ID6041
Feature activation-9.24e+00

Pos logit contributions
icer+0.015
 ramp+0.013
etti+0.013
 dent+0.013
bing+0.013
Neg logit contributions
 lots-0.005
 many-0.005
up-0.004
drawn-0.004
 extra-0.004
Token of
Token ID286
Feature activation+3.79e-01

Pos logit contributions
up+0.161
 extra+0.141
aid+0.140
 many+0.137
bs+0.135
Neg logit contributions
icer-0.396
etti-0.364
 ramp-0.354
 dent-0.341
 scoring-0.338
Token hopping
Token ID47153
Feature activation-6.54e-01

Pos logit contributions
icer+0.016
etti+0.015
 ramp+0.015
 scoring+0.015
bing+0.014
Neg logit contributions
up-0.006
drawn-0.005
 lots-0.005
aid-0.005
First-0.005
Token around
Token ID1088
Feature activation+2.16e-01

Pos logit contributions
up+0.013
drawn+0.012
aid+0.011
large+0.011
 lots+0.011
Neg logit contributions
icer-0.024
 scoring-0.022
 ramp-0.022
etti-0.022
 exploding-0.022
Token,
Token ID11
Feature activation-5.19e-01

Pos logit contributions
icer+0.009
etti+0.009
 dent+0.008
unch+0.008
omo+0.008
Neg logit contributions
 to-0.004
 lots-0.004
 many-0.003
 so-0.003
 a-0.003
Token the
Token ID262
Feature activation+8.83e-02

Pos logit contributions
 lots+0.011
 many+0.010
 to+0.010
 a+0.009
 an+0.009
Neg logit contributions
icer-0.019
etti-0.018
opl-0.017
arter-0.017
 dent-0.017
Token.
Token ID13
Feature activation-7.78e-01

Pos logit contributions
up+0.035
drawn+0.034
aid+0.033
large+0.032
First+0.031
Neg logit contributions
icer-0.046
 scoring-0.042
 ramp-0.042
etti-0.041
bing-0.040
Token
Token ID198
Feature activation-8.22e-02

Pos logit contributions
First+0.017
 lots+0.017
up+0.017
drawn+0.016
 to+0.016
Neg logit contributions
icer-0.026
 scoring-0.026
ocol-0.025
 exploding-0.025
bing-0.024
Token
Token ID198
Feature activation+1.79e-02

Pos logit contributions
up+0.002
drawn+0.002
aid+0.002
large+0.002
First+0.002
Neg logit contributions
icer-0.002
 scoring-0.002
 ramp-0.002
etti-0.002
 exploding-0.002
TokenHe
Token ID1544
Feature activation+8.52e-01

Pos logit contributions
icer+0.001
 scoring+0.001
etti+0.001
ocol+0.001
eros+0.001
Neg logit contributions
up-0.000
First-0.000
drawn-0.000
zzy-0.000
aid-0.000
Token asked
Token ID1965
Feature activation+2.22e-01

Pos logit contributions
icer+0.029
 ramp+0.027
 scoring+0.027
oc+0.026
 burying+0.026
Neg logit contributions
up-0.018
drawn-0.017
aid-0.017
large-0.017
First-0.016
Token lots
Token ID6041
Feature activation-9.23e+00

Pos logit contributions
icer+0.004
oc+0.004
 burying+0.004
 scoring+0.004
 sharing+0.004
Neg logit contributions
up-0.008
drawn-0.007
zzy-0.007
ave-0.007
aid-0.007
Token of
Token ID286
Feature activation+4.24e-01

Pos logit contributions
up+0.223
drawn+0.210
aid+0.202
large+0.199
First+0.194
Neg logit contributions
icer-0.296
 scoring-0.276
 ramp-0.269
etti-0.265
 exploding-0.263
Token people
Token ID661
Feature activation-3.89e+00

Pos logit contributions
icer+0.016
 scoring+0.015
ocol+0.015
 burying+0.015
 pretending+0.015
Neg logit contributions
drawn-0.007
up-0.007
ave-0.007
large-0.007
zzy-0.007
Token,
Token ID11
Feature activation-1.07e+00

Pos logit contributions
up+0.071
 to+0.062
drawn+0.061
 lots+0.060
aid+0.058
Neg logit contributions
icer-0.155
etti-0.141
 ramp-0.140
 scoring-0.139
 dent-0.138
Token but
Token ID475
Feature activation-3.42e-01

Pos logit contributions
up+0.024
 lots+0.024
drawn+0.023
First+0.023
aid+0.023
Neg logit contributions
icer-0.036
etti-0.033
 scoring-0.032
 ramp-0.031
bing-0.031
Token he
Token ID339
Feature activation+8.20e-01

Pos logit contributions
up+0.008
drawn+0.007
 lots+0.007
large+0.007
aid+0.007
Neg logit contributions
icer-0.012
 scoring-0.010
 ramp-0.010
etti-0.010
oc-0.010
Token worked
Token ID3111
Feature activation-1.23e+00

Pos logit contributions
up+0.028
drawn+0.025
 lots+0.025
aid+0.024
 to+0.024
Neg logit contributions
icer-0.054
 scoring-0.048
ocol-0.047
 ramp-0.047
etti-0.046
Token very
Token ID845
Feature activation-6.12e+00

Pos logit contributions
up+0.022
drawn+0.021
aid+0.019
large+0.019
First+0.019
Neg logit contributions
icer-0.047
 scoring-0.044
 ramp-0.043
etti-0.043
 exploding-0.042
Token hard
Token ID1327
Feature activation-2.97e+00

Pos logit contributions
up+0.064
drawn+0.055
aid+0.052
 lots+0.051
large+0.049
Neg logit contributions
icer-0.281
 scoring-0.260
etti-0.260
 ramp-0.256
 exploding-0.255
Token.
Token ID13
Feature activation-7.68e-01

Pos logit contributions
 to+0.048
up+0.047
 lots+0.046
 many+0.044
 extra+0.043
Neg logit contributions
icer-0.121
etti-0.108
 ramp-0.106
 scoring-0.105
ocol-0.105
Token After
Token ID2293
Feature activation+3.25e-01

Pos logit contributions
 lots+0.019
 many+0.018
First+0.018
 to+0.017
drawn+0.017
Neg logit contributions
icer-0.025
 scoring-0.024
bing-0.023
 exploding-0.023
ebra-0.023
Token lots
Token ID6041
Feature activation-9.21e+00

Pos logit contributions
icer+0.017
ebra+0.015
oc+0.015
arter+0.015
bing+0.015
Neg logit contributions
 lots-0.003
 many-0.003
 a-0.002
 to-0.002
 extra-0.002
Token of
Token ID286
Feature activation+7.09e-01

Pos logit contributions
up+0.125
 many+0.123
 to+0.121
 extra+0.121
 care+0.111
Neg logit contributions
icer-0.432
etti-0.388
 ramp-0.373
ric-0.371
stack-0.369
Token trying
Token ID2111
Feature activation-1.79e+00

Pos logit contributions
icer+0.034
etti+0.032
eros+0.030
bing+0.030
arter+0.030
Neg logit contributions
 many-0.008
 lots-0.008
 extra-0.008
 big-0.007
First-0.007
Token,
Token ID11
Feature activation-1.01e+00

Pos logit contributions
 to+0.035
 lots+0.033
 many+0.032
 off+0.031
 extra+0.029
Neg logit contributions
icer-0.075
etti-0.066
oc-0.066
ebra-0.065
eros-0.065
Token he
Token ID339
Feature activation-2.35e-01

Pos logit contributions
 lots+0.020
 many+0.020
 to+0.019
 a+0.017
 an+0.017
Neg logit contributions
icer-0.040
etti-0.037
opl-0.036
arter-0.036
ebra-0.036
Token finally
Token ID3443
Feature activation-1.53e+00

Pos logit contributions
up+0.004
ave+0.004
 lots+0.004
 decided+0.004
 to+0.004
Neg logit contributions
icer-0.009
ocol-0.009
 scoring-0.008
arter-0.008
etti-0.008
Token,
Token ID11
Feature activation-1.56e+00

Pos logit contributions
up+0.041
 to+0.036
drawn+0.036
 lots+0.035
aid+0.034
Neg logit contributions
icer-0.087
 ramp-0.080
etti-0.079
 scoring-0.079
 exploding-0.077
Token "
Token ID366
Feature activation-2.71e+00

Pos logit contributions
 to+0.040
 very+0.037
 lots+0.037
 so+0.036
 extra+0.035
Neg logit contributions
icer-0.053
 ramp-0.047
etti-0.045
opl-0.044
 scoring-0.044
TokenWhat
Token ID2061
Feature activation-1.19e+00

Pos logit contributions
up+0.058
drawn+0.054
aid+0.053
First+0.052
large+0.051
Neg logit contributions
icer-0.092
 ramp-0.090
 scoring-0.088
etti-0.088
 exploding-0.086
Token are
Token ID389
Feature activation+2.79e+00

Pos logit contributions
up+0.033
drawn+0.032
aid+0.031
large+0.030
zzy+0.030
Neg logit contributions
icer-0.033
 scoring-0.031
 ramp-0.030
etti-0.029
 exploding-0.029
Token you
Token ID345
Feature activation+9.11e-01

Pos logit contributions
icer+0.058
 scoring+0.052
 burying+0.050
 exploding+0.049
ocol+0.048
Neg logit contributions
drawn-0.097
up-0.097
zzy-0.095
ave-0.093
aid-0.093
Token doing
Token ID1804
Feature activation+4.90e+00

Pos logit contributions
icer+0.018
 ramp+0.017
etti+0.016
 scoring+0.016
oc+0.015
Neg logit contributions
up-0.031
 lots-0.030
drawn-0.030
large-0.030
 extra-0.030
Token?"
Token ID1701
Feature activation-2.43e+00

Pos logit contributions
icer+0.193
 ramp+0.176
etti+0.175
 scoring+0.175
oc+0.174
Neg logit contributions
up-0.081
 to-0.079
 news-0.074
 lots-0.074
 extra-0.072
Token
Token ID198
Feature activation+1.81e-01

Pos logit contributions
up+0.069
drawn+0.066
aid+0.063
large+0.062
First+0.061
Neg logit contributions
icer-0.068
 scoring-0.063
 ramp-0.061
etti-0.060
 exploding-0.059
Token
Token ID198
Feature activation+2.25e-01

Pos logit contributions
icer+0.005
 scoring+0.005
 ramp+0.005
etti+0.005
 exploding+0.005
Neg logit contributions
up-0.005
drawn-0.005
aid-0.005
large-0.005
First-0.004
TokenThe
Token ID464
Feature activation+2.74e-01

Pos logit contributions
icer+0.008
 scoring+0.007
 ramp+0.007
etti+0.007
oc+0.007
Neg logit contributions
up-0.005
drawn-0.004
aid-0.004
large-0.004
zzy-0.004
Token man
Token ID582
Feature activation-3.17e-01

Pos logit contributions
icer+0.006
 scoring+0.006
 ramp+0.006
oc+0.005
eros+0.005
Neg logit contributions
drawn-0.009
up-0.009
First-0.009
zzy-0.009
aid-0.008
Token "
Token ID366
Feature activation-1.27e+00

Pos logit contributions
icer+0.022
 ramp+0.018
ocol+0.018
omp+0.018
oc+0.018
Neg logit contributions
 to-0.015
 lots-0.014
 many-0.013
 very-0.013
Of-0.013
TokenThere
Token ID1858
Feature activation+4.46e+00

Pos logit contributions
up+0.028
drawn+0.027
First+0.026
aid+0.026
large+0.025
Neg logit contributions
icer-0.043
 burying-0.042
 scoring-0.041
 ramp-0.041
arter-0.040
Token,
Token ID11
Feature activation+2.62e-01

Pos logit contributions
icer+0.148
 scoring+0.143
ocol+0.140
oc+0.139
 ramp+0.138
Neg logit contributions
up-0.104
 lots-0.099
 to-0.095
 many-0.094
 big-0.091
Token it
Token ID340
Feature activation-3.97e-01

Pos logit contributions
icer+0.010
 scoring+0.009
oc+0.009
bing+0.009
omp+0.009
Neg logit contributions
up-0.005
 lots-0.005
drawn-0.005
aid-0.005
 many-0.005
Token is
Token ID318
Feature activation+3.29e+00

Pos logit contributions
up+0.010
 prett+0.008
 big+0.008
 off+0.008
 to+0.008
Neg logit contributions
ocol-0.013
icer-0.013
 scoring-0.013
eros-0.012
 lending-0.012
Token fixed
Token ID5969
Feature activation+5.07e+00

Pos logit contributions
icer+0.134
 scoring+0.126
oc+0.125
 ramp+0.122
etti+0.122
Neg logit contributions
up-0.050
drawn-0.046
 lots-0.045
large-0.042
 extra-0.042
Token.
Token ID13
Feature activation+2.84e+00

Pos logit contributions
icer+0.179
 scoring+0.173
ocol+0.165
 ramp+0.163
omp+0.161
Neg logit contributions
up-0.103
 to-0.094
 lots-0.093
 off-0.091
drawn-0.089
Token Good
Token ID4599
Feature activation+2.45e+00

Pos logit contributions
icer+0.074
 scoring+0.070
bing+0.066
 burying+0.064
 exploding+0.063
Neg logit contributions
up-0.084
aid-0.083
drawn-0.083
large-0.082
First-0.080
Token as
Token ID355
Feature activation-8.41e-01

Pos logit contributions
icer+0.085
 scoring+0.079
 ramp+0.078
etti+0.076
 exploding+0.076
Neg logit contributions
up-0.053
drawn-0.051
aid-0.048
large-0.048
First-0.047
Token new
Token ID649
Feature activation+1.02e+00

Pos logit contributions
up+0.018
drawn+0.017
aid+0.016
 lots+0.016
large+0.016
Neg logit contributions
icer-0.029
 scoring-0.027
 ramp-0.027
oc-0.026
etti-0.026
Token."
Token ID526
Feature activation-2.74e-01

Pos logit contributions
icer+0.037
 scoring+0.032
omp+0.032
etti+0.032
 exploding+0.031
Neg logit contributions
up-0.022
 to-0.020
drawn-0.020
 lots-0.020
aid-0.019
Token"
Token ID1
Feature activation-3.73e+00

Pos logit contributions
icer+0.066
arter+0.060
etti+0.060
eros+0.059
 scoring+0.059
Neg logit contributions
First-0.032
up-0.030
aid-0.029
A-0.027
zzy-0.027
TokenWow
Token ID22017
Feature activation+3.72e+00

Pos logit contributions
up+0.092
drawn+0.087
aid+0.084
large+0.082
First+0.082
Neg logit contributions
icer-0.119
 scoring-0.110
 ramp-0.108
etti-0.105
 exploding-0.105
Token,
Token ID11
Feature activation+1.86e+00

Pos logit contributions
icer+0.124
 scoring+0.118
oc+0.112
 ramp+0.112
 exploding+0.110
Neg logit contributions
up-0.084
 lots-0.077
drawn-0.076
 to-0.073
 off-0.073
Token this
Token ID428
Feature activation+2.17e+00

Pos logit contributions
icer+0.058
 scoring+0.055
 ramp+0.054
etti+0.052
 exploding+0.052
Neg logit contributions
up-0.046
drawn-0.043
aid-0.042
large-0.041
First-0.041
Token is
Token ID318
Feature activation+4.76e+00

Pos logit contributions
icer+0.050
 scoring+0.045
 ramp+0.043
etti+0.042
 exploding+0.042
Neg logit contributions
up-0.072
drawn-0.069
aid-0.066
large-0.066
First-0.065
Token amazing
Token ID4998
Feature activation+5.04e+00

Pos logit contributions
icer+0.199
 scoring+0.190
 ramp+0.187
 exploding+0.185
etti+0.185
Neg logit contributions
up-0.069
drawn-0.062
aid-0.061
large-0.056
ave-0.055
Token!"
Token ID2474
Feature activation-2.34e+00

Pos logit contributions
icer+0.163
 scoring+0.150
 ramp+0.145
etti+0.143
 exploding+0.142
Neg logit contributions
up-0.123
drawn-0.115
aid-0.108
large-0.108
 lots-0.107
Token Ben
Token ID3932
Feature activation-3.59e-01

Pos logit contributions
up+0.065
drawn+0.061
 lots+0.058
aid+0.058
First+0.058
Neg logit contributions
icer-0.071
 scoring-0.063
bing-0.061
etti-0.061
 ramp-0.060
Token says
Token ID1139
Feature activation+1.88e+00

Pos logit contributions
up+0.008
drawn+0.008
aid+0.007
large+0.007
First+0.007
Neg logit contributions
icer-0.012
 scoring-0.011
 ramp-0.011
etti-0.011
 exploding-0.011
Token.
Token ID13
Feature activation+1.41e+00

Pos logit contributions
icer+0.060
 scoring+0.057
 ramp+0.055
etti+0.054
 exploding+0.054
Neg logit contributions
up-0.045
drawn-0.042
aid-0.041
large-0.040
First-0.039
Token
Token ID198
Feature activation+1.90e+00

Pos logit contributions
icer+0.046
 scoring+0.041
etti+0.041
bing+0.040
 exploding+0.040
Neg logit contributions
up-0.034
 to-0.033
drawn-0.032
 lots-0.032
aid-0.032
Token Anna
Token ID11735
Feature activation+3.45e-01

Pos logit contributions
icer+0.048
 scoring+0.045
 ramp+0.043
bing+0.043
 exploding+0.043
Neg logit contributions
up-0.037
drawn-0.035
First-0.034
aid-0.034
 lots-0.033
Token thinks
Token ID6834
Feature activation-1.47e+00

Pos logit contributions
icer+0.012
ocol+0.011
etti+0.011
arter+0.011
 scoring+0.011
Neg logit contributions
up-0.008
drawn-0.007
 to-0.007
 many-0.007
 lots-0.007
Token the
Token ID262
Feature activation+2.79e+00

Pos logit contributions
up+0.033
 to+0.032
 many+0.032
 lots+0.031
drawn+0.030
Neg logit contributions
icer-0.052
oc-0.048
 scoring-0.046
bing-0.045
 ramp-0.044
Token car
Token ID1097
Feature activation+1.40e+00

Pos logit contributions
icer+0.067
 scoring+0.060
etti+0.057
bing+0.057
 ramp+0.056
Neg logit contributions
up-0.091
drawn-0.087
aid-0.084
large-0.083
First-0.082
Token is
Token ID318
Feature activation+4.03e+00

Pos logit contributions
icer+0.038
 scoring+0.035
etti+0.033
bing+0.033
 exploding+0.033
Neg logit contributions
up-0.043
drawn-0.039
aid-0.039
ave-0.037
 lots-0.037
Token pretty
Token ID2495
Feature activation+5.02e+00

Pos logit contributions
icer+0.182
oc+0.167
 scoring+0.165
 ramp+0.165
etti+0.165
Neg logit contributions
up-0.049
drawn-0.042
aid-0.041
 lots-0.041
 prett-0.040
Token.
Token ID13
Feature activation+1.71e+00

Pos logit contributions
icer+0.149
 scoring+0.138
 ramp+0.135
etti+0.133
 exploding+0.131
Neg logit contributions
up-0.131
drawn-0.127
aid-0.123
large-0.120
zzy-0.119
Token She
Token ID1375
Feature activation+2.24e+00

Pos logit contributions
icer+0.048
 scoring+0.044
 ramp+0.043
 You+0.042
etti+0.041
Neg logit contributions
up-0.048
drawn-0.046
aid-0.045
large-0.044
zzy-0.043
Token wants
Token ID3382
Feature activation-1.92e+00

Pos logit contributions
icer+0.067
 scoring+0.062
 ramp+0.060
etti+0.059
 exploding+0.059
Neg logit contributions
up-0.059
drawn-0.056
aid-0.054
large-0.053
First-0.052
Token to
Token ID284
Feature activation+2.58e-01

Pos logit contributions
zzy+0.039
large+0.035
ave+0.035
drawn+0.035
First+0.033
Neg logit contributions
 scoring-0.066
 pretending-0.066
 sharing-0.066
 Teach-0.063
 fighting-0.062
Token see
Token ID766
Feature activation-1.25e-01

Pos logit contributions
 ramp+0.007
 scoring+0.007
icer+0.006
etti+0.006
 exploding+0.006
Neg logit contributions
up-0.007
ave-0.007
drawn-0.007
aid-0.007
large-0.007
Token what
Token ID644
Feature activation-1.53e+00

Pos logit contributions
icer+0.031
 scoring+0.029
bing+0.028
 ramp+0.027
oc+0.027
Neg logit contributions
up-0.023
drawn-0.022
large-0.021
aid-0.021
zzy-0.020
Token Mrs
Token ID9074
Feature activation+2.38e+00

Pos logit contributions
up+0.037
drawn+0.036
zzy+0.032
large+0.032
First+0.032
Neg logit contributions
 scoring-0.047
icer-0.046
 sharing-0.045
 lending-0.045
 repairing-0.044
Token.
Token ID13
Feature activation+1.73e+00

Pos logit contributions
icer+0.089
 scoring+0.084
 ramp+0.084
omp+0.080
 exploding+0.080
Neg logit contributions
up-0.047
 to-0.040
 news-0.040
aid-0.040
drawn-0.040
Token Brown
Token ID4373
Feature activation+2.27e+00

Pos logit contributions
 scoring+0.051
icer+0.050
omp+0.050
ocol+0.049
 exploding+0.049
Neg logit contributions
 to-0.039
drawn-0.038
 lots-0.036
up-0.035
 nos-0.035
Token meant
Token ID4001
Feature activation+1.44e+00

Pos logit contributions
icer+0.089
 ramp+0.083
 scoring+0.082
etti+0.080
ocol+0.079
Neg logit contributions
up-0.046
aid-0.039
ave-0.037
 decided-0.037
 listened-0.036
Token by
Token ID416
Feature activation+5.17e+00

Pos logit contributions
icer+0.058
 ramp+0.051
etti+0.050
oc+0.049
 scoring+0.049
Neg logit contributions
 to-0.030
up-0.029
 lots-0.027
 extra-0.027
 many-0.026
Token "
Token ID366
Feature activation-6.79e-01

Pos logit contributions
icer+0.161
 ramp+0.151
etti+0.146
 exploding+0.139
oc+0.139
Neg logit contributions
 lots-0.131
 many-0.124
up-0.121
 sunglasses-0.120
 extra-0.119
Tokensp
Token ID2777
Feature activation+2.10e+00

Pos logit contributions
up+0.015
drawn+0.015
large+0.014
zzy+0.014
First+0.013
Neg logit contributions
icer-0.022
 scoring-0.021
oc-0.020
bing-0.020
 ramp-0.019
Tokenoil
Token ID9437
Feature activation+2.19e+00

Pos logit contributions
 scoring+0.056
 ramp+0.054
 burying+0.053
 pretending+0.053
 exploding+0.051
Neg logit contributions
up-0.063
aid-0.059
zzy-0.057
drawn-0.057
ave-0.056
Token",
Token ID1600
Feature activation-9.28e-01

Pos logit contributions
icer+0.064
 scoring+0.059
bing+0.057
oc+0.057
 ramp+0.056
Neg logit contributions
up-0.057
drawn-0.057
aid-0.054
large-0.053
First-0.052
Token but
Token ID475
Feature activation+5.92e-01

Pos logit contributions
up+0.022
aid+0.020
 lots+0.020
drawn+0.020
 to+0.019
Neg logit contributions
icer-0.032
 scoring-0.029
 ramp-0.028
 exploding-0.028
ocol-0.027
Token was
Token ID373
Feature activation-1.55e+00

Pos logit contributions
 scoring+0.039
icer+0.038
ocol+0.038
 ramp+0.038
arter+0.037
Neg logit contributions
 lots-0.013
 to-0.013
 decided-0.013
 very-0.012
up-0.012
Token a
Token ID257
Feature activation-2.70e+00

Pos logit contributions
up+0.018
drawn+0.015
aid+0.014
ave+0.013
large+0.013
Neg logit contributions
icer-0.070
 scoring-0.066
etti-0.064
 ramp-0.064
 exploding-0.064
Token little
Token ID1310
Feature activation+3.80e-01

Pos logit contributions
 big+0.027
up+0.023
 pretty+0.022
 nice+0.021
 very+0.021
Neg logit contributions
icer-0.124
etti-0.118
 scoring-0.116
omp-0.116
bing-0.116
Token girl
Token ID2576
Feature activation+1.47e+00

Pos logit contributions
icer+0.015
 scoring+0.014
etti+0.014
 exploding+0.014
 ramp+0.013
Neg logit contributions
up-0.006
drawn-0.006
aid-0.006
 big-0.005
 prett-0.005
Token called
Token ID1444
Feature activation+9.33e-01

Pos logit contributions
icer+0.035
 scoring+0.032
 ramp+0.031
etti+0.030
oc+0.030
Neg logit contributions
up-0.047
drawn-0.045
aid-0.044
large-0.043
First-0.043
Token Sarah
Token ID10490
Feature activation+1.67e+00

Pos logit contributions
icer+0.024
 scoring+0.022
etti+0.022
 ramp+0.022
 exploding+0.021
Neg logit contributions
up-0.028
drawn-0.026
aid-0.026
ave-0.025
large-0.025
Token.
Token ID13
Feature activation-7.89e-01

Pos logit contributions
icer+0.047
 scoring+0.043
oc+0.043
etti+0.042
 ramp+0.042
Neg logit contributions
up-0.045
drawn-0.043
aid-0.042
large-0.042
First-0.041
Token She
Token ID1375
Feature activation+1.59e+00

Pos logit contributions
up+0.021
drawn+0.020
aid+0.019
large+0.019
zzy+0.018
Neg logit contributions
icer-0.023
 scoring-0.022
 ramp-0.021
etti-0.021
 exploding-0.021
Token was
Token ID373
Feature activation+6.82e-01

Pos logit contributions
icer+0.053
 scoring+0.050
 ramp+0.048
 exploding+0.047
etti+0.047
Neg logit contributions
up-0.036
drawn-0.034
aid-0.033
large-0.033
First-0.032
Token three
Token ID1115
Feature activation-7.48e-01

Pos logit contributions
icer+0.030
 scoring+0.028
 ramp+0.028
etti+0.027
 exploding+0.027
Neg logit contributions
up-0.009
drawn-0.008
aid-0.007
large-0.007
First-0.007
Token years
Token ID812
Feature activation-1.03e+00

Pos logit contributions
drawn+0.024
up+0.024
aid+0.024
zzy+0.023
First+0.023
Neg logit contributions
icer-0.018
oc-0.017
bing-0.016
 scoring-0.015
 pretending-0.015
Token time
Token ID640
Feature activation-7.14e-01

Pos logit contributions
up+0.123
drawn+0.117
aid+0.112
zzy+0.111
large+0.111
Neg logit contributions
icer-0.159
 scoring-0.149
 ramp-0.146
etti-0.144
 exploding-0.142
Token,
Token ID11
Feature activation-1.95e+00

Pos logit contributions
 to+0.013
 lots+0.012
 many+0.011
 big+0.011
 very+0.011
Neg logit contributions
icer-0.028
 ramp-0.027
 scoring-0.026
 exploding-0.026
etti-0.026
Token there
Token ID612
Feature activation+1.44e+00

Pos logit contributions
 lots+0.029
up+0.029
 many+0.029
zzy+0.028
drawn+0.028
Neg logit contributions
icer-0.076
 scoring-0.073
etti-0.072
 ramp-0.072
opl-0.070
Token was
Token ID373
Feature activation-8.59e-01

Pos logit contributions
ocol+0.062
etti+0.060
icer+0.060
 ramp+0.060
 scoring+0.059
Neg logit contributions
 lots-0.020
 to-0.018
 many-0.018
 big-0.018
 very-0.017
Token a
Token ID257
Feature activation-2.03e+00

Pos logit contributions
up+0.009
drawn+0.008
aid+0.007
large+0.006
zzy+0.006
Neg logit contributions
icer-0.040
 scoring-0.038
 ramp-0.037
etti-0.037
 exploding-0.036
Token gray
Token ID12768
Feature activation+6.98e-01

Pos logit contributions
 big+0.019
up+0.018
 pretty+0.016
drawn+0.015
 nice+0.015
Neg logit contributions
icer-0.094
etti-0.088
 scoring-0.087
bing-0.087
omp-0.086
Token fish
Token ID5916
Feature activation+1.22e+00

Pos logit contributions
icer+0.023
 scoring+0.022
bing+0.021
etti+0.021
 exploding+0.021
Neg logit contributions
up-0.015
 big-0.014
 sunglasses-0.014
drawn-0.014
 lots-0.014
Token who
Token ID508
Feature activation+1.56e+00

Pos logit contributions
icer+0.040
etti+0.038
 ramp+0.038
 scoring+0.037
ocol+0.036
Neg logit contributions
up-0.028
 lots-0.026
 to-0.026
 many-0.026
aid-0.025
Token lived
Token ID5615
Feature activation+2.69e+00

Pos logit contributions
icer+0.065
etti+0.062
ocol+0.061
 scoring+0.061
 ramp+0.060
Neg logit contributions
 lots-0.023
 decided-0.023
 very-0.022
up-0.022
 many-0.021
Token in
Token ID287
Feature activation+2.33e+00

Pos logit contributions
icer+0.105
etti+0.098
 ramp+0.098
 scoring+0.098
oc+0.095
Neg logit contributions
up-0.045
 lots-0.041
drawn-0.041
aid-0.040
 many-0.039
Token the
Token ID262
Feature activation-1.49e-01

Pos logit contributions
icer+0.121
etti+0.118
 ramp+0.117
 scoring+0.117
bing+0.116
Neg logit contributions
 lots-0.011
 many-0.010
 big-0.009
 a-0.009
 an-0.006
Token,
Token ID11
Feature activation+8.89e-01

Pos logit contributions
up+0.004
drawn+0.004
aid+0.004
large+0.004
zzy+0.004
Neg logit contributions
icer-0.007
 scoring-0.006
 ramp-0.006
 exploding-0.006
oc-0.006
Token round
Token ID2835
Feature activation+2.84e-01

Pos logit contributions
icer+0.039
etti+0.036
 ramp+0.035
 scoring+0.035
bing+0.035
Neg logit contributions
 lots-0.011
up-0.011
 sunglasses-0.010
 many-0.010
 extra-0.010
Token balloons
Token ID35485
Feature activation+2.79e-01

Pos logit contributions
icer+0.010
 scoring+0.009
 ramp+0.009
etti+0.009
 exploding+0.009
Neg logit contributions
up-0.006
drawn-0.006
aid-0.005
large-0.005
zzy-0.005
Token.
Token ID13
Feature activation-6.30e-02

Pos logit contributions
icer+0.013
oc+0.011
 ramp+0.011
ocol+0.011
omp+0.011
Neg logit contributions
 to-0.005
 off-0.004
 very-0.004
 lots-0.004
 so-0.004
Token Sue
Token ID26089
Feature activation-1.32e-01

Pos logit contributions
up+0.002
drawn+0.002
aid+0.002
large+0.002
First+0.002
Neg logit contributions
icer-0.002
 scoring-0.002
 ramp-0.002
etti-0.002
 exploding-0.002
Token's
Token ID338
Feature activation-1.20e-01

Pos logit contributions
up+0.002
drawn+0.002
ave+0.002
aid+0.002
 lots+0.002
Neg logit contributions
icer-0.006
 ramp-0.005
 scoring-0.005
etti-0.005
ocol-0.005
Token mom
Token ID1995
Feature activation-1.88e+00

Pos logit contributions
up+0.002
drawn+0.002
aid+0.002
large+0.002
 lots+0.002
Neg logit contributions
icer-0.005
 scoring-0.004
 ramp-0.004
etti-0.004
 exploding-0.004
Token bought
Token ID5839
Feature activation+1.98e-01

Pos logit contributions
up+0.022
 decided+0.019
 listened+0.017
ave+0.016
 lots+0.016
Neg logit contributions
icer-0.088
 ramp-0.084
ocol-0.084
 scoring-0.082
arter-0.081
Token her
Token ID607
Feature activation-7.73e-01

Pos logit contributions
icer+0.008
 scoring+0.007
 ramp+0.007
etti+0.007
 exploding+0.007
Neg logit contributions
up-0.003
drawn-0.003
aid-0.003
large-0.003
zzy-0.003
Token a
Token ID257
Feature activation-2.68e+00

Pos logit contributions
up+0.015
aid+0.014
drawn+0.014
ave+0.013
zzy+0.013
Neg logit contributions
icer-0.029
 scoring-0.027
oc-0.026
 burying-0.026
 pretending-0.026
Token red
Token ID2266
Feature activation+9.78e-01

Pos logit contributions
up+0.055
drawn+0.051
aid+0.050
ave+0.049
zzy+0.048
Neg logit contributions
icer-0.096
 scoring-0.091
 ramp-0.091
oc-0.086
 exploding-0.086
Token wash
Token ID13502
Feature activation+1.22e+00

Pos logit contributions
icer+0.007
 scoring+0.007
 ramp+0.006
etti+0.006
 exploding+0.006
Neg logit contributions
up-0.006
drawn-0.005
aid-0.005
large-0.005
First-0.005
Token him
Token ID683
Feature activation+1.56e+00

Pos logit contributions
icer+0.036
 scoring+0.034
 ramp+0.034
etti+0.033
 exploding+0.033
Neg logit contributions
up-0.032
drawn-0.030
aid-0.029
 lots-0.029
large-0.028
Token.
Token ID13
Feature activation+2.11e+00

Pos logit contributions
icer+0.052
 scoring+0.049
 ramp+0.048
etti+0.047
 exploding+0.047
Neg logit contributions
up-0.036
drawn-0.033
aid-0.032
large-0.031
First-0.031
Token Then
Token ID3244
Feature activation+1.36e+00

Pos logit contributions
icer+0.059
 scoring+0.056
bing+0.054
 exploding+0.053
etti+0.051
Neg logit contributions
up-0.059
large-0.057
First-0.056
drawn-0.056
aid-0.056
Token he
Token ID339
Feature activation-5.09e-02

Pos logit contributions
icer+0.039
 scoring+0.035
 ramp+0.034
etti+0.033
 exploding+0.033
Neg logit contributions
up-0.039
drawn-0.036
aid-0.035
large-0.035
 lots-0.035
Token will
Token ID481
Feature activation+1.65e-01

Pos logit contributions
up+0.001
aid+0.001
ave+0.001
drawn+0.001
zzy+0.001
Neg logit contributions
icer-0.002
 scoring-0.002
etti-0.002
bing-0.002
 exploding-0.002
Token be
Token ID307
Feature activation-1.73e-01

Pos logit contributions
icer+0.006
 scoring+0.005
 ramp+0.005
etti+0.005
bing+0.005
Neg logit contributions
up-0.004
drawn-0.003
aid-0.003
 lots-0.003
zzy-0.003
Token clean
Token ID3424
Feature activation+3.53e+00

Pos logit contributions
up+0.003
drawn+0.003
large+0.003
aid+0.003
First+0.002
Neg logit contributions
icer-0.007
 scoring-0.006
 ramp-0.006
 exploding-0.006
bing-0.006
Token and
Token ID290
Feature activation+4.61e-01

Pos logit contributions
icer+0.115
 scoring+0.110
 ramp+0.106
etti+0.106
ocol+0.106
Neg logit contributions
up-0.079
 lots-0.075
 to-0.073
 extra-0.072
 off-0.071
Token fresh
Token ID4713
Feature activation+2.34e+00

Pos logit contributions
icer+0.017
 scoring+0.016
 ramp+0.016
 exploding+0.016
 burying+0.016
Neg logit contributions
up-0.009
drawn-0.008
zzy-0.008
large-0.007
First-0.007
Token.
Token ID13
Feature activation+1.56e+00

Pos logit contributions
icer+0.070
 scoring+0.064
oc+0.063
 ramp+0.062
bing+0.062
Neg logit contributions
up-0.060
drawn-0.059
large-0.056
aid-0.056
zzy-0.055
Token.
Token ID13
Feature activation-7.94e-01

Pos logit contributions
up+0.001
drawn+0.001
aid+0.001
large+0.001
zzy+0.001
Neg logit contributions
icer-0.002
 scoring-0.002
 ramp-0.002
etti-0.002
 exploding-0.002
Token One
Token ID1881
Feature activation-2.78e+00

Pos logit contributions
up+0.014
First+0.014
drawn+0.014
 lots+0.013
 to+0.013
Neg logit contributions
icer-0.029
 scoring-0.028
 exploding-0.027
bing-0.027
 ramp-0.027
Token day
Token ID1110
Feature activation-3.33e-01

Pos logit contributions
up+0.091
drawn+0.088
aid+0.086
First+0.082
talk+0.082
Neg logit contributions
icer-0.064
oc-0.057
 scoring-0.057
 exploding-0.055
 sharing-0.054
Token,
Token ID11
Feature activation-1.54e+00

Pos logit contributions
up+0.010
drawn+0.009
aid+0.009
large+0.009
First+0.009
Neg logit contributions
icer-0.009
 scoring-0.008
 ramp-0.008
etti-0.008
 exploding-0.008
Token she
Token ID673
Feature activation-9.00e-01

Pos logit contributions
up+0.031
drawn+0.029
aid+0.028
large+0.027
First+0.026
Neg logit contributions
icer-0.056
 scoring-0.052
 ramp-0.052
etti-0.051
 exploding-0.050
Token was
Token ID373
Feature activation-3.73e-02

Pos logit contributions
aid+0.019
up+0.019
drawn+0.019
large+0.018
First+0.017
Neg logit contributions
icer-0.030
 scoring-0.028
 ramp-0.027
 exploding-0.027
oc-0.027
Token playing
Token ID2712
Feature activation+2.65e+00

Pos logit contributions
up+0.002
drawn+0.002
aid+0.002
large+0.002
zzy+0.002
Neg logit contributions
icer-0.000
 scoring-0.000
 pretending-0.000
 burying-0.000
 exploding-0.000
Token with
Token ID351
Feature activation+5.81e-01

Pos logit contributions
icer+0.092
etti+0.083
 scoring+0.082
 ramp+0.081
ocol+0.081
Neg logit contributions
up-0.059
 lots-0.054
drawn-0.054
aid-0.053
 extra-0.052
Token her
Token ID607
Feature activation-2.48e-01

Pos logit contributions
icer+0.022
 scoring+0.021
 pretending+0.020
 ramp+0.020
 exploding+0.020
Neg logit contributions
up-0.010
drawn-0.010
ave-0.010
zzy-0.010
large-0.009
Token favorite
Token ID4004
Feature activation-9.34e-01

Pos logit contributions
up+0.006
drawn+0.006
aid+0.006
large+0.006
zzy+0.006
Neg logit contributions
icer-0.008
 scoring-0.007
 ramp-0.007
etti-0.007
bing-0.007
Token toy
Token ID13373
Feature activation-3.67e-03

Pos logit contributions
up+0.024
drawn+0.023
aid+0.022
zzy+0.022
large+0.022
Neg logit contributions
icer-0.029
 scoring-0.027
 ramp-0.026
bing-0.025
etti-0.025
Token Come
Token ID7911
Feature activation+9.33e-01

Pos logit contributions
icer+0.044
 exploding+0.038
bing+0.038
 scoring+0.038
 burying+0.037
Neg logit contributions
Of-0.035
up-0.032
First-0.032
drawn-0.032
large-0.032
Token on
Token ID319
Feature activation-5.71e-01

Pos logit contributions
icer+0.040
ocol+0.038
arter+0.037
 dent+0.037
opl+0.036
Neg logit contributions
 to-0.017
 off-0.014
 back-0.014
 very-0.013
 lots-0.012
Token,
Token ID11
Feature activation-3.03e-01

Pos logit contributions
up+0.014
 lots+0.013
 to+0.013
drawn+0.013
 off+0.012
Neg logit contributions
icer-0.018
 scoring-0.017
ocol-0.016
 ramp-0.016
 exploding-0.016
Token let
Token ID1309
Feature activation-1.30e+00

Pos logit contributions
up+0.004
 to+0.004
Of+0.004
 lots+0.004
 off+0.004
Neg logit contributions
icer-0.014
ocol-0.012
bing-0.012
arter-0.012
itched-0.012
Token's
Token ID338
Feature activation-3.53e+00

Pos logit contributions
up+0.031
drawn+0.031
aid+0.031
large+0.029
zzy+0.029
Neg logit contributions
icer-0.041
 scoring-0.037
 ramp-0.036
oc-0.036
etti-0.035
Token go
Token ID467
Feature activation-2.64e+00

Pos logit contributions
up+0.068
 lots+0.062
drawn+0.060
 off+0.059
 sunglasses+0.058
Neg logit contributions
icer-0.135
ocol-0.121
 scoring-0.120
eros-0.120
etti-0.119
Token!"
Token ID2474
Feature activation-2.11e+00

Pos logit contributions
up+0.062
drawn+0.058
aid+0.055
large+0.054
First+0.053
Neg logit contributions
icer-0.088
 scoring-0.082
 ramp-0.078
etti-0.078
 exploding-0.078
Token
Token ID198
Feature activation+9.27e-01

Pos logit contributions
up+0.052
drawn+0.050
aid+0.049
 lots+0.048
First+0.047
Neg logit contributions
icer-0.067
 scoring-0.062
 exploding-0.060
ocol-0.060
bing-0.060
Token
Token ID198
Feature activation+1.06e+00

Pos logit contributions
icer+0.030
 scoring+0.028
etti+0.027
 ramp+0.027
 exploding+0.027
Neg logit contributions
up-0.023
drawn-0.021
aid-0.021
First-0.020
 lots-0.020
TokenHe
Token ID1544
Feature activation+1.03e+00

Pos logit contributions
icer+0.043
ocol+0.041
arter+0.040
etti+0.040
 burying+0.040
Neg logit contributions
up-0.018
First-0.016
aid-0.016
drawn-0.016
 lots-0.015
Token opened
Token ID4721
Feature activation+2.01e-01

Pos logit contributions
icer+0.034
 scoring+0.033
 ramp+0.033
etti+0.031
oc+0.031
Neg logit contributions
up-0.022
drawn-0.021
aid-0.020
large-0.020
First-0.020
Token there
Token ID612
Feature activation+5.69e-01

Pos logit contributions
icer+0.007
etti+0.007
ocol+0.007
 ramp+0.006
ebra+0.006
Neg logit contributions
 to-0.003
 very-0.003
 off-0.002
 many-0.002
 lots-0.002
Token."
Token ID526
Feature activation-2.38e+00

Pos logit contributions
icer+0.019
 scoring+0.018
 ramp+0.018
etti+0.018
ocol+0.018
Neg logit contributions
up-0.013
 lots-0.012
 to-0.011
drawn-0.011
 decided-0.011
Token
Token ID198
Feature activation+1.22e+00

Pos logit contributions
up+0.054
drawn+0.052
First+0.051
 lots+0.051
 to+0.050
Neg logit contributions
icer-0.081
 ramp-0.073
 exploding-0.072
 scoring-0.072
etti-0.071
Token
Token ID198
Feature activation+1.25e+00

Pos logit contributions
icer+0.044
 scoring+0.042
etti+0.041
 exploding+0.041
 ramp+0.041
Neg logit contributions
up-0.025
First-0.024
zzy-0.024
drawn-0.023
aid-0.023
TokenThey
Token ID2990
Feature activation-2.99e-01

Pos logit contributions
icer+0.047
 exploding+0.044
etti+0.044
 scoring+0.044
 ramp+0.043
Neg logit contributions
up-0.025
First-0.025
zzy-0.023
drawn-0.023
aid-0.023
Token looked
Token ID3114
Feature activation-2.35e+00

Pos logit contributions
up+0.007
drawn+0.006
large+0.006
aid+0.006
First+0.006
Neg logit contributions
icer-0.010
 scoring-0.010
 ramp-0.009
etti-0.009
 exploding-0.009
Token around
Token ID1088
Feature activation+2.62e+00

Pos logit contributions
up+0.055
aid+0.052
drawn+0.051
large+0.049
First+0.049
Neg logit contributions
icer-0.077
 scoring-0.073
 exploding-0.070
 ramp-0.070
 burying-0.070
Token the
Token ID262
Feature activation+1.05e-01

Pos logit contributions
icer+0.098
 ramp+0.088
etti+0.087
 scoring+0.086
oc+0.085
Neg logit contributions
 to-0.056
 lots-0.051
 many-0.049
drawn-0.048
 extra-0.048
Token slide
Token ID10649
Feature activation-9.02e-02

Pos logit contributions
icer+0.003
omp+0.003
etti+0.003
 scoring+0.003
bing+0.003
Neg logit contributions
up-0.002
 big-0.002
drawn-0.002
 prett-0.002
 sunglasses-0.002
Token and
Token ID290
Feature activation-1.94e-01

Pos logit contributions
up+0.002
 to+0.002
 lots+0.002
 extra+0.002
 so+0.002
Neg logit contributions
icer-0.003
ocol-0.003
 scoring-0.003
oc-0.003
arter-0.003
Token saw
Token ID2497
Feature activation-1.82e-01

Pos logit contributions
up+0.004
drawn+0.004
aid+0.004
large+0.003
First+0.003
Neg logit contributions
icer-0.007
 scoring-0.007
 ramp-0.006
 exploding-0.006
etti-0.006
Token happy
Token ID3772
Feature activation+3.99e-01

Pos logit contributions
 big+0.018
up+0.017
 pretty+0.016
 nice+0.015
 adventurous+0.015
Neg logit contributions
icer-0.091
etti-0.086
 scoring-0.085
bing-0.085
omp-0.084
Token little
Token ID1310
Feature activation+1.23e+00

Pos logit contributions
icer+0.016
 scoring+0.015
bing+0.015
etti+0.015
omp+0.015
Neg logit contributions
 big-0.006
up-0.006
 to-0.006
 news-0.006
 lots-0.005
Token girl
Token ID2576
Feature activation+2.30e+00

Pos logit contributions
icer+0.044
 scoring+0.042
etti+0.040
bing+0.040
 ramp+0.040
Neg logit contributions
up-0.024
drawn-0.023
aid-0.022
large-0.021
 lots-0.021
Token named
Token ID3706
Feature activation+2.31e+00

Pos logit contributions
icer+0.065
 scoring+0.059
 ramp+0.058
etti+0.056
 exploding+0.055
Neg logit contributions
up-0.066
drawn-0.063
aid-0.061
large-0.060
zzy-0.059
Token Sarah
Token ID10490
Feature activation+2.16e+00

Pos logit contributions
icer+0.036
etti+0.034
 You+0.033
 pretending+0.033
 scoring+0.033
Neg logit contributions
up-0.088
aid-0.088
drawn-0.086
large-0.083
zzy-0.081
Token.
Token ID13
Feature activation-7.22e-01

Pos logit contributions
icer+0.067
 scoring+0.063
 ramp+0.060
oc+0.060
etti+0.060
Neg logit contributions
up-0.054
drawn-0.051
aid-0.050
large-0.049
First-0.048
Token She
Token ID1375
Feature activation+1.77e+00

Pos logit contributions
up+0.019
drawn+0.017
aid+0.017
large+0.017
zzy+0.016
Neg logit contributions
icer-0.022
 scoring-0.020
 ramp-0.020
etti-0.020
 exploding-0.019
Token loved
Token ID6151
Feature activation+3.53e-01

Pos logit contributions
icer+0.064
 scoring+0.061
 ramp+0.059
 exploding+0.058
etti+0.058
Neg logit contributions
up-0.035
drawn-0.033
aid-0.032
large-0.031
First-0.031
Token going
Token ID1016
Feature activation-1.34e-01

Pos logit contributions
 scoring+0.014
icer+0.014
 sharing+0.014
 pretending+0.014
 burying+0.014
Neg logit contributions
up-0.005
zzy-0.005
ave-0.004
drawn-0.004
aid-0.004
Token for
Token ID329
Feature activation-1.21e+00

Pos logit contributions
up+0.002
aid+0.002
zzy+0.002
ave+0.002
drawn+0.002
Neg logit contributions
icer-0.005
 ramp-0.005
 scoring-0.005
 pretending-0.005
 burying-0.005
Token walks
Token ID11114
Feature activation-2.69e-02

Pos logit contributions
up+0.032
ave+0.031
aid+0.029
drawn+0.028
large+0.028
Neg logit contributions
 scoring-0.037
 pretending-0.036
icer-0.035
 ramp-0.034
 exploding-0.033
Token bed
Token ID3996
Feature activation+4.14e-01

Pos logit contributions
up+0.028
drawn+0.027
 lots+0.027
 extra+0.026
aid+0.025
Neg logit contributions
icer-0.043
oc-0.039
 ramp-0.039
 scoring-0.039
etti-0.038
Token and
Token ID290
Feature activation-6.71e-01

Pos logit contributions
icer+0.012
 scoring+0.011
 ramp+0.011
etti+0.010
 exploding+0.010
Neg logit contributions
up-0.011
drawn-0.011
aid-0.010
large-0.010
First-0.010
Token his
Token ID465
Feature activation+2.68e-02

Pos logit contributions
up+0.015
drawn+0.015
aid+0.014
large+0.014
First+0.014
Neg logit contributions
icer-0.021
 scoring-0.020
 ramp-0.020
 burying-0.019
 exploding-0.019
Token mom
Token ID1995
Feature activation-2.21e+00

Pos logit contributions
icer+0.001
 scoring+0.001
 ramp+0.001
etti+0.001
 exploding+0.001
Neg logit contributions
up-0.001
drawn-0.001
aid-0.001
First-0.001
large-0.001
Tokenmy
Token ID1820
Feature activation-7.58e-01

Pos logit contributions
up+0.061
drawn+0.060
large+0.057
aid+0.057
zzy+0.056
Neg logit contributions
icer-0.060
 scoring-0.055
 ramp-0.054
etti-0.053
oc-0.052
Token read
Token ID1100
Feature activation-1.78e+00

Pos logit contributions
up+0.014
drawn+0.014
large+0.013
aid+0.013
First+0.013
Neg logit contributions
icer-0.026
 scoring-0.025
 ramp-0.025
etti-0.024
oc-0.024
Token him
Token ID683
Feature activation-5.51e-01

Pos logit contributions
up+0.044
drawn+0.042
aid+0.040
large+0.039
zzy+0.039
Neg logit contributions
icer-0.056
 scoring-0.052
 ramp-0.051
etti-0.050
 exploding-0.050
Token a
Token ID257
Feature activation-2.84e+00

Pos logit contributions
up+0.010
aid+0.010
drawn+0.010
First+0.010
large+0.010
Neg logit contributions
 scoring-0.019
icer-0.018
 pretending-0.018
oc-0.018
bing-0.017
Token story
Token ID1621
Feature activation+2.98e-01

Pos logit contributions
up+0.084
drawn+0.079
ups+0.077
ave+0.077
aid+0.077
Neg logit contributions
icer-0.072
 ramp-0.069
 scoring-0.067
oc-0.067
 pretending-0.064
Token.
Token ID13
Feature activation-3.72e-01

Pos logit contributions
icer+0.009
 scoring+0.009
 ramp+0.009
etti+0.008
 exploding+0.008
Neg logit contributions
up-0.007
drawn-0.007
aid-0.007
large-0.006
First-0.006
Token Tim
Token ID5045
Feature activation+1.13e+00

Pos logit contributions
 lots+0.010
First+0.009
 to+0.009
drawn+0.009
 many+0.009
Neg logit contributions
icer-0.012
 scoring-0.011
ebra-0.011
ocol-0.011
 exploding-0.011
Token in
Token ID287
Feature activation-2.92e-01

Pos logit contributions
icer+0.016
 scoring+0.015
 ramp+0.015
etti+0.015
bing+0.015
Neg logit contributions
up-0.009
drawn-0.008
aid-0.008
large-0.008
First-0.008
Token the
Token ID262
Feature activation-2.04e+00

Pos logit contributions
 a+0.003
 to+0.003
 big+0.003
 lots+0.003
 an+0.003
Neg logit contributions
icer-0.014
arter-0.014
etti-0.014
oc-0.014
oke-0.013
Token park
Token ID3952
Feature activation+9.23e-01

Pos logit contributions
up+0.055
drawn+0.053
aid+0.052
large+0.050
zzy+0.050
Neg logit contributions
icer-0.059
 scoring-0.054
 ramp-0.053
etti-0.051
 exploding-0.051
Token and
Token ID290
Feature activation-1.06e+00

Pos logit contributions
icer+0.036
ocol+0.032
 ramp+0.032
etti+0.031
 dent+0.031
Neg logit contributions
 to-0.019
up-0.019
 lots-0.018
 many-0.017
 extra-0.017
Token he
Token ID339
Feature activation+3.98e-01

Pos logit contributions
up+0.028
drawn+0.027
 lots+0.025
large+0.025
aid+0.025
Neg logit contributions
icer-0.032
 ramp-0.029
 scoring-0.029
etti-0.029
ocol-0.028
Token saw
Token ID2497
Feature activation-5.10e-01

Pos logit contributions
icer+0.017
etti+0.016
ocol+0.016
 ramp+0.016
 scoring+0.016
Neg logit contributions
up-0.006
drawn-0.005
 lots-0.005
zzy-0.004
ave-0.004
Token a
Token ID257
Feature activation-2.98e+00

Pos logit contributions
ave+0.013
bs+0.013
talk+0.013
zzy+0.012
fy+0.012
Neg logit contributions
 burying-0.014
 sharing-0.013
 scoring-0.013
 shooting-0.013
 Tim-0.013
Token bright
Token ID6016
Feature activation-1.28e-01

Pos logit contributions
up+0.044
drawn+0.039
aid+0.037
large+0.036
 lots+0.034
Neg logit contributions
icer-0.123
 scoring-0.116
etti-0.113
 ramp-0.113
bing-0.112
Token yellow
Token ID7872
Feature activation+9.95e-01

Pos logit contributions
 big+0.002
 pretty+0.002
 bright+0.002
up+0.002
 very+0.002
Neg logit contributions
icer-0.005
 scoring-0.005
omp-0.005
bing-0.005
 defeating-0.005
Token notebook
Token ID20922
Feature activation-3.69e-03

Pos logit contributions
icer+0.035
 scoring+0.032
omp+0.032
bing+0.032
 exploding+0.031
Neg logit contributions
up-0.020
 sunglasses-0.019
drawn-0.019
 lots-0.019
 news-0.018
Token lying
Token ID9105
Feature activation+3.85e+00

Pos logit contributions
up+0.000
 to+0.000
drawn+0.000
 lots+0.000
aid+0.000
Neg logit contributions
icer-0.000
 scoring-0.000
omp-0.000
ocol-0.000
oc-0.000
Token playing
Token ID2712
Feature activation+2.28e+00

Pos logit contributions
icer+0.012
 scoring+0.011
 ramp+0.011
etti+0.011
 exploding+0.010
Neg logit contributions
up-0.014
drawn-0.013
aid-0.013
large-0.012
First-0.012
Token.
Token ID13
Feature activation-2.56e-01

Pos logit contributions
icer+0.076
 scoring+0.070
 ramp+0.068
etti+0.067
 exploding+0.067
Neg logit contributions
up-0.053
drawn-0.050
aid-0.048
 lots-0.046
large-0.046
Token<|endoftext|>
Token ID50256
Feature activation+2.00e+00

Pos logit contributions
up+0.006
drawn+0.006
First+0.006
 lots+0.006
large+0.006
Neg logit contributions
icer-0.008
 scoring-0.008
 ramp-0.007
bing-0.007
 exploding-0.007
TokenOnce
Token ID7454
Feature activation-4.14e-01

Pos logit contributions
 scoring+0.075
icer+0.073
ocol+0.072
bing+0.072
 exploding+0.072
Neg logit contributions
 to-0.033
 lots-0.033
up-0.033
zzy-0.032
First-0.032
Token upon
Token ID2402
Feature activation+8.79e-01

Pos logit contributions
up+0.012
drawn+0.011
zzy+0.011
large+0.011
 lots+0.011
Neg logit contributions
icer-0.011
 scoring-0.011
 ramp-0.010
etti-0.010
 exploding-0.010
Token a
Token ID257
Feature activation-5.19e+00

Pos logit contributions
icer+0.034
 scoring+0.033
 ramp+0.032
 pretending+0.032
 exploding+0.032
Neg logit contributions
up-0.014
drawn-0.012
ave-0.012
aid-0.011
large-0.011
Token time
Token ID640
Feature activation-8.88e-01

Pos logit contributions
up+0.127
drawn+0.121
aid+0.116
zzy+0.115
large+0.115
Neg logit contributions
icer-0.161
 scoring-0.152
 ramp-0.148
etti-0.146
 exploding-0.144
Token there
Token ID612
Feature activation+1.01e+00

Pos logit contributions
 to+0.017
 lots+0.016
 many+0.016
 big+0.015
up+0.015
Neg logit contributions
icer-0.033
 scoring-0.031
 ramp-0.031
 exploding-0.030
 dent-0.030
Token was
Token ID373
Feature activation-1.43e+00

Pos logit contributions
ocol+0.043
 scoring+0.043
icer+0.042
 ramp+0.042
etti+0.041
Neg logit contributions
 lots-0.014
 big-0.012
 decided-0.012
 to-0.012
 many-0.012
Token a
Token ID257
Feature activation-2.64e+00

Pos logit contributions
up+0.017
drawn+0.014
aid+0.013
large+0.012
ave+0.012
Neg logit contributions
icer-0.064
 scoring-0.061
 ramp-0.059
etti-0.059
 exploding-0.059
Token happy
Token ID3772
Feature activation-3.49e-02

Pos logit contributions
up+0.025
 big+0.024
drawn+0.022
 pretty+0.021
aid+0.021
Neg logit contributions
icer-0.120
etti-0.114
 scoring-0.113
bing-0.111
omp-0.111
Token little
Token ID1310
Feature activation+1.22e+00

Pos logit contributions
icer+0.033
 scoring+0.030
 ramp+0.030
etti+0.029
 exploding+0.029
Neg logit contributions
up-0.022
drawn-0.021
aid-0.020
large-0.020
First-0.019
Token brother
Token ID3956
Feature activation-4.33e-01

Pos logit contributions
icer+0.032
 scoring+0.030
 ramp+0.029
etti+0.028
 exploding+0.028
Neg logit contributions
up-0.036
drawn-0.034
aid-0.033
large-0.033
First-0.032
Token had
Token ID550
Feature activation-2.81e-01

Pos logit contributions
 listened+0.007
 decided+0.007
 to+0.006
up+0.006
 very+0.006
Neg logit contributions
icer-0.020
ocol-0.018
 ramp-0.017
etti-0.016
bing-0.016
Token knocked
Token ID13642
Feature activation-3.24e+00

Pos logit contributions
up+0.006
drawn+0.006
aid+0.006
large+0.006
First+0.006
Neg logit contributions
icer-0.009
 scoring-0.009
 exploding-0.008
etti-0.008
 ramp-0.008
Token over
Token ID625
Feature activation-2.41e+00

Pos logit contributions
up+0.070
drawn+0.065
large+0.063
aid+0.062
zzy+0.062
Neg logit contributions
icer-0.114
 scoring-0.107
bing-0.101
 exploding-0.100
 ramp-0.099
Token some
Token ID617
Feature activation-5.81e+00

Pos logit contributions
up+0.041
drawn+0.038
aid+0.036
large+0.035
First+0.034
Neg logit contributions
icer-0.095
 scoring-0.089
 ramp-0.087
etti-0.086
 exploding-0.086
Token books
Token ID3835
Feature activation-2.83e+00

Pos logit contributions
up+0.142
drawn+0.141
zzy+0.137
aid+0.132
First+0.132
Neg logit contributions
icer-0.178
 ramp-0.165
 scoring-0.165
 exploding-0.159
oc-0.157
Token.
Token ID13
Feature activation-2.21e-01

Pos logit contributions
up+0.066
drawn+0.062
 lots+0.059
aid+0.058
large+0.058
Neg logit contributions
icer-0.094
 scoring-0.087
 ramp-0.083
ocol-0.082
etti-0.082
Token Lily
Token ID20037
Feature activation-6.14e-01

Pos logit contributions
up+0.007
drawn+0.006
aid+0.006
large+0.006
First+0.006
Neg logit contributions
icer-0.006
 scoring-0.005
 ramp-0.005
etti-0.005
 exploding-0.005
Token's
Token ID338
Feature activation-9.59e-01

Pos logit contributions
up+0.012
drawn+0.011
ave+0.010
 lots+0.010
aid+0.010
Neg logit contributions
icer-0.023
 scoring-0.021
 ramp-0.021
ocol-0.021
etti-0.020
Token mom
Token ID1995
Feature activation-2.47e+00

Pos logit contributions
up+0.028
drawn+0.027
aid+0.027
First+0.026
large+0.026
Neg logit contributions
icer-0.026
 scoring-0.024
 ramp-0.024
 exploding-0.022
 pretending-0.022
Token time
Token ID640
Feature activation+1.18e+00

Pos logit contributions
icer+0.036
etti+0.034
 scoring+0.033
 ramp+0.032
bing+0.032
Neg logit contributions
up-0.021
drawn-0.020
First-0.019
large-0.018
 lots-0.018
Token.
Token ID13
Feature activation-2.91e-01

Pos logit contributions
icer+0.053
oc+0.046
 ramp+0.045
arter+0.045
 scoring+0.045
Neg logit contributions
 to-0.020
up-0.019
 lots-0.017
 many-0.016
 off-0.015
Token<|endoftext|>
Token ID50256
Feature activation+2.18e+00

Pos logit contributions
 lots+0.005
First+0.005
 many+0.004
 to+0.004
Of+0.004
Neg logit contributions
icer-0.011
bing-0.011
 scoring-0.011
 exploding-0.011
ocol-0.011
TokenOnce
Token ID7454
Feature activation-1.01e-01

Pos logit contributions
 scoring+0.084
ocol+0.083
arter+0.082
bing+0.081
icer+0.081
Neg logit contributions
 lots-0.034
 to-0.034
 many-0.033
zzy-0.033
up-0.032
Token upon
Token ID2402
Feature activation+1.21e+00

Pos logit contributions
up+0.003
drawn+0.003
 lots+0.003
zzy+0.003
 many+0.003
Neg logit contributions
icer-0.003
 scoring-0.003
etti-0.003
 ramp-0.003
 exploding-0.003
Token a
Token ID257
Feature activation-5.25e+00

Pos logit contributions
icer+0.050
 scoring+0.048
 ramp+0.046
 exploding+0.046
 pretending+0.046
Neg logit contributions
up-0.018
drawn-0.016
aid-0.014
ave-0.014
large-0.014
Token time
Token ID640
Feature activation-7.35e-01

Pos logit contributions
up+0.122
drawn+0.117
zzy+0.113
large+0.111
aid+0.111
Neg logit contributions
icer-0.167
 scoring-0.159
 ramp-0.155
etti-0.155
bing-0.152
Token,
Token ID11
Feature activation-1.99e+00

Pos logit contributions
 to+0.013
 lots+0.012
 many+0.012
 big+0.011
 very+0.011
Neg logit contributions
icer-0.029
 scoring-0.028
 ramp-0.027
 exploding-0.027
etti-0.027
Token there
Token ID612
Feature activation+1.46e+00

Pos logit contributions
up+0.032
 lots+0.031
drawn+0.031
 many+0.030
zzy+0.030
Neg logit contributions
icer-0.075
 scoring-0.073
 ramp-0.071
etti-0.071
 exploding-0.069
Token was
Token ID373
Feature activation-7.95e-01

Pos logit contributions
ocol+0.063
 scoring+0.061
etti+0.061
icer+0.060
 ramp+0.060
Neg logit contributions
 lots-0.020
 to-0.019
 big-0.019
 many-0.018
 very-0.018
Token a
Token ID257
Feature activation-2.12e+00

Pos logit contributions
up+0.009
drawn+0.007
aid+0.007
large+0.006
ave+0.006
Neg logit contributions
icer-0.036
 scoring-0.035
 ramp-0.034
etti-0.034
 exploding-0.033
Token sad
Token ID6507
Feature activation+1.91e+00

Pos logit contributions
icer+0.018
 scoring+0.017
 ramp+0.016
etti+0.016
oc+0.016
Neg logit contributions
up-0.014
drawn-0.014
large-0.013
aid-0.013
First-0.013
Token.
Token ID13
Feature activation+7.70e-01

Pos logit contributions
icer+0.052
 scoring+0.049
 ramp+0.047
 exploding+0.046
bing+0.046
Neg logit contributions
up-0.053
drawn-0.052
aid-0.049
large-0.049
First-0.049
Token<|endoftext|>
Token ID50256
Feature activation+1.37e+00

Pos logit contributions
icer+0.028
 exploding+0.026
bing+0.026
 ramp+0.025
 scoring+0.025
Neg logit contributions
 to-0.015
 lots-0.015
 many-0.014
 so-0.014
First-0.014
TokenOnce
Token ID7454
Feature activation-3.66e-02

Pos logit contributions
 scoring+0.050
icer+0.049
etti+0.049
bing+0.048
 exploding+0.048
Neg logit contributions
 to-0.023
up-0.023
 lots-0.023
 many-0.022
zzy-0.022
Token upon
Token ID2402
Feature activation+1.08e+00

Pos logit contributions
up+0.001
 many+0.001
 lots+0.001
drawn+0.001
 to+0.001
Neg logit contributions
icer-0.001
 scoring-0.001
etti-0.001
 ramp-0.001
oc-0.001
Token a
Token ID257
Feature activation-5.28e+00

Pos logit contributions
icer+0.041
 scoring+0.040
 pretending+0.039
 ramp+0.038
 exploding+0.038
Neg logit contributions
up-0.018
drawn-0.016
ave-0.015
ups-0.015
large-0.015
Token time
Token ID640
Feature activation-6.54e-01

Pos logit contributions
up+0.124
drawn+0.117
zzy+0.112
aid+0.111
large+0.111
Neg logit contributions
icer-0.167
 scoring-0.160
etti-0.157
 ramp-0.155
 exploding-0.153
Token,
Token ID11
Feature activation-2.12e+00

Pos logit contributions
 to+0.012
 many+0.011
 lots+0.011
 big+0.011
 very+0.010
Neg logit contributions
icer-0.025
 scoring-0.024
 ramp-0.024
oc-0.023
 exploding-0.023
Token in
Token ID287
Feature activation+1.02e+00

Pos logit contributions
up+0.038
drawn+0.036
 lots+0.034
aid+0.034
zzy+0.034
Neg logit contributions
icer-0.078
 scoring-0.074
 ramp-0.073
etti-0.072
oc-0.072
Token a
Token ID257
Feature activation-3.88e+00

Pos logit contributions
icer+0.051
 scoring+0.049
 ramp+0.048
etti+0.048
bing+0.048
Neg logit contributions
up-0.005
drawn-0.004
 lots-0.004
aid-0.004
large-0.004
Token small
Token ID1402
Feature activation-1.70e+00

Pos logit contributions
up+0.027
drawn+0.023
 big+0.022
 many+0.019
 lots+0.018
Neg logit contributions
icer-0.184
etti-0.181
 scoring-0.180
 ramp-0.175
 exploding-0.175
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
icer+0.029
 scoring+0.026
 ramp+0.026
etti+0.026
 exploding+0.025
Neg logit contributions
up-0.026
drawn-0.024
aid-0.023
large-0.023
First-0.023
Token I
Token ID314
Feature activation-2.22e+00

Pos logit contributions
icer+0.030
 scoring+0.029
 exploding+0.028
bing+0.027
 ramp+0.026
Neg logit contributions
up-0.026
Of-0.026
First-0.026
drawn-0.026
aid-0.025
Token won
Token ID1839
Feature activation-1.28e+00

Pos logit contributions
up+0.045
drawn+0.042
aid+0.040
zzy+0.039
large+0.039
Neg logit contributions
icer-0.081
 scoring-0.076
 ramp-0.074
etti-0.073
 exploding-0.072
Tokenâ
Token ID22940
Feature activation+5.77e-01

Pos logit contributions
up+0.042
drawn+0.040
 lots+0.039
 to+0.039
 off+0.038
Neg logit contributions
icer-0.031
 ramp-0.026
 scoring-0.026
etti-0.026
ocol-0.025
Token
Token ID26391
Feature activation+3.37e-02

Pos logit contributions
icer+0.021
 scoring+0.019
 ramp+0.019
etti+0.019
 exploding+0.019
Neg logit contributions
up-0.012
drawn-0.011
aid-0.011
large-0.010
zzy-0.010
Token
Token ID8151
Feature activation-6.66e+00

Pos logit contributions
icer+0.001
ocol+0.001
 scoring+0.001
 ramp+0.001
 exploding+0.001
Neg logit contributions
up-0.001
 lots-0.001
 to-0.001
drawn-0.001
First-0.001
Tokent
Token ID83
Feature activation-5.03e+00

Pos logit contributions
up+0.136
drawn+0.125
aid+0.120
large+0.119
First+0.116
Neg logit contributions
icer-0.237
 scoring-0.224
 ramp-0.220
etti-0.216
 exploding-0.215
Token hurt
Token ID5938
Feature activation+1.75e-01

Pos logit contributions
drawn+0.120
up+0.119
large+0.112
aid+0.111
First+0.110
Neg logit contributions
 scoring-0.150
icer-0.143
 ramp-0.142
 exploding-0.141
oc-0.140
Token you
Token ID345
Feature activation-1.04e+00

Pos logit contributions
icer+0.005
etti+0.005
 ramp+0.005
 scoring+0.005
 exploding+0.005
Neg logit contributions
up-0.005
drawn-0.004
 lots-0.004
aid-0.004
large-0.004
Token.
Token ID13
Feature activation+9.49e-01

Pos logit contributions
up+0.025
drawn+0.023
aid+0.022
large+0.022
First+0.021
Neg logit contributions
icer-0.034
 scoring-0.031
 ramp-0.031
etti-0.030
 exploding-0.030
Tokenâ
Token ID22940
Feature activation+1.60e+00

Pos logit contributions
icer+0.026
 scoring+0.025
 exploding+0.025
bing+0.024
 ramp+0.023
Neg logit contributions
Of-0.027
First-0.026
up-0.026
drawn-0.025
large-0.025