TOP ACTIVATIONS
MAX = 0.213

baseball returned . Hours later , the sun had
alone . Hours passed , but nobody came
do . Hours passed and Lily 's parents
to work . Hours later , the prince and
wrong . Hours passed and the animal looked
down and fell asleep . Hours later it woke up and
appear from the clay . Hours passed , and eventually the
the door would open . Hours passed , but no -
it was smooth again . Hours and hours went by ,
I 'm sorry , Your Majesty . I won 't do
the king , " Your Majesty , it has been too
, " Yes , Your Majesty . I promise I 'll
everyone called him " Your Majesty ". The king loved to
the village back to its former glory !"
, leaving it to its former state .
the tree to its once former beauty . He was happy
back the village to its former state . With the help
to be restored to its former glory ."
to sort the oranges . First it ate the ripe oranges
robot got to work . First , they found a big

MIN ACTIVATIONS
MIN = -0.170

told Jim that the zo oke ep ers had split the
the bear and the zo oke ep ers and went back
. It was the zo oke eper . He had come
Soon , a zo oke eper came and unlocked the
have met a famous zo oke eper . She had learned
Suddenly , the zo oke eper noticed . He shouted
other animals . The zo oke eper was worried that the
Soon , a zo oke eper came with a big
and dad heard the zo oke eper and they were very
the cage . The zo oke eper closed the door and
. She thanked the zo oke eper for his advice and
. He thanked the zo oke eper for taking care of
. The zo oke eper picked up Lily and
he knew that the zo oke ep ers were just trying
. The zo oke ep ers arrived and threw
said that sometimes the zo oke ep ers have to order
. He hoped the zo oke eper would say yes .
of him . The zo oke eper rushed in and warned
. Ben asked the zo oke eper if he could touch
but only if the zo oke eper said it was okay

INTERVAL 0.118 - 0.213
CONTAINS 0.663%

the man had said and quickly ran away . She was
are lively and happy . One day , they go to
her flock . As they were walking , they
come back to play again soon ." And from that day
. He looks strange . Maybe he is a traveler .

INTERVAL 0.022 - 0.118
CONTAINS 27.157%

tw ig . He wished he had shared it with Sam
at the trees . His wife always stayed at home and
. The girl searched and searched , until she
Mom says , " Thank you , Ben and
time had passed . He went to bed feeling proud of

INTERVAL -0.074 - 0.022
CONTAINS 69.335%

was very fashionable . When she got to
exploring the new area , thrilled to discover all sorts of
and the fly out of the house . The dog and
, there was a man called Nick . He was a
out and kissed Molly 's forehead . " Good bye Molly

INTERVAL -0.170 - -0.074
CONTAINS 2.845%

of special things to take on a journey .
outside , and its leaves were moving in the wind .
. She loved to play with her dolls and t eddy
field of flowers . " Wow !" she said . She
Lily her shapes . They were brown and hard and smelled
Token baseball
Token ID9283
Feature activation-6.33e-02

Pos logit contributions
 making+0.011
 poking+0.010
:+0.010
's+0.010
 drawing+0.010
Neg logit contributions
des-0.022
use-0.021
 former-0.021
powers-0.019
ited-0.019
Token returned
Token ID4504
Feature activation+9.69e-02

Pos logit contributions
 making+0.055
 poking+0.047
 drawing+0.045
:+0.045
's+0.044
Neg logit contributions
des-0.163
use-0.160
 former-0.155
powers-0.145
ited-0.144
Token.
Token ID13
Feature activation-2.22e-02

Pos logit contributions
des+0.303
use+0.296
 former+0.291
ited+0.282
God+0.280
Neg logit contributions
 making-0.043
:-0.035
 in-0.032
's-0.032
 on-0.030
Token
Token ID198
Feature activation+1.86e-02

Pos logit contributions
 making+0.025
 poking+0.022
:+0.022
's+0.022
 drawing+0.022
Neg logit contributions
des-0.052
use-0.051
 former-0.050
powers-0.046
ited-0.046
Token
Token ID198
Feature activation+1.82e-02

Pos logit contributions
des+0.043
use+0.041
 former+0.039
powers+0.038
God+0.037
Neg logit contributions
 making-0.022
 poking-0.019
 drawing-0.019
 fighting-0.018
 video-0.018
TokenHours
Token ID39792
Feature activation+2.13e-01

Pos logit contributions
des+0.033
use+0.033
 former+0.031
powers+0.028
ited+0.028
Neg logit contributions
 making-0.031
's-0.029
:-0.029
 poking-0.028
 video-0.028
Token later
Token ID1568
Feature activation+8.95e-02

Pos logit contributions
des+0.345
use+0.337
ited+0.290
 former+0.281
irst+0.279
Neg logit contributions
 making-0.371
 video-0.359
 poking-0.356
 drawing-0.348
's-0.337
Token,
Token ID11
Feature activation-4.67e-02

Pos logit contributions
des+0.225
use+0.223
 former+0.215
 anytime+0.203
God+0.202
Neg logit contributions
 making-0.056
 poking-0.055
 video-0.054
 drawing-0.051
 spraying-0.049
Token the
Token ID262
Feature activation-9.04e-03

Pos logit contributions
 making+0.028
:+0.025
 poking+0.024
's+0.024
 video+0.023
Neg logit contributions
des-0.129
 former-0.125
use-0.125
 greater-0.116
God-0.116
Token sun
Token ID4252
Feature activation-3.61e-02

Pos logit contributions
 making+0.013
 poking+0.012
:+0.012
 drawing+0.012
's+0.011
Neg logit contributions
des-0.017
use-0.016
 former-0.016
powers-0.015
 greater-0.015
Token had
Token ID550
Feature activation-6.19e-02

Pos logit contributions
 making+0.041
 drawing+0.041
 poking+0.040
 video+0.039
 sharing+0.039
Neg logit contributions
des-0.068
use-0.066
ited-0.066
 former-0.063
God-0.060
Token alone
Token ID3436
Feature activation+1.23e-02

Pos logit contributions
 making+0.041
's+0.037
 poking+0.037
 drawing+0.036
 playing+0.036
Neg logit contributions
use-0.055
des-0.054
 former-0.052
powers-0.047
God-0.047
Token.
Token ID13
Feature activation-1.89e-02

Pos logit contributions
des+0.036
use+0.035
 former+0.034
powers+0.033
ited+0.033
Neg logit contributions
 making-0.007
:-0.006
's-0.005
 poking-0.005
 drawing-0.005
Token
Token ID220
Feature activation+2.87e-02

Pos logit contributions
 making+0.020
 poking+0.018
 drawing+0.018
:+0.017
's+0.017
Neg logit contributions
des-0.046
use-0.044
 former-0.043
powers-0.040
ited-0.040
Token
Token ID198
Feature activation+1.94e-02

Pos logit contributions
des+0.068
use+0.066
 former+0.064
powers+0.060
God+0.059
Neg logit contributions
 making-0.031
 poking-0.028
 drawing-0.027
:-0.027
's-0.027
Token
Token ID198
Feature activation+1.92e-02

Pos logit contributions
des+0.052
use+0.052
 former+0.051
 unnecessary+0.048
ited+0.048
Neg logit contributions
 making-0.015
:-0.014
's-0.014
 was-0.013
 poking-0.012
TokenHours
Token ID39792
Feature activation+2.13e-01

Pos logit contributions
use+0.032
des+0.032
 former+0.030
 unnecessary+0.027
irst+0.027
Neg logit contributions
 making-0.035
's-0.034
 was-0.032
:-0.032
 video-0.032
Token passed
Token ID3804
Feature activation+4.43e-02

Pos logit contributions
des+0.302
use+0.293
ited+0.244
 former+0.236
irst+0.232
Neg logit contributions
 making-0.413
 video-0.405
 poking-0.394
 drawing-0.389
 spraying-0.380
Token,
Token ID11
Feature activation-4.52e-02

Pos logit contributions
des+0.135
use+0.131
 former+0.130
powers+0.122
 reins+0.121
Neg logit contributions
 making-0.023
's-0.021
:-0.020
 in-0.019
 was-0.019
Token but
Token ID475
Feature activation-1.57e-02

Pos logit contributions
 making+0.039
:+0.035
 poking+0.035
 drawing+0.035
's+0.035
Neg logit contributions
des-0.114
use-0.111
 former-0.109
God-0.101
 greater-0.101
Token nobody
Token ID8168
Feature activation+1.04e-01

Pos logit contributions
 making+0.017
 video+0.017
 poking+0.016
 drawing+0.016
:+0.016
Neg logit contributions
des-0.035
use-0.034
 former-0.034
 greater-0.031
minded-0.031
Token came
Token ID1625
Feature activation+1.48e-02

Pos logit contributions
des+0.259
use+0.252
 former+0.244
powers+0.229
ited+0.228
Neg logit contributions
 making-0.105
 poking-0.091
 drawing-0.089
:-0.089
 video-0.088
Token do
Token ID466
Feature activation-1.11e-01

Pos logit contributions
use+0.060
des+0.058
away+0.050
yes+0.049
 someday+0.049
Neg logit contributions
 spraying-0.028
 making-0.028
 drawing-0.027
 shooting-0.026
 poking-0.026
Token.
Token ID13
Feature activation-2.64e-02

Pos logit contributions
 making+0.082
 poking+0.070
 video+0.067
 drawing+0.066
's+0.064
Neg logit contributions
des-0.300
use-0.294
 former-0.283
powers-0.267
minded-0.265
Token
Token ID220
Feature activation+2.54e-02

Pos logit contributions
 making+0.025
:+0.024
's+0.023
 poking+0.021
 drawing+0.021
Neg logit contributions
des-0.068
use-0.066
 former-0.064
ited-0.061
 reins-0.061
Token
Token ID198
Feature activation+1.60e-02

Pos logit contributions
des+0.063
use+0.061
 former+0.060
ited+0.055
powers+0.055
Neg logit contributions
 making-0.026
:-0.023
 poking-0.023
's-0.022
 drawing-0.022
Token
Token ID198
Feature activation+1.59e-02

Pos logit contributions
des+0.044
use+0.044
 former+0.043
ited+0.041
 unnecessary+0.040
Neg logit contributions
:-0.011
 making-0.011
's-0.010
 was-0.010
 poking-0.009
TokenHours
Token ID39792
Feature activation+2.12e-01

Pos logit contributions
use+0.029
des+0.028
 former+0.027
 reins+0.025
 unnecessary+0.025
Neg logit contributions
:-0.025
 making-0.025
's-0.025
 was-0.024
 poking-0.024
Token passed
Token ID3804
Feature activation+4.11e-02

Pos logit contributions
des+0.338
use+0.337
ited+0.289
 former+0.287
irst+0.281
Neg logit contributions
 making-0.364
 video-0.358
 poking-0.342
 drawing-0.341
's-0.337
Token and
Token ID290
Feature activation+1.33e-02

Pos logit contributions
des+0.132
use+0.126
 former+0.124
 reins+0.119
powers+0.119
Neg logit contributions
 making-0.017
 in-0.016
:-0.016
's-0.015
 was-0.015
Token Lily
Token ID20037
Feature activation-7.34e-03

Pos logit contributions
des+0.035
use+0.034
 former+0.033
powers+0.031
ited+0.031
Neg logit contributions
 making-0.012
 poking-0.010
:-0.010
's-0.010
 drawing-0.010
Token's
Token ID338
Feature activation-4.20e-02

Pos logit contributions
 making+0.005
 poking+0.004
 video+0.004
 drawing+0.004
:+0.004
Neg logit contributions
des-0.020
use-0.020
 former-0.019
ited-0.018
powers-0.018
Token parents
Token ID3397
Feature activation+4.93e-02

Pos logit contributions
 making+0.052
 poking+0.046
:+0.046
's+0.046
 drawing+0.045
Neg logit contributions
des-0.094
use-0.091
 former-0.089
powers-0.083
ited-0.081
Token to
Token ID284
Feature activation+3.00e-02

Pos logit contributions
des+0.134
 former+0.133
use+0.128
 greater+0.120
minded+0.120
Neg logit contributions
's-0.085
 video-0.085
:-0.080
 making-0.079
 poking-0.077
Token work
Token ID670
Feature activation+3.15e-02

Pos logit contributions
des+0.066
use+0.065
 former+0.064
God+0.057
powers+0.057
Neg logit contributions
 making-0.036
:-0.034
 poking-0.034
's-0.033
 video-0.032
Token.
Token ID13
Feature activation-2.26e-02

Pos logit contributions
des+0.075
use+0.073
 former+0.073
 greater+0.069
powers+0.068
Neg logit contributions
 making-0.028
 poking-0.027
's-0.027
 video-0.026
 drawing-0.025
Token
Token ID198
Feature activation+1.58e-02

Pos logit contributions
 making+0.028
 poking+0.025
 drawing+0.025
's+0.025
:+0.025
Neg logit contributions
des-0.050
use-0.049
 former-0.048
powers-0.044
God-0.043
Token
Token ID198
Feature activation+1.54e-02

Pos logit contributions
des+0.040
use+0.040
 former+0.039
ited+0.036
powers+0.036
Neg logit contributions
 making-0.015
:-0.013
's-0.013
 poking-0.013
 drawing-0.012
TokenHours
Token ID39792
Feature activation+2.12e-01

Pos logit contributions
des+0.023
use+0.022
 former+0.021
 unnecessary+0.018
powers+0.018
Neg logit contributions
 making-0.032
's-0.030
:-0.029
 poking-0.029
 video-0.029
Token later
Token ID1568
Feature activation+8.72e-02

Pos logit contributions
use+0.346
des+0.339
ited+0.288
minded+0.286
 former+0.284
Neg logit contributions
 video-0.343
 making-0.333
 poking-0.323
 drawing-0.318
's-0.318
Token,
Token ID11
Feature activation-4.84e-02

Pos logit contributions
des+0.218
use+0.217
 former+0.213
powers+0.200
God+0.199
Neg logit contributions
 making-0.053
 poking-0.052
 video-0.050
 drawing-0.047
 spraying-0.044
Token the
Token ID262
Feature activation-8.80e-03

Pos logit contributions
 making+0.029
 poking+0.024
:+0.024
's+0.023
 drawing+0.023
Neg logit contributions
des-0.136
use-0.133
 former-0.132
powers-0.122
God-0.122
Token prince
Token ID19716
Feature activation+6.05e-03

Pos logit contributions
 making+0.016
 drawing+0.015
's+0.015
 doing+0.014
 in+0.014
Neg logit contributions
des-0.016
use-0.015
 former-0.014
rene-0.013
 unnecessary-0.013
Token and
Token ID290
Feature activation+1.39e-02

Pos logit contributions
des+0.015
use+0.015
 former+0.014
ited+0.013
powers+0.013
Neg logit contributions
 making-0.006
 poking-0.005
 video-0.005
 drawing-0.005
:-0.005
Token wrong
Token ID2642
Feature activation-2.39e-02

Pos logit contributions
 making+0.067
 poking+0.065
 video+0.063
 drawing+0.060
 spraying+0.058
Neg logit contributions
des-0.205
use-0.197
 former-0.192
minded-0.180
 greater-0.178
Token.
Token ID13
Feature activation-2.17e-02

Pos logit contributions
 making+0.017
 poking+0.014
's+0.013
:+0.013
 drawing+0.013
Neg logit contributions
des-0.066
use-0.065
 former-0.063
powers-0.060
ited-0.059
Token
Token ID220
Feature activation+2.95e-02

Pos logit contributions
 making+0.019
:+0.019
's+0.018
 Sam+0.018
 was+0.017
Neg logit contributions
des-0.058
use-0.057
ited-0.053
 former-0.052
 unnecessary-0.051
Token
Token ID198
Feature activation+1.91e-02

Pos logit contributions
des+0.075
use+0.074
 former+0.071
ited+0.067
powers+0.066
Neg logit contributions
 making-0.028
:-0.025
's-0.025
 poking-0.024
 drawing-0.023
Token
Token ID198
Feature activation+1.89e-02

Pos logit contributions
des+0.055
use+0.054
 former+0.053
ited+0.050
 unnecessary+0.050
Neg logit contributions
:-0.012
 making-0.012
's-0.011
 was-0.011
 having-0.010
TokenHours
Token ID39792
Feature activation+2.11e-01

Pos logit contributions
use+0.038
des+0.037
 former+0.036
 unnecessary+0.033
ited+0.033
Neg logit contributions
's-0.027
 making-0.027
:-0.026
 was-0.026
 had-0.025
Token passed
Token ID3804
Feature activation+4.26e-02

Pos logit contributions
des+0.371
use+0.360
 former+0.326
ited+0.309
God+0.305
Neg logit contributions
 making-0.348
 video-0.331
 poking-0.328
 drawing-0.324
:-0.316
Token and
Token ID290
Feature activation+1.49e-02

Pos logit contributions
des+0.128
use+0.123
 former+0.120
powers+0.114
 reins+0.114
Neg logit contributions
 making-0.025
's-0.022
:-0.021
 was-0.020
 in-0.020
Token the
Token ID262
Feature activation-5.40e-03

Pos logit contributions
des+0.040
use+0.039
 former+0.038
powers+0.036
ited+0.036
Neg logit contributions
 making-0.012
 poking-0.010
:-0.010
's-0.009
 drawing-0.009
Token animal
Token ID5044
Feature activation-4.76e-02

Pos logit contributions
 making+0.006
 poking+0.006
:+0.006
's+0.006
 drawing+0.006
Neg logit contributions
 former-0.012
des-0.012
use-0.011
powers-0.010
God-0.010
Token looked
Token ID3114
Feature activation-1.21e-02

Pos logit contributions
 making+0.026
 poking+0.020
 drawing+0.019
:+0.019
 video+0.019
Neg logit contributions
des-0.138
use-0.135
 former-0.132
powers-0.125
ited-0.124
Token down
Token ID866
Feature activation+3.50e-02

Pos logit contributions
des+0.176
use+0.171
 former+0.166
powers+0.158
God+0.153
Neg logit contributions
 making-0.084
 poking-0.073
:-0.073
's-0.071
 in-0.071
Token and
Token ID290
Feature activation+1.27e-02

Pos logit contributions
des+0.111
use+0.110
 former+0.108
powers+0.105
 unnecessary+0.104
Neg logit contributions
 making-0.018
 on-0.016
 in-0.015
:-0.014
,-0.012
Token fell
Token ID3214
Feature activation+2.32e-02

Pos logit contributions
des+0.031
use+0.031
 former+0.030
powers+0.029
 unnecessary+0.028
Neg logit contributions
 making-0.014
's-0.012
 poking-0.012
 drawing-0.011
 was-0.011
Token asleep
Token ID16039
Feature activation+9.21e-02

Pos logit contributions
des+0.052
use+0.051
 former+0.049
ited+0.045
powers+0.045
Neg logit contributions
 making-0.030
 poking-0.027
:-0.026
's-0.026
 drawing-0.025
Token.
Token ID13
Feature activation-2.37e-02

Pos logit contributions
des+0.255
use+0.249
 former+0.244
powers+0.228
ited+0.226
Neg logit contributions
 making-0.063
 poking-0.052
's-0.051
:-0.051
 video-0.050
Token Hours
Token ID19347
Feature activation+2.11e-01

Pos logit contributions
 making+0.019
:+0.016
's+0.016
 drawing+0.015
 Sam+0.014
Neg logit contributions
use-0.066
des-0.065
 former-0.060
ited-0.060
 unnecessary-0.059
Token later
Token ID1568
Feature activation+8.58e-02

Pos logit contributions
des+0.443
use+0.430
 former+0.415
powers+0.381
ited+0.378
Neg logit contributions
 making-0.293
 poking-0.266
:-0.263
's-0.262
 drawing-0.260
Token it
Token ID340
Feature activation-1.25e-02

Pos logit contributions
des+0.254
use+0.248
 former+0.242
powers+0.228
God+0.227
Neg logit contributions
 making-0.040
 poking-0.031
 video-0.030
 drawing-0.029
's-0.028
Token woke
Token ID19092
Feature activation-6.50e-04

Pos logit contributions
's+0.007
 was+0.007
 making+0.006
 had+0.006
:+0.005
Neg logit contributions
des-0.037
 former-0.037
use-0.037
 reins-0.035
 unnecessary-0.035
Token up
Token ID510
Feature activation+2.60e-02

Pos logit contributions
 making+0.000
's+0.000
:+0.000
 poking+0.000
 drawing+0.000
Neg logit contributions
des-0.002
use-0.002
 former-0.002
powers-0.002
ited-0.002
Token and
Token ID290
Feature activation+1.33e-02

Pos logit contributions
des+0.079
use+0.077
 former+0.074
powers+0.074
 reins+0.072
Neg logit contributions
 making-0.016
:-0.012
 in-0.012
 on-0.011
 poking-0.011
Token appear
Token ID1656
Feature activation-6.38e-02

Pos logit contributions
 making+0.009
 poking+0.008
:+0.008
 video+0.008
 drawing+0.007
Neg logit contributions
des-0.034
use-0.033
 former-0.032
powers-0.030
ited-0.030
Token from
Token ID422
Feature activation-6.89e-02

Pos logit contributions
 making+0.043
:+0.037
's+0.035
 poking+0.033
 in+0.031
Neg logit contributions
des-0.184
use-0.179
 former-0.175
powers-0.166
ited-0.165
Token the
Token ID262
Feature activation+2.38e-03

Pos logit contributions
 making+0.058
 drawing+0.045
 poking+0.045
:+0.044
 playing+0.044
Neg logit contributions
des-0.196
use-0.187
 former-0.180
powers-0.176
ited-0.175
Token clay
Token ID21558
Feature activation-4.65e-02

Pos logit contributions
des+0.005
use+0.005
 former+0.005
powers+0.005
ited+0.004
Neg logit contributions
 making-0.003
 drawing-0.003
 poking-0.003
 video-0.003
's-0.003
Token.
Token ID13
Feature activation-1.69e-02

Pos logit contributions
 making+0.040
 poking+0.034
's+0.034
:+0.033
 drawing+0.033
Neg logit contributions
des-0.122
use-0.119
 former-0.116
powers-0.109
ited-0.108
Token Hours
Token ID19347
Feature activation+2.11e-01

Pos logit contributions
:+0.015
 making+0.014
 Sam+0.014
's+0.014
 was+0.013
Neg logit contributions
des-0.045
use-0.044
 former-0.042
ited-0.042
 unnecessary-0.041
Token passed
Token ID3804
Feature activation+3.97e-02

Pos logit contributions
des+0.524
use+0.507
 former+0.502
powers+0.468
ited+0.459
Neg logit contributions
 making-0.216
:-0.187
's-0.186
 poking-0.186
 drawing-0.180
Token,
Token ID11
Feature activation-4.38e-02

Pos logit contributions
des+0.122
use+0.117
 former+0.116
powers+0.110
 reins+0.109
Neg logit contributions
 making-0.020
's-0.017
:-0.017
 in-0.015
 was-0.015
Token and
Token ID290
Feature activation+1.20e-02

Pos logit contributions
 making+0.049
 poking+0.041
's+0.041
 drawing+0.040
:+0.040
Neg logit contributions
des-0.108
use-0.105
 former-0.101
powers-0.097
ited-0.095
Token eventually
Token ID4191
Feature activation+1.20e-01

Pos logit contributions
des+0.031
use+0.030
 former+0.029
powers+0.027
ited+0.027
Neg logit contributions
 making-0.012
 poking-0.010
's-0.010
 drawing-0.010
:-0.010
Token the
Token ID262
Feature activation+2.31e-03

Pos logit contributions
des+0.321
use+0.316
 former+0.307
powers+0.288
ited+0.284
Neg logit contributions
 making-0.090
 poking-0.075
 video-0.075
 drawing-0.071
:-0.070
Token the
Token ID262
Feature activation-1.05e-02

Pos logit contributions
des+0.242
use+0.237
 former+0.230
powers+0.215
ited+0.213
Neg logit contributions
 making-0.099
 poking-0.086
:-0.085
's-0.084
 drawing-0.083
Token door
Token ID3420
Feature activation+4.47e-02

Pos logit contributions
 making+0.014
 poking+0.013
's+0.012
 drawing+0.012
 video+0.012
Neg logit contributions
des-0.023
use-0.022
 former-0.021
powers-0.020
ited-0.020
Token would
Token ID561
Feature activation+5.77e-02

Pos logit contributions
des+0.112
use+0.111
 former+0.109
ited+0.102
 unnecessary+0.102
Neg logit contributions
 making-0.045
's-0.043
 was-0.043
 had-0.041
 in-0.041
Token open
Token ID1280
Feature activation+5.49e-02

Pos logit contributions
des+0.157
use+0.153
 former+0.146
ited+0.146
minded+0.145
Neg logit contributions
 making-0.051
:-0.039
 was-0.038
 poking-0.038
's-0.038
Token.
Token ID13
Feature activation-2.34e-02

Pos logit contributions
des+0.163
use+0.160
 former+0.155
powers+0.151
ited+0.149
Neg logit contributions
 making-0.033
 in-0.030
:-0.029
 on-0.028
's-0.027
Token Hours
Token ID19347
Feature activation+2.10e-01

Pos logit contributions
 making+0.031
's+0.029
:+0.028
 poking+0.027
 drawing+0.027
Neg logit contributions
des-0.051
use-0.051
 former-0.047
ited-0.046
 unnecessary-0.045
Token passed
Token ID3804
Feature activation+4.26e-02

Pos logit contributions
des+0.394
use+0.382
 former+0.361
ited+0.330
powers+0.329
Neg logit contributions
 making-0.337
 poking-0.312
 video-0.307
:-0.307
 drawing-0.307
Token,
Token ID11
Feature activation-4.66e-02

Pos logit contributions
des+0.120
use+0.118
 former+0.116
powers+0.109
God+0.107
Neg logit contributions
 making-0.029
's-0.024
:-0.024
 poking-0.023
 drawing-0.021
Token but
Token ID475
Feature activation-1.85e-02

Pos logit contributions
 making+0.049
 poking+0.042
's+0.042
:+0.041
 drawing+0.041
Neg logit contributions
des-0.115
use-0.113
 former-0.109
powers-0.102
ited-0.101
Token no
Token ID645
Feature activation+6.64e-02

Pos logit contributions
 making+0.023
 video+0.022
 poking+0.021
 drawing+0.021
:+0.021
Neg logit contributions
des-0.038
 former-0.038
use-0.037
 greater-0.033
 unnecessary-0.033
Token-
Token ID12
Feature activation+1.62e-03

Pos logit contributions
des+0.130
use+0.127
 former+0.120
powers+0.112
minded+0.109
Neg logit contributions
 making-0.107
 poking-0.097
's-0.096
 drawing-0.095
:-0.093
Token it
Token ID340
Feature activation-1.26e-02

Pos logit contributions
des+0.060
use+0.059
 former+0.057
powers+0.054
ited+0.053
Neg logit contributions
 making-0.023
 poking-0.019
:-0.019
 drawing-0.019
's-0.019
Token was
Token ID373
Feature activation-4.83e-02

Pos logit contributions
 making+0.006
 poking+0.004
 drawing+0.004
:+0.004
 video+0.004
Neg logit contributions
des-0.038
use-0.037
 former-0.037
powers-0.035
ited-0.035
Token smooth
Token ID7209
Feature activation-3.67e-02

Pos logit contributions
 making+0.054
 poking+0.046
:+0.046
 drawing+0.045
's+0.044
Neg logit contributions
des-0.118
use-0.116
 former-0.111
powers-0.105
minded-0.104
Token again
Token ID757
Feature activation+1.03e-01

Pos logit contributions
 making+0.031
 poking+0.026
 drawing+0.026
:+0.026
 video+0.026
Neg logit contributions
des-0.096
use-0.093
 former-0.091
powers-0.085
ited-0.085
Token.
Token ID13
Feature activation-2.56e-02

Pos logit contributions
des+0.291
use+0.285
 former+0.278
powers+0.263
ited+0.260
Neg logit contributions
 making-0.071
 poking-0.057
:-0.056
's-0.056
 drawing-0.054
Token Hours
Token ID19347
Feature activation+2.10e-01

Pos logit contributions
 making+0.028
:+0.026
's+0.025
 Sam+0.024
 drawing+0.023
Neg logit contributions
use-0.064
des-0.063
ited-0.058
 former-0.057
 unnecessary-0.056
Token and
Token ID290
Feature activation+1.43e-02

Pos logit contributions
des+0.426
use+0.414
 former+0.392
ited+0.362
powers+0.359
Neg logit contributions
 making-0.305
 poking-0.279
 video-0.275
:-0.275
 drawing-0.275
Token hours
Token ID2250
Feature activation+1.02e-01

Pos logit contributions
des+0.037
use+0.036
 former+0.035
powers+0.033
 reins+0.033
Neg logit contributions
 making-0.014
's-0.012
 poking-0.012
:-0.011
 drawing-0.011
Token went
Token ID1816
Feature activation+7.58e-02

Pos logit contributions
des+0.240
use+0.225
 former+0.216
ited+0.208
powers+0.205
Neg logit contributions
 video-0.101
 making-0.101
 drawing-0.093
's-0.092
 poking-0.091
Token by
Token ID416
Feature activation-8.77e-02

Pos logit contributions
des+0.181
use+0.176
 former+0.171
powers+0.160
ited+0.159
Neg logit contributions
 making-0.087
:-0.077
 poking-0.076
's-0.075
 drawing-0.074
Token,
Token ID11
Feature activation-4.88e-02

Pos logit contributions
 making+0.066
's+0.058
 video+0.056
 poking+0.055
:+0.055
Neg logit contributions
des-0.231
 former-0.225
use-0.224
powers-0.207
 unnecessary-0.204
TokenI
Token ID40
Feature activation+6.22e-02

Pos logit contributions
 former+0.105
use+0.099
des+0.097
powers+0.094
 unnecessary+0.093
Neg logit contributions
's-0.087
 making-0.085
 was-0.084
 had-0.080
 being-0.079
Token'm
Token ID1101
Feature activation-1.42e-02

Pos logit contributions
des+0.127
 former+0.123
use+0.122
 reins+0.115
 unnecessary+0.113
Neg logit contributions
 making-0.086
's-0.084
:-0.082
 was-0.081
 poking-0.079
Token sorry
Token ID7926
Feature activation-3.57e-02

Pos logit contributions
 making+0.016
 video+0.016
's+0.016
:+0.015
 drawing+0.015
Neg logit contributions
des-0.030
use-0.029
 former-0.029
 unnecessary-0.027
 greater-0.027
Token,
Token ID11
Feature activation-4.35e-02

Pos logit contributions
 making+0.026
 poking+0.021
:+0.021
's+0.020
 drawing+0.020
Neg logit contributions
des-0.100
use-0.098
 former-0.095
powers-0.089
ited-0.089
Token Your
Token ID3406
Feature activation+2.79e-02

Pos logit contributions
 making+0.062
 poking+0.056
:+0.056
's+0.056
 drawing+0.055
Neg logit contributions
des-0.090
use-0.087
 former-0.085
powers-0.078
ited-0.077
Token Majesty
Token ID25788
Feature activation+2.09e-01

Pos logit contributions
des+0.067
use+0.063
 former+0.060
ited+0.059
God+0.057
Neg logit contributions
 making-0.034
 drawing-0.031
's-0.031
 video-0.030
 poking-0.030
Token.
Token ID13
Feature activation-2.28e-02

Pos logit contributions
des+0.587
use+0.571
 former+0.565
powers+0.525
ited+0.523
Neg logit contributions
 making-0.145
's-0.121
:-0.120
 poking-0.116
 drawing-0.109
Token I
Token ID314
Feature activation+8.30e-02

Pos logit contributions
 making+0.033
's+0.031
:+0.031
 poking+0.030
 video+0.029
Neg logit contributions
des-0.046
use-0.045
 former-0.044
ited-0.042
 unnecessary-0.040
Token won
Token ID1839
Feature activation-1.78e-02

Pos logit contributions
des+0.204
use+0.200
 former+0.195
 reins+0.183
powers+0.182
Neg logit contributions
 making-0.083
's-0.075
:-0.073
 poking-0.072
 drawing-0.070
Token't
Token ID470
Feature activation-4.02e-03

Pos logit contributions
 making+0.021
 poking+0.020
 drawing+0.019
:+0.019
 video+0.019
Neg logit contributions
use-0.034
des-0.032
 former-0.031
powers-0.031
 unnecessary-0.030
Token do
Token ID466
Feature activation-1.10e-01

Pos logit contributions
 making+0.005
 drawing+0.005
:+0.005
 poking+0.005
 video+0.005
Neg logit contributions
des-0.008
use-0.008
 former-0.008
powers-0.007
ited-0.007
Token the
Token ID262
Feature activation-6.70e-03

Pos logit contributions
des+0.086
use+0.084
 former+0.082
powers+0.077
ited+0.076
Neg logit contributions
 making-0.025
 poking-0.020
:-0.020
's-0.020
 drawing-0.020
Token king
Token ID5822
Feature activation+3.79e-02

Pos logit contributions
 making+0.010
's+0.009
 drawing+0.009
 video+0.008
:+0.008
Neg logit contributions
des-0.015
use-0.014
 former-0.013
ited-0.013
powers-0.012
Token,
Token ID11
Feature activation-4.70e-02

Pos logit contributions
use+0.098
des+0.096
 former+0.092
minded+0.089
powers+0.088
Neg logit contributions
 making-0.027
 video-0.022
 poking-0.022
 spraying-0.022
 drawing-0.021
Token "
Token ID366
Feature activation+5.48e-02

Pos logit contributions
 making+0.027
 video+0.022
 poking+0.022
 drawing+0.022
's+0.021
Neg logit contributions
use-0.128
des-0.128
 former-0.124
God-0.118
minded-0.117
TokenYour
Token ID7120
Feature activation+1.04e-03

Pos logit contributions
des+0.097
use+0.095
 former+0.095
powers+0.090
 Majesty+0.090
Neg logit contributions
 making-0.089
's-0.087
 was-0.085
 in-0.081
 poking-0.081
Token Majesty
Token ID25788
Feature activation+2.07e-01

Pos logit contributions
des+0.002
use+0.002
 former+0.002
ited+0.002
irst+0.002
Neg logit contributions
 making-0.001
 drawing-0.001
's-0.001
 video-0.001
:-0.001
Token,
Token ID11
Feature activation-4.68e-02

Pos logit contributions
des+0.607
 former+0.586
use+0.580
powers+0.543
ited+0.537
Neg logit contributions
 making-0.132
:-0.115
's-0.113
 poking-0.099
 was-0.095
Token it
Token ID340
Feature activation-1.04e-02

Pos logit contributions
 making+0.058
 poking+0.053
:+0.052
 drawing+0.052
's+0.052
Neg logit contributions
des-0.102
use-0.099
 former-0.097
powers-0.089
God-0.088
Token has
Token ID468
Feature activation-5.55e-02

Pos logit contributions
 making+0.005
 poking+0.004
 drawing+0.004
 video+0.004
:+0.004
Neg logit contributions
des-0.030
use-0.030
 former-0.028
powers-0.027
ited-0.027
Token been
Token ID587
Feature activation-8.43e-03

Pos logit contributions
 making+0.056
's+0.049
:+0.046
 poking+0.046
 drawing+0.046
Neg logit contributions
des-0.140
use-0.138
 former-0.132
ited-0.126
minded-0.123
Token too
Token ID1165
Feature activation+5.46e-02

Pos logit contributions
 making+0.010
's+0.008
:+0.008
 poking+0.008
 in+0.008
Neg logit contributions
des-0.021
use-0.020
 former-0.019
 reins-0.018
powers-0.018
Token,
Token ID11
Feature activation-4.47e-02

Pos logit contributions
 making+0.029
 video+0.025
 drawing+0.023
 poking+0.022
's+0.021
Neg logit contributions
des-0.126
use-0.122
 former-0.118
powers-0.114
minded-0.113
Token "
Token ID366
Feature activation+5.34e-02

Pos logit contributions
 making+0.027
 video+0.024
 poking+0.023
 drawing+0.023
's+0.022
Neg logit contributions
des-0.120
use-0.118
 former-0.115
God-0.110
 reins-0.109
TokenYes
Token ID5297
Feature activation-1.95e-02

Pos logit contributions
use+0.105
des+0.100
 former+0.100
 unnecessary+0.095
aj+0.092
Neg logit contributions
 making-0.078
 was-0.073
's-0.073
 had-0.071
 being-0.070
Token,
Token ID11
Feature activation-4.49e-02

Pos logit contributions
 making+0.009
 poking+0.007
:+0.006
's+0.006
 drawing+0.006
Neg logit contributions
des-0.059
use-0.058
 former-0.056
powers-0.053
ited-0.053
Token Your
Token ID3406
Feature activation+2.54e-02

Pos logit contributions
 making+0.066
's+0.060
 poking+0.060
:+0.060
 drawing+0.059
Neg logit contributions
des-0.091
use-0.089
 former-0.086
powers-0.080
ited-0.079
Token Majesty
Token ID25788
Feature activation+2.06e-01

Pos logit contributions
des+0.059
use+0.056
 former+0.053
ited+0.052
God+0.051
Neg logit contributions
 making-0.032
 drawing-0.030
's-0.029
 poking-0.029
:-0.028
Token.
Token ID13
Feature activation-2.45e-02

Pos logit contributions
des+0.595
use+0.575
 former+0.575
powers+0.531
ited+0.531
Neg logit contributions
 making-0.132
:-0.113
's-0.111
 poking-0.101
 drawing-0.095
Token I
Token ID314
Feature activation+8.02e-02

Pos logit contributions
 making+0.033
's+0.030
:+0.030
 poking+0.029
 drawing+0.029
Neg logit contributions
des-0.052
use-0.051
 former-0.049
ited-0.047
 unnecessary-0.045
Token promise
Token ID6991
Feature activation+1.39e-03

Pos logit contributions
des+0.154
use+0.147
 former+0.143
 reins+0.134
powers+0.133
Neg logit contributions
 making-0.126
's-0.119
:-0.119
 was-0.115
 poking-0.113
Token I
Token ID314
Feature activation+7.98e-02

Pos logit contributions
des+0.003
use+0.003
 former+0.003
ited+0.003
powers+0.003
Neg logit contributions
 making-0.002
:-0.001
's-0.001
 poking-0.001
 drawing-0.001
Token'll
Token ID1183
Feature activation+7.57e-02

Pos logit contributions
des+0.143
use+0.138
 former+0.134
powers+0.122
ited+0.121
Neg logit contributions
 making-0.135
:-0.126
's-0.126
 poking-0.123
 drawing-0.121
Token everyone
Token ID2506
Feature activation+5.35e-02

Pos logit contributions
des+0.043
use+0.042
 former+0.041
powers+0.039
God+0.038
Neg logit contributions
 making-0.017
 poking-0.015
's-0.015
:-0.015
 video-0.015
Token called
Token ID1444
Feature activation+1.20e-02

Pos logit contributions
des+0.114
use+0.107
God+0.101
yes+0.100
away+0.100
Neg logit contributions
 making-0.055
 poking-0.054
 grid-0.053
 Sam-0.052
 video-0.051
Token him
Token ID683
Feature activation-2.63e-02

Pos logit contributions
des+0.027
use+0.026
 former+0.025
 reins+0.024
God+0.024
Neg logit contributions
 making-0.013
 poking-0.012
's-0.011
 video-0.011
 spraying-0.011
Token "
Token ID366
Feature activation+5.90e-02

Pos logit contributions
 making+0.025
 poking+0.022
 drawing+0.021
 video+0.020
's+0.020
Neg logit contributions
des-0.065
use-0.064
 former-0.062
powers-0.059
minded-0.058
TokenYour
Token ID7120
Feature activation+2.86e-03

Pos logit contributions
 former+0.097
des+0.092
use+0.090
 unnecessary+0.090
powers+0.090
Neg logit contributions
 was-0.101
 making-0.099
 on-0.099
's-0.098
 had-0.098
Token Majesty
Token ID25788
Feature activation+2.05e-01

Pos logit contributions
des+0.007
use+0.007
ited+0.006
 former+0.006
 unnecessary+0.006
Neg logit contributions
 drawing-0.004
 making-0.003
's-0.003
 poking-0.003
:-0.003
Token".
Token ID1911
Feature activation+5.60e-02

Pos logit contributions
des+0.442
 former+0.422
use+0.414
 unnecessary+0.377
powers+0.377
Neg logit contributions
 making-0.289
:-0.279
's-0.276
 was-0.256
 poking-0.256
Token The
Token ID383
Feature activation-1.50e-02

Pos logit contributions
des+0.102
use+0.097
 former+0.095
powers+0.087
minded+0.085
Neg logit contributions
 making-0.087
 poking-0.085
 drawing-0.080
 spraying-0.079
 doing-0.078
Token king
Token ID5822
Feature activation+3.91e-02

Pos logit contributions
 making+0.020
 drawing+0.019
's+0.018
:+0.018
 poking+0.018
Neg logit contributions
des-0.033
use-0.032
 former-0.032
ited-0.030
 unnecessary-0.030
Token loved
Token ID6151
Feature activation+4.15e-02

Pos logit contributions
des+0.100
use+0.097
 former+0.097
ited+0.093
powers+0.093
Neg logit contributions
 making-0.028
 poking-0.024
 spraying-0.023
 video-0.022
 fighting-0.020
Token to
Token ID284
Feature activation+3.29e-02

Pos logit contributions
des+0.101
use+0.098
 former+0.096
powers+0.090
God+0.088
Neg logit contributions
 making-0.041
's-0.037
 poking-0.036
:-0.036
 video-0.035
Token the
Token ID262
Feature activation-7.87e-03

Pos logit contributions
des+0.097
use+0.091
 former+0.089
ited+0.088
powers+0.086
Neg logit contributions
 making-0.024
's-0.022
:-0.019
 in-0.018
 drawing-0.018
Token village
Token ID7404
Feature activation+1.14e-02

Pos logit contributions
 making+0.012
's+0.011
 drawing+0.011
 poking+0.010
 video+0.010
Neg logit contributions
des-0.017
use-0.016
 former-0.015
ited-0.014
rene-0.014
Token back
Token ID736
Feature activation+8.89e-02

Pos logit contributions
des+0.029
use+0.028
 former+0.026
ited+0.025
 reins+0.025
Neg logit contributions
 making-0.013
's-0.012
:-0.011
 in-0.011
 was-0.010
Token to
Token ID284
Feature activation+3.09e-02

Pos logit contributions
des+0.237
use+0.225
ited+0.224
irst+0.207
 former+0.206
Neg logit contributions
's-0.090
 in-0.089
 making-0.089
 on-0.085
,-0.079
Token its
Token ID663
Feature activation+1.06e-02

Pos logit contributions
des+0.078
use+0.073
ited+0.071
powers+0.070
 reins+0.069
Neg logit contributions
 making-0.038
 drawing-0.031
's-0.031
 being-0.030
:-0.030
Token former
Token ID1966
Feature activation+2.02e-01

Pos logit contributions
des+0.022
use+0.020
ited+0.019
rene+0.018
irst+0.017
Neg logit contributions
 making-0.017
 drawing-0.016
's-0.016
 in-0.016
 playing-0.015
Token glory
Token ID13476
Feature activation+1.42e-02

Pos logit contributions
des+0.349
use+0.326
God+0.290
 reins+0.288
yes+0.276
Neg logit contributions
 making-0.367
:-0.364
's-0.348
 drawing-0.342
 shape-0.336
Token!"
Token ID2474
Feature activation+4.44e-02

Pos logit contributions
des+0.033
use+0.031
 former+0.029
ited+0.029
 unnecessary+0.028
Neg logit contributions
 making-0.020
:-0.018
's-0.018
 in-0.017
,-0.017
Token
Token ID220
Feature activation+2.60e-02

Pos logit contributions
des+0.113
use+0.110
 former+0.108
powers+0.100
ited+0.100
Neg logit contributions
 making-0.042
's-0.036
:-0.036
 poking-0.036
 drawing-0.035
Token
Token ID198
Feature activation+1.70e-02

Pos logit contributions
des+0.066
use+0.065
 former+0.062
ited+0.059
powers+0.058
Neg logit contributions
 making-0.025
:-0.022
's-0.022
 poking-0.022
 drawing-0.021
Token
Token ID198
Feature activation+1.66e-02

Pos logit contributions
des+0.046
use+0.045
 former+0.044
ited+0.042
 unnecessary+0.041
Neg logit contributions
 making-0.014
:-0.013
's-0.013
 was-0.012
 video-0.012
Token,
Token ID11
Feature activation-4.57e-02

Pos logit contributions
 making+0.020
 drawing+0.019
 video+0.019
 poking+0.018
 grid+0.017
Neg logit contributions
des-0.049
 former-0.049
use-0.048
 greater-0.044
away-0.044
Token leaving
Token ID4305
Feature activation-2.15e-03

Pos logit contributions
 making+0.046
:+0.041
 poking+0.041
 video+0.041
's+0.040
Neg logit contributions
des-0.110
use-0.107
 former-0.105
powers-0.096
 greater-0.096
Token it
Token ID340
Feature activation-9.83e-03

Pos logit contributions
 making+0.002
 poking+0.001
 video+0.001
 drawing+0.001
:+0.001
Neg logit contributions
des-0.006
use-0.006
 former-0.006
 greater-0.005
minded-0.005
Token to
Token ID284
Feature activation+3.10e-02

Pos logit contributions
 making+0.010
 video+0.009
 poking+0.009
:+0.009
 drawing+0.009
Neg logit contributions
des-0.023
use-0.023
 former-0.022
powers-0.021
ited-0.020
Token its
Token ID663
Feature activation+9.44e-03

Pos logit contributions
des+0.073
use+0.070
 former+0.070
powers+0.063
 greater+0.063
Neg logit contributions
 making-0.034
's-0.031
 poking-0.031
:-0.031
 drawing-0.030
Token former
Token ID1966
Feature activation+2.01e-01

Pos logit contributions
des+0.019
use+0.018
 former+0.016
ited+0.016
powers+0.016
Neg logit contributions
 making-0.015
 poking-0.014
's-0.013
 drawing-0.013
:-0.013
Token state
Token ID1181
Feature activation+6.30e-02

Pos logit contributions
des+0.319
use+0.308
 former+0.264
God+0.263
ited+0.256
Neg logit contributions
 making-0.390
:-0.370
's-0.361
 drawing-0.360
 poking-0.358
Token.
Token ID13
Feature activation-2.19e-02

Pos logit contributions
des+0.173
use+0.169
 former+0.165
powers+0.155
ited+0.153
Neg logit contributions
 making-0.046
 poking-0.038
:-0.037
's-0.037
 drawing-0.036
Token
Token ID220
Feature activation+2.79e-02

Pos logit contributions
 making+0.022
 poking+0.020
 drawing+0.019
's+0.019
:+0.019
Neg logit contributions
des-0.053
use-0.052
 former-0.051
powers-0.048
God-0.047
Token
Token ID198
Feature activation+1.79e-02

Pos logit contributions
des+0.067
use+0.065
 former+0.064
powers+0.059
ited+0.058
Neg logit contributions
 making-0.030
 poking-0.026
:-0.026
's-0.026
 drawing-0.026
Token
Token ID198
Feature activation+1.75e-02

Pos logit contributions
des+0.044
use+0.042
 former+0.041
powers+0.038
ited+0.038
Neg logit contributions
 making-0.019
 poking-0.017
:-0.016
's-0.016
 drawing-0.016
Token the
Token ID262
Feature activation-6.61e-03

Pos logit contributions
des+0.096
use+0.094
 former+0.093
minded+0.086
 unnecessary+0.086
Neg logit contributions
 making-0.024
 poking-0.021
 drawing-0.021
:-0.020
 video-0.020
Token tree
Token ID5509
Feature activation+2.61e-02

Pos logit contributions
 making+0.008
 drawing+0.007
 poking+0.007
's+0.007
 video+0.007
Neg logit contributions
des-0.015
use-0.015
 former-0.014
minded-0.014
ited-0.013
Token to
Token ID284
Feature activation+2.66e-02

Pos logit contributions
des+0.067
 former+0.065
use+0.064
powers+0.060
 greater+0.059
Neg logit contributions
 making-0.021
 poking-0.019
 video-0.018
:-0.018
 drawing-0.018
Token its
Token ID663
Feature activation+5.60e-03

Pos logit contributions
des+0.067
use+0.066
 former+0.063
powers+0.060
ited+0.060
Neg logit contributions
 making-0.026
 poking-0.022
:-0.022
's-0.022
 drawing-0.022
Token once
Token ID1752
Feature activation+5.00e-03

Pos logit contributions
des+0.012
use+0.012
ited+0.011
irst+0.010
minded+0.010
Neg logit contributions
 making-0.008
's-0.008
 drawing-0.008
:-0.008
 playing-0.007
Token former
Token ID1966
Feature activation+2.01e-01

Pos logit contributions
des+0.011
use+0.011
 former+0.011
powers+0.010
 reins+0.010
Neg logit contributions
 making-0.006
's-0.006
 poking-0.006
:-0.006
 drawing-0.005
Token beauty
Token ID8737
Feature activation-2.98e-03

Pos logit contributions
des+0.352
use+0.335
 reins+0.295
God+0.293
yes+0.286
Neg logit contributions
 making-0.359
:-0.356
's-0.334
,-0.329
 drawing-0.326
Token.
Token ID13
Feature activation-2.11e-02

Pos logit contributions
 making+0.002
 poking+0.002
 drawing+0.002
 video+0.002
's+0.002
Neg logit contributions
des-0.008
use-0.008
 former-0.007
powers-0.007
 greater-0.007
Token He
Token ID679
Feature activation+6.08e-03

Pos logit contributions
:+0.021
 making+0.021
's+0.020
 was+0.019
 having+0.018
Neg logit contributions
use-0.053
des-0.052
 former-0.049
ited-0.048
 unnecessary-0.047
Token was
Token ID373
Feature activation-4.60e-02

Pos logit contributions
des+0.018
use+0.017
 former+0.017
 unnecessary+0.016
powers+0.016
Neg logit contributions
 making-0.004
's-0.003
 was-0.003
:-0.003
 had-0.003
Token happy
Token ID3772
Feature activation-4.25e-03

Pos logit contributions
 making+0.049
 poking+0.043
:+0.043
's+0.043
 drawing+0.042
Neg logit contributions
des-0.111
use-0.108
 former-0.105
powers-0.098
ited-0.097
Token back
Token ID736
Feature activation+9.00e-02

Pos logit contributions
des+0.106
use+0.103
 former+0.101
powers+0.093
 unnecessary+0.092
Neg logit contributions
 making-0.047
 poking-0.042
:-0.041
 drawing-0.040
's-0.040
Token the
Token ID262
Feature activation-6.89e-03

Pos logit contributions
des+0.251
use+0.248
 former+0.237
ited+0.232
powers+0.225
Neg logit contributions
 making-0.066
's-0.059
 in-0.055
:-0.054
 drawing-0.053
Token village
Token ID7404
Feature activation+9.85e-03

Pos logit contributions
 making+0.010
 poking+0.009
's+0.009
 drawing+0.009
:+0.009
Neg logit contributions
des-0.014
use-0.014
 former-0.013
powers-0.012
ited-0.012
Token to
Token ID284
Feature activation+3.18e-02

Pos logit contributions
des+0.025
use+0.025
 former+0.024
ited+0.022
 unnecessary+0.022
Neg logit contributions
 making-0.009
's-0.009
:-0.008
 drawing-0.008
 poking-0.008
Token its
Token ID663
Feature activation+1.09e-02

Pos logit contributions
des+0.089
use+0.087
 former+0.084
powers+0.081
ited+0.080
Neg logit contributions
 making-0.024
 drawing-0.019
:-0.019
 poking-0.019
's-0.018
Token former
Token ID1966
Feature activation+2.00e-01

Pos logit contributions
des+0.023
use+0.022
ited+0.020
rene+0.019
irst+0.018
Neg logit contributions
 making-0.018
 drawing-0.016
 playing-0.016
 in-0.016
's-0.016
Token state
Token ID1181
Feature activation+6.20e-02

Pos logit contributions
des+0.343
use+0.326
God+0.290
 reins+0.286
ited+0.274
Neg logit contributions
 making-0.360
:-0.359
's-0.336
 drawing-0.335
 shape-0.326
Token.
Token ID13
Feature activation-2.22e-02

Pos logit contributions
des+0.179
use+0.177
 former+0.169
powers+0.162
ited+0.162
Neg logit contributions
 making-0.040
:-0.033
's-0.033
 poking-0.030
 drawing-0.029
Token With
Token ID2080
Feature activation-1.18e-02

Pos logit contributions
 making+0.023
 poking+0.020
:+0.020
's+0.020
 drawing+0.020
Neg logit contributions
des-0.054
use-0.053
 former-0.051
powers-0.048
God-0.047
Token the
Token ID262
Feature activation-6.33e-03

Pos logit contributions
 making+0.010
 poking+0.009
's+0.009
 video+0.009
 drawing+0.009
Neg logit contributions
des-0.030
use-0.029
 former-0.029
powers-0.027
ited-0.027
Token help
Token ID1037
Feature activation+2.12e-02

Pos logit contributions
 making+0.009
 poking+0.009
:+0.008
's+0.008
 drawing+0.008
Neg logit contributions
des-0.013
use-0.012
 former-0.012
powers-0.011
ited-0.011
Token to
Token ID284
Feature activation+3.09e-02

Pos logit contributions
des+0.129
 former+0.124
use+0.119
ited+0.118
God+0.115
Neg logit contributions
 making-0.038
:-0.034
's-0.034
 in-0.034
 on-0.033
Token be
Token ID307
Feature activation-1.51e-02

Pos logit contributions
des+0.072
use+0.070
 former+0.066
ited+0.065
powers+0.064
Neg logit contributions
 making-0.039
's-0.034
 drawing-0.033
 poking-0.033
:-0.032
Token restored
Token ID15032
Feature activation+3.40e-02

Pos logit contributions
 making+0.020
 in+0.018
's+0.017
 on+0.017
 playing+0.017
Neg logit contributions
des-0.037
use-0.035
ited-0.033
irst-0.033
 reins-0.033
Token to
Token ID284
Feature activation+3.09e-02

Pos logit contributions
des+0.088
ited+0.083
use+0.082
 former+0.079
powers+0.075
Neg logit contributions
 in-0.038
 making-0.038
 on-0.037
's-0.037
,-0.036
Token its
Token ID663
Feature activation+8.01e-03

Pos logit contributions
des+0.079
use+0.076
ited+0.073
 former+0.071
powers+0.070
Neg logit contributions
 making-0.034
 drawing-0.029
's-0.028
 being-0.028
 poking-0.027
Token former
Token ID1966
Feature activation+2.00e-01

Pos logit contributions
des+0.016
use+0.014
ited+0.014
rene+0.012
irst+0.012
Neg logit contributions
 making-0.014
 drawing-0.013
's-0.013
:-0.013
 playing-0.013
Token glory
Token ID13476
Feature activation+1.33e-02

Pos logit contributions
des+0.362
use+0.333
God+0.302
 reins+0.301
ited+0.294
Neg logit contributions
 making-0.350
:-0.349
 drawing-0.327
's-0.326
,-0.323
Token."
Token ID526
Feature activation+6.78e-02

Pos logit contributions
des+0.033
use+0.031
ited+0.031
 former+0.030
 unnecessary+0.029
Neg logit contributions
 making-0.016
:-0.016
's-0.015
 in-0.015
,-0.015
Token
Token ID220
Feature activation+2.69e-02

Pos logit contributions
des+0.164
use+0.159
 former+0.155
powers+0.144
ited+0.142
Neg logit contributions
 making-0.071
 poking-0.062
 video-0.061
 drawing-0.061
's-0.060
Token
Token ID198
Feature activation+1.78e-02

Pos logit contributions
des+0.067
use+0.065
 former+0.063
ited+0.059
powers+0.059
Neg logit contributions
 making-0.027
:-0.024
 poking-0.024
's-0.024
 drawing-0.023
Token
Token ID198
Feature activation+1.78e-02

Pos logit contributions
des+0.046
use+0.045
 former+0.044
powers+0.040
ited+0.040
Neg logit contributions
 making-0.017
:-0.015
's-0.015
 poking-0.015
 drawing-0.014
Token to
Token ID284
Feature activation+2.10e-02

Pos logit contributions
 making+0.021
:+0.021
's+0.019
 was+0.019
 poking+0.019
Neg logit contributions
use-0.048
des-0.048
 former-0.046
ited-0.044
 unnecessary-0.043
Token sort
Token ID3297
Feature activation+4.23e-02

Pos logit contributions
des+0.051
use+0.049
 former+0.048
powers+0.045
ited+0.045
Neg logit contributions
 making-0.023
 poking-0.020
's-0.020
:-0.020
 drawing-0.019
Token the
Token ID262
Feature activation-1.70e-03

Pos logit contributions
des+0.113
use+0.109
 former+0.108
ited+0.101
powers+0.100
Neg logit contributions
 making-0.037
's-0.032
:-0.031
 poking-0.030
 drawing-0.030
Token oranges
Token ID48389
Feature activation-1.88e-02

Pos logit contributions
 making+0.002
 poking+0.002
's+0.002
 drawing+0.002
:+0.002
Neg logit contributions
des-0.004
use-0.004
 former-0.004
ited-0.003
powers-0.003
Token.
Token ID13
Feature activation-1.09e-02

Pos logit contributions
 making+0.014
:+0.011
 poking+0.011
's+0.011
 drawing+0.011
Neg logit contributions
des-0.052
use-0.051
 former-0.050
powers-0.047
ited-0.046
Token First
Token ID3274
Feature activation+1.83e-01

Pos logit contributions
 making+0.009
's+0.009
:+0.009
 Sam+0.009
 was+0.008
Neg logit contributions
des-0.029
use-0.029
 former-0.027
 unnecessary-0.027
ited-0.027
Token it
Token ID340
Feature activation-7.36e-03

Pos logit contributions
des+0.488
use+0.483
 former+0.466
powers+0.439
ited+0.433
Neg logit contributions
 making-0.117
 video-0.102
 poking-0.100
 drawing-0.093
:-0.093
Token ate
Token ID15063
Feature activation-2.33e-02

Pos logit contributions
 making+0.006
 poking+0.005
:+0.005
 drawing+0.005
 video+0.004
Neg logit contributions
des-0.020
use-0.019
 former-0.019
powers-0.018
ited-0.018
Token the
Token ID262
Feature activation-2.09e-03

Pos logit contributions
 making+0.017
:+0.014
 poking+0.014
's+0.013
 drawing+0.013
Neg logit contributions
des-0.065
use-0.064
 former-0.062
powers-0.059
ited-0.058
Token ripe
Token ID29036
Feature activation+2.22e-02

Pos logit contributions
 making+0.003
 poking+0.003
's+0.003
:+0.003
 drawing+0.003
Neg logit contributions
des-0.004
use-0.004
 former-0.004
powers-0.004
ited-0.004
Token oranges
Token ID48389
Feature activation-2.07e-02

Pos logit contributions
des+0.052
use+0.050
 former+0.049
powers+0.046
God+0.046
Neg logit contributions
 making-0.027
:-0.024
's-0.024
 poking-0.024
 drawing-0.023
Token robot
Token ID9379
Feature activation-4.06e-02

Pos logit contributions
des+0.007
use+0.007
 former+0.006
powers+0.006
ited+0.006
Neg logit contributions
 making-0.004
 poking-0.004
 drawing-0.004
:-0.003
's-0.003
Token got
Token ID1392
Feature activation+2.40e-02

Pos logit contributions
 making+0.031
 poking+0.025
:+0.025
's+0.025
 drawing+0.025
Neg logit contributions
des-0.110
use-0.108
 former-0.105
powers-0.099
ited-0.098
Token to
Token ID284
Feature activation+2.01e-02

Pos logit contributions
des+0.054
use+0.053
 former+0.052
powers+0.048
 greater+0.048
Neg logit contributions
 making-0.026
 video-0.024
:-0.024
 drawing-0.023
's-0.023
Token work
Token ID670
Feature activation+3.88e-02

Pos logit contributions
des+0.049
use+0.047
 former+0.046
powers+0.043
ited+0.043
Neg logit contributions
 making-0.022
 poking-0.019
's-0.019
 drawing-0.019
:-0.019
Token.
Token ID13
Feature activation-2.01e-02

Pos logit contributions
des+0.108
use+0.105
 former+0.103
powers+0.097
ited+0.096
Neg logit contributions
 making-0.028
:-0.023
 poking-0.022
's-0.022
 drawing-0.022
Token First
Token ID3274
Feature activation+1.83e-01

Pos logit contributions
:+0.017
 making+0.017
's+0.017
 Sam+0.017
 Max+0.015
Neg logit contributions
des-0.054
use-0.053
 unnecessary-0.050
 former-0.050
ited-0.049
Token,
Token ID11
Feature activation-4.71e-02

Pos logit contributions
des+0.487
use+0.481
 former+0.463
powers+0.439
ited+0.432
Neg logit contributions
 making-0.120
 video-0.105
 poking-0.103
 drawing-0.096
:-0.094
Token they
Token ID484
Feature activation-2.33e-02

Pos logit contributions
 making+0.034
's+0.026
 poking+0.025
 drawing+0.024
 in+0.024
Neg logit contributions
des-0.137
use-0.133
 former-0.129
ited-0.123
powers-0.122
Token found
Token ID1043
Feature activation+2.22e-02

Pos logit contributions
 making+0.025
's+0.022
:+0.022
 poking+0.021
 drawing+0.021
Neg logit contributions
des-0.057
use-0.055
 former-0.054
powers-0.050
 unnecessary-0.050
Token a
Token ID257
Feature activation+1.67e-02

Pos logit contributions
des+0.050
use+0.049
 former+0.049
 greater+0.047
powers+0.047
Neg logit contributions
 making-0.020
 video-0.019
 drawing-0.018
:-0.018
 sharing-0.018
Token big
Token ID1263
Feature activation-1.55e-02

Pos logit contributions
des+0.036
use+0.036
 former+0.035
powers+0.032
 greater+0.032
Neg logit contributions
 making-0.020
:-0.018
 poking-0.018
's-0.018
 drawing-0.017
Token told
Token ID1297
Feature activation-2.54e-03

Pos logit contributions
 making+0.035
 video+0.033
 drawing+0.030
 grid+0.030
 spraying+0.028
Neg logit contributions
use-0.100
des-0.096
ited-0.092
 former-0.092
powers-0.090
Token Jim
Token ID5395
Feature activation-2.54e-02

Pos logit contributions
 making+0.003
 video+0.003
 drawing+0.003
 spraying+0.003
 poking+0.003
Neg logit contributions
des-0.005
use-0.005
minded-0.005
ited-0.005
 former-0.004
Token that
Token ID326
Feature activation-5.93e-02

Pos logit contributions
 making+0.029
 video+0.029
 spraying+0.026
 drawing+0.025
 grid+0.024
Neg logit contributions
des-0.052
use-0.050
powers-0.048
minded-0.047
 anytime-0.046
Token the
Token ID262
Feature activation-8.22e-03

Pos logit contributions
 making+0.065
 video+0.063
:+0.061
 drawing+0.059
 poking+0.058
Neg logit contributions
des-0.123
use-0.121
 former-0.116
God-0.112
 greater-0.110
Token zo
Token ID40565
Feature activation+4.86e-02

Pos logit contributions
 making+0.011
:+0.010
 poking+0.010
on+0.010
 drawing+0.010
Neg logit contributions
use-0.017
 former-0.016
des-0.016
 reins-0.015
powers-0.015
Tokenoke
Token ID2088
Feature activation-1.70e-01

Pos logit contributions
use+0.106
des+0.105
powers+0.097
 former+0.095
minded+0.093
Neg logit contributions
 making-0.063
 poking-0.054
 drawing-0.054
 video-0.053
's-0.052
Tokenep
Token ID538
Feature activation-1.07e-02

Pos logit contributions
 making+0.352
 video+0.340
 spraying+0.321
 drawing+0.320
 having+0.310
Neg logit contributions
use-0.209
des-0.185
minded-0.170
away-0.168
 reins-0.161
Tokeners
Token ID364
Feature activation-4.24e-02

Pos logit contributions
 making+0.008
 poking+0.007
 drawing+0.007
 video+0.007
 spraying+0.006
Neg logit contributions
des-0.028
use-0.028
 former-0.027
powers-0.026
ited-0.025
Token had
Token ID550
Feature activation-6.52e-02

Pos logit contributions
 making+0.035
 poking+0.031
 drawing+0.030
 video+0.030
 spraying+0.028
Neg logit contributions
use-0.107
des-0.107
 former-0.101
ited-0.097
powers-0.096
Token split
Token ID6626
Feature activation-2.51e-02

Pos logit contributions
 making+0.074
 video+0.066
 poking+0.066
:+0.066
's+0.065
Neg logit contributions
des-0.149
use-0.146
 former-0.142
powers-0.132
 unnecessary-0.129
Token the
Token ID262
Feature activation-8.08e-03

Pos logit contributions
 making+0.023
 video+0.020
 drawing+0.019
 poking+0.019
:+0.019
Neg logit contributions
des-0.061
use-0.060
 former-0.060
powers-0.056
 unnecessary-0.056
Token the
Token ID262
Feature activation-1.08e-02

Pos logit contributions
des+0.080
use+0.079
 former+0.076
minded+0.072
 unnecessary+0.072
Neg logit contributions
 making-0.019
 poking-0.017
:-0.016
 drawing-0.015
's-0.015
Token bear
Token ID6842
Feature activation-3.90e-02

Pos logit contributions
 making+0.012
's+0.011
 poking+0.010
:+0.010
 drawing+0.010
Neg logit contributions
des-0.025
use-0.025
 former-0.025
powers-0.023
God-0.022
Token and
Token ID290
Feature activation+1.47e-02

Pos logit contributions
 making+0.043
 poking+0.041
 drawing+0.038
 video+0.038
 grid+0.037
Neg logit contributions
des-0.085
use-0.079
 former-0.078
 anytime-0.076
 someday-0.075
Token the
Token ID262
Feature activation-1.03e-02

Pos logit contributions
des+0.033
use+0.033
 former+0.032
powers+0.029
ited+0.029
Neg logit contributions
 making-0.018
 poking-0.016
's-0.016
:-0.016
 drawing-0.015
Token zo
Token ID40565
Feature activation+4.48e-02

Pos logit contributions
 making+0.010
 poking+0.009
:+0.008
 drawing+0.008
's+0.008
Neg logit contributions
des-0.026
use-0.026
 former-0.025
powers-0.023
ited-0.023
Tokenoke
Token ID2088
Feature activation-1.69e-01

Pos logit contributions
des+0.103
use+0.101
 former+0.094
powers+0.092
minded+0.090
Neg logit contributions
 making-0.053
 poking-0.046
 drawing-0.046
 video-0.046
's-0.045
Tokenep
Token ID538
Feature activation-1.11e-02

Pos logit contributions
 video+0.351
 making+0.350
 drawing+0.332
 spraying+0.326
 shooting+0.317
Neg logit contributions
use-0.207
des-0.188
 reins-0.173
minded-0.172
God-0.165
Tokeners
Token ID364
Feature activation-4.36e-02

Pos logit contributions
 making+0.015
 poking+0.014
 drawing+0.014
 video+0.014
:+0.013
Neg logit contributions
des-0.023
use-0.022
 former-0.021
minded-0.019
powers-0.019
Token and
Token ID290
Feature activation+1.54e-02

Pos logit contributions
 making+0.041
 video+0.038
 poking+0.037
 drawing+0.036
:+0.033
Neg logit contributions
des-0.107
use-0.103
 former-0.097
 reins-0.093
ited-0.093
Token went
Token ID1816
Feature activation+7.46e-02

Pos logit contributions
des+0.034
use+0.033
 former+0.032
powers+0.029
God+0.029
Neg logit contributions
 making-0.019
 poking-0.017
:-0.017
 video-0.017
 drawing-0.017
Token back
Token ID736
Feature activation+9.05e-02

Pos logit contributions
des+0.174
use+0.170
 former+0.166
powers+0.153
ited+0.153
Neg logit contributions
 making-0.087
 poking-0.077
:-0.077
's-0.076
 drawing-0.075
Token.
Token ID13
Feature activation-2.67e-02

Pos logit contributions
 drawing+0.033
 sharing+0.032
 Cube+0.030
on+0.030
 grid+0.030
Neg logit contributions
des-0.055
away-0.053
God-0.050
use-0.049
 reins-0.048
Token It
Token ID632
Feature activation-4.91e-02

Pos logit contributions
 making+0.029
 poking+0.026
's+0.025
 drawing+0.025
:+0.024
Neg logit contributions
des-0.062
use-0.060
 former-0.060
powers-0.057
God-0.056
Token was
Token ID373
Feature activation-5.01e-02

Pos logit contributions
 drawing+0.041
 sharing+0.039
 making+0.036
 poking+0.034
on+0.034
Neg logit contributions
use-0.115
des-0.114
 former-0.107
God-0.107
ited-0.107
Token the
Token ID262
Feature activation-1.20e-02

Pos logit contributions
's+0.049
 video+0.046
on+0.044
 grid+0.044
:+0.041
Neg logit contributions
 former-0.103
God-0.100
des-0.099
 unnecessary-0.099
use-0.099
Token zo
Token ID40565
Feature activation+4.65e-02

Pos logit contributions
 making+0.016
 poking+0.014
:+0.014
's+0.014
 drawing+0.014
Neg logit contributions
des-0.026
use-0.025
 former-0.025
powers-0.023
ited-0.022
Tokenoke
Token ID2088
Feature activation-1.69e-01

Pos logit contributions
des+0.107
use+0.106
 former+0.098
powers+0.096
God+0.093
Neg logit contributions
 making-0.054
 poking-0.048
 drawing-0.047
's-0.047
 video-0.047
Tokeneper
Token ID5723
Feature activation+2.68e-02

Pos logit contributions
 video+0.320
 making+0.303
 drawing+0.288
:+0.278
 spraying+0.273
Neg logit contributions
des-0.245
use-0.230
 reins-0.203
God-0.201
away-0.198
Token.
Token ID13
Feature activation-2.64e-02

Pos logit contributions
des+0.057
use+0.056
God+0.048
ited+0.048
 reins+0.048
Neg logit contributions
 video-0.029
 making-0.026
 drawing-0.025
 er-0.024
 sharing-0.024
Token He
Token ID679
Feature activation+4.25e-03

Pos logit contributions
 making+0.035
 poking+0.032
 drawing+0.031
's+0.030
 video+0.030
Neg logit contributions
des-0.057
use-0.055
 former-0.053
powers-0.050
God-0.049
Token had
Token ID550
Feature activation-6.52e-02

Pos logit contributions
use+0.010
des+0.010
 former+0.010
God+0.009
powers+0.009
Neg logit contributions
 making-0.004
 drawing-0.004
 poking-0.004
 video-0.003
's-0.003
Token come
Token ID1282
Feature activation+3.00e-02

Pos logit contributions
 making+0.071
 poking+0.061
's+0.060
:+0.060
 drawing+0.059
Neg logit contributions
des-0.161
use-0.156
 former-0.154
powers-0.142
ited-0.142
Token
Token ID198
Feature activation+2.26e-02

Pos logit contributions
des+0.058
use+0.056
 former+0.055
ited+0.051
powers+0.051
Neg logit contributions
 making-0.023
:-0.020
 poking-0.020
's-0.020
 drawing-0.019
TokenSoon
Token ID28093
Feature activation+9.20e-02

Pos logit contributions
des+0.044
 former+0.043
use+0.043
powers+0.038
ited+0.038
Neg logit contributions
 making-0.035
:-0.033
's-0.033
 poking-0.032
 drawing-0.031
Token,
Token ID11
Feature activation-4.88e-02

Pos logit contributions
des+0.219
use+0.215
 anytime+0.205
 unnecessary+0.201
irst+0.199
Neg logit contributions
 video-0.073
 making-0.069
 poking-0.064
 drawing-0.061
 spraying-0.060
Token a
Token ID257
Feature activation+1.55e-02

Pos logit contributions
 making+0.036
 poking+0.030
:+0.029
's+0.029
 drawing+0.028
Neg logit contributions
des-0.134
use-0.131
 former-0.128
powers-0.120
God-0.119
Token zo
Token ID40565
Feature activation+4.57e-02

Pos logit contributions
des+0.032
use+0.031
 former+0.030
powers+0.028
ited+0.028
Neg logit contributions
 making-0.022
 poking-0.020
's-0.020
:-0.020
 drawing-0.019
Tokenoke
Token ID2088
Feature activation-1.69e-01

Pos logit contributions
des+0.110
use+0.108
 former+0.103
powers+0.097
ited+0.096
Neg logit contributions
 making-0.050
 poking-0.043
's-0.043
 drawing-0.042
:-0.042
Tokeneper
Token ID5723
Feature activation+2.60e-02

Pos logit contributions
 making+0.296
 video+0.293
 drawing+0.279
 poking+0.271
:+0.270
Neg logit contributions
des-0.268
use-0.265
God-0.224
ited-0.222
 reins-0.221
Token came
Token ID1625
Feature activation+1.30e-02

Pos logit contributions
use+0.065
des+0.064
 former+0.057
ited+0.057
God+0.056
Neg logit contributions
 making-0.022
 video-0.021
 drawing-0.020
 poking-0.020
:-0.018
Token and
Token ID290
Feature activation+1.33e-02

Pos logit contributions
des+0.036
use+0.035
 former+0.034
powers+0.032
ited+0.032
Neg logit contributions
 making-0.010
 poking-0.008
:-0.008
's-0.008
 drawing-0.008
Token unlocked
Token ID14838
Feature activation+7.85e-02

Pos logit contributions
 former+0.033
des+0.033
use+0.032
 unnecessary+0.030
powers+0.030
Neg logit contributions
 making-0.015
's-0.013
 poking-0.012
 was-0.012
 drawing-0.012
Token the
Token ID262
Feature activation-9.49e-03

Pos logit contributions
des+0.222
use+0.217
 former+0.213
powers+0.199
minded+0.199
Neg logit contributions
 making-0.046
 poking-0.039
 drawing-0.036
's-0.035
 video-0.034
Token have
Token ID423
Feature activation-3.78e-02

Pos logit contributions
des+0.071
use+0.069
 former+0.067
powers+0.062
ited+0.061
Neg logit contributions
 making-0.036
 poking-0.032
's-0.032
:-0.032
 drawing-0.031
Token met
Token ID1138
Feature activation+3.19e-02

Pos logit contributions
 making+0.040
 poking+0.035
:+0.034
 video+0.034
 drawing+0.034
Neg logit contributions
des-0.091
use-0.089
 former-0.087
powers-0.081
ited-0.080
Token a
Token ID257
Feature activation+1.62e-02

Pos logit contributions
des+0.082
use+0.080
 former+0.078
powers+0.073
ited+0.072
Neg logit contributions
 making-0.029
 poking-0.025
:-0.024
's-0.024
 drawing-0.024
Token famous
Token ID5863
Feature activation-8.27e-03

Pos logit contributions
des+0.036
use+0.035
 former+0.034
ited+0.032
powers+0.031
Neg logit contributions
 making-0.021
 video-0.019
's-0.019
 drawing-0.019
 poking-0.019
Token zo
Token ID40565
Feature activation+4.75e-02

Pos logit contributions
 making+0.011
 poking+0.010
:+0.010
's+0.009
 drawing+0.009
Neg logit contributions
des-0.018
use-0.018
 former-0.017
powers-0.016
ited-0.016
Tokenoke
Token ID2088
Feature activation-1.69e-01

Pos logit contributions
des+0.109
use+0.107
 former+0.101
powers+0.097
minded+0.095
Neg logit contributions
 making-0.056
 poking-0.049
 drawing-0.048
's-0.048
 video-0.048
Tokeneper
Token ID5723
Feature activation+2.77e-02

Pos logit contributions
 making+0.322
 video+0.319
 drawing+0.298
 spraying+0.293
:+0.292
Neg logit contributions
des-0.223
use-0.216
 reins-0.194
God-0.189
minded-0.187
Token.
Token ID13
Feature activation-2.16e-02

Pos logit contributions
des+0.071
use+0.070
 former+0.067
minded+0.063
ited+0.063
Neg logit contributions
 making-0.023
 poking-0.020
 video-0.020
 drawing-0.019
:-0.018
Token She
Token ID1375
Feature activation+4.04e-02

Pos logit contributions
 making+0.023
:+0.022
's+0.021
 drawing+0.020
 Sam+0.019
Neg logit contributions
des-0.053
use-0.052
 former-0.051
 unnecessary-0.048
ited-0.047
Token had
Token ID550
Feature activation-6.46e-02

Pos logit contributions
des+0.109
use+0.106
 former+0.102
 unnecessary+0.099
powers+0.097
Neg logit contributions
 making-0.035
's-0.034
:-0.033
 was-0.032
 had-0.031
Token learned
Token ID4499
Feature activation+7.41e-03

Pos logit contributions
 making+0.066
's+0.059
:+0.056
 drawing+0.054
 poking+0.054
Neg logit contributions
des-0.164
use-0.161
 former-0.155
powers-0.145
ited-0.144
Token
Token ID198
Feature activation+1.72e-02

Pos logit contributions
des+0.047
 former+0.047
use+0.046
ited+0.043
 unnecessary+0.043
Neg logit contributions
 making-0.013
's-0.012
:-0.012
 was-0.011
 poking-0.011
TokenSuddenly
Token ID38582
Feature activation+7.81e-02

Pos logit contributions
des+0.030
use+0.030
 former+0.028
 unnecessary+0.025
minded+0.025
Neg logit contributions
 making-0.031
's-0.029
 poking-0.028
:-0.028
 was-0.027
Token,
Token ID11
Feature activation-4.54e-02

Pos logit contributions
des+0.191
use+0.189
powers+0.175
 unnecessary+0.172
 former+0.172
Neg logit contributions
 video-0.061
 making-0.060
 drawing-0.052
 sharing-0.047
 having-0.047
Token the
Token ID262
Feature activation-8.31e-03

Pos logit contributions
's+0.047
 making+0.047
 video+0.045
 drawing+0.045
:+0.044
Neg logit contributions
des-0.101
use-0.095
 former-0.091
powers-0.088
 unnecessary-0.088
Token zo
Token ID40565
Feature activation+4.72e-02

Pos logit contributions
 making+0.012
:+0.011
on+0.011
's+0.011
 sharing+0.011
Neg logit contributions
use-0.014
des-0.014
 former-0.014
 reins-0.013
God-0.013
Tokenoke
Token ID2088
Feature activation-1.69e-01

Pos logit contributions
des+0.107
use+0.106
 former+0.099
powers+0.097
God+0.094
Neg logit contributions
 making-0.056
 poking-0.049
 drawing-0.048
's-0.048
 video-0.048
Tokeneper
Token ID5723
Feature activation+2.67e-02

Pos logit contributions
 video+0.335
 making+0.326
 drawing+0.313
 Max+0.300
 spraying+0.298
Neg logit contributions
des-0.219
use-0.212
God-0.187
minded-0.178
ited-0.178
Token noticed
Token ID6810
Feature activation+3.08e-02

Pos logit contributions
use+0.056
des+0.052
ited+0.048
powers+0.046
God+0.046
Neg logit contributions
 making-0.033
 video-0.031
 sharing-0.030
 being-0.029
 drawing-0.029
Token.
Token ID13
Feature activation-2.08e-02

Pos logit contributions
des+0.072
use+0.068
 former+0.064
minded+0.062
 greater+0.062
Neg logit contributions
 making-0.027
 drawing-0.027
 video-0.027
's-0.026
 poking-0.024
Token He
Token ID679
Feature activation+3.19e-03

Pos logit contributions
 making+0.029
 drawing+0.027
 poking+0.027
 video+0.027
 spraying+0.026
Neg logit contributions
des-0.038
use-0.038
powers-0.035
 former-0.035
God-0.033
Token shouted
Token ID17293
Feature activation-3.77e-02

Pos logit contributions
use+0.008
des+0.007
 former+0.007
ited+0.007
God+0.007
Neg logit contributions
 making-0.003
 drawing-0.003
 video-0.003
 poking-0.003
 sharing-0.003
Token other
Token ID584
Feature activation+4.91e-02

Pos logit contributions
 making+0.010
 poking+0.009
's+0.009
 drawing+0.009
:+0.009
Neg logit contributions
des-0.019
use-0.018
 former-0.018
powers-0.017
ited-0.016
Token animals
Token ID4695
Feature activation-2.40e-02

Pos logit contributions
des+0.116
use+0.113
 former+0.110
powers+0.102
ited+0.101
Neg logit contributions
 making-0.055
 poking-0.049
:-0.049
 drawing-0.048
's-0.048
Token.
Token ID13
Feature activation-2.30e-02

Pos logit contributions
 making+0.019
 video+0.019
 drawing+0.017
 grid+0.017
 poking+0.017
Neg logit contributions
des-0.059
use-0.057
 former-0.052
 greater-0.051
powers-0.051
Token The
Token ID383
Feature activation-1.92e-02

Pos logit contributions
 making+0.024
's+0.021
:+0.021
 poking+0.021
 drawing+0.020
Neg logit contributions
des-0.057
use-0.056
 former-0.054
ited-0.051
 unnecessary-0.050
Token zo
Token ID40565
Feature activation+4.71e-02

Pos logit contributions
 making+0.025
's+0.022
 poking+0.022
 drawing+0.022
 video+0.021
Neg logit contributions
des-0.044
use-0.043
 former-0.041
powers-0.038
ited-0.038
Tokenoke
Token ID2088
Feature activation-1.69e-01

Pos logit contributions
des+0.112
use+0.110
 former+0.105
powers+0.100
ited+0.098
Neg logit contributions
 making-0.052
 poking-0.045
 drawing-0.045
's-0.045
:-0.044
Tokeneper
Token ID5723
Feature activation+2.65e-02

Pos logit contributions
 making+0.311
 video+0.303
 drawing+0.293
 poking+0.282
 spraying+0.281
Neg logit contributions
des-0.254
use-0.252
ited-0.214
 former-0.214
God-0.212
Token was
Token ID373
Feature activation-4.87e-02

Pos logit contributions
use+0.052
des+0.052
 former+0.049
ited+0.048
God+0.046
Neg logit contributions
 making-0.035
 video-0.033
 poking-0.031
 drawing-0.031
 spraying-0.030
Token worried
Token ID7960
Feature activation-1.66e-02

Pos logit contributions
 video+0.063
's+0.058
 grid+0.058
 tube+0.056
 making+0.054
Neg logit contributions
 former-0.088
des-0.084
 greater-0.081
God-0.079
 wealthier-0.079
Token that
Token ID326
Feature activation-5.89e-02

Pos logit contributions
 video+0.019
 grid+0.019
 drawing+0.018
 making+0.017
 poking+0.017
Neg logit contributions
des-0.032
 someday-0.031
 anytime-0.030
use-0.030
 greater-0.029
Token the
Token ID262
Feature activation-7.26e-03

Pos logit contributions
 making+0.041
 poking+0.034
:+0.034
's+0.033
 drawing+0.033
Neg logit contributions
des-0.161
use-0.158
 former-0.154
powers-0.144
God-0.144
Token
Token ID198
Feature activation+2.00e-02

Pos logit contributions
des+0.059
 former+0.057
use+0.056
ited+0.054
 reins+0.053
Neg logit contributions
 making-0.013
:-0.013
's-0.012
 was-0.012
 Sam-0.011
TokenSoon
Token ID28093
Feature activation+9.26e-02

Pos logit contributions
des+0.045
 former+0.045
use+0.042
 reins+0.039
irst+0.039
Neg logit contributions
's-0.026
 making-0.025
:-0.024
 had-0.024
 was-0.024
Token,
Token ID11
Feature activation-4.91e-02

Pos logit contributions
use+0.240
des+0.240
 anytime+0.218
 former+0.218
minded+0.218
Neg logit contributions
 making-0.062
 video-0.058
 poking-0.055
 drawing-0.052
 spraying-0.049
Token a
Token ID257
Feature activation+1.39e-02

Pos logit contributions
 making+0.029
 poking+0.022
:+0.021
's+0.021
 drawing+0.021
Neg logit contributions
des-0.144
use-0.141
 former-0.138
powers-0.130
ited-0.129
Token zo
Token ID40565
Feature activation+4.59e-02

Pos logit contributions
use+0.027
des+0.027
 former+0.026
God+0.024
powers+0.024
Neg logit contributions
 making-0.020
's-0.018
 poking-0.018
:-0.018
 drawing-0.017
Tokenoke
Token ID2088
Feature activation-1.69e-01

Pos logit contributions
des+0.110
use+0.107
 former+0.102
powers+0.097
ited+0.096
Neg logit contributions
 making-0.050
 poking-0.044
's-0.043
 drawing-0.043
 video-0.043
Tokeneper
Token ID5723
Feature activation+2.59e-02

Pos logit contributions
 video+0.316
 making+0.315
 drawing+0.297
 having+0.285
 spraying+0.285
Neg logit contributions
des-0.240
use-0.237
God-0.200
away-0.198
 reins-0.197
Token came
Token ID1625
Feature activation+1.31e-02

Pos logit contributions
use+0.063
des+0.062
ited+0.055
 former+0.055
God+0.054
Neg logit contributions
 making-0.023
 video-0.022
 drawing-0.021
 poking-0.020
 spraying-0.019
Token with
Token ID351
Feature activation-7.33e-02

Pos logit contributions
des+0.033
use+0.032
 former+0.031
 reins+0.029
powers+0.029
Neg logit contributions
 making-0.012
 video-0.010
 drawing-0.010
 poking-0.010
:-0.010
Token a
Token ID257
Feature activation+1.44e-02

Pos logit contributions
 making+0.055
:+0.047
's+0.046
 poking+0.046
 drawing+0.045
Neg logit contributions
des-0.197
use-0.193
 former-0.187
powers-0.176
 reins-0.175
Token big
Token ID1263
Feature activation-1.48e-02

Pos logit contributions
use+0.031
des+0.031
 former+0.029
powers+0.027
 reins+0.027
Neg logit contributions
 making-0.018
:-0.016
's-0.016
 poking-0.016
 drawing-0.015
Token and
Token ID290
Feature activation+1.50e-02

Pos logit contributions
des+0.135
use+0.134
ited+0.124
minded+0.117
God+0.117
Neg logit contributions
 making-0.051
 drawing-0.048
 video-0.048
 poking-0.046
 shooting-0.044
Token dad
Token ID9955
Feature activation+4.49e-02

Pos logit contributions
des+0.034
use+0.033
 former+0.031
powers+0.029
God+0.029
Neg logit contributions
 making-0.018
 poking-0.017
 video-0.016
 drawing-0.016
:-0.016
Token heard
Token ID2982
Feature activation-6.03e-03

Pos logit contributions
des+0.104
use+0.103
ited+0.094
minded+0.089
 former+0.088
Neg logit contributions
 making-0.046
 video-0.045
 drawing-0.044
 poking-0.043
 shooting-0.041
Token the
Token ID262
Feature activation-9.63e-03

Pos logit contributions
 making+0.005
 poking+0.005
's+0.005
 drawing+0.005
 video+0.004
Neg logit contributions
des-0.015
use-0.014
 former-0.013
minded-0.013
God-0.013
Token zo
Token ID40565
Feature activation+4.56e-02

Pos logit contributions
:+0.012
 making+0.011
's+0.011
 drawing+0.011
on+0.011
Neg logit contributions
des-0.018
use-0.018
 reins-0.017
 former-0.017
powers-0.017
Tokenoke
Token ID2088
Feature activation-1.69e-01

Pos logit contributions
des+0.105
use+0.103
 former+0.094
powers+0.092
ited+0.091
Neg logit contributions
 making-0.054
 poking-0.048
 drawing-0.047
:-0.046
's-0.046
Tokeneper
Token ID5723
Feature activation+2.62e-02

Pos logit contributions
 video+0.303
 making+0.295
 drawing+0.282
:+0.282
 shooting+0.275
Neg logit contributions
des-0.241
use-0.227
 reins-0.202
minded-0.200
away-0.198
Token and
Token ID290
Feature activation+1.48e-02

Pos logit contributions
des+0.058
use+0.052
ited+0.048
God+0.047
minded+0.047
Neg logit contributions
 making-0.026
 drawing-0.026
 poking-0.025
 video-0.025
 triangle-0.025
Token they
Token ID484
Feature activation-2.37e-02

Pos logit contributions
des+0.034
use+0.034
 former+0.031
God+0.031
minded+0.031
Neg logit contributions
 making-0.014
 video-0.014
 drawing-0.014
 poking-0.013
's-0.012
Token were
Token ID547
Feature activation-7.92e-02

Pos logit contributions
 making+0.024
 video+0.023
 drawing+0.023
 poking+0.022
 grid+0.020
Neg logit contributions
use-0.055
des-0.055
 former-0.050
ited-0.049
God-0.048
Token very
Token ID845
Feature activation+7.35e-02

Pos logit contributions
 video+0.111
 grid+0.109
 triangle+0.104
's+0.103
on+0.103
Neg logit contributions
des-0.132
use-0.123
 former-0.123
 unnecessary-0.120
God-0.118
Token the
Token ID262
Feature activation-8.82e-03

Pos logit contributions
des+0.036
use+0.036
 former+0.035
powers+0.033
God+0.033
Neg logit contributions
 making-0.007
:-0.006
 poking-0.006
's-0.006
 drawing-0.005
Token cage
Token ID17186
Feature activation-2.25e-02

Pos logit contributions
 making+0.014
 poking+0.013
:+0.013
 drawing+0.012
's+0.012
Neg logit contributions
des-0.017
use-0.016
 former-0.016
powers-0.014
God-0.014
Token.
Token ID13
Feature activation-2.70e-02

Pos logit contributions
 making+0.016
 poking+0.013
:+0.012
 drawing+0.012
's+0.012
Neg logit contributions
des-0.062
use-0.061
 former-0.060
powers-0.056
ited-0.055
Token The
Token ID383
Feature activation-1.94e-02

Pos logit contributions
 making+0.022
 Sam+0.020
:+0.020
's+0.019
 Max+0.017
Neg logit contributions
des-0.074
use-0.074
 former-0.071
ited-0.070
 unnecessary-0.066
Token zo
Token ID40565
Feature activation+4.67e-02

Pos logit contributions
 making+0.024
 poking+0.022
:+0.022
's+0.021
 drawing+0.021
Neg logit contributions
des-0.042
use-0.042
 former-0.041
God-0.037
powers-0.037
Tokenoke
Token ID2088
Feature activation-1.68e-01

Pos logit contributions
des+0.105
use+0.104
 former+0.095
powers+0.095
minded+0.092
Neg logit contributions
 making-0.057
 poking-0.050
 drawing-0.050
's-0.049
 video-0.048
Tokeneper
Token ID5723
Feature activation+2.62e-02

Pos logit contributions
 making+0.299
 video+0.296
 drawing+0.281
 spraying+0.274
 poking+0.273
Neg logit contributions
des-0.263
use-0.260
God-0.221
ited-0.221
 reins-0.218
Token closed
Token ID4838
Feature activation+4.01e-02

Pos logit contributions
des+0.058
use+0.058
 former+0.054
ited+0.053
God+0.051
Neg logit contributions
 making-0.029
 video-0.026
 poking-0.026
 drawing-0.025
 doing-0.024
Token the
Token ID262
Feature activation-8.22e-03

Pos logit contributions
des+0.102
use+0.101
 former+0.097
 reins+0.091
minded+0.091
Neg logit contributions
 making-0.033
 video-0.029
 poking-0.029
 drawing-0.028
's-0.028
Token door
Token ID3420
Feature activation+4.64e-02

Pos logit contributions
 making+0.013
 poking+0.012
:+0.012
's+0.012
 drawing+0.011
Neg logit contributions
des-0.015
use-0.015
 former-0.014
 reins-0.013
powers-0.013
Token and
Token ID290
Feature activation+1.40e-02

Pos logit contributions
des+0.136
use+0.135
 former+0.130
 unnecessary+0.125
powers+0.123
Neg logit contributions
 making-0.028
:-0.025
's-0.024
 poking-0.021
 in-0.020
Token.
Token ID13
Feature activation-2.27e-02

Pos logit contributions
des+0.098
use+0.096
 former+0.093
powers+0.088
ited+0.087
Neg logit contributions
 making-0.027
 poking-0.023
:-0.022
 drawing-0.022
's-0.022
Token She
Token ID1375
Feature activation+4.06e-02

Pos logit contributions
:+0.024
's+0.022
 making+0.021
 was+0.020
 Sam+0.018
Neg logit contributions
des-0.059
use-0.058
 former-0.056
 unnecessary-0.055
ited-0.054
Token thanked
Token ID26280
Feature activation+1.21e-02

Pos logit contributions
des+0.096
use+0.094
 former+0.090
powers+0.085
 unnecessary+0.084
Neg logit contributions
 making-0.046
:-0.042
's-0.042
 poking-0.041
 drawing-0.040
Token the
Token ID262
Feature activation-6.12e-03

Pos logit contributions
des+0.031
use+0.030
 former+0.029
God+0.027
minded+0.027
Neg logit contributions
 making-0.011
 poking-0.009
's-0.009
 drawing-0.009
 video-0.009
Token zo
Token ID40565
Feature activation+4.65e-02

Pos logit contributions
 making+0.010
 poking+0.009
:+0.009
's+0.009
 drawing+0.009
Neg logit contributions
des-0.012
use-0.011
 former-0.011
powers-0.010
ited-0.010
Tokenoke
Token ID2088
Feature activation-1.68e-01

Pos logit contributions
des+0.118
use+0.115
 former+0.113
powers+0.104
ited+0.103
Neg logit contributions
 making-0.044
:-0.039
 poking-0.039
's-0.038
 drawing-0.037
Tokeneper
Token ID5723
Feature activation+2.68e-02

Pos logit contributions
 making+0.338
 video+0.330
 drawing+0.318
 spraying+0.310
 poking+0.305
Neg logit contributions
use-0.222
des-0.217
minded-0.194
 reins-0.192
God-0.190
Token for
Token ID329
Feature activation-1.57e-02

Pos logit contributions
des+0.068
use+0.066
 former+0.064
 reins+0.060
minded+0.060
Neg logit contributions
 making-0.024
 poking-0.021
 video-0.020
 drawing-0.020
 fighting-0.019
Token his
Token ID465
Feature activation+1.93e-02

Pos logit contributions
's+0.017
 video+0.016
:+0.014
 grid+0.014
 drawing+0.014
Neg logit contributions
des-0.035
use-0.033
 former-0.033
 greater-0.032
God-0.031
Token advice
Token ID5608
Feature activation+2.96e-02

Pos logit contributions
des+0.038
use+0.037
 former+0.035
powers+0.032
ited+0.032
Neg logit contributions
 making-0.030
 poking-0.027
:-0.027
's-0.027
 drawing-0.027
Token and
Token ID290
Feature activation+1.74e-02

Pos logit contributions
des+0.078
use+0.077
 former+0.074
powers+0.070
ited+0.069
Neg logit contributions
 making-0.023
 poking-0.020
 video-0.020
 drawing-0.019
's-0.018
Token.
Token ID13
Feature activation-2.56e-02

Pos logit contributions
 making+0.066
 video+0.059
 poking+0.057
 drawing+0.056
's+0.053
Neg logit contributions
des-0.193
use-0.184
 former-0.181
powers-0.177
minded-0.171
Token He
Token ID679
Feature activation+3.42e-03

Pos logit contributions
:+0.023
 Sam+0.023
 making+0.022
's+0.021
 Max+0.020
Neg logit contributions
des-0.069
use-0.068
ited-0.064
 former-0.063
 reins-0.060
Token thanked
Token ID26280
Feature activation+1.02e-02

Pos logit contributions
des+0.009
use+0.008
 former+0.008
 unnecessary+0.008
 reins+0.008
Neg logit contributions
 making-0.003
's-0.003
 was-0.003
:-0.003
 had-0.003
Token the
Token ID262
Feature activation-8.50e-03

Pos logit contributions
des+0.026
use+0.026
 former+0.025
God+0.023
powers+0.023
Neg logit contributions
 making-0.009
 poking-0.008
's-0.008
:-0.007
 drawing-0.007
Token zo
Token ID40565
Feature activation+4.77e-02

Pos logit contributions
 making+0.009
's+0.009
:+0.008
 poking+0.008
 drawing+0.008
Neg logit contributions
use-0.019
des-0.018
 former-0.018
powers-0.017
minded-0.017
Tokenoke
Token ID2088
Feature activation-1.68e-01

Pos logit contributions
des+0.113
use+0.111
 former+0.106
powers+0.101
ited+0.099
Neg logit contributions
 making-0.053
 poking-0.046
 drawing-0.045
's-0.045
:-0.045
Tokeneper
Token ID5723
Feature activation+2.77e-02

Pos logit contributions
 making+0.340
 video+0.339
 drawing+0.321
 spraying+0.316
 poking+0.306
Neg logit contributions
des-0.210
use-0.210
minded-0.187
away-0.177
 reins-0.176
Token for
Token ID329
Feature activation-1.76e-02

Pos logit contributions
des+0.068
use+0.067
 former+0.064
minded+0.062
 reins+0.061
Neg logit contributions
 making-0.026
 poking-0.023
 video-0.022
 drawing-0.021
 spraying-0.020
Token taking
Token ID2263
Feature activation+9.54e-03

Pos logit contributions
's+0.018
 video+0.015
:+0.015
 was+0.013
 grid+0.013
Neg logit contributions
des-0.040
 former-0.039
use-0.038
 greater-0.038
God-0.037
Token care
Token ID1337
Feature activation+6.23e-05

Pos logit contributions
des+0.024
use+0.023
 former+0.023
powers+0.021
ited+0.021
Neg logit contributions
 making-0.010
 poking-0.008
's-0.008
:-0.008
 drawing-0.008
Token of
Token ID286
Feature activation-4.17e-02

Pos logit contributions
des+0.000
use+0.000
 former+0.000
powers+0.000
ited+0.000
Neg logit contributions
 making-0.000
:-0.000
 poking-0.000
's-0.000
 drawing-0.000
Token.
Token ID13
Feature activation-2.68e-02

Pos logit contributions
 video+0.060
 grid+0.055
 making+0.054
 drawing+0.050
 doing+0.049
Neg logit contributions
des-0.096
use-0.091
powers-0.089
minded-0.086
 anytime-0.085
Token
Token ID198
Feature activation+2.20e-02

Pos logit contributions
 making+0.033
 video+0.031
's+0.030
 drawing+0.030
 poking+0.030
Neg logit contributions
use-0.053
des-0.051
 former-0.050
powers-0.048
God-0.047
Token
Token ID198
Feature activation+2.17e-02

Pos logit contributions
des+0.061
use+0.059
 former+0.058
ited+0.056
 reins+0.054
Neg logit contributions
 making-0.017
:-0.017
's-0.015
 poking-0.014
 was-0.014
TokenThe
Token ID464
Feature activation-1.00e-02

Pos logit contributions
des+0.042
use+0.040
 former+0.040
ited+0.037
 reins+0.035
Neg logit contributions
 making-0.035
's-0.033
:-0.033
 drawing-0.032
 poking-0.032
Token zo
Token ID40565
Feature activation+4.46e-02

Pos logit contributions
 making+0.016
 poking+0.015
:+0.014
's+0.014
 drawing+0.014
Neg logit contributions
des-0.019
use-0.018
 former-0.018
powers-0.016
God-0.016
Tokenoke
Token ID2088
Feature activation-1.68e-01

Pos logit contributions
use+0.094
des+0.092
powers+0.086
minded+0.083
God+0.081
Neg logit contributions
 making-0.061
 drawing-0.053
 video-0.052
 poking-0.052
's-0.051
Tokeneper
Token ID5723
Feature activation+2.61e-02

Pos logit contributions
 video+0.365
 making+0.348
 drawing+0.342
 spraying+0.329
 sharing+0.320
Neg logit contributions
use-0.193
des-0.182
God-0.162
minded-0.161
 reins-0.159
Token picked
Token ID6497
Feature activation+7.91e-04

Pos logit contributions
use+0.043
des+0.041
ited+0.041
Exc+0.040
minded+0.036
Neg logit contributions
 doing-0.038
 video-0.038
 sharing-0.037
 making-0.037
 having-0.037
Token up
Token ID510
Feature activation+2.69e-02

Pos logit contributions
des+0.002
use+0.002
minded+0.002
 former+0.001
 greater+0.001
Neg logit contributions
 making-0.001
 drawing-0.001
 poking-0.001
:-0.001
 doing-0.001
Token Lily
Token ID20037
Feature activation-1.01e-02

Pos logit contributions
des+0.065
use+0.063
 former+0.061
minded+0.058
God+0.057
Neg logit contributions
 making-0.026
 poking-0.024
 drawing-0.023
 doing-0.022
 video-0.022
Token and
Token ID290
Feature activation+1.32e-02

Pos logit contributions
 making+0.013
 video+0.012
 printing+0.012
 grid+0.012
 poking+0.012
Neg logit contributions
des-0.019
 anytime-0.018
use-0.018
 reins-0.017
away-0.016
Token he
Token ID339
Feature activation+3.40e-02

Pos logit contributions
 making+0.018
 video+0.017
 poking+0.017
 drawing+0.016
 grid+0.016
Neg logit contributions
des-0.026
use-0.023
 former-0.023
God-0.022
ited-0.022
Token knew
Token ID2993
Feature activation+4.32e-02

Pos logit contributions
des+0.077
use+0.075
 former+0.073
powers+0.068
 unnecessary+0.068
Neg logit contributions
 making-0.042
's-0.040
:-0.039
 was-0.037
 poking-0.037
Token that
Token ID326
Feature activation-5.80e-02

Pos logit contributions
des+0.102
use+0.097
 former+0.097
God+0.089
 greater+0.089
Neg logit contributions
 making-0.046
 video-0.042
 poking-0.041
 drawing-0.040
's-0.040
Token the
Token ID262
Feature activation-7.46e-03

Pos logit contributions
 making+0.060
:+0.059
 video+0.057
 poking+0.056
 drawing+0.055
Neg logit contributions
des-0.132
 former-0.125
use-0.125
God-0.117
 greater-0.115
Token zo
Token ID40565
Feature activation+4.77e-02

Pos logit contributions
 making+0.011
:+0.010
 poking+0.010
's+0.010
 drawing+0.010
Neg logit contributions
des-0.015
use-0.015
 former-0.014
powers-0.013
God-0.013
Tokenoke
Token ID2088
Feature activation-1.68e-01

Pos logit contributions
des+0.114
use+0.112
 former+0.107
powers+0.102
ited+0.100
Neg logit contributions
 making-0.052
 poking-0.045
's-0.044
:-0.044
 drawing-0.044
Tokenep
Token ID538
Feature activation-1.01e-02

Pos logit contributions
 making+0.316
 video+0.311
 drawing+0.300
 spraying+0.298
 shooting+0.290
Neg logit contributions
use-0.230
des-0.225
minded-0.192
God-0.184
irst-0.181
Tokeners
Token ID364
Feature activation-4.29e-02

Pos logit contributions
 making+0.006
 poking+0.005
 drawing+0.005
 video+0.004
:+0.004
Neg logit contributions
des-0.029
use-0.028
 former-0.028
powers-0.026
ited-0.026
Token were
Token ID547
Feature activation-7.78e-02

Pos logit contributions
 making+0.033
 poking+0.029
 drawing+0.027
 video+0.026
:+0.026
Neg logit contributions
des-0.114
use-0.113
 former-0.109
powers-0.102
ited-0.102
Token just
Token ID655
Feature activation+1.98e-02

Pos logit contributions
 making+0.094
 video+0.088
 poking+0.087
:+0.086
 drawing+0.085
Neg logit contributions
des-0.170
use-0.164
 former-0.161
 greater-0.148
God-0.147
Token trying
Token ID2111
Feature activation-2.41e-02

Pos logit contributions
des+0.045
 former+0.045
use+0.044
 greater+0.041
 unnecessary+0.041
Neg logit contributions
 making-0.019
 video-0.019
's-0.019
:-0.019
 poking-0.018
Token.
Token ID13
Feature activation-2.61e-02

Pos logit contributions
des+0.174
use+0.170
 former+0.162
ited+0.161
 unnecessary+0.159
Neg logit contributions
 making-0.024
 in-0.020
:-0.020
,-0.020
's-0.018
Token
Token ID198
Feature activation+2.12e-02

Pos logit contributions
 Sam+0.014
:+0.013
's+0.012
 making+0.011
 Tom+0.011
Neg logit contributions
 former-0.079
des-0.079
use-0.079
ited-0.078
sie-0.074
Token
Token ID198
Feature activation+2.07e-02

Pos logit contributions
des+0.054
use+0.053
 former+0.052
ited+0.048
powers+0.048
Neg logit contributions
 making-0.020
:-0.018
's-0.018
 poking-0.017
 drawing-0.017
TokenThe
Token ID464
Feature activation-9.60e-03

Pos logit contributions
 former+0.046
des+0.044
use+0.043
 unnecessary+0.039
powers+0.039
Neg logit contributions
:-0.028
 making-0.028
's-0.027
 was-0.025
 had-0.025
Token zo
Token ID40565
Feature activation+4.60e-02

Pos logit contributions
 making+0.012
 poking+0.012
's+0.011
:+0.011
 drawing+0.011
Neg logit contributions
des-0.020
use-0.020
 former-0.019
God-0.018
 reins-0.018
Tokenoke
Token ID2088
Feature activation-1.68e-01

Pos logit contributions
des+0.111
use+0.109
 former+0.105
powers+0.099
ited+0.097
Neg logit contributions
 making-0.049
 poking-0.043
's-0.042
 drawing-0.042
:-0.042
Tokenep
Token ID538
Feature activation-1.03e-02

Pos logit contributions
 making+0.318
 video+0.312
 drawing+0.301
 spraying+0.294
 poking+0.293
Neg logit contributions
use-0.240
des-0.238
God-0.205
minded-0.202
ited-0.201
Tokeners
Token ID364
Feature activation-4.18e-02

Pos logit contributions
 making+0.010
 poking+0.009
 drawing+0.009
 video+0.009
 spraying+0.008
Neg logit contributions
des-0.025
use-0.024
 former-0.023
ited-0.022
minded-0.022
Token arrived
Token ID5284
Feature activation+4.09e-02

Pos logit contributions
 making+0.043
 poking+0.037
:+0.037
 drawing+0.037
 video+0.036
Neg logit contributions
des-0.102
use-0.100
 former-0.097
powers-0.090
ited-0.090
Token and
Token ID290
Feature activation+1.43e-02

Pos logit contributions
des+0.113
use+0.111
 former+0.108
powers+0.102
ited+0.101
Neg logit contributions
 making-0.031
:-0.025
 poking-0.025
's-0.025
 drawing-0.024
Token threw
Token ID9617
Feature activation+9.75e-04

Pos logit contributions
des+0.035
use+0.033
 former+0.033
powers+0.031
 unnecessary+0.030
Neg logit contributions
 making-0.016
 poking-0.014
's-0.014
:-0.014
 drawing-0.014
Token said
Token ID531
Feature activation-4.51e-02

Pos logit contributions
 making+0.029
 video+0.028
 grid+0.028
 drawing+0.027
 poking+0.027
Neg logit contributions
use-0.045
des-0.042
ited-0.040
 former-0.039
God-0.039
Token that
Token ID326
Feature activation-5.80e-02

Pos logit contributions
 video+0.058
 grid+0.057
 making+0.050
 shape+0.050
 drawing+0.050
Neg logit contributions
des-0.087
use-0.085
God-0.077
 greater-0.075
 someday-0.074
Token sometimes
Token ID3360
Feature activation+5.66e-02

Pos logit contributions
 making+0.064
:+0.063
 video+0.060
 drawing+0.058
 poking+0.058
Neg logit contributions
des-0.123
use-0.118
 former-0.116
God-0.111
minded-0.106
Token the
Token ID262
Feature activation-7.02e-03

Pos logit contributions
des+0.139
use+0.135
 former+0.132
powers+0.123
ited+0.122
Neg logit contributions
 making-0.060
 poking-0.052
's-0.051
:-0.051
 drawing-0.050
Token zo
Token ID40565
Feature activation+4.80e-02

Pos logit contributions
 making+0.009
 poking+0.008
's+0.008
:+0.008
 drawing+0.008
Neg logit contributions
des-0.016
use-0.015
 former-0.015
powers-0.014
ited-0.013
Tokenoke
Token ID2088
Feature activation-1.68e-01

Pos logit contributions
des+0.115
use+0.113
 former+0.109
powers+0.102
ited+0.101
Neg logit contributions
 making-0.052
 poking-0.045
:-0.045
's-0.045
 drawing-0.044
Tokenep
Token ID538
Feature activation-9.47e-03

Pos logit contributions
 making+0.280
 video+0.265
 drawing+0.258
:+0.254
 poking+0.253
Neg logit contributions
des-0.285
use-0.283
 former-0.248
minded-0.241
ited-0.237
Tokeners
Token ID364
Feature activation-4.19e-02

Pos logit contributions
 making+0.008
's+0.007
:+0.006
 poking+0.006
 drawing+0.006
Neg logit contributions
des-0.026
use-0.025
 former-0.024
powers-0.023
ited-0.023
Token have
Token ID423
Feature activation-3.79e-02

Pos logit contributions
 making+0.040
's+0.037
:+0.035
 poking+0.034
 drawing+0.033
Neg logit contributions
des-0.106
use-0.102
 former-0.102
powers-0.094
God-0.093
Token to
Token ID284
Feature activation+3.26e-02

Pos logit contributions
 making+0.041
's+0.036
 poking+0.036
:+0.035
 drawing+0.035
Neg logit contributions
des-0.093
use-0.090
 former-0.087
ited-0.082
minded-0.081
Token order
Token ID1502
Feature activation+5.75e-02

Pos logit contributions
des+0.080
use+0.077
 former+0.075
powers+0.074
minded+0.073
Neg logit contributions
 making-0.036
's-0.030
 poking-0.029
 drawing-0.029
:-0.028
Token.
Token ID13
Feature activation-2.18e-02

Pos logit contributions
 video+0.005
 making+0.004
 poking+0.004
 grid+0.004
's+0.004
Neg logit contributions
des-0.008
use-0.008
 anytime-0.007
minded-0.007
 someday-0.007
Token He
Token ID679
Feature activation+3.65e-03

Pos logit contributions
 making+0.030
 poking+0.028
 drawing+0.026
 video+0.026
 spraying+0.026
Neg logit contributions
des-0.046
use-0.044
 former-0.042
powers-0.041
minded-0.039
Token hoped
Token ID10719
Feature activation+1.03e-01

Pos logit contributions
des+0.009
use+0.009
 former+0.009
 unnecessary+0.009
 reins+0.008
Neg logit contributions
 making-0.004
's-0.003
 was-0.003
:-0.003
 had-0.003
Token the
Token ID262
Feature activation-8.72e-03

Pos logit contributions
des+0.284
use+0.280
 former+0.272
powers+0.256
ited+0.255
Neg logit contributions
 making-0.079
 poking-0.064
:-0.063
's-0.062
 drawing-0.061
Token zo
Token ID40565
Feature activation+4.80e-02

Pos logit contributions
 making+0.014
 poking+0.013
:+0.013
's+0.013
 drawing+0.012
Neg logit contributions
des-0.017
use-0.016
 former-0.016
powers-0.014
ited-0.014
Tokenoke
Token ID2088
Feature activation-1.68e-01

Pos logit contributions
des+0.114
use+0.111
 former+0.106
powers+0.101
ited+0.099
Neg logit contributions
 making-0.054
 poking-0.047
 drawing-0.046
's-0.046
:-0.046
Tokeneper
Token ID5723
Feature activation+2.74e-02

Pos logit contributions
 video+0.332
 making+0.330
 drawing+0.322
 spraying+0.303
 Max+0.300
Neg logit contributions
des-0.217
use-0.215
God-0.185
 reins-0.182
away-0.179
Token would
Token ID561
Feature activation+5.93e-02

Pos logit contributions
des+0.066
use+0.064
 former+0.064
 unnecessary+0.059
powers+0.059
Neg logit contributions
 making-0.030
's-0.028
:-0.027
 was-0.026
 poking-0.025
Token say
Token ID910
Feature activation-6.37e-02

Pos logit contributions
des+0.165
use+0.158
 former+0.153
powers+0.151
ited+0.150
Neg logit contributions
 making-0.050
:-0.040
 poking-0.038
 drawing-0.037
's-0.036
Token yes
Token ID3763
Feature activation+7.12e-02

Pos logit contributions
 making+0.086
 poking+0.078
 drawing+0.077
's+0.077
 video+0.077
Neg logit contributions
des-0.134
use-0.129
 former-0.125
powers-0.116
God-0.115
Token.
Token ID13
Feature activation-2.15e-02

Pos logit contributions
des+0.198
use+0.194
 former+0.189
powers+0.178
ited+0.176
Neg logit contributions
 making-0.050
 poking-0.041
 drawing-0.039
:-0.039
's-0.039
Token of
Token ID286
Feature activation-4.02e-02

Pos logit contributions
des+0.024
use+0.024
 former+0.023
powers+0.022
ited+0.022
Neg logit contributions
 making-0.007
 poking-0.006
 drawing-0.006
 video-0.006
's-0.006
Token him
Token ID683
Feature activation-2.52e-02

Pos logit contributions
 making+0.033
's+0.030
:+0.029
 poking+0.028
 video+0.028
Neg logit contributions
des-0.103
use-0.101
 former-0.099
 unnecessary-0.092
 greater-0.091
Token.
Token ID13
Feature activation-2.08e-02

Pos logit contributions
 making+0.018
 poking+0.014
:+0.014
's+0.014
 drawing+0.014
Neg logit contributions
des-0.070
use-0.068
 former-0.067
powers-0.063
ited-0.062
Token The
Token ID383
Feature activation-1.37e-02

Pos logit contributions
 making+0.019
:+0.019
 Sam+0.018
's+0.018
 was+0.016
Neg logit contributions
des-0.054
use-0.053
 former-0.050
ited-0.049
 unnecessary-0.049
Token zo
Token ID40565
Feature activation+4.80e-02

Pos logit contributions
 making+0.014
 poking+0.012
:+0.012
's+0.012
 drawing+0.012
Neg logit contributions
des-0.033
use-0.032
 former-0.032
ited-0.029
powers-0.029
Tokenoke
Token ID2088
Feature activation-1.68e-01

Pos logit contributions
des+0.113
use+0.111
 former+0.106
powers+0.101
ited+0.100
Neg logit contributions
 making-0.054
 poking-0.047
 drawing-0.046
's-0.046
 video-0.045
Tokeneper
Token ID5723
Feature activation+2.80e-02

Pos logit contributions
 making+0.309
 video+0.298
 drawing+0.287
 poking+0.276
 spraying+0.274
Neg logit contributions
use-0.260
des-0.257
ited-0.226
 former-0.218
God-0.214
Token rushed
Token ID13999
Feature activation+8.93e-02

Pos logit contributions
des+0.066
use+0.066
 former+0.063
ited+0.060
God+0.058
Neg logit contributions
 making-0.029
 poking-0.025
 drawing-0.025
 video-0.025
:-0.023
Token in
Token ID287
Feature activation-3.40e-02

Pos logit contributions
des+0.208
use+0.202
 former+0.194
powers+0.181
 reins+0.180
Neg logit contributions
 making-0.101
 poking-0.089
 drawing-0.088
 video-0.087
:-0.087
Token and
Token ID290
Feature activation+1.82e-02

Pos logit contributions
 making+0.032
 drawing+0.028
 video+0.028
's+0.028
 poking+0.028
Neg logit contributions
des-0.082
use-0.080
 former-0.077
 reins-0.073
ited-0.072
Token warned
Token ID7728
Feature activation-2.91e-03

Pos logit contributions
des+0.041
use+0.041
 former+0.039
ited+0.036
powers+0.036
Neg logit contributions
 making-0.021
:-0.019
 poking-0.019
 video-0.019
 drawing-0.019
Token.
Token ID13
Feature activation-2.19e-02

Pos logit contributions
 making+0.023
 poking+0.019
:+0.018
 drawing+0.018
 video+0.017
Neg logit contributions
des-0.086
use-0.084
 former-0.082
powers-0.077
ited-0.076
Token Ben
Token ID3932
Feature activation-5.67e-02

Pos logit contributions
 making+0.020
's+0.018
:+0.018
 drawing+0.017
 poking+0.017
Neg logit contributions
des-0.057
use-0.055
 former-0.053
ited-0.050
powers-0.050
Token asked
Token ID1965
Feature activation+1.11e-01

Pos logit contributions
 making+0.055
's+0.047
:+0.047
 poking+0.047
 drawing+0.045
Neg logit contributions
des-0.144
use-0.140
 former-0.137
powers-0.128
 unnecessary-0.126
Token the
Token ID262
Feature activation-8.30e-03

Pos logit contributions
des+0.319
use+0.311
 former+0.307
powers+0.290
ited+0.287
Neg logit contributions
 making-0.074
:-0.061
 poking-0.058
's-0.058
 drawing-0.055
Token zo
Token ID40565
Feature activation+4.82e-02

Pos logit contributions
 making+0.011
:+0.010
's+0.010
 poking+0.010
 drawing+0.010
Neg logit contributions
des-0.019
use-0.018
 former-0.017
ited-0.016
powers-0.016
Tokenoke
Token ID2088
Feature activation-1.68e-01

Pos logit contributions
des+0.118
use+0.115
 former+0.113
powers+0.105
ited+0.104
Neg logit contributions
 making-0.050
 poking-0.043
:-0.043
's-0.043
 drawing-0.042
Tokeneper
Token ID5723
Feature activation+2.73e-02

Pos logit contributions
 making+0.334
 video+0.328
 drawing+0.312
 spraying+0.303
 poking+0.300
Neg logit contributions
des-0.227
use-0.224
minded-0.199
God-0.190
away-0.183
Token if
Token ID611
Feature activation+7.87e-02

Pos logit contributions
use+0.058
des+0.058
 former+0.055
minded+0.052
God+0.051
Neg logit contributions
 making-0.034
 video-0.031
 poking-0.031
 drawing-0.030
's-0.029
Token he
Token ID339
Feature activation+3.28e-02

Pos logit contributions
des+0.195
use+0.191
 former+0.186
powers+0.174
ited+0.172
Neg logit contributions
 making-0.081
 poking-0.070
:-0.069
's-0.069
 drawing-0.068
Token could
Token ID714
Feature activation+2.13e-02

Pos logit contributions
des+0.085
 former+0.084
use+0.082
 unnecessary+0.077
 reins+0.076
Neg logit contributions
's-0.030
 making-0.029
:-0.029
 was-0.028
 had-0.027
Token touch
Token ID3638
Feature activation-7.16e-03

Pos logit contributions
des+0.048
use+0.046
 former+0.045
powers+0.042
ited+0.041
Neg logit contributions
 making-0.027
:-0.025
 poking-0.024
's-0.024
 drawing-0.024
Token but
Token ID475
Feature activation-1.71e-02

Pos logit contributions
 making+0.055
 poking+0.050
:+0.050
 video+0.049
 drawing+0.049
Neg logit contributions
des-0.104
use-0.101
 former-0.097
powers-0.090
God-0.090
Token only
Token ID691
Feature activation+1.74e-02

Pos logit contributions
 making+0.019
 poking+0.018
 video+0.018
:+0.018
 drawing+0.017
Neg logit contributions
des-0.038
use-0.037
 former-0.036
God-0.033
 greater-0.033
Token if
Token ID611
Feature activation+7.78e-02

Pos logit contributions
des+0.025
use+0.024
 former+0.022
 greater+0.020
 unnecessary+0.020
Neg logit contributions
 making-0.033
 video-0.032
:-0.032
 poking-0.031
 drawing-0.031
Token the
Token ID262
Feature activation-8.26e-03

Pos logit contributions
des+0.202
use+0.198
 former+0.197
ited+0.188
 reins+0.182
Neg logit contributions
 making-0.077
's-0.066
:-0.063
 poking-0.062
 was-0.062
Token zo
Token ID40565
Feature activation+4.86e-02

Pos logit contributions
 making+0.013
 poking+0.012
's+0.012
:+0.012
 drawing+0.011
Neg logit contributions
des-0.016
use-0.016
 former-0.015
powers-0.014
ited-0.014
Tokenoke
Token ID2088
Feature activation-1.68e-01

Pos logit contributions
des+0.122
use+0.118
 former+0.118
powers+0.108
ited+0.107
Neg logit contributions
 making-0.047
:-0.042
 poking-0.042
's-0.041
 drawing-0.040
Tokeneper
Token ID5723
Feature activation+2.85e-02

Pos logit contributions
 making+0.292
 video+0.286
 drawing+0.275
 spraying+0.266
 poking+0.258
Neg logit contributions
use-0.278
des-0.269
ited-0.229
minded-0.228
God-0.224
Token said
Token ID531
Feature activation-4.53e-02

Pos logit contributions
des+0.077
use+0.076
 former+0.074
powers+0.069
 unnecessary+0.069
Neg logit contributions
 making-0.022
's-0.018
:-0.018
 poking-0.018
 drawing-0.017
Token it
Token ID340
Feature activation-1.08e-02

Pos logit contributions
 making+0.049
 video+0.046
 drawing+0.044
 poking+0.042
's+0.042
Neg logit contributions
des-0.102
use-0.099
 former-0.095
 unnecessary-0.090
powers-0.090
Token was
Token ID373
Feature activation-4.93e-02

Pos logit contributions
 making+0.008
 poking+0.007
 drawing+0.007
 video+0.007
:+0.006
Neg logit contributions
des-0.029
use-0.028
 former-0.027
powers-0.026
God-0.025
Token okay
Token ID8788
Feature activation-1.22e-02

Pos logit contributions
 making+0.058
 poking+0.051
:+0.051
's+0.051
 drawing+0.050
Neg logit contributions
des-0.115
use-0.112
 former-0.109
powers-0.101
ited-0.100
Token the
Token ID262
Feature activation-6.91e-03

Pos logit contributions
 grid+0.043
 making+0.042
 video+0.040
 spraying+0.039
 poking+0.039
Neg logit contributions
des-0.052
use-0.049
God-0.047
 former-0.045
powers-0.045
Token man
Token ID582
Feature activation-5.02e-03

Pos logit contributions
 making+0.009
:+0.008
 poking+0.008
's+0.008
 drawing+0.008
Neg logit contributions
des-0.015
use-0.015
 former-0.014
powers-0.013
God-0.013
Token had
Token ID550
Feature activation-6.07e-02

Pos logit contributions
 grid+0.004
 video+0.004
 making+0.004
 drawing+0.004
 spraying+0.003
Neg logit contributions
use-0.012
des-0.012
ited-0.011
God-0.011
 greater-0.010
Token said
Token ID531
Feature activation-4.44e-02

Pos logit contributions
 video+0.078
 making+0.074
 grid+0.072
 drawing+0.071
 poking+0.070
Neg logit contributions
des-0.128
use-0.123
 former-0.111
ited-0.106
powers-0.105
Token and
Token ID290
Feature activation+1.70e-02

Pos logit contributions
 video+0.046
 grid+0.043
 making+0.039
 drawing+0.038
 er+0.036
Neg logit contributions
des-0.094
use-0.088
 anytime-0.083
 someday-0.082
 former-0.082
Token quickly
Token ID2952
Feature activation+1.35e-01

Pos logit contributions
des+0.039
use+0.038
 former+0.037
powers+0.034
ited+0.034
Neg logit contributions
 making-0.020
 poking-0.017
:-0.017
's-0.017
 drawing-0.017
Token ran
Token ID4966
Feature activation+6.59e-02

Pos logit contributions
des+0.288
use+0.281
 former+0.270
powers+0.253
ited+0.250
Neg logit contributions
 making-0.173
 video-0.161
 poking-0.155
 drawing-0.154
's-0.150
Token away
Token ID1497
Feature activation-1.52e-02

Pos logit contributions
des+0.123
use+0.119
 former+0.114
powers+0.105
minded+0.103
Neg logit contributions
 making-0.106
 poking-0.098
's-0.097
 drawing-0.096
:-0.096
Token.
Token ID13
Feature activation-2.18e-02

Pos logit contributions
 making+0.010
:+0.008
 poking+0.008
's+0.008
 drawing+0.008
Neg logit contributions
des-0.043
use-0.043
 former-0.041
powers-0.039
ited-0.039
Token She
Token ID1375
Feature activation+3.98e-02

Pos logit contributions
 making+0.016
's+0.016
:+0.015
 was+0.013
 Sam+0.013
Neg logit contributions
des-0.061
use-0.060
 former-0.059
ited-0.056
 unnecessary-0.056
Token was
Token ID373
Feature activation-4.56e-02

Pos logit contributions
des+0.099
use+0.096
 former+0.094
powers+0.087
 unnecessary+0.087
Neg logit contributions
 making-0.041
's-0.038
:-0.037
 poking-0.036
 was-0.035
Token are
Token ID389
Feature activation-6.75e-02

Pos logit contributions
 making+0.062
 poking+0.058
:+0.055
 video+0.054
 drawing+0.053
Neg logit contributions
use-0.153
des-0.152
 former-0.142
ited-0.133
God-0.133
Token lively
Token ID29696
Feature activation-3.83e-02

Pos logit contributions
 video+0.083
on+0.083
:+0.081
's+0.078
 poking+0.077
Neg logit contributions
 former-0.122
des-0.120
 greater-0.116
God-0.113
 reins-0.110
Token and
Token ID290
Feature activation+1.20e-02

Pos logit contributions
 making+0.027
 poking+0.022
:+0.022
's+0.022
 drawing+0.021
Neg logit contributions
des-0.107
use-0.105
 former-0.103
powers-0.096
ited-0.096
Token happy
Token ID3772
Feature activation-4.52e-03

Pos logit contributions
des+0.022
use+0.021
 former+0.020
powers+0.019
God+0.018
Neg logit contributions
 poking-0.018
:-0.018
 making-0.018
 video-0.017
's-0.017
Token.
Token ID13
Feature activation-2.19e-02

Pos logit contributions
 making+0.003
:+0.002
 poking+0.002
 drawing+0.002
's+0.002
Neg logit contributions
des-0.013
use-0.013
 former-0.013
ited-0.012
powers-0.012
Token One
Token ID1881
Feature activation+1.63e-01

Pos logit contributions
 making+0.023
:+0.020
's+0.020
 poking+0.020
 drawing+0.019
Neg logit contributions
des-0.054
use-0.053
 former-0.052
ited-0.048
powers-0.048
Token day
Token ID1110
Feature activation-4.39e-03

Pos logit contributions
des+0.357
use+0.332
 former+0.319
God+0.307
powers+0.305
Neg logit contributions
 making-0.201
 poking-0.194
 fighting-0.181
 drawing-0.174
 shooting-0.174
Token,
Token ID11
Feature activation-4.73e-02

Pos logit contributions
 making+0.002
 video+0.002
 drawing+0.002
 poking+0.002
's+0.001
Neg logit contributions
des-0.013
use-0.013
 former-0.012
powers-0.011
God-0.011
Token they
Token ID484
Feature activation-2.38e-02

Pos logit contributions
 making+0.043
 poking+0.038
:+0.037
's+0.037
 drawing+0.037
Neg logit contributions
des-0.121
use-0.118
 former-0.114
powers-0.107
ited-0.106
Token go
Token ID467
Feature activation+8.14e-02

Pos logit contributions
 making+0.034
's+0.032
:+0.030
 drawing+0.030
 had+0.030
Neg logit contributions
des-0.051
 former-0.049
use-0.047
powers-0.045
 unnecessary-0.044
Token to
Token ID284
Feature activation+3.07e-02

Pos logit contributions
des+0.213
 former+0.212
use+0.206
ited+0.195
God+0.192
Neg logit contributions
 making-0.079
's-0.067
:-0.065
 drawing-0.065
 on-0.063
Token her
Token ID607
Feature activation+4.91e-02

Pos logit contributions
des+0.061
use+0.060
 former+0.059
ited+0.055
powers+0.055
Neg logit contributions
 making-0.018
's-0.015
 poking-0.015
:-0.014
 drawing-0.014
Token flock
Token ID29556
Feature activation+2.28e-03

Pos logit contributions
des+0.120
use+0.117
 former+0.111
ited+0.105
powers+0.105
Neg logit contributions
 making-0.054
's-0.047
 poking-0.047
:-0.047
 drawing-0.046
Token.
Token ID13
Feature activation-2.12e-02

Pos logit contributions
des+0.006
use+0.006
 former+0.005
powers+0.005
ited+0.005
Neg logit contributions
 making-0.002
 poking-0.002
 drawing-0.002
 video-0.002
:-0.002
Token
Token ID198
Feature activation+1.90e-02

Pos logit contributions
 making+0.016
's+0.015
:+0.014
 was+0.013
 drawing+0.013
Neg logit contributions
des-0.059
use-0.058
 former-0.056
 unnecessary-0.054
ited-0.054
Token
Token ID198
Feature activation+1.86e-02

Pos logit contributions
des+0.051
use+0.050
 former+0.050
 unnecessary+0.046
ited+0.046
Neg logit contributions
 making-0.016
:-0.014
's-0.014
 was-0.013
 video-0.013
TokenAs
Token ID1722
Feature activation+1.23e-01

Pos logit contributions
use+0.039
des+0.038
 former+0.036
 unnecessary+0.035
 reins+0.034
Neg logit contributions
's-0.027
 making-0.026
 was-0.025
:-0.024
 video-0.024
Token they
Token ID484
Feature activation-2.64e-02

Pos logit contributions
des+0.316
use+0.308
 former+0.302
powers+0.281
ited+0.278
Neg logit contributions
 making-0.110
 poking-0.096
:-0.094
's-0.094
 drawing-0.091
Token were
Token ID547
Feature activation-8.03e-02

Pos logit contributions
 making+0.034
 drawing+0.032
 poking+0.032
 video+0.031
:+0.029
Neg logit contributions
des-0.057
use-0.056
 former-0.053
ited-0.052
irst-0.049
Token walking
Token ID6155
Feature activation+3.83e-02

Pos logit contributions
 video+0.091
 making+0.089
's+0.084
 grid+0.082
 was+0.081
Neg logit contributions
des-0.174
use-0.171
 former-0.164
 unnecessary-0.155
 reins-0.151
Token,
Token ID11
Feature activation-4.52e-02

Pos logit contributions
des+0.102
use+0.099
 former+0.097
God+0.091
ited+0.091
Neg logit contributions
 making-0.033
:-0.030
's-0.028
 poking-0.028
 in-0.028
Token they
Token ID484
Feature activation-2.63e-02

Pos logit contributions
 making+0.042
:+0.039
 poking+0.038
's+0.037
 drawing+0.036
Neg logit contributions
des-0.110
use-0.108
 former-0.106
powers-0.098
 greater-0.097
Token come
Token ID1282
Feature activation+2.98e-02

Pos logit contributions
des+0.082
use+0.080
 former+0.078
powers+0.073
ited+0.072
Neg logit contributions
 making-0.036
 poking-0.030
:-0.030
's-0.030
 drawing-0.030
Token back
Token ID736
Feature activation+8.86e-02

Pos logit contributions
des+0.081
use+0.078
ited+0.077
 former+0.076
powers+0.072
Neg logit contributions
 making-0.029
's-0.025
 in-0.025
 on-0.025
 drawing-0.024
Token to
Token ID284
Feature activation+2.84e-02

Pos logit contributions
des+0.250
ited+0.238
use+0.234
 former+0.224
 unnecessary+0.220
Neg logit contributions
 in-0.092
 on-0.090
 making-0.088
,-0.086
:-0.083
Token play
Token ID711
Feature activation-2.30e-02

Pos logit contributions
des+0.072
use+0.070
 former+0.067
powers+0.066
ited+0.066
Neg logit contributions
 making-0.033
 drawing-0.028
 playing-0.028
 being-0.027
's-0.026
Token again
Token ID757
Feature activation+1.04e-01

Pos logit contributions
 making+0.026
's+0.023
 in+0.022
 on+0.022
 drawing+0.022
Neg logit contributions
des-0.058
use-0.057
 former-0.056
ited-0.052
powers-0.052
Token soon
Token ID2582
Feature activation+1.30e-01

Pos logit contributions
des+0.237
use+0.212
ited+0.205
powers+0.203
 reins+0.200
Neg logit contributions
 making-0.166
,-0.161
 in-0.159
's-0.157
 on-0.156
Token."
Token ID526
Feature activation+6.28e-02

Pos logit contributions
des+0.279
 former+0.271
use+0.261
ited+0.246
God+0.244
Neg logit contributions
 making-0.203
's-0.194
,-0.188
 on-0.186
:-0.185
Token And
Token ID843
Feature activation-1.31e-02

Pos logit contributions
des+0.129
use+0.116
 former+0.113
powers+0.111
 anytime+0.106
Neg logit contributions
 making-0.088
 poking-0.084
 drawing-0.078
 shooting-0.076
 doing-0.076
Token from
Token ID422
Feature activation-6.65e-02

Pos logit contributions
 making+0.014
 poking+0.013
 video+0.013
 drawing+0.012
 slapping+0.012
Neg logit contributions
des-0.029
 anytime-0.027
 former-0.026
use-0.025
yes-0.025
Token that
Token ID326
Feature activation-6.10e-02

Pos logit contributions
 making+0.041
's+0.037
 poking+0.035
 video+0.032
 drawing+0.032
Neg logit contributions
des-0.183
use-0.180
 former-0.177
minded-0.166
ited-0.164
Token day
Token ID1110
Feature activation-6.20e-03

Pos logit contributions
 poking+0.089
 spraying+0.086
 making+0.084
 video+0.082
 fighting+0.082
Neg logit contributions
use-0.102
des-0.099
ited-0.094
 anytime-0.091
 greater-0.086
Token.
Token ID13
Feature activation-2.58e-02

Pos logit contributions
 making+0.016
 poking+0.013
 drawing+0.013
 video+0.012
's+0.012
Neg logit contributions
use-0.054
des-0.053
 former-0.051
powers-0.048
God-0.048
Token He
Token ID679
Feature activation+1.17e-03

Pos logit contributions
 making+0.033
's+0.033
:+0.032
 Sam+0.030
 drawing+0.029
Neg logit contributions
des-0.058
use-0.056
ited-0.055
 former-0.054
 unnecessary-0.051
Token looks
Token ID3073
Feature activation+4.24e-03

Pos logit contributions
des+0.003
use+0.003
 former+0.003
powers+0.003
ited+0.003
Neg logit contributions
 making-0.001
 poking-0.001
 drawing-0.001
:-0.001
 video-0.001
Token strange
Token ID6283
Feature activation-3.56e-02

Pos logit contributions
des+0.010
use+0.010
 former+0.010
powers+0.009
God+0.009
Neg logit contributions
 making-0.005
 poking-0.004
's-0.004
:-0.004
 drawing-0.004
Token.
Token ID13
Feature activation-2.61e-02

Pos logit contributions
 making+0.022
:+0.021
's+0.020
 in+0.020
 on+0.018
Neg logit contributions
des-0.108
use-0.103
 former-0.102
ited-0.098
irst-0.096
Token Maybe
Token ID6674
Feature activation+1.20e-01

Pos logit contributions
's+0.035
 making+0.034
:+0.034
 Sam+0.033
 was+0.031
Neg logit contributions
des-0.057
ited-0.055
use-0.055
 former-0.053
 unnecessary-0.049
Token he
Token ID339
Feature activation+2.84e-02

Pos logit contributions
des+0.332
 former+0.326
use+0.318
ited+0.314
rene+0.300
Neg logit contributions
 making-0.114
's-0.103
:-0.098
 was-0.094
 in-0.094
Token is
Token ID318
Feature activation-5.81e-02

Pos logit contributions
des+0.072
 former+0.069
use+0.069
powers+0.064
 reins+0.064
Neg logit contributions
 making-0.028
's-0.026
:-0.024
 poking-0.023
 was-0.023
Token a
Token ID257
Feature activation+1.27e-02

Pos logit contributions
 making+0.062
 poking+0.054
:+0.054
's+0.054
 video+0.053
Neg logit contributions
des-0.139
use-0.136
 former-0.132
powers-0.123
ited-0.121
Token traveler
Token ID40168
Feature activation+6.88e-02

Pos logit contributions
des+0.029
use+0.027
 former+0.026
ited+0.025
powers+0.025
Neg logit contributions
 making-0.017
 drawing-0.015
 poking-0.015
 video-0.015
's-0.015
Token.
Token ID13
Feature activation-2.67e-02

Pos logit contributions
des+0.187
 former+0.179
use+0.178
ited+0.168
powers+0.167
Neg logit contributions
 making-0.061
's-0.053
:-0.052
 in-0.050
 on-0.048
Token tw
Token ID665
Feature activation+3.10e-02

Pos logit contributions
des+0.035
use+0.033
 former+0.031
ited+0.031
minded+0.031
Neg logit contributions
 making-0.018
 drawing-0.016
 poking-0.016
 video-0.016
's-0.015
Tokenig
Token ID328
Feature activation-1.79e-02

Pos logit contributions
use+0.072
des+0.071
 former+0.065
minded+0.065
God+0.065
Neg logit contributions
 making-0.035
:-0.033
 poking-0.033
 drawing-0.032
's-0.030
Token.
Token ID13
Feature activation-2.44e-02

Pos logit contributions
 making+0.017
 video+0.016
 fighting+0.016
 poking+0.015
:+0.015
Neg logit contributions
des-0.042
use-0.041
 former-0.040
powers-0.037
 reins-0.037
Token He
Token ID679
Feature activation+4.38e-03

Pos logit contributions
 making+0.023
:+0.022
 Sam+0.020
's+0.020
 drawing+0.019
Neg logit contributions
des-0.063
use-0.063
 former-0.060
ited-0.058
 unnecessary-0.056
Token wished
Token ID16555
Feature activation+5.81e-02

Pos logit contributions
des+0.012
use+0.011
 former+0.011
 unnecessary+0.010
powers+0.010
Neg logit contributions
 making-0.004
's-0.003
:-0.003
 was-0.003
 poking-0.003
Token he
Token ID339
Feature activation+3.28e-02

Pos logit contributions
des+0.121
use+0.116
 former+0.114
powers+0.106
 greater+0.105
Neg logit contributions
 making-0.070
 video-0.070
's-0.068
 poking-0.065
 drawing-0.063
Token had
Token ID550
Feature activation-6.28e-02

Pos logit contributions
des+0.099
use+0.097
 former+0.096
powers+0.090
ited+0.089
Neg logit contributions
 making-0.015
's-0.011
 poking-0.011
:-0.011
 drawing-0.010
Token shared
Token ID4888
Feature activation-1.21e-02

Pos logit contributions
 making+0.066
 video+0.061
 poking+0.060
:+0.058
 drawing+0.057
Neg logit contributions
des-0.147
use-0.143
 former-0.139
powers-0.129
God-0.128
Token it
Token ID340
Feature activation-9.54e-03

Pos logit contributions
 making+0.010
 poking+0.009
's+0.009
:+0.009
 video+0.009
Neg logit contributions
des-0.031
use-0.030
 former-0.030
powers-0.028
God-0.027
Token with
Token ID351
Feature activation-7.40e-02

Pos logit contributions
 making+0.007
 poking+0.006
 video+0.005
 drawing+0.005
:+0.005
Neg logit contributions
des-0.026
use-0.025
 former-0.025
powers-0.023
minded-0.023
Token Sam
Token ID3409
Feature activation-6.92e-02

Pos logit contributions
 making+0.075
 poking+0.066
's+0.065
:+0.064
 video+0.063
Neg logit contributions
des-0.182
use-0.177
 former-0.173
powers-0.161
God-0.159
Token at
Token ID379
Feature activation-4.07e-02

Pos logit contributions
 making+0.002
 in+0.002
's+0.002
:+0.002
 on+0.002
Neg logit contributions
des-0.011
use-0.010
 former-0.010
God-0.010
powers-0.010
Token the
Token ID262
Feature activation-8.57e-03

Pos logit contributions
 making+0.016
 was+0.010
:+0.009
 poking+0.009
's+0.009
Neg logit contributions
des-0.131
use-0.130
 former-0.125
ited-0.120
God-0.120
Token trees
Token ID7150
Feature activation-7.39e-03

Pos logit contributions
 making+0.010
 video+0.009
's+0.009
 poking+0.009
 drawing+0.008
Neg logit contributions
des-0.021
use-0.020
 former-0.019
ited-0.018
powers-0.018
Token.
Token ID13
Feature activation-2.08e-02

Pos logit contributions
 video+0.006
 making+0.006
 drawing+0.006
 poking+0.006
 grid+0.005
Neg logit contributions
des-0.017
use-0.017
 former-0.016
 anytime-0.016
minded-0.015
Token His
Token ID2399
Feature activation+4.77e-03

Pos logit contributions
 Sam+0.019
 making+0.018
's+0.018
:+0.018
 Max+0.017
Neg logit contributions
des-0.054
use-0.053
ited-0.051
 former-0.051
 unnecessary-0.048
Token wife
Token ID3656
Feature activation+9.96e-02

Pos logit contributions
des+0.011
use+0.011
 former+0.010
ited+0.010
powers+0.010
Neg logit contributions
 making-0.006
:-0.005
 poking-0.005
's-0.005
 drawing-0.005
Token always
Token ID1464
Feature activation+5.66e-02

Pos logit contributions
des+0.261
use+0.255
 former+0.249
powers+0.233
ited+0.230
Neg logit contributions
 making-0.085
 poking-0.072
:-0.071
's-0.071
 drawing-0.070
Token stayed
Token ID9658
Feature activation+2.74e-02

Pos logit contributions
des+0.144
 former+0.139
use+0.135
powers+0.130
God+0.127
Neg logit contributions
 making-0.060
's-0.053
:-0.051
 poking-0.049
 was-0.049
Token at
Token ID379
Feature activation-4.00e-02

Pos logit contributions
des+0.071
use+0.070
 former+0.067
powers+0.066
God+0.063
Neg logit contributions
 making-0.028
 in-0.025
 on-0.024
's-0.024
 playing-0.023
Token home
Token ID1363
Feature activation+4.99e-02

Pos logit contributions
 making+0.024
 drawing+0.017
 playing+0.017
 being+0.016
 poking+0.016
Neg logit contributions
des-0.122
use-0.121
 former-0.115
powers-0.113
ited-0.113
Token and
Token ID290
Feature activation+9.36e-03

Pos logit contributions
des+0.142
use+0.139
 former+0.134
powers+0.127
ited+0.127
Neg logit contributions
 making-0.035
:-0.028
's-0.027
 poking-0.027
 drawing-0.026
Token.
Token ID13
Feature activation-2.25e-02

Pos logit contributions
 making+0.049
 poking+0.043
:+0.041
 drawing+0.040
 video+0.040
Neg logit contributions
des-0.158
use-0.154
 former-0.151
powers-0.142
God-0.139
Token
Token ID198
Feature activation+1.73e-02

Pos logit contributions
 making+0.026
's+0.025
:+0.024
 poking+0.023
 drawing+0.023
Neg logit contributions
des-0.053
use-0.051
 former-0.051
ited-0.047
 unnecessary-0.046
Token
Token ID198
Feature activation+1.77e-02

Pos logit contributions
des+0.044
use+0.043
 former+0.042
ited+0.039
powers+0.039
Neg logit contributions
 making-0.016
:-0.015
's-0.014
 poking-0.014
 drawing-0.014
TokenThe
Token ID464
Feature activation-9.61e-03

Pos logit contributions
des+0.028
use+0.028
 former+0.028
 unnecessary+0.024
powers+0.024
Neg logit contributions
 making-0.033
's-0.033
:-0.032
 poking-0.031
 was-0.031
Token girl
Token ID2576
Feature activation+1.07e-02

Pos logit contributions
 making+0.013
 drawing+0.011
's+0.011
 video+0.011
 poking+0.011
Neg logit contributions
des-0.022
use-0.022
 former-0.021
 unnecessary-0.019
ited-0.019
Token searched
Token ID16499
Feature activation+6.83e-02

Pos logit contributions
des+0.024
use+0.023
 former+0.022
powers+0.021
ited+0.021
Neg logit contributions
 making-0.013
 poking-0.011
 video-0.011
 drawing-0.011
:-0.011
Token and
Token ID290
Feature activation+1.48e-02

Pos logit contributions
des+0.195
use+0.189
 former+0.187
powers+0.177
God+0.174
Neg logit contributions
 making-0.050
 poking-0.039
:-0.039
's-0.037
 drawing-0.036
Token searched
Token ID16499
Feature activation+6.77e-02

Pos logit contributions
des+0.033
use+0.033
 former+0.031
ited+0.029
powers+0.029
Neg logit contributions
 making-0.018
 poking-0.016
:-0.016
 video-0.016
's-0.016
Token,
Token ID11
Feature activation-4.88e-02

Pos logit contributions
des+0.190
use+0.184
 former+0.181
powers+0.171
God+0.168
Neg logit contributions
 making-0.052
 poking-0.041
:-0.041
's-0.040
 drawing-0.039
Token until
Token ID1566
Feature activation+2.73e-02

Pos logit contributions
 video+0.070
:+0.067
's+0.063
 making+0.062
on+0.062
Neg logit contributions
des-0.090
use-0.085
 former-0.085
 greater-0.080
ited-0.079
Token she
Token ID673
Feature activation+7.01e-02

Pos logit contributions
des+0.056
use+0.054
 former+0.053
powers+0.049
ited+0.048
Neg logit contributions
 making-0.040
 poking-0.037
:-0.036
 drawing-0.036
's-0.036
Token
Token ID198
Feature activation+2.76e-02

Pos logit contributions
des+0.169
use+0.165
 former+0.159
powers+0.149
God+0.147
Neg logit contributions
 making-0.079
 poking-0.070
 drawing-0.068
's-0.068
:-0.067
Token
Token ID198
Feature activation+2.67e-02

Pos logit contributions
des+0.056
use+0.055
 former+0.051
God+0.050
powers+0.049
Neg logit contributions
 making-0.037
 poking-0.034
 drawing-0.033
 video-0.032
's-0.032
TokenMom
Token ID29252
Feature activation+4.16e-02

Pos logit contributions
des+0.039
use+0.038
 former+0.035
God+0.033
powers+0.031
Neg logit contributions
 making-0.053
 poking-0.050
 drawing-0.050
 video-0.049
's-0.049
Token says
Token ID1139
Feature activation-4.38e-02

Pos logit contributions
use+0.063
des+0.062
minded+0.055
ited+0.052
powers+0.050
Neg logit contributions
 making-0.076
 poking-0.074
 drawing-0.072
 video-0.070
 spraying-0.070
Token,
Token ID11
Feature activation-4.89e-02

Pos logit contributions
 making+0.032
 video+0.031
 drawing+0.027
 shooting+0.026
 poking+0.026
Neg logit contributions
use-0.112
des-0.110
minded-0.101
powers-0.099
 anytime-0.099
Token "
Token ID366
Feature activation+5.39e-02

Pos logit contributions
 video+0.033
's+0.032
 making+0.031
 drawing+0.030
 poking+0.029
Neg logit contributions
use-0.126
des-0.122
God-0.114
 former-0.114
 reins-0.111
TokenThank
Token ID10449
Feature activation-2.84e-02

Pos logit contributions
des+0.110
 former+0.108
use+0.108
 unnecessary+0.096
 reins+0.094
Neg logit contributions
 making-0.080
's-0.073
 was-0.071
:-0.071
 poking-0.070
Token you
Token ID345
Feature activation+1.87e-02

Pos logit contributions
 making+0.051
 poking+0.048
 drawing+0.048
 video+0.047
:+0.047
Neg logit contributions
des-0.047
use-0.046
 former-0.042
powers-0.039
minded-0.038
Token,
Token ID11
Feature activation-4.69e-02

Pos logit contributions
des+0.054
use+0.054
 former+0.052
powers+0.049
minded+0.049
Neg logit contributions
 making-0.009
 poking-0.007
 drawing-0.007
 video-0.007
's-0.006
Token Ben
Token ID3932
Feature activation-5.64e-02

Pos logit contributions
 making+0.056
 poking+0.051
 drawing+0.051
 video+0.050
:+0.050
Neg logit contributions
des-0.104
use-0.102
 former-0.098
ited-0.091
powers-0.091
Token and
Token ID290
Feature activation+1.38e-02

Pos logit contributions
 making+0.051
 poking+0.046
 drawing+0.045
 video+0.045
 shooting+0.041
Neg logit contributions
des-0.140
use-0.139
 former-0.130
minded-0.124
powers-0.123
Token time
Token ID640
Feature activation-3.72e-02

Pos logit contributions
des+0.202
use+0.184
powers+0.180
 former+0.180
 unnecessary+0.171
Neg logit contributions
 making-0.150
:-0.148
 poking-0.143
 drawing-0.143
 video-0.142
Token had
Token ID550
Feature activation-6.61e-02

Pos logit contributions
 making+0.046
 video+0.044
 poking+0.043
 drawing+0.043
 spraying+0.042
Neg logit contributions
des-0.077
use-0.071
powers-0.066
 former-0.065
 anytime-0.064
Token passed
Token ID3804
Feature activation+4.22e-02

Pos logit contributions
 making+0.076
 video+0.072
:+0.070
 poking+0.069
 drawing+0.067
Neg logit contributions
des-0.145
use-0.139
 former-0.137
powers-0.128
 greater-0.127
Token.
Token ID13
Feature activation-2.23e-02

Pos logit contributions
des+0.121
use+0.118
 former+0.117
ited+0.109
 reins+0.109
Neg logit contributions
 making-0.027
's-0.024
:-0.023
 in-0.022
 poking-0.022
Token He
Token ID679
Feature activation+1.78e-03

Pos logit contributions
 making+0.026
 poking+0.023
's+0.023
:+0.023
 drawing+0.023
Neg logit contributions
des-0.052
use-0.051
 former-0.049
powers-0.046
ited-0.045
Token went
Token ID1816
Feature activation+7.60e-02

Pos logit contributions
des+0.004
use+0.004
 former+0.004
powers+0.003
 unnecessary+0.003
Neg logit contributions
 making-0.002
's-0.002
:-0.002
 poking-0.002
 was-0.002
Token to
Token ID284
Feature activation+3.09e-02

Pos logit contributions
des+0.176
use+0.172
 former+0.166
powers+0.154
minded+0.152
Neg logit contributions
 making-0.086
 poking-0.077
's-0.077
:-0.075
 video-0.075
Token bed
Token ID3996
Feature activation-8.87e-03

Pos logit contributions
des+0.078
use+0.077
 former+0.074
powers+0.070
ited+0.070
Neg logit contributions
 making-0.031
 poking-0.027
 drawing-0.027
's-0.026
:-0.026
Token feeling
Token ID4203
Feature activation+1.34e-03

Pos logit contributions
 making+0.007
 poking+0.006
 video+0.006
's+0.006
:+0.006
Neg logit contributions
des-0.022
use-0.022
 former-0.022
powers-0.020
God-0.020
Token proud
Token ID6613
Feature activation-2.15e-02

Pos logit contributions
des+0.003
use+0.003
 former+0.003
powers+0.003
ited+0.003
Neg logit contributions
 making-0.002
:-0.001
 poking-0.001
's-0.001
 drawing-0.001
Token of
Token ID286
Feature activation-4.16e-02

Pos logit contributions
 making+0.020
 poking+0.018
 video+0.018
's+0.017
 drawing+0.017
Neg logit contributions
des-0.053
use-0.051
 former-0.050
powers-0.048
minded-0.047
Token was
Token ID373
Feature activation-4.79e-02

Pos logit contributions
 making+0.040
 poking+0.036
 video+0.035
's+0.035
 drawing+0.034
Neg logit contributions
des-0.093
use-0.090
 former-0.087
powers-0.082
God-0.081
Token very
Token ID845
Feature activation+7.23e-02

Pos logit contributions
 making+0.042
 video+0.041
 poking+0.041
:+0.040
's+0.038
Neg logit contributions
des-0.112
 former-0.109
use-0.107
 greater-0.102
 unnecessary-0.102
Token fashionable
Token ID38378
Feature activation+2.89e-02

Pos logit contributions
use+0.160
des+0.160
 former+0.154
 reins+0.145
ited+0.140
Neg logit contributions
 making-0.096
 drawing-0.087
's-0.085
:-0.084
 poking-0.082
Token.
Token ID13
Feature activation-1.99e-02

Pos logit contributions
des+0.080
use+0.078
 former+0.076
powers+0.072
ited+0.071
Neg logit contributions
 making-0.021
 poking-0.017
:-0.016
's-0.016
 drawing-0.016
Token
Token ID220
Feature activation+2.78e-02

Pos logit contributions
 making+0.020
's+0.019
:+0.018
 drawing+0.018
 was+0.017
Neg logit contributions
des-0.050
use-0.049
 former-0.048
ited-0.045
 unnecessary-0.045
Token
Token ID198
Feature activation+1.88e-02

Pos logit contributions
des+0.070
use+0.069
 former+0.067
ited+0.062
powers+0.062
Neg logit contributions
 making-0.027
:-0.024
's-0.024
 poking-0.023
 drawing-0.023
Token
Token ID198
Feature activation+1.82e-02

Pos logit contributions
 former+0.052
des+0.052
use+0.052
 unnecessary+0.048
ited+0.048
Neg logit contributions
 making-0.013
's-0.013
:-0.012
 was-0.012
 poking-0.010
TokenWhen
Token ID2215
Feature activation+4.43e-02

Pos logit contributions
use+0.034
 former+0.032
des+0.032
 unnecessary+0.030
minded+0.029
Neg logit contributions
's-0.031
 making-0.030
 was-0.029
 had-0.028
:-0.028
Token she
Token ID673
Feature activation+7.00e-02

Pos logit contributions
des+0.082
use+0.080
 former+0.075
God+0.069
irst+0.068
Neg logit contributions
 making-0.065
 poking-0.064
:-0.062
 video-0.061
's-0.061
Token got
Token ID1392
Feature activation+3.51e-02

Pos logit contributions
des+0.165
use+0.160
ited+0.148
 former+0.145
powers+0.137
Neg logit contributions
 making-0.076
 poking-0.071
 video-0.066
 drawing-0.064
 spraying-0.063
Token to
Token ID284
Feature activation+3.09e-02

Pos logit contributions
des+0.081
use+0.079
 former+0.077
powers+0.071
ited+0.071
Neg logit contributions
 making-0.043
 poking-0.038
 drawing-0.038
's-0.038
:-0.038
Token exploring
Token ID13504
Feature activation+7.96e-03

Pos logit contributions
des+0.143
use+0.138
 former+0.138
powers+0.127
God+0.125
Neg logit contributions
 making-0.073
's-0.070
 video-0.069
:-0.069
 poking-0.067
Token the
Token ID262
Feature activation+6.70e-03

Pos logit contributions
des+0.023
use+0.023
 former+0.022
powers+0.021
ited+0.021
Neg logit contributions
 making-0.005
:-0.004
 poking-0.004
's-0.004
 drawing-0.004
Token new
Token ID649
Feature activation+3.98e-02

Pos logit contributions
des+0.015
use+0.014
 former+0.013
ited+0.013
powers+0.013
Neg logit contributions
 making-0.009
 video-0.008
 drawing-0.008
's-0.008
 in-0.008
Token area
Token ID1989
Feature activation-1.70e-03

Pos logit contributions
des+0.087
use+0.084
 former+0.079
ited+0.076
 unnecessary+0.075
Neg logit contributions
 making-0.054
 drawing-0.050
's-0.050
:-0.050
 video-0.050
Token,
Token ID11
Feature activation-4.75e-02

Pos logit contributions
 making+0.001
 poking+0.001
 video+0.001
 drawing+0.001
's+0.001
Neg logit contributions
des-0.005
use-0.004
 former-0.004
powers-0.004
ited-0.004
Token thrilled
Token ID20536
Feature activation+5.29e-03

Pos logit contributions
 making+0.047
:+0.044
 video+0.044
's+0.043
 poking+0.043
Neg logit contributions
des-0.112
use-0.109
 former-0.108
ited-0.099
 greater-0.099
Token to
Token ID284
Feature activation+2.90e-02

Pos logit contributions
des+0.014
use+0.013
 former+0.013
powers+0.012
ited+0.012
Neg logit contributions
 making-0.005
:-0.004
 poking-0.004
's-0.004
 drawing-0.004
Token discover
Token ID7073
Feature activation-8.52e-03

Pos logit contributions
des+0.060
use+0.058
 former+0.056
powers+0.052
ited+0.051
Neg logit contributions
 making-0.042
 poking-0.038
:-0.038
's-0.037
 drawing-0.037
Token all
Token ID477
Feature activation+4.85e-02

Pos logit contributions
 making+0.008
 poking+0.007
 drawing+0.007
:+0.007
's+0.007
Neg logit contributions
des-0.021
use-0.021
 former-0.020
powers-0.019
 unnecessary-0.019
Token sorts
Token ID10524
Feature activation+3.87e-02

Pos logit contributions
des+0.113
use+0.102
 greater+0.100
away+0.100
ited+0.098
Neg logit contributions
 drawing-0.047
 video-0.046
 making-0.046
 poking-0.045
 fighting-0.042
Token of
Token ID286
Feature activation-4.42e-02

Pos logit contributions
des+0.094
use+0.088
 former+0.086
powers+0.084
 greater+0.081
Neg logit contributions
 making-0.040
 video-0.035
 drawing-0.035
 poking-0.035
's-0.035
Token and
Token ID290
Feature activation+9.83e-03

Pos logit contributions
 making+0.090
 poking+0.079
:+0.078
's+0.078
 drawing+0.077
Neg logit contributions
des-0.209
use-0.204
 former-0.198
powers-0.185
ited-0.183
Token the
Token ID262
Feature activation-1.14e-02

Pos logit contributions
des+0.021
use+0.021
 former+0.019
ited+0.018
powers+0.018
Neg logit contributions
 making-0.011
:-0.011
 video-0.011
 drawing-0.011
 poking-0.010
Token fly
Token ID6129
Feature activation+2.32e-02

Pos logit contributions
 making+0.011
:+0.010
 poking+0.009
's+0.009
 drawing+0.009
Neg logit contributions
des-0.029
use-0.028
 former-0.027
powers-0.026
 reins-0.025
Token out
Token ID503
Feature activation+2.82e-02

Pos logit contributions
des+0.047
use+0.045
 former+0.042
powers+0.041
away+0.040
Neg logit contributions
 making-0.032
 drawing-0.029
 video-0.029
 poking-0.028
:-0.028
Token of
Token ID286
Feature activation-4.34e-02

Pos logit contributions
des+0.074
use+0.072
 former+0.070
powers+0.066
ited+0.065
Neg logit contributions
 making-0.024
 poking-0.021
:-0.020
's-0.020
 drawing-0.020
Token the
Token ID262
Feature activation-1.06e-02

Pos logit contributions
 making+0.024
's+0.019
 poking+0.019
:+0.019
 drawing+0.018
Neg logit contributions
des-0.126
use-0.123
 former-0.121
powers-0.113
ited-0.113
Token house
Token ID2156
Feature activation+1.25e-03

Pos logit contributions
 making+0.014
 poking+0.012
's+0.012
:+0.012
 drawing+0.012
Neg logit contributions
des-0.024
use-0.023
 former-0.022
powers-0.020
ited-0.020
Token.
Token ID13
Feature activation-2.71e-02

Pos logit contributions
des+0.003
use+0.003
 former+0.003
powers+0.003
minded+0.003
Neg logit contributions
 making-0.001
 video-0.001
 poking-0.001
 drawing-0.001
:-0.001
Token The
Token ID383
Feature activation-1.96e-02

Pos logit contributions
 making+0.021
:+0.018
's+0.018
 poking+0.016
 video+0.016
Neg logit contributions
des-0.076
use-0.074
 former-0.071
ited-0.068
 unnecessary-0.067
Token dog
Token ID3290
Feature activation-8.58e-02

Pos logit contributions
 making+0.020
:+0.019
 poking+0.018
's+0.018
 drawing+0.017
Neg logit contributions
des-0.045
use-0.044
 former-0.044
powers-0.042
 reins-0.041
Token and
Token ID290
Feature activation+1.06e-02

Pos logit contributions
 making+0.085
 poking+0.075
 drawing+0.074
:+0.073
 video+0.073
Neg logit contributions
des-0.209
use-0.205
 former-0.198
powers-0.187
ited-0.185
Token,
Token ID11
Feature activation-4.43e-02

Pos logit contributions
 making+0.029
 video+0.028
 drawing+0.024
 shape+0.022
 spraying+0.022
Neg logit contributions
des-0.087
use-0.087
 anytime-0.080
irst-0.078
powers-0.077
Token there
Token ID612
Feature activation-3.17e-02

Pos logit contributions
 video+0.065
 making+0.065
 drawing+0.060
 poking+0.057
 grid+0.057
Neg logit contributions
des-0.076
use-0.075
 former-0.067
irst-0.066
 anytime-0.066
Token was
Token ID373
Feature activation-4.31e-02

Pos logit contributions
 making+0.035
 drawing+0.035
 video+0.034
 printing+0.033
 grid+0.033
Neg logit contributions
des-0.062
 anytime-0.060
use-0.055
 someday-0.054
E-0.052
Token a
Token ID257
Feature activation+1.97e-02

Pos logit contributions
 making+0.038
:+0.033
 poking+0.033
's+0.032
 video+0.032
Neg logit contributions
des-0.109
 former-0.102
use-0.100
powers-0.096
God-0.096
Token man
Token ID582
Feature activation-3.93e-03

Pos logit contributions
des+0.041
use+0.040
 former+0.039
powers+0.036
God+0.036
Neg logit contributions
 making-0.027
 poking-0.024
:-0.024
's-0.024
 drawing-0.023
Token called
Token ID1444
Feature activation+1.20e-02

Pos logit contributions
 video+0.005
 making+0.005
 grid+0.005
 poking+0.005
 Sam+0.005
Neg logit contributions
des-0.007
use-0.006
God-0.006
away-0.006
 someday-0.006
Token Nick
Token ID8047
Feature activation+3.27e-02

Pos logit contributions
des+0.030
use+0.029
 former+0.028
powers+0.026
God+0.026
Neg logit contributions
 making-0.011
 poking-0.010
's-0.009
 video-0.009
:-0.009
Token.
Token ID13
Feature activation-1.88e-02

Pos logit contributions
des+0.086
use+0.085
 former+0.081
God+0.076
powers+0.076
Neg logit contributions
 making-0.027
 poking-0.023
 video-0.022
 drawing-0.022
's-0.021
Token He
Token ID679
Feature activation+7.80e-03

Pos logit contributions
 making+0.026
 poking+0.024
:+0.024
's+0.024
 drawing+0.023
Neg logit contributions
des-0.039
use-0.038
 former-0.037
powers-0.034
ited-0.033
Token was
Token ID373
Feature activation-4.43e-02

Pos logit contributions
des+0.018
use+0.017
God+0.017
 former+0.016
powers+0.016
Neg logit contributions
 poking-0.007
 making-0.007
 fighting-0.007
 spraying-0.006
 video-0.006
Token a
Token ID257
Feature activation+1.87e-02

Pos logit contributions
:+0.052
 video+0.051
 grid+0.048
's+0.047
on+0.044
Neg logit contributions
des-0.083
 former-0.081
God-0.078
please-0.076
 greater-0.076
Token out
Token ID503
Feature activation+3.00e-02

Pos logit contributions
des+0.212
use+0.207
 former+0.205
 reins+0.195
powers+0.192
Neg logit contributions
 making-0.071
 poking-0.064
 video-0.062
:-0.062
's-0.060
Token and
Token ID290
Feature activation+1.40e-02

Pos logit contributions
des+0.064
 former+0.061
use+0.059
 greater+0.058
 reins+0.058
Neg logit contributions
 making-0.032
 drawing-0.030
's-0.030
 video-0.029
 poking-0.029
Token kissed
Token ID27946
Feature activation-2.48e-02

Pos logit contributions
des+0.033
use+0.033
 former+0.032
powers+0.030
 unnecessary+0.029
Neg logit contributions
 making-0.016
's-0.014
 poking-0.014
:-0.013
 drawing-0.013
Token Molly
Token ID30236
Feature activation+5.07e-03

Pos logit contributions
 making+0.028
 drawing+0.027
 video+0.027
 poking+0.026
 spraying+0.025
Neg logit contributions
des-0.053
use-0.052
 former-0.048
God-0.048
 reins-0.048
Token's
Token ID338
Feature activation-4.24e-02

Pos logit contributions
des+0.012
 reins+0.011
 former+0.011
 anytime+0.011
use+0.011
Neg logit contributions
 making-0.005
 poking-0.005
 video-0.004
 spraying-0.004
 fighting-0.004
Token forehead
Token ID22645
Feature activation-8.67e-03

Pos logit contributions
 making+0.055
 poking+0.050
:+0.049
's+0.049
 drawing+0.048
Neg logit contributions
des-0.093
use-0.090
 former-0.088
powers-0.081
 reins-0.080
Token.
Token ID13
Feature activation-2.37e-02

Pos logit contributions
 making+0.008
 video+0.008
 poking+0.007
 drawing+0.007
 fighting+0.007
Neg logit contributions
des-0.020
 former-0.019
use-0.019
powers-0.018
 greater-0.018
Token "
Token ID366
Feature activation+5.39e-02

Pos logit contributions
 making+0.029
 poking+0.027
 drawing+0.026
 video+0.026
's+0.025
Neg logit contributions
des-0.052
use-0.051
 former-0.050
powers-0.046
God-0.045
TokenGood
Token ID10248
Feature activation+6.16e-02

Pos logit contributions
use+0.108
 former+0.103
des+0.097
 unnecessary+0.096
powers+0.095
Neg logit contributions
 making-0.078
 had-0.074
's-0.072
 was-0.072
 being-0.070
Tokenbye
Token ID16390
Feature activation+6.93e-02

Pos logit contributions
des+0.079
use+0.074
 former+0.071
powers+0.061
ited+0.059
Neg logit contributions
 making-0.136
 poking-0.129
:-0.127
 drawing-0.126
's-0.126
Token Molly
Token ID30236
Feature activation+5.45e-03

Pos logit contributions
des+0.202
 former+0.195
use+0.194
ited+0.182
powers+0.178
Neg logit contributions
 making-0.045
's-0.039
:-0.037
 drawing-0.034
 poking-0.034
Token of
Token ID286
Feature activation-4.74e-02

Pos logit contributions
des+0.221
use+0.219
 former+0.214
God+0.200
 reins+0.199
Neg logit contributions
 making-0.062
:-0.052
 poking-0.050
's-0.050
 drawing-0.048
Token special
Token ID2041
Feature activation+2.93e-02

Pos logit contributions
 making+0.062
 poking+0.054
 playing+0.054
 drawing+0.053
 was+0.052
Neg logit contributions
des-0.110
use-0.110
 former-0.103
minded-0.098
God-0.098
Token things
Token ID1243
Feature activation-3.16e-02

Pos logit contributions
des+0.066
use+0.063
 former+0.062
ited+0.059
rene+0.059
Neg logit contributions
 making-0.038
 drawing-0.035
 was-0.034
's-0.034
 playing-0.034
Token to
Token ID284
Feature activation+2.00e-02

Pos logit contributions
 making+0.023
:+0.020
's+0.019
 in+0.018
 was+0.017
Neg logit contributions
des-0.091
 former-0.088
use-0.087
ited-0.084
 reins-0.083
Token take
Token ID1011
Feature activation+3.88e-02

Pos logit contributions
des+0.045
use+0.044
 former+0.042
powers+0.040
ited+0.039
Neg logit contributions
 making-0.025
:-0.022
 poking-0.022
 drawing-0.022
's-0.022
Token on
Token ID319
Feature activation-8.45e-02

Pos logit contributions
des+0.099
use+0.095
 former+0.094
ited+0.090
God+0.089
Neg logit contributions
 making-0.037
's-0.035
:-0.034
 in-0.033
 on-0.033
Token a
Token ID257
Feature activation+1.77e-02

Pos logit contributions
 making+0.086
's+0.079
:+0.077
 in+0.076
 playing+0.076
Neg logit contributions
des-0.215
use-0.211
ited-0.200
 reins-0.192
 former-0.191
Token journey
Token ID7002
Feature activation+6.51e-02

Pos logit contributions
des+0.039
use+0.038
 former+0.036
minded+0.035
ited+0.034
Neg logit contributions
 making-0.023
 video-0.022
's-0.022
 drawing-0.021
 poking-0.020
Token.
Token ID13
Feature activation-1.72e-02

Pos logit contributions
des+0.196
use+0.190
 former+0.187
ited+0.182
 unnecessary+0.178
Neg logit contributions
 making-0.037
's-0.031
:-0.031
 in-0.030
 on-0.028
Token
Token ID198
Feature activation+9.59e-03

Pos logit contributions
's+0.014
 Sam+0.014
 making+0.014
:+0.013
 Max+0.012
Neg logit contributions
des-0.046
use-0.045
ited-0.043
 unnecessary-0.042
 former-0.042
Token
Token ID198
Feature activation+9.18e-03

Pos logit contributions
use+0.027
des+0.027
 former+0.026
 unnecessary+0.025
ited+0.025
Neg logit contributions
:-0.006
 making-0.006
's-0.006
 was-0.005
 poking-0.005
Token outside
Token ID2354
Feature activation+3.06e-02

Pos logit contributions
des+0.047
use+0.045
away+0.044
 former+0.044
 someday+0.044
Neg logit contributions
 video-0.025
 Sam-0.022
 Cube-0.021
 Max-0.021
 grid-0.020
Token,
Token ID11
Feature activation-4.67e-02

Pos logit contributions
des+0.075
use+0.070
 former+0.069
away+0.066
minded+0.065
Neg logit contributions
 making-0.025
 video-0.025
 poking-0.023
 drawing-0.023
's-0.021
Token and
Token ID290
Feature activation+1.57e-02

Pos logit contributions
 making+0.047
 video+0.045
:+0.044
's+0.043
 poking+0.043
Neg logit contributions
des-0.109
use-0.105
 former-0.103
ited-0.095
 greater-0.094
Token its
Token ID663
Feature activation+7.84e-03

Pos logit contributions
des+0.038
use+0.037
 former+0.036
powers+0.033
ited+0.033
Neg logit contributions
 making-0.018
 poking-0.016
's-0.016
:-0.016
 drawing-0.015
Token leaves
Token ID5667
Feature activation+5.54e-03

Pos logit contributions
des+0.019
use+0.019
 former+0.017
ited+0.017
minded+0.016
Neg logit contributions
 making-0.009
 drawing-0.008
 poking-0.008
:-0.008
's-0.008
Token were
Token ID547
Feature activation-7.90e-02

Pos logit contributions
des+0.012
use+0.011
ited+0.011
 former+0.010
powers+0.010
Neg logit contributions
 video-0.006
 drawing-0.006
 Sam-0.005
 grid-0.005
 Max-0.005
Token moving
Token ID3867
Feature activation-4.88e-02

Pos logit contributions
 video+0.095
's+0.089
 making+0.086
:+0.084
 drawing+0.082
Neg logit contributions
 former-0.162
des-0.160
use-0.151
 greater-0.147
God-0.146
Token in
Token ID287
Feature activation-3.56e-02

Pos logit contributions
 making+0.039
 poking+0.033
:+0.032
's+0.032
 drawing+0.031
Neg logit contributions
des-0.131
use-0.128
 former-0.125
powers-0.118
ited-0.116
Token the
Token ID262
Feature activation-8.05e-03

Pos logit contributions
 making+0.026
 poking+0.022
:+0.022
's+0.022
 drawing+0.021
Neg logit contributions
des-0.096
use-0.094
 former-0.092
powers-0.086
ited-0.085
Token wind
Token ID2344
Feature activation-3.02e-02

Pos logit contributions
 making+0.009
's+0.008
 poking+0.008
 drawing+0.008
:+0.008
Neg logit contributions
des-0.020
use-0.019
 former-0.018
powers-0.017
irst-0.017
Token.
Token ID13
Feature activation-2.07e-02

Pos logit contributions
 video+0.035
 Sam+0.032
 grid+0.031
 drawing+0.031
 poking+0.030
Neg logit contributions
use-0.056
des-0.056
 former-0.054
away-0.053
ited-0.051
Token.
Token ID13
Feature activation-1.61e-02

Pos logit contributions
 making+0.005
 poking+0.004
 video+0.004
 drawing+0.004
 fighting+0.004
Neg logit contributions
des-0.011
use-0.010
God-0.010
 former-0.010
away-0.009
Token She
Token ID1375
Feature activation+4.51e-02

Pos logit contributions
 making+0.028
 poking+0.026
 video+0.025
's+0.025
:+0.025
Neg logit contributions
des-0.027
use-0.026
 former-0.026
powers-0.022
 anytime-0.022
Token loved
Token ID6151
Feature activation+4.19e-02

Pos logit contributions
des+0.089
use+0.081
God+0.077
powers+0.075
 former+0.072
Neg logit contributions
 poking-0.060
 making-0.060
 fighting-0.060
 video-0.058
 spraying-0.056
Token to
Token ID284
Feature activation+3.27e-02

Pos logit contributions
des+0.096
use+0.092
 former+0.090
powers+0.084
God+0.083
Neg logit contributions
 making-0.047
:-0.043
 poking-0.043
's-0.043
 video-0.042
Token play
Token ID711
Feature activation-1.89e-02

Pos logit contributions
des+0.074
use+0.072
 former+0.070
powers+0.065
ited+0.064
Neg logit contributions
 making-0.040
 poking-0.036
:-0.036
's-0.035
 drawing-0.035
Token with
Token ID351
Feature activation-7.51e-02

Pos logit contributions
:+0.017
 making+0.017
 poking+0.016
 fighting+0.016
 had+0.015
Neg logit contributions
des-0.044
use-0.042
 former-0.040
away-0.039
powers-0.039
Token her
Token ID607
Feature activation+4.87e-02

Pos logit contributions
 making+0.101
:+0.096
's+0.093
 poking+0.092
 video+0.089
Neg logit contributions
des-0.148
use-0.141
 former-0.139
powers-0.134
 reins-0.129
Token dolls
Token ID36062
Feature activation-1.05e-02

Pos logit contributions
des+0.103
use+0.101
 former+0.101
powers+0.097
 reins+0.094
Neg logit contributions
 making-0.057
:-0.055
 poking-0.053
's-0.052
 video-0.049
Token and
Token ID290
Feature activation+1.51e-02

Pos logit contributions
 poking+0.009
 making+0.009
 video+0.008
 spraying+0.008
 slapping+0.008
Neg logit contributions
use-0.025
des-0.025
 former-0.023
 anytime-0.023
minded-0.022
Token t
Token ID256
Feature activation+3.04e-02

Pos logit contributions
use+0.032
des+0.031
 former+0.029
 reins+0.028
God+0.028
Neg logit contributions
:-0.018
 making-0.018
 poking-0.017
 video-0.017
's-0.016
Tokeneddy
Token ID21874
Feature activation-3.88e-02

Pos logit contributions
des+0.051
use+0.050
 former+0.048
 unnecessary+0.043
 greater+0.043
Neg logit contributions
 making-0.054
's-0.051
:-0.051
 poking-0.050
 drawing-0.050
Token field
Token ID2214
Feature activation+9.09e-03

Pos logit contributions
 making+0.042
 poking+0.037
's+0.037
 having+0.037
:+0.036
Neg logit contributions
 former-0.065
des-0.065
use-0.064
away-0.058
 greater-0.058
Token of
Token ID286
Feature activation-4.04e-02

Pos logit contributions
des+0.019
use+0.017
 former+0.017
ited+0.017
yes+0.017
Neg logit contributions
 video-0.010
 making-0.009
 fighting-0.009
 drawing-0.009
 grid-0.009
Token flowers
Token ID12734
Feature activation-1.12e-02

Pos logit contributions
 making+0.058
 poking+0.053
's+0.052
:+0.051
 drawing+0.051
Neg logit contributions
des-0.084
use-0.082
 former-0.079
powers-0.073
ited-0.072
Token.
Token ID13
Feature activation-2.22e-02

Pos logit contributions
 video+0.011
 grid+0.009
 making+0.009
 Sam+0.009
 drawing+0.009
Neg logit contributions
des-0.025
use-0.025
 former-0.023
ited-0.022
 anytime-0.022
Token "
Token ID366
Feature activation+5.79e-02

Pos logit contributions
 making+0.029
 poking+0.026
:+0.026
's+0.026
 drawing+0.026
Neg logit contributions
des-0.048
use-0.047
 former-0.045
powers-0.042
ited-0.042
TokenWow
Token ID22017
Feature activation-1.35e-01

Pos logit contributions
des+0.106
use+0.103
 former+0.101
powers+0.094
 unnecessary+0.092
Neg logit contributions
 making-0.092
's-0.089
 poking-0.087
 video-0.085
 was-0.085
Token!"
Token ID2474
Feature activation+4.52e-02

Pos logit contributions
 making+0.143
 poking+0.125
 video+0.125
 drawing+0.122
:+0.122
Neg logit contributions
des-0.321
use-0.317
 former-0.300
powers-0.285
ited-0.280
Token she
Token ID673
Feature activation+7.29e-02

Pos logit contributions
des+0.098
use+0.096
 former+0.094
 unnecessary+0.085
ited+0.084
Neg logit contributions
 making-0.062
's-0.057
:-0.056
 poking-0.055
 drawing-0.054
Token said
Token ID531
Feature activation-4.54e-02

Pos logit contributions
des+0.154
 former+0.147
use+0.147
powers+0.142
ited+0.138
Neg logit contributions
 making-0.092
 video-0.083
 poking-0.081
 drawing-0.079
 spraying-0.077
Token.
Token ID13
Feature activation-2.20e-02

Pos logit contributions
 video+0.046
 grid+0.046
 making+0.045
 fighting+0.041
 doing+0.040
Neg logit contributions
des-0.096
powers-0.088
yes-0.087
 former-0.087
away-0.085
Token She
Token ID1375
Feature activation+3.88e-02

Pos logit contributions
 making+0.017
 poking+0.014
:+0.014
's+0.014
 drawing+0.014
Neg logit contributions
des-0.060
use-0.058
 former-0.057
powers-0.053
ited-0.053
Token Lily
Token ID20037
Feature activation-9.51e-03

Pos logit contributions
 making+0.000
 poking+0.000
:+0.000
's+0.000
 drawing+0.000
Neg logit contributions
des-0.001
use-0.001
 former-0.001
powers-0.001
ited-0.001
Token her
Token ID607
Feature activation+4.98e-02

Pos logit contributions
 making+0.010
 poking+0.009
 video+0.009
 shooting+0.009
 drawing+0.009
Neg logit contributions
des-0.022
use-0.020
 former-0.020
powers-0.019
minded-0.019
Token shapes
Token ID15268
Feature activation-8.15e-02

Pos logit contributions
des+0.117
use+0.113
 former+0.112
powers+0.105
 reins+0.103
Neg logit contributions
 making-0.052
 poking-0.047
:-0.045
's-0.045
 drawing-0.044
Token.
Token ID13
Feature activation-2.44e-02

Pos logit contributions
 making+0.076
 video+0.069
 poking+0.069
's+0.064
 fighting+0.064
Neg logit contributions
des-0.187
use-0.177
 former-0.177
powers-0.173
 greater-0.166
Token They
Token ID1119
Feature activation-6.71e-02

Pos logit contributions
 making+0.023
 poking+0.020
:+0.020
's+0.020
 drawing+0.019
Neg logit contributions
des-0.062
use-0.061
 former-0.059
powers-0.055
ited-0.054
Token were
Token ID547
Feature activation-8.32e-02

Pos logit contributions
 making+0.069
 poking+0.065
:+0.062
 video+0.062
 drawing+0.061
Neg logit contributions
use-0.154
des-0.153
 former-0.147
ited-0.140
God-0.136
Token brown
Token ID7586
Feature activation-5.90e-02

Pos logit contributions
 video+0.096
's+0.093
 grid+0.085
on+0.084
 tube+0.083
Neg logit contributions
 former-0.156
 greater-0.152
powers-0.142
des-0.142
 unnecessary-0.141
Token and
Token ID290
Feature activation+1.29e-02

Pos logit contributions
 making+0.059
 poking+0.055
 video+0.053
 shooting+0.051
 drawing+0.051
Neg logit contributions
des-0.131
use-0.129
 former-0.126
powers-0.121
minded-0.120
Token hard
Token ID1327
Feature activation-4.29e-04

Pos logit contributions
use+0.023
 former+0.022
des+0.021
 greater+0.021
God+0.021
Neg logit contributions
's-0.017
 poking-0.016
:-0.016
 video-0.016
 making-0.016
Token and
Token ID290
Feature activation+1.29e-02

Pos logit contributions
 making+0.000
 poking+0.000
's+0.000
:+0.000
 video+0.000
Neg logit contributions
des-0.001
use-0.001
 former-0.001
powers-0.001
minded-0.001
Token smelled
Token ID37609
Feature activation-3.86e-02

Pos logit contributions
des+0.025
 former+0.025
use+0.025
 greater+0.023
powers+0.023
Neg logit contributions
 making-0.016
 poking-0.016
:-0.015
 video-0.015
's-0.015