TOP ACTIVATIONS
MAX = 12.396

do . Hours passed and Lily 's parents
wrong . Hours passed and the animal looked
to work . Hours later , the prince and
alone . Hours passed , but nobody came
baseball returned . Hours later , the sun had
down and fell asleep . Hours later it woke up and
it was smooth again . Hours and hours went by ,
the door would open . Hours passed , but no -
his tight yellow shirt . One day , he asked his
liked to play outside . One day , they saw a
brave girl named Jane . One day , she was searching
you â . One day she was in her
barn full of hay . One day , a mean fox
to lose anything important . One day , he lost his
little boy called Jack . One day , he was feeling
a girl named Lily . One day , Lily and her
little girl named Lily . One day , she was playing
little girl named Lily . One day , Lily went for
to buy her some . One day , her mom brought
little girl named Sam . One day Sam picked a lemon

MIN ACTIVATIONS
MIN = -10.270

, he is playing a sax ophone . It is a
watch the man with the sax ophone . He plays fast
man playing jazz on his sax ophone . She thought it
low . He makes the sax ophone sing . He is
. He plays with the sax ophone . He is also
The man with the sax ophone has a friend .
loved the sound of the sax ophone and the drums .
good . The sax ophone and the guitar support
, the guitar and the sax ophone support each other .
The man started playing his sax ophone and Lily rode her
the trumpet , and the sax ophone . They also teach
little girl playing a toy sax ophone . She was playing
excited ! She grabbed a sax ophone and played perfectly .
. Jack had his sax ophone , Jill had her
. The par rot was soon able to
time there was a furry par rot . He liked to
par rot , but the par rot flew away . The
They say goodbye to the par rot . They hope to
it with you ?" the par rot said .
was a very talk ative par rot , and she loved

INTERVAL 6.730 - 12.396
CONTAINS 0.958%

play together every day . One day , they heard about
a bit afraid . But soon enough , Sarah 's fear
shape with little bumps on top . When Tim
So the yacht sailed slowly into the part and stopped
who liked to run . One day he was running in

INTERVAL 1.063 - 6.730
CONTAINS 25.849%

Anna asked . " No , sweet ie ,
She closed her eyes and tried to imagine what it would
the wolf . He said " Hi ! I 'm Tim
blanket and used it to cover herself . Her mom found
, " Mine !" He ran towards the shiny object but

INTERVAL -4.603 - 1.063
CONTAINS 71.909%

back to the plastic box and found something new . It
. Then , Spot had an idea . He
runs at them . Sara and Ben are scared .
Lily 's hand . " What happened ?" she
's play ". The short girl replied , " Ok !"

INTERVAL -10.270 - -4.603
CONTAINS 1.284%

He has to write a story , do some math ,
worked on her pet , talking to it as she spread
together . It was a wonderful day with lots of fun
family . He loved to play in the sand and splash
it 's always better to do the right thing . <|endoftext|>
Token do
Token ID466
Feature activation-6.91e+00

Pos logit contributions
use+0.012
,"+0.011
+0.011
',"+0.011
rated+0.011
Neg logit contributions
 sharing-0.008
 performing-0.007
 shooting-0.007
 drawing-0.007
 playing-0.007
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
 playing+0.226
 sharing+0.224
 drawing+0.219
 video+0.215
 shooting+0.215
Neg logit contributions
use-0.184
 tighter-0.183
igi-0.169
rated-0.167
-0.166
Token
Token ID220
Feature activation+4.01e+00

Pos logit contributions
 playing+0.032
 sharing+0.031
 being+0.030
 drawing+0.029
 acting+0.029
Neg logit contributions
 tighter-0.043
use-0.042
igi-0.042
rated-0.040
Lex-0.040
Token
Token ID198
Feature activation-3.06e-01

Pos logit contributions
 tighter+0.150
use+0.148
igi+0.146
rated+0.141
Lex+0.138
Neg logit contributions
 playing-0.098
 sharing-0.095
 being-0.093
 drawing-0.090
 acting-0.087
Token
Token ID198
Feature activation-2.55e-01

Pos logit contributions
 playing+0.007
 sharing+0.007
 being+0.007
 drawing+0.006
 acting+0.006
Neg logit contributions
 tighter-0.012
use-0.012
rated-0.012
igi-0.012
Lex-0.011
TokenHours
Token ID39792
Feature activation+1.24e+01

Pos logit contributions
 playing+0.010
 sharing+0.010
 being+0.010
 drawing+0.010
 acting+0.010
Neg logit contributions
 tighter-0.005
use-0.005
igi-0.005
rated-0.005
Lex-0.005
Token passed
Token ID3804
Feature activation+3.16e+00

Pos logit contributions
 tighter+0.276
use+0.264
igi+0.253
rated+0.231
Lex+0.224
Neg logit contributions
 sharing-0.483
 playing-0.466
 video-0.459
 drawing-0.456
 acting-0.454
Token and
Token ID290
Feature activation-1.01e-01

Pos logit contributions
use+0.142
igi+0.141
 reins+0.141
rated+0.136
Lex+0.133
Neg logit contributions
 playing-0.056
 being-0.052
 sharing-0.045
 teasing-0.044
 drawing-0.043
Token Lily
Token ID20037
Feature activation-4.90e-01

Pos logit contributions
 playing+0.003
 sharing+0.003
 being+0.002
 drawing+0.002
 acting+0.002
Neg logit contributions
 tighter-0.004
use-0.004
igi-0.004
rated-0.003
Lex-0.003
Token's
Token ID338
Feature activation-2.62e+00

Pos logit contributions
 sharing+0.012
 playing+0.011
 being+0.011
 drawing+0.011
 video+0.011
Neg logit contributions
 tighter-0.019
use-0.018
igi-0.018
rated-0.017
Lex-0.017
Token parents
Token ID3397
Feature activation+1.29e+00

Pos logit contributions
 sharing+0.078
 being+0.075
 playing+0.074
 acting+0.071
 doing+0.071
Neg logit contributions
 tighter-0.082
use-0.077
igi-0.076
Lex-0.072
 eggs-0.072
Token wrong
Token ID2642
Feature activation-1.60e-01

Pos logit contributions
 playing+0.103
 sharing+0.101
 drawing+0.101
 video+0.101
 shooting+0.099
Neg logit contributions
 tighter-0.080
use-0.076
rated-0.075
-0.072
igi-0.071
Token.
Token ID13
Feature activation-8.00e-01

Pos logit contributions
 playing+0.005
 sharing+0.005
 being+0.005
 drawing+0.005
 acting+0.005
Neg logit contributions
 tighter-0.005
use-0.005
igi-0.004
rated-0.004
-0.004
Token
Token ID220
Feature activation+4.20e+00

Pos logit contributions
 playing+0.021
 sharing+0.020
 being+0.020
 drawing+0.019
 acting+0.019
Neg logit contributions
 tighter-0.029
use-0.028
igi-0.028
rated-0.027
Lex-0.026
Token
Token ID198
Feature activation-1.06e-01

Pos logit contributions
 tighter+0.155
use+0.153
igi+0.151
rated+0.145
Lex+0.142
Neg logit contributions
 playing-0.105
 sharing-0.102
 being-0.099
 drawing-0.096
 acting-0.094
Token
Token ID198
Feature activation-6.69e-02

Pos logit contributions
 playing+0.003
 sharing+0.002
 being+0.002
 drawing+0.002
 acting+0.002
Neg logit contributions
 tighter-0.004
use-0.004
igi-0.004
rated-0.004
Lex-0.004
TokenHours
Token ID39792
Feature activation+1.23e+01

Pos logit contributions
 playing+0.003
 sharing+0.003
 being+0.003
 drawing+0.003
 acting+0.003
Neg logit contributions
 tighter-0.001
use-0.001
igi-0.001
Lex-0.001
rated-0.001
Token passed
Token ID3804
Feature activation+3.23e+00

Pos logit contributions
 tighter+0.295
use+0.279
igi+0.268
rated+0.250
Lex+0.240
Neg logit contributions
 sharing-0.461
 playing-0.453
 drawing-0.442
 video-0.439
 acting-0.436
Token and
Token ID290
Feature activation+3.00e-02

Pos logit contributions
igi+0.136
use+0.135
 tighter+0.128
rated+0.128
Lex+0.128
Neg logit contributions
 playing-0.067
 being-0.062
 sharing-0.059
 shooting-0.058
 drawing-0.056
Token the
Token ID262
Feature activation+3.41e-01

Pos logit contributions
 tighter+0.001
use+0.001
igi+0.001
rated+0.001
Lex+0.001
Neg logit contributions
 playing-0.001
 sharing-0.001
 being-0.001
 drawing-0.001
 acting-0.001
Token animal
Token ID5044
Feature activation-4.23e+00

Pos logit contributions
 tighter+0.011
igi+0.011
use+0.011
Lex+0.010
 eggs+0.010
Neg logit contributions
 sharing-0.010
 playing-0.009
 being-0.009
 drawing-0.008
 acting-0.008
Token looked
Token ID3114
Feature activation+1.63e+00

Pos logit contributions
 sharing+0.105
 playing+0.100
 being+0.095
 drawing+0.094
 trading+0.092
Neg logit contributions
 tighter-0.158
use-0.148
igi-0.147
rated-0.146
-0.139
Token to
Token ID284
Feature activation+4.82e-01

Pos logit contributions
 tighter+0.093
 eggs+0.083
use+0.081
 pans+0.080
 sugar+0.077
Neg logit contributions
 sharing-0.116
 video-0.116
 teasing-0.113
 being-0.112
 Spike-0.111
Token work
Token ID670
Feature activation+1.05e+00

Pos logit contributions
 tighter+0.016
use+0.016
igi+0.015
rated+0.015
+0.014
Neg logit contributions
 playing-0.013
 sharing-0.013
 being-0.013
 teasing-0.012
 drawing-0.012
Token.
Token ID13
Feature activation-5.83e-01

Pos logit contributions
 tighter+0.033
use+0.032
igi+0.031
rated+0.030
Lex+0.029
Neg logit contributions
 playing-0.032
 sharing-0.031
 being-0.030
 drawing-0.030
 acting-0.029
Token
Token ID198
Feature activation-1.51e-01

Pos logit contributions
 sharing+0.017
 playing+0.017
 being+0.017
 drawing+0.016
 acting+0.016
Neg logit contributions
 tighter-0.018
use-0.018
igi-0.018
-0.017
Lex-0.017
Token
Token ID198
Feature activation-1.37e-01

Pos logit contributions
 playing+0.003
 sharing+0.003
 being+0.003
 drawing+0.003
 shooting+0.003
Neg logit contributions
 tighter-0.006
use-0.006
rated-0.006
igi-0.006
 reins-0.006
TokenHours
Token ID39792
Feature activation+1.23e+01

Pos logit contributions
 playing+0.006
 sharing+0.006
 being+0.006
 drawing+0.006
 acting+0.006
Neg logit contributions
 tighter-0.003
use-0.002
igi-0.002
rated-0.002
Lex-0.002
Token later
Token ID1568
Feature activation+4.39e+00

Pos logit contributions
 tighter+0.286
use+0.285
igi+0.270
rated+0.255
+0.240
Neg logit contributions
 sharing-0.447
 playing-0.435
 video-0.428
 teasing-0.427
 drawing-0.422
Token,
Token ID11
Feature activation-1.05e+00

Pos logit contributions
 tighter+0.130
use+0.127
+0.127
igi+0.124
Lex+0.120
Neg logit contributions
 sharing-0.128
 playing-0.126
 being-0.120
 drawing-0.119
 teasing-0.118
Token the
Token ID262
Feature activation+2.26e-01

Pos logit contributions
 playing+0.023
 sharing+0.022
 being+0.022
 drawing+0.021
 acting+0.020
Neg logit contributions
 tighter-0.042
use-0.041
igi-0.041
rated-0.039
Lex-0.039
Token prince
Token ID19716
Feature activation-3.05e+00

Pos logit contributions
 tighter+0.007
use+0.006
igi+0.006
rated+0.006
Lex+0.006
Neg logit contributions
 playing-0.007
 sharing-0.007
 being-0.007
 drawing-0.007
 acting-0.007
Token and
Token ID290
Feature activation-1.06e-01

Pos logit contributions
 sharing+0.086
 playing+0.084
 being+0.081
 teasing+0.078
 drawing+0.077
Neg logit contributions
 tighter-0.098
use-0.097
igi-0.094
-0.092
rated-0.092
Token alone
Token ID3436
Feature activation-1.76e+00

Pos logit contributions
 playing+0.094
 sharing+0.094
 being+0.091
 drawing+0.090
 acting+0.089
Neg logit contributions
 tighter-0.094
use-0.089
igi-0.087
rated-0.083
Lex-0.082
Token.
Token ID13
Feature activation-4.76e-01

Pos logit contributions
 playing+0.050
 sharing+0.047
 being+0.047
 drawing+0.044
 acting+0.043
Neg logit contributions
igi-0.062
use-0.061
 tighter-0.061
Lex-0.058
rated-0.057
Token
Token ID220
Feature activation+4.27e+00

Pos logit contributions
 sharing+0.015
 playing+0.014
 being+0.014
 drawing+0.014
 acting+0.014
Neg logit contributions
 tighter-0.015
use-0.014
-0.014
igi-0.014
Lex-0.013
Token
Token ID198
Feature activation+2.07e-02

Pos logit contributions
 tighter+0.156
use+0.154
igi+0.152
rated+0.146
Lex+0.144
Neg logit contributions
 playing-0.107
 sharing-0.104
 being-0.102
 drawing-0.099
 acting-0.096
Token
Token ID198
Feature activation+3.79e-02

Pos logit contributions
 tighter+0.001
use+0.001
igi+0.001
rated+0.001
Lex+0.001
Neg logit contributions
 playing-0.000
 sharing-0.000
 being-0.000
 drawing-0.000
 acting-0.000
TokenHours
Token ID39792
Feature activation+1.23e+01

Pos logit contributions
 tighter+0.001
use+0.001
igi+0.001
rated+0.001
Lex+0.001
Neg logit contributions
 playing-0.002
 sharing-0.002
 being-0.002
 drawing-0.002
 acting-0.002
Token passed
Token ID3804
Feature activation+3.18e+00

Pos logit contributions
 tighter+0.230
use+0.219
igi+0.203
rated+0.189
+0.174
Neg logit contributions
 sharing-0.513
 playing-0.506
 video-0.498
 drawing-0.496
 acting-0.495
Token,
Token ID11
Feature activation-9.14e-01

Pos logit contributions
use+0.143
igi+0.143
 reins+0.137
Lex+0.136
rated+0.135
Neg logit contributions
 playing-0.058
 being-0.052
 sharing-0.049
 shooting-0.049
 drawing-0.046
Token but
Token ID475
Feature activation-5.79e-01

Pos logit contributions
 playing+0.025
 sharing+0.024
 drawing+0.023
 being+0.023
 acting+0.023
Neg logit contributions
 tighter-0.032
use-0.030
igi-0.029
rated-0.028
Lex-0.028
Token nobody
Token ID8168
Feature activation+6.16e+00

Pos logit contributions
 sharing+0.017
 playing+0.017
 video+0.017
 drawing+0.017
 acting+0.016
Neg logit contributions
 tighter-0.018
use-0.016
igi-0.016
rated-0.016
 steep-0.016
Token came
Token ID1625
Feature activation+1.62e+00

Pos logit contributions
 tighter+0.206
use+0.199
igi+0.197
rated+0.193
Lex+0.183
Neg logit contributions
 playing-0.173
 sharing-0.172
 drawing-0.164
 being-0.164
 acting-0.159
Token baseball
Token ID9283
Feature activation-4.28e+00

Pos logit contributions
 tighter+0.001
use+0.001
igi+0.001
rated+0.001
+0.001
Neg logit contributions
 playing-0.001
 sharing-0.000
 being-0.000
 drawing-0.000
 acting-0.000
Token returned
Token ID4504
Feature activation+3.19e+00

Pos logit contributions
 playing+0.112
 sharing+0.112
 drawing+0.108
 being+0.105
 teasing+0.102
Neg logit contributions
 tighter-0.143
use-0.142
-0.137
rated-0.134
igi-0.127
Token.
Token ID13
Feature activation-8.80e-01

Pos logit contributions
igi+0.119
use+0.119
Lex+0.118
 tighter+0.117
rated+0.116
Neg logit contributions
 playing-0.083
 being-0.077
 sharing-0.077
 drawing-0.071
 doing-0.069
Token
Token ID198
Feature activation-3.24e-01

Pos logit contributions
 playing+0.024
 sharing+0.023
 being+0.023
 drawing+0.022
 acting+0.022
Neg logit contributions
 tighter-0.031
use-0.030
igi-0.030
Lex-0.028
-0.028
Token
Token ID198
Feature activation-2.95e-01

Pos logit contributions
 sharing+0.010
 playing+0.010
 being+0.010
 acting+0.009
 teasing+0.009
Neg logit contributions
igi-0.010
 tighter-0.010
-0.009
Lex-0.009
',"-0.009
TokenHours
Token ID39792
Feature activation+1.22e+01

Pos logit contributions
 playing+0.012
 drawing+0.012
 sharing+0.012
 being+0.012
 acting+0.012
Neg logit contributions
 tighter-0.006
igi-0.006
use-0.005
Lex-0.005
First-0.005
Token later
Token ID1568
Feature activation+4.28e+00

Pos logit contributions
 tighter+0.265
use+0.250
igi+0.225
rated+0.209
+0.206
Neg logit contributions
 sharing-0.466
 playing-0.466
 video-0.462
 drawing-0.461
 acting-0.455
Token,
Token ID11
Feature activation-1.17e+00

Pos logit contributions
+0.108
 tighter+0.104
use+0.102
Lex+0.094
 slowly+0.093
Neg logit contributions
 sharing-0.135
 playing-0.133
 drawing-0.131
 video-0.130
 joking-0.127
Token the
Token ID262
Feature activation+1.40e-02

Pos logit contributions
 playing+0.027
 sharing+0.027
 drawing+0.026
 being+0.026
 acting+0.025
Neg logit contributions
 tighter-0.044
use-0.040
igi-0.040
-0.040
Lex-0.039
Token sun
Token ID4252
Feature activation-1.28e-01

Pos logit contributions
 tighter+0.000
+0.000
use+0.000
rated+0.000
 steep+0.000
Neg logit contributions
 sharing-0.000
 playing-0.000
 being-0.000
 drawing-0.000
 acting-0.000
Token had
Token ID550
Feature activation-1.67e+00

Pos logit contributions
 sharing+0.004
 playing+0.004
 being+0.004
 trading+0.004
 drawing+0.004
Neg logit contributions
 tighter-0.003
rated-0.003
 slowly-0.003
-0.003
 colder-0.003
Token down
Token ID866
Feature activation+3.20e+00

Pos logit contributions
use+0.136
',"+0.123
First+0.122
igi+0.122
Lex+0.122
Neg logit contributions
 playing-0.108
 being-0.099
 sharing-0.098
 doing-0.092
 having-0.091
Token and
Token ID290
Feature activation-1.39e-03

Pos logit contributions
use+0.159
igi+0.158
 fiber+0.149
rated+0.149
Lex+0.148
Neg logit contributions
 playing-0.059
 being-0.052
 doing-0.047
 sharing-0.047
 having-0.045
Token fell
Token ID3214
Feature activation+5.53e+00

Pos logit contributions
 playing+0.000
 being+0.000
 sharing+0.000
 drawing+0.000
 doing+0.000
Neg logit contributions
igi-0.000
use-0.000
 tighter-0.000
Lex-0.000
',"-0.000
Token asleep
Token ID16039
Feature activation+2.16e+00

Pos logit contributions
use+0.203
igi+0.196
Lex+0.181
',"+0.179
sie+0.179
Neg logit contributions
 playing-0.163
 being-0.154
 shooting-0.144
 sharing-0.144
 doing-0.143
Token.
Token ID13
Feature activation-3.70e-01

Pos logit contributions
 tighter+0.071
use+0.068
igi+0.067
rated+0.064
+0.063
Neg logit contributions
 sharing-0.061
 playing-0.061
 being-0.059
 drawing-0.058
 video-0.057
Token Hours
Token ID19347
Feature activation+1.09e+01

Pos logit contributions
 playing+0.008
 sharing+0.008
 being+0.008
 drawing+0.007
 acting+0.007
Neg logit contributions
 tighter-0.015
use-0.015
igi-0.015
rated-0.014
Lex-0.014
Token later
Token ID1568
Feature activation+4.38e+00

Pos logit contributions
 tighter+0.299
use+0.297
igi+0.281
rated+0.268
Lex+0.261
Neg logit contributions
 sharing-0.365
 playing-0.360
 drawing-0.350
 being-0.348
 video-0.346
Token it
Token ID340
Feature activation+1.07e+00

Pos logit contributions
 tighter+0.145
use+0.144
igi+0.138
+0.137
rated+0.134
Neg logit contributions
 sharing-0.119
 playing-0.117
 being-0.112
 drawing-0.111
 video-0.109
Token woke
Token ID19092
Feature activation+2.47e+00

Pos logit contributions
 tighter+0.042
igi+0.042
use+0.041
Lex+0.040
rated+0.039
Neg logit contributions
 playing-0.024
 being-0.023
 sharing-0.022
 drawing-0.022
 acting-0.021
Token up
Token ID510
Feature activation+2.45e+00

Pos logit contributions
use+0.112
 reins+0.103
igi+0.103
suits+0.102
First+0.101
Neg logit contributions
 playing-0.049
 being-0.049
 sharing-0.044
 doing-0.043
 shooting-0.043
Token and
Token ID290
Feature activation-8.30e-02

Pos logit contributions
igi+0.108
use+0.105
 reins+0.102
 tighter+0.100
Lex+0.099
Neg logit contributions
 playing-0.056
 being-0.051
 sharing-0.049
 doing-0.048
 shooting-0.047
Token it
Token ID340
Feature activation+1.12e+00

Pos logit contributions
 tighter+0.110
use+0.109
igi+0.109
rated+0.105
Lex+0.102
Neg logit contributions
 playing-0.067
 being-0.063
 sharing-0.062
 drawing-0.060
 acting-0.057
Token was
Token ID373
Feature activation-1.65e+00

Pos logit contributions
use+0.049
 tighter+0.049
igi+0.049
Lex+0.046
 reins+0.046
Neg logit contributions
 playing-0.020
 being-0.019
 sharing-0.018
 drawing-0.017
 acting-0.017
Token smooth
Token ID7209
Feature activation+2.03e-01

Pos logit contributions
 playing+0.044
 being+0.042
 sharing+0.041
 doing+0.039
 drawing+0.038
Neg logit contributions
use-0.065
igi-0.062
 tighter-0.059
',"-0.058
 reins-0.058
Token again
Token ID757
Feature activation+7.49e+00

Pos logit contributions
use+0.008
igi+0.008
Lex+0.007
 tighter+0.007
 reins+0.007
Neg logit contributions
 playing-0.005
 being-0.005
 sharing-0.005
 doing-0.005
 drawing-0.004
Token.
Token ID13
Feature activation-6.52e-01

Pos logit contributions
 tighter+0.247
use+0.244
igi+0.244
rated+0.231
Lex+0.229
Neg logit contributions
 playing-0.219
 sharing-0.212
 being-0.209
 drawing-0.202
 acting-0.198
Token Hours
Token ID19347
Feature activation+1.09e+01

Pos logit contributions
 playing+0.017
 sharing+0.017
 being+0.017
 drawing+0.016
 acting+0.016
Neg logit contributions
use-0.023
 tighter-0.023
igi-0.023
rated-0.022
Lex-0.021
Token and
Token ID290
Feature activation+4.30e-03

Pos logit contributions
 tighter+0.294
use+0.291
igi+0.277
rated+0.260
Lex+0.256
Neg logit contributions
 sharing-0.368
 playing-0.365
 drawing-0.355
 being-0.352
 video-0.350
Token hours
Token ID2250
Feature activation+4.47e+00

Pos logit contributions
use+0.000
igi+0.000
 tighter+0.000
rated+0.000
Lex+0.000
Neg logit contributions
 playing-0.000
 sharing-0.000
 being-0.000
 drawing-0.000
 acting-0.000
Token went
Token ID1816
Feature activation+4.05e+00

Pos logit contributions
 tighter+0.125
igi+0.121
use+0.117
 minutes+0.114
Lex+0.110
Neg logit contributions
 sharing-0.141
 video-0.136
 drawing-0.130
 teasing-0.130
 acting-0.128
Token by
Token ID416
Feature activation-3.85e+00

Pos logit contributions
 tighter+0.129
use+0.128
igi+0.126
rated+0.120
Lex+0.118
Neg logit contributions
 playing-0.122
 sharing-0.119
 being-0.117
 drawing-0.114
 acting-0.112
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 sharing+0.109
 playing+0.106
 being+0.102
 drawing+0.101
 video+0.101
Neg logit contributions
 tighter-0.122
use-0.114
igi-0.113
rated-0.110
Lex-0.107
Token the
Token ID262
Feature activation+1.19e-01

Pos logit contributions
 tighter+0.101
use+0.097
igi+0.095
rated+0.091
Lex+0.089
Neg logit contributions
 playing-0.079
 sharing-0.078
 being-0.075
 drawing-0.074
 acting-0.073
Token door
Token ID3420
Feature activation+4.37e+00

Pos logit contributions
 tighter+0.004
use+0.003
igi+0.003
rated+0.003
Lex+0.003
Neg logit contributions
 playing-0.004
 sharing-0.004
 being-0.004
 drawing-0.004
 acting-0.003
Token would
Token ID561
Feature activation+2.88e+00

Pos logit contributions
igi+0.144
use+0.140
Lex+0.138
 tighter+0.135
rated+0.128
Neg logit contributions
 playing-0.128
 being-0.124
 sharing-0.121
 drawing-0.117
 doing-0.115
Token open
Token ID1280
Feature activation+3.88e+00

Pos logit contributions
 tighter+0.113
use+0.111
igi+0.110
rated+0.106
Lex+0.104
Neg logit contributions
 playing-0.065
 sharing-0.064
 being-0.062
 drawing-0.060
 acting-0.058
Token.
Token ID13
Feature activation-7.85e-01

Pos logit contributions
igi+0.149
use+0.145
Lex+0.143
rated+0.140
 tighter+0.137
Neg logit contributions
 playing-0.102
 being-0.095
 sharing-0.093
 shooting-0.087
 drawing-0.087
Token Hours
Token ID19347
Feature activation+1.08e+01

Pos logit contributions
 playing+0.025
 sharing+0.024
 being+0.024
 drawing+0.023
 acting+0.023
Neg logit contributions
 tighter-0.024
use-0.023
igi-0.023
rated-0.022
Lex-0.022
Token passed
Token ID3804
Feature activation+3.16e+00

Pos logit contributions
 tighter+0.258
use+0.256
igi+0.234
rated+0.221
+0.216
Neg logit contributions
 sharing-0.404
 playing-0.395
 drawing-0.390
 video-0.387
 being-0.384
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
igi+0.120
use+0.119
 tighter+0.118
rated+0.113
Lex+0.113
Neg logit contributions
 playing-0.077
 being-0.072
 sharing-0.072
 drawing-0.068
 shooting-0.068
Token but
Token ID475
Feature activation-7.37e-01

Pos logit contributions
 sharing+0.032
 playing+0.032
 drawing+0.031
 video+0.030
 being+0.030
Neg logit contributions
 tighter-0.032
use-0.029
igi-0.028
rated-0.028
Lex-0.027
Token no
Token ID645
Feature activation+7.24e+00

Pos logit contributions
 sharing+0.024
 video+0.023
 playing+0.023
 drawing+0.023
 acting+0.022
Neg logit contributions
 tighter-0.021
 steep-0.018
igi-0.018
use-0.018
 colder-0.018
Token-
Token ID12
Feature activation-1.07e+00

Pos logit contributions
igi+0.193
use+0.184
Lex+0.174
 tighter+0.173
rated+0.165
Neg logit contributions
 playing-0.276
 being-0.269
 sharing-0.266
 drawing-0.259
 shooting-0.256
Token his
Token ID465
Feature activation-1.61e-01

Pos logit contributions
 sharing+0.228
 being+0.220
 acting+0.216
 doing+0.215
 teasing+0.214
Neg logit contributions
 tighter-0.190
 corn-0.182
-0.179
 pans-0.174
 eggs-0.174
Token tight
Token ID5381
Feature activation+3.95e+00

Pos logit contributions
 sharing+0.006
 being+0.005
 acting+0.005
 playing+0.005
 teasing+0.005
Neg logit contributions
 tighter-0.004
use-0.004
-0.004
igi-0.004
 reins-0.004
Token yellow
Token ID7872
Feature activation-8.82e-01

Pos logit contributions
 tighter+0.125
igi+0.125
use+0.123
rated+0.116
Lex+0.116
Neg logit contributions
 playing-0.121
 sharing-0.115
 being-0.114
 drawing-0.112
 acting-0.109
Token shirt
Token ID10147
Feature activation-2.99e+00

Pos logit contributions
 playing+0.027
 sharing+0.027
 being+0.027
 drawing+0.026
 acting+0.026
Neg logit contributions
 tighter-0.026
use-0.026
igi-0.025
rated-0.024
Lex-0.024
Token.
Token ID13
Feature activation-4.13e-02

Pos logit contributions
 playing+0.089
 sharing+0.087
 being+0.085
 drawing+0.083
 acting+0.081
Neg logit contributions
 tighter-0.095
use-0.093
igi-0.092
rated-0.088
Lex-0.086
Token One
Token ID1881
Feature activation+1.06e+01

Pos logit contributions
 playing+0.001
 sharing+0.001
 being+0.001
 drawing+0.001
 acting+0.001
Neg logit contributions
 tighter-0.001
use-0.001
igi-0.001
rated-0.001
Lex-0.001
Token day
Token ID1110
Feature activation-1.70e+00

Pos logit contributions
 tighter+0.316
rated+0.298
 steep+0.281
igi+0.280
+0.279
Neg logit contributions
 sharing-0.325
 being-0.315
 acting-0.314
 trading-0.307
 playing-0.306
Token,
Token ID11
Feature activation-4.88e-01

Pos logit contributions
 sharing+0.048
 being+0.047
 playing+0.046
 acting+0.046
 video+0.046
Neg logit contributions
use-0.052
 tighter-0.051
-0.050
igi-0.049
Lex-0.048
Token he
Token ID339
Feature activation+1.45e+00

Pos logit contributions
 sharing+0.015
 being+0.014
 playing+0.014
 acting+0.014
 drawing+0.014
Neg logit contributions
 tighter-0.015
-0.014
use-0.013
igi-0.013
Lex-0.013
Token asked
Token ID1965
Feature activation+4.64e+00

Pos logit contributions
 tighter+0.050
use+0.049
igi+0.049
rated+0.046
Lex+0.046
Neg logit contributions
 playing-0.040
 sharing-0.039
 being-0.038
 drawing-0.037
 acting-0.036
Token his
Token ID465
Feature activation-2.37e-01

Pos logit contributions
 tighter+0.190
use+0.187
igi+0.185
rated+0.178
Lex+0.176
Neg logit contributions
 playing-0.098
 sharing-0.095
 being-0.092
 drawing-0.089
 acting-0.087
Token liked
Token ID8288
Feature activation-1.95e-01

Pos logit contributions
 drawing+0.065
 shooting+0.063
 video+0.062
 teasing+0.061
 performing+0.061
Neg logit contributions
use-0.044
Maybe-0.044
 tighter-0.044
OK-0.043
 slowly-0.043
Token to
Token ID284
Feature activation+5.39e-01

Pos logit contributions
 video+0.007
 acting+0.007
 sharing+0.007
 teasing+0.007
 doing+0.007
Neg logit contributions
 tighter-0.003
 eggs-0.003
 corn-0.003
 slowly-0.003
 steep-0.003
Token play
Token ID711
Feature activation-5.26e+00

Pos logit contributions
use+0.022
 tighter+0.022
igi+0.021
rated+0.021
+0.021
Neg logit contributions
 playing-0.010
 sharing-0.010
 being-0.010
 acting-0.009
 drawing-0.009
Token outside
Token ID2354
Feature activation+7.10e-01

Pos logit contributions
 sharing+0.152
 being+0.144
 lending+0.143
 doing+0.142
 drawing+0.142
Neg logit contributions
 tighter-0.144
-0.137
 slowly-0.128
.'"-0.126
use-0.126
Token.
Token ID13
Feature activation-2.22e-01

Pos logit contributions
 tighter+0.024
use+0.023
igi+0.023
rated+0.022
Lex+0.021
Neg logit contributions
 playing-0.020
 sharing-0.020
 being-0.019
 drawing-0.019
 acting-0.018
Token One
Token ID1881
Feature activation+1.06e+01

Pos logit contributions
 playing+0.008
 sharing+0.008
 being+0.008
 drawing+0.007
 acting+0.007
Neg logit contributions
 tighter-0.006
use-0.006
igi-0.006
rated-0.005
Lex-0.005
Token day
Token ID1110
Feature activation-1.65e+00

Pos logit contributions
rated+0.310
 tighter+0.308
 steep+0.287
igi+0.280
+0.270
Neg logit contributions
 sharing-0.331
 being-0.316
 acting-0.316
 drawing-0.316
 trading-0.313
Token,
Token ID11
Feature activation-6.45e-01

Pos logit contributions
 sharing+0.047
 video+0.046
 being+0.045
 drawing+0.045
 acting+0.045
Neg logit contributions
 tighter-0.050
use-0.048
igi-0.048
Lex-0.046
-0.045
Token they
Token ID484
Feature activation-8.43e-01

Pos logit contributions
 sharing+0.015
 being+0.014
 playing+0.014
 drawing+0.014
 acting+0.014
Neg logit contributions
 tighter-0.024
igi-0.023
use-0.022
Lex-0.022
-0.022
Token saw
Token ID2497
Feature activation+1.04e-01

Pos logit contributions
 playing+0.030
 sharing+0.030
 being+0.029
 drawing+0.029
 acting+0.028
Neg logit contributions
 tighter-0.021
use-0.021
igi-0.020
rated-0.019
Lex-0.019
Token a
Token ID257
Feature activation+1.32e+00

Pos logit contributions
 tighter+0.003
 eggs+0.003
 colder+0.003
 steep+0.002
igi+0.002
Neg logit contributions
 sharing-0.003
 acting-0.003
 being-0.003
 playing-0.003
 performing-0.003
Token brave
Token ID14802
Feature activation+2.64e+00

Pos logit contributions
 tighter+0.073
use+0.072
igi+0.071
rated+0.068
Lex+0.066
Neg logit contributions
 playing-0.064
 sharing-0.062
 being-0.060
 drawing-0.059
 acting-0.057
Token girl
Token ID2576
Feature activation-2.11e+00

Pos logit contributions
 tighter+0.085
use+0.083
igi+0.082
rated+0.078
Lex+0.077
Neg logit contributions
 playing-0.077
 sharing-0.076
 being-0.074
 drawing-0.072
 acting-0.071
Token named
Token ID3706
Feature activation-7.52e-01

Pos logit contributions
 sharing+0.050
 playing+0.049
 being+0.049
 acting+0.047
 shooting+0.047
Neg logit contributions
use-0.075
 tighter-0.075
igi-0.073
-0.069
.'"-0.069
Token Jane
Token ID12091
Feature activation+2.42e+00

Pos logit contributions
 playing+0.016
 sharing+0.014
 drawing+0.014
 being+0.014
 teasing+0.013
Neg logit contributions
 tighter-0.033
igi-0.032
use-0.031
 reins-0.031
rated-0.030
Token.
Token ID13
Feature activation-3.05e-01

Pos logit contributions
 tighter+0.085
use+0.083
igi+0.082
rated+0.079
Lex+0.078
Neg logit contributions
 playing-0.068
 sharing-0.065
 being-0.064
 drawing-0.062
 doing-0.059
Token One
Token ID1881
Feature activation+1.05e+01

Pos logit contributions
 playing+0.009
 sharing+0.009
 being+0.008
 drawing+0.008
 doing+0.008
Neg logit contributions
 tighter-0.011
use-0.010
igi-0.010
Lex-0.010
 reins-0.010
Token day
Token ID1110
Feature activation-1.81e+00

Pos logit contributions
 tighter+0.309
rated+0.308
igi+0.289
 steep+0.285
+0.284
Neg logit contributions
 sharing-0.322
 being-0.306
 playing-0.306
 trading-0.304
 acting-0.303
Token,
Token ID11
Feature activation-6.51e-01

Pos logit contributions
 playing+0.047
 sharing+0.047
 being+0.045
 drawing+0.044
 acting+0.044
Neg logit contributions
 tighter-0.062
use-0.062
igi-0.061
rated-0.058
Lex-0.057
Token she
Token ID673
Feature activation+2.52e+00

Pos logit contributions
 playing+0.020
 sharing+0.019
 being+0.019
 drawing+0.018
 teasing+0.018
Neg logit contributions
 tighter-0.021
use-0.021
igi-0.021
rated-0.020
Lex-0.019
Token was
Token ID373
Feature activation-1.46e+00

Pos logit contributions
 tighter+0.081
use+0.081
igi+0.079
rated+0.076
+0.074
Neg logit contributions
 playing-0.072
 sharing-0.072
 being-0.070
 drawing-0.068
 teasing-0.066
Token searching
Token ID10342
Feature activation-5.03e-01

Pos logit contributions
 video+0.023
 sharing+0.021
 Sharing+0.020
 character+0.018
 joke+0.017
Neg logit contributions
 tighter-0.058
 colder-0.056
Lex-0.055
 eggs-0.055
 lam-0.055
Token you
Token ID345
Feature activation-9.73e-01

Pos logit contributions
 tighter+0.010
use+0.010
igi+0.009
+0.009
Lex+0.009
Neg logit contributions
 sharing-0.011
 playing-0.011
 drawing-0.010
 video-0.010
 acting-0.010
Tokenâ
Token ID22940
Feature activation+3.31e+00

Pos logit contributions
 playing+0.029
 sharing+0.029
 being+0.028
 drawing+0.027
 doing+0.026
Neg logit contributions
igi-0.031
use-0.031
 tighter-0.031
rated-0.029
Lex-0.029
Token
Token ID26391
Feature activation-2.14e+00

Pos logit contributions
use+0.087
 tighter+0.085
igi+0.083
rated+0.082
+0.078
Neg logit contributions
 playing-0.121
 sharing-0.118
 being-0.115
 drawing-0.114
 acting-0.112
Token
Token ID8151
Feature activation+1.62e+00

Pos logit contributions
 playing+0.071
 sharing+0.070
 being+0.069
 drawing+0.067
 acting+0.066
Neg logit contributions
 tighter-0.059
use-0.059
igi-0.058
rated-0.054
-0.054
Token.
Token ID13
Feature activation+3.29e-03

Pos logit contributions
 tighter+0.053
use+0.052
igi+0.050
rated+0.050
Lex+0.048
Neg logit contributions
 playing-0.048
 sharing-0.047
 being-0.046
 drawing-0.044
 teasing-0.043
Token One
Token ID1881
Feature activation+1.05e+01

Pos logit contributions
 tighter+0.000
igi+0.000
use+0.000
rated+0.000
 colder+0.000
Neg logit contributions
 playing-0.000
 sharing-0.000
 being-0.000
 wearing-0.000
 having-0.000
Token day
Token ID1110
Feature activation-1.75e+00

Pos logit contributions
 tighter+0.332
rated+0.330
use+0.317
igi+0.309
+0.308
Neg logit contributions
 sharing-0.291
 playing-0.286
 drawing-0.285
 being-0.278
 teasing-0.275
Token she
Token ID673
Feature activation+2.59e+00

Pos logit contributions
 playing+0.041
 being+0.037
 sharing+0.037
 doing+0.034
 drawing+0.034
Neg logit contributions
 tighter-0.073
igi-0.072
use-0.069
rated-0.069
 reins-0.066
Token was
Token ID373
Feature activation-1.39e+00

Pos logit contributions
 tighter+0.100
igi+0.097
use+0.094
Lex+0.091
 reins+0.091
Neg logit contributions
 playing-0.066
 sharing-0.061
 being-0.060
 drawing-0.058
 acting-0.058
Token in
Token ID287
Feature activation-7.74e-03

Pos logit contributions
 sharing+0.020
 video+0.018
 being+0.017
 playing+0.017
 drawing+0.016
Neg logit contributions
 tighter-0.062
use-0.061
igi-0.060
Lex-0.059
rated-0.058
Token her
Token ID607
Feature activation+7.23e-01

Pos logit contributions
 sharing+0.000
 playing+0.000
 drawing+0.000
 being+0.000
 video+0.000
Neg logit contributions
 tighter-0.000
use-0.000
igi-0.000
Lex-0.000
-0.000
Token barn
Token ID25203
Feature activation+1.16e+00

Pos logit contributions
 tighter+0.045
use+0.044
igi+0.043
rated+0.041
Lex+0.041
Neg logit contributions
 playing-0.047
 sharing-0.047
 being-0.046
 drawing-0.044
 acting-0.044
Token full
Token ID1336
Feature activation-1.41e+00

Pos logit contributions
 tighter+0.025
igi+0.025
 eggs+0.024
+0.023
.'"+0.023
Neg logit contributions
 sharing-0.041
 teasing-0.040
 playing-0.040
 joking-0.040
 being-0.040
Token of
Token ID286
Feature activation-1.99e+00

Pos logit contributions
 playing+0.035
 sharing+0.034
 being+0.033
 drawing+0.032
 acting+0.031
Neg logit contributions
 tighter-0.053
use-0.053
igi-0.052
rated-0.050
Lex-0.049
Token hay
Token ID27678
Feature activation+2.92e+00

Pos logit contributions
 sharing+0.066
 playing+0.066
 being+0.064
 drawing+0.064
 acting+0.063
Neg logit contributions
 tighter-0.054
igi-0.053
use-0.052
Lex-0.049
rated-0.049
Token.
Token ID13
Feature activation-2.37e-01

Pos logit contributions
 eggs+0.069
 tighter+0.067
igi+0.066
use+0.062
.'"+0.061
Neg logit contributions
 sharing-0.101
 teasing-0.098
 shooting-0.097
 being-0.096
 playing-0.095
Token One
Token ID1881
Feature activation+1.05e+01

Pos logit contributions
 playing+0.006
 sharing+0.006
 being+0.006
 drawing+0.006
 acting+0.005
Neg logit contributions
 tighter-0.009
use-0.009
igi-0.009
rated-0.008
Lex-0.008
Token day
Token ID1110
Feature activation-1.81e+00

Pos logit contributions
 tighter+0.431
use+0.430
igi+0.419
rated+0.400
Lex+0.399
Neg logit contributions
 playing-0.226
 sharing-0.214
 being-0.210
 drawing-0.200
 acting-0.193
Token,
Token ID11
Feature activation-6.42e-01

Pos logit contributions
 playing+0.044
 sharing+0.042
 being+0.041
 drawing+0.040
 acting+0.039
Neg logit contributions
 tighter-0.070
use-0.068
igi-0.068
rated-0.065
Lex-0.064
Token a
Token ID257
Feature activation+1.46e+00

Pos logit contributions
 playing+0.014
 sharing+0.013
 being+0.013
 drawing+0.012
 acting+0.012
Neg logit contributions
use-0.027
 tighter-0.027
igi-0.027
rated-0.026
Lex-0.025
Token mean
Token ID1612
Feature activation-7.97e-01

Pos logit contributions
 tighter+0.045
use+0.045
igi+0.044
rated+0.042
Lex+0.041
Neg logit contributions
 playing-0.045
 sharing-0.044
 being-0.043
 drawing-0.042
 acting-0.041
Token fox
Token ID21831
Feature activation-9.30e-01

Pos logit contributions
 playing+0.024
 sharing+0.023
 being+0.023
 drawing+0.022
 acting+0.022
Neg logit contributions
 tighter-0.025
igi-0.025
use-0.025
rated-0.023
Lex-0.023
Token to
Token ID284
Feature activation+7.52e-01

Pos logit contributions
 tighter+0.045
use+0.037
 steep+0.035
 reins+0.035
igi+0.034
Neg logit contributions
 video-0.068
 playing-0.061
 drawing-0.061
 shooting-0.061
 character-0.061
Token lose
Token ID4425
Feature activation-1.63e+00

Pos logit contributions
 tighter+0.027
use+0.026
igi+0.025
rated+0.025
Lex+0.024
Neg logit contributions
 playing-0.019
 sharing-0.019
 being-0.018
 drawing-0.018
 acting-0.017
Token anything
Token ID1997
Feature activation+3.05e+00

Pos logit contributions
 playing+0.038
 sharing+0.035
 being+0.035
 drawing+0.033
 teasing+0.032
Neg logit contributions
igi-0.064
 tighter-0.064
use-0.064
rated-0.062
Lex-0.060
Token important
Token ID1593
Feature activation+1.67e-01

Pos logit contributions
 tighter+0.098
use+0.096
igi+0.091
rated+0.089
+0.086
Neg logit contributions
 playing-0.087
 sharing-0.086
 drawing-0.084
 being-0.083
 video-0.081
Token.
Token ID13
Feature activation-1.17e-01

Pos logit contributions
 tighter+0.005
use+0.005
igi+0.005
rated+0.005
+0.004
Neg logit contributions
 playing-0.005
 sharing-0.005
 drawing-0.005
 being-0.005
 video-0.005
Token One
Token ID1881
Feature activation+1.05e+01

Pos logit contributions
 playing+0.003
 being+0.003
 sharing+0.003
 drawing+0.003
 doing+0.003
Neg logit contributions
 tighter-0.004
igi-0.004
use-0.004
rated-0.004
Lex-0.004
Token day
Token ID1110
Feature activation-1.75e+00

Pos logit contributions
 tighter+0.391
use+0.376
igi+0.372
rated+0.367
Lex+0.352
Neg logit contributions
 playing-0.256
 sharing-0.254
 being-0.245
 drawing-0.238
 acting-0.235
Token,
Token ID11
Feature activation-6.40e-01

Pos logit contributions
 playing+0.044
 sharing+0.044
 being+0.042
 drawing+0.041
 acting+0.040
Neg logit contributions
 tighter-0.063
use-0.062
igi-0.061
rated-0.059
Lex-0.058
Token he
Token ID339
Feature activation+1.23e+00

Pos logit contributions
 playing+0.015
 sharing+0.014
 being+0.014
 drawing+0.013
 acting+0.013
Neg logit contributions
 tighter-0.025
use-0.025
igi-0.025
rated-0.024
Lex-0.023
Token lost
Token ID2626
Feature activation-2.55e+00

Pos logit contributions
 tighter+0.044
igi+0.044
use+0.043
rated+0.041
Lex+0.041
Neg logit contributions
 playing-0.033
 sharing-0.031
 being-0.031
 drawing-0.030
 acting-0.029
Token his
Token ID465
Feature activation-3.54e-01

Pos logit contributions
 playing+0.055
 sharing+0.053
 being+0.051
 drawing+0.050
 acting+0.048
Neg logit contributions
 tighter-0.103
use-0.101
igi-0.101
rated-0.097
Lex-0.095
Token little
Token ID1310
Feature activation+9.03e-01

Pos logit contributions
 tighter+0.047
use+0.047
igi+0.046
rated+0.045
Lex+0.042
Neg logit contributions
 playing-0.044
 sharing-0.042
 being-0.041
 drawing-0.041
 acting-0.040
Token boy
Token ID2933
Feature activation-3.50e+00

Pos logit contributions
 tighter+0.025
use+0.025
igi+0.025
rated+0.024
Lex+0.022
Neg logit contributions
 playing-0.032
 sharing-0.031
 being-0.030
 drawing-0.029
 acting-0.029
Token called
Token ID1444
Feature activation+1.12e-01

Pos logit contributions
 sharing+0.083
 being+0.080
 playing+0.079
 drawing+0.077
 acting+0.077
Neg logit contributions
use-0.126
 tighter-0.125
igi-0.122
Lex-0.116
-0.116
Token Jack
Token ID3619
Feature activation-9.87e-01

Pos logit contributions
 tighter+0.004
use+0.004
igi+0.004
Lex+0.004
rated+0.004
Neg logit contributions
 sharing-0.002
 playing-0.002
 being-0.002
 drawing-0.002
 acting-0.002
Token.
Token ID13
Feature activation-3.40e-01

Pos logit contributions
 playing+0.031
 sharing+0.031
 being+0.030
 drawing+0.029
 acting+0.029
Neg logit contributions
 tighter-0.030
use-0.029
igi-0.029
rated-0.027
Lex-0.027
Token One
Token ID1881
Feature activation+1.05e+01

Pos logit contributions
 being+0.008
 playing+0.008
 drawing+0.008
 acting+0.008
 teasing+0.008
Neg logit contributions
-0.011
 tighter-0.011
use-0.011
rated-0.010
First-0.010
Token day
Token ID1110
Feature activation-1.78e+00

Pos logit contributions
 tighter+0.317
rated+0.312
igi+0.297
+0.294
use+0.290
Neg logit contributions
 sharing-0.313
 being-0.299
 playing-0.296
 acting-0.295
 trading-0.294
Token,
Token ID11
Feature activation-6.97e-01

Pos logit contributions
 being+0.054
 sharing+0.053
 acting+0.053
 video+0.052
 drawing+0.051
Neg logit contributions
-0.049
use-0.049
 tighter-0.048
',"-0.044
rated-0.043
Token he
Token ID339
Feature activation+1.38e+00

Pos logit contributions
 sharing+0.018
 being+0.018
 acting+0.017
 playing+0.017
 video+0.017
Neg logit contributions
 tighter-0.023
-0.022
use-0.021
rated-0.020
igi-0.020
Token was
Token ID373
Feature activation-1.45e+00

Pos logit contributions
 tighter+0.047
use+0.047
igi+0.046
rated+0.044
Lex+0.043
Neg logit contributions
 playing-0.038
 sharing-0.037
 being-0.036
 drawing-0.035
 acting-0.034
Token feeling
Token ID4203
Feature activation-6.58e-01

Pos logit contributions
 video+0.022
 Sharing+0.021
 sharing+0.020
 Spike+0.018
 Zero+0.018
Neg logit contributions
 tighter-0.058
 eggs-0.057
 colder-0.056
 fever-0.056
 lam-0.055
Token a
Token ID257
Feature activation+1.43e+00

Pos logit contributions
 sharing+0.041
 being+0.039
 playing+0.039
 acting+0.039
 teasing+0.039
Neg logit contributions
 tighter-0.047
igi-0.044
Lex-0.044
use-0.042
-0.042
Token girl
Token ID2576
Feature activation-2.06e+00

Pos logit contributions
 tighter+0.041
use+0.040
igi+0.040
Lex+0.038
rated+0.037
Neg logit contributions
 playing-0.046
 sharing-0.045
 being-0.045
 drawing-0.043
 acting-0.043
Token named
Token ID3706
Feature activation-9.03e-01

Pos logit contributions
 sharing+0.047
 playing+0.046
 being+0.046
 acting+0.044
 drawing+0.044
Neg logit contributions
 tighter-0.076
use-0.075
igi-0.074
Lex-0.070
-0.070
Token Lily
Token ID20037
Feature activation-2.71e-01

Pos logit contributions
 playing+0.021
 sharing+0.020
 being+0.020
 drawing+0.019
 acting+0.018
Neg logit contributions
 tighter-0.035
use-0.035
igi-0.035
rated-0.033
Lex-0.033
Token.
Token ID13
Feature activation-4.34e-01

Pos logit contributions
 sharing+0.009
 playing+0.009
 being+0.008
 drawing+0.008
 acting+0.008
Neg logit contributions
 tighter-0.008
use-0.007
igi-0.007
rated-0.007
Lex-0.007
Token One
Token ID1881
Feature activation+1.05e+01

Pos logit contributions
 playing+0.014
 sharing+0.014
 being+0.014
 shooting+0.013
 acting+0.013
Neg logit contributions
 tighter-0.012
use-0.012
igi-0.011
rated-0.011
-0.011
Token day
Token ID1110
Feature activation-1.78e+00

Pos logit contributions
 tighter+0.305
rated+0.300
igi+0.282
+0.280
 steep+0.275
Neg logit contributions
 sharing-0.320
 being-0.309
 trading-0.309
 acting-0.308
 playing-0.305
Token,
Token ID11
Feature activation-7.11e-01

Pos logit contributions
 sharing+0.047
 playing+0.046
 being+0.046
 acting+0.045
 drawing+0.044
Neg logit contributions
use-0.060
 tighter-0.060
igi-0.058
-0.056
Lex-0.055
Token Lily
Token ID20037
Feature activation-3.13e-01

Pos logit contributions
 sharing+0.020
 playing+0.020
 being+0.020
 acting+0.019
 drawing+0.019
Neg logit contributions
 tighter-0.023
use-0.022
igi-0.022
-0.020
Lex-0.020
Token and
Token ID290
Feature activation+9.09e-02

Pos logit contributions
 playing+0.008
 sharing+0.008
 being+0.008
 drawing+0.008
 acting+0.007
Neg logit contributions
 tighter-0.011
use-0.011
igi-0.011
rated-0.010
Lex-0.010
Token her
Token ID607
Feature activation+5.43e-01

Pos logit contributions
 tighter+0.002
use+0.002
Lex+0.002
+0.002
igi+0.002
Neg logit contributions
 acting-0.003
 shooting-0.003
 video-0.003
 performing-0.003
 sharing-0.003
Token little
Token ID1310
Feature activation+9.44e-01

Pos logit contributions
 tighter+0.040
use+0.039
igi+0.039
Lex+0.037
+0.036
Neg logit contributions
 playing-0.046
 sharing-0.045
 being-0.045
 drawing-0.043
 acting-0.043
Token girl
Token ID2576
Feature activation-2.03e+00

Pos logit contributions
 tighter+0.025
use+0.024
igi+0.024
rated+0.023
Lex+0.022
Neg logit contributions
 playing-0.034
 sharing-0.033
 being-0.032
 drawing-0.032
 acting-0.031
Token named
Token ID3706
Feature activation-8.66e-01

Pos logit contributions
 playing+0.044
 sharing+0.044
 being+0.043
 acting+0.040
 drawing+0.040
Neg logit contributions
 tighter-0.078
use-0.078
igi-0.077
Lex-0.073
-0.072
Token Lily
Token ID20037
Feature activation-1.27e-01

Pos logit contributions
 playing+0.019
 sharing+0.017
 drawing+0.017
 being+0.017
 teasing+0.015
Neg logit contributions
 tighter-0.037
igi-0.036
use-0.036
rated-0.035
Lex-0.034
Token.
Token ID13
Feature activation-2.74e-01

Pos logit contributions
 playing+0.004
 sharing+0.004
 being+0.004
 drawing+0.004
 acting+0.004
Neg logit contributions
 tighter-0.004
use-0.004
igi-0.004
rated-0.003
Lex-0.003
Token One
Token ID1881
Feature activation+1.05e+01

Pos logit contributions
 playing+0.009
 sharing+0.009
 being+0.009
 drawing+0.009
 acting+0.009
Neg logit contributions
 tighter-0.008
use-0.008
igi-0.007
rated-0.007
Lex-0.007
Token day
Token ID1110
Feature activation-1.78e+00

Pos logit contributions
 tighter+0.323
rated+0.316
igi+0.308
+0.299
use+0.298
Neg logit contributions
 sharing-0.306
 being-0.295
 playing-0.293
 acting-0.291
 trading-0.289
Token,
Token ID11
Feature activation-6.36e-01

Pos logit contributions
 playing+0.045
 sharing+0.045
 being+0.044
 drawing+0.042
 acting+0.042
Neg logit contributions
 tighter-0.063
use-0.063
igi-0.061
rated-0.058
Lex-0.058
Token she
Token ID673
Feature activation+2.58e+00

Pos logit contributions
 playing+0.019
 sharing+0.019
 being+0.018
 drawing+0.018
 acting+0.018
Neg logit contributions
 tighter-0.020
use-0.020
igi-0.020
rated-0.019
Lex-0.018
Token was
Token ID373
Feature activation-1.50e+00

Pos logit contributions
 tighter+0.088
use+0.087
igi+0.086
rated+0.082
Lex+0.081
Neg logit contributions
 playing-0.071
 sharing-0.070
 being-0.068
 drawing-0.066
 acting-0.065
Token playing
Token ID2712
Feature activation-5.76e+00

Pos logit contributions
 video+0.022
 sharing+0.020
 Sharing+0.019
 Spike+0.018
 character+0.017
Neg logit contributions
 tighter-0.061
 eggs-0.060
 colder-0.058
 lam-0.058
 fever-0.058
Token little
Token ID1310
Feature activation+8.44e-01

Pos logit contributions
 tighter+0.040
use+0.039
igi+0.039
Lex+0.037
+0.036
Neg logit contributions
 playing-0.046
 sharing-0.045
 being-0.045
 drawing-0.043
 acting-0.042
Token girl
Token ID2576
Feature activation-2.01e+00

Pos logit contributions
 tighter+0.022
use+0.022
igi+0.022
rated+0.021
Lex+0.020
Neg logit contributions
 playing-0.030
 sharing-0.029
 being-0.029
 drawing-0.028
 acting-0.028
Token named
Token ID3706
Feature activation-8.96e-01

Pos logit contributions
 playing+0.043
 sharing+0.043
 being+0.042
 drawing+0.040
 acting+0.040
Neg logit contributions
 tighter-0.079
use-0.078
igi-0.077
Lex-0.073
rated-0.072
Token Lily
Token ID20037
Feature activation-2.11e-01

Pos logit contributions
 playing+0.019
 sharing+0.017
 drawing+0.017
 being+0.017
 teasing+0.015
Neg logit contributions
 tighter-0.039
igi-0.038
use-0.038
rated-0.037
 reins-0.036
Token.
Token ID13
Feature activation-3.99e-01

Pos logit contributions
 playing+0.007
 sharing+0.007
 being+0.006
 drawing+0.006
 acting+0.006
Neg logit contributions
 tighter-0.006
use-0.006
igi-0.006
rated-0.006
Lex-0.006
Token One
Token ID1881
Feature activation+1.05e+01

Pos logit contributions
 playing+0.013
 sharing+0.013
 being+0.013
 acting+0.012
 drawing+0.012
Neg logit contributions
 tighter-0.011
use-0.011
igi-0.011
rated-0.010
-0.010
Token day
Token ID1110
Feature activation-1.75e+00

Pos logit contributions
 tighter+0.323
rated+0.315
igi+0.307
+0.298
use+0.297
Neg logit contributions
 sharing-0.305
 being-0.294
 playing-0.292
 acting-0.291
 trading-0.288
Token,
Token ID11
Feature activation-6.49e-01

Pos logit contributions
 sharing+0.046
 being+0.046
 playing+0.046
 acting+0.044
 drawing+0.044
Neg logit contributions
use-0.058
 tighter-0.058
igi-0.056
-0.055
Lex-0.054
Token Lily
Token ID20037
Feature activation-2.60e-01

Pos logit contributions
 playing+0.019
 sharing+0.019
 being+0.019
 acting+0.018
 drawing+0.018
Neg logit contributions
 tighter-0.020
use-0.020
igi-0.019
rated-0.018
Lex-0.018
Token went
Token ID1816
Feature activation+4.20e+00

Pos logit contributions
 playing+0.007
 sharing+0.006
 being+0.006
 drawing+0.006
 acting+0.006
Neg logit contributions
 tighter-0.010
use-0.009
igi-0.009
rated-0.009
Lex-0.009
Token for
Token ID329
Feature activation-4.39e-01

Pos logit contributions
 tighter+0.121
use+0.117
igi+0.110
 reins+0.106
Lex+0.101
Neg logit contributions
 sharing-0.132
 being-0.125
 teasing-0.121
 acting-0.121
 shooting-0.121
Token to
Token ID284
Feature activation+6.94e-01

Pos logit contributions
 tighter+0.172
use+0.169
igi+0.168
rated+0.160
Lex+0.157
Neg logit contributions
 playing-0.174
 sharing-0.171
 being-0.167
 drawing-0.163
 acting-0.158
Token buy
Token ID2822
Feature activation+1.61e-01

Pos logit contributions
 tighter+0.024
use+0.024
igi+0.024
rated+0.023
Lex+0.022
Neg logit contributions
 playing-0.019
 sharing-0.018
 being-0.018
 drawing-0.017
 acting-0.017
Token her
Token ID607
Feature activation+5.55e-01

Pos logit contributions
rated+0.007
igi+0.006
 tighter+0.006
use+0.006
 steep+0.006
Neg logit contributions
 playing-0.004
 being-0.004
 sharing-0.004
 drawing-0.004
 doing-0.003
Token some
Token ID617
Feature activation+4.95e+00

Pos logit contributions
igi+0.021
rated+0.021
use+0.020
 tighter+0.019
lights+0.019
Neg logit contributions
 playing-0.016
 being-0.014
 sharing-0.014
 drawing-0.014
 doing-0.013
Token.
Token ID13
Feature activation-4.22e-02

Pos logit contributions
rated+0.191
igi+0.188
Lex+0.171
zzy+0.170
 tighter+0.170
Neg logit contributions
 playing-0.142
 sharing-0.132
 being-0.132
 doing-0.121
 wearing-0.116
Token One
Token ID1881
Feature activation+1.05e+01

Pos logit contributions
 playing+0.001
 sharing+0.001
 being+0.001
 drawing+0.001
 acting+0.001
Neg logit contributions
 tighter-0.001
use-0.001
igi-0.001
rated-0.001
Lex-0.001
Token day
Token ID1110
Feature activation-1.78e+00

Pos logit contributions
 tighter+0.398
use+0.387
igi+0.386
rated+0.374
Lex+0.365
Neg logit contributions
 playing-0.249
 sharing-0.246
 being-0.239
 drawing-0.232
 acting-0.230
Token,
Token ID11
Feature activation-6.82e-01

Pos logit contributions
 playing+0.045
 sharing+0.045
 being+0.044
 drawing+0.042
 acting+0.042
Neg logit contributions
 tighter-0.063
use-0.062
igi-0.061
rated-0.058
Lex-0.058
Token her
Token ID607
Feature activation+5.54e-01

Pos logit contributions
 sharing+0.019
 playing+0.019
 being+0.019
 acting+0.018
 drawing+0.018
Neg logit contributions
 tighter-0.022
use-0.021
igi-0.021
-0.020
Lex-0.020
Token mom
Token ID1995
Feature activation+5.52e+00

Pos logit contributions
 tighter+0.017
use+0.016
igi+0.016
rated+0.015
Lex+0.015
Neg logit contributions
 playing-0.018
 sharing-0.018
 being-0.017
 drawing-0.017
 acting-0.016
Token brought
Token ID3181
Feature activation+2.29e+00

Pos logit contributions
 tighter+0.185
use+0.182
igi+0.180
rated+0.171
Lex+0.169
Neg logit contributions
 playing-0.155
 sharing-0.152
 being-0.149
 drawing-0.145
 acting-0.142
Token little
Token ID1310
Feature activation+7.83e-01

Pos logit contributions
 tighter+0.043
use+0.042
igi+0.042
rated+0.039
Lex+0.039
Neg logit contributions
 playing-0.046
 sharing-0.045
 being-0.044
 drawing-0.043
 acting-0.042
Token girl
Token ID2576
Feature activation-2.11e+00

Pos logit contributions
 tighter+0.021
use+0.021
igi+0.021
rated+0.020
Lex+0.019
Neg logit contributions
 playing-0.027
 sharing-0.027
 being-0.026
 drawing-0.026
 acting-0.025
Token named
Token ID3706
Feature activation-8.63e-01

Pos logit contributions
 sharing+0.046
 playing+0.046
 being+0.045
 drawing+0.042
 acting+0.042
Neg logit contributions
 tighter-0.081
use-0.081
igi-0.079
Lex-0.075
rated-0.075
Token Sam
Token ID3409
Feature activation-1.46e+00

Pos logit contributions
 playing+0.018
 sharing+0.016
 drawing+0.016
 being+0.016
 teasing+0.014
Neg logit contributions
 tighter-0.038
igi-0.038
use-0.037
 reins-0.036
rated-0.036
Token.
Token ID13
Feature activation-2.44e-01

Pos logit contributions
 playing+0.045
 sharing+0.044
 being+0.043
 drawing+0.042
 acting+0.041
Neg logit contributions
 tighter-0.045
use-0.045
igi-0.044
rated-0.042
Lex-0.041
Token One
Token ID1881
Feature activation+1.05e+01

Pos logit contributions
 playing+0.008
 sharing+0.008
 being+0.008
 drawing+0.007
 shooting+0.007
Neg logit contributions
 tighter-0.007
use-0.007
igi-0.007
rated-0.006
-0.006
Token day
Token ID1110
Feature activation-1.73e+00

Pos logit contributions
 tighter+0.287
rated+0.286
igi+0.264
 steep+0.261
 colder+0.261
Neg logit contributions
 sharing-0.333
 trading-0.322
 being-0.319
 acting-0.318
 showing-0.313
Token Sam
Token ID3409
Feature activation-1.56e+00

Pos logit contributions
 sharing+0.053
 being+0.052
 acting+0.051
 video+0.051
 trading+0.050
Neg logit contributions
 tighter-0.047
use-0.046
-0.046
igi-0.044
 minutes-0.042
Token picked
Token ID6497
Feature activation+1.57e+00

Pos logit contributions
 playing+0.039
 sharing+0.038
 being+0.037
 drawing+0.036
 acting+0.035
Neg logit contributions
 tighter-0.058
use-0.057
igi-0.057
rated-0.054
Lex-0.053
Token a
Token ID257
Feature activation+1.42e+00

Pos logit contributions
 tighter+0.042
Lex+0.041
igi+0.040
 corn+0.039
 eggs+0.039
Neg logit contributions
 sharing-0.052
 acting-0.048
 playing-0.047
 being-0.047
 teasing-0.047
Token lemon
Token ID18873
Feature activation-6.85e-01

Pos logit contributions
 tighter+0.041
use+0.040
igi+0.038
Lex+0.037
+0.037
Neg logit contributions
 playing-0.045
 sharing-0.045
 being-0.045
 acting-0.043
 teasing-0.042
Token,
Token ID11
Feature activation-1.61e+00

Pos logit contributions
 playing+0.047
 sharing+0.046
 being+0.045
 drawing+0.043
 acting+0.042
Neg logit contributions
 tighter-0.067
use-0.065
igi-0.065
rated-0.062
Lex-0.062
Token he
Token ID339
Feature activation+8.07e-01

Pos logit contributions
 playing+0.051
 acting+0.051
 drawing+0.050
 trading+0.050
 sharing+0.050
Neg logit contributions
 tighter-0.041
use-0.038
 eggs-0.038
-0.036
 reins-0.036
Token is
Token ID318
Feature activation-3.03e+00

Pos logit contributions
+0.020
use+0.020
rated+0.016
 tighter+0.016
",+0.015
Neg logit contributions
 sharing-0.030
 being-0.028
 lending-0.028
 showing-0.028
 playing-0.028
Token playing
Token ID2712
Feature activation-5.82e+00

Pos logit contributions
 sharing+0.075
 video+0.073
 drawing+0.071
 trading+0.071
 playing+0.069
Neg logit contributions
 tighter-0.090
 steep-0.089
igi-0.087
-0.087
 colder-0.086
Token a
Token ID257
Feature activation+9.16e-01

Pos logit contributions
 sharing+0.152
 playing+0.149
 drawing+0.147
 being+0.142
 acting+0.139
Neg logit contributions
 tighter-0.197
use-0.189
-0.186
igi-0.177
.'"-0.177
Token sax
Token ID46909
Feature activation-1.03e+01

Pos logit contributions
 tighter+0.031
use+0.029
+0.029
igi+0.029
rated+0.028
Neg logit contributions
 playing-0.025
 sharing-0.024
 being-0.024
 drawing-0.023
 doing-0.022
Tokenophone
Token ID38656
Feature activation-2.57e+00

Pos logit contributions
 sharing+0.295
 playing+0.293
 being+0.291
 acting+0.280
 teasing+0.276
Neg logit contributions
use-0.327
 tighter-0.320
-0.312
igi-0.310
rated-0.304
Token.
Token ID13
Feature activation-1.43e+00

Pos logit contributions
 playing+0.096
 sharing+0.094
 being+0.093
 drawing+0.091
 acting+0.089
Neg logit contributions
 tighter-0.064
igi-0.063
use-0.063
rated-0.058
Lex-0.058
Token It
Token ID632
Feature activation-2.07e+00

Pos logit contributions
 playing+0.038
 sharing+0.037
 being+0.036
 drawing+0.035
 acting+0.034
Neg logit contributions
 tighter-0.051
use-0.050
igi-0.049
rated-0.047
Lex-0.046
Token is
Token ID318
Feature activation-2.99e+00

Pos logit contributions
 sharing+0.071
 playing+0.068
 drawing+0.067
 trading+0.065
 lending+0.064
Neg logit contributions
 tighter-0.055
use-0.054
-0.052
 steep-0.049
rated-0.048
Token a
Token ID257
Feature activation+9.31e-01

Pos logit contributions
 sharing+0.082
 playing+0.082
 drawing+0.080
 video+0.080
 teasing+0.079
Neg logit contributions
 tighter-0.093
 steep-0.085
igi-0.085
use-0.080
Lex-0.080
Token watch
Token ID2342
Feature activation-6.48e-01

Pos logit contributions
 sharing+0.049
 playing+0.049
 being+0.048
 drawing+0.046
 acting+0.045
Neg logit contributions
 tighter-0.063
use-0.062
igi-0.058
rated-0.057
-0.057
Token the
Token ID262
Feature activation+1.72e-01

Pos logit contributions
 sharing+0.018
 trading+0.018
 lending+0.018
 doing+0.017
 acting+0.017
Neg logit contributions
 tighter-0.021
 harder-0.018
 slowly-0.018
 lower-0.018
-0.017
Token man
Token ID582
Feature activation-1.90e+00

Pos logit contributions
 tighter+0.005
 reins+0.005
+0.005
 lower+0.005
 pans+0.005
Neg logit contributions
 sharing-0.005
 playing-0.004
 drawing-0.004
 being-0.004
 lending-0.004
Token with
Token ID351
Feature activation-3.59e+00

Pos logit contributions
 sharing+0.051
 lending+0.048
 being+0.045
 character+0.045
 trading+0.045
Neg logit contributions
 tighter-0.061
 slowly-0.053
 lower-0.053
 closer-0.051
 harder-0.050
Token the
Token ID262
Feature activation+1.93e-01

Pos logit contributions
 sharing+0.102
 acting+0.097
 doing+0.096
 being+0.095
 drawing+0.093
Neg logit contributions
 tighter-0.124
 lower-0.104
 reins-0.103
 steep-0.101
 eggs-0.100
Token sax
Token ID46909
Feature activation-9.96e+00

Pos logit contributions
 reins+0.006
 tighter+0.006
+0.006
use+0.006
bars+0.005
Neg logit contributions
 sharing-0.006
 playing-0.005
 being-0.005
 trading-0.005
 drawing-0.005
Tokenophone
Token ID38656
Feature activation-2.38e+00

Pos logit contributions
 sharing+0.314
 being+0.310
 playing+0.305
 teasing+0.297
 acting+0.296
Neg logit contributions
use-0.298
 tighter-0.290
-0.282
igi-0.276
',"-0.271
Token.
Token ID13
Feature activation-7.19e-01

Pos logit contributions
 sharing+0.079
 playing+0.075
 teasing+0.075
 being+0.075
 trading+0.075
Neg logit contributions
 tighter-0.065
bars-0.058
 slowly-0.053
 reins-0.053
use-0.051
Token He
Token ID679
Feature activation+6.23e-01

Pos logit contributions
 playing+0.019
 sharing+0.018
 being+0.018
 drawing+0.017
 acting+0.017
Neg logit contributions
 tighter-0.026
use-0.025
igi-0.025
Lex-0.024
rated-0.024
Token plays
Token ID5341
Feature activation-3.76e+00

Pos logit contributions
 tighter+0.017
use+0.016
+0.015
Lex+0.014
bars+0.014
Neg logit contributions
 sharing-0.021
 playing-0.021
 being-0.020
 drawing-0.020
 teasing-0.019
Token fast
Token ID3049
Feature activation+1.69e+00

Pos logit contributions
 sharing+0.106
 lending+0.103
 playing+0.101
 doing+0.101
 trading+0.101
Neg logit contributions
 tighter-0.107
 harder-0.099
 slowly-0.097
 slower-0.095
 lower-0.091
Token man
Token ID582
Feature activation-1.85e+00

Pos logit contributions
 tighter+0.037
use+0.035
igi+0.035
Lex+0.034
+0.034
Neg logit contributions
 playing-0.044
 sharing-0.043
 being-0.043
 acting-0.040
 drawing-0.040
Token playing
Token ID2712
Feature activation-5.82e+00

Pos logit contributions
 Spike+0.046
 video+0.042
 Zero+0.042
 character+0.041
 sharing+0.040
Neg logit contributions
 tighter-0.059
use-0.054
 closer-0.049
 slowly-0.049
 sooner-0.049
Token jazz
Token ID21274
Feature activation-5.88e+00

Pos logit contributions
 sharing+0.166
 performing+0.163
 drawing+0.162
 acting+0.162
 celebrating+0.160
Neg logit contributions
 tighter-0.162
-0.145
 corn-0.139
 lower-0.137
,"-0.137
Token on
Token ID319
Feature activation-1.86e+00

Pos logit contributions
 sharing+0.162
 drawing+0.146
 playing+0.145
 teasing+0.145
 lending+0.145
Neg logit contributions
-0.183
 tighter-0.180
use-0.159
.'"-0.158
rated-0.157
Token his
Token ID465
Feature activation-5.37e-01

Pos logit contributions
 sharing+0.046
 playing+0.045
 drawing+0.042
 being+0.042
 teasing+0.042
Neg logit contributions
 tighter-0.067
igi-0.063
use-0.062
Lex-0.062
-0.061
Token sax
Token ID46909
Feature activation-9.87e+00

Pos logit contributions
 sharing+0.020
 playing+0.020
 being+0.019
 drawing+0.019
 acting+0.019
Neg logit contributions
 tighter-0.013
use-0.012
igi-0.012
-0.011
 reins-0.011
Tokenophone
Token ID38656
Feature activation-2.31e+00

Pos logit contributions
 sharing+0.281
 playing+0.279
 being+0.277
 acting+0.266
 teasing+0.264
Neg logit contributions
 tighter-0.314
use-0.313
igi-0.310
rated-0.299
-0.297
Token.
Token ID13
Feature activation-5.39e-01

Pos logit contributions
 sharing+0.077
 playing+0.075
 being+0.073
 drawing+0.072
 acting+0.072
Neg logit contributions
 tighter-0.061
rated-0.055
use-0.055
igi-0.054
-0.054
Token She
Token ID1375
Feature activation+1.66e+00

Pos logit contributions
 playing+0.014
 sharing+0.013
 being+0.013
 drawing+0.013
 acting+0.012
Neg logit contributions
 tighter-0.020
use-0.020
igi-0.019
rated-0.019
Lex-0.018
Token thought
Token ID1807
Feature activation-8.07e-01

Pos logit contributions
 tighter+0.052
igi+0.050
use+0.050
Lex+0.048
+0.047
Neg logit contributions
 sharing-0.048
 playing-0.048
 drawing-0.046
 being-0.046
 video-0.045
Token it
Token ID340
Feature activation+1.12e+00

Pos logit contributions
 sharing+0.023
 acting+0.023
 playing+0.023
 drawing+0.022
 video+0.022
Neg logit contributions
 tighter-0.024
use-0.022
igi-0.022
bars-0.021
 steep-0.021
Token low
Token ID1877
Feature activation+5.82e+00

Pos logit contributions
 playing+0.008
 sharing+0.008
 being+0.007
 drawing+0.007
 acting+0.007
Neg logit contributions
 tighter-0.005
use-0.005
igi-0.005
rated-0.004
Lex-0.004
Token.
Token ID13
Feature activation-7.65e-01

Pos logit contributions
 tighter+0.196
use+0.194
igi+0.193
rated+0.183
Lex+0.181
Neg logit contributions
 playing-0.164
 sharing-0.160
 being-0.157
 drawing-0.152
 acting-0.149
Token He
Token ID679
Feature activation+6.32e-01

Pos logit contributions
 playing+0.019
 sharing+0.018
 being+0.018
 drawing+0.017
 acting+0.017
Neg logit contributions
 tighter-0.029
use-0.029
igi-0.029
rated-0.028
Lex-0.027
Token makes
Token ID1838
Feature activation-1.34e+00

Pos logit contributions
 tighter+0.018
use+0.017
+0.016
igi+0.016
rated+0.015
Neg logit contributions
 sharing-0.021
 playing-0.021
 being-0.020
 drawing-0.020
 teasing-0.019
Token the
Token ID262
Feature activation+1.70e-01

Pos logit contributions
 sharing+0.040
 lending+0.037
 trading+0.037
 playing+0.036
 doing+0.036
Neg logit contributions
 tighter-0.042
 colder-0.036
 harder-0.035
 eggs-0.035
 slowly-0.034
Token sax
Token ID46909
Feature activation-9.79e+00

Pos logit contributions
 tighter+0.005
 reins+0.005
+0.005
 lower+0.005
use+0.005
Neg logit contributions
 sharing-0.005
 playing-0.005
 lending-0.004
 being-0.004
 trading-0.004
Tokenophone
Token ID38656
Feature activation-2.34e+00

Pos logit contributions
 sharing+0.324
 being+0.324
 playing+0.316
 teasing+0.311
 acting+0.308
Neg logit contributions
 tighter-0.255
use-0.254
-0.254
rated-0.242
igi-0.242
Token sing
Token ID1702
Feature activation-3.10e+00

Pos logit contributions
 sharing+0.076
 trading+0.075
 lending+0.074
 teasing+0.074
 playing+0.074
Neg logit contributions
 tighter-0.060
bars-0.055
 slowly-0.051
 harder-0.049
 steep-0.048
Token.
Token ID13
Feature activation-7.76e-01

Pos logit contributions
 playing+0.098
 sharing+0.097
 trading+0.096
 drawing+0.096
 acting+0.094
Neg logit contributions
 tighter-0.092
rated-0.077
 slowly-0.076
 steep-0.074
 lower-0.074
Token He
Token ID679
Feature activation+6.15e-01

Pos logit contributions
 playing+0.018
 sharing+0.017
 being+0.017
 drawing+0.016
 acting+0.016
Neg logit contributions
use-0.031
 tighter-0.031
igi-0.030
rated-0.030
Lex-0.028
Token is
Token ID318
Feature activation-2.75e+00

Pos logit contributions
 tighter+0.018
use+0.017
+0.016
igi+0.015
rated+0.015
Neg logit contributions
 playing-0.020
 sharing-0.020
 being-0.019
 drawing-0.019
 teasing-0.018
Token.
Token ID13
Feature activation-8.20e-01

Pos logit contributions
 playing+0.065
 trading+0.064
 drawing+0.064
 sharing+0.064
 acting+0.063
Neg logit contributions
 tighter-0.051
 steep-0.042
 colder-0.041
 lower-0.041
 minutes-0.040
Token He
Token ID679
Feature activation+5.50e-01

Pos logit contributions
 playing+0.018
 sharing+0.018
 being+0.017
 drawing+0.017
 acting+0.016
Neg logit contributions
 tighter-0.032
use-0.032
igi-0.032
rated-0.031
Lex-0.030
Token plays
Token ID5341
Feature activation-3.76e+00

Pos logit contributions
 tighter+0.016
use+0.016
igi+0.015
+0.015
Lex+0.014
Neg logit contributions
 playing-0.017
 sharing-0.017
 being-0.017
 drawing-0.016
 teasing-0.016
Token with
Token ID351
Feature activation-3.63e+00

Pos logit contributions
 playing+0.096
 trading+0.095
 teasing+0.095
 sharing+0.095
 lending+0.093
Neg logit contributions
 tighter-0.114
 harder-0.104
-0.103
 slowly-0.102
 slower-0.097
Token the
Token ID262
Feature activation+1.38e-01

Pos logit contributions
 sharing+0.092
 playing+0.090
 being+0.087
 acting+0.085
 drawing+0.084
Neg logit contributions
 tighter-0.132
igi-0.123
use-0.122
Lex-0.116
rated-0.115
Token sax
Token ID46909
Feature activation-9.77e+00

Pos logit contributions
 tighter+0.005
 reins+0.004
+0.004
use+0.004
bars+0.004
Neg logit contributions
 sharing-0.004
 playing-0.004
 being-0.004
 drawing-0.003
 trading-0.003
Tokenophone
Token ID38656
Feature activation-2.41e+00

Pos logit contributions
 sharing+0.297
 being+0.296
 playing+0.294
 teasing+0.285
 acting+0.282
Neg logit contributions
use-0.293
 tighter-0.290
-0.283
igi-0.274
.'"-0.271
Token.
Token ID13
Feature activation-8.65e-01

Pos logit contributions
 sharing+0.074
 playing+0.073
 being+0.072
 teasing+0.071
 acting+0.070
Neg logit contributions
 tighter-0.069
use-0.062
-0.060
bars-0.060
.'"-0.060
Token He
Token ID679
Feature activation+5.39e-01

Pos logit contributions
 playing+0.021
 sharing+0.021
 being+0.020
 drawing+0.019
 acting+0.019
Neg logit contributions
 tighter-0.033
use-0.033
igi-0.032
rated-0.031
Lex-0.030
Token is
Token ID318
Feature activation-2.74e+00

Pos logit contributions
 tighter+0.016
use+0.015
+0.014
igi+0.013
bars+0.013
Neg logit contributions
 sharing-0.017
 playing-0.017
 being-0.016
 drawing-0.016
 teasing-0.016
Token also
Token ID635
Feature activation+2.14e+00

Pos logit contributions
 video+0.072
 sharing+0.064
 trading+0.063
 teasing+0.063
 drawing+0.063
Neg logit contributions
 tighter-0.089
igi-0.078
Lex-0.077
 steep-0.076
 lower-0.076
Token
Token ID198
Feature activation-2.11e-01

Pos logit contributions
 playing+0.004
 sharing+0.004
 being+0.004
 drawing+0.004
 shooting+0.004
Neg logit contributions
 tighter-0.010
use-0.010
rated-0.010
igi-0.009
 reins-0.009
TokenThe
Token ID464
Feature activation+4.76e-01

Pos logit contributions
 playing+0.007
 sharing+0.007
 being+0.006
 shooting+0.006
 acting+0.006
Neg logit contributions
rated-0.006
use-0.006
 tighter-0.006
igi-0.006
 reins-0.006
Token man
Token ID582
Feature activation-1.86e+00

Pos logit contributions
 tighter+0.011
use+0.011
+0.010
 reins+0.010
igi+0.010
Neg logit contributions
 sharing-0.018
 playing-0.017
 being-0.017
 drawing-0.016
 teasing-0.016
Token with
Token ID351
Feature activation-3.57e+00

Pos logit contributions
 sharing+0.064
 playing+0.063
 being+0.062
 drawing+0.061
 acting+0.060
Neg logit contributions
 tighter-0.047
use-0.046
-0.042
bars-0.039
ara-0.038
Token the
Token ID262
Feature activation+1.58e-01

Pos logit contributions
 sharing+0.088
 playing+0.087
 being+0.084
 acting+0.083
 drawing+0.082
Neg logit contributions
 tighter-0.134
use-0.122
igi-0.121
Lex-0.119
 steep-0.116
Token sax
Token ID46909
Feature activation-9.76e+00

Pos logit contributions
 tighter+0.005
 reins+0.005
+0.004
use+0.004
bars+0.004
Neg logit contributions
 sharing-0.005
 playing-0.005
 being-0.004
 lending-0.004
 drawing-0.004
Tokenophone
Token ID38656
Feature activation-2.35e+00

Pos logit contributions
 sharing+0.309
 being+0.305
 playing+0.302
 teasing+0.295
 acting+0.293
Neg logit contributions
use-0.288
-0.274
 tighter-0.273
.'"-0.260
igi-0.259
Token has
Token ID468
Feature activation-1.94e+00

Pos logit contributions
 sharing+0.074
 playing+0.072
 teasing+0.070
 drawing+0.070
 being+0.070
Neg logit contributions
 tighter-0.066
use-0.061
-0.060
bars-0.060
.'"-0.057
Token a
Token ID257
Feature activation+1.09e+00

Pos logit contributions
 sharing+0.054
 playing+0.054
 being+0.051
 drawing+0.051
 teasing+0.050
Neg logit contributions
 tighter-0.062
use-0.057
igi-0.055
 reins-0.054
Lex-0.054
Token friend
Token ID1545
Feature activation-6.19e-01

Pos logit contributions
 tighter+0.028
+0.028
use+0.027
 steep+0.026
 reins+0.025
Neg logit contributions
 playing-0.035
 being-0.035
 sharing-0.035
 having-0.032
 doing-0.032
Token.
Token ID13
Feature activation-7.59e-01

Pos logit contributions
 sharing+0.022
 being+0.021
 video+0.021
 drawing+0.021
 shooting+0.021
Neg logit contributions
use-0.013
 tighter-0.012
.'"-0.011
-0.011
',"-0.011
Token loved
Token ID6151
Feature activation-2.40e-04

Pos logit contributions
use+0.060
 tighter+0.060
igi+0.059
rated+0.056
Lex+0.055
Neg logit contributions
 playing-0.042
 sharing-0.039
 being-0.039
 drawing-0.038
 acting-0.037
Token the
Token ID262
Feature activation+2.31e-01

Pos logit contributions
 sharing+0.000
 video+0.000
 drawing+0.000
 trading+0.000
 teasing+0.000
Neg logit contributions
 tighter-0.000
 eggs-0.000
 minutes-0.000
use-0.000
igi-0.000
Token sound
Token ID2128
Feature activation-3.06e+00

Pos logit contributions
 tighter+0.009
+0.008
 colder+0.008
rated+0.008
 reins+0.008
Neg logit contributions
 sharing-0.005
 playing-0.005
 drawing-0.005
 lending-0.005
 trading-0.005
Token of
Token ID286
Feature activation-2.25e+00

Pos logit contributions
 sharing+0.093
 playing+0.090
 trading+0.090
 drawing+0.089
 acting+0.087
Neg logit contributions
 tighter-0.089
 colder-0.077
lights-0.077
bars-0.077
rated-0.077
Token the
Token ID262
Feature activation+2.42e-01

Pos logit contributions
 sharing+0.054
 drawing+0.052
 trading+0.050
 video+0.050
 lending+0.048
Neg logit contributions
 tighter-0.075
 eggs-0.067
 reins-0.067
 colder-0.067
 lam-0.066
Token sax
Token ID46909
Feature activation-9.75e+00

Pos logit contributions
 tighter+0.009
+0.009
rated+0.009
 colder+0.008
use+0.008
Neg logit contributions
 sharing-0.006
 playing-0.005
 drawing-0.005
 being-0.005
 trading-0.005
Tokenophone
Token ID38656
Feature activation-2.57e+00

Pos logit contributions
 sharing+0.309
 being+0.297
 playing+0.290
 trading+0.285
 teasing+0.284
Neg logit contributions
use-0.282
-0.272
 tighter-0.270
rated-0.257
.'"-0.255
Token and
Token ID290
Feature activation-1.12e-01

Pos logit contributions
 sharing+0.082
 playing+0.079
 trading+0.077
 teasing+0.076
 drawing+0.076
Neg logit contributions
 tighter-0.074
bars-0.062
use-0.061
rated-0.060
 steep-0.059
Token the
Token ID262
Feature activation+2.21e-01

Pos logit contributions
 sharing+0.003
 playing+0.003
 video+0.003
 drawing+0.003
 trading+0.003
Neg logit contributions
 tighter-0.004
use-0.003
rated-0.003
igi-0.003
 steep-0.003
Token drums
Token ID24730
Feature activation-3.78e+00

Pos logit contributions
 tighter+0.008
use+0.007
 reins+0.007
+0.007
rated+0.007
Neg logit contributions
 sharing-0.006
 playing-0.005
 drawing-0.005
 being-0.005
 trading-0.005
Token.
Token ID13
Feature activation-6.33e-01

Pos logit contributions
 sharing+0.119
 playing+0.113
 trading+0.112
 drawing+0.111
 teasing+0.109
Neg logit contributions
 tighter-0.107
use-0.098
rated-0.097
bars-0.095
-0.091
Token good
Token ID922
Feature activation-4.98e-01

Pos logit contributions
 tighter+0.057
igi+0.053
use+0.052
+0.050
Lex+0.049
Neg logit contributions
 playing-0.051
 being-0.050
 sharing-0.050
 video-0.050
 drawing-0.049
Token.
Token ID13
Feature activation-9.39e-01

Pos logit contributions
 sharing+0.014
 video+0.013
 playing+0.013
 drawing+0.013
 being+0.013
Neg logit contributions
 tighter-0.015
use-0.014
-0.014
igi-0.013
.'"-0.013
Token
Token ID198
Feature activation-2.97e-01

Pos logit contributions
 playing+0.022
 sharing+0.021
 being+0.021
 drawing+0.020
 acting+0.019
Neg logit contributions
 tighter-0.037
use-0.037
igi-0.036
rated-0.036
Lex-0.034
Token
Token ID198
Feature activation-2.83e-01

Pos logit contributions
 playing+0.007
 sharing+0.007
 being+0.007
 drawing+0.006
 acting+0.006
Neg logit contributions
 tighter-0.011
use-0.011
igi-0.011
rated-0.011
Lex-0.010
TokenThe
Token ID464
Feature activation+3.83e-01

Pos logit contributions
 sharing+0.009
 playing+0.009
 being+0.009
 shooting+0.009
 acting+0.008
Neg logit contributions
rated-0.008
use-0.008
 tighter-0.008
igi-0.008
 reins-0.007
Token sax
Token ID46909
Feature activation-9.75e+00

Pos logit contributions
 tighter+0.011
use+0.010
+0.009
 reins+0.009
igi+0.009
Neg logit contributions
 playing-0.013
 sharing-0.013
 being-0.012
 drawing-0.012
 acting-0.012
Tokenophone
Token ID38656
Feature activation-2.42e+00

Pos logit contributions
 sharing+0.313
 playing+0.312
 being+0.311
 teasing+0.303
 acting+0.298
Neg logit contributions
use-0.278
-0.269
 tighter-0.259
.'"-0.255
',"-0.249
Token and
Token ID290
Feature activation-2.96e-01

Pos logit contributions
 sharing+0.071
 playing+0.071
 teasing+0.068
 being+0.068
 drawing+0.068
Neg logit contributions
 tighter-0.072
-0.071
use-0.071
.'"-0.065
bars-0.064
Token the
Token ID262
Feature activation+8.96e-02

Pos logit contributions
 sharing+0.008
 playing+0.008
 acting+0.007
 doing+0.007
 being+0.007
Neg logit contributions
 tighter-0.011
use-0.009
 reins-0.009
 steep-0.009
-0.009
Token guitar
Token ID10047
Feature activation-4.48e+00

Pos logit contributions
 tighter+0.003
use+0.003
 reins+0.003
+0.003
igi+0.002
Neg logit contributions
 sharing-0.003
 playing-0.002
 being-0.002
 drawing-0.002
 trading-0.002
Token support
Token ID1104
Feature activation-1.98e-01

Pos logit contributions
 sharing+0.122
 playing+0.120
 teasing+0.116
 being+0.114
 trading+0.113
Neg logit contributions
 tighter-0.147
use-0.143
-0.142
.'"-0.132
',"-0.129
Token,
Token ID11
Feature activation-1.45e+00

Pos logit contributions
 playing+0.047
 sharing+0.046
 being+0.045
 drawing+0.043
 doing+0.042
Neg logit contributions
 tighter-0.072
use-0.071
igi-0.071
rated-0.067
Lex-0.067
Token the
Token ID262
Feature activation-1.69e-02

Pos logit contributions
 acting+0.049
 teasing+0.049
 playing+0.048
 drawing+0.048
 trading+0.047
Neg logit contributions
 tighter-0.028
-0.028
?"-0.025
 eggs-0.025
?",-0.025
Token guitar
Token ID10047
Feature activation-4.61e+00

Pos logit contributions
 sharing+0.000
 playing+0.000
 drawing+0.000
 being+0.000
 teasing+0.000
Neg logit contributions
 tighter-0.001
-0.001
 reins-0.001
use-0.001
bars-0.001
Token and
Token ID290
Feature activation-4.86e-01

Pos logit contributions
 playing+0.140
 sharing+0.137
 teasing+0.131
 being+0.131
 drawing+0.130
Neg logit contributions
-0.139
use-0.139
 tighter-0.137
',"-0.127
.'"-0.125
Token the
Token ID262
Feature activation-3.56e-02

Pos logit contributions
 sharing+0.012
 playing+0.012
 acting+0.012
 teasing+0.012
 drawing+0.011
Neg logit contributions
 tighter-0.017
use-0.015
-0.015
 reins-0.015
bars-0.014
Token sax
Token ID46909
Feature activation-9.72e+00

Pos logit contributions
 sharing+0.001
 playing+0.001
 being+0.001
 drawing+0.001
 trading+0.001
Neg logit contributions
 tighter-0.001
 reins-0.001
use-0.001
-0.001
 eggs-0.001
Tokenophone
Token ID38656
Feature activation-2.50e+00

Pos logit contributions
 playing+0.282
 sharing+0.281
 being+0.278
 teasing+0.267
 acting+0.266
Neg logit contributions
use-0.313
 tighter-0.305
-0.296
igi-0.287
.'"-0.286
Token support
Token ID1104
Feature activation-2.42e-01

Pos logit contributions
 playing+0.093
 sharing+0.092
 being+0.090
 drawing+0.089
 acting+0.087
Neg logit contributions
 tighter-0.060
use-0.059
igi-0.057
rated-0.054
Lex-0.052
Token each
Token ID1123
Feature activation+1.66e+00

Pos logit contributions
 playing+0.009
 sharing+0.008
 being+0.008
 drawing+0.008
 acting+0.008
Neg logit contributions
 tighter-0.006
igi-0.006
use-0.006
rated-0.006
Lex-0.006
Token other
Token ID584
Feature activation+1.34e+00

Pos logit contributions
igi+0.056
 tighter+0.056
use+0.055
rated+0.052
Lex+0.052
Neg logit contributions
 playing-0.048
 sharing-0.046
 being-0.046
 drawing-0.044
 acting-0.043
Token.
Token ID13
Feature activation-1.49e+00

Pos logit contributions
igi+0.035
use+0.034
Lex+0.032
 tighter+0.032
rated+0.030
Neg logit contributions
 playing-0.050
 sharing-0.049
 being-0.048
 drawing-0.047
 doing-0.046
Token The
Token ID383
Feature activation+1.54e-01

Pos logit contributions
 playing+0.019
 sharing+0.019
 being+0.018
 drawing+0.018
 acting+0.017
Neg logit contributions
use-0.022
igi-0.022
 tighter-0.022
rated-0.021
Lex-0.020
Token man
Token ID582
Feature activation-1.92e+00

Pos logit contributions
 tighter+0.004
+0.003
use+0.003
 reins+0.003
igi+0.003
Neg logit contributions
 sharing-0.006
 playing-0.006
 being-0.006
 drawing-0.006
 acting-0.005
Token started
Token ID2067
Feature activation+3.70e+00

Pos logit contributions
 sharing+0.061
 playing+0.057
 being+0.057
 drawing+0.056
 acting+0.056
Neg logit contributions
 tighter-0.059
use-0.051
-0.050
 slowly-0.049
bars-0.049
Token playing
Token ID2712
Feature activation-5.93e+00

Pos logit contributions
 tighter+0.102
bars+0.094
 reins+0.092
 slowly+0.086
 steep+0.083
Neg logit contributions
 sharing-0.112
 video-0.112
 drawing-0.104
 being-0.100
 trading-0.099
Token his
Token ID465
Feature activation-5.62e-01

Pos logit contributions
 sharing+0.159
 drawing+0.154
 lending+0.149
 video+0.148
 acting+0.146
Neg logit contributions
 tighter-0.187
 harder-0.166
-0.164
 slowly-0.163
 lower-0.159
Token sax
Token ID46909
Feature activation-9.70e+00

Pos logit contributions
 sharing+0.019
 being+0.018
 playing+0.017
 drawing+0.017
 lending+0.017
Neg logit contributions
 tighter-0.015
-0.014
 reins-0.014
use-0.013
bars-0.013
Tokenophone
Token ID38656
Feature activation-2.60e+00

Pos logit contributions
 sharing+0.284
 being+0.279
 playing+0.279
 acting+0.270
 teasing+0.266
Neg logit contributions
 tighter-0.303
use-0.300
-0.288
igi-0.283
rated-0.281
Token and
Token ID290
Feature activation-3.86e-02

Pos logit contributions
 sharing+0.081
 playing+0.078
 teasing+0.077
 acting+0.077
 trading+0.076
Neg logit contributions
 tighter-0.074
bars-0.066
-0.061
 steep-0.060
.'"-0.059
Token Lily
Token ID20037
Feature activation-6.97e-01

Pos logit contributions
 sharing+0.001
 playing+0.001
 being+0.001
 trading+0.001
 video+0.001
Neg logit contributions
 tighter-0.001
 steep-0.001
use-0.001
igi-0.001
-0.001
Token rode
Token ID22075
Feature activation+1.40e+00

Pos logit contributions
 sharing+0.019
 video+0.018
 being+0.018
 teasing+0.017
 drawing+0.017
Neg logit contributions
 tighter-0.022
-0.021
bars-0.020
use-0.020
 slowly-0.020
Token her
Token ID607
Feature activation+3.48e-01

Pos logit contributions
 tighter+0.048
use+0.040
igi+0.040
 steep+0.040
+0.040
Neg logit contributions
 sharing-0.041
 playing-0.039
 acting-0.039
 drawing-0.039
 being-0.038
Token the
Token ID262
Feature activation+6.05e-02

Pos logit contributions
 trading+0.038
 video+0.038
 sharing+0.038
 drawing+0.038
 acting+0.037
Neg logit contributions
 tighter-0.036
 lower-0.030
 reins-0.029
 slowly-0.029
use-0.029
Token trumpet
Token ID40334
Feature activation-4.74e+00

Pos logit contributions
 tighter+0.002
use+0.002
+0.002
 reins+0.002
 steep+0.002
Neg logit contributions
 sharing-0.002
 playing-0.002
 being-0.001
 drawing-0.001
 trading-0.001
Token,
Token ID11
Feature activation-1.32e+00

Pos logit contributions
 sharing+0.139
 playing+0.139
 being+0.134
 teasing+0.133
 drawing+0.133
Neg logit contributions
 tighter-0.142
use-0.135
-0.131
.'"-0.125
rated-0.124
Token and
Token ID290
Feature activation-2.95e-01

Pos logit contributions
 video+0.041
 trading+0.041
 drawing+0.041
 sharing+0.041
 teasing+0.040
Neg logit contributions
 tighter-0.034
 reins-0.028
use-0.028
 lower-0.028
 slowly-0.028
Token the
Token ID262
Feature activation+6.81e-02

Pos logit contributions
 sharing+0.008
 trading+0.008
 acting+0.008
 playing+0.008
 drawing+0.007
Neg logit contributions
 tighter-0.010
 lower-0.008
 steep-0.008
 slowly-0.008
 harder-0.008
Token sax
Token ID46909
Feature activation-9.70e+00

Pos logit contributions
 tighter+0.002
use+0.002
 reins+0.002
+0.002
 steep+0.002
Neg logit contributions
 sharing-0.002
 playing-0.002
 being-0.002
 drawing-0.002
 doing-0.002
Tokenophone
Token ID38656
Feature activation-2.45e+00

Pos logit contributions
 sharing+0.294
 being+0.292
 playing+0.291
 teasing+0.280
 acting+0.276
Neg logit contributions
use-0.293
 tighter-0.277
-0.277
.'"-0.267
',"-0.262
Token.
Token ID13
Feature activation-8.92e-01

Pos logit contributions
 playing+0.077
 sharing+0.077
 being+0.074
 drawing+0.074
 teasing+0.073
Neg logit contributions
 tighter-0.071
use-0.067
igi-0.063
rated-0.063
-0.062
Token They
Token ID1119
Feature activation-2.66e+00

Pos logit contributions
 playing+0.025
 sharing+0.025
 being+0.024
 drawing+0.024
 acting+0.023
Neg logit contributions
 tighter-0.029
use-0.029
igi-0.029
rated-0.027
Lex-0.027
Token also
Token ID635
Feature activation+2.16e+00

Pos logit contributions
 playing+0.067
 sharing+0.067
 being+0.065
 drawing+0.064
 teasing+0.063
Neg logit contributions
use-0.092
 tighter-0.092
-0.087
 lower-0.084
 steep-0.082
Token teach
Token ID4545
Feature activation-2.35e+00

Pos logit contributions
use+0.073
 tighter+0.071
+0.068
 lower+0.064
 steep+0.064
Neg logit contributions
 playing-0.055
 sharing-0.055
 being-0.053
 drawing-0.052
 teasing-0.052
Token little
Token ID1310
Feature activation+4.05e-01

Pos logit contributions
 tighter+0.031
use+0.029
igi+0.028
Lex+0.028
+0.028
Neg logit contributions
 playing-0.035
 sharing-0.035
 being-0.034
 drawing-0.033
 teasing-0.032
Token girl
Token ID2576
Feature activation-2.41e+00

Pos logit contributions
 tighter+0.010
use+0.009
igi+0.008
+0.008
 steep+0.008
Neg logit contributions
 sharing-0.015
 playing-0.015
 being-0.015
 drawing-0.014
 doing-0.014
Token playing
Token ID2712
Feature activation-5.96e+00

Pos logit contributions
 video+0.057
 sharing+0.055
 Zero+0.051
 joke+0.050
 being+0.050
Neg logit contributions
 tighter-0.074
use-0.068
 slowly-0.067
 lam-0.064
,"-0.063
Token a
Token ID257
Feature activation+1.09e+00

Pos logit contributions
 sharing+0.162
 drawing+0.153
 celebrating+0.150
 lending+0.149
 trading+0.149
Neg logit contributions
 tighter-0.179
 slowly-0.151
-0.147
 harder-0.146
 lower-0.146
Token toy
Token ID13373
Feature activation-2.62e+00

Pos logit contributions
 tighter+0.039
+0.036
 steep+0.036
use+0.035
igi+0.034
Neg logit contributions
 playing-0.026
 being-0.025
 sharing-0.025
 doing-0.023
 having-0.023
Token sax
Token ID46909
Feature activation-9.67e+00

Pos logit contributions
 sharing+0.072
 playing+0.068
 being+0.066
 drawing+0.066
 teasing+0.065
Neg logit contributions
 tighter-0.083
use-0.078
-0.075
bars-0.072
.'"-0.072
Tokenophone
Token ID38656
Feature activation-2.59e+00

Pos logit contributions
 sharing+0.312
 being+0.308
 playing+0.298
 acting+0.295
 doing+0.290
Neg logit contributions
use-0.273
 tighter-0.263
rated-0.259
-0.255
bars-0.250
Token.
Token ID13
Feature activation-7.24e-01

Pos logit contributions
 sharing+0.085
 playing+0.081
 drawing+0.081
 acting+0.081
 being+0.080
Neg logit contributions
 tighter-0.066
bars-0.060
.'"-0.056
rated-0.056
use-0.055
Token She
Token ID1375
Feature activation+1.40e+00

Pos logit contributions
 playing+0.021
 sharing+0.021
 being+0.020
 drawing+0.019
 acting+0.019
Neg logit contributions
 tighter-0.023
use-0.023
igi-0.022
Lex-0.021
-0.021
Token was
Token ID373
Feature activation-1.87e+00

Pos logit contributions
 tighter+0.045
use+0.043
igi+0.042
Lex+0.041
+0.041
Neg logit contributions
 sharing-0.039
 playing-0.039
 drawing-0.038
 being-0.037
 video-0.037
Token playing
Token ID2712
Feature activation-6.01e+00

Pos logit contributions
 video+0.046
ols+0.036
 Friends+0.036
 Playing+0.035
 This+0.035
Neg logit contributions
 tighter-0.058
 lower-0.055
 slowly-0.055
 slower-0.055
Lex-0.055
Token excited
Token ID6568
Feature activation+4.97e-01

Pos logit contributions
 tighter+0.078
igi+0.073
Lex+0.072
 steep+0.071
ushes+0.069
Neg logit contributions
 video-0.070
 being-0.070
 sharing-0.069
 drawing-0.069
 playing-0.068
Token!
Token ID0
Feature activation-1.10e+00

Pos logit contributions
 tighter+0.014
use+0.013
igi+0.013
rated+0.013
+0.012
Neg logit contributions
 sharing-0.015
 video-0.015
 playing-0.014
 drawing-0.014
 shooting-0.014
Token She
Token ID1375
Feature activation+1.59e+00

Pos logit contributions
 playing+0.028
 sharing+0.027
 being+0.026
 drawing+0.026
 acting+0.025
Neg logit contributions
 tighter-0.041
use-0.040
igi-0.040
rated-0.039
Lex-0.037
Token grabbed
Token ID13176
Feature activation+1.95e+00

Pos logit contributions
 tighter+0.052
use+0.050
igi+0.048
rated+0.047
Lex+0.046
Neg logit contributions
 sharing-0.044
 playing-0.043
 being-0.042
 drawing-0.042
 teasing-0.041
Token a
Token ID257
Feature activation+1.16e+00

Pos logit contributions
 tighter+0.068
use+0.062
igi+0.060
Lex+0.059
 reins+0.058
Neg logit contributions
 sharing-0.054
 playing-0.052
 being-0.051
 acting-0.050
 doing-0.049
Token sax
Token ID46909
Feature activation-9.59e+00

Pos logit contributions
 tighter+0.033
use+0.031
+0.031
 reins+0.030
suits+0.029
Neg logit contributions
 sharing-0.039
 being-0.039
 playing-0.037
 acting-0.035
 doing-0.035
Tokenophone
Token ID38656
Feature activation-2.44e+00

Pos logit contributions
 sharing+0.322
 being+0.318
 playing+0.314
 acting+0.308
 doing+0.302
Neg logit contributions
 tighter-0.272
use-0.266
-0.250
igi-0.245
rated-0.238
Token and
Token ID290
Feature activation+1.93e-02

Pos logit contributions
 sharing+0.079
 playing+0.077
 being+0.076
 acting+0.075
 doing+0.074
Neg logit contributions
 tighter-0.070
bars-0.061
 reins-0.058
use-0.058
 slowly-0.054
Token played
Token ID2826
Feature activation-3.61e+00

Pos logit contributions
 tighter+0.001
use+0.001
igi+0.001
+0.001
Lex+0.001
Neg logit contributions
 sharing-0.001
 playing-0.001
 being-0.001
 drawing-0.000
 acting-0.000
Token perfectly
Token ID7138
Feature activation+1.60e+00

Pos logit contributions
 sharing+0.098
 trading+0.093
 lending+0.093
 drawing+0.092
 teasing+0.091
Neg logit contributions
 tighter-0.108
 slowly-0.100
 harder-0.096
 lower-0.095
-0.093
Token.
Token ID13
Feature activation-7.55e-01

Pos logit contributions
 tighter+0.039
+0.037
 steep+0.035
 slowly+0.035
rated+0.034
Neg logit contributions
 sharing-0.051
 teasing-0.049
 playing-0.048
 lending-0.047
 being-0.047
Token.
Token ID13
Feature activation-8.75e-01

Pos logit contributions
 tighter+0.101
use+0.095
igi+0.091
rated+0.091
 colder+0.085
Neg logit contributions
 sharing-0.114
 playing-0.111
 drawing-0.110
 video-0.110
 being-0.108
Token
Token ID198
Feature activation-2.58e-01

Pos logit contributions
 playing+0.024
 sharing+0.024
 being+0.023
 drawing+0.022
 acting+0.022
Neg logit contributions
 tighter-0.029
use-0.029
igi-0.028
-0.027
rated-0.027
TokenJack
Token ID14295
Feature activation-3.41e-01

Pos logit contributions
 playing+0.008
 sharing+0.008
 being+0.008
 drawing+0.008
 acting+0.008
Neg logit contributions
 tighter-0.007
igi-0.007
use-0.007
Lex-0.007
rated-0.007
Token had
Token ID550
Feature activation-1.74e+00

Pos logit contributions
 sharing+0.010
 playing+0.009
 being+0.009
 drawing+0.009
 acting+0.009
Neg logit contributions
 tighter-0.011
use-0.011
igi-0.011
rated-0.010
Lex-0.010
Token his
Token ID465
Feature activation-5.44e-01

Pos logit contributions
 playing+0.045
 sharing+0.044
 being+0.043
 drawing+0.042
 acting+0.041
Neg logit contributions
 tighter-0.062
use-0.060
igi-0.059
rated-0.056
Lex-0.056
Token sax
Token ID46909
Feature activation-9.51e+00

Pos logit contributions
 sharing+0.016
 playing+0.016
 being+0.016
 drawing+0.015
 acting+0.015
Neg logit contributions
 tighter-0.017
use-0.017
igi-0.016
rated-0.016
-0.015
Tokenophone
Token ID38656
Feature activation-2.24e+00

Pos logit contributions
 sharing+0.312
 playing+0.308
 being+0.307
 acting+0.300
 drawing+0.295
Neg logit contributions
use-0.269
 tighter-0.264
igi-0.255
rated-0.249
-0.245
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
 sharing+0.064
 playing+0.063
 being+0.061
 drawing+0.061
 acting+0.061
Neg logit contributions
 tighter-0.069
use-0.064
igi-0.062
bars-0.061
rated-0.061
Token Jill
Token ID22772
Feature activation-4.75e-01

Pos logit contributions
 sharing+0.029
 playing+0.028
 drawing+0.028
 video+0.028
 acting+0.028
Neg logit contributions
 tighter-0.035
use-0.031
igi-0.031
 reins-0.030
 steep-0.030
Token had
Token ID550
Feature activation-1.74e+00

Pos logit contributions
 sharing+0.013
 playing+0.012
 being+0.012
 video+0.012
 drawing+0.012
Neg logit contributions
 tighter-0.016
use-0.015
igi-0.015
-0.014
rated-0.014
Token her
Token ID607
Feature activation+3.81e-01

Pos logit contributions
 sharing+0.046
 playing+0.045
 being+0.043
 drawing+0.043
 video+0.042
Neg logit contributions
 tighter-0.059
use-0.057
igi-0.056
Lex-0.053
rated-0.053
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
 playing+0.006
 sharing+0.006
 being+0.006
 drawing+0.006
 acting+0.006
Neg logit contributions
 tighter-0.008
igi-0.008
use-0.008
Lex-0.008
rated-0.008
Token
Token ID220
Feature activation+3.40e+00

Pos logit contributions
 playing+0.038
 sharing+0.037
 being+0.036
 drawing+0.035
 acting+0.034
Neg logit contributions
 tighter-0.049
use-0.049
igi-0.048
rated-0.046
Lex-0.045
Token
Token ID198
Feature activation-1.07e+00

Pos logit contributions
 tighter+0.126
use+0.124
igi+0.122
rated+0.118
Lex+0.115
Neg logit contributions
 playing-0.084
 sharing-0.082
 being-0.080
 drawing-0.077
 acting-0.075
Token
Token ID198
Feature activation-9.31e-01

Pos logit contributions
 playing+0.025
 sharing+0.024
 being+0.023
 drawing+0.022
 acting+0.022
Neg logit contributions
 tighter-0.042
use-0.041
igi-0.040
rated-0.040
Lex-0.038
TokenThe
Token ID464
Feature activation+5.89e-01

Pos logit contributions
 playing+0.037
 sharing+0.036
 being+0.035
 drawing+0.035
 acting+0.034
Neg logit contributions
 tighter-0.021
use-0.021
igi-0.020
rated-0.019
Lex-0.018
Token par
Token ID1582
Feature activation-9.13e+00

Pos logit contributions
 tighter+0.018
use+0.017
igi+0.017
rated+0.016
Lex+0.016
Neg logit contributions
 playing-0.019
 sharing-0.018
 being-0.018
 drawing-0.017
 acting-0.017
Tokenrot
Token ID10599
Feature activation-4.63e+00

Pos logit contributions
 playing+0.315
 sharing+0.295
 being+0.291
 acting+0.286
 teasing+0.283
Neg logit contributions
igi-0.259
 tighter-0.241
use-0.238
Lex-0.236
-0.224
Token was
Token ID373
Feature activation-1.96e+00

Pos logit contributions
 sharing+0.124
 playing+0.124
 being+0.118
 drawing+0.116
 acting+0.114
Neg logit contributions
 tighter-0.160
use-0.155
igi-0.150
rated-0.147
-0.145
Token soon
Token ID2582
Feature activation+7.85e+00

Pos logit contributions
 video+0.051
 sharing+0.051
 playing+0.048
 drawing+0.047
 acting+0.047
Neg logit contributions
 tighter-0.067
igi-0.062
Lex-0.061
rated-0.058
 steep-0.058
Token able
Token ID1498
Feature activation+3.93e+00

Pos logit contributions
 tighter+0.270
use+0.236
igi+0.232
Lex+0.228
 colder+0.227
Neg logit contributions
 sharing-0.217
 video-0.215
 playing-0.208
 being-0.200
 drawing-0.198
Token to
Token ID284
Feature activation+1.98e-01

Pos logit contributions
 tighter+0.110
use+0.097
igi+0.097
Lex+0.092
 steep+0.090
Neg logit contributions
 video-0.130
 sharing-0.127
 playing-0.125
 drawing-0.123
 acting-0.122
Token time
Token ID640
Feature activation-2.21e+00

Pos logit contributions
 tighter+0.031
use+0.031
igi+0.030
rated+0.029
+0.028
Neg logit contributions
 playing-0.030
 sharing-0.030
 being-0.029
 drawing-0.028
 acting-0.028
Token there
Token ID612
Feature activation-1.57e+00

Pos logit contributions
 video+0.072
 sharing+0.069
 drawing+0.065
 teasing+0.064
 acting+0.062
Neg logit contributions
use-0.057
 minutes-0.053
 tighter-0.052
igi-0.050
",-0.050
Token was
Token ID373
Feature activation-2.39e+00

Pos logit contributions
 sharing+0.038
 drawing+0.036
 playing+0.036
 video+0.036
 teasing+0.035
Neg logit contributions
use-0.056
 tighter-0.054
igi-0.054
-0.053
rated-0.051
Token a
Token ID257
Feature activation+1.35e+00

Pos logit contributions
 playing+0.057
 sharing+0.055
 being+0.054
 drawing+0.052
 acting+0.050
Neg logit contributions
 tighter-0.092
use-0.091
igi-0.090
rated-0.086
Lex-0.084
Token furry
Token ID40857
Feature activation-4.24e+00

Pos logit contributions
 tighter+0.039
use+0.038
igi+0.037
Lex+0.035
+0.035
Neg logit contributions
 playing-0.043
 sharing-0.043
 being-0.042
 drawing-0.040
 acting-0.040
Token par
Token ID1582
Feature activation-9.12e+00

Pos logit contributions
 sharing+0.119
 playing+0.116
 being+0.115
 acting+0.111
 doing+0.111
Neg logit contributions
 tighter-0.131
igi-0.130
use-0.127
Lex-0.123
-0.121
Tokenrot
Token ID10599
Feature activation-4.68e+00

Pos logit contributions
 playing+0.314
 being+0.306
 sharing+0.302
 acting+0.295
 having+0.295
Neg logit contributions
igi-0.257
Lex-0.233
 tighter-0.230
use-0.222
',"-0.214
Token.
Token ID13
Feature activation-7.09e-01

Pos logit contributions
 sharing+0.151
 teasing+0.147
 trading+0.147
 being+0.147
 acting+0.145
Neg logit contributions
.'"-0.111
 tighter-0.109
-0.108
igi-0.108
use-0.106
Token He
Token ID679
Feature activation+4.85e-01

Pos logit contributions
 playing+0.019
 sharing+0.018
 being+0.018
 drawing+0.018
 acting+0.017
Neg logit contributions
 tighter-0.024
use-0.024
igi-0.024
Lex-0.023
-0.022
Token liked
Token ID8288
Feature activation-8.05e-01

Pos logit contributions
 tighter+0.017
use+0.017
igi+0.017
Lex+0.016
rated+0.016
Neg logit contributions
 sharing-0.012
 playing-0.012
 being-0.011
 drawing-0.011
 teasing-0.011
Token to
Token ID284
Feature activation+3.14e-01

Pos logit contributions
 video+0.025
 sharing+0.025
 acting+0.024
 drawing+0.024
 playing+0.024
Neg logit contributions
 tighter-0.019
 eggs-0.019
 corn-0.018
igi-0.018
use-0.018
Token par
Token ID1582
Feature activation-9.35e+00

Pos logit contributions
 tighter+0.006
igi+0.006
use+0.006
rated+0.006
Lex+0.006
Neg logit contributions
 playing-0.006
 sharing-0.006
 being-0.006
 drawing-0.006
 acting-0.006
Tokenrot
Token ID10599
Feature activation-4.92e+00

Pos logit contributions
 playing+0.284
 sharing+0.273
 being+0.270
 acting+0.260
 teasing+0.259
Neg logit contributions
igi-0.293
 tighter-0.293
use-0.290
-0.272
',"-0.268
Token,
Token ID11
Feature activation-1.58e+00

Pos logit contributions
 playing+0.152
 sharing+0.150
 being+0.146
 drawing+0.143
 acting+0.142
Neg logit contributions
 tighter-0.149
use-0.145
igi-0.140
rated-0.136
-0.132
Token but
Token ID475
Feature activation-1.19e+00

Pos logit contributions
 sharing+0.042
 playing+0.041
 drawing+0.040
 video+0.039
 being+0.039
Neg logit contributions
 tighter-0.053
use-0.050
igi-0.048
Lex-0.047
-0.047
Token the
Token ID262
Feature activation+4.12e-01

Pos logit contributions
 playing+0.033
 sharing+0.032
 drawing+0.031
 being+0.031
 acting+0.031
Neg logit contributions
 tighter-0.040
use-0.038
igi-0.038
Lex-0.036
rated-0.036
Token par
Token ID1582
Feature activation-9.07e+00

Pos logit contributions
 tighter+0.014
use+0.013
igi+0.013
Lex+0.012
+0.012
Neg logit contributions
 playing-0.011
 sharing-0.011
 being-0.011
 acting-0.010
 drawing-0.010
Tokenrot
Token ID10599
Feature activation-4.67e+00

Pos logit contributions
 playing+0.311
 being+0.292
 sharing+0.291
 acting+0.284
 teasing+0.281
Neg logit contributions
igi-0.262
use-0.250
 tighter-0.247
Lex-0.240
-0.236
Token flew
Token ID13112
Feature activation+3.39e+00

Pos logit contributions
 playing+0.118
 sharing+0.116
 being+0.112
 drawing+0.110
 acting+0.107
Neg logit contributions
 tighter-0.168
use-0.167
igi-0.163
rated-0.157
Lex-0.154
Token away
Token ID1497
Feature activation-2.88e-01

Pos logit contributions
use+0.102
igi+0.099
 tighter+0.096
rated+0.093
Lex+0.092
Neg logit contributions
 playing-0.107
 being-0.102
 sharing-0.102
 shooting-0.097
 teasing-0.096
Token.
Token ID13
Feature activation-9.90e-01

Pos logit contributions
 playing+0.007
 being+0.007
 sharing+0.007
 acting+0.006
 drawing+0.006
Neg logit contributions
use-0.011
igi-0.011
 tighter-0.011
rated-0.010
Lex-0.010
Token The
Token ID383
Feature activation+4.14e-01

Pos logit contributions
 playing+0.025
 sharing+0.025
 being+0.024
 drawing+0.023
 acting+0.023
Neg logit contributions
 tighter-0.036
use-0.035
igi-0.035
rated-0.033
Lex-0.033
Token They
Token ID1119
Feature activation-2.76e+00

Pos logit contributions
 playing+0.034
 sharing+0.033
 being+0.032
 drawing+0.031
 acting+0.031
Neg logit contributions
 tighter-0.046
use-0.045
igi-0.045
rated-0.043
Lex-0.042
Token say
Token ID910
Feature activation-5.73e+00

Pos logit contributions
 playing+0.070
 sharing+0.069
 being+0.067
 drawing+0.065
 acting+0.063
Neg logit contributions
use-0.099
 tighter-0.098
igi-0.093
rated-0.091
-0.091
Token goodbye
Token ID24829
Feature activation+1.54e+00

Pos logit contributions
 playing+0.165
 acting+0.163
 drawing+0.161
 sharing+0.157
 being+0.157
Neg logit contributions
 tighter-0.158
use-0.157
-0.152
Lex-0.148
",-0.148
Token to
Token ID284
Feature activation+2.40e-01

Pos logit contributions
use+0.058
igi+0.057
 tighter+0.057
rated+0.055
Lex+0.055
Neg logit contributions
 playing-0.038
 sharing-0.036
 being-0.036
 drawing-0.035
 acting-0.033
Token the
Token ID262
Feature activation+2.69e-01

Pos logit contributions
 tighter+0.008
use+0.007
igi+0.007
Lex+0.007
.'"+0.007
Neg logit contributions
 sharing-0.006
 acting-0.006
 playing-0.006
 doing-0.006
 drawing-0.006
Token par
Token ID1582
Feature activation-9.05e+00

Pos logit contributions
 tighter+0.009
use+0.009
igi+0.009
+0.008
Lex+0.008
Neg logit contributions
 playing-0.007
 sharing-0.007
 being-0.007
 acting-0.006
 teasing-0.006
Tokenrot
Token ID10599
Feature activation-4.59e+00

Pos logit contributions
 playing+0.326
 being+0.310
 sharing+0.304
 doing+0.302
 having+0.300
Neg logit contributions
igi-0.248
use-0.227
 tighter-0.223
Lex-0.217
',"-0.215
Token.
Token ID13
Feature activation-9.36e-01

Pos logit contributions
 playing+0.136
 sharing+0.134
 being+0.130
 drawing+0.128
 acting+0.128
Neg logit contributions
 tighter-0.146
use-0.141
igi-0.138
rated-0.129
-0.128
Token They
Token ID1119
Feature activation-2.27e+00

Pos logit contributions
 playing+0.022
 sharing+0.022
 being+0.021
 drawing+0.020
 acting+0.020
Neg logit contributions
 tighter-0.036
use-0.035
igi-0.035
rated-0.034
Lex-0.033
Token hope
Token ID2911
Feature activation+2.33e+00

Pos logit contributions
 playing+0.058
 sharing+0.057
 being+0.055
 drawing+0.054
 acting+0.052
Neg logit contributions
 tighter-0.083
use-0.081
igi-0.080
rated-0.077
Lex-0.076
Token to
Token ID284
Feature activation+4.42e-01

Pos logit contributions
 tighter+0.074
use+0.066
igi+0.066
 eggs+0.065
 colder+0.061
Neg logit contributions
 sharing-0.063
 playing-0.062
 video-0.062
 acting-0.061
 drawing-0.060
Token it
Token ID340
Feature activation+9.42e-01

Pos logit contributions
igi+0.125
rated+0.124
use+0.122
 tighter+0.122
Lex+0.117
Neg logit contributions
 playing-0.070
 being-0.064
 sharing-0.064
 drawing-0.063
 shooting-0.060
Token with
Token ID351
Feature activation-3.87e+00

Pos logit contributions
igi+0.037
use+0.036
Lex+0.035
rated+0.035
 tighter+0.035
Neg logit contributions
 playing-0.023
 drawing-0.022
 video-0.022
 sharing-0.021
 being-0.021
Token you
Token ID345
Feature activation-1.05e+00

Pos logit contributions
 playing+0.105
 being+0.097
 drawing+0.096
 sharing+0.096
 shooting+0.090
Neg logit contributions
igi-0.145
rated-0.144
 tighter-0.139
use-0.138
Lex-0.132
Token?"
Token ID1701
Feature activation+5.01e+00

Pos logit contributions
 playing+0.042
 sharing+0.039
 being+0.038
 doing+0.038
 drawing+0.038
Neg logit contributions
igi-0.031
Lex-0.027
rated-0.026
 tighter-0.026
Jess-0.024
Token the
Token ID262
Feature activation-3.02e-02

Pos logit contributions
 tighter+0.186
use+0.183
igi+0.181
rated+0.176
Lex+0.170
Neg logit contributions
 playing-0.126
 sharing-0.122
 being-0.119
 drawing-0.115
 acting-0.112
Token par
Token ID1582
Feature activation-9.02e+00

Pos logit contributions
 playing+0.001
 sharing+0.001
 being+0.001
 drawing+0.001
 acting+0.001
Neg logit contributions
 tighter-0.001
use-0.001
igi-0.001
rated-0.001
Lex-0.001
Tokenrot
Token ID10599
Feature activation-4.62e+00

Pos logit contributions
 playing+0.289
 being+0.272
 sharing+0.272
 teasing+0.262
 acting+0.261
Neg logit contributions
igi-0.274
use-0.264
 tighter-0.263
-0.254
Lex-0.251
Token said
Token ID531
Feature activation-3.74e+00

Pos logit contributions
 playing+0.128
 sharing+0.125
 being+0.123
 drawing+0.119
 acting+0.116
Neg logit contributions
igi-0.156
 tighter-0.156
use-0.154
rated-0.147
Lex-0.145
Token.
Token ID13
Feature activation-1.19e+00

Pos logit contributions
 playing+0.118
 sharing+0.116
 being+0.113
 drawing+0.111
 acting+0.111
Neg logit contributions
 tighter-0.112
use-0.105
igi-0.103
rated-0.097
-0.096
Token
Token ID198
Feature activation-4.13e-01

Pos logit contributions
 playing+0.029
 sharing+0.028
 being+0.027
 drawing+0.026
 acting+0.026
Neg logit contributions
 tighter-0.045
use-0.044
igi-0.044
rated-0.042
Lex-0.041
Token
Token ID198
Feature activation-3.70e-01

Pos logit contributions
 playing+0.006
 sharing+0.006
 being+0.006
 shooting+0.005
 drawing+0.005
Neg logit contributions
rated-0.020
 tighter-0.019
use-0.019
igi-0.018
 reins-0.018
Token was
Token ID373
Feature activation-2.01e+00

Pos logit contributions
 sharing+0.049
 playing+0.049
 drawing+0.047
 being+0.047
 teasing+0.046
Neg logit contributions
use-0.070
 tighter-0.069
-0.068
igi-0.068
Lex-0.066
Token a
Token ID257
Feature activation+1.19e+00

Pos logit contributions
 video+0.065
 sharing+0.056
 trading+0.055
 joke+0.054
 drawing+0.052
Neg logit contributions
Lex-0.056
igi-0.051
 tighter-0.049
 steep-0.045
 Lex-0.044
Token very
Token ID845
Feature activation+1.59e+00

Pos logit contributions
Lex+0.033
 tighter+0.033
use+0.032
+0.032
igi+0.032
Neg logit contributions
 sharing-0.036
 playing-0.036
 being-0.036
 having-0.034
 acting-0.033
Token talk
Token ID1561
Feature activation-2.08e+00

Pos logit contributions
 tighter+0.050
igi+0.050
use+0.050
Lex+0.047
rated+0.046
Neg logit contributions
 playing-0.046
 sharing-0.045
 being-0.045
 drawing-0.043
 acting-0.043
Tokenative
Token ID876
Feature activation-1.70e+00

Pos logit contributions
 playing+0.058
 drawing+0.058
 sharing+0.058
 acting+0.056
 being+0.056
Neg logit contributions
 tighter-0.062
igi-0.057
use-0.056
rated-0.056
 steep-0.055
Token par
Token ID1582
Feature activation-8.98e+00

Pos logit contributions
 playing+0.048
 being+0.044
 sharing+0.044
 doing+0.043
 joking+0.043
Neg logit contributions
 tighter-0.051
use-0.050
igi-0.049
.'"-0.048
-0.048
Tokenrot
Token ID10599
Feature activation-4.67e+00

Pos logit contributions
 playing+0.315
 being+0.303
 sharing+0.299
 doing+0.294
 acting+0.292
Neg logit contributions
igi-0.246
 tighter-0.226
Lex-0.225
use-0.221
',"-0.211
Token,
Token ID11
Feature activation-1.17e+00

Pos logit contributions
 sharing+0.167
 trading+0.166
 acting+0.164
 playing+0.163
 drawing+0.163
Neg logit contributions
 tighter-0.096
.'"-0.092
',"-0.082
use-0.082
",-0.080
Token and
Token ID290
Feature activation-4.38e-01

Pos logit contributions
 video+0.035
 drawing+0.034
 trading+0.033
 playing+0.033
 sharing+0.032
Neg logit contributions
 tighter-0.030
igi-0.027
use-0.026
Lex-0.026
 steep-0.026
Token she
Token ID673
Feature activation+2.30e+00

Pos logit contributions
 playing+0.012
 video+0.012
 drawing+0.011
 sharing+0.011
 acting+0.011
Neg logit contributions
 tighter-0.014
use-0.013
igi-0.013
Lex-0.012
-0.012
Token loved
Token ID6151
Feature activation+5.04e-02

Pos logit contributions
 tighter+0.076
use+0.074
igi+0.074
Lex+0.071
+0.071
Neg logit contributions
 sharing-0.060
 playing-0.060
 drawing-0.058
 being-0.057
 video-0.056
Token play
Token ID711
Feature activation-5.31e+00

Pos logit contributions
use+0.030
 tighter+0.030
igi+0.029
rated+0.028
+0.028
Neg logit contributions
 playing-0.011
 sharing-0.011
 being-0.010
 drawing-0.009
 acting-0.009
Token together
Token ID1978
Feature activation+9.03e-02

Pos logit contributions
 sharing+0.144
 being+0.141
 lending+0.139
 teasing+0.137
 trading+0.137
Neg logit contributions
-0.165
 tighter-0.148
use-0.144
 slowly-0.137
.'"-0.137
Token every
Token ID790
Feature activation+3.52e+00

Pos logit contributions
 tighter+0.003
use+0.003
igi+0.003
rated+0.003
Lex+0.003
Neg logit contributions
 playing-0.002
 sharing-0.002
 being-0.002
 drawing-0.002
 acting-0.002
Token day
Token ID1110
Feature activation-1.89e+00

Pos logit contributions
 tighter+0.135
use+0.133
igi+0.131
rated+0.127
Lex+0.124
Neg logit contributions
 playing-0.082
 sharing-0.080
 being-0.078
 drawing-0.075
 acting-0.073
Token.
Token ID13
Feature activation-6.99e-01

Pos logit contributions
 sharing+0.061
 playing+0.060
 being+0.059
 drawing+0.058
 acting+0.058
Neg logit contributions
 tighter-0.052
use-0.049
igi-0.048
-0.046
.'"-0.046
Token One
Token ID1881
Feature activation+1.02e+01

Pos logit contributions
 playing+0.020
 sharing+0.020
 being+0.020
 acting+0.019
 drawing+0.019
Neg logit contributions
use-0.022
 tighter-0.021
igi-0.021
-0.020
rated-0.020
Token day
Token ID1110
Feature activation-1.88e+00

Pos logit contributions
 tighter+0.318
rated+0.313
igi+0.296
use+0.289
+0.289
Neg logit contributions
 sharing-0.293
 being-0.285
 playing-0.284
 trading-0.282
 acting-0.281
Token,
Token ID11
Feature activation-9.25e-01

Pos logit contributions
 playing+0.046
 sharing+0.045
 being+0.043
 drawing+0.042
 acting+0.041
Neg logit contributions
 tighter-0.071
use-0.070
igi-0.069
rated-0.066
Lex-0.065
Token they
Token ID484
Feature activation-9.16e-01

Pos logit contributions
 sharing+0.024
 playing+0.024
 being+0.024
 drawing+0.023
 acting+0.023
Neg logit contributions
 tighter-0.032
use-0.031
igi-0.030
-0.029
rated-0.029
Token heard
Token ID2982
Feature activation+3.33e+00

Pos logit contributions
 playing+0.026
 sharing+0.025
 being+0.024
 drawing+0.024
 acting+0.023
Neg logit contributions
 tighter-0.031
use-0.031
igi-0.031
rated-0.029
Lex-0.029
Token about
Token ID546
Feature activation-4.00e+00

Pos logit contributions
 tighter+0.102
 eggs+0.089
use+0.089
+0.089
 colder+0.088
Neg logit contributions
 sharing-0.095
 playing-0.093
 acting-0.091
 being-0.091
 doing-0.091
Token a
Token ID257
Feature activation+1.22e+00

Pos logit contributions
 tighter+0.033
igi+0.031
Lex+0.029
use+0.029
rated+0.029
Neg logit contributions
 sharing-0.026
 video-0.024
 playing-0.024
 being-0.024
 drawing-0.024
Token bit
Token ID1643
Feature activation+4.88e+00

Pos logit contributions
 tighter+0.037
use+0.036
igi+0.036
rated+0.034
Lex+0.034
Neg logit contributions
 playing-0.038
 sharing-0.037
 being-0.036
 drawing-0.036
 acting-0.035
Token afraid
Token ID7787
Feature activation+5.82e-01

Pos logit contributions
igi+0.183
use+0.172
Lex+0.167
rated+0.162
 reins+0.161
Neg logit contributions
 playing-0.134
 sharing-0.125
 being-0.125
 teasing-0.121
 acting-0.119
Token.
Token ID13
Feature activation-6.87e-01

Pos logit contributions
 tighter+0.019
use+0.019
igi+0.018
rated+0.017
Lex+0.017
Neg logit contributions
 playing-0.017
 sharing-0.016
 being-0.016
 drawing-0.016
 acting-0.015
Token But
Token ID887
Feature activation+6.19e-01

Pos logit contributions
 playing+0.018
 sharing+0.018
 being+0.017
 drawing+0.017
 acting+0.016
Neg logit contributions
 tighter-0.024
use-0.024
igi-0.024
rated-0.023
Lex-0.022
Token soon
Token ID2582
Feature activation+8.24e+00

Pos logit contributions
 tighter+0.019
use+0.017
rated+0.016
 steep+0.016
 minutes+0.015
Neg logit contributions
 video-0.018
 sharing-0.018
 acting-0.017
 playing-0.017
 performing-0.016
Token enough
Token ID1576
Feature activation+6.59e+00

Pos logit contributions
 tighter+0.256
use+0.244
igi+0.232
+0.230
rated+0.230
Neg logit contributions
 sharing-0.232
 playing-0.229
 video-0.226
 acting-0.222
 being-0.221
Token,
Token ID11
Feature activation-9.56e-01

Pos logit contributions
 tighter+0.206
use+0.202
igi+0.190
+0.183
Lex+0.182
Neg logit contributions
 playing-0.184
 sharing-0.183
 being-0.177
 video-0.176
 acting-0.176
Token Sarah
Token ID10490
Feature activation-1.20e+00

Pos logit contributions
 playing+0.024
 sharing+0.024
 being+0.023
 acting+0.022
 drawing+0.022
Neg logit contributions
 tighter-0.035
use-0.033
igi-0.032
rated-0.031
Lex-0.031
Token's
Token ID338
Feature activation-2.72e+00

Pos logit contributions
 sharing+0.033
 playing+0.032
 being+0.031
 drawing+0.030
 acting+0.029
Neg logit contributions
 tighter-0.042
use-0.041
igi-0.040
rated-0.038
Lex-0.038
Token fear
Token ID3252
Feature activation-4.22e-01

Pos logit contributions
 sharing+0.088
 playing+0.085
 being+0.085
 drawing+0.081
 acting+0.081
Neg logit contributions
 tighter-0.078
use-0.072
igi-0.070
rated-0.069
 reins-0.068
Token shape
Token ID5485
Feature activation-1.13e+00

Pos logit contributions
 tighter+0.014
+0.012
use+0.012
rated+0.012
igi+0.012
Neg logit contributions
 playing-0.012
 sharing-0.012
 being-0.012
 acting-0.012
 doing-0.011
Token with
Token ID351
Feature activation-3.45e+00

Pos logit contributions
 playing+0.034
 sharing+0.034
 being+0.033
 teasing+0.033
 trading+0.032
Neg logit contributions
 tighter-0.034
 eggs-0.029
rier-0.028
bars-0.027
rated-0.027
Token little
Token ID1310
Feature activation+7.12e-01

Pos logit contributions
 playing+0.094
 sharing+0.093
 being+0.090
 acting+0.088
 drawing+0.087
Neg logit contributions
 tighter-0.118
use-0.112
igi-0.110
Lex-0.106
rated-0.105
Token bumps
Token ID34029
Feature activation+4.39e+00

Pos logit contributions
 tighter+0.020
use+0.019
igi+0.019
rated+0.018
Lex+0.018
Neg logit contributions
 playing-0.024
 sharing-0.024
 being-0.023
 drawing-0.022
 acting-0.022
Token on
Token ID319
Feature activation-1.75e+00

Pos logit contributions
 tighter+0.131
igi+0.117
use+0.116
rated+0.115
 eggs+0.111
Neg logit contributions
 sharing-0.136
 playing-0.135
 being-0.130
 video-0.130
 drawing-0.128
Token top
Token ID1353
Feature activation+9.65e+00

Pos logit contributions
 playing+0.048
 sharing+0.047
 being+0.046
 drawing+0.045
 acting+0.043
Neg logit contributions
 tighter-0.061
use-0.061
igi-0.060
rated-0.057
Lex-0.056
Token.
Token ID13
Feature activation-3.31e-01

Pos logit contributions
 tighter+0.324
use+0.322
igi+0.319
rated+0.303
Lex+0.299
Neg logit contributions
 playing-0.273
 sharing-0.266
 being-0.261
 drawing-0.254
 acting-0.247
Token
Token ID198
Feature activation+7.38e-02

Pos logit contributions
 playing+0.009
 sharing+0.009
 being+0.008
 drawing+0.008
 acting+0.008
Neg logit contributions
 tighter-0.012
use-0.012
igi-0.011
rated-0.011
Lex-0.011
Token
Token ID198
Feature activation+8.59e-02

Pos logit contributions
 tighter+0.003
use+0.003
rated+0.003
igi+0.003
 reins+0.003
Neg logit contributions
 playing-0.001
 sharing-0.001
 being-0.001
 drawing-0.001
 wearing-0.001
TokenWhen
Token ID2215
Feature activation+2.59e+00

Pos logit contributions
use+0.002
 tighter+0.002
igi+0.002
rated+0.002
Lex+0.002
Neg logit contributions
 playing-0.003
 sharing-0.003
 being-0.003
 drawing-0.003
 acting-0.003
Token Tim
Token ID5045
Feature activation-7.52e-01

Pos logit contributions
 tighter+0.099
use+0.095
igi+0.095
Lex+0.092
+0.092
Neg logit contributions
 sharing-0.058
 playing-0.057
 being-0.055
 drawing-0.054
 acting-0.054
Token
Token ID198
Feature activation-1.75e-01

Pos logit contributions
 playing+0.005
 sharing+0.005
 being+0.005
 drawing+0.005
 acting+0.005
Neg logit contributions
 tighter-0.008
use-0.008
igi-0.008
rated-0.007
Lex-0.007
TokenSo
Token ID2396
Feature activation+2.27e+00

Pos logit contributions
 playing+0.006
 sharing+0.006
 being+0.006
 drawing+0.006
 acting+0.006
Neg logit contributions
 tighter-0.004
use-0.004
igi-0.004
rated-0.004
Lex-0.004
Token the
Token ID262
Feature activation+3.61e-01

Pos logit contributions
 tighter+0.076
use+0.069
+0.068
Lex+0.068
igi+0.067
Neg logit contributions
 playing-0.063
 sharing-0.062
 drawing-0.061
 being-0.060
 acting-0.059
Token yacht
Token ID43207
Feature activation-2.19e+00

Pos logit contributions
 tighter+0.011
use+0.011
igi+0.010
rated+0.010
+0.010
Neg logit contributions
 playing-0.011
 sharing-0.011
 drawing-0.010
 being-0.010
 acting-0.010
Token sailed
Token ID33835
Feature activation+1.34e+00

Pos logit contributions
 sharing+0.072
 playing+0.071
 drawing+0.069
 being+0.067
 teasing+0.067
Neg logit contributions
 tighter-0.064
igi-0.058
use-0.058
rated-0.057
Lex-0.054
Token slowly
Token ID6364
Feature activation+9.17e+00

Pos logit contributions
 tighter+0.039
use+0.036
igi+0.036
rated+0.035
Lex+0.034
Neg logit contributions
 playing-0.045
 sharing-0.044
 being-0.043
 drawing-0.043
 acting-0.042
Token into
Token ID656
Feature activation+1.81e+00

Pos logit contributions
 tighter+0.293
use+0.279
igi+0.267
rated+0.260
 steep+0.251
Neg logit contributions
 sharing-0.279
 playing-0.268
 being-0.259
 drawing-0.259
 video-0.256
Token the
Token ID262
Feature activation+3.83e-01

Pos logit contributions
 tighter+0.065
use+0.059
igi+0.058
Lex+0.058
 eggs+0.056
Neg logit contributions
 sharing-0.046
 playing-0.045
 being-0.042
 drawing-0.042
 teasing-0.042
Token part
Token ID636
Feature activation+3.31e+00

Pos logit contributions
 tighter+0.011
use+0.010
igi+0.010
rated+0.010
Lex+0.010
Neg logit contributions
 playing-0.013
 sharing-0.013
 being-0.013
 drawing-0.012
 acting-0.012
Token and
Token ID290
Feature activation+8.83e-02

Pos logit contributions
 tighter+0.104
igi+0.095
use+0.093
bars+0.090
rated+0.089
Neg logit contributions
 sharing-0.095
 playing-0.094
 being-0.091
 teasing-0.091
 acting-0.090
Token stopped
Token ID5025
Feature activation+4.64e+00

Pos logit contributions
 tighter+0.003
use+0.002
igi+0.002
rated+0.002
 colder+0.002
Neg logit contributions
 sharing-0.003
 playing-0.003
 drawing-0.003
 being-0.003
 video-0.003
Token who
Token ID508
Feature activation-2.29e+00

Pos logit contributions
 sharing+0.095
 playing+0.093
 being+0.093
 acting+0.089
 drawing+0.089
Neg logit contributions
 tighter-0.144
use-0.143
igi-0.140
-0.132
Lex-0.131
Token liked
Token ID8288
Feature activation-6.89e-01

Pos logit contributions
 sharing+0.060
 drawing+0.059
 performing+0.058
 teasing+0.057
 video+0.057
Neg logit contributions
use-0.077
rated-0.076
 tighter-0.075
-0.075
igi-0.071
Token to
Token ID284
Feature activation+3.92e-01

Pos logit contributions
 sharing+0.022
 acting+0.022
 drawing+0.021
 being+0.021
 playing+0.021
Neg logit contributions
 tighter-0.018
 eggs-0.016
use-0.016
 reins-0.015
 corn-0.015
Token run
Token ID1057
Feature activation+1.54e+00

Pos logit contributions
use+0.015
 tighter+0.014
+0.014
rated+0.014
igi+0.014
Neg logit contributions
 sharing-0.009
 playing-0.008
 being-0.008
 teasing-0.008
 shooting-0.008
Token.
Token ID13
Feature activation-6.08e-01

Pos logit contributions
 tighter+0.058
use+0.056
igi+0.054
rated+0.053
+0.052
Neg logit contributions
 sharing-0.037
 playing-0.037
 being-0.035
 drawing-0.035
 acting-0.034
Token One
Token ID1881
Feature activation+1.01e+01

Pos logit contributions
 playing+0.020
 sharing+0.019
 being+0.019
 drawing+0.018
 acting+0.018
Neg logit contributions
 tighter-0.018
use-0.018
igi-0.017
rated-0.016
Lex-0.016
Token day
Token ID1110
Feature activation-2.00e+00

Pos logit contributions
 tighter+0.317
rated+0.305
igi+0.290
 steep+0.288
use+0.287
Neg logit contributions
 sharing-0.295
 being-0.279
 acting-0.277
 trading-0.275
 playing-0.275
Token he
Token ID339
Feature activation+1.13e+00

Pos logit contributions
 playing+0.050
 sharing+0.050
 being+0.048
 drawing+0.047
 acting+0.046
Neg logit contributions
 tighter-0.072
use-0.072
igi-0.070
rated-0.067
Lex-0.067
Token was
Token ID373
Feature activation-1.83e+00

Pos logit contributions
 tighter+0.039
use+0.038
igi+0.038
rated+0.036
Lex+0.036
Neg logit contributions
 playing-0.031
 sharing-0.030
 being-0.029
 drawing-0.029
 acting-0.028
Token running
Token ID2491
Feature activation-3.59e-02

Pos logit contributions
 video+0.022
 sharing+0.020
 character+0.015
 Sharing+0.015
 trading+0.015
Neg logit contributions
 tighter-0.084
 colder-0.079
 steep-0.077
Lex-0.077
 lower-0.076
Token in
Token ID287
Feature activation-4.42e-01

Pos logit contributions
 playing+0.001
 sharing+0.001
 being+0.001
 drawing+0.001
 acting+0.001
Neg logit contributions
 tighter-0.001
use-0.001
igi-0.001
rated-0.001
Lex-0.001
Token Anna
Token ID11735
Feature activation+6.00e-01

Pos logit contributions
igi+0.163
 tighter+0.163
use+0.161
rated+0.154
Lex+0.147
Neg logit contributions
 playing-0.161
 sharing-0.155
 being-0.154
 drawing-0.150
 acting-0.146
Token asked
Token ID1965
Feature activation+3.65e+00

Pos logit contributions
igi+0.017
 tighter+0.017
use+0.017
rated+0.016
Lex+0.015
Neg logit contributions
 playing-0.020
 being-0.020
 sharing-0.019
 drawing-0.019
 acting-0.019
Token.
Token ID13
Feature activation-8.36e-01

Pos logit contributions
igi+0.141
use+0.139
rated+0.133
 tighter+0.130
Lex+0.128
Neg logit contributions
 playing-0.097
 being-0.091
 sharing-0.089
 drawing-0.087
 teasing-0.083
Token
Token ID198
Feature activation-1.79e-01

Pos logit contributions
 playing+0.020
 sharing+0.019
 being+0.019
 drawing+0.018
 acting+0.018
Neg logit contributions
 tighter-0.032
use-0.032
igi-0.032
rated-0.030
Lex-0.030
Token
Token ID198
Feature activation-1.60e-01

Pos logit contributions
 playing+0.004
 sharing+0.004
 being+0.004
 drawing+0.004
 acting+0.004
Neg logit contributions
 tighter-0.007
use-0.007
igi-0.007
rated-0.007
Lex-0.006
Token"
Token ID1
Feature activation+2.30e+00

Pos logit contributions
 playing+0.006
 sharing+0.006
 being+0.006
 drawing+0.006
 video+0.006
Neg logit contributions
igi-0.004
use-0.004
 tighter-0.003
rated-0.003
Lex-0.003
TokenNo
Token ID2949
Feature activation+1.08e+00

Pos logit contributions
igi+0.064
rated+0.063
use+0.062
 tighter+0.061
Lex+0.056
Neg logit contributions
 playing-0.077
 sharing-0.076
 being-0.076
 teasing-0.073
 acting-0.072
Token,
Token ID11
Feature activation-1.20e+00

Pos logit contributions
 tighter+0.042
igi+0.042
use+0.042
rated+0.040
Lex+0.039
Neg logit contributions
 playing-0.026
 sharing-0.025
 being-0.024
 drawing-0.023
 acting-0.023
Token sweet
Token ID6029
Feature activation-2.29e+00

Pos logit contributions
 playing+0.041
 sharing+0.040
 drawing+0.039
 being+0.039
 acting+0.038
Neg logit contributions
 tighter-0.033
use-0.031
igi-0.030
rated-0.029
Lex-0.028
Tokenie
Token ID494
Feature activation+6.71e-01

Pos logit contributions
 playing+0.085
 sharing+0.083
 being+0.082
 teasing+0.078
 doing+0.078
Neg logit contributions
 tighter-0.057
use-0.056
igi-0.053
rated-0.051
Lex-0.050
Token,
Token ID11
Feature activation-1.21e+00

Pos logit contributions
igi+0.024
 tighter+0.023
use+0.023
Lex+0.022
rated+0.022
Neg logit contributions
 playing-0.018
 being-0.017
 sharing-0.017
 drawing-0.016
 doing-0.016
Token She
Token ID1375
Feature activation+1.69e+00

Pos logit contributions
 playing+0.028
 sharing+0.026
 being+0.026
 drawing+0.025
 acting+0.024
Neg logit contributions
 tighter-0.040
igi-0.039
use-0.039
rated-0.039
Lex-0.036
Token closed
Token ID4838
Feature activation+4.48e+00

Pos logit contributions
use+0.060
igi+0.060
 tighter+0.059
rated+0.056
Lex+0.056
Neg logit contributions
 playing-0.045
 being-0.042
 sharing-0.042
 drawing-0.042
 shooting-0.040
Token her
Token ID607
Feature activation+4.12e-01

Pos logit contributions
use+0.179
igi+0.175
 tighter+0.175
rated+0.171
Lex+0.164
Neg logit contributions
 playing-0.103
 sharing-0.096
 being-0.096
 drawing-0.093
 teasing-0.088
Token eyes
Token ID2951
Feature activation+8.85e-01

Pos logit contributions
 tighter+0.013
use+0.013
igi+0.013
rated+0.012
Lex+0.012
Neg logit contributions
 playing-0.013
 sharing-0.012
 being-0.012
 drawing-0.012
 acting-0.012
Token and
Token ID290
Feature activation-1.15e-01

Pos logit contributions
igi+0.039
use+0.038
rated+0.036
Lex+0.035
 tighter+0.034
Neg logit contributions
 playing-0.018
 being-0.017
 drawing-0.016
 sharing-0.016
 shooting-0.016
Token tried
Token ID3088
Feature activation+3.48e+00

Pos logit contributions
 playing+0.003
 sharing+0.003
 being+0.003
 drawing+0.003
 acting+0.003
Neg logit contributions
 tighter-0.004
use-0.004
igi-0.003
rated-0.003
Lex-0.003
Token to
Token ID284
Feature activation+4.53e-01

Pos logit contributions
 tighter+0.108
use+0.099
igi+0.097
Lex+0.094
rated+0.091
Neg logit contributions
 sharing-0.103
 playing-0.102
 being-0.099
 video-0.098
 drawing-0.097
Token imagine
Token ID5967
Feature activation-2.67e+00

Pos logit contributions
 tighter+0.016
use+0.016
igi+0.015
rated+0.015
Lex+0.015
Neg logit contributions
 playing-0.012
 sharing-0.012
 being-0.011
 drawing-0.011
 acting-0.011
Token what
Token ID644
Feature activation-1.26e+00

Pos logit contributions
 sharing+0.073
 playing+0.072
 drawing+0.069
 being+0.068
 video+0.068
Neg logit contributions
 tighter-0.089
use-0.084
igi-0.083
rated-0.078
Lex-0.078
Token it
Token ID340
Feature activation+1.00e+00

Pos logit contributions
 playing+0.036
 sharing+0.036
 drawing+0.034
 acting+0.034
 being+0.034
Neg logit contributions
 tighter-0.040
use-0.037
igi-0.036
Lex-0.035
rated-0.035
Token would
Token ID561
Feature activation+2.77e+00

Pos logit contributions
 tighter+0.037
use+0.037
rated+0.036
igi+0.035
+0.034
Neg logit contributions
 sharing-0.024
 playing-0.024
 drawing-0.022
 being-0.022
 teasing-0.021
Token the
Token ID262
Feature activation+2.18e-01

Pos logit contributions
 tighter+0.011
use+0.011
igi+0.011
rated+0.010
Lex+0.010
Neg logit contributions
 playing-0.005
 sharing-0.005
 being-0.005
 drawing-0.005
 acting-0.005
Token wolf
Token ID17481
Feature activation+4.97e-01

Pos logit contributions
igi+0.007
use+0.007
 tighter+0.007
rated+0.006
',"+0.006
Neg logit contributions
 playing-0.007
 being-0.006
 teasing-0.006
 sharing-0.006
 video-0.006
Token.
Token ID13
Feature activation-9.54e-01

Pos logit contributions
 tighter+0.012
use+0.011
.'"+0.010
igi+0.010
rated+0.010
Neg logit contributions
 sharing-0.017
 playing-0.017
 drawing-0.017
 trading-0.016
 shooting-0.016
Token He
Token ID679
Feature activation+4.91e-01

Pos logit contributions
 playing+0.024
 sharing+0.023
 being+0.023
 drawing+0.022
 acting+0.021
Neg logit contributions
 tighter-0.035
use-0.035
igi-0.034
rated-0.033
Lex-0.032
Token said
Token ID531
Feature activation-3.62e+00

Pos logit contributions
 tighter+0.017
use+0.016
igi+0.016
rated+0.015
Lex+0.015
Neg logit contributions
 playing-0.014
 sharing-0.014
 being-0.013
 drawing-0.013
 acting-0.012
Token "
Token ID366
Feature activation+1.50e+00

Pos logit contributions
 playing+0.097
 sharing+0.095
 drawing+0.093
 acting+0.093
 being+0.092
Neg logit contributions
 tighter-0.122
use-0.116
igi-0.115
Lex-0.111
rated-0.107
TokenHi
Token ID17250
Feature activation-4.49e-01

Pos logit contributions
 tighter+0.040
igi+0.039
use+0.038
rated+0.035
Lex+0.035
Neg logit contributions
 playing-0.051
 being-0.050
 sharing-0.050
 shooting-0.048
 teasing-0.047
Token!
Token ID0
Feature activation-1.40e+00

Pos logit contributions
 playing+0.012
 sharing+0.011
 being+0.011
 teasing+0.010
 doing+0.010
Neg logit contributions
 tighter-0.017
use-0.016
igi-0.016
rated-0.016
bars-0.015
Token I
Token ID314
Feature activation+3.11e+00

Pos logit contributions
 playing+0.044
 sharing+0.043
 being+0.042
 drawing+0.042
 acting+0.042
Neg logit contributions
use-0.042
 tighter-0.039
-0.039
igi-0.039
Lex-0.038
Token'm
Token ID1101
Feature activation-2.23e+00

Pos logit contributions
use+0.086
+0.077
igi+0.074
rated+0.073
 tighter+0.073
Neg logit contributions
 sharing-0.108
 playing-0.107
 being-0.106
 drawing-0.102
 teasing-0.101
Token Tim
Token ID5045
Feature activation-8.23e-01

Pos logit contributions
 video+0.062
 sharing+0.061
 drawing+0.058
 trading+0.057
 acting+0.055
Neg logit contributions
Lex-0.064
igi-0.063
 tighter-0.059
?",-0.058
 steep-0.058
Token blanket
Token ID18447
Feature activation+1.29e-02

Pos logit contributions
 tighter+0.003
use+0.003
igi+0.003
rated+0.003
Lex+0.003
Neg logit contributions
 playing-0.003
 sharing-0.003
 being-0.003
 drawing-0.003
 acting-0.003
Token and
Token ID290
Feature activation-1.54e-01

Pos logit contributions
 tighter+0.000
use+0.000
igi+0.000
+0.000
rated+0.000
Neg logit contributions
 sharing-0.000
 playing-0.000
 acting-0.000
 being-0.000
 trading-0.000
Token used
Token ID973
Feature activation+3.73e+00

Pos logit contributions
 playing+0.004
 drawing+0.004
 being+0.003
 sharing+0.003
 wearing+0.003
Neg logit contributions
use-0.006
igi-0.006
rated-0.006
 tighter-0.006
.'"-0.006
Token it
Token ID340
Feature activation+1.07e+00

Pos logit contributions
 tighter+0.130
use+0.123
igi+0.122
Lex+0.116
rated+0.115
Neg logit contributions
 playing-0.101
 sharing-0.100
 being-0.097
 acting-0.094
 drawing-0.094
Token to
Token ID284
Feature activation+5.38e-01

Pos logit contributions
 tighter+0.034
use+0.032
igi+0.031
rated+0.030
Lex+0.029
Neg logit contributions
 sharing-0.032
 playing-0.032
 being-0.031
 acting-0.030
 drawing-0.030
Token cover
Token ID3002
Feature activation+1.66e+00

Pos logit contributions
 tighter+0.018
use+0.017
igi+0.017
Lex+0.016
rated+0.016
Neg logit contributions
 playing-0.015
 sharing-0.015
 being-0.015
 drawing-0.014
 teasing-0.014
Token herself
Token ID5223
Feature activation+2.09e+00

Pos logit contributions
use+0.068
igi+0.065
 tighter+0.065
rated+0.064
Lex+0.062
Neg logit contributions
 playing-0.037
 sharing-0.036
 being-0.035
 drawing-0.033
 acting-0.031
Token.
Token ID13
Feature activation-6.10e-01

Pos logit contributions
 tighter+0.079
igi+0.075
use+0.075
rated+0.074
Lex+0.071
Neg logit contributions
 playing-0.049
 sharing-0.049
 being-0.047
 teasing-0.045
 drawing-0.045
Token Her
Token ID2332
Feature activation+9.12e-01

Pos logit contributions
 playing+0.017
 sharing+0.016
 being+0.016
 drawing+0.015
 acting+0.015
Neg logit contributions
 tighter-0.021
use-0.021
igi-0.021
rated-0.020
Lex-0.019
Token mom
Token ID1995
Feature activation+5.16e+00

Pos logit contributions
 tighter+0.023
igi+0.021
use+0.021
+0.020
Lex+0.020
Neg logit contributions
 sharing-0.033
 playing-0.032
 being-0.032
 acting-0.031
 shooting-0.031
Token found
Token ID1043
Feature activation-3.15e-03

Pos logit contributions
 tighter+0.170
igi+0.156
use+0.150
Lex+0.147
rated+0.146
Neg logit contributions
 playing-0.139
 sharing-0.138
 being-0.137
 acting-0.134
 shooting-0.133
Token,
Token ID11
Feature activation-7.98e-01

Pos logit contributions
 playing+0.102
 sharing+0.101
 acting+0.100
 video+0.099
 being+0.097
Neg logit contributions
 tighter-0.093
use-0.087
Lex-0.087
-0.086
",-0.086
Token "
Token ID366
Feature activation+1.65e+00

Pos logit contributions
 sharing+0.021
 playing+0.021
 video+0.020
 drawing+0.020
 trading+0.019
Neg logit contributions
 tighter-0.026
use-0.024
Lex-0.024
igi-0.024
-0.023
TokenMine
Token ID24461
Feature activation+2.33e+00

Pos logit contributions
 tighter+0.046
use+0.044
igi+0.044
rated+0.042
Lex+0.040
Neg logit contributions
 playing-0.055
 sharing-0.054
 being-0.054
 drawing-0.052
 acting-0.051
Token!"
Token ID2474
Feature activation+2.96e+00

Pos logit contributions
 tighter+0.057
igi+0.055
use+0.054
Lex+0.051
rated+0.050
Neg logit contributions
 playing-0.091
 sharing-0.088
 being-0.088
 drawing-0.085
 doing-0.084
Token He
Token ID679
Feature activation+5.09e-01

Pos logit contributions
 tighter+0.093
use+0.087
+0.086
igi+0.083
rated+0.083
Neg logit contributions
 sharing-0.090
 playing-0.086
 being-0.083
 drawing-0.082
 teasing-0.081
Token ran
Token ID4966
Feature activation+4.42e+00

Pos logit contributions
 tighter+0.017
use+0.016
igi+0.016
rated+0.016
Lex+0.015
Neg logit contributions
 sharing-0.014
 playing-0.014
 being-0.014
 drawing-0.013
 acting-0.013
Token towards
Token ID3371
Feature activation+1.18e+00

Pos logit contributions
 tighter+0.123
igi+0.099
use+0.099
 steep+0.099
 lower+0.099
Neg logit contributions
 sharing-0.159
 playing-0.153
 drawing-0.148
 acting-0.148
 being-0.147
Token the
Token ID262
Feature activation+4.11e-01

Pos logit contributions
 tighter+0.038
use+0.036
igi+0.034
Lex+0.033
rated+0.032
Neg logit contributions
 sharing-0.034
 playing-0.032
 acting-0.032
 being-0.031
 doing-0.031
Token shiny
Token ID22441
Feature activation-1.18e+00

Pos logit contributions
 tighter+0.011
use+0.011
igi+0.011
rated+0.010
Lex+0.010
Neg logit contributions
 playing-0.014
 sharing-0.014
 being-0.014
 drawing-0.013
 acting-0.013
Token object
Token ID2134
Feature activation-1.86e+00

Pos logit contributions
 playing+0.034
 sharing+0.032
 drawing+0.032
 being+0.032
 shooting+0.032
Neg logit contributions
use-0.040
igi-0.040
 tighter-0.040
Lex-0.038
rated-0.037
Token but
Token ID475
Feature activation-5.79e-01

Pos logit contributions
 sharing+0.059
 playing+0.057
 acting+0.055
 being+0.055
 drawing+0.055
Neg logit contributions
 tighter-0.056
use-0.047
 steep-0.046
bars-0.046
igi-0.046
Token back
Token ID736
Feature activation+4.66e+00

Pos logit contributions
 tighter+0.122
use+0.111
igi+0.111
 steep+0.102
 reins+0.101
Neg logit contributions
 sharing-0.130
 being-0.123
 playing-0.122
 video-0.120
 acting-0.120
Token to
Token ID284
Feature activation+5.59e-01

Pos logit contributions
 tighter+0.160
use+0.157
igi+0.157
rated+0.151
Lex+0.147
Neg logit contributions
 playing-0.132
 sharing-0.126
 being-0.124
 drawing-0.121
 acting-0.118
Token the
Token ID262
Feature activation+2.49e-01

Pos logit contributions
 tighter+0.016
 corn+0.014
 slowly+0.014
 lower+0.014
use+0.014
Neg logit contributions
 video-0.014
 sharing-0.014
od-0.014
opot-0.013
 acting-0.013
Token plastic
Token ID7309
Feature activation-1.18e+00

Pos logit contributions
 tighter+0.008
use+0.007
igi+0.007
+0.007
Lex+0.007
Neg logit contributions
 sharing-0.008
 playing-0.008
 being-0.007
 acting-0.007
 drawing-0.007
Token box
Token ID3091
Feature activation-1.86e+00

Pos logit contributions
 playing+0.038
 sharing+0.038
 being+0.037
 drawing+0.036
 acting+0.036
Neg logit contributions
 tighter-0.034
use-0.034
igi-0.033
rated-0.031
-0.031
Token and
Token ID290
Feature activation-6.74e-02

Pos logit contributions
 sharing+0.052
 playing+0.052
 being+0.050
 drawing+0.049
 acting+0.049
Neg logit contributions
 tighter-0.062
use-0.058
igi-0.057
rated-0.055
-0.053
Token found
Token ID1043
Feature activation+3.80e-02

Pos logit contributions
 sharing+0.002
 video+0.002
 being+0.002
 acting+0.002
 playing+0.002
Neg logit contributions
 tighter-0.002
use-0.002
igi-0.002
rated-0.002
 slowly-0.002
Token something
Token ID1223
Feature activation+1.24e+00

Pos logit contributions
 tighter+0.001
igi+0.001
use+0.001
Lex+0.001
rated+0.001
Neg logit contributions
 playing-0.001
 sharing-0.001
 being-0.001
 acting-0.001
 drawing-0.001
Token new
Token ID649
Feature activation-1.34e+00

Pos logit contributions
 tighter+0.035
 steep+0.030
 colder+0.029
 cheaper+0.028
 thicker+0.028
Neg logit contributions
 sharing-0.039
 drawing-0.038
 playing-0.037
 video-0.037
 acting-0.037
Token.
Token ID13
Feature activation-7.98e-01

Pos logit contributions
 sharing+0.041
 playing+0.041
 being+0.039
 drawing+0.039
 video+0.038
Neg logit contributions
 tighter-0.041
use-0.039
igi-0.039
rated-0.037
-0.035
Token It
Token ID632
Feature activation-2.08e+00

Pos logit contributions
 playing+0.020
 sharing+0.020
 being+0.019
 drawing+0.019
 acting+0.018
Neg logit contributions
 tighter-0.029
use-0.029
igi-0.029
rated-0.027
Lex-0.027
Token.
Token ID13
Feature activation-1.39e+00

Pos logit contributions
 tighter+0.032
use+0.029
rated+0.029
.'"+0.027
+0.027
Neg logit contributions
 sharing-0.038
 playing-0.037
 drawing-0.037
 teasing-0.036
 shooting-0.036
Token
Token ID198
Feature activation-5.05e-01

Pos logit contributions
 playing+0.038
 sharing+0.038
 being+0.037
 drawing+0.036
 acting+0.035
Neg logit contributions
 tighter-0.047
use-0.046
igi-0.046
rated-0.044
Lex-0.043
Token
Token ID198
Feature activation-4.54e-01

Pos logit contributions
 playing+0.012
 sharing+0.011
 being+0.011
 drawing+0.010
 acting+0.010
Neg logit contributions
 tighter-0.020
use-0.020
igi-0.019
rated-0.019
Lex-0.018
TokenThen
Token ID6423
Feature activation+3.13e+00

Pos logit contributions
 playing+0.019
 sharing+0.019
 being+0.019
 drawing+0.018
 acting+0.018
Neg logit contributions
 tighter-0.009
use-0.008
igi-0.008
rated-0.008
Lex-0.007
Token,
Token ID11
Feature activation-1.29e+00

Pos logit contributions
 tighter+0.105
use+0.102
igi+0.100
Lex+0.096
+0.095
Neg logit contributions
 sharing-0.085
 playing-0.084
 being-0.082
 drawing-0.080
 acting-0.079
Token Spot
Token ID15899
Feature activation-1.67e+00

Pos logit contributions
 sharing+0.034
 playing+0.034
 being+0.033
 drawing+0.032
 acting+0.032
Neg logit contributions
 tighter-0.044
use-0.043
igi-0.043
Lex-0.041
rated-0.040
Token had
Token ID550
Feature activation-1.88e+00

Pos logit contributions
 sharing+0.049
 playing+0.047
 being+0.046
 drawing+0.045
 shooting+0.044
Neg logit contributions
use-0.053
 tighter-0.053
igi-0.050
rated-0.049
Lex-0.048
Token an
Token ID281
Feature activation+2.55e+00

Pos logit contributions
 playing+0.050
 sharing+0.049
 being+0.048
 drawing+0.046
 acting+0.045
Neg logit contributions
 tighter-0.068
igi-0.067
use-0.067
rated-0.065
Lex-0.062
Token idea
Token ID2126
Feature activation-2.46e+00

Pos logit contributions
 tighter+0.082
use+0.078
igi+0.077
rated+0.077
Lex+0.073
Neg logit contributions
 playing-0.074
 sharing-0.074
 being-0.070
 drawing-0.069
 acting-0.068
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
 sharing+0.095
 drawing+0.091
 video+0.090
 playing+0.090
 trading+0.089
Neg logit contributions
 tighter-0.050
use-0.048
igi-0.043
 minutes-0.042
',"-0.042
Token He
Token ID679
Feature activation+3.64e-01

Pos logit contributions
 playing+0.033
 sharing+0.032
 drawing+0.032
 being+0.031
 acting+0.031
Neg logit contributions
 tighter-0.033
use-0.031
igi-0.031
Lex-0.029
-0.029
Token runs
Token ID4539
Feature activation+2.68e+00

Pos logit contributions
 sharing+0.011
 drawing+0.010
 video+0.010
 playing+0.010
 trading+0.010
Neg logit contributions
 tighter-0.010
use-0.010
-0.009
 steep-0.009
Lex-0.009
Token at
Token ID379
Feature activation-1.63e+00

Pos logit contributions
 tighter+0.074
use+0.072
igi+0.071
rated+0.067
Lex+0.065
Neg logit contributions
 playing-0.092
 sharing-0.091
 being-0.089
 drawing-0.087
 acting-0.086
Token them
Token ID606
Feature activation-5.20e-01

Pos logit contributions
 playing+0.038
 being+0.036
 sharing+0.034
 drawing+0.032
 wearing+0.032
Neg logit contributions
 tighter-0.065
use-0.064
rated-0.063
igi-0.062
bars-0.059
Token.
Token ID13
Feature activation-4.94e-01

Pos logit contributions
 sharing+0.018
 playing+0.018
 drawing+0.017
 being+0.017
 video+0.017
Neg logit contributions
 tighter-0.013
use-0.012
.'"-0.011
igi-0.011
rated-0.011
Token Sara
Token ID24799
Feature activation+1.99e+00

Pos logit contributions
 playing+0.012
 sharing+0.011
 being+0.011
 wearing+0.011
 having+0.010
Neg logit contributions
 tighter-0.019
rated-0.019
use-0.019
igi-0.019
 hinges-0.017
Token and
Token ID290
Feature activation-1.50e-01

Pos logit contributions
 tighter+0.067
use+0.066
igi+0.064
rated+0.062
Lex+0.061
Neg logit contributions
 playing-0.055
 sharing-0.055
 being-0.053
 drawing-0.052
 acting-0.051
Token Ben
Token ID3932
Feature activation-1.71e+00

Pos logit contributions
 sharing+0.004
 video+0.004
 drawing+0.004
 shooting+0.004
 playing+0.004
Neg logit contributions
 tighter-0.005
use-0.005
-0.004
Lex-0.004
igi-0.004
Token are
Token ID389
Feature activation-3.70e+00

Pos logit contributions
 sharing+0.054
 drawing+0.053
 playing+0.051
 video+0.050
 performing+0.050
Neg logit contributions
use-0.053
 tighter-0.052
-0.049
rated-0.047
 lower-0.045
Token scared
Token ID12008
Feature activation+5.40e-01

Pos logit contributions
 video+0.149
 drawing+0.138
on+0.136
 lending+0.135
 performing+0.135
Neg logit contributions
 tighter-0.071
Lex-0.059
 steep-0.057
 slowly-0.056
 lower-0.056
Token.
Token ID13
Feature activation-5.55e-01

Pos logit contributions
use+0.020
igi+0.020
 tighter+0.019
rated+0.019
Lex+0.019
Neg logit contributions
 playing-0.015
 being-0.014
 sharing-0.014
 teasing-0.013
 drawing-0.013
Token
Token ID198
Feature activation+1.53e-01

Pos logit contributions
 playing+0.013
 sharing+0.012
 being+0.012
 drawing+0.011
 acting+0.011
Neg logit contributions
 tighter-0.022
use-0.022
igi-0.021
rated-0.021
Lex-0.020
Token Lily
Token ID20037
Feature activation-7.20e-01

Pos logit contributions
 sharing+0.004
 playing+0.004
 acting+0.004
 being+0.004
 doing+0.004
Neg logit contributions
 tighter-0.005
igi-0.005
 eggs-0.005
use-0.005
Lex-0.005
Token's
Token ID338
Feature activation-2.78e+00

Pos logit contributions
 trading+0.020
 video+0.020
 sharing+0.020
 celebrating+0.019
 Zero+0.019
Neg logit contributions
 tighter-0.022
 eggs-0.019
 slowly-0.017
 lower-0.017
use-0.017
Token hand
Token ID1021
Feature activation+3.05e+00

Pos logit contributions
 sharing+0.092
 playing+0.091
 being+0.089
 acting+0.086
 trading+0.085
Neg logit contributions
 tighter-0.079
use-0.072
igi-0.071
Lex-0.068
 reins-0.067
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
 tighter+0.100
use+0.090
igi+0.089
rated+0.086
 eggs+0.084
Neg logit contributions
 sharing-0.091
 playing-0.088
 being-0.086
 drawing-0.085
 trading-0.084
Token
Token ID198
Feature activation-3.44e-01

Pos logit contributions
 playing+0.037
 sharing+0.036
 being+0.035
 drawing+0.034
 acting+0.033
Neg logit contributions
 tighter-0.051
use-0.050
igi-0.049
rated-0.049
Lex-0.046
Token
Token ID198
Feature activation-3.09e-01

Pos logit contributions
 playing+0.006
 being+0.005
 sharing+0.005
 drawing+0.005
 wearing+0.005
Neg logit contributions
rated-0.016
 tighter-0.016
use-0.016
igi-0.015
ited-0.015
Token"
Token ID1
Feature activation+2.35e+00

Pos logit contributions
 playing+0.011
 sharing+0.010
 being+0.010
 drawing+0.010
 doing+0.009
Neg logit contributions
rated-0.009
 tighter-0.009
use-0.008
igi-0.008
 reins-0.008
TokenWhat
Token ID2061
Feature activation-1.47e+00

Pos logit contributions
rated+0.062
 tighter+0.061
igi+0.061
use+0.059
 steep+0.055
Neg logit contributions
 sharing-0.079
 playing-0.079
 being-0.078
 video-0.077
 teasing-0.076
Token happened
Token ID3022
Feature activation-2.35e+00

Pos logit contributions
 showing+0.035
 playing+0.035
 teasing+0.035
 acting+0.035
 drawing+0.034
Neg logit contributions
-0.046
use-0.046
 tighter-0.044
?",-0.043
',"-0.042
Token?"
Token ID1701
Feature activation+5.09e+00

Pos logit contributions
 playing+0.078
 sharing+0.078
 being+0.076
 drawing+0.074
 acting+0.073
Neg logit contributions
use-0.064
 tighter-0.064
igi-0.062
rated-0.058
Lex-0.058
Token she
Token ID673
Feature activation+2.33e+00

Pos logit contributions
 tighter+0.164
use+0.162
igi+0.161
rated+0.157
Lex+0.147
Neg logit contributions
 playing-0.156
 sharing-0.149
 being-0.148
 drawing-0.144
 acting-0.140
Token's
Token ID338
Feature activation-2.48e+00

Pos logit contributions
 tighter+0.180
use+0.179
igi+0.173
rated+0.172
+0.170
Neg logit contributions
 playing-0.066
 sharing-0.066
 drawing-0.060
 being-0.059
 acting-0.057
Token play
Token ID711
Feature activation-5.00e+00

Pos logit contributions
 sharing+0.064
 playing+0.064
 being+0.062
 acting+0.059
 drawing+0.059
Neg logit contributions
use-0.085
 tighter-0.084
-0.080
igi-0.080
Lex-0.079
Token".
Token ID1911
Feature activation+2.39e+00

Pos logit contributions
 sharing+0.136
 playing+0.135
 being+0.131
 drawing+0.127
 acting+0.127
Neg logit contributions
 tighter-0.164
use-0.160
-0.155
igi-0.153
',"-0.150
Token The
Token ID383
Feature activation+3.77e-01

Pos logit contributions
 tighter+0.077
use+0.075
igi+0.071
+0.070
Lex+0.070
Neg logit contributions
 sharing-0.070
 playing-0.069
 being-0.067
 drawing-0.065
 acting-0.065
Token short
Token ID1790
Feature activation+4.38e+00

Pos logit contributions
igi+0.012
use+0.012
 tighter+0.011
rated+0.011
Lex+0.011
Neg logit contributions
 playing-0.012
 being-0.011
 sharing-0.011
 drawing-0.011
 teasing-0.011
Token girl
Token ID2576
Feature activation-2.10e+00

Pos logit contributions
igi+0.131
use+0.130
 tighter+0.126
Lex+0.120
rated+0.116
Neg logit contributions
 playing-0.145
 sharing-0.139
 being-0.138
 drawing-0.137
 teasing-0.132
Token replied
Token ID8712
Feature activation-2.42e+00

Pos logit contributions
 sharing+0.064
 playing+0.061
 being+0.060
 doing+0.058
 drawing+0.058
Neg logit contributions
 tighter-0.067
use-0.061
igi-0.059
rated-0.059
-0.057
Token,
Token ID11
Feature activation-8.56e-01

Pos logit contributions
 sharing+0.069
 playing+0.068
 video+0.068
 acting+0.067
 drawing+0.067
Neg logit contributions
 tighter-0.077
use-0.064
rier-0.063
rated-0.063
',"-0.063
Token "
Token ID366
Feature activation+1.73e+00

Pos logit contributions
 playing+0.019
 sharing+0.019
 drawing+0.017
 being+0.017
 video+0.017
Neg logit contributions
 tighter-0.033
use-0.031
igi-0.030
Lex-0.030
-0.030
TokenOk
Token ID18690
Feature activation+3.40e+00

Pos logit contributions
igi+0.049
 tighter+0.048
use+0.048
rated+0.046
Lex+0.043
Neg logit contributions
 playing-0.056
 sharing-0.055
 being-0.055
 drawing-0.052
 teasing-0.052
Token!"
Token ID2474
Feature activation+2.88e+00

Pos logit contributions
igi+0.120
 tighter+0.120
use+0.120
rated+0.112
Lex+0.112
Neg logit contributions
 playing-0.093
 sharing-0.090
 being-0.089
 drawing-0.086
 doing-0.083
Token He
Token ID679
Feature activation+2.58e-01

Pos logit contributions
 playing+0.016
 sharing+0.015
 being+0.015
 drawing+0.014
 acting+0.014
Neg logit contributions
igi-0.022
use-0.022
 tighter-0.022
Lex-0.020
rated-0.020
Token has
Token ID468
Feature activation-2.03e+00

Pos logit contributions
igi+0.009
 tighter+0.008
use+0.008
Lex+0.008
rated+0.008
Neg logit contributions
 playing-0.008
 sharing-0.007
 being-0.007
 drawing-0.007
 acting-0.007
Token to
Token ID284
Feature activation+2.98e-01

Pos logit contributions
 playing+0.049
 being+0.047
 sharing+0.045
 drawing+0.044
 doing+0.043
Neg logit contributions
igi-0.087
use-0.080
rated-0.080
 tighter-0.079
Lex-0.079
Token write
Token ID3551
Feature activation-3.66e+00

Pos logit contributions
 tighter+0.010
use+0.010
igi+0.010
rated+0.010
Lex+0.009
Neg logit contributions
 playing-0.008
 sharing-0.008
 being-0.007
 drawing-0.007
 acting-0.007
Token a
Token ID257
Feature activation+8.47e-01

Pos logit contributions
 playing+0.086
 sharing+0.084
 being+0.081
 drawing+0.079
 acting+0.077
Neg logit contributions
 tighter-0.140
use-0.138
igi-0.137
rated-0.131
Lex-0.129
Token story
Token ID1621
Feature activation-6.56e+00

Pos logit contributions
 tighter+0.029
use+0.028
igi+0.028
rated+0.027
Lex+0.026
Neg logit contributions
 playing-0.023
 sharing-0.023
 being-0.022
 drawing-0.021
 acting-0.021
Token,
Token ID11
Feature activation-1.35e+00

Pos logit contributions
 playing+0.132
 sharing+0.126
 being+0.121
 acting+0.117
 teasing+0.117
Neg logit contributions
 tighter-0.272
use-0.261
igi-0.254
rated-0.253
-0.246
Token do
Token ID466
Feature activation-7.10e+00

Pos logit contributions
 playing+0.037
 sharing+0.037
 being+0.035
 teasing+0.035
 drawing+0.035
Neg logit contributions
 tighter-0.043
use-0.040
igi-0.037
-0.037
 reins-0.037
Token some
Token ID617
Feature activation+4.75e+00

Pos logit contributions
 playing+0.182
 sharing+0.180
 being+0.170
 teasing+0.170
 drawing+0.168
Neg logit contributions
 tighter-0.241
use-0.233
igi-0.226
-0.222
rated-0.219
Token math
Token ID10688
Feature activation+7.18e-01

Pos logit contributions
 tighter+0.152
use+0.144
+0.134
igi+0.134
rated+0.133
Neg logit contributions
 playing-0.134
 sharing-0.133
 being-0.128
 drawing-0.125
 acting-0.124
Token,
Token ID11
Feature activation-1.33e+00

Pos logit contributions
 tighter+0.023
use+0.022
+0.020
igi+0.020
rated+0.020
Neg logit contributions
 playing-0.021
 sharing-0.021
 being-0.020
 teasing-0.019
 drawing-0.019
Token worked
Token ID3111
Feature activation+3.30e+00

Pos logit contributions
 tighter+0.088
use+0.086
igi+0.086
rated+0.082
Lex+0.081
Neg logit contributions
 playing-0.065
 sharing-0.063
 being-0.061
 drawing-0.060
 acting-0.058
Token on
Token ID319
Feature activation-1.76e+00

Pos logit contributions
use+0.110
 tighter+0.108
igi+0.108
rated+0.103
Lex+0.101
Neg logit contributions
 playing-0.097
 sharing-0.093
 being-0.093
 drawing-0.089
 acting-0.087
Token her
Token ID607
Feature activation+4.95e-01

Pos logit contributions
 playing+0.043
 sharing+0.043
 being+0.041
 drawing+0.040
 acting+0.039
Neg logit contributions
 tighter-0.065
use-0.063
igi-0.062
rated-0.059
Lex-0.059
Token pet
Token ID4273
Feature activation+9.32e-02

Pos logit contributions
 tighter+0.016
use+0.015
igi+0.015
rated+0.015
Lex+0.014
Neg logit contributions
 playing-0.015
 sharing-0.015
 being-0.015
 drawing-0.014
 acting-0.014
Token,
Token ID11
Feature activation-8.84e-01

Pos logit contributions
 tighter+0.003
use+0.003
igi+0.003
rated+0.002
Lex+0.002
Neg logit contributions
 playing-0.003
 sharing-0.003
 being-0.003
 drawing-0.003
 acting-0.003
Token talking
Token ID3375
Feature activation-5.29e+00

Pos logit contributions
 video+0.024
 sharing+0.023
 playing+0.022
 trading+0.022
 drawing+0.022
Neg logit contributions
 tighter-0.029
use-0.026
igi-0.026
 eggs-0.026
Lex-0.025
Token to
Token ID284
Feature activation+6.94e-01

Pos logit contributions
 playing+0.155
 sharing+0.155
 being+0.150
 acting+0.148
 drawing+0.148
Neg logit contributions
 tighter-0.164
igi-0.150
use-0.148
rated-0.147
Lex-0.140
Token it
Token ID340
Feature activation+1.18e+00

Pos logit contributions
 tighter+0.025
use+0.024
igi+0.024
rated+0.023
Lex+0.023
Neg logit contributions
 playing-0.017
 sharing-0.017
 being-0.016
 drawing-0.016
 acting-0.015
Token as
Token ID355
Feature activation+3.65e+00

Pos logit contributions
 tighter+0.036
use+0.033
rated+0.032
igi+0.032
Lex+0.030
Neg logit contributions
 sharing-0.037
 playing-0.037
 being-0.035
 drawing-0.035
 teasing-0.034
Token she
Token ID673
Feature activation+2.38e+00

Pos logit contributions
use+0.143
 tighter+0.130
igi+0.130
rated+0.129
',"+0.123
Neg logit contributions
 playing-0.102
 sharing-0.096
 being-0.094
 drawing-0.092
 shooting-0.088
Token spread
Token ID4104
Feature activation+2.27e+00

Pos logit contributions
 tighter+0.085
igi+0.084
use+0.083
rated+0.081
Lex+0.079
Neg logit contributions
 playing-0.064
 sharing-0.060
 being-0.060
 drawing-0.059
 acting-0.057
Token together
Token ID1978
Feature activation-2.84e-02

Pos logit contributions
 drawing+0.166
 sharing+0.162
 lending+0.158
 acting+0.157
 doing+0.157
Neg logit contributions
 tighter-0.176
-0.164
use-0.155
 slowly-0.155
suits-0.154
Token.
Token ID13
Feature activation-7.27e-01

Pos logit contributions
 video+0.001
 drawing+0.001
 sharing+0.001
 acting+0.001
 playing+0.001
Neg logit contributions
 tighter-0.001
.'"-0.001
 slowly-0.001
use-0.001
 steep-0.001
Token It
Token ID632
Feature activation-2.16e+00

Pos logit contributions
 playing+0.021
 sharing+0.021
 being+0.020
 teasing+0.020
 acting+0.020
Neg logit contributions
 tighter-0.023
use-0.022
igi-0.022
rated-0.022
-0.021
Token was
Token ID373
Feature activation-1.80e+00

Pos logit contributions
 sharing+0.053
 playing+0.053
 drawing+0.053
 being+0.048
 trading+0.048
Neg logit contributions
 tighter-0.079
rated-0.076
use-0.074
igi-0.074
 colder-0.072
Token a
Token ID257
Feature activation+1.13e+00

Pos logit contributions
 drawing+0.045
 playing+0.044
 sharing+0.044
 video+0.044
 being+0.043
Neg logit contributions
 tighter-0.062
igi-0.060
Lex-0.058
rated-0.057
 steep-0.056
Token wonderful
Token ID7932
Feature activation-4.63e+00

Pos logit contributions
 tighter+0.035
use+0.033
igi+0.033
Lex+0.032
+0.032
Neg logit contributions
 playing-0.034
 sharing-0.033
 being-0.033
 drawing-0.032
 acting-0.031
Token day
Token ID1110
Feature activation-1.99e+00

Pos logit contributions
 drawing+0.137
 sharing+0.137
 being+0.136
 playing+0.135
 acting+0.131
Neg logit contributions
-0.124
rated-0.122
 tighter-0.121
igi-0.120
 steep-0.115
Token with
Token ID351
Feature activation-3.59e+00

Pos logit contributions
 sharing+0.054
 video+0.054
 drawing+0.051
 acting+0.051
 teasing+0.051
Neg logit contributions
 tighter-0.061
use-0.058
 minutes-0.056
igi-0.056
-0.054
Token lots
Token ID6041
Feature activation+5.48e+00

Pos logit contributions
 sharing+0.092
 drawing+0.091
 acting+0.091
 playing+0.090
 being+0.090
Neg logit contributions
 tighter-0.119
igi-0.114
Lex-0.109
 eggs-0.109
use-0.108
Token of
Token ID286
Feature activation-2.28e+00

Pos logit contributions
 tighter+0.195
use+0.189
igi+0.188
rated+0.178
Lex+0.176
Neg logit contributions
 playing-0.142
 sharing-0.139
 being-0.136
 drawing-0.134
 acting-0.131
Token fun
Token ID1257
Feature activation-2.52e+00

Pos logit contributions
 sharing+0.069
 playing+0.068
 being+0.067
 drawing+0.066
 acting+0.064
Neg logit contributions
 tighter-0.069
use-0.067
igi-0.067
Lex-0.063
rated-0.062
Token family
Token ID1641
Feature activation-2.57e+00

Pos logit contributions
 sharing+0.021
 being+0.021
 acting+0.020
 playing+0.020
 doing+0.019
Neg logit contributions
 tighter-0.012
igi-0.011
use-0.011
 eggs-0.011
Lex-0.011
Token.
Token ID13
Feature activation-4.95e-01

Pos logit contributions
 sharing+0.082
 playing+0.082
 being+0.081
 drawing+0.079
 acting+0.079
Neg logit contributions
 tighter-0.073
use-0.069
igi-0.068
-0.063
rated-0.062
Token He
Token ID679
Feature activation+6.60e-01

Pos logit contributions
 being+0.014
 performing+0.014
 acting+0.014
 sharing+0.014
 drawing+0.014
Neg logit contributions
 tighter-0.013
rated-0.013
-0.013
use-0.013
 steep-0.013
Token loved
Token ID6151
Feature activation+4.93e-02

Pos logit contributions
 tighter+0.020
use+0.020
igi+0.019
+0.018
rated+0.018
Neg logit contributions
 sharing-0.020
 being-0.019
 playing-0.019
 video-0.019
 teasing-0.018
Token to
Token ID284
Feature activation+5.11e-01

Pos logit contributions
 eggs+0.001
 corn+0.001
 pans+0.001
igi+0.001
 oats+0.001
Neg logit contributions
 video-0.002
 character-0.002
 teasing-0.002
 sharing-0.002
 acting-0.002
Token play
Token ID711
Feature activation-5.38e+00

Pos logit contributions
use+0.019
 tighter+0.018
igi+0.018
rated+0.018
+0.018
Neg logit contributions
 sharing-0.012
 being-0.011
 playing-0.011
 teasing-0.011
 acting-0.011
Token in
Token ID287
Feature activation-3.32e-01

Pos logit contributions
 sharing+0.135
 being+0.134
 teasing+0.132
 lending+0.130
 drawing+0.130
Neg logit contributions
-0.162
 minutes-0.155
 tighter-0.154
use-0.154
 slowly-0.152
Token the
Token ID262
Feature activation+2.24e-01

Pos logit contributions
 playing+0.007
 sharing+0.007
 being+0.007
 drawing+0.007
 acting+0.006
Neg logit contributions
 tighter-0.013
use-0.013
igi-0.013
Lex-0.012
rated-0.012
Token sand
Token ID6450
Feature activation-7.91e-01

Pos logit contributions
 tighter+0.006
use+0.006
igi+0.006
+0.005
rated+0.005
Neg logit contributions
 sharing-0.007
 playing-0.007
 being-0.007
 drawing-0.007
 acting-0.007
Token and
Token ID290
Feature activation-1.80e-01

Pos logit contributions
 playing+0.020
 sharing+0.020
 being+0.019
 drawing+0.019
 acting+0.018
Neg logit contributions
 tighter-0.029
use-0.028
igi-0.028
rated-0.027
Lex-0.026
Token splash
Token ID22870
Feature activation-4.90e-01

Pos logit contributions
 sharing+0.004
 playing+0.004
 being+0.004
 video+0.004
 acting+0.004
Neg logit contributions
 tighter-0.006
use-0.006
igi-0.006
rated-0.006
-0.006
Token it
Token ID340
Feature activation+1.09e+00

Pos logit contributions
 playing+0.082
 video+0.081
 drawing+0.081
 acting+0.079
 shooting+0.078
Neg logit contributions
 tighter-0.095
use-0.081
 eggs-0.081
 sugar-0.080
 colder-0.079
Token's
Token ID338
Feature activation-2.75e+00

Pos logit contributions
 tighter+0.033
rated+0.031
Maybe+0.030
use+0.029
+0.028
Neg logit contributions
 playing-0.032
 drawing-0.032
 sharing-0.032
 trading-0.030
 teasing-0.029
Token always
Token ID1464
Feature activation+1.17e+00

Pos logit contributions
 playing+0.084
 sharing+0.082
 drawing+0.080
 being+0.080
 acting+0.078
Neg logit contributions
 tighter-0.086
igi-0.082
use-0.082
rated-0.078
Lex-0.077
Token better
Token ID1365
Feature activation+3.31e+00

Pos logit contributions
 tighter+0.036
use+0.033
rated+0.032
igi+0.030
+0.029
Neg logit contributions
 playing-0.036
 drawing-0.036
 video-0.035
 sharing-0.035
 acting-0.034
Token to
Token ID284
Feature activation+4.86e-01

Pos logit contributions
igi+0.116
 tighter+0.115
use+0.114
Lex+0.110
rated+0.110
Neg logit contributions
 playing-0.093
 sharing-0.089
 being-0.089
 drawing-0.084
 acting-0.083
Token do
Token ID466
Feature activation-6.96e+00

Pos logit contributions
use+0.016
 tighter+0.016
igi+0.015
Lex+0.015
+0.015
Neg logit contributions
 playing-0.013
 drawing-0.013
 sharing-0.012
 being-0.012
 video-0.012
Token the
Token ID262
Feature activation+8.95e-02

Pos logit contributions
 playing+0.209
 sharing+0.206
 drawing+0.201
 video+0.201
 shooting+0.196
Neg logit contributions
 tighter-0.198
use-0.188
-0.175
',"-0.173
igi-0.171
Token right
Token ID826
Feature activation+8.66e-01

Pos logit contributions
 tighter+0.003
use+0.003
rated+0.003
+0.002
Lex+0.002
Neg logit contributions
 playing-0.003
 sharing-0.003
 being-0.002
 wearing-0.002
 drawing-0.002
Token thing
Token ID1517
Feature activation-1.34e+00

Pos logit contributions
 tighter+0.031
use+0.030
igi+0.030
Lex+0.029
rated+0.028
Neg logit contributions
 playing-0.023
 sharing-0.023
 being-0.022
 drawing-0.022
 teasing-0.021
Token.
Token ID13
Feature activation-9.54e-01

Pos logit contributions
 sharing+0.042
 playing+0.042
 drawing+0.040
 being+0.040
 acting+0.040
Neg logit contributions
 tighter-0.039
use-0.038
igi-0.036
rated-0.034
-0.034
Token<|endoftext|>
Token ID50256
Feature activation+2.09e+00

Pos logit contributions
 playing+0.026
 sharing+0.025
 being+0.024
 drawing+0.024
 acting+0.023
Neg logit contributions
 tighter-0.033
use-0.033
igi-0.032
rated-0.031
Lex-0.031