TOP ACTIVATIONS
MAX = 15.091

Oh , Mike ! Bro ccoli is a vegetable ! It
ruined the broccoli ! Bro ccoli is good for you .
on your plate . Bro ccoli is not that bad .
. " Bro ccoli is not y ucky ,
. " Bro ccoli is what we have for
and stay well . Bro ccoli is your friend , not
to eat broccoli . Bro ccoli was green and bitter and
Mom says , " Bro ccoli is good for you .
try new things . Bro ccoli is not that bad .
. " Bro ccoli is not our friend ,
. " Bro ccoli is y ucky !" Ben
the necklace and the magn ifying glass and started to examine
He took out a magn ifying glass and a notebook and
, handing him the magn ifying glass . She looked around
, next to her magn ifying glass . She looked at
to the man " Bro ccoli is y ummy ! We
the watch with the magn ifying glass . He saw the
it she had a magn ifying glass , a tiny torch
She takes out a magn ifying glass . She looks at
He brought his magn ifying glass and looked closely .

MIN ACTIVATIONS
MIN = -17.586

." They t ipt o ed to the bush
few minutes , then t ipt o ed back to the
car had a party . Many friends came to celebrate .
the jar . She t ipt o ed up to the
the baby . They t ipt o ed inside and put
tree in a park . Many animals lived in the tree
was a big race . Many kids ran in the race
, and then they t ipt o ed to the table
naughty idea . She t ipt o ed to his bed
it was too late . Many people were sad and angry
sour ! â € . Many people heard her and started
long , long time . Many people came to see the
jumping over logs , t ipt oe ing around rocks ,
landed on the street . Many birds and dogs came and
made flour from wheat . Many people came to see the
rushed toward the fire . Many of the people said ,
searching for a lesson . Many times , what we â
of trees and flowers . Many humans came to play and
. He t ipt oes to her bed and
at the sports ground . Many children were playing and running

INTERVAL 6.921 - 15.091
CONTAINS 1.698%

, he began to work on the roof . He hammered
ina . M ina lived on a beautiful piece of land
down and pe cks Tom on the head . Tom cries
eyes , put his hands on his hips , and roared
tall man . He replied , " Yes ,

INTERVAL -1.248 - 6.921
CONTAINS 69.109%

They were happy and excited . They reached the
So , the mouse leaned against the chim ney
doll and I can play with your ball ."
home !" From that day on , Benny
big fan near the pond . The fan is blowing air

INTERVAL -9.417 - -1.248
CONTAINS 29.124%

be a scared y - cat . It will be fun
Then , they heard a sound coming from the other side
the sun shone , things began to dry . Jack was
the shelf . It has many pictures and stories . He
The little girl saw a lot of her favorite cartoon

INTERVAL -17.586 - -9.417
CONTAINS 0.069%

the skull to explore . As he sw am inside ,
trip to the park . As soon as they arrived ,
decided to explore and sw am far away . The big
They spl ashed and sw am and had fun . They
the bridge . He sw am up to it and called
TokenOh
Token ID5812
Feature activation+2.71e+00

Pos logit contributions
kin+0.051
ppers+0.050
ccoli+0.047
 rinse+0.047
edient+0.046
Neg logit contributions
ted-0.062
 Everyone-0.057
 Many-0.057
 People-0.056
 All-0.050
Token,
Token ID11
Feature activation+7.71e-01

Pos logit contributions
kin+0.170
ppers+0.166
ccoli+0.161
 promise+0.156
 rinse+0.153
Neg logit contributions
ted-0.172
 Many-0.157
 Everyone-0.156
 People-0.152
 All-0.135
Token Mike
Token ID4995
Feature activation-3.27e+00

Pos logit contributions
kin+0.053
ppers+0.053
ccoli+0.050
 rinse+0.049
 promise+0.049
Neg logit contributions
ted-0.043
 Many-0.040
 Everyone-0.040
 People-0.039
 All-0.034
Token!
Token ID0
Feature activation-7.68e-01

Pos logit contributions
ted+0.162
 Everyone+0.147
 People+0.141
 Many+0.134
heart+0.129
Neg logit contributions
kin-0.237
ppers-0.234
 rinse-0.231
 afloat-0.229
 achieve-0.223
Token Bro
Token ID2806
Feature activation+6.13e+00

Pos logit contributions
ted+0.045
 Many+0.038
 Everyone+0.037
 People+0.036
 winds+0.034
Neg logit contributions
ccoli-0.050
kin-0.049
ppers-0.049
 promise-0.045
 rinse-0.045
Tokenccoli
Token ID34544
Feature activation+1.51e+01

Pos logit contributions
kin+0.176
 afloat+0.169
ppers+0.166
 rinse+0.158
 achieve+0.156
Neg logit contributions
ted-0.568
 Everyone-0.547
 People-0.536
 Many-0.523
 All-0.507
Token is
Token ID318
Feature activation+1.33e+00

Pos logit contributions
ppers+0.874
kin+0.864
 afloat+0.808
ccoli+0.797
 rinse+0.790
Neg logit contributions
ted-0.999
 Everyone-0.941
 People-0.912
 Many-0.895
 All-0.847
Token a
Token ID257
Feature activation-1.92e+00

Pos logit contributions
ppers+0.091
kin+0.090
ccoli+0.084
 afloat+0.084
illas+0.083
Neg logit contributions
ted-0.072
 Everyone-0.068
 Many-0.067
 People-0.066
 All-0.060
Token vegetable
Token ID20236
Feature activation+5.39e+00

Pos logit contributions
 Everyone+0.105
 Many+0.104
ted+0.104
 People+0.100
 All+0.094
Neg logit contributions
ppers-0.130
illas-0.120
kin-0.117
kins-0.113
 afloat-0.113
Token!
Token ID0
Feature activation-8.27e-01

Pos logit contributions
ppers+0.350
kin+0.348
ccoli+0.331
 afloat+0.328
 rinse+0.325
Neg logit contributions
ted-0.307
 Everyone-0.288
 Many-0.278
 People-0.278
 All-0.253
Token It
Token ID632
Feature activation-3.31e+00

Pos logit contributions
ted+0.045
 Many+0.038
 Everyone+0.036
 People+0.036
ly+0.034
Neg logit contributions
ccoli-0.057
ppers-0.055
kin-0.055
 rinse-0.053
 promise-0.051
Token ruined
Token ID20484
Feature activation+7.80e+00

Pos logit contributions
ted+0.026
 Everyone+0.024
 Many+0.024
 People+0.024
 All+0.022
Neg logit contributions
kin-0.016
ppers-0.016
ccoli-0.015
 rinse-0.014
 afloat-0.014
Token the
Token ID262
Feature activation+8.70e-01

Pos logit contributions
kin+0.516
ppers+0.513
 afloat+0.480
cher+0.474
 achieve+0.473
Neg logit contributions
ted-0.459
 Everyone-0.438
 Many-0.419
 People-0.412
 All-0.383
Token broccoli
Token ID44653
Feature activation+2.85e+00

Pos logit contributions
ppers+0.056
kin+0.054
ccoli+0.051
edient+0.050
 achieve+0.050
Neg logit contributions
ted-0.053
 Everyone-0.050
 Many-0.050
 People-0.048
 All-0.044
Token!
Token ID0
Feature activation-8.72e-01

Pos logit contributions
ppers+0.208
kin+0.200
ccoli+0.200
 rinse+0.197
eps+0.196
Neg logit contributions
ted-0.131
 Everyone-0.122
 People-0.113
 Many-0.112
 All-0.108
Token Bro
Token ID2806
Feature activation+5.98e+00

Pos logit contributions
ted+0.052
 Many+0.044
 Everyone+0.043
 People+0.043
ly+0.039
Neg logit contributions
kin-0.057
ppers-0.056
ccoli-0.055
edient-0.052
 rinse-0.052
Tokenccoli
Token ID34544
Feature activation+1.50e+01

Pos logit contributions
 afloat+0.147
 achieve+0.146
 rinse+0.138
kin+0.134
ppers+0.131
Neg logit contributions
ted-0.569
 Everyone-0.544
 People-0.528
 Many-0.522
 All-0.511
Token is
Token ID318
Feature activation+1.23e+00

Pos logit contributions
ppers+0.957
kin+0.932
 afloat+0.901
ccoli+0.883
 rinse+0.876
Neg logit contributions
ted-0.905
 Everyone-0.828
 People-0.789
 Many-0.771
 All-0.749
Token good
Token ID922
Feature activation+6.41e+00

Pos logit contributions
kin+0.070
ppers+0.069
illas+0.068
 achieve+0.065
 afloat+0.065
Neg logit contributions
ted-0.077
 Everyone-0.073
 Many-0.073
 People-0.070
 All-0.067
Token for
Token ID329
Feature activation-2.41e+00

Pos logit contributions
kin+0.421
ppers+0.416
ccoli+0.395
 afloat+0.384
 promise+0.382
Neg logit contributions
ted-0.382
 Many-0.354
 Everyone-0.353
 People-0.345
 All-0.308
Token you
Token ID345
Feature activation-3.55e+00

Pos logit contributions
ted+0.122
 Everyone+0.114
 Many+0.113
 People+0.109
 All+0.097
Neg logit contributions
kin-0.176
ppers-0.175
ccoli-0.168
 afloat-0.165
edient-0.162
Token.
Token ID13
Feature activation+6.06e-01

Pos logit contributions
ted+0.145
 Everyone+0.144
 Many+0.139
 People+0.129
 All+0.126
Neg logit contributions
kin-0.276
ppers-0.265
ccoli-0.265
 rinse-0.260
illas-0.259
Token on
Token ID319
Feature activation+6.89e+00

Pos logit contributions
ppers+0.065
kin+0.063
illas+0.061
 afloat+0.061
 achieve+0.061
Neg logit contributions
ted-0.072
 Everyone-0.065
 People-0.063
 Many-0.062
 All-0.058
Token your
Token ID534
Feature activation+4.60e+00

Pos logit contributions
kin+0.442
ppers+0.421
ccoli+0.404
 rinse+0.400
kins+0.395
Neg logit contributions
ted-0.427
 Many-0.398
 Everyone-0.393
 People-0.387
 All-0.345
Token plate
Token ID7480
Feature activation+5.36e+00

Pos logit contributions
ppers+0.276
kin+0.275
ccoli+0.256
 afloat+0.253
 rinse+0.250
Neg logit contributions
ted-0.294
 Many-0.275
 Everyone-0.274
 People-0.268
 All-0.242
Token.
Token ID13
Feature activation+4.49e-01

Pos logit contributions
ppers+0.348
kin+0.341
ccoli+0.332
 afloat+0.328
 rinse+0.323
Neg logit contributions
ted-0.312
 Everyone-0.295
 Many-0.287
 People-0.285
 All-0.260
Token Bro
Token ID2806
Feature activation+5.92e+00

Pos logit contributions
kin+0.038
ppers+0.038
ccoli+0.037
 rinse+0.036
 promise+0.035
Neg logit contributions
ted-0.018
 Many-0.016
 Everyone-0.016
 People-0.015
 All-0.013
Tokenccoli
Token ID34544
Feature activation+1.49e+01

Pos logit contributions
 afloat+0.116
 achieve+0.111
kin+0.099
ppers+0.097
 rinse+0.085
Neg logit contributions
ted-0.599
 Everyone-0.566
 People-0.557
 Many-0.541
heart-0.534
Token is
Token ID318
Feature activation+1.15e+00

Pos logit contributions
ppers+0.885
kin+0.852
 afloat+0.830
 rinse+0.795
 achieve+0.794
Neg logit contributions
ted-0.964
 Everyone-0.887
 People-0.860
 Many-0.833
 All-0.802
Token not
Token ID407
Feature activation+4.93e+00

Pos logit contributions
kin+0.071
ppers+0.070
illas+0.069
ucer+0.067
cher+0.066
Neg logit contributions
ted-0.066
 Everyone-0.064
 Many-0.063
 People-0.062
 Every-0.059
Token that
Token ID326
Feature activation-1.16e+00

Pos logit contributions
kin+0.338
ppers+0.329
illas+0.319
edient+0.317
 afloat+0.316
Neg logit contributions
ted-0.259
 Many-0.247
 Everyone-0.246
 People-0.237
 All-0.215
Token bad
Token ID2089
Feature activation+5.14e+00

Pos logit contributions
ted+0.075
 Everyone+0.071
 Many+0.070
 People+0.069
 All+0.064
Neg logit contributions
ppers-0.071
kin-0.070
 afloat-0.065
ccoli-0.063
edient-0.063
Token.
Token ID13
Feature activation+5.96e-01

Pos logit contributions
kin+0.312
ppers+0.300
ccoli+0.294
 promise+0.283
 rinse+0.282
Neg logit contributions
ted-0.329
 Many-0.309
 Everyone-0.300
 People-0.296
 All-0.266
Token.
Token ID13
Feature activation+7.27e-01

Pos logit contributions
ted+0.071
 Everyone+0.062
 Many+0.060
 People+0.059
 All+0.056
Neg logit contributions
kin-0.118
ppers-0.114
Jackson-0.111
ccoli-0.110
 afloat-0.110
Token
Token ID198
Feature activation+5.88e-02

Pos logit contributions
kin+0.051
ppers+0.050
ccoli+0.049
 rinse+0.048
 achieve+0.047
Neg logit contributions
ted-0.039
 Many-0.034
 Everyone-0.034
 People-0.033
 All-0.029
Token
Token ID198
Feature activation+4.46e-02

Pos logit contributions
kin+0.004
ppers+0.004
ccoli+0.004
 afloat+0.004
 rinse+0.004
Neg logit contributions
ted-0.003
 Everyone-0.003
 Many-0.003
 People-0.003
 All-0.003
Token"
Token ID1
Feature activation+2.54e+00

Pos logit contributions
ccoli+0.003
kin+0.003
 afloat+0.003
ppers+0.003
 Claus+0.003
Neg logit contributions
 Many-0.003
 Everyone-0.003
ted-0.003
 People-0.002
 All-0.002
TokenBro
Token ID15783
Feature activation+4.53e+00

Pos logit contributions
kin+0.153
ppers+0.153
ccoli+0.144
 afloat+0.144
 promise+0.140
Neg logit contributions
ted-0.168
 Many-0.155
 Everyone-0.155
 People-0.151
 All-0.139
Tokenccoli
Token ID34544
Feature activation+1.49e+01

Pos logit contributions
kin+0.181
 afloat+0.168
 achieve+0.162
 rinse+0.161
ppers+0.158
Neg logit contributions
ted-0.377
 People-0.357
 Everyone-0.356
 Many-0.355
 All-0.327
Token is
Token ID318
Feature activation+1.18e+00

Pos logit contributions
ppers+0.919
kin+0.887
 afloat+0.872
 rinse+0.859
 achieve+0.853
Neg logit contributions
ted-0.902
 Everyone-0.828
 People-0.797
 Many-0.766
 All-0.746
Token not
Token ID407
Feature activation+4.85e+00

Pos logit contributions
ppers+0.071
kin+0.071
illas+0.069
ucer+0.067
cher+0.067
Neg logit contributions
ted-0.069
 Everyone-0.067
 Many-0.067
 People-0.066
 Every-0.062
Token y
Token ID331
Feature activation-4.78e-01

Pos logit contributions
kin+0.330
ppers+0.323
illas+0.311
edient+0.307
 achieve+0.305
Neg logit contributions
ted-0.252
 Everyone-0.243
 Many-0.242
 People-0.235
 All-0.216
Tokenucky
Token ID5309
Feature activation-2.17e+00

Pos logit contributions
ted+0.019
 Many+0.017
 Everyone+0.017
 People+0.016
 All+0.013
Neg logit contributions
kin-0.041
ppers-0.041
ccoli-0.039
 afloat-0.039
 rinse-0.038
Token,
Token ID11
Feature activation+4.98e-01

Pos logit contributions
ted+0.127
 Many+0.118
 Everyone+0.117
 People+0.114
 All+0.103
Neg logit contributions
kin-0.143
ppers-0.140
ccoli-0.134
 afloat-0.131
 rinse-0.131
Token.
Token ID13
Feature activation+7.29e-01

Pos logit contributions
ted+0.091
 Everyone+0.083
 Many+0.081
 People+0.080
 All+0.073
Neg logit contributions
kin-0.127
ppers-0.121
 afloat-0.119
ccoli-0.117
 rinse-0.115
Token
Token ID198
Feature activation+9.11e-02

Pos logit contributions
kin+0.051
ppers+0.051
ccoli+0.048
 afloat+0.047
 promise+0.047
Neg logit contributions
ted-0.040
 Everyone-0.037
 Many-0.037
 People-0.036
 All-0.032
Token
Token ID198
Feature activation+9.35e-02

Pos logit contributions
kin+0.006
ppers+0.006
ccoli+0.006
 afloat+0.006
 rinse+0.006
Neg logit contributions
ted-0.005
 Many-0.005
 Everyone-0.005
 People-0.005
 All-0.004
Token"
Token ID1
Feature activation+2.44e+00

Pos logit contributions
kin+0.006
ccoli+0.006
 afloat+0.006
cher+0.006
ppers+0.006
Neg logit contributions
 Many-0.006
 Everyone-0.005
ted-0.005
 People-0.005
 All-0.005
TokenBro
Token ID15783
Feature activation+4.37e+00

Pos logit contributions
kin+0.142
ppers+0.142
 afloat+0.138
ccoli+0.136
 achieve+0.132
Neg logit contributions
ted-0.167
 Many-0.152
 Everyone-0.151
 People-0.148
 All-0.136
Tokenccoli
Token ID34544
Feature activation+1.48e+01

Pos logit contributions
kin+0.188
 afloat+0.183
 rinse+0.178
 achieve+0.175
ppers+0.172
Neg logit contributions
ted-0.338
 People-0.323
 Everyone-0.322
 Many-0.321
 All-0.293
Token is
Token ID318
Feature activation+1.14e+00

Pos logit contributions
ppers+0.899
kin+0.881
 afloat+0.852
 rinse+0.844
 achieve+0.828
Neg logit contributions
ted-0.901
 Everyone-0.833
 People-0.801
 Many-0.777
 All-0.749
Token what
Token ID644
Feature activation+3.95e+00

Pos logit contributions
kin+0.069
ppers+0.067
illas+0.066
ucer+0.065
edient+0.064
Neg logit contributions
ted-0.065
 Many-0.064
 Everyone-0.064
 People-0.063
 Every-0.059
Token we
Token ID356
Feature activation-1.01e+00

Pos logit contributions
kin+0.292
ppers+0.286
ccoli+0.277
 rinse+0.268
 promise+0.267
Neg logit contributions
ted-0.204
 Many-0.184
 Everyone-0.184
 People-0.178
 All-0.156
Token have
Token ID423
Feature activation+5.86e+00

Pos logit contributions
ted+0.060
 Many+0.057
 Everyone+0.056
 People+0.055
 All+0.051
Neg logit contributions
kin-0.066
ppers-0.062
ccoli-0.061
 afloat-0.059
edient-0.058
Token for
Token ID329
Feature activation-2.51e+00

Pos logit contributions
kin+0.399
ppers+0.394
ccoli+0.376
 afloat+0.366
 rinse+0.365
Neg logit contributions
ted-0.333
 Everyone-0.307
 Many-0.307
 People-0.299
 All-0.267
Token and
Token ID290
Feature activation-2.09e-01

Pos logit contributions
kin+0.050
ppers+0.048
ccoli+0.047
 afloat+0.046
 rinse+0.045
Neg logit contributions
ted-0.034
 Everyone-0.031
 People-0.030
 Many-0.029
 All-0.026
Token stay
Token ID2652
Feature activation+1.67e+00

Pos logit contributions
ted+0.014
 Many+0.013
 Everyone+0.013
 People+0.012
 All+0.011
Neg logit contributions
kin-0.013
ppers-0.012
ccoli-0.011
 afloat-0.011
edient-0.011
Token well
Token ID880
Feature activation+4.27e+00

Pos logit contributions
kin+0.091
ccoli+0.090
 globe+0.089
Jackson+0.088
ppers+0.088
Neg logit contributions
ted-0.100
 as-0.094
 Everyone-0.094
 Many-0.089
 People-0.088
Token.
Token ID13
Feature activation+6.02e-01

Pos logit contributions
kin+0.303
ppers+0.299
ccoli+0.288
illas+0.280
 rinse+0.279
Neg logit contributions
ted-0.210
 Everyone-0.194
 Many-0.190
 People-0.185
ly-0.173
Token Bro
Token ID2806
Feature activation+5.79e+00

Pos logit contributions
kin+0.039
ppers+0.039
ccoli+0.039
 rinse+0.037
 promise+0.036
Neg logit contributions
ted-0.035
 Many-0.030
 Everyone-0.030
 People-0.029
ly-0.027
Tokenccoli
Token ID34544
Feature activation+1.47e+01

Pos logit contributions
 achieve+0.159
kin+0.158
 afloat+0.153
 rinse+0.146
ppers+0.137
Neg logit contributions
ted-0.540
 Everyone-0.504
 People-0.496
 Many-0.484
heart-0.479
Token is
Token ID318
Feature activation+1.05e+00

Pos logit contributions
ppers+0.899
kin+0.875
 afloat+0.834
ccoli+0.817
 rinse+0.816
Neg logit contributions
ted-0.933
 Everyone-0.864
 People-0.834
 Many-0.809
 All-0.775
Token your
Token ID534
Feature activation+4.56e+00

Pos logit contributions
ppers+0.060
kin+0.059
illas+0.056
ucer+0.055
 afloat+0.054
Neg logit contributions
ted-0.066
 Everyone-0.065
 Many-0.065
 People-0.063
 Every-0.059
Token friend
Token ID1545
Feature activation-1.31e+00

Pos logit contributions
kin+0.277
ppers+0.273
ccoli+0.259
 afloat+0.251
 rinse+0.250
Neg logit contributions
ted-0.293
 Everyone-0.272
 Many-0.272
 People-0.266
 All-0.241
Token,
Token ID11
Feature activation+4.19e-01

Pos logit contributions
ted+0.073
 Everyone+0.068
 Many+0.068
 People+0.066
 All+0.060
Neg logit contributions
kin-0.088
ppers-0.088
ccoli-0.083
 afloat-0.083
 rinse-0.082
Token not
Token ID407
Feature activation+4.74e+00

Pos logit contributions
ppers+0.028
kin+0.028
 afloat+0.027
edient+0.026
ccoli+0.026
Neg logit contributions
ted-0.023
 Everyone-0.022
 Many-0.021
 People-0.021
 All-0.019
Token to
Token ID284
Feature activation-3.10e-01

Pos logit contributions
kin+0.052
ppers+0.051
ccoli+0.051
 rinse+0.049
kins+0.047
Neg logit contributions
ted-0.042
 Everyone-0.039
 Many-0.039
 People-0.038
 All-0.033
Token eat
Token ID4483
Feature activation+2.25e-01

Pos logit contributions
 Many+0.017
 Everyone+0.017
ted+0.017
 People+0.016
 All+0.015
Neg logit contributions
ppers-0.020
kin-0.020
ccoli-0.019
cher-0.019
ucer-0.019
Token broccoli
Token ID44653
Feature activation+2.68e+00

Pos logit contributions
kin+0.017
ppers+0.017
ccoli+0.016
 afloat+0.016
edient+0.016
Neg logit contributions
ted-0.011
 Everyone-0.010
 Many-0.010
 People-0.010
 All-0.009
Token.
Token ID13
Feature activation+7.31e-01

Pos logit contributions
ppers+0.189
kin+0.184
ccoli+0.181
 afloat+0.178
edient+0.177
Neg logit contributions
ted-0.142
 Everyone-0.121
 Many-0.118
 People-0.116
 All-0.109
Token Bro
Token ID2806
Feature activation+5.87e+00

Pos logit contributions
ccoli+0.052
kin+0.050
ppers+0.050
 rinse+0.049
 achieve+0.048
Neg logit contributions
ted-0.038
 Many-0.030
 Everyone-0.029
ly-0.029
 People-0.029
Tokenccoli
Token ID34544
Feature activation+1.47e+01

Pos logit contributions
 afloat+0.169
kin+0.166
 achieve+0.156
ppers+0.154
cher+0.145
Neg logit contributions
ted-0.550
 Everyone-0.503
 People-0.495
 Many-0.487
ly-0.473
Token was
Token ID373
Feature activation-7.46e-01

Pos logit contributions
ppers+1.038
kin+1.026
ccoli+0.981
 afloat+0.970
 rinse+0.956
Neg logit contributions
ted-0.805
 Everyone-0.728
 Many-0.712
 People-0.704
 All-0.638
Token green
Token ID4077
Feature activation-6.29e-01

Pos logit contributions
ted+0.049
 Many+0.046
 Everyone+0.046
 People+0.045
 All+0.041
Neg logit contributions
ppers-0.043
kin-0.042
illas-0.040
ccoli-0.039
 afloat-0.039
Token and
Token ID290
Feature activation-6.40e-02

Pos logit contributions
ted+0.035
 Everyone+0.032
 Many+0.032
 People+0.031
 All+0.028
Neg logit contributions
kin-0.043
ppers-0.043
ccoli-0.041
 afloat-0.040
 rinse-0.040
Token bitter
Token ID12922
Feature activation+2.58e+00

Pos logit contributions
ted+0.004
 Everyone+0.003
 Many+0.003
 People+0.003
 All+0.003
Neg logit contributions
ppers-0.004
illas-0.004
 achieve-0.004
 afloat-0.004
kin-0.004
Token and
Token ID290
Feature activation-9.28e-03

Pos logit contributions
kin+0.143
ppers+0.136
ccoli+0.129
kins+0.126
 rinse+0.125
Neg logit contributions
ted-0.178
 Many-0.175
 Everyone-0.170
 People-0.169
 All-0.149
TokenMom
Token ID29252
Feature activation+2.44e+00

Pos logit contributions
 Everyone+0.021
ted+0.021
 Many+0.021
 People+0.020
 All+0.019
Neg logit contributions
ccoli-0.016
cher-0.016
ppers-0.016
 afloat-0.015
kin-0.015
Token says
Token ID1139
Feature activation-3.70e+00

Pos logit contributions
kin+0.174
ppers+0.162
 afloat+0.161
edient+0.156
kins+0.152
Neg logit contributions
ted-0.148
 Everyone-0.131
 Many-0.130
 People-0.126
 All-0.121
Token,
Token ID11
Feature activation+1.22e-01

Pos logit contributions
ted+0.197
 Many+0.181
 Everyone+0.181
 People+0.175
 All+0.157
Neg logit contributions
kin-0.265
ppers-0.265
ccoli-0.252
 afloat-0.251
edient-0.246
Token "
Token ID366
Feature activation+4.10e-01

Pos logit contributions
kin+0.010
ppers+0.010
 afloat+0.010
 promise+0.009
 achieve+0.009
Neg logit contributions
ted-0.005
 Everyone-0.005
 Many-0.005
 People-0.005
 All-0.004
TokenBro
Token ID15783
Feature activation+4.13e+00

Pos logit contributions
kin+0.025
ppers+0.024
ccoli+0.023
 rinse+0.023
 afloat+0.022
Neg logit contributions
ted-0.027
 Everyone-0.025
 Many-0.025
 People-0.024
 All-0.022
Tokenccoli
Token ID34544
Feature activation+1.46e+01

Pos logit contributions
kin+0.058
 afloat+0.057
ppers+0.055
 achieve+0.049
 rinse+0.044
Neg logit contributions
ted-0.442
 Everyone-0.425
 People-0.423
 Many-0.420
 All-0.399
Token is
Token ID318
Feature activation+8.34e-01

Pos logit contributions
ppers+0.925
 afloat+0.884
kin+0.875
cher+0.854
 achieve+0.828
Neg logit contributions
ted-0.901
 Everyone-0.832
 People-0.786
 All-0.764
 Many-0.753
Token good
Token ID922
Feature activation+6.19e+00

Pos logit contributions
ppers+0.052
illas+0.051
kin+0.049
cher+0.049
ucer+0.048
Neg logit contributions
ted-0.047
 Everyone-0.045
 Many-0.045
 People-0.044
 Every-0.042
Token for
Token ID329
Feature activation-2.60e+00

Pos logit contributions
kin+0.387
ppers+0.379
ccoli+0.362
 promise+0.355
 rinse+0.351
Neg logit contributions
ted-0.387
 Many-0.369
 Everyone-0.359
 People-0.355
 All-0.312
Token you
Token ID345
Feature activation-3.83e+00

Pos logit contributions
ted+0.131
 Everyone+0.120
 Many+0.119
 People+0.116
 All+0.102
Neg logit contributions
kin-0.194
ppers-0.192
ccoli-0.184
 afloat-0.180
 rinse-0.179
Token.
Token ID13
Feature activation+2.34e-02

Pos logit contributions
ted+0.180
 Everyone+0.174
 Many+0.172
 People+0.163
 All+0.154
Neg logit contributions
kin-0.284
ppers-0.276
 afloat-0.270
ccoli-0.267
 rinse-0.264
Token try
Token ID1949
Feature activation+2.86e+00

Pos logit contributions
 Everyone+0.023
 Many+0.023
 People+0.021
 Every+0.021
ted+0.021
Neg logit contributions
cher-0.027
ppers-0.027
ccoli-0.026
ucer-0.026
illas-0.026
Token new
Token ID649
Feature activation-2.32e+00

Pos logit contributions
kin+0.228
ppers+0.218
cher+0.212
 afloat+0.208
edient+0.207
Neg logit contributions
ted-0.132
 Everyone-0.124
 Many-0.123
 People-0.118
 All-0.110
Token things
Token ID1243
Feature activation-6.05e+00

Pos logit contributions
ted+0.105
 Many+0.095
 Everyone+0.094
 People+0.091
 All+0.078
Neg logit contributions
kin-0.184
ppers-0.182
ccoli-0.176
 promise-0.172
 afloat-0.171
Token.
Token ID13
Feature activation+2.79e-01

Pos logit contributions
ted+0.258
 Everyone+0.233
 People+0.220
 Many+0.209
 All+0.203
Neg logit contributions
ppers-0.470
kin-0.469
 afloat-0.468
edient-0.458
ccoli-0.453
Token Bro
Token ID2806
Feature activation+5.58e+00

Pos logit contributions
kin+0.019
ppers+0.019
ccoli+0.019
 rinse+0.018
 promise+0.018
Neg logit contributions
ted-0.015
 Everyone-0.014
 Many-0.014
 People-0.013
 All-0.012
Tokenccoli
Token ID34544
Feature activation+1.45e+01

Pos logit contributions
 afloat+0.103
 achieve+0.098
kin+0.088
 rinse+0.080
ppers+0.077
Neg logit contributions
ted-0.578
 Everyone-0.539
 People-0.536
 Many-0.520
 All-0.510
Token is
Token ID318
Feature activation+8.76e-01

Pos logit contributions
ppers+0.900
 afloat+0.859
kin+0.849
cher+0.806
 rinse+0.804
Neg logit contributions
ted-0.923
 Everyone-0.849
 People-0.822
 Many-0.789
 All-0.770
Token not
Token ID407
Feature activation+4.65e+00

Pos logit contributions
ppers+0.053
kin+0.052
ucer+0.051
illas+0.051
cher+0.050
Neg logit contributions
 Everyone-0.050
 Many-0.050
ted-0.050
 People-0.049
 Every-0.048
Token that
Token ID326
Feature activation-1.48e+00

Pos logit contributions
kin+0.289
ppers+0.287
 afloat+0.272
edient+0.271
 achieve+0.269
Neg logit contributions
ted-0.272
 Everyone-0.264
 Many-0.263
 People-0.257
 All-0.235
Token bad
Token ID2089
Feature activation+4.84e+00

Pos logit contributions
ted+0.091
 Everyone+0.085
 Many+0.084
 People+0.083
 All+0.076
Neg logit contributions
ppers-0.094
kin-0.093
 afloat-0.087
ccoli-0.086
edient-0.084
Token.
Token ID13
Feature activation+2.43e-01

Pos logit contributions
kin+0.281
ppers+0.265
ccoli+0.261
 promise+0.253
 rinse+0.250
Neg logit contributions
ted-0.323
 Many-0.304
 Everyone-0.295
 People-0.291
 All-0.261
Token.
Token ID13
Feature activation+1.95e-01

Pos logit contributions
ppers+0.305
kin+0.297
 afloat+0.293
 achieve+0.291
ccoli+0.291
Neg logit contributions
ted-0.228
 Everyone-0.195
 People-0.184
 Many-0.179
 All-0.172
Token
Token ID198
Feature activation-3.99e-01

Pos logit contributions
kin+0.013
ppers+0.013
ccoli+0.013
 afloat+0.012
 rinse+0.012
Neg logit contributions
ted-0.011
 Many-0.010
 Everyone-0.010
 People-0.010
 All-0.009
Token
Token ID198
Feature activation-4.04e-01

Pos logit contributions
ted+0.021
 Everyone+0.021
 Many+0.020
 People+0.020
 All+0.018
Neg logit contributions
kin-0.029
ppers-0.028
ccoli-0.027
 afloat-0.026
 rinse-0.026
Token"
Token ID1
Feature activation+2.04e+00

Pos logit contributions
 Everyone+0.023
 Many+0.023
ted+0.022
 People+0.021
 All+0.020
Neg logit contributions
ccoli-0.026
 bleed-0.026
 afloat-0.025
kin-0.025
ppers-0.024
TokenBro
Token ID15783
Feature activation+4.12e+00

Pos logit contributions
ppers+0.125
kin+0.120
 afloat+0.118
ccoli+0.116
 achieve+0.112
Neg logit contributions
ted-0.135
 Many-0.126
 Everyone-0.126
 People-0.122
 All-0.113
Tokenccoli
Token ID34544
Feature activation+1.45e+01

Pos logit contributions
kin+0.090
 afloat+0.088
 achieve+0.081
 rinse+0.079
ppers+0.074
Neg logit contributions
ted-0.405
 People-0.387
 Everyone-0.386
 Many-0.385
 All-0.361
Token is
Token ID318
Feature activation+7.60e-01

Pos logit contributions
ppers+0.904
kin+0.864
 afloat+0.851
 rinse+0.833
cher+0.829
Neg logit contributions
ted-0.860
 Everyone-0.797
 People-0.761
 Many-0.733
 All-0.718
Token not
Token ID407
Feature activation+4.48e+00

Pos logit contributions
ppers+0.045
illas+0.044
kin+0.044
cher+0.043
ucer+0.043
Neg logit contributions
 Everyone-0.045
 Many-0.044
ted-0.044
 People-0.044
 Every-0.042
Token our
Token ID674
Feature activation+6.85e+00

Pos logit contributions
kin+0.307
ppers+0.305
illas+0.289
edient+0.288
 achieve+0.287
Neg logit contributions
ted-0.231
 Everyone-0.226
 Many-0.225
 People-0.218
 All-0.198
Token friend
Token ID1545
Feature activation-1.62e+00

Pos logit contributions
kin+0.443
ppers+0.439
ccoli+0.415
 afloat+0.406
 rinse+0.403
Neg logit contributions
ted-0.411
 Everyone-0.383
 Many-0.382
 People-0.373
 All-0.335
Token,
Token ID11
Feature activation+5.02e-02

Pos logit contributions
ted+0.097
 Many+0.090
 Everyone+0.089
 People+0.087
 All+0.079
Neg logit contributions
kin-0.106
ppers-0.104
ccoli-0.098
 afloat-0.097
 rinse-0.096
Token.
Token ID13
Feature activation-1.99e+00

Pos logit contributions
ted+0.010
 Everyone+0.009
 Many+0.009
 People+0.009
 All+0.008
Neg logit contributions
kin-0.011
ppers-0.011
ccoli-0.010
 afloat-0.010
 rinse-0.010
Token
Token ID198
Feature activation-2.04e+00

Pos logit contributions
ted+0.110
 Many+0.094
 Everyone+0.091
 People+0.091
ly+0.081
Neg logit contributions
kin-0.135
ccoli-0.132
ppers-0.131
 rinse-0.125
 achieve-0.124
Token
Token ID198
Feature activation-1.59e+00

Pos logit contributions
ted+0.130
 Everyone+0.119
 Many+0.119
 People+0.117
 All+0.105
Neg logit contributions
kin-0.125
ppers-0.123
ccoli-0.117
 afloat-0.113
 rinse-0.113
Token"
Token ID1
Feature activation+1.11e+00

Pos logit contributions
 Many+0.112
 Everyone+0.111
ted+0.109
 People+0.107
 All+0.099
Neg logit contributions
kin-0.085
ppers-0.085
ccoli-0.081
 afloat-0.078
 promise-0.076
TokenBro
Token ID15783
Feature activation+2.96e+00

Pos logit contributions
ppers+0.071
kin+0.069
 afloat+0.069
ccoli+0.066
 achieve+0.066
Neg logit contributions
ted-0.069
 Many-0.065
 Everyone-0.064
 People-0.062
 All-0.058
Tokenccoli
Token ID34544
Feature activation+1.38e+01

Pos logit contributions
kin+0.136
 afloat+0.136
ppers+0.132
 achieve+0.130
 rinse+0.130
Neg logit contributions
ted-0.215
 Everyone-0.204
 Many-0.203
 People-0.202
 All-0.186
Token is
Token ID318
Feature activation+2.96e-01

Pos logit contributions
ppers+0.861
kin+0.829
 afloat+0.802
 rinse+0.787
edient+0.779
Neg logit contributions
ted-0.833
 Everyone-0.782
 People-0.748
 Many-0.728
 All-0.702
Token y
Token ID331
Feature activation-1.27e+00

Pos logit contributions
ppers+0.019
kin+0.019
illas+0.019
edient+0.018
ucer+0.018
Neg logit contributions
 Everyone-0.016
ted-0.016
 Many-0.016
 People-0.015
 Every-0.014
Tokenucky
Token ID5309
Feature activation-2.92e+00

Pos logit contributions
ted+0.036
 Many+0.030
 Everyone+0.030
 People+0.029
 All+0.021
Neg logit contributions
kin-0.125
ppers-0.124
ccoli-0.118
 afloat-0.117
 rinse-0.116
Token!"
Token ID2474
Feature activation-1.41e+00

Pos logit contributions
ted+0.166
 Everyone+0.155
 Many+0.152
 People+0.148
 All+0.135
Neg logit contributions
kin-0.190
ppers-0.189
ccoli-0.180
 rinse-0.178
 achieve-0.177
Token Ben
Token ID3932
Feature activation+2.24e+00

Pos logit contributions
ted+0.091
 Many+0.082
 People+0.081
 Everyone+0.080
heart+0.072
Neg logit contributions
kin-0.081
ppers-0.077
ccoli-0.076
 rinse-0.075
kins-0.073
Token the
Token ID262
Feature activation+9.23e-01

Pos logit contributions
kin+0.046
 rinse+0.044
ppers+0.044
kins+0.042
ccoli+0.042
Neg logit contributions
ted-0.039
 Everyone-0.038
 People-0.037
 Many-0.037
 All-0.031
Token necklace
Token ID38987
Feature activation+3.85e+00

Pos logit contributions
ppers+0.046
kin+0.045
ccoli+0.044
edient+0.042
 achieve+0.042
Neg logit contributions
ted-0.069
 Many-0.064
 Everyone-0.064
 People-0.063
 All-0.057
Token and
Token ID290
Feature activation+4.83e-02

Pos logit contributions
kin+0.287
ppers+0.286
ccoli+0.284
 achieve+0.279
 rinse+0.278
Neg logit contributions
ted-0.189
 Everyone-0.160
 Many-0.156
 People-0.153
 All-0.141
Token the
Token ID262
Feature activation+9.28e-01

Pos logit contributions
kin+0.003
ppers+0.003
 rinse+0.003
ccoli+0.003
kins+0.002
Neg logit contributions
ted-0.003
 Many-0.003
 Everyone-0.003
 People-0.003
 All-0.003
Token magn
Token ID7842
Feature activation+7.69e+00

Pos logit contributions
ppers+0.056
kin+0.054
ccoli+0.053
edient+0.052
 afloat+0.051
Neg logit contributions
ted-0.059
 Many-0.054
 Everyone-0.054
 People-0.053
 All-0.047
Tokenifying
Token ID4035
Feature activation+1.38e+01

Pos logit contributions
kin+0.305
ppers+0.303
ccoli+0.261
 afloat+0.256
 rinse+0.249
Neg logit contributions
ted-0.644
 Many-0.636
 Everyone-0.633
 People-0.626
 All-0.575
Token glass
Token ID5405
Feature activation+2.26e+00

Pos logit contributions
kin+0.699
ppers+0.669
ccoli+0.634
 rinse+0.618
 afloat+0.617
Neg logit contributions
ted-1.020
 Many-0.954
 Everyone-0.953
 People-0.940
 All-0.856
Token and
Token ID290
Feature activation+7.46e-02

Pos logit contributions
ppers+0.146
kin+0.143
ccoli+0.140
 achieve+0.137
 afloat+0.136
Neg logit contributions
ted-0.138
 Everyone-0.124
 Many-0.122
 People-0.120
 All-0.109
Token started
Token ID2067
Feature activation-3.39e+00

Pos logit contributions
kin+0.004
ppers+0.004
 rinse+0.004
ccoli+0.004
kins+0.004
Neg logit contributions
ted-0.005
 Many-0.005
 Everyone-0.005
 People-0.005
 All-0.004
Token to
Token ID284
Feature activation-1.33e-03

Pos logit contributions
ted+0.165
 Everyone+0.154
 Many+0.153
 People+0.149
 All+0.129
Neg logit contributions
kin-0.256
ppers-0.255
ccoli-0.244
 rinse-0.240
 afloat-0.238
Token examine
Token ID10716
Feature activation+1.09e+00

Pos logit contributions
ted+0.000
 Many+0.000
 Everyone+0.000
 People+0.000
 All+0.000
Neg logit contributions
ppers-0.000
kin-0.000
edient-0.000
ccoli-0.000
illas-0.000
Token He
Token ID679
Feature activation+2.33e+00

Pos logit contributions
ted+0.004
 Many+0.003
 Everyone+0.003
 People+0.003
heart+0.003
Neg logit contributions
kin-0.004
ccoli-0.004
ppers-0.004
 achieve-0.004
illas-0.004
Token took
Token ID1718
Feature activation+7.67e-01

Pos logit contributions
kin+0.135
ppers+0.129
ccoli+0.124
 promise+0.121
kins+0.121
Neg logit contributions
ted-0.156
 Many-0.148
 Everyone-0.147
 People-0.145
heart-0.129
Token out
Token ID503
Feature activation+5.97e-01

Pos logit contributions
kin+0.044
ppers+0.041
 rinse+0.040
 promise+0.039
ccoli+0.039
Neg logit contributions
ted-0.050
 Many-0.046
 Everyone-0.044
 People-0.044
pill-0.042
Token a
Token ID257
Feature activation-1.79e+00

Pos logit contributions
kin+0.037
ppers+0.036
ccoli+0.035
kins+0.034
 rinse+0.034
Neg logit contributions
ted-0.036
 Everyone-0.033
 Many-0.033
 People-0.032
 All-0.028
Token magn
Token ID7842
Feature activation+7.62e+00

Pos logit contributions
 Many+0.104
ted+0.098
 Everyone+0.097
 People+0.096
 All+0.089
Neg logit contributions
ppers-0.118
edient-0.111
illas-0.107
wy-0.107
kin-0.103
Tokenifying
Token ID4035
Feature activation+1.38e+01

Pos logit contributions
kin+0.200
ppers+0.195
ccoli+0.168
 afloat+0.157
 rinse+0.153
Neg logit contributions
ted-0.744
 Many-0.724
 Everyone-0.722
 People-0.713
 All-0.665
Token glass
Token ID5405
Feature activation+2.19e+00

Pos logit contributions
kin+0.733
ppers+0.710
ccoli+0.675
 afloat+0.652
 rinse+0.652
Neg logit contributions
ted-0.987
 Many-0.922
 Everyone-0.921
 People-0.905
 All-0.825
Token and
Token ID290
Feature activation+1.76e-01

Pos logit contributions
ppers+0.157
kin+0.153
ccoli+0.152
 achieve+0.148
 afloat+0.147
Neg logit contributions
ted-0.118
 Everyone-0.103
 Many-0.101
 People-0.100
 All-0.092
Token a
Token ID257
Feature activation-1.82e+00

Pos logit contributions
kin+0.010
ppers+0.010
ccoli+0.010
 rinse+0.009
kins+0.009
Neg logit contributions
ted-0.011
 Many-0.011
 Everyone-0.011
 People-0.011
 All-0.009
Token notebook
Token ID20922
Feature activation+4.35e+00

Pos logit contributions
 Many+0.101
 Everyone+0.100
ted+0.099
 People+0.096
 All+0.090
Neg logit contributions
ppers-0.119
wy-0.111
edient-0.111
illas-0.108
 afloat-0.104
Token and
Token ID290
Feature activation+1.59e-01

Pos logit contributions
kin+0.282
ppers+0.280
ccoli+0.268
 afloat+0.261
 rinse+0.259
Neg logit contributions
ted-0.259
 Everyone-0.239
 Many-0.237
 People-0.232
 All-0.210
Token,
Token ID11
Feature activation+4.30e-01

Pos logit contributions
ted+0.103
 Everyone+0.098
 Many+0.089
 People+0.089
 All+0.085
Neg logit contributions
ppers-0.179
 afloat-0.178
kin-0.176
wy-0.175
illas-0.174
Token handing
Token ID22786
Feature activation+2.70e-01

Pos logit contributions
kin+0.037
ppers+0.036
wy+0.036
 afloat+0.035
 promise+0.035
Neg logit contributions
ted-0.018
 Everyone-0.016
 Many-0.015
 People-0.014
 All-0.014
Token him
Token ID683
Feature activation-5.68e+00

Pos logit contributions
kin+0.019
ppers+0.019
ccoli+0.019
 rinse+0.018
 promise+0.018
Neg logit contributions
 Many-0.013
 People-0.012
 Everyone-0.012
ted-0.012
pill-0.010
Token the
Token ID262
Feature activation+7.20e-01

Pos logit contributions
ted+0.277
 Everyone+0.245
 Many+0.241
 People+0.233
 All+0.208
Neg logit contributions
kin-0.445
ppers-0.439
 achieve-0.416
ccoli-0.416
 afloat-0.415
Token magn
Token ID7842
Feature activation+7.55e+00

Pos logit contributions
ppers+0.041
kin+0.039
edient+0.037
 afloat+0.036
ccoli+0.036
Neg logit contributions
ted-0.049
 Many-0.046
 Everyone-0.046
 People-0.045
 All-0.041
Tokenifying
Token ID4035
Feature activation+1.37e+01

Pos logit contributions
kin+0.222
ppers+0.216
ccoli+0.196
 promise+0.184
 rinse+0.184
Neg logit contributions
ted-0.729
 Everyone-0.685
 Many-0.682
 People-0.672
 All-0.635
Token glass
Token ID5405
Feature activation+2.15e+00

Pos logit contributions
kin+0.717
ppers+0.710
ccoli+0.665
 afloat+0.642
 rinse+0.637
Neg logit contributions
ted-0.992
 Everyone-0.932
 Many-0.932
 People-0.912
 All-0.839
Token.
Token ID13
Feature activation+6.81e-01

Pos logit contributions
ppers+0.151
 achieve+0.144
ccoli+0.144
kin+0.143
 afloat+0.141
Neg logit contributions
ted-0.120
 Everyone-0.106
 Many-0.104
 People-0.102
 All-0.093
Token She
Token ID1375
Feature activation-9.53e-01

Pos logit contributions
kin+0.044
ppers+0.044
ccoli+0.043
 rinse+0.042
 achieve+0.041
Neg logit contributions
ted-0.039
 Many-0.035
 Everyone-0.035
 People-0.034
heart-0.031
Token looked
Token ID3114
Feature activation+3.47e-01

Pos logit contributions
ted+0.060
 Many+0.058
 Everyone+0.058
 People+0.057
 All+0.050
Neg logit contributions
kin-0.059
ppers-0.057
ccoli-0.054
 promise-0.053
 rinse-0.053
Token around
Token ID1088
Feature activation-6.44e-01

Pos logit contributions
kin+0.031
ppers+0.031
ccoli+0.029
 achieve+0.029
edient+0.029
Neg logit contributions
ted-0.014
 Everyone-0.011
 Many-0.011
 People-0.011
 All-0.009
Token,
Token ID11
Feature activation+7.25e-01

Pos logit contributions
ppers+0.208
kin+0.206
ccoli+0.202
 afloat+0.199
 rinse+0.196
Neg logit contributions
ted-0.155
 Everyone-0.142
 Many-0.141
 People-0.138
 All-0.125
Token next
Token ID1306
Feature activation-1.86e-01

Pos logit contributions
kin+0.053
ppers+0.051
edient+0.050
 afloat+0.049
ccoli+0.048
Neg logit contributions
ted-0.040
 Many-0.034
 Everyone-0.033
 People-0.032
 All-0.031
Token to
Token ID284
Feature activation-8.65e-02

Pos logit contributions
ted+0.010
 Many+0.009
 Everyone+0.009
 People+0.009
 All+0.008
Neg logit contributions
kin-0.013
ppers-0.013
 promise-0.013
ccoli-0.012
 rinse-0.012
Token her
Token ID607
Feature activation-1.44e+00

Pos logit contributions
ted+0.005
 Many+0.004
 Everyone+0.004
 People+0.004
unn+0.004
Neg logit contributions
kin-0.006
 rinse-0.005
ppers-0.005
ccoli-0.005
 achieve-0.005
Token magn
Token ID7842
Feature activation+7.56e+00

Pos logit contributions
ted+0.085
 Everyone+0.080
 Many+0.079
 People+0.077
 All+0.070
Neg logit contributions
ppers-0.094
kin-0.092
ccoli-0.088
 afloat-0.087
edient-0.085
Tokenifying
Token ID4035
Feature activation+1.37e+01

Pos logit contributions
kin+0.222
ppers+0.214
ccoli+0.195
 afloat+0.181
 promise+0.180
Neg logit contributions
ted-0.729
 Everyone-0.687
 Many-0.686
 People-0.676
 All-0.637
Token glass
Token ID5405
Feature activation+2.22e+00

Pos logit contributions
kin+0.687
ppers+0.651
ccoli+0.627
 rinse+0.607
 promise+0.601
Neg logit contributions
ted-1.021
 Many-0.953
 Everyone-0.951
 People-0.938
 All-0.854
Token.
Token ID13
Feature activation+9.54e-01

Pos logit contributions
ppers+0.127
kin+0.123
ccoli+0.120
 afloat+0.118
 achieve+0.117
Neg logit contributions
ted-0.152
 Everyone-0.138
 Many-0.136
 People-0.135
 All-0.124
Token She
Token ID1375
Feature activation-6.46e-01

Pos logit contributions
ccoli+0.070
kin+0.070
 rinse+0.068
ppers+0.066
 achieve+0.065
Neg logit contributions
ted-0.047
 Many-0.038
 Everyone-0.037
 People-0.037
heart-0.036
Token looked
Token ID3114
Feature activation+4.94e-01

Pos logit contributions
ted+0.045
 Everyone+0.043
 Many+0.043
 People+0.042
 All+0.038
Neg logit contributions
kin-0.035
ppers-0.034
ccoli-0.033
 promise-0.033
 rinse-0.032
Token at
Token ID379
Feature activation-4.00e+00

Pos logit contributions
kin+0.036
ppers+0.036
ccoli+0.034
 rinse+0.034
 afloat+0.034
Neg logit contributions
ted-0.025
 Many-0.023
 Everyone-0.023
 People-0.023
 All-0.020
Token to
Token ID284
Feature activation-4.24e+00

Pos logit contributions
ted+0.299
 Everyone+0.278
 Many+0.277
 People+0.270
 All+0.245
Neg logit contributions
kin-0.324
ppers-0.316
ccoli-0.303
 rinse-0.293
 afloat-0.289
Token the
Token ID262
Feature activation-2.19e+00

Pos logit contributions
ted+0.284
 Everyone+0.268
 Many+0.267
 People+0.261
 All+0.239
Neg logit contributions
kin-0.244
ppers-0.242
ccoli-0.228
 afloat-0.220
 rinse-0.218
Token man
Token ID582
Feature activation-1.47e+00

Pos logit contributions
ted+0.127
 Everyone+0.121
 Many+0.119
 People+0.117
 All+0.105
Neg logit contributions
ppers-0.143
kin-0.141
ccoli-0.133
 afloat-0.130
 rinse-0.129
Token "
Token ID366
Feature activation-9.69e-01

Pos logit contributions
ted+0.078
 Everyone+0.074
 Many+0.072
 People+0.070
 All+0.065
Neg logit contributions
kin-0.107
ppers-0.100
ccoli-0.097
 afloat-0.094
 rinse-0.094
TokenBro
Token ID15783
Feature activation+3.13e+00

Pos logit contributions
ted+0.062
 Everyone+0.056
 Many+0.056
 People+0.054
 All+0.050
Neg logit contributions
kin-0.062
ppers-0.060
ccoli-0.058
 afloat-0.056
 promise-0.056
Tokenccoli
Token ID34544
Feature activation+1.37e+01

Pos logit contributions
kin+0.170
ppers+0.159
 rinse+0.156
 afloat+0.155
 promise+0.147
Neg logit contributions
ted-0.219
 Everyone-0.209
 People-0.206
 Many-0.205
 All-0.187
Token is
Token ID318
Feature activation+1.06e-01

Pos logit contributions
ppers+0.839
kin+0.823
 afloat+0.782
 rinse+0.777
ccoli+0.752
Neg logit contributions
ted-0.871
 Everyone-0.820
 People-0.792
 Many-0.767
 All-0.734
Token y
Token ID331
Feature activation-1.47e+00

Pos logit contributions
ppers+0.007
kin+0.007
illas+0.007
ucer+0.006
 achieve+0.006
Neg logit contributions
ted-0.006
 Everyone-0.006
 Many-0.006
 People-0.005
 Every-0.005
Tokenummy
Token ID13513
Feature activation-7.76e+00

Pos logit contributions
ted+0.039
 Many+0.033
 Everyone+0.031
 People+0.030
 All+0.021
Neg logit contributions
kin-0.145
ppers-0.144
ccoli-0.139
 afloat-0.136
 rinse-0.136
Token!
Token ID0
Feature activation-1.58e+00

Pos logit contributions
ted+0.468
 Everyone+0.438
 Many+0.433
 People+0.426
 All+0.384
Neg logit contributions
kin-0.496
ppers-0.490
ccoli-0.461
 afloat-0.455
 rinse-0.453
Token We
Token ID775
Feature activation-1.22e-01

Pos logit contributions
ted+0.081
 Many+0.072
 Everyone+0.071
 People+0.069
 All+0.060
Neg logit contributions
kin-0.115
ppers-0.114
ccoli-0.111
 rinse-0.107
 afloat-0.106
Token the
Token ID262
Feature activation+9.06e-01

Pos logit contributions
ted+0.239
 Many+0.230
 People+0.225
 Everyone+0.224
pill+0.195
Neg logit contributions
kin-0.252
ppers-0.236
 rinse-0.230
 afloat-0.230
ccoli-0.224
Token watch
Token ID2342
Feature activation+1.30e+00

Pos logit contributions
ppers+0.056
kin+0.056
ccoli+0.053
 afloat+0.052
edient+0.051
Neg logit contributions
ted-0.057
 Everyone-0.053
 Many-0.053
 People-0.052
 All-0.047
Token with
Token ID351
Feature activation+6.04e-01

Pos logit contributions
ppers+0.087
kin+0.086
ccoli+0.083
 afloat+0.081
 rinse+0.080
Neg logit contributions
ted-0.075
 Everyone-0.069
 Many-0.068
 People-0.067
 All-0.060
Token the
Token ID262
Feature activation+9.25e-01

Pos logit contributions
kin+0.043
ppers+0.042
ccoli+0.040
 afloat+0.039
 rinse+0.039
Neg logit contributions
ted-0.033
 Many-0.030
 Everyone-0.030
 People-0.030
 All-0.026
Token magn
Token ID7842
Feature activation+7.53e+00

Pos logit contributions
kin+0.056
ppers+0.055
ccoli+0.052
 afloat+0.051
 rinse+0.050
Neg logit contributions
ted-0.060
 Everyone-0.056
 Many-0.056
 People-0.055
 All-0.049
Tokenifying
Token ID4035
Feature activation+1.37e+01

Pos logit contributions
kin+0.244
ppers+0.238
ccoli+0.220
 promise+0.211
 achieve+0.207
Neg logit contributions
ted-0.706
 Everyone-0.659
 Many-0.655
 People-0.645
 All-0.612
Token glass
Token ID5405
Feature activation+2.15e+00

Pos logit contributions
kin+0.701
ppers+0.656
ccoli+0.635
 rinse+0.621
 promise+0.614
Neg logit contributions
ted-1.002
 Many-0.932
 Everyone-0.930
 People-0.920
 All-0.833
Token.
Token ID13
Feature activation+7.84e-01

Pos logit contributions
ppers+0.143
ccoli+0.138
 achieve+0.137
kin+0.134
 afloat+0.134
Neg logit contributions
ted-0.127
 Everyone-0.111
 People-0.107
 Many-0.107
 All-0.100
Token He
Token ID679
Feature activation+2.08e+00

Pos logit contributions
kin+0.051
ccoli+0.049
ppers+0.048
 rinse+0.047
 achieve+0.046
Neg logit contributions
ted-0.046
 Many-0.039
 Everyone-0.038
 People-0.037
heart-0.037
Token saw
Token ID2497
Feature activation-1.97e+00

Pos logit contributions
kin+0.115
ppers+0.113
ccoli+0.107
 promise+0.105
kins+0.104
Neg logit contributions
ted-0.140
 Everyone-0.138
 Many-0.136
 People-0.135
 Everybody-0.117
Token the
Token ID262
Feature activation+9.50e-01

Pos logit contributions
ted+0.121
 People+0.115
 Everyone+0.113
 Many+0.113
heart+0.109
Neg logit contributions
kin-0.107
 afloat-0.098
ppers-0.098
kins-0.096
 globe-0.096
Token it
Token ID340
Feature activation-3.18e+00

Pos logit contributions
ted+0.094
 Many+0.085
 Everyone+0.085
 People+0.083
 All+0.071
Neg logit contributions
kin-0.128
ppers-0.121
ccoli-0.116
 rinse-0.116
 afloat-0.116
Token she
Token ID673
Feature activation-2.09e+00

Pos logit contributions
ted+0.190
 Many+0.176
 Everyone+0.175
 People+0.171
 All+0.152
Neg logit contributions
kin-0.209
ppers-0.205
ccoli-0.195
 afloat-0.191
 promise-0.191
Token had
Token ID550
Feature activation+6.94e+00

Pos logit contributions
ted+0.134
 Many+0.125
 Everyone+0.124
 People+0.120
 All+0.107
Neg logit contributions
kin-0.129
ppers-0.122
 promise-0.119
 achieve-0.117
ccoli-0.116
Token a
Token ID257
Feature activation-1.79e+00

Pos logit contributions
kin+0.354
ppers+0.353
ak+0.345
 achieve+0.343
eps+0.341
Neg logit contributions
ted-0.467
 Everyone-0.416
 Many-0.414
 People-0.406
heart-0.399
Token magn
Token ID7842
Feature activation+7.63e+00

Pos logit contributions
 Many+0.081
 Everyone+0.080
ted+0.078
 People+0.076
 All+0.071
Neg logit contributions
ppers-0.136
wy-0.128
ccoli-0.126
edient-0.126
kin-0.125
Tokenifying
Token ID4035
Feature activation+1.37e+01

Pos logit contributions
kin+0.207
ppers+0.205
ccoli+0.168
 afloat+0.166
cher+0.154
Neg logit contributions
ted-0.731
 Many-0.718
 Everyone-0.715
 People-0.708
 All-0.653
Token glass
Token ID5405
Feature activation+2.21e+00

Pos logit contributions
kin+0.715
ppers+0.658
ccoli+0.641
 rinse+0.632
 afloat+0.629
Neg logit contributions
ted-1.000
 Everyone-0.920
 Many-0.917
 People-0.909
 All-0.815
Token,
Token ID11
Feature activation+7.15e-01

Pos logit contributions
ppers+0.170
kin+0.170
ccoli+0.164
 afloat+0.160
 rinse+0.159
Neg logit contributions
ted-0.105
 Everyone-0.094
 Many-0.093
 People-0.090
 All-0.080
Token a
Token ID257
Feature activation-1.84e+00

Pos logit contributions
kin+0.046
ppers+0.043
 rinse+0.041
kins+0.041
ccoli+0.040
Neg logit contributions
ted-0.043
 Everyone-0.041
 Many-0.041
 People-0.039
 Everybody-0.035
Token tiny
Token ID7009
Feature activation-2.50e-01

Pos logit contributions
 Many+0.086
 Everyone+0.084
ted+0.084
 People+0.080
 All+0.076
Neg logit contributions
ppers-0.134
wy-0.128
edient-0.126
illas-0.125
ccoli-0.120
Token torch
Token ID28034
Feature activation+3.95e+00

Pos logit contributions
ted+0.016
 Everyone+0.015
 Many+0.015
 People+0.015
 All+0.013
Neg logit contributions
kin-0.015
ppers-0.015
ccoli-0.014
 afloat-0.014
 promise-0.013
Token She
Token ID1375
Feature activation-1.19e+00

Pos logit contributions
kin+0.012
ppers+0.012
ccoli+0.012
 rinse+0.011
 afloat+0.011
Neg logit contributions
ted-0.010
 Many-0.009
 Everyone-0.009
 People-0.008
 All-0.007
Token takes
Token ID2753
Feature activation+1.46e+00

Pos logit contributions
ted+0.088
 Many+0.083
 Everyone+0.082
 People+0.081
heart+0.072
Neg logit contributions
kin-0.058
ppers-0.057
ccoli-0.057
 promise-0.055
 rinse-0.054
Token out
Token ID503
Feature activation+5.41e-01

Pos logit contributions
kin+0.085
ppers+0.083
 rinse+0.081
ccoli+0.078
 afloat+0.077
Neg logit contributions
ted-0.091
 Many-0.088
 Everyone-0.085
 People-0.085
 All-0.077
Token a
Token ID257
Feature activation-2.00e+00

Pos logit contributions
kin+0.036
ppers+0.035
 rinse+0.034
kins+0.033
ccoli+0.033
Neg logit contributions
ted-0.031
 Everyone-0.028
 Many-0.028
 People-0.027
 All-0.023
Token magn
Token ID7842
Feature activation+7.46e+00

Pos logit contributions
 Many+0.118
ted+0.109
 Everyone+0.109
 People+0.108
 All+0.101
Neg logit contributions
ppers-0.132
edient-0.124
illas-0.121
wy-0.121
kin-0.119
Tokenifying
Token ID4035
Feature activation+1.37e+01

Pos logit contributions
kin+0.234
ppers+0.228
ccoli+0.203
 afloat+0.192
 rinse+0.189
Neg logit contributions
ted-0.699
 Many-0.668
 Everyone-0.668
 People-0.658
 All-0.615
Token glass
Token ID5405
Feature activation+2.09e+00

Pos logit contributions
kin+0.719
ppers+0.707
ccoli+0.665
 afloat+0.642
 rinse+0.639
Neg logit contributions
ted-0.987
 Everyone-0.925
 Many-0.925
 People-0.906
 All-0.831
Token.
Token ID13
Feature activation+4.88e-01

Pos logit contributions
ppers+0.151
ccoli+0.148
 achieve+0.144
 afloat+0.142
illas+0.141
Neg logit contributions
ted-0.111
 Everyone-0.097
 Many-0.094
 People-0.092
 All-0.086
Token She
Token ID1375
Feature activation-1.02e+00

Pos logit contributions
kin+0.036
ppers+0.035
ccoli+0.035
 rinse+0.034
 achieve+0.033
Neg logit contributions
ted-0.025
 Many-0.021
 Everyone-0.021
 People-0.020
heart-0.018
Token looks
Token ID3073
Feature activation+5.31e-01

Pos logit contributions
ted+0.075
 Everyone+0.072
 Many+0.071
 People+0.070
heart+0.063
Neg logit contributions
ccoli-0.049
kin-0.049
ppers-0.048
 rinse-0.047
 promise-0.047
Token at
Token ID379
Feature activation-4.13e+00

Pos logit contributions
kin+0.039
ppers+0.039
ccoli+0.037
 afloat+0.036
 rinse+0.036
Neg logit contributions
ted-0.027
 Everyone-0.025
 Many-0.025
 People-0.024
 All-0.021
Token
Token ID198
Feature activation+2.07e-01

Pos logit contributions
kin+0.013
ppers+0.012
ccoli+0.012
 afloat+0.012
 rinse+0.012
Neg logit contributions
ted-0.012
 Many-0.011
 Everyone-0.011
 People-0.011
 All-0.010
TokenHe
Token ID1544
Feature activation+7.35e+00

Pos logit contributions
kin+0.013
ppers+0.012
ccoli+0.012
kins+0.012
 afloat+0.011
Neg logit contributions
 Many-0.014
 Everyone-0.014
 People-0.013
ted-0.013
 All-0.013
Token brought
Token ID3181
Feature activation-3.44e-01

Pos logit contributions
ppers+0.386
kin+0.372
 promise+0.367
eps+0.358
ak+0.356
Neg logit contributions
 Everyone-0.499
 Many-0.486
 People-0.486
ted-0.481
 Everybody-0.433
Token his
Token ID465
Feature activation+3.52e+00

Pos logit contributions
ted+0.021
 Many+0.021
pill+0.020
 People+0.020
 Everyone+0.020
Neg logit contributions
 afloat-0.019
kin-0.019
aks-0.018
 rinse-0.018
ppers-0.018
Token magn
Token ID7842
Feature activation+7.54e+00

Pos logit contributions
kin+0.205
ppers+0.194
 promise+0.193
ccoli+0.189
 afloat+0.185
Neg logit contributions
ted-0.241
 Many-0.225
 Everyone-0.225
 People-0.221
 All-0.198
Tokenifying
Token ID4035
Feature activation+1.36e+01

Pos logit contributions
kin+0.243
ppers+0.236
ccoli+0.215
 rinse+0.200
 afloat+0.199
Neg logit contributions
ted-0.701
 Everyone-0.663
 Many-0.663
 People-0.651
 All-0.613
Token glass
Token ID5405
Feature activation+2.17e+00

Pos logit contributions
kin+0.724
ppers+0.684
ccoli+0.662
 rinse+0.645
 promise+0.641
Neg logit contributions
ted-0.978
 Everyone-0.907
 Many-0.905
 People-0.893
 All-0.807
Token and
Token ID290
Feature activation+8.40e-03

Pos logit contributions
ppers+0.149
kin+0.147
ccoli+0.144
 achieve+0.141
 rinse+0.138
Neg logit contributions
ted-0.122
 Everyone-0.108
 Many-0.107
 People-0.104
 All-0.095
Token looked
Token ID3114
Feature activation+3.61e-01

Pos logit contributions
kin+0.000
ppers+0.000
kins+0.000
 afloat+0.000
 rinse+0.000
Neg logit contributions
 Many-0.001
 People-0.001
 Everyone-0.001
ted-0.001
 Everybody-0.000
Token closely
Token ID7173
Feature activation-2.84e-01

Pos logit contributions
kin+0.025
ppers+0.025
ccoli+0.024
 achieve+0.023
 rinse+0.023
Neg logit contributions
ted-0.021
 Everyone-0.019
 Many-0.018
 People-0.018
 All-0.016
Token.
Token ID13
Feature activation+9.77e-01

Pos logit contributions
 Many+0.019
ted+0.019
 Everyone+0.018
 People+0.018
 All+0.016
Neg logit contributions
ppers-0.016
kin-0.015
ccoli-0.015
 afloat-0.015
kins-0.015
Token."
Token ID526
Feature activation-1.37e+00

Pos logit contributions
ted+0.504
 Everyone+0.440
 Many+0.424
 People+0.424
 All+0.397
Neg logit contributions
kin-0.700
ppers-0.680
ccoli-0.659
 rinse-0.639
kins-0.628
Token
Token ID198
Feature activation-1.06e+00

Pos logit contributions
ted+0.081
 Many+0.073
 People+0.070
 Everyone+0.070
 All+0.062
Neg logit contributions
kin-0.087
ppers-0.084
ccoli-0.082
 rinse-0.080
 promise-0.080
Token
Token ID198
Feature activation-8.75e-01

Pos logit contributions
ted+0.058
 Everyone+0.056
 Many+0.056
 People+0.054
 All+0.048
Neg logit contributions
ppers-0.074
kin-0.074
ccoli-0.070
 afloat-0.069
 rinse-0.068
TokenThey
Token ID2990
Feature activation+7.20e+00

Pos logit contributions
 Everyone+0.063
 Many+0.063
 People+0.060
ted+0.059
 All+0.056
Neg logit contributions
ppers-0.048
kin-0.047
ccoli-0.045
 afloat-0.045
 Claus-0.042
Token t
Token ID256
Feature activation-1.51e+00

Pos logit contributions
ppers+0.485
kin+0.483
ccoli+0.458
 afloat+0.444
edient+0.443
Neg logit contributions
ted-0.425
 Everyone-0.396
 Many-0.389
 People-0.376
 All-0.351
Tokenipt
Token ID10257
Feature activation-1.49e+01

Pos logit contributions
 People+0.063
 Many+0.063
 Everyone+0.061
ted+0.060
 All+0.052
Neg logit contributions
ppers-0.125
illas-0.116
kin-0.116
eps-0.112
edient-0.112
Tokeno
Token ID78
Feature activation+3.15e+00

Pos logit contributions
ted+1.252
 Everyone+1.154
 Many+1.131
 People+1.116
 All+1.047
Neg logit contributions
kin-0.664
ppers-0.621
ccoli-0.595
 afloat-0.580
 rinse-0.577
Tokened
Token ID276
Feature activation-6.27e+00

Pos logit contributions
kin+0.272
ppers+0.272
ccoli+0.261
 afloat+0.256
 rinse+0.254
Neg logit contributions
ted-0.116
 Many-0.106
 Everyone-0.105
 People-0.101
 All-0.083
Token to
Token ID284
Feature activation-9.70e-01

Pos logit contributions
ted+0.371
 Many+0.344
 Everyone+0.344
 People+0.336
 All+0.301
Neg logit contributions
kin-0.411
ppers-0.406
ccoli-0.387
 afloat-0.377
 rinse-0.375
Token the
Token ID262
Feature activation+2.74e-01

Pos logit contributions
ted+0.054
 Everyone+0.049
 Many+0.049
 People+0.048
 All+0.042
Neg logit contributions
kin-0.067
ppers-0.066
ccoli-0.063
 rinse-0.062
 afloat-0.062
Token bush
Token ID24484
Feature activation+3.65e+00

Pos logit contributions
kin+0.019
ppers+0.019
ccoli+0.018
kins+0.017
 rinse+0.017
Neg logit contributions
ted-0.015
 Everyone-0.014
 Many-0.014
 People-0.014
 All-0.012
Token few
Token ID1178
Feature activation-2.65e+00

Pos logit contributions
 Many+0.312
ted+0.306
 Everyone+0.302
 People+0.297
 All+0.270
Neg logit contributions
ppers-0.421
kin-0.382
edient-0.379
ccoli-0.373
kins-0.370
Token minutes
Token ID2431
Feature activation-4.59e+00

Pos logit contributions
ted+0.144
 Everyone+0.132
 Many+0.130
 People+0.127
 All+0.113
Neg logit contributions
kin-0.189
 promise-0.176
ppers-0.175
kins-0.174
 rinse-0.170
Token,
Token ID11
Feature activation-1.07e+00

Pos logit contributions
ted+0.285
 Many+0.283
 Everyone+0.279
 People+0.274
 Everybody+0.234
Neg logit contributions
ppers-0.272
kin-0.271
 achieve-0.247
kins-0.247
illas-0.244
Token then
Token ID788
Feature activation-2.24e+00

Pos logit contributions
ted+0.055
 Everyone+0.051
 Many+0.051
 People+0.049
 All+0.044
Neg logit contributions
kin-0.078
ppers-0.078
ccoli-0.074
 afloat-0.072
edient-0.072
Token t
Token ID256
Feature activation-1.70e+00

Pos logit contributions
ted+0.112
 Everyone+0.102
 Many+0.101
 People+0.099
 All+0.087
Neg logit contributions
kin-0.168
ppers-0.167
ccoli-0.160
 afloat-0.156
 rinse-0.155
Tokenipt
Token ID10257
Feature activation-1.48e+01

Pos logit contributions
 Many+0.110
 People+0.109
ted+0.105
 Everyone+0.105
 All+0.094
Neg logit contributions
ppers-0.107
kin-0.101
ucer-0.091
edient-0.091
cher-0.090
Tokeno
Token ID78
Feature activation+3.24e+00

Pos logit contributions
ted+1.134
 Everyone+1.060
 Many+1.054
 People+1.036
 All+0.958
Neg logit contributions
kin-0.731
ppers-0.712
ccoli-0.674
 afloat-0.648
 rinse-0.643
Tokened
Token ID276
Feature activation-6.23e+00

Pos logit contributions
kin+0.293
ppers+0.288
ccoli+0.280
 rinse+0.274
 afloat+0.274
Neg logit contributions
ted-0.115
 Everyone-0.098
 Many-0.095
 People-0.092
 All-0.076
Token back
Token ID736
Feature activation+5.68e+00

Pos logit contributions
ted+0.408
 Many+0.383
 Everyone+0.382
 People+0.374
 All+0.339
Neg logit contributions
kin-0.367
ppers-0.362
ccoli-0.343
 afloat-0.334
 rinse-0.331
Token to
Token ID284
Feature activation-9.98e-01

Pos logit contributions
kin+0.396
ppers+0.389
ccoli+0.373
 rinse+0.361
 afloat+0.358
Neg logit contributions
ted-0.320
 Everyone-0.291
 Many-0.284
 People-0.280
 All-0.255
Token the
Token ID262
Feature activation+2.39e-01

Pos logit contributions
ted+0.053
 Everyone+0.049
 Many+0.048
 People+0.047
 All+0.042
Neg logit contributions
kin-0.073
ppers-0.072
ccoli-0.069
 afloat-0.067
kins-0.066
Token car
Token ID1097
Feature activation-3.43e+00

Pos logit contributions
ted+0.122
 Everyone+0.114
 Many+0.113
 People+0.110
 All+0.098
Neg logit contributions
ppers-0.149
kin-0.148
ccoli-0.141
 afloat-0.137
 rinse-0.136
Token had
Token ID550
Feature activation+5.59e+00

Pos logit contributions
ted+0.210
 Many+0.196
 Everyone+0.195
 People+0.190
 All+0.171
Neg logit contributions
kin-0.216
ppers-0.214
ccoli-0.202
 promise-0.197
 afloat-0.197
Token a
Token ID257
Feature activation-3.06e+00

Pos logit contributions
ppers+0.311
kin+0.309
 afloat+0.300
eps+0.297
 rinse+0.293
Neg logit contributions
ted-0.350
 Many-0.315
 Everyone-0.312
 People-0.305
heart-0.292
Token party
Token ID2151
Feature activation-1.46e+00

Pos logit contributions
 Many+0.154
 Everyone+0.153
 People+0.146
ted+0.139
 All+0.137
Neg logit contributions
ppers-0.226
wy-0.211
kin-0.210
kins-0.207
illas-0.204
Token.
Token ID13
Feature activation+2.44e-01

Pos logit contributions
ted+0.079
 Everyone+0.073
 People+0.070
 Many+0.070
 All+0.064
Neg logit contributions
ppers-0.101
kin-0.101
ccoli-0.100
 rinse-0.096
 afloat-0.095
Token Many
Token ID4650
Feature activation-1.48e+01

Pos logit contributions
kin+0.017
ccoli+0.017
ppers+0.016
 rinse+0.016
Anna+0.015
Neg logit contributions
ted-0.013
ly-0.010
 Many-0.010
 Everyone-0.009
 People-0.009
Token friends
Token ID2460
Feature activation-4.32e+00

Pos logit contributions
ted+0.706
 Everyone+0.637
 Many+0.605
 People+0.579
 As+0.551
Neg logit contributions
kin-1.102
kins-1.059
ppers-1.051
 promise-1.046
eps-1.038
Token came
Token ID1625
Feature activation-4.70e+00

Pos logit contributions
ted+0.243
 Many+0.226
 Everyone+0.225
 People+0.219
 All+0.192
Neg logit contributions
kin-0.291
ppers-0.287
ccoli-0.273
 promise-0.269
 achieve-0.267
Token to
Token ID284
Feature activation-6.73e-01

Pos logit contributions
ted+0.218
 Everyone+0.194
 Many+0.192
 People+0.188
 All+0.165
Neg logit contributions
kin-0.380
ppers-0.367
ccoli-0.357
 rinse-0.349
kins-0.343
Token celebrate
Token ID10648
Feature activation+3.24e+00

Pos logit contributions
ted+0.046
 Everyone+0.041
 Many+0.040
 People+0.040
 All+0.036
Neg logit contributions
kin-0.038
ppers-0.037
 promise-0.036
 achieve-0.036
 rinse-0.036
Token.
Token ID13
Feature activation+5.94e-01

Pos logit contributions
ppers+0.245
ccoli+0.230
kin+0.229
 afloat+0.225
edient+0.223
Neg logit contributions
ted-0.157
 Everyone-0.154
 People-0.145
 Many-0.137
 All-0.130
Token the
Token ID262
Feature activation+4.28e-01

Pos logit contributions
kin+0.201
ppers+0.189
 rinse+0.185
 promise+0.184
 achieve+0.182
Neg logit contributions
ted-0.178
 Many-0.161
 Everyone-0.161
 People-0.156
 All-0.136
Token jar
Token ID17379
Feature activation+5.00e+00

Pos logit contributions
ppers+0.028
kin+0.027
ccoli+0.026
 afloat+0.025
kins+0.025
Neg logit contributions
ted-0.026
 Everyone-0.025
 Many-0.025
 People-0.024
 All-0.022
Token.
Token ID13
Feature activation+4.30e-01

Pos logit contributions
kin+0.329
ppers+0.325
ccoli+0.311
 afloat+0.301
 rinse+0.300
Neg logit contributions
ted-0.294
 Everyone-0.272
 Many-0.272
 People-0.266
 All-0.239
Token She
Token ID1375
Feature activation-1.29e+00

Pos logit contributions
 achieve+0.023
 rinse+0.022
 neat+0.022
 promise+0.022
ppers+0.021
Neg logit contributions
ted-0.028
unn-0.023
pill-0.022
 winds-0.021
 butterflies-0.020
Token t
Token ID256
Feature activation-1.33e+00

Pos logit contributions
 Everyone+0.086
 Many+0.084
ted+0.084
 People+0.083
 Everybody+0.074
Neg logit contributions
kin-0.071
ppers-0.070
 promise-0.066
eps-0.065
 rinse-0.065
Tokenipt
Token ID10257
Feature activation-1.48e+01

Pos logit contributions
 People+0.070
 Many+0.068
 Everyone+0.067
ted+0.066
 winds+0.064
Neg logit contributions
ppers-0.097
kin-0.089
ucer-0.087
aky-0.087
eps-0.085
Tokeno
Token ID78
Feature activation+3.63e+00

Pos logit contributions
ted+1.259
 Everyone+1.176
 Many+1.170
 People+1.149
 All+1.075
Neg logit contributions
kin-0.611
ppers-0.581
ccoli-0.552
 afloat-0.532
 rinse-0.523
Tokened
Token ID276
Feature activation-6.32e+00

Pos logit contributions
ppers+0.283
kin+0.278
ccoli+0.268
 afloat+0.263
cher+0.262
Neg logit contributions
 Many-0.155
 Everyone-0.152
ted-0.150
 People-0.147
 All-0.122
Token up
Token ID510
Feature activation-2.38e+00

Pos logit contributions
ted+0.373
 Many+0.358
 Everyone+0.357
 People+0.348
 All+0.309
Neg logit contributions
kin-0.400
ppers-0.398
ccoli-0.381
 afloat-0.372
 rinse-0.369
Token to
Token ID284
Feature activation-7.17e-01

Pos logit contributions
ted+0.128
 Many+0.123
 Everyone+0.123
 People+0.120
 All+0.103
Neg logit contributions
kin-0.166
ppers-0.162
ccoli-0.154
 afloat-0.153
kins-0.151
Token the
Token ID262
Feature activation+3.71e-01

Pos logit contributions
ted+0.049
unn+0.045
pill+0.045
 People+0.044
 Many+0.044
Neg logit contributions
 rinse-0.037
 promise-0.036
 achieve-0.035
kin-0.034
ppers-0.033
Token the
Token ID262
Feature activation+4.24e-01

Pos logit contributions
ted+0.123
 Everyone+0.115
 Many+0.112
 People+0.111
 All+0.100
Neg logit contributions
ppers-0.165
kin-0.164
ccoli-0.158
 rinse-0.151
illas-0.150
Token baby
Token ID5156
Feature activation-5.03e-01

Pos logit contributions
kin+0.024
ppers+0.022
 promise+0.022
ccoli+0.022
 rinse+0.021
Neg logit contributions
ted-0.029
 Many-0.028
 Everyone-0.028
 People-0.027
 All-0.024
Token.
Token ID13
Feature activation+2.04e-01

Pos logit contributions
ted+0.032
 Many+0.030
 Everyone+0.030
 People+0.029
 All+0.026
Neg logit contributions
kin-0.031
ppers-0.031
ccoli-0.029
 afloat-0.028
 promise-0.028
Token They
Token ID1119
Feature activation+1.97e+00

Pos logit contributions
kin+0.012
ppers+0.012
ccoli+0.012
 rinse+0.011
 promise+0.011
Neg logit contributions
ted-0.013
 Many-0.012
 Everyone-0.011
 People-0.011
heart-0.010
Token t
Token ID256
Feature activation-1.42e+00

Pos logit contributions
kin+0.132
ppers+0.130
 promise+0.125
ccoli+0.123
 rinse+0.123
Neg logit contributions
ted-0.112
 Many-0.104
 Everyone-0.104
 People-0.102
 All-0.088
Tokenipt
Token ID10257
Feature activation-1.47e+01

Pos logit contributions
 People+0.064
 Many+0.063
 Everyone+0.060
ted+0.059
 winds+0.053
Neg logit contributions
ppers-0.116
kin-0.107
ucer-0.102
illas-0.102
edient-0.101
Tokeno
Token ID78
Feature activation+3.37e+00

Pos logit contributions
ted+1.247
 Everyone+1.140
 Many+1.122
 People+1.105
 All+1.034
Neg logit contributions
kin-0.663
ppers-0.608
ccoli-0.600
 afloat-0.574
kins-0.568
Tokened
Token ID276
Feature activation-6.32e+00

Pos logit contributions
kin+0.296
ppers+0.293
ccoli+0.282
 afloat+0.277
 rinse+0.276
Neg logit contributions
ted-0.126
 Everyone-0.110
 Many-0.110
 People-0.106
 All-0.087
Token inside
Token ID2641
Feature activation+3.21e+00

Pos logit contributions
ted+0.372
 Everyone+0.331
 Many+0.330
 People+0.322
 All+0.291
Neg logit contributions
kin-0.436
ppers-0.427
ccoli-0.404
 achieve-0.395
 afloat-0.391
Token and
Token ID290
Feature activation-7.11e-01

Pos logit contributions
kin+0.233
ppers+0.233
ccoli+0.222
 afloat+0.215
 rinse+0.213
Neg logit contributions
ted-0.170
 Everyone-0.155
 Many-0.154
 People-0.150
 All-0.134
Token put
Token ID1234
Feature activation-4.30e-01

Pos logit contributions
ted+0.043
 Everyone+0.038
 Many+0.037
 People+0.036
 All+0.034
Neg logit contributions
kin-0.048
ppers-0.047
ccoli-0.045
edient-0.044
 achieve-0.044
Token tree
Token ID5509
Feature activation-9.09e-01

Pos logit contributions
ted+0.041
 Many+0.040
 Everyone+0.039
 People+0.038
 All+0.035
Neg logit contributions
ppers-0.055
kin-0.052
ccoli-0.050
cher-0.049
edient-0.049
Token in
Token ID287
Feature activation+2.07e+00

Pos logit contributions
ted+0.050
 Everyone+0.046
 Many+0.046
 People+0.045
 All+0.042
Neg logit contributions
kin-0.063
ppers-0.062
ccoli-0.060
 rinse-0.059
 afloat-0.058
Token a
Token ID257
Feature activation-3.10e+00

Pos logit contributions
kin+0.144
ppers+0.139
ccoli+0.133
 afloat+0.132
 rinse+0.131
Neg logit contributions
ted-0.117
 Many-0.106
 Everyone-0.105
 People-0.103
 All-0.089
Token park
Token ID3952
Feature activation-2.13e+00

Pos logit contributions
 Many+0.155
 Everyone+0.153
 People+0.145
ted+0.143
 All+0.137
Neg logit contributions
ppers-0.238
kin-0.226
ccoli-0.224
kins-0.215
cher-0.213
Token.
Token ID13
Feature activation+6.71e-02

Pos logit contributions
ted+0.139
 Many+0.130
 Everyone+0.129
 People+0.126
 All+0.113
Neg logit contributions
kin-0.128
ppers-0.125
ccoli-0.117
 afloat-0.115
 promise-0.114
Token Many
Token ID4650
Feature activation-1.47e+01

Pos logit contributions
ccoli+0.004
kin+0.004
Anna+0.004
Arthur+0.004
Jackson+0.004
Neg logit contributions
ted-0.004
ly-0.003
unn-0.003
heart-0.003
 news-0.003
Token animals
Token ID4695
Feature activation-8.60e+00

Pos logit contributions
ted+0.695
 Everyone+0.633
 As+0.588
 Many+0.569
 People+0.558
Neg logit contributions
kin-1.043
kins-1.040
eps-1.036
aky-1.031
aks-1.019
Token lived
Token ID5615
Feature activation-2.31e+00

Pos logit contributions
ted+0.487
 Everyone+0.447
 Many+0.447
 People+0.436
 All+0.390
Neg logit contributions
kin-0.587
ppers-0.580
ccoli-0.555
 afloat-0.539
 rinse-0.537
Token in
Token ID287
Feature activation+3.05e+00

Pos logit contributions
ted+0.139
 Everyone+0.125
 People+0.122
 Many+0.122
 All+0.113
Neg logit contributions
ppers-0.151
ccoli-0.144
kin-0.144
 rinse-0.142
 promise-0.137
Token the
Token ID262
Feature activation+6.79e-01

Pos logit contributions
kin+0.222
ppers+0.218
ccoli+0.209
 afloat+0.204
 rinse+0.203
Neg logit contributions
ted-0.161
 Many-0.147
 Everyone-0.147
 People-0.143
 All-0.126
Token tree
Token ID5509
Feature activation+2.63e-01

Pos logit contributions
kin+0.050
ppers+0.050
ccoli+0.048
 afloat+0.047
 rinse+0.046
Neg logit contributions
ted-0.035
 Everyone-0.032
 Many-0.032
 People-0.031
 All-0.027
Token was
Token ID373
Feature activation-2.44e+00

Pos logit contributions
kin+0.056
ppers+0.052
ccoli+0.051
kins+0.050
 promise+0.049
Neg logit contributions
ted-0.050
 Many-0.046
 Everyone-0.045
 People-0.044
 All-0.039
Token a
Token ID257
Feature activation-3.02e+00

Pos logit contributions
ted+0.175
unn+0.151
 People+0.151
 Everyone+0.150
 Many+0.147
Neg logit contributions
kin-0.131
ccoli-0.115
 rinse-0.113
Anna-0.112
 afloat-0.110
Token big
Token ID1263
Feature activation+1.34e-01

Pos logit contributions
 Many+0.109
 Everyone+0.107
 People+0.102
ted+0.101
 All+0.088
Neg logit contributions
ppers-0.263
kin-0.244
ccoli-0.239
 afloat-0.239
wy-0.238
Token race
Token ID3234
Feature activation-1.63e+00

Pos logit contributions
ppers+0.009
kin+0.009
ccoli+0.008
 afloat+0.008
 rinse+0.008
Neg logit contributions
ted-0.008
 Many-0.007
 Everyone-0.007
 People-0.007
 All-0.006
Token.
Token ID13
Feature activation+2.40e-01

Pos logit contributions
ted+0.107
 Many+0.100
 Everyone+0.098
 People+0.097
 All+0.087
Neg logit contributions
kin-0.097
ppers-0.094
ccoli-0.088
 afloat-0.087
 promise-0.087
Token Many
Token ID4650
Feature activation-1.47e+01

Pos logit contributions
kin+0.016
ccoli+0.016
Anna+0.015
Jackson+0.015
 achieve+0.015
Neg logit contributions
ted-0.013
unn-0.009
ly-0.009
iked-0.008
arted-0.008
Token kids
Token ID3988
Feature activation-6.31e+00

Pos logit contributions
ted+0.742
 Everyone+0.619
 Many+0.606
 People+0.572
 As+0.563
Neg logit contributions
kin-1.067
kins-1.039
eps-1.007
 promise-0.998
aky-0.998
Token ran
Token ID4966
Feature activation-2.66e+00

Pos logit contributions
ted+0.354
 Many+0.326
 Everyone+0.324
 People+0.315
 All+0.276
Neg logit contributions
kin-0.426
ppers-0.424
ccoli-0.399
 promise-0.397
 achieve-0.394
Token in
Token ID287
Feature activation+3.12e+00

Pos logit contributions
ted+0.159
 Many+0.149
 Everyone+0.146
 People+0.144
 All+0.126
Neg logit contributions
kin-0.165
ppers-0.164
 afloat-0.157
ccoli-0.153
 promise-0.153
Token the
Token ID262
Feature activation+6.14e-01

Pos logit contributions
kin+0.227
ppers+0.226
ccoli+0.216
 rinse+0.208
 afloat+0.207
Neg logit contributions
ted-0.164
 Everyone-0.152
 Many-0.152
 People-0.147
 All-0.132
Token race
Token ID3234
Feature activation-1.14e+00

Pos logit contributions
kin+0.040
ppers+0.040
ccoli+0.038
 afloat+0.036
 rinse+0.036
Neg logit contributions
ted-0.037
 Everyone-0.035
 Many-0.035
 People-0.034
 All-0.031
Token,
Token ID11
Feature activation+8.51e-02

Pos logit contributions
kin+0.135
ppers+0.135
ccoli+0.131
 afloat+0.130
 achieve+0.127
Neg logit contributions
ted-0.100
 Everyone-0.090
 People-0.087
 Many-0.085
 All-0.080
Token and
Token ID290
Feature activation-5.39e-01

Pos logit contributions
kin+0.006
ppers+0.006
ccoli+0.006
edient+0.005
 afloat+0.005
Neg logit contributions
ted-0.005
 Everyone-0.004
 Many-0.004
 People-0.004
 All-0.004
Token then
Token ID788
Feature activation-1.45e+00

Pos logit contributions
ted+0.027
 Everyone+0.025
 Many+0.025
 People+0.024
 All+0.021
Neg logit contributions
kin-0.040
ppers-0.040
ccoli-0.038
 afloat-0.037
 promise-0.037
Token they
Token ID484
Feature activation+1.75e+00

Pos logit contributions
ted+0.084
 Many+0.079
 Everyone+0.078
 People+0.077
 All+0.068
Neg logit contributions
kin-0.096
ppers-0.094
ccoli-0.090
 rinse-0.088
 promise-0.087
Token t
Token ID256
Feature activation-1.31e+00

Pos logit contributions
kin+0.117
ppers+0.115
ccoli+0.113
 afloat+0.109
edient+0.108
Neg logit contributions
ted-0.104
 Everyone-0.092
 Many-0.092
 People-0.089
 All-0.082
Tokenipt
Token ID10257
Feature activation-1.46e+01

Pos logit contributions
 Many+0.056
 People+0.055
ted+0.055
 Everyone+0.053
 All+0.045
Neg logit contributions
ppers-0.110
kin-0.105
ucer-0.098
kins-0.097
illas-0.097
Tokeno
Token ID78
Feature activation+3.46e+00

Pos logit contributions
ted+1.229
 Everyone+1.117
 Many+1.094
 People+1.084
 All+1.011
Neg logit contributions
kin-0.668
ppers-0.616
ccoli-0.609
 afloat-0.587
 achieve-0.578
Tokened
Token ID276
Feature activation-6.24e+00

Pos logit contributions
kin+0.333
ppers+0.325
ccoli+0.318
 afloat+0.312
 achieve+0.312
Neg logit contributions
ted-0.110
 Everyone-0.086
 Many-0.083
 People-0.080
 All-0.064
Token to
Token ID284
Feature activation-6.36e-01

Pos logit contributions
ted+0.344
 Everyone+0.301
 Many+0.295
 People+0.290
 All+0.262
Neg logit contributions
kin-0.457
ppers-0.449
ccoli-0.426
 achieve-0.421
kins-0.413
Token the
Token ID262
Feature activation+5.06e-01

Pos logit contributions
ted+0.033
 Everyone+0.030
 Many+0.030
 People+0.029
 All+0.026
Neg logit contributions
kin-0.047
ppers-0.047
ccoli-0.045
 afloat-0.043
kins-0.043
Token table
Token ID3084
Feature activation+2.22e+00

Pos logit contributions
ppers+0.033
kin+0.033
 afloat+0.032
ccoli+0.031
 achieve+0.030
Neg logit contributions
ted-0.030
 Everyone-0.029
 Many-0.028
 People-0.028
 All-0.025
Token naughty
Token ID45128
Feature activation+1.19e+00

Pos logit contributions
 Many+0.093
ted+0.090
 Everyone+0.089
 sight+0.085
 news+0.084
Neg logit contributions
ppers-0.186
illas-0.169
edient-0.169
wy-0.162
kins-0.161
Token idea
Token ID2126
Feature activation-4.66e+00

Pos logit contributions
kin+0.075
ppers+0.074
ccoli+0.070
 afloat+0.068
 rinse+0.068
Neg logit contributions
ted-0.073
 Many-0.068
 Everyone-0.068
 People-0.066
 All-0.060
Token.
Token ID13
Feature activation+9.79e-02

Pos logit contributions
ted+0.316
 Many+0.303
 Everyone+0.301
 People+0.296
 All+0.263
Neg logit contributions
kin-0.261
ppers-0.260
ccoli-0.243
 promise-0.236
 rinse-0.232
Token She
Token ID1375
Feature activation-1.40e+00

Pos logit contributions
ccoli+0.006
kin+0.006
ppers+0.006
 rinse+0.006
Anna+0.006
Neg logit contributions
ted-0.005
 Many-0.005
 Everyone-0.005
 People-0.005
heart-0.004
Token t
Token ID256
Feature activation-1.25e+00

Pos logit contributions
ted+0.086
 Everyone+0.082
 Many+0.081
 People+0.080
 All+0.070
Neg logit contributions
ppers-0.086
kin-0.086
ccoli-0.084
 promise-0.082
 rinse-0.080
Tokenipt
Token ID10257
Feature activation-1.46e+01

Pos logit contributions
 People+0.075
 Many+0.075
ted+0.074
 Everyone+0.074
 All+0.066
Neg logit contributions
ppers-0.081
kin-0.074
eps-0.070
ucer-0.069
illas-0.069
Tokeno
Token ID78
Feature activation+3.51e+00

Pos logit contributions
ted+1.244
 Everyone+1.136
 Many+1.126
 People+1.107
 All+1.035
Neg logit contributions
kin-0.653
ppers-0.604
ccoli-0.582
 afloat-0.572
kins-0.562
Tokened
Token ID276
Feature activation-6.16e+00

Pos logit contributions
kin+0.309
ppers+0.304
ccoli+0.293
 afloat+0.288
 rinse+0.288
Neg logit contributions
ted-0.135
 Everyone-0.115
 Many-0.114
 People-0.110
 All-0.092
Token to
Token ID284
Feature activation-6.61e-01

Pos logit contributions
ted+0.332
 Everyone+0.302
 Many+0.301
 People+0.293
 All+0.260
Neg logit contributions
kin-0.442
ppers-0.436
ccoli-0.416
 afloat-0.406
 rinse-0.404
Token his
Token ID465
Feature activation+2.96e+00

Pos logit contributions
ted+0.033
 Many+0.030
 Everyone+0.030
 People+0.029
 All+0.025
Neg logit contributions
kin-0.050
ppers-0.050
ccoli-0.048
 afloat-0.046
 rinse-0.046
Token bed
Token ID3996
Feature activation+2.23e+00

Pos logit contributions
ppers+0.175
kin+0.170
 achieve+0.158
wy+0.156
 afloat+0.156
Neg logit contributions
ted-0.195
 Everyone-0.185
 Many-0.183
 People-0.177
 All-0.167
Token it
Token ID340
Feature activation-3.55e+00

Pos logit contributions
ppers+0.161
kin+0.160
ccoli+0.154
 afloat+0.149
 achieve+0.148
Neg logit contributions
ted-0.125
 Everyone-0.116
 Many-0.115
 People-0.113
 All-0.101
Token was
Token ID373
Feature activation-1.36e+00

Pos logit contributions
ted+0.188
 Everyone+0.155
 People+0.153
 Many+0.153
 All+0.143
Neg logit contributions
ppers-0.257
kin-0.252
 achieve-0.247
ccoli-0.244
 afloat-0.242
Token too
Token ID1165
Feature activation-2.52e+00

Pos logit contributions
ted+0.060
 as+0.055
 Every+0.052
 Everyone+0.051
 winds+0.050
Neg logit contributions
ppers-0.101
 achieve-0.101
kin-0.097
ucer-0.096
ccoli-0.095
Token late
Token ID2739
Feature activation+3.79e+00

Pos logit contributions
ted+0.139
 Many+0.132
 Everyone+0.127
 People+0.125
 winds+0.118
Neg logit contributions
ppers-0.167
kin-0.162
 achieve-0.159
illas-0.157
ccoli-0.156
Token.
Token ID13
Feature activation+2.76e-01

Pos logit contributions
kin+0.269
ppers+0.264
ccoli+0.256
 rinse+0.253
wy+0.249
Neg logit contributions
ted-0.207
 Everyone-0.187
 People-0.178
 Many-0.178
 All-0.164
Token Many
Token ID4650
Feature activation-1.46e+01

Pos logit contributions
kin+0.022
ppers+0.021
ccoli+0.020
 rinse+0.020
 afloat+0.020
Neg logit contributions
ted-0.013
 Many-0.011
 Everyone-0.011
 People-0.011
 All-0.009
Token people
Token ID661
Feature activation-6.75e+00

Pos logit contributions
ted+0.621
 Everyone+0.558
 Many+0.532
 People+0.514
 Every+0.462
Neg logit contributions
kin-1.174
ppers-1.160
kins-1.124
 promise-1.115
eps-1.103
Token were
Token ID547
Feature activation-1.53e+00

Pos logit contributions
ted+0.382
 Many+0.326
 Everyone+0.325
 People+0.317
 All+0.292
Neg logit contributions
kin-0.476
ccoli-0.471
ppers-0.462
 afloat-0.439
 rinse-0.435
Token sad
Token ID6507
Feature activation-1.48e-03

Pos logit contributions
ted+0.094
 Everyone+0.087
 Many+0.087
 People+0.084
 All+0.078
Neg logit contributions
kin-0.094
ccoli-0.093
ppers-0.090
 rinse-0.085
 achieve-0.085
Token and
Token ID290
Feature activation-5.44e-01

Pos logit contributions
ted+0.000
 Many+0.000
 Everyone+0.000
 People+0.000
 All+0.000
Neg logit contributions
kin-0.000
ppers-0.000
ccoli-0.000
 afloat-0.000
 rinse-0.000
Token angry
Token ID7954
Feature activation-1.88e-01

Pos logit contributions
ted+0.037
 Everyone+0.035
 Many+0.035
 People+0.034
 All+0.031
Neg logit contributions
kin-0.031
ppers-0.030
ccoli-0.029
 afloat-0.028
 rinse-0.028
Token sour
Token ID11348
Feature activation+2.10e-01

Pos logit contributions
ted+0.163
 Many+0.153
 Everyone+0.153
 People+0.148
 All+0.136
Neg logit contributions
kin-0.151
ppers-0.149
ccoli-0.142
 afloat-0.139
 achieve-0.137
Token!
Token ID0
Feature activation-1.30e+00

Pos logit contributions
ppers+0.012
kin+0.012
ccoli+0.011
 rinse+0.011
 promise+0.011
Neg logit contributions
ted-0.014
 Many-0.014
 Everyone-0.013
 People-0.013
 All-0.011
Tokenâ
Token ID22940
Feature activation-1.07e+00

Pos logit contributions
ted+0.075
ly+0.057
iked+0.055
ened+0.054
 Many+0.052
Neg logit contributions
ccoli-0.079
 rinse-0.077
kin-0.076
 promise-0.075
ppers-0.074
Token€
Token ID26391
Feature activation+3.47e+00

Pos logit contributions
ted+0.074
 Everyone+0.072
 People+0.070
 Many+0.069
 All+0.064
Neg logit contributions
ppers-0.057
kin-0.057
ccoli-0.055
 afloat-0.053
 rinse-0.052
Token.
Token ID13
Feature activation+2.29e-01

Pos logit contributions
kin+0.184
ppers+0.183
ccoli+0.171
 afloat+0.164
 promise+0.163
Neg logit contributions
ted-0.251
 Everyone-0.233
 Many-0.233
 People-0.228
 All-0.209
Token Many
Token ID4650
Feature activation-1.46e+01

Pos logit contributions
ccoli+0.014
 rinse+0.014
kin+0.013
ppers+0.013
 OK+0.013
Neg logit contributions
ted-0.013
ly-0.010
ened-0.009
 winds-0.009
iked-0.009
Token people
Token ID661
Feature activation-6.91e+00

Pos logit contributions
ted+0.705
 Everyone+0.653
 Many+0.624
 People+0.606
 As+0.584
Neg logit contributions
kin-1.086
kins-1.070
ppers-1.059
 promise-1.013
aky-1.013
Token heard
Token ID2982
Feature activation+1.17e+00

Pos logit contributions
ted+0.401
 Everyone+0.355
 Many+0.351
 People+0.341
 All+0.315
Neg logit contributions
kin-0.472
ppers-0.465
ccoli-0.453
 afloat-0.438
edient-0.425
Token her
Token ID607
Feature activation-2.02e+00

Pos logit contributions
 rinse+0.061
kin+0.061
ppers+0.061
 promise+0.060
kins+0.057
Neg logit contributions
ted-0.069
 Many-0.063
unn-0.063
 People-0.061
 Everyone-0.060
Token and
Token ID290
Feature activation-6.49e-01

Pos logit contributions
ted+0.111
 Everyone+0.103
 Many+0.100
 People+0.095
 All+0.091
Neg logit contributions
kin-0.142
ppers-0.138
ccoli-0.134
 afloat-0.131
cher-0.130
Token started
Token ID2067
Feature activation-4.16e+00

Pos logit contributions
ted+0.038
 Many+0.038
 Everyone+0.037
 People+0.036
 All+0.031
Neg logit contributions
kin-0.040
ppers-0.039
 rinse-0.038
 promise-0.038
ccoli-0.037
Token long
Token ID890
Feature activation-1.04e+00

Pos logit contributions
 Many+0.151
 Everyone+0.142
 People+0.137
ted+0.135
 All+0.125
Neg logit contributions
ppers-0.261
kin-0.253
kins-0.249
wy-0.248
edient-0.240
Token,
Token ID11
Feature activation-3.22e-01

Pos logit contributions
ted+0.055
 Many+0.051
 Everyone+0.050
 People+0.049
 All+0.044
Neg logit contributions
ppers-0.074
kin-0.072
ccoli-0.070
edient-0.068
 rinse-0.068
Token long
Token ID890
Feature activation-5.92e-01

Pos logit contributions
ted+0.019
 Many+0.018
 Everyone+0.018
 People+0.017
 All+0.016
Neg logit contributions
ppers-0.021
kin-0.020
ccoli-0.019
illas-0.019
edient-0.019
Token time
Token ID640
Feature activation-9.67e-01

Pos logit contributions
ted+0.037
 Many+0.035
 Everyone+0.035
 People+0.034
 All+0.031
Neg logit contributions
ppers-0.036
kin-0.034
ccoli-0.034
edient-0.033
 achieve-0.032
Token.
Token ID13
Feature activation+4.15e-01

Pos logit contributions
ted+0.057
 Everyone+0.053
 Many+0.052
 People+0.051
 All+0.047
Neg logit contributions
kin-0.064
ppers-0.063
ccoli-0.060
 rinse-0.058
 afloat-0.058
Token Many
Token ID4650
Feature activation-1.46e+01

Pos logit contributions
ccoli+0.028
kin+0.027
ppers+0.026
 rinse+0.026
Anna+0.025
Neg logit contributions
ted-0.023
 Many-0.018
ly-0.017
 People-0.017
 Everyone-0.017
Token people
Token ID661
Feature activation-6.77e+00

Pos logit contributions
ted+0.702
 Everyone+0.635
 Many+0.615
 People+0.581
 As+0.558
Neg logit contributions
kin-1.081
kins-1.044
 promise-1.010
ppers-1.006
eps-0.991
Token came
Token ID1625
Feature activation-4.90e+00

Pos logit contributions
ted+0.394
 Everyone+0.356
 Many+0.351
 People+0.347
 All+0.318
Neg logit contributions
kin-0.463
ppers-0.454
ccoli-0.439
 afloat-0.423
 rinse-0.417
Token to
Token ID284
Feature activation-7.00e-01

Pos logit contributions
ted+0.218
 Everyone+0.188
 Many+0.183
 People+0.180
 All+0.165
Neg logit contributions
kin-0.414
ppers-0.391
ccoli-0.389
wy-0.377
 rinse-0.376
Token see
Token ID766
Feature activation-7.58e-01

Pos logit contributions
ted+0.044
 Everyone+0.041
 Many+0.041
 People+0.040
 All+0.036
Neg logit contributions
kin-0.043
ppers-0.043
ccoli-0.041
 afloat-0.040
 promise-0.040
Token the
Token ID262
Feature activation+5.50e-01

Pos logit contributions
ted+0.043
 Everyone+0.040
 Many+0.039
 People+0.039
 All+0.034
Neg logit contributions
kin-0.052
ppers-0.051
ccoli-0.049
 afloat-0.048
 rinse-0.048
Token jumping
Token ID14284
Feature activation-1.06e+00

Pos logit contributions
kin+0.017
ppers+0.016
ccoli+0.015
wy+0.015
edient+0.015
Neg logit contributions
ted-0.013
 Many-0.011
 People-0.011
 Everyone-0.010
 All-0.010
Token over
Token ID625
Feature activation+4.58e+00

Pos logit contributions
ted+0.062
 Everyone+0.056
 Many+0.056
 People+0.055
 All+0.049
Neg logit contributions
kin-0.072
ppers-0.071
ccoli-0.068
 promise-0.066
 rinse-0.066
Token logs
Token ID17259
Feature activation-1.72e+00

Pos logit contributions
kin+0.277
ppers+0.273
 afloat+0.268
ccoli+0.265
 rinse+0.261
Neg logit contributions
ted-0.262
 Everyone-0.258
 Many-0.256
 People-0.250
 Everybody-0.216
Token,
Token ID11
Feature activation+2.19e-01

Pos logit contributions
ted+0.095
 Everyone+0.086
 Many+0.085
 People+0.083
 All+0.076
Neg logit contributions
kin-0.122
ppers-0.119
ccoli-0.116
 rinse-0.112
 promise-0.110
Token t
Token ID256
Feature activation-1.24e+00

Pos logit contributions
kin+0.017
ppers+0.016
ccoli+0.016
 promise+0.016
edient+0.016
Neg logit contributions
ted-0.011
 Many-0.009
 Everyone-0.009
 People-0.009
 All-0.008
Tokenipt
Token ID10257
Feature activation-1.46e+01

Pos logit contributions
 Many+0.069
ted+0.068
 People+0.067
 Everyone+0.067
 All+0.057
Neg logit contributions
ppers-0.088
kin-0.079
edient-0.076
 afloat-0.076
illas-0.075
Tokenoe
Token ID2577
Feature activation+2.56e+00

Pos logit contributions
ted+1.195
 Everyone+1.100
 Many+1.086
 People+1.071
 All+0.999
Neg logit contributions
kin-0.675
ppers-0.626
ccoli-0.613
 rinse-0.590
 promise-0.584
Tokening
Token ID278
Feature activation-1.97e+00

Pos logit contributions
ppers+0.176
kin+0.171
ccoli+0.163
 afloat+0.160
illas+0.157
Neg logit contributions
ted-0.141
 Everyone-0.139
 Many-0.138
 People-0.135
 All-0.119
Token around
Token ID1088
Feature activation-1.06e+00

Pos logit contributions
ted+0.128
 Many+0.121
 Everyone+0.121
 People+0.118
 All+0.107
Neg logit contributions
kin-0.117
ppers-0.116
ccoli-0.109
 afloat-0.107
 rinse-0.105
Token rocks
Token ID12586
Feature activation-4.69e+00

Pos logit contributions
ted+0.052
 Everyone+0.046
 Many+0.046
 People+0.045
 All+0.040
Neg logit contributions
kin-0.082
ppers-0.082
ccoli-0.078
 promise-0.076
 rinse-0.075
Token,
Token ID11
Feature activation+1.60e-01

Pos logit contributions
ted+0.263
 Everyone+0.236
 Many+0.233
 People+0.229
 All+0.209
Neg logit contributions
kin-0.325
ppers-0.317
ccoli-0.310
 rinse-0.301
 promise-0.298
Token landed
Token ID11406
Feature activation-3.92e+00

Pos logit contributions
ted+0.030
 Everyone+0.026
 Many+0.025
 People+0.025
 All+0.023
Neg logit contributions
kin-0.031
ppers-0.031
ccoli-0.029
 promise-0.029
 achieve-0.029
Token on
Token ID319
Feature activation+6.63e+00

Pos logit contributions
ted+0.302
 Everyone+0.268
 People+0.261
 Many+0.258
 All+0.244
Neg logit contributions
ccoli-0.200
kin-0.197
ppers-0.196
 promise-0.193
 rinse-0.186
Token the
Token ID262
Feature activation+6.29e-01

Pos logit contributions
kin+0.458
ppers+0.447
ccoli+0.428
 afloat+0.424
 rinse+0.420
Neg logit contributions
ted-0.357
 Many-0.331
 Everyone-0.323
 People-0.320
 All-0.276
Token street
Token ID4675
Feature activation+4.99e-01

Pos logit contributions
ppers+0.041
kin+0.041
ccoli+0.039
 afloat+0.038
 achieve+0.037
Neg logit contributions
ted-0.037
 Everyone-0.036
 Many-0.035
 People-0.035
 All-0.030
Token.
Token ID13
Feature activation+2.45e-01

Pos logit contributions
kin+0.033
ppers+0.033
ccoli+0.032
 rinse+0.031
 afloat+0.030
Neg logit contributions
ted-0.029
 Everyone-0.027
 Many-0.027
 People-0.026
 All-0.024
Token Many
Token ID4650
Feature activation-1.46e+01

Pos logit contributions
kin+0.017
ccoli+0.016
ppers+0.016
 rinse+0.016
 afloat+0.016
Neg logit contributions
ted-0.013
 Many-0.011
 Everyone-0.011
 People-0.011
heart-0.011
Token birds
Token ID10087
Feature activation-9.15e+00

Pos logit contributions
ted+0.610
 Everyone+0.534
 Many+0.514
 People+0.498
 Every+0.444
Neg logit contributions
kin-1.185
ppers-1.179
kins-1.132
 promise-1.117
eps-1.110
Token and
Token ID290
Feature activation-5.21e-01

Pos logit contributions
ted+0.489
 Many+0.425
 Everyone+0.423
 People+0.409
 All+0.371
Neg logit contributions
kin-0.674
ppers-0.660
ccoli-0.643
 afloat-0.619
 promise-0.616
Token dogs
Token ID6844
Feature activation-6.25e+00

Pos logit contributions
 Everyone+0.024
ted+0.024
 Many+0.023
 People+0.023
 All+0.020
Neg logit contributions
ppers-0.037
kin-0.037
ccoli-0.035
illas-0.035
kins-0.035
Token came
Token ID1625
Feature activation-4.66e+00

Pos logit contributions
ted+0.358
 Everyone+0.302
 Many+0.300
 People+0.290
 All+0.272
Neg logit contributions
kin-0.453
ppers-0.439
ccoli-0.431
 afloat-0.410
edient-0.409
Token and
Token ID290
Feature activation-5.48e-01

Pos logit contributions
ted+0.256
 Everyone+0.232
 Many+0.231
 People+0.224
 All+0.202
Neg logit contributions
kin-0.342
ppers-0.325
ccoli-0.321
 rinse-0.306
kins-0.305
Token made
Token ID925
Feature activation+7.39e-01

Pos logit contributions
ted+0.251
 Many+0.234
 Everyone+0.232
 People+0.226
 All+0.203
Neg logit contributions
kin-0.226
ppers-0.218
ccoli-0.209
 promise-0.202
 rinse-0.201
Token flour
Token ID10601
Feature activation+2.03e+00

Pos logit contributions
ep+0.035
eps+0.035
aks+0.032
 clean+0.031
kin+0.031
Neg logit contributions
ted-0.056
unn-0.050
pill-0.050
elt-0.046
umm-0.045
Token from
Token ID422
Feature activation+6.78e-01

Pos logit contributions
ccoli+0.155
kin+0.153
cher+0.151
Jackson+0.151
ppers+0.151
Neg logit contributions
ted-0.088
 Everyone-0.080
 People-0.077
 Many-0.070
 All-0.070
Token wheat
Token ID17135
Feature activation+1.62e-01

Pos logit contributions
kin+0.043
ppers+0.042
 afloat+0.041
ccoli+0.041
edient+0.039
Neg logit contributions
ted-0.040
 Many-0.038
 Everyone-0.038
 People-0.037
 All-0.033
Token.
Token ID13
Feature activation+3.92e-01

Pos logit contributions
kin+0.011
ppers+0.011
ccoli+0.010
 rinse+0.010
 promise+0.010
Neg logit contributions
ted-0.009
 Everyone-0.008
 People-0.008
 Many-0.008
 All-0.007
Token Many
Token ID4650
Feature activation-1.46e+01

Pos logit contributions
ccoli+0.019
kin+0.019
 globe+0.019
aks+0.018
 achieve+0.018
Neg logit contributions
ted-0.027
ly-0.023
heart-0.022
vious-0.022
arted-0.021
Token people
Token ID661
Feature activation-6.83e+00

Pos logit contributions
ted+0.681
 Everyone+0.567
 Many+0.537
 People+0.515
 As+0.490
Neg logit contributions
kin-1.139
kins-1.126
ppers-1.120
eps-1.094
illas-1.081
Token came
Token ID1625
Feature activation-4.80e+00

Pos logit contributions
ted+0.391
 Everyone+0.358
 Many+0.357
 People+0.349
 All+0.314
Neg logit contributions
kin-0.464
ppers-0.458
ccoli-0.441
 afloat-0.429
 rinse-0.423
Token to
Token ID284
Feature activation-7.28e-01

Pos logit contributions
ted+0.221
 Everyone+0.200
 Many+0.199
 People+0.194
 All+0.168
Neg logit contributions
kin-0.383
ppers-0.375
ccoli-0.362
 rinse-0.353
 afloat-0.352
Token see
Token ID766
Feature activation-8.78e-01

Pos logit contributions
ted+0.047
 Everyone+0.043
 Many+0.043
 People+0.042
 All+0.038
Neg logit contributions
kin-0.044
ppers-0.044
ccoli-0.042
 afloat-0.041
 rinse-0.041
Token the
Token ID262
Feature activation+4.26e-01

Pos logit contributions
ted+0.049
 Many+0.045
 Everyone+0.044
 People+0.043
 All+0.038
Neg logit contributions
ppers-0.060
kin-0.060
ccoli-0.057
 afloat-0.056
 rinse-0.056
Token rushed
Token ID13999
Feature activation-3.16e+00

Pos logit contributions
ted+0.032
 Everyone+0.032
 Many+0.031
 People+0.031
 Everybody+0.028
Neg logit contributions
kin-0.027
ppers-0.027
 rinse-0.026
ccoli-0.026
kins-0.026
Token toward
Token ID3812
Feature activation-2.66e+00

Pos logit contributions
ted+0.192
 Many+0.181
 Everyone+0.180
 People+0.176
 All+0.158
Neg logit contributions
kin-0.198
ppers-0.197
ccoli-0.187
 afloat-0.183
 rinse-0.182
Token the
Token ID262
Feature activation+5.44e-01

Pos logit contributions
ted+0.166
 Many+0.158
 Everyone+0.154
 People+0.149
pill+0.139
Neg logit contributions
kin-0.151
 rinse-0.144
ppers-0.142
kins-0.140
 afloat-0.137
Token fire
Token ID2046
Feature activation+2.66e+00

Pos logit contributions
kin+0.032
ppers+0.031
ccoli+0.030
 afloat+0.029
 rinse+0.029
Neg logit contributions
ted-0.037
 Everyone-0.034
 Many-0.034
 People-0.033
 All-0.030
Token.
Token ID13
Feature activation+4.67e-01

Pos logit contributions
kin+0.191
ppers+0.181
ccoli+0.176
 rinse+0.175
 promise+0.173
Neg logit contributions
ted-0.148
 Everyone-0.134
 People-0.129
 Many-0.125
 All-0.120
Token Many
Token ID4650
Feature activation-1.46e+01

Pos logit contributions
ccoli+0.030
 achieve+0.028
 rinse+0.027
 Claus+0.027
ppers+0.027
Neg logit contributions
ted-0.025
heart-0.022
ly-0.020
vious-0.019
unn-0.019
Token of
Token ID286
Feature activation-6.32e+00

Pos logit contributions
ted+0.571
 Everyone+0.518
 Many+0.486
 People+0.463
 All+0.401
Neg logit contributions
kin-1.210
ppers-1.209
kins-1.176
 afloat-1.149
eps-1.148
Token the
Token ID262
Feature activation+5.22e-01

Pos logit contributions
ted+0.397
 Everyone+0.368
 Many+0.363
 People+0.355
ly+0.325
Neg logit contributions
kin-0.371
kins-0.351
ppers-0.348
 rinse-0.346
edient-0.339
Token people
Token ID661
Feature activation-6.76e+00

Pos logit contributions
kin+0.031
ppers+0.030
ccoli+0.029
 afloat+0.028
 rinse+0.028
Neg logit contributions
ted-0.035
 Many-0.032
 Everyone-0.032
 People-0.032
 All-0.029
Token said
Token ID531
Feature activation-2.70e+00

Pos logit contributions
ted+0.378
 Everyone+0.334
 Many+0.333
 People+0.326
 All+0.296
Neg logit contributions
kin-0.477
ppers-0.465
ccoli-0.456
 afloat-0.435
 rinse-0.434
Token,
Token ID11
Feature activation+1.11e-01

Pos logit contributions
ted+0.152
 Many+0.140
 Everyone+0.139
 People+0.136
 All+0.120
Neg logit contributions
kin-0.186
ppers-0.182
ccoli-0.175
 rinse-0.169
 afloat-0.169
Token searching
Token ID10342
Feature activation-4.00e+00

Pos logit contributions
ted+0.132
 Many+0.122
 Everyone+0.122
 People+0.119
 All+0.108
Neg logit contributions
kin-0.127
ppers-0.125
ccoli-0.119
 rinse-0.116
 promise-0.116
Token for
Token ID329
Feature activation-3.26e+00

Pos logit contributions
ted+0.227
 Many+0.212
 Everyone+0.210
 People+0.204
 All+0.184
Neg logit contributions
kin-0.269
ppers-0.265
ccoli-0.254
 promise-0.251
 afloat-0.251
Token a
Token ID257
Feature activation-2.63e+00

Pos logit contributions
ted+0.219
 Many+0.205
 Everyone+0.204
 People+0.200
arted+0.184
Neg logit contributions
ppers-0.176
 promise-0.175
 afloat-0.170
kin-0.169
 rinse-0.169
Token lesson
Token ID11483
Feature activation-2.14e+00

Pos logit contributions
ted+0.125
 Many+0.124
 People+0.116
 Everyone+0.116
 All+0.106
Neg logit contributions
ppers-0.185
kin-0.183
kins-0.176
edient-0.175
wy-0.174
Token.
Token ID13
Feature activation+3.76e-01

Pos logit contributions
ted+0.125
 Many+0.120
 Everyone+0.116
 People+0.113
 All+0.100
Neg logit contributions
ppers-0.142
kin-0.141
ccoli-0.131
 promise-0.130
kins-0.129
Token Many
Token ID4650
Feature activation-1.46e+01

Pos logit contributions
kin+0.025
ppers+0.025
ccoli+0.024
 afloat+0.023
 rinse+0.023
Neg logit contributions
ted-0.021
 Many-0.019
 Everyone-0.019
 People-0.019
 All-0.017
Token times
Token ID1661
Feature activation-6.03e+00

Pos logit contributions
ted+0.759
 Everyone+0.699
 Many+0.685
 People+0.647
 As+0.603
Neg logit contributions
kin-1.061
 promise-1.016
ppers-1.009
kins-1.004
 achieve-0.973
Token,
Token ID11
Feature activation+1.38e-01

Pos logit contributions
ted+0.377
 Everyone+0.350
 Many+0.349
 People+0.342
 All+0.308
Neg logit contributions
kin-0.375
ppers-0.370
ccoli-0.352
 afloat-0.342
 rinse-0.340
Token what
Token ID644
Feature activation+3.73e+00

Pos logit contributions
kin+0.009
ppers+0.009
 achieve+0.009
ccoli+0.009
 rinse+0.009
Neg logit contributions
ted-0.008
 Many-0.007
 Everyone-0.007
 People-0.007
 All-0.006
Token we
Token ID356
Feature activation-1.07e+00

Pos logit contributions
kin+0.202
ppers+0.190
ccoli+0.188
 promise+0.184
kins+0.179
Neg logit contributions
ted-0.272
 Many-0.250
 Everyone-0.246
 People-0.239
 All-0.216
Tokenâ
Token ID22940
Feature activation-8.89e-01

Pos logit contributions
ted+0.068
 Everyone+0.064
 Many+0.063
 People+0.062
 All+0.058
Neg logit contributions
kin-0.066
ppers-0.065
ccoli-0.062
 afloat-0.060
 rinse-0.059
Token of
Token ID286
Feature activation-6.31e+00

Pos logit contributions
ted+0.251
 Many+0.195
 People+0.191
 Everyone+0.191
 As+0.184
Neg logit contributions
kin-0.312
ak-0.309
 promise-0.297
kins-0.297
aky-0.295
Token trees
Token ID7150
Feature activation-6.32e+00

Pos logit contributions
ted+0.354
 Everyone+0.272
 As+0.271
heart+0.268
ened+0.267
Neg logit contributions
aks-0.396
eps-0.392
 neat-0.379
kin-0.377
Anna-0.371
Token and
Token ID290
Feature activation-5.48e-01

Pos logit contributions
ted+0.409
 Many+0.382
 Everyone+0.380
 People+0.373
 All+0.332
Neg logit contributions
kin-0.377
ppers-0.374
ccoli-0.351
 afloat-0.345
 rinse-0.339
Token flowers
Token ID12734
Feature activation-5.37e+00

Pos logit contributions
ted+0.028
 Everyone+0.024
 People+0.023
 Many+0.022
heart+0.021
Neg logit contributions
kin-0.036
 afloat-0.035
 rinse-0.035
ppers-0.035
eps-0.035
Token.
Token ID13
Feature activation+3.01e-01

Pos logit contributions
ted+0.366
 Many+0.350
 Everyone+0.344
 People+0.340
 All+0.300
Neg logit contributions
kin-0.298
ppers-0.293
ccoli-0.270
kins-0.269
 afloat-0.268
Token Many
Token ID4650
Feature activation-1.46e+01

Pos logit contributions
ccoli+0.019
kin+0.019
 rinse+0.019
 tim+0.019
Anna+0.018
Neg logit contributions
ted-0.016
heart-0.013
unn-0.013
arted-0.012
ly-0.012
Token humans
Token ID5384
Feature activation-3.21e+00

Pos logit contributions
ted+0.713
 Everyone+0.603
 As+0.565
 People+0.536
 Many+0.532
Neg logit contributions
aks-1.027
Anna-1.000
ak-0.999
aky-0.998
eps-0.992
Token came
Token ID1625
Feature activation-4.71e+00

Pos logit contributions
ted+0.191
 Many+0.178
 Everyone+0.177
 People+0.173
 All+0.152
Neg logit contributions
kin-0.204
ppers-0.200
ccoli-0.190
 promise-0.190
 achieve-0.188
Token to
Token ID284
Feature activation-7.36e-01

Pos logit contributions
ted+0.256
 Many+0.233
 Everyone+0.231
 People+0.225
 All+0.192
Neg logit contributions
ppers-0.321
kin-0.311
 achieve-0.304
 afloat-0.303
edient-0.302
Token play
Token ID711
Feature activation-1.25e+00

Pos logit contributions
ted+0.048
 Everyone+0.043
 Many+0.042
 People+0.042
 All+0.038
Neg logit contributions
kin-0.043
ppers-0.042
 promise-0.041
 achieve-0.041
 rinse-0.040
Token and
Token ID290
Feature activation-5.78e-01

Pos logit contributions
ted+0.084
 Everyone+0.079
 Many+0.079
 People+0.077
 All+0.070
Neg logit contributions
kin-0.073
ppers-0.071
ccoli-0.068
 afloat-0.066
 rinse-0.065
Token.
Token ID13
Feature activation+1.39e-01

Pos logit contributions
ted+0.099
 Everyone+0.093
 Many+0.088
 People+0.086
 All+0.081
Neg logit contributions
ppers-0.154
kin-0.152
ccoli-0.149
illas-0.146
cher-0.142
Token
Token ID198
Feature activation-3.21e-01

Pos logit contributions
kin+0.008
ccoli+0.008
ppers+0.008
 rinse+0.008
 promise+0.007
Neg logit contributions
ted-0.009
 Many-0.008
 Everyone-0.008
 People-0.008
heart-0.007
Token
Token ID198
Feature activation-3.01e-01

Pos logit contributions
ted+0.017
 Everyone+0.017
 Many+0.017
 People+0.016
 All+0.014
Neg logit contributions
kin-0.023
ppers-0.023
ccoli-0.022
 afloat-0.021
kins-0.021
TokenHe
Token ID1544
Feature activation+6.73e+00

Pos logit contributions
 Many+0.020
 Everyone+0.020
ted+0.019
 People+0.019
 All+0.018
Neg logit contributions
ccoli-0.018
kin-0.017
ppers-0.017
 afloat-0.017
illas-0.016
Token t
Token ID256
Feature activation-1.25e+00

Pos logit contributions
kin+0.364
ccoli+0.359
ppers+0.359
 promise+0.348
 rinse+0.347
Neg logit contributions
ted-0.453
 Everyone-0.435
 Many-0.433
 People-0.424
 All-0.381
Tokenipt
Token ID10257
Feature activation-1.46e+01

Pos logit contributions
ted+0.052
 Many+0.049
 People+0.049
 Everyone+0.048
 All+0.041
Neg logit contributions
ppers-0.104
kin-0.102
illas-0.095
ccoli-0.094
 rinse-0.094
Tokenoes
Token ID3028
Feature activation-2.29e+00

Pos logit contributions
ted+1.218
 Everyone+1.118
 Many+1.109
 People+1.088
 All+1.018
Neg logit contributions
kin-0.660
ppers-0.617
ccoli-0.590
 afloat-0.585
kins-0.576
Token to
Token ID284
Feature activation-5.27e-01

Pos logit contributions
ted+0.123
 Everyone+0.113
 Many+0.113
 People+0.110
 All+0.097
Neg logit contributions
kin-0.162
ppers-0.160
ccoli-0.153
 afloat-0.149
 rinse-0.149
Token her
Token ID607
Feature activation-1.87e+00

Pos logit contributions
ted+0.027
 Many+0.024
 Everyone+0.024
 People+0.023
 All+0.021
Neg logit contributions
kin-0.040
ppers-0.039
ccoli-0.037
 afloat-0.037
 rinse-0.036
Token bed
Token ID3996
Feature activation+2.31e+00

Pos logit contributions
ted+0.122
 Everyone+0.113
 Many+0.110
 People+0.108
 All+0.104
Neg logit contributions
ppers-0.110
kin-0.109
wy-0.105
 afloat-0.103
edient-0.103
Token and
Token ID290
Feature activation-6.20e-01

Pos logit contributions
ppers+0.171
kin+0.167
ccoli+0.166
 afloat+0.162
 rinse+0.160
Neg logit contributions
ted-0.117
 Everyone-0.106
 Many-0.103
 People-0.102
 All-0.093
Token at
Token ID379
Feature activation-4.63e+00

Pos logit contributions
ted+0.033
 Everyone+0.030
 Many+0.027
 People+0.027
 All+0.023
Neg logit contributions
kin-0.079
ppers-0.079
ccoli-0.077
 rinse-0.074
 afloat-0.073
Token the
Token ID262
Feature activation+4.37e-01

Pos logit contributions
ted+0.273
 Many+0.252
 Everyone+0.249
 People+0.246
 All+0.216
Neg logit contributions
kin-0.304
ppers-0.296
ccoli-0.285
 rinse-0.283
 afloat-0.283
Token sports
Token ID5701
Feature activation-1.70e+00

Pos logit contributions
ppers+0.030
kin+0.030
ccoli+0.029
 afloat+0.028
 rinse+0.027
Neg logit contributions
ted-0.025
 Everyone-0.023
 Many-0.023
 People-0.022
 All-0.020
Token ground
Token ID2323
Feature activation-1.72e+00

Pos logit contributions
ted+0.105
 Everyone+0.098
 Many+0.096
 People+0.095
 All+0.087
Neg logit contributions
kin-0.107
ppers-0.107
ccoli-0.102
 afloat-0.098
 rinse-0.098
Token.
Token ID13
Feature activation+2.21e-01

Pos logit contributions
ted+0.095
 Everyone+0.086
 Many+0.083
 People+0.083
 All+0.077
Neg logit contributions
kin-0.119
ppers-0.118
ccoli-0.115
 rinse-0.110
 promise-0.109
Token Many
Token ID4650
Feature activation-1.45e+01

Pos logit contributions
kin+0.015
 rinse+0.015
ccoli+0.015
ppers+0.015
 afloat+0.015
Neg logit contributions
ted-0.011
 Many-0.010
 People-0.009
 Everyone-0.009
heart-0.009
Token children
Token ID1751
Feature activation-6.38e+00

Pos logit contributions
ted+0.714
 Many+0.635
 Everyone+0.625
 People+0.606
 As+0.563
Neg logit contributions
kin-1.044
kins-1.009
 achieve-0.990
ppers-0.985
 promise-0.981
Token were
Token ID547
Feature activation-1.60e+00

Pos logit contributions
ted+0.379
 Everyone+0.350
 Many+0.350
 People+0.341
 All+0.307
Neg logit contributions
kin-0.417
ppers-0.412
ccoli-0.392
 afloat-0.381
 rinse-0.379
Token playing
Token ID2712
Feature activation-3.69e+00

Pos logit contributions
ted+0.090
 Everyone+0.084
 Many+0.083
 People+0.081
 All+0.073
Neg logit contributions
kin-0.109
ppers-0.109
ccoli-0.104
 afloat-0.100
 rinse-0.100
Token and
Token ID290
Feature activation-6.37e-01

Pos logit contributions
ted+0.215
 Everyone+0.205
 Many+0.198
 People+0.195
 All+0.182
Neg logit contributions
ppers-0.259
ccoli-0.241
kin-0.240
edient-0.229
 Claus-0.226
Token running
Token ID2491
Feature activation-1.76e+00

Pos logit contributions
ted+0.040
 Many+0.037
 Everyone+0.036
 People+0.036
 All+0.031
Neg logit contributions
kin-0.038
ppers-0.038
 rinse-0.036
ccoli-0.036
 afloat-0.035
Token,
Token ID11
Feature activation+6.33e-01

Pos logit contributions
kin+0.251
ppers+0.248
ccoli+0.237
 afloat+0.231
 rinse+0.231
Neg logit contributions
ted-0.188
 Everyone-0.172
 Many-0.172
 People-0.167
 All-0.148
Token he
Token ID339
Feature activation+3.49e-01

Pos logit contributions
kin+0.046
ppers+0.044
 afloat+0.043
ccoli+0.042
 achieve+0.042
Neg logit contributions
ted-0.033
 Many-0.031
 Everyone-0.029
 People-0.029
 All-0.024
Token began
Token ID2540
Feature activation-3.70e+00

Pos logit contributions
kin+0.022
ppers+0.022
ccoli+0.021
 afloat+0.020
 rinse+0.020
Neg logit contributions
ted-0.021
 Everyone-0.020
 Many-0.020
 People-0.019
 All-0.018
Token to
Token ID284
Feature activation-2.75e-01

Pos logit contributions
ted+0.153
 Everyone+0.136
 Many+0.134
 People+0.129
 All+0.115
Neg logit contributions
kin-0.316
ppers-0.308
ccoli-0.297
 promise-0.289
wy-0.287
Token work
Token ID670
Feature activation-4.02e+00

Pos logit contributions
 Everyone+0.016
 Many+0.016
ted+0.015
 People+0.015
 All+0.015
Neg logit contributions
kin-0.018
ppers-0.017
wy-0.016
kins-0.016
ccoli-0.016
Token on
Token ID319
Feature activation+7.00e+00

Pos logit contributions
ted+0.205
 Everyone+0.190
 People+0.181
 Many+0.174
 All+0.171
Neg logit contributions
ppers-0.295
kin-0.288
ccoli-0.286
illas-0.280
 promise-0.273
Token the
Token ID262
Feature activation+8.85e-01

Pos logit contributions
kin+0.514
ppers+0.507
ccoli+0.485
 promise+0.469
 rinse+0.467
Neg logit contributions
ted-0.366
 Everyone-0.338
 Many-0.335
 People-0.328
 All-0.295
Token roof
Token ID9753
Feature activation+2.62e-01

Pos logit contributions
kin+0.058
ppers+0.057
ccoli+0.055
 afloat+0.052
 rinse+0.052
Neg logit contributions
ted-0.052
 Everyone-0.050
 Many-0.050
 People-0.049
 All-0.044
Token.
Token ID13
Feature activation+8.58e-01

Pos logit contributions
kin+0.019
ppers+0.019
ccoli+0.019
 rinse+0.018
 afloat+0.017
Neg logit contributions
ted-0.013
 Everyone-0.013
 People-0.012
 Many-0.012
 All-0.011
Token He
Token ID679
Feature activation+2.07e+00

Pos logit contributions
ccoli+0.055
kin+0.053
 rinse+0.052
 achieve+0.051
ppers+0.051
Neg logit contributions
ted-0.049
heart-0.040
 Many-0.038
ly-0.037
 news-0.035
Token hammered
Token ID41223
Feature activation+4.17e+00

Pos logit contributions
kin+0.125
ppers+0.123
ccoli+0.117
 afloat+0.113
 rinse+0.113
Neg logit contributions
ted-0.134
 Everyone-0.125
 Many-0.125
 People-0.122
 All-0.110
Tokenina
Token ID1437
Feature activation+4.32e-02

Pos logit contributions
kin+0.008
ppers+0.008
ccoli+0.008
 promise+0.008
 rinse+0.008
Neg logit contributions
ted-0.005
 Many-0.005
 Everyone-0.005
 People-0.005
 All-0.004
Token.
Token ID13
Feature activation+9.02e-01

Pos logit contributions
kin+0.003
ppers+0.003
ccoli+0.003
 afloat+0.002
 rinse+0.002
Neg logit contributions
ted-0.003
 Everyone-0.003
 Many-0.003
 People-0.002
 All-0.002
Token M
Token ID337
Feature activation+8.58e-02

Pos logit contributions
 rinse+0.060
kin+0.059
ccoli+0.057
ppers+0.055
 achieve+0.053
Neg logit contributions
ted-0.051
 Many-0.043
 Everyone-0.042
 People-0.041
ly-0.038
Tokenina
Token ID1437
Feature activation+2.59e-02

Pos logit contributions
kin+0.004
ppers+0.004
ccoli+0.004
 promise+0.004
kins+0.004
Neg logit contributions
ted-0.006
 Many-0.006
 Everyone-0.006
 People-0.006
 All-0.005
Token lived
Token ID5615
Feature activation-1.65e+00

Pos logit contributions
kin+0.002
ppers+0.002
ccoli+0.002
 promise+0.002
kins+0.002
Neg logit contributions
ted-0.002
 Many-0.001
 Everyone-0.001
 People-0.001
 All-0.001
Token on
Token ID319
Feature activation+7.24e+00

Pos logit contributions
ted+0.091
 Everyone+0.080
 Many+0.080
 People+0.075
 All+0.074
Neg logit contributions
ppers-0.115
ccoli-0.114
kin-0.109
 promise-0.105
 achieve-0.105
Token a
Token ID257
Feature activation-1.67e+00

Pos logit contributions
kin+0.374
 rinse+0.335
Anna+0.332
 globe+0.328
ccoli+0.323
Neg logit contributions
ted-0.501
heart-0.453
 Everyone-0.447
 People-0.444
 Many-0.438
Token beautiful
Token ID4950
Feature activation+4.90e-01

Pos logit contributions
ted+0.090
 Many+0.088
 Everyone+0.087
 People+0.083
 All+0.077
Neg logit contributions
ppers-0.115
kin-0.113
ccoli-0.107
cher-0.107
 afloat-0.105
Token piece
Token ID3704
Feature activation+2.47e+00

Pos logit contributions
kin+0.029
ppers+0.027
ccoli+0.027
 promise+0.026
 rinse+0.026
Neg logit contributions
ted-0.033
 Many-0.030
 Everyone-0.030
 People-0.030
 All-0.026
Token of
Token ID286
Feature activation-5.75e+00

Pos logit contributions
kin+0.225
ppers+0.211
 promise+0.203
ccoli+0.202
kins+0.201
Neg logit contributions
ted-0.090
 Many-0.079
 Everyone-0.077
 People-0.076
 All-0.056
Token land
Token ID1956
Feature activation-1.55e+00

Pos logit contributions
ted+0.416
 Everyone+0.388
 Many+0.388
 People+0.387
iked+0.357
Neg logit contributions
kin-0.298
 rinse-0.273
ccoli-0.267
 promise-0.255
ppers-0.255
Token down
Token ID866
Feature activation+3.08e+00

Pos logit contributions
ted+0.514
 Many+0.486
 Everyone+0.486
 People+0.476
 All+0.438
Neg logit contributions
kin-0.345
ppers-0.339
ccoli-0.318
 afloat-0.307
 rinse-0.306
Token and
Token ID290
Feature activation-1.52e-01

Pos logit contributions
kin+0.197
ppers+0.194
ccoli+0.184
 afloat+0.181
 rinse+0.180
Neg logit contributions
ted-0.185
 Everyone-0.173
 Many-0.173
 People-0.168
 All-0.150
Token pe
Token ID613
Feature activation-2.34e+00

Pos logit contributions
ted+0.010
 Many+0.009
 Everyone+0.009
 People+0.009
 All+0.008
Neg logit contributions
ppers-0.009
kin-0.009
ccoli-0.008
 rinse-0.008
 afloat-0.008
Tokencks
Token ID4657
Feature activation+3.18e+00

Pos logit contributions
ted+0.214
 Everyone+0.194
 Many+0.194
 People+0.191
 All+0.179
Neg logit contributions
kin-0.085
ppers-0.078
ccoli-0.077
 rinse-0.070
 afloat-0.070
Token Tom
Token ID4186
Feature activation+6.39e-02

Pos logit contributions
kin+0.235
ppers+0.231
ccoli+0.224
 rinse+0.222
 promise+0.221
Neg logit contributions
ted-0.152
 Many-0.140
 Everyone-0.137
 People-0.135
 All-0.117
Token on
Token ID319
Feature activation+6.96e+00

Pos logit contributions
kin+0.004
ppers+0.004
kins+0.004
ccoli+0.004
 afloat+0.004
Neg logit contributions
ted-0.004
 Many-0.004
 Everyone-0.003
 People-0.003
 All-0.003
Token the
Token ID262
Feature activation+8.89e-01

Pos logit contributions
kin+0.471
ppers+0.457
ccoli+0.441
 rinse+0.434
 afloat+0.431
Neg logit contributions
ted-0.391
 Many-0.360
 Everyone-0.359
 People-0.353
 All-0.310
Token head
Token ID1182
Feature activation+5.33e+00

Pos logit contributions
ppers+0.072
kin+0.072
ccoli+0.072
edient+0.068
kins+0.068
Neg logit contributions
ted-0.039
 Many-0.034
 Everyone-0.034
 People-0.032
 winds-0.032
Token.
Token ID13
Feature activation+5.27e-01

Pos logit contributions
kin+0.376
ppers+0.375
ccoli+0.373
 afloat+0.365
 achieve+0.357
Neg logit contributions
ted-0.291
 Everyone-0.262
 Many-0.254
 People-0.249
 All-0.232
Token Tom
Token ID4186
Feature activation+3.84e-02

Pos logit contributions
kin+0.041
ppers+0.040
ccoli+0.039
 rinse+0.038
 achieve+0.038
Neg logit contributions
ted-0.024
 Many-0.021
 Everyone-0.021
 People-0.021
 All-0.017
Token cries
Token ID24691
Feature activation-6.96e+00

Pos logit contributions
kin+0.003
ppers+0.002
ccoli+0.002
 afloat+0.002
kins+0.002
Neg logit contributions
ted-0.002
 Many-0.002
 Everyone-0.002
 People-0.002
 All-0.002
Token eyes
Token ID2951
Feature activation-4.54e+00

Pos logit contributions
kin+0.213
ppers+0.203
ccoli+0.198
 promise+0.194
 afloat+0.192
Neg logit contributions
ted-0.204
 Many-0.187
 Everyone-0.186
 People-0.183
 All-0.164
Token,
Token ID11
Feature activation+4.40e-01

Pos logit contributions
ted+0.260
 Everyone+0.240
 Many+0.239
 People+0.233
 All+0.209
Neg logit contributions
kin-0.308
ppers-0.304
ccoli-0.290
 afloat-0.282
 rinse-0.281
Token put
Token ID1234
Feature activation-6.26e-02

Pos logit contributions
kin+0.028
ppers+0.028
ccoli+0.027
 afloat+0.026
 rinse+0.026
Neg logit contributions
ted-0.026
 Everyone-0.024
 Many-0.024
 People-0.023
 All-0.021
Token his
Token ID465
Feature activation+3.27e+00

Pos logit contributions
ted+0.004
 Many+0.003
 Everyone+0.003
 People+0.003
 All+0.003
Neg logit contributions
kin-0.004
ppers-0.004
ccoli-0.004
 afloat-0.004
 rinse-0.004
Token hands
Token ID2832
Feature activation-8.85e-01

Pos logit contributions
ppers+0.188
kin+0.185
ccoli+0.175
 rinse+0.171
 achieve+0.170
Neg logit contributions
ted-0.220
 Everyone-0.205
 Many-0.205
 People-0.201
 All-0.183
Token on
Token ID319
Feature activation+6.95e+00

Pos logit contributions
ted+0.057
 Everyone+0.046
 Many+0.045
 All+0.044
ly+0.043
Neg logit contributions
kin-0.056
ccoli-0.054
 achieve-0.054
ppers-0.054
 promise-0.054
Token his
Token ID465
Feature activation+3.25e+00

Pos logit contributions
kin+0.482
ppers+0.475
ccoli+0.456
 afloat+0.444
 rinse+0.442
Neg logit contributions
ted-0.387
 Many-0.356
 Everyone-0.355
 People-0.346
 All-0.306
Token hips
Token ID21497
Feature activation+7.34e+00

Pos logit contributions
ppers+0.168
kin+0.159
ccoli+0.154
 rinse+0.150
edient+0.149
Neg logit contributions
ted-0.232
 Everyone-0.225
 Many-0.222
 People-0.219
 All-0.203
Token,
Token ID11
Feature activation+3.97e-01

Pos logit contributions
kin+0.490
ppers+0.486
ccoli+0.463
 afloat+0.451
 rinse+0.447
Neg logit contributions
ted-0.422
 Many-0.391
 Everyone-0.390
 People-0.381
 All-0.337
Token and
Token ID290
Feature activation-2.13e-01

Pos logit contributions
kin+0.030
ppers+0.029
ccoli+0.028
edient+0.028
 afloat+0.027
Neg logit contributions
ted-0.021
 Many-0.018
 Everyone-0.018
 People-0.017
 All-0.016
Token roared
Token ID45615
Feature activation-3.22e-01

Pos logit contributions
ted+0.012
 Many+0.011
 Everyone+0.011
 People+0.011
 All+0.009
Neg logit contributions
kin-0.014
ppers-0.014
ccoli-0.014
 rinse-0.013
 afloat-0.013
Token tall
Token ID7331
Feature activation-1.13e+00

Pos logit contributions
ted+0.084
 Everyone+0.083
 Many+0.083
 People+0.078
 All+0.074
Neg logit contributions
ppers-0.141
wy-0.135
ccoli-0.132
kin-0.130
illas-0.128
Token man
Token ID582
Feature activation+7.36e-01

Pos logit contributions
ted+0.069
 Everyone+0.064
 Many+0.064
 People+0.062
 All+0.057
Neg logit contributions
ppers-0.072
kin-0.071
ccoli-0.069
 rinse-0.066
 afloat-0.066
Token.
Token ID13
Feature activation+9.30e-01

Pos logit contributions
kin+0.050
ppers+0.047
ccoli+0.047
 rinse+0.045
wy+0.044
Neg logit contributions
ted-0.044
 Everyone-0.039
 Many-0.039
 People-0.038
 All-0.035
Token
Token ID198
Feature activation+1.70e-01

Pos logit contributions
kin+0.058
ccoli+0.057
ppers+0.055
 afloat+0.055
 achieve+0.054
Neg logit contributions
ted-0.053
 Many-0.045
 People-0.043
 Everyone-0.042
heart-0.041
Token
Token ID198
Feature activation+1.83e-01

Pos logit contributions
ppers+0.013
kin+0.013
ccoli+0.012
 afloat+0.012
 promise+0.012
Neg logit contributions
 Everyone-0.009
ted-0.008
 Many-0.008
 People-0.008
 All-0.007
TokenHe
Token ID1544
Feature activation+7.22e+00

Pos logit contributions
kin+0.010
ppers+0.010
ccoli+0.010
 afloat+0.009
 promise+0.009
Neg logit contributions
 Everyone-0.013
 Many-0.013
ted-0.013
 People-0.012
 All-0.012
Token replied
Token ID8712
Feature activation-3.25e+00

Pos logit contributions
kin+0.454
ppers+0.454
ccoli+0.434
 Claus+0.421
 promise+0.421
Neg logit contributions
ted-0.416
 Many-0.398
 Everyone-0.390
 People-0.390
 All-0.328
Token,
Token ID11
Feature activation+7.27e-01

Pos logit contributions
ted+0.188
 Many+0.173
 Everyone+0.173
 People+0.169
 All+0.150
Neg logit contributions
kin-0.218
ppers-0.215
ccoli-0.206
 afloat-0.200
 rinse-0.199
Token "
Token ID366
Feature activation+8.29e-01

Pos logit contributions
kin+0.057
ppers+0.056
 promise+0.053
ccoli+0.053
 afloat+0.053
Neg logit contributions
ted-0.035
 Everyone-0.032
 Many-0.031
 People-0.031
 All-0.028
TokenYes
Token ID5297
Feature activation-2.79e+00

Pos logit contributions
ppers+0.055
kin+0.055
ccoli+0.052
 rinse+0.051
 afloat+0.051
Neg logit contributions
ted-0.047
 Many-0.043
 Everyone-0.043
 People-0.043
 All-0.038
Token,
Token ID11
Feature activation+6.99e-01

Pos logit contributions
ted+0.156
 Everyone+0.145
 Many+0.144
 People+0.140
 All+0.126
Neg logit contributions
kin-0.192
ppers-0.191
ccoli-0.182
 afloat-0.178
 rinse-0.176
Token They
Token ID1119
Feature activation+2.16e+00

Pos logit contributions
kin+0.028
ccoli+0.028
ppers+0.028
 rinse+0.027
 afloat+0.026
Neg logit contributions
ted-0.033
 Many-0.029
 Everyone-0.029
 People-0.028
heart-0.027
Token were
Token ID547
Feature activation-1.21e+00

Pos logit contributions
kin+0.148
ppers+0.146
ccoli+0.141
 afloat+0.136
 rinse+0.135
Neg logit contributions
ted-0.123
 Everyone-0.112
 Many-0.112
 People-0.109
 All-0.099
Token happy
Token ID3772
Feature activation+1.83e+00

Pos logit contributions
ted+0.074
 Everyone+0.072
 Many+0.070
 People+0.069
 All+0.063
Neg logit contributions
kin-0.077
ppers-0.076
ccoli-0.072
 achieve-0.068
kins-0.068
Token and
Token ID290
Feature activation-3.71e-01

Pos logit contributions
ppers+0.102
ccoli+0.097
kin+0.096
illas+0.094
 promise+0.093
Neg logit contributions
 Many-0.116
ted-0.115
 Everyone-0.114
 People-0.114
 Everybody-0.098
Token excited
Token ID6568
Feature activation-6.24e+00

Pos logit contributions
ted+0.024
 Everyone+0.022
 Many+0.022
 People+0.022
 All+0.020
Neg logit contributions
kin-0.023
ppers-0.023
ccoli-0.022
 afloat-0.021
 rinse-0.021
Token.
Token ID13
Feature activation+5.11e-01

Pos logit contributions
 Many+0.441
ted+0.438
 People+0.423
 Everyone+0.419
pill+0.392
Neg logit contributions
ppers-0.299
kin-0.286
 promise-0.281
ccoli-0.274
 Claus-0.265
Token
Token ID198
Feature activation-8.50e-02

Pos logit contributions
kin+0.026
ccoli+0.025
ppers+0.025
 rinse+0.024
 afloat+0.024
Neg logit contributions
ted-0.036
heart-0.031
 Many-0.031
ly-0.030
 Everyone-0.030
Token
Token ID198
Feature activation-7.37e-02

Pos logit contributions
ted+0.005
 Everyone+0.005
 Many+0.005
 People+0.004
 All+0.004
Neg logit contributions
kin-0.006
ppers-0.006
ccoli-0.005
 afloat-0.005
 rinse-0.005
TokenThey
Token ID2990
Feature activation+8.09e+00

Pos logit contributions
 Many+0.005
 Everyone+0.005
ted+0.005
 People+0.005
 All+0.005
Neg logit contributions
kin-0.004
ccoli-0.004
ppers-0.004
 afloat-0.004
 promise-0.004
Token reached
Token ID4251
Feature activation-4.86e+00

Pos logit contributions
kin+0.538
ppers+0.532
ccoli+0.508
 afloat+0.494
 rinse+0.490
Neg logit contributions
ted-0.474
 Everyone-0.436
 Many-0.436
 People-0.425
 All-0.382
Token the
Token ID262
Feature activation+8.80e-01

Pos logit contributions
ted+0.279
 Many+0.257
 Everyone+0.257
 People+0.251
 All+0.224
Neg logit contributions
kin-0.331
ppers-0.326
ccoli-0.312
 afloat-0.304
 rinse-0.302
Token
Token ID198
Feature activation+1.29e-02

Pos logit contributions
ccoli+0.055
 achieve+0.051
Jackson+0.049
 ramp+0.049
 Claus+0.049
Neg logit contributions
ted-0.047
heart-0.038
ly-0.036
unn-0.036
andel-0.035
Token
Token ID198
Feature activation+2.50e-02

Pos logit contributions
kin+0.001
ppers+0.001
ccoli+0.001
 achieve+0.001
 afloat+0.001
Neg logit contributions
ted-0.001
 Everyone-0.001
 Many-0.001
 People-0.001
 All-0.001
TokenSo
Token ID2396
Feature activation+1.73e+00

Pos logit contributions
kin+0.002
ppers+0.002
ccoli+0.002
 afloat+0.001
kins+0.001
Neg logit contributions
ted-0.002
 Many-0.002
 Everyone-0.001
 People-0.001
 All-0.001
Token,
Token ID11
Feature activation+5.91e-01

Pos logit contributions
kin+0.104
ppers+0.097
ccoli+0.095
 promise+0.094
 afloat+0.093
Neg logit contributions
ted-0.111
 Many-0.105
 Everyone-0.102
 People-0.101
pill-0.091
Token the
Token ID262
Feature activation+9.94e-01

Pos logit contributions
kin+0.033
 achieve+0.030
 afloat+0.030
ppers+0.030
ccoli+0.030
Neg logit contributions
ted-0.037
 Many-0.035
 People-0.034
 Everyone-0.034
unn-0.033
Token mouse
Token ID10211
Feature activation+1.33e+00

Pos logit contributions
kin+0.064
ppers+0.063
ccoli+0.060
 promise+0.058
 afloat+0.058
Neg logit contributions
ted-0.061
 Everyone-0.056
 Many-0.056
 People-0.055
 All-0.049
Token leaned
Token ID23831
Feature activation-6.44e+00

Pos logit contributions
kin+0.084
ppers+0.083
ccoli+0.079
 promise+0.077
 afloat+0.076
Neg logit contributions
ted-0.082
 Everyone-0.077
 Many-0.077
 People-0.075
 All-0.067
Token against
Token ID1028
Feature activation+4.58e+00

Pos logit contributions
ted+0.439
 Everyone+0.357
 Many+0.352
 People+0.337
 All+0.333
Neg logit contributions
ppers-0.398
kin-0.390
 promise-0.382
ccoli-0.380
 achieve-0.376
Token the
Token ID262
Feature activation+9.68e-01

Pos logit contributions
kin+0.281
ppers+0.267
 afloat+0.265
ccoli+0.258
 rinse+0.256
Neg logit contributions
ted-0.272
 Many-0.260
 Everyone-0.256
 People-0.252
pill-0.224
Token chim
Token ID18205
Feature activation-4.13e-01

Pos logit contributions
kin+0.068
ppers+0.066
ccoli+0.064
 rinse+0.062
 afloat+0.062
Neg logit contributions
ted-0.054
 Many-0.050
 Everyone-0.050
 People-0.049
 All-0.044
Tokenney
Token ID1681
Feature activation+7.34e+00

Pos logit contributions
ted+0.041
 Everyone+0.040
 Many+0.039
 People+0.039
 All+0.037
Neg logit contributions
kin-0.011
ppers-0.010
ccoli-0.009
 rinse-0.009
 afloat-0.008
Token doll
Token ID3654
Feature activation+5.30e+00

Pos logit contributions
ppers+0.103
kin+0.102
ccoli+0.097
 rinse+0.095
 achieve+0.095
Neg logit contributions
ted-0.091
 Many-0.085
 Everyone-0.085
 People-0.083
 All-0.074
Token and
Token ID290
Feature activation-2.23e-01

Pos logit contributions
kin+0.377
 afloat+0.369
 rinse+0.365
ppers+0.364
 achieve+0.361
Neg logit contributions
ted-0.274
 Many-0.243
 Everyone-0.241
 People-0.233
 All-0.213
Token I
Token ID314
Feature activation-1.74e+00

Pos logit contributions
ted+0.015
 Many+0.014
 Everyone+0.014
 People+0.013
 All+0.012
Neg logit contributions
kin-0.013
ppers-0.013
ccoli-0.012
 afloat-0.012
edient-0.012
Token can
Token ID460
Feature activation+4.78e+00

Pos logit contributions
ted+0.152
 Many+0.148
 Everyone+0.148
 People+0.144
 All+0.137
Neg logit contributions
kin-0.064
ppers-0.061
kins-0.056
 afloat-0.056
ccoli-0.054
Token play
Token ID711
Feature activation-7.26e-01

Pos logit contributions
kin+0.263
ppers+0.252
ccoli+0.246
 afloat+0.239
kins+0.234
Neg logit contributions
ted-0.334
 Everyone-0.328
 Many-0.326
 People-0.315
 All-0.296
Token with
Token ID351
Feature activation+7.86e-01

Pos logit contributions
ted+0.042
 Many+0.041
 Everyone+0.041
 People+0.039
 All+0.036
Neg logit contributions
ppers-0.049
kin-0.047
ccoli-0.046
 afloat-0.046
edient-0.044
Token your
Token ID534
Feature activation+4.66e+00

Pos logit contributions
kin+0.051
ppers+0.050
ccoli+0.048
 rinse+0.047
kins+0.047
Neg logit contributions
ted-0.045
 Many-0.042
 Everyone-0.041
 People-0.041
 All-0.037
Token ball
Token ID2613
Feature activation+1.13e+00

Pos logit contributions
ppers+0.296
kin+0.287
ccoli+0.275
 achieve+0.272
 afloat+0.271
Neg logit contributions
ted-0.280
 Many-0.265
 Everyone-0.263
 People-0.259
 All-0.229
Token."
Token ID526
Feature activation-6.87e-01

Pos logit contributions
kin+0.073
ppers+0.073
ccoli+0.069
 afloat+0.068
 rinse+0.068
Neg logit contributions
ted-0.067
 Many-0.061
 Everyone-0.061
 People-0.059
 All-0.054
Token
Token ID198
Feature activation-4.14e-02

Pos logit contributions
ted+0.042
 Many+0.038
 Everyone+0.038
 People+0.037
 All+0.033
Neg logit contributions
kin-0.044
ppers-0.043
ccoli-0.042
 rinse-0.041
 promise-0.040
Token
Token ID198
Feature activation-2.01e-03

Pos logit contributions
ted+0.002
 Everyone+0.002
 Many+0.002
 People+0.002
 All+0.002
Neg logit contributions
kin-0.003
ppers-0.003
ccoli-0.003
 afloat-0.003
 rinse-0.003
Token home
Token ID1363
Feature activation-1.73e+00

Pos logit contributions
kin+0.097
ppers+0.094
 promise+0.090
ccoli+0.090
 afloat+0.089
Neg logit contributions
ted-0.096
 Many-0.090
 Everyone-0.090
 People-0.088
 All-0.079
Token!"
Token ID2474
Feature activation-6.63e-01

Pos logit contributions
ted+0.089
 Everyone+0.086
 Many+0.083
 People+0.083
 All+0.074
Neg logit contributions
kin-0.117
ppers-0.115
ccoli-0.113
 rinse-0.111
 afloat-0.110
Token
Token ID220
Feature activation+3.12e+00

Pos logit contributions
ted+0.037
 Many+0.035
 People+0.033
 Everyone+0.032
heart+0.030
Neg logit contributions
kin-0.042
ccoli-0.041
ppers-0.041
 rinse-0.040
 afloat-0.040
Token
Token ID198
Feature activation+2.05e-01

Pos logit contributions
kin+0.187
ppers+0.185
ccoli+0.178
 afloat+0.174
 rinse+0.171
Neg logit contributions
ted-0.200
 Many-0.182
 Everyone-0.175
 People-0.175
 All-0.156
Token
Token ID198
Feature activation+2.07e-01

Pos logit contributions
kin+0.014
ppers+0.014
ccoli+0.013
 afloat+0.013
 rinse+0.012
Neg logit contributions
ted-0.012
 Everyone-0.011
 Many-0.011
 People-0.011
 All-0.010
TokenFrom
Token ID4863
Feature activation-7.85e-01

Pos logit contributions
kin+0.012
ppers+0.012
ccoli+0.012
illas+0.011
 promise+0.011
Neg logit contributions
ted-0.013
 Everyone-0.013
 Many-0.013
 People-0.013
 All-0.012
Token that
Token ID326
Feature activation-1.16e+00

Pos logit contributions
ted+0.030
 Many+0.030
 People+0.028
 Everyone+0.028
 All+0.023
Neg logit contributions
kin-0.066
ppers-0.066
 afloat-0.063
ccoli-0.063
 promise-0.062
Token day
Token ID1110
Feature activation-2.79e+00

Pos logit contributions
ted+0.035
 Everyone+0.032
 Many+0.031
 People+0.030
 All+0.027
Neg logit contributions
ppers-0.112
kin-0.109
 rinse-0.104
ccoli-0.103
cher-0.103
Token on
Token ID319
Feature activation+7.02e+00

Pos logit contributions
ted+0.202
 Everyone+0.187
 All+0.173
 Many+0.172
 People+0.172
Neg logit contributions
ppers-0.158
ccoli-0.157
kin-0.146
 achieve-0.144
illas-0.144
Token,
Token ID11
Feature activation+7.95e-01

Pos logit contributions
ppers+0.574
kin+0.542
cher+0.535
ccoli+0.532
eps+0.530
Neg logit contributions
ted-0.326
 Everyone-0.307
 People-0.285
 Many-0.278
 All-0.265
Token Benny
Token ID44275
Feature activation+1.21e+00

Pos logit contributions
kin+0.063
ppers+0.063
ccoli+0.060
 afloat+0.059
 rinse+0.059
Neg logit contributions
ted-0.036
 Many-0.033
 Everyone-0.033
 People-0.032
 All-0.027
Token big
Token ID1263
Feature activation+1.05e+00

Pos logit contributions
ted+0.107
 Many+0.102
 Everyone+0.102
 People+0.098
 All+0.091
Neg logit contributions
ppers-0.148
kin-0.141
ccoli-0.138
wy-0.136
edient-0.136
Token fan
Token ID4336
Feature activation+4.44e+00

Pos logit contributions
ppers+0.067
kin+0.066
ccoli+0.063
edient+0.062
 rinse+0.061
Neg logit contributions
ted-0.063
 Many-0.059
 Everyone-0.058
 People-0.056
 All-0.052
Token near
Token ID1474
Feature activation-1.18e+00

Pos logit contributions
kin+0.261
ppers+0.257
ccoli+0.243
 afloat+0.236
 rinse+0.234
Neg logit contributions
ted-0.294
 Many-0.275
 Everyone-0.275
 People-0.269
 All-0.244
Token the
Token ID262
Feature activation+6.59e-01

Pos logit contributions
ted+0.070
 Many+0.065
 Everyone+0.064
 People+0.063
 All+0.056
Neg logit contributions
kin-0.077
ppers-0.075
ccoli-0.071
 afloat-0.071
 rinse-0.071
Token pond
Token ID16723
Feature activation+2.49e+00

Pos logit contributions
kin+0.048
ppers+0.048
ccoli+0.047
kins+0.044
edient+0.044
Neg logit contributions
ted-0.035
 Everyone-0.033
 Many-0.032
 People-0.032
 All-0.029
Token.
Token ID13
Feature activation+2.97e-01

Pos logit contributions
kin+0.162
ppers+0.161
ccoli+0.156
 rinse+0.149
 afloat+0.148
Neg logit contributions
ted-0.149
 Everyone-0.139
 Many-0.137
 People-0.135
 All-0.123
Token The
Token ID383
Feature activation-2.63e+00

Pos logit contributions
kin+0.021
ccoli+0.020
 rinse+0.020
ppers+0.020
 afloat+0.019
Neg logit contributions
ted-0.016
 Many-0.013
 Everyone-0.013
 People-0.013
heart-0.012
Token fan
Token ID4336
Feature activation+4.36e+00

Pos logit contributions
ted+0.211
 Everyone+0.199
 Many+0.199
 People+0.196
 All+0.181
Neg logit contributions
kin-0.119
ppers-0.117
ccoli-0.109
 afloat-0.104
 rinse-0.103
Token is
Token ID318
Feature activation+9.30e-01

Pos logit contributions
kin+0.214
ppers+0.211
ccoli+0.198
 promise+0.191
 rinse+0.189
Neg logit contributions
ted-0.327
 Many-0.310
 Everyone-0.308
 People-0.302
 All-0.277
Token blowing
Token ID19280
Feature activation+1.46e+00

Pos logit contributions
kin+0.059
ppers+0.059
ccoli+0.057
illas+0.055
 achieve+0.055
Neg logit contributions
ted-0.054
 Many-0.051
 Everyone-0.050
 People-0.049
 All-0.046
Token air
Token ID1633
Feature activation-2.82e+00

Pos logit contributions
ppers+0.093
kin+0.091
ccoli+0.088
 achieve+0.084
 promise+0.084
Neg logit contributions
ted-0.092
 Everyone-0.085
 Many-0.083
 People-0.082
 All-0.075
Token be
Token ID307
Feature activation+3.02e+00

Pos logit contributions
ted+0.333
 Everyone+0.317
 Many+0.316
 People+0.309
 All+0.290
Neg logit contributions
kin-0.207
ppers-0.200
ccoli-0.189
kins-0.179
 afloat-0.177
Token a
Token ID257
Feature activation-1.81e+00

Pos logit contributions
kin+0.239
ppers+0.231
kins+0.221
ccoli+0.219
 globe+0.214
Neg logit contributions
ted-0.142
 Many-0.140
 Everyone-0.139
 People-0.130
 All-0.125
Token scared
Token ID12008
Feature activation-2.58e+00

Pos logit contributions
 Many+0.092
 Everyone+0.090
ted+0.089
 People+0.086
 All+0.082
Neg logit contributions
ppers-0.133
kin-0.123
illas-0.122
kins-0.121
ccoli-0.118
Tokeny
Token ID88
Feature activation+4.65e+00

Pos logit contributions
ted+0.135
 Many+0.123
 Everyone+0.123
 People+0.119
 All+0.105
Neg logit contributions
kin-0.189
ppers-0.186
ccoli-0.178
 afloat-0.174
 promise-0.174
Token-
Token ID12
Feature activation-7.64e+00

Pos logit contributions
kin+0.300
 promise+0.284
ppers+0.278
edient+0.265
ccoli+0.264
Neg logit contributions
ted-0.291
 Many-0.266
 Everyone-0.264
 People-0.260
pill-0.230
Tokencat
Token ID9246
Feature activation-1.48e+00

Pos logit contributions
ted+0.423
 Everyone+0.390
 Many+0.388
 People+0.381
 All+0.339
Neg logit contributions
kin-0.525
ppers-0.521
ccoli-0.495
edient-0.487
 promise-0.481
Token.
Token ID13
Feature activation+6.95e-01

Pos logit contributions
ted+0.088
 Everyone+0.079
 Many+0.078
 People+0.077
 All+0.070
Neg logit contributions
kin-0.099
ppers-0.098
 rinse-0.093
ccoli-0.093
 afloat-0.093
Token It
Token ID632
Feature activation-3.46e+00

Pos logit contributions
ccoli+0.048
kin+0.047
ppers+0.047
 rinse+0.045
 promise+0.045
Neg logit contributions
ted-0.038
 Many-0.032
 Everyone-0.031
 People-0.030
heart-0.027
Token will
Token ID481
Feature activation+9.93e+00

Pos logit contributions
ted+0.266
 Many+0.251
 Everyone+0.250
 People+0.245
 All+0.225
Neg logit contributions
kin-0.165
ppers-0.163
ccoli-0.154
 promise-0.147
 afloat-0.147
Token be
Token ID307
Feature activation+2.95e+00

Pos logit contributions
kin+0.573
ppers+0.560
ccoli+0.542
kins+0.516
illas+0.501
Neg logit contributions
ted-0.659
 Everyone-0.644
 Many-0.635
 People-0.629
 All-0.585
Token fun
Token ID1257
Feature activation-1.94e+00

Pos logit contributions
kin+0.172
ppers+0.165
ccoli+0.155
kins+0.150
 achieve+0.149
Neg logit contributions
ted-0.201
 Many-0.186
 Everyone-0.186
 People-0.183
 All-0.167
Token Then
Token ID3244
Feature activation-1.29e+00

Pos logit contributions
ccoli+0.037
kin+0.034
edient+0.033
 Claus+0.032
ppers+0.032
Neg logit contributions
ted-0.032
heart-0.027
unn-0.024
ly-0.024
 Many-0.024
Token,
Token ID11
Feature activation+2.29e-01

Pos logit contributions
ted+0.077
 Many+0.072
 Everyone+0.071
 People+0.070
 All+0.062
Neg logit contributions
kin-0.084
ppers-0.082
ccoli-0.078
 afloat-0.077
 promise-0.076
Token they
Token ID484
Feature activation+1.75e+00

Pos logit contributions
kin+0.015
ppers+0.015
 afloat+0.014
ccoli+0.014
kins+0.014
Neg logit contributions
ted-0.014
 Many-0.013
 Everyone-0.012
 People-0.012
 All-0.011
Token heard
Token ID2982
Feature activation+1.26e+00

Pos logit contributions
kin+0.120
ppers+0.116
ccoli+0.114
 afloat+0.109
edient+0.109
Neg logit contributions
ted-0.102
 Everyone-0.090
 Many-0.090
 People-0.086
 All-0.081
Token a
Token ID257
Feature activation-2.26e+00

Pos logit contributions
kin+0.064
 rinse+0.061
eps+0.061
ccoli+0.061
Their+0.060
Neg logit contributions
ted-0.081
unn-0.076
pill-0.072
 Many-0.069
umm-0.068
Token sound
Token ID2128
Feature activation-1.94e+00

Pos logit contributions
 Many+0.109
 Everyone+0.108
ted+0.102
 People+0.102
 All+0.092
Neg logit contributions
kin-0.175
ppers-0.174
wy-0.169
ccoli-0.169
illas-0.161
Token coming
Token ID2406
Feature activation+4.10e-01

Pos logit contributions
ted+0.123
 Many+0.116
 Everyone+0.115
 People+0.113
 All+0.100
Neg logit contributions
kin-0.119
ppers-0.116
ccoli-0.110
 afloat-0.108
 rinse-0.108
Token from
Token ID422
Feature activation+7.38e-01

Pos logit contributions
kin+0.027
ppers+0.027
ccoli+0.026
 rinse+0.025
 afloat+0.025
Neg logit contributions
ted-0.024
 Everyone-0.022
 Many-0.022
 People-0.022
 All-0.020
Token the
Token ID262
Feature activation+4.40e-01

Pos logit contributions
kin+0.048
ppers+0.046
 afloat+0.046
ccoli+0.044
 rinse+0.044
Neg logit contributions
ted-0.043
 Many-0.041
 Everyone-0.041
 People-0.040
 All-0.036
Token other
Token ID584
Feature activation-7.40e-01

Pos logit contributions
kin+0.032
ppers+0.031
ccoli+0.030
kins+0.029
edient+0.029
Neg logit contributions
ted-0.024
 Many-0.023
 Everyone-0.022
 People-0.022
 All-0.020
Token side
Token ID1735
Feature activation-1.24e+00

Pos logit contributions
ted+0.053
 Many+0.049
 Everyone+0.049
 People+0.048
 All+0.043
Neg logit contributions
kin-0.040
ppers-0.040
 promise-0.037
ccoli-0.037
 afloat-0.037
Token the
Token ID262
Feature activation+6.60e-01

Pos logit contributions
ted+0.399
 Everyone+0.368
 Many+0.363
 People+0.357
 All+0.313
Neg logit contributions
ppers-0.635
kin-0.629
ccoli-0.603
 afloat-0.588
 promise-0.586
Token sun
Token ID4252
Feature activation-4.20e+00

Pos logit contributions
kin+0.041
 promise+0.041
 rinse+0.039
 afloat+0.038
ppers+0.037
Neg logit contributions
ted-0.040
 Many-0.039
 Everyone-0.037
 People-0.036
pill-0.034
Token shone
Token ID44193
Feature activation-2.47e-02

Pos logit contributions
ted+0.213
 Many+0.201
 Everyone+0.198
 People+0.192
 All+0.167
Neg logit contributions
kin-0.311
ppers-0.306
ccoli-0.292
 afloat-0.287
 rinse-0.286
Token,
Token ID11
Feature activation+2.07e-01

Pos logit contributions
ted+0.002
 Everyone+0.001
 Many+0.001
 People+0.001
 All+0.001
Neg logit contributions
kin-0.002
ppers-0.002
ccoli-0.001
 achieve-0.001
illas-0.001
Token things
Token ID1243
Feature activation-6.10e+00

Pos logit contributions
kin+0.017
ppers+0.016
ccoli+0.016
 afloat+0.015
 rinse+0.015
Neg logit contributions
ted-0.009
 Many-0.008
 Everyone-0.008
 People-0.008
 All-0.007
Token began
Token ID2540
Feature activation-4.08e+00

Pos logit contributions
ted+0.359
 Everyone+0.311
 People+0.303
 Many+0.301
 All+0.280
Neg logit contributions
ppers-0.411
kin-0.409
ccoli-0.399
edient-0.377
illas-0.372
Token to
Token ID284
Feature activation-6.41e-01

Pos logit contributions
ted+0.179
 Everyone+0.156
 Many+0.155
 People+0.151
 All+0.131
Neg logit contributions
kin-0.334
ppers-0.329
ccoli-0.318
 rinse-0.308
 promise-0.308
Token dry
Token ID5894
Feature activation+4.39e+00

Pos logit contributions
ted+0.040
 Many+0.037
 Everyone+0.037
 People+0.036
 All+0.035
Neg logit contributions
ppers-0.039
ccoli-0.039
kin-0.038
cher-0.036
illas-0.036
Token.
Token ID13
Feature activation+4.79e-01

Pos logit contributions
kin+0.306
ppers+0.303
ccoli+0.294
 promise+0.284
 achieve+0.280
Neg logit contributions
ted-0.255
 Everyone-0.224
 People-0.216
 Many-0.210
 All-0.198
Token Jack
Token ID3619
Feature activation+1.72e+00

Pos logit contributions
kin+0.034
ccoli+0.034
ppers+0.033
 achieve+0.033
 afloat+0.033
Neg logit contributions
ted-0.025
 Many-0.020
 People-0.019
 Everyone-0.019
heart-0.017
Token was
Token ID373
Feature activation-1.50e+00

Pos logit contributions
kin+0.115
ppers+0.114
ccoli+0.109
 afloat+0.106
 rinse+0.105
Neg logit contributions
ted-0.100
 Everyone-0.091
 Many-0.090
 People-0.088
 All-0.080
Token the
Token ID262
Feature activation+5.07e-01

Pos logit contributions
kin+0.419
ppers+0.382
 globe+0.372
 rinse+0.372
ccoli+0.371
Neg logit contributions
ted-0.427
 Many-0.386
 People-0.383
 Everyone-0.382
elt-0.358
Token shelf
Token ID18316
Feature activation+4.12e+00

Pos logit contributions
ppers+0.032
kin+0.032
ccoli+0.030
 afloat+0.030
kins+0.029
Neg logit contributions
ted-0.031
 Everyone-0.030
 Many-0.030
 People-0.029
 All-0.026
Token.
Token ID13
Feature activation+6.43e-01

Pos logit contributions
ppers+0.288
ccoli+0.286
kin+0.279
 rinse+0.277
 afloat+0.275
Neg logit contributions
ted-0.216
 Everyone-0.202
 Many-0.196
 People-0.195
 All-0.183
Token It
Token ID632
Feature activation-3.47e+00

Pos logit contributions
kin+0.044
ccoli+0.042
ppers+0.041
 rinse+0.040
 afloat+0.039
Neg logit contributions
ted-0.037
 Many-0.031
 Everyone-0.030
 People-0.030
heart-0.028
Token has
Token ID468
Feature activation+7.96e+00

Pos logit contributions
ted+0.282
 Many+0.267
 Everyone+0.266
 People+0.261
 All+0.241
Neg logit contributions
kin-0.152
ppers-0.149
ccoli-0.140
 promise-0.133
 rinse-0.132
Token many
Token ID867
Feature activation-2.37e+00

Pos logit contributions
kin+0.403
eps+0.396
ppers+0.393
 rinse+0.390
aks+0.389
Neg logit contributions
ted-0.523
heart-0.466
 Everyone-0.445
 People-0.431
arted-0.423
Token pictures
Token ID5986
Feature activation-5.57e+00

Pos logit contributions
ted+0.153
 Everyone+0.134
 People+0.127
 Many+0.127
arted+0.123
Neg logit contributions
kin-0.136
ppers-0.132
kins-0.130
eps-0.129
 promise-0.128
Token and
Token ID290
Feature activation-8.95e-02

Pos logit contributions
ted+0.232
 Many+0.207
 Everyone+0.207
 People+0.199
 All+0.168
Neg logit contributions
kin-0.463
ppers-0.458
ccoli-0.441
 afloat-0.432
 rinse-0.430
Token stories
Token ID3923
Feature activation-1.05e+00

Pos logit contributions
ted+0.005
 Everyone+0.005
 Many+0.005
 People+0.004
heart+0.004
Neg logit contributions
kin-0.006
ppers-0.005
ccoli-0.005
 rinse-0.005
 achieve-0.005
Token.
Token ID13
Feature activation+7.57e-01

Pos logit contributions
ted+0.055
 Everyone+0.051
 People+0.049
 Many+0.048
 All+0.045
Neg logit contributions
ppers-0.077
kin-0.076
ccoli-0.074
 afloat-0.073
edient-0.073
Token He
Token ID679
Feature activation+2.02e+00

Pos logit contributions
kin+0.046
ccoli+0.045
ppers+0.043
 rinse+0.042
 afloat+0.041
Neg logit contributions
ted-0.048
 Many-0.039
 Everyone-0.039
 People-0.038
heart-0.038
Token
Token ID198
Feature activation+3.43e-01

Pos logit contributions
kin+0.019
ppers+0.018
 achieve+0.018
 rinse+0.017
ccoli+0.017
Neg logit contributions
ted-0.022
 Many-0.020
 Everyone-0.020
 People-0.019
 winds-0.018
TokenThe
Token ID464
Feature activation+5.33e+00

Pos logit contributions
kin+0.018
ppers+0.017
ccoli+0.016
 rinse+0.016
 achieve+0.016
Neg logit contributions
ted-0.025
 Many-0.023
 Everyone-0.023
 People-0.023
 All-0.021
Token little
Token ID1310
Feature activation-2.33e+00

Pos logit contributions
kin+0.403
 promise+0.372
ppers+0.366
kins+0.360
 rinse+0.357
Neg logit contributions
ted-0.286
 Many-0.245
 Everyone-0.243
 People-0.236
 All-0.209
Token girl
Token ID2576
Feature activation-1.42e-01

Pos logit contributions
ted+0.153
 Many+0.144
 Everyone+0.142
 People+0.139
 All+0.124
Neg logit contributions
kin-0.143
 promise-0.130
ppers-0.129
 rinse-0.127
kins-0.126
Token saw
Token ID2497
Feature activation-1.77e+00

Pos logit contributions
ted+0.009
 Many+0.009
 Everyone+0.009
 People+0.009
 winds+0.008
Neg logit contributions
ppers-0.008
kin-0.008
ccoli-0.008
 promise-0.008
 Claus-0.007
Token a
Token ID257
Feature activation-1.69e+00

Pos logit contributions
ted+0.111
 Many+0.104
 People+0.104
umm+0.103
 Everyone+0.103
Neg logit contributions
kin-0.085
kins-0.078
Anna-0.078
 rinse-0.077
ccoli-0.077
Token lot
Token ID1256
Feature activation-1.57e+00

Pos logit contributions
 Many+0.083
 Everyone+0.083
ted+0.082
 People+0.080
 All+0.072
Neg logit contributions
ppers-0.125
kin-0.115
edient-0.115
 afloat-0.114
ccoli-0.114
Token of
Token ID286
Feature activation-5.74e+00

Pos logit contributions
ted+0.053
 Many+0.045
 Everyone+0.043
 People+0.042
 All+0.033
Neg logit contributions
kin-0.143
ppers-0.138
 rinse-0.133
kins-0.133
ccoli-0.132
Token her
Token ID607
Feature activation-1.31e+00

Pos logit contributions
ted+0.350
 Everyone+0.299
 Many+0.291
 People+0.284
elt+0.283
Neg logit contributions
kin-0.352
 promise-0.341
 rinse-0.336
eps-0.335
kins-0.332
Token favorite
Token ID4004
Feature activation-6.18e+00

Pos logit contributions
ted+0.084
 Many+0.076
 Everyone+0.074
 People+0.074
 winds+0.066
Neg logit contributions
kin-0.083
 promise-0.079
ccoli-0.076
kins-0.075
ppers-0.074
Token cartoon
Token ID16251
Feature activation+2.45e+00

Pos logit contributions
ted+0.448
 Many+0.411
 Everyone+0.405
 People+0.402
arted+0.391
Neg logit contributions
kin-0.315
 promise-0.296
kins-0.285
 globe-0.276
ccoli-0.270
Token the
Token ID262
Feature activation+5.02e-01

Pos logit contributions
kin+0.222
ppers+0.215
ccoli+0.206
 promise+0.205
 afloat+0.204
Neg logit contributions
ted-0.199
 Many-0.186
 Everyone-0.184
 People-0.180
 All-0.158
Token skull
Token ID14511
Feature activation+3.59e+00

Pos logit contributions
kin+0.032
ppers+0.031
ccoli+0.030
 rinse+0.029
 afloat+0.028
Neg logit contributions
ted-0.032
 Everyone-0.029
 Many-0.029
 People-0.029
 All-0.026
Token to
Token ID284
Feature activation-6.79e-01

Pos logit contributions
kin+0.225
ppers+0.221
ccoli+0.210
 afloat+0.204
 promise+0.203
Neg logit contributions
ted-0.224
 Many-0.210
 Everyone-0.209
 People-0.205
 All-0.183
Token explore
Token ID7301
Feature activation-8.61e-01

Pos logit contributions
ted+0.049
 Many+0.046
 Everyone+0.046
 People+0.045
 All+0.041
Neg logit contributions
kin-0.036
ppers-0.036
ccoli-0.034
 afloat-0.033
 rinse-0.032
Token.
Token ID13
Feature activation+2.91e-01

Pos logit contributions
ted+0.051
 Many+0.047
 Everyone+0.047
 People+0.046
 All+0.041
Neg logit contributions
kin-0.057
ppers-0.056
ccoli-0.054
 afloat-0.052
 rinse-0.052
Token As
Token ID1081
Feature activation-1.00e+01

Pos logit contributions
 afloat+0.015
ccoli+0.014
 achieve+0.014
 ramp+0.014
 promise+0.014
Neg logit contributions
heart-0.018
ted-0.016
illy-0.015
elle-0.015
andel-0.015
Token he
Token ID339
Feature activation-1.16e-01

Pos logit contributions
ted+0.546
 Many+0.517
 Everyone+0.513
 People+0.504
heart+0.436
Neg logit contributions
kin-0.684
ppers-0.674
 afloat-0.670
 achieve-0.639
ccoli-0.624
Token sw
Token ID1509
Feature activation+2.96e-01

Pos logit contributions
ted+0.007
 Everyone+0.007
 Many+0.007
 People+0.007
 All+0.006
Neg logit contributions
kin-0.007
ppers-0.007
 afloat-0.006
ccoli-0.006
 promise-0.006
Tokenam
Token ID321
Feature activation-1.05e+01

Pos logit contributions
kin+0.025
ppers+0.025
ccoli+0.024
 afloat+0.023
 promise+0.023
Neg logit contributions
ted-0.012
 Many-0.011
 Everyone-0.011
 People-0.011
 All-0.009
Token inside
Token ID2641
Feature activation+3.33e+00

Pos logit contributions
ted+0.563
 Everyone+0.512
 Many+0.490
 People+0.486
 All+0.450
Neg logit contributions
ppers-0.799
ccoli-0.789
kin-0.771
wy-0.731
 rinse-0.730
Token,
Token ID11
Feature activation+5.16e-02

Pos logit contributions
kin+0.203
ppers+0.190
 afloat+0.183
kins+0.182
 achieve+0.181
Neg logit contributions
ted-0.208
 Many-0.200
 Everyone-0.197
 People-0.191
 All-0.168
Token trip
Token ID5296
Feature activation-5.09e+00

Pos logit contributions
ted+0.132
 Many+0.124
 Everyone+0.124
 People+0.121
 All+0.111
Neg logit contributions
kin-0.101
ppers-0.100
ccoli-0.094
 afloat-0.091
 rinse-0.090
Token to
Token ID284
Feature activation-2.68e-01

Pos logit contributions
ted+0.223
 Everyone+0.204
 Many+0.197
 People+0.195
 All+0.170
Neg logit contributions
kin-0.416
ppers-0.407
ccoli-0.401
 rinse-0.387
 afloat-0.381
Token the
Token ID262
Feature activation+8.85e-01

Pos logit contributions
ted+0.017
 Many+0.015
 Everyone+0.014
 People+0.014
heart+0.013
Neg logit contributions
kin-0.016
 rinse-0.016
 achieve-0.016
 promise-0.016
ppers-0.016
Token park
Token ID3952
Feature activation-1.23e+00

Pos logit contributions
kin+0.055
ppers+0.054
 promise+0.052
 achieve+0.052
 afloat+0.052
Neg logit contributions
ted-0.054
 Many-0.049
 Everyone-0.048
 People-0.047
 All-0.042
Token.
Token ID13
Feature activation+6.84e-01

Pos logit contributions
ted+0.082
 Many+0.076
 Everyone+0.075
 People+0.074
 All+0.066
Neg logit contributions
kin-0.071
ppers-0.070
ccoli-0.066
 promise-0.065
 afloat-0.065
Token As
Token ID1081
Feature activation-9.58e+00

Pos logit contributions
 rinse+0.038
ccoli+0.037
ppers+0.037
 Claus+0.037
kin+0.037
Neg logit contributions
ted-0.042
heart-0.037
 Many-0.035
 People-0.034
 Everyone-0.034
Token soon
Token ID2582
Feature activation+1.97e+00

Pos logit contributions
ted+0.607
 Many+0.568
 Everyone+0.564
 People+0.552
 All+0.494
Neg logit contributions
kin-0.594
ppers-0.581
ccoli-0.547
 rinse-0.541
 afloat-0.533
Token as
Token ID355
Feature activation-8.03e+00

Pos logit contributions
kin+0.133
 rinse+0.120
ppers+0.114
 afloat+0.113
aults+0.113
Neg logit contributions
 People-0.096
 Many-0.093
ted-0.091
heart-0.090
 Everyone-0.089
Token they
Token ID484
Feature activation+2.13e+00

Pos logit contributions
ted+0.504
 Many+0.471
 Everyone+0.466
 People+0.456
 All+0.410
Neg logit contributions
kin-0.505
ppers-0.491
ccoli-0.460
 rinse-0.457
kins-0.453
Token arrived
Token ID5284
Feature activation-4.77e-01

Pos logit contributions
kin+0.123
ppers+0.120
 rinse+0.115
 promise+0.114
illas+0.111
Neg logit contributions
 Everyone-0.132
 People-0.130
 Many-0.127
ted-0.125
heart-0.112
Token,
Token ID11
Feature activation+4.92e-01

Pos logit contributions
ted+0.027
 Many+0.026
 Everyone+0.025
 People+0.025
 All+0.022
Neg logit contributions
kin-0.032
ppers-0.032
ccoli-0.030
 afloat-0.030
kins-0.029
Token decided
Token ID3066
Feature activation-2.29e+00

Pos logit contributions
ted+0.129
 Many+0.123
 Everyone+0.121
 People+0.118
 All+0.103
Neg logit contributions
kin-0.137
ppers-0.131
ccoli-0.127
 promise-0.126
kins-0.125
Token to
Token ID284
Feature activation-1.44e-01

Pos logit contributions
ted+0.131
 Many+0.122
 Everyone+0.120
 People+0.117
 All+0.100
Neg logit contributions
ppers-0.149
kin-0.145
 achieve-0.145
ccoli-0.143
 promise-0.141
Token explore
Token ID7301
Feature activation-3.20e-01

Pos logit contributions
ted+0.011
 Many+0.010
 Everyone+0.010
 People+0.010
 All+0.009
Neg logit contributions
kin-0.007
ppers-0.007
ccoli-0.007
 promise-0.007
 rinse-0.007
Token and
Token ID290
Feature activation-4.07e-03

Pos logit contributions
ted+0.018
 Many+0.017
 Everyone+0.017
 People+0.016
 All+0.014
Neg logit contributions
kin-0.022
ppers-0.021
ccoli-0.020
 afloat-0.020
 promise-0.020
Token sw
Token ID1509
Feature activation+8.12e-01

Pos logit contributions
ted+0.000
 Many+0.000
 Everyone+0.000
 People+0.000
 All+0.000
Neg logit contributions
kin-0.000
ppers-0.000
 promise-0.000
 achieve-0.000
ccoli-0.000
Tokenam
Token ID321
Feature activation-9.87e+00

Pos logit contributions
kin+0.065
ppers+0.064
 afloat+0.061
ccoli+0.061
 promise+0.061
Neg logit contributions
ted-0.036
 Many-0.034
 Everyone-0.033
 People-0.032
 All-0.028
Token far
Token ID1290
Feature activation+1.95e-01

Pos logit contributions
ted+0.536
 Everyone+0.508
 People+0.476
 Many+0.465
 All+0.441
Neg logit contributions
ppers-0.715
ccoli-0.703
kin-0.685
 rinse-0.639
wy-0.636
Token away
Token ID1497
Feature activation+1.11e+01

Pos logit contributions
ccoli+0.010
kin+0.010
ppers+0.010
 rinse+0.008
cher+0.008
Neg logit contributions
ted-0.015
ly-0.014
 Everyone-0.014
 People-0.013
 as-0.013
Token.
Token ID13
Feature activation+8.67e-01

Pos logit contributions
kin+0.700
ppers+0.692
ccoli+0.659
 rinse+0.635
 afloat+0.634
Neg logit contributions
ted-0.692
 Everyone-0.644
 Many-0.642
 People-0.628
 All-0.568
Token The
Token ID383
Feature activation-2.23e+00

Pos logit contributions
ccoli+0.037
Anna+0.036
kin+0.036
 achieve+0.035
 rinse+0.035
Neg logit contributions
ted-0.065
heart-0.056
ly-0.055
 winds-0.054
 Many-0.053
Token big
Token ID1263
Feature activation+1.29e+00

Pos logit contributions
ted+0.143
 Everyone+0.130
 Many+0.129
 People+0.127
 All+0.114
Neg logit contributions
kin-0.139
ppers-0.131
 promise-0.128
 afloat-0.127
 rinse-0.124
TokenThey
Token ID2990
Feature activation+7.89e+00

Pos logit contributions
ted+0.007
 Many+0.007
 Everyone+0.007
 People+0.006
 All+0.006
Neg logit contributions
ccoli-0.006
kin-0.006
ppers-0.006
 promise-0.005
 afloat-0.005
Token spl
Token ID4328
Feature activation-2.26e+00

Pos logit contributions
kin+0.519
ppers+0.512
ccoli+0.487
 afloat+0.474
 rinse+0.473
Neg logit contributions
ted-0.468
 Everyone-0.434
 Many-0.433
 People-0.423
 All-0.379
Tokenashed
Token ID5263
Feature activation-6.05e+00

Pos logit contributions
ted+0.111
 People+0.109
 Everyone+0.109
 Many+0.108
 All+0.095
Neg logit contributions
ppers-0.167
kin-0.164
ccoli-0.155
 afloat-0.151
 rinse-0.151
Token and
Token ID290
Feature activation-3.14e-01

Pos logit contributions
ted+0.374
 Many+0.371
 Everyone+0.365
 People+0.364
 All+0.322
Neg logit contributions
kin-0.362
ppers-0.346
 afloat-0.336
 rinse-0.330
ccoli-0.329
Token sw
Token ID1509
Feature activation+5.51e-01

Pos logit contributions
ted+0.019
 Everyone+0.018
 Many+0.018
 People+0.017
 All+0.016
Neg logit contributions
kin-0.020
ppers-0.020
ccoli-0.019
 afloat-0.018
 rinse-0.018
Tokenam
Token ID321
Feature activation-1.03e+01

Pos logit contributions
ppers+0.045
kin+0.044
ccoli+0.042
 afloat+0.041
kins+0.041
Neg logit contributions
ted-0.024
 Many-0.023
 Everyone-0.022
 People-0.022
 All-0.019
Token and
Token ID290
Feature activation-3.15e-01

Pos logit contributions
ted+0.473
 Everyone+0.427
 Many+0.389
 People+0.384
ly+0.372
Neg logit contributions
ccoli-0.839
ppers-0.832
kin-0.808
 achieve-0.785
wy-0.775
Token had
Token ID550
Feature activation+6.63e+00

Pos logit contributions
ted+0.018
 Everyone+0.015
 Many+0.015
 People+0.015
 All+0.013
Neg logit contributions
kin-0.022
ppers-0.022
ccoli-0.021
 achieve-0.021
edient-0.021
Token fun
Token ID1257
Feature activation-2.04e+00

Pos logit contributions
kin+0.431
ppers+0.425
ccoli+0.409
 rinse+0.404
 afloat+0.403
Neg logit contributions
ted-0.390
 Everyone-0.357
 Many-0.353
 People-0.347
 All-0.304
Token.
Token ID13
Feature activation+4.32e-01

Pos logit contributions
ted+0.139
 Many+0.130
 Everyone+0.128
 People+0.127
 All+0.113
Neg logit contributions
kin-0.118
ppers-0.116
 afloat-0.108
ccoli-0.107
kins-0.106
Token They
Token ID1119
Feature activation+2.34e+00

Pos logit contributions
kin+0.025
ccoli+0.025
 rinse+0.025
 afloat+0.024
ppers+0.024
Neg logit contributions
heart-0.025
ted-0.024
 Many-0.021
 People-0.021
unn-0.020
Token the
Token ID262
Feature activation+1.13e+00

Pos logit contributions
ted+0.015
 Many+0.014
 Everyone+0.013
 People+0.013
 All+0.012
Neg logit contributions
kin-0.017
ppers-0.016
ccoli-0.016
 afloat-0.016
 rinse-0.016
Token bridge
Token ID7696
Feature activation+2.85e+00

Pos logit contributions
kin+0.055
ppers+0.054
ccoli+0.050
 afloat+0.049
 promise+0.049
Neg logit contributions
ted-0.088
 Many-0.082
 Everyone-0.082
 People-0.080
 All-0.074
Token.
Token ID13
Feature activation+1.35e+00

Pos logit contributions
kin+0.221
ppers+0.218
ccoli+0.211
 rinse+0.205
 afloat+0.204
Neg logit contributions
ted-0.134
 Everyone-0.122
 Many-0.121
 People-0.118
 All-0.103
Token He
Token ID679
Feature activation+2.35e+00

Pos logit contributions
ccoli+0.078
ppers+0.076
 rinse+0.076
kin+0.075
 Claus+0.074
Neg logit contributions
ted-0.076
 Many-0.067
 People-0.063
heart-0.062
 news-0.062
Token sw
Token ID1509
Feature activation+8.55e-01

Pos logit contributions
ppers+0.143
kin+0.141
ccoli+0.134
 promise+0.131
kins+0.128
Neg logit contributions
ted-0.144
 Everyone-0.142
 Many-0.140
 People-0.140
 All-0.121
Tokenam
Token ID321
Feature activation-9.99e+00

Pos logit contributions
ppers+0.065
kin+0.064
 afloat+0.060
ccoli+0.060
kins+0.060
Neg logit contributions
ted-0.041
 Many-0.040
 Everyone-0.038
 People-0.038
 All-0.034
Token up
Token ID510
Feature activation-1.69e+00

Pos logit contributions
ted+0.581
 Everyone+0.521
 Many+0.503
 People+0.492
 All+0.454
Neg logit contributions
kin-0.694
ppers-0.689
ccoli-0.670
 rinse-0.646
 achieve-0.646
Token to
Token ID284
Feature activation-8.49e-02

Pos logit contributions
ted+0.084
 Many+0.076
 Everyone+0.076
 People+0.074
 All+0.065
Neg logit contributions
kin-0.128
ppers-0.127
ccoli-0.122
 rinse-0.118
 afloat-0.118
Token it
Token ID340
Feature activation-3.08e+00

Pos logit contributions
ted+0.005
 Many+0.005
 Everyone+0.005
 People+0.005
pill+0.004
Neg logit contributions
kin-0.005
ppers-0.005
 rinse-0.005
 afloat-0.005
 promise-0.005
Token and
Token ID290
Feature activation+3.37e-01

Pos logit contributions
ted+0.142
 Everyone+0.125
 Many+0.120
 People+0.118
 All+0.113
Neg logit contributions
kin-0.246
ppers-0.237
wy-0.228
ccoli-0.227
illas-0.226
Token called
Token ID1444
Feature activation-3.04e+00

Pos logit contributions
ppers+0.022
kin+0.022
ccoli+0.021
 afloat+0.021
 rinse+0.021
Neg logit contributions
ted-0.019
 Many-0.018
 Everyone-0.018
 People-0.018
 All-0.016