TOP ACTIVATIONS
MAX = 17.157

car had a party . Many friends came to celebrate .
tree in a park . Many animals lived in the tree
rushed toward the fire . Many of the people said ,
of trees and flowers . Many humans came to play and
sour ! â . Many people heard her and started
very tough and strong . Many animals lived in the forest
at the sports ground . Many children were playing and running
long , long time . Many people came to see the
was a big race . Many kids ran in the race
made flour from wheat . Many people came to see the
was a big hive . Many bees lived in the hive
landed on the street . Many birds and dogs came and
it was too late . Many people were sad and angry
planes came and went . Many people went there to fly
there was a hive . Many bees lived in the hive
a big peach tree . Many pe aches grew on the
the swing are popular . Many children want to play on
can sing very well . Many people like to hear him
middle of the park . Many kids would come to play
searching for a lesson . Many times , what we â

MIN ACTIVATIONS
MIN = -13.870

Oh , Mike ! Bro ccoli is a vegetable ! It
. " Bro ccoli is not y ucky ,
on your plate . Bro ccoli is not that bad .
ruined the broccoli ! Bro ccoli is good for you .
. " Bro ccoli is what we have for
to eat broccoli . Bro ccoli was green and bitter and
and stay well . Bro ccoli is your friend , not
Mom says , " Bro ccoli is good for you .
keep trying , you can achieve anything . <|endoftext|> Tom my
and exercise are necessary to achieve any goal . <|endoftext|> Once
and determination , he could achieve anything he wanted . <|endoftext|>
dream and worked hard to achieve it . The
hard work , she could achieve anything she wanted . Lily
a friend to help you achieve your goals . Friends can
with patience he could always achieve what he wanted . And
and practice , he could achieve anything he wanted . He
try new things . Bro ccoli is not that bad .
hard work can help you achieve your goals . He felt
working hard and you will achieve great things ."
the smallest of us can achieve great things ." <|endoftext|> Once

INTERVAL 9.401 - 17.157
CONTAINS 0.074%

around in the fields . One day , Benny saw a
that was very popular . People came from all around to
, a little bird named Twe ety , and told her
out of the hole . People walked by and saw Tim
and love playing outside . One day , Zoe was playing

INTERVAL 1.644 - 9.401
CONTAINS 23.425%

red hat , a beautiful yellow dress , and two j
felt ashamed when she saw him carrying the sausage and he
a time there was a little boy , who was very
t eddy . He gave it to Lily .
the dish washer so she leaned against the counter while she

INTERVAL -6.113 - 1.644
CONTAINS 73.597%

?" Anna asked . " Sure , sweet ie
had found on the ground . C urious
to the park . When they got to the
they could buy it , but her mom my said no
, waiting for the disgusting thing to move away . Eventually

INTERVAL -13.870 - -6.113
CONTAINS 2.904%

From that day on , Lily knew that if
would join hands and go on adventures .
the bird and set off on an adventure .
distance and ran over to True . She said , "
them in the front of their shirt . And they started
Token car
Token ID1097
Feature activation+2.56e+00

Pos logit contributions
ted+0.075
 Many+0.074
 Everyone+0.070
 People+0.070
 Every+0.066
Neg logit contributions
cher-0.099
ppers-0.096
kin-0.096
 achieve-0.096
edient-0.096
Token had
Token ID550
Feature activation-4.98e+00

Pos logit contributions
ted+0.142
 Many+0.139
 Everyone+0.131
 People+0.130
 Every+0.124
Neg logit contributions
cher-0.171
kin-0.167
 achieve-0.164
illas-0.164
ppers-0.164
Token a
Token ID257
Feature activation+2.47e+00

Pos logit contributions
cher+0.301
 afloat+0.288
 achieve+0.285
 Claus+0.282
illas+0.279
Neg logit contributions
ted-0.292
 Many-0.276
heart-0.274
ummy-0.259
andel-0.259
Token party
Token ID2151
Feature activation+1.17e+00

Pos logit contributions
 Many+0.120
 Everyone+0.111
 People+0.109
 All+0.107
 Every+0.106
Neg logit contributions
cher-0.184
ppers-0.183
illas-0.182
kins-0.176
edient-0.175
Token.
Token ID13
Feature activation+4.32e-02

Pos logit contributions
ted+0.056
 Many+0.052
 Everyone+0.052
 People+0.051
 All+0.048
Neg logit contributions
cher-0.086
edient-0.085
 afloat-0.085
illas-0.084
 achieve-0.084
Token Many
Token ID4650
Feature activation+1.47e+01

Pos logit contributions
ted+0.002
ly+0.002
heart+0.002
 Many+0.002
 winds+0.002
Neg logit contributions
cher-0.003
illas-0.003
 Claus-0.003
 achieve-0.003
kin-0.003
Token friends
Token ID2460
Feature activation+3.73e+00

Pos logit contributions
ted+0.680
 Many+0.624
 Every+0.620
 Everyone+0.614
 As+0.591
Neg logit contributions
cher-1.083
illas-1.072
kins-1.066
kin-1.061
 achieve-1.045
Token came
Token ID1625
Feature activation+5.51e+00

Pos logit contributions
ted+0.187
 Many+0.185
 Everyone+0.172
 People+0.172
 Every+0.162
Neg logit contributions
cher-0.268
illas-0.260
kin-0.260
 achieve-0.259
edient-0.256
Token to
Token ID284
Feature activation+6.85e-01

Pos logit contributions
ted+0.219
 Many+0.205
 Everyone+0.191
 People+0.190
 Every+0.180
Neg logit contributions
kin-0.464
cher-0.458
ppers-0.445
 achieve-0.443
edient-0.440
Token celebrate
Token ID10648
Feature activation-3.91e+00

Pos logit contributions
ted+0.044
 Many+0.041
 Everyone+0.039
 People+0.039
heart+0.037
Neg logit contributions
cher-0.040
 achieve-0.040
illas-0.039
kin-0.039
 afloat-0.039
Token.
Token ID13
Feature activation-4.57e-01

Pos logit contributions
cher+0.311
ppers+0.301
edient+0.298
illas+0.296
 afloat+0.296
Neg logit contributions
ted-0.169
 Everyone-0.165
 People-0.160
 Many-0.158
 All-0.150
Token tree
Token ID5509
Feature activation+7.62e-01

Pos logit contributions
 Many+0.040
ted+0.039
 Everyone+0.037
 People+0.037
 Every+0.036
Neg logit contributions
cher-0.057
ppers-0.055
edient-0.054
illas-0.054
 achieve-0.053
Token in
Token ID287
Feature activation-2.56e+00

Pos logit contributions
ted+0.038
 Many+0.036
 Everyone+0.034
 People+0.034
 Every+0.034
Neg logit contributions
cher-0.056
kin-0.054
 afloat-0.053
illas-0.053
ppers-0.053
Token a
Token ID257
Feature activation+2.13e+00

Pos logit contributions
kin+0.180
cher+0.178
 afloat+0.177
 achieve+0.176
edient+0.176
Neg logit contributions
ted-0.132
 Many-0.127
 People-0.119
 Everyone-0.118
heart-0.107
Token park
Token ID3952
Feature activation+1.46e+00

Pos logit contributions
 Many+0.110
 Everyone+0.102
 People+0.099
 All+0.098
ted+0.097
Neg logit contributions
cher-0.160
ppers-0.156
edient-0.150
kin-0.150
illas-0.150
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
ted+0.085
 Many+0.083
 People+0.078
 Everyone+0.078
 Every+0.074
Neg logit contributions
cher-0.094
kin-0.092
illas-0.090
edient-0.090
ppers-0.090
Token Many
Token ID4650
Feature activation+1.45e+01

Pos logit contributions
cher+0.013
 globe+0.012
 achieve+0.012
ccoli+0.012
Jackson+0.012
Neg logit contributions
ted-0.010
ly-0.008
heart-0.008
andel-0.008
ened-0.008
Token animals
Token ID4695
Feature activation+8.55e+00

Pos logit contributions
ted+0.664
 As+0.623
 Everyone+0.607
 Every+0.602
 Many+0.584
Neg logit contributions
kins-1.036
illas-1.026
aks-1.019
 achieve-1.006
cher-0.994
Token lived
Token ID5615
Feature activation+1.90e+00

Pos logit contributions
ted+0.431
 Many+0.418
 Everyone+0.393
 People+0.392
 Every+0.368
Neg logit contributions
cher-0.615
kin-0.605
ppers-0.596
 afloat-0.594
edient-0.594
Token in
Token ID287
Feature activation-3.21e+00

Pos logit contributions
ted+0.105
 Many+0.096
 People+0.093
 Everyone+0.093
 Every+0.091
Neg logit contributions
ppers-0.125
cher-0.123
 achieve-0.123
edient-0.122
 afloat-0.122
Token the
Token ID262
Feature activation-7.96e-01

Pos logit contributions
cher+0.242
kin+0.239
illas+0.235
edient+0.235
 achieve+0.234
Neg logit contributions
ted-0.152
 Many-0.147
 Everyone-0.137
 People-0.137
 Every-0.128
Token tree
Token ID5509
Feature activation-1.61e-01

Pos logit contributions
cher+0.059
kin+0.058
ppers+0.057
edient+0.057
 afloat+0.057
Neg logit contributions
ted-0.039
 Many-0.038
 Everyone-0.036
 People-0.036
 Every-0.034
Token rushed
Token ID13999
Feature activation+2.93e+00

Pos logit contributions
 Many+0.027
 Everyone+0.026
ted+0.026
 People+0.026
andel+0.024
Neg logit contributions
cher-0.024
kins-0.024
kin-0.024
ppers-0.024
 afloat-0.023
Token toward
Token ID3812
Feature activation+2.09e+00

Pos logit contributions
 Many+0.168
ted+0.165
 People+0.157
 Everyone+0.156
 Every+0.148
Neg logit contributions
cher-0.186
 afloat-0.182
edient-0.179
kin-0.179
ppers-0.179
Token the
Token ID262
Feature activation-6.22e-01

Pos logit contributions
 Many+0.121
ted+0.121
heart+0.112
 Everyone+0.111
 People+0.110
Neg logit contributions
edient-0.120
illas-0.119
 achieve-0.119
kin-0.118
 globe-0.118
Token fire
Token ID2046
Feature activation-1.14e+00

Pos logit contributions
cher+0.037
kin+0.037
edient+0.036
 afloat+0.036
 achieve+0.036
Neg logit contributions
ted-0.039
 Many-0.038
 Everyone-0.036
 People-0.036
 Every-0.034
Token.
Token ID13
Feature activation-3.30e-01

Pos logit contributions
cher+0.086
kin+0.085
 achieve+0.083
ppers+0.081
illas+0.080
Neg logit contributions
ted-0.056
 Everyone-0.050
 Many-0.049
 People-0.049
 All-0.048
Token Many
Token ID4650
Feature activation+1.45e+01

Pos logit contributions
 achieve+0.022
ccoli+0.021
 Claus+0.021
cher+0.020
Jackson+0.020
Neg logit contributions
ted-0.016
heart-0.016
andel-0.014
ly-0.014
 Ph-0.013
Token of
Token ID286
Feature activation+6.94e+00

Pos logit contributions
ted+0.537
 Many+0.496
 Everyone+0.489
 Every+0.461
 People+0.447
Neg logit contributions
illas-1.201
edient-1.198
cher-1.195
kins-1.194
 achieve-1.193
Token the
Token ID262
Feature activation-6.06e-01

Pos logit contributions
ted+0.399
 Many+0.383
 Everyone+0.368
 People+0.362
ly+0.355
Neg logit contributions
edient-0.419
illas-0.419
kins-0.416
kin-0.413
 globe-0.401
Token people
Token ID661
Feature activation+6.61e+00

Pos logit contributions
cher+0.036
kin+0.036
edient+0.035
 afloat+0.035
illas+0.035
Neg logit contributions
ted-0.038
 Many-0.037
 Everyone-0.035
 People-0.035
 Every-0.033
Token said
Token ID531
Feature activation+2.21e+00

Pos logit contributions
ted+0.329
 Many+0.306
 People+0.288
 Everyone+0.287
 All+0.274
Neg logit contributions
kin-0.481
cher-0.480
 achieve-0.470
 afloat-0.469
ppers-0.468
Token,
Token ID11
Feature activation-3.38e-02

Pos logit contributions
ted+0.112
 Many+0.109
 Everyone+0.102
 People+0.102
 Every+0.097
Neg logit contributions
cher-0.158
kin-0.155
 achieve-0.152
ppers-0.151
edient-0.151
Token of
Token ID286
Feature activation+6.90e+00

Pos logit contributions
ted+0.240
ened+0.198
 Many+0.194
 Ph+0.191
 As+0.188
Neg logit contributions
kin-0.350
kins-0.349
 globe-0.349
ak-0.347
aks-0.344
Token trees
Token ID7150
Feature activation+6.96e+00

Pos logit contributions
ted+0.364
heart+0.310
ened+0.307
 As+0.299
ly+0.295
Neg logit contributions
aks-0.451
 globe-0.438
eps-0.421
kins-0.418
illas-0.414
Token and
Token ID290
Feature activation+4.43e-01

Pos logit contributions
ted+0.402
 Many+0.394
 People+0.373
 Everyone+0.372
 Every+0.348
Neg logit contributions
cher-0.447
kin-0.439
ppers-0.433
 afloat-0.433
 achieve-0.433
Token flowers
Token ID12734
Feature activation+6.01e+00

Pos logit contributions
ted+0.021
andel+0.018
heart+0.018
ummy+0.018
 Everyone+0.018
Neg logit contributions
 globe-0.031
illas-0.030
 achieve-0.030
 afloat-0.030
cher-0.030
Token.
Token ID13
Feature activation-3.30e-01

Pos logit contributions
ted+0.366
 Many+0.366
 People+0.345
 Everyone+0.342
heart+0.321
Neg logit contributions
cher-0.362
kin-0.355
illas-0.350
 achieve-0.349
ppers-0.348
Token Many
Token ID4650
Feature activation+1.45e+01

Pos logit contributions
 globe+0.023
 achieve+0.022
 tim+0.022
 Claus+0.022
 ramp+0.022
Neg logit contributions
ted-0.016
heart-0.015
illy-0.013
unn-0.012
andel-0.012
Token humans
Token ID5384
Feature activation+3.00e+00

Pos logit contributions
ted+0.687
 Every+0.608
 As+0.599
ened+0.582
 Everyone+0.578
Neg logit contributions
aks-1.040
illas-1.015
kins-0.983
 achieve-0.978
ak-0.958
Token came
Token ID1625
Feature activation+5.49e+00

Pos logit contributions
ted+0.162
 Many+0.160
 Everyone+0.150
 People+0.149
 Every+0.141
Neg logit contributions
cher-0.201
illas-0.197
 achieve-0.196
kin-0.195
edient-0.192
Token to
Token ID284
Feature activation+7.51e-01

Pos logit contributions
ted+0.260
 Many+0.251
 Everyone+0.233
 People+0.233
 Every+0.212
Neg logit contributions
cher-0.402
 achieve-0.400
illas-0.399
edient-0.399
 afloat-0.395
Token play
Token ID711
Feature activation+6.79e-01

Pos logit contributions
ted+0.046
 Many+0.042
 Everyone+0.041
 People+0.041
heart+0.038
Neg logit contributions
 achieve-0.046
cher-0.046
illas-0.045
kin-0.045
 afloat-0.045
Token and
Token ID290
Feature activation+4.78e-01

Pos logit contributions
ted+0.042
 Many+0.041
 Everyone+0.039
 People+0.039
 Every+0.037
Neg logit contributions
cher-0.041
kin-0.040
ppers-0.040
edient-0.040
 achieve-0.040
Token sour
Token ID11348
Feature activation+7.95e-01

Pos logit contributions
ted+0.198
 Many+0.193
 People+0.184
 Everyone+0.183
 Every+0.174
Neg logit contributions
cher-0.201
kin-0.199
ppers-0.195
edient-0.195
 afloat-0.193
Token!
Token ID0
Feature activation+1.26e+00

Pos logit contributions
 Many+0.051
ted+0.048
 People+0.047
 Everyone+0.046
 Every+0.043
Neg logit contributions
ppers-0.046
kin-0.046
cher-0.046
 achieve-0.044
ccoli-0.044
Tokenâ
Token ID22940
Feature activation+9.80e-01

Pos logit contributions
ted+0.068
ened+0.056
ly+0.056
 hours+0.054
andel+0.051
Neg logit contributions
ccoli-0.076
edient-0.075
 achieve-0.075
 promise-0.074
kin-0.072
Token
Token ID26391
Feature activation-4.62e+00

Pos logit contributions
ted+0.066
 Everyone+0.065
 Many+0.064
 People+0.064
 All+0.061
Neg logit contributions
cher-0.055
 afloat-0.052
illas-0.052
ppers-0.051
 Claus-0.050
Token.
Token ID13
Feature activation-3.04e-01

Pos logit contributions
cher+0.246
kin+0.241
ppers+0.240
illas+0.240
 achieve+0.239
Neg logit contributions
ted-0.318
 Many-0.308
 Everyone-0.294
 People-0.293
 Every-0.283
Token Many
Token ID4650
Feature activation+1.45e+01

Pos logit contributions
ccoli+0.019
 achieve+0.019
edient+0.018
kin+0.018
 promise+0.018
Neg logit contributions
ted-0.016
ly-0.013
ened-0.013
 winds-0.012
 hours-0.011
Token people
Token ID661
Feature activation+6.61e+00

Pos logit contributions
ted+0.680
 Every+0.647
 Many+0.645
 Everyone+0.633
 As+0.629
Neg logit contributions
kins-1.073
cher-1.040
kin-1.039
illas-1.033
 achieve-1.027
Token heard
Token ID2982
Feature activation-6.82e-01

Pos logit contributions
ted+0.348
 Many+0.323
 Everyone+0.307
 People+0.302
 All+0.291
Neg logit contributions
cher-0.462
kin-0.461
 afloat-0.457
edient-0.453
ppers-0.453
Token her
Token ID607
Feature activation+2.06e+00

Pos logit contributions
 promise+0.037
 achieve+0.036
kins+0.035
edient+0.035
kin+0.035
Neg logit contributions
ted-0.036
 Many-0.035
unn-0.035
pill-0.033
 People-0.033
Token and
Token ID290
Feature activation+4.87e-01

Pos logit contributions
ted+0.097
 Many+0.093
 Everyone+0.090
 All+0.085
 People+0.084
Neg logit contributions
cher-0.160
kin-0.152
ucer-0.150
illas-0.149
 afloat-0.149
Token started
Token ID2067
Feature activation+4.33e+00

Pos logit contributions
 Many+0.028
ted+0.026
 People+0.026
 Everyone+0.025
 Every+0.024
Neg logit contributions
cher-0.031
kin-0.030
illas-0.030
 achieve-0.030
 Claus-0.030
Token very
Token ID845
Feature activation-2.18e+00

Pos logit contributions
ted+0.083
 Many+0.078
 Everyone+0.076
 People+0.075
heart+0.071
Neg logit contributions
kin-0.067
 afloat-0.065
cher-0.064
 achieve-0.063
edient-0.062
Token tough
Token ID5802
Feature activation+1.10e+00

Pos logit contributions
kin+0.110
 promise+0.103
 achieve+0.103
edient+0.102
 afloat+0.101
Neg logit contributions
ted-0.168
 Many-0.150
 Everyone-0.145
 People-0.144
heart-0.140
Token and
Token ID290
Feature activation+4.17e-01

Pos logit contributions
 Many+0.075
ted+0.073
 Everyone+0.071
 People+0.069
andel+0.067
Neg logit contributions
cher-0.056
 afloat-0.054
edient-0.054
 globe-0.053
 Claus-0.053
Token strong
Token ID1913
Feature activation-1.06e+00

Pos logit contributions
ted+0.027
 Many+0.027
 Everyone+0.025
 People+0.025
 Every+0.024
Neg logit contributions
cher-0.024
kin-0.024
 achieve-0.023
edient-0.023
ppers-0.023
Token.
Token ID13
Feature activation-3.38e-01

Pos logit contributions
cher+0.058
kin+0.057
edient+0.056
 Claus+0.056
 afloat+0.056
Neg logit contributions
 Many-0.070
ted-0.068
 People-0.064
 Everyone-0.064
heart-0.060
Token Many
Token ID4650
Feature activation+1.45e+01

Pos logit contributions
 globe+0.022
 achieve+0.022
ccoli+0.021
aks+0.020
illas+0.020
Neg logit contributions
ted-0.018
heart-0.017
andel-0.016
ly-0.016
arted-0.015
Token animals
Token ID4695
Feature activation+8.57e+00

Pos logit contributions
ted+0.687
 Everyone+0.608
 As+0.581
 Every+0.576
 Many+0.562
Neg logit contributions
aks-1.028
aky-1.012
kins-1.007
illas-1.007
 achieve-0.999
Token lived
Token ID5615
Feature activation+1.97e+00

Pos logit contributions
ted+0.443
 Many+0.424
 Everyone+0.398
 People+0.398
 All+0.377
Neg logit contributions
cher-0.610
kin-0.596
ppers-0.595
 afloat-0.590
edient-0.588
Token in
Token ID287
Feature activation-3.09e+00

Pos logit contributions
ted+0.106
 Many+0.098
 People+0.094
 Everyone+0.093
 All+0.093
Neg logit contributions
ppers-0.134
cher-0.131
 achieve-0.131
ccoli-0.131
edient-0.130
Token the
Token ID262
Feature activation-6.26e-01

Pos logit contributions
cher+0.263
ppers+0.255
kin+0.250
 achieve+0.249
edient+0.248
Neg logit contributions
 Many-0.124
ted-0.121
 People-0.114
 Everyone-0.114
 Every-0.111
Token forest
Token ID8222
Feature activation+3.69e+00

Pos logit contributions
 achieve+0.040
edient+0.038
 promise+0.038
 globe+0.038
 afloat+0.038
Neg logit contributions
ted-0.036
 Many-0.033
 Everyone-0.031
 People-0.031
heart-0.030
Token at
Token ID379
Feature activation+4.62e+00

Pos logit contributions
ted+0.015
 Everyone+0.014
 Many+0.013
 People+0.012
 All+0.012
Neg logit contributions
cher-0.048
kin-0.047
ppers-0.047
illas-0.047
edient-0.046
Token the
Token ID262
Feature activation-6.71e-01

Pos logit contributions
ted+0.241
 Many+0.234
 People+0.220
 Everyone+0.219
 Every+0.204
Neg logit contributions
cher-0.322
kin-0.318
 afloat-0.315
 achieve-0.314
edient-0.314
Token sports
Token ID5701
Feature activation+6.73e-01

Pos logit contributions
cher+0.047
ppers+0.046
kin+0.046
illas+0.045
edient+0.045
Neg logit contributions
ted-0.035
 Many-0.035
 Everyone-0.034
 People-0.033
 Every-0.032
Token ground
Token ID2323
Feature activation+2.09e+00

Pos logit contributions
ted+0.039
 Many+0.037
 Everyone+0.037
 People+0.036
 All+0.035
Neg logit contributions
cher-0.043
edient-0.043
ppers-0.042
kin-0.042
 afloat-0.042
Token.
Token ID13
Feature activation-3.52e-01

Pos logit contributions
ted+0.104
 Many+0.098
 Everyone+0.094
 People+0.093
 All+0.090
Neg logit contributions
cher-0.153
kin-0.148
ppers-0.146
edient-0.146
illas-0.146
Token Many
Token ID4650
Feature activation+1.45e+01

Pos logit contributions
 afloat+0.025
cher+0.024
 achieve+0.024
kin+0.024
 Claus+0.024
Neg logit contributions
ted-0.017
 Many-0.015
heart-0.015
ly-0.014
 People-0.013
Token children
Token ID1751
Feature activation+5.52e+00

Pos logit contributions
ted+0.668
 Many+0.636
 Every+0.608
 Everyone+0.581
 People+0.581
Neg logit contributions
 achieve-1.049
cher-1.038
kins-1.033
kin-1.025
illas-1.017
Token were
Token ID547
Feature activation+1.80e+00

Pos logit contributions
ted+0.299
 Many+0.292
 Everyone+0.275
 People+0.274
 Every+0.259
Neg logit contributions
cher-0.376
kin-0.369
ppers-0.363
 achieve-0.362
illas-0.362
Token playing
Token ID2712
Feature activation+3.08e+00

Pos logit contributions
ted+0.095
 Many+0.094
 Everyone+0.089
 People+0.088
 Every+0.083
Neg logit contributions
cher-0.124
kin-0.122
ppers-0.120
edient-0.119
illas-0.119
Token and
Token ID290
Feature activation+4.43e-01

Pos logit contributions
ted+0.165
 Many+0.161
 Everyone+0.159
 People+0.154
 All+0.150
Neg logit contributions
ppers-0.220
 Claus-0.213
cher-0.212
edient-0.211
illas-0.209
Token running
Token ID2491
Feature activation+1.34e+00

Pos logit contributions
ted+0.026
 Many+0.025
 People+0.024
 Everyone+0.023
 Every+0.023
Neg logit contributions
cher-0.028
 achieve-0.027
illas-0.027
kin-0.027
 afloat-0.027
Token long
Token ID890
Feature activation+1.48e+00

Pos logit contributions
 Many+0.104
 Everyone+0.091
 People+0.090
ted+0.087
 All+0.087
Neg logit contributions
cher-0.195
illas-0.194
ppers-0.193
edient-0.192
kins-0.192
Token,
Token ID11
Feature activation+3.01e-01

Pos logit contributions
ted+0.067
 Many+0.066
 Everyone+0.060
 People+0.060
 Every+0.057
Neg logit contributions
cher-0.115
edient-0.110
kin-0.110
ppers-0.110
 achieve-0.110
Token long
Token ID890
Feature activation+1.20e+00

Pos logit contributions
ted+0.016
 Many+0.016
 People+0.015
 Everyone+0.015
 Every+0.015
Neg logit contributions
cher-0.021
illas-0.020
ppers-0.020
edient-0.020
 achieve-0.020
Token time
Token ID640
Feature activation+1.99e+00

Pos logit contributions
ted+0.071
 Many+0.070
 Everyone+0.066
 People+0.066
 Every+0.064
Neg logit contributions
cher-0.076
edient-0.073
ppers-0.072
 achieve-0.072
illas-0.072
Token.
Token ID13
Feature activation-2.83e-01

Pos logit contributions
ted+0.105
 Many+0.102
 Everyone+0.097
 People+0.097
 Every+0.091
Neg logit contributions
cher-0.138
kin-0.136
ppers-0.133
edient-0.133
 achieve-0.133
Token Many
Token ID4650
Feature activation+1.45e+01

Pos logit contributions
cher+0.019
illas+0.019
 achieve+0.019
ccoli+0.019
 Claus+0.019
Neg logit contributions
ted-0.014
ly-0.012
 Many-0.012
heart-0.012
andel-0.011
Token people
Token ID661
Feature activation+6.49e+00

Pos logit contributions
ted+0.662
 Many+0.622
 Every+0.603
 Everyone+0.598
 As+0.584
Neg logit contributions
kins-1.056
kin-1.045
illas-1.042
cher-1.041
 achieve-1.024
Token came
Token ID1625
Feature activation+5.34e+00

Pos logit contributions
ted+0.338
 Many+0.319
 Everyone+0.303
 People+0.303
 All+0.289
Neg logit contributions
kin-0.458
cher-0.455
 afloat-0.448
ppers-0.448
edient-0.443
Token to
Token ID284
Feature activation+5.39e-01

Pos logit contributions
ted+0.210
 Many+0.189
 Everyone+0.179
 People+0.177
 Every+0.173
Neg logit contributions
kin-0.457
cher-0.445
ppers-0.431
ccoli-0.430
 achieve-0.427
Token see
Token ID766
Feature activation+4.33e-01

Pos logit contributions
ted+0.032
 Many+0.031
 Everyone+0.030
 People+0.029
 Every+0.028
Neg logit contributions
cher-0.034
kin-0.033
 achieve-0.033
illas-0.033
 afloat-0.033
Token the
Token ID262
Feature activation-8.59e-01

Pos logit contributions
ted+0.022
 Many+0.021
 Everyone+0.020
 People+0.020
 Every+0.019
Neg logit contributions
cher-0.031
kin-0.031
ppers-0.030
 achieve-0.030
edient-0.030
Token was
Token ID373
Feature activation+1.77e+00

Pos logit contributions
kin+0.099
cher+0.096
 achieve+0.095
illas+0.094
 afloat+0.093
Neg logit contributions
ted-0.081
 Many-0.079
 People-0.073
 Everyone-0.073
 Every-0.069
Token a
Token ID257
Feature activation+2.13e+00

Pos logit contributions
ted+0.121
unn+0.108
 Many+0.106
 People+0.106
heart+0.105
Neg logit contributions
kin-0.094
 achieve-0.087
 afloat-0.086
Jackson-0.085
Arthur-0.084
Token big
Token ID1263
Feature activation+3.99e-02

Pos logit contributions
 Many+0.070
 Everyone+0.062
 People+0.061
ted+0.058
 All+0.057
Neg logit contributions
cher-0.193
ppers-0.191
illas-0.187
edient-0.185
 afloat-0.183
Token race
Token ID3234
Feature activation+5.68e-01

Pos logit contributions
ted+0.002
 Many+0.002
 Everyone+0.002
 People+0.002
 Every+0.002
Neg logit contributions
cher-0.003
ppers-0.003
edient-0.003
kin-0.003
 achieve-0.003
Token.
Token ID13
Feature activation-2.83e-01

Pos logit contributions
ted+0.033
 Many+0.032
 People+0.030
 Everyone+0.030
 Every+0.029
Neg logit contributions
cher-0.037
kin-0.036
illas-0.035
 achieve-0.035
edient-0.035
Token Many
Token ID4650
Feature activation+1.45e+01

Pos logit contributions
kin+0.019
cher+0.019
 achieve+0.019
Jackson+0.019
 globe+0.019
Neg logit contributions
ted-0.014
ly-0.011
unn-0.010
andel-0.010
heart-0.010
Token kids
Token ID3988
Feature activation+5.94e+00

Pos logit contributions
ted+0.700
 Many+0.613
 Every+0.598
 Everyone+0.581
 As+0.579
Neg logit contributions
cher-1.049
kins-1.039
 achieve-1.034
illas-1.032
kin-1.024
Token ran
Token ID4966
Feature activation+2.46e+00

Pos logit contributions
ted+0.295
 Many+0.287
 Everyone+0.267
 People+0.267
 Every+0.251
Neg logit contributions
cher-0.432
illas-0.420
 achieve-0.419
kin-0.418
edient-0.415
Token in
Token ID287
Feature activation-3.25e+00

Pos logit contributions
ted+0.134
 Many+0.133
 People+0.124
 Everyone+0.123
 Every+0.116
Neg logit contributions
cher-0.164
 afloat-0.159
 achieve-0.158
edient-0.157
kin-0.157
Token the
Token ID262
Feature activation-8.37e-01

Pos logit contributions
cher+0.245
kin+0.241
ppers+0.239
edient+0.237
illas+0.236
Neg logit contributions
ted-0.154
 Many-0.151
 Everyone-0.141
 People-0.140
 Every-0.133
Token race
Token ID3234
Feature activation+1.99e-01

Pos logit contributions
cher+0.055
ppers+0.054
kin+0.054
edient+0.053
illas+0.053
Neg logit contributions
ted-0.048
 Many-0.048
 Everyone-0.046
 People-0.045
 Every-0.043
Token made
Token ID925
Feature activation+9.36e-02

Pos logit contributions
ted+0.233
 Many+0.228
 Everyone+0.215
 People+0.214
 Every+0.202
Neg logit contributions
cher-0.243
kin-0.240
edient-0.235
illas-0.234
 achieve-0.233
Token flour
Token ID10601
Feature activation-5.33e-01

Pos logit contributions
ted+0.007
pill+0.006
unn+0.006
heart+0.006
umm+0.006
Neg logit contributions
ep-0.005
eps-0.004
aks-0.004
 achieve-0.004
 overcome-0.004
Token from
Token ID422
Feature activation-9.37e-01

Pos logit contributions
cher+0.045
illas+0.044
Jackson+0.043
edient+0.042
ccoli+0.042
Neg logit contributions
ted-0.020
 Everyone-0.018
 Every-0.018
 People-0.018
 Many-0.018
Token wheat
Token ID17135
Feature activation-9.69e-02

Pos logit contributions
 afloat+0.061
 globe+0.061
edient+0.060
kin+0.060
illas+0.060
Neg logit contributions
 Many-0.051
ted-0.051
 Everyone-0.048
 People-0.048
 All-0.045
Token.
Token ID13
Feature activation-3.71e-01

Pos logit contributions
cher+0.007
kin+0.007
edient+0.007
illas+0.007
ppers+0.007
Neg logit contributions
ted-0.005
 Many-0.005
 Everyone-0.004
 People-0.004
 Every-0.004
Token Many
Token ID4650
Feature activation+1.45e+01

Pos logit contributions
 globe+0.020
 achieve+0.019
aks+0.018
illas+0.018
 afloat+0.018
Neg logit contributions
ted-0.024
ly-0.022
heart-0.022
arted-0.021
ened-0.021
Token people
Token ID661
Feature activation+6.55e+00

Pos logit contributions
ted+0.667
 Every+0.566
 Many+0.565
 Everyone+0.553
 As+0.536
Neg logit contributions
illas-1.123
kins-1.118
 achieve-1.089
edient-1.087
kin-1.082
Token came
Token ID1625
Feature activation+5.45e+00

Pos logit contributions
ted+0.339
 Many+0.327
 Everyone+0.309
 People+0.308
 All+0.289
Neg logit contributions
cher-0.463
kin-0.456
 afloat-0.452
ppers-0.449
edient-0.447
Token to
Token ID284
Feature activation+7.00e-01

Pos logit contributions
ted+0.216
 Many+0.208
 Everyone+0.192
 People+0.192
 Every+0.178
Neg logit contributions
cher-0.453
kin-0.449
ppers-0.438
 afloat-0.436
 achieve-0.436
Token see
Token ID766
Feature activation+6.33e-01

Pos logit contributions
ted+0.042
 Many+0.040
 Everyone+0.038
 People+0.038
 Every+0.036
Neg logit contributions
cher-0.044
kin-0.043
 achieve-0.043
illas-0.042
 afloat-0.042
Token the
Token ID262
Feature activation-6.34e-01

Pos logit contributions
ted+0.032
 Many+0.031
 Everyone+0.029
 People+0.029
 Every+0.027
Neg logit contributions
cher-0.044
illas-0.044
ppers-0.043
 achieve-0.043
kin-0.043
Token was
Token ID373
Feature activation+1.19e+00

Pos logit contributions
kin+0.141
 globe+0.134
 achieve+0.133
illas+0.131
edient+0.129
Neg logit contributions
ted-0.137
 Many-0.131
 People-0.124
 Everyone-0.123
andel-0.122
Token a
Token ID257
Feature activation+1.70e+00

Pos logit contributions
heart+0.091
ted+0.091
unn+0.085
arted+0.084
ly+0.084
Neg logit contributions
kin-0.056
 globe-0.048
Anna-0.045
aks-0.045
ccoli-0.044
Token big
Token ID1263
Feature activation-2.61e-01

Pos logit contributions
 Many+0.051
 Everyone+0.043
 People+0.042
 All+0.039
ted+0.037
Neg logit contributions
cher-0.162
illas-0.160
ppers-0.159
edient-0.156
 afloat-0.154
Token hive
Token ID35881
Feature activation-2.36e+00

Pos logit contributions
cher+0.019
ppers+0.018
edient+0.018
illas+0.018
 achieve+0.018
Neg logit contributions
ted-0.013
 Many-0.013
 Everyone-0.012
 People-0.012
 Every-0.012
Token.
Token ID13
Feature activation-3.70e-01

Pos logit contributions
cher+0.164
illas+0.158
ppers+0.157
kin+0.157
 achieve+0.156
Neg logit contributions
ted-0.126
 Many-0.122
 Everyone-0.118
 People-0.116
 Every-0.112
Token Many
Token ID4650
Feature activation+1.45e+01

Pos logit contributions
kin+0.025
 globe+0.024
aks+0.024
 achieve+0.024
 tim+0.024
Neg logit contributions
ted-0.019
heart-0.017
ly-0.015
arted-0.015
illy-0.015
Token bees
Token ID17002
Feature activation+4.19e+00

Pos logit contributions
ted+0.729
 As+0.691
 Everyone+0.676
 Every+0.663
 Many+0.649
Neg logit contributions
kins-1.065
aks-1.013
aky-0.992
kin-0.988
illas-0.964
Token lived
Token ID5615
Feature activation+1.93e+00

Pos logit contributions
ted+0.224
 Many+0.221
 Everyone+0.208
 People+0.207
 Every+0.195
Neg logit contributions
cher-0.286
kin-0.282
illas-0.279
 achieve-0.277
edient-0.277
Token in
Token ID287
Feature activation-3.11e+00

Pos logit contributions
ted+0.105
 Many+0.098
 People+0.093
 Everyone+0.093
 Every+0.092
Neg logit contributions
cher-0.130
ppers-0.129
 achieve-0.128
 afloat-0.126
ccoli-0.125
Token the
Token ID262
Feature activation-6.48e-01

Pos logit contributions
kin+0.225
cher+0.222
illas+0.220
edient+0.219
 achieve+0.219
Neg logit contributions
ted-0.155
 Many-0.148
 Everyone-0.138
 People-0.138
heart-0.127
Token hive
Token ID35881
Feature activation-2.31e+00

Pos logit contributions
 achieve+0.032
kin+0.032
edient+0.031
 afloat+0.031
illas+0.031
Neg logit contributions
ted-0.046
 Many-0.044
 Everyone-0.042
 People-0.042
 Every-0.040
Token landed
Token ID11406
Feature activation+3.72e+00

Pos logit contributions
ted+0.020
 Many+0.019
 Everyone+0.018
 People+0.018
 All+0.017
Neg logit contributions
cher-0.024
kin-0.024
 achieve-0.024
edient-0.024
 afloat-0.024
Token on
Token ID319
Feature activation-7.25e+00

Pos logit contributions
ted+0.278
 Many+0.248
 Everyone+0.246
 People+0.244
 All+0.236
Neg logit contributions
 achieve-0.190
edient-0.185
ccoli-0.185
 promise-0.184
cher-0.182
Token the
Token ID262
Feature activation-6.82e-01

Pos logit contributions
kin+0.495
 globe+0.495
 afloat+0.491
cher+0.488
illas+0.487
Neg logit contributions
ted-0.359
 Many-0.356
 People-0.330
 Everyone-0.321
heart-0.320
Token street
Token ID4675
Feature activation-1.12e+00

Pos logit contributions
cher+0.048
 afloat+0.046
ppers+0.046
kin+0.046
 achieve+0.046
Neg logit contributions
 Many-0.036
ted-0.036
 Everyone-0.035
 People-0.034
 All-0.031
Token.
Token ID13
Feature activation-3.70e-01

Pos logit contributions
cher+0.078
kin+0.076
 afloat+0.075
ppers+0.075
 achieve+0.075
Neg logit contributions
ted-0.060
 Many-0.058
 Everyone-0.055
 People-0.054
 All-0.052
Token Many
Token ID4650
Feature activation+1.45e+01

Pos logit contributions
cher+0.026
 achieve+0.026
 afloat+0.025
kin+0.025
 Claus+0.025
Neg logit contributions
ted-0.019
heart-0.017
 Many-0.016
ly-0.016
 People-0.015
Token birds
Token ID10087
Feature activation+8.99e+00

Pos logit contributions
ted+0.588
 Many+0.537
 Every+0.526
 Everyone+0.514
 People+0.496
Neg logit contributions
illas-1.154
cher-1.150
 achieve-1.136
kin-1.133
kins-1.126
Token and
Token ID290
Feature activation+4.01e-01

Pos logit contributions
ted+0.425
 Many+0.393
 Everyone+0.364
 People+0.362
 All+0.343
Neg logit contributions
cher-0.685
kin-0.681
edient-0.670
 afloat-0.667
ppers-0.666
Token dogs
Token ID6844
Feature activation+5.79e+00

Pos logit contributions
 Many+0.018
 Everyone+0.017
 People+0.017
ted+0.017
 Every+0.017
Neg logit contributions
illas-0.029
 Claus-0.029
cher-0.028
 globe-0.028
ppers-0.028
Token came
Token ID1625
Feature activation+5.45e+00

Pos logit contributions
ted+0.299
 Many+0.266
 Everyone+0.250
 People+0.247
 All+0.242
Neg logit contributions
kin-0.429
cher-0.422
edient-0.420
ppers-0.415
 afloat-0.414
Token and
Token ID290
Feature activation+4.31e-01

Pos logit contributions
ted+0.275
 Many+0.264
 Everyone+0.249
 People+0.246
 Every+0.233
Neg logit contributions
kin-0.401
cher-0.396
 achieve-0.383
illas-0.382
ppers-0.381
Token it
Token ID340
Feature activation+3.70e+00

Pos logit contributions
cher+0.129
kin+0.126
ppers+0.124
 achieve+0.124
illas+0.124
Neg logit contributions
ted-0.092
 Many-0.089
 Everyone-0.084
 People-0.084
 Every-0.079
Token was
Token ID373
Feature activation+1.16e+00

Pos logit contributions
ted+0.180
 Many+0.159
 People+0.152
 Everyone+0.149
 All+0.148
Neg logit contributions
 achieve-0.276
cher-0.275
 afloat-0.269
ppers-0.267
kin-0.265
Token too
Token ID1165
Feature activation+3.19e+00

Pos logit contributions
ted+0.049
 Every+0.048
 as+0.046
 Many+0.046
 All+0.043
Neg logit contributions
 achieve-0.090
cher-0.089
illas-0.086
ucer-0.085
ppers-0.085
Token late
Token ID2739
Feature activation-3.36e+00

Pos logit contributions
 Many+0.175
ted+0.174
 Every+0.164
 People+0.161
 Everyone+0.161
Neg logit contributions
cher-0.216
illas-0.209
 achieve-0.208
ppers-0.202
kin-0.200
Token.
Token ID13
Feature activation-3.93e-01

Pos logit contributions
cher+0.258
kin+0.252
illas+0.251
 achieve+0.248
edient+0.247
Neg logit contributions
ted-0.158
 Many-0.141
 Everyone-0.140
 People-0.136
 All-0.132
Token Many
Token ID4650
Feature activation+1.44e+01

Pos logit contributions
cher+0.030
kin+0.030
 afloat+0.030
 achieve+0.030
edient+0.030
Neg logit contributions
ted-0.017
 Many-0.016
 People-0.015
 Everyone-0.014
heart-0.014
Token people
Token ID661
Feature activation+6.51e+00

Pos logit contributions
ted+0.602
 Many+0.558
 Every+0.550
 Everyone+0.542
 People+0.514
Neg logit contributions
illas-1.134
cher-1.125
kin-1.110
 achieve-1.110
kins-1.108
Token were
Token ID547
Feature activation+1.75e+00

Pos logit contributions
ted+0.328
 Many+0.294
 People+0.274
 Everyone+0.274
 All+0.265
Neg logit contributions
kin-0.475
ccoli-0.470
 afloat-0.467
cher-0.466
edient-0.463
Token sad
Token ID6507
Feature activation+1.77e-01

Pos logit contributions
ted+0.101
 Many+0.098
 Everyone+0.094
 People+0.092
 All+0.089
Neg logit contributions
kin-0.108
ccoli-0.107
 achieve-0.106
cher-0.106
ucer-0.105
Token and
Token ID290
Feature activation+3.95e-01

Pos logit contributions
ted+0.010
 Many+0.010
 People+0.009
 Everyone+0.009
 Every+0.009
Neg logit contributions
cher-0.011
kin-0.011
 achieve-0.011
 afloat-0.011
illas-0.011
Token angry
Token ID7954
Feature activation-3.13e-01

Pos logit contributions
ted+0.025
 Many+0.025
 Everyone+0.024
 People+0.024
 Every+0.022
Neg logit contributions
cher-0.023
kin-0.022
edient-0.022
 afloat-0.022
illas-0.022
Token planes
Token ID13016
Feature activation+6.56e+00

Pos logit contributions
cher+0.062
kins+0.060
illas+0.059
 Claus+0.059
kin+0.058
Neg logit contributions
ted-0.054
 Many-0.049
andel-0.047
 Every-0.045
heart-0.045
Token came
Token ID1625
Feature activation+5.38e+00

Pos logit contributions
ted+0.335
 Many+0.324
 People+0.305
 Everyone+0.305
 Every+0.286
Neg logit contributions
cher-0.468
kin-0.460
ppers-0.454
edient-0.453
 afloat-0.451
Token and
Token ID290
Feature activation+3.42e-01

Pos logit contributions
ted+0.312
 Many+0.302
 Everyone+0.285
 People+0.282
 All+0.264
Neg logit contributions
cher-0.342
 afloat-0.338
 achieve-0.337
edient-0.336
 Claus-0.334
Token went
Token ID1816
Feature activation+3.96e+00

Pos logit contributions
ted+0.020
 Many+0.019
 Everyone+0.018
 People+0.018
 Every+0.017
Neg logit contributions
cher-0.022
kin-0.022
ppers-0.021
edient-0.021
 afloat-0.021
Token.
Token ID13
Feature activation-4.12e-01

Pos logit contributions
ted+0.245
 Many+0.240
 Everyone+0.228
 People+0.227
 Every+0.216
Neg logit contributions
cher-0.241
kin-0.236
ppers-0.231
edient-0.231
illas-0.231
Token Many
Token ID4650
Feature activation+1.44e+01

Pos logit contributions
 globe+0.026
aks+0.026
 achieve+0.026
 slot+0.025
Jackson+0.024
Neg logit contributions
ted-0.022
heart-0.022
ly-0.020
arted-0.019
illy-0.019
Token people
Token ID661
Feature activation+6.50e+00

Pos logit contributions
ted+0.693
 Every+0.600
 Many+0.597
 Everyone+0.585
 As+0.569
Neg logit contributions
illas-1.084
kins-1.066
 achieve-1.051
kin-1.042
cher-1.041
Token went
Token ID1816
Feature activation+3.98e+00

Pos logit contributions
ted+0.341
 Many+0.331
 People+0.313
 Everyone+0.313
 All+0.293
Neg logit contributions
cher-0.454
kin-0.449
 afloat-0.443
ppers-0.441
edient-0.439
Token there
Token ID612
Feature activation-2.23e+00

Pos logit contributions
ted+0.220
 Many+0.215
 Everyone+0.203
 People+0.202
 Every+0.191
Neg logit contributions
cher-0.269
kin-0.263
 achieve-0.259
ppers-0.259
illas-0.258
Token to
Token ID284
Feature activation+6.53e-01

Pos logit contributions
cher+0.162
kin+0.159
ppers+0.157
 achieve+0.156
edient+0.156
Neg logit contributions
ted-0.111
 Many-0.108
 Everyone-0.102
 People-0.101
 Every-0.095
Token fly
Token ID6129
Feature activation+2.86e+00

Pos logit contributions
 Many+0.040
ted+0.040
 People+0.038
 Everyone+0.038
 Every+0.036
Neg logit contributions
cher-0.040
kin-0.039
ppers-0.039
edient-0.038
 afloat-0.038
Token there
Token ID612
Feature activation-2.29e+00

Pos logit contributions
cher+0.013
ppers+0.013
kin+0.013
edient+0.013
 achieve+0.013
Neg logit contributions
ted-0.011
 Many-0.011
 People-0.010
 Everyone-0.010
 Every-0.010
Token was
Token ID373
Feature activation+1.13e+00

Pos logit contributions
kin+0.144
 globe+0.136
 achieve+0.134
illas+0.133
cher+0.132
Neg logit contributions
ted-0.139
 Many-0.134
 People-0.126
 Everyone-0.126
andel-0.125
Token a
Token ID257
Feature activation+1.65e+00

Pos logit contributions
ted+0.084
heart+0.080
unn+0.076
ly+0.076
arted+0.075
Neg logit contributions
kin-0.057
 globe-0.051
aks-0.050
ccoli-0.048
 achieve-0.047
Token hive
Token ID35881
Feature activation-2.40e+00

Pos logit contributions
 Many+0.049
 Everyone+0.041
 People+0.039
 All+0.037
 birds+0.032
Neg logit contributions
cher-0.161
illas-0.158
ppers-0.157
edient-0.153
 afloat-0.152
Token.
Token ID13
Feature activation-4.40e-01

Pos logit contributions
cher+0.168
illas+0.162
ppers+0.162
kin+0.161
 achieve+0.160
Neg logit contributions
ted-0.127
 Many-0.124
 Everyone-0.120
 People-0.118
 Every-0.114
Token Many
Token ID4650
Feature activation+1.44e+01

Pos logit contributions
 globe+0.030
kin+0.030
aks+0.030
 Claus+0.030
 achieve+0.029
Neg logit contributions
ted-0.021
heart-0.019
arted-0.017
illy-0.017
ly-0.017
Token bees
Token ID17002
Feature activation+4.14e+00

Pos logit contributions
ted+0.751
 As+0.671
 Everyone+0.668
 Every+0.641
 Many+0.630
Neg logit contributions
kins-1.032
aks-1.004
kin-0.977
aky-0.971
illas-0.961
Token lived
Token ID5615
Feature activation+1.88e+00

Pos logit contributions
ted+0.218
 Many+0.215
 Everyone+0.202
 People+0.202
 Every+0.190
Neg logit contributions
cher-0.285
kin-0.281
illas-0.278
 achieve-0.276
edient-0.276
Token in
Token ID287
Feature activation-3.17e+00

Pos logit contributions
ted+0.102
 Many+0.096
 People+0.092
 Everyone+0.091
 Every+0.090
Neg logit contributions
cher-0.126
ppers-0.125
 achieve-0.124
 afloat-0.124
edient-0.122
Token the
Token ID262
Feature activation-7.04e-01

Pos logit contributions
cher+0.241
kin+0.238
illas+0.233
edient+0.233
 achieve+0.233
Neg logit contributions
ted-0.148
 Many-0.143
 Everyone-0.134
 People-0.133
 Every-0.124
Token hive
Token ID35881
Feature activation-2.36e+00

Pos logit contributions
 achieve+0.031
kin+0.030
 promise+0.030
illas+0.029
edient+0.029
Neg logit contributions
ted-0.055
 Many-0.052
 Everyone-0.050
 People-0.049
heart-0.047
Token a
Token ID257
Feature activation+1.64e+00

Pos logit contributions
ted+0.079
heart+0.075
arted+0.072
ly+0.070
andel+0.070
Neg logit contributions
kin-0.063
 globe-0.055
aks-0.054
kins-0.053
 achieve-0.052
Token big
Token ID1263
Feature activation-3.13e-01

Pos logit contributions
 Many+0.056
 Everyone+0.049
 People+0.047
 All+0.045
ted+0.042
Neg logit contributions
cher-0.151
illas-0.148
ppers-0.147
edient-0.144
 afloat-0.142
Token peach
Token ID47565
Feature activation-4.63e-01

Pos logit contributions
cher+0.023
ppers+0.022
edient+0.022
illas+0.022
 achieve+0.022
Neg logit contributions
ted-0.016
 Many-0.015
 Everyone-0.015
 People-0.014
 Every-0.014
Token tree
Token ID5509
Feature activation-6.09e-02

Pos logit contributions
cher+0.034
 afloat+0.033
 achieve+0.032
illas+0.032
ppers+0.032
Neg logit contributions
ted-0.023
 Many-0.022
 Everyone-0.021
 People-0.021
 All-0.020
Token.
Token ID13
Feature activation-4.20e-01

Pos logit contributions
cher+0.005
kin+0.004
illas+0.004
 achieve+0.004
 afloat+0.004
Neg logit contributions
ted-0.003
 Many-0.003
 Every-0.003
 Everyone-0.003
 All-0.003
Token Many
Token ID4650
Feature activation+1.44e+01

Pos logit contributions
 globe+0.030
kin+0.030
 achieve+0.030
 Claus+0.029
aks+0.029
Neg logit contributions
ted-0.020
heart-0.017
ly-0.017
arted-0.016
illy-0.015
Token pe
Token ID613
Feature activation+2.83e+00

Pos logit contributions
ted+0.774
 As+0.712
 Everyone+0.702
 Every+0.670
 Many+0.656
Neg logit contributions
kins-0.988
aks-0.953
kin-0.939
aky-0.933
illas-0.924
Tokenaches
Token ID3694
Feature activation-3.79e+00

Pos logit contributions
ted+0.276
 Many+0.276
 Everyone+0.271
 People+0.267
 All+0.260
Neg logit contributions
cher-0.074
ppers-0.073
illas-0.072
kins-0.070
kin-0.069
Token grew
Token ID6348
Feature activation+5.11e+00

Pos logit contributions
kin+0.252
cher+0.247
illas+0.245
kins+0.242
 achieve+0.240
Neg logit contributions
 Many-0.213
ted-0.209
 Everyone-0.201
 People-0.198
 Every-0.189
Token on
Token ID319
Feature activation-7.25e+00

Pos logit contributions
ted+0.303
 Many+0.302
 Everyone+0.287
 People+0.282
 All+0.264
Neg logit contributions
ppers-0.318
kins-0.314
cher-0.313
kin-0.313
 afloat-0.313
Token the
Token ID262
Feature activation-6.89e-01

Pos logit contributions
 globe+0.479
 achieve+0.476
kin+0.475
kins+0.466
illas+0.461
Neg logit contributions
ted-0.398
 Many-0.383
 Everyone-0.354
 People-0.350
heart-0.331
Token the
Token ID262
Feature activation-7.73e-01

Pos logit contributions
ted+0.015
 Many+0.014
 Everyone+0.013
 People+0.013
 Every+0.012
Neg logit contributions
cher-0.022
kin-0.022
illas-0.021
 afloat-0.021
 achieve-0.021
Token swing
Token ID9628
Feature activation-4.37e+00

Pos logit contributions
cher+0.032
kin+0.030
ppers+0.030
edient+0.030
 afloat+0.030
Neg logit contributions
ted-0.063
 Many-0.063
 Everyone-0.061
 People-0.060
 All-0.058
Token are
Token ID389
Feature activation-3.46e+00

Pos logit contributions
cher+0.223
kin+0.220
 achieve+0.218
illas+0.215
ppers+0.213
Neg logit contributions
ted-0.307
 Many-0.298
 Everyone-0.285
 People-0.283
 Every-0.270
Token popular
Token ID2968
Feature activation+1.57e+00

Pos logit contributions
cher+0.243
edient+0.229
illas+0.229
ccoli+0.228
 Claus+0.227
Neg logit contributions
 Many-0.185
 Every-0.177
ted-0.175
 People-0.171
 Everyone-0.169
Token.
Token ID13
Feature activation-4.49e-01

Pos logit contributions
ted+0.096
 Many+0.096
 Everyone+0.090
 People+0.090
 Every+0.085
Neg logit contributions
cher-0.096
kin-0.094
illas-0.092
ppers-0.092
edient-0.092
Token Many
Token ID4650
Feature activation+1.44e+01

Pos logit contributions
kin+0.028
 afloat+0.027
 achieve+0.027
illas+0.027
cher+0.027
Neg logit contributions
ted-0.026
heart-0.023
 Many-0.022
ly-0.022
 People-0.021
Token children
Token ID1751
Feature activation+5.43e+00

Pos logit contributions
ted+0.657
 Many+0.578
 Everyone+0.563
 Every+0.548
 People+0.540
Neg logit contributions
illas-1.100
 achieve-1.099
kin-1.081
kins-1.074
cher-1.058
Token want
Token ID765
Feature activation+4.26e+00

Pos logit contributions
 Many+0.327
ted+0.327
 Everyone+0.313
 People+0.312
 Every+0.300
Neg logit contributions
kin-0.336
cher-0.335
 afloat-0.328
edient-0.324
ppers-0.323
Token to
Token ID284
Feature activation+6.19e-01

Pos logit contributions
ted+0.229
 Many+0.229
heart+0.208
 Everyone+0.204
andel+0.203
Neg logit contributions
cher-0.259
illas-0.257
 globe-0.255
 promise-0.254
 achieve-0.250
Token play
Token ID711
Feature activation+5.45e-01

Pos logit contributions
ted+0.042
 Many+0.041
 Everyone+0.040
 People+0.039
 Every+0.038
Neg logit contributions
cher-0.034
kin-0.033
ppers-0.032
edient-0.032
 afloat-0.032
Token on
Token ID319
Feature activation-7.27e+00

Pos logit contributions
ted+0.035
 Many+0.034
 Everyone+0.033
 People+0.033
 All+0.031
Neg logit contributions
cher-0.032
ppers-0.032
edient-0.032
 Claus-0.032
 afloat-0.031
Token can
Token ID460
Feature activation-3.73e+00

Pos logit contributions
cher+0.098
kin+0.095
ppers+0.093
 afloat+0.093
edient+0.093
Neg logit contributions
ted-0.148
 Many-0.146
 Everyone-0.140
 People-0.139
 Every-0.134
Token sing
Token ID1702
Feature activation-6.21e-02

Pos logit contributions
 achieve+0.204
cher+0.198
kin+0.190
 Claus+0.187
illas+0.185
Neg logit contributions
ted-0.266
 Many-0.239
heart-0.232
 People-0.231
 Everyone-0.225
Token very
Token ID845
Feature activation-2.28e+00

Pos logit contributions
cher+0.004
kin+0.004
 achieve+0.004
 afloat+0.004
ppers+0.004
Neg logit contributions
ted-0.003
 Many-0.003
 Everyone-0.003
 People-0.003
 Every-0.003
Token well
Token ID880
Feature activation-4.51e+00

Pos logit contributions
cher+0.128
illas+0.123
ppers+0.123
 afloat+0.122
kin+0.121
Neg logit contributions
 Many-0.152
ted-0.151
 Everyone-0.146
 People-0.145
 Every-0.138
Token.
Token ID13
Feature activation-4.54e-01

Pos logit contributions
cher+0.324
ppers+0.317
 afloat+0.317
kin+0.315
edient+0.314
Neg logit contributions
ted-0.226
 Many-0.217
 Everyone-0.208
 People-0.206
 Every-0.197
Token Many
Token ID4650
Feature activation+1.44e+01

Pos logit contributions
 achieve+0.030
kin+0.030
cher+0.029
 globe+0.029
illas+0.029
Neg logit contributions
ted-0.025
 Many-0.021
heart-0.021
 People-0.020
 Everyone-0.020
Token people
Token ID661
Feature activation+6.45e+00

Pos logit contributions
ted+0.694
 Many+0.625
 Every+0.611
 Everyone+0.605
 As+0.586
Neg logit contributions
kin-1.061
illas-1.053
kins-1.042
 achieve-1.041
cher-1.029
Token like
Token ID588
Feature activation+1.75e-01

Pos logit contributions
ted+0.365
 Many+0.359
 Everyone+0.352
 People+0.346
 Every+0.333
Neg logit contributions
 afloat-0.417
kin-0.412
cher-0.407
ppers-0.404
edient-0.399
Token to
Token ID284
Feature activation+6.04e-01

Pos logit contributions
ted+0.009
 Many+0.008
 Every+0.008
 Everyone+0.008
heart+0.008
Neg logit contributions
cher-0.012
illas-0.012
 achieve-0.011
 Claus-0.011
 globe-0.011
Token hear
Token ID3285
Feature activation-1.13e+00

Pos logit contributions
ted+0.040
 Many+0.040
 Everyone+0.038
 People+0.038
 Every+0.036
Neg logit contributions
cher-0.034
kin-0.033
ppers-0.032
 afloat-0.032
edient-0.032
Token him
Token ID683
Feature activation+5.56e+00

Pos logit contributions
 achieve+0.071
cher+0.071
illas+0.070
kin+0.069
edient+0.069
Neg logit contributions
ted-0.062
 Many-0.058
 People-0.054
 Everyone-0.053
 Every-0.052
Token middle
Token ID3504
Feature activation+2.13e+00

Pos logit contributions
cher+0.019
ppers+0.019
kin+0.018
 afloat+0.018
edient+0.018
Neg logit contributions
 Many-0.012
ted-0.012
 Everyone-0.012
 People-0.012
 All-0.011
Token of
Token ID286
Feature activation+7.05e+00

Pos logit contributions
ted+0.059
 Many+0.052
 People+0.044
 Everyone+0.043
 Every+0.037
Neg logit contributions
kin-0.204
 achieve-0.200
cher-0.199
illas-0.198
edient-0.197
Token the
Token ID262
Feature activation-5.79e-01

Pos logit contributions
ted+0.413
 Many+0.386
 People+0.365
 Everyone+0.360
heart+0.350
Neg logit contributions
kin-0.442
 globe-0.432
 achieve-0.428
cher-0.426
illas-0.424
Token park
Token ID3952
Feature activation+1.19e+00

Pos logit contributions
cher+0.039
ppers+0.038
kin+0.038
 afloat+0.038
edient+0.038
Neg logit contributions
ted-0.032
 Many-0.031
 Everyone-0.030
 People-0.030
 All-0.028
Token.
Token ID13
Feature activation-4.03e-01

Pos logit contributions
ted+0.071
 Many+0.070
 People+0.065
 Everyone+0.065
 Every+0.062
Neg logit contributions
cher-0.075
kin-0.074
edient-0.072
illas-0.072
 achieve-0.072
Token Many
Token ID4650
Feature activation+1.44e+01

Pos logit contributions
 achieve+0.027
cher+0.026
Jackson+0.026
 globe+0.026
illas+0.026
Neg logit contributions
ted-0.020
ly-0.017
heart-0.016
andel-0.016
ened-0.016
Token kids
Token ID3988
Feature activation+5.85e+00

Pos logit contributions
ted+0.664
 Every+0.617
 Many+0.610
 As+0.589
 Everyone+0.568
Neg logit contributions
kins-1.055
kin-1.045
cher-1.042
illas-1.041
 achieve-1.028
Token would
Token ID561
Feature activation-3.16e+00

Pos logit contributions
ted+0.297
 Many+0.292
 Everyone+0.272
 People+0.272
 Every+0.255
Neg logit contributions
cher-0.416
illas-0.405
kin-0.404
 achieve-0.403
edient-0.400
Token come
Token ID1282
Feature activation+3.35e-01

Pos logit contributions
cher+0.171
kin+0.166
 achieve+0.164
illas+0.163
edient+0.163
Neg logit contributions
ted-0.216
 Many-0.211
 Everyone-0.201
 People-0.201
 Every-0.192
Token to
Token ID284
Feature activation+4.37e-01

Pos logit contributions
ted+0.012
 Many+0.011
 Everyone+0.011
 Every+0.011
 People+0.011
Neg logit contributions
kin-0.030
cher-0.028
wy-0.028
kins-0.027
ccoli-0.027
Token play
Token ID711
Feature activation+3.45e-01

Pos logit contributions
ted+0.027
 Many+0.025
 People+0.024
 Everyone+0.024
 Every+0.023
Neg logit contributions
cher-0.027
 achieve-0.026
illas-0.026
kin-0.026
 afloat-0.026
Token searching
Token ID10342
Feature activation+3.31e+00

Pos logit contributions
ted+0.130
 Many+0.126
 People+0.119
 Everyone+0.119
 Every+0.113
Neg logit contributions
cher-0.139
kin-0.135
 achieve-0.134
illas-0.133
ppers-0.133
Token for
Token ID329
Feature activation+3.04e+00

Pos logit contributions
ted+0.175
 Many+0.174
 Everyone+0.162
 People+0.161
 All+0.153
Neg logit contributions
cher-0.226
 afloat-0.222
kin-0.220
 achieve-0.218
edient-0.216
Token a
Token ID257
Feature activation+1.81e+00

Pos logit contributions
ted+0.194
 Many+0.191
 People+0.181
 Everyone+0.181
arted+0.178
Neg logit contributions
 afloat-0.169
 promise-0.167
cher-0.165
kins-0.161
 achieve-0.161
Token lesson
Token ID11483
Feature activation+1.18e+00

Pos logit contributions
 Many+0.084
ted+0.080
 People+0.076
 Everyone+0.074
 All+0.073
Neg logit contributions
edient-0.130
cher-0.129
illas-0.128
ppers-0.126
kin-0.126
Token.
Token ID13
Feature activation-4.03e-01

Pos logit contributions
 Many+0.062
ted+0.062
 Everyone+0.057
 People+0.057
 Every+0.054
Neg logit contributions
cher-0.084
ppers-0.080
kin-0.080
illas-0.080
 achieve-0.080
Token Many
Token ID4650
Feature activation+1.44e+01

Pos logit contributions
cher+0.027
kin+0.027
illas+0.026
edient+0.026
 achieve+0.026
Neg logit contributions
ted-0.022
 Many-0.021
 Everyone-0.020
 People-0.020
 Every-0.019
Token times
Token ID1661
Feature activation+5.61e+00

Pos logit contributions
ted+0.716
 Many+0.687
 Everyone+0.659
 Every+0.640
 As+0.628
Neg logit contributions
kin-1.025
cher-1.024
kins-1.016
 achieve-1.016
 promise-1.010
Token,
Token ID11
Feature activation-2.82e-01

Pos logit contributions
ted+0.321
 Many+0.315
 Everyone+0.297
 People+0.296
 Every+0.281
Neg logit contributions
cher-0.364
kin-0.357
 achieve-0.350
ppers-0.350
illas-0.350
Token what
Token ID644
Feature activation-3.50e+00

Pos logit contributions
cher+0.019
 achieve+0.019
kin+0.019
 afloat+0.019
illas+0.019
Neg logit contributions
ted-0.015
 Many-0.014
 Everyone-0.013
 People-0.013
 Every-0.013
Token we
Token ID356
Feature activation+8.59e-01

Pos logit contributions
cher+0.192
kin+0.191
 achieve+0.187
illas+0.185
edient+0.181
Neg logit contributions
ted-0.240
 Many-0.230
 Everyone-0.216
 People-0.215
 Every-0.206
Tokenâ
Token ID22940
Feature activation+6.83e-01

Pos logit contributions
ted+0.051
 Many+0.049
 Everyone+0.047
 People+0.047
 Every+0.045
Neg logit contributions
cher-0.053
kin-0.053
ppers-0.052
 afloat-0.052
edient-0.052
TokenOh
Token ID5812
Feature activation-2.69e+00

Pos logit contributions
cher+0.039
kin+0.037
edient+0.037
 achieve+0.037
illas+0.037
Neg logit contributions
ted-0.044
 Many-0.043
 Everyone-0.041
 People-0.041
 Every-0.039
Token,
Token ID11
Feature activation-5.69e-01

Pos logit contributions
cher+0.177
kin+0.173
ppers+0.168
illas+0.167
 achieve+0.167
Neg logit contributions
ted-0.157
 Many-0.151
 People-0.141
 Everyone-0.141
 Every-0.132
Token Mike
Token ID4995
Feature activation+2.92e+00

Pos logit contributions
cher+0.041
kin+0.040
ppers+0.039
 achieve+0.039
illas+0.039
Neg logit contributions
ted-0.029
 Many-0.028
 Everyone-0.027
 People-0.026
 Every-0.025
Token!
Token ID0
Feature activation+7.69e-01

Pos logit contributions
ted+0.123
heart+0.112
 Everyone+0.110
 People+0.108
 Many+0.105
Neg logit contributions
 afloat-0.229
 achieve-0.226
cher-0.225
illas-0.222
kin-0.222
Token Bro
Token ID2806
Feature activation-5.73e+00

Pos logit contributions
ted+0.042
 Many+0.039
 People+0.035
 Everyone+0.035
heart+0.035
Neg logit contributions
cher-0.052
ccoli-0.050
illas-0.049
kin-0.049
edient-0.048
Tokenccoli
Token ID34544
Feature activation-1.39e+01

Pos logit contributions
 afloat+0.185
 achieve+0.178
cher+0.177
edient+0.167
kin+0.164
Neg logit contributions
ted-0.506
 Everyone-0.489
 People-0.485
 Many-0.481
heart-0.473
Token is
Token ID318
Feature activation-1.48e+00

Pos logit contributions
cher+0.840
 afloat+0.820
ppers+0.815
illas+0.809
kin+0.805
Neg logit contributions
ted-0.850
 Everyone-0.800
 Many-0.792
 People-0.791
 All-0.765
Token a
Token ID257
Feature activation+1.28e+00

Pos logit contributions
cher+0.103
kin+0.100
illas+0.100
ppers+0.099
 achieve+0.099
Neg logit contributions
ted-0.077
 Many-0.076
 Everyone-0.072
 People-0.072
 Every-0.069
Token vegetable
Token ID20236
Feature activation-5.50e+00

Pos logit contributions
 Many+0.068
 Everyone+0.064
ted+0.064
 People+0.063
 Every+0.061
Neg logit contributions
illas-0.088
cher-0.088
ppers-0.087
edient-0.083
 afloat-0.082
Token!
Token ID0
Feature activation+7.87e-01

Pos logit contributions
cher+0.377
illas+0.370
 afloat+0.369
edient+0.366
ppers+0.366
Neg logit contributions
ted-0.282
 Many-0.271
 Everyone-0.264
 People-0.261
 Every-0.249
Token It
Token ID632
Feature activation+3.47e+00

Pos logit contributions
ted+0.041
 Many+0.037
 People+0.034
heart+0.033
ly+0.033
Neg logit contributions
cher-0.055
ccoli-0.053
edient-0.052
 achieve-0.052
ppers-0.051
Token.
Token ID13
Feature activation-7.86e-01

Pos logit contributions
ted+0.117
 Many+0.106
 Every+0.102
 Everyone+0.100
 All+0.099
Neg logit contributions
cher-0.232
kin-0.222
Jackson-0.219
 globe-0.219
ucer-0.218
Token
Token ID198
Feature activation-2.61e-01

Pos logit contributions
 achieve+0.055
cher+0.054
kin+0.054
illas+0.053
ccoli+0.053
Neg logit contributions
ted-0.040
 Many-0.037
heart-0.034
 People-0.034
 Everyone-0.034
Token
Token ID198
Feature activation-2.41e-01

Pos logit contributions
cher+0.018
kin+0.018
ppers+0.017
 afloat+0.017
illas+0.017
Neg logit contributions
ted-0.014
 Many-0.014
 Everyone-0.013
 People-0.013
 Every-0.012
Token"
Token ID1
Feature activation-2.48e+00

Pos logit contributions
 afloat+0.016
cher+0.016
 Claus+0.016
 globe+0.015
illas+0.015
Neg logit contributions
 Many-0.014
 Everyone-0.013
ted-0.013
 People-0.013
 All-0.012
TokenBro
Token ID15783
Feature activation-4.31e+00

Pos logit contributions
cher+0.151
 afloat+0.147
kin+0.146
 achieve+0.144
ppers+0.144
Neg logit contributions
ted-0.158
 Many-0.154
 Everyone-0.146
 People-0.145
 All-0.139
Tokenccoli
Token ID34544
Feature activation-1.38e+01

Pos logit contributions
 afloat+0.172
 achieve+0.171
cher+0.169
kin+0.166
illas+0.152
Neg logit contributions
ted-0.349
 Many-0.341
 People-0.334
 Everyone-0.329
 Every-0.320
Token is
Token ID318
Feature activation-1.43e+00

Pos logit contributions
cher+0.918
 afloat+0.878
 achieve+0.873
illas+0.857
ppers+0.855
Neg logit contributions
ted-0.777
 Everyone-0.711
 People-0.698
 Many-0.690
 Every-0.688
Token not
Token ID407
Feature activation-4.19e+00

Pos logit contributions
cher+0.093
illas+0.090
edient+0.087
ucer+0.087
kin+0.087
Neg logit contributions
 Many-0.081
ted-0.080
 Every-0.078
 Everyone-0.077
 People-0.077
Token y
Token ID331
Feature activation+1.46e+00

Pos logit contributions
cher+0.300
illas+0.298
edient+0.291
 achieve+0.289
kin+0.289
Neg logit contributions
 Many-0.203
ted-0.200
 Everyone-0.191
 Every-0.191
 People-0.189
Tokenucky
Token ID5309
Feature activation+2.86e+00

Pos logit contributions
ted+0.050
 Many+0.049
 People+0.045
 Everyone+0.044
 All+0.038
Neg logit contributions
kin-0.129
ppers-0.128
 afloat-0.127
edient-0.125
 achieve-0.125
Token,
Token ID11
Feature activation-4.98e-01

Pos logit contributions
ted+0.150
 Many+0.147
 Everyone+0.138
 People+0.137
 Every+0.130
Neg logit contributions
cher-0.199
kin-0.194
illas-0.191
 achieve-0.191
 afloat-0.191
Token on
Token ID319
Feature activation-7.51e+00

Pos logit contributions
cher+0.079
illas+0.077
 achieve+0.076
edient+0.075
 afloat+0.075
Neg logit contributions
ted-0.078
 Many-0.073
 Everyone-0.071
 People-0.071
 Every-0.069
Token your
Token ID534
Feature activation-4.37e+00

Pos logit contributions
kin+0.485
 achieve+0.464
 afloat+0.460
edient+0.457
illas+0.457
Neg logit contributions
ted-0.436
 Many-0.428
 People-0.403
 Everyone-0.398
 All-0.373
Token plate
Token ID7480
Feature activation-5.33e+00

Pos logit contributions
cher+0.280
ppers+0.267
 achieve+0.266
edient+0.266
 afloat+0.266
Neg logit contributions
ted-0.255
 Many-0.252
 Everyone-0.238
 People-0.238
 Every-0.227
Token.
Token ID13
Feature activation-7.05e-01

Pos logit contributions
cher+0.364
 afloat+0.358
illas+0.355
 achieve+0.354
ppers+0.353
Neg logit contributions
ted-0.281
 Many-0.274
 Everyone-0.264
 People-0.261
 Every-0.250
Token Bro
Token ID2806
Feature activation-5.66e+00

Pos logit contributions
cher+0.059
kin+0.059
ppers+0.058
edient+0.058
illas+0.057
Neg logit contributions
ted-0.027
 Many-0.025
 Everyone-0.023
 People-0.023
 Every-0.021
Tokenccoli
Token ID34544
Feature activation-1.38e+01

Pos logit contributions
 achieve+0.135
 afloat+0.134
cher+0.127
edient+0.106
illas+0.105
Neg logit contributions
ted-0.551
heart-0.523
 Everyone-0.521
 People-0.519
 Many-0.514
Token is
Token ID318
Feature activation-1.41e+00

Pos logit contributions
cher+0.871
 afloat+0.844
ppers+0.828
 achieve+0.821
illas+0.814
Neg logit contributions
ted-0.831
 Everyone-0.761
 People-0.752
 Many-0.745
 All-0.732
Token not
Token ID407
Feature activation-4.19e+00

Pos logit contributions
cher+0.093
illas+0.091
 globe+0.088
ucer+0.087
 achieve+0.087
Neg logit contributions
 Many-0.078
ted-0.077
 Every-0.075
 Everyone-0.074
 People-0.073
Token that
Token ID326
Feature activation+6.71e-01

Pos logit contributions
illas+0.299
cher+0.299
edient+0.295
 achieve+0.294
kin+0.291
Neg logit contributions
 Many-0.204
ted-0.202
 Everyone-0.191
 People-0.189
 Every-0.186
Token bad
Token ID2089
Feature activation-4.53e+00

Pos logit contributions
ted+0.041
 Many+0.040
 Everyone+0.038
 People+0.038
 Every+0.037
Neg logit contributions
cher-0.042
ppers-0.040
kin-0.040
illas-0.040
 afloat-0.040
Token.
Token ID13
Feature activation-8.03e-01

Pos logit contributions
kin+0.282
cher+0.281
 achieve+0.271
edient+0.271
ppers+0.269
Neg logit contributions
ted-0.266
 Many-0.263
 People-0.243
 Everyone-0.241
 Every-0.232
Token ruined
Token ID20484
Feature activation-6.30e+00

Pos logit contributions
cher+0.034
kin+0.033
ppers+0.032
illas+0.032
 achieve+0.032
Neg logit contributions
ted-0.047
 Many-0.046
 People-0.044
 Everyone-0.044
 Every-0.042
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
cher+0.439
kin+0.417
edient+0.415
 achieve+0.414
 afloat+0.414
Neg logit contributions
ted-0.346
 Many-0.335
 Everyone-0.327
 People-0.317
 Every-0.309
Token broccoli
Token ID44653
Feature activation-2.31e+00

Pos logit contributions
cher+0.070
ppers+0.068
illas+0.067
edient+0.067
 achieve+0.066
Neg logit contributions
ted-0.058
 Many-0.058
 Everyone-0.055
 People-0.054
 Every-0.053
Token!
Token ID0
Feature activation+8.70e-01

Pos logit contributions
cher+0.182
illas+0.177
ppers+0.174
 afloat+0.174
edient+0.173
Neg logit contributions
ted-0.092
 Everyone-0.086
 Many-0.084
 All-0.082
ummy-0.081
Token Bro
Token ID2806
Feature activation-5.64e+00

Pos logit contributions
ted+0.050
 Many+0.046
 People+0.043
 Everyone+0.043
heart+0.042
Neg logit contributions
cher-0.056
kin-0.055
edient-0.055
 achieve-0.054
ppers-0.054
Tokenccoli
Token ID34544
Feature activation-1.38e+01

Pos logit contributions
 achieve+0.168
cher+0.164
 afloat+0.163
edient+0.142
illas+0.136
Neg logit contributions
ted-0.516
 Everyone-0.493
 Many-0.489
 People-0.485
 All-0.482
Token is
Token ID318
Feature activation-1.38e+00

Pos logit contributions
cher+0.932
 afloat+0.907
illas+0.895
ppers+0.894
 achieve+0.888
Neg logit contributions
ted-0.764
 Everyone-0.695
 Many-0.680
 People-0.674
 All-0.672
Token good
Token ID922
Feature activation-6.24e+00

Pos logit contributions
illas+0.084
cher+0.083
 achieve+0.080
edient+0.079
 globe+0.079
Neg logit contributions
ted-0.083
 Many-0.082
 Everyone-0.078
 Every-0.078
 People-0.077
Token for
Token ID329
Feature activation+2.51e+00

Pos logit contributions
cher+0.429
kin+0.421
ppers+0.414
edient+0.412
 afloat+0.411
Neg logit contributions
ted-0.337
 Many-0.332
 People-0.311
 Everyone-0.310
 Every-0.292
Token you
Token ID345
Feature activation+3.53e+00

Pos logit contributions
ted+0.119
 Many+0.116
 Everyone+0.110
 People+0.109
 Every+0.101
Neg logit contributions
cher-0.187
kin-0.183
edient-0.181
 afloat-0.181
illas-0.181
Token.
Token ID13
Feature activation-7.13e-01

Pos logit contributions
 Many+0.133
ted+0.129
 Everyone+0.127
 All+0.121
 Every+0.118
Neg logit contributions
illas-0.283
cher-0.280
kin-0.278
 globe-0.277
Jackson-0.273
Token.
Token ID13
Feature activation-7.57e-01

Pos logit contributions
ted+0.137
 Many+0.129
 Everyone+0.124
 People+0.122
 Every+0.119
Neg logit contributions
cher-0.215
kin-0.213
 afloat-0.212
 achieve-0.209
 globe-0.207
Token
Token ID198
Feature activation-2.27e-01

Pos logit contributions
cher+0.053
kin+0.052
 achieve+0.051
edient+0.051
illas+0.051
Neg logit contributions
ted-0.040
 Many-0.038
 Everyone-0.036
 People-0.036
 Every-0.034
Token
Token ID198
Feature activation-2.28e-01

Pos logit contributions
cher+0.016
kin+0.016
ppers+0.015
illas+0.015
 afloat+0.015
Neg logit contributions
 Many-0.012
ted-0.012
 Everyone-0.011
 People-0.011
 Every-0.011
Token"
Token ID1
Feature activation-2.44e+00

Pos logit contributions
cher+0.016
 afloat+0.015
 Claus+0.015
 globe+0.015
kin+0.015
Neg logit contributions
 Many-0.014
 Everyone-0.012
 People-0.012
ted-0.012
 Every-0.012
TokenBro
Token ID15783
Feature activation-4.27e+00

Pos logit contributions
cher+0.147
 afloat+0.146
 achieve+0.143
edient+0.141
kin+0.140
Neg logit contributions
ted-0.159
 Many-0.152
 People-0.144
 Everyone-0.143
 All-0.137
Tokenccoli
Token ID34544
Feature activation-1.38e+01

Pos logit contributions
 afloat+0.194
 achieve+0.190
cher+0.183
kin+0.179
illas+0.174
Neg logit contributions
ted-0.317
 Many-0.315
 People-0.309
 Everyone-0.303
 Every-0.291
Token is
Token ID318
Feature activation-1.38e+00

Pos logit contributions
cher+0.905
 afloat+0.868
 achieve+0.858
illas+0.851
edient+0.848
Neg logit contributions
ted-0.771
 Everyone-0.710
 People-0.697
 Many-0.694
heart-0.689
Token what
Token ID644
Feature activation-3.67e+00

Pos logit contributions
cher+0.089
illas+0.088
edient+0.085
ucer+0.085
kin+0.084
Neg logit contributions
 Many-0.078
ted-0.076
 Every-0.076
 People-0.075
 Everyone-0.074
Token we
Token ID356
Feature activation+7.30e-01

Pos logit contributions
cher+0.276
kin+0.272
 achieve+0.268
illas+0.267
ppers+0.266
Neg logit contributions
ted-0.175
 Many-0.168
 Everyone-0.156
 People-0.156
 Every-0.147
Token have
Token ID423
Feature activation-5.97e+00

Pos logit contributions
 Many+0.041
ted+0.040
 Everyone+0.038
 People+0.038
 Every+0.037
Neg logit contributions
kin-0.048
cher-0.047
edient-0.046
 globe-0.046
 afloat-0.046
Token for
Token ID329
Feature activation+2.51e+00

Pos logit contributions
cher+0.423
kin+0.415
ppers+0.409
 achieve+0.408
illas+0.408
Neg logit contributions
ted-0.308
 Many-0.300
 Everyone-0.282
 People-0.281
 Every-0.265
Token to
Token ID284
Feature activation+4.37e-01

Pos logit contributions
ted+0.003
 Many+0.002
 Everyone+0.002
 People+0.002
 All+0.002
Neg logit contributions
 achieve-0.003
kin-0.003
ccoli-0.003
illas-0.003
ppers-0.003
Token eat
Token ID4483
Feature activation-1.14e-01

Pos logit contributions
 Many+0.025
ted+0.024
 Everyone+0.023
 Every+0.023
 People+0.023
Neg logit contributions
cher-0.030
illas-0.028
 globe-0.027
edient-0.027
kin-0.027
Token broccoli
Token ID44653
Feature activation-2.25e+00

Pos logit contributions
cher+0.009
kin+0.008
ppers+0.008
 achieve+0.008
 afloat+0.008
Neg logit contributions
ted-0.005
 Many-0.005
 Everyone-0.005
 People-0.005
 Every-0.005
Token.
Token ID13
Feature activation-6.74e-01

Pos logit contributions
cher+0.169
edient+0.163
 achieve+0.162
illas+0.162
 afloat+0.162
Neg logit contributions
ted-0.107
 Many-0.096
 Everyone-0.091
 Every-0.090
 People-0.090
Token Bro
Token ID2806
Feature activation-5.61e+00

Pos logit contributions
ccoli+0.047
 achieve+0.047
illas+0.047
kin+0.046
 Claus+0.045
Neg logit contributions
ted-0.033
ly-0.027
heart-0.027
 Many-0.026
andel-0.026
Tokenccoli
Token ID34544
Feature activation-1.38e+01

Pos logit contributions
cher+0.190
 afloat+0.183
 achieve+0.177
edient+0.163
kin+0.158
Neg logit contributions
ted-0.506
 Many-0.465
heart-0.464
 Everyone-0.463
 People-0.462
Token was
Token ID373
Feature activation+8.38e-01

Pos logit contributions
cher+1.032
 afloat+0.994
ppers+0.993
 achieve+0.984
illas+0.982
Neg logit contributions
ted-0.677
 Many-0.627
 Everyone-0.605
 People-0.599
 All-0.571
Token green
Token ID4077
Feature activation+1.91e+00

Pos logit contributions
ted+0.054
 Many+0.053
 Everyone+0.050
 People+0.050
 Every+0.048
Neg logit contributions
cher-0.049
kin-0.048
illas-0.047
ppers-0.047
 achieve-0.047
Token and
Token ID290
Feature activation+1.51e-02

Pos logit contributions
ted+0.098
 Many+0.096
 Everyone+0.090
 People+0.090
 Every+0.084
Neg logit contributions
cher-0.136
kin-0.134
ppers-0.131
 achieve-0.131
edient-0.131
Token bitter
Token ID12922
Feature activation-1.00e+00

Pos logit contributions
ted+0.001
 Many+0.001
 Every+0.001
 Everyone+0.001
 People+0.001
Neg logit contributions
cher-0.001
illas-0.001
 achieve-0.001
 afloat-0.001
ppers-0.001
Token and
Token ID290
Feature activation-7.21e-03

Pos logit contributions
kin+0.056
cher+0.053
ppers+0.053
kins+0.053
edient+0.053
Neg logit contributions
 Many-0.066
ted-0.064
 People-0.062
 Everyone-0.061
 Every-0.057
Token and
Token ID290
Feature activation+1.12e-01

Pos logit contributions
cher+0.040
kin+0.039
 afloat+0.038
illas+0.038
 achieve+0.038
Neg logit contributions
ted-0.023
 Many-0.021
 Everyone-0.021
 People-0.021
 Every-0.020
Token stay
Token ID2652
Feature activation-2.00e+00

Pos logit contributions
ted+0.007
 Many+0.007
 Everyone+0.006
 People+0.006
 Every+0.006
Neg logit contributions
cher-0.007
kin-0.007
ppers-0.007
 afloat-0.007
illas-0.007
Token well
Token ID880
Feature activation-4.70e+00

Pos logit contributions
 globe+0.121
cher+0.120
illas+0.119
Jackson+0.115
 achieve+0.113
Neg logit contributions
 as-0.117
ted-0.112
 Many-0.106
 Everyone-0.105
 Every-0.104
Token.
Token ID13
Feature activation-6.54e-01

Pos logit contributions
cher+0.348
illas+0.340
kin+0.336
 globe+0.336
ppers+0.331
Neg logit contributions
ted-0.212
 Many-0.205
 Everyone-0.195
 Every-0.194
 People-0.191
Token Bro
Token ID2806
Feature activation-5.56e+00

Pos logit contributions
cher+0.042
kin+0.041
 achieve+0.041
edient+0.041
illas+0.041
Neg logit contributions
ted-0.037
 Many-0.034
 People-0.032
 Everyone-0.032
heart-0.031
Tokenccoli
Token ID34544
Feature activation-1.37e+01

Pos logit contributions
 achieve+0.181
cher+0.179
 afloat+0.170
kin+0.151
edient+0.140
Neg logit contributions
ted-0.498
heart-0.472
 Everyone-0.465
 People-0.464
 Many-0.463
Token is
Token ID318
Feature activation-1.29e+00

Pos logit contributions
cher+0.898
 afloat+0.854
ppers+0.850
illas+0.837
 achieve+0.831
Neg logit contributions
ted-0.797
 Everyone-0.734
 People-0.722
 Many-0.715
 All-0.702
Token your
Token ID534
Feature activation-4.33e+00

Pos logit contributions
cher+0.078
illas+0.077
 globe+0.075
ucer+0.074
edient+0.074
Neg logit contributions
 Many-0.078
ted-0.076
 Everyone-0.075
 Every-0.075
 People-0.074
Token friend
Token ID1545
Feature activation+5.94e-01

Pos logit contributions
cher+0.268
kin+0.262
ppers+0.257
 achieve+0.257
edient+0.257
Neg logit contributions
ted-0.262
 Many-0.257
 Everyone-0.243
 People-0.243
 Every-0.231
Token,
Token ID11
Feature activation-3.62e-01

Pos logit contributions
ted+0.030
 Many+0.030
 Everyone+0.028
 People+0.028
 Every+0.027
Neg logit contributions
cher-0.042
 afloat-0.041
kin-0.041
 achieve-0.041
ppers-0.041
Token not
Token ID407
Feature activation-4.03e+00

Pos logit contributions
cher+0.026
 afloat+0.025
edient+0.025
ppers+0.025
 achieve+0.025
Neg logit contributions
ted-0.018
 Many-0.018
 Everyone-0.017
 People-0.017
 Every-0.016
TokenMom
Token ID29252
Feature activation-1.85e+00

Pos logit contributions
cher+0.002
illas+0.002
 Claus+0.001
 afloat+0.001
 globe+0.001
Neg logit contributions
 Many-0.002
 Everyone-0.002
ted-0.002
 People-0.002
 All-0.002
Token says
Token ID1139
Feature activation+2.48e+00

Pos logit contributions
kin+0.125
cher+0.125
 afloat+0.123
edient+0.122
ppers+0.117
Neg logit contributions
ted-0.109
 Many-0.103
 Every-0.098
 Everyone-0.098
 People-0.096
Token,
Token ID11
Feature activation-3.01e-01

Pos logit contributions
ted+0.122
 Many+0.119
 Everyone+0.111
 People+0.111
 Every+0.104
Neg logit contributions
cher-0.184
kin-0.180
 afloat-0.177
edient-0.177
ppers-0.177
Token "
Token ID366
Feature activation-3.92e-01

Pos logit contributions
 afloat+0.025
cher+0.025
 achieve+0.025
kin+0.025
edient+0.025
Neg logit contributions
ted-0.012
 Many-0.012
 Everyone-0.011
 People-0.011
 All-0.011
TokenBro
Token ID15783
Feature activation-4.09e+00

Pos logit contributions
cher+0.023
kin+0.023
edient+0.023
 achieve+0.022
illas+0.022
Neg logit contributions
ted-0.025
 Many-0.024
 People-0.023
 Everyone-0.023
 Every-0.022
Tokenccoli
Token ID34544
Feature activation-1.36e+01

Pos logit contributions
 afloat+0.074
 achieve+0.070
cher+0.068
illas+0.060
edient+0.058
Neg logit contributions
ted-0.423
 Many-0.414
 People-0.409
 Everyone-0.406
 Every-0.398
Token is
Token ID318
Feature activation-1.19e+00

Pos logit contributions
cher+0.925
 afloat+0.884
ppers+0.858
illas+0.854
 achieve+0.843
Neg logit contributions
ted-0.786
 Everyone-0.724
 All-0.710
 Every-0.700
 People-0.697
Token good
Token ID922
Feature activation-6.06e+00

Pos logit contributions
cher+0.079
illas+0.078
edient+0.074
 achieve+0.074
 globe+0.073
Neg logit contributions
 Many-0.066
ted-0.065
 Every-0.064
 Everyone-0.062
 People-0.062
Token for
Token ID329
Feature activation+2.71e+00

Pos logit contributions
kin+0.388
cher+0.388
ppers+0.378
edient+0.373
 achieve+0.371
Neg logit contributions
 Many-0.351
ted-0.347
 People-0.325
 Everyone-0.320
 Every-0.302
Token you
Token ID345
Feature activation+3.73e+00

Pos logit contributions
ted+0.128
 Many+0.124
 People+0.115
 Everyone+0.115
 Every+0.108
Neg logit contributions
cher-0.204
kin-0.202
ppers-0.198
 achieve-0.198
illas-0.198
Token.
Token ID13
Feature activation-5.16e-01

Pos logit contributions
 Many+0.163
ted+0.161
 Everyone+0.154
 People+0.149
 All+0.146
Neg logit contributions
cher-0.287
illas-0.281
 afloat-0.280
kin-0.279
 globe-0.277
Token keep
Token ID1394
Feature activation-3.30e-01

Pos logit contributions
ted+0.225
 Many+0.221
 Everyone+0.213
 People+0.211
 Every+0.204
Neg logit contributions
cher-0.188
kin-0.184
edient-0.182
ppers-0.181
 afloat-0.181
Token trying
Token ID2111
Feature activation-1.28e+00

Pos logit contributions
kin+0.025
cher+0.025
illas+0.024
ppers+0.024
edient+0.024
Neg logit contributions
ted-0.015
 Many-0.014
 Everyone-0.014
 People-0.014
 Every-0.013
Token,
Token ID11
Feature activation-5.96e-01

Pos logit contributions
cher+0.101
kin+0.100
illas+0.099
 Claus+0.098
edient+0.097
Neg logit contributions
ted-0.054
 Many-0.053
 Everyone-0.052
 People-0.051
 Every-0.050
Token you
Token ID345
Feature activation+3.39e+00

Pos logit contributions
cher+0.044
kin+0.043
 achieve+0.043
ppers+0.043
illas+0.043
Neg logit contributions
ted-0.029
 Many-0.028
 Everyone-0.026
 People-0.026
 Every-0.024
Token can
Token ID460
Feature activation-4.11e+00

Pos logit contributions
ted+0.233
 Many+0.232
 Everyone+0.229
 People+0.227
 All+0.220
Neg logit contributions
kin-0.169
cher-0.168
ppers-0.166
 afloat-0.166
edient-0.166
Token achieve
Token ID4620
Feature activation-1.36e+01

Pos logit contributions
kin+0.243
cher+0.241
illas+0.234
ccoli+0.230
 globe+0.230
Neg logit contributions
 Everyone-0.258
 Many-0.253
 All-0.249
 Every-0.249
 People-0.246
Token anything
Token ID1997
Feature activation-1.11e+00

Pos logit contributions
cher+1.036
ppers+1.009
 Claus+1.004
edient+1.001
illas+0.983
Neg logit contributions
 Everyone-0.625
 People-0.605
 Many-0.597
ted-0.585
 Every-0.576
Token.
Token ID13
Feature activation-8.63e-01

Pos logit contributions
cher+0.077
kin+0.076
ppers+0.075
illas+0.074
 afloat+0.074
Neg logit contributions
ted-0.059
 Many-0.057
 Everyone-0.054
 People-0.054
 Every-0.051
Token<|endoftext|>
Token ID50256
Feature activation-5.20e-01

Pos logit contributions
cher+0.061
kin+0.060
ppers+0.059
 afloat+0.059
edient+0.058
Neg logit contributions
 Many-0.045
ted-0.045
 Everyone-0.043
 People-0.043
 Every-0.041
TokenTom
Token ID13787
Feature activation-2.42e+00

Pos logit contributions
cher+0.039
ppers+0.039
kin+0.039
edient+0.038
 afloat+0.038
Neg logit contributions
ted-0.025
 Many-0.024
 Everyone-0.023
 People-0.023
 All-0.022
Tokenmy
Token ID1820
Feature activation-3.64e+00

Pos logit contributions
cher+0.138
 afloat+0.133
kin+0.131
ppers+0.129
edient+0.128
Neg logit contributions
ted-0.166
 Many-0.157
 Every-0.155
 Everyone-0.152
 People-0.151
Token and
Token ID290
Feature activation-8.72e-02

Pos logit contributions
ted+0.170
 Everyone+0.163
 People+0.162
 Many+0.159
 All+0.154
Neg logit contributions
cher-0.231
illas-0.228
ppers-0.227
 afloat-0.224
 Claus-0.222
Token exercise
Token ID5517
Feature activation+7.85e-01

Pos logit contributions
cher+0.005
kin+0.005
ppers+0.005
illas+0.005
 afloat+0.005
Neg logit contributions
ted-0.005
 Many-0.005
 Everyone-0.005
 People-0.005
 Every-0.005
Token are
Token ID389
Feature activation-3.87e+00

Pos logit contributions
ted+0.041
 Everyone+0.039
 People+0.039
 Many+0.039
 All+0.037
Neg logit contributions
cher-0.054
ppers-0.054
 Claus-0.053
illas-0.053
edient-0.052
Token necessary
Token ID3306
Feature activation-1.75e+00

Pos logit contributions
cher+0.267
illas+0.258
kin+0.258
ppers+0.256
 globe+0.254
Neg logit contributions
ted-0.196
 Many-0.193
 People-0.190
 Everyone-0.189
 Every-0.188
Token to
Token ID284
Feature activation+1.90e-01

Pos logit contributions
cher+0.127
kin+0.124
illas+0.123
ppers+0.123
 afloat+0.122
Neg logit contributions
ted-0.085
 Many-0.081
 Everyone-0.078
 People-0.078
 Every-0.074
Token achieve
Token ID4620
Feature activation-1.36e+01

Pos logit contributions
 Everyone+0.011
 Many+0.011
 People+0.011
 All+0.011
 Every+0.011
Neg logit contributions
cher-0.012
ppers-0.012
illas-0.011
 globe-0.011
 Claus-0.011
Token any
Token ID597
Feature activation+1.74e+00

Pos logit contributions
cher+1.040
ppers+1.010
 Claus+0.994
illas+0.976
edient+0.970
Neg logit contributions
 Everyone-0.651
 People-0.632
 Many-0.621
ted-0.609
 Every-0.591
Token goal
Token ID3061
Feature activation-2.50e+00

Pos logit contributions
ted+0.099
 Many+0.097
 Everyone+0.093
 People+0.092
 Every+0.087
Neg logit contributions
cher-0.114
kin-0.111
ppers-0.110
illas-0.109
edient-0.109
Token.
Token ID13
Feature activation-8.50e-01

Pos logit contributions
cher+0.174
kin+0.170
ppers+0.169
 afloat+0.168
edient+0.167
Neg logit contributions
ted-0.129
 Many-0.126
 Everyone-0.121
 People-0.121
 All-0.113
Token<|endoftext|>
Token ID50256
Feature activation-5.23e-01

Pos logit contributions
 achieve+0.056
cher+0.055
edient+0.055
illas+0.055
kin+0.054
Neg logit contributions
ted-0.048
 Many-0.043
 People-0.040
 Everyone-0.039
ly-0.039
TokenOnce
Token ID7454
Feature activation-2.38e+00

Pos logit contributions
cher+0.038
ppers+0.037
kin+0.037
edient+0.037
 achieve+0.037
Neg logit contributions
ted-0.026
 Many-0.025
 Everyone-0.024
 People-0.024
 Every-0.023
Token and
Token ID290
Feature activation-8.38e-02

Pos logit contributions
ted+0.163
 Many+0.153
 People+0.149
 Everyone+0.149
 All+0.140
Neg logit contributions
cher-0.245
ppers-0.242
illas-0.240
 afloat-0.240
kin-0.239
Token determination
Token ID12123
Feature activation+2.01e+00

Pos logit contributions
cher+0.006
kin+0.005
 achieve+0.005
edient+0.005
 afloat+0.005
Neg logit contributions
ted-0.005
 Many-0.005
 Everyone-0.004
 People-0.004
 Every-0.004
Token,
Token ID11
Feature activation-5.93e-01

Pos logit contributions
ted+0.095
 Many+0.088
 People+0.085
 Everyone+0.085
 All+0.080
Neg logit contributions
cher-0.151
kin-0.150
ppers-0.149
illas-0.149
 afloat-0.148
Token he
Token ID339
Feature activation-1.01e+00

Pos logit contributions
 achieve+0.043
cher+0.043
 afloat+0.043
kin+0.042
edient+0.042
Neg logit contributions
ted-0.029
 Many-0.028
 Everyone-0.025
 People-0.025
 Every-0.024
Token could
Token ID714
Feature activation-2.92e-01

Pos logit contributions
cher+0.065
kin+0.064
 achieve+0.063
illas+0.062
ppers+0.062
Neg logit contributions
ted-0.058
 Many-0.057
 Everyone-0.054
 People-0.054
 Every-0.051
Token achieve
Token ID4620
Feature activation-1.36e+01

Pos logit contributions
cher+0.016
kin+0.016
ucer+0.015
ppers+0.015
edient+0.015
Neg logit contributions
 Everyone-0.020
 Many-0.020
 All-0.020
 Every-0.019
 People-0.019
Token anything
Token ID1997
Feature activation-1.12e+00

Pos logit contributions
cher+1.077
ppers+1.068
 Claus+1.040
edient+1.040
illas+1.029
Neg logit contributions
 Everyone-0.583
 People-0.572
ted-0.569
 Many-0.568
 Every-0.533
Token he
Token ID339
Feature activation-1.02e+00

Pos logit contributions
cher+0.081
kin+0.080
ppers+0.078
illas+0.078
edient+0.078
Neg logit contributions
ted-0.055
 Many-0.054
 Everyone-0.051
 People-0.050
 Every-0.047
Token wanted
Token ID2227
Feature activation+6.65e+00

Pos logit contributions
cher+0.071
 achieve+0.070
kin+0.069
illas+0.069
edient+0.069
Neg logit contributions
ted-0.054
 Many-0.053
 Everyone-0.050
 People-0.049
 Every-0.046
Token.
Token ID13
Feature activation-8.63e-01

Pos logit contributions
ted+0.377
 Many+0.372
 Everyone+0.347
 People+0.345
 Every+0.326
Neg logit contributions
cher-0.436
kin-0.426
 achieve-0.422
edient-0.418
illas-0.416
Token<|endoftext|>
Token ID50256
Feature activation-5.20e-01

Pos logit contributions
cher+0.054
 achieve+0.053
edient+0.052
kin+0.052
 afloat+0.052
Neg logit contributions
ted-0.051
 Many-0.048
 People-0.045
 Everyone-0.044
heart-0.043
Token dream
Token ID4320
Feature activation+1.42e+00

Pos logit contributions
cher+0.264
kin+0.256
ppers+0.256
 afloat+0.253
illas+0.253
Neg logit contributions
ted-0.231
 Many-0.229
 Everyone-0.219
 People-0.216
 All-0.205
Token and
Token ID290
Feature activation-9.38e-02

Pos logit contributions
ted+0.070
 Many+0.069
 Everyone+0.066
 People+0.066
 All+0.062
Neg logit contributions
cher-0.101
kin-0.100
illas-0.100
ppers-0.099
edient-0.099
Token worked
Token ID3111
Feature activation-1.78e+00

Pos logit contributions
cher+0.006
kin+0.006
 achieve+0.006
ppers+0.006
edient+0.006
Neg logit contributions
ted-0.005
 Many-0.005
 Everyone-0.005
 People-0.005
 Every-0.005
Token hard
Token ID1327
Feature activation-3.18e+00

Pos logit contributions
illas+0.120
cher+0.118
ppers+0.116
kin+0.114
 Claus+0.114
Neg logit contributions
ted-0.100
 Everyone-0.096
 Many-0.094
 People-0.092
 All-0.091
Token to
Token ID284
Feature activation+1.89e-01

Pos logit contributions
 achieve+0.212
cher+0.210
 afloat+0.205
edient+0.205
kin+0.204
Neg logit contributions
 Many-0.173
ted-0.173
 People-0.161
 Everyone-0.158
 Every-0.147
Token achieve
Token ID4620
Feature activation-1.36e+01

Pos logit contributions
 Many+0.012
 Everyone+0.011
 People+0.011
 All+0.011
ted+0.011
Neg logit contributions
ppers-0.012
ccoli-0.012
cher-0.012
kin-0.012
illas-0.011
Token it
Token ID340
Feature activation+3.28e+00

Pos logit contributions
ppers+1.088
cher+1.078
illas+1.057
edient+1.056
 Claus+1.045
Neg logit contributions
 Everyone-0.595
 People-0.570
 Many-0.569
ted-0.563
 All-0.536
Token.
Token ID13
Feature activation-8.41e-01

Pos logit contributions
ted+0.161
 Many+0.160
 Everyone+0.157
 People+0.154
 All+0.148
Neg logit contributions
cher-0.237
ppers-0.233
kin-0.233
illas-0.232
 Claus-0.227
Token
Token ID198
Feature activation-3.14e-01

Pos logit contributions
 globe+0.041
 achieve+0.041
edient+0.040
ccoli+0.039
 afloat+0.039
Neg logit contributions
ted-0.059
heart-0.054
ly-0.053
andel-0.053
 Many-0.051
Token
Token ID198
Feature activation-3.22e-01

Pos logit contributions
cher+0.020
kin+0.020
 achieve+0.020
edient+0.020
 afloat+0.020
Neg logit contributions
ted-0.018
 Many-0.017
 People-0.016
 Everyone-0.016
 Every-0.016
TokenThe
Token ID464
Feature activation-5.45e+00

Pos logit contributions
cher+0.021
kin+0.020
illas+0.020
 afloat+0.020
ppers+0.020
Neg logit contributions
ted-0.019
 Many-0.019
 Everyone-0.018
 People-0.018
 Every-0.017
Token hard
Token ID1327
Feature activation-3.18e+00

Pos logit contributions
cher+0.005
kin+0.005
 achieve+0.005
edient+0.005
ppers+0.005
Neg logit contributions
ted-0.005
 Many-0.005
 People-0.004
 Everyone-0.004
 Every-0.004
Token work
Token ID670
Feature activation+3.32e+00

Pos logit contributions
 achieve+0.207
kin+0.195
edient+0.191
cher+0.190
 afloat+0.188
Neg logit contributions
 Many-0.175
ted-0.169
 People-0.160
 Everyone-0.156
 Every-0.147
Token,
Token ID11
Feature activation-5.94e-01

Pos logit contributions
ted+0.186
 Many+0.179
 Everyone+0.173
 People+0.173
 All+0.163
Neg logit contributions
cher-0.221
ppers-0.216
 afloat-0.215
illas-0.215
kin-0.215
Token she
Token ID673
Feature activation+1.94e+00

Pos logit contributions
 achieve+0.044
kin+0.043
cher+0.043
ppers+0.042
illas+0.041
Neg logit contributions
ted-0.028
 Many-0.027
 Everyone-0.024
 People-0.023
 Every-0.022
Token could
Token ID714
Feature activation-2.88e-01

Pos logit contributions
 Many+0.119
ted+0.119
 People+0.111
 Everyone+0.110
 Every+0.104
Neg logit contributions
cher-0.119
kin-0.115
 achieve-0.114
edient-0.112
illas-0.112
Token achieve
Token ID4620
Feature activation-1.36e+01

Pos logit contributions
cher+0.017
kin+0.017
ppers+0.016
edient+0.016
 afloat+0.016
Neg logit contributions
 Many-0.018
 Everyone-0.018
ted-0.018
 People-0.017
 Every-0.017
Token anything
Token ID1997
Feature activation-1.11e+00

Pos logit contributions
cher+1.058
ppers+1.036
edient+1.016
illas+1.010
 Claus+1.009
Neg logit contributions
ted-0.605
 Many-0.599
 Everyone-0.591
 People-0.582
 Every-0.545
Token she
Token ID673
Feature activation+1.95e+00

Pos logit contributions
cher+0.086
 achieve+0.085
kin+0.085
edient+0.083
illas+0.083
Neg logit contributions
ted-0.050
 Many-0.048
 Everyone-0.043
 People-0.043
 Every-0.041
Token wanted
Token ID2227
Feature activation+6.66e+00

Pos logit contributions
ted+0.105
 Many+0.102
 Everyone+0.096
 People+0.095
 Every+0.090
Neg logit contributions
cher-0.135
 achieve-0.132
kin-0.131
illas-0.130
edient-0.130
Token.
Token ID13
Feature activation-8.46e-01

Pos logit contributions
ted+0.396
 Many+0.395
 Everyone+0.364
 People+0.361
 Every+0.342
Neg logit contributions
cher-0.415
kin-0.406
 achieve-0.402
edient-0.395
illas-0.393
Token Lily
Token ID20037
Feature activation+2.82e-01

Pos logit contributions
ccoli+0.041
 achieve+0.040
edient+0.039
 promise+0.038
kin+0.038
Neg logit contributions
ted-0.058
 winds-0.052
ly-0.051
heart-0.051
 Many-0.051
Token a
Token ID257
Feature activation+1.23e+00

Pos logit contributions
ted+0.115
 Many+0.111
 People+0.104
 Everyone+0.103
heart+0.099
Neg logit contributions
cher-0.168
 achieve-0.168
 afloat-0.163
illas-0.162
edient-0.160
Token friend
Token ID1545
Feature activation+3.65e-01

Pos logit contributions
 Many+0.051
 Everyone+0.048
 sight+0.047
 People+0.046
 All+0.046
Neg logit contributions
ppers-0.096
cher-0.095
edient-0.094
illas-0.094
kins-0.093
Token to
Token ID284
Feature activation+1.79e-01

Pos logit contributions
ted+0.016
 Many+0.016
 Everyone+0.015
 People+0.015
 Every+0.014
Neg logit contributions
cher-0.028
kin-0.028
 afloat-0.028
ppers-0.027
edient-0.027
Token help
Token ID1037
Feature activation+5.02e-01

Pos logit contributions
 Many+0.011
 Everyone+0.011
 People+0.010
 All+0.010
ted+0.010
Neg logit contributions
cher-0.011
ppers-0.011
kin-0.011
ucer-0.011
ccoli-0.011
Token you
Token ID345
Feature activation+3.40e+00

Pos logit contributions
ted+0.023
 Many+0.022
 Everyone+0.021
 People+0.021
 Every+0.019
Neg logit contributions
cher-0.039
kin-0.038
ppers-0.038
 afloat-0.037
edient-0.037
Token achieve
Token ID4620
Feature activation-1.35e+01

Pos logit contributions
 as+0.119
 Everyone+0.118
 All+0.118
 Many+0.116
 Every+0.115
Neg logit contributions
cher-0.285
kin-0.279
illas-0.276
wy-0.273
 globe-0.271
Token your
Token ID534
Feature activation-4.56e+00

Pos logit contributions
cher+1.051
ppers+1.033
illas+0.994
 Claus+0.990
edient+0.990
Neg logit contributions
 Everyone-0.622
 Many-0.609
 People-0.603
ted-0.583
 Every-0.575
Token goals
Token ID4661
Feature activation-1.14e+00

Pos logit contributions
cher+0.374
ppers+0.370
kin+0.355
ucer+0.353
illas+0.345
Neg logit contributions
 Many-0.200
 People-0.194
 Everyone-0.194
ted-0.186
 Every-0.184
Token.
Token ID13
Feature activation-8.54e-01

Pos logit contributions
cher+0.091
ppers+0.090
 Claus+0.089
ccoli+0.089
illas+0.088
Neg logit contributions
 Everyone-0.042
 Many-0.042
 Every-0.041
 People-0.041
 of-0.039
Token Friends
Token ID14213
Feature activation-3.82e-01

Pos logit contributions
cher+0.063
kin+0.062
ppers+0.061
 afloat+0.061
edient+0.060
Neg logit contributions
 Many-0.042
ted-0.042
 Everyone-0.040
 People-0.039
 Every-0.037
Token can
Token ID460
Feature activation-4.12e+00

Pos logit contributions
ppers+0.026
 afloat+0.025
kin+0.025
cher+0.025
ccoli+0.025
Neg logit contributions
 Everyone-0.022
 Many-0.021
 People-0.021
 All-0.020
ted-0.020
Token with
Token ID351
Feature activation-8.18e-01

Pos logit contributions
ted+0.029
 Many+0.028
 Everyone+0.025
 People+0.025
 Every+0.024
Neg logit contributions
cher-0.045
 achieve-0.044
kin-0.044
edient-0.044
 afloat-0.043
Token patience
Token ID16336
Feature activation+2.53e+00

Pos logit contributions
cher+0.056
kin+0.055
 achieve+0.054
edient+0.054
illas+0.054
Neg logit contributions
ted-0.044
 Many-0.043
 Everyone-0.040
 People-0.040
 Every-0.038
Token he
Token ID339
Feature activation-1.01e+00

Pos logit contributions
ted+0.125
 Many+0.117
 People+0.114
 Everyone+0.114
 All+0.107
Neg logit contributions
cher-0.185
 afloat-0.182
ppers-0.182
kin-0.182
edient-0.180
Token could
Token ID714
Feature activation-2.88e-01

Pos logit contributions
cher+0.060
 achieve+0.058
illas+0.057
kin+0.057
edient+0.057
Neg logit contributions
ted-0.064
 Many-0.064
 Everyone-0.060
 People-0.059
 Every-0.057
Token always
Token ID1464
Feature activation-7.15e-01

Pos logit contributions
cher+0.018
kin+0.017
ucer+0.017
 globe+0.017
illas+0.017
Neg logit contributions
 Everyone-0.017
 Many-0.017
 All-0.017
 Every-0.016
 People-0.016
Token achieve
Token ID4620
Feature activation-1.35e+01

Pos logit contributions
cher+0.046
ucer+0.045
kin+0.045
 globe+0.045
ppers+0.045
Neg logit contributions
 Everyone-0.040
 Many-0.040
 All-0.039
 People-0.038
 Every-0.038
Token what
Token ID644
Feature activation-3.78e+00

Pos logit contributions
cher+1.073
ppers+1.052
illas+1.017
kin+1.017
 Claus+1.016
Neg logit contributions
 Everyone-0.603
 People-0.588
 Many-0.578
ted-0.570
 Every-0.551
Token he
Token ID339
Feature activation-9.94e-01

Pos logit contributions
cher+0.280
kin+0.276
 achieve+0.274
illas+0.272
edient+0.271
Neg logit contributions
ted-0.182
 Many-0.176
 Everyone-0.163
 People-0.162
 Every-0.153
Token wanted
Token ID2227
Feature activation+6.65e+00

Pos logit contributions
cher+0.064
kin+0.062
 achieve+0.062
illas+0.061
edient+0.061
Neg logit contributions
ted-0.058
 Many-0.057
 Everyone-0.054
 People-0.054
 Every-0.051
Token.
Token ID13
Feature activation-8.44e-01

Pos logit contributions
ted+0.362
 Many+0.356
 Everyone+0.330
 People+0.328
 Every+0.310
Neg logit contributions
cher-0.452
kin-0.442
 achieve-0.440
edient-0.434
illas-0.433
Token And
Token ID843
Feature activation-6.29e-01

Pos logit contributions
cher+0.060
kin+0.059
ppers+0.058
 afloat+0.057
illas+0.057
Neg logit contributions
ted-0.044
 Many-0.044
 Everyone-0.042
 People-0.041
 Every-0.039
Token and
Token ID290
Feature activation-9.38e-02

Pos logit contributions
ted+0.092
 Many+0.086
 People+0.084
 Everyone+0.084
 All+0.078
Neg logit contributions
cher-0.153
 afloat-0.150
ppers-0.150
kin-0.150
illas-0.150
Token practice
Token ID3357
Feature activation+8.20e-01

Pos logit contributions
cher+0.007
kin+0.007
ppers+0.006
 achieve+0.006
edient+0.006
Neg logit contributions
ted-0.005
 Many-0.005
 Everyone-0.004
 People-0.004
 Every-0.004
Token,
Token ID11
Feature activation-5.96e-01

Pos logit contributions
ted+0.038
 Many+0.036
 Everyone+0.036
 People+0.035
 All+0.034
Neg logit contributions
cher-0.062
ppers-0.062
 Claus-0.061
kin-0.061
illas-0.061
Token he
Token ID339
Feature activation-1.01e+00

Pos logit contributions
cher+0.044
 achieve+0.043
kin+0.043
 afloat+0.042
illas+0.042
Neg logit contributions
ted-0.029
 Many-0.028
 Everyone-0.026
 People-0.026
 Every-0.024
Token could
Token ID714
Feature activation-2.98e-01

Pos logit contributions
cher+0.059
kin+0.057
 achieve+0.057
illas+0.056
edient+0.056
Neg logit contributions
ted-0.064
 Many-0.064
 Everyone-0.060
 People-0.060
 Every-0.057
Token achieve
Token ID4620
Feature activation-1.35e+01

Pos logit contributions
cher+0.018
ucer+0.018
kin+0.017
ppers+0.017
ccoli+0.017
Neg logit contributions
 Every-0.018
 Everyone-0.018
 All-0.018
 Many-0.018
 People-0.017
Token anything
Token ID1997
Feature activation-1.10e+00

Pos logit contributions
cher+1.082
ppers+1.076
 Claus+1.045
edient+1.039
illas+1.035
Neg logit contributions
 Everyone-0.589
 People-0.575
 Many-0.563
ted-0.556
 Every-0.545
Token he
Token ID339
Feature activation-9.94e-01

Pos logit contributions
cher+0.081
kin+0.080
ppers+0.079
 achieve+0.079
edient+0.078
Neg logit contributions
ted-0.054
 Many-0.052
 Everyone-0.049
 People-0.049
 Every-0.046
Token wanted
Token ID2227
Feature activation+6.65e+00

Pos logit contributions
cher+0.069
kin+0.067
 achieve+0.067
illas+0.066
edient+0.066
Neg logit contributions
ted-0.053
 Many-0.052
 Everyone-0.049
 People-0.048
 Every-0.046
Token.
Token ID13
Feature activation-8.44e-01

Pos logit contributions
ted+0.371
 Many+0.365
 Everyone+0.341
 People+0.340
 Every+0.321
Neg logit contributions
cher-0.441
kin-0.433
 achieve-0.426
edient-0.424
illas-0.423
Token He
Token ID679
Feature activation-2.36e+00

Pos logit contributions
cher+0.059
 achieve+0.059
kin+0.058
illas+0.058
edient+0.058
Neg logit contributions
ted-0.043
 Many-0.040
 People-0.037
 Everyone-0.036
heart-0.035
Token try
Token ID1949
Feature activation-3.18e+00

Pos logit contributions
 Many+0.030
 Every+0.029
 Everyone+0.029
 All+0.027
 People+0.027
Neg logit contributions
cher-0.039
illas-0.036
 globe-0.036
ucer-0.035
edient-0.034
Token new
Token ID649
Feature activation+2.30e+00

Pos logit contributions
cher+0.257
kin+0.247
edient+0.242
 afloat+0.240
illas+0.240
Neg logit contributions
ted-0.139
 Many-0.138
 Everyone-0.129
 People-0.127
 Every-0.125
Token things
Token ID1243
Feature activation+5.86e+00

Pos logit contributions
ted+0.098
 Many+0.095
 People+0.088
 Everyone+0.088
 All+0.080
Neg logit contributions
cher-0.181
kin-0.179
 achieve-0.178
 afloat-0.177
illas-0.177
Token.
Token ID13
Feature activation-4.73e-01

Pos logit contributions
ted+0.224
 Everyone+0.201
 Many+0.197
 People+0.196
 Every+0.195
Neg logit contributions
cher-0.484
 afloat-0.482
edient-0.480
illas-0.470
 globe-0.462
Token Bro
Token ID2806
Feature activation-5.38e+00

Pos logit contributions
cher+0.032
kin+0.032
 achieve+0.031
ppers+0.031
illas+0.031
Neg logit contributions
ted-0.026
 Many-0.024
 Everyone-0.023
 People-0.023
 Every-0.021
Tokenccoli
Token ID34544
Feature activation-1.35e+01

Pos logit contributions
 achieve+0.122
 afloat+0.122
cher+0.115
edient+0.094
kin+0.084
Neg logit contributions
ted-0.538
 People-0.505
heart-0.504
 Everyone-0.502
 Many-0.499
Token is
Token ID318
Feature activation-1.12e+00

Pos logit contributions
cher+0.892
 afloat+0.873
ppers+0.846
edient+0.827
 achieve+0.826
Neg logit contributions
ted-0.798
 Everyone-0.730
 People-0.722
 Many-0.710
 All-0.707
Token not
Token ID407
Feature activation-3.87e+00

Pos logit contributions
cher+0.073
illas+0.071
ucer+0.069
edient+0.068
 globe+0.068
Neg logit contributions
 Many-0.064
 Every-0.064
ted-0.061
 Everyone-0.061
 People-0.061
Token that
Token ID326
Feature activation+1.00e+00

Pos logit contributions
cher+0.254
illas+0.250
edient+0.249
 achieve+0.247
 afloat+0.246
Neg logit contributions
 Many-0.214
ted-0.211
 Everyone-0.202
 People-0.201
 Every-0.199
Token bad
Token ID2089
Feature activation-4.20e+00

Pos logit contributions
ted+0.057
 Many+0.056
 Everyone+0.054
 People+0.053
 Every+0.051
Neg logit contributions
cher-0.066
kin-0.064
ppers-0.064
 afloat-0.063
illas-0.063
Token.
Token ID13
Feature activation-4.37e-01

Pos logit contributions
kin+0.253
cher+0.250
 achieve+0.241
edient+0.240
illas+0.239
Neg logit contributions
ted-0.254
 Many-0.251
 People-0.233
 Everyone-0.231
 Every-0.223
Token hard
Token ID1327
Feature activation-3.17e+00

Pos logit contributions
 achieve+0.005
cher+0.005
kin+0.005
edient+0.005
illas+0.005
Neg logit contributions
ted-0.004
 Many-0.004
 Everyone-0.004
 People-0.004
andel-0.004
Token work
Token ID670
Feature activation+3.35e+00

Pos logit contributions
 achieve+0.212
 afloat+0.193
cher+0.187
kins+0.185
edient+0.184
Neg logit contributions
 Many-0.189
ted-0.181
 Everyone-0.168
 People-0.165
andel-0.153
Token can
Token ID460
Feature activation-4.10e+00

Pos logit contributions
ted+0.176
 People+0.165
 Everyone+0.164
 Many+0.162
 All+0.156
Neg logit contributions
cher-0.231
ppers-0.230
illas-0.229
 afloat-0.226
kin-0.224
Token help
Token ID1037
Feature activation+5.23e-01

Pos logit contributions
cher+0.220
kin+0.220
ppers+0.212
illas+0.211
edient+0.209
Neg logit contributions
 Many-0.276
ted-0.276
 Everyone-0.267
 People-0.266
 Every-0.257
Token you
Token ID345
Feature activation+3.42e+00

Pos logit contributions
ted+0.023
 Many+0.023
 Everyone+0.022
 People+0.022
 Every+0.020
Neg logit contributions
cher-0.041
kin-0.040
ppers-0.039
illas-0.039
edient-0.039
Token achieve
Token ID4620
Feature activation-1.35e+01

Pos logit contributions
 Everyone+0.185
 All+0.184
 Many+0.182
 Every+0.180
 People+0.179
Neg logit contributions
cher-0.220
kin-0.215
illas-0.213
ppers-0.209
ccoli-0.208
Token your
Token ID534
Feature activation-4.52e+00

Pos logit contributions
cher+1.098
ppers+1.075
edient+1.056
 Claus+1.043
kin+1.042
Neg logit contributions
 Everyone-0.571
 People-0.562
 Many-0.552
ted-0.538
 Every-0.531
Token goals
Token ID4661
Feature activation-1.13e+00

Pos logit contributions
cher+0.330
ppers+0.321
kin+0.317
edient+0.310
illas+0.309
Neg logit contributions
 Many-0.229
ted-0.224
 People-0.220
 Everyone-0.220
 Every-0.211
Token.
Token ID13
Feature activation-8.29e-01

Pos logit contributions
cher+0.080
ppers+0.079
kin+0.079
 Claus+0.078
ccoli+0.078
Neg logit contributions
 Many-0.053
 Everyone-0.052
ted-0.052
 People-0.052
 Every-0.050
Token He
Token ID679
Feature activation-2.34e+00

Pos logit contributions
cher+0.046
 globe+0.045
edient+0.045
 achieve+0.045
ccoli+0.044
Neg logit contributions
ted-0.053
 Many-0.048
ly-0.046
andel-0.046
heart-0.045
Token felt
Token ID2936
Feature activation-6.36e-01

Pos logit contributions
cher+0.142
 achieve+0.141
kin+0.139
 promise+0.136
edient+0.136
Neg logit contributions
 Many-0.143
ted-0.142
 People-0.133
 Everyone-0.132
 Every-0.123
Token working
Token ID1762
Feature activation-5.01e+00

Pos logit contributions
ted+0.041
 Many+0.039
 Everyone+0.038
 People+0.038
 Every+0.036
Neg logit contributions
cher-0.049
kin-0.049
illas-0.048
ppers-0.048
edient-0.047
Token hard
Token ID1327
Feature activation-3.18e+00

Pos logit contributions
illas+0.361
cher+0.353
edient+0.347
ppers+0.346
kin+0.346
Neg logit contributions
ted-0.237
 Everyone-0.235
 People-0.235
 Many-0.227
 Every-0.218
Token and
Token ID290
Feature activation-9.06e-02

Pos logit contributions
cher+0.217
 achieve+0.212
 afloat+0.210
kin+0.210
edient+0.209
Neg logit contributions
ted-0.172
 Many-0.168
 Everyone-0.158
 People-0.156
 Every-0.146
Token you
Token ID345
Feature activation+3.40e+00

Pos logit contributions
kin+0.005
cher+0.005
illas+0.005
ppers+0.005
 afloat+0.005
Neg logit contributions
ted-0.006
 Many-0.006
 Everyone-0.005
 People-0.005
 All-0.005
Token will
Token ID481
Feature activation-9.98e+00

Pos logit contributions
 Many+0.242
ted+0.242
 Everyone+0.237
 People+0.235
 All+0.229
Neg logit contributions
kin-0.166
edient-0.163
cher-0.162
 afloat-0.161
ppers-0.160
Token achieve
Token ID4620
Feature activation-1.35e+01

Pos logit contributions
cher+0.619
kin+0.602
illas+0.601
ppers+0.590
kins+0.584
Neg logit contributions
 Many-0.607
 Everyone-0.601
 People-0.582
 Every-0.579
 All-0.577
Token great
Token ID1049
Feature activation-3.85e+00

Pos logit contributions
cher+0.978
edient+0.940
ppers+0.936
 Claus+0.918
illas+0.916
Neg logit contributions
 Everyone-0.688
 Many-0.681
 People-0.672
ted-0.665
 Every-0.644
Token things
Token ID1243
Feature activation+5.50e+00

Pos logit contributions
 achieve+0.281
cher+0.266
kin+0.265
 afloat+0.262
illas+0.261
Neg logit contributions
ted-0.203
 Many-0.188
 People-0.168
 Everyone-0.166
 Every-0.163
Token."
Token ID526
Feature activation+6.25e-01

Pos logit contributions
 Everyone+0.258
 People+0.255
ted+0.251
 Many+0.251
 All+0.241
Neg logit contributions
kin-0.397
cher-0.388
 afloat-0.387
ppers-0.379
illas-0.378
Token
Token ID220
Feature activation-2.91e+00

Pos logit contributions
ted+0.037
 Many+0.036
 People+0.034
 Everyone+0.034
 Every+0.032
Neg logit contributions
cher-0.040
kin-0.039
 achieve-0.038
ppers-0.038
edient-0.038
Token
Token ID198
Feature activation-3.18e-01

Pos logit contributions
cher+0.198
kin+0.195
ppers+0.191
 afloat+0.191
edient+0.191
Neg logit contributions
ted-0.157
 Many-0.154
 Everyone-0.146
 People-0.145
 Every-0.137
Token the
Token ID262
Feature activation-1.11e+00

Pos logit contributions
ted+0.221
 Many+0.215
 Everyone+0.205
 People+0.204
 Every+0.191
Neg logit contributions
cher-0.267
kin-0.261
 achieve-0.260
ppers-0.257
edient-0.256
Token smallest
Token ID18197
Feature activation+1.46e+00

Pos logit contributions
cher+0.073
kin+0.072
 achieve+0.071
illas+0.071
edient+0.070
Neg logit contributions
ted-0.064
 Many-0.061
 Everyone-0.058
 People-0.058
 Every-0.055
Token of
Token ID286
Feature activation+6.42e+00

Pos logit contributions
ted+0.074
 Many+0.072
 Everyone+0.065
 People+0.064
 Every+0.062
Neg logit contributions
kin-0.105
cher-0.104
 promise-0.103
edient-0.101
 achieve-0.101
Token us
Token ID514
Feature activation+6.52e+00

Pos logit contributions
ted+0.363
 Many+0.340
 People+0.316
 Everyone+0.316
 Every+0.300
Neg logit contributions
kin-0.430
 achieve-0.424
cher-0.424
illas-0.416
edient-0.415
Token can
Token ID460
Feature activation-4.09e+00

Pos logit contributions
ted+0.404
 Many+0.396
 Everyone+0.386
 People+0.383
 Every+0.364
Neg logit contributions
cher-0.383
kin-0.380
 afloat-0.376
ppers-0.373
edient-0.368
Token achieve
Token ID4620
Feature activation-1.35e+01

Pos logit contributions
cher+0.226
kin+0.223
ppers+0.216
illas+0.216
 afloat+0.216
Neg logit contributions
 Many-0.271
ted-0.269
 Everyone-0.262
 People-0.258
 Every-0.250
Token great
Token ID1049
Feature activation-3.85e+00

Pos logit contributions
cher+0.970
ppers+0.940
edient+0.929
kin+0.923
illas+0.920
Neg logit contributions
 Many-0.678
ted-0.675
 Everyone-0.664
 People-0.654
 Every-0.619
Token things
Token ID1243
Feature activation+5.49e+00

Pos logit contributions
 achieve+0.293
kin+0.279
 afloat+0.278
cher+0.278
illas+0.276
Neg logit contributions
ted-0.185
 Many-0.174
 People-0.155
 Everyone-0.153
 Every-0.150
Token."
Token ID526
Feature activation+6.13e-01

Pos logit contributions
ted+0.262
 Many+0.259
 Everyone+0.257
 People+0.254
 Every+0.240
Neg logit contributions
kin-0.390
cher-0.388
 afloat-0.378
ppers-0.377
illas-0.375
Token<|endoftext|>
Token ID50256
Feature activation-5.14e-01

Pos logit contributions
ted+0.037
 Many+0.034
ly+0.031
 People+0.031
 winds+0.031
Neg logit contributions
cher-0.036
 achieve-0.035
illas-0.035
kin-0.035
 Claus-0.034
TokenOnce
Token ID7454
Feature activation-2.39e+00

Pos logit contributions
cher+0.038
ppers+0.037
kin+0.037
edient+0.037
 afloat+0.036
Neg logit contributions
ted-0.026
 Many-0.025
 Everyone-0.024
 People-0.024
 Every-0.022
Token around
Token ID1088
Feature activation+2.15e+00

Pos logit contributions
ted+0.156
 Many+0.151
 Everyone+0.144
 People+0.143
 Every+0.137
Neg logit contributions
cher-0.157
kin-0.152
ppers-0.152
illas-0.152
edient-0.151
Token in
Token ID287
Feature activation-2.37e+00

Pos logit contributions
ted+0.111
 Many+0.108
 Everyone+0.102
 People+0.101
 Every+0.096
Neg logit contributions
cher-0.153
kin-0.149
ppers-0.148
edient-0.148
illas-0.147
Token the
Token ID262
Feature activation-1.03e-01

Pos logit contributions
cher+0.177
kin+0.175
 achieve+0.173
 afloat+0.172
illas+0.172
Neg logit contributions
ted-0.113
 Many-0.109
 Everyone-0.102
 People-0.102
 Every-0.094
Token fields
Token ID7032
Feature activation+6.19e+00

Pos logit contributions
cher+0.009
ppers+0.009
edient+0.008
illas+0.008
 Claus+0.008
Neg logit contributions
ted-0.004
 Many-0.004
 Everyone-0.004
 People-0.004
 Every-0.004
Token.
Token ID13
Feature activation-8.43e-02

Pos logit contributions
ted+0.293
 Many+0.283
 Everyone+0.275
 People+0.273
 Every+0.267
Neg logit contributions
cher-0.461
ppers-0.447
kin-0.445
edient-0.442
illas-0.442
Token One
Token ID1881
Feature activation+9.56e+00

Pos logit contributions
ccoli+0.006
illas+0.006
 achieve+0.006
Jackson+0.006
cher+0.006
Neg logit contributions
ted-0.004
ly-0.003
 Many-0.003
heart-0.003
 winds-0.003
Token day
Token ID1110
Feature activation+2.49e+00

Pos logit contributions
ted+0.556
 Many+0.529
 People+0.501
 Everyone+0.500
arted+0.436
Neg logit contributions
kin-0.641
 promise-0.611
edient-0.600
 afloat-0.591
 globe-0.579
Token,
Token ID11
Feature activation-9.58e-02

Pos logit contributions
 Many+0.149
ted+0.145
 People+0.139
 Every+0.136
 Everyone+0.132
Neg logit contributions
kin-0.150
cher-0.144
 afloat-0.140
kins-0.137
 achieve-0.137
Token Benny
Token ID44275
Feature activation-1.19e-01

Pos logit contributions
kin+0.007
cher+0.007
 afloat+0.007
 Claus+0.007
illas+0.007
Neg logit contributions
 Many-0.005
ted-0.005
 People-0.004
 Everyone-0.004
 Every-0.004
Token saw
Token ID2497
Feature activation+2.66e+00

Pos logit contributions
cher+0.007
 achieve+0.007
 Claus+0.007
 afloat+0.007
kin+0.007
Neg logit contributions
 Many-0.007
ted-0.007
 Everyone-0.007
 People-0.007
 Every-0.006
Token a
Token ID257
Feature activation+1.59e+00

Pos logit contributions
 Many+0.163
 People+0.162
ted+0.161
 Everyone+0.153
andel+0.152
Neg logit contributions
cher-0.149
 afloat-0.143
kin-0.138
 achieve-0.133
 globe-0.129
Token that
Token ID326
Feature activation+6.18e-01

Pos logit contributions
cher+0.008
ppers+0.008
 achieve+0.008
ccoli+0.008
 afloat+0.008
Neg logit contributions
ted-0.006
 Everyone-0.006
 Many-0.006
 People-0.006
 All-0.006
Token was
Token ID373
Feature activation+7.40e-01

Pos logit contributions
ted+0.033
 Many+0.032
 Everyone+0.030
 People+0.030
 Every+0.028
Neg logit contributions
cher-0.043
kin-0.042
edient-0.041
 afloat-0.041
 achieve-0.041
Token very
Token ID845
Feature activation-2.64e+00

Pos logit contributions
ted+0.051
 Many+0.049
 Everyone+0.047
 People+0.047
 All+0.044
Neg logit contributions
kin-0.040
cher-0.040
 afloat-0.039
edient-0.039
 achieve-0.038
Token popular
Token ID2968
Feature activation+1.17e+00

Pos logit contributions
kin+0.148
cher+0.145
edient+0.144
 afloat+0.144
 achieve+0.141
Neg logit contributions
ted-0.179
 Many-0.168
 Everyone-0.159
 People-0.159
 All-0.152
Token.
Token ID13
Feature activation-8.47e-01

Pos logit contributions
 Many+0.074
ted+0.074
 People+0.068
 Everyone+0.067
 Every+0.064
Neg logit contributions
kin-0.070
cher-0.069
 globe-0.068
edient-0.067
 afloat-0.066
Token People
Token ID4380
Feature activation+1.14e+01

Pos logit contributions
 globe+0.058
kin+0.057
ccoli+0.056
 tim+0.056
 Claus+0.056
Neg logit contributions
ted-0.043
heart-0.036
ly-0.036
arted-0.033
unn-0.031
Token came
Token ID1625
Feature activation+4.95e+00

Pos logit contributions
ted+0.622
 Many+0.602
 Everyone+0.571
 People+0.570
 Every+0.538
Neg logit contributions
cher-0.783
kin-0.769
ppers-0.759
 afloat-0.758
edient-0.752
Token from
Token ID422
Feature activation-1.42e+00

Pos logit contributions
ted+0.203
 Many+0.197
 Everyone+0.182
 People+0.181
 Every+0.168
Neg logit contributions
cher-0.403
kin-0.398
ppers-0.392
 achieve-0.391
edient-0.391
Token all
Token ID477
Feature activation+3.94e+00

Pos logit contributions
cher+0.093
kin+0.093
 afloat+0.092
illas+0.092
edient+0.091
Neg logit contributions
ted-0.078
 Many-0.077
 Everyone-0.073
 People-0.072
 All-0.068
Token around
Token ID1088
Feature activation+6.39e-01

Pos logit contributions
ted+0.181
 Many+0.177
 Everyone+0.167
 People+0.166
 Every+0.156
Neg logit contributions
cher-0.306
kin-0.300
ppers-0.298
 afloat-0.293
 achieve-0.292
Token to
Token ID284
Feature activation+1.78e-01

Pos logit contributions
ted+0.033
 Many+0.032
 Everyone+0.030
 People+0.030
 Every+0.028
Neg logit contributions
cher-0.045
kin-0.045
 achieve-0.044
illas-0.044
edient-0.044
Token,
Token ID11
Feature activation-2.79e-01

Pos logit contributions
ted+0.032
 Many+0.031
 Everyone+0.029
 People+0.029
 Every+0.027
Neg logit contributions
cher-0.053
kin-0.052
 achieve-0.051
ppers-0.051
edient-0.051
Token a
Token ID257
Feature activation+1.57e+00

Pos logit contributions
cher+0.022
kin+0.021
ppers+0.021
 achieve+0.021
edient+0.021
Neg logit contributions
ted-0.013
 Many-0.012
 Everyone-0.011
 People-0.011
 Every-0.011
Token little
Token ID1310
Feature activation+3.23e+00

Pos logit contributions
 Many+0.051
 All+0.040
 Everyone+0.038
 gust+0.037
 People+0.037
Neg logit contributions
illas-0.148
ppers-0.147
cher-0.144
wy-0.143
edient-0.142
Token bird
Token ID6512
Feature activation+2.69e+00

Pos logit contributions
ted+0.157
 Many+0.153
 Everyone+0.141
 People+0.141
 Every+0.133
Neg logit contributions
cher-0.241
kin-0.238
 achieve-0.235
ppers-0.234
illas-0.233
Token named
Token ID3706
Feature activation+2.08e+00

Pos logit contributions
ted+0.133
 Many+0.125
 Everyone+0.116
 People+0.115
 All+0.111
Neg logit contributions
cher-0.201
kin-0.198
 achieve-0.194
 afloat-0.193
ppers-0.192
Token Twe
Token ID24205
Feature activation+9.93e+00

Pos logit contributions
ted+0.097
 Many+0.090
 Everyone+0.082
 People+0.082
heart+0.076
Neg logit contributions
cher-0.159
 Claus-0.153
edient-0.152
illas-0.151
 achieve-0.151
Tokenety
Token ID2963
Feature activation-1.95e-02

Pos logit contributions
ted+0.634
 Many+0.609
 Everyone+0.566
 People+0.565
 Every+0.549
Neg logit contributions
kin-0.595
cher-0.587
 achieve-0.578
 afloat-0.571
edient-0.560
Token,
Token ID11
Feature activation-2.43e-01

Pos logit contributions
kin+0.001
cher+0.001
 achieve+0.001
 afloat+0.001
edient+0.001
Neg logit contributions
ted-0.001
 Many-0.001
 Everyone-0.001
 People-0.001
 Every-0.001
Token and
Token ID290
Feature activation+2.73e-01

Pos logit contributions
cher+0.015
illas+0.015
ppers+0.015
 afloat+0.014
 Claus+0.014
Neg logit contributions
 Many-0.014
ted-0.014
 Everyone-0.014
 People-0.014
 Every-0.013
Token told
Token ID1297
Feature activation-3.68e-01

Pos logit contributions
andel+0.014
 Many+0.013
ted+0.013
 People+0.013
ummy+0.013
Neg logit contributions
cher-0.019
 Claus-0.019
ppers-0.017
 afloat-0.017
 promise-0.017
Token her
Token ID607
Feature activation+1.87e+00

Pos logit contributions
 achieve+0.025
 stret+0.025
 promise+0.025
 afloat+0.025
 Claus+0.025
Neg logit contributions
ted-0.015
 Many-0.015
pill-0.014
andel-0.014
oline-0.014
Token out
Token ID503
Feature activation+9.42e-01

Pos logit contributions
ted+0.046
 Many+0.046
 People+0.044
 Everyone+0.043
 Every+0.041
Neg logit contributions
cher-0.046
 afloat-0.045
kin-0.045
ppers-0.044
edient-0.044
Token of
Token ID286
Feature activation+7.44e+00

Pos logit contributions
ted+0.039
 Many+0.038
 Everyone+0.035
 People+0.035
 Every+0.033
Neg logit contributions
cher-0.076
kin-0.075
ppers-0.074
 achieve-0.074
illas-0.073
Token the
Token ID262
Feature activation-2.53e-01

Pos logit contributions
ted+0.414
 Many+0.406
 Everyone+0.382
 People+0.381
 All+0.356
Neg logit contributions
cher-0.488
 afloat-0.487
kin-0.487
illas-0.475
 achieve-0.473
Token hole
Token ID7604
Feature activation-4.01e+00

Pos logit contributions
cher+0.015
ppers+0.015
edient+0.015
 achieve+0.015
kin+0.015
Neg logit contributions
ted-0.015
 Many-0.015
 People-0.014
 Everyone-0.014
 Every-0.014
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
cher+0.294
 achieve+0.288
kin+0.286
illas+0.284
edient+0.284
Neg logit contributions
ted-0.192
 Many-0.176
 Everyone-0.170
 People-0.169
 Every-0.164
Token People
Token ID4380
Feature activation+1.20e+01

Pos logit contributions
cher+0.014
kin+0.013
 achieve+0.013
illas+0.013
edient+0.013
Neg logit contributions
ted-0.010
 Many-0.009
 People-0.009
 Everyone-0.009
heart-0.008
Token walked
Token ID6807
Feature activation+4.59e+00

Pos logit contributions
ted+0.597
 Many+0.543
 Everyone+0.523
 People+0.522
 All+0.501
Neg logit contributions
kin-0.874
ppers-0.871
cher-0.868
ccoli-0.867
 afloat-0.864
Token by
Token ID416
Feature activation-7.70e-01

Pos logit contributions
ted+0.291
 Many+0.271
 People+0.262
 Everyone+0.261
 Every+0.249
Neg logit contributions
kin-0.280
cher-0.277
ppers-0.277
 achieve-0.277
illas-0.271
Token and
Token ID290
Feature activation+3.12e-01

Pos logit contributions
cher+0.058
kin+0.056
ppers+0.056
 Claus+0.055
illas+0.055
Neg logit contributions
ted-0.037
 Many-0.036
 Everyone-0.034
 People-0.034
 All-0.032
Token saw
Token ID2497
Feature activation+2.62e+00

Pos logit contributions
ted+0.016
 Many+0.015
 Everyone+0.014
 People+0.014
 All+0.013
Neg logit contributions
kin-0.023
ppers-0.022
cher-0.022
 achieve-0.022
edient-0.022
Token Tim
Token ID5045
Feature activation+4.29e+00

Pos logit contributions
ted+0.134
 Many+0.130
 Everyone+0.119
 People+0.119
 Every+0.117
Neg logit contributions
cher-0.179
illas-0.174
 achieve-0.170
kin-0.169
 globe-0.169
Token and
Token ID290
Feature activation+3.84e-01

Pos logit contributions
ted+0.293
 Many+0.291
 People+0.275
 Everyone+0.271
 Every+0.256
Neg logit contributions
cher-0.327
kin-0.324
edient-0.321
 Claus-0.321
illas-0.320
Token love
Token ID1842
Feature activation+1.17e+00

Pos logit contributions
ted+0.019
 Many+0.019
 Everyone+0.018
 People+0.018
 Every+0.017
Neg logit contributions
cher-0.028
kin-0.027
ppers-0.027
 afloat-0.027
illas-0.027
Token playing
Token ID2712
Feature activation+3.04e+00

Pos logit contributions
 Many+0.062
ted+0.061
 People+0.057
 Everyone+0.057
heart+0.052
Neg logit contributions
kin-0.078
cher-0.077
edient-0.077
 globe-0.077
 Claus-0.076
Token outside
Token ID2354
Feature activation+2.07e+00

Pos logit contributions
ted+0.158
 Many+0.152
 Everyone+0.148
 People+0.144
 All+0.140
Neg logit contributions
cher-0.220
ppers-0.219
edient-0.216
illas-0.216
 Claus-0.215
Token.
Token ID13
Feature activation-3.67e-01

Pos logit contributions
ted+0.126
 Many+0.123
 People+0.116
 Everyone+0.116
 Every+0.109
Neg logit contributions
cher-0.126
kin-0.125
edient-0.123
 achieve-0.122
illas-0.122
Token One
Token ID1881
Feature activation+9.41e+00

Pos logit contributions
 globe+0.019
 achieve+0.018
illas+0.018
Anna+0.017
 promise+0.017
Neg logit contributions
ted-0.025
heart-0.023
 winds-0.022
ly-0.022
andel-0.022
Token day
Token ID1110
Feature activation+2.42e+00

Pos logit contributions
ted+0.443
 Many+0.419
 People+0.395
 Everyone+0.391
 Every+0.356
Neg logit contributions
kin-0.716
edient-0.697
cher-0.690
 afloat-0.684
 achieve-0.681
Token,
Token ID11
Feature activation-8.06e-02

Pos logit contributions
heart+0.169
 Many+0.166
ted+0.164
andel+0.163
 People+0.161
Neg logit contributions
kin-0.109
 globe-0.105
illas-0.098
kins-0.094
cher-0.094
Token Zoe
Token ID32280
Feature activation+1.83e+00

Pos logit contributions
kin+0.004
 globe+0.004
Anna+0.004
illas+0.004
 firm+0.004
Neg logit contributions
heart-0.005
elling-0.005
umm-0.005
andel-0.005
 winds-0.005
Token was
Token ID373
Feature activation+1.24e+00

Pos logit contributions
 Many+0.114
ted+0.111
 Everyone+0.109
 People+0.108
 Every+0.100
Neg logit contributions
illas-0.108
 Claus-0.107
edient-0.107
ppers-0.106
 achieve-0.105
Token playing
Token ID2712
Feature activation+3.09e+00

Pos logit contributions
ted+0.070
 Many+0.066
 People+0.064
 Everyone+0.064
heart+0.060
Neg logit contributions
cher-0.082
kin-0.081
 achieve-0.080
 afloat-0.080
edient-0.079
Token red
Token ID2266
Feature activation+2.50e+00

Pos logit contributions
cher+0.161
kin+0.160
 afloat+0.155
ppers+0.155
 achieve+0.154
Neg logit contributions
ted-0.182
 Many-0.177
 Everyone-0.168
 People-0.168
 Every-0.160
Token hat
Token ID6877
Feature activation-3.94e+00

Pos logit contributions
ted+0.176
 Many+0.175
 People+0.168
 Everyone+0.166
 Every+0.160
Neg logit contributions
cher-0.124
kin-0.124
 globe-0.123
ppers-0.120
 afloat-0.118
Token,
Token ID11
Feature activation-5.70e-01

Pos logit contributions
cher+0.273
kin+0.272
edient+0.269
 achieve+0.268
illas+0.268
Neg logit contributions
ted-0.207
 Many-0.194
 Everyone-0.186
 People-0.184
 All-0.176
Token a
Token ID257
Feature activation+1.27e+00

Pos logit contributions
cher+0.039
kin+0.038
 afloat+0.038
ppers+0.038
 achieve+0.037
Neg logit contributions
ted-0.030
 Many-0.030
 Everyone-0.028
 People-0.028
 Every-0.026
Token beautiful
Token ID4950
Feature activation+6.60e-02

Pos logit contributions
 Many+0.063
ted+0.059
 Everyone+0.058
 People+0.057
 All+0.055
Neg logit contributions
illas-0.089
edient-0.089
 achieve-0.088
ppers-0.087
cher-0.086
Token yellow
Token ID7872
Feature activation+2.12e+00

Pos logit contributions
ted+0.004
 Many+0.004
 Everyone+0.004
 People+0.004
 Every+0.004
Neg logit contributions
cher-0.004
 achieve-0.004
edient-0.004
illas-0.004
kin-0.004
Token dress
Token ID6576
Feature activation-1.53e+00

Pos logit contributions
ted+0.159
 Many+0.157
 People+0.150
 Everyone+0.150
 Every+0.144
Neg logit contributions
cher-0.099
kin-0.097
ppers-0.094
 afloat-0.094
illas-0.094
Token,
Token ID11
Feature activation-5.77e-01

Pos logit contributions
cher+0.110
 achieve+0.108
kin+0.108
 afloat+0.108
edient+0.108
Neg logit contributions
ted-0.077
 Many-0.072
 Everyone-0.069
 People-0.068
 All-0.065
Token and
Token ID290
Feature activation-6.87e-02

Pos logit contributions
cher+0.039
kin+0.039
ppers+0.038
 achieve+0.038
 afloat+0.038
Neg logit contributions
ted-0.031
 Many-0.030
 Everyone-0.029
 People-0.029
 Every-0.027
Token two
Token ID734
Feature activation-2.56e+00

Pos logit contributions
cher+0.005
kin+0.004
ppers+0.004
 Claus+0.004
kins+0.004
Neg logit contributions
 Many-0.004
ted-0.004
 People-0.003
 Everyone-0.003
andel-0.003
Token j
Token ID474
Feature activation+2.43e+00

Pos logit contributions
cher+0.178
ppers+0.172
kin+0.172
illas+0.170
 afloat+0.170
Neg logit contributions
ted-0.132
 Many-0.129
 People-0.121
 Everyone-0.121
 Every-0.115
Token felt
Token ID2936
Feature activation-6.12e-01

Pos logit contributions
cher+0.151
kin+0.147
 Claus+0.146
ppers+0.146
edient+0.145
Neg logit contributions
ted-0.131
 Many-0.129
 People-0.122
 Everyone-0.121
 Every-0.113
Token ashamed
Token ID22461
Feature activation-1.70e+00

Pos logit contributions
cher+0.036
kin+0.036
 afloat+0.036
 achieve+0.035
edient+0.035
Neg logit contributions
ted-0.038
 Many-0.037
 People-0.035
 Everyone-0.035
 Every-0.033
Token when
Token ID618
Feature activation+2.92e+00

Pos logit contributions
cher+0.115
edient+0.114
 afloat+0.113
kin+0.113
 achieve+0.112
Neg logit contributions
 Many-0.088
ted-0.085
 People-0.081
 Everyone-0.079
 Every-0.073
Token she
Token ID673
Feature activation+2.02e+00

Pos logit contributions
ted+0.121
 Many+0.118
 People+0.105
 Everyone+0.104
 Every+0.097
Neg logit contributions
cher-0.232
kin-0.230
 achieve-0.228
 Claus-0.227
edient-0.225
Token saw
Token ID2497
Feature activation+2.35e+00

Pos logit contributions
ted+0.118
 Many+0.118
 People+0.112
 Everyone+0.112
 Every+0.105
Neg logit contributions
cher-0.129
kin-0.125
ppers-0.124
illas-0.123
 achieve-0.123
Token him
Token ID683
Feature activation+5.22e+00

Pos logit contributions
 Many+0.109
ted+0.102
 People+0.098
 Everyone+0.095
andel+0.088
Neg logit contributions
cher-0.172
kin-0.167
 globe-0.165
illas-0.165
 Claus-0.164
Token carrying
Token ID6872
Feature activation-2.20e+00

Pos logit contributions
ted+0.239
 Many+0.218
 All+0.218
 Every+0.218
 Everyone+0.214
Neg logit contributions
cher-0.397
kin-0.393
ucer-0.381
illas-0.380
 globe-0.376
Token the
Token ID262
Feature activation-1.07e+00

Pos logit contributions
cher+0.162
kin+0.161
 afloat+0.158
ppers+0.155
edient+0.155
Neg logit contributions
 Many-0.104
ted-0.102
 People-0.095
 Everyone-0.093
 All-0.086
Token sausage
Token ID31889
Feature activation-3.65e+00

Pos logit contributions
cher+0.070
kin+0.068
 achieve+0.067
illas+0.067
ppers+0.067
Neg logit contributions
ted-0.062
 Many-0.060
 Everyone-0.058
 People-0.057
 Every-0.056
Token and
Token ID290
Feature activation+1.30e-02

Pos logit contributions
 achieve+0.283
illas+0.277
cher+0.276
Jackson+0.272
edient+0.269
Neg logit contributions
ted-0.163
 Everyone-0.138
 All-0.136
ly-0.135
 Every-0.133
Token he
Token ID339
Feature activation-9.12e-01

Pos logit contributions
ted+0.001
 Many+0.001
 People+0.001
 Everyone+0.001
 Every+0.001
Neg logit contributions
cher-0.001
kin-0.001
ppers-0.001
 achieve-0.001
edient-0.001
Token a
Token ID257
Feature activation+1.72e+00

Pos logit contributions
ted+0.070
 Many+0.063
 People+0.062
 Everyone+0.061
heart+0.058
Neg logit contributions
kin-0.087
cher-0.086
 achieve-0.085
 afloat-0.083
 Claus-0.081
Token time
Token ID640
Feature activation+1.81e+00

Pos logit contributions
 Many+0.066
ted+0.064
 Everyone+0.060
 People+0.059
 Every+0.057
Neg logit contributions
cher-0.148
ppers-0.141
illas-0.140
edient-0.140
 afloat-0.140
Token there
Token ID612
Feature activation-2.22e+00

Pos logit contributions
ted+0.103
 Many+0.098
 Everyone+0.093
 People+0.093
 Every+0.087
Neg logit contributions
kin-0.119
cher-0.117
 achieve-0.115
illas-0.115
 afloat-0.114
Token was
Token ID373
Feature activation+1.21e+00

Pos logit contributions
kin+0.138
 globe+0.131
edient+0.130
 achieve+0.129
illas+0.128
Neg logit contributions
ted-0.133
 Many-0.130
 People-0.121
 Everyone-0.120
andel-0.115
Token a
Token ID257
Feature activation+1.74e+00

Pos logit contributions
ted+0.081
 People+0.074
heart+0.072
 Everyone+0.072
 Many+0.070
Neg logit contributions
kin-0.070
 afloat-0.065
 globe-0.063
aks-0.062
 achieve-0.060
Token little
Token ID1310
Feature activation+3.40e+00

Pos logit contributions
 Many+0.042
 Everyone+0.033
 People+0.031
 All+0.030
ted+0.029
Neg logit contributions
cher-0.178
illas-0.173
ppers-0.171
edient-0.170
 afloat-0.169
Token boy
Token ID2933
Feature activation-1.92e-01

Pos logit contributions
ted+0.174
 Many+0.169
 Everyone+0.158
 People+0.157
 Every+0.148
Neg logit contributions
cher-0.245
 achieve-0.240
illas-0.236
ppers-0.236
kin-0.236
Token,
Token ID11
Feature activation-8.22e-02

Pos logit contributions
cher+0.013
kin+0.013
ppers+0.013
 achieve+0.013
 afloat+0.013
Neg logit contributions
ted-0.010
 Many-0.010
 Everyone-0.009
 People-0.009
 Every-0.009
Token who
Token ID508
Feature activation-5.39e+00

Pos logit contributions
cher+0.007
ppers+0.007
edient+0.006
illas+0.006
 achieve+0.006
Neg logit contributions
ted-0.004
 Many-0.003
 Everyone-0.003
 Every-0.003
 People-0.003
Token was
Token ID373
Feature activation+1.25e+00

Pos logit contributions
cher+0.465
ppers+0.456
illas+0.449
 afloat+0.438
 achieve+0.431
Neg logit contributions
 Many-0.223
ted-0.207
 Everyone-0.203
 People-0.201
 All-0.198
Token very
Token ID845
Feature activation-2.15e+00

Pos logit contributions
ted+0.097
 Many+0.090
 People+0.088
 Everyone+0.088
heart+0.084
Neg logit contributions
edient-0.057
 achieve-0.057
 Claus-0.056
kin-0.056
cher-0.056
Token t
Token ID256
Feature activation+1.77e+00

Pos logit contributions
 achieve+0.084
cher+0.083
illas+0.081
edient+0.080
kin+0.079
Neg logit contributions
ted-0.113
 Many-0.110
 People-0.105
 Everyone-0.105
 Every-0.102
Tokeneddy
Token ID21874
Feature activation+8.29e-01

Pos logit contributions
 Many+0.122
ted+0.118
 People+0.117
 Everyone+0.115
 Every+0.112
Neg logit contributions
ppers-0.097
cher-0.097
kin-0.093
edient-0.092
illas-0.092
Token.
Token ID13
Feature activation-5.77e-01

Pos logit contributions
ted+0.055
 Many+0.044
 Everyone+0.044
 People+0.043
ly+0.043
Neg logit contributions
 achieve-0.056
kin-0.052
edient-0.051
cher-0.051
 afloat-0.050
Token He
Token ID679
Feature activation-2.09e+00

Pos logit contributions
cher+0.037
kin+0.036
 achieve+0.036
 Claus+0.035
 afloat+0.035
Neg logit contributions
ted-0.032
heart-0.029
 Many-0.029
 People-0.027
 Everyone-0.027
Token gave
Token ID2921
Feature activation-3.60e-01

Pos logit contributions
cher+0.132
kin+0.129
ppers+0.127
illas+0.127
 achieve+0.127
Neg logit contributions
ted-0.124
 Many-0.122
 Everyone-0.115
 People-0.115
 Every-0.109
Token it
Token ID340
Feature activation+3.56e+00

Pos logit contributions
cher+0.022
kin+0.021
 away+0.021
ucer+0.020
 promise+0.020
Neg logit contributions
 Many-0.020
 People-0.019
ted-0.018
andel-0.018
 winds-0.018
Token to
Token ID284
Feature activation+4.78e-01

Pos logit contributions
 Many+0.192
ted+0.189
 People+0.181
 Everyone+0.179
 All+0.167
Neg logit contributions
cher-0.241
kin-0.233
ppers-0.232
 Claus-0.230
 afloat-0.228
Token Lily
Token ID20037
Feature activation+5.68e-01

Pos logit contributions
ted+0.021
 Many+0.020
 People+0.019
 Everyone+0.018
 Every+0.017
Neg logit contributions
cher-0.037
kin-0.037
 achieve-0.036
edient-0.036
ppers-0.036
Token.
Token ID13
Feature activation-5.46e-01

Pos logit contributions
ted+0.021
 Every+0.021
 Many+0.020
 Everyone+0.019
 All+0.018
Neg logit contributions
cher-0.048
kin-0.048
illas-0.047
 afloat-0.047
edient-0.046
Token
Token ID198
Feature activation-1.69e-02

Pos logit contributions
cher+0.034
kin+0.034
 achieve+0.034
 afloat+0.032
 Claus+0.032
Neg logit contributions
ted-0.031
heart-0.028
 Many-0.028
 People-0.026
 Everyone-0.026
Token
Token ID198
Feature activation-1.19e-02

Pos logit contributions
cher+0.001
kin+0.001
ppers+0.001
illas+0.001
 afloat+0.001
Neg logit contributions
ted-0.001
 Many-0.001
 Everyone-0.001
 People-0.001
 Every-0.001
Token the
Token ID262
Feature activation-1.06e+00

Pos logit contributions
cher+0.050
 afloat+0.047
ppers+0.047
edient+0.046
kin+0.046
Neg logit contributions
 Many-0.034
ted-0.032
 People-0.031
 Everyone-0.029
 Every-0.029
Token dish
Token ID9433
Feature activation-5.62e+00

Pos logit contributions
cher+0.050
kin+0.048
ppers+0.047
edient+0.047
 afloat+0.047
Neg logit contributions
ted-0.081
 Many-0.079
 Everyone-0.076
 People-0.076
 Every-0.073
Tokenwasher
Token ID45146
Feature activation-6.77e+00

Pos logit contributions
illas+0.260
cher+0.251
 achieve+0.250
kin+0.245
 afloat+0.243
Neg logit contributions
ted-0.424
 Everyone-0.414
 People-0.406
 Many-0.401
 All-0.394
Token so
Token ID523
Feature activation+8.61e-01

Pos logit contributions
cher+0.498
illas+0.493
kin+0.488
edient+0.484
ppers+0.484
Neg logit contributions
ted-0.327
 Many-0.309
 Everyone-0.304
 People-0.297
 Every-0.290
Token she
Token ID673
Feature activation+2.04e+00

Pos logit contributions
ted+0.050
 Many+0.048
 People+0.045
 Everyone+0.043
 Every+0.042
Neg logit contributions
kin-0.055
 achieve-0.054
cher-0.054
edient-0.053
 afloat-0.053
Token leaned
Token ID23831
Feature activation+6.26e+00

Pos logit contributions
 Many+0.121
ted+0.118
 People+0.112
 Everyone+0.111
 Every+0.107
Neg logit contributions
cher-0.132
kin-0.128
ppers-0.126
 Claus-0.124
 achieve-0.123
Token against
Token ID1028
Feature activation-4.77e+00

Pos logit contributions
ted+0.396
 Everyone+0.340
 Many+0.337
 Every+0.336
 All+0.328
Neg logit contributions
 achieve-0.391
illas-0.386
cher-0.384
ppers-0.381
ccoli-0.379
Token the
Token ID262
Feature activation-1.05e+00

Pos logit contributions
kin+0.372
cher+0.368
 globe+0.367
 afloat+0.365
 achieve+0.360
Neg logit contributions
 Many-0.199
ted-0.186
 People-0.176
 Everyone-0.170
heart-0.158
Token counter
Token ID3753
Feature activation-1.67e+00

Pos logit contributions
illas+0.069
cher+0.068
kin+0.067
 achieve+0.067
ppers+0.067
Neg logit contributions
 Everyone-0.062
ted-0.062
 People-0.060
 Many-0.059
 All-0.057
Token while
Token ID981
Feature activation+2.77e+00

Pos logit contributions
kin+0.129
cher+0.128
illas+0.125
edient+0.124
 afloat+0.123
Neg logit contributions
ted-0.075
 Many-0.069
 Everyone-0.068
 Every-0.067
 People-0.066
Token she
Token ID673
Feature activation+2.05e+00

Pos logit contributions
ted+0.112
 Many+0.109
 People+0.100
 Everyone+0.099
 Every+0.093
Neg logit contributions
cher-0.224
kin-0.224
 achieve-0.219
illas-0.218
 afloat-0.218
Token?"
Token ID1701
Feature activation+8.01e-01

Pos logit contributions
kin+0.243
cher+0.243
 afloat+0.240
edient+0.239
illas+0.238
Neg logit contributions
ted-0.194
 Many-0.185
 Everyone-0.176
 People-0.176
 Every-0.175
Token Anna
Token ID11735
Feature activation-4.52e+00

Pos logit contributions
ted+0.060
 Many+0.056
andel+0.056
 People+0.054
heart+0.054
Neg logit contributions
Jackson-0.034
 Claus-0.034
kin-0.033
cher-0.033
Arthur-0.032
Token asked
Token ID1965
Feature activation+4.29e+00

Pos logit contributions
 Claus+0.344
cher+0.342
kin+0.334
edient+0.332
 promise+0.329
Neg logit contributions
ted-0.201
 Many-0.197
 People-0.189
 Everyone-0.182
andel-0.162
Token.
Token ID13
Feature activation-8.40e-01

Pos logit contributions
ted+0.224
 Many+0.215
 Everyone+0.207
 People+0.203
 Every+0.194
Neg logit contributions
cher-0.302
 achieve-0.297
kin-0.294
illas-0.294
ppers-0.294
Token
Token ID198
Feature activation-3.10e-01

Pos logit contributions
cher+0.050
kin+0.050
 achieve+0.049
 Claus+0.049
illas+0.049
Neg logit contributions
ted-0.051
 Many-0.048
 People-0.045
 Everyone-0.044
heart-0.044
Token
Token ID198
Feature activation-3.14e-01

Pos logit contributions
cher+0.021
kin+0.021
ppers+0.020
 achieve+0.020
edient+0.020
Neg logit contributions
ted-0.017
 Many-0.017
 Everyone-0.016
 People-0.016
 Every-0.015
Token"
Token ID1
Feature activation-2.55e+00

Pos logit contributions
 afloat+0.021
cher+0.021
illas+0.020
 Claus+0.020
 globe+0.020
Neg logit contributions
 Many-0.018
 Everyone-0.018
ted-0.017
 All-0.017
 People-0.016
TokenSure
Token ID19457
Feature activation+1.53e+00

Pos logit contributions
cher+0.165
kin+0.162
ppers+0.159
 achieve+0.159
edient+0.159
Neg logit contributions
ted-0.148
 Many-0.144
 Everyone-0.137
 People-0.136
 Every-0.129
Token,
Token ID11
Feature activation-5.88e-01

Pos logit contributions
ted+0.086
 Many+0.084
 People+0.078
 Everyone+0.078
 Every+0.074
Neg logit contributions
cher-0.102
kin-0.102
ppers-0.099
illas-0.098
 achieve-0.098
Token sweet
Token ID6029
Feature activation-2.45e+00

Pos logit contributions
cher+0.040
kin+0.039
ppers+0.038
 achieve+0.038
illas+0.038
Neg logit contributions
ted-0.033
 Many-0.032
 Everyone-0.030
 People-0.030
 Every-0.028
Tokenie
Token ID494
Feature activation+7.40e+00

Pos logit contributions
kin+0.208
 promise+0.202
cher+0.201
ppers+0.200
edient+0.198
Neg logit contributions
ted-0.094
 Many-0.092
 People-0.080
 Everyone-0.078
 All-0.072
Token had
Token ID550
Feature activation-6.35e+00

Pos logit contributions
cher+0.145
kin+0.142
ppers+0.139
illas+0.139
 achieve+0.139
Neg logit contributions
ted-0.139
 Many-0.137
 Everyone-0.129
 People-0.129
 Every-0.123
Token found
Token ID1043
Feature activation+2.09e+00

Pos logit contributions
cher+0.394
kin+0.386
ppers+0.381
illas+0.379
 achieve+0.378
Neg logit contributions
ted-0.382
 Many-0.379
 Everyone-0.358
 People-0.357
 Every-0.336
Token on
Token ID319
Feature activation-7.60e+00

Pos logit contributions
ted+0.116
 Many+0.109
 Everyone+0.105
 People+0.105
 Every+0.099
Neg logit contributions
cher-0.140
kin-0.137
 achieve-0.136
edient-0.135
ppers-0.135
Token the
Token ID262
Feature activation-1.05e+00

Pos logit contributions
cher+0.557
kin+0.556
 afloat+0.547
 achieve+0.546
edient+0.545
Neg logit contributions
ted-0.363
 Many-0.356
 People-0.328
 Everyone-0.328
 Every-0.301
Token ground
Token ID2323
Feature activation+1.68e+00

Pos logit contributions
cher+0.070
ppers+0.068
kin+0.067
edient+0.067
 afloat+0.067
Neg logit contributions
ted-0.056
 Many-0.055
 Everyone-0.054
 People-0.053
 All-0.049
Token.
Token ID13
Feature activation-7.76e-01

Pos logit contributions
ted+0.083
 Many+0.078
 Everyone+0.075
 People+0.075
 Every+0.072
Neg logit contributions
cher-0.122
kin-0.119
illas-0.117
 achieve-0.117
edient-0.116
Token
Token ID220
Feature activation-2.85e+00

Pos logit contributions
 Claus+0.047
 afloat+0.047
illas+0.047
Jackson+0.046
ccoli+0.046
Neg logit contributions
ted-0.043
heart-0.038
ly-0.037
 Many-0.037
andel-0.037
Token
Token ID198
Feature activation-2.79e-01

Pos logit contributions
cher+0.202
kin+0.198
ppers+0.194
edient+0.194
 afloat+0.194
Neg logit contributions
 Many-0.146
ted-0.146
 Everyone-0.140
 People-0.139
 Every-0.131
Token
Token ID198
Feature activation-2.80e-01

Pos logit contributions
cher+0.020
kin+0.020
ppers+0.020
 afloat+0.020
illas+0.020
Neg logit contributions
 Many-0.015
 Everyone-0.014
ted-0.014
 People-0.014
 Every-0.013
TokenC
Token ID34
Feature activation+1.45e+00

Pos logit contributions
cher+0.017
kin+0.017
ppers+0.016
illas+0.016
 afloat+0.016
Neg logit contributions
 Many-0.018
ted-0.018
 Everyone-0.018
 People-0.017
 All-0.017
Tokenurious
Token ID16421
Feature activation+6.08e+00

Pos logit contributions
ted+0.090
 Many+0.088
 People+0.083
 Everyone+0.083
 All+0.079
Neg logit contributions
edient-0.088
ppers-0.087
kin-0.087
 achieve-0.086
illas-0.086
Token to
Token ID284
Feature activation+4.12e-01

Pos logit contributions
ted+0.219
 Many+0.208
 Everyone+0.200
 People+0.196
 Every+0.187
Neg logit contributions
cher-0.319
kin-0.316
 achieve-0.311
edient-0.309
ppers-0.309
Token the
Token ID262
Feature activation-9.16e-01

Pos logit contributions
ted+0.019
 Many+0.019
 Everyone+0.018
 People+0.017
 Every+0.016
Neg logit contributions
cher-0.031
kin-0.031
ppers-0.030
 achieve-0.030
 afloat-0.030
Token park
Token ID3952
Feature activation+9.31e-01

Pos logit contributions
cher+0.069
kin+0.068
edient+0.068
ppers+0.067
 afloat+0.066
Neg logit contributions
 Many-0.046
ted-0.045
 Everyone-0.043
 People-0.042
 All-0.040
Token.
Token ID13
Feature activation-6.09e-01

Pos logit contributions
ted+0.052
 Many+0.051
 Everyone+0.048
 People+0.048
 Every+0.046
Neg logit contributions
cher-0.062
kin-0.061
ppers-0.060
 achieve-0.060
edient-0.060
Token
Token ID198
Feature activation-8.14e-02

Pos logit contributions
 afloat+0.030
 globe+0.030
 Claus+0.030
 achieve+0.029
ccoli+0.028
Neg logit contributions
ted-0.041
heart-0.040
andel-0.038
ly-0.038
 winds-0.037
Token
Token ID198
Feature activation-7.22e-02

Pos logit contributions
cher+0.005
kin+0.005
 achieve+0.005
 afloat+0.005
edient+0.005
Neg logit contributions
ted-0.005
 Many-0.005
 People-0.004
 Everyone-0.004
heart-0.004
TokenWhen
Token ID2215
Feature activation+1.83e+00

Pos logit contributions
cher+0.005
kin+0.005
 achieve+0.005
ppers+0.005
 afloat+0.005
Neg logit contributions
ted-0.004
 Many-0.004
 People-0.004
 Everyone-0.004
 Every-0.004
Token they
Token ID484
Feature activation-2.22e+00

Pos logit contributions
ted+0.112
 Many+0.112
 People+0.104
 Everyone+0.103
heart+0.103
Neg logit contributions
 afloat-0.106
 achieve-0.104
kin-0.104
 globe-0.104
ppers-0.103
Token got
Token ID1392
Feature activation+3.68e+00

Pos logit contributions
 achieve+0.127
 Claus+0.126
kin+0.125
cher+0.125
illas+0.124
Neg logit contributions
ted-0.128
 Many-0.127
 People-0.127
heart-0.126
 Everyone-0.123
Token to
Token ID284
Feature activation+4.42e-01

Pos logit contributions
ted+0.187
 Many+0.180
 Everyone+0.169
 People+0.169
 Every+0.159
Neg logit contributions
cher-0.267
kin-0.264
edient-0.260
 achieve-0.259
ppers-0.259
Token the
Token ID262
Feature activation-8.90e-01

Pos logit contributions
ted+0.020
 Many+0.020
 Everyone+0.018
 People+0.018
 Every+0.017
Neg logit contributions
cher-0.034
kin-0.034
ppers-0.033
edient-0.033
illas-0.033
Token they
Token ID484
Feature activation-2.48e+00

Pos logit contributions
kin+0.048
cher+0.048
 Claus+0.047
kins+0.046
illas+0.046
Neg logit contributions
 Many-0.033
ted-0.031
 Everyone-0.030
 People-0.029
 Every-0.029
Token could
Token ID714
Feature activation-2.90e-01

Pos logit contributions
 promise+0.126
cher+0.123
ppers+0.123
 Claus+0.119
kin+0.118
Neg logit contributions
ted-0.167
 Many-0.167
heart-0.166
 Everyone-0.165
 People-0.165
Token buy
Token ID2822
Feature activation-9.49e-01

Pos logit contributions
cher+0.017
kin+0.016
illas+0.016
edient+0.016
ppers+0.016
Neg logit contributions
ted-0.019
 Many-0.019
 Everyone-0.018
 People-0.018
 Every-0.017
Token it
Token ID340
Feature activation+3.27e+00

Pos logit contributions
ppers+0.061
 promise+0.059
anut+0.058
kins+0.058
cher+0.057
Neg logit contributions
 Many-0.049
 People-0.046
heart-0.046
ted-0.045
arted-0.045
Token,
Token ID11
Feature activation-6.02e-01

Pos logit contributions
ted+0.168
 Many+0.163
 Everyone+0.155
 People+0.153
 Every+0.147
Neg logit contributions
cher-0.235
kin-0.229
illas-0.228
edient-0.227
 achieve-0.227
Token but
Token ID475
Feature activation-2.22e+00

Pos logit contributions
ppers+0.029
cher+0.029
 achieve+0.029
kin+0.029
 promise+0.028
Neg logit contributions
ted-0.044
 Many-0.041
 Everyone-0.039
 People-0.039
 winds-0.036
Token her
Token ID607
Feature activation+1.51e+00

Pos logit contributions
kin+0.179
cher+0.175
ppers+0.169
 Claus+0.169
illas+0.168
Neg logit contributions
 Many-0.097
ted-0.094
 People-0.088
 Everyone-0.087
 Every-0.078
Token mom
Token ID1995
Feature activation+5.04e-01

Pos logit contributions
ted+0.112
 Many+0.108
 People+0.106
 Everyone+0.105
heart+0.097
Neg logit contributions
kin-0.080
 promise-0.078
 Claus-0.072
ppers-0.072
 globe-0.071
Tokenmy
Token ID1820
Feature activation-3.63e+00

Pos logit contributions
ted+0.039
 Many+0.038
 Everyone+0.037
 People+0.037
 Every+0.035
Neg logit contributions
cher-0.023
kin-0.022
ppers-0.022
edient-0.022
illas-0.022
Token said
Token ID531
Feature activation+1.66e+00

Pos logit contributions
cher+0.269
kin+0.264
ppers+0.262
 Claus+0.257
edient+0.257
Neg logit contributions
ted-0.176
 Many-0.176
 People-0.164
 Everyone-0.163
 Every-0.150
Token no
Token ID645
Feature activation-1.59e+00

Pos logit contributions
ted+0.101
 Many+0.100
 People+0.093
 Everyone+0.093
 Every+0.090
Neg logit contributions
kin-0.104
cher-0.100
ppers-0.097
kins-0.096
edient-0.095
Token,
Token ID11
Feature activation-5.84e-01

Pos logit contributions
ccoli+0.105
cher+0.102
illas+0.101
 Claus+0.101
ppers+0.100
Neg logit contributions
 as-0.106
ted-0.102
 Everyone-0.101
 one-0.100
 Every-0.099
Token waiting
Token ID4953
Feature activation-4.29e+00

Pos logit contributions
kin+0.043
cher+0.042
ppers+0.041
kins+0.041
illas+0.041
Neg logit contributions
ted-0.029
 Many-0.028
 Everyone-0.026
 People-0.026
 All-0.026
Token for
Token ID329
Feature activation+2.40e+00

Pos logit contributions
cher+0.349
illas+0.349
ppers+0.346
kin+0.344
edient+0.343
Neg logit contributions
ted-0.171
 Everyone-0.157
 People-0.154
 Many-0.153
 Every-0.145
Token the
Token ID262
Feature activation-1.13e+00

Pos logit contributions
ted+0.113
 Many+0.110
 People+0.100
 Everyone+0.098
 Every+0.096
Neg logit contributions
cher-0.174
kin-0.173
 afloat-0.169
 achieve-0.168
edient-0.168
Token disgusting
Token ID23374
Feature activation-3.90e-01

Pos logit contributions
cher+0.075
kin+0.073
edient+0.072
 afloat+0.072
illas+0.072
Neg logit contributions
ted-0.064
 Many-0.062
 Everyone-0.059
 People-0.059
 Every-0.056
Token thing
Token ID1517
Feature activation+6.16e-01

Pos logit contributions
kin+0.023
 globe+0.023
cher+0.023
 afloat+0.022
edient+0.022
Neg logit contributions
ted-0.024
 Many-0.024
 Everyone-0.022
 People-0.022
 Every-0.022
Token to
Token ID284
Feature activation+2.09e-01

Pos logit contributions
ted+0.029
 Many+0.026
 Everyone+0.025
 People+0.025
 Every+0.024
Neg logit contributions
kin-0.047
cher-0.046
 achieve-0.046
ppers-0.046
 afloat-0.046
Token move
Token ID1445
Feature activation-2.83e+00

Pos logit contributions
ted+0.013
 Many+0.013
 Everyone+0.013
 People+0.013
 Every+0.012
Neg logit contributions
cher-0.012
kin-0.012
ppers-0.012
edient-0.012
illas-0.012
Token away
Token ID1497
Feature activation-1.13e+01

Pos logit contributions
cher+0.189
kin+0.185
ppers+0.183
 achieve+0.182
illas+0.182
Neg logit contributions
ted-0.158
 Many-0.152
 Everyone-0.145
 People-0.144
 Every-0.137
Token.
Token ID13
Feature activation-8.22e-01

Pos logit contributions
cher+0.839
kin+0.817
ppers+0.813
illas+0.811
 achieve+0.796
Neg logit contributions
ted-0.555
 Many-0.519
 Everyone-0.511
 People-0.499
 All-0.485
Token Eventually
Token ID16178
Feature activation+1.99e+00

Pos logit contributions
cher+0.052
 achieve+0.051
edient+0.051
 afloat+0.051
kin+0.051
Neg logit contributions
ted-0.047
 Many-0.044
 People-0.041
heart-0.040
 Everyone-0.040
Token
Token ID198
Feature activation-2.80e-01

Pos logit contributions
cher+0.197
kin+0.194
 afloat+0.190
ppers+0.190
edient+0.190
Neg logit contributions
ted-0.153
 Many-0.151
 Everyone-0.143
 People-0.143
 Every-0.135
Token
Token ID198
Feature activation-3.03e-01

Pos logit contributions
cher+0.019
kin+0.019
ppers+0.019
 afloat+0.018
illas+0.018
Neg logit contributions
ted-0.015
 Many-0.015
 Everyone-0.014
 People-0.014
 Every-0.013
TokenFrom
Token ID4863
Feature activation+3.04e-01

Pos logit contributions
cher+0.018
kin+0.017
 afloat+0.017
ppers+0.017
illas+0.017
Neg logit contributions
 Many-0.020
ted-0.019
 Everyone-0.019
 People-0.018
 All-0.018
Token that
Token ID326
Feature activation+6.22e-01

Pos logit contributions
 Many+0.013
 People+0.012
ted+0.012
 Everyone+0.011
 All+0.010
Neg logit contributions
kin-0.024
 afloat-0.024
cher-0.024
edient-0.024
 achieve-0.024
Token day
Token ID1110
Feature activation+1.94e+00

Pos logit contributions
 Many+0.017
ted+0.016
 Everyone+0.015
 People+0.015
 Every+0.015
Neg logit contributions
cher-0.061
ppers-0.060
illas-0.059
 afloat-0.058
kin-0.058
Token on
Token ID319
Feature activation-7.70e+00

Pos logit contributions
ted+0.139
 Everyone+0.133
 Many+0.129
 All+0.127
 People+0.127
Neg logit contributions
ccoli-0.103
cher-0.103
illas-0.103
 Claus-0.103
 achieve-0.102
Token,
Token ID11
Feature activation-5.77e-01

Pos logit contributions
cher+0.576
kin+0.560
ppers+0.556
 afloat+0.554
edient+0.554
Neg logit contributions
ted-0.376
 Many-0.363
 Everyone-0.344
 People-0.342
 Every-0.321
Token Lily
Token ID20037
Feature activation+2.89e-01

Pos logit contributions
kin+0.043
cher+0.042
illas+0.042
 achieve+0.041
edient+0.041
Neg logit contributions
ted-0.027
 Many-0.026
 People-0.023
 Everyone-0.023
 Every-0.022
Token knew
Token ID2993
Feature activation+7.59e-01

Pos logit contributions
ted+0.015
 Many+0.014
 Everyone+0.013
 People+0.013
 Every+0.013
Neg logit contributions
cher-0.021
kin-0.020
 afloat-0.020
ppers-0.020
illas-0.020
Token that
Token ID326
Feature activation+5.94e-01

Pos logit contributions
 Many+0.033
ted+0.030
 People+0.028
 Everyone+0.027
 Every+0.024
Neg logit contributions
cher-0.061
kin-0.060
 achieve-0.060
edient-0.058
ppers-0.058
Token if
Token ID611
Feature activation-6.59e-01

Pos logit contributions
 Many+0.029
ted+0.029
 Everyone+0.027
 People+0.026
 Every+0.024
Neg logit contributions
kin-0.043
cher-0.043
 achieve-0.042
edient-0.042
illas-0.041
Token would
Token ID561
Feature activation-3.46e+00

Pos logit contributions
cher+0.167
kin+0.165
illas+0.162
 achieve+0.162
ppers+0.162
Neg logit contributions
ted-0.132
 Many-0.130
 Everyone-0.123
 People-0.123
 Every-0.115
Token join
Token ID4654
Feature activation-1.35e+00

Pos logit contributions
cher+0.221
kin+0.212
ppers+0.210
edient+0.209
illas+0.209
Neg logit contributions
 Many-0.206
ted-0.200
 Everyone-0.197
 People-0.193
 Every-0.191
Token hands
Token ID2832
Feature activation+9.72e-01

Pos logit contributions
kin+0.082
cher+0.082
illas+0.082
edient+0.080
 afloat+0.079
Neg logit contributions
ted-0.079
 Many-0.078
 People-0.074
 Everyone-0.072
 Every-0.069
Token and
Token ID290
Feature activation-8.38e-02

Pos logit contributions
ted+0.049
 Many+0.047
 Everyone+0.044
 People+0.044
 Every+0.042
Neg logit contributions
cher-0.071
kin-0.069
 achieve-0.068
ppers-0.068
edient-0.068
Token go
Token ID467
Feature activation+1.77e+00

Pos logit contributions
cher+0.005
kin+0.005
ppers+0.005
edient+0.005
 afloat+0.005
Neg logit contributions
ted-0.005
 Many-0.005
 Everyone-0.005
 People-0.005
 Every-0.005
Token on
Token ID319
Feature activation-7.71e+00

Pos logit contributions
ted+0.084
 Many+0.080
 Everyone+0.076
 People+0.075
 Every+0.071
Neg logit contributions
kin-0.137
edient-0.133
cher-0.132
illas-0.131
ppers-0.130
Token adventures
Token ID17545
Feature activation+5.85e+00

Pos logit contributions
cher+0.560
kin+0.553
 achieve+0.553
 afloat+0.552
illas+0.547
Neg logit contributions
ted-0.377
 Many-0.364
 People-0.342
 Everyone-0.341
 Every-0.314
Token.
Token ID13
Feature activation-8.53e-01

Pos logit contributions
ted+0.364
 Many+0.356
 Everyone+0.338
 People+0.337
 Every+0.321
Neg logit contributions
cher-0.353
kin-0.346
ppers-0.339
 achieve-0.339
illas-0.338
Token
Token ID220
Feature activation-2.93e+00

Pos logit contributions
illas+0.048
 globe+0.047
 achieve+0.047
 afloat+0.046
ccoli+0.046
Neg logit contributions
ted-0.055
heart-0.049
 Many-0.048
ly-0.048
 winds-0.046
Token
Token ID198
Feature activation-3.28e-01

Pos logit contributions
cher+0.213
kin+0.209
ppers+0.206
 afloat+0.206
edient+0.206
Neg logit contributions
 Many-0.145
ted-0.142
 Everyone-0.140
 People-0.139
 Every-0.131
Token
Token ID198
Feature activation-3.40e-01

Pos logit contributions
cher+0.022
kin+0.022
ppers+0.022
 afloat+0.022
edient+0.022
Neg logit contributions
ted-0.018
 Many-0.017
 Everyone-0.016
 People-0.016
 Every-0.015
Token the
Token ID262
Feature activation-9.34e-01

Pos logit contributions
aks+0.007
 afloat+0.007
cher+0.007
kin+0.007
 globe+0.007
Neg logit contributions
 Many-0.008
ted-0.007
pill-0.007
 People-0.007
heart-0.007
Token bird
Token ID6512
Feature activation+2.58e+00

Pos logit contributions
cher+0.065
kin+0.063
 promise+0.062
illas+0.062
 globe+0.062
Neg logit contributions
ted-0.051
 Many-0.049
 Everyone-0.048
 People-0.047
 Every-0.045
Token and
Token ID290
Feature activation+1.42e-01

Pos logit contributions
ted+0.129
 Many+0.124
 Everyone+0.116
 People+0.115
 Every+0.111
Neg logit contributions
cher-0.188
kin-0.187
 achieve-0.182
illas-0.181
edient-0.180
Token set
Token ID900
Feature activation+4.15e+00

Pos logit contributions
 Many+0.008
ted+0.008
 People+0.008
 Everyone+0.008
 Every+0.007
Neg logit contributions
cher-0.009
 afloat-0.009
kin-0.009
ppers-0.009
 Claus-0.009
Token off
Token ID572
Feature activation-4.62e+00

Pos logit contributions
ted+0.197
 Many+0.192
 People+0.176
 Everyone+0.174
 Every+0.165
Neg logit contributions
cher-0.310
 afloat-0.304
 achieve-0.303
kin-0.302
ppers-0.299
Token on
Token ID319
Feature activation-7.47e+00

Pos logit contributions
kin+0.367
cher+0.359
ppers+0.356
illas+0.352
ccoli+0.350
Neg logit contributions
ted-0.213
 Many-0.196
 Everyone-0.190
 People-0.184
 All-0.180
Token an
Token ID281
Feature activation+3.74e+00

Pos logit contributions
cher+0.528
kin+0.519
ppers+0.511
illas+0.509
edient+0.509
Neg logit contributions
ted-0.392
 Many-0.382
 Everyone-0.360
 People-0.359
 Every-0.339
Token adventure
Token ID8855
Feature activation+6.46e+00

Pos logit contributions
ted+0.182
 Everyone+0.157
 People+0.155
 Many+0.153
 All+0.143
Neg logit contributions
 achieve-0.285
 promise-0.285
 afloat-0.284
edient-0.281
kin-0.279
Token.
Token ID13
Feature activation-6.07e-01

Pos logit contributions
ted+0.348
 Many+0.339
 Everyone+0.322
 People+0.321
 Every+0.304
Neg logit contributions
cher-0.442
kin-0.435
ppers-0.427
illas-0.426
edient-0.426
Token
Token ID198
Feature activation-7.55e-02

Pos logit contributions
 achieve+0.029
 globe+0.027
aks+0.026
 afloat+0.026
illas+0.025
Neg logit contributions
ted-0.045
ly-0.041
heart-0.040
andel-0.039
 winds-0.038
Token
Token ID198
Feature activation-6.52e-02

Pos logit contributions
cher+0.005
 achieve+0.005
kin+0.005
 afloat+0.005
edient+0.004
Neg logit contributions
ted-0.005
 Many-0.004
 People-0.004
 Everyone-0.004
heart-0.004
Token distance
Token ID5253
Feature activation-4.95e+00

Pos logit contributions
 Many+0.075
 Everyone+0.069
 People+0.067
ted+0.066
 All+0.064
Neg logit contributions
kin-0.096
ppers-0.096
cher-0.096
illas-0.095
wy-0.093
Token and
Token ID290
Feature activation+2.15e-02

Pos logit contributions
cher+0.328
kin+0.321
 afloat+0.320
 achieve+0.319
edient+0.319
Neg logit contributions
ted-0.272
 Many-0.269
 People-0.252
 Everyone-0.250
 Every-0.234
Token ran
Token ID4966
Feature activation+2.16e+00

Pos logit contributions
 Many+0.001
ted+0.001
 People+0.001
 Everyone+0.001
 Every+0.001
Neg logit contributions
cher-0.001
kin-0.001
 achieve-0.001
illas-0.001
 Claus-0.001
Token over
Token ID625
Feature activation-4.09e+00

Pos logit contributions
ted+0.144
 Many+0.139
 Everyone+0.134
 People+0.133
 Every+0.127
Neg logit contributions
cher-0.123
kin-0.122
 achieve-0.119
ppers-0.119
illas-0.119
Token to
Token ID284
Feature activation+2.62e-01

Pos logit contributions
cher+0.317
kin+0.310
 achieve+0.307
illas+0.307
 afloat+0.307
Neg logit contributions
ted-0.182
 Many-0.178
 People-0.165
 Everyone-0.164
 Every-0.151
Token True
Token ID6407
Feature activation-6.44e+00

Pos logit contributions
ted+0.013
 Many+0.012
 Everyone+0.012
 People+0.012
 Every+0.011
Neg logit contributions
cher-0.020
kin-0.020
ppers-0.019
edient-0.019
illas-0.019
Token.
Token ID13
Feature activation-8.03e-01

Pos logit contributions
cher+0.483
kin+0.481
 achieve+0.469
edient+0.461
ppers+0.457
Neg logit contributions
ted-0.308
 Many-0.278
 Everyone-0.271
 Every-0.267
 All-0.263
Token She
Token ID1375
Feature activation+1.01e+00

Pos logit contributions
 Claus+0.059
ccoli+0.059
 achieve+0.059
cher+0.058
kin+0.057
Neg logit contributions
ted-0.035
 Many-0.032
heart-0.031
 People-0.030
andel-0.029
Token said
Token ID531
Feature activation+1.70e+00

Pos logit contributions
 Many+0.058
ted+0.057
 Everyone+0.055
 People+0.055
 Every+0.051
Neg logit contributions
cher-0.066
illas-0.063
 Claus-0.062
ppers-0.062
kin-0.062
Token,
Token ID11
Feature activation-5.70e-01

Pos logit contributions
ted+0.083
 Many+0.081
 People+0.076
 Everyone+0.076
 Every+0.072
Neg logit contributions
cher-0.124
kin-0.122
 achieve-0.119
ppers-0.119
illas-0.119
Token "
Token ID366
Feature activation-6.64e-01

Pos logit contributions
kin+0.048
cher+0.048
 afloat+0.047
illas+0.047
edient+0.047
Neg logit contributions
ted-0.022
 Many-0.022
 Everyone-0.021
 People-0.020
 All-0.020
Token them
Token ID606
Feature activation+2.86e+00

Pos logit contributions
ted+0.010
 Many+0.009
 Everyone+0.008
 People+0.008
 Every+0.008
Neg logit contributions
cher-0.024
 achieve-0.023
kin-0.023
edient-0.023
ppers-0.023
Token in
Token ID287
Feature activation-3.52e+00

Pos logit contributions
ted+0.180
 as+0.156
 Everyone+0.149
ly+0.147
 Every+0.144
Neg logit contributions
 achieve-0.181
edient-0.173
cher-0.169
Jackson-0.169
ccoli-0.168
Token the
Token ID262
Feature activation-1.05e+00

Pos logit contributions
cher+0.305
ppers+0.290
 achieve+0.286
 Claus+0.282
illas+0.282
Neg logit contributions
 Every-0.138
ted-0.138
 Many-0.129
 Everyone-0.127
 All-0.123
Token front
Token ID2166
Feature activation-6.93e-02

Pos logit contributions
cher+0.074
ppers+0.071
kin+0.071
 achieve+0.070
illas+0.069
Neg logit contributions
ted-0.058
 Many-0.055
 Everyone-0.055
 People-0.054
 Every-0.054
Token of
Token ID286
Feature activation+6.49e+00

Pos logit contributions
cher+0.005
kin+0.005
ppers+0.005
 achieve+0.005
edient+0.005
Neg logit contributions
ted-0.004
 Many-0.004
 Everyone-0.003
 People-0.003
 Every-0.003
Token their
Token ID511
Feature activation-7.34e+00

Pos logit contributions
ted+0.304
 Many+0.294
 Everyone+0.275
 People+0.274
 Every+0.258
Neg logit contributions
cher-0.495
kin-0.485
ppers-0.478
 achieve-0.478
illas-0.477
Token shirt
Token ID10147
Feature activation-3.89e+00

Pos logit contributions
cher+0.496
kin+0.482
 achieve+0.473
ppers+0.470
 afloat+0.468
Neg logit contributions
ted-0.411
 Many-0.397
 People-0.376
 Everyone-0.374
 Every-0.358
Token.
Token ID13
Feature activation-7.52e-01

Pos logit contributions
 achieve+0.279
cher+0.275
edient+0.274
illas+0.272
 afloat+0.271
Neg logit contributions
ted-0.197
 Many-0.176
 Everyone-0.176
 People-0.172
 All-0.165
Token And
Token ID843
Feature activation-5.38e-01

Pos logit contributions
cher+0.050
kin+0.050
 achieve+0.050
illas+0.050
 afloat+0.049
Neg logit contributions
ted-0.039
 Many-0.037
heart-0.034
 People-0.033
ly-0.033
Token they
Token ID484
Feature activation-2.34e+00

Pos logit contributions
cher+0.038
kin+0.038
 afloat+0.037
ppers+0.037
 achieve+0.037
Neg logit contributions
ted-0.028
 Many-0.027
 People-0.025
 Everyone-0.025
 Every-0.024
Token started
Token ID2067
Feature activation+3.88e+00

Pos logit contributions
cher+0.179
edient+0.176
 afloat+0.176
 achieve+0.175
ppers+0.175
Neg logit contributions
ted-0.109
 Many-0.099
 Everyone-0.096
 People-0.093
 Every-0.091